Genomic expression patterns of cardiac tissues from dogs with dilated cardiomyopathy.
Oyama, Mark A; Chittur, Sridar
2005-07-01
To evaluate global genome expression patterns of left ventricular tissues from dogs with dilated cardiomyopathy (DCM). Tissues obtained from the left ventricle of 2 Doberman Pinschers with end-stage DCM and 5 healthy control dogs. Transcriptional activities of 23,851 canine DNA sequences were determined by use of an oligonucleotide microarray. Genome expression patterns of DCM tissue were evaluated by measuring the relative amount of complementary RNA hybridization to the microarray probes and comparing it with gene expression for tissues from 5 healthy control dogs. 478 transcripts were differentially expressed (> or = 2.5-fold change). In DCM tissue, expression of 173 transcripts was upregulated and expression of 305 transcripts was downregulated, compared with expression for control tissues. Of the 478 transcripts, 167 genes could be specifically identified. These genes were grouped into 1 of 8 categories on the basis of their primary physiologic function. Grouping revealed that pathways involving cellular energy production, signaling and communication, and cell structure were generally downregulated, whereas pathways involving cellular defense and stress responses were upregulated. Many previously unreported genes that may contribute to the pathophysiologic aspects of heart disease were identified. Evaluation of global expression patterns provides a molecular portrait of heart failure, yields insights into the pathophysiologic aspects of DCM, and identifies intriguing genes and pathways for further study.
USDA-ARS?s Scientific Manuscript database
Polymorphic genetic markers were identified and characterized using a partial genomic library of Heliothis virescens enriched for simple sequence repeats (SSR) and nucleotide sequences of expressed sequence tags (EST). Nucleotide sequences of 192 clones from the partial genomic library yielded 147 u...
Alterations in DNA methylation have been proposed as a mechanism for the complex toxicological effects of arsenic. In this study, whole genome DNA methylation and gene expression changes were evaluated in lungs from female mice exposed for 90 days to 50 ppm arsenate (As) in drink...
Genome-wide Gene Expression Profiling of Acute Metal Exposures in Male Zebrafish
2014-10-23
Data in Brief Genome-wide gene expression profiling of acute metal exposures in male zebrafish Christine E. Baer a,⁎, Danielle L. Ippolito b, Naissan... Zebrafish Whole organism Nickel Chromium Cobalt Toxicogenomics To capture global responses to metal poisoning and mechanistic insights into metal...toxicity, gene expression changes were evaluated in whole adult male zebrafish following acute 24 h high dose exposure to three metals with known human
Lowry, David B.; Logan, Tierney L.; Santuari, Luca; Hardtke, Christian S.; Richards, James H.; DeRose-Wilson, Leah J.; McKay, John K.; Sen, Saunak; Juenger, Thomas E.
2013-01-01
The regulation of gene expression is crucial for an organism’s development and response to stress, and an understanding of the evolution of gene expression is of fundamental importance to basic and applied biology. To improve this understanding, we conducted expression quantitative trait locus (eQTL) mapping in the Tsu-1 (Tsushima, Japan) × Kas-1 (Kashmir, India) recombinant inbred line population of Arabidopsis thaliana across soil drying treatments. We then used genome resequencing data to evaluate whether genomic features (promoter polymorphism, recombination rate, gene length, and gene density) are associated with genes responding to the environment (E) or with genes with genetic variation (G) in gene expression in the form of eQTLs. We identified thousands of genes that responded to soil drying and hundreds of main-effect eQTLs. However, we identified very few statistically significant eQTLs that interacted with the soil drying treatment (GxE eQTL). Analysis of genome resequencing data revealed associations of several genomic features with G and E genes. In general, E genes had lower promoter diversity and local recombination rates. By contrast, genes with eQTLs (G) had significantly greater promoter diversity and were located in genomic regions with higher recombination. These results suggest that genomic architecture may play an important a role in the evolution of gene expression. PMID:24045022
Genomic Imprinting Was Evolutionarily Conserved during Wheat Polyploidization[OPEN
Yang, Guanghui; Liu, Zhenshan; Gao, Lulu; Yu, Kuohai; Feng, Man; Peng, Huiru; Sun, Qixin; Ni, Zhongfu
2018-01-01
Genomic imprinting is an epigenetic phenomenon that causes genes to be differentially expressed depending on their parent of origin. To evaluate the evolutionary conservation of genomic imprinting and the effects of ploidy on this process, we investigated parent-of-origin-specific gene expression patterns in the endosperm of diploid (Aegilops spp), tetraploid, and hexaploid wheat (Triticum spp) at various stages of development via high-throughput transcriptome sequencing. We identified 91, 135, and 146 maternally or paternally expressed genes (MEGs or PEGs, respectively) in diploid, tetraploid, and hexaploid wheat, respectively, 52.7% of which exhibited dynamic expression patterns at different developmental stages. Gene Ontology enrichment analysis suggested that MEGs and PEGs were involved in metabolic processes and DNA-dependent transcription, respectively. Nearly half of the imprinted genes exhibited conserved expression patterns during wheat hexaploidization. In addition, 40% of the homoeolog pairs originating from whole-genome duplication were consistently maternally or paternally biased in the different subgenomes of hexaploid wheat. Furthermore, imprinted expression was found for 41.2% and 50.0% of homolog pairs that evolved by tandem duplication after genome duplication in tetraploid and hexaploid wheat, respectively. These results suggest that genomic imprinting was evolutionarily conserved between closely related Triticum and Aegilops species and in the face of polyploid hybridization between species in these genera. PMID:29298834
NASA Astrophysics Data System (ADS)
Trajano, L. A. S. N.; Sergio, L. P. S.; Silva, C. L.; Carvalho, L.; Mencalha, A. L.; Stumbo, A. C.; Fonseca, A. S.
2016-07-01
Low-level lasers are used for the treatment of diseases in soft and bone tissues, but few data are available regarding their effects on genomic stability. In this study, we investigated mRNA expression from genes involved in DNA repair and genomic stabilization in myoblasts exposed to low-level infrared laser. C2C12 myoblast cultures in different fetal bovine serum concentrations were exposed to low-level infrared laser (10, 35 and 70 J cm-2), and collected for the evaluation of DNA repair gene expression. Laser exposure increased gene expression related to base excision repair (8-oxoguanine DNA glycosylase and apurinic/apyrimidinic endonuclease 1), nucleotide excision repair (excision repair cross-complementation group 1 and xeroderma pigmentosum C protein) and genomic stabilization (ATM serine/threonine kinase and tumor protein p53) in normal and low fetal bovine serum concentrations. Results suggest that genomic stability could be part of a biostimulation effect of low-level laser therapy in injured muscles.
Consequences of reductive evolution for gene expression in an obligate endosymbiont.
Wilcox, Jennifer L; Dunbar, Helen E; Wolfinger, Russell D; Moran, Nancy A
2003-06-01
The smallest cellular genomes are found in obligate symbiotic and pathogenic bacteria living within eukaryotic hosts. In comparison with large genomes of free-living relatives, these reduced genomes are rearranged and have lost most regulatory elements. To test whether reduced bacterial genomes incur reduced regulatory capacities, we used full-genome microarrays to evaluate transcriptional response to environmental stress in Buchnera aphidicola, the obligate endosymbiont of aphids. The 580 genes of the B. aphidicola genome represent a subset of the 4500 genes known from the related organism, Escherichia coli. Although over 20 orthologues of E. coli heat stress (HS) genes are retained by B. aphidicola, only five were differentially expressed after near-lethal heat stress treatments, and only modest shifts were observed. Analyses of upstream regulatory regions revealed loss or degradation of most HS (sigma32) promoters. Genomic rearrangements downstream of an intact HS promoter yielded upregulation of a functionally unrelated and an inactivated gene. Reanalyses of comparable experimental array data for E. coli and Bacillus subtilis revealed that genome-wide differential expression was significantly lower in B. aphidicola. Our demonstration of a diminished stress response validates reports of temperature sensitivity in B. aphidicola and suggests that this reduced bacterial genome exhibits transcriptional inflexibility.
Plum pox virus (PPV) genome expression in genetically engineered RNAi plants
USDA-ARS?s Scientific Manuscript database
An important approach to controlling sharka disease caused by Plum pox virus (PPV) is the development of PPV resistant plants using small interfering RNAs (siRNA) technology. In order to evaluate siRNA induced gene silencing, we studied, based on knowledge of the PPV genome sequence, virus genome t...
Genomic Imprinting Was Evolutionarily Conserved during Wheat Polyploidization.
Yang, Guanghui; Liu, Zhenshan; Gao, Lulu; Yu, Kuohai; Feng, Man; Yao, Yingyin; Peng, Huiru; Hu, Zhaorong; Sun, Qixin; Ni, Zhongfu; Xin, Mingming
2018-01-01
Genomic imprinting is an epigenetic phenomenon that causes genes to be differentially expressed depending on their parent of origin. To evaluate the evolutionary conservation of genomic imprinting and the effects of ploidy on this process, we investigated parent-of-origin-specific gene expression patterns in the endosperm of diploid ( Aegilops spp), tetraploid, and hexaploid wheat ( Triticum spp) at various stages of development via high-throughput transcriptome sequencing. We identified 91, 135, and 146 maternally or paternally expressed genes (MEGs or PEGs, respectively) in diploid, tetraploid, and hexaploid wheat, respectively, 52.7% of which exhibited dynamic expression patterns at different developmental stages. Gene Ontology enrichment analysis suggested that MEGs and PEGs were involved in metabolic processes and DNA-dependent transcription, respectively. Nearly half of the imprinted genes exhibited conserved expression patterns during wheat hexaploidization. In addition, 40% of the homoeolog pairs originating from whole-genome duplication were consistently maternally or paternally biased in the different subgenomes of hexaploid wheat. Furthermore, imprinted expression was found for 41.2% and 50.0% of homolog pairs that evolved by tandem duplication after genome duplication in tetraploid and hexaploid wheat, respectively. These results suggest that genomic imprinting was evolutionarily conserved between closely related Triticum and Aegilops species and in the face of polyploid hybridization between species in these genera. © 2018 American Society of Plant Biologists. All rights reserved.
Attitudes regarding privacy of genomic information in personalized cancer therapy
Rogith, Deevakar; Yusuf, Rafeek A; Hovick, Shelley R; Peterson, Susan K; Burton-Chase, Allison M; Li, Yisheng; Meric-Bernstam, Funda; Bernstam, Elmer V
2014-01-01
Objective To evaluate attitudes regarding privacy of genomic data in a sample of patients with breast cancer. Methods Female patients with breast cancer (n=100) completed a questionnaire assessing attitudes regarding concerns about privacy of genomic data. Results Most patients (83%) indicated that genomic data should be protected. However, only 13% had significant concerns regarding privacy of such data. Patients expressed more concern about insurance discrimination than employment discrimination (43% vs 28%, p<0.001). They expressed less concern about research institutions protecting the security of their molecular data than government agencies or drug companies (20% vs 38% vs 44%; p<0.001). Most did not express concern regarding the association of their genomic data with their name and personal identity (49% concerned), billing and insurance information (44% concerned), or clinical data (27% concerned). Significantly fewer patients were concerned about the association with clinical data than other data types (p<0.001). In the absence of direct benefit, patients were more willing to consent to sharing of deidentified than identified data with researchers not involved in their care (76% vs 60%; p<0.001). Most (85%) patients were willing to consent to DNA banking. Discussion While patients are opposed to indiscriminate release of genomic data, privacy does not appear to be their primary concern. Furthermore, we did not find any specific predictors of privacy concerns. Conclusions Patients generally expressed low levels of concern regarding privacy of genomic data, and many expressed willingness to consent to sharing their genomic data with researchers. PMID:24737606
Gene networks are rapidly growing in size and number, raising the question of which networks are most appropriate for particular applications. Here, we evaluate 21 human genome-wide interaction networks for their ability to recover 446 disease gene sets identified through literature curation, gene expression profiling, or genome-wide association studies. While all networks have some ability to recover disease genes, we observe a wide range of performance with STRING, ConsensusPathDB, and GIANT networks having the best performance overall.
Forreryd, Andy; Johansson, Henrik; Albrekt, Ann-Sofie; Lindstedt, Malin
2014-05-16
Allergic contact dermatitis (ACD) develops upon exposure to certain chemical compounds termed skin sensitizers. To reduce the occurrence of skin sensitizers, chemicals are regularly screened for their capacity to induce sensitization. The recently developed Genomic Allergen Rapid Detection (GARD) assay is an in vitro alternative to animal testing for identification of skin sensitizers, classifying chemicals by evaluating transcriptional levels of a genomic biomarker signature. During assay development and biomarker identification, genome-wide expression analysis was applied using microarrays covering approximately 30,000 transcripts. However, the microarray platform suffers from drawbacks in terms of low sample throughput, high cost per sample and time consuming protocols and is a limiting factor for adaption of GARD into a routine assay for screening of potential sensitizers. With the purpose to simplify assay procedures, improve technical parameters and increase sample throughput, we assessed the performance of three high throughput gene expression platforms--nCounter®, BioMark HD™ and OpenArray®--and correlated their performance metrics against our previously generated microarray data. We measured the levels of 30 transcripts from the GARD biomarker signature across 48 samples. Detection sensitivity, reproducibility, correlations and overall structure of gene expression measurements were compared across platforms. Gene expression data from all of the evaluated platforms could be used to classify most of the sensitizers from non-sensitizers in the GARD assay. Results also showed high data quality and acceptable reproducibility for all platforms but only medium to poor correlations of expression measurements across platforms. In addition, evaluated platforms were superior to the microarray platform in terms of cost efficiency, simplicity of protocols and sample throughput. We evaluated the performance of three non-array based platforms using a limited set of transcripts from the GARD biomarker signature. We demonstrated that it was possible to achieve acceptable discriminatory power in terms of separation between sensitizers and non-sensitizers in the GARD assay while reducing assay costs, simplify assay procedures and increase sample throughput by using an alternative platform, providing a first step towards the goal to prepare GARD for formal validation and adaption of the assay for industrial screening of potential sensitizers.
Partnering for functional genomics research conference: Abstracts of poster presentations
DOE Office of Scientific and Technical Information (OSTI.GOV)
NONE
1998-06-01
This reports contains abstracts of poster presentations presented at the Functional Genomics Research Conference held April 16--17, 1998 in Oak Ridge, Tennessee. Attention is focused on the following areas: mouse mutagenesis and genomics; phenotype screening; gene expression analysis; DNA analysis technology development; bioinformatics; comparative analyses of mouse, human, and yeast sequences; and pilot projects to evaluate methodologies.
Reference genome sequence of the model plant Setaria
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bennetzen, Jeffrey L; Schmutz, Jeremy; Wang, Hao
We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ~400-Mb assembly covers ~80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species thatmore » demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).« less
Reference genome sequence of the model plant Setaria.
Bennetzen, Jeffrey L; Schmutz, Jeremy; Wang, Hao; Percifield, Ryan; Hawkins, Jennifer; Pontaroli, Ana C; Estep, Matt; Feng, Liang; Vaughn, Justin N; Grimwood, Jane; Jenkins, Jerry; Barry, Kerrie; Lindquist, Erika; Hellsten, Uffe; Deshpande, Shweta; Wang, Xuewen; Wu, Xiaomei; Mitros, Therese; Triplett, Jimmy; Yang, Xiaohan; Ye, Chu-Yu; Mauro-Herrera, Margarita; Wang, Lin; Li, Pinghua; Sharma, Manoj; Sharma, Rita; Ronald, Pamela C; Panaud, Olivier; Kellogg, Elizabeth A; Brutnell, Thomas P; Doust, Andrew N; Tuskan, Gerald A; Rokhsar, Daniel; Devos, Katrien M
2012-05-13
We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ∼400-Mb assembly covers ∼80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species that demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).
Social Status Modulates Gene Expression and Metabolite Profiles in the Fathead Minnow Males
The fathead minnow (FHM) is a valuable small fish model for genomic research in ecotoxicology. Our recent studies have successfully used genomic and metabolomic analyses to evaluate responses to endocrine disrupting compounds (EDCs) in urine of the FHM, but these results indicate...
Evaluation of a toxicogenomic approach to the local lymph node assay (LLNA).
Boverhof, Darrell R; Gollapudi, B Bhaskar; Hotchkiss, Jon A; Osterloh-Quiroz, Mandy; Woolhiser, Michael R
2009-02-01
Genomic technologies have the potential to enhance and complement existing toxicology endpoints; however, assessment of these approaches requires a systematic evaluation including a robust experimental design with genomic endpoints anchored to traditional toxicology endpoints. The present study was conducted to assess the sensitivity of genomic responses when compared with the traditional local lymph node assay (LLNA) endpoint of lymph node cell proliferation and to evaluate the responses for their ability to provide insights into mode of action. Female BALB/c mice were treated with the sensitizer trimellitic anhydride (TMA), following the standard LLNA dosing regimen, at doses of 0.1, 1, or 10% and traditional tritiated thymidine ((3)HTdR) incorporation and gene expression responses were monitored in the auricular lymph nodes. Additional mice dosed with either vehicle or 10% TMA and sacrificed on day 4 or 10, were also included to examine temporal effects on gene expression. Analysis of (3)HTdR incorporation revealed TMA-induced stimulation indices of 2.8, 22.9, and 61.0 relative to vehicle with an EC(3) of 0.11%. Examination of the dose-response gene expression responses identified 9, 833, and 2122 differentially expressed genes relative to vehicle for the 0.1, 1, and 10% TMA dose groups, respectively. Calculation of EC(3) values for differentially expressed genes did not identify a response that was more sensitive than the (3)HTdR value, although a number of genes displayed comparable sensitivity. Examination of temporal responses revealed 1760, 1870, and 953 differentially expressed genes at the 4-, 6-, and 10-day time points respectively. Functional analysis revealed many responses displayed dose- and time-specific induction patterns within the functional categories of cellular proliferation and immune response, including numerous immunoglobin genes which were highly induced at the day 10 time point. Overall, these experiments have systematically illustrated the potential utility of genomic endpoints to enhance the LLNA and support further exploration of this approach through examination of a more diverse array of chemicals.
Schmidt, Ellen M; Zhang, Ji; Zhou, Wei; Chen, Jin; Mohlke, Karen L; Chen, Y Eugene; Willer, Cristen J
2015-08-15
The majority of variation identified by genome wide association studies falls in non-coding genomic regions and is hypothesized to impact regulatory elements that modulate gene expression. Here we present a statistically rigorous software tool GREGOR (Genomic Regulatory Elements and Gwas Overlap algoRithm) for evaluating enrichment of any set of genetic variants with any set of regulatory features. Using variants from five phenotypes, we describe a data-driven approach to determine the tissue and cell types most relevant to a trait of interest and to identify the subset of regulatory features likely impacted by these variants. Last, we experimentally evaluate six predicted functional variants at six lipid-associated loci and demonstrate significant evidence for allele-specific impact on expression levels. GREGOR systematically evaluates enrichment of genetic variation with the vast collection of regulatory data available to explore novel biological mechanisms of disease and guide us toward the functional variant at trait-associated loci. GREGOR, including source code, documentation, examples, and executables, is available at http://genome.sph.umich.edu/wiki/GREGOR. cristen@umich.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
USDA-ARS?s Scientific Manuscript database
Oxalate oxidases catalyze the degradation of oxalic acid (OA). Highly resistant transgenic soybean carrying an oxalate oxidase (OxO) gene and its susceptible parent soybean line, AC Colibri, were tested for genome-wide gene expression in response to the necrotrophic, OA producing pathogen Sclerotini...
2015-10-24
zebrafish reference genome sequence and its relationship to the human genome . Nature. 2013;496(7446):498–503. 21. Linney E, Upchurch L, Donerly S. Zebrafish...To obtain a broader understanding of the effects of dichlorvos on liver metabolism, we per- formed a genome -wide analysis of gene expression in the ...condition) for whole genome transcript ana- lysis, and fixed another set of fish for histological evaluation (n = 5/condition). We determined the target
DOE Office of Scientific and Technical Information (OSTI.GOV)
Henrique Barreta, Marcos; Laboratorio de Biotecnologia e Reproducao Animal-BioRep, Universidade Federal de Santa Maria, Santa Maria, RS; Garziera Gasperin, Bernardo
2012-10-01
This study investigated the expression of genes controlling homologous recombination (HR), and non-homologous end-joining (NHEJ) DNA-repair pathways in bovine embryos of different developmental potential. It also evaluated whether bovine embryos can respond to DNA double-strand breaks (DSBs) induced with ultraviolet irradiation by regulating expression of genes involved in HR and NHEJ repair pathways. Embryos with high, intermediate or low developmental competence were selected based on the cleavage time after in vitro insemination and were removed from in vitro culture before (36 h), during (72 h) and after (96 h) the expected period of embryonic genome activation. All studied genes weremore » expressed before, during and after the genome activation period regardless the developmental competence of the embryos. Higher mRNA expression of 53BP1 and RAD52 was found before genome activation in embryos with low developmental competence. Expression of 53BP1, RAD51 and KU70 was downregulated at 72 h and upregulated at 168 h post-insemination in response to DSBs induced by ultraviolet irradiation. In conclusion, important genes controlling HR and NHEJ DNA-repair pathways are expressed in bovine embryos, however genes participating in these pathways are only regulated after the period of embryo genome activation in response to ultraviolet-induced DSBs.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bennetzen, Jeffrey L; Yang, Xiaohan; Ye, Chuyu
We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The {approx}400-Mb assembly covers {approx}80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species thatmore » demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).« less
Clinical and Biological Relevance of Genomic Heterogeneity in Chronic Lymphocytic Leukemia
Friedman, Daphne R.; Lucas, Joseph E.; Weinberg, J. Brice
2013-01-01
Background Chronic lymphocytic leukemia (CLL) is typically regarded as an indolent B-cell malignancy. However, there is wide variability with regards to need for therapy, time to progressive disease, and treatment response. This clinical variability is due, in part, to biological heterogeneity between individual patients’ leukemias. While much has been learned about this biological variation using genomic approaches, it is unclear whether such efforts have sufficiently evaluated biological and clinical heterogeneity in CLL. Methods To study the extent of genomic variability in CLL and the biological and clinical attributes of genomic classification in CLL, we evaluated 893 unique CLL samples from fifteen publicly available gene expression profiling datasets. We used unsupervised approaches to divide the data into subgroups, evaluated the biological pathways and genetic aberrations that were associated with the subgroups, and compared prognostic and clinical outcome data between the subgroups. Results Using an unsupervised approach, we determined that approximately 600 CLL samples are needed to define the spectrum of diversity in CLL genomic expression. We identified seven genomically-defined CLL subgroups that have distinct biological properties, are associated with specific chromosomal deletions and amplifications, and have marked differences in molecular prognostic markers and clinical outcomes. Conclusions Our results indicate that investigations focusing on small numbers of patient samples likely provide a biased outlook on CLL biology. These findings may have important implications in identifying patients who should be treated with specific targeted therapies, which could have efficacy against CLL cells that rely on specific biological pathways. PMID:23468975
Clinical and biological relevance of genomic heterogeneity in chronic lymphocytic leukemia.
Friedman, Daphne R; Lucas, Joseph E; Weinberg, J Brice
2013-01-01
Chronic lymphocytic leukemia (CLL) is typically regarded as an indolent B-cell malignancy. However, there is wide variability with regards to need for therapy, time to progressive disease, and treatment response. This clinical variability is due, in part, to biological heterogeneity between individual patients' leukemias. While much has been learned about this biological variation using genomic approaches, it is unclear whether such efforts have sufficiently evaluated biological and clinical heterogeneity in CLL. To study the extent of genomic variability in CLL and the biological and clinical attributes of genomic classification in CLL, we evaluated 893 unique CLL samples from fifteen publicly available gene expression profiling datasets. We used unsupervised approaches to divide the data into subgroups, evaluated the biological pathways and genetic aberrations that were associated with the subgroups, and compared prognostic and clinical outcome data between the subgroups. Using an unsupervised approach, we determined that approximately 600 CLL samples are needed to define the spectrum of diversity in CLL genomic expression. We identified seven genomically-defined CLL subgroups that have distinct biological properties, are associated with specific chromosomal deletions and amplifications, and have marked differences in molecular prognostic markers and clinical outcomes. Our results indicate that investigations focusing on small numbers of patient samples likely provide a biased outlook on CLL biology. These findings may have important implications in identifying patients who should be treated with specific targeted therapies, which could have efficacy against CLL cells that rely on specific biological pathways.
Genomic markers for decision making: what is preventing us from using markers?
Coyle, Vicky M; Johnston, Patrick G
2010-02-01
The advent of novel genomic technologies that enable the evaluation of genomic alterations on a genome-wide scale has significantly altered the field of genomic marker research in solid tumors. Researchers have moved away from the traditional model of identifying a particular genomic alteration and evaluating the association between this finding and a clinical outcome measure to a new approach involving the identification and measurement of multiple genomic markers simultaneously within clinical studies. This in turn has presented additional challenges in considering the use of genomic markers in oncology, such as clinical study design, reproducibility and interpretation and reporting of results. This Review will explore these challenges, focusing on microarray-based gene-expression profiling, and highlights some common failings in study design that have impacted on the use of putative genomic markers in the clinic. Despite these rapid technological advances there is still a paucity of genomic markers in routine clinical use at present. A rational and focused approach to the evaluation and validation of genomic markers is needed, whereby analytically validated markers are investigated in clinical studies that are adequately powered and have pre-defined patient populations and study endpoints. Furthermore, novel adaptive clinical trial designs, incorporating putative genomic markers into prospective clinical trials, will enable the evaluation of these markers in a rigorous and timely fashion. Such approaches have the potential to facilitate the implementation of such markers into routine clinical practice and consequently enable the rational and tailored use of cancer therapies for individual patients.
Camps, Jordi; Nguyen, Quang Tri; Padilla-Nash, Hesed M.; Knutsen, Turid; McNeil, Nicole E.; Wangsa, Danny; Hummon, Amanda B.; Grade, Marian; Ried, Thomas; Difilippantonio, Michael J.
2016-01-01
To evaluate the mechanisms and consequences of chromosomal aberrations in colorectal cancer (CRC), we used a combination of spectral karyotyping, array comparative genomic hybridization (aCGH), and array-based global gene expression profiling on 31 primary carcinomas and 15 established cell lines. Importantly, aCGH showed that the genomic profiles of primary tumors are recapitulated in the cell lines. We revealed a preponderance of chromosome breakpoints at sites of copy number variants (CNVs) in the CRC cell lines, a novel mechanism of DNA breakage in cancer. The integration of gene expression and aCGH led to the identification of 157 genes localized within high-level copy number changes whose transcriptional deregulation was significantly affected across all of the samples, thereby suggesting that these genes play a functional role in CRC. Genomic amplification at 8q24 was the most recurrent event and led to the overexpression of MYC and FAM84B. Copy number dependent gene expression resulted in deregulation of known cancer genes such as APC, FGFR2, and ERBB2. The identification of only 36 genes whose localization near a breakpoint could account for their observed deregulated expression demonstrates that the major mechanism for transcriptional deregulation in CRC is genomic copy number changes resulting from chromosomal aberrations. PMID:19691111
Ziemke, Michael; Patil, Tejas; Nolan, Kyle; Tippimanchai, Darinee; Malkoski, Stephen P
2017-07-01
Smad4 is a tumor suppressor that transduces transforming growth factor beta signaling and regulates genomic stability. We previously found that Smad4 knockdown in vitro inhibited DNA repair and increased sensitivity to DNA topoisomerase inhibitors. In this study, we assessed the association between reduced Smad4 expression and DNA topoisomerase inhibitor sensitivity in human non-small cell lung cancer (NSCLC) patients and evaluated the relationship between genomic alterations of Smad4 and molecular alterations in DNA repair molecules. We retrospectively identified NSCLC patients who received etoposide or gemcitabine. Chemotherapeutic response was quantified by RECIST 1.1 criteria and Smad4 expression was assessed by immunohistochemistry. Relationships between Smad4 mutation and DNA repair molecule mutations were evaluated using publically available datasets. We identified 28 individuals who received 30 treatments with gemcitabine or etoposide containing regimens for NSCLC. Reduced Smad4 expression was seen in 13/28 patients and was not associated with significant differences in clinical or pathologic parameters. Patients with reduced Smad4 expression had a larger response to DNA topoisomerase inhibitor containing regimens then patients with high Smad4 expression (-25.7% vs. -6.8% in lesion size, p=0.03); this relationship was more pronounced with gemcitabine containing regimens. The overall treatment response was higher in patients with reduced Smad4 expression (8/14 vs 2/16 p=0.02). Analysis of data from The Cancer Genome Atlas revealed that Smad4 mutation or homozygous loss was mutually exclusive with genomic alterations in DNA repair molecules. Reduced Smad4 expression may predict responsiveness to regimens that contain DNA topoisomerase inhibitors. That Smad4 signaling alterations are mutually exclusive with alterations in DNA repair machinery is consistent with an important role of Smad4 in regulating DNA repair. Copyright © 2017 Elsevier B.V. All rights reserved.
Jung, Ki-Hong; Dardick, Christopher; Bartley, Laura E; Cao, Peijian; Phetsom, Jirapa; Canlas, Patrick; Seo, Young-Su; Shultz, Michael; Ouyang, Shu; Yuan, Qiaoping; Frank, Bryan C; Ly, Eugene; Zheng, Li; Jia, Yi; Hsia, An-Ping; An, Kyungsook; Chou, Hui-Hsien; Rocke, David; Lee, Geun Cheol; Schnable, Patrick S; An, Gynheung; Buell, C Robin; Ronald, Pamela C
2008-10-06
Studies of gene function are often hampered by gene-redundancy, especially in organisms with large genomes such as rice (Oryza sativa). We present an approach for using transcriptomics data to focus functional studies and address redundancy. To this end, we have constructed and validated an inexpensive and publicly available rice oligonucleotide near-whole genome array, called the rice NSF45K array. We generated expression profiles for light- vs. dark-grown rice leaf tissue and validated the biological significance of the data by analyzing sources of variation and confirming expression trends with reverse transcription polymerase chain reaction. We examined trends in the data by evaluating enrichment of gene ontology terms at multiple false discovery rate thresholds. To compare data generated with the NSF45K array with published results, we developed publicly available, web-based tools (www.ricearray.org). The Oligo and EST Anatomy Viewer enables visualization of EST-based expression profiling data for all genes on the array. The Rice Multi-platform Microarray Search Tool facilitates comparison of gene expression profiles across multiple rice microarray platforms. Finally, we incorporated gene expression and biochemical pathway data to reduce the number of candidate gene products putatively participating in the eight steps of the photorespiration pathway from 52 to 10, based on expression levels of putatively functionally redundant genes. We confirmed the efficacy of this method to cope with redundancy by correctly predicting participation in photorespiration of a gene with five paralogs. Applying these methods will accelerate rice functional genomics.
Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data.
Tintle, Nathan L; Sitarik, Alexandra; Boerema, Benjamin; Young, Kylie; Best, Aaron A; Dejongh, Matthew
2012-08-08
Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed. We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size. Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.
Generation of Knock-in Mouse by Genome Editing.
Fujii, Wataru
2017-01-01
Knock-in mice are useful for evaluating endogenous gene expressions and functions in vivo. Instead of the conventional gene-targeting method using embryonic stem cells, an exogenous DNA sequence can be inserted into the target locus in the zygote using genome editing technology. In this chapter, I describe the generation of epitope-tagged mice using engineered endonuclease and single-stranded oligodeoxynucleotide through the mouse zygote as an example of how to generate a knock-in mouse by genome editing.
Detection of DNA Methylation by Whole-Genome Bisulfite Sequencing.
Li, Qing; Hermanson, Peter J; Springer, Nathan M
2018-01-01
DNA methylation plays an important role in the regulation of the expression of transposons and genes. Various methods have been developed to assay DNA methylation levels. Bisulfite sequencing is considered to be the "gold standard" for single-base resolution measurement of DNA methylation levels. Coupled with next-generation sequencing, whole-genome bisulfite sequencing (WGBS) allows DNA methylation to be evaluated at a genome-wide scale. Here, we described a protocol for WGBS in plant species with large genomes. This protocol has been successfully applied to assay genome-wide DNA methylation levels in maize and barley. This protocol has also been successfully coupled with sequence capture technology to assay DNA methylation levels in a targeted set of genomic regions.
Kaur, Navneet; Hasegawa, Daniel K; Ling, Kai-Shu; Wintermantel, William M
2016-10-01
The relationships between plant viruses and their vectors have evolved over the millennia, and yet, studies on viruses began <150 years ago and investigations into the virus and vector interactions even more recently. The advent of next generation sequencing, including rapid genome and transcriptome analysis, methods for evaluation of small RNAs, and the related disciplines of proteomics and metabolomics offer a significant shift in the ability to elucidate molecular mechanisms involved in virus infection and transmission by insect vectors. Genomic technologies offer an unprecedented opportunity to examine the response of insect vectors to the presence of ingested viruses through gene expression changes and altered biochemical pathways. This review focuses on the interactions between viruses and their whitefly or thrips vectors and on potential applications of genomics-driven control of the insect vectors. Recent studies have evaluated gene expression in vectors during feeding on plants infected with begomoviruses, criniviruses, and tospoviruses, which exhibit very different types of virus-vector interactions. These studies demonstrate the advantages of genomics and the potential complementary studies that rapidly advance our understanding of the biology of virus transmission by insect vectors and offer additional opportunities to design novel genetic strategies to manage insect vectors and the viruses they transmit.
Garcia-Bloj, Benjamin; Fry, Jacqueline; Wichmann, Ignacio
2015-01-01
Gastric cancer is the fifth most common cancer and the third leading cause of cancer-related death, whose patterns vary among geographical regions and ethnicities. It is a multifactorial disease, and its development depends on infection by Helicobacter pylori (H. pylori) and Epstein-Barr virus (EBV), host genetic factors, and environmental factors. The heterogeneity of the disease has begun to be unraveled by a comprehensive mutational evaluation of primary tumors. The low-abundance of mutations suggests that other mechanisms participate in the evolution of the disease, such as those found through analyses of noncoding genomics. Noncoding genomics includes single nucleotide polymorphisms (SNPs), regulation of gene expression through DNA methylation of promoter sites, miRNAs, other noncoding RNAs in regulatory regions, and other topics. These processes and molecules ultimately control gene expression. Potential biomarkers are appearing from analyses of noncoding genomics. This review focuses on noncoding genomics and potential biomarkers in the context of gastric cancer and the gastric precancerous cascade. PMID:26379360
Park, Minji; Cho, Yong-Joon; Lee, Yang Won; Jung, Won Hee
2017-03-01
Malassezia species are opportunistic pathogenic fungi that are frequently associated with seborrhoeic dermatitis, including dandruff. Most Malassezia species are lipid dependent, a property that is compensated by breaking down host sebum into fatty acids by lipases. In this study, we aimed to sequence and analyse the whole genome of Malassezia restricta KCTC 27527, a clinical isolate from a Korean patient with severe dandruff, to search for lipase orthologues and identify the lipase that is the most frequently expressed on the scalp of patients with dandruff. The genome of M. restricta KCTC 27527 was sequenced using the Illumina MiSeq and PacBio platforms. Lipase orthologues were identified by comparison with known lipase genes in the genomes of Malassezia globosa and Malassezia sympodialis. The expression of the identified lipase genes was directly evaluated in swab samples from the scalps of 56 patients with dandruff. We found that, among the identified lipase-encoding genes, the gene encoding lipase homolog MRES_03670, named LIP5 in this study, was the most frequently expressed lipase in the swab samples. Our study provides an overview of the genome of a clinical isolate of M. restricta and fundamental information for elucidating the role of lipases during fungus-host interaction. © 2016 Blackwell Verlag GmbH.
Distinct contributions of replication and transcription to mutation rate variation of human genomes.
Cui, Peng; Ding, Feng; Lin, Qiang; Zhang, Lingfang; Li, Ang; Zhang, Zhang; Hu, Songnian; Yu, Jun
2012-02-01
Here, we evaluate the contribution of two major biological processes--DNA replication and transcription--to mutation rate variation in human genomes. Based on analysis of the public human tissue transcriptomics data, high-resolution replicating map of Hela cells and dbSNP data, we present significant correlations between expression breadth, replication time in local regions and SNP density. SNP density of tissue-specific (TS) genes is significantly higher than that of housekeeping (HK) genes. TS genes tend to locate in late-replicating genomic regions and genes in such regions have a higher SNP density compared to those in early-replication regions. In addition, SNP density is found to be positively correlated with expression level among HK genes. We conclude that the process of DNA replication generates stronger mutational pressure than transcription-associated biological processes do, resulting in an increase of mutation rate in TS genes while having weaker effects on HK genes. In contrast, transcription-associated processes are mainly responsible for the accumulation of mutations in highly-expressed HK genes. Copyright © 2012 Beijing Genomics Institute. Published by Elsevier Ltd. All rights reserved.
SPP1 and AGER as potential prognostic biomarkers for lung adenocarcinoma.
Zhang, Weiguo; Fan, Junli; Chen, Qiang; Lei, Caipeng; Qiao, Bin; Liu, Qin
2018-05-01
Overdue treatment and prognostic evaluation lead to low survival rates in patients with lung adenocarcinoma (LUAD). To date, effective biomarkers for prognosis are still required. The aim of the present study was to screen differentially expressed genes (DEGs) as biomarkers for prognostic evaluation of LUAD. DEGs in tumor and normal samples were identified and analyzed for Kyoto Encyclopedia of Genes and Genomes/Gene Ontology functional enrichments. The common genes that are up and downregulated were selected for prognostic analysis using RNAseq data in The Cancer Genome Atlas. Differential expression analysis was performed with 164 samples in GSE10072 and GSE7670 datasets. A total of 484 DEGs that were present in GSE10072 and GSE7670 datasets were screened, including secreted phosphoprotein 1 (SPP1) that was highly expressed and DEGs ficolin 3, advanced glycosylation end-product specific receptor (AGER), transmembrane protein 100 that were lowly expressed in tumor tissues. These four key genes were subsequently verified using an independent dataset, GSE19804. The gene expression model was consistent with GSE10072 and GSE7670 datasets. The dysregulation of highly expressed SPP1 and lowly expressed AGER significantly reduced the median survival time of patients with LUAD. These findings suggest that SPP1 and AGER are risk factors for LUAD, and these two genes may be utilized in the prognostic evaluation of patients with LUAD. Additionally, the key genes and functional enrichments may provide a reference for investigating the molecular expression mechanisms underlying LUAD.
USDA-ARS?s Scientific Manuscript database
Oligionucleotide microarrays (GeneChip Bovine Genome Arrays, Affymetrix Inc., Santa Clara, CA) were used to evaluate gene expression profiles in anterior pituitary glands collected from 4 anestrous and 4 cycling postpartum primiparous beef cows to provide insight into genes associated with transitio...
USDA-ARS?s Scientific Manuscript database
Common bean (Phaseolus vulgaris) and soybean (Glycine max) both belong to the Phaseoleae tribe and share significant coding sequence homology. This suggests that the GeneChip(R) Soybean Genome Array (soybean GeneChip) may be used for gene expression studies using common bean. To evaluate the utility...
Wu, Lang; Shi, Wei; Long, Jirong; Guo, Xingyi; Michailidou, Kyriaki; Beesley, Jonathan; Bolla, Manjeet K; Shu, Xiao-Ou; Lu, Yingchang; Cai, Qiuyin; Al-Ejeh, Fares; Rozali, Esdy; Wang, Qin; Dennis, Joe; Li, Bingshan; Zeng, Chenjie; Feng, Helian; Gusev, Alexander; Barfield, Richard T; Andrulis, Irene L; Anton-Culver, Hoda; Arndt, Volker; Aronson, Kristan J; Auer, Paul L; Barrdahl, Myrto; Baynes, Caroline; Beckmann, Matthias W; Benitez, Javier; Bermisheva, Marina; Blomqvist, Carl; Bogdanova, Natalia V; Bojesen, Stig E; Brauch, Hiltrud; Brenner, Hermann; Brinton, Louise; Broberg, Per; Brucker, Sara Y; Burwinkel, Barbara; Caldés, Trinidad; Canzian, Federico; Carter, Brian D; Castelao, J Esteban; Chang-Claude, Jenny; Chen, Xiaoqing; Cheng, Ting-Yuan David; Christiansen, Hans; Clarke, Christine L; Collée, Margriet; Cornelissen, Sten; Couch, Fergus J; Cox, David; Cox, Angela; Cross, Simon S; Cunningham, Julie M; Czene, Kamila; Daly, Mary B; Devilee, Peter; Doheny, Kimberly F; Dörk, Thilo; Dos-Santos-Silva, Isabel; Dumont, Martine; Dwek, Miriam; Eccles, Diana M; Eilber, Ursula; Eliassen, A Heather; Engel, Christoph; Eriksson, Mikael; Fachal, Laura; Fasching, Peter A; Figueroa, Jonine; Flesch-Janys, Dieter; Fletcher, Olivia; Flyger, Henrik; Fritschi, Lin; Gabrielson, Marike; Gago-Dominguez, Manuela; Gapstur, Susan M; García-Closas, Montserrat; Gaudet, Mia M; Ghoussaini, Maya; Giles, Graham G; Goldberg, Mark S; Goldgar, David E; González-Neira, Anna; Guénel, Pascal; Hahnen, Eric; Haiman, Christopher A; Håkansson, Niclas; Hall, Per; Hallberg, Emily; Hamann, Ute; Harrington, Patricia; Hein, Alexander; Hicks, Belynda; Hillemanns, Peter; Hollestelle, Antoinette; Hoover, Robert N; Hopper, John L; Huang, Guanmengqian; Humphreys, Keith; Hunter, David J; Jakubowska, Anna; Janni, Wolfgang; John, Esther M; Johnson, Nichola; Jones, Kristine; Jones, Michael E; Jung, Audrey; Kaaks, Rudolf; Kerin, Michael J; Khusnutdinova, Elza; Kosma, Veli-Matti; Kristensen, Vessela N; Lambrechts, Diether; Le Marchand, Loic; Li, Jingmei; Lindström, Sara; Lissowska, Jolanta; Lo, Wing-Yee; Loibl, Sibylle; Lubinski, Jan; Luccarini, Craig; Lux, Michael P; MacInnis, Robert J; Maishman, Tom; Kostovska, Ivana Maleva; Mannermaa, Arto; Manson, JoAnn E; Margolin, Sara; Mavroudis, Dimitrios; Meijers-Heijboer, Hanne; Meindl, Alfons; Menon, Usha; Meyer, Jeffery; Mulligan, Anna Marie; Neuhausen, Susan L; Nevanlinna, Heli; Neven, Patrick; Nielsen, Sune F; Nordestgaard, Børge G; Olopade, Olufunmilayo I; Olson, Janet E; Olsson, Håkan; Peterlongo, Paolo; Peto, Julian; Plaseska-Karanfilska, Dijana; Prentice, Ross; Presneau, Nadege; Pylkäs, Katri; Rack, Brigitte; Radice, Paolo; Rahman, Nazneen; Rennert, Gad; Rennert, Hedy S; Rhenius, Valerie; Romero, Atocha; Romm, Jane; Rudolph, Anja; Saloustros, Emmanouil; Sandler, Dale P; Sawyer, Elinor J; Schmidt, Marjanka K; Schmutzler, Rita K; Schneeweiss, Andreas; Scott, Rodney J; Scott, Christopher G; Seal, Sheila; Shah, Mitul; Shrubsole, Martha J; Smeets, Ann; Southey, Melissa C; Spinelli, John J; Stone, Jennifer; Surowy, Harald; Swerdlow, Anthony J; Tamimi, Rulla M; Tapper, William; Taylor, Jack A; Terry, Mary Beth; Tessier, Daniel C; Thomas, Abigail; Thöne, Kathrin; Tollenaar, Rob A E M; Torres, Diana; Truong, Thérèse; Untch, Michael; Vachon, Celine; Van Den Berg, David; Vincent, Daniel; Waisfisz, Quinten; Weinberg, Clarice R; Wendt, Camilla; Whittemore, Alice S; Wildiers, Hans; Willett, Walter C; Winqvist, Robert; Wolk, Alicja; Xia, Lucy; Yang, Xiaohong R; Ziogas, Argyrios; Ziv, Elad; Dunning, Alison M; Pharoah, Paul D P; Simard, Jacques; Milne, Roger L; Edwards, Stacey L; Kraft, Peter; Easton, Douglas F; Chenevix-Trench, Georgia; Zheng, Wei
2018-06-18
The breast cancer risk variants identified in genome-wide association studies explain only a small fraction of the familial relative risk, and the genes responsible for these associations remain largely unknown. To identify novel risk loci and likely causal genes, we performed a transcriptome-wide association study evaluating associations of genetically predicted gene expression with breast cancer risk in 122,977 cases and 105,974 controls of European ancestry. We used data from the Genotype-Tissue Expression Project to establish genetic models to predict gene expression in breast tissue and evaluated model performance using data from The Cancer Genome Atlas. Of the 8,597 genes evaluated, significant associations were identified for 48 at a Bonferroni-corrected threshold of P < 5.82 × 10 -6 , including 14 genes at loci not yet reported for breast cancer. We silenced 13 genes and showed an effect for 11 on cell proliferation and/or colony-forming efficiency. Our study provides new insights into breast cancer genetics and biology.
Tsuchiya, Masa; Giuliani, Alessandro; Hashimoto, Midori; Erenpreisa, Jekaterina; Yoshikawa, Kenichi
2016-01-01
Background A fundamental issue in bioscience is to understand the mechanism that underlies the dynamic control of genome-wide expression through the complex temporal-spatial self-organization of the genome to regulate the change in cell fate. We address this issue by elucidating a physically motivated mechanism of self-organization. Principal Findings Building upon transcriptome experimental data for seven distinct cell fates, including early embryonic development, we demonstrate that self-organized criticality (SOC) plays an essential role in the dynamic control of global gene expression regulation at both the population and single-cell levels. The novel findings are as follows: i) Mechanism of cell-fate changes: A sandpile-type critical transition self-organizes overall expression into a few transcription response domains (critical states). A cell-fate change occurs by means of a dissipative pulse-like global perturbation in self-organization through the erasure of initial-state critical behaviors (criticality). Most notably, the reprogramming of early embryo cells destroys the zygote SOC control to initiate self-organization in the new embryonal genome, which passes through a stochastic overall expression pattern. ii) Mechanism of perturbation of SOC controls: Global perturbations in self-organization involve the temporal regulation of critical states. Quantitative evaluation of this perturbation in terminal cell fates reveals that dynamic interactions between critical states determine the critical-state coherent regulation. The occurrence of a temporal change in criticality perturbs this between-states interaction, which directly affects the entire genomic system. Surprisingly, a sub-critical state, corresponding to an ensemble of genes that shows only marginal changes in expression and consequently are considered to be devoid of any interest, plays an essential role in generating a global perturbation in self-organization directed toward the cell-fate change. Conclusion and Significance ‘Whole-genome’ regulation of gene expression through self-regulatory SOC control complements gene-by-gene fine tuning and represents a still largely unexplored non-equilibrium statistical mechanism that is responsible for the massive reprogramming of genome expression. PMID:27997556
Genome-Wide Identification and Expression Analysis of WRKY Gene Family in Capsicum annuum L.
Diao, Wei-Ping; Snyder, John C; Wang, Shu-Bin; Liu, Jin-Bing; Pan, Bao-Gui; Guo, Guang-Jun; Wei, Ge
2016-01-01
The WRKY family of transcription factors is one of the most important families of plant transcriptional regulators with members regulating multiple biological processes, especially in regulating defense against biotic and abiotic stresses. However, little information is available about WRKYs in pepper (Capsicum annuum L.). The recent release of completely assembled genome sequences of pepper allowed us to perform a genome-wide investigation for pepper WRKY proteins. In the present study, a total of 71 WRKY genes were identified in the pepper genome. According to structural features of their encoded proteins, the pepper WRKY genes (CaWRKY) were classified into three main groups, with the second group further divided into five subgroups. Genome mapping analysis revealed that CaWRKY were enriched on four chromosomes, especially on chromosome 1, and 15.5% of the family members were tandemly duplicated genes. A phylogenetic tree was constructed depending on WRKY domain' sequences derived from pepper and Arabidopsis. The expression of 21 selected CaWRKY genes in response to seven different biotic and abiotic stresses (salt, heat shock, drought, Phytophtora capsici, SA, MeJA, and ABA) was evaluated by quantitative RT-PCR; Some CaWRKYs were highly expressed and up-regulated by stress treatment. Our results will provide a platform for functional identification and molecular breeding studies of WRKY genes in pepper.
USDA-ARS?s Scientific Manuscript database
A comprehensive transcriptome survey, or “Gene Atlas,” provides information essential for a complete understanding of the genomic biology of an organism. Using a digital gene expression approach, we developed a Gene Atlas of RNA abundance in 92 adult, juvenile and fetal cattle tissues. The samples...
Validation of reference genes for gene expression studies in soybean aphid, Aphis glycines Matsumura
USDA-ARS?s Scientific Manuscript database
Quantitative real-time PCR (qRT-PCR) is a common tool for quantifying mRNA transcripts. To normalize results, a reference gene is mandatory. Aphis glycines is a significant soybean pest, yet gene expression and functional genomics studies are hindered by a lack of stable reference genes. We evalu...
Unsupervised Outlier Profile Analysis
Ghosh, Debashis; Li, Song
2014-01-01
In much of the analysis of high-throughput genomic data, “interesting” genes have been selected based on assessment of differential expression between two groups or generalizations thereof. Most of the literature focuses on changes in mean expression or the entire distribution. In this article, we explore the use of C(α) tests, which have been applied in other genomic data settings. Their use for the outlier expression problem, in particular with continuous data, is problematic but nevertheless motivates new statistics that give an unsupervised analog to previously developed outlier profile analysis approaches. Some simulation studies are used to evaluate the proposal. A bivariate extension is described that can accommodate data from two platforms on matched samples. The proposed methods are applied to data from a prostate cancer study. PMID:25452686
Kakitani, Makoto; Oshima, Takeshi; Horikoshi, Kaori; Yoshitome, Tetsuo; Ueda, Akiko; Kajikawa, Miwa; Iba, Yumi; Ozone, Yoshinao; Ijima, Yuki; Yoshino, Tohko; Itoh, Mikiko; Seki, Sachiko; Aoki, Ayako; Ishihara, Toshie; Shionoya, Michiyo; Makino, Utako; Kitada, Rina; Ohguma, Atsuko; Ohta, Takami; Yoshida, Yoshimasa; Kudoh, Hiroe; Hanaoka, Kazunori; Sibuya, Kazunori; Ishida, Isao; Kakeda, Minoru; Yagi, Mikio; Yoneya, Takashi; Tomizuka, Kazuma
2005-01-01
A major challenge of the post-genomic era is the functional characterization of anonymous open reading frames (ORFs) identified by the Human Genome Project. In this context, there is a strong requirement for the development of technologies that enhance our ability to analyze gene functions at the level of the whole organism. Here, we describe a rapid and efficient procedure to generate transgenic chimaeric mice that continuously secrete a foreign protein into the systemic circulation. The transgene units were inserted into the genomic site adjacent to the endogenous immunoglobulin (Ig) κ locus by homologous recombination, using a modified mouse embryonic stem (ES) cell line that exhibits a high frequency of homologous recombination at the Igκ region. The resultant ES clones were injected into embryos derived from a B-cell-deficient host strain, thus producing chimaerism-independent, B-cell-specific transgene expression. This feature of the system eliminates the time-consuming breeding typically implemented in standard transgenic strategies and allows for evaluating the effect of ectopic transgene expression directly in the resulting chimaeric mice. To demonstrate the utility of this system we showed high-level protein expression in the sera and severe phenotypes in human EPO (hEPO) and murine thrombopoietin (mTPO) transgenic chimaeras. PMID:15914664
2013-01-01
Background Modern banana cultivars are primarily interspecific triploid hybrids of two species, Musa acuminata and Musa balbisiana, which respectively contribute the A- and B-genomes. The M. balbisiana genome has been associated with improved vigour and tolerance to biotic and abiotic stresses and is thus a target for Musa breeding programs. However, while a reference M. acuminata genome has recently been released (Nature 488:213–217, 2012), little sequence data is available for the corresponding B-genome. To address these problems we carried out Next Generation gDNA sequencing of the wild diploid M. balbisiana variety ‘Pisang Klutuk Wulung’ (PKW). Our strategy was to align PKW gDNA reads against the published A-genome and to extract the mapped consensus sequences for subsequent rounds of evaluation and gene annotation. Results The resulting B-genome is 79% the size of the A-genome, and contains 36,638 predicted functional gene sequences which is nearly identical to the 36,542 of the A-genome. There is substantial sequence divergence from the A-genome at a frequency of 1 homozygous SNP per 23.1 bp, and a high degree of heterozygosity corresponding to one heterozygous SNP per 55.9 bp. Using expressed small RNA data, a similar number of microRNA sequences were predicted in both A- and B-genomes, but additional novel miRNAs were detected, including some that are unique to each genome. The usefulness of this B-genome sequence was evaluated by mapping RNA-seq data from a set of triploid AAA and AAB hybrids simultaneously to both genomes. Results for the plantains demonstrated the expected 2:1 distribution of reads across the A- and B-genomes, but for the AAA genomes, results show they contain regions of significant homology to the B-genome supporting proposals that there has been a history of interspecific recombination between homeologous A and B chromosomes in Musa hybrids. Conclusions We have generated and annotated a draft reference Musa B-genome and demonstrate that this can be used for molecular genetic mapping of gene transcripts and small RNA expression data from several allopolyploid banana cultivars. This draft therefore represents a valuable resource to support the study of metabolism in inter- and intraspecific triploid Musa hybrids and to help direct breeding programs. PMID:24094114
Comparative systems biology across an evolutionary gradient within the Shewanella genus.
Konstantinidis, Konstantinos T; Serres, Margrethe H; Romine, Margaret F; Rodrigues, Jorge L M; Auchtung, Jennifer; McCue, Lee-Ann; Lipton, Mary S; Obraztsova, Anna; Giometti, Carol S; Nealson, Kenneth H; Fredrickson, James K; Tiedje, James M
2009-09-15
To what extent genotypic differences translate to phenotypic variation remains a poorly understood issue of paramount importance for several cornerstone concepts of microbiology including the species definition. Here, we take advantage of the completed genomic sequences, expressed proteomic profiles, and physiological studies of 10 closely related Shewanella strains and species to provide quantitative insights into this issue. Our analyses revealed that, despite extensive horizontal gene transfer within these genomes, the genotypic and phenotypic similarities among the organisms were generally predictable from their evolutionary relatedness. The power of the predictions depended on the degree of ecological specialization of the organisms evaluated. Using the gradient of evolutionary relatedness formed by these genomes, we were able to partly isolate the effect of ecology from that of evolutionary divergence and to rank the different cellular functions in terms of their rates of evolution. Our ranking also revealed that whole-cell protein expression differences among these organisms, when the organisms were grown under identical conditions, were relatively larger than differences at the genome level, suggesting that similarity in gene regulation and expression should constitute another important parameter for (new) species description. Collectively, our results provide important new information toward beginning a systems-level understanding of bacterial species and genera.
Bourdon-Lacombe, Julie A; Moffat, Ivy D; Deveau, Michelle; Husain, Mainul; Auerbach, Scott; Krewski, Daniel; Thomas, Russell S; Bushel, Pierre R; Williams, Andrew; Yauk, Carole L
2015-07-01
Toxicogenomics promises to be an important part of future human health risk assessment of environmental chemicals. The application of gene expression profiles (e.g., for hazard identification, chemical prioritization, chemical grouping, mode of action discovery, and quantitative analysis of response) is growing in the literature, but their use in formal risk assessment by regulatory agencies is relatively infrequent. Although additional validations for specific applications are required, gene expression data can be of immediate use for increasing confidence in chemical evaluations. We believe that a primary reason for the current lack of integration is the limited practical guidance available for risk assessment specialists with limited experience in genomics. The present manuscript provides basic information on gene expression profiling, along with guidance on evaluating the quality of genomic experiments and data, and interpretation of results presented in the form of heat maps, pathway analyses and other common approaches. Moreover, potential ways to integrate information from gene expression experiments into current risk assessment are presented using published studies as examples. The primary objective of this work is to facilitate integration of gene expression data into human health risk assessments of environmental chemicals. Crown Copyright © 2015. Published by Elsevier Inc. All rights reserved.
Calcium Signaling Pathway Genes RUNX2 and CACNA1C Are Associated With Calcific Aortic Valve Disease
Guauque-Olarte, Sandra; Messika-Zeitoun, David; Droit, Arnaud; Lamontagne, Maxime; Tremblay-Marchand, Joël; Lavoie-Charland, Emilie; Gaudreault, Nathalie; Arsenault, Benoit J.; Dubé, Marie-Pierre; Tardif, Jean-Claude; Body, Simon C.; Seidman, Jonathan G.; Boileau, Catherine; Mathieu, Patrick; Pibarot, Philippe; Bossé, Yohan
2016-01-01
Background Calcific aortic valve stenosis (AS) is a life-threatening disease with no medical therapy. The genetic architecture of AS remains elusive. This study combines genome-wide association studies, gene expression, and expression quantitative trait loci mapping in human valve tissues to identify susceptibility genes of AS. Methods and Results A meta-analysis was performed combining the results of 2 genome-wide association studies in 474 and 486 cases from Quebec City (Canada) and Paris (France), respectively. Corresponding controls consisted of 2988 and 1864 individuals with European ancestry from the database of genotypes and phenotypes. mRNA expression levels were evaluated in 9 calcified and 8 normal aortic valves by RNA sequencing. The results were integrated with valve expression quantitative trait loci data obtained from 22 AS patients. Twenty-five single-nucleotide polymorphisms had P<5×10−6 in the genome-wide association studies meta-analysis. The calcium signaling pathway was the top gene set enriched for genes mapped to moderately AS-associated single-nucleotide polymorphisms. Genes in this pathway were found differentially expressed in valves with and without AS. Two single-nucleotide polymorphisms located in RUNX2 (runt-related transcription factor 2), encoding an osteogenic transcription factor, demonstrated some association with AS (genome-wide association studies P=5.33×10−5). The mRNA expression levels of RUNX2 were upregulated in calcified valves and associated with eQTL-SNPs. CACNA1C encoding a subunit of a voltage-dependent calcium channel was upregulated in calcified valves. The eQTL-SNP with the most significant association with AS located in CACNA1C was associated with higher expression of the gene. Conclusions This integrative genomic study confirmed the role of RUNX2 as a potential driver of AS and identified a new AS susceptibility gene, CACNA1C, belonging to the calcium signaling pathway. PMID:26553695
Altobelli, Gioia; Bogdarina, Irina G; Stupka, Elia; Clark, Adrian J L; Langley-Evans, Simon
2013-01-01
A large body of evidence from human and animal studies demonstrates that the maternal diet during pregnancy can programme physiological and metabolic functions in the developing fetus, effectively determining susceptibility to later disease. The mechanistic basis of such programming is unclear but may involve resetting of epigenetic marks and fetal gene expression. The aim of this study was to evaluate genome-wide DNA methylation and gene expression in the livers of newborn rats exposed to maternal protein restriction. On day one postnatally, there were 618 differentially expressed genes and 1183 differentially methylated regions (FDR 5%). The functional analysis of differentially expressed genes indicated a significant effect on DNA repair/cycle/maintenance functions and of lipid, amino acid metabolism and circadian functions. Enrichment for known biological functions was found to be associated with differentially methylated regions. Moreover, these epigenetically altered regions overlapped genetic loci associated with metabolic and cardiovascular diseases. Both expression changes and DNA methylation changes were largely reversed by supplementing the protein restricted diet with folic acid. Although the epigenetic and gene expression signatures appeared to underpin largely different biological processes, the gene expression profile of DNA methyl transferases was altered, providing a potential link between the two molecular signatures. The data showed that maternal protein restriction is associated with widespread differential gene expression and DNA methylation across the genome, and that folic acid is able to reset both molecular signatures.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gallaher, Sean D.; Fitz-Gibbon, Sorel T.; Strenkert, Daniela
Chlamydomonas reinhardtii is a unicellular chlorophyte alga that is widely studied as a reference organism for understanding photosynthesis, sensory and motile cilia, and for development of an algal-based platform for producing biofuels and bio-products. Its highly repetitive, ~205-kbp circular chloroplast genome and ~15.8-kbp linear mitochondrial genome were sequenced prior to the advent of high-throughput sequencing technologies. Here, high coverage shotgun sequencing was used to assemble both organellar genomes de novo. These new genomes correct dozens of errors in the prior genome sequences and annotations. Gen-ome sequencing coverage indicates that each cell contains on average 83 copies of the chloroplast genomemore » and 130 copies of the mitochondrial genome. Using protocols and analyses optimized for organellar tran-scripts, RNA-Seq was used to quantify their relative abundances across 12 different growth conditions. Forty-six percent of total cellular mRNA is attributable to high expression from a few dozen chloroplast genes. RNA-Seq data were used to guide gene annotation, to demonstrate polycistronic gene expression, and to quantify splicing of psaA and psbA introns. In contrast to a conclusion from a recent study, we found that chloroplast transcripts are not edited. Unexpectedly, cytosine-rich polynucleotide tails were observed at the 3’-end of all mitochondrial transcripts. A comparative genomics analysis of eight laboratory strains and 11 wild isolates of C. reinhardtii identified 2658 variants in the organellargenomes, which is 1/10th as much genetic diversity as is found in the nucleus.« less
Javid, Mahsa; Sasanakietkul, Thanyawat; Nicolson, Norman G; Gibson, Courtney E; Callender, Glenda G; Korah, Reju; Carling, Tobias
2018-02-01
Efficient DNA damage repair by MutL-homolog DNA mismatch repair (MMR) enzymes, MLH1, MLH3, PMS1 and PMS2, are required to maintain thyrocyte genomic integrity. We hypothesized that persistent oxidative stress and consequent transcriptional dysregulation observed in thyroid follicles will lead to MMR deficiency and potentiate papillary thyroid tumorigenesis. MMR gene expression was analyzed by targeted microarray in 18 papillary thyroid cancer (PTC), 9 paracarcinoma normal thyroid (PCNT) and 10 normal thyroid (NT) samples. The findings were validated by qRT-PCR, and in follicular thyroid cancers (FTC) and follicular thyroid adenomas (FTA) for comparison. FOXO transcription factor expression was also analyzed. Protein expression was assessed by immunohistochemistry. Genomic integrity was evaluated by whole-exome sequencing-derived read-depth analysis and Mann-Whitney U test. Clinical correlations were assessed using Fisher's exact and t tests. Microarray and qRT-PCR revealed reduced expression of all four MMR genes in PTC compared with PCNT and of PMS2 compared with NT. FTC and FTA showed upregulation in MLH1, MLH3 and PMS2. PMS2 protein expression correlated with the mRNA expression pattern. FOXO1 showed lower expression in PMS2-deficient PTCs (log2-fold change -1.72 vs. -0.55, U = 11, p < 0.05 two-tailed). Rate of LOH, a measure of genomic instability, was higher in PMS2-deficient PTCs (median 3 and 1, respectively; U = 26, p < 0.05 two-tailed). No correlation was noted between MMR deficiency and clinical characteristics. MMR deficiency, potentially promoted by FOXO1 suppression, may explain the etiology for PTC development in some patients. FTC and FTA retain MMR activity and are likely caused by a different tumorigenic pathway.
Lee, Mikyung; Kim, Yangseok
2009-12-16
Genomic alterations frequently occur in many cancer patients and play important mechanistic roles in the pathogenesis of cancer. Furthermore, they can modify the expression level of genes due to altered copy number in the corresponding region of the chromosome. An accumulating body of evidence supports the possibility that strong genome-wide correlation exists between DNA content and gene expression. Therefore, more comprehensive analysis is needed to quantify the relationship between genomic alteration and gene expression. A well-designed bioinformatics tool is essential to perform this kind of integrative analysis. A few programs have already been introduced for integrative analysis. However, there are many limitations in their performance of comprehensive integrated analysis using published software because of limitations in implemented algorithms and visualization modules. To address this issue, we have implemented the Java-based program CHESS to allow integrative analysis of two experimental data sets: genomic alteration and genome-wide expression profile. CHESS is composed of a genomic alteration analysis module and an integrative analysis module. The genomic alteration analysis module detects genomic alteration by applying a threshold based method or SW-ARRAY algorithm and investigates whether the detected alteration is phenotype specific or not. On the other hand, the integrative analysis module measures the genomic alteration's influence on gene expression. It is divided into two separate parts. The first part calculates overall correlation between comparative genomic hybridization ratio and gene expression level by applying following three statistical methods: simple linear regression, Spearman rank correlation and Pearson's correlation. In the second part, CHESS detects the genes that are differentially expressed according to the genomic alteration pattern with three alternative statistical approaches: Student's t-test, Fisher's exact test and Chi square test. By successive operations of two modules, users can clarify how gene expression levels are affected by the phenotype specific genomic alterations. As CHESS was developed in both Java application and web environments, it can be run on a web browser or a local machine. It also supports all experimental platforms if a properly formatted text file is provided to include the chromosomal position of probes and their gene identifiers. CHESS is a user-friendly tool for investigating disease specific genomic alterations and quantitative relationships between those genomic alterations and genome-wide gene expression profiling.
G-cimp status prediction of glioblastoma samples using mRNA expression data.
Baysan, Mehmet; Bozdag, Serdar; Cam, Margaret C; Kotliarova, Svetlana; Ahn, Susie; Walling, Jennifer; Killian, Jonathan K; Stevenson, Holly; Meltzer, Paul; Fine, Howard A
2012-01-01
Glioblastoma Multiforme (GBM) is a tumor with high mortality and no known cure. The dramatic molecular and clinical heterogeneity seen in this tumor has led to attempts to define genetically similar subgroups of GBM with the hope of developing tumor specific therapies targeted to the unique biology within each of these subgroups. Recently, a subset of relatively favorable prognosis GBMs has been identified. These glioma CpG island methylator phenotype, or G-CIMP tumors, have distinct genomic copy number aberrations, DNA methylation patterns, and (mRNA) expression profiles compared to other GBMs. While the standard method for identifying G-CIMP tumors is based on genome-wide DNA methylation data, such data is often not available compared to the more widely available gene expression data. In this study, we have developed and evaluated a method to predict the G-CIMP status of GBM samples based solely on gene expression data.
G-Cimp Status Prediction Of Glioblastoma Samples Using mRNA Expression Data
Baysan, Mehmet; Bozdag, Serdar; Cam, Margaret C.; Kotliarova, Svetlana; Ahn, Susie; Walling, Jennifer; Killian, Jonathan K.; Stevenson, Holly; Meltzer, Paul; Fine, Howard A.
2012-01-01
Glioblastoma Multiforme (GBM) is a tumor with high mortality and no known cure. The dramatic molecular and clinical heterogeneity seen in this tumor has led to attempts to define genetically similar subgroups of GBM with the hope of developing tumor specific therapies targeted to the unique biology within each of these subgroups. Recently, a subset of relatively favorable prognosis GBMs has been identified. These glioma CpG island methylator phenotype, or G-CIMP tumors, have distinct genomic copy number aberrations, DNA methylation patterns, and (mRNA) expression profiles compared to other GBMs. While the standard method for identifying G-CIMP tumors is based on genome-wide DNA methylation data, such data is often not available compared to the more widely available gene expression data. In this study, we have developed and evaluated a method to predict the G-CIMP status of GBM samples based solely on gene expression data. PMID:23139755
Lamontagne, Maxime; Timens, Wim; Hao, Ke; Bossé, Yohan; Laviolette, Michel; Steiling, Katrina; Campbell, Joshua D; Couture, Christian; Conti, Massimo; Sherwood, Karen; Hogg, James C; Brandsma, Corry-Anke; van den Berge, Maarten; Sandford, Andrew; Lam, Stephen; Lenburg, Marc E; Spira, Avrum; Paré, Peter D; Nickle, David; Sin, Don D; Postma, Dirkje S
2014-11-01
COPD is a complex chronic disease with poorly understood pathogenesis. Integrative genomic approaches have the potential to elucidate the biological networks underlying COPD and lung function. We recently combined genome-wide genotyping and gene expression in 1111 human lung specimens to map expression quantitative trait loci (eQTL). To determine causal associations between COPD and lung function-associated single nucleotide polymorphisms (SNPs) and lung tissue gene expression changes in our lung eQTL dataset. We evaluated causality between SNPs and gene expression for three COPD phenotypes: FEV(1)% predicted, FEV(1)/FVC and COPD as a categorical variable. Different models were assessed in the three cohorts independently and in a meta-analysis. SNPs associated with a COPD phenotype and gene expression were subjected to causal pathway modelling and manual curation. In silico analyses evaluated functional enrichment of biological pathways among newly identified causal genes. Biologically relevant causal genes were validated in two separate gene expression datasets of lung tissues and bronchial airway brushings. High reliability causal relations were found in SNP-mRNA-phenotype triplets for FEV(1)% predicted (n=169) and FEV(1)/FVC (n=80). Several genes of potential biological relevance for COPD were revealed. eQTL-SNPs upregulating cystatin C (CST3) and CD22 were associated with worse lung function. Signalling pathways enriched with causal genes included xenobiotic metabolism, apoptosis, protease-antiprotease and oxidant-antioxidant balance. By using integrative genomics and analysing the relationships of COPD phenotypes with SNPs and gene expression in lung tissue, we identified CST3 and CD22 as potential causal genes for airflow obstruction. This study also augmented the understanding of previously described COPD pathways. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Genome-Wide Identification and Expression Analysis of WRKY Gene Family in Capsicum annuum L.
Diao, Wei-Ping; Snyder, John C.; Wang, Shu-Bin; Liu, Jin-Bing; Pan, Bao-Gui; Guo, Guang-Jun; Wei, Ge
2016-01-01
The WRKY family of transcription factors is one of the most important families of plant transcriptional regulators with members regulating multiple biological processes, especially in regulating defense against biotic and abiotic stresses. However, little information is available about WRKYs in pepper (Capsicum annuum L.). The recent release of completely assembled genome sequences of pepper allowed us to perform a genome-wide investigation for pepper WRKY proteins. In the present study, a total of 71 WRKY genes were identified in the pepper genome. According to structural features of their encoded proteins, the pepper WRKY genes (CaWRKY) were classified into three main groups, with the second group further divided into five subgroups. Genome mapping analysis revealed that CaWRKY were enriched on four chromosomes, especially on chromosome 1, and 15.5% of the family members were tandemly duplicated genes. A phylogenetic tree was constructed depending on WRKY domain' sequences derived from pepper and Arabidopsis. The expression of 21 selected CaWRKY genes in response to seven different biotic and abiotic stresses (salt, heat shock, drought, Phytophtora capsici, SA, MeJA, and ABA) was evaluated by quantitative RT-PCR; Some CaWRKYs were highly expressed and up-regulated by stress treatment. Our results will provide a platform for functional identification and molecular breeding studies of WRKY genes in pepper. PMID:26941768
Romero Navarro, J. Alberto; Phillips-Mora, Wilbert; Arciniegas-Leal, Adriana; Mata-Quirós, Allan; Haiminen, Niina; Mustiga, Guiliana; Livingstone III, Donald; van Bakel, Harm; Kuhn, David N.; Parida, Laxmi; Kasarskis, Andrew; Motamayor, Juan C.
2017-01-01
Chocolate is a highly valued and palatable confectionery product. Chocolate is primarily made from the processed seeds of the tree species Theobroma cacao. Cacao cultivation is highly relevant for small-holder farmers throughout the tropics, yet its productivity remains limited by low yields and widespread pathogens. A panel of 148 improved cacao clones was assembled based on productivity and disease resistance, and phenotypic single-tree replicated clonal evaluation was performed for 8 years. Using high-density markers, the diversity of clones was expressed relative to 10 known ancestral cacao populations, and significant effects of ancestry were observed in productivity and disease resistance. Genome-wide association (GWA) was performed, and six markers were significantly associated with frosty pod disease resistance. In addition, genomic selection was performed, and consistent with the observed extensive linkage disequilibrium, high predictive ability was observed at low marker densities for all traits. Finally, quantitative trait locus mapping and differential expression analysis of two cultivars with contrasting disease phenotypes were performed to identify genes underlying frosty pod disease resistance, identifying a significant quantitative trait locus and 35 differentially expressed genes using two independent differential expression analyses. These results indicate that in breeding populations of heterozygous and recently admixed individuals, mapping approaches can be used for low complexity traits like pod color cacao, or in other species single gene disease resistance, however genomic selection for quantitative traits remains highly effective relative to mapping. Our results can help guide the breeding process for sustainable improved cacao productivity. PMID:29184558
Castro-Rojas, Carlos; Ortiz-Lópezj, Rocío; Rojas-Martínez, Augusto
2014-06-01
Gastric cancer (GC) is often diagnosed at later stages due to the lack of specificity of symptoms associated with the neoplasm, causing high mortality rates worldwide. The first line of adjuvant and neoadjuvant treatment includes cytotoxic fluoropyrimidines and platin-containing compounds which cause the formation of DNA adducts. The clinical outcome with these antineoplastic agents depends mainly on tumor sensitivity, which is conditioned by the expression level of the drug targets and the DNA-repair system enzymes. In addition, some germ line polymorphisms, in genes linked to drug metabolism and response to chemotherapy, have been associated with poor responses and the development of adverse effects, even with fatal outcomes in GC patients. The identification of genomic biomarkers, such as individual gene polymorphisms or differential expression patterns of specific genes, in a patient-by-patient context with potential clinical application is the main focus of current pharmacogenomic research, which aims at developing a rational and personalized therapy (i.e., a therapy that ensures maximum efficacy with no predictable side effects). However, because of the future application of genomic technologies in the clinical setting, it is necessary to establish the prognostic value of these genomic biomarkers with genotype-phenotype association studies and to evaluate their prevalence in the population under treatment. These issues are important for their cost-effectiveness evaluation, which determines the feasibility of using these medical genomic research products for GC treatment in the clinical setting.
Homoeolog-specific transcriptional bias in allopolyploid wheat
2010-01-01
Background Interaction between parental genomes is accompanied by global changes in gene expression which, eventually, contributes to growth vigor and the broader phenotypic diversity of allopolyploid species. In order to gain a better understanding of the effects of allopolyploidization on the regulation of diverged gene networks, we performed a genome-wide analysis of homoeolog-specific gene expression in re-synthesized allohexaploid wheat created by the hybridization of a tetraploid derivative of hexaploid wheat with the diploid ancestor of the wheat D genome Ae. tauschii. Results Affymetrix wheat genome arrays were used for both the discovery of divergent homoeolog-specific mutations and analysis of homoeolog-specific gene expression in re-synthesized allohexaploid wheat. More than 34,000 detectable parent-specific features (PSF) distributed across the wheat genome were used to assess AB genome (could not differentiate A and B genome contributions) and D genome parental expression in the allopolyploid transcriptome. In re-synthesized polyploid 81% of PSFs detected mid-parent levels of gene expression, and only 19% of PSFs showed the evidence of non-additive expression. Non-additive expression in both AB and D genomes was strongly biased toward up-regulation of parental type of gene expression with only 6% and 11% of genes, respectively, being down-regulated. Of all the non-additive gene expression, 84% can be explained by differences in the parental genotypes used to make the allopolyploid. Homoeolog-specific co-regulation of several functional gene categories was found, particularly genes involved in photosynthesis and protein biosynthesis in wheat. Conclusions Here, we have demonstrated that the establishment of interactions between the diverged regulatory networks in allopolyploids is accompanied by massive homoeolog-specific up- and down-regulation of gene expression. This study provides insights into interactions between homoeologous genomes and their role in growth vigor, development, and fertility of allopolyploid species. PMID:20849627
Superior Cross-Species Reference Genes: A Blueberry Case Study
Die, Jose V.; Rowland, Lisa J.
2013-01-01
The advent of affordable Next Generation Sequencing technologies has had major impact on studies of many crop species, where access to genomic technologies and genome-scale data sets has been extremely limited until now. The recent development of genomic resources in blueberry will enable the application of high throughput gene expression approaches that should relatively quickly increase our understanding of blueberry physiology. These studies, however, require a highly accurate and robust workflow and make necessary the identification of reference genes with high expression stability for correct target gene normalization. To create a set of superior reference genes for blueberry expression analyses, we mined a publicly available transcriptome data set from blueberry for orthologs to a set of Arabidopsis genes that showed the most stable expression in a developmental series. In total, the expression stability of 13 putative reference genes was evaluated by qPCR and a set of new references with high stability values across a developmental series in fruits and floral buds of blueberry were identified. We also demonstrated the need to use at least two, preferably three, reference genes to avoid inconsistencies in results, even when superior reference genes are used. The new references identified here provide a valuable resource for accurate normalization of gene expression in Vaccinium spp. and may be useful for other members of the Ericaceae family as well. PMID:24058469
Biomarkers identified for prostate cancer patients through genome-scale screening.
Wang, Lei-Yun; Cui, Jia-Jia; Zhu, Tao; Shao, Wei-Hua; Zhao, Yi; Wang, Sai; Zhang, Yu-Peng; Wu, Ji-Chu; Zhang, Le
2017-11-03
Prostate cancer is a threat to men and usually occurs in aged males. Though prostate specific antigen level and Gleason score are utilized for evaluation of the prostate cancer in clinic, the biomarkers for this malignancy have not been widely recognized. Furthermore, the outcome varies across individuals receiving comparable treatment regimens and the underlying mechanism is still unclear. We supposed that genetic feature may be responsible for, at least in part, this process and conducted a two-cohort study to compare the genetic difference in tumorous and normal tissues of prostate cancer patients. The Gene Expression Omnibus dataset were used and a total of 41 genes were found significantly differently expressed in tumor tissues as compared with normal prostate tissues. Four genes (SPOCK3, SPON1, PTN and TGFB3) were selected for further evaluation after Gene Ontology analysis, Kyoto Encyclopedia of Genes and Genomes pathway analysis and clinical association analysis. MIR1908 was also found decreased expression level in prostate cancer whose target genes were found expressing in both prostate tumor and normal tissues. These results indicated that these potential biomarkers deserve attention in prostate cancer patients and the underlying mechanism should be further investigated.
Marcon, Helena Sanches; Domingues, Douglas Silva; Silva, Juliana Costa; Borges, Rafael Junqueira; Matioli, Fábio Filippi; Fontes, Marcos Roberto de Mattos; Marino, Celso Luis
2015-08-14
In Eucalyptus genus, studies on genome composition and transposable elements (TEs) are particularly scarce. Nearly half of the recently released Eucalyptus grandis genome is composed by retrotransposons and this data provides an important opportunity to understand TE dynamics in Eucalyptus genome and transcriptome. We characterized nine families of transcriptionally active LTR retrotransposons from Copia and Gypsy superfamilies in Eucalyptus grandis genome and we depicted genomic distribution and copy number in two Eucalyptus species. We also evaluated genomic polymorphism and transcriptional profile in three organs of five Eucalyptus species. We observed contrasting genomic and transcriptional behavior in the same family among different species. RLC_egMax_1 was the most prevalent family and RLC_egAngela_1 was the family with the lowest copy number. Most families of both superfamilies have their insertions occurring <3 million years, except one Copia family, RLC_egBianca_1. Protein theoretical models suggest different properties between Copia and Gypsy domains. IRAP and REMAP markers suggested genomic polymorphisms among Eucalyptus species. Using EST analysis and qRT-PCRs, we observed transcriptional activity in several tissues and in all evaluated species. In some families, osmotic stress increases transcript values. Our strategy was successful in isolating transcriptionally active retrotransposons in Eucalyptus, and each family has a particular genomic and transcriptional pattern. Overall, our results show that retrotransposon activity have differentially affected genome and transcriptome among Eucalyptus species.
Bianchi-Frias, Daniella; Basom, Ryan; Delrow, Jeffrey J; Coleman, Ilsa M; Dakhova, Olga; Qu, Xiaoyu; Fang, Min; Franco, Omar E.; Ericson, Nolan G.; Bielas, Jason H.; Hayward, Simon W.; True, Lawrence; Morrissey, Colm; Brown, Lisha; Bhowmick, Neil A.; Rowley, David; Ittmann, Michael; Nelson, Peter S.
2017-01-01
Prostate cancer-associated stroma (CAS) plays an active role in malignant transformation, tumor progression, and metastasis. Molecular analyses of CAS have demonstrated significant changes in gene expression; however, conflicting evidence exists on whether genomic alterations in benign cells comprising the tumor microenvironment (TME) underlie gene expression changes and oncogenic phenotypes. This study evaluates the nuclear and mitochondrial DNA integrity of prostate carcinoma cells, CAS, matched benign epithelium and benign epithelium-associated stroma by whole genome copy number analyses, targeted sequencing of TP53, and fluorescence in situ hybridization. Comparative genomic hybridization (aCGH) of CAS revealed a copy-neutral diploid genome with only rare and small somatic copy number aberrations (SCNAs). In contrast, several expected recurrent SCNAs were evident in the adjacent prostate carcinoma cells, including gains at 3q, 7p, and 8q, and losses at 8p and 10q. No somatic TP53 mutations were observed in CAS. Mitochondrial DNA (mtDNA) extracted from carcinoma cells and stroma identified 23 somatic mtDNA mutations in neoplastic epithelial cells but only one mutation in stroma. Finally, genomic analyses identified no SCNAs, no loss of heterozygosity (LOH) or copy-neutral LOH in cultured cancer-associated fibroblasts (CAFs), which are known to promote prostate cancer progression in vivo. PMID:26753621
Yoshida, Asuka; Samal, Siba K.
2017-01-01
Avian paramyxovirus serotype 3 (APMV-3) causes infection in a wide variety of avian species, but it does not cause apparent diseases in chickens. On the contrary, APMV-1, also known as Newcastle disease virus (NDV), can cause severe disease in chickens. Currently, natural low virulence strains of NDV are used as live-attenuated vaccines throughout the world. NDV is also being evaluated as a vaccine vector against poultry pathogens. However, due to routine vaccination programs, chickens often possess pre-existing antibodies against NDV, which may cause the chickens to be less sensitive to recombinant NDV vaccines expressing antigens of other avian pathogens. Therefore, it may be possible for an APMV-3 vector vaccine to circumvent this issue. In this study, we determined the optimal insertion site in the genome of APMV-3 for high level expression of a foreign gene. We generated recombinant APMV-3 viruses expressing the green fluorescent protein (GFP) by inserting the GFP gene at five different intergenic regions in the genome. The levels of GFP transcription and translation were evaluated. Interestingly, the levels of GFP transcription and translation did not follow the 3′-to-5′ attenuation mechanism of non-segmented, negative-sense RNA viruses. The insertion of GFP gene into the P-M gene junction resulted in higher level of expression of GFP than when the gene was inserted into the upstream N-P gene junction. Unlike NDV, insertion of GFP did not attenuate the growth efficiency of AMPV-3. Thus, APMV-3 could be a more useful vaccine vector for avian pathogens than NDV. PMID:28473820
Yoshida, Asuka; Samal, Siba K
2017-01-01
Avian paramyxovirus serotype 3 (APMV-3) causes infection in a wide variety of avian species, but it does not cause apparent diseases in chickens. On the contrary, APMV-1, also known as Newcastle disease virus (NDV), can cause severe disease in chickens. Currently, natural low virulence strains of NDV are used as live-attenuated vaccines throughout the world. NDV is also being evaluated as a vaccine vector against poultry pathogens. However, due to routine vaccination programs, chickens often possess pre-existing antibodies against NDV, which may cause the chickens to be less sensitive to recombinant NDV vaccines expressing antigens of other avian pathogens. Therefore, it may be possible for an APMV-3 vector vaccine to circumvent this issue. In this study, we determined the optimal insertion site in the genome of APMV-3 for high level expression of a foreign gene. We generated recombinant APMV-3 viruses expressing the green fluorescent protein (GFP) by inserting the GFP gene at five different intergenic regions in the genome. The levels of GFP transcription and translation were evaluated. Interestingly, the levels of GFP transcription and translation did not follow the 3'-to-5' attenuation mechanism of non-segmented, negative-sense RNA viruses. The insertion of GFP gene into the P-M gene junction resulted in higher level of expression of GFP than when the gene was inserted into the upstream N-P gene junction. Unlike NDV, insertion of GFP did not attenuate the growth efficiency of AMPV-3. Thus, APMV-3 could be a more useful vaccine vector for avian pathogens than NDV.
SIDR: simultaneous isolation and parallel sequencing of genomic DNA and total RNA from single cells.
Han, Kyung Yeon; Kim, Kyu-Tae; Joung, Je-Gun; Son, Dae-Soon; Kim, Yeon Jeong; Jo, Areum; Jeon, Hyo-Jeong; Moon, Hui-Sung; Yoo, Chang Eun; Chung, Woosung; Eum, Hye Hyeon; Kim, Sangmin; Kim, Hong Kwan; Lee, Jeong Eon; Ahn, Myung-Ju; Lee, Hae-Ock; Park, Donghyun; Park, Woong-Yang
2018-01-01
Simultaneous sequencing of the genome and transcriptome at the single-cell level is a powerful tool for characterizing genomic and transcriptomic variation and revealing correlative relationships. However, it remains technically challenging to analyze both the genome and transcriptome in the same cell. Here, we report a novel method for simultaneous isolation of genomic DNA and total RNA (SIDR) from single cells, achieving high recovery rates with minimal cross-contamination, as is crucial for accurate description and integration of the single-cell genome and transcriptome. For reliable and efficient separation of genomic DNA and total RNA from single cells, the method uses hypotonic lysis to preserve nuclear lamina integrity and subsequently captures the cell lysate using antibody-conjugated magnetic microbeads. Evaluating the performance of this method using real-time PCR demonstrated that it efficiently recovered genomic DNA and total RNA. Thorough data quality assessments showed that DNA and RNA simultaneously fractionated by the SIDR method were suitable for genome and transcriptome sequencing analysis at the single-cell level. The integration of single-cell genome and transcriptome sequencing by SIDR (SIDR-seq) showed that genetic alterations, such as copy-number and single-nucleotide variations, were more accurately captured by single-cell SIDR-seq compared with conventional single-cell RNA-seq, although copy-number variations positively correlated with the corresponding gene expression levels. These results suggest that SIDR-seq is potentially a powerful tool to reveal genetic heterogeneity and phenotypic information inferred from gene expression patterns at the single-cell level. © 2018 Han et al.; Published by Cold Spring Harbor Laboratory Press.
SIDR: simultaneous isolation and parallel sequencing of genomic DNA and total RNA from single cells
Han, Kyung Yeon; Kim, Kyu-Tae; Joung, Je-Gun; Son, Dae-Soon; Kim, Yeon Jeong; Jo, Areum; Jeon, Hyo-Jeong; Moon, Hui-Sung; Yoo, Chang Eun; Chung, Woosung; Eum, Hye Hyeon; Kim, Sangmin; Kim, Hong Kwan; Lee, Jeong Eon; Ahn, Myung-Ju; Lee, Hae-Ock; Park, Donghyun; Park, Woong-Yang
2018-01-01
Simultaneous sequencing of the genome and transcriptome at the single-cell level is a powerful tool for characterizing genomic and transcriptomic variation and revealing correlative relationships. However, it remains technically challenging to analyze both the genome and transcriptome in the same cell. Here, we report a novel method for simultaneous isolation of genomic DNA and total RNA (SIDR) from single cells, achieving high recovery rates with minimal cross-contamination, as is crucial for accurate description and integration of the single-cell genome and transcriptome. For reliable and efficient separation of genomic DNA and total RNA from single cells, the method uses hypotonic lysis to preserve nuclear lamina integrity and subsequently captures the cell lysate using antibody-conjugated magnetic microbeads. Evaluating the performance of this method using real-time PCR demonstrated that it efficiently recovered genomic DNA and total RNA. Thorough data quality assessments showed that DNA and RNA simultaneously fractionated by the SIDR method were suitable for genome and transcriptome sequencing analysis at the single-cell level. The integration of single-cell genome and transcriptome sequencing by SIDR (SIDR-seq) showed that genetic alterations, such as copy-number and single-nucleotide variations, were more accurately captured by single-cell SIDR-seq compared with conventional single-cell RNA-seq, although copy-number variations positively correlated with the corresponding gene expression levels. These results suggest that SIDR-seq is potentially a powerful tool to reveal genetic heterogeneity and phenotypic information inferred from gene expression patterns at the single-cell level. PMID:29208629
Convergent Genomic Studies Identify Association of GRIK2 and NPAS2 with Chronic Fatigue Syndrome
Smith, Alicia K.; Fang, Hong; Whistler, Toni; Unger, Elizabeth R.; Rajeevan, Mangalathu S.
2011-01-01
Background There is no consistent evidence of specific gene(s) or molecular pathways that contribute to the pathogenesis, therapeutic intervention or diagnosis of chronic fatigue syndrome (CFS). While multiple studies support a role for genetic variation in CFS, genome-wide efforts to identify associated loci remain unexplored. We employed a novel convergent functional genomics approach that incorporates the findings from single-nucleotide polymorphism (SNP) and mRNA expression studies to identify associations between CFS and novel candidate genes for further investigation. Methods We evaluated 116,204 SNPs in 40 CFS and 40 nonfatigued control subjects along with mRNA expression of 20,160 genes in a subset of these subjects (35 CFS subjects and 27 controls) derived from a population-based study. Results Sixty-five SNPs were nominally associated with CFS (p < 0.001), and 165 genes were differentially expressed (≥4-fold; p ≤ 0.05) in peripheral blood mononuclear cells of CFS subjects. Two genes, glutamate receptor, ionotropic, kinase 2 (GRIK2) and neuronal PAS domain protein 2 (NPAS2), were identified by both SNP and gene expression analyses. Subjects with the G allele of rs2247215 (GRIK2) were more likely to have CFS (p = 0.0005), and CFS subjects showed decreased GRIK2 expression (10-fold; p = 0.015). Subjects with the T allele of rs356653 (NPAS2) were more likely to have CFS (p = 0.0007), and NPAS2 expression was increased (10-fold; p = 0.027) in those with CFS. Conclusion Using an integrated genomic strategy, this study suggests a possible role for genes involved in glutamatergic neurotransmission and circadian rhythm in CFS and supports further study of novel candidate genes in independent populations of CFS subjects. PMID:21912186
Engineered Chloroplast Genome just got Smarter
Jin, Shuangxia; Daniell, Henry
2015-01-01
Chloroplasts are known to sustain life on earth by providing food, fuel and oxygen through the process of photosynthesis. However, the chloroplast genome has also been smartly engineered to confer valuable agronomic traits and/or serve as bioreactors for production of industrial enzymes, biopharmaceuticals, bio-products or vaccines. The recent breakthrough in hyper-expression of biopharmaceuticals in edible leaves has facilitated the advancement to clinical studies by major pharmaceutical companies. This review critically evaluates progress in developing new tools to enhance or simplify expression of targeted genes in chloroplasts. These tools hold the promise to further the development of novel fuels and products, enhance the photosynthetic process, and increase our understanding of retrograde signaling and cellular processes. PMID:26440432
Williams, Ruth M; Senanayake, Upeka; Artibani, Mara; Taylor, Gunes; Wells, Daniel; Ahmed, Ahmed Ashour; Sauka-Spengler, Tatjana
2018-02-23
CRISPR/Cas9 genome engineering has revolutionised all aspects of biological research, with epigenome engineering transforming gene regulation studies. Here, we present an optimised, adaptable toolkit enabling genome and epigenome engineering in the chicken embryo, and demonstrate its utility by probing gene regulatory interactions mediated by neural crest enhancers. First, we optimise novel efficient guide-RNA mini expression vectors utilising chick U6 promoters, provide a strategy for rapid somatic gene knockout and establish a protocol for evaluation of mutational penetrance by targeted next-generation sequencing. We show that CRISPR/Cas9-mediated disruption of transcription factors causes a reduction in their cognate enhancer-driven reporter activity. Next, we assess endogenous enhancer function using both enhancer deletion and nuclease-deficient Cas9 (dCas9) effector fusions to modulate enhancer chromatin landscape, thus providing the first report of epigenome engineering in a developing embryo. Finally, we use the synergistic activation mediator (SAM) system to activate an endogenous target promoter. The novel genome and epigenome engineering toolkit developed here enables manipulation of endogenous gene expression and enhancer activity in chicken embryos, facilitating high-resolution analysis of gene regulatory interactions in vivo . © 2018. Published by The Company of Biologists Ltd.
Hovey, Raymond; Lentes, Sabine; Ehrenreich, Armin; Salmon, Kirsty; Saba, Karla; Gottschalk, Gerhard; Gunsalus, Robert P; Deppenmeier, Uwe
2005-05-01
Methansarcina mazei Gö1 DNA arrays were constructed and used to evaluate the genomic expression patterns of cells grown on either of two alternative methanogenic substrates, acetate or methanol, as sole carbon and energy source. Analysis of differential transcription across the genome revealed two functionally grouped sets of genes that parallel the central biochemical pathways in, and reflect many known features of, acetate and methanol metabolism. These include the acetate-induced genes encoding acetate activating enzymes, acetyl-CoA synthase/CO dehydrogenase, and carbonic anhydrase. Interestingly, additional genes expressed at significantly higher levels during growth on acetate included two energy-conserving complexes (the Ech hydrogenase, and the A1A0-type ATP synthase). Many previously unknown features included the induction by acetate of genes coding for ferredoxins and flavoproteins, an aldehyde:ferredoxin oxidoreductase, enzymes for the synthesis of aromatic amino acids, and components of iron, cobalt and oligopeptide uptake systems. In contrast, methanol-grown cells exhibited elevated expression of genes assigned to the methylotrophic pathway of methanogenesis. Expression of genes for components of the translation apparatus was also elevated in cells grown in the methanol medium relative to acetate, and was correlated with the faster growth rate observed on the former substrate. These experiments provide the first comprehensive insight into substrate-dependent gene expression in a methanogenic archaeon. This genome-wide approach, coupled with the complementary molecular and biochemical tools, should greatly accelerate the exploration of Methanosarcina cell physiology, given the present modest level of our knowledge of these large archaeal genomes.
Vidal, Ramon Oliveira; Mondego, Jorge Maurício Costa; Pot, David; Ambrósio, Alinne Batista; Andrade, Alan Carvalho; Pereira, Luiz Filipe Protasio; Colombo, Carlos Augusto; Vieira, Luiz Gonzaga Esteves; Carazzolle, Marcelo Falsarella; Pereira, Gonçalo Amarante Guimarães
2010-01-01
Polyploidization constitutes a common mode of evolution in flowering plants. This event provides the raw material for the divergence of function in homeologous genes, leading to phenotypic novelty that can contribute to the success of polyploids in nature or their selection for use in agriculture. Mounting evidence underlined the existence of homeologous expression biases in polyploid genomes; however, strategies to analyze such transcriptome regulation remained scarce. Important factors regarding homeologous expression biases remain to be explored, such as whether this phenomenon influences specific genes, how paralogs are affected by genome doubling, and what is the importance of the variability of homeologous expression bias to genotype differences. This study reports the expressed sequence tag assembly of the allopolyploid Coffea arabica and one of its direct ancestors, Coffea canephora. The assembly was used for the discovery of single nucleotide polymorphisms through the identification of high-quality discrepancies in overlapped expressed sequence tags and for gene expression information indirectly estimated by the transcript redundancy. Sequence diversity profiles were evaluated within C. arabica (Ca) and C. canephora (Cc) and used to deduce the transcript contribution of the Coffea eugenioides (Ce) ancestor. The assignment of the C. arabica haplotypes to the C. canephora (CaCc) or C. eugenioides (CaCe) ancestral genomes allowed us to analyze gene expression contributions of each subgenome in C. arabica. In silico data were validated by the quantitative polymerase chain reaction and allele-specific combination TaqMAMA-based method. The presence of differential expression of C. arabica homeologous genes and its implications in coffee gene expression, ontology, and physiology are discussed. PMID:20864545
He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei
2016-01-01
WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions, indicating tandem duplicate WRKYs in the adaptive responses to environmental stimuli during the evolution process. Our results provide a framework for future studies regarding the function of WRKY genes in response to stress in B. napus. PMID:27322342
He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei
2016-01-01
WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions, indicating tandem duplicate WRKYs in the adaptive responses to environmental stimuli during the evolution process. Our results provide a framework for future studies regarding the function of WRKY genes in response to stress in B. napus.
Amar, David; Frades, Itziar; Danek, Agnieszka; Goldberg, Tatyana; Sharma, Sanjeev K; Hedley, Pete E; Proux-Wera, Estelle; Andreasson, Erik; Shamir, Ron; Tzfadia, Oren; Alexandersson, Erik
2014-12-05
For most organisms, even if their genome sequence is available, little functional information about individual genes or proteins exists. Several annotation pipelines have been developed for functional analysis based on sequence, 'omics', and literature data. However, researchers encounter little guidance on how well they perform. Here, we used the recently sequenced potato genome as a case study. The potato genome was selected since its genome is newly sequenced and it is a non-model plant even if there is relatively ample information on individual potato genes, and multiple gene expression profiles are available. We show that the automatic gene annotations of potato have low accuracy when compared to a "gold standard" based on experimentally validated potato genes. Furthermore, we evaluate six state-of-the-art annotation pipelines and show that their predictions are markedly dissimilar (Jaccard similarity coefficient of 0.27 between pipelines on average). To overcome this discrepancy, we introduce a simple GO structure-based algorithm that reconciles the predictions of the different pipelines. We show that the integrated annotation covers more genes, increases by over 50% the number of highly co-expressed GO processes, and obtains much higher agreement with the gold standard. We find that different annotation pipelines produce different results, and show how to integrate them into a unified annotation that is of higher quality than each single pipeline. We offer an improved functional annotation of both PGSC and ITAG potato gene models, as well as tools that can be applied to additional pipelines and improve annotation in other organisms. This will greatly aid future functional analysis of '-omics' datasets from potato and other organisms with newly sequenced genomes. The new potato annotations are available with this paper.
Optimized gene editing technology for Drosophila melanogaster using germ line-specific Cas9.
Ren, Xingjie; Sun, Jin; Housden, Benjamin E; Hu, Yanhui; Roesel, Charles; Lin, Shuailiang; Liu, Lu-Ping; Yang, Zhihao; Mao, Decai; Sun, Lingzhu; Wu, Qujie; Ji, Jun-Yuan; Xi, Jianzhong; Mohr, Stephanie E; Xu, Jiang; Perrimon, Norbert; Ni, Jian-Quan
2013-11-19
The ability to engineer genomes in a specific, systematic, and cost-effective way is critical for functional genomic studies. Recent advances using the CRISPR-associated single-guide RNA system (Cas9/sgRNA) illustrate the potential of this simple system for genome engineering in a number of organisms. Here we report an effective and inexpensive method for genome DNA editing in Drosophila melanogaster whereby plasmid DNAs encoding short sgRNAs under the control of the U6b promoter are injected into transgenic flies in which Cas9 is specifically expressed in the germ line via the nanos promoter. We evaluate the off-targets associated with the method and establish a Web-based resource, along with a searchable, genome-wide database of predicted sgRNAs appropriate for genome engineering in flies. Finally, we discuss the advantages of our method in comparison with other recently published approaches.
Van Coillie, Samya; Liang, Lunxi; Zhang, Yao; Wang, Huanbin; Fang, Jing-Yuan; Xu, Jie
2016-04-05
High-throughput methods such as co-immunoprecipitationmass spectrometry (coIP-MS) and yeast 2 hybridization (Y2H) have suggested a broad range of unannotated protein-protein interactions (PPIs), and interpretation of these PPIs remains a challenging task. The advancements in cancer genomic researches allow for the inference of "coactivation pairs" in cancer, which may facilitate the identification of PPIs involved in cancer. Here we present OncoBinder as a tool for the assessment of proteomic interaction data based on the functional synergy of oncoproteins in cancer. This decision tree-based method combines gene mutation, copy number and mRNA expression information to infer the functional status of protein-coding genes. We applied OncoBinder to evaluate the potential binders of EGFR and ERK2 proteins based on the gastric cancer dataset of The Cancer Genome Atlas (TCGA). As a result, OncoBinder identified high confidence interactions (annotated by Kyoto Encyclopedia of Genes and Genomes (KEGG) or validated by low-throughput assays) more efficiently than co-expression based method. Taken together, our results suggest that evaluation of gene functional synergy in cancer may facilitate the interpretation of proteomic interaction data. The OncoBinder toolbox for Matlab is freely accessible online.
Evaluation of an FRDA-EGFP genomic reporter assay in transgenic mice.
Sarsero, Joseph P; Holloway, Timothy P; Li, Lingli; McLenachan, Samuel; Fowler, Kerry J; Bertoncello, Ivan; Voullaire, Lucille; Gazeas, Sophie; Ioannou, Panos A
2005-04-01
Friedreich ataxia is an autosomal recessive neurodegenerative disorder caused by a GAA trinucleotide expansion in the first intron of the Friedreich ataxia gene (FRDA) that causes reduced synthesis of frataxin, a mitochondrial protein likely to be involved in biosynthesis of iron-sulfur clusters. This leads to increased oxidative stress, progressive loss of large sensory neurons, and hypertrophic cardiomyopathy. To elucidate the mechanisms regulating FRDA expression and to develop an in vivo assay for agents that might upregulate FRDA expression in a therapeutically relevant manner, we have generated transgenic mice with a BAC genomic reporter construct consisting of an in-frame fusion between FRDA and the gene coding for enhanced green fluorescent protein (EGFP). Production of full-length frataxin-EGFP fusion protein was demonstrated by immunoblotting. EGFP expression was observed as early as day E3.5 of development. Most tissues of adult transgenic mice were fluorescent. The level of FRDA-EGFP expression in peripheral blood, bone marrow, and cells obtained from enzymatically disaggregated tissues was quantitated by flow cytometry. There was a twofold increase in EGFP expression in mice homozygous for the transgene when compared to hemizygous mice. These transgenic mice are a valuable tool for the examination of spatial and temporal aspects of FRDA gene expression and for the preclinical evaluation of pharmacological inducers of FRDA expression in a whole-animal model. In addition, tissues from these mice should also be valuable for stem cell transplantation studies.
Application of industrial scale genomics to discovery of therapeutic targets in heart failure.
Mehraban, F; Tomlinson, J E
2001-12-01
In recent years intense activity in both academic and industrial sectors has provided a wealth of information on the human genome with an associated impressive increase in the number of novel gene sequences deposited in sequence data repositories and patent applications. This genomic industrial revolution has transformed the way in which drug target discovery is now approached. In this article we discuss how various differential gene expression (DGE) technologies are being utilized for cardiovascular disease (CVD) drug target discovery. Other approaches such as sequencing cDNA from cardiovascular derived tissues and cells coupled with bioinformatic sequence analysis are used with the aim of identifying novel gene sequences that may be exploited towards target discovery. Additional leverage from gene sequence information is obtained through identification of polymorphisms that may confer disease susceptibility and/or affect drug responsiveness. Pharmacogenomic studies are described wherein gene expression-based techniques are used to evaluate drug response and/or efficacy. Industrial-scale genomics supports and addresses not only novel target gene discovery but also the burgeoning issues in pharmaceutical and clinical cardiovascular medicine relative to polymorphic gene responses.
Baxter, Laura L; Hsu, Benjamin J; Umayam, Lowell; Wolfsberg, Tyra G; Larson, Denise M; Frith, Martin C; Kawai, Jun; Hayashizaki, Yoshihide; Carninci, Piero; Pavan, William J
2007-06-01
As part of the RIKEN mouse encyclopedia project, two cDNA libraries were prepared from melanocyte-derived cell lines, using techniques of full-length clone selection and subtraction/normalization to enrich for rare transcripts. End sequencing showed that these libraries display over 83% complete coding sequence at the 5' end and 96-97% complete coding sequence at the 3' end. Evaluation of the libraries, derived from B16F10Y tumor cells and melan-c cells, revealed that they contain clones for a majority of the genes previously demonstrated to function in melanocyte biology. Analysis of genomic locations for transcripts revealed that the distribution of melanocyte genes is non-random throughout the genome. Three genomic regions identified that showed significant clustering of melanocyte-expressed genes contain one or more genes previously shown to regulate melanocyte development or function. A catalog of genes expressed in these libraries is presented, providing a valuable resource of cDNA clones and sequence information that can be used for identification of new genes important for melanocyte development, function, and disease.
Meyer, Vera; Wanka, Franziska; van Gent, Janneke; Arentshorst, Mark; van den Hondel, Cees A. M. J. J.; Ram, Arthur F. J.
2011-01-01
Filamentous fungi are the cause of serious human and plant diseases but are also exploited in biotechnology as production platforms. Comparative genomics has documented their genetic diversity, and functional genomics and systems biology approaches are under way to understand the functions and interaction of fungal genes and proteins. In these approaches, gene functions are usually inferred from deletion or overexpression mutants. However, studies at these extreme points give only limited information. Moreover, many overexpression studies use metabolism-dependent promoters, often causing pleiotropic effects and thus limitations in their significance. We therefore established and systematically evaluated a tunable expression system for Aspergillus niger that is independent of carbon and nitrogen metabolism and silent under noninduced conditions. The system consists of two expression modules jointly targeted to a defined genomic locus. One module ensures constitutive expression of the tetracycline-dependent transactivator rtTA2S-M2, and one module harbors the rtTA2S-M2-dependent promoter that controls expression of the gene of interest (the Tet-on system). We show here that the system is tight, responds within minutes after inducer addition, and allows fine-tuning based on the inducer concentration or gene copy number up to expression levels higher than the expression levels of the gpdA promoter. We also validate the Tet-on system for the generation of conditional overexpression mutants and demonstrate its power when combined with a gene deletion approach. Finally, we show that the system is especially suitable when the functions of essential genes must be examined. PMID:21378046
Tang, Roderick S.; Schickli, Jeanne H.; MacPhail, Mia; Fernandes, Fiona; Bicha, Leenas; Spaete, Joshua; Fouchier, Ron A. M.; Osterhaus, Albert D. M. E.; Spaete, Richard; Haller, Aurelia A.
2003-01-01
A live attenuated bovine parainfluenza virus type 3 (PIV3), harboring the fusion (F) and hemagglutinin-neuraminidase (HN) genes of human PIV3, was used as a virus vector to express surface glycoproteins derived from two human pathogens, human metapneumovirus (hMPV) and respiratory syncytial virus (RSV). RSV and hMPV are both paramyxoviruses that cause respiratory disease in young children, the elderly, and immunocompromised individuals. RSV has been known for decades to cause acute lower respiratory tract infections in young children, which often result in hospitalization, while hMPV has only been recently identified as a novel human respiratory pathogen. In this study, the ability of bovine/human PIV3 to express three different foreign transmembrane surface glycoproteins and to induce a protective immune response was evaluated. The RNA-dependent RNA polymerase of paramyxoviruses binds to a single site at the 3′ end of the viral RNA genome to initiate transcription of viral genes. The genome position of the viral gene determines its level of gene expression. The promoter-proximal gene is transcribed with the highest frequency, and each downstream gene is transcribed less often due to attenuation of transcription at each gene junction. This feature of paramyxoviruses was exploited using the PIV3 vector by inserting the foreign viral genes at the 3′ terminus, at position 1 or 2, of the viral RNA genome. These locations were expected to yield high levels of foreign viral protein expression stimulating a protective immune response. The immunogenicity and protection results obtained with a hamster model showed that bovine/human PIV3 can be employed to generate bivalent PIV3/RSV or PIV3/hMPV vaccine candidates that will be further evaluated for safety and efficacy in primates. PMID:14512532
Pharmacogenomics and its potential impact on drug and formulation development.
Regnstrom, Karin; Burgess, Diane J
2005-01-01
Recent advances in genomic research have provided the basis for new insights into the importance of genetic and genomic markers during the different stages of drug development. A new field of research, pharmacogenomics, which studies the relationship between drug effects and the genome, has emerged. Structural pharmacogenomics maps the complete DNA sequences of whole genomes (genotypes) including individual variations, and functional pharmacogenomics assesses the expression levels of thousands of genes in one single experiment. Together, these two areas of pharmacogenomics have generated massive databases, which have become a challenge for the research field of informatics and have fostered a new branch of research, bioinformatics. If skillfully used, the databases generated by pharmacogenomics together with data mining on the Web promise to improve the drug development process in a variety of areas: identification of drug targets, evaluation of toxicity, classification of diseases, evaluation of formulations, assessment of drug response and treatment, post-marketing applications, and development of personalized medicines.
Ahmadi, Samira; Davami, Fatemeh; Davoudi, Noushin; Nematpour, Fatemeh; Ahmadi, Maryam; Ebadat, Saeedeh; Azadmanesh, Kayhan; Barkhordari, Farzaneh; Mahboudi, Fereidoun
2017-01-01
Establishing stable Chinese Hamster Ovary (CHO) cells producing monoclonal antibodies (mAbs) usually pass through the random integration of vectors to the cell genome, which is sensitive to gene silencing. One approach to overcome this issue is to target a highly transcribed region in the genome. Transposons are useful devices to target active parts of genomes, and PiggyBac (PB) transposon can be considered as a good option. In the present study, three PB transposon donor vectors containing both heavy and light chains were constructed, one contained independent expression cassettes while the others utilized either an Internal Ribosome Entry Site (IRES) or 2A element to express mAb. Conventional cell pools were created by transferring donor vectors into the CHO cells, whereas transposon-based cells were generated by transfecting the cells with donor vectors with a companion of a transposase-encoding helper vector, with 1:2.5 helper/donor vectors ratio. To evaluate the influence of helper/donor vectors ratio on expression, the second transposon-based cell pools were generated with 1:5 helper/donor ratio. Expression levels in the transposon-based cells were two to five -folds more than those created by conventional method except for the IRES-mediated ones, in which the observed difference increased more than 100-fold. The results were dependent on both donor vector design and vectors ratios.
Ahmadi, Samira; Davami, Fatemeh; Davoudi, Noushin; Nematpour, Fatemeh; Ahmadi, Maryam; Ebadat, Saeedeh; Azadmanesh, Kayhan; Barkhordari, Farzaneh
2017-01-01
Establishing stable Chinese Hamster Ovary (CHO) cells producing monoclonal antibodies (mAbs) usually pass through the random integration of vectors to the cell genome, which is sensitive to gene silencing. One approach to overcome this issue is to target a highly transcribed region in the genome. Transposons are useful devices to target active parts of genomes, and PiggyBac (PB) transposon can be considered as a good option. In the present study, three PB transposon donor vectors containing both heavy and light chains were constructed, one contained independent expression cassettes while the others utilized either an Internal Ribosome Entry Site (IRES) or 2A element to express mAb. Conventional cell pools were created by transferring donor vectors into the CHO cells, whereas transposon-based cells were generated by transfecting the cells with donor vectors with a companion of a transposase-encoding helper vector, with 1:2.5 helper/donor vectors ratio. To evaluate the influence of helper/donor vectors ratio on expression, the second transposon-based cell pools were generated with 1:5 helper/donor ratio. Expression levels in the transposon-based cells were two to five -folds more than those created by conventional method except for the IRES-mediated ones, in which the observed difference increased more than 100-fold. The results were dependent on both donor vector design and vectors ratios. PMID:28662065
Structural and quantitative expression analyses of HERV gene family in human tissues.
Ahn, Kung; Kim, Heui-Soo
2009-08-31
Human endogenous retroviruses (HERVs) have been implicated in the pathogenesis of several human diseases as multi-copy members in the human genome. Their gene expression profiling could provide us with important insights into the pathogenic relationship between HERVs and cancer. In this study, we have evaluated the genomic structure and quantitatively determined the expression patterns in the env gene of a variety of HERV family members located on six specific loci by the RetroTector 10 program, as well as real-time RT-PCR amplification. The env gene transcripts evidenced significant differences in the human tumor/normal adjacent tissues (colon, liver, uterus, lung and testis). As compared to the adjacent normal tissues, high levels of expression were noted in testis tumor tissues for HERV-K, in liver and lung tumor tissues for HERV-R, in liver, lung, and testis tumor tissues for HERV-H, and in colon and liver tumor tissues for HERV-P. These data warrant further studies with larger groups of patients to develop biomarkers for specific human cancers.
Comprehensive Evaluation of the Contribution of X Chromosome Genes to Platinum Sensitivity
Gamazon, Eric R.; Im, Hae Kyung; O’Donnell, Peter H.; Ziliak, Dana; Stark, Amy L.; Cox, Nancy J.; Dolan, M. Eileen; Huang, Rong Stephanie
2011-01-01
Utilizing a genome-wide gene expression dataset generated from Affymetrix GeneChip® Human Exon 1.0ST array, we comprehensively surveyed the role of 322 X chromosome gene expression traits on cellular sensitivity to cisplatin and carboplatin. We identified 31 and 17 X chromosome genes whose expression levels are significantly correlated (after multiple testing correction) with sensitivity to carboplatin and cisplatin, respectively, in the combined HapMap CEU and YRI populations (false discovery rate, FDR<0.05). Of those, 14 overlap for both cisplatin and carboplatin. Employing an independent gene expression quantification method, the Illumina Sentrix Human-6 Expression BeadChip, measured on the same HapMap cell lines, we found that 4 and 2 of these genes are significantly associated with carboplatin and cisplatin sensitivity respectively in both analyses. Two genes, CTPS2 and DLG3, were identified by both genome-wide gene expression analyses as correlated with cellular sensitivity to both platinating agents. The expression of DLG3 gene was also found to correlate with cellular sensitivity to platinating agents in NCI60 cancer cell lines. In addition, we evaluated the role of X chromosome gene expression to the observed differences in sensitivity to the platinums between CEU and YRI derived cell lines. Of the 34 distinct genes significantly correlated with either carboplatin or cisplatin sensitivity, 14 are differentially expressed (defined as p<0.05) between CEU and YRI. Thus, sex chromosome genes play a role in cellular sensitivity to platinating agents and differences in the expression level of these genes are an important source of variation that should be included in comprehensive pharmacogenomic studies. PMID:21252287
Almlöf, Jonas Carlsson; Lundmark, Per; Lundmark, Anders; Ge, Bing; Maouche, Seraya; Göring, Harald H. H.; Liljedahl, Ulrika; Enström, Camilla; Brocheton, Jessy; Proust, Carole; Godefroy, Tiphaine; Sambrook, Jennifer G.; Jolley, Jennifer; Crisp-Hihn, Abigail; Foad, Nicola; Lloyd-Jones, Heather; Stephens, Jonathan; Gwilliam, Rhian; Rice, Catherine M.; Hengstenberg, Christian; Samani, Nilesh J.; Erdmann, Jeanette; Schunkert, Heribert; Pastinen, Tomi; Deloukas, Panos; Goodall, Alison H.; Ouwehand, Willem H.; Cambien, François; Syvänen, Ann-Christine
2012-01-01
A large number of genome-wide association studies have been performed during the past five years to identify associations between SNPs and human complex diseases and traits. The assignment of a functional role for the identified disease-associated SNP is not straight-forward. Genome-wide expression quantitative trait locus (eQTL) analysis is frequently used as the initial step to define a function while allele-specific gene expression (ASE) analysis has not yet gained a wide-spread use in disease mapping studies. We compared the power to identify cis-acting regulatory SNPs (cis-rSNPs) by genome-wide allele-specific gene expression (ASE) analysis with that of traditional expression quantitative trait locus (eQTL) mapping. Our study included 395 healthy blood donors for whom global gene expression profiles in circulating monocytes were determined by Illumina BeadArrays. ASE was assessed in a subset of these monocytes from 188 donors by quantitative genotyping of mRNA using a genome-wide panel of SNP markers. The performance of the two methods for detecting cis-rSNPs was evaluated by comparing associations between SNP genotypes and gene expression levels in sample sets of varying size. We found that up to 8-fold more samples are required for eQTL mapping to reach the same statistical power as that obtained by ASE analysis for the same rSNPs. The performance of ASE is insensitive to SNPs with low minor allele frequencies and detects a larger number of significantly associated rSNPs using the same sample size as eQTL mapping. An unequivocal conclusion from our comparison is that ASE analysis is more sensitive for detecting cis-rSNPs than standard eQTL mapping. Our study shows the potential of ASE mapping in tissue samples and primary cells which are difficult to obtain in large numbers. PMID:23300628
Genome engineering and gene expression control for bacterial strain development.
Song, Chan Woo; Lee, Joungmin; Lee, Sang Yup
2015-01-01
In recent years, a number of techniques and tools have been developed for genome engineering and gene expression control to achieve desired phenotypes of various bacteria. Here we review and discuss the recent advances in bacterial genome manipulation and gene expression control techniques, and their actual uses with accompanying examples. Genome engineering has been commonly performed based on homologous recombination. During such genome manipulation, the counterselection systems employing SacB or nucleases have mainly been used for the efficient selection of desired engineered strains. The recombineering technology enables simple and more rapid manipulation of the bacterial genome. The group II intron-mediated genome engineering technology is another option for some bacteria that are difficult to be engineered by homologous recombination. Due to the increasing demands on high-throughput screening of bacterial strains having the desired phenotypes, several multiplex genome engineering techniques have recently been developed and validated in some bacteria. Another approach to achieve desired bacterial phenotypes is the repression of target gene expression without the modification of genome sequences. This can be performed by expressing antisense RNA, small regulatory RNA, or CRISPR RNA to repress target gene expression at the transcriptional or translational level. All of these techniques allow efficient and rapid development and screening of bacterial strains having desired phenotypes, and more advanced techniques are expected to be seen. Copyright © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
USDA-ARS?s Scientific Manuscript database
The existence of two separate lineages of Escherichia coli O157:H7 has previously been reported, and research indicates that lineage I might be more pathogenic towards human hosts than lineage II. We have previously shown that lineage I expresses higher levels of Shiga toxin 2 (Stx2). To evaluate w...
Picking Cell Lines for High-Throughput Transcriptomic Toxicity Screening (SOT)
High throughput, whole genome transcriptomic profiling is a promising approach to comprehensively evaluate chemicals for potential biological effects. To be useful for in vitro toxicity screening, gene expression must be quantified in a set of representative cell types that captu...
Evaluation of Sequencing Approaches for High-Throughput Transcriptomics - (BOSC)
Whole-genome in vitro transcriptomics has shown the capability to identify mechanisms of action and estimates of potency for chemical-mediated effects in a toxicological framework, but with limited throughput and high cost. The generation of high-throughput global gene expression...
Transcriptomic resources for environmental risk assessment: a case study in the Venice lagoon.
Milan, M; Pauletto, M; Boffo, L; Carrer, C; Sorrentino, F; Ferrari, G; Pavan, L; Patarnello, T; Bargelloni, L
2015-02-01
The development of new resources to evaluate the environmental status is becoming increasingly important representing a key challenge for ocean and coastal management. Recently, the employment of transcriptomics in aquatic toxicology has led to increasing initiatives proposing to integrate eco-toxicogenomics in the evaluation of marine ecosystem health. However, several technical issues need to be addressed before introducing genomics as a reliable tool in regulatory ecotoxicology. The Venice lagoon constitutes an excellent case, in which the assessment of environmental risks derived from the nearby industrial activities represents a crucial task. In this context, the potential role of genomics to assist environmental monitoring was investigated through the definition of reliable gene expression markers associated to chemical contamination in Manila clams, and their subsequent employment for the classification of Venice lagoon areas. Overall, the present study addresses key issues to evaluate the future outlooks of genomics in the environmental monitoring and risk assessment. Copyright © 2014 Elsevier Ltd. All rights reserved.
Xylella fastidiosa gene expression analysis by DNA microarrays.
Travensolo, Regiane F; Carareto-Alves, Lucia M; Costa, Maria V C G; Lopes, Tiago J S; Carrilho, Emanuel; Lemos, Eliana G M
2009-04-01
Xylella fastidiosa genome sequencing has generated valuable data by identifying genes acting either on metabolic pathways or in associated pathogenicity and virulence. Based on available information on these genes, new strategies for studying their expression patterns, such as microarray technology, were employed. A total of 2,600 primer pairs were synthesized and then used to generate fragments using the PCR technique. The arrays were hybridized against cDNAs labeled during reverse transcription reactions and which were obtained from bacteria grown under two different conditions (liquid XDM(2) and liquid BCYE). All data were statistically analyzed to verify which genes were differentially expressed. In addition to exploring conditions for X. fastidiosa genome-wide transcriptome analysis, the present work observed the differential expression of several classes of genes (energy, protein, amino acid and nucleotide metabolism, transport, degradation of substances, toxins and hypothetical proteins, among others). The understanding of expressed genes in these two different media will be useful in comprehending the metabolic characteristics of X. fastidiosa, and in evaluating how important certain genes are for the functioning and survival of these bacteria in plants.
Gao, Xue-Ke; Zhang, Shuai; Luo, Jun-Yu; Wang, Chun-Yi; Lü, Li-Min; Zhang, Li-Juan; Zhu, Xiang-Zhen; Wang, Li; Lu, Hui; Cui, Jin-Jie
2017-12-30
Lysiphlebia japonica (Ashmead) is a predominant parasitoid of cotton-melon aphids in the fields of northern China with a proven ability to effectively control cotton aphid populations in early summer. For accurate normalization of gene expression in L. japonica using quantitative reverse transcriptase-polymerase chain reaction (RT-qPCR), reference genes with stable gene expression patterns are essential. However, no appropriate reference genes is L. japonica have been investigated to date. In the present study, 12 selected housekeeping genes from L. japonica were cloned. We evaluated the stability of these genes under various experimental treatments by RT-qPCR using four independent (geNorm, NormFinder, BestKeeper and Delta Ct) and one comparative (RefFinder) algorithm. We identified genes showing the most stable levels of expression: DIMT, 18S rRNA, and RPL13 during different stages; AK, RPL13, and TBP among sexes; EF1A, PPI, and RPL27 in different tissues, and EF1A, RPL13, and PPI in adults fed on different diets. Moreover, the expression profile of a target gene (odorant receptor 1, OR1) studied during the developmental stages confirms the reliability of the chosen selected reference genes. This study provides for the first time a comprehensive list of suitable reference genes for gene expression studies in L. japonica and will benefit subsequent genomics and functional genomics research on this natural enemy. Copyright © 2017. Published by Elsevier B.V.
Franco, Sulamita de Freitas; Baroni, Renata Moro; Carazzolle, Marcelo Falsarella; Teixeira, Paulo José Pereira Lima; Reis, Osvaldo; Pereira, Gonçalo Amarante Guimarães; Mondego, Jorge Maurício Costa
2015-10-30
Thaumatin-like proteins (TLPs) are found in diverse eukaryotes. Plant TLPs, known as Pathogenicity Related Protein (PR-5), are considered fungal inhibitors. However, genes encoding TLPs are frequently found in fungal genomes. In this work, we have identified that Moniliophthora perniciosa, a basidiomycete pathogen that causes the Witches' Broom Disease (WBD) of cacao, presents thirteen putative TLPs from which four are expressed during WBD progression. One of them is similar to small TLPs, which are present in phytopathogenic basidiomycete, such as wheat stem rust fungus Puccinia graminis. Fungi genomes annotation and phylogenetic data revealed a larger number of TLPs in basidiomycetes when comparing with ascomycetes, suggesting that these proteins could be involved in specific traits of mushroom-forming species. Based on the present data, we discuss the contribution of TLPs in the combat against fungal competitors and hypothesize a role of these proteins in M. perniciosa pathogenicity. Copyright © 2015 Elsevier Inc. All rights reserved.
Sapriel, Guillaume; Quinet, Michelle; Heijde, Marc; Jourdren, Laurent; Tanty, Véronique; Luo, Guangzuo; Le Crom, Stéphane; Lopez, Pascal Jean
2009-01-01
Background Diatoms are largely responsible for production of biogenic silica in the global ocean. However, in surface seawater, Si(OH)4 can be a major limiting factor for diatom productivity. Analyzing at the global scale the genes networks involved in Si transport and metabolism is critical in order to elucidate Si biomineralization, and to understand diatoms contribution to biogeochemical cycles. Methodology/Principal Findings Using whole genome expression analyses we evaluated the transcriptional response to Si availability for the model species Phaeodactylum tricornutum. Among the differentially regulated genes we found genes involved in glutamine-nitrogen pathways, encoding putative extracellular matrix components, or involved in iron regulation. Some of these compounds may be good candidates for intracellular intermediates involved in silicic acid storage and/or intracellular transport, which are very important processes that remain mysterious in diatoms. Expression analyses and localization studies gave the first picture of the spatial distribution of a silicic acid transporter in a diatom model species, and support the existence of transcriptional and post-transcriptional regulations. Conclusions/Significance Our global analyses revealed that about one fourth of the differentially expressed genes are organized in clusters, underlying a possible evolution of P. tricornutum genome, and perhaps other pennate diatoms, toward a better optimization of its response to variable environmental stimuli. High fitness and adaptation of diatoms to various Si levels in marine environments might arise in part by global regulations from gene (expression level) to genomic (organization in clusters, dosage compensation by gene duplication), and by post-transcriptional regulation and spatial distribution of SIT proteins. PMID:19829693
2012-01-01
Background Alteration in gene expression resulting from allopolyploidization is a prominent feature in plants, but its spectrum and extent are not fully known. Common wheat (Triticum aestivum) was formed via allohexaploidization about 10,000 years ago, and became the most important crop plant. To gain further insights into the genome-wide transcriptional dynamics associated with the onset of common wheat formation, we conducted microarray-based genome-wide gene expression analysis on two newly synthesized allohexaploid wheat lines with chromosomal stability and a genome constitution analogous to that of the present-day common wheat. Results Multi-color GISH (genomic in situ hybridization) was used to identify individual plants from two nascent allohexaploid wheat lines between Triticum turgidum (2n = 4x = 28; genome BBAA) and Aegilops tauschii (2n = 2x = 14; genome DD), which had a stable chromosomal constitution analogous to that of common wheat (2n = 6x = 42; genome BBAADD). Genome-wide analysis of gene expression was performed for these allohexaploid lines along with their parental plants from T. turgidum and Ae. tauschii, using the Affymetrix Gene Chip Wheat Genome-Array. Comparison with the parental plants coupled with inclusion of empirical mid-parent values (MPVs) revealed that whereas the great majority of genes showed the expected parental additivity, two major patterns of alteration in gene expression in the allohexaploid lines were identified: parental dominance expression and non-additive expression. Genes involved in each of the two altered expression patterns could be classified into three distinct groups, stochastic, heritable and persistent, based on their transgenerational heritability and inter-line conservation. Strikingly, whereas both altered patterns of gene expression showed a propensity of inheritance, identity of the involved genes was highly stochastic, consistent with the involvement of diverse Gene Ontology (GO) terms. Nonetheless, those genes showing non-additive expression exhibited a significant enrichment for vesicle-function. Conclusions Our results show that two patterns of global alteration in gene expression are conditioned by allohexaploidization in wheat, that is, parental dominance expression and non-additive expression. Both altered patterns of gene expression but not the identity of the genes involved are likely to play functional roles in stabilization and establishment of the newly formed allohexaploid plants, and hence, relevant to speciation and evolution of T. aestivum. PMID:22277161
Global Genetic Response in a Cancer Cell: Self-Organized Coherent Expression Dynamics
Tsuchiya, Masa; Hashimoto, Midori; Takenaka, Yoshiko; Motoike, Ikuko N.; Yoshikawa, Kenichi
2014-01-01
Understanding the basic mechanism of the spatio-temporal self-control of genome-wide gene expression engaged with the complex epigenetic molecular assembly is one of major challenges in current biological science. In this study, the genome-wide dynamical profile of gene expression was analyzed for MCF-7 breast cancer cells induced by two distinct ErbB receptor ligands: epidermal growth factor (EGF) and heregulin (HRG), which drive cell proliferation and differentiation, respectively. We focused our attention to elucidate how global genetic responses emerge and to decipher what is an underlying principle for dynamic self-control of genome-wide gene expression. The whole mRNA expression was classified into about a hundred groups according to the root mean square fluctuation (rmsf). These expression groups showed characteristic time-dependent correlations, indicating the existence of collective behaviors on the ensemble of genes with respect to mRNA expression and also to temporal changes in expression. All-or-none responses were observed for HRG and EGF (biphasic statistics) at around 10–20 min. The emergence of time-dependent collective behaviors of expression occurred through bifurcation of a coherent expression state (CES). In the ensemble of mRNA expression, the self-organized CESs reveals distinct characteristic expression domains for biphasic statistics, which exhibits notably the presence of criticality in the expression profile as a route for genomic transition. In time-dependent changes in the expression domains, the dynamics of CES reveals that the temporal development of the characteristic domains is characterized as autonomous bistable switch, which exhibits dynamic criticality (the temporal development of criticality) in the genome-wide coherent expression dynamics. It is expected that elucidation of the biophysical origin for such critical behavior sheds light on the underlying mechanism of the control of whole genome. PMID:24831017
Comparative Bacterial Proteomics: Analysis of the Core Genome Concept
Callister, Stephen J.; McCue, Lee Ann; Turse, Joshua E.; Monroe, Matthew E.; Auberry, Kenneth J.; Smith, Richard D.; Adkins, Joshua N.; Lipton, Mary S.
2008-01-01
While comparative bacterial genomic studies commonly predict a set of genes indicative of common ancestry, experimental validation of the existence of this core genome requires extensive measurement and is typically not undertaken. Enabled by an extensive proteome database developed over six years, we have experimentally verified the expression of proteins predicted from genomic ortholog comparisons among 17 environmental and pathogenic bacteria. More exclusive relationships were observed among the expressed protein content of phenotypically related bacteria, which is indicative of the specific lifestyles associated with these organisms. Although genomic studies can establish relative orthologous relationships among a set of bacteria and propose a set of ancestral genes, our proteomics study establishes expressed lifestyle differences among conserved genes and proposes a set of expressed ancestral traits. PMID:18253490
Evaluation of microRNA alignment techniques
Kaspi, Antony; El-Osta, Assam
2016-01-01
Genomic alignment of small RNA (smRNA) sequences such as microRNAs poses considerable challenges due to their short length (∼21 nucleotides [nt]) as well as the large size and complexity of plant and animal genomes. While several tools have been developed for high-throughput mapping of longer mRNA-seq reads (>30 nt), there are few that are specifically designed for mapping of smRNA reads including microRNAs. The accuracy of these mappers has not been systematically determined in the case of smRNA-seq. In addition, it is unknown whether these aligners accurately map smRNA reads containing sequence errors and polymorphisms. By using simulated read sets, we determine the alignment sensitivity and accuracy of 16 short-read mappers and quantify their robustness to mismatches, indels, and nontemplated nucleotide additions. These were explored in the context of a plant genome (Oryza sativa, ∼500 Mbp) and a mammalian genome (Homo sapiens, ∼3.1 Gbp). Analysis of simulated and real smRNA-seq data demonstrates that mapper selection impacts differential expression results and interpretation. These results will inform on best practice for smRNA mapping and enable more accurate smRNA detection and quantification of expression and RNA editing. PMID:27284164
Oud, Bart; Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T
2012-01-01
Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages. PMID:22152095
Oud, Bart; van Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T
2012-03-01
Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
Evaluation of xenobiotic-induced changes in gene expression as a method to identify and classify potential toxicants is being pursued by industry and regulatory agencies worldwide. A workshop was held at the Research Triangle Park campus of the Environmental Protection Agency to...
Ranganathan, Vinod; Wahlin, Karl; Maruotti, Julien; Zack, Donald J
2014-08-08
The repurposed CRISPR-Cas9 system has recently emerged as a revolutionary genome-editing tool. Here we report a modification in the expression of the guide RNA (gRNA) required for targeting that greatly expands the targetable genome. gRNA expression through the commonly used U6 promoter requires a guanosine nucleotide to initiate transcription, thus constraining genomic-targeting sites to GN19NGG. We demonstrate the ability to modify endogenous genes using H1 promoter-expressed gRNAs, which can be used to target both AN19NGG and GN19NGG genomic sites. AN19NGG sites occur ~15% more frequently than GN19NGG sites in the human genome and the increase in targeting space is also enriched at human genes and disease loci. Together, our results enhance the versatility of the CRISPR technology by more than doubling the number of targetable sites within the human genome and other eukaryotic species.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Kejian, E-mail: kejian.wang.bio@gmail.com; Weng, Zuquan; Sun, Liya
Adverse drug reaction (ADR) is of great importance to both regulatory agencies and the pharmaceutical industry. Various techniques, such as quantitative structure–activity relationship (QSAR) and animal toxicology, are widely used to identify potential risks during the preclinical stage of drug development. Despite these efforts, drugs with safety liabilities can still pass through safety checkpoints and enter the market. This situation raises the concern that conventional chemical structure analysis and phenotypic screening are not sufficient to avoid all clinical adverse events. Genomic expression data following in vitro drug treatments characterize drug actions and thus have become widely used in drug repositioning. Inmore » the present study, we explored prediction of ADRs based on the drug-induced gene-expression profiles from cultured human cells in the Connectivity Map (CMap) database. The results showed that drugs inducing comparable ADRs generally lead to similar CMap expression profiles. Based on such ADR-gene expression association, we established prediction models for various ADRs, including severe myocardial and infectious events. Drugs with FDA boxed warnings of safety liability were effectively identified. We therefore suggest that drug-induced gene expression change, in combination with effective computational methods, may provide a new dimension of information to facilitate systematic drug safety evaluation. - Highlights: • Drugs causing common toxicity lead to similar in vitro gene expression changes. • We built a model to predict drug toxicity with drug-specific expression profiles. • Drugs with FDA black box warnings were effectively identified by our model. • In vitro assay can detect severe toxicity in the early stage of drug development.« less
Sanzol, Javier
2010-05-14
Gene duplication is central to genome evolution. In plants, genes can be duplicated through small-scale events and large-scale duplications often involving polyploidy. The apple belongs to the subtribe Pyrinae (Rosaceae), a diverse lineage that originated via allopolyploidization. Both small-scale duplications and polyploidy may have been important mechanisms shaping the genome of this species. This study evaluates the gene duplication and polyploidy history of the apple by characterizing duplicated genes in this species using EST data. Overall, 68% of the apple genes were clustered into families with a mean copy-number of 4.6. Analysis of the age distribution of gene duplications supported a continuous mode of small-scale duplications, plus two episodes of large-scale duplicates of vastly different ages. The youngest was consistent with the polyploid origin of the Pyrinae 37-48 MYBP, whereas the older may be related to gamma-triplication; an ancient hexapolyploidization previously characterized in the four sequenced eurosid genomes and basal to the eurosid-asterid divergence. Duplicated genes were studied for functional diversification with an emphasis on young paralogs; those originated during or after the formation of the Pyrinae lineage. Unequal assignment of single-copy genes and gene families to Gene Ontology categories suggested functional bias in the pattern of gene retention of paralogs. Young paralogs related to signal transduction, metabolism, and energy pathways have been preferentially retained. Non-random retention of duplicated genes seems to have mediated the expansion of gene families, some of which may have substantially increased their members after the origin of the Pyrinae. The joint analysis of over-duplicated functional categories and phylogenies, allowed evaluation of the role of both polyploidy and small-scale duplications during this process. Finally, gene expression analysis indicated that 82% of duplicated genes, including 80% of young paralogs, showed uncorrelated expression profiles, suggesting extensive subfunctionalization and a role of gene duplication in the acquisition of novel patterns of gene expression. This study reports a genome-wide analysis of the mode of gene duplication in the apple, and provides evidence for its role in genome functional diversification by characterising three major processes: selective retention of paralogs, amplification of gene families, and changes in gene expression.
Chidambaranathan, Parameswaran; Jagannadham, Prasanth Tej Kumar; Satheesh, Viswanathan; Kohli, Deshika; Basavarajappa, Santosh Halasabala; Chellapilla, Bharadwaj; Kumar, Jitendra; Jain, Pradeep Kumar; Srinivasan, R
2018-05-01
The heat stress transcription factors (Hsfs) play a prominent role in thermotolerance and eliciting the heat stress response in plants. Identification and expression analysis of Hsfs gene family members in chickpea would provide valuable information on heat stress responsive Hsfs. A genome-wide analysis of Hsfs gene family resulted in the identification of 22 Hsf genes in chickpea in both desi and kabuli genome. Phylogenetic analysis distinctly separated 12 A, 9 B, and 1 C class Hsfs, respectively. An analysis of cis-regulatory elements in the upstream region of the genes identified many stress responsive elements such as heat stress elements (HSE), abscisic acid responsive element (ABRE) etc. In silico expression analysis showed nine and three Hsfs were also expressed in drought and salinity stresses, respectively. Q-PCR expression analysis of Hsfs under heat stress at pod development and at 15 days old seedling stage showed that CarHsfA2, A6, and B2 were significantly upregulated in both the stages of crop growth and other four Hsfs (CarHsfA2, A6a, A6c, B2a) showed early transcriptional upregulation for heat stress at seedling stage of chickpea. These subclasses of Hsfs identified in this study can be further evaluated as candidate genes in the characterization of heat stress response in chickpea.
Targeted Imaging of the Atypical Chemokine Receptor 3 (ACKR3/CXCR7) in Human Cancer Xenografts.
Behnam Azad, Babak; Lisok, Ala; Chatterjee, Samit; Poirier, John T; Pullambhatla, Mrudula; Luker, Gary D; Pomper, Martin G; Nimmagadda, Sridhar
2016-06-01
The atypical chemokine receptor ACKR3 (formerly CXCR7), overexpressed in various cancers compared with normal tissues, plays a pivotal role in adhesion, angiogenesis, tumorigenesis, metastasis, and tumor cell survival. ACKR3 modulates the tumor microenvironment and regulates tumor growth. The therapeutic potential of ACKR3 has also been demonstrated in various murine models of human cancer. Literature findings underscore the importance of ACKR3 in disease progression and suggest it as an important diagnostic marker for noninvasive imaging of ACKR3-overexpressing malignancies. There are currently no reports on direct receptor-specific detection of ACKR3 expression. Here we report the evaluation of a radiolabeled ACKR3-targeted monoclonal antibody (ACKR3-mAb) for the noninvasive in vivo nuclear imaging of ACKR3 expression in human breast, lung, and esophageal squamous cell carcinoma cancer xenografts. ACKR3 expression data were extracted from Cancer Cell Line Encyclopedia, The Cancer Genome Atlas, and the Clinical Lung Cancer Genome Project. (89)Zr-ACKR3-mAb was evaluated in vitro and subsequently in vivo by PET and ex vivo biodistribution studies in mice xenografted with breast (MDA-MB-231-ACKR3 [231-ACKR3], MDA-MB-231 [231], MCF7), lung (HCC95), or esophageal (KYSE520) cancer cells. In addition, ACKR3-mAb was radiolabeled with (125)I and evaluated by SPECT imaging and ex vivo biodistribution studies. ACKR3 transcript levels were highest in lung squamous cell carcinoma among the 21 cancer type data extracted from The Cancer Genome Atlas. Also, Clinical Lung Cancer Genome Project data showed that lung squamous cell carcinoma had the highest CXCR7 transcript levels compared with other lung cancer subtypes. The (89)Zr-ACKR3-mAb was produced in 80% ± 5% radiochemical yields with greater than 98% radiochemical purity. In vitro cell uptake of (89)Zr-ACKR3-mAb correlated with gradient levels of cell surface ACKR3 expression observed by flow cytometry. In vivo PET imaging and ex vivo biodistribution studies in mice with breast, lung, and esophageal cancer xenografts consistently showed enhanced (89)Zr-ACKR3-mAb uptake in high-ACKR3-expressing tumors. SPECT imaging of (125)I-ACKR3-mAb showed the versatility of ACKR3-mAb for in vivo monitoring of ACKR3 expression. Data from this study suggest ACKR3 to be a viable diagnostic marker and demonstrate the utility of radiolabeled ACKR3-mAb for in vivo visualization of ACKR3-overexpressing malignancies. © 2016 by the Society of Nuclear Medicine and Molecular Imaging, Inc.
Using expression genetics to study the neurobiology of ethanol and alcoholism.
Farris, Sean P; Wolen, Aaron R; Miles, Michael F
2010-01-01
Recent simultaneous progress in human and animal model genetics and the advent of microarray whole genome expression profiling have produced prodigious data sets on genetic loci, potential candidate genes, and differential gene expression related to alcoholism and ethanol behaviors. Validated target genes or gene networks functioning in alcoholism are still of meager proportions. Genetical genomics, which combines genetic analysis of both traditional phenotypes and whole genome expression data, offers a potential methodology for characterizing brain gene networks functioning in alcoholism. This chapter will describe concepts, approaches, and recent findings in the field of genetical genomics as it applies to alcohol research. Copyright 2010 Elsevier Inc. All rights reserved.
Experimental evidence supports a sex-specific selective sieve in mitochondrial genome evolution.
Innocenti, Paolo; Morrow, Edward H; Dowling, Damian K
2011-05-13
Mitochondria are maternally transmitted; hence, their genome can only make a direct and adaptive response to selection through females, whereas males represent an evolutionary dead end. In theory, this creates a sex-specific selective sieve, enabling deleterious mutations to accumulate in mitochondrial genomes if they exert male-specific effects. We tested this hypothesis, expressing five mitochondrial variants alongside a standard nuclear genome in Drosophila melanogaster, and found striking sexual asymmetry in patterns of nuclear gene expression. Mitochondrial polymorphism had few effects on nuclear gene expression in females but major effects in males, modifying nearly 10% of transcripts. These were mostly male-biased in expression, with enrichment hotspots in the testes and accessory glands. Our results suggest an evolutionary mechanism that results in mitochondrial genomes harboring male-specific mutation loads.
Parvovirus B19 DNA CpG Dinucleotide Methylation and Epigenetic Regulation of Viral Expression
Bonvicini, Francesca; Manaresi, Elisabetta; Di Furio, Francesca; De Falco, Luisa; Gallinella, Giorgio
2012-01-01
CpG DNA methylation is one of the main epigenetic modifications playing a role in the control of gene expression. For DNA viruses whose genome has the ability to integrate in the host genome or to maintain as a latent episome, a correlation has been found between the extent of DNA methylation and viral quiescence. No information is available for Parvovirus B19, a human pathogenic virus, which is capable of both lytic and persistent infections. Within Parvovirus B19 genome, the inverted terminal regions display all the characteristic signatures of a genomic CpG island; therefore we hypothesised a role of CpG dinucleotide methylation in the regulation of viral genome expression. The analysis of CpG dinucleotide methylation of Parvovirus B19 DNA was carried out by an aptly designed quantitative real-time PCR assay on bisulfite-modified DNA. The effects of CpG methylation on the regulation of viral genome expression were first investigated by transfection of either unmethylated or in vitro methylated viral DNA in a model cell line, showing that methylation of viral DNA was correlated to lower expression levels of the viral genome. Then, in the course of in vitro infections in different cellular environments, it was observed that absence of viral expression and genome replication were both correlated to increasing levels of CpG methylation of viral DNA. Finally, the presence of CpG methylation was documented in viral DNA present in bioptic samples, indicating the occurrence and a possible role of this epigenetic modification in the course of natural infections. The presence of an epigenetic level of regulation of viral genome expression, possibly correlated to the silencing of the viral genome and contributing to the maintenance of the virus in tissues, can be relevant to the balance and outcome of the different types of infection associated to Parvovirus B19. PMID:22413013
Schwaenen, Carsten; Nessling, Michelle; Wessendorf, Swen; Salvi, Tatjana; Wrobel, Gunnar; Radlwimmer, Bernhard; Kestler, Hans A.; Haslinger, Christian; Stilgenbauer, Stephan; Döhner, Hartmut; Bentz, Martin; Lichter, Peter
2004-01-01
B cell chronic lymphocytic leukemia (B-CLL) is characterized by a highly variable clinical course. Recurrent chromosomal imbalances provide significant prognostic markers. Risk-adapted therapy based on genomic alterations has become an option that is currently being tested in clinical trials. To supply a robust tool for such large scale studies, we developed a comprehensive DNA microarray dedicated to the automated analysis of recurrent genomic imbalances in B-CLL by array-based comparative genomic hybridization (matrix–CGH). Validation of this chip in a series of 106 B-CLL cases revealed a high specificity and sensitivity that fulfils the criteria for application in clinical oncology. This chip is immediately applicable within clinical B-CLL treatment trials that evaluate whether B-CLL cases with distinct chromosomal abnormalities should be treated with chemotherapy of different intensities and/or stem cell transplantation. Through the control set of DNA fragments equally distributed over the genome, recurrent genomic imbalances were discovered: trisomy of chromosome 19 and gain of the MYCN oncogene correlating with an elevation of MYCN mRNA expression. PMID:14730057
Tzelepi, Vassiliki; Grivas, Petros; Kefalopoulou, Zinovia; Kalofonos, Haralabos; Varakis, John N; Melachrinou, Maria; Sotiropoulou-Bonikou, Georgia
2009-04-01
Epidemiological and molecular data suggest the involvement of estrogen signaling in colorectal tissue, mediated mainly through estrogen receptor beta (ERbeta). Estrogens may mediate their effects in epithelial cells indirectly by acting on stromal cells. Expression of ERalpha, ERbeta1, and the ER coregulators, amplified in breast cancer-1 (AIB-1) and transcriptional intermediary factor 2 (TIF-2), was evaluated in myofibroblasts of 107 colorectal carcinomas, 77 paired samples of normal mucosa, and 29 adenomas by immunohistochemistry. Double immunostaining with a-SMA was used to identify the myofibroblasts of normal tissue, adenomas, and cancer microenvironment. ERalpha was not expressed in stromal cells. Nuclear expression of ERbeta1, AIB-1, and TIF-2 in myofibroblasts gradually increased from normal mucosa, through adenomas, to carcinomas. Cytoplasmic ERbeta1 and TIF-2 expression was enhanced in carcinomas compared to normal mucosa and adenomas. Enhanced nuclear and cytoplasmic ERbeta1 expression and elevated nuclear AIB-1 expression were more frequently noted in myofibroblasts of carcinomas of advanced stage. ERbeta1 expression in cancer-associated myofibroblasts correlated to AIB-1 and TIF-2 expression. None of the markers correlated with patients' prognosis. Our findings imply that ERbeta1-dependent (genomic and non-genomic) and ER-coregulator-dependent (AIB-1, TIF-2) signal transductions in myofibroblasts may be involved in the initiation and progression of colorectal carcinomas.
USDA-ARS?s Scientific Manuscript database
Dual luciferase reporter systems are valuable tools for functional genomic studies, but have not previously been developed for use in tick cell culture. We evaluated expression of available luciferase constructs in tick cell cultures derived from Rhipicephalus (Boophilus) microplus, an important vec...
Partial-genome evaluation of postweaning feed intake and efficiency of crossbred beef cattle
USDA-ARS?s Scientific Manuscript database
Effects of individual single nucleotide polymorphisms (SNP), and variation explained by sets of SNP associated with dry matter intake (DMI), metabolic mid-test weight (MBW), BW gain (GN) and feed efficiency expressed as phenotypic and genetic residual feed intake (RFIp; RFIg) were estimated from wei...
Evaluation of Androgen Receptor Function in Prostate Cancer Prognosis and Therapeutic Stratification
2012-10-01
Miettinen, Wang et al. 2011) (Braun, Goltz et al. 2011) (Rosen, Sesterhenn et al. 2012), rabbit polyclonal anti-PSA antibody (DAKO, A056201-2... Goltz , et al. (2011). "ERG protein expression and genomic rearrangement status in primary and metastatic prostate cancer - a comparative study of two
Genomic Expression Patterns in Menstrually-Related Migraine in Adolescents
Hershey, Andrew; Horn, Paul; Kabbouche, Marielle; O'Brien, Hope; Powers, Scott
2011-01-01
Background Exacerbation of migraine with menses is common in adolescent girls and women with migraine, occurring in up to 60% of females with migraine. These migraines are oftentimes longer and more disabling and may be related to estrogen levels and hormonal fluctuations. Objective This study identifies the unique genomic expression pattern of menstrually-related migraine (MRM) in comparison to migraine occurring outside the menstrual period and headache free controls. Methods Whole blood samples were obtained from female subjects having an acute migraine during their menstrual period (MRM) or outside of their menstrual period (nonMRM) and controls (C) – females having a menstrual period without any history of headache. The mRNA was isolated from these samples and genomic profile was assessed. Affymetrix Human Exon ST 1.0 arrays were used to examine the genomic expression pattern differences between these three groups. Results Blood genomic expression patterns were obtained on 56 subjects (MRM = 18, nonMRM = 18 and C = 20). Unique genomic expression patterns were observed for both MRM and nonMRM. For MRM, 77 genes were identified that were unique to MRM, while 61 genes were commonly expressed for MRM and nonMRM and 127 genes appeared to have a unique expression pattern for nonMRM. In addition, there were 279 genes that differentially expressed for MRM compared to nonMRM that were not differentially expressed for nonMRM. Gene ontology of these samples indicated many of these groups of genes were functionally related and included categories of immunomodulation/inflammation, mitochondrial function and DNA homeostasis. Conclusions Blood genomic patterns can accurately differentiate MRM from nonMRM. These results indicate that MRM involves a unique molecular biology pathway that can be identified with a specific biomarker and suggest that individuals with MRM have a different underlying genetic etiology. PMID:22220971
Does Simulated Spaceflight Modify Epigenetic Status During Bone Remodeling?
NASA Technical Reports Server (NTRS)
Thomas, Nicholas J.; Stevick, Rebecca J.; Tran, Luan H.; Nalavadi, Mohit O.; Almeida, Eduardo A.C.; Globus, Ruth K.; Alwood, Joshua S.
2015-01-01
Little is known about the effects of spaceflight conditions on epigenetics. The term epigenetics describes changes to the genome that can affect expression of a gene without changes to the sequence of DNA. Epigenetic processes are thought to underlie cellular differentiation, where transcription of specific genes occurs in response to key stimuli, and may be heritable - passing from one cell to its daughter cell. We hypothesize that the mechanical environment during spaceflight, namely microgravity-induced weightlessness or exercise regulate gene expression in the osteoblast-lineage cells both to control bone formation by osteoblasts and bone resorption by osteoclasts, which continually shapes bone structure throughout life. Similarly we intend to evaluate how radiation regulates these same bone cell activity and differentiation related genes. We further hypothesize that the regulation in bone cell gene expression is at least partially controlled through epigenetic mechanisms of methylation or small non-coding RNA (microRNAs). We have acquired preliminary data suggesting that global genome methylation is modified in response to axial compression of the tibia - a model of exercise. We intend to pursue these hypotheses wherein we will evaluate changes in gene expression and, congruently, changes in epigenetic state in bones from mice subjected to the aforementioned conditions: hindlimb unloading to simulate weightlessness, axial compression of the tibia, or radiation exposure in order to gain insight into the role of epigenetics in spaceflight-induced bone loss.
DNA methylome signature in rheumatoid arthritis.
Nakano, Kazuhisa; Whitaker, John W; Boyle, David L; Wang, Wei; Firestein, Gary S
2013-01-01
Epigenetics can influence disease susceptibility and severity. While DNA methylation of individual genes has been explored in autoimmunity, no unbiased systematic analyses have been reported. Therefore, a genome-wide evaluation of DNA methylation loci in fibroblast-like synoviocytes (FLS) isolated from the site of disease in rheumatoid arthritis (RA) was performed. Genomic DNA was isolated from six RA and five osteoarthritis (OA) FLS lines and evaluated using the Illumina HumanMethylation450 chip. Cluster analysis of data was performed and corrected using Benjamini-Hochberg adjustment for multiple comparisons. Methylation was confirmed by pyrosequencing and gene expression was determined by qPCR. Pathway analysis was performed using the Kyoto Encyclopedia of Genes and Genomes. RA and control FLS segregated based on DNA methylation, with 1859 differentially methylated loci. Hypomethylated loci were identified in key genes relevant to RA, such as CHI3L1, CASP1, STAT3, MAP3K5, MEFV and WISP3. Hypermethylation was also observed, including TGFBR2 and FOXO1. Hypomethylation of individual genes was associated with increased gene expression. Grouped analysis identified 207 hypermethylated or hypomethylated genes with multiple differentially methylated loci, including COL1A1, MEFV and TNF. Hypomethylation was increased in multiple pathways related to cell migration, including focal adhesion, cell adhesion, transendothelial migration and extracellular matrix interactions. Confirmatory studies with OA and normal FLS also demonstrated segregation of RA from control FLS based on methylation pattern. Differentially methylated genes could alter FLS gene expression and contribute to the pathogenesis of RA. DNA methylation of critical genes suggests that RA FLS are imprinted and implicate epigenetic contributions to inflammatory arthritis.
Knowledge-driven genomic interactions: an application in ovarian cancer.
Kim, Dokyoon; Li, Ruowang; Dudek, Scott M; Frase, Alex T; Pendergrass, Sarah A; Ritchie, Marylyn D
2014-01-01
Effective cancer clinical outcome prediction for understanding of the mechanism of various types of cancer has been pursued using molecular-based data such as gene expression profiles, an approach that has promise for providing better diagnostics and supporting further therapies. However, clinical outcome prediction based on gene expression profiles varies between independent data sets. Further, single-gene expression outcome prediction is limited for cancer evaluation since genes do not act in isolation, but rather interact with other genes in complex signaling or regulatory networks. In addition, since pathways are more likely to co-operate together, it would be desirable to incorporate expert knowledge to combine pathways in a useful and informative manner. Thus, we propose a novel approach for identifying knowledge-driven genomic interactions and applying it to discover models associated with cancer clinical phenotypes using grammatical evolution neural networks (GENN). In order to demonstrate the utility of the proposed approach, an ovarian cancer data from the Cancer Genome Atlas (TCGA) was used for predicting clinical stage as a pilot project. We identified knowledge-driven genomic interactions associated with cancer stage from single knowledge bases such as sources of pathway-pathway interaction, but also knowledge-driven genomic interactions across different sets of knowledge bases such as pathway-protein family interactions by integrating different types of information. Notably, an integration model from different sources of biological knowledge achieved 78.82% balanced accuracy and outperformed the top models with gene expression or single knowledge-based data types alone. Furthermore, the results from the models are more interpretable because they are framed in the context of specific biological pathways or other expert knowledge. The success of the pilot study we have presented herein will allow us to pursue further identification of models predictive of clinical cancer survival and recurrence. Understanding the underlying tumorigenesis and progression in ovarian cancer through the global view of interactions within/between different biological knowledge sources has the potential for providing more effective screening strategies and therapeutic targets for many types of cancer.
Su, Ling; Liu, Xin; Hao, Yujin
2013-01-01
The plant-specific LBD (LATERAL ORGAN BOUNDARIES domain) genes belong to a major family of transcription factor that encode a zinc finger-like domain. It has been shown that LBD genes play crucial roles in the growth and development of Arabidopsis and other plant species. However, no detailed information concerning this family is available for apple. In the present study, we analyzed the apple (Malus domestica) genome and identified 58 LBD genes. This gene family was tested for its phylogenetic relationships with homologous genes in the Arabidopsis genome, as well as its location in the genome, structure and expression. We also transformed one MdLBD gene into Arabidopsis to evaluate its function. Like Arabidopsis, apple LBD genes also have a conserved CX2CX6CX3C zinc finger-like domain in the N terminus and can be divided into two classes. The expression profile indicated that apple LBD genes exhibited a variety of expression patterns, suggesting that they have diverse functions. At the same time, the expression analysis implied that members of this apple gene family were responsive to hormones and stress and that they may participate in hormone-mediated plant organogenesis, which was demonstrated with the overexpression of the apple LBD gene MdLBD11, resulting in an abnormal phenotype. This phenotype included upward curling leaves, delayed flowering, downward-pointing flowers, siliques and other abnormal traits. Based on these data, we concluded that the MdLBD genes may play an important role in apple growth and development as in Arabidopsis and other species. PMID:23468909
Wang, Xiaofei; Zhang, Shizhong; Su, Ling; Liu, Xin; Hao, Yujin
2013-01-01
The plant-specific LBD (LATERAL ORGAN BOUNDARIES domain) genes belong to a major family of transcription factor that encode a zinc finger-like domain. It has been shown that LBD genes play crucial roles in the growth and development of Arabidopsis and other plant species. However, no detailed information concerning this family is available for apple. In the present study, we analyzed the apple (Malus domestica) genome and identified 58 LBD genes. This gene family was tested for its phylogenetic relationships with homologous genes in the Arabidopsis genome, as well as its location in the genome, structure and expression. We also transformed one MdLBD gene into Arabidopsis to evaluate its function. Like Arabidopsis, apple LBD genes also have a conserved CX2CX6CX3C zinc finger-like domain in the N terminus and can be divided into two classes. The expression profile indicated that apple LBD genes exhibited a variety of expression patterns, suggesting that they have diverse functions. At the same time, the expression analysis implied that members of this apple gene family were responsive to hormones and stress and that they may participate in hormone-mediated plant organogenesis, which was demonstrated with the overexpression of the apple LBD gene MdLBD11, resulting in an abnormal phenotype. This phenotype included upward curling leaves, delayed flowering, downward-pointing flowers, siliques and other abnormal traits. Based on these data, we concluded that the MdLBD genes may play an important role in apple growth and development as in Arabidopsis and other species.
2014-01-01
Background Genome-wide microarrays have been useful for predicting chemical-genetic interactions at the gene level. However, interpreting genome-wide microarray results can be overwhelming due to the vast output of gene expression data combined with off-target transcriptional responses many times induced by a drug treatment. This study demonstrates how experimental and computational methods can interact with each other, to arrive at more accurate predictions of drug-induced perturbations. We present a two-stage strategy that links microarray experimental testing and network training conditions to predict gene perturbations for a drug with a known mechanism of action in a well-studied organism. Results S. cerevisiae cells were treated with the antifungal, fluconazole, and expression profiling was conducted under different biological conditions using Affymetrix genome-wide microarrays. Transcripts were filtered with a formal network-based method, sparse simultaneous equation models and Lasso regression (SSEM-Lasso), under different network training conditions. Gene expression results were evaluated using both gene set and single gene target analyses, and the drug’s transcriptional effects were narrowed first by pathway and then by individual genes. Variables included: (i) Testing conditions – exposure time and concentration and (ii) Network training conditions – training compendium modifications. Two analyses of SSEM-Lasso output – gene set and single gene – were conducted to gain a better understanding of how SSEM-Lasso predicts perturbation targets. Conclusions This study demonstrates that genome-wide microarrays can be optimized using a two-stage strategy for a more in-depth understanding of how a cell manifests biological reactions to a drug treatment at the transcription level. Additionally, a more detailed understanding of how the statistical model, SSEM-Lasso, propagates perturbations through a network of gene regulatory interactions is achieved. PMID:24444313
Cui, Yi; Han, Jin; Xiao, Zhifeng; Qi, Yiduo; Zhao, Yannan; Chen, Bing; Fang, Yongxiang; Liu, Sumei; Wu, Xianming; Dai, Jianwu
2017-01-01
Recently, with the development of the space program there are growing concerns about the influence of spaceflight on tissue engineering. The purpose of this study was thus to determine the variations of neural stem cells (NSCs) during spaceflight. RNA-Sequencing (RNA-Seq) based transcriptomic profiling of NSCs identified many differentially expressed mRNAs and miRNAs between space and earth groups. Subsequently, those genes with differential expression were subjected to bioinformatic evaluation using gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes pathway (KEGG) and miRNA-mRNA network analyses. The results showed that NSCs maintain greater stemness ability during spaceflight although the growth rate of NSCs was slowed down. Furthermore, the results indicated that NSCs tended to differentiate into neuron in outer space conditions. Detailed genomic analyses of NSCs during spaceflight will help us to elucidate the molecular mechanisms behind their differentiation and proliferation when they are in outer space.
Gene therapies that restore dystrophin expression for the treatment of Duchenne muscular dystrophy
Robinson-Hamm, Jacqueline N.; Gersbach, Charles A.
2016-01-01
Duchenne muscular dystrophy is one of the most common inherited genetic diseases and is caused by mutations to the DMD gene that encodes the dystrophin protein. Recent advances in genome editing and gene therapy offer hope for the development of potential therapeutics. Truncated versions of the DMD gene can be delivered to the affected tissues with viral vectors and show promising results in a variety of animal models. Genome editing with the CRISPR/Cas9 system has recently been used to restore dystrophin expression by deleting one or more exons of the DMD gene in patient cells and in a mouse model that led to functional improvement of muscle strength. Exon skipping with oligonucleotides has been successful in several animal models and evaluated in multiple clinical trials. Next-generation oligonucleotide formulations offer significant promise to build on these results. All these approaches to restoring dystrophin expression are encouraging, but many hurdles remain. This review summarizes the current state of these technologies and summarizes considerations for their future development. PMID:27542949
Suppression of HBV replication by the expression of nickase- and nuclease dead-Cas9.
Kurihara, Takeshi; Fukuhara, Takasuke; Ono, Chikako; Yamamoto, Satomi; Uemura, Kentaro; Okamoto, Toru; Sugiyama, Masaya; Motooka, Daisuke; Nakamura, Shota; Ikawa, Masato; Mizokami, Masashi; Maehara, Yoshihiko; Matsuura, Yoshiharu
2017-07-21
Complete removal of hepatitis B virus (HBV) DNA from nuclei is difficult by the current therapies. Recent reports have shown that a novel genome-editing tool using Cas9 with a single-guide RNA (sgRNA) system can cleave the HBV genome in vitro and in vivo. However, induction of a double-strand break (DSB) on the targeted genome by Cas9 risks undesirable off-target cleavage on the host genome. Nickase-Cas9 cleaves a single strand of DNA, and thereby two sgRNAs are required for inducing DSBs. To avoid Cas9-induced off-target mutagenesis, we examined the effects of the expressions of nickase-Cas9 and nuclease dead Cas9 (d-Cas9) with sgRNAs on HBV replication. The expression of nickase-Cas9 with a pair of sgRNAs cleaved the target HBV genome and suppressed the viral-protein expression and HBV replication in vitro. Moreover, nickase-Cas9 with the sgRNA pair cleaved the targeted HBV genome in mouse liver. Interestingly, d-Cas9 expression with the sgRNAs also suppressed HBV replication in vitro without cleaving the HBV genome. These results suggest the possible use of nickase-Cas9 and d-Cas9 with a pair of sgRNAs for eliminating HBV DNA from the livers of chronic hepatitis B patients with low risk of undesirable off-target mutation on the host genome.
Intergenic disease-associated regions are abundant in novel transcripts.
Bartonicek, N; Clark, M B; Quek, X C; Torpy, J R; Pritchard, A L; Maag, J L V; Gloss, B S; Crawford, J; Taft, R J; Hayward, N K; Montgomery, G W; Mattick, J S; Mercer, T R; Dinger, M E
2017-12-28
Genotyping of large populations through genome-wide association studies (GWAS) has successfully identified many genomic variants associated with traits or disease risk. Unexpectedly, a large proportion of GWAS single nucleotide polymorphisms (SNPs) and associated haplotype blocks are in intronic and intergenic regions, hindering their functional evaluation. While some of these risk-susceptibility regions encompass cis-regulatory sites, their transcriptional potential has never been systematically explored. To detect rare tissue-specific expression, we employed the transcript-enrichment method CaptureSeq on 21 human tissues to identify 1775 multi-exonic transcripts from 561 intronic and intergenic haploblocks associated with 392 traits and diseases, covering 73.9 Mb (2.2%) of the human genome. We show that a large proportion (85%) of disease-associated haploblocks express novel multi-exonic non-coding transcripts that are tissue-specific and enriched for GWAS SNPs as well as epigenetic markers of active transcription and enhancer activity. Similarly, we captured transcriptomes from 13 melanomas, targeting nine melanoma-associated haploblocks, and characterized 31 novel melanoma-specific transcripts that include fusion proteins, novel exons and non-coding RNAs, one-third of which showed allelically imbalanced expression. This resource of previously unreported transcripts in disease-associated regions ( http://gwas-captureseq.dingerlab.org ) should provide an important starting point for the translational community in search of novel biomarkers, disease mechanisms, and drug targets.
Cognitive Endophenotypes Inform Genome-Wide Expression Profiling in Schizophrenia
Zheutlin, Amanda B.; Viehman, Rachael W.; Fortgang, Rebecca; Borg, Jacqueline; Smith, Desmond J.; Suvisaari, Jaana; Therman, Sebastian; Hultman, Christina M.; Cannon, Tyrone D.
2015-01-01
OBJECTIVE We performed a whole-genome expression study to clarify the nature of the biological processes mediating between inherited genetic variations and cognitive dysfunction in schizophrenia. METHOD Gene expression was assayed from peripheral blood mononuclear cells using Illumina Human WG6 v3.0 chips in twins discordant for schizophrenia or bipolar disorder and control twins. After quality control, expression levels of 18,559 genes were screened for association with California Verbal Learning Test (CVLT) performance, and any memory-related probes were then evaluated for variation by diagnostic status in the discovery sample (N = 190), and in an independent replication sample (N = 73). Heritability of gene expression using the twin design was also assessed. RESULTS After Bonferroni correction (p < 2.69 × 10−6), CVLT performance was significantly related to expression levels for 76 genes, 43 of which were differentially expressed in schizophrenia patients, with comparable effect sizes in the same direction in the replication sample. For 41 of these 43 transcripts, expression levels were heritable. Nearly all identified genes contain common or de novo mutations associated with schizophrenia in prior studies. CONCLUSION Genes increasing risk for schizophrenia appear to do so in part via effects on signaling cascades influencing memory. The genes implicated in these processes are enriched for those related to RNA processing and DNA replication and include genes influencing G-protein coupled signal transduction, cytokine signaling, and oligodendrocyte function. PMID:26710095
Cognitive endophenotypes inform genome-wide expression profiling in schizophrenia.
Zheutlin, Amanda B; Viehman, Rachael W; Fortgang, Rebecca; Borg, Jacqueline; Smith, Desmond J; Suvisaari, Jaana; Therman, Sebastian; Hultman, Christina M; Cannon, Tyrone D
2016-01-01
We performed a whole-genome expression study to clarify the nature of the biological processes mediating between inherited genetic variations and cognitive dysfunction in schizophrenia. Gene expression was assayed from peripheral blood mononuclear cells using Illumina Human WG6 v3.0 chips in twins discordant for schizophrenia or bipolar disorder and control twins. After quality control, expression levels of 18,559 genes were screened for association with the California Verbal Learning Test (CVLT) performance, and any memory-related probes were then evaluated for variation by diagnostic status in the discovery sample (N = 190), and in an independent replication sample (N = 73). Heritability of gene expression using the twin design was also assessed. After Bonferroni correction (p < 2.69 × 10-6), CVLT performance was significantly related to expression levels for 76 genes, 43 of which were differentially expressed in schizophrenia patients, with comparable effect sizes in the same direction in the replication sample. For 41 of these 43 transcripts, expression levels were heritable. Nearly all identified genes contain common or de novo mutations associated with schizophrenia in prior studies. Genes increasing risk for schizophrenia appear to do so in part via effects on signaling cascades influencing memory. The genes implicated in these processes are enriched for those related to RNA processing and DNA replication and include genes influencing G-protein coupled signal transduction, cytokine signaling, and oligodendrocyte function. (c) 2015 APA, all rights reserved).
Integrated analysis of chromosome copy number variation and gene expression in cervical carcinoma
Yan, Deng; Yi, Song; Chiu, Wang Chi; Qin, Liu Gui; Kin, Wong Hoi; Kwok Hung, Chung Tony; Linxiao, Han; Wai, Choy Kwong; Yi, Sui; Tao, Yang; Tao, Tang
2017-01-01
Objective This study was conducted to explore chromosomal copy number variations (CNV) and transcript expression and to examine pathways in cervical pathogenesis using genome-wide high resolution microarrays. Methods Genome-wide chromosomal CNVs were investigated in 6 cervical cancer cell lines by Human Genome CGH Microarray Kit (4x44K). Gene expression profiles in cervical cancer cell lines, primary cervical carcinoma and normal cervical epithelium tissues were also studied using the Whole Human Genome Microarray Kit (4x44K). Results Fifty common chromosomal CNVs were identified in the cervical cancer cell lines. Correlation analysis revealed that gene up-regulation or down-regulation is significantly correlated with genomic amplification (P=0.009) or deletion (P=0.006) events. Expression profiles were identified through cluster analysis. Gene annotation analysis pinpointed cell cycle pathways was significantly (P=1.15E-08) affected in cervical cancer. Common CNVs were associated with cervical cancer. Conclusion Chromosomal CNVs may contribute to their transcript expression in cervical cancer. PMID:29312578
Integrated analysis of chromosome copy number variation and gene expression in cervical carcinoma.
Yan, Deng; Yi, Song; Chiu, Wang Chi; Qin, Liu Gui; Kin, Wong Hoi; Kwok Hung, Chung Tony; Linxiao, Han; Wai, Choy Kwong; Yi, Sui; Tao, Yang; Tao, Tang
2017-12-12
This study was conducted to explore chromosomal copy number variations (CNV) and transcript expression and to examine pathways in cervical pathogenesis using genome-wide high resolution microarrays. Genome-wide chromosomal CNVs were investigated in 6 cervical cancer cell lines by Human Genome CGH Microarray Kit (4x44K). Gene expression profiles in cervical cancer cell lines, primary cervical carcinoma and normal cervical epithelium tissues were also studied using the Whole Human Genome Microarray Kit (4x44K). Fifty common chromosomal CNVs were identified in the cervical cancer cell lines. Correlation analysis revealed that gene up-regulation or down-regulation is significantly correlated with genomic amplification ( P =0.009) or deletion ( P =0.006) events. Expression profiles were identified through cluster analysis. Gene annotation analysis pinpointed cell cycle pathways was significantly ( P =1.15E-08) affected in cervical cancer. Common CNVs were associated with cervical cancer. Chromosomal CNVs may contribute to their transcript expression in cervical cancer.
GermOnline 4.0 is a genomics gateway for germline development, meiosis and the mitotic cell cycle.
Lardenois, Aurélie; Gattiker, Alexandre; Collin, Olivier; Chalmel, Frédéric; Primig, Michael
2010-01-01
GermOnline 4.0 is a cross-species database portal focusing on high-throughput expression data relevant for germline development, the meiotic cell cycle and mitosis in healthy versus malignant cells. It is thus a source of information for life scientists as well as clinicians who are interested in gene expression and regulatory networks. The GermOnline gateway provides unlimited access to information produced with high-density oligonucleotide microarrays (3'-UTR GeneChips), genome-wide protein-DNA binding assays and protein-protein interaction studies in the context of Ensembl genome annotation. Samples used to produce high-throughput expression data and to carry out genome-wide in vivo DNA binding assays are annotated via the MIAME-compliant Multiomics Information Management and Annotation System (MIMAS 3.0). Furthermore, the Saccharomyces Genomics Viewer (SGV) was developed and integrated into the gateway. SGV is a visualization tool that outputs genome annotation and DNA-strand specific expression data produced with high-density oligonucleotide tiling microarrays (Sc_tlg GeneChips) which cover the complete budding yeast genome on both DNA strands. It facilitates the interpretation of expression levels and transcript structures determined for various cell types cultured under different growth and differentiation conditions. Database URL: www.germonline.org/
GermOnline 4.0 is a genomics gateway for germline development, meiosis and the mitotic cell cycle
Lardenois, Aurélie; Gattiker, Alexandre; Collin, Olivier; Chalmel, Frédéric; Primig, Michael
2010-01-01
GermOnline 4.0 is a cross-species database portal focusing on high-throughput expression data relevant for germline development, the meiotic cell cycle and mitosis in healthy versus malignant cells. It is thus a source of information for life scientists as well as clinicians who are interested in gene expression and regulatory networks. The GermOnline gateway provides unlimited access to information produced with high-density oligonucleotide microarrays (3′-UTR GeneChips), genome-wide protein–DNA binding assays and protein–protein interaction studies in the context of Ensembl genome annotation. Samples used to produce high-throughput expression data and to carry out genome-wide in vivo DNA binding assays are annotated via the MIAME-compliant Multiomics Information Management and Annotation System (MIMAS 3.0). Furthermore, the Saccharomyces Genomics Viewer (SGV) was developed and integrated into the gateway. SGV is a visualization tool that outputs genome annotation and DNA-strand specific expression data produced with high-density oligonucleotide tiling microarrays (Sc_tlg GeneChips) which cover the complete budding yeast genome on both DNA strands. It facilitates the interpretation of expression levels and transcript structures determined for various cell types cultured under different growth and differentiation conditions. Database URL: www.germonline.org/ PMID:21149299
Orthogonal control of expression mean and variance by epigenetic features at different genomic loci
Dey, Siddharth S.; Foley, Jonathan E.; Limsirichai, Prajit; ...
2015-05-05
While gene expression noise has been shown to drive dramatic phenotypic variations, the molecular basis for this variability in mammalian systems is not well understood. Gene expression has been shown to be regulated by promoter architecture and the associated chromatin environment. However, the exact contribution of these two factors in regulating expression noise has not been explored. Using a dual-reporter lentiviral model system, we deconvolved the influence of the promoter sequence to systematically study the contribution of the chromatin environment at different genomic locations in regulating expression noise. By integrating a large-scale analysis to quantify mRNA levels by smFISH andmore » protein levels by flow cytometry in single cells, we found that mean expression and noise are uncorrelated across genomic locations. Furthermore, we showed that this independence could be explained by the orthogonal control of mean expression by the transcript burst size and noise by the burst frequency. Finally, we showed that genomic locations displaying higher expression noise are associated with more repressed chromatin, thereby indicating the contribution of the chromatin environment in regulating expression noise.« less
Prognostic significance of FAM83D gene expression across human cancer types
Walian, Peter J.; Hang, Bo; Mao, Jian-Hua
2015-12-15
The family with sequence similarity 83, member D (FAM83D) gene has been proposed as a new prognostic marker for breast cancer. In this work, we further evaluate the prognostic significance of FAM83D expression in different breast cancer subtypes using a meta-analysis. Patients with higher FAM83D mRNA levels have significantly decreased overall and metastatic relapse-free survival, particularly in the group of patients with ER-positive, or luminal subtype tumors. We also assessed FAM83D alterations and its prognostic significance across 22 human cancer types using The Cancer Genome Atlas (TCGA). FAM83D is frequently gained in the majority of human cancer types, resulting inmore » the elevated expression of FAM83D. Higher levels of FAM83D mRNA expression are significantly associated with decreased overall survival in several cancer types. Finally, we demonstrate that TP53 mutation in human cancers is coupled to a significant increase in the expression of FAM83D, and that a higher level of FAM83D expression is positively correlated with an increase in genome instability in many cancer types. These results identify FAM83D as a potential novel oncogene across multiple human cancer types.« less
Tao, Xiang; Lai, Xian-Jun; Zhang, Yi-Zheng; Tan, Xue-Mei; Wang, Haiyan
2014-01-01
Background Transposable elements (TEs) are the most abundant genomic components in eukaryotes and affect the genome by their replications and movements to generate genetic plasticity. Sweet potato performs asexual reproduction generally and the TEs may be an important genetic factor for genome reorganization. Complete identification of TEs is essential for the study of genome evolution. However, the TEs of sweet potato are still poorly understood because of its complex hexaploid genome and difficulty in genome sequencing. The recent availability of the sweet potato transcriptome databases provides an opportunity for discovering and characterizing the expressed TEs. Methodology/Principal Findings We first established the integrated-transcriptome database by de novo assembling four published sweet potato transcriptome databases from three cultivars in China. Using sequence-similarity search and analysis, a total of 1,405 TEs including 883 retrotransposons and 522 DNA transposons were predicted and categorized. Depending on mapping sets of RNA-Seq raw short reads to the predicted TEs, we compared the quantities, classifications and expression activities of TEs inter- and intra-cultivars. Moreover, the differential expressions of TEs in seven tissues of Xushu 18 cultivar were analyzed by using Illumina digital gene expression (DGE) tag profiling. It was found that 417 TEs were expressed in one or more tissues and 107 in all seven tissues. Furthermore, the copy number of 11 transposase genes was determined to be 1–3 copies in the genome of sweet potato by Real-time PCR-based absolute quantification. Conclusions/Significance Our result provides a new method for TE searching on species with transcriptome sequences while lacking genome information. The searching, identification and expression analysis of TEs will provide useful TE information in sweet potato, which are valuable for the further studies of TE-mediated gene mutation and optimization in asexual reproduction. It contributes to elucidating the roles of TEs in genome evolution. PMID:24608103
Tissue-specific NETs alter genome organization and regulation even in a heterologous system.
de Las Heras, Jose I; Zuleger, Nikolaj; Batrakou, Dzmitry G; Czapiewski, Rafal; Kerr, Alastair R W; Schirmer, Eric C
2017-01-02
Different cell types exhibit distinct patterns of 3D genome organization that correlate with changes in gene expression in tissue and differentiation systems. Several tissue-specific nuclear envelope transmembrane proteins (NETs) have been found to influence the spatial positioning of genes and chromosomes that normally occurs during tissue differentiation. Here we study 3 such NETs: NET29, NET39, and NET47, which are expressed preferentially in fat, muscle and liver, respectively. We found that even when exogenously expressed in a heterologous system they can specify particular genome organization patterns and alter gene expression. Each NET affected largely different subsets of genes. Notably, the liver-specific NET47 upregulated many genes in HT1080 fibroblast cells that are normally upregulated in hepatogenesis, showing that tissue-specific NETs can favor expression patterns associated with the tissue where the NET is normally expressed. Similarly, global profiling of peripheral chromatin after exogenous expression of these NETs using lamin B1 DamID revealed that each NET affected the nuclear positioning of distinct sets of genomic regions with a significant tissue-specific component. Thus NET influences on genome organization can contribute to gene expression changes associated with differentiation even in the absence of other factors and overt cellular differentiation changes.
TEcandidates: Prediction of genomic origin of expressed Transposable Elements using RNA-seq data.
Valdebenito-Maturana, Braulio; Riadi, Gonzalo
2018-06-01
In recent years, Transposable Elements (TEs) have been related to gene regulation. However, estimating the origin of expression of TEs through RNA-seq is complicated by multimapping reads coming from their repetitive sequences. Current approaches that address multimapping reads are focused in expression quantification and not in finding the origin of expression. Addressing the genomic origin of expressed TEs could further aid in understanding the role that TEs might have in the cell. We have developed a new pipeline called TEcandidates, based on de novo transcriptome assembly to assess the instances of TEs being expressed, along with their location, to include in downstream DE analysis. TEcandidates takes as input the RNA-seq data, the genome sequence and the TE annotation file, and returns a list of coordinates of candidate TEs being expressed, the TEs that have been removed, and the genome sequence with removed TEs as masked. This masked genome is suited to include TEs in downstream expression analysis, as the ambiguity of reads coming from TEs is significantly reduced in the mapping step of the analysis. The script which runs the pipeline can be downloaded at http://www.mobilomics.org/tecandidates/downloads or http://github.com/TEcandidates/TEcandidates. griadi@utalca.cl. Supplementary data are available at Bioinformatics online.
Comparative Transcriptomes and EVO-DEVO Studies Depending on Next Generation Sequencing.
Liu, Tiancheng; Yu, Lin; Liu, Lei; Li, Hong; Li, Yixue
2015-01-01
High throughput technology has prompted the progressive omics studies, including genomics and transcriptomics. We have reviewed the improvement of comparative omic studies, which are attributed to the high throughput measurement of next generation sequencing technology. Comparative genomics have been successfully applied to evolution analysis while comparative transcriptomics are adopted in comparison of expression profile from two subjects by differential expression or differential coexpression, which enables their application in evolutionary developmental biology (EVO-DEVO) studies. EVO-DEVO studies focus on the evolutionary pressure affecting the morphogenesis of development and previous works have been conducted to illustrate the most conserved stages during embryonic development. Old measurements of these studies are based on the morphological similarity from macro view and new technology enables the micro detection of similarity in molecular mechanism. Evolutionary model of embryo development, which includes the "funnel-like" model and the "hourglass" model, has been evaluated by combination of these new comparative transcriptomic methods with prior comparative genomic information. Although the technology has promoted the EVO-DEVO studies into a new era, technological and material limitation still exist and further investigations require more subtle study design and procedure.
Directed combinatorial mutagenesis of Escherichia coli for complex phenotype engineering
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Rongming; Liang, Liya; Garst, Andrew D.
Strain engineering for industrial production requires a targeted improvement of multiple complex traits, which range from pathway flux to tolerance to mixed sugar utilization. Here, we report the use of an iterative CRISPR EnAbled Trackable genome Engineering (iCREATE) method to engineer rapid glucose and xylose co-consumption and tolerance to hydrolysate inhibitors in E. coli. Deep mutagenesis libraries were rationally designed, constructed, and screened to target ~40,000 mutations across 30 genes. These libraries included global and high-level regulators that regulate global gene expression, transcription factors that play important roles in genome-level transcription, enzymes that function in the sugar transport system, NAD(P)Hmore » metabolism, and the aldehyde reduction system. Specific mutants that conferred increased growth in mixed sugars and hydrolysate tolerance conditions were isolated, confirmed, and evaluated for changes in genome-wide expression levels. As a result, we tested the strain with positive combinatorial mutations for 3-hydroxypropionic acid (3HP) production under high furfural and high acetate hydrolysate fermentation, which demonstrated a 7- and 8-fold increase in 3HP productivity relative to the parent strain, respectively.« less
Directed combinatorial mutagenesis of Escherichia coli for complex phenotype engineering
Liu, Rongming; Liang, Liya; Garst, Andrew D.; ...
2018-03-29
Strain engineering for industrial production requires a targeted improvement of multiple complex traits, which range from pathway flux to tolerance to mixed sugar utilization. Here, we report the use of an iterative CRISPR EnAbled Trackable genome Engineering (iCREATE) method to engineer rapid glucose and xylose co-consumption and tolerance to hydrolysate inhibitors in E. coli. Deep mutagenesis libraries were rationally designed, constructed, and screened to target ~40,000 mutations across 30 genes. These libraries included global and high-level regulators that regulate global gene expression, transcription factors that play important roles in genome-level transcription, enzymes that function in the sugar transport system, NAD(P)Hmore » metabolism, and the aldehyde reduction system. Specific mutants that conferred increased growth in mixed sugars and hydrolysate tolerance conditions were isolated, confirmed, and evaluated for changes in genome-wide expression levels. As a result, we tested the strain with positive combinatorial mutations for 3-hydroxypropionic acid (3HP) production under high furfural and high acetate hydrolysate fermentation, which demonstrated a 7- and 8-fold increase in 3HP productivity relative to the parent strain, respectively.« less
NASA Astrophysics Data System (ADS)
Coccini, Teresa; Fabbri, Marco; Roda, Elisa; Grazia Sacco, Maria; Manzo, Luigi; Gribaldo, Laura
2011-07-01
Silica nanoparticles (NPs) incorporating cadmium (Cd) have been developed for a range of potential application including drug delivery devices. Occupational Cd inhalation has been associated with emphysema, pulmonary fibrosis and lung tumours. Mechanistically, Cd can induce oxidative stress and mediate cell-signalling pathways that are involved in inflammation.This in vivo study aimed at investigating pulmonary molecular effects of NPs doped with Cd (NP-Cd, 1 mg/animal) compared to soluble CdCl2 (400 μg/animal), in Sprague Dawley rats treated intra-tracheally, 7 and 30 days after administration. NPs of silica containing Cd salt were prepared starting from commercial nano-size silica powder (HiSil™ T700 Degussa) with average pore size of 20 nm and surface area of 240 m2/g. Toxicogenomic analysis was performed by the DNA microarray technology (using Agilent Whole Rat Genome Microarray 4×44K) to evaluate changes in gene expression of the entire genome. These findings indicate that the whole genome analysis may represent a valuable approach to assess the whole spectrum of biological responses to cadmium containing nanomaterials.
The spotted gar genome illuminates vertebrate evolution and facilitates human-teleost comparisons.
Braasch, Ingo; Gehrke, Andrew R; Smith, Jeramiah J; Kawasaki, Kazuhiko; Manousaki, Tereza; Pasquier, Jeremy; Amores, Angel; Desvignes, Thomas; Batzel, Peter; Catchen, Julian; Berlin, Aaron M; Campbell, Michael S; Barrell, Daniel; Martin, Kyle J; Mulley, John F; Ravi, Vydianathan; Lee, Alison P; Nakamura, Tetsuya; Chalopin, Domitille; Fan, Shaohua; Wcisel, Dustin; Cañestro, Cristian; Sydes, Jason; Beaudry, Felix E G; Sun, Yi; Hertel, Jana; Beam, Michael J; Fasold, Mario; Ishiyama, Mikio; Johnson, Jeremy; Kehr, Steffi; Lara, Marcia; Letaw, John H; Litman, Gary W; Litman, Ronda T; Mikami, Masato; Ota, Tatsuya; Saha, Nil Ratan; Williams, Louise; Stadler, Peter F; Wang, Han; Taylor, John S; Fontenot, Quenton; Ferrara, Allyse; Searle, Stephen M J; Aken, Bronwen; Yandell, Mark; Schneider, Igor; Yoder, Jeffrey A; Volff, Jean-Nicolas; Meyer, Axel; Amemiya, Chris T; Venkatesh, Byrappa; Holland, Peter W H; Guiguen, Yann; Bobe, Julien; Shubin, Neil H; Di Palma, Federica; Alföldi, Jessica; Lindblad-Toh, Kerstin; Postlethwait, John H
2016-04-01
To connect human biology to fish biomedical models, we sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before teleost genome duplication (TGD). The slowly evolving gar genome has conserved in content and size many entire chromosomes from bony vertebrate ancestors. Gar bridges teleosts to tetrapods by illuminating the evolution of immunity, mineralization and development (mediated, for example, by Hox, ParaHox and microRNA genes). Numerous conserved noncoding elements (CNEs; often cis regulatory) undetectable in direct human-teleost comparisons become apparent using gar: functional studies uncovered conserved roles for such cryptic CNEs, facilitating annotation of sequences identified in human genome-wide association studies. Transcriptomic analyses showed that the sums of expression domains and expression levels for duplicated teleost genes often approximate the patterns and levels of expression for gar genes, consistent with subfunctionalization. The gar genome provides a resource for understanding evolution after genome duplication, the origin of vertebrate genomes and the function of human regulatory sequences.
Bluvshteĭn, G A; Lysenko, V G; Zakharova, N B; Kitaev, I V
2012-01-01
The objective of this study is to develop a marker panel of abnormalities of immune regulatory systems and cell genome in patients with gastric cancer for assessment of efficacy of surgical treatment in combination with pharmaco-nutritional therapy in early postoperative period. Expression of p53, sFAS, FASL, CEA, and CA 19-9, nutritial status, as well as incidence of purulent-septic complications in early postoperative period were determined in 40 patients with gastric cancer (21 patients--the test group and 19 patients--the comparative group) prior to a curative surgical intervention, postoperative days 1 and 7, while the pharmaco-nutritional therapy (the test group) or partial parenteral nutrition (the comparative group) has been performed. The pharmaco-nutritional therapy significantly decreases in activity of tumor suppressing genome, lymphocyte apoptosis, expression of oncology-associated markers, incidence of purulent-septic complications, as well as exacerbation risk for hypotrophy in patients with gastric cancer in early postoperative period. To assess the efficacy of the surgical intervention performed in patients with gastric cancer, an expression of mutant p53 and markers of lymphocyte apoptosis (sFAS/FASL) is reasonable to be evaluated together with determination of oncomarkers (CEA and CA 19-9).
Building a genome analysis pipeline to predict disease risk and prevent disease.
Bromberg, Y
2013-11-01
Reduced costs and increased speed and accuracy of sequencing can bring the genome-based evaluation of individual disease risk to the bedside. While past efforts have identified a number of actionable mutations, the bulk of genetic risk remains hidden in sequence data. The biggest challenge facing genomic medicine today is the development of new techniques to predict the specifics of a given human phenome (set of all expressed phenotypes) encoded by each individual variome (full set of genome variants) in the context of the given environment. Numerous tools exist for the computational identification of the functional effects of a single variant. However, the pipelines taking advantage of full genomic, exomic, transcriptomic (and other) sequences have only recently become a reality. This review looks at the building of methodologies for predicting "variome"-defined disease risk. It also discusses some of the challenges for incorporating such a pipeline into everyday medical practice. © 2013. Published by Elsevier Ltd. All rights reserved.
A Genomic Score Prognostic of Outcome in Trauma Patients
Warren, H Shaw; Elson, Constance M; Hayden, Douglas L; Schoenfeld, David A; Cobb, J Perren; Maier, Ronald V; Moldawer, Lyle L; Moore, Ernest E; Harbrecht, Brian G; Pelak, Kimberly; Cuschieri, Joseph; Herndon, David N; Jeschke, Marc G; Finnerty, Celeste C; Brownstein, Bernard H; Hennessy, Laura; Mason, Philip H; Tompkins, Ronald G
2009-01-01
Traumatic injuries frequently lead to infection, organ failure, and death. Health care providers rely on several injury scoring systems to quantify the extent of injury and to help predict clinical outcome. Physiological, anatomical, and clinical laboratory analytic scoring systems (Acute Physiology and Chronic Health Evaluation [APACHE], Injury Severity Score [ISS]) are utilized, with limited success, to predict outcome following injury. The recent development of techniques for measuring the expression level of all of a person’s genes simultaneously may make it possible to develop an injury scoring system based on the degree of gene activation. We hypothesized that a peripheral blood leukocyte gene expression score could predict outcome, including multiple organ failure, following severe blunt trauma. To test such a scoring system, we measured gene expression of peripheral blood leukocytes from patients within 12 h of traumatic injury. cRNA derived from whole blood leukocytes obtained within 12 h of injury provided gene expression data for the entire genome that were used to create a composite gene expression score for each patient. Total blood leukocytes were chosen because they are active during inflammation, which is reflective of poor outcome. The gene expression score combines the activation levels of all the genes into a single number which compares the patient’s gene expression to the average gene expression in uninjured volunteers. Expression profiles from healthy volunteers were averaged to create a reference gene expression profile which was used to compute a difference from reference (DFR) score for each patient. This score described the overall genomic response of patients within the first 12 h following severe blunt trauma. Regression models were used to compare the association of the DFR, APACHE, and ISS scores with outcome. We hypothesized that patients with a total gene response more different from uninjured volunteers would tend to have poorer outcome than those more similar. Our data show that for measures of poor outcome, such as infections, organ failures, and length of hospital stay, this is correct. DFR scores were associated significantly with adverse outcome, including multiple organ failure, duration of ventilation, length of hospital stay, and infection rate. The association remained significant after adjustment for injury severity as measured by APACHE or ISS. A single score representing changes in gene expression in peripheral blood leukocytes within hours of severe blunt injury is associated with adverse clinical outcomes that develop later in the hospital course. Assessment of genome-wide gene expression provides useful clinical information that is different from that provided by currently utilized anatomic or physiologic scores. PMID:19593405
Potential Role of microRNAs in Cardiovascular Disease: Are They up to Their Hype?
Duggal, Bhanu; Gupta, Manveen K; Naga Prasad, Sathyamangla V
Cardiovascular diseases remain the foremost cause of mortality globally. As molecular medicine unravels the alterations in genomic expression and regulation of the underlying atherosclerotic process, it opens new vistas for discovering novel diagnostic biomarkers and therapeutics for limiting the disease process. miRNAs have emerged as powerful regulators of protein translation by regulating gene expression at the post-transcriptional level. Overexpression and under-expression of specific miRNAs are being evaluated as a novel approach to diagnosis and treatment of cardiovascular disease. This review sheds light on the current knowledge of the miRNA evaluated in cardiovascular disease. In this review we summarize the data, including the more recent data, regarding miRNAs in cardiovascular disease and their potential role in future in diagnostic and therapeutic strategies.
Neighboring Genes Show Correlated Evolution in Gene Expression
Ghanbarian, Avazeh T.; Hurst, Laurence D.
2015-01-01
When considering the evolution of a gene’s expression profile, we commonly assume that this is unaffected by its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between neighboring genes in gene expression profiles in extant taxa. Indeed, in all eukaryotic genomes genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their expression or is gene expression evolution autonomous? To address this here we consider evolution of human gene expression since the human-chimp common ancestor, allowing for both variation in estimation of current expression level and error in Bayesian estimation of the ancestral state. We find that in all tissues and both sexes, the change in gene expression of a focal gene on average predicts the change in gene expression of neighbors. The effect is highly pronounced in the immediate vicinity (<100 kb) but extends much further. Sex-specific expression change is also genomically clustered. As genes increasing their expression in humans tend to avoid nuclear lamina domains and be enriched for the gene activator 5-hydroxymethylcytosine, we conclude that, most probably owing to chromatin level control of gene expression, a change in gene expression of one gene likely affects the expression evolution of neighbors, what we term expression piggybacking, an analog of hitchhiking. PMID:25743543
Naval-Sanchez, Marina; Nguyen, Quan; McWilliam, Sean; Porto-Neto, Laercio R; Tellam, Ross; Vuocolo, Tony; Reverter, Antonio; Perez-Enciso, Miguel; Brauning, Rudiger; Clarke, Shannon; McCulloch, Alan; Zamani, Wahid; Naderi, Saeid; Rezaei, Hamid Reza; Pompanon, Francois; Taberlet, Pierre; Worley, Kim C; Gibbs, Richard A; Muzny, Donna M; Jhangiani, Shalini N; Cockett, Noelle; Daetwyler, Hans; Kijas, James
2018-02-28
Domestication fundamentally reshaped animal morphology, physiology and behaviour, offering the opportunity to investigate the molecular processes driving evolutionary change. Here we assess sheep domestication and artificial selection by comparing genome sequence from 43 modern breeds (Ovis aries) and their Asian mouflon ancestor (O. orientalis) to identify selection sweeps. Next, we provide a comparative functional annotation of the sheep genome, validated using experimental ChIP-Seq of sheep tissue. Using these annotations, we evaluate the impact of selection and domestication on regulatory sequences and find that sweeps are significantly enriched for protein coding genes, proximal regulatory elements of genes and genome features associated with active transcription. Finally, we find individual sites displaying strong allele frequency divergence are enriched for the same regulatory features. Our data demonstrate that remodelling of gene expression is likely to have been one of the evolutionary forces that drove phenotypic diversification of this common livestock species.
USDA-ARS?s Scientific Manuscript database
Apis mellifera syriaca is the native honeybee subspecies of Jordan and much of the Levant Region. It expresses behavioral adaptations to a regional climate with very high temperatures, nectar dearth in summer, attacks of the Oriental wasp and is resistant to Varroa mites. The A. m. syriaca control r...
Exposure to certain phthalate esters (PEs) during sexual differentiation induces reproductive tract malformations in male rats due to reductions in fetal testicular testosterone (T) production and expression of steroidogenesis-and insl3-related genes. In the current study, we use...
Tsuchiya, Masa; Giuliani, Alessandro; Hashimoto, Midori; Erenpreisa, Jekaterina; Yoshikawa, Kenichi
2015-01-01
Background The underlying mechanism of dynamic control of the genome-wide expression is a fundamental issue in bioscience. We addressed it in terms of phase transition by a systemic approach based on both density analysis and characteristics of temporal fluctuation for the time-course mRNA expression in differentiating MCF-7 breast cancer cells. Methodology In a recent work, we suggested criticality as an essential aspect of dynamic control of genome-wide gene expression. Criticality was evident by a unimodal-bimodal transition through flattened unimodal expression profile. The flatness on the transition suggests the existence of a critical transition at which up- and down-regulated expression is balanced. Mean field (averaging) behavior of mRNAs based on the temporal expression changes reveals a sandpile type of transition in the flattened profile. Furthermore, around the transition, a self-similar unimodal-bimodal transition of the whole expression occurs in the density profile of an ensemble of mRNA expression. These singular and scaling behaviors identify the transition as the expression phase transition driven by self-organized criticality (SOC). Principal Findings Emergent properties of SOC through a mean field approach are revealed: i) SOC, as a form of genomic phase transition, consolidates distinct critical states of expression, ii) Coupling of coherent stochastic oscillations between critical states on different time-scales gives rise to SOC, and iii) Specific gene clusters (barcode genes) ranging in size from kbp to Mbp reveal similar SOC to genome-wide mRNA expression and ON-OFF synchronization to critical states. This suggests that the cooperative gene regulation of topological genome sub-units is mediated by the coherent phase transitions of megadomain-scaled conformations between compact and swollen chromatin states. Conclusion and Significance In summary, our study provides not only a systemic method to demonstrate SOC in whole-genome expression, but also introduces novel, physically grounded concepts for a breakthrough in the study of biological regulation. PMID:26067993
Fügi, Matthias A; Gunasekera, Kapila; Ochsenreiter, Torsten; Guan, Xueli; Wenk, Markus R; Mäser, Pascal
2014-05-01
Sterols are an essential class of lipids in eukaryotes, where they serve as structural components of membranes and play important roles as signaling molecules. Sterols are also of high pharmacological significance: cholesterol-lowering drugs are blockbusters in human health, and inhibitors of ergosterol biosynthesis are widely used as antifungals. Inhibitors of ergosterol synthesis are also being developed for Chagas's disease, caused by Trypanosoma cruzi. Here we develop an in silico pipeline to globally evaluate sterol metabolism and perform comparative genomics. We generate a library of hidden Markov model-based profiles for 42 sterol biosynthetic enzymes, which allows expressing the genomic makeup of a given species as a numerical vector. Hierarchical clustering of these vectors functionally groups eukaryote proteomes and reveals convergent evolution, in particular metabolic reduction in obligate endoparasites. We experimentally explore sterol metabolism by testing a set of sterol biosynthesis inhibitors against trypanosomatids, Plasmodium falciparum, Giardia, and mammalian cells, and by quantifying the expression levels of sterol biosynthetic genes during the different life stages of T. cruzi and Trypanosoma brucei. The phenotypic data correlate with genomic makeup for simvastatin, which showed activity against trypanosomatids. Other findings, such as the activity of terbinafine against Giardia, are not in agreement with the genotypic profile.
Fügi, Matthias A.; Gunasekera, Kapila; Ochsenreiter, Torsten; Guan, Xueli; Wenk, Markus R.; Mäser, Pascal
2014-01-01
Sterols are an essential class of lipids in eukaryotes, where they serve as structural components of membranes and play important roles as signaling molecules. Sterols are also of high pharmacological significance: cholesterol-lowering drugs are blockbusters in human health, and inhibitors of ergosterol biosynthesis are widely used as antifungals. Inhibitors of ergosterol synthesis are also being developed for Chagas’s disease, caused by Trypanosoma cruzi. Here we develop an in silico pipeline to globally evaluate sterol metabolism and perform comparative genomics. We generate a library of hidden Markov model-based profiles for 42 sterol biosynthetic enzymes, which allows expressing the genomic makeup of a given species as a numerical vector. Hierarchical clustering of these vectors functionally groups eukaryote proteomes and reveals convergent evolution, in particular metabolic reduction in obligate endoparasites. We experimentally explore sterol metabolism by testing a set of sterol biosynthesis inhibitors against trypanosomatids, Plasmodium falciparum, Giardia, and mammalian cells, and by quantifying the expression levels of sterol biosynthetic genes during the different life stages of T. cruzi and Trypanosoma brucei. The phenotypic data correlate with genomic makeup for simvastatin, which showed activity against trypanosomatids. Other findings, such as the activity of terbinafine against Giardia, are not in agreement with the genotypic profile. PMID:24627128
With the advent of sequence information for entire eukaryotic genomes, it is now possible to analyze gene expression on a genomic scale. The primary tool for genomic analysis of gene expression is the gene microarray. We have used commercially available and custom cDNA microarray...
Angstadt, Andrea Y; Motsinger-Reif, Alison; Thomas, Rachael; Kisseberth, William C; Guillermo Couto, C; Duval, Dawn L; Nielsen, Dahlia M; Modiano, Jaime F; Breen, Matthew
2011-11-01
Osteosarcoma (OS) is the most commonly diagnosed malignant bone tumor in humans and dogs, characterized in both species by extremely complex karyotypes exhibiting high frequencies of genomic imbalance. Evaluation of genomic signatures in human OS using array comparative genomic hybridization (aCGH) has assisted in uncovering genetic mechanisms that result in disease phenotype. Previous low-resolution (10-20 Mb) aCGH analysis of canine OS identified a wide range of recurrent DNA copy number aberrations, indicating extensive genomic instability. In this study, we profiled 123 canine OS tumors by 1 Mb-resolution aCGH to generate a dataset for direct comparison with current data for human OS, concluding that several high frequency aberrations in canine and human OS are orthologous. To ensure complete coverage of gene annotation, we identified the human refseq genes that map to these orthologous aberrant dog regions and found several candidate genes warranting evaluation for OS involvement. Specifically, subsequenct FISH and qRT-PCR analysis of RUNX2, TUSC3, and PTEN indicated that expression levels correlated with genomic copy number status, showcasing RUNX2 as an OS associated gene and TUSC3 as a possible tumor suppressor candidate. Together these data demonstrate the ability of genomic comparative oncology to identify genetic abberations which may be important for OS progression. Large scale screening of genomic imbalance in canine OS further validates the use of the dog as a suitable model for human cancers, supporting the idea that dysregulation discovered in canine cancers will provide an avenue for complementary study in human counterparts. Copyright © 2011 Wiley-Liss, Inc.
The Eucalyptus terpene synthase gene family.
Külheim, Carsten; Padovan, Amanda; Hefer, Charles; Krause, Sandra T; Köllner, Tobias G; Myburg, Alexander A; Degenhardt, Jörg; Foley, William J
2015-06-11
Terpenoids are abundant in the foliage of Eucalyptus, providing the characteristic smell as well as being valuable economically and influencing ecological interactions. Quantitative and qualitative inter- and intra- specific variation of terpenes is common in eucalypts. The genome sequences of Eucalyptus grandis and E. globulus were mined for terpene synthase genes (TPS) and compared to other plant species. We investigated the relative expression of TPS in seven plant tissues and functionally characterized five TPS genes from E. grandis. Compared to other sequenced plant genomes, Eucalyptus grandis has the largest number of putative functional TPS genes of any sequenced plant. We discovered 113 and 106 putative functional TPS genes in E. grandis and E. globulus, respectively. All but one TPS from E. grandis were expressed in at least one of seven plant tissues examined. Genomic clusters of up to 20 genes were identified. Many TPS are expressed in tissues other than leaves which invites a re-evaluation of the function of terpenes in Eucalyptus. Our data indicate that terpenes in Eucalyptus may play a wider role in biotic and abiotic interactions than previously thought. Tissue specific expression is common and the possibility of stress induction needs further investigation. Phylogenetic comparison of the two investigated Eucalyptus species gives insight about recent evolution of different clades within the TPS gene family. While the majority of TPS genes occur in orthologous pairs some clades show evidence of recent gene duplication, as well as loss of function.
Tae, Donghyun; Seok, Junhee
2018-05-29
In this paper, we introduce multiple-matching Evidence-based Translator (mEBT) to discover genomic responses from murine expression data for human immune studies, which are significant in the given condition of mice and likely have similar responses in the corresponding condition of human. mEBT is evaluated over multiple data sets and shows improved inter-species agreement. mEBT is expected to be useful for research groups who use murine models to study human immunity. http://cdal.korea.ac.kr/mebt/. jseok14@korea.ac.kr. Supplementary data are available at Bioinformatics online.
Comparison of CRISPR/Cas9 expression constructs for efficient targeted mutagenesis in rice.
Mikami, Masafumi; Toki, Seiichi; Endo, Masaki
2015-08-01
The CRISPR/Cas9 system is an efficient tool used for genome editing in a variety of organisms. Despite several recent reports of successful targeted mutagenesis using the CRISPR/Cas9 system in plants, in each case the target gene of interest, the Cas9 expression system and guide-RNA (gRNA) used, and the tissues used for transformation and subsequent mutagenesis differed, hence the reported frequencies of targeted mutagenesis cannot be compared directly. Here, we evaluated mutation frequency in rice using different Cas9 and/or gRNA expression cassettes under standardized experimental conditions. We introduced Cas9 and gRNA expression cassettes separately or sequentially into rice calli, and assessed the frequency of mutagenesis at the same endogenous targeted sequences. Mutation frequencies differed significantly depending on the Cas9 expression cassette used. In addition, a gRNA driven by the OsU6 promoter was superior to one driven by the OsU3 promoter. Using an all-in-one expression vector harboring the best combined Cas9/gRNA expression cassette resulted in a much improved frequency of targeted mutagenesis in rice calli, and bi-allelic mutant plants were produced in the T0 generation. The approach presented here could be adapted to optimize the construction of Cas9/gRNA cassettes for genome editing in a variety of plants.
The Genomic and Transcriptomic Landscape of a HeLa Cell Line
Landry, Jonathan J. M.; Pyl, Paul Theodor; Rausch, Tobias; Zichner, Thomas; Tekkedil, Manu M.; Stütz, Adrian M.; Jauch, Anna; Aiyar, Raeka S.; Pau, Gregoire; Delhomme, Nicolas; Gagneur, Julien; Korbel, Jan O.; Huber, Wolfgang; Steinmetz, Lars M.
2013-01-01
HeLa is the most widely used model cell line for studying human cellular and molecular biology. To date, no genomic reference for this cell line has been released, and experiments have relied on the human reference genome. Effective design and interpretation of molecular genetic studies performed using HeLa cells require accurate genomic information. Here we present a detailed genomic and transcriptomic characterization of a HeLa cell line. We performed DNA and RNA sequencing of a HeLa Kyoto cell line and analyzed its mutational portfolio and gene expression profile. Segmentation of the genome according to copy number revealed a remarkably high level of aneuploidy and numerous large structural variants at unprecedented resolution. Some of the extensive genomic rearrangements are indicative of catastrophic chromosome shattering, known as chromothripsis. Our analysis of the HeLa gene expression profile revealed that several pathways, including cell cycle and DNA repair, exhibit significantly different expression patterns from those in normal human tissues. Our results provide the first detailed account of genomic variants in the HeLa genome, yielding insight into their impact on gene expression and cellular function as well as their origins. This study underscores the importance of accounting for the strikingly aberrant characteristics of HeLa cells when designing and interpreting experiments, and has implications for the use of HeLa as a model of human biology. PMID:23550136
Lux, M P; Nabieva, N; Hildebrandt, T; Rebscher, H; Kümmel, S; Blohmer, J-U; Schrauder, M G
2018-02-01
Many women with early-stage, hormone receptor-positive breast cancer may not benefit from adjuvant chemotherapy. Gene expression tests can reduce chemotherapy over- and undertreatment by providing prognostic information on the likelihood of recurrence and, with Oncotype DX, predictive information on chemotherapy benefit. These tests are currently not reimbursed by German healthcare payers. An analysis was conducted to evaluate the budget impact of gene expression tests in Germany. Costs of gene expression tests and medical and non-medical costs associated with treatment were assessed from healthcare payer and societal perspectives. Costs were estimated from data collected at a university hospital and were combined with decision impact data for Oncotype DX, MammaPrint, Prosigna and EndoPredict (EPclin). Changes in chemotherapy use and budget impact were evaluated over 1 year for 20,000 women. Chemotherapy was associated with substantial annual costs of EUR 19,003 and EUR 84,412 per therapy from the healthcare payer and societal perspective, respectively. Compared with standard care, only Oncotype DX was associated with cost savings to healthcare payers and society (EUR 5.9 million and EUR 253 million, respectively). Scenario analysis showed that both women at high clinical but low genomic risk and low clinical but high genomic risk were important contributors to costs. Oncotype DX was the only gene expression test that was estimated to reduce costs versus standard care in Germany. The reimbursement of Oncotype DX testing in standard clinical practice in Germany should be considered. Copyright © 2017 Elsevier Ltd. All rights reserved.
Cis-regulatory somatic mutations and gene-expression alteration in B-cell lymphomas.
Mathelier, Anthony; Lefebvre, Calvin; Zhang, Allen W; Arenillas, David J; Ding, Jiarui; Wasserman, Wyeth W; Shah, Sohrab P
2015-04-23
With the rapid increase of whole-genome sequencing of human cancers, an important opportunity to analyze and characterize somatic mutations lying within cis-regulatory regions has emerged. A focus on protein-coding regions to identify nonsense or missense mutations disruptive to protein structure and/or function has led to important insights; however, the impact on gene expression of mutations lying within cis-regulatory regions remains under-explored. We analyzed somatic mutations from 84 matched tumor-normal whole genomes from B-cell lymphomas with accompanying gene expression measurements to elucidate the extent to which these cancers are disrupted by cis-regulatory mutations. We characterize mutations overlapping a high quality set of well-annotated transcription factor binding sites (TFBSs), covering a similar portion of the genome as protein-coding exons. Our results indicate that cis-regulatory mutations overlapping predicted TFBSs are enriched in promoter regions of genes involved in apoptosis or growth/proliferation. By integrating gene expression data with mutation data, our computational approach culminates with identification of cis-regulatory mutations most likely to participate in dysregulation of the gene expression program. The impact can be measured along with protein-coding mutations to highlight key mutations disrupting gene expression and pathways in cancer. Our study yields specific genes with disrupted expression triggered by genomic mutations in either the coding or the regulatory space. It implies that mutated regulatory components of the genome contribute substantially to cancer pathways. Our analyses demonstrate that identifying genomically altered cis-regulatory elements coupled with analysis of gene expression data will augment biological interpretation of mutational landscapes of cancers.
Ushijima, Masaru; Mashima, Tetsuo; Tomida, Akihiro; Dan, Shingo; Saito, Sakae; Furuno, Aki; Tsukahara, Satomi; Seimiya, Hiroyuki; Yamori, Takao; Matsuura, Masaaki
2013-03-01
Genome-wide transcriptional expression analysis is a powerful strategy for characterizing the biological activity of anticancer compounds. It is often instructive to identify gene sets involved in the activity of a given drug compound for comparison with different compounds. Currently, however, there is no comprehensive gene expression database and related application system that is; (i) specialized in anticancer agents; (ii) easy to use; and (iii) open to the public. To develop a public gene expression database of antitumor agents, we first examined gene expression profiles in human cancer cells after exposure to 35 compounds including 25 clinically used anticancer agents. Gene signatures were extracted that were classified as upregulated or downregulated after exposure to the drug. Hierarchical clustering showed that drugs with similar mechanisms of action, such as genotoxic drugs, were clustered. Connectivity map analysis further revealed that our gene signature data reflected modes of action of the respective agents. Together with the database, we developed analysis programs that calculate scores for ranking changes in gene expression and for searching statistically significant pathways from the Kyoto Encyclopedia of Genes and Genomes database in order to analyze the datasets more easily. Our database and the analysis programs are available online at our website (http://scads.jfcr.or.jp/db/cs/). Using these systems, we successfully showed that proteasome inhibitors are selectively classified as endoplasmic reticulum stress inducers and induce atypical endoplasmic reticulum stress. Thus, our public access database and related analysis programs constitute a set of efficient tools to evaluate the mode of action of novel compounds and identify promising anticancer lead compounds. © 2012 Japanese Cancer Association.
Cox, Murray P; Dong, Ting; Shen, Genggeng; Dalvi, Yogesh; Scott, D Barry; Ganley, Austen R D
2014-03-01
Polyploidy, a state in which the chromosome complement has undergone an increase, is a major force in evolution. Understanding the consequences of polyploidy has received much attention, and allopolyploids, which result from the union of two different parental genomes, are of particular interest because they must overcome a suite of biological responses to this merger, known as "genome shock." A key question is what happens to gene expression of the two gene copies following allopolyploidization, but until recently the tools to answer this question on a genome-wide basis were lacking. Here we utilize high throughput transcriptome sequencing to produce the first genome-wide picture of gene expression response to allopolyploidy in fungi. A novel pipeline for assigning sequence reads to the gene copies was used to quantify their expression in a fungal allopolyploid. We find that the transcriptional response to allopolyploidy is predominantly conservative: both copies of most genes are retained; over half the genes inherit parental gene expression patterns; and parental differential expression is often lost in the allopolyploid. Strikingly, the patterns of gene expression change are highly concordant with the genome-wide expression results of a cotton allopolyploid. The very different nature of these two allopolyploids implies a conserved, eukaryote-wide transcriptional response to genome merger. We provide evidence that the transcriptional responses we observe are mostly driven by intrinsic differences between the regulatory systems in the parent species, and from this propose a mechanistic model in which the cross-kingdom conservation in transcriptional response reflects conservation of the mutational processes underlying eukaryotic gene regulatory evolution. This work provides a platform to develop a universal understanding of gene expression response to allopolyploidy and suggests that allopolyploids are an exceptional system to investigate gene regulatory changes that have evolved in the parental species prior to allopolyploidization.
HIV promoter integration site primarily modulates transcriptional burst size rather than frequency.
Skupsky, Ron; Burnett, John C; Foley, Jonathan E; Schaffer, David V; Arkin, Adam P
2010-09-30
Mammalian gene expression patterns, and their variability across populations of cells, are regulated by factors specific to each gene in concert with its surrounding cellular and genomic environment. Lentiviruses such as HIV integrate their genomes into semi-random genomic locations in the cells they infect, and the resulting viral gene expression provides a natural system to dissect the contributions of genomic environment to transcriptional regulation. Previously, we showed that expression heterogeneity and its modulation by specific host factors at HIV integration sites are key determinants of infected-cell fate and a possible source of latent infections. Here, we assess the integration context dependence of expression heterogeneity from diverse single integrations of a HIV-promoter/GFP-reporter cassette in Jurkat T-cells. Systematically fitting a stochastic model of gene expression to our data reveals an underlying transcriptional dynamic, by which multiple transcripts are produced during short, infrequent bursts, that quantitatively accounts for the wide, highly skewed protein expression distributions observed in each of our clonal cell populations. Interestingly, we find that the size of transcriptional bursts is the primary systematic covariate over integration sites, varying from a few to tens of transcripts across integration sites, and correlating well with mean expression. In contrast, burst frequencies are scattered about a typical value of several per cell-division time and demonstrate little correlation with the clonal means. This pattern of modulation generates consistently noisy distributions over the sampled integration positions, with large expression variability relative to the mean maintained even for the most productive integrations, and could contribute to specifying heterogeneous, integration-site-dependent viral production patterns in HIV-infected cells. Genomic environment thus emerges as a significant control parameter for gene expression variation that may contribute to structuring mammalian genomes, as well as be exploited for survival by integrating viruses.
Ni, Haifeng; Zhou, Zhen; Jiang, Bo; Yuan, Xiaoyang; Cao, Xiaolin; Huang, Guangwu; Li, Yong
2017-03-01
This study aimed to investigate the inactivation of the parkin gene by promoter methylation and its relationship with genome instability in nasopharyngeal carcinoma. Parkin was considered as a tumor suppressor gene in various types of cancers. However, its role in nasopharyngeal carcinoma is unexplored. Genomic instabilities were detected in nasopharyngeal carcinoma tissues by the random amplified polymorphic DNA. The methylation-specific polymerase chain reaction, semi-quantitative reverse transcription polymerase chain reaction, and immunohistochemical analysis were used to detect methylation and mRNA and protein expression of parkin in 54 cases of nasopharyngeal carcinoma tissues and 16 cases of normal nasopharyngeal epithelia tissues, and in 5 nasopharyngeal carcinoma cell lines (CNE1, CNE2, TWO3, C666, and HONE1) and 1 normal nasopharyngeal epithelia cell line (NP69). mRNA expression of parkin in CNE1 and CNE2 was analyzed before and after methyltransferase inhibitor 5-aza-2-deoxycytidine treatment. The relationship between promoter methylation and mRNA expression, demethylation and mRNA expression, and mRNA and protein expression of the gene and clinical factors and genomic instabilities were analyzed. The mRNA and protein expression levels were significantly reduced in 54 cases of human nasopharyngeal carcinoma compared with 16 cases of normal nasopharyngeal epithelia. Parkin-methylated cases showed significantly lower mRNA and protein expression levels compared with unmethylated cases. After 5-aza-2-deoxycytidine treatment, parkin mRNA expression was restored in CNE1 and CNE2; 92.59% (50/54) of nasopharyngeal carcinoma demonstrated genomic instability. Parkin is frequently inactivated by promoter methylation, and its mRNA and protein expression correlate with lymph node metastasis and genomic instability. Parkin deficiency probably promotes tumorigenesis in nasopharyngeal carcinoma.
Lipinski, Kamil A; Kaniak-Golik, Aneta; Golik, Pawel
2010-01-01
As a legacy of their endosymbiotic eubacterial origin, mitochondria possess a residual genome, encoding only a few proteins and dependent on a variety of factors encoded by the nuclear genome for its maintenance and expression. As a facultative anaerobe with well understood genetics and molecular biology, Saccharomyces cerevisiae is the model system of choice for studying nucleo-mitochondrial genetic interactions. Maintenance of the mitochondrial genome is controlled by a set of nuclear-coded factors forming intricately interconnected circuits responsible for replication, recombination, repair and transmission to buds. Expression of the yeast mitochondrial genome is regulated mostly at the post-transcriptional level, and involves many general and gene-specific factors regulating splicing, RNA processing and stability and translation. A very interesting aspect of the yeast mitochondrial system is the relationship between genome maintenance and gene expression. Deletions of genes involved in many different aspects of mitochondrial gene expression, notably translation, result in an irreversible loss of functional mtDNA. The mitochondrial genetic system viewed from the systems biology perspective is therefore very fragile and lacks robustness compared to the remaining systems of the cell. This lack of robustness could be a legacy of the reductive evolution of the mitochondrial genome, but explanations involving selective advantages of increased evolvability have also been postulated. Copyright © 2009 Elsevier B.V. All rights reserved.
Ferreira, Joshua P; Peacock, Ryan W S; Lawhorn, Ingrid E B; Wang, Clifford L
2011-12-01
The human cytomegalovirus and elongation factor 1α promoters are constitutive promoters commonly employed by mammalian expression vectors. These promoters generally produce high levels of expression in many types of cells and tissues. To generate a library of synthetic promoters capable of generating a range of low, intermediate, and high expression levels, the TATA and CAAT box elements of these promoters were mutated. Other promoter variants were also generated by random mutagenesis. Evaluation using plasmid vectors integrated at a single site in the genome revealed that these various synthetic promoters were capable of expression levels spanning a 40-fold range. Retroviral vectors were equipped with the synthetic promoters and evaluated for their ability to reproduce the graded expression demonstrated by plasmid integration. A vector with a self-inactivating long terminal repeat could neither reproduce the full range of expression levels nor produce stable expression. Using a second vector design, the different synthetic promoters enabled stable expression over a broad range of expression levels in different cell lines. The online version of this article (doi:10.1007/s11693-011-9089-0) contains supplementary material, which is available to authorized users.
Methods for Genome-Wide Analysis of Gene Expression Changes in Polyploids
Wang, Jianlin; Lee, Jinsuk J.; Tian, Lu; Lee, Hyeon-Se; Chen, Meng; Rao, Sheetal; Wei, Edward N.; Doerge, R. W.; Comai, Luca; Jeffrey Chen, Z.
2007-01-01
Polyploidy is an evolutionary innovation, providing extra sets of genetic material for phenotypic variation and adaptation. It is predicted that changes of gene expression by genetic and epigenetic mechanisms are responsible for novel variation in nascent and established polyploids (Liu and Wendel, 2002; Osborn et al., 2003; Pikaard, 2001). Studying gene expression changes in allopolyploids is more complicated than in autopolyploids, because allopolyploids contain more than two sets of genomes originating from divergent, but related, species. Here we describe two methods that are applicable to the genome-wide analysis of gene expression differences resulting from genome duplication in autopolyploids or interactions between homoeologous genomes in allopolyploids. First, we describe an amplified fragment length polymorphism (AFLP)–complementary DNA (cDNA) display method that allows the discrimination of homoeologous loci based on restriction polymorphisms between the progenitors. Second, we describe microarray analyses that can be used to compare gene expression differences between the allopolyploids and respective progenitors using appropriate experimental design and statistical analysis. We demonstrate the utility of these two complementary methods and discuss the pros and cons of using the methods to analyze gene expression changes in autopolyploids and allopolyploids. Furthermore, we describe these methods in general terms to be of wider applicability for comparative gene expression in a variety of evolutionary, genetic, biological, and physiological contexts. PMID:15865985
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vega, Fernando E.; Brown, Stuart M.; Chen, Hao
The coffee berry borer, Hypothenemus hampei, is the most economically important insect pest of coffee worldwide. We present an analysis of the draft genome of the coffee berry borer, the third genome for a Coleopteran species. The genome size is ca. 163 Mb with 19,222 predicted protein-coding genes. Analysis was focused on genes involved in primary digestion as well as gene families involved in detoxification of plant defense molecules and insecticides, such as carboxylesterases, cytochrome P450, gluthathione S-transferases, ATP-binding cassette transporters, and a gene that confers resistance to the insecticide dieldrin. A broad range of enzymes capable of degrading complexmore » polysaccharides were identified. We also evaluated the pathogen defense system and found homologs to antimicrobial genes reported in the Drosophila genome. Ten cases of horizontal gene transfer were identified with evidence for expression, integration into the H. hampei genome, and phylogenetic evidence that the sequences are more closely related to bacterial rather than eukaryotic genes. We find the draft genome analysis broadly expands our knowledge on the biology of a devastating tropical insect pest and suggests new pest management strategies.« less
Vega, Fernando E.; Brown, Stuart M.; Chen, Hao; Shen, Eric; Nair, Mridul B.; Ceja-Navarro, Javier A.; Brodie, Eoin L.; Infante, Francisco; Dowd, Patrick F.; Pain, Arnab
2015-01-01
The coffee berry borer, Hypothenemus hampei, is the most economically important insect pest of coffee worldwide. We present an analysis of the draft genome of the coffee berry borer, the third genome for a Coleopteran species. The genome size is ca. 163 Mb with 19,222 predicted protein-coding genes. Analysis was focused on genes involved in primary digestion as well as gene families involved in detoxification of plant defense molecules and insecticides, such as carboxylesterases, cytochrome P450, gluthathione S-transferases, ATP-binding cassette transporters, and a gene that confers resistance to the insecticide dieldrin. A broad range of enzymes capable of degrading complex polysaccharides were identified. We also evaluated the pathogen defense system and found homologs to antimicrobial genes reported in the Drosophila genome. Ten cases of horizontal gene transfer were identified with evidence for expression, integration into the H. hampei genome, and phylogenetic evidence that the sequences are more closely related to bacterial rather than eukaryotic genes. The draft genome analysis broadly expands our knowledge on the biology of a devastating tropical insect pest and suggests new pest management strategies. PMID:26228545
Vega, Fernando E.; Brown, Stuart M.; Chen, Hao; ...
2015-07-31
The coffee berry borer, Hypothenemus hampei, is the most economically important insect pest of coffee worldwide. We present an analysis of the draft genome of the coffee berry borer, the third genome for a Coleopteran species. The genome size is ca. 163 Mb with 19,222 predicted protein-coding genes. Analysis was focused on genes involved in primary digestion as well as gene families involved in detoxification of plant defense molecules and insecticides, such as carboxylesterases, cytochrome P450, gluthathione S-transferases, ATP-binding cassette transporters, and a gene that confers resistance to the insecticide dieldrin. A broad range of enzymes capable of degrading complexmore » polysaccharides were identified. We also evaluated the pathogen defense system and found homologs to antimicrobial genes reported in the Drosophila genome. Ten cases of horizontal gene transfer were identified with evidence for expression, integration into the H. hampei genome, and phylogenetic evidence that the sequences are more closely related to bacterial rather than eukaryotic genes. We find the draft genome analysis broadly expands our knowledge on the biology of a devastating tropical insect pest and suggests new pest management strategies.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ya Wang
2010-05-31
The major goal of this study is to determine the effects of the Fhit pathway on low dose ({le} 0.1 Gy) ionizing radiation (IR)-induced genetic instability. Reduction of Fhit protein expression is observed in most solid tumors particularly in those tumors resulting from exposure to environmental carcinogens. Therefore, characterization of the role of the Fhit-dependent pathway in preventing low dose IR-induced genetic instability will provide useful parameters for evaluating the low dose IR-induced risk of mutagenesis and carcinogenesis. We pursued 3 specific aims to study our hypothesis that the Fhit-dependent pathways maintain genomic integrity through adjusting checkpoint response and repairmore » genes expression following low dose IR. Aim 1: Determine whether Fhit interaction with RPA is necessary for Fhit to affect the cellular response to low dose IR. We combined the approaches of in vitro (GST pull-down and site-directed mutagenesis) and in vivo (observing the co-localization and immunoprecipitation of Fhit and RPA in Fhit knock out mouse cells transfected with mutant Fhit which has lost ability to interact with RPA in vitro). Aim 2: Determine the role of genes whose expression is affected by Fhit in low dose irradiated cells. We analyzed the distinct signature of gene expression in low dose irradiated Fhit-/- cells compared with Fhit+/+ cells by combining microarray, gene transfection and siRNA approaches. Aim 3: Determine the role of Fhit in genetic susceptibility to low dose IR in vivo. We compared the gene mutation frequency and the fragile site stability in the cells isolated from the Fhit+/+ and Fhit-/- mice at 1.5 years following low dose IR. These results determine the role of the Fhit-dependent pathway in maintaining genomic integrity in vitro and in vivo, which provide a basis for choosing surrogate markers in the Fhit-dependent pathway to evaluate low dose IR-induced risk of mutagenesis and carcinogenesis.« less
Yang, Hongli; Liu, Jing; Huang, Shunmou; Guo, Tingting; Deng, Linbin; Hua, Wei
2014-03-15
Selection of reference genes in Brassica napus, a tetraploid (4×) species, is a very difficult task without information on genome and transcriptome. By now, only several traditional reference genes which show significant expression differentiation under different conditions are used in B. napus. In the present study, based on genome and transcriptome data of the rapeseed Zhongshuang-11 cultivar, 14 candidate reference genes were screened for investigation in different tissues, cultivars, and treated conditions of B. napus. These genes were as follows: ELF5, ENTH, F-BOX7, F-BOX2, FYPP1, GDI1, GYF, MCP2d, OTP80, PPR, SPOC, Unknown1, Unknown2 and UBA. Among them, excluding GYF and FYPP1, another 12 genes, were identified to perform better than traditional reference genes ACTIN7 and GAPDH. To further validate the accuracy of the newly developed reference genes in normalization, expression levels of BnCAT1 (B. napus catalase 1) in different rapeseed tissues and seedlings under stress conditions were normalized by the three most stable reference genes PPR, GDI1, and ENTH and little difference existed in normalization results. To the best of our knowledge, this is the first time B. napus reference genes have been provided with the help of complete genome and transcriptome information. The new reference genes provided in this study are more accurate than previously reported reference genes in quantifying expression levels of B. napus genes. Crown Copyright © 2014. Published by Elsevier B.V. All rights reserved.
2007-05-01
Benign and Malignant Nerve Sheath Tumors in Neurofibromatosis Patients PRINCIPAL INVESTIGATOR: Matt van de Rijn, M.D., Ph.D. Torsten...Annual 3. DATES COVERED 1 May 2006 –30 Apr 2007 4. TITLE AND SUBTITLE 5a. CONTRACT NUMBER Genomic and Expression Profiling of Benign and Malignant Nerve...Award Number: DAMD17-03-1-0297 Title: Genomic and Expression Profiling of Benign and Malignant Nerve Sheath Tumors in Neurofibromatosis
Wood, David L. A.; Nones, Katia; Steptoe, Anita; Christ, Angelika; Harliwong, Ivon; Newell, Felicity; Bruxner, Timothy J. C.; Miller, David; Cloonan, Nicole; Grimmond, Sean M.
2015-01-01
Genetic variation modulates gene expression transcriptionally or post-transcriptionally, and can profoundly alter an individual’s phenotype. Measuring allelic differential expression at heterozygous loci within an individual, a phenomenon called allele-specific expression (ASE), can assist in identifying such factors. Massively parallel DNA and RNA sequencing and advances in bioinformatic methodologies provide an outstanding opportunity to measure ASE genome-wide. In this study, matched DNA and RNA sequencing, genotyping arrays and computationally phased haplotypes were integrated to comprehensively and conservatively quantify ASE in a single human brain and liver tissue sample. We describe a methodological evaluation and assessment of common bioinformatic steps for ASE quantification, and recommend a robust approach to accurately measure SNP, gene and isoform ASE through the use of personalized haplotype genome alignment, strict alignment quality control and intragenic SNP aggregation. Our results indicate that accurate ASE quantification requires careful bioinformatic analyses and is adversely affected by sample specific alignment confounders and random sampling even at moderate sequence depths. We identified multiple known and several novel ASE genes in liver, including WDR72, DSP and UBD, as well as genes that contained ASE SNPs with imbalance direction discordant with haplotype phase, explainable by annotated transcript structure, suggesting isoform derived ASE. The methods evaluated in this study will be of use to researchers performing highly conservative quantification of ASE, and the genes and isoforms identified as ASE of interest to researchers studying those loci. PMID:25965996
The Plant Genome Integrative Explorer Resource: PlantGenIE.org.
Sundell, David; Mannapperuma, Chanaka; Netotea, Sergiu; Delhomme, Nicolas; Lin, Yao-Cheng; Sjödin, Andreas; Van de Peer, Yves; Jansson, Stefan; Hvidsten, Torgeir R; Street, Nathaniel R
2015-12-01
Accessing and exploring large-scale genomics data sets remains a significant challenge to researchers without specialist bioinformatics training. We present the integrated PlantGenIE.org platform for exploration of Populus, conifer and Arabidopsis genomics data, which includes expression networks and associated visualization tools. Standard features of a model organism database are provided, including genome browsers, gene list annotation, Blast homology searches and gene information pages. Community annotation updating is supported via integration of WebApollo. We have produced an RNA-sequencing (RNA-Seq) expression atlas for Populus tremula and have integrated these data within the expression tools. An updated version of the ComPlEx resource for performing comparative plant expression analyses of gene coexpression network conservation between species has also been integrated. The PlantGenIE.org platform provides intuitive access to large-scale and genome-wide genomics data from model forest tree species, facilitating both community contributions to annotation improvement and tools supporting use of the included data resources to inform biological insight. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Noncoding RNAs in DNA Repair and Genome Integrity
Wan, Guohui; Liu, Yunhua; Han, Cecil; Zhang, Xinna
2014-01-01
Abstract Significance: The well-studied sequences in the human genome are those of protein-coding genes, which account for only 1%–2% of the total genome. However, with the advent of high-throughput transcriptome sequencing technology, we now know that about 90% of our genome is extensively transcribed and that the vast majority of them are transcribed into noncoding RNAs (ncRNAs). It is of great interest and importance to decipher the functions of these ncRNAs in humans. Recent Advances: In the last decade, it has become apparent that ncRNAs play a crucial role in regulating gene expression in normal development, in stress responses to internal and environmental stimuli, and in human diseases. Critical Issues: In addition to those constitutively expressed structural RNA, such as ribosomal and transfer RNAs, regulatory ncRNAs can be classified as microRNAs (miRNAs), Piwi-interacting RNAs (piRNAs), small interfering RNAs (siRNAs), small nucleolar RNAs (snoRNAs), and long noncoding RNAs (lncRNAs). However, little is known about the biological features and functional roles of these ncRNAs in DNA repair and genome instability, although a number of miRNAs and lncRNAs are regulated in the DNA damage response. Future Directions: A major goal of modern biology is to identify and characterize the full profile of ncRNAs with regard to normal physiological functions and roles in human disorders. Clinically relevant ncRNAs will also be evaluated and targeted in therapeutic applications. Antioxid. Redox Signal. 20, 655–677. PMID:23879367
Generation and validation of homozygous fluorescent knock-in cells using CRISPR-Cas9 genome editing.
Koch, Birgit; Nijmeijer, Bianca; Kueblbeck, Moritz; Cai, Yin; Walther, Nike; Ellenberg, Jan
2018-06-01
Gene tagging with fluorescent proteins is essential for investigations of the dynamic properties of cellular proteins. CRISPR-Cas9 technology is a powerful tool for inserting fluorescent markers into all alleles of the gene of interest (GOI) and allows functionality and physiological expression of the fusion protein. It is essential to evaluate such genome-edited cell lines carefully in order to preclude off-target effects caused by (i) incorrect insertion of the fluorescent protein, (ii) perturbation of the fusion protein by the fluorescent proteins or (iii) nonspecific genomic DNA damage by CRISPR-Cas9. In this protocol, we provide a step-by-step description of our systematic pipeline to generate and validate homozygous fluorescent knock-in cell lines.We have used the paired Cas9D10A nickase approach to efficiently insert tags into specific genomic loci via homology-directed repair (HDR) with minimal off-target effects. It is time-consuming and costly to perform whole-genome sequencing of each cell clone to check for spontaneous genetic variations occurring in mammalian cell lines. Therefore, we have developed an efficient validation pipeline of the generated cell lines consisting of junction PCR, Southern blotting analysis, Sanger sequencing, microscopy, western blotting analysis and live-cell imaging for cell-cycle dynamics. This protocol takes between 6 and 9 weeks. With this protocol, up to 70% of the targeted genes can be tagged homozygously with fluorescent proteins, thus resulting in physiological levels and phenotypically functional expression of the fusion proteins.
Neighboring Genes Show Correlated Evolution in Gene Expression.
Ghanbarian, Avazeh T; Hurst, Laurence D
2015-07-01
When considering the evolution of a gene's expression profile, we commonly assume that this is unaffected by its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between neighboring genes in gene expression profiles in extant taxa. Indeed, in all eukaryotic genomes genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their expression or is gene expression evolution autonomous? To address this here we consider evolution of human gene expression since the human-chimp common ancestor, allowing for both variation in estimation of current expression level and error in Bayesian estimation of the ancestral state. We find that in all tissues and both sexes, the change in gene expression of a focal gene on average predicts the change in gene expression of neighbors. The effect is highly pronounced in the immediate vicinity (<100 kb) but extends much further. Sex-specific expression change is also genomically clustered. As genes increasing their expression in humans tend to avoid nuclear lamina domains and be enriched for the gene activator 5-hydroxymethylcytosine, we conclude that, most probably owing to chromatin level control of gene expression, a change in gene expression of one gene likely affects the expression evolution of neighbors, what we term expression piggybacking, an analog of hitchhiking. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
ISOL@: an Italian SOLAnaceae genomics resource.
Chiusano, Maria Luisa; D'Agostino, Nunzio; Traini, Alessandra; Licciardello, Concetta; Raimondo, Enrico; Aversano, Mario; Frusciante, Luigi; Monti, Luigi
2008-03-26
Present-day '-omics' technologies produce overwhelming amounts of data which include genome sequences, information on gene expression (transcripts and proteins) and on cell metabolic status. These data represent multiple aspects of a biological system and need to be investigated as a whole to shed light on the mechanisms which underpin the system functionality. The gathering and convergence of data generated by high-throughput technologies, the effective integration of different data-sources and the analysis of the information content based on comparative approaches are key methods for meaningful biological interpretations. In the frame of the International Solanaceae Genome Project, we propose here ISOLA, an Italian SOLAnaceae genomics resource. ISOLA (available at http://biosrv.cab.unina.it/isola) represents a trial platform and it is conceived as a multi-level computational environment.ISOLA currently consists of two main levels: the genome and the expression level. The cornerstone of the genome level is represented by the Solanum lycopersicum genome draft sequences generated by the International Tomato Genome Sequencing Consortium. Instead, the basic element of the expression level is the transcriptome information from different Solanaceae species, mainly in the form of species-specific comprehensive collections of Expressed Sequence Tags (ESTs). The cross-talk between the genome and the expression levels is based on data source sharing and on tools that enhance data quality, that extract information content from the levels' under parts and produce value-added biological knowledge. ISOLA is the result of a bioinformatics effort that addresses the challenges of the post-genomics era. It is designed to exploit '-omics' data based on effective integration to acquire biological knowledge and to approach a systems biology view. Beyond providing experimental biologists with a preliminary annotation of the tomato genome, this effort aims to produce a trial computational environment where different aspects and details are maintained as they are relevant for the analysis of the organization, the functionality and the evolution of the Solanaceae family.
Sri, Tanu; Mayee, Pratiksha; Singh, Anandita
2015-09-01
Whole genome sequence analyses allow unravelling such evolutionary consequences of meso-triplication event in Brassicaceae (∼14-20 million years ago (MYA)) as differential gene fractionation and diversification in homeologous sub-genomes. This study presents a simple gene-centric approach involving microsynteny and natural genetic variation analysis for understanding SUPPRESSOR of OVEREXPRESSION of CONSTANS 1 (SOC1) homeolog evolution in Brassica. Analysis of microsynteny in Brassica rapa homeologous regions containing SOC1 revealed differential gene fractionation correlating to reported fractionation status of sub-genomes of origin, viz. least fractionated (LF), moderately fractionated 1 (MF1) and most fractionated (MF2), respectively. Screening 18 cultivars of 6 Brassica species led to the identification of 8 genomic and 27 transcript variants of SOC1, including splice-forms. Co-occurrence of both interrupted and intronless SOC1 genes was detected in few Brassica species. In silico analysis characterised Brassica SOC1 as MADS intervening, K-box, C-terminal (MIKC(C)) transcription factor, with highly conserved MADS and I domains relative to K-box and C-terminal domain. Phylogenetic analyses and multiple sequence alignments depicting shared pattern of silent/non-silent mutations assigned Brassica SOC1 homologs into groups based on shared diploid base genome. In addition, a sub-genome structure in uncharacterised Brassica genomes was inferred. Expression analysis of putative MF2 and LF (Brassica diploid base genome A (AA)) sub-genome-specific SOC1 homeologs of Brassica juncea revealed near identical expression pattern. However, MF2-specific homeolog exhibited significantly higher expression implying regulatory diversification. In conclusion, evidence for polyploidy-induced sequence and regulatory evolution in Brassica SOC1 is being presented wherein differential homeolog expression is implied in functional diversification.
Cross-study projections of genomic biomarkers: an evaluation in cancer genomics.
Lucas, Joseph E; Carvalho, Carlos M; Chen, Julia Ling-Yu; Chi, Jen-Tsan; West, Mike
2009-01-01
Human disease studies using DNA microarrays in both clinical/observational and experimental/controlled studies are having increasing impact on our understanding of the complexity of human diseases. A fundamental concept is the use of gene expression as a "common currency" that links the results of in vitro controlled experiments to in vivo observational human studies. Many studies--in cancer and other diseases--have shown promise in using in vitro cell manipulations to improve understanding of in vivo biology, but experiments often simply fail to reflect the enormous phenotypic variation seen in human diseases. We address this with a framework and methods to dissect, enhance and extend the in vivo utility of in vitro derived gene expression signatures. From an experimentally defined gene expression signature we use statistical factor analysis to generate multiple quantitative factors in human cancer gene expression data. These factors retain their relationship to the original, one-dimensional in vitro signature but better describe the diversity of in vivo biology. In a breast cancer analysis, we show that factors can reflect fundamentally different biological processes linked to molecular and clinical features of human cancers, and that in combination they can improve prediction of clinical outcomes.
Regulation of human genome expression and RNA splicing by human papillomavirus 16 E2 protein.
Gauson, Elaine J; Windle, Brad; Donaldson, Mary M; Caffarel, Maria M; Dornan, Edward S; Coleman, Nicholas; Herzyk, Pawel; Henderson, Scott C; Wang, Xu; Morgan, Iain M
2014-11-01
Human papillomavirus 16 (HPV16) is causative in human cancer. The E2 protein regulates transcription from and replication of the viral genome; the role of E2 in regulating the host genome has been less well studied. We have expressed HPV16 E2 (E2) stably in U2OS cells; these cells tolerate E2 expression well and gene expression analysis identified 74 genes showing differential expression specific to E2. Analysis of published gene expression data sets during cervical cancer progression identified 20 of the genes as being altered in a similar direction as the E2 specific genes. In addition, E2 altered the splicing of many genes implicated in cancer and cell motility. The E2 expressing cells showed no alteration in cell growth but were altered in cell motility, consistent with the E2 induced altered splicing predicted to affect this cellular function. The results present a model system for investigating E2 regulation of the host genome. Copyright © 2014 Elsevier Inc. All rights reserved.
Liang, Danna; Liu, Min; Hu, Qijing; He, Min; Qi, Xiaohua; Xu, Qiang; Zhou, Fucai; Chen, Xuehao
2015-01-01
Cucumber, a very important vegetable crop worldwide, is easily damaged by pests. Aphids (Aphis gossypii Glover) are among the most serious pests in cucumber production and often cause severe loss of yield and make fruit quality get worse. Identifying genes that render cucumbers resistant to aphid-induced damage and breeding aphid-resistant cucumber varieties have become the most promising control strategies. In this study, a Illumina Genome Analyzer platform was applied to monitor changes in gene expression in the whole genome of the cucumber cultivar ‘EP6392’ which is resistant to aphids. Nine DGE libraries were constructed from infected and uninfected leaves. In total, 49 differentially expressed genes related to cucumber aphid resistance were screened during the treatment period. These genes are mainly associated with signal transduction, plant-pathogen interactions, flavonoid biosynthesis, amino acid metabolism and sugar metabolism pathways. Eight of the 49 genes may be associated with aphid resistance. Finally, expression of 9 randomly selected genes was evaluated by qRT-PCR to verify the results for the tag-mapped genes. With the exception of 1-aminocyclopropane-1-carboxylate oxidase homolog 6, the expression of the chosen genes was in agreement with the results of the tag-sequencing analysis patterns. PMID:25959296
Regis, David P.; Dobaño, Carlota; Quiñones-Olson, Paola; Liang, Xiaowu; Graber, Norma L.; Stefaniak, Maureen E.; Campo, Joseph J.; Carucci, Daniel J.; Roth, David A.; He, Huaping; Felgner, Philip L.; Doolan, Denise L.
2009-01-01
We have evaluated a technology called Transcriptionally Active PCR (TAP) for high throughput identification and prioritization of novel target antigens from genomic sequence data using the Plasmodium parasite, the causative agent of malaria, as a model. First, we adapted the TAP technology for the highly AT-rich Plasmodium genome, using well-characterized P. falciparum and P. yoelii antigens and a small panel of uncharacterized open reading frames from the P. falciparum genome sequence database. We demonstrated that TAP fragments encoding six well-characterized P. falciparum antigens and five well-characterized P. yoelii antigens could be amplified in an equivalent manner from both plasmid DNA and genomic DNA templates, and that uncharacterized open reading frames could also be amplified from genomic DNA template. Second, we showed that the in vitro expression of the TAP fragments was equivalent or superior to that of supercoiled plasmid DNA encoding the same antigen. Third, we evaluated the in vivo immunogenicity of TAP fragments encoding a subset of the model P. falciparum and P. yoelii antigens. We found that antigen-specific antibody and cellular immune responses induced by the TAP fragments in mice were equivalent or superior to those induced by the corresponding plasmid DNA vaccines. Finally, we developed and demonstrated proof-of-principle for an in vitro humoral immunoscreening assay for down-selection of novel target antigens. These data support the potential of a TAP approach for rapid high throughput functional screening and identification of potential candidate vaccine antigens from genomic sequence data. PMID:18164079
Genotype Imputation for Latinos Using the HapMap and 1000 Genomes Project Reference Panels.
Gao, Xiaoyi; Haritunians, Talin; Marjoram, Paul; McKean-Cowdin, Roberta; Torres, Mina; Taylor, Kent D; Rotter, Jerome I; Gauderman, William J; Varma, Rohit
2012-01-01
Genotype imputation is a vital tool in genome-wide association studies (GWAS) and meta-analyses of multiple GWAS results. Imputation enables researchers to increase genomic coverage and to pool data generated using different genotyping platforms. HapMap samples are often employed as the reference panel. More recently, the 1000 Genomes Project resource is becoming the primary source for reference panels. Multiple GWAS and meta-analyses are targeting Latinos, the most populous, and fastest growing minority group in the US. However, genotype imputation resources for Latinos are rather limited compared to individuals of European ancestry at present, largely because of the lack of good reference data. One choice of reference panel for Latinos is one derived from the population of Mexican individuals in Los Angeles contained in the HapMap Phase 3 project and the 1000 Genomes Project. However, a detailed evaluation of the quality of the imputed genotypes derived from the public reference panels has not yet been reported. Using simulation studies, the Illumina OmniExpress GWAS data from the Los Angles Latino Eye Study and the MACH software package, we evaluated the accuracy of genotype imputation in Latinos. Our results show that the 1000 Genomes Project AMR + CEU + YRI reference panel provides the highest imputation accuracy for Latinos, and that also including Asian samples in the panel can reduce imputation accuracy. We also provide the imputation accuracy for each autosomal chromosome using the 1000 Genomes Project panel for Latinos. Our results serve as a guide to future imputation based analysis in Latinos.
Regis, David P; Dobaño, Carlota; Quiñones-Olson, Paola; Liang, Xiaowu; Graber, Norma L; Stefaniak, Maureen E; Campo, Joseph J; Carucci, Daniel J; Roth, David A; He, Huaping; Felgner, Philip L; Doolan, Denise L
2008-03-01
We have evaluated a technology called transcriptionally active PCR (TAP) for high throughput identification and prioritization of novel target antigens from genomic sequence data using the Plasmodium parasite, the causative agent of malaria, as a model. First, we adapted the TAP technology for the highly AT-rich Plasmodium genome, using well-characterized P. falciparum and P. yoelii antigens and a small panel of uncharacterized open reading frames from the P. falciparum genome sequence database. We demonstrated that TAP fragments encoding six well-characterized P. falciparum antigens and five well-characterized P. yoelii antigens could be amplified in an equivalent manner from both plasmid DNA and genomic DNA templates, and that uncharacterized open reading frames could also be amplified from genomic DNA template. Second, we showed that the in vitro expression of the TAP fragments was equivalent or superior to that of supercoiled plasmid DNA encoding the same antigen. Third, we evaluated the in vivo immunogenicity of TAP fragments encoding a subset of the model P. falciparum and P. yoelii antigens. We found that antigen-specific antibody and cellular immune responses induced by the TAP fragments in mice were equivalent or superior to those induced by the corresponding plasmid DNA vaccines. Finally, we developed and demonstrated proof-of-principle for an in vitro humoral immunoscreening assay for down-selection of novel target antigens. These data support the potential of a TAP approach for rapid high throughput functional screening and identification of potential candidate vaccine antigens from genomic sequence data.
Zhang, Min; Zhang, Lin; Zou, Jinfeng; Yao, Chen; Xiao, Hui; Liu, Qing; Wang, Jing; Wang, Dong; Wang, Chenguang; Guo, Zheng
2009-07-01
According to current consistency metrics such as percentage of overlapping genes (POG), lists of differentially expressed genes (DEGs) detected from different microarray studies for a complex disease are often highly inconsistent. This irreproducibility problem also exists in other high-throughput post-genomic areas such as proteomics and metabolism. A complex disease is often characterized with many coordinated molecular changes, which should be considered when evaluating the reproducibility of discovery lists from different studies. We proposed metrics percentage of overlapping genes-related (POGR) and normalized POGR (nPOGR) to evaluate the consistency between two DEG lists for a complex disease, considering correlated molecular changes rather than only counting gene overlaps between the lists. Based on microarray datasets of three diseases, we showed that though the POG scores for DEG lists from different studies for each disease are extremely low, the POGR and nPOGR scores can be rather high, suggesting that the apparently inconsistent DEG lists may be highly reproducible in the sense that they are actually significantly correlated. Observing different discovery results for a disease by the POGR and nPOGR scores will obviously reduce the uncertainty of the microarray studies. The proposed metrics could also be applicable in many other high-throughput post-genomic areas.
Redefining the genetics of Murine Gammaherpesvirus 68 via transcriptome-based annotation
Johnson, L. Steven; Willert, Erin K.; Virgin, Herbert W.
2010-01-01
Summary Viral genetic studies often focus on large open reading frames (ORFs) identified during genome annotation (ORF-based annotation). Here we provide a tool and software set for defining gene expression by murine gammaherpesvirus 68 (γHV68) nucleotide-by-nucleotide across the 119,450 basepair (bp) genome. These tools allowed us to determine that viral RNA expression was significantly more complex than predicted from ORF-based annotation, including over 73,000 nucleotides of unexpected transcription within 30 expressed genomic regions (EGRs). Approximately 90% of this RNA expression was antisense to genomic regions containing known large ORFs. We verified the existence of novel transcripts in three EGRs using standard methods to validate the approach and determined which parts of the transcriptome depend on protein or viral DNA synthesis. This redefines the genetic map of γHV68, indicates that herpesviruses contain significantly more genetic complexity than predicted from ORF-based genome annotations, and provides new tools and approaches for viral genetic studies. PMID:20542255
Xu, Jian-zhong; Zhang, Wei-guo
2016-01-01
With the availability of the whole genome sequence of Escherichia coli or Corynebacterium glutamicum, strategies for directed DNA manipulation have developed rapidly. DNA manipulation plays an important role in understanding the function of genes and in constructing novel engineering bacteria according to requirement. DNA manipulation involves modifying the autologous genes and expressing the heterogenous genes. Two alternative approaches, using electroporation linear DNA or recombinant suicide plasmid, allow a wide variety of DNA manipulation. However, the over-expression of the desired gene is generally executed via plasmid-mediation. The current review summarizes the common strategies used for genetically modifying E. coli and C. glutamicum genomes, and discusses the technical problem of multi-layered DNA manipulation. Strategies for gene over-expression via integrating into genome are proposed. This review is intended to be an accessible introduction to DNA manipulation within the bacterial genome for novices and a source of the latest experimental information for experienced investigators. PMID:26834010
Tyagi, Shivi; Himani; Sembi, Jaspreet K; Upadhyay, Santosh Kumar
2018-04-01
Glutathione peroxidases (GPXs) are redox sensor proteins that maintain a steady-state of H 2 O 2 in plant cells. They exhibit distinct sub-cellular localization and have diverse functionality in response to different stimuli. In this study, a total of 14 TaGPX genes and three splice variants were identified in the genome of Triticum aestivum and evaluated for various physicochemical properties. The TaGPX genes were scattered on the various chromosomes of the A, B, and D sub-genomes and clustered into five homeologous groups based on high sequence homology. The majority of genes were derived from the B sub-genome and localized on chromosome 2. The intron-exon organization, motif and domain architecture, and phylogenetic analyses revealed the conserved nature of TaGPXs. The occurrence of both development-related and stress-responsive cis-acting elements in the promoter region, the differential expression of these genes during various developmental stages, and the modulation of expression in the presence of biotic and abiotic stresses suggested their diverse role in T. aestivum. The majority of TaGPX genes showed higher expression in various leaf developmental stages. However, TaGPX1-A1 was upregulated in the presence of each abiotic stress treatment. A co-expression analysis revealed the interaction of TaGPXs with numerous development and stress-related genes, which indicated their vital role in numerous biological processes. Our study revealed the opportunities for further characterization of individual TaGPX proteins, which might be useful in designing future crop improvement strategies. Copyright © 2018 Elsevier GmbH. All rights reserved.
Verma, Jitendra Kumar; Wardhan, Vijay; Singh, Deepali; Chakraborty, Subhra; Chakraborty, Niranjan
2018-03-28
Architectural proteins play key roles in genome construction and regulate the expression of many genes, albeit the modulation of genome plasticity by these proteins is largely unknown. A critical screening of the architectural proteins in five crop species, viz., Oryza sativa , Zea mays , Sorghum bicolor , Cicer arietinum , and Vitis vinifera , and in the model plant Arabidopsis thaliana along with evolutionary relevant species such as Chlamydomonas reinhardtii , Physcomitrella patens , and Amborella trichopoda , revealed 9, 20, 10, 7, 7, 6, 1, 4, and 4 Alba (acetylation lowers binding affinity) genes, respectively. A phylogenetic analysis of the genes and of their counterparts in other plant species indicated evolutionary conservation and diversification. In each group, the structural components of the genes and motifs showed significant conservation. The chromosomal location of the Alba genes of rice ( OsAlba ), showed an unequal distribution on 8 of its 12 chromosomes. The expression profiles of the OsAlba genes indicated a distinct tissue-specific expression in the seedling, vegetative, and reproductive stages. The quantitative real-time PCR (qRT-PCR) analysis of the OsAlba genes confirmed their stress-inducible expression under multivariate environmental conditions and phytohormone treatments. The evaluation of the regulatory elements in 68 Alba genes from the 9 species studied led to the identification of conserved motifs and overlapping microRNA (miRNA) target sites, suggesting the conservation of their function in related proteins and a divergence in their biological roles across species. The 3D structure and the prediction of putative ligands and their binding sites for OsAlba proteins offered a key insight into the structure-function relationship. These results provide a comprehensive overview of the subtle genetic diversification of the OsAlba genes, which will help in elucidating their functional role in plants.
Mallik, Saurav; Zhao, Zhongming
2017-12-28
For transcriptomic analysis, there are numerous microarray-based genomic data, especially those generated for cancer research. The typical analysis measures the difference between a cancer sample-group and a matched control group for each transcript or gene. Association rule mining is used to discover interesting item sets through rule-based methodology. Thus, it has advantages to find causal effect relationships between the transcripts. In this work, we introduce two new rule-based similarity measures-weighted rank-based Jaccard and Cosine measures-and then propose a novel computational framework to detect condensed gene co-expression modules ( C o n G E M s) through the association rule-based learning system and the weighted similarity scores. In practice, the list of evolved condensed markers that consists of both singular and complex markers in nature depends on the corresponding condensed gene sets in either antecedent or consequent of the rules of the resultant modules. In our evaluation, these markers could be supported by literature evidence, KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway and Gene Ontology annotations. Specifically, we preliminarily identified differentially expressed genes using an empirical Bayes test. A recently developed algorithm-RANWAR-was then utilized to determine the association rules from these genes. Based on that, we computed the integrated similarity scores of these rule-based similarity measures between each rule-pair, and the resultant scores were used for clustering to identify the co-expressed rule-modules. We applied our method to a gene expression dataset for lung squamous cell carcinoma and a genome methylation dataset for uterine cervical carcinogenesis. Our proposed module discovery method produced better results than the traditional gene-module discovery measures. In summary, our proposed rule-based method is useful for exploring biomarker modules from transcriptomic data.
Central genomic regulation of the expression of oestrous behaviour in dairy cows: a review.
Woelders, H; van der Lende, T; Kommadath, A; te Pas, M F W; Smits, M A; Kaal, L M T E
2014-05-01
The expression of oestrous behaviour in Holstein Friesian dairy cows has progressively decreased over the past 50 years. Reduced oestrus expression is one of the factors contributing to the current suboptimal reproductive efficiency in dairy farming. Variation between and within cows in the expression of oestrous behaviour is associated with variation in peripheral blood oestradiol concentrations during oestrus. In addition, there is evidence for a priming role of progesterone for the full display of oestrous behaviour. A higher rate of metabolic clearance of ovarian steroids could be one of the factors leading to lower peripheral blood concentrations of oestradiol and progesterone in high-producing dairy cows. Oestradiol acts on the brain by genomic, non-genomic and growth factor-dependent mechanisms. A firm base of understanding of the ovarian steroid-driven central genomic regulation of female sexual behaviour has been obtained from studies on rodents. These studies have resulted in the definition of five modules of oestradiol-activated genes in the brain, referred to as the GAPPS modules. In a recent series of studies, gene expression in the anterior pituitary and four brain areas (amygdala, hippocampus, dorsal hypothalamus and ventral hypothalamus) in oestrous and luteal phase cows, respectively, has been measured, and the relation with oestrous behaviour of these cows was analysed. These studies identified a number of genes of which the expression was associated with the intensity of oestrous behaviour. These genes could be grouped according to the GAPPS modules, suggesting close similarity of the regulation of oestrous behaviour in cows and female sexual behaviour in rodents. A better understanding of the central genomic regulation of the expression of oestrous behaviour in dairy cows may in due time contribute to improved (genomic) selection strategies for appropriate oestrus expression in high-producing dairy cows.
Anis, Eman A; Dhar, Madhu; Legendre, Alfred M; Wilkes, Rebecca P
2017-06-01
Objectives The goals of the study were: (1) to develop and evaluate non-replicating lentivirus vectors coding for feline coronavirus (FCoV)-specific micro (mi)RNA as a potential antiviral therapy for feline infectious peritonitis (FIP); (2) to assess the feasibility of transducing hematopoietic stem cells (HSCs) with ex vivo introduction of the miRNA-expressing lentivirus vector; and (3) to assess the ability of the expressed miRNA to inhibit FCoV replication in HSCs in vitro. Methods HSCs were obtained from feline bone marrow and replicated in vitro. Three lentiviruses were constructed, each expressing a different anti-FCoV miRNA. HSCs were stably transduced with the miRNA-expressing lentivirus vector that produced the most effective viral inhibition in a feline cell line. The effectiveness of the transduction and the expression of anti-FCoV miRNA were tested by infecting the HSCs with two different strains of FCoV. The inhibition of coronavirus replication was determined by relative quantification of the inhibition of intracellular viral genomic RNA synthesis using real-time, reverse-transcription PCR. The assessment of virus replication inhibition was determined via titration of extracellular virus using the TCID 50 assay. Results Inhibition of FCoV was most significant in feline cells expressing miRNA-L2 that targeted the viral leader sequence, 48 h postinfection. miRNA-L2 expression in stably transduced HSCs resulted in 90% and 92% reductions in FIPV WSU 79-1146 genomic RNA synthesis and extracellular virus production, respectively, as well as 74% and 80% reduction in FECV WSU 79-1683 genomic RNA synthesis and extracellular virus production, respectively, as compared with an infected negative control sample producing non-targeting miRNA. Conclusions and relevance These preliminary results show that genetic modification of HSCs for constitutive production of anti-coronavirus miRNA will reduce FCoV replication.
Schwaenen, Carsten; Viardot, Andreas; Berger, Hilmar; Barth, Thomas F E; Bentink, Stefan; Döhner, Hartmut; Enz, Martina; Feller, Alfred C; Hansmann, Martin-Leo; Hummel, Michael; Kestler, Hans A; Klapper, Wolfram; Kreuz, Markus; Lenze, Dido; Loeffler, Markus; Möller, Peter; Müller-Hermelink, Hans-Konrad; Ott, German; Rosolowski, Maciej; Rosenwald, Andreas; Ruf, Sandra; Siebert, Reiner; Spang, Rainer; Stein, Harald; Truemper, Lorenz; Lichter, Peter; Bentz, Martin; Wessendorf, Swen
2009-01-01
Follicular lymphoma (FL) is characterized by a large number of chromosomal aberrations. However, their exact genomic extension and involved target genes remain to be determined. For this purpose, we used array-based intermediate-high resolution genomic profiling in combination with Affymetrix gene expression analysis. Tumor specimens from 128 FL patients were analyzed for the presence of genomic aberrations and the results were correlated to clinical data sets and mRNA expression levels. In 114 (89%) of the 128 analyzed cases, a total of 688 genomic aberrations (384 gains/amplifications and 304 losses) were detected. Frequent genomic aberrations were: -1p36 (18%), +2p15 (24%), -3q (14%), -6q (25%), +7p (19%), +7q (23%), +8q (14%), -9p (16%), -11q (15%), +12q (20%), -13q (11%), -17p (16%), +18p (18%), and +18q (28%). Critical segments of these imbalances were delineated to genomic fragments with a minimum size down to 0.2 Mb. By comparison of these with mRNA gene expression data, putative candidate genes were identified. Moreover, we found that deletions affecting the tumor suppressor gene CDKN2A/B on 9p21 were detected in nontransformed FL grade I-II. For this aberration as well as for -6q25 and -6q26, an association with inferior survival was observed.
[Evolution of genomic imprinting in mammals: what a zoo!].
Proudhon, Charlotte; Bourc'his, Déborah
2010-05-01
Genomic imprinting imposes an obligate mode of biparental reproduction in mammals. This phenomenon results from the monoparental expression of a subset of genes. This specific gene regulation mechanism affects viviparous mammals, especially eutherians, but also marsupials to a lesser extent. Oviparous mammals, or monotremes, do not seem to demonstrate monoparental allele expression. This phylogenic confinement suggests that the evolution of the placenta imposed a selective pressure for the emergence of genomic imprinting. This physiological argument is now complemented by recent genomic evidence facilitated by the sequencing of the platypus genome, a rare modern day case of a monotreme. Analysis of the platypus genome in comparison to eutherian genomes shows a chronological and functional coincidence between the appearance of genomic imprinting and transposable element accumulation. The systematic comparative analyses of genomic sequences in different species is essential for the further understanding of genomic imprinting emergence and divergent evolution along mammalian speciation.
Hattori, Hiroyoshi; Janky, Rekin's; Nietfeld, Wilfried; Aerts, Stein; Madan Babu, M; Venkitaraman, Ashok R
2014-01-01
The human DNA damage response (DDR) triggers profound changes in gene expression, whose nature and regulation remain uncertain. Although certain micro-(mi)RNA species including miR34, miR-18, miR-16 and miR-143 have been implicated in the DDR, there is as yet no comprehensive description of genome-wide changes in the expression of miRNAs triggered by DNA breakage in human cells. We have used next-generation sequencing (NGS), combined with rigorous integrative computational analyses, to describe genome-wide changes in the expression of miRNAs during the human DDR. The changes affect 150 of 1523 miRNAs known in miRBase v18 from 4-24 h after the induction of DNA breakage, in cell-type dependent patterns. The regulatory regions of the most-highly regulated miRNA species are enriched in conserved binding sites for p53. Indeed, genome-wide changes in miRNA expression during the DDR are markedly altered in TP53-/- cells compared to otherwise isogenic controls. The expression levels of certain damage-induced, p53-regulated miRNAs in cancer samples correlate with patient survival. Our work reveals genome-wide and cell type-specific alterations in miRNA expression during the human DDR, which are regulated by the tumor suppressor protein p53. These findings provide a genomic resource to identify new molecules and mechanisms involved in the DDR, and to examine their role in tumor suppression and the clinical outcome of cancer patients.
Platre, Matthieu Pierre; Barberon, Marie; Caillieux, Erwann; Colot, Vincent
2016-01-01
Summary Multicellular organisms are composed of many cell types that acquire their specific fate through a precisely controlled pattern of gene expression in time and space dictated in part by cell type-specific promoter activity. Understanding the contribution of highly specialized cell types in the development of a whole organism requires the ability to isolate or analyze different cell types separately. We have characterized and validated a large collection of root cell type-specific promoters and have generated cell type-specific marker lines. These benchmarked promoters can be readily used to evaluate cell type-specific complementation of mutant phenotypes, or to knockdown gene expression using targeted expression of artificial miRNA. We also generated vectors and characterized transgenic lines for cell type-specific induction of gene expression and cell type-specific isolation of nuclei for RNA and chromatin profiling. Vectors and seeds from transgenic Arabidopsis plants will be freely available, and will promote rapid progress in cell type-specific functional genomics. We demonstrate the power of this promoter set for analysis of complex biological processes by investigating the contribution of root cell types in the IRT1-dependent root iron uptake. Our findings revealed the complex spatial expression pattern of IRT1 in both root epidermis and phloem companion cells and the requirement for IRT1 to be expressed in both cell types for proper iron homeostasis. PMID:26662936
Frasson, Amanda Piccoli; Dos Santos, Odelta; Meirelles, Lúcia Collares; Macedo, Alexandre José; Tasca, Tiana
2016-01-01
Trichomonas vaginalis is a protozoan that parasitizes the human urogenital tract causing trichomoniasis, the most common non-viral sexually transmitted disease. The parasite has unique genomic characteristics such as a large genome size and expanded gene families. Ectonucleoside triphosphate diphosphohydrolase (E-NTPDase) is an enzyme responsible for hydrolyzing nucleoside tri- and diphosphates and has already been biochemically characterized in T. vaginalis. Considering the important role of this enzyme in the production of extracellular adenosine for parasite uptake, we evaluated the gene expression of five putative NTPDases in T. vaginalis. We showed that all five putative TvNTPDase genes (TvNTPDase1-5) were expressed by both fresh clinical and long-term grown isolates. The amino acid alignment predicted the presence of the five crucial apyrase conserved regions, transmembrane domains, signal peptides, phosphorylation and catalytic sites. Moreover, a phylogenetic analysis showed that TvNTPDase sequences make up a clade with NTPDases intracellularly located. Biochemical NTPDase activity (ATP and ADP hydrolysis) is responsive to the serum-restrictive conditions and the gene expression of TvNTPDases was mostly increased, mainly TvNTPDase2 and TvNTPDase4, although there was not a clear pattern of expression among them. In summary, the present report demonstrates the gene expression patterns of predicted NTPDases in T. vaginalis. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Libbrecht, Maxwell W.; Ay, Ferhat; Hoffman, Michael M.; Gilbert, David M.; Bilmes, Jeffrey A.; Noble, William Stafford
2015-01-01
The genomic neighborhood of a gene influences its activity, a behavior that is attributable in part to domain-scale regulation. Previous genomic studies have identified many types of regulatory domains. However, due to the difficulty of integrating genomics data sets, the relationships among these domain types are poorly understood. Semi-automated genome annotation (SAGA) algorithms facilitate human interpretation of heterogeneous collections of genomics data by simultaneously partitioning the human genome and assigning labels to the resulting genomic segments. However, existing SAGA methods cannot integrate inherently pairwise chromatin conformation data. We developed a new computational method, called graph-based regularization (GBR), for expressing a pairwise prior that encourages certain pairs of genomic loci to receive the same label in a genome annotation. We used GBR to exploit chromatin conformation information during genome annotation by encouraging positions that are close in 3D to occupy the same type of domain. Using this approach, we produced a model of chromatin domains in eight human cell types, thereby revealing the relationships among known domain types. Through this model, we identified clusters of tightly regulated genes expressed in only a small number of cell types, which we term “specific expression domains.” We found that domain boundaries marked by promoters and CTCF motifs are consistent between cell types even when domain activity changes. Finally, we showed that GBR can be used to transfer information from well-studied cell types to less well-characterized cell types during genome annotation, making it possible to produce high-quality annotations of the hundreds of cell types with limited available data. PMID:25677182
Libbrecht, Maxwell W; Ay, Ferhat; Hoffman, Michael M; Gilbert, David M; Bilmes, Jeffrey A; Noble, William Stafford
2015-04-01
The genomic neighborhood of a gene influences its activity, a behavior that is attributable in part to domain-scale regulation. Previous genomic studies have identified many types of regulatory domains. However, due to the difficulty of integrating genomics data sets, the relationships among these domain types are poorly understood. Semi-automated genome annotation (SAGA) algorithms facilitate human interpretation of heterogeneous collections of genomics data by simultaneously partitioning the human genome and assigning labels to the resulting genomic segments. However, existing SAGA methods cannot integrate inherently pairwise chromatin conformation data. We developed a new computational method, called graph-based regularization (GBR), for expressing a pairwise prior that encourages certain pairs of genomic loci to receive the same label in a genome annotation. We used GBR to exploit chromatin conformation information during genome annotation by encouraging positions that are close in 3D to occupy the same type of domain. Using this approach, we produced a model of chromatin domains in eight human cell types, thereby revealing the relationships among known domain types. Through this model, we identified clusters of tightly regulated genes expressed in only a small number of cell types, which we term "specific expression domains." We found that domain boundaries marked by promoters and CTCF motifs are consistent between cell types even when domain activity changes. Finally, we showed that GBR can be used to transfer information from well-studied cell types to less well-characterized cell types during genome annotation, making it possible to produce high-quality annotations of the hundreds of cell types with limited available data. © 2015 Libbrecht et al.; Published by Cold Spring Harbor Laboratory Press.
Gu, Joyce Xiuweu-Xu; Wei, Michael Yang; Rao, Pulivarthi H.; Lau, Ching C.; Behl, Sanjiv; Man, Tsz-Kwong
2007-01-01
With the increasing application of various genomic technologies in biomedical research, there is a need to integrate these data to correlate candidate genes/regions that are identified by different genomic platforms. Although there are tools that can analyze data from individual platforms, essential software for integration of genomic data is still lacking. Here, we present a novel Java-based program called CGI (Cytogenetics-Genomics Integrator) that matches the BAC clones from array-based comparative genomic hybridization (aCGH) to genes from RNA expression profiling datasets. The matching is computed via a fast, backend MySQL database containing UCSC Genome Browser annotations. This program also provides an easy-to-use graphical user interface for visualizing and summarizing the correlation of DNA copy number changes and RNA expression patterns from a set of experiments. In addition, CGI uses a Java applet to display the copy number values of a specific BAC clone in aCGH experiments side by side with the expression levels of genes that are mapped back to that BAC clone from the microarray experiments. The CGI program is built on top of extensible, reusable graphic components specifically designed for biologists. It is cross-platform compatible and the source code is freely available under the General Public License. PMID:19936083
Gu, Joyce Xiuweu-Xu; Wei, Michael Yang; Rao, Pulivarthi H; Lau, Ching C; Behl, Sanjiv; Man, Tsz-Kwong
2007-10-06
With the increasing application of various genomic technologies in biomedical research, there is a need to integrate these data to correlate candidate genes/regions that are identified by different genomic platforms. Although there are tools that can analyze data from individual platforms, essential software for integration of genomic data is still lacking. Here, we present a novel Java-based program called CGI (Cytogenetics-Genomics Integrator) that matches the BAC clones from array-based comparative genomic hybridization (aCGH) to genes from RNA expression profiling datasets. The matching is computed via a fast, backend MySQL database containing UCSC Genome Browser annotations. This program also provides an easy-to-use graphical user interface for visualizing and summarizing the correlation of DNA copy number changes and RNA expression patterns from a set of experiments. In addition, CGI uses a Java applet to display the copy number values of a specific BAC clone in aCGH experiments side by side with the expression levels of genes that are mapped back to that BAC clone from the microarray experiments. The CGI program is built on top of extensible, reusable graphic components specifically designed for biologists. It is cross-platform compatible and the source code is freely available under the General Public License.
Gramene 2016: comparative plant genomics and pathway resources
Tello-Ruiz, Marcela K.; Stein, Joshua; Wei, Sharon; Preece, Justin; Olson, Andrew; Naithani, Sushma; Amarasinghe, Vindhya; Dharmawardhana, Palitha; Jiao, Yinping; Mulvaney, Joseph; Kumari, Sunita; Chougule, Kapeel; Elser, Justin; Wang, Bo; Thomason, James; Bolser, Daniel M.; Kerhornou, Arnaud; Walts, Brandon; Fonseca, Nuno A.; Huerta, Laura; Keays, Maria; Tang, Y. Amy; Parkinson, Helen; Fabregat, Antonio; McKay, Sheldon; Weiser, Joel; D'Eustachio, Peter; Stein, Lincoln; Petryszak, Robert; Kersey, Paul J.; Jaiswal, Pankaj; Ware, Doreen
2016-01-01
Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the database website adopted a new Drupal management platform. The genomes section features 39 fully assembled reference genomes that are integrated using ontology-based annotation and comparative analyses, and accessed through both visual and programmatic interfaces. Additional community data, such as genetic variation, expression and methylation, are also mapped for a subset of genomes. The Plant Reactome pathway portal (http://plantreactome.gramene.org) provides a reference resource for analyzing plant metabolic and regulatory pathways. In addition to ∼200 curated rice reference pathways, the portal hosts gene homology-based pathway projections for 33 plant species. Both the genome and pathway browsers interface with the EMBL-EBI's Expression Atlas to enable the projection of baseline and differential expression data from curated expression studies in plants. Gramene's archive website (http://archive.gramene.org) continues to provide previously reported resources on comparative maps, markers and QTL. To further aid our users, we have also introduced a live monthly educational webinar series and a Gramene YouTube channel carrying video tutorials. PMID:26553803
The consequences of chromosomal aneuploidy on the transcriptome of cancer cells☆
Ried, Thomas; Hu, Yue; Difilippantonio, Michael J.; Ghadimi, B. Michael; Grade, Marian; Camps, Jordi
2016-01-01
Chromosomal aneuploidies are a defining feature of carcinomas, i.e., tumors of epithelial origin. Such aneuploidies result in tumor specific genomic copy number alterations. The patterns of genomic imbalances are tumor specific, and to a certain extent specific for defined stages of tumor development. Genomic imbalances occur already in premalignant precursor lesions, i.e., before the transition to invasive disease, and their distribution is maintained in metastases, and in cell lines derived from primary tumors. These observations are consistent with the interpretation that tumor specific genomic imbalances are drivers of malignant transformation. Naturally, this precipitates the question of how such imbalances influence the expression of resident genes. A number of laboratories have systematically integrated copy number alterations with gene expression changes in primary tumors and metastases, cell lines, and experimental models of aneuploidy to address the question as to whether genomic imbalances deregulate the expression of one or few key genes, or rather affect the cancer transcriptome more globally. The majority of these studies showed that gene expression levels follow genomic copy number. Therefore, gross genomic copy number changes, including aneuploidies of entire chromosome arms and chromosomes, result in a massive deregulation of the transcriptome of cancer cells. This article is part of a Special Issue entitled: Chromatin in time and space. PMID:22426433
Cerveau, Nicolas; Gilbert, Clément; Liu, Chao; Garrett, Roger A; Grève, Pierre; Bouchon, Didier; Cordaux, Richard
2015-06-10
Transposable elements (TEs) are DNA pieces that are present in almost all the living world at variable genomic density. Due to their mobility and density, TEs are involved in a large array of genomic modifications. In eukaryotes, TE expression has been studied in detail in several species. In prokaryotes, studies of IS expression are generally linked to particular copies that induce a modification of neighboring gene expression. Here we investigated global patterns of IS transcription in the Alphaproteobacterial endosymbiont Wolbachia wVulC, using both RT-PCR and bioinformatic analyses. We detected several transcriptional promoters in all IS groups. Nevertheless, only one of the potentially functional IS groups possesses a promoter located upstream of the transposase gene, that could lead up to the production of a functional protein. We found that the majority of IS groups are expressed whatever their functional status. RT-PCR analyses indicate that the transcription of two IS groups lacking internal promoters upstream of the transposase start codon may be driven by the genomic environment. We confirmed this observation with the transcription analysis of individual copies of one IS group. These results suggest that the genomic environment is important for IS expression and it could explain, at least partly, copy number variability of the various IS groups present in the wVulC genome and, more generally, in bacterial genomes. Copyright © 2015 Elsevier B.V. All rights reserved.
Perspectives: Gene Expression in Fisheries Management
Nielsen, Jennifer L.; Pavey, Scott A.
2010-01-01
Functional genes and gene expression have been connected to physiological traits linked to effective production and broodstock selection in aquaculture, selective implications of commercial fish harvest, and adaptive changes reflected in non-commercial fish populations subject to human disturbance and climate change. Gene mapping using single nucleotide polymorphisms (SNPs) to identify functional genes, gene expression (analogue microarrays and real-time PCR), and digital sequencing technologies looking at RNA transcripts present new concepts and opportunities in support of effective and sustainable fisheries. Genomic tools have been rapidly growing in aquaculture research addressing aspects of fish health, toxicology, and early development. Genomic technologies linking effects in functional genes involved in growth, maturation and life history development have been tied to selection resulting from harvest practices. Incorporating new and ever-increasing knowledge of fish genomes is opening a different perspective on local adaptation that will prove invaluable in wild fish conservation and management. Conservation of fish stocks is rapidly incorporating research on critical adaptive responses directed at the effects of human disturbance and climate change through gene expression studies. Genomic studies of fish populations can be generally grouped into three broad categories: 1) evolutionary genomics and biodiversity; 2) adaptive physiological responses to a changing environment; and 3) adaptive behavioral genomics and life history diversity. We review current genomic research in fisheries focusing on those that use microarrays to explore differences in gene expression among phenotypes and within or across populations, information that is critically important to the conservation of fish and their relationship to humans.
The Immunological Genome Project: networks of gene expression in immune cells.
Heng, Tracy S P; Painter, Michio W
2008-10-01
The Immunological Genome Project combines immunology and computational biology laboratories in an effort to establish a complete 'road map' of gene-expression and regulatory networks in all immune cells.
Developing molecular tools for Chlamydomonas reinhardtii
NASA Astrophysics Data System (ADS)
Noor-Mohammadi, Samaneh
Microalgae have garnered increasing interest over the years for their ability to produce compounds ranging from biofuels to neutraceuticals. A main focus of researchers has been to use microalgae as a natural bioreactor for the production of valuable and complex compounds. Recombinant protein expression in the chloroplasts of green algae has recently become more routine; however, the heterologous expression of multiple proteins or complete biosynthetic pathways remains a significant challenge. To take full advantage of these organisms' natural abilities, sophisticated molecular tools are needed to be able to introduce and functionally express multiple gene biosynthetic pathways in its genome. To achieve the above objective, we have sought to establish a method to construct, integrate and express multigene operons in the chloroplast and nuclear genome of the model microalgae Chlamydomonas reinhardtii. Here we show that a modified DNA Assembler approach can be used to rapidly assemble multiple-gene biosynthetic pathways in yeast and then integrate these assembled pathways at a site-specific location in the chloroplast, or by random integration in the nuclear genome of C. reinhardtii. As a proof of concept, this method was used to successfully integrate and functionally express up to three reporter proteins (AphA6, AadA, and GFP) in the chloroplast of C. reinhardtii and up to three reporter proteins (Ble, AphVIII, and GFP) in its nuclear genome. An analysis of the relative gene expression of the engineered strains showed significant differences in the mRNA expression levels of the reporter genes and thus highlights the importance of proper promoter/untranslated-region selection when constructing a target pathway. In addition, this work focuses on expressing the cofactor regeneration enzyme phosphite dehydrogenase (PTDH) in the chloroplast and nuclear genomes of C. reinhardtii. The PTDH enzyme converts phosphite into phosphate and NAD(P)+ into NAD(P)H. The reduced nicotinamide cofactor NAD(P)H plays a pivotal role in many biochemical oxidation and reduction reactions, thus this enzyme would allow regeneration of NAD(P)H in a microalgae strain over-expressing a NAD(P)H-dependent oxidoreductase. A phosphite dehydrogenase gene was introduced into the chloroplast genome (codon optimized) and nuclear genome of C. reinhardtii by biolistic transformation and electroporation in separate events, respectively. Successful expression of the heterologous protein was confirmed by transcript analysis and protein analysis. In conclusion, this new method represents a useful genetic tool in the construction and integration of complex biochemical pathways into the chloroplast or nuclear genome of microalgae, and this should aid current efforts to engineer algae for recombinant protein expression, biofuels production and production of other desirable natural products.
Economic evaluation of genomic selection in small ruminants: a sheep meat breeding program.
Shumbusho, F; Raoul, J; Astruc, J M; Palhiere, I; Lemarié, S; Fugeray-Scarbel, A; Elsen, J M
2016-06-01
Recent genomic evaluation studies using real data and predicting genetic gain by modeling breeding programs have reported moderate expected benefits from the replacement of classic selection schemes by genomic selection (GS) in small ruminants. The objectives of this study were to compare the cost, monetary genetic gain and economic efficiency of classic selection and GS schemes in the meat sheep industry. Deterministic methods were used to model selection based on multi-trait indices from a sheep meat breeding program. Decisional variables related to male selection candidates and progeny testing were optimized to maximize the annual monetary genetic gain (AMGG), that is, a weighted sum of meat and maternal traits annual genetic gains. For GS, a reference population of 2000 individuals was assumed and genomic information was available for evaluation of male candidates only. In the classic selection scheme, males breeding values were estimated from own and offspring phenotypes. In GS, different scenarios were considered, differing by the information used to select males (genomic only, genomic+own performance, genomic+offspring phenotypes). The results showed that all GS scenarios were associated with higher total variable costs than classic selection (if the cost of genotyping was 123 euros/animal). In terms of AMGG and economic returns, GS scenarios were found to be superior to classic selection only if genomic information was combined with their own meat phenotypes (GS-Pheno) or with their progeny test information. The predicted economic efficiency, defined as returns (proportional to number of expressions of AMGG in the nucleus and commercial flocks) minus total variable costs, showed that the best GS scenario (GS-Pheno) was up to 15% more efficient than classic selection. For all selection scenarios, optimization increased the overall AMGG, returns and economic efficiency. As a conclusion, our study shows that some forms of GS strategies are more advantageous than classic selection, provided that GS is already initiated (i.e. the initial reference population is available). Optimizing decisional variables of the classic selection scheme could be of greater benefit than including genomic information in optimized designs.
Issa, Amalia M; Hutchinson, Janis F; Tufail, Waqas; Fletcher, Erica; Ajike, Roseline; Tenorio, Jose
2011-07-01
Several novel pharmacogenomic diagnostic tests are commercially available for breast and colorectal cancer, and are increasingly being used in clinical practice for improving treatment decisions. However, there is little evidence evaluating the value of these new genomic technologies from the perspective of patients. As part of an ongoing effort to understand the continuum of the process of adoption of genomic diagnostics, our aim in this study was to examine the value of genomic diagnostics to breast and colorectal cancer patients, and their willingness to adopt and use genomic diagnostics. We conducted six focus groups of breast and colorectal cancer patients from the oncology clinics at The Methodist Hospital, Houston, TX, USA. An adapted Q-sort instrument was also administered to focus group participants. The majority of breast and colorectal cancer patients are interested in using novel genomic diagnostics for deciding about treatment options. Most participants in our study expressed a willingness to pay out-of-pocket for genomic testing (z = 0.736). Reliability and validity of genomic testing were of significant concern (z = 1.32) for the majority of breast and colorectal cancer patients. Participants identified several facilitators and barriers within health systems that might either facilitate or impede the widespread adoption and use of genomic diagnostics in healthcare delivery. This study demonstrates breast and colorectal cancer patients' willingness to adopt and pay for novel genomic diagnostics, as well as identifies several salient factors associated with patient preferences for genomic diagnostics.
[Research progress in neuropsychopharmacology updated for the post-genomic era].
Nakanishi, Toru
2009-11-01
Neuropsychopharmacological research in the post genomic (genomic sequence) era has been developing rapidly through the use of novel techniques including DNA chips. We have applied these techniques to investigate the anti-tumor effect of NSAIDs, isolate novel genes specifically expressed in rheumatoid arthritis, and analyze gene expression profiles in mesenchymal stem cells. Recently, we have developed a novel system of quantitative PCR for detection of BDNF mRNA isoforms. By using this system, we identified the exon-specific mode of expression in acute and chronic pain. In addition, we have made gene expression profiles of KO mice of beta2 subunits in acetylcholine receptors.
Calla, Bernarda; Blahut-Beatty, Laureen; Koziol, Lisa; Zhang, Yunfang; Neece, David J; Carbajulca, Doris; Garcia, Alexandre; Simmonds, Daina H; Clough, Steven J
2014-08-01
Oxalate oxidases (OxO) catalyse the degradation of oxalic acid (OA). Highly resistant transgenic soybean carrying an OxO gene and its susceptible parent soybean line, AC Colibri, were tested for genome-wide gene expression in response to the necrotrophic, OA-producing pathogen Sclerotinia sclerotiorum using soybean cDNA microarrays. The genes with changed expression at statistically significant levels (overall F-test P-value cut-off of 0.0001) were classified into functional categories and pathways, and were analysed to evaluate the differences in transcriptome profiles. Although many genes and pathways were found to be similarly activated or repressed in both genotypes after inoculation with S. sclerotiorum, the OxO genotype displayed a measurably faster induction of basal defence responses, as observed by the differential changes in defence-related and secondary metabolite genes compared with its susceptible parent AC Colibri. In addition, the experiment presented provides data on several other transcripts that support the hypothesis that S. sclerotiorum at least partially elicits the hypersensitive response, induces lignin synthesis (cinnamoyl CoA reductase) and elicits as yet unstudied signalling pathways (G-protein-coupled receptor and related). Of the nine genes showing the most extreme opposite directions of expression between genotypes, eight were related to photosynthesis and/or oxidation, highlighting the importance of redox in the control of this pathogen. © 2014 BSPP AND JOHN WILEY & SONS LTD.
Shaw, Lindsay M; Turner, Adrian S; Laurie, David A
2012-07-01
Flowering time is a trait that has been extensively altered during wheat domestication, enabling it to be highly productive in diverse environments and providing a rich source of variation for studying adaptation mechanisms. Hexaploid wheat is ancestrally a long-day plant, but many environments require varieties with photoperiod insensitivity (PI) that can flower in short days. PI results from mutations in the Ppd-1 gene on the A, B or D genomes, with individual mutations conferring different degrees of earliness. The basis of this is poorly understood. Using a common genetic background, the effects of A, B and D genome PI mutations on genes of the circadian clock and photoperiod pathway were studied using genome-specific expression assays. Ppd-1 PI mutations did not affect the clock or immediate clock outputs, but affected TaCO1 and TaFT1, with a reduction in TaCO1 expression as TaFT1 expression increased. Therefore, although Ppd-1 is related to PRR genes of the Arabidopsis circadian clock, Ppd-1 affects flowering by an alternative route, most likely by upregulating TaFT1 with a feedback effect that reduces TaCO1 expression. Individual genes in the circadian clock and photoperiod pathway were predominantly expressed from one genome, and there was no genome specificity in Ppd-1 action. Lines combining PI mutations on two or three genomes had enhanced earliness with higher levels, but not earlier induction, of TaFT1, showing that there is a direct quantitative relationship between Ppd-1 mutations, TaFT1 expression and flowering. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.
Yu, Hua; Jiao, Bingke; Lu, Lu; Wang, Pengfei; Chen, Shuangcheng; Liang, Chengzhi; Liu, Wei
2018-01-01
Accurately reconstructing gene co-expression network is of great importance for uncovering the genetic architecture underlying complex and various phenotypes. The recent availability of high-throughput RNA-seq sequencing has made genome-wide detecting and quantifying of the novel, rare and low-abundance transcripts practical. However, its potential merits in reconstructing gene co-expression network have still not been well explored. Using massive-scale RNA-seq samples, we have designed an ensemble pipeline, called NetMiner, for building genome-scale and high-quality Gene Co-expression Network (GCN) by integrating three frequently used inference algorithms. We constructed a RNA-seq-based GCN in one species of monocot rice. The quality of network obtained by our method was verified and evaluated by the curated gene functional association data sets, which obviously outperformed each single method. In addition, the powerful capability of network for associating genes with functions and agronomic traits was shown by enrichment analysis and case studies. In particular, we demonstrated the potential value of our proposed method to predict the biological roles of unknown protein-coding genes, long non-coding RNA (lncRNA) genes and circular RNA (circRNA) genes. Our results provided a valuable and highly reliable data source to select key candidate genes for subsequent experimental validation. To facilitate identification of novel genes regulating important biological processes and phenotypes in other plants or animals, we have published the source code of NetMiner, making it freely available at https://github.com/czllab/NetMiner.
Molecular Pathways: Extracting Medical Knowledge from High Throughput Genomic Data
Goldstein, Theodore; Paull, Evan O.; Ellis, Matthew J.; Stuart, Joshua M.
2013-01-01
High-throughput genomic data that measures RNA expression, DNA copy number, mutation status and protein levels provide us with insights into the molecular pathway structure of cancer. Genomic lesions (amplifications, deletions, mutations) and epigenetic modifications disrupt biochemical cellular pathways. While the number of possible lesions is vast, different genomic alterations may result in concordant expression and pathway activities, producing common tumor subtypes that share similar phenotypic outcomes. How can these data be translated into medical knowledge that provides prognostic and predictive information? First generation mRNA expression signatures such as Genomic Health's Oncotype DX already provide prognostic information, but do not provide therapeutic guidance beyond the current standard of care – which is often inadequate in high-risk patients. Rather than building molecular signatures based on gene expression levels, evidence is growing that signatures based on higher-level quantities such as from genetic pathways may provide important prognostic and diagnostic cues. We provide examples of how activities for molecular entities can be predicted from pathway analysis and how the composite of all such activities, referred to here as the “activitome,” help connect genomic events to clinical factors in order to predict the drivers of poor outcome. PMID:23430023
Kogelman, Lisette J A; Zhernakova, Daria V; Westra, Harm-Jan; Cirera, Susanna; Fredholm, Merete; Franke, Lude; Kadarmideen, Haja N
2015-10-20
Obesity is a multi-factorial health problem in which genetic factors play an important role. Limited results have been obtained in single-gene studies using either genomic or transcriptomic data. RNA sequencing technology has shown its potential in gaining accurate knowledge about the transcriptome, and may reveal novel genes affecting complex diseases. Integration of genomic and transcriptomic variation (expression quantitative trait loci [eQTL] mapping) has identified causal variants that affect complex diseases. We integrated transcriptomic data from adipose tissue and genomic data from a porcine model to investigate the mechanisms involved in obesity using a systems genetics approach. Using a selective gene expression profiling approach, we selected 36 animals based on a previously created genomic Obesity Index for RNA sequencing of subcutaneous adipose tissue. Differential expression analysis was performed using the Obesity Index as a continuous variable in a linear model. eQTL mapping was then performed to integrate 60 K porcine SNP chip data with the RNA sequencing data. Results were restricted based on genome-wide significant single nucleotide polymorphisms, detected differentially expressed genes, and previously detected co-expressed gene modules. Further data integration was performed by detecting co-expression patterns among eQTLs and integration with protein data. Differential expression analysis of RNA sequencing data revealed 458 differentially expressed genes. The eQTL mapping resulted in 987 cis-eQTLs and 73 trans-eQTLs (false discovery rate < 0.05), of which the cis-eQTLs were associated with metabolic pathways. We reduced the eQTL search space by focusing on differentially expressed and co-expressed genes and disease-associated single nucleotide polymorphisms to detect obesity-related genes and pathways. Building a co-expression network using eQTLs resulted in the detection of a module strongly associated with lipid pathways. Furthermore, we detected several obesity candidate genes, for example, ENPP1, CTSL, and ABHD12B. To our knowledge, this is the first study to perform an integrated genomics and transcriptomics (eQTL) study using, and modeling, genomic and subcutaneous adipose tissue RNA sequencing data on obesity in a porcine model. We detected several pathways and potential causal genes for obesity. Further validation and investigation may reveal their exact function and association with obesity.
Schnitzler, Christine E; Pang, Kevin; Powers, Meghan L; Reitzel, Adam M; Ryan, Joseph F; Simmons, David; Tada, Takashi; Park, Morgan; Gupta, Jyoti; Brooks, Shelise Y; Blakesley, Robert W; Yokoyama, Shozo; Haddock, Steven Hd; Martindale, Mark Q; Baxevanis, Andreas D
2012-12-21
Calcium-activated photoproteins are luciferase variants found in photocyte cells of bioluminescent jellyfish (Phylum Cnidaria) and comb jellies (Phylum Ctenophora). The complete genomic sequence from the ctenophore Mnemiopsis leidyi, a representative of the earliest branch of animals that emit light, provided an opportunity to examine the genome of an organism that uses this class of luciferase for bioluminescence and to look for genes involved in light reception. To determine when photoprotein genes first arose, we examined the genomic sequence from other early-branching taxa. We combined our genomic survey with gene trees, developmental expression patterns, and functional protein assays of photoproteins and opsins to provide a comprehensive view of light production and light reception in Mnemiopsis. The Mnemiopsis genome has 10 full-length photoprotein genes situated within two genomic clusters with high sequence conservation that are maintained due to strong purifying selection and concerted evolution. Photoprotein-like genes were also identified in the genomes of the non-luminescent sponge Amphimedon queenslandica and the non-luminescent cnidarian Nematostella vectensis, and phylogenomic analysis demonstrated that photoprotein genes arose at the base of all animals. Photoprotein gene expression in Mnemiopsis embryos begins during gastrulation in migrating precursors to photocytes and persists throughout development in the canals where photocytes reside. We identified three putative opsin genes in the Mnemiopsis genome and show that they do not group with well-known bilaterian opsin subfamilies. Interestingly, photoprotein transcripts are co-expressed with two of the putative opsins in developing photocytes. Opsin expression is also seen in the apical sensory organ. We present evidence that one opsin functions as a photopigment in vitro, absorbing light at wavelengths that overlap with peak photoprotein light emission, raising the hypothesis that light production and light reception may be functionally connected in ctenophore photocytes. We also present genomic evidence of a complete ciliary phototransduction cascade in Mnemiopsis. This study elucidates the genomic organization, evolutionary history, and developmental expression of photoprotein and opsin genes in the ctenophore Mnemiopsis leidyi, introduces a novel dual role for ctenophore photocytes in both bioluminescence and phototransduction, and raises the possibility that light production and light reception are linked in this early-branching non-bilaterian animal.
2012-01-01
Background Calcium-activated photoproteins are luciferase variants found in photocyte cells of bioluminescent jellyfish (Phylum Cnidaria) and comb jellies (Phylum Ctenophora). The complete genomic sequence from the ctenophore Mnemiopsis leidyi, a representative of the earliest branch of animals that emit light, provided an opportunity to examine the genome of an organism that uses this class of luciferase for bioluminescence and to look for genes involved in light reception. To determine when photoprotein genes first arose, we examined the genomic sequence from other early-branching taxa. We combined our genomic survey with gene trees, developmental expression patterns, and functional protein assays of photoproteins and opsins to provide a comprehensive view of light production and light reception in Mnemiopsis. Results The Mnemiopsis genome has 10 full-length photoprotein genes situated within two genomic clusters with high sequence conservation that are maintained due to strong purifying selection and concerted evolution. Photoprotein-like genes were also identified in the genomes of the non-luminescent sponge Amphimedon queenslandica and the non-luminescent cnidarian Nematostella vectensis, and phylogenomic analysis demonstrated that photoprotein genes arose at the base of all animals. Photoprotein gene expression in Mnemiopsis embryos begins during gastrulation in migrating precursors to photocytes and persists throughout development in the canals where photocytes reside. We identified three putative opsin genes in the Mnemiopsis genome and show that they do not group with well-known bilaterian opsin subfamilies. Interestingly, photoprotein transcripts are co-expressed with two of the putative opsins in developing photocytes. Opsin expression is also seen in the apical sensory organ. We present evidence that one opsin functions as a photopigment in vitro, absorbing light at wavelengths that overlap with peak photoprotein light emission, raising the hypothesis that light production and light reception may be functionally connected in ctenophore photocytes. We also present genomic evidence of a complete ciliary phototransduction cascade in Mnemiopsis. Conclusions This study elucidates the genomic organization, evolutionary history, and developmental expression of photoprotein and opsin genes in the ctenophore Mnemiopsis leidyi, introduces a novel dual role for ctenophore photocytes in both bioluminescence and phototransduction, and raises the possibility that light production and light reception are linked in this early-branching non-bilaterian animal. PMID:23259493
Kasi, Devi; Catherine, Christy; Lee, Seung-Won; Lee, Kyung-Ho; Kim, Yu Jung; Ro Lee, Myeong; Ju, Jung Won; Kim, Dong-Myung
2017-05-01
The rapidly evolving cloning and sequencing technologies have enabled understanding of genomic structure of parasite genomes, opening up new ways of combatting parasite-related diseases. To make the most of the exponentially accumulating genomic data, however, it is crucial to analyze the proteins encoded by these genomic sequences. In this study, we adopted an engineered cell-free protein synthesis system for large-scale expression screening of an expression sequence tag (EST) library of Clonorchis sinensis to identify potential antigens that can be used for diagnosis and treatment of clonorchiasis. To allow high-throughput expression and identification of individual genes comprising the library, a cell-free synthesis reaction was designed such that both the template DNA and the expressed proteins were co-immobilized on the same microbeads, leading to microbead-based linkage of the genotype and phenotype. This reaction configuration allowed streamlined expression, recovery, and analysis of proteins. This approach enabled us to identify 21 antigenic proteins. © 2017 American Institute of Chemical Engineers Biotechnol. Prog., 33:832-837, 2017. © 2017 American Institute of Chemical Engineers.
Guo, D; Li, H L; Tang, X; Peng, S Q
2014-12-18
In plants, homeodomain proteins play a critical role in regulating various aspects of plant growth and development. KNOX proteins are members of the homeodomain protein family. The KNOX transcription factors have been reported from Arabidopsis, rice, and other higher plants. The recent publication of the draft genome sequence of cassava (Manihot esculenta Krantz) has allowed a genome-wide search for M. esculenta KNOX (MeKNOX) transcription factors and the comparison of these positively identified proteins with their homologs in model plants. In the present study, we identified 12 MeKNOX genes in the cassava genome and grouped them into two distinct subfamilies based on their domain composition and phylogenetic analysis. Furthermore, semi-quantitative reverse transcription polymerase chain reaction analysis was performed to elucidate the expression profiles of these genes in different tissues and during various stages of root development. The analysis of MeKNOX expression profiles of indicated that 12 MeKNOX genes display differential expressions either in their transcript abundance or expression patterns.
Laubinger, Sascha; Zeller, Georg; Henz, Stefan R; Sachsenberg, Timo; Widmer, Christian K; Naouar, Naïra; Vuylsteke, Marnik; Schölkopf, Bernhard; Rätsch, Gunnar; Weigel, Detlef
2008-01-01
Gene expression maps for model organisms, including Arabidopsis thaliana, have typically been created using gene-centric expression arrays. Here, we describe a comprehensive expression atlas, Arabidopsis thaliana Tiling Array Express (At-TAX), which is based on whole-genome tiling arrays. We demonstrate that tiling arrays are accurate tools for gene expression analysis and identified more than 1,000 unannotated transcribed regions. Visualizations of gene expression estimates, transcribed regions, and tiling probe measurements are accessible online at the At-TAX homepage. PMID:18613972
2011-01-01
Background Green plant leaves have always fascinated biologists as hosts for photosynthesis and providers of basic energy to many food webs. Today, comprehensive databases of gene expression data enable us to apply increasingly more advanced computational methods for reverse-engineering the regulatory network of leaves, and to begin to understand the gene interactions underlying complex emergent properties related to stress-response and development. These new systems biology methods are now also being applied to organisms such as Populus, a woody perennial tree, in order to understand the specific characteristics of these species. Results We present a systems biology model of the regulatory network of Populus leaves. The network is reverse-engineered from promoter information and expression profiles of leaf-specific genes measured over a large set of conditions related to stress and developmental. The network model incorporates interactions between regulators, such as synergistic and competitive relationships, by evaluating increasingly more complex regulatory mechanisms, and is therefore able to identify new regulators of leaf development not found by traditional genomics methods based on pair-wise expression similarity. The approach is shown to explain available gene function information and to provide robust prediction of expression levels in new data. We also use the predictive capability of the model to identify condition-specific regulation as well as conserved regulation between Populus and Arabidopsis. Conclusions We outline a computationally inferred model of the regulatory network of Populus leaves, and show how treating genes as interacting, rather than individual, entities identifies new regulators compared to traditional genomics analysis. Although systems biology models should be used with care considering the complexity of regulatory programs and the limitations of current genomics data, methods describing interactions can provide hypotheses about the underlying cause of emergent properties and are needed if we are to identify target genes other than those constituting the "low hanging fruit" of genomic analysis. PMID:21232107
2010-01-01
Background Expressed Sequence Tag (EST) has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST) sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047), among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs) in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65%) and low in the peach (46%), and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species. PMID:20626882
Feltus, F Alex
2014-06-01
Understanding the control of any trait optimally requires the detection of causal genes, gene interaction, and mechanism of action to discover and model the biochemical pathways underlying the expressed phenotype. Functional genomics techniques, including RNA expression profiling via microarray and high-throughput DNA sequencing, allow for the precise genome localization of biological information. Powerful genetic approaches, including quantitative trait locus (QTL) and genome-wide association study mapping, link phenotype with genome positions, yet genetics is less precise in localizing the relevant mechanistic information encoded in DNA. The coupling of salient functional genomic signals with genetically mapped positions is an appealing approach to discover meaningful gene-phenotype relationships. Techniques used to define this genetic-genomic convergence comprise the field of systems genetics. This short review will address an application of systems genetics where RNA profiles are associated with genetically mapped genome positions of individual genes (eQTL mapping) or as gene sets (co-expression network modules). Both approaches can be applied for knowledge independent selection of candidate genes (and possible control mechanisms) underlying complex traits where multiple, likely unlinked, genomic regions might control specific complex traits. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Graumann, Franziska; Churin, Yuri; Tschuschner, Annette; Reifenberg, Kurt; Glebe, Dieter; Roderfeld, Martin; Roeb, Elke
2015-01-01
The Hepatitis B virus genome persists in the nucleus of virus infected hepatocytes where it serves as template for viral mRNA synthesis. Epigenetic modifications, including methylation of the CpG islands contribute to the regulation of viral gene expression. The present study investigates the effects of spontaneous age dependent loss of hepatitis B surface protein- (HBs) expression due to HBV-genome specific methylation as well as its proximate positive effects in HBs transgenic mice. Liver and serum of HBs transgenic mice aged 5-33 weeks were analyzed by Western blot, immunohistochemistry, serum analysis, PCR, and qRT-PCR. From the third month of age hepatic loss of HBs was observed in 20% of transgenic mice. The size of HBs-free area and the relative number of animals with these effects increased with age and struck about 55% of animals aged 33 weeks. Loss of HBs-expression was strongly correlated with amelioration of serum parameters ALT and AST. In addition lower HBs-expression went on with decreased ER-stress. The loss of surface protein expression started on transcriptional level and appeared to be regulated epigenetically by DNA methylation. The amount of the HBs-expression correlated negatively with methylation of HBV DNA in the mouse genome. Our data suggest that methylation of specific CpG sites controls gene expression even in HBs-transgenic mice with truncated HBV genome. More important, the loss of HBs expression and intracellular aggregation ameliorated cell stress and liver integrity. Thus, targeted modulation of HBs expression may offer new therapeutic approaches. Furthermore, HBs-transgenic mice depict a non-infectious mouse model to study one possible mechanism of HBs gene silencing by hypermethylation.
USDA-ARS?s Scientific Manuscript database
Dual luciferase reporter systems are valuable tools for functional genomic studies, but have not previously been developed for use in tick cell culture. We evaluated expression of available luciferase constructs in tick cell cultures derived from Rhipicephalus (Boophilus) microplus, an important vec...
Jiang, Shu-Ye; Ma, Ali; Ramamoorthy, Rengasamy; Ramachandran, Srinivasan
2013-01-01
Expression profiling is one of the most important tools for dissecting biological functions of genes and the upregulation or downregulation of gene expression is sufficient for recreating phenotypic differences. Expression divergence of genes significantly contributes to phenotypic variations. However, little is known on the molecular basis of expression divergence and evolution among rice genotypes with contrasting phenotypes. In this study, we have implemented an integrative approach using bioinformatics and experimental analyses to provide insights into genomic variation, expression divergence, and evolution between salinity-sensitive rice variety Nipponbare and tolerant rice line Pokkali under normal and high salinity stress conditions. We have detected thousands of differentially expressed genes between these two genotypes and thousands of up- or downregulated genes under high salinity stress. Many genes were first detected with expression evidence using custom microarray analysis. Some gene families were preferentially regulated by high salinity stress and might play key roles in stress-responsive biological processes. Genomic variations in promoter regions resulted from single nucleotide polymorphisms, indels (1–10 bp of insertion/deletion), and structural variations significantly contributed to the expression divergence and regulation. Our data also showed that tandem and segmental duplication, CACTA and hAT elements played roles in the evolution of gene expression divergence and regulation between these two contrasting genotypes under normal or high salinity stress conditions. PMID:24121498
Kim, Ji-Yeon; Lee, Eunjin; Park, Kyunghee; Park, Woong-Yang; Jung, Hae Hyun; Ahn, Jin Seok; Im, Young-Hyuck; Park, Yeon Hee
2017-04-25
Breast cancer (BC) has been genetically profiled through large-scale genome analyses. However, the role and clinical implications of genetic alterations in metastatic BC (MBC) have not been evaluated. Therefore, we conducted whole-exome sequencing (WES) and RNA-Seq of 37 MBC samples and targeted deep sequencing of another 29 MBCs. We evaluated somatic mutations from WES and targeted sequencing and assessed gene expression and performed pathway analysis from RNA-Seq. In this analysis, PIK3CA was the most commonly mutated gene in estrogen receptor (ER)-positive BC, while in ER-negative BC, TP53 was the most commonly mutated gene (p = 0.018 and p < 0.001, respectively). TP53 stopgain/loss and frameshift mutation was related to low expression of TP53 in contrast nonsynonymous mutation was related to high expression. The impact of TP53 mutation on clinical outcome varied with regard to ER status. In ER-positive BCs, wild type TP53 had a better prognosis than mutated TP53 (median overall survival (OS) (wild type vs. mutated): 88.5 ± 54.4 vs. 32.6 ± 10.7 (months), p = 0.002). In contrast, mutated TP53 had a protective effect in ER-negative BCs (median OS: 0.10 vs. 32.6 ± 8.2, p = 0.026). However, PIK3CA mutation did not affect patient survival. In gene expression analysis, CALM1, a potential regulator of AKT, was highly expressed in PIK3CA-mutated BCs. In conclusion, mutation of TP53 was associated with expression status and affect clinical outcome according to ER status in MBC. Although mutation of PIK3CA was not related to survival in this study, mutation of PIK3CA altered the expression of other genes and pathways including CALM1 and may be a potential predictive marker of PI3K inhibitor effectiveness.
Tserga, Aggeliki; Binder, Alexandra M; Michels, Karin B
2017-12-01
Folic acid is an essential component of 1-carbon metabolism, which generates methyl groups for DNA methylation. Disruption of genomic imprinting leads to biallelic expression which may affect disease susceptibility possibly reflected in high levels of S -adenosyl-homocysteine (SAH) and low levels of S -adenosyl-methionine (SAM). We investigated the association between folic acid supplementation during pregnancy and loss of imprinting (LOI) of IGF2 and H19 genes in placentas and cord blood of 90 mother-child dyads in association with the methylenetetrahydrofolate reductase ( MTHFR ) genotype. Pyrosequencing was used to evaluate deviation from monoallelic expression among 47 placentas heterozygous for H19 and 37 placentas and cord blood tissues heterozygous for IGF2 and H19 methylation levels of 48 placentas. We detected relaxation of imprinting (ROI) and LOI of H19 in placentas not associated with differences in methylation levels of the H19ICR. Placentas retained monoallelic allele-specific gene expression of IGF2 , but 32.4% of cord blood samples displayed LOI of IGF2 and 10.8% showed ROI. High SAH levels were significantly associated with low H19 methylation. An interesting positive association between SAM/SAH ratio and high H19 methylation levels was detected among infants with low B 12 levels. Our data suggest profound differences in regulation of imprinting in placenta and cord blood; a lack of correlation of the methylome, transcriptome, and proteome; and a complex regulatory feedback network between free methyl groups and genomic imprinting at birth.-Tserga, A., Binder, A. M., Michels, K. B. Impact of folic acid intake during pregnancy on genomic imprinting of IGF2/H19 and 1-carbon metabolism. © FASEB.
Jung, Seung-Hyun; Shin, Seung-Hun; Yim, Seon-Hee; Choi, Hye-Sun; Lee, Sug-Hyung; Chung, Yeun-Jun
2009-07-31
Recently, microarray-based comparative genomic hybridization (array-CGH) has emerged as a very efficient technology with higher resolution for the genome-wide identification of copy number alterations (CNA). Although CNAs are thought to affect gene expression, there is no platform currently available for the integrated CNA-expression analysis. To achieve high-resolution copy number analysis integrated with expression profiles, we established human 30k oligoarray-based genome-wide copy number analysis system and explored the applicability of this system for integrated genome and transcriptome analysis using MDA-MB-231 cell line. We compared the CNAs detected by the oligoarray with those detected by the 3k BAC array for validation. The oligoarray identified the single copy difference more accurately and sensitively than the BAC array. Seventeen CNAs detected by both platforms in MDA-MB-231 such as gains of 5p15.33-13.1, 8q11.22-8q21.13, 17p11.2, and losses of 1p32.3, 8p23.3-8p11.21, and 9p21 were consistently identified in previous studies on breast cancer. There were 122 other small CNAs (mean size 1.79 mb) that were detected by oligoarray only, not by BAC-array. We performed genomic qPCR targeting 7 CNA regions, detected by oligoarray only, and one non-CNA region to validate the oligoarray CNA detection. All qPCR results were consistent with the oligoarray-CGH results. When we explored the possibility of combined interpretation of both DNA copy number and RNA expression profiles, mean DNA copy number and RNA expression levels showed a significant correlation. In conclusion, this 30k oligoarray-CGH system can be a reasonable choice for analyzing whole genome CNAs and RNA expression profiles at a lower cost.
Zacapala-Gómez, Ana E; Navarro-Tito, Napoleón; Alarcón-Romero, Luz Del C; Ortuño-Pineda, Carlos; Illades-Aguiar, Berenice; Castañeda-Saucedo, Eduardo; Ortiz-Ortiz, Julio; Garibay-Cerdenares, Olga L; Jiménez-López, Marco A; Mendoza-Catalán, Miguel A
2018-03-27
Cervical cancer (CC) is the fourth cause of mortality by neoplasia in women worldwide. The use of immunomarkers is an alternative tool to complement currently used algorithms for detection of cancer, and to improve selection of therapeutic schemes. Aberrant expression of Ezrin and E-cadherin play an important role in tumor invasion. In this study we analyzed Ezrin and E-cadherin expression in liquid-based cervical cytology samples, and evaluated their potential use as prognostic immunomarkers. Immunocytochemical staining of Ezrin and E-cadherin was performed in cervical samples of 125 patients. The cytological or histological diagnostic was performed by Papanicolaou staining or H&E staining, respectively. HPV genotyping was determined using INNO-LIPA Genotyping Extra kit and the HPV physical status by in situ hybridization. Ezrin expression in HaCaT, HeLa and SiHa cell lines was determined by immunocytochemistry, immunofluorescence and Western blot. High Ezrin expression was observed in cervical cancer samples (70%), samples with multiple infection by HR-HPV (43%), and samples with integrated viral genome (47%). High Ezrin expression was associated with degree of SIL, viral genotype and physical status. In contrast, low E-cadherin expression was found in cervical cancer samples (95%), samples with multiple infection by HR-HPV/LR-HPV (87%) and integrated viral genome (72%). Low E-cadherin expression was associated with degree of SIL and viral genotype. Interestingly, Ezrin nuclear staining was associated with degree of SIL and viral genotype. High Ezrin expression, high percent of nuclear Ezrin and low E-cadherin expression behaved as risk factors for progression to HSIL and cervical cancer. Ezrin and E-cadherin expression profile in cervical cytology samples could be a potential prognostic marker, useful for identifying cervical lesions with a high-risk of progression to cervical cancer.
Evaluating cell lines as tumour models by comparison of genomic profiles
Domcke, Silvia; Sinha, Rileen; Levine, Douglas A.; Sander, Chris; Schultz, Nikolaus
2013-01-01
Cancer cell lines are frequently used as in vitro tumour models. Recent molecular profiles of hundreds of cell lines from The Cancer Cell Line Encyclopedia and thousands of tumour samples from the Cancer Genome Atlas now allow a systematic genomic comparison of cell lines and tumours. Here we analyse a panel of 47 ovarian cancer cell lines and identify those that have the highest genetic similarity to ovarian tumours. Our comparison of copy-number changes, mutations and mRNA expression profiles reveals pronounced differences in molecular profiles between commonly used ovarian cancer cell lines and high-grade serous ovarian cancer tumour samples. We identify several rarely used cell lines that more closely resemble cognate tumour profiles than commonly used cell lines, and we propose these lines as the most suitable models of ovarian cancer. Our results indicate that the gap between cell lines and tumours can be bridged by genomically informed choices of cell line models for all tumour types. PMID:23839242
Nunes, Luiz R; Rosato, Yoko B; Muto, Nair H; Yanai, Giane M; da Silva, Vivian S; Leite, Daniela B; Gonçalves, Edmilson R; de Souza, Alessandra A; Coletta-Filho, Helvécio D; Machado, Marcos A; Lopes, Silvio A; de Oliveira, Regina Costa
2003-04-01
Genetically distinct strains of the plant bacterium Xylella fastidiosa (Xf) are responsible for a variety of plant diseases, accounting for severe economic damage throughout the world. Using as a reference the genome of Xf 9a5c strain, associated with citrus variegated chlorosis (CVC), we developed a microarray-based comparison involving 12 Xf isolates, providing a thorough assessment of the variation in genomic composition across the group. Our results demonstrate that Xf displays one of the largest flexible gene pools characterized to date, with several horizontally acquired elements, such as prophages, plasmids, and genomic islands (GIs), which contribute up to 18% of the final genome. Transcriptome analysis of bacteria grown under different conditions shows that most of these elements are transcriptionally active, and their expression can be influenced in a coordinated manner by environmental stimuli. Finally, evaluation of the genetic composition of these laterally transferred elements identified differences that may help to explain the adaptability of Xf strains to infect such a wide range of plant species.
Comprehensive analysis of RNA-seq data reveals the complexity of the transcriptome in Brassica rapa.
Tong, Chaobo; Wang, Xiaowu; Yu, Jingyin; Wu, Jian; Li, Wanshun; Huang, Junyan; Dong, Caihua; Hua, Wei; Liu, Shengyi
2013-10-07
The species Brassica rapa (2n=20, AA) is an important vegetable and oilseed crop, and serves as an excellent model for genomic and evolutionary research in Brassica species. With the availability of whole genome sequence of B. rapa, it is essential to further determine the activity of all functional elements of the B. rapa genome and explore the transcriptome on a genome-wide scale. Here, RNA-seq data was employed to provide a genome-wide transcriptional landscape and characterization of the annotated and novel transcripts and alternative splicing events across tissues. RNA-seq reads were generated using the Illumina platform from six different tissues (root, stem, leaf, flower, silique and callus) of the B. rapa accession Chiifu-401-42, the same line used for whole genome sequencing. First, these data detected the widespread transcription of the B. rapa genome, leading to the identification of numerous novel transcripts and definition of 5'/3' UTRs of known genes. Second, 78.8% of the total annotated genes were detected as expressed and 45.8% were constitutively expressed across all tissues. We further defined several groups of genes: housekeeping genes, tissue-specific expressed genes and co-expressed genes across tissues, which will serve as a valuable repository for future crop functional genomics research. Third, alternative splicing (AS) is estimated to occur in more than 29.4% of intron-containing B. rapa genes, and 65% of them were commonly detected in more than two tissues. Interestingly, genes with high rate of AS were over-represented in GO categories relating to transcriptional regulation and signal transduction, suggesting potential importance of AS for playing regulatory role in these genes. Further, we observed that intron retention (IR) is predominant in the AS events and seems to preferentially occurred in genes with short introns. The high-resolution RNA-seq analysis provides a global transcriptional landscape as a complement to the B. rapa genome sequence, which will advance our understanding of the dynamics and complexity of the B. rapa transcriptome. The atlas of gene expression in different tissues will be useful for accelerating research on functional genomics and genome evolution in Brassica species.
Xu, Min; Wang, Yemin; Zhao, Zhilong; Gao, Guixi; Huang, Sheng-Xiong; Kang, Qianjin; He, Xinyi; Lin, Shuangjun; Pang, Xiuhua; Deng, Zixin
2016-01-01
ABSTRACT Genome sequencing projects in the last decade revealed numerous cryptic biosynthetic pathways for unknown secondary metabolites in microbes, revitalizing drug discovery from microbial metabolites by approaches called genome mining. In this work, we developed a heterologous expression and functional screening approach for genome mining from genomic bacterial artificial chromosome (BAC) libraries in Streptomyces spp. We demonstrate mining from a strain of Streptomyces rochei, which is known to produce streptothricins and borrelidin, by expressing its BAC library in the surrogate host Streptomyces lividans SBT5, and screening for antimicrobial activity. In addition to the successful capture of the streptothricin and borrelidin biosynthetic gene clusters, we discovered two novel linear lipopeptides and their corresponding biosynthetic gene cluster, as well as a novel cryptic gene cluster for an unknown antibiotic from S. rochei. This high-throughput functional genome mining approach can be easily applied to other streptomycetes, and it is very suitable for the large-scale screening of genomic BAC libraries for bioactive natural products and the corresponding biosynthetic pathways. IMPORTANCE Microbial genomes encode numerous cryptic biosynthetic gene clusters for unknown small metabolites with potential biological activities. Several genome mining approaches have been developed to activate and bring these cryptic metabolites to biological tests for future drug discovery. Previous sequence-guided procedures relied on bioinformatic analysis to predict potentially interesting biosynthetic gene clusters. In this study, we describe an efficient approach based on heterologous expression and functional screening of a whole-genome library for the mining of bioactive metabolites from Streptomyces. The usefulness of this function-driven approach was demonstrated by the capture of four large biosynthetic gene clusters for metabolites of various chemical types, including streptothricins, borrelidin, two novel lipopeptides, and one unknown antibiotic from Streptomyces rochei Sal35. The transfer, expression, and screening of the library were all performed in a high-throughput way, so that this approach is scalable and adaptable to industrial automation for next-generation antibiotic discovery. PMID:27451447
O'Brien, M.A.; Costin, B.N.; Miles, M.F.
2014-01-01
Postgenomic studies of the function of genes and their role in disease have now become an area of intense study since efforts to define the raw sequence material of the genome have largely been completed. The use of whole-genome approaches such as microarray expression profiling and, more recently, RNA-sequence analysis of transcript abundance has allowed an unprecedented look at the workings of the genome. However, the accurate derivation of such high-throughput data and their analysis in terms of biological function has been critical to truly leveraging the postgenomic revolution. This chapter will describe an approach that focuses on the use of gene networks to both organize and interpret genomic expression data. Such networks, derived from statistical analysis of large genomic datasets and the application of multiple bioinformatics data resources, poten-tially allow the identification of key control elements for networks associated with human disease, and thus may lead to derivation of novel therapeutic approaches. However, as discussed in this chapter, the leveraging of such networks cannot occur without a thorough understanding of the technical and statistical factors influencing the derivation of genomic expression data. Thus, while the catch phrase may be “it's the network … stupid,” the understanding of factors extending from RNA isolation to genomic profiling technique, multivariate statistics, and bioinformatics are all critical to defining fully useful gene networks for study of complex biology. PMID:23195313
Fu, Liezhen; Wen, Luan; Luu, Nga; Shi, Yun-Bo
2016-01-01
Genome editing with designer nucleases such as TALEN and CRISPR/Cas enzymes has broad applications. Delivery of these designer nucleases into organisms induces various genetic mutations including deletions, insertions and nucleotide substitutions. Characterizing those mutations is critical for evaluating the efficacy and specificity of targeted genome editing. While a number of methods have been developed to identify the mutations, none other than sequencing allows the identification of the most desired mutations, i.e., out-of-frame insertions/deletions that disrupt genes. Here we report a simple and efficient method to visualize and quantify the efficiency of genomic mutations induced by genome-editing. Our approach is based on the expression of a two-color fusion protein in a vector that allows the insertion of the edited region in the genome in between the two color moieties. We show that our approach not only easily identifies developing animals with desired mutations but also efficiently quantifies the mutation rate in vivo. Furthermore, by using LacZα and GFP as the color moieties, our approach can even eliminate the need for a fluorescent microscope, allowing the analysis with simple bright field visualization. Such an approach will greatly simplify the screen for effective genome-editing enzymes and identify the desired mutant cells/animals. PMID:27748423
Global Shifts in Genome and Proteome Composition Are Very Tightly Coupled
Brbić, Maria; Warnecke, Tobias; Kriško, Anita; Supek, Fran
2015-01-01
The amino acid composition (AAC) of proteomes differs greatly between microorganisms and is associated with the environmental niche they inhabit, suggesting that these changes may be adaptive. Similarly, the oligonucleotide composition of genomes varies and may confer advantages at the DNA/RNA level. These influences overlap in protein-coding sequences, making it difficult to gauge their relative contributions. We disentangle these effects by systematically evaluating the correspondence between intergenic nucleotide composition, where protein-level selection is absent, the AAC, and ecological parameters of 909 prokaryotes. We find that G + C content, the most frequently used measure of genomic composition, cannot capture diversity in AAC and across ecological contexts. However, di-/trinucleotide composition in intergenic DNA predicts amino acid frequencies of proteomes to the point where very little cross-species variability remains unexplained (91% of variance accounted for). Qualitatively similar results were obtained for 49 fungal genomes, where 80% of the variability in AAC could be explained by the composition of introns and intergenic regions. Upon factoring out oligonucleotide composition and phylogenetic inertia, the residual AAC is poorly predictive of the microbes’ ecological preferences, in stark contrast with the original AAC. Moreover, highly expressed genes do not exhibit more prominent environment-related AAC signatures than lowly expressed genes, despite contributing more to the effective proteome. Thus, evolutionary shifts in overall AAC appear to occur almost exclusively through factors shaping the global oligonucleotide content of the genome. We discuss these results in light of contravening evidence from biophysical data and further reading frame-specific analyses that suggest that adaptation takes place at the protein level. PMID:25971281
Sleeping Beauty transposon-based system for rapid generation of HBV-replicating stable cell lines.
Wu, Yong; Zhang, Tian-Ying; Fang, Lin-Lin; Chen, Zi-Xuan; Song, Liu-Wei; Cao, Jia-Li; Yang, Lin; Yuan, Quan; Xia, Ning-Shao
2016-08-01
The stable HBV-replicating cell lines, which carry replication-competent HBV genome stably integrated into the genome of host cell, are widely used to evaluate the effects of antiviral agents. However, current methods to generate HBV-replicating cell lines, which are mostly dependent on random integration of foreign DNA via plasmid transfection, are less-efficient and time-consuming. To address this issue, we constructed an all-in-one Sleeping Beauty transposon system (denoted pTSMP-HBV vector) for robust generation of stable cell lines carrying replication-competent HBV genome of different genotype. This vector contains a Sleeping Beauty transposon containing HBV 1.3-copy genome with an expression cassette of the SV40 promoter driving red fluorescent protein (mCherry) and self-cleaving P2A peptide linked puromycin resistance gene (PuroR). In addition, a PGK promoter-driven SB100X hyperactive transposase cassette is placed in the outside of the transposon in the same plasmid.The HBV-replicating stable cells could be obtained from pTSMP-HBV transfected HepG2 cells by red fluorescence-activated cell sorting and puromycin resistant cell selection within 4-week. Using this system, we successfully constructed four cell lines carrying replication-competent HBV genome of genotypes A-D. The replication and viral protein expression profiles of these cells were systematically characterized. In conclusion, our study provides a high-efficiency strategy to generate HBV-replicating stable cell lines, which may facilitate HBV-related virological study. Copyright © 2016. Published by Elsevier B.V.
USDA-ARS?s Scientific Manuscript database
Elder age and chronic alcohol consumption are important risk factors for the development of colon cancer. Each factor can alter genomic and gene-specific DNA methylation. This study examined the effects of aging and chronic alcohol consumption on genomic and p16-specific methylation, and p16 express...
USDA-ARS?s Scientific Manuscript database
Elder age and chronic alcohol consumption are important risk factors for the development of colon cancer. Each factor can alter genomic and gene-specific DNA methylation. This study examined the effects of aging and chronic alcohol consumption on genomic and p16-specific methylation, and p16 express...
Oduru, Sreedhar; Campbell, Janee L; Karri, SriTulasi; Hendry, William J; Khan, Shafiq A; Williams, Simon C
2003-01-01
Background Complete genome annotation will likely be achieved through a combination of computer-based analysis of available genome sequences combined with direct experimental characterization of expressed regions of individual genomes. We have utilized a comparative genomics approach involving the sequencing of randomly selected hamster testis cDNAs to begin to identify genes not previously annotated on the human, mouse, rat and Fugu (pufferfish) genomes. Results 735 distinct sequences were analyzed for their relatedness to known sequences in public databases. Eight of these sequences were derived from previously unidentified genes and expression of these genes in testis was confirmed by Northern blotting. The genomic locations of each sequence were mapped in human, mouse, rat and pufferfish, where applicable, and the structure of their cognate genes was derived using computer-based predictions, genomic comparisons and analysis of uncharacterized cDNA sequences from human and macaque. Conclusion The use of a comparative genomics approach resulted in the identification of eight cDNAs that correspond to previously uncharacterized genes in the human genome. The proteins encoded by these genes included a new member of the kinesin superfamily, a SET/MYND-domain protein, and six proteins for which no specific function could be predicted. Each gene was expressed primarily in testis, suggesting that they may play roles in the development and/or function of testicular cells. PMID:12783626
Genomic expression patterns in medication overuse headaches
Hershey, Andrew D; Burdine, Danny; Kabbouche, Marielle A; Powers, Scott W
2016-01-01
Background Chronic daily headache (CDH) and chronic migraine (CM) are one of the most frequent problems encountered in neurology, are often difficult to treat, and frequently complicated by medication-overuse headache (MOH). Proper recognition of MOH may alter treatment outcome and prevent long term disability. Objective This study identifies the unique genomic expression pattern MOH that respond to cessation of the overused medication. Methods Baseline occurrence of MOH and typical pattern of response to medication cessation were measured from a large database. Whole blood samples from patients with CM with or without MOH were obtained and their genomic profile was assessed. Affymetrix human U133 plus2 arrays were used to examine the genomic expression patterns prior to treatment and 6–12 weeks later. Headache characterisation and response to treatment based on headache frequency and disability were compared. Results Of 1311 patients reporting daily or continuous headaches, 513 (39.1%) reported overusing analgesic medication. At follow-up, 44.5% had a 50% or greater reduction in headache frequency, while 41.6% had no change. Blood genomic expression patterns were obtained on 33 patients with 19 (57.6%) overusing analgesic medication with a unique genomic expression pattern in MOH that responded to cessation of analgesics. Gene ontology of these samples indicated a significant number were involved with brain and immunological tissues, including multiple signalling pathways and apoptosis. Conclusions Blood genomic patterns can accurately identify MOH patients that respond to medication cessation. These results suggest that MOH involves a unique molecular biology pathway that can be identified with a specific biomarker. PMID:20974594
Viral delivery of genome-modifying proteins for cellular reprogramming.
Mikkelsen, Jacob Giehm
2018-06-18
Following the successful development of virus-based gene vehicles for genetic therapies, exploitation of viruses as carriers of genetic tools for cellular reprogramming and genome editing should be right up the street. However, whereas persistent, potentially life-long gene expression is the main goal of conventional genetic therapies, tools and bits for genome engineering should ideally be short-lived and active only for a limited time. Although viral vector systems have already been adapted for potent genome editing both in vitro and in vivo, regulatable gene expression systems or self-limiting expression circuits need to be implemented limiting exposure of chromatin to genome-modifying enzymes. As an alternative approach, emerging virus-based protein delivery technologies support direct protein delivery, providing a short, robust boost of enzymatic activity in transduced cells. Is this potentially the perfect way of shipping loads of cargo to many recipients and still maintain short-term activity? Copyright © 2018 Elsevier Ltd. All rights reserved.
Wada, Takuya; Oku, Koichiro; Nagano, Soichiro; Isobe, Sachiko; Suzuki, Hideyuki; Mori, Miyuki; Takata, Kinuko; Hirata, Chiharu; Shimomura, Katsumi; Tsubone, Masao; Katayama, Takao; Hirashima, Keita; Uchimura, Yosuke; Ikegami, Hidetoshi; Sueyoshi, Takayuki; Obu, Ko-ichi; Hayashida, Tatsuya; Shibato, Yasushi
2017-01-01
A strawberry Multi-parent Advanced Generation Intercrosses (MAGIC) population, derived from crosses using six strawberry cultivars was successfully developed. The population was composed of 338 individuals; genome conformation was evaluated by expressed sequence tag-derived simple short repeat (EST-SSR) markers. Cluster analysis and principal component analysis (PCA) based on EST-SSR marker polymorphisms revealed that the MAGIC population was a mosaic of the six founder cultivars and covered the genomic regions of the six founders evenly. Fruit quality related traits, including days to flowering (DTF), fruit weight (FW), fruit firmness (FF), fruit color (FC), soluble solid content (SC), and titratable acidity (TA), of the MAGIC population were evaluated over two years. All traits showed normal transgressive segregation beyond the founder cultivars and most traits, except for DTF, distributed normally. FC exhibited the highest correlation coefficient overall and was distributed normally regardless of differences in DTF, FW, FF, SC, and TA. These facts were supported by PCA using fruit quality related values as explanatory variables, suggesting that major genetic factors, which are not influenced by fluctuations in other fruit traits, could control the distribution of FC. This MAGIC population is a promising resource for genome-wide association studies and genomic selection for efficient strawberry breeding. PMID:29085247
Computer vision and machine learning for robust phenotyping in genome-wide studies
Zhang, Jiaoping; Naik, Hsiang Sing; Assefa, Teshale; Sarkar, Soumik; Reddy, R. V. Chowda; Singh, Arti; Ganapathysubramanian, Baskar; Singh, Asheesh K.
2017-01-01
Traditional evaluation of crop biotic and abiotic stresses are time-consuming and labor-intensive limiting the ability to dissect the genetic basis of quantitative traits. A machine learning (ML)-enabled image-phenotyping pipeline for the genetic studies of abiotic stress iron deficiency chlorosis (IDC) of soybean is reported. IDC classification and severity for an association panel of 461 diverse plant-introduction accessions was evaluated using an end-to-end phenotyping workflow. The workflow consisted of a multi-stage procedure including: (1) optimized protocols for consistent image capture across plant canopies, (2) canopy identification and registration from cluttered backgrounds, (3) extraction of domain expert informed features from the processed images to accurately represent IDC expression, and (4) supervised ML-based classifiers that linked the automatically extracted features with expert-rating equivalent IDC scores. ML-generated phenotypic data were subsequently utilized for the genome-wide association study and genomic prediction. The results illustrate the reliability and advantage of ML-enabled image-phenotyping pipeline by identifying previously reported locus and a novel locus harboring a gene homolog involved in iron acquisition. This study demonstrates a promising path for integrating the phenotyping pipeline into genomic prediction, and provides a systematic framework enabling robust and quicker phenotyping through ground-based systems. PMID:28272456
Tirosh, Y; Morpurgo, N; Cohen, M; Linial, M; Bloch, G
2012-06-01
We identified a predicted compact cysteine-rich sequence in the honey bee genome that we called 'Raalin'. Raalin transcripts are enriched in the brain of adult honey bee workers and drones, with only minimum expression in other tissues or in pre-adult stages. Open-reading frame (ORF) homologues of Raalin were identified in the transcriptomes of fruit flies, mosquitoes and moths. The Raalin-like gene from Drosophila melanogaster encodes for a short secreted protein that is maximally expressed in the adult brain with negligible expression in other tissues or pre-imaginal stages. Raalin-like sequences have also been found in the recently sequenced genomes of six ant species, but not in the jewel wasp Nasonia vitripennis. As in the honey bee, the Raalin-like sequences of ants do not have an ORF. A comparison of the genome region containing Raalin in the genomes of bees, ants and the wasp provides evolutionary support for an extensive genome rearrangement in this sequence. Our analyses identify a new family of ancient cysteine-rich short sequences in insects in which insertions and genome rearrangements may have disrupted this locus in the branch leading to the Hymenoptera. The regulated expression of this transcript suggests that it has a brain-specific function. © 2012 The Authors. Insect Molecular Biology © 2012 The Royal Entomological Society.
Gao, Ri; Wang, Haibin; Dong, Bin; Yang, Xiaodong; Chen, Sumei; Jiang, Jiafu; Zhang, Zhaohe; Liu, Chen; Zhao, Nan; Chen, Fadi
2016-10-09
Autopolyploidy is widespread in higher plants and plays an important role in the process of evolution. The present study successfully induced autotetraploidys from Chrysanthemum lavandulifolium by colchicine. The plant morphology, genomic, transcriptomic, and epigenetic changes between tetraploid and diploid plants were investigated. Ligulate flower, tubular flower and leaves of tetraploid plants were greater than those of the diploid plants. Compared with diploid plants, the genome changed as a consequence of polyploidization in tetraploid plants, namely, 1.1% lost fragments and 1.6% novel fragments occurred. In addition, DNA methylation increased after genome doubling in tetraploid plants. Among 485 common transcript-derived fragments (TDFs), which existed in tetraploid and diploid progenitors, 62 fragments were detected as differentially expressed TDFs, 6.8% of TDFs exhibited up-regulated gene expression in the tetraploid plants and 6.0% exhibited down-regulation. The present study provides a reference for further studying the autopolyploidization role in the evolution of C. lavandulifolium. In conclusion, the autopolyploid C. lavandulifolium showed a global change in morphology, genome and gene expression compared with corresponding diploid.
Tajaddod, Mansoureh; Tanzer, Andrea; Licht, Konstantin; Wolfinger, Michael T; Badelt, Stefan; Huber, Florian; Pusch, Oliver; Schopoff, Sandy; Janisiw, Michael; Hofacker, Ivo; Jantsch, Michael F
2016-10-25
Short interspersed elements (SINEs) represent the most abundant group of non-long-terminal repeat transposable elements in mammalian genomes. In primates, Alu elements are the most prominent and homogenous representatives of SINEs. Due to their frequent insertion within or close to coding regions, SINEs have been suggested to play a crucial role during genome evolution. Moreover, Alu elements within mRNAs have also been reported to control gene expression at different levels. Here, we undertake a genome-wide analysis of insertion patterns of human Alus within transcribed portions of the genome. Multiple, nearby insertions of SINEs within one transcript are more abundant in tandem orientation than in inverted orientation. Indeed, analysis of transcriptome-wide expression levels of 15 ENCODE cell lines suggests a cis-repressive effect of inverted Alu elements on gene expression. Using reporter assays, we show that the negative effect of inverted SINEs on gene expression is independent of known sensors of double-stranded RNAs. Instead, transcriptional elongation seems impaired, leading to reduced mRNA levels. Our study suggests that there is a bias against multiple SINE insertions that can promote intramolecular base pairing within a transcript. Moreover, at a genome-wide level, mRNAs harboring inverted SINEs are less expressed than mRNAs harboring single or tandemly arranged SINEs. Finally, we demonstrate a novel mechanism by which inverted SINEs can impact on gene expression by interfering with RNA polymerase II.
Genomic profiling of human penile carcinoma predicts worse prognosis and survival.
Busso-Lopes, Ariane F; Marchi, Fábio A; Kuasne, Hellen; Scapulatempo-Neto, Cristovam; Trindade-Filho, José Carlos S; de Jesus, Carlos Márcio N; Lopes, Ademar; Guimarães, Gustavo C; Rogatto, Silvia R
2015-02-01
The molecular mechanisms underlying penile carcinoma are still poorly understood, and the detection of genetic markers would be of great benefit for these patients. In this study, we assessed the genomic profile aiming at identifying potential prognostic biomarkers in penile carcinoma. Globally, 46 penile carcinoma samples were considered to evaluate DNA copy-number alterations via array comparative genomic hybridization (aCGH) combined with human papillomavirus (HPV) genotyping. Specific genes were investigated by using qPCR, FISH, and RT-qPCR. Genomic alterations mapped at 3p and 8p were related to worse prognostic features, including advanced T and clinical stage, recurrence and death from the disease. Losses of 3p21.1-p14.3 and gains of 3q25.31-q29 were associated with reduced cancer-specific and disease-free survival. Genomic alterations detected for chromosome 3 (LAMP3, PPARG, TNFSF10 genes) and 8 (DLC1) were evaluated by qPCR. DLC1 and PPARG losses were associated with poor prognosis characteristics. Losses of DLC1 were an independent risk factor for recurrence on multivariate analysis. The gene-expression analysis showed downexpression of DLC1 and PPARG and overexpression of LAMP3 and TNFSF10 genes. Chromosome Y losses and MYC gene (8q24) gains were confirmed by FISH. HPV infection was detected in 34.8% of the samples, and 19 differential genomic regions were obtained related to viral status. At first time, we described recurrent copy-number alterations and its potential prognostic value in penile carcinomas. We also showed a specific genomic profile according to HPV infection, supporting the hypothesis that penile tumors present distinct etiologies according to virus status. ©2014 American Association for Cancer Research.
Giuliani, Alessandro; Tomita, Masaru
2010-01-01
Cell fate decision remarkably generates specific cell differentiation path among the multiple possibilities that can arise through the complex interplay of high-dimensional genome activities. The coordinated action of thousands of genes to switch cell fate decision has indicated the existence of stable attractors guiding the process. However, origins of the intracellular mechanisms that create “cellular attractor” still remain unknown. Here, we examined the collective behavior of genome-wide expressions for neutrophil differentiation through two different stimuli, dimethyl sulfoxide (DMSO) and all-trans-retinoic acid (atRA). To overcome the difficulties of dealing with single gene expression noises, we grouped genes into ensembles and analyzed their expression dynamics in correlation space defined by Pearson correlation and mutual information. The standard deviation of correlation distributions of gene ensembles reduces when the ensemble size is increased following the inverse square root law, for both ensembles chosen randomly from whole genome and ranked according to expression variances across time. Choosing the ensemble size of 200 genes, we show the two probability distributions of correlations of randomly selected genes for atRA and DMSO responses overlapped after 48 hours, defining the neutrophil attractor. Next, tracking the ranked ensembles' trajectories, we noticed that only certain, not all, fall into the attractor in a fractal-like manner. The removal of these genome elements from the whole genomes, for both atRA and DMSO responses, destroys the attractor providing evidence for the existence of specific genome elements (named “genome vehicle”) responsible for the neutrophil attractor. Notably, within the genome vehicles, genes with low or moderate expression changes, which are often considered noisy and insignificant, are essential components for the creation of the neutrophil attractor. Further investigations along with our findings might provide a comprehensive mechanistic view of cell fate decision. PMID:20725638
Association of genetic variants and expression levels of porcine FABP4 and FABP5 genes.
Ballester, M; Puig-Oliveras, A; Castelló, A; Revilla, M; Fernández, A I; Folch, J M
2017-12-01
The FABP4 and FABP5 genes, coding for fatty acid transport proteins, have long been studied as positional candidate genes for SSC4 QTL affecting fat deposition and composition traits in pigs. Polymorphisms in these genes, FABP4:g.2634_2635insC and FABP5:g.3000T>G, have previously been associated with fatness traits in an Iberian by Landrace cross (IBMAP). The aim of the present work was to evaluate the functional implication of these genetic variants. For this purpose, FABP4 and FABP5 mRNA expression levels in 114 BC1_LD animals (25% Iberian × 75% Landrace) were analyzed using real-time quantitative PCR in backfat and muscle. FABP4 gene expression in backfat, but not in muscle, was associated with FABP4:g.2634_2635insC. In contrast, FABP5:g.3000T>G was not associated with gene expression levels. An expression-based genome-wide association study highlighted the FABP4:g.2634_2635insC polymorphism as the polymorphism most associated with FABP4 gene expression in backfat. Furthermore, other genomic regions associated in trans with the mRNA expression of FABP4 in backfat and FABP5 in muscle were also identified. Finally, two putative transcription binding sites for PPARG and NR4A2 may be affected by the FABP4:g.2634_2635insC polymorphism, modifying FABP4 gene expression. Our results reinforce FABP4 as a candidate gene for fatness traits on SSC4. © 2017 Stichting International Foundation for Animal Genetics.
Alonso, Conchita; Pérez, Ricardo; Bazaga, Pilar; Herrera, Carlos M.
2015-01-01
DNA cytosine methylation is a widespread epigenetic mechanism in eukaryotes, and plant genomes commonly are densely methylated. Genomic methylation can be associated with functional consequences such as mutational events, genomic instability or altered gene expression, but little is known on interspecific variation in global cytosine methylation in plants. In this paper, we compare global cytosine methylation estimates obtained by HPLC and use a phylogenetically-informed analytical approach to test for significance of evolutionary signatures of this trait across 54 angiosperm species in 25 families. We evaluate whether interspecific variation in global cytosine methylation is statistically related to phylogenetic distance and also whether it is evolutionarily correlated with genome size (C-value). Global cytosine methylation varied widely between species, ranging between 5.3% (Arabidopsis) and 39.2% (Narcissus). Differences between species were related to their evolutionary trajectories, as denoted by the strong phylogenetic signal underlying interspecific variation. Global cytosine methylation and genome size were evolutionarily correlated, as revealed by the significant relationship between the corresponding phylogenetically independent contrasts. On average, a ten-fold increase in genome size entailed an increase of about 10% in global cytosine methylation. Results show that global cytosine methylation is an evolving trait in angiosperms whose evolutionary trajectory is significantly linked to changes in genome size, and suggest that the evolutionary implications of epigenetic mechanisms are likely to vary between plant lineages. PMID:25688257
Mining the archives: a cross-platform analysis of gene ...
Formalin-fixed paraffin-embedded (FFPE) tissue samples represent a potentially invaluable resource for genomic research into the molecular basis of disease. However, use of FFPE samples in gene expression studies has been limited by technical challenges resulting from degradation of nucleic acids. Here we evaluated gene expression profiles derived from fresh-frozen (FRO) and FFPE mouse liver tissues using two DNA microarray protocols and two whole transcriptome sequencing (RNA-seq) library preparation methodologies. The ribo-depletion protocol outperformed the other three methods by having the highest correlations of differentially expressed genes (DEGs) and best overlap of pathways between FRO and FFPE groups. We next tested the effect of sample time in formalin (18 hours or 3 weeks) on gene expression profiles. Hierarchical clustering of the datasets indicated that test article treatment, and not preservation method, was the main driver of gene expression profiles. Meta- and pathway analyses indicated that biological responses were generally consistent for 18-hour and 3-week FFPE samples compared to FRO samples. However, clear erosion of signal intensity with time in formalin was evident, and DEG numbers differed by platform and preservation method. Lastly, we investigated the effect of age in FFPE block on genomic profiles. RNA-seq analysis of 8-, 19-, and 26-year-old control blocks using the ribo-depletion protocol resulted in comparable quality metrics, inc
Li, Ming D; Wang, Ju; Niu, Tianhua; Ma, Jennie Z; Seneviratne, Chamindi; Ait-Daoud, Nassima; Saadvandi, Jim; Morris, Rana; Weiss, David; Campbell, Jan; Haning, William; Mawhinney, David J; Weis, Denis; McCann, Michael; Stock, Christopher; Kahn, Roberta; Iturriaga, Erin; Yu, Elmer; Elkashef, Ahmed; Johnson, Bankole A
2014-12-12
Developing efficacious medications to treat methamphetamine dependence is a global challenge in public health. Topiramate (TPM) is undergoing evaluation for this indication. The molecular mechanisms underlying its effects are largely unknown. Examining the effects of TPM on genome-wide gene expression in methamphetamine addicts is a clinically and scientifically important component of understanding its therapeutic profile. In this double-blind, placebo-controlled clinical trial, 140 individuals who met the DSM-IV criteria for methamphetamine dependence were randomized to receive either TPM or placebo, of whom 99 consented to participate in our genome-wide expression study. The RNA samples were collected from whole blood for 50 TPM- and 49 placebo-treated participants at three time points: baseline and the ends of weeks 8 and 12. Genome-wide expression profiles and pathways of the two groups were compared for the responders and non-responders at Weeks 8 and 12. To minimize individual variations, expression of all examined genes at Weeks 8 and 12 were normalized to the values at baseline prior to identification of differentially expressed genes and pathways. At the single-gene level, we identified 1054, 502, 204, and 404 genes at nominal P values < 0.01 in the responders vs. non-responders at Weeks 8 and 12 for the TPM and placebo groups, respectively. Among them, expression of 159, 38, 2, and 21 genes was still significantly different after Bonferroni corrections for multiple testing. Many of these genes, such as GRINA, PRKACA, PRKCI, SNAP23, and TRAK2, which are involved in glutamate receptor and GABA receptor signaling, are direct targets for TPM. In contrast, no TPM drug targets were identified in the 38 significant genes for the Week 8 placebo group. Pathway analyses based on nominally significant genes revealed 27 enriched pathways shared by the Weeks 8 and 12 TPM groups. These pathways are involved in relevant physiological functions such as neuronal function/synaptic plasticity, signal transduction, cardiovascular function, and inflammation/immune function. Topiramate treatment of methamphetamine addicts significantly modulates the expression of genes involved in multiple biological processes underlying addiction behavior and other physiological functions.
Oakley, Todd H; Gu, Zhenglong; Abouheif, Ehab; Patel, Nipam H; Li, Wen-Hsiung
2005-01-01
Understanding the evolution of gene function is a primary challenge of modern evolutionary biology. Despite an expanding database from genomic and developmental studies, we are lacking quantitative methods for analyzing the evolution of some important measures of gene function, such as gene-expression patterns. Here, we introduce phylogenetic comparative methods to compare different models of gene-expression evolution in a maximum-likelihood framework. We find that expression of duplicated genes has evolved according to a nonphylogenetic model, where closely related genes are no more likely than more distantly related genes to share common expression patterns. These results are consistent with previous studies that found rapid evolution of gene expression during the history of yeast. The comparative methods presented here are general enough to test a wide range of evolutionary hypotheses using genomic-scale data from any organism.
Genome-Wide Discovery of Long Non-Coding RNAs in Rainbow Trout.
Al-Tobasei, Rafet; Paneru, Bam; Salem, Mohamed
2016-01-01
The ENCODE project revealed that ~70% of the human genome is transcribed. While only 1-2% of the RNAs encode for proteins, the rest are non-coding RNAs. Long non-coding RNAs (lncRNAs) form a diverse class of non-coding RNAs that are longer than 200 nt. Emerging evidence indicates that lncRNAs play critical roles in various cellular processes including regulation of gene expression. LncRNAs show low levels of gene expression and sequence conservation, which make their computational identification in genomes difficult. In this study, more than two billion Illumina sequence reads were mapped to the genome reference using the TopHat and Cufflinks software. Transcripts shorter than 200 nt, with more than 83-100 amino acids ORF, or with significant homologies to the NCBI nr-protein database were removed. In addition, a computational pipeline was used to filter the remaining transcripts based on a protein-coding-score test. Depending on the filtering stringency conditions, between 31,195 and 54,503 lncRNAs were identified, with only 421 matching known lncRNAs in other species. A digital gene expression atlas revealed 2,935 tissue-specific and 3,269 ubiquitously-expressed lncRNAs. This study annotates the lncRNA rainbow trout genome and provides a valuable resource for functional genomics research in salmonids.
Lee, Hong Jo; Lee, Hyung Chul; Kim, Young Min; Hwang, Young Sun; Park, Young Hyun; Park, Tae Sub; Han, Jae Yong
2016-02-01
Targeted genome recombination has been applied in diverse research fields and has a wide range of possible applications. In particular, the discovery of specific loci in the genome that support robust and ubiquitous expression of integrated genes and the development of genome-editing technology have facilitated rapid advances in various scientific areas. In this study, we produced transgenic (TG) chickens that can induce recombinase-mediated gene cassette exchange (RMCE), one of the site-specific recombination technologies, and confirmed RMCE in TG chicken-derived cells. As a result, we established TG chicken lines that have, Flipase (Flp) recognition target (FRT) pairs in the chicken genome, mediated by piggyBac transposition. The transgene integration patterns were diverse in each TG chicken line, and the integration diversity resulted in diverse levels of expression of exogenous genes in each tissue of the TG chickens. In addition, the replaced gene cassette was expressed successfully and maintained by RMCE in the FRT predominant loci of TG chicken-derived cells. These results indicate that targeted genome recombination technology with RMCE could be adaptable to TG chicken models and that the technology would be applicable to specific gene regulation by cis-element insertion and customized expression of functional proteins at predicted levels without epigenetic influence. © FASEB.
Lee, Moon Young; Park, Chanjae; Berent, Robyn M.; Park, Paul J.; Fuchs, Robert; Syn, Hannah; Chin, Albert; Townsend, Jared; Benson, Craig C.; Redelman, Doug; Shen, Tsai-wei; Park, Jong Kun; Miano, Joseph M.; Sanders, Kenton M.; Ro, Seungil
2015-01-01
Genome-scale expression data on the absolute numbers of gene isoforms offers essential clues in cellular functions and biological processes. Smooth muscle cells (SMCs) perform a unique contractile function through expression of specific genes controlled by serum response factor (SRF), a transcription factor that binds to DNA sites known as the CArG boxes. To identify SRF-regulated genes specifically expressed in SMCs, we isolated SMC populations from mouse small intestine and colon, obtained their transcriptomes, and constructed an interactive SMC genome and CArGome browser. To our knowledge, this is the first online resource that provides a comprehensive library of all genetic transcripts expressed in primary SMCs. The browser also serves as the first genome-wide map of SRF binding sites. The browser analysis revealed novel SMC-specific transcriptional variants and SRF target genes, which provided new and unique insights into the cellular and biological functions of the cells in gastrointestinal (GI) physiology. The SRF target genes in SMCs, which were discovered in silico, were confirmed by proteomic analysis of SMC-specific Srf knockout mice. Our genome browser offers a new perspective into the alternative expression of genes in the context of SRF binding sites in SMCs and provides a valuable reference for future functional studies. PMID:26241044
Chen, Lin; Dong, Chuanju; Kong, Shengnan; Zhang, Jiangfan; Li, Xuejun; Xu, Peng
2017-09-05
Bone morphogenetic proteins (Bmps) are a group of signaling molecules known to play important roles during formation and maintenance of various organs, not only bone, but also muscle, blood and so on. Common carp (Cyprinus carpio) is one of the most intensively studied fish due to its economic and environmental importance. Besides, common carp has encountered an additional round of whole genome duplication (WGD) compared with many closely related diploid teleost, which make it one of the most important models for genome evolutionary studies in teleost. Comprehensive genome resources of common carp have been developed recently, which facilitate the thorough characterization of bmp gene family in the tetraploidized common carp genome. We identified a total of 44 bmps from the common carp genome, which are twice as many as that of zebrafish. Phylogenetic analysis revealed that most of bmps are highly conserved. Comparative analysis was performed across six typical vertebrate genomes. It appeared that all the bmp genes in common carp were duplicated. Obviously, the expansion of the bmp gene family in common carp was due to the latest additional round of whole genome duplication and made it more abundant than other diploid teleosts. Expression signatures were assessed in major tissues, including gill, intestine, liver, spleen, skin, heart, gonad, muscle, kidney, head kidney, brain and blood, which demonstrated the comprehensive expression profiles of bmp genes in the tetraploidized genome. Significant gene expression divergences were observed which revealed substantial functional divergences of those duplicated bmp genes post the latest WGD event. The conserved synteny blocks of bmp5s revealed the genome rearrangement of common carp post the 4R WGD. The whole set of bmp gene family in common carp provides insight into gene fate of tetraploidized common carp genome post recent WGD. Copyright © 2017. Published by Elsevier B.V.
A deep auto-encoder model for gene expression prediction.
Xie, Rui; Wen, Jia; Quitadamo, Andrew; Cheng, Jianlin; Shi, Xinghua
2017-11-17
Gene expression is a key intermediate level that genotypes lead to a particular trait. Gene expression is affected by various factors including genotypes of genetic variants. With an aim of delineating the genetic impact on gene expression, we build a deep auto-encoder model to assess how good genetic variants will contribute to gene expression changes. This new deep learning model is a regression-based predictive model based on the MultiLayer Perceptron and Stacked Denoising Auto-encoder (MLP-SAE). The model is trained using a stacked denoising auto-encoder for feature selection and a multilayer perceptron framework for backpropagation. We further improve the model by introducing dropout to prevent overfitting and improve performance. To demonstrate the usage of this model, we apply MLP-SAE to a real genomic datasets with genotypes and gene expression profiles measured in yeast. Our results show that the MLP-SAE model with dropout outperforms other models including Lasso, Random Forests and the MLP-SAE model without dropout. Using the MLP-SAE model with dropout, we show that gene expression quantifications predicted by the model solely based on genotypes, align well with true gene expression patterns. We provide a deep auto-encoder model for predicting gene expression from SNP genotypes. This study demonstrates that deep learning is appropriate for tackling another genomic problem, i.e., building predictive models to understand genotypes' contribution to gene expression. With the emerging availability of richer genomic data, we anticipate that deep learning models play a bigger role in modeling and interpreting genomics.
Qiu, Xing; Hu, Rui; Wu, Zhixin
2014-01-01
Normalization procedures are widely used in high-throughput genomic data analyses to remove various technological noise and variations. They are known to have profound impact to the subsequent gene differential expression analysis. Although there has been some research in evaluating different normalization procedures, few attempts have been made to systematically evaluate the gene detection performances of normalization procedures from the bias-variance trade-off point of view, especially with strong gene differentiation effects and large sample size. In this paper, we conduct a thorough study to evaluate the effects of normalization procedures combined with several commonly used statistical tests and MTPs under different configurations of effect size and sample size. We conduct theoretical evaluation based on a random effect model, as well as simulation and biological data analyses to verify the results. Based on our findings, we provide some practical guidance for selecting a suitable normalization procedure under different scenarios. PMID:24941114
The opportunities and challenges of large-scale molecular approaches to songbird neurobiology
Mello, C.V.; Clayton, D.F.
2014-01-01
High-through put methods for analyzing genome structure and function are having a large impact in song-bird neurobiology. Methods include genome sequencing and annotation, comparative genomics, DNA microarrays and transcriptomics, and the development of a brain atlas of gene expression. Key emerging findings include the identification of complex transcriptional programs active during singing, the robust brain expression of non-coding RNAs, evidence of profound variations in gene expression across brain regions, and the identification of molecular specializations within song production and learning circuits. Current challenges include the statistical analysis of large datasets, effective genome curations, the efficient localization of gene expression changes to specific neuronal circuits and cells, and the dissection of behavioral and environmental factors that influence brain gene expression. The field requires efficient methods for comparisons with organisms like chicken, which offer important anatomical, functional and behavioral contrasts. As sequencing costs plummet, opportunities emerge for comparative approaches that may help reveal evolutionary transitions contributing to vocal learning, social behavior and other properties that make songbirds such compelling research subjects. PMID:25280907
Biased Gene Fractionation and Dominant Gene Expression among the Subgenomes of Brassica rapa
Cheng, Feng; Wu, Jian; Fang, Lu; Sun, Silong; Liu, Bo; Lin, Ke; Bonnema, Guusje; Wang, Xiaowu
2012-01-01
Polyploidization, both ancient and recent, is frequent among plants. A “two-step theory" was proposed to explain the meso-triplication of the Brassica “A" genome: Brassica rapa. By accurately partitioning of this genome, we observed that genes in the less fractioned subgenome (LF) were dominantly expressed over the genes in more fractioned subgenomes (MFs: MF1 and MF2), while the genes in MF1 were slightly dominantly expressed over the genes in MF2. The results indicated that the dominantly expressed genes tended to be resistant against gene fractionation. By re-sequencing two B. rapa accessions: a vegetable turnip (VT117) and a Rapid Cycling line (L144), we found that genes in LF had less non-synonymous or frameshift mutations than genes in MFs; however mutation rates were not significantly different between MF1 and MF2. The differences in gene expression patterns and on-going gene death among the three subgenomes suggest that “two-step" genome triplication and differential subgenome methylation played important roles in the genome evolution of B. rapa. PMID:22567157
Biased gene fractionation and dominant gene expression among the subgenomes of Brassica rapa.
Cheng, Feng; Wu, Jian; Fang, Lu; Sun, Silong; Liu, Bo; Lin, Ke; Bonnema, Guusje; Wang, Xiaowu
2012-01-01
Polyploidization, both ancient and recent, is frequent among plants. A "two-step theory" was proposed to explain the meso-triplication of the Brassica "A" genome: Brassica rapa. By accurately partitioning of this genome, we observed that genes in the less fractioned subgenome (LF) were dominantly expressed over the genes in more fractioned subgenomes (MFs: MF1 and MF2), while the genes in MF1 were slightly dominantly expressed over the genes in MF2. The results indicated that the dominantly expressed genes tended to be resistant against gene fractionation. By re-sequencing two B. rapa accessions: a vegetable turnip (VT117) and a Rapid Cycling line (L144), we found that genes in LF had less non-synonymous or frameshift mutations than genes in MFs; however mutation rates were not significantly different between MF1 and MF2. The differences in gene expression patterns and on-going gene death among the three subgenomes suggest that "two-step" genome triplication and differential subgenome methylation played important roles in the genome evolution of B. rapa.
Functional analysis and transcriptional output of the Göttingen minipig genome.
Heckel, Tobias; Schmucki, Roland; Berrera, Marco; Ringshandl, Stephan; Badi, Laura; Steiner, Guido; Ravon, Morgane; Küng, Erich; Kuhn, Bernd; Kratochwil, Nicole A; Schmitt, Georg; Kiialainen, Anna; Nowaczyk, Corinne; Daff, Hamina; Khan, Azinwi Phina; Lekolool, Isaac; Pelle, Roger; Okoth, Edward; Bishop, Richard; Daubenberger, Claudia; Ebeling, Martin; Certa, Ulrich
2015-11-14
In the past decade the Göttingen minipig has gained increasing recognition as animal model in pharmaceutical and safety research because it recapitulates many aspects of human physiology and metabolism. Genome-based comparison of drug targets together with quantitative tissue expression analysis allows rational prediction of pharmacology and cross-reactivity of human drugs in animal models thereby improving drug attrition which is an important challenge in the process of drug development. Here we present a new chromosome level based version of the Göttingen minipig genome together with a comparative transcriptional analysis of tissues with pharmaceutical relevance as basis for translational research. We relied on mapping and assembly of WGS (whole-genome-shotgun sequencing) derived reads to the reference genome of the Duroc pig and predict 19,228 human orthologous protein-coding genes. Genome-based prediction of the sequence of human drug targets enables the prediction of drug cross-reactivity based on conservation of binding sites. We further support the finding that the genome of Sus scrofa contains about ten-times less pseudogenized genes compared to other vertebrates. Among the functional human orthologs of these minipig pseudogenes we found HEPN1, a putative tumor suppressor gene. The genomes of Sus scrofa, the Tibetan boar, the African Bushpig, and the Warthog show sequence conservation of all inactivating HEPN1 mutations suggesting disruption before the evolutionary split of these pig species. We identify 133 Sus scrofa specific, conserved long non-coding RNAs (lncRNAs) in the minipig genome and show that these transcripts are highly conserved in the African pigs and the Tibetan boar suggesting functional significance. Using a new minipig specific microarray we show high conservation of gene expression signatures in 13 tissues with biomedical relevance between humans and adult minipigs. We underline this relationship for minipig and human liver where we could demonstrate similar expression levels for most phase I drug-metabolizing enzymes. Higher expression levels and metabolic activities were found for FMO1, AKR/CRs and for phase II drug metabolizing enzymes in minipig as compared to human. The variability of gene expression in equivalent human and minipig tissues is considerably higher in minipig organs, which is important for study design in case a human target belongs to this variable category in the minipig. The first analysis of gene expression in multiple tissues during development from young to adult shows that the majority of transcriptional programs are concluded four weeks after birth. This finding is in line with the advanced state of human postnatal organ development at comparative age categories and further supports the minipig as model for pediatric drug safety studies. Genome based assessment of sequence conservation combined with gene expression data in several tissues improves the translational value of the minipig for human drug development. The genome and gene expression data presented here are important resources for researchers using the minipig as model for biomedical research or commercial breeding. Potential impact of our data for comparative genomics, translational research, and experimental medicine are discussed.
Identification of true EST alignments for recognising transcribed regions.
Ma, Chuang; Wang, Jia; Li, Lun; Duan, Mo-Jie; Zhou, Yan-Hong
2011-01-01
Transcribed regions can be determined by aligning Expressed Sequence Tags (ESTs) with genome sequences. The kernel of this strategy is to effectively distinguish true EST alignments from spurious ones. In this study, three measures including Direction Check, Identity Check and Terminal Check were introduced to more effectively eliminate spurious EST alignments. On the basis of these introduced measures and other widely used measures, a computational tool, named ESTCleanser, has been developed to identify true EST alignments for obtaining reliable transcribed regions. The performance of ESTCleanser has been evaluated on the well-annotated human ENCyclopedia of DNA Elements (ENCODE) regions using human ESTs in the dbEST database. The evaluation results show that the accuracy of ESTCleanser at exon and intron levels is more remarkably enhanced than that of UCSC-spliced EST alignments. This work would be helpful to EST-based researches on finding new genes, complementing genome annotation, recognising alternative splicing events and Single Nucleotide Polymorphisms (SNPs), etc.
Molecular medicine and the development of cancer chemopreventive agents.
Izzotti, Alberto
2012-07-01
Chemoprevention is effective in inhibiting the onset of cancer in experimental animal models, but the transferability of similar results to humans is questionable. Therefore, reliable intermediate molecular biomarkers are needed to evaluate the efficacy of chemopreventive agents before the onset of cancer. The use of genomic biomarkers is limited by their poor predictive value. Although post-genomic biomarkers (i.e., gene-expression analyses) are useful for evaluating the safety, efficacy, and mechanistic basis of chemopreventive agents, the biomarkers are often poorly related to the phenotype, due to posttranscriptional regulation. Proteome analyses can evaluate preclinical phenotype alterations, but only at low protein counts. MicroRNA alterations, which are essential for the development of cancer, may be modulated by chemopreventive agents. Furthermore, microRNA delivery may be used to counteract carcinogenesis. Exposure to cigarette smoke induces microRNA let-7 downregulation and cell proliferation that can be converted to cell growth arrest and apoptosis upon let-7a transfection. Therefore, microRNAs are reliable biomarkers for evaluating chemoprevention efficacy and may be used to counteract carcinogenesis. © 2012 New York Academy of Sciences.
Decoherence in yeast cell populations and its implications for genome-wide expression noise.
Briones, M R S; Bosco, F
2009-01-20
Gene expression "noise" is commonly defined as the stochastic variation of gene expression levels in different cells of the same population under identical growth conditions. Here, we tested whether this "noise" is amplified with time, as a consequence of decoherence in global gene expression profiles (genome-wide microarrays) of synchronized cells. The stochastic component of transcription causes fluctuations that tend to be amplified as time progresses, leading to a decay of correlations of expression profiles, in perfect analogy with elementary relaxation processes. Measuring decoherence, defined here as a decay in the auto-correlation function of yeast genome-wide expression profiles, we found a slowdown in the decay of correlations, opposite to what would be expected if, as in mixing systems, correlations decay exponentially as the equilibrium state is reached. Our results indicate that the populational variation in gene expression (noise) is a consequence of temporal decoherence, in which the slow decay of correlations is a signature of strong interdependence of the transcription dynamics of different genes.
Studying a Complex Tumor—Potential and Pitfalls
Zheng, Siyuan; Chheda, Milan G.; Verhaak, Roel G.W.
2012-01-01
Glioblastoma multiforme (GBM) is a histopathologically heterogeneous disease with few treatment options. Therapy based on genomic alterations is rapidly gaining popularity because of the high response rate and high specificity. DNA copy number and exon sequencing studies of GBM samples have revealed recurrent genomic alterations in genes such as TP53, EGFR and IDH1 but to date this has not resulted in novel GBM therapies. Identification of expression subtypes have resulted in new insights such as the association between genomic abnormalities and expression signatures. This review describes the types of genomic studies that have been performed and that are underway, the most prominent results and the implications of genomic research for development of clinical treatment modalities. PMID:22290264
Heterologous Production of a Novel Cyclic Peptide Compound, KK-1, in Aspergillus oryzae.
Yoshimi, Akira; Yamaguchi, Sigenari; Fujioka, Tomonori; Kawai, Kiyoshi; Gomi, Katsuya; Machida, Masayuki; Abe, Keietsu
2018-01-01
A novel cyclic peptide compound, KK-1, was originally isolated from the plant-pathogenic fungus Curvularia clavata . It consists of 10 amino acid residues, including five N -methylated amino acid residues, and has potent antifungal activity. Recently, the genome-sequencing analysis of C. clavata was completed, and the biosynthetic genes involved in KK-1 production were predicted by using a novel gene cluster mining tool, MIDDAS-M. These genes form an approximately 75-kb cluster, which includes nine open reading frames, containing a non-ribosomal peptide synthetase (NRPS) gene. To determine whether the predicted genes were responsible for the biosynthesis of KK-1, we performed heterologous production of KK-1 in Aspergillus oryzae by introduction of the cluster genes into the genome of A. oryzae . The NRPS gene was split in two fragments and then reconstructed in the A. oryzae genome, because the gene was quite large (approximately 40 kb). The remaining seven genes in the cluster, excluding the regulatory gene kkR , were simultaneously introduced into the strain of A. oryzae in which NRPS had already been incorporated. To evaluate the heterologous production of KK-1 in A. oryzae , gene expression was analyzed by RT-PCR and KK-1 productivity was quantified by HPLC. KK-1 was produced in variable quantities by a number of transformed strains, along with expression of the cluster genes. The amount of KK-1 produced by the strain with the greatest expression of all genes was lower than that produced by the original producer, C. clavata . Therefore, expression of the cluster genes is necessary and sufficient for the heterologous production of KK-1 in A. oryzae , although there may be unknown factors limiting productivity in this species.
Heterologous Production of a Novel Cyclic Peptide Compound, KK-1, in Aspergillus oryzae
Yoshimi, Akira; Yamaguchi, Sigenari; Fujioka, Tomonori; Kawai, Kiyoshi; Gomi, Katsuya; Machida, Masayuki; Abe, Keietsu
2018-01-01
A novel cyclic peptide compound, KK-1, was originally isolated from the plant-pathogenic fungus Curvularia clavata. It consists of 10 amino acid residues, including five N-methylated amino acid residues, and has potent antifungal activity. Recently, the genome-sequencing analysis of C. clavata was completed, and the biosynthetic genes involved in KK-1 production were predicted by using a novel gene cluster mining tool, MIDDAS-M. These genes form an approximately 75-kb cluster, which includes nine open reading frames, containing a non-ribosomal peptide synthetase (NRPS) gene. To determine whether the predicted genes were responsible for the biosynthesis of KK-1, we performed heterologous production of KK-1 in Aspergillus oryzae by introduction of the cluster genes into the genome of A. oryzae. The NRPS gene was split in two fragments and then reconstructed in the A. oryzae genome, because the gene was quite large (approximately 40 kb). The remaining seven genes in the cluster, excluding the regulatory gene kkR, were simultaneously introduced into the strain of A. oryzae in which NRPS had already been incorporated. To evaluate the heterologous production of KK-1 in A. oryzae, gene expression was analyzed by RT-PCR and KK-1 productivity was quantified by HPLC. KK-1 was produced in variable quantities by a number of transformed strains, along with expression of the cluster genes. The amount of KK-1 produced by the strain with the greatest expression of all genes was lower than that produced by the original producer, C. clavata. Therefore, expression of the cluster genes is necessary and sufficient for the heterologous production of KK-1 in A. oryzae, although there may be unknown factors limiting productivity in this species. PMID:29686660
Ariani, Andrea; Gepts, Paul
2015-10-01
Plant aquaporins are a large and diverse family of water channel proteins that are essential for several physiological processes in living organisms. Numerous studies have linked plant aquaporins with a plethora of processes, such as nutrient acquisition, CO2 transport, plant growth and development, and response to abiotic stresses. However, little is known about this protein family in common bean. Here, we present a genome-wide identification of the aquaporin gene family in common bean (Phaseolus vulgaris L.), a legume crop essential for human nutrition. We identified 41 full-length coding aquaporin sequences in the common bean genome, divided by phylogenetic analysis into five sub-families (PIPs, TIPs, NIPs, SIPs and XIPs). Residues determining substrate specificity of aquaporins (i.e., NPA motifs and ar/R selectivity filter) seem conserved between common bean and other plant species, allowing inference of substrate specificity for these proteins. Thanks to the availability of RNA-sequencing datasets, expression levels in different organs and in leaves of wild and domesticated bean accessions were evaluated. Three aquaporins (PvTIP1;1, PvPIP2;4 and PvPIP1;2) have the overall highest mean expressions, with PvTIP1;1 having the highest expression among all aquaporins. We performed an EST database mining to identify drought-responsive aquaporins in common bean. This analysis showed a significant increase in expression for PvTIP1;1 in drought stress conditions compared to well-watered environments. The pivotal role suggested for PvTIP1;1 in regulating water homeostasis and drought stress response in the common bean should be verified by further field experimentation under drought stress.
BAD phosphorylation determines ovarian cancer chemo-sensitivity and patient survival
Marchion, Douglas C.; Cottrill, Hope M.; Xiong, Yin; Chen, Ning; Bicaku, Elona; Fulp, William J.; Bansal, Nisha; Chon, Hye Sook; Stickles, Xiaomang B.; Kamath, Siddharth G.; Hakam, Ardeshir; Li, Lihua; Su, Dan; Moreno, Carolina; Judson, Patricia L.; Berchuck, Andrew; Wenham, Robert M.; Apte, Sachin M.; Gonzalez-Bosquet, Jesus; Bloom, Gregory C.; Eschrich, Steven A.; Sebti, Said; Chen, Dung-Tsa; Lancaster, Johnathan M.
2011-01-01
Purpose Despite initial sensitivity to chemotherapy, ovarian cancers (OVCA) often develop drug-resistance, which limits patient survival. Using specimens and/or genomic data from 289 patients and a panel of cancer cell lines, we explored genome-wide expression changes that underlie the evolution of OVCA chemo-resistance and characterized the BCL2 antagonist of cell death (BAD) apoptosis pathway as a determinant of chemo-sensitivity and patient survival. Experimental Design Serial OVCA cell cisplatin treatments were performed in parallel with measurements of genome-wide expression changes. Pathway analysis was performed on genes associated with increasing cisplatin-resistance (EC50). BAD-pathway expression and BAD-protein phosphorylation were evaluated in patient samples and cell lines as determinants of chemo-sensitivity and/or clinical outcome and as therapeutic targets. Results Induced in vitro OVCA cisplatin-resistance was associated with BAD-pathway expression (P < 0.001). In OVCA cell lines and primary specimens, BAD-protein phosphorylation was associated with platinum-resistance (n = 147, P < 0.0001) and also with overall patient survival (n = 134, P = 0.0007). Targeted modulation of BAD-phosphorylation levels influenced cisplatin sensitivity. A 47-gene BAD-pathway score was associated with in vitro phosphorylated-BAD levels and with survival in 142 patients with advanced-stage (III/IV) serous OVCA. Integration of BAD-phosphorylation or BAD-pathway score with OVCA surgical cytoreductive status was significantly associated with overall survival by log-rank test (P = 0.004 and <0.0001, respectively). Conclusion The BAD apoptosis pathway influences OVCA chemo-sensitivity and overall survival, likely via modulation of BAD-phosphorylation. The pathway has clinical relevance as a biomarker of therapeutic response, patient survival, and as a promising therapeutic target. PMID:21849418
Nucleotide, cytogenetic and expression impact of the human chromosome 8p23.1 inversion polymorphism.
Bosch, Nina; Morell, Marta; Ponsa, Immaculada; Mercader, Josep Maria; Armengol, Lluís; Estivill, Xavier
2009-12-14
The human chromosome 8p23.1 region contains a 3.8-4.5 Mb segment which can be found in different orientations (defined as genomic inversion) among individuals. The identification of single nucleotide polymorphisms (SNPs) tightly linked to the genomic orientation of a given region should be useful to indirectly evaluate the genotypes of large genomic orientations in the individuals. We have identified 16 SNPs, which are in linkage disequilibrium (LD) with the 8p23.1 inversion as detected by fluorescent in situ hybridization (FISH). The variability of the 8p23.1 orientation in 150 HapMap samples was predicted using this set of SNPs and was verified by FISH in a subset of samples. Four genes (NEIL2, MSRA, CTSB and BLK) were found differentially expressed (p<0.0005) according to the orientation of the 8p23.1 region. Finally, we have found variable levels of mosaicism for the orientation of the 8p23.1 as determined by FISH. By means of dense SNP genotyping of the region, haplotype-based computational analyses and FISH experiments we could infer and verify the orientation status of alleles in the 8p23.1 region by detecting two short haplotype stretches at both ends of the inverted region, which are likely the relic of the chromosome in which the original inversion occurred. Moreover, an impact of 8p23.1 inversion on gene expression levels cannot be ruled out, since four genes from this region have statistically significant different expression levels depending on the inversion status. FISH results in lymphoblastoid cell lines suggest the presence of mosaicism regarding the 8p23.1 inversion.
The Cancer Genome Atlas Comprehensive Molecular Characterization of Renal Cell Carcinoma.
Ricketts, Christopher J; De Cubas, Aguirre A; Fan, Huihui; Smith, Christof C; Lang, Martin; Reznik, Ed; Bowlby, Reanne; Gibb, Ewan A; Akbani, Rehan; Beroukhim, Rameen; Bottaro, Donald P; Choueiri, Toni K; Gibbs, Richard A; Godwin, Andrew K; Haake, Scott; Hakimi, A Ari; Henske, Elizabeth P; Hsieh, James J; Ho, Thai H; Kanchi, Rupa S; Krishnan, Bhavani; Kwiatkowski, David J; Lui, Wembin; Merino, Maria J; Mills, Gordon B; Myers, Jerome; Nickerson, Michael L; Reuter, Victor E; Schmidt, Laura S; Shelley, C Simon; Shen, Hui; Shuch, Brian; Signoretti, Sabina; Srinivasan, Ramaprasad; Tamboli, Pheroze; Thomas, George; Vincent, Benjamin G; Vocke, Cathy D; Wheeler, David A; Yang, Lixing; Kim, William Y; Robertson, A Gordon; Spellman, Paul T; Rathmell, W Kimryn; Linehan, W Marston
2018-04-03
Renal cell carcinoma (RCC) is not a single disease, but several histologically defined cancers with different genetic drivers, clinical courses, and therapeutic responses. The current study evaluated 843 RCC from the three major histologic subtypes, including 488 clear cell RCC, 274 papillary RCC, and 81 chromophobe RCC. Comprehensive genomic and phenotypic analysis of the RCC subtypes reveals distinctive features of each subtype that provide the foundation for the development of subtype-specific therapeutic and management strategies for patients affected with these cancers. Somatic alteration of BAP1, PBRM1, and PTEN and altered metabolic pathways correlated with subtype-specific decreased survival, while CDKN2A alteration, increased DNA hypermethylation, and increases in the immune-related Th2 gene expression signature correlated with decreased survival within all major histologic subtypes. CIMP-RCC demonstrated an increased immune signature, and a uniform and distinct metabolic expression pattern identified a subset of metabolically divergent (MD) ChRCC that associated with extremely poor survival. Published by Elsevier Inc.
Opazo, Juan C.; Toloza-Villalobos, Jessica; Burmester, Thorsten; Venkatesh, Byrappa; Storz, Jay F.
2015-01-01
Comparative analyses of vertebrate genomes continue to uncover a surprising diversity of genes in the globin gene superfamily, some of which have very restricted phyletic distributions despite their antiquity. Genomic analysis of the globin gene repertoire of cartilaginous fish (Chondrichthyes) should be especially informative about the duplicative origins and ancestral functions of vertebrate globins, as divergence between Chondrichthyes and bony vertebrates represents the most basal split within the jawed vertebrates. Here, we report a comparative genomic analysis of the vertebrate globin gene family that includes the complete globin gene repertoire of the elephant shark (Callorhinchus milii). Using genomic sequence data from representatives of all major vertebrate classes, integrated analyses of conserved synteny and phylogenetic relationships revealed that the last common ancestor of vertebrates possessed a repertoire of at least seven globin genes: single copies of androglobin and neuroglobin, four paralogous copies of globin X, and the single-copy progenitor of the entire set of vertebrate-specific globins. Combined with expression data, the genomic inventory of elephant shark globins yielded four especially surprising findings: 1) there is no trace of the neuroglobin gene (a highly conserved gene that is present in all other jawed vertebrates that have been examined to date), 2) myoglobin is highly expressed in heart, but not in skeletal muscle (reflecting a possible ancestral condition in vertebrates with single-circuit circulatory systems), 3) elephant shark possesses two highly divergent globin X paralogs, one of which is preferentially expressed in gonads, and 4) elephant shark possesses two structurally distinct α-globin paralogs, one of which is preferentially expressed in the brain. Expression profiles of elephant shark globin genes reveal distinct specializations of function relative to orthologs in bony vertebrates and suggest hypotheses about ancestral functions of vertebrate globins. PMID:25743544
Genome-wide methylation analysis identifies genes silenced in non-seminoma cell lines
Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J
2016-01-01
Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours’ biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription–quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes. PMID:29263807
Genome-wide methylation analysis identifies genes silenced in non-seminoma cell lines.
Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J
2016-01-01
Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours' biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription-quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes.
Arthur, Victoria L; Shuldiner, Emily; Remmers, Elaine F; Hinks, Anne; Grom, Alexei A; Foell, Dirk; Martini, Alberto; Gattorno, Marco; Özen, Seza; Prahalad, Sampath; Zeft, Andrew S; Bohnsack, John F; Ilowite, Norman T; Mellins, Elizabeth D; Russo, Ricardo; Len, Claudio; Oliveira, Sheila; Yeung, Rae S M; Rosenberg, Alan M; Wedderburn, Lucy R; Anton, Jordi; Haas, Johannes-Peter; Rösen-Wolff, Angela; Minden, Kirsten; Szymanski, Ann Marie; Thomson, Wendy; Kastner, Daniel L; Woo, Patricia; Ombrello, Michael J
2018-04-02
To determine whether systemic juvenile idiopathic arthritis (sJIA) susceptibility loci identified by candidate gene studies demonstrated association with sJIA in the largest study population assembled to date. Single nucleotide polymorphisms (SNPs) from 11 previously reported sJIA risk loci were examined for association in 9 populations, including 770 sJIA cases and 6947 control subjects. The effect of sJIA-associated SNPs on gene expression was evaluated in silico in paired whole genome and RNA sequencing data from lymphoblastoid cell lines (LCL) of 373 European 1000 Genomes Project subjects. The relationship between sJIA-associated SNPs and response to anakinra treatment was evaluated in 38 US patients for whom treatment response data were available. We found no association of the 26 SNPs previously reported as sJIA-associated. Expanded analysis of the regions containing the 26 SNPs revealed only one significant association, the promoter region of IL1RN (p<1E-4). sJIA-associated SNPs correlated with IL1RN expression in LCLs, with an inverse correlation between sJIA risk and IL1RN expression. The presence of homozygous IL1RN high expression alleles correlated strongly with non-response to anakinra therapy (OR 28.7 [3.2, 255.8]). IL1RN was the only candidate locus associated with sJIA in our study. The implicated SNPs are among the strongest known determinants of IL1RN and IL1RA levels, linking low expression with increased sJIA risk. Homozygous high expression alleles predicted non-response to anakinra therapy, nominating them as candidate biomarkers to guide sJIA treatment. This is an important first step towards the personalized treatment of sJIA. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Singh, Deepak K.; Rath, Pramod C.
2012-01-01
We report strong somatic and germ line expression of LINE RNAs in eight different tissues of rat by using a novel ~2.8 kb genomic PstI-LINE DNA (P1-LINE) isolated from the rat brain. P1-LINE is present in a 93 kb LINE-SINE-cluster in sub-telomeric region of chromosome 12 (12p12) and as multiple truncated copies interspersed in all rat chromosomes. P1-LINEs occur as inverted repeats at multiple genomic loci in tissue-specific and mosaic patterns. P1-LINE RNAs are strongly expressed in brain, liver, lungs, heart, kidney, testes, spleen and thymus into large to small heterogeneous RNAs (~5.0 to 0.2 kb) in tissue-specific and dynamic patterns in individual rats. P1-LINE DNA is strongly methylated at CpG-dinucleotides in most genomic copies in all the tissues and weakly hypomethylated in few copies in some tissues. Small (700–75 nt) P1-LINE RNAs expressed in all tissues may be possible precursors for small regulatory RNAs (PIWI-interacting/piRNAs) bioinformatically derived from P1-LINE. The strong and dynamic expression of LINE RNAs from multiple chromosomal loci and the putative piRNAs in somatic tissues of rat under normal physiological conditions may define functional chromosomal domains marked by LINE RNAs as long noncoding RNAs (lncRNAs) unrestricted by DNA methylation. The tissue-specific, dynamic RNA expression and mosaic genomic distribution of LINEs representing a steady-state genomic flux of retrotransposon RNAs suggest for biological role of LINE RNAs as long ncRNAs and small piRNAs in mammalian tissues independent of their cellular fate for translation, reverse-transcription and retrotransposition. This may provide evolutionary advantages to LINEs and mammalian genomes. PMID:23064113
Marko, Nicholas F.; Weil, Robert J.
2012-01-01
Introduction Gene expression data is often assumed to be normally-distributed, but this assumption has not been tested rigorously. We investigate the distribution of expression data in human cancer genomes and study the implications of deviations from the normal distribution for translational molecular oncology research. Methods We conducted a central moments analysis of five cancer genomes and performed empiric distribution fitting to examine the true distribution of expression data both on the complete-experiment and on the individual-gene levels. We used a variety of parametric and nonparametric methods to test the effects of deviations from normality on gene calling, functional annotation, and prospective molecular classification using a sixth cancer genome. Results Central moments analyses reveal statistically-significant deviations from normality in all of the analyzed cancer genomes. We observe as much as 37% variability in gene calling, 39% variability in functional annotation, and 30% variability in prospective, molecular tumor subclassification associated with this effect. Conclusions Cancer gene expression profiles are not normally-distributed, either on the complete-experiment or on the individual-gene level. Instead, they exhibit complex, heavy-tailed distributions characterized by statistically-significant skewness and kurtosis. The non-Gaussian distribution of this data affects identification of differentially-expressed genes, functional annotation, and prospective molecular classification. These effects may be reduced in some circumstances, although not completely eliminated, by using nonparametric analytics. This analysis highlights two unreliable assumptions of translational cancer gene expression analysis: that “small” departures from normality in the expression data distributions are analytically-insignificant and that “robust” gene-calling algorithms can fully compensate for these effects. PMID:23118863
The Genomic Impact of DNA CpG Methylation on Gene Expression; Relationships in Prostate Cancer.
Long, Mark D; Smiraglia, Dominic J; Campbell, Moray J
2017-02-14
The process of DNA CpG methylation has been extensively investigated for over 50 years and revealed associations between changing methylation status of CpG islands and gene expression. As a result, DNA CpG methylation is implicated in the control of gene expression in developmental and homeostasis processes, as well as being a cancer-driver mechanism. The development of genome-wide technologies and sophisticated statistical analytical approaches has ushered in an era of widespread analyses, for example in the cancer arena, of the relationships between altered DNA CpG methylation, gene expression, and tumor status. The remarkable increase in the volume of such genomic data, for example, through investigators from the Cancer Genome Atlas (TCGA), has allowed dissection of the relationships between DNA CpG methylation density and distribution, gene expression, and tumor outcome. In this manner, it is now possible to test that the genome-wide correlations are measurable between changes in DNA CpG methylation and gene expression. Perhaps surprisingly is that these associations can only be detected for hundreds, but not thousands, of genes, and the direction of the correlations are both positive and negative. This, perhaps, suggests that CpG methylation events in cancer systems can act as disease drivers but the effects are possibly more restricted than suspected. Additionally, the positive and negative correlations suggest direct and indirect events and an incomplete understanding. Within the prostate cancer TCGA cohort, we examined the relationships between expression of genes that control DNA methylation, known targets of DNA methylation and tumor status. This revealed that genes that control the synthesis of S -adenosyl-l-methionine (SAM) associate with altered expression of DNA methylation targets in a subset of aggressive tumors.
Zhu, Bin; Shao, Yujiao; Pan, Qi; Ge, Xianhong; Li, Zaiyun
2015-01-01
Aneuploidy with loss of entire chromosomes from normal complement disrupts the balanced genome and is tolerable only by polyploidy plants. In this study, the monosomic and nullisomic plants losing one or two copies of C2 chromosome from allotetraploid Brassica napus L. (2n = 38, AACC) were produced and compared for their phenotype and transcriptome. The monosomics gave a plant phenotype very similar to the original donor, but the nullisomics had much smaller stature and also shorter growth period. By the comparative analyses on the global transcript profiles with the euploid donor, genome-wide alterations in gene expression were revealed in two aneuploids, and their majority of differentially expressed genes (DEGs) resulted from the trans-acting effects of the zero and one copy of C2 chromosome. The higher number of up-regulated genes than down-regulated genes on other chromosomes suggested that the genome responded to the C2 loss via enhancing the expression of certain genes. Particularly, more DEGs were detected in the monosomics than nullisomics, contrasting with their phenotypes. The gene expression of the other chromosomes was differently affected, and several dysregulated domains in which up- or downregulated genes obviously clustered were identifiable. But the mean gene expression (MGE) for homoeologous chromosome A2 reduced with the C2 loss. Some genes and their expressions on C2 were correlated with the phenotype deviations in the aneuploids. These results provided new insights into the transcriptomic perturbation of the allopolyploid genome elicited by the loss of individual chromosome. PMID:26442076
Gramene 2016: comparative plant genomics and pathway resources.
Tello-Ruiz, Marcela K; Stein, Joshua; Wei, Sharon; Preece, Justin; Olson, Andrew; Naithani, Sushma; Amarasinghe, Vindhya; Dharmawardhana, Palitha; Jiao, Yinping; Mulvaney, Joseph; Kumari, Sunita; Chougule, Kapeel; Elser, Justin; Wang, Bo; Thomason, James; Bolser, Daniel M; Kerhornou, Arnaud; Walts, Brandon; Fonseca, Nuno A; Huerta, Laura; Keays, Maria; Tang, Y Amy; Parkinson, Helen; Fabregat, Antonio; McKay, Sheldon; Weiser, Joel; D'Eustachio, Peter; Stein, Lincoln; Petryszak, Robert; Kersey, Paul J; Jaiswal, Pankaj; Ware, Doreen
2016-01-04
Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the database website adopted a new Drupal management platform. The genomes section features 39 fully assembled reference genomes that are integrated using ontology-based annotation and comparative analyses, and accessed through both visual and programmatic interfaces. Additional community data, such as genetic variation, expression and methylation, are also mapped for a subset of genomes. The Plant Reactome pathway portal (http://plantreactome.gramene.org) provides a reference resource for analyzing plant metabolic and regulatory pathways. In addition to ∼ 200 curated rice reference pathways, the portal hosts gene homology-based pathway projections for 33 plant species. Both the genome and pathway browsers interface with the EMBL-EBI's Expression Atlas to enable the projection of baseline and differential expression data from curated expression studies in plants. Gramene's archive website (http://archive.gramene.org) continues to provide previously reported resources on comparative maps, markers and QTL. To further aid our users, we have also introduced a live monthly educational webinar series and a Gramene YouTube channel carrying video tutorials. Published by Oxford University Press on behalf of Nucleic Acids Research 2015. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Liu, Yunxian; Hilakivi-Clarke, Leena; Zhang, Yukun; Wang, Xiao; Pan, Yuan-Xiang; Xuan, Jianhua; Fleck, Stefanie C; Doerge, Daniel R; Helferich, William G
2015-08-01
Soy flour diet (MS) prevented isoflavones from stimulating MCF-7 tumor growth in athymic nude mice, indicating that other bioactive compounds in soy can negate the estrogenic properties of isoflavones. The underlying signal transduction pathways to explain the protective effects of soy flour consumption were studied here. Ovariectomized athymic nude mice inoculated with MCF-7 human breast cancer cells were fed either Soy flour diet (MS) or purified isoflavone mix diet (MI), both with equivalent amounts of genistein. Positive controls received estradiol pellets and negative controls received sham pellets. GeneChip Human Genome U133 Plus 2.0 Array platform was used to evaluate gene expressions, and results were analyzed using bioinformatics approaches. Tumors in MS-fed mice exhibited higher expression of tumor growth suppressing genes ATP2A3 and BLNK and lower expression of oncogene MYC. Tumors in MI-fed mice expressed a higher level of oncogene MYB and a lower level of MHC-I and MHC-II, allowing tumor cells to escape immunosurveillance. MS-induced gene expression alterations were predictive of prolonged survival among estrogen-receptor-positive breast cancer patients, whilst MI-induced gene changes were predictive of shortened survival. Our findings suggest that dietary soy flour affects gene expression differently than purified isoflavones, which may explain why soy foods prevent isoflavones-induced stimulation of MCF-7 tumor growth in athymic nude mice. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A new estimator of the discovery probability.
Favaro, Stefano; Lijoi, Antonio; Prünster, Igor
2012-12-01
Species sampling problems have a long history in ecological and biological studies and a number of issues, including the evaluation of species richness, the design of sampling experiments, and the estimation of rare species variety, are to be addressed. Such inferential problems have recently emerged also in genomic applications, however, exhibiting some peculiar features that make them more challenging: specifically, one has to deal with very large populations (genomic libraries) containing a huge number of distinct species (genes) and only a small portion of the library has been sampled (sequenced). These aspects motivate the Bayesian nonparametric approach we undertake, since it allows to achieve the degree of flexibility typically needed in this framework. Based on an observed sample of size n, focus will be on prediction of a key aspect of the outcome from an additional sample of size m, namely, the so-called discovery probability. In particular, conditionally on an observed basic sample of size n, we derive a novel estimator of the probability of detecting, at the (n+m+1)th observation, species that have been observed with any given frequency in the enlarged sample of size n+m. Such an estimator admits a closed-form expression that can be exactly evaluated. The result we obtain allows us to quantify both the rate at which rare species are detected and the achieved sample coverage of abundant species, as m increases. Natural applications are represented by the estimation of the probability of discovering rare genes within genomic libraries and the results are illustrated by means of two expressed sequence tags datasets. © 2012, The International Biometric Society.
Zhang, Wenli; Muck-Hausl, Martin; Wang, Jichang; Sun, Chuanbo; Gebbing, Maren; Miskey, Csaba; Ivics, Zoltan; Izsvak, Zsuzsanna; Ehrhardt, Anja
2013-01-01
We recently developed adenovirus/transposase hybrid-vectors utilizing the previously described hyperactive Sleeping Beauty (SB) transposase HSB5 for somatic integration and we could show stabilized transgene expression in mice and a canine model for hemophilia B. However, the safety profile of these hybrid-vectors with respect to vector dose and genotoxicity remains to be investigated. Herein, we evaluated this hybrid-vector system in C57Bl/6 mice with escalating vector dose settings. We found that in all mice which received the hyperactive SB transposase, transgene expression levels were stabilized in a dose-dependent manner and that the highest vector dose was accompanied by fatalities in mice. To analyze potential genotoxic side-effects due to somatic integration into host chromosomes, we performed a genome-wide integration site analysis using linker-mediated PCR (LM-PCR) and linear amplification-mediated PCR (LAM-PCR). Analysis of genomic DNA samples obtained from HSB5 treated female and male mice revealed a total of 1327 unique transposition events. Overall the chromosomal distribution pattern was close-to-random and we observed a random integration profile with respect to integration into gene and non-gene areas. Notably, when using the LM-PCR protocol, 27 extra-chromosomal integration events were identified, most likely caused by transposon excision and subsequent transposition into the delivered adenoviral vector genome. In total, this study provides a careful evaluation of the safety profile of adenovirus/Sleeping Beauty transposase hybrid-vectors. The obtained information will be useful when designing future preclinical studies utilizing hybrid-vectors in small and large animal models. PMID:24124483
Chou, Wen-Chi; Ma, Qin; Yang, Shihui; ...
2015-03-12
The identification of transcription units (TUs) encoded in a bacterial genome is essential to elucidation of transcriptional regulation of the organism. To gain a detailed understanding of the dynamically composed TU structures, we have used four strand-specific RNA-seq (ssRNA-seq) datasets collected under two experimental conditions to derive the genomic TU organization of Clostridium thermocellum using a machine-learning approach. Our method accurately predicted the genomic boundaries of individual TUs based on two sets of parameters measuring the RNA-seq expression patterns across the genome: expression-level continuity and variance. A total of 2590 distinct TUs are predicted based on the four RNA-seq datasets.more » Moreover, among the predicted TUs, 44% have multiple genes. We assessed our prediction method on an independent set of RNA-seq data with longer reads. The evaluation confirmed the high quality of the predicted TUs. Functional enrichment analyses on a selected subset of the predicted TUs revealed interesting biology. To demonstrate the generality of the prediction method, we have also applied the method to RNA-seq data collected on Escherichia coli and achieved high prediction accuracies. The TU prediction program named SeqTU is publicly available athttps://code.google.com/p/seqtu/. We expect that the predicted TUs can serve as the baseline information for studying transcriptional and post-transcriptional regulation in C. thermocellum and other bacteria.« less
Saulnier Sholler, Giselle L; Bond, Jeffrey P; Bergendahl, Genevieve; Dutta, Akshita; Dragon, Julie; Neville, Kathleen; Ferguson, William; Roberts, William; Eslin, Don; Kraveka, Jacqueline; Kaplan, Joel; Mitchell, Deanna; Parikh, Nehal; Merchant, Melinda; Ashikaga, Takamaru; Hanna, Gina; Lescault, Pamela Jean; Siniard, Ashley; Corneveaux, Jason; Huentelman, Matthew; Trent, Jeffrey
2015-01-01
The primary objective of the study was to evaluate the feasibility and safety of a process which would utilize genome-wide expression data from tumor biopsies to support individualized treatment decisions. Current treatment options for recurrent neuroblastoma are limited and ineffective, with a survival rate of <10%. Molecular profiling may provide data which will enable the practitioner to select the most appropriate therapeutic option for individual patients, thus improving outcomes. Sixteen patients with neuroblastoma were enrolled of which fourteen were eligible for this study. Feasibility was defined as completion of tumor biopsy, pathological evaluation, RNA quality control, gene expression profiling, bioinformatics analysis, generation of a drug prediction report, molecular tumor board yielding a treatment plan, independent medical monitor review, and treatment initiation within a 21 day period. All eligible biopsies passed histopathology and RNA quality control. Expression profiling by microarray and RNA sequencing were mutually validated. The average time from biopsy to report generation was 5.9 days and from biopsy to initiation of treatment was 12.4 days. No serious adverse events were observed and all adverse events were expected. Clinical benefit was seen in 64% of patients as stabilization of disease for at least one cycle of therapy or partial response. The overall response rate was 7% and the progression free survival was 59 days. This study demonstrates the feasibility and safety of performing real-time genomic profiling to guide treatment decision making for pediatric neuroblastoma patients. PMID:25720842
A Protocol for Epigenetic Imprinting Analysis with RNA-Seq Data.
Zou, Jinfeng; Xiang, Daoquan; Datla, Raju; Wang, Edwin
2018-01-01
Genomic imprinting is an epigenetic regulatory mechanism that operates through expression of certain genes from maternal or paternal in a parent-of-origin-specific manner. Imprinted genes have been identified in diverse biological systems that are implicated in some human diseases and in embryonic and seed developmental programs in plants. The molecular underpinning programs and mechanisms involved in imprinting are yet to be explored in depth in plants. The recent advances in RNA-Seq-based methods and technologies offer an opportunity to systematically analyze epigenetic imprinting that operates at the whole genome level in the model and crop plants. We are interested using Arabidopsis model system, to investigate gene expression patterns associated with parent of origin and their implications to imprinting during embryo and seed development. Toward this, we have generated early embryo development RNA-Seq-based transcriptome datasets in F1s from a genetic cross between two diverse Arabidopsis thaliana ecotypes Col-0 and Tsu-1. With the data, we developed a protocol for evaluating the maternal and paternal contributions of genes during the early stages of embryo development after fertilization. This protocol is also designed to consider the contamination from other potential seed tissues, sequencing quality, proper processing of sequenced reads and variant calling, and appropriate inference of the parental contributions based on the parent-of-origin-specific single-nucleotide polymorphisms within the expressed genes. The approach, methods and the protocol developed in this study can be used for evaluating the effects of epigenetic imprinting in plants.
Transcriptional activity of transposable elements in coelacanth.
Forconi, Mariko; Chalopin, Domitille; Barucca, Marco; Biscotti, Maria Assunta; De Moro, Gianluca; Galiana, Delphine; Gerdol, Marco; Pallavicini, Alberto; Canapa, Adriana; Olmo, Ettore; Volff, Jean-Nicolas
2014-09-01
The morphological stasis of coelacanths has long suggested a slow evolutionary rate. General genomic stasis might also imply a decrease of transposable elements activity. To evaluate the potential activity of transposable elements (TEs) in "living fossil" species, transcriptomic data of Latimeria chalumnae and its Indonesian congener Latimeria menadoensis were compared through the RNA-sequencing mapping procedures in three different organs (liver, testis, and muscle). The analysis of coelacanth transcriptomes highlights a significant percentage of transcribed TEs in both species. Major contributors are LINE retrotransposons, especially from the CR1 family. Furthermore, some particular elements such as a LF-SINE and a LINE2 sequences seem to be more expressed than other elements. The amount of TEs expressed in testis suggests possible transposition burst in incoming generations. Moreover, significant amount of TEs in liver and muscle transcriptomes were also observed. Analyses of elements displaying marked organ-specific expression gave us the opportunity to highlight exaptation cases, that is, the recruitment of TEs as new cellular genes, but also to identify a new Latimeria-specific family of Short Interspersed Nuclear Elements called CoeG-SINEs. Overall, transcriptome results do not seem to be in line with a slow-evolving genome with poor TE activity. © 2013 Wiley Periodicals, Inc.
Differential retention of metabolic genes following whole-genome duplication.
Gout, Jean-François; Duret, Laurent; Kahn, Daniel
2009-05-01
Classical studies in Metabolic Control Theory have shown that metabolic fluxes usually exhibit little sensitivity to changes in individual enzyme activity, yet remain sensitive to global changes of all enzymes in a pathway. Therefore, little selective pressure is expected on the dosage or expression of individual metabolic genes, yet entire pathways should still be constrained. However, a direct estimate of this selective pressure had not been evaluated. Whole-genome duplications (WGDs) offer a good opportunity to address this question by analyzing the fates of metabolic genes during the massive gene losses that follow. Here, we take advantage of the successive rounds of WGD that occurred in the Paramecium lineage. We show that metabolic genes exhibit different gene retention patterns than nonmetabolic genes. Contrary to what was expected for individual genes, metabolic genes appeared more retained than other genes after the recent WGD, which was best explained by selection for gene expression operating on entire pathways. Metabolic genes also tend to be less retained when present at high copy number before WGD, contrary to other genes that show a positive correlation between gene retention and preduplication copy number. This is rationalized on the basis of the classical concave relationship relating metabolic fluxes with enzyme expression.
MEXPRESS: visualizing expression, DNA methylation and clinical TCGA data.
Koch, Alexander; De Meyer, Tim; Jeschke, Jana; Van Criekinge, Wim
2015-08-26
In recent years, increasing amounts of genomic and clinical cancer data have become publically available through large-scale collaborative projects such as The Cancer Genome Atlas (TCGA). However, as long as these datasets are difficult to access and interpret, they are essentially useless for a major part of the research community and their scientific potential will not be fully realized. To address these issues we developed MEXPRESS, a straightforward and easy-to-use web tool for the integration and visualization of the expression, DNA methylation and clinical TCGA data on a single-gene level ( http://mexpress.be ). In comparison to existing tools, MEXPRESS allows researchers to quickly visualize and interpret the different TCGA datasets and their relationships for a single gene, as demonstrated for GSTP1 in prostate adenocarcinoma. We also used MEXPRESS to reveal the differences in the DNA methylation status of the PAM50 marker gene MLPH between the breast cancer subtypes and how these differences were linked to the expression of MPLH. We have created a user-friendly tool for the visualization and interpretation of TCGA data, offering clinical researchers a simple way to evaluate the TCGA data for their genes or candidate biomarkers of interest.
GTA: a game theoretic approach to identifying cancer subnetwork markers.
Farahmand, S; Goliaei, S; Ansari-Pour, N; Razaghi-Moghadam, Z
2016-03-01
The identification of genetic markers (e.g. genes, pathways and subnetworks) for cancer has been one of the most challenging research areas in recent years. A subset of these studies attempt to analyze genome-wide expression profiles to identify markers with high reliability and reusability across independent whole-transcriptome microarray datasets. Therefore, the functional relationships of genes are integrated with their expression data. However, for a more accurate representation of the functional relationships among genes, utilization of the protein-protein interaction network (PPIN) seems to be necessary. Herein, a novel game theoretic approach (GTA) is proposed for the identification of cancer subnetwork markers by integrating genome-wide expression profiles and PPIN. The GTA method was applied to three distinct whole-transcriptome breast cancer datasets to identify the subnetwork markers associated with metastasis. To evaluate the performance of our approach, the identified subnetwork markers were compared with gene-based, pathway-based and network-based markers. We show that GTA is not only capable of identifying robust metastatic markers, it also provides a higher classification performance. In addition, based on these GTA-based subnetworks, we identified a new bonafide candidate gene for breast cancer susceptibility.
CRISPR-Cas9 provides the means to perform genome editing and facilitates loss-of-function screens. However, we and others demonstrated that expression of the Cas9 endonuclease induces a gene-independent response that correlates with the number of target sequences in the genome. An alternative approach to suppressing gene expression is to block transcription using a catalytically inactive Cas9 (dCas9). Here we directly compare genome editing by CRISPR-Cas9 (cutting, CRISPRc) and gene suppression using KRAB-dCas9 (CRISPRi) in loss-of-function screens to identify cell essential genes.
Li, XiaoChing; Wang, Xiu-Jie; Tannenhauser, Jonathan; Podell, Sheila; Mukherjee, Piali; Hertel, Moritz; Biane, Jeremy; Masuda, Shoko; Nottebohm, Fernando; Gaasterland, Terry
2007-01-01
Vocal learning and neuronal replacement have been studied extensively in songbirds, but until recently, few molecular and genomic tools for songbird research existed. Here we describe new molecular/genomic resources developed in our laboratory. We made cDNA libraries from zebra finch (Taeniopygia guttata) brains at different developmental stages. A total of 11,000 cDNA clones from these libraries, representing 5,866 unique gene transcripts, were randomly picked and sequenced from the 3′ ends. A web-based database was established for clone tracking, sequence analysis, and functional annotations. Our cDNA libraries were not normalized. Sequencing ESTs without normalization produced many developmental stage-specific sequences, yielding insights into patterns of gene expression at different stages of brain development. In particular, the cDNA library made from brains at posthatching day 30–50, corresponding to the period of rapid song system development and song learning, has the most diverse and richest set of genes expressed. We also identified five microRNAs whose sequences are highly conserved between zebra finch and other species. We printed cDNA microarrays and profiled gene expression in the high vocal center of both adult male zebra finches and canaries (Serinus canaria). Genes differentially expressed in the high vocal center were identified from the microarray hybridization results. Selected genes were validated by in situ hybridization. Networks among the regulated genes were also identified. These resources provide songbird biologists with tools for genome annotation, comparative genomics, and microarray gene expression analysis. PMID:17426146
Gene expression profiling--Opening the black box of plant ecosystem responses to global change
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leakey, A.D.B.; Ainsworth, E.A.; Bernard, S.M.
The use of genomic techniques to address ecological questions is emerging as the field of genomic ecology. Experimentation under environmentally realistic conditions to investigate the molecular response of plants to meaningful changes in growth conditions and ecological interactions is the defining feature of genomic ecology. Since the impact of global change factors on plant performance are mediated by direct effects at the molecular, biochemical and physiological scales, gene expression analysis promises important advances in understanding factors that have previously been consigned to the 'black box' of unknown mechanism. Various tools and approaches are available for assessing gene expression in modelmore » and non-model species as part of global change biology studies. Each approach has its own unique advantages and constraints. A first generation of genomic ecology studies in managed ecosystems and mesocosms have provided a testbed for the approach and have begun to reveal how the experimental design and data analysis of gene expression studies can be tailored for use in an ecological context.« less
Heo, Min-Ji; Jung, Hwi-Min; Um, Jaeyong; Lee, Sang-Woo; Oh, Min-Kyu
2017-02-17
Genome editing using CRISPR/Cas9 was successfully demonstrated in Esherichia coli to effectively produce n-butanol in a defined medium under microaerobic condition. The butanol synthetic pathway genes including those encoding oxygen-tolerant alcohol dehydrogenase were overexpressed in metabolically engineered E. coli, resulting in 0.82 g/L butanol production. To increase butanol production, carbon flux from acetyl-CoA to citric acid cycle should be redirected to acetoacetyl-CoA. For this purpose, the 5'-untranslated region sequence of gltA encoding citrate synthase was designed using an expression prediction program, UTR designer, and modified using the CRISPR/Cas9 genome editing method to reduce its expression level. E. coli strains with decreased citrate synthase expression produced more butanol and the citrate synthase activity was correlated with butanol production. These results demonstrate that redistributing carbon flux using genome editing is an efficient engineering tool for metabolite overproduction.
A genome-wide 20 K citrus microarray for gene expression analysis
Martinez-Godoy, M Angeles; Mauri, Nuria; Juarez, Jose; Marques, M Carmen; Santiago, Julia; Forment, Javier; Gadea, Jose
2008-01-01
Background Understanding of genetic elements that contribute to key aspects of citrus biology will impact future improvements in this economically important crop. Global gene expression analysis demands microarray platforms with a high genome coverage. In the last years, genome-wide EST collections have been generated in citrus, opening the possibility to create new tools for functional genomics in this crop plant. Results We have designed and constructed a publicly available genome-wide cDNA microarray that include 21,081 putative unigenes of citrus. As a functional companion to the microarray, a web-browsable database [1] was created and populated with information about the unigenes represented in the microarray, including cDNA libraries, isolated clones, raw and processed nucleotide and protein sequences, and results of all the structural and functional annotation of the unigenes, like general description, BLAST hits, putative Arabidopsis orthologs, microsatellites, putative SNPs, GO classification and PFAM domains. We have performed a Gene Ontology comparison with the full set of Arabidopsis proteins to estimate the genome coverage of the microarray. We have also performed microarray hybridizations to check its usability. Conclusion This new cDNA microarray replaces the first 7K microarray generated two years ago and allows gene expression analysis at a more global scale. We have followed a rational design to minimize cross-hybridization while maintaining its utility for different citrus species. Furthermore, we also provide access to a website with full structural and functional annotation of the unigenes represented in the microarray, along with the ability to use this site to directly perform gene expression analysis using standard tools at different publicly available servers. Furthermore, we show how this microarray offers a good representation of the citrus genome and present the usefulness of this genomic tool for global studies in citrus by using it to catalogue genes expressed in citrus globular embryos. PMID:18598343
Transcriptome Assembly, Gene Annotation and Tissue Gene Expression Atlas of the Rainbow Trout
Salem, Mohamed; Paneru, Bam; Al-Tobasei, Rafet; Abdouni, Fatima; Thorgaard, Gary H.; Rexroad, Caird E.; Yao, Jianbo
2015-01-01
Efforts to obtain a comprehensive genome sequence for rainbow trout are ongoing and will be complemented by transcriptome information that will enhance genome assembly and annotation. Previously, transcriptome reference sequences were reported using data from different sources. Although the previous work added a great wealth of sequences, a complete and well-annotated transcriptome is still needed. In addition, gene expression in different tissues was not completely addressed in the previous studies. In this study, non-normalized cDNA libraries were sequenced from 13 different tissues of a single doubled haploid rainbow trout from the same source used for the rainbow trout genome sequence. A total of ~1.167 billion paired-end reads were de novo assembled using the Trinity RNA-Seq assembler yielding 474,524 contigs > 500 base-pairs. Of them, 287,593 had homologies to the NCBI non-redundant protein database. The longest contig of each cluster was selected as a reference, yielding 44,990 representative contigs. A total of 4,146 contigs (9.2%), including 710 full-length sequences, did not match any mRNA sequences in the current rainbow trout genome reference. Mapping reads to the reference genome identified an additional 11,843 transcripts not annotated in the genome. A digital gene expression atlas revealed 7,678 housekeeping and 4,021 tissue-specific genes. Expression of about 16,000–32,000 genes (35–71% of the identified genes) accounted for basic and specialized functions of each tissue. White muscle and stomach had the least complex transcriptomes, with high percentages of their total mRNA contributed by a small number of genes. Brain, testis and intestine, in contrast, had complex transcriptomes, with a large numbers of genes involved in their expression patterns. This study provides comprehensive de novo transcriptome information that is suitable for functional and comparative genomics studies in rainbow trout, including annotation of the genome. PMID:25793877
Bozinovic, Goran; Oleksiak, Marjorie F.
2010-01-01
Transcriptomics and population genomics are two complementary genomic approaches that can be used to gain insight into pollutant effects in natural populations. Transcriptomics identify altered gene expression pathways while population genomics approaches more directly target the causative genomic polymorphisms. Neither approach is restricted to a pre-determined set of genes or loci. Instead, both approaches allow a broad overview of genomic processes. Transcriptomics and population genomic approaches have been used to explore genomic responses in populations of fish from polluted environments and have identified sets of candidate genes and loci that appear biologically important in response to pollution. Often differences in gene expression or loci between polluted and reference populations are not conserved among polluted populations suggesting a biological complexity that we do not yet fully understand. As genomic approaches become less expensive with the advent of new sequencing and genotyping technologies, they will be more widely used in complimentary studies. However, while these genomic approaches are immensely powerful for identifying candidate gene and loci, the challenge of determining biological mechanisms that link genotypes and phenotypes remains. PMID:21072843
APOBEC3B upregulation and genomic mutation patterns in serous ovarian carcinoma
Leonard, Brandon; Hart, Steven N.; Burns, Michael B.; Carpenter, Michael A.; Temiz, Nuri A.; Rathore, Anurag; Vogel, Rachel Isaksson; Nikas, Jason B.; Law, Emily K.; Brown, William L.; Li, Ying; Zhang, Yuji; Maurer, Matthew J.; Oberg, Ann L.; Cunningham, Julie M.; Shridhar, Viji; Bell, Debra A.; April, Craig; Bentley, David; Bibikova, Marina; Cheetham, R. Keira; Fan, Jian-Bing; Grocock, Russell; Humphray, Sean; Kingsbury, Zoya; Peden, John; Chien, Jeremy; Swisher, Elizabeth M.; Hartmann, Lynn C.; Kalli, Kimberly R.; Goode, Ellen L.; Sicotte, Hugues; Kaufmann, Scott H.; Harris, Reuben S.
2013-01-01
Ovarian cancer is a clinically and molecularly heterogeneous disease. The driving forces behind this variability are unknown. Here we report wide variation in expression of the DNA cytosine deaminase APOBEC3B, with elevated expression in a majority of ovarian cancer cell lines (3 standard deviations above the mean of normal ovarian surface epithelial cells) and high grade primary ovarian cancers. APOBEC3B is active in the nucleus of several ovarian cancer cell lines and elicits a biochemical preference for deamination of cytosines in 5′TC dinucleotides. Importantly, examination of whole-genome sequence from 16 ovarian cancers reveals that APOBEC3B expression correlates with total mutation load as well as elevated levels of transversion mutations. In particular, high APOBEC3B expression correlates with C-to-A and C-to-G transversion mutations within 5′TC dinucleotide motifs in early-stage high grade serous ovarian cancer genomes, suggesting that APOBEC3B-catalyzed genomic uracil lesions are further processed by downstream DNA ‘repair’ enzymes including error-prone translesion polymerases. These data identify a potential role for APOBEC3B in serous ovarian cancer genomic instability. PMID:24154874
Gene expression levels as endophenotypes in genome-wide association studies of Alzheimer disease
Zou, F.; Carrasquillo, M. M.; Pankratz, V. S.; Belbin, O.; Morgan, K.; Allen, M.; Wilcox, S. L.; Ma, L.; Walker, L. P.; Kouri, N.; Burgess, J. D.; Younkin, L. H.; Younkin, Samuel G.; Younkin, C. S.; Bisceglio, G. D.; Crook, J. E.; Dickson, D. W.; Petersen, R. C.; Graff-Radford, N.; Younkin, Steven G.; Ertekin-Taner, N.
2010-01-01
Background: Late-onset Alzheimer disease (LOAD) is a common disorder with a substantial genetic component. We postulate that many disease susceptibility variants act by altering gene expression levels. Methods: We measured messenger RNA (mRNA) expression levels of 12 LOAD candidate genes in the cerebella of 200 subjects with LOAD. Using the genotypes from our LOAD genome-wide association study for the cis-single nucleotide polymorphisms (SNPs) (n = 619) of these 12 LOAD candidate genes, we tested for associations with expression levels as endophenotypes. The strongest expression cis-SNP was tested for AD association in 7 independent case-control series (2,280 AD and 2,396 controls). Results: We identified 3 SNPs that associated significantly with IDE (insulin degrading enzyme) expression levels. A single copy of the minor allele for each significant SNP was associated with ∼twofold higher IDE expression levels. The most significant SNP, rs7910977, is 4.2 kb beyond the 3′ end of IDE. The association observed with this SNP was significant even at the genome-wide level (p = 2.7 × 10−8). Furthermore, the minor allele of rs7910977 associated significantly (p = 0.0046) with reduced LOAD risk (OR = 0.81 with a 95% CI of 0.70-0.94), as expected biologically from its association with elevated IDE expression. Conclusions: These results provide strong evidence that IDE is a late-onset Alzheimer disease (LOAD) gene with variants that modify risk of LOAD by influencing IDE expression. They also suggest that the use of expression levels as endophenotypes in genome-wide association studies may provide a powerful approach for the identification of disease susceptibility alleles. GLOSSARY AD = Alzheimer disease; CI = confidence interval; GWAS = genome-wide association study; LOAD = late-onset Alzheimer disease; mRNA = messenger RNA; OR = odds ratio; SNP = single nucleotide polymorphism. PMID:20142614
Wen, Jianguo; Tao, Wenjing; Hao, Suyang; Zu, Youli
2017-06-13
Sickle cell disease (SCD) is a disorder of red blood cells (RBCs) expressing abnormal hemoglobin-S (HbS) due to genetic inheritance of homologous HbS gene. However, people with the sickle cell trait (SCT) carry a single allele of HbS and do not usually suffer from SCD symptoms, thus providing a rationale to treat SCD. To validate gene therapy potential, hematopoietic stem cells were isolated from the SCD patient blood and treated with CRISPR/Cas9 approach. To precisely dissect genome-editing effects, erythroid progenitor cells were cloned from single colonies of CRISPR-treated cells and then expanded for simultaneous gene, protein, and cellular function studies. Genotyping and sequencing analysis revealed that the genome-edited erythroid progenitor colonies were converted to SCT genotype from SCD genotype. HPLC protein assays confirmed reinstallation of normal hemoglobin at a similar level with HbS in the cloned genome-edited erythroid progenitor cells. For cell function evaluation, in vitro RBC differentiation of the cloned erythroid progenitor cells was induced. As expected, cell sickling assays indicated function reinstitution of the genome-edited offspring SCD RBCs, which became more resistant to sickling under hypoxia condition. This study is an exploration of genome editing of SCD HSPCs.
Häcker, Irina; Harrell Ii, Robert A; Eichner, Gerrit; Pilitt, Kristina L; O'Brochta, David A; Handler, Alfred M; Schetelig, Marc F
2017-03-07
Site-specific genome modification (SSM) is an important tool for mosquito functional genomics and comparative gene expression studies, which contribute to a better understanding of mosquito biology and are thus a key to finding new strategies to eliminate vector-borne diseases. Moreover, it allows for the creation of advanced transgenic strains for vector control programs. SSM circumvents the drawbacks of transposon-mediated transgenesis, where random transgene integration into the host genome results in insertional mutagenesis and variable position effects. We applied the Cre/lox recombinase-mediated cassette exchange (RMCE) system to Aedes aegypti, the vector of dengue, chikungunya, and Zika viruses. In this context we created four target site lines for RMCE and evaluated their fitness costs. Cre-RMCE is functional in a two-step mechanism and with good efficiency in Ae. aegypti. The advantages of Cre-RMCE over existing site-specific modification systems for Ae. aegypti, phiC31-RMCE and CRISPR, originate in the preservation of the recombination sites, which 1) allows successive modifications and rapid expansion or adaptation of existing systems by repeated targeting of the same site; and 2) provides reversibility, thus allowing the excision of undesired sequences. Thereby, Cre-RMCE complements existing genomic modification tools, adding flexibility and versatility to vector genome targeting.
Jackson, Christopher J; Norman, John E; Schnare, Murray N; Gray, Michael W; Keeling, Patrick J; Waller, Ross F
2007-01-01
Background Dinoflagellates comprise an ecologically significant and diverse eukaryotic phylum that is sister to the phylum containing apicomplexan endoparasites. The mitochondrial genome of apicomplexans is uniquely reduced in gene content and size, encoding only three proteins and two ribosomal RNAs (rRNAs) within a highly compacted 6 kb DNA. Dinoflagellate mitochondrial genomes have been comparatively poorly studied: limited available data suggest some similarities with apicomplexan mitochondrial genomes but an even more radical type of genomic organization. Here, we investigate structure, content and expression of dinoflagellate mitochondrial genomes. Results From two dinoflagellates, Crypthecodinium cohnii and Karlodinium micrum, we generated over 42 kb of mitochondrial genomic data that indicate a reduced gene content paralleling that of mitochondrial genomes in apicomplexans, i.e., only three protein-encoding genes and at least eight conserved components of the highly fragmented large and small subunit rRNAs. Unlike in apicomplexans, dinoflagellate mitochondrial genes occur in multiple copies, often as gene fragments, and in numerous genomic contexts. Analysis of cDNAs suggests several novel aspects of dinoflagellate mitochondrial gene expression. Polycistronic transcripts were found, standard start codons are absent, and oligoadenylation occurs upstream of stop codons, resulting in the absence of termination codons. Transcripts of at least one gene, cox3, are apparently trans-spliced to generate full-length mRNAs. RNA substitutional editing, a process previously identified for mRNAs in dinoflagellate mitochondria, is also implicated in rRNA expression. Conclusion The dinoflagellate mitochondrial genome shares the same gene complement and fragmentation of rRNA genes with its apicomplexan counterpart. However, it also exhibits several unique characteristics. Most notable are the expansion of gene copy numbers and their arrangements within the genome, RNA editing, loss of stop codons, and use of trans-splicing. PMID:17897476
Genome evolution and speciation genetics of clawed frogs (Xenopus and Silurana).
Evans, Ben J
2008-05-01
Speciation of clawed frogs occurred through bifurcation and reticulation of evolutionary lineages, and resulted in extant species with different ploidy levels. Duplicate gene evolution and expression in these animals provides a unique perspective into the earliest genomic transformations after vertebrate whole genome duplication (WGD) and suggests that functional constraints are relaxed compared to before duplication but still consistently strong for millions of years following WGD. Additionally, extensive quantitative expression divergence between duplicate genes occurred after WGD. Diversification of clawed frogs was potentially catalyzed by transposition and divergent resolution--processes that occur through different genetic mechanisms but that have analogous implications for genome structure. How sex determination is maintained after genome duplication is fundamental to our understanding of why allopolyploidization is so prevalent in this group, and why clawed frogs violate Haldane's Rule for hybrid sterility. Future studies of expression subfunctionalization in polyploids will shed light on the role and purviews of cis- and trans-regulatory elements in gene regulation.
Sexual dimorphism in parental imprint ontogeny and contribution to embryonic development.
Bourc'his, Déborah; Proudhon, Charlotte
2008-01-30
Genomic imprinting refers to the functional non-equivalence of parental genomes in mammals that results from the parent-of-origin allelic expression of a subset of genes. Parent-specific expression is dependent on the germ line acquisition of DNA methylation marks at imprinting control regions (ICRs), coordinated by the DNA-methyltransferase homolog DNMT3L. We discuss here how the gender-specific stages of DNMT3L expression may have influenced the various sexually dimorphic aspects of genomic imprinting: (1) the differential developmental timing of methylation establishment at paternally and maternally imprinted genes in each parental germ line, (2) the differential dependence on DNMT3L of parental methylation imprint establishment, (3) the unequal duration of paternal versus maternal methylation imprints during germ cell development, (4) the biased distribution of methylation-dependent ICRs towards the maternal genome, (5) the different genomic organization of paternal versus maternal ICRs, and finally (6) the overwhelming contribution of maternal germ line imprints to development compared to their paternal counterparts.
Aquatic Plant Genomics: Advances, Applications, and Prospects
Li, Gaojie; Yang, Jingjing
2017-01-01
Genomics is a discipline in genetics that studies the genome composition of organisms and the precise structure of genes and their expression and regulation. Genomics research has resolved many problems where other biological methods have failed. Here, we summarize advances in aquatic plant genomics with a focus on molecular markers, the genes related to photosynthesis and stress tolerance, comparative study of genomes and genome/transcriptome sequencing technology. PMID:28900619
Umemura, Myco; Koike, Hideaki; Yamane, Noriko; Koyama, Yoshinori; Satou, Yuki; Kikuzato, Ikuya; Teruya, Morimi; Tsukahara, Masatoshi; Imada, Yumi; Wachi, Youji; Miwa, Yukino; Yano, Shuichi; Tamano, Koichi; Kawarabayasi, Yutaka; Fujimori, Kazuhiro E.; Machida, Masayuki; Hirano, Takashi
2012-01-01
Aspergillus oryzae has been utilized for over 1000 years in Japan for the production of various traditional foods, and a large number of A. oryzae strains have been isolated and/or selected for the effective fermentation of food ingredients. Characteristics of genetic alterations among the strains used are of particular interest in studies of A. oryzae. Here, we have sequenced the whole genome of an industrial fungal isolate, A. oryzae RIB326, by using a next-generation sequencing system and compared the data with those of A. oryzae RIB40, a wild-type strain sequenced in 2005. The aim of this study was to evaluate the mutation pressure on the non-syntenic blocks (NSBs) of the genome, which were previously identified through comparative genomic analysis of A. oryzae, Aspergillus fumigatus, and Aspergillus nidulans. We found that genes within the NSBs of RIB326 accumulate mutations more frequently than those within the SBs, regardless of their distance from the telomeres or of their expression level. Our findings suggest that the high mutation frequency of NSBs might contribute to maintaining the diversity of the A. oryzae genome. PMID:22912434
Umemura, Myco; Koike, Hideaki; Yamane, Noriko; Koyama, Yoshinori; Satou, Yuki; Kikuzato, Ikuya; Teruya, Morimi; Tsukahara, Masatoshi; Imada, Yumi; Wachi, Youji; Miwa, Yukino; Yano, Shuichi; Tamano, Koichi; Kawarabayasi, Yutaka; Fujimori, Kazuhiro E; Machida, Masayuki; Hirano, Takashi
2012-10-01
Aspergillus oryzae has been utilized for over 1000 years in Japan for the production of various traditional foods, and a large number of A. oryzae strains have been isolated and/or selected for the effective fermentation of food ingredients. Characteristics of genetic alterations among the strains used are of particular interest in studies of A. oryzae. Here, we have sequenced the whole genome of an industrial fungal isolate, A. oryzae RIB326, by using a next-generation sequencing system and compared the data with those of A. oryzae RIB40, a wild-type strain sequenced in 2005. The aim of this study was to evaluate the mutation pressure on the non-syntenic blocks (NSBs) of the genome, which were previously identified through comparative genomic analysis of A. oryzae, Aspergillus fumigatus, and Aspergillus nidulans. We found that genes within the NSBs of RIB326 accumulate mutations more frequently than those within the SBs, regardless of their distance from the telomeres or of their expression level. Our findings suggest that the high mutation frequency of NSBs might contribute to maintaining the diversity of the A. oryzae genome.
Tomo, Naoki; Goto, Toshiyuki; Morikawa, Yuko
2013-03-26
Yeast is recognized as a generally safe microorganism and is utilized for the production of pharmaceutical products, including vaccines. We previously showed that expression of human immunodeficiency virus type 1 (HIV-1) Gag protein in Saccharomyces cerevisiae spheroplasts released Gag virus-like particles (VLPs) extracellularly, suggesting that the production system could be used in vaccine development. In this study, we further establish HIV-1 genome packaging into Gag VLPs in a yeast cell system. The nearly full-length HIV-1 genome containing the entire 5' long terminal repeat, U3-R-U5, did not transcribe gag mRNA in yeast. Co-expression of HIV-1 Tat, a transcription activator, did not support the transcription. When the HIV-1 promoter U3 was replaced with the promoter for the yeast glyceraldehyde-3-phosphate dehydrogenase gene, gag mRNA transcription was restored, but no Gag protein expression was observed. Co-expression of HIV-1 Rev, a factor that facilitates nuclear export of gag mRNA, did not support the protein synthesis. Progressive deletions of R-U5 and its downstream stem-loop-rich region (SL) to the gag start ATG codon restored Gag protein expression, suggesting that a highly structured noncoding RNA generated from the R-U5-SL region had an inhibitory effect on gag mRNA translation. When a plasmid containing the HIV-1 genome with the R-U5-SL region was coexpressed with an expression plasmid for Gag protein, the HIV-1 genomic RNA was transcribed and incorporated into Gag VLPs formed by Gag protein assembly, indicative of the trans-packaging of HIV-1 genomic RNA into Gag VLPs in a yeast cell system. The concentration of HIV-1 genomic RNA in Gag VLPs released from yeast was approximately 500-fold higher than that in yeast cytoplasm. The deletion of R-U5 to the gag gene resulted in the failure of HIV-1 RNA packaging into Gag VLPs, indicating that the packaging signal of HIV-1 genomic RNA present in the R-U5 to gag region functions similarly in yeast cells. Our data indicate that selective trans-packaging of HIV-1 genomic RNA into Gag VLPs occurs in a yeast cell system, analogous to a mammalian cell system, suggesting that yeast may provide an alternative packaging system for lentiviral RNA.
Genomic Perspectives of Transcriptional Regulation in Forebrain Development
Nord, Alex S.; Pattabiraman, Kartik; Visel, Axel; ...
2015-01-07
The forebrain is the seat of higher-order brain functions, and many human neuropsychiatric disorders are due to genetic defects affecting forebrain development, making it imperative to understand the underlying genetic circuitry. We report that recent progress now makes it possible to begin fully elucidating the genomic regulatory mechanisms that control forebrain gene expression. Here, we discuss the current knowledge of how transcription factors drive gene expression programs through their interactions with cis-acting genomic elements, such as enhancers; how analyses of chromatin and DNA modifications provide insights into gene expression states; and how these approaches yield insights into the evolution ofmore » the human brain.« less
[Prediction of Promoter Motifs in Virophages].
Gong, Chaowen; Zhou, Xuewen; Pan, Yingjie; Wang, Yongjie
2015-07-01
Virophages have crucial roles in ecosystems and are the transport vectors of genetic materials. To shed light on regulation and control mechanisms in virophage--host systems as well as evolution between virophages and their hosts, the promoter motifs of virophages were predicted on the upstream regions of start codons using an analytical tool for prediction of promoter motifs: Multiple EM for Motif Elicitation. Seventeen potential promoter motifs were identified based on the E-value, location, number and length of promoters in genomes. Sputnik and zamilon motif 2 with AT-rich regions were distributed widely on genomes, suggesting that these motifs may be associated with regulation of the expression of various genes. Motifs containing the TCTA box were predicted to be late promoter motif in mavirus; motifs containing the ATCT box were the potential late promoter motif in the Ace Lake mavirus . AT-rich regions were identified on motif 2 in the Organic Lake virophage, motif 3 in Yellowstone Lake virophage (YSLV)1 and 2, motif 1 in YSLV3, and motif 1 and 2 in YSLV4, respectively. AT-rich regions were distributed widely on the genomes of virophages. All of these motifs may be promoter motifs of virophages. Our results provide insights into further exploration of temporal expression of genes in virophages as well as associations between virophages and giant viruses.
Chacon, Diego; Beck, Dominik; Perera, Dilmi; Wong, Jason W H; Pimanda, John E
2014-01-01
The BloodChIP database (http://www.med.unsw.edu.au/CRCWeb.nsf/page/BloodChIP) supports exploration and visualization of combinatorial transcription factor (TF) binding at a particular locus in human CD34-positive and other normal and leukaemic cells or retrieval of target gene sets for user-defined combinations of TFs across one or more cell types. Increasing numbers of genome-wide TF binding profiles are being added to public repositories, and this trend is likely to continue. For the power of these data sets to be fully harnessed by experimental scientists, there is a need for these data to be placed in context and easily accessible for downstream applications. To this end, we have built a user-friendly database that has at its core the genome-wide binding profiles of seven key haematopoietic TFs in human stem/progenitor cells. These binding profiles are compared with binding profiles in normal differentiated and leukaemic cells. We have integrated these TF binding profiles with chromatin marks and expression data in normal and leukaemic cell fractions. All queries can be exported into external sites to construct TF-gene and protein-protein networks and to evaluate the association of genes with cellular processes and tissue expression.
Wang, Hui; Guo, Ruoyu; Ki, Jang-Seu
2018-03-01
Endocrine disrupting chemicals (EDCs) have toxic effects on algae; however, their molecular genomic responses have not been sufficiently elucidated. Here, we evaluated genome-scaled responses of the dinoflagellate alga Prorocentrum minimum exposed to an EDC, polychlorinated biphenyl (PCB), using a 6.0 K microarray. Based on two-fold change cut-off, we identified that 609 genes (∼10.2%) responded to the PCB treatment. KEGG pathway analysis showed that differentially expressed genes (DEGs) were related to ribosomes, biosynthesis of amino acids, spliceosomes, and cellular processes. Many DEGs were involved in cell cycle progression, apoptosis, signal transduction, ion binding, and cellular transportation. In contrast, only a few genes related to photosynthesis and oxidative stress were expressed in response to PCB exposure. This was supported by that fact that there were no obvious changes in the photosynthetic efficiency and reactive oxygen species (ROS) production. These results suggest that PCB might not cause chloroplast and oxidative damage, but could lead to cell cycle arrest and apoptosis. In addition, various signal transduction and transport pathways might be disrupted in the cells, which could further contribute to cell death. These results expand the genomic understanding of the effects of EDCs on this dinoflagellate protist. Copyright © 2017 Elsevier Ltd. All rights reserved.
Raisuddin, Sheikh; Kwok, Kevin W H; Leung, Kenneth M Y; Schlenk, Daniel; Lee, Jae-Seong
2007-07-20
There is an increasing body of evidence to support the significant role of invertebrates in assessing impacts of environmental contaminants on marine ecosystems. Therefore, in recent years massive efforts have been directed to identify viable and ecologically relevant invertebrate toxicity testing models. Tigriopus, a harpacticoid copepod has a number of promising characteristics which make it a candidate worth consideration in such efforts. Tigriopus and other copepods are widely distributed and ecologically important organisms. Their position in marine food chains is very prominent, especially with regard to the transfer of energy. Copepods also play an important role in the transportation of aquatic pollutants across the food chains. In recent years there has been a phenomenal increase in the knowledge base of Tigriopus spp., particularly in the areas of their ecology, geophylogeny, genomics and their behavioural, biochemical and molecular responses following exposure to environmental stressors and chemicals. Sequences of a number of important marker genes have been studied in various Tigriopus spp., notably T. californicus and T. japonicus. These genes belong to normal biophysiological functions (e.g. electron transport system enzymes) as well as stress and toxic chemical exposure responses (heat shock protein 20, glutathione reductase, glutathione S-transferase). Recently, 40,740 expressed sequenced tags (ESTs) from T. japonicus, have been sequenced and of them, 5,673 ESTs showed significant hits (E-value, >1.0E-05) to the red flour beetle Tribolium genome database. Metals and organic pollutants such as antifouling agents, pesticides, polycyclic aromatic hydrocarbons (PAH) and polychrlorinated biphenyls (PCB) have shown reproducible biological responses when tested in Tigriopus spp. Promising results have been obtained when Tigriopus was used for assessment of risk associated with exposure to endocrine-disrupting chemicals (EDCs). Application of environmental gene expression techniques has allowed evaluation of transcriptional changes in T. japonicus with the ultimate aim of understanding the mechanisms of action of environmental stressors. Through a better understanding of toxicological mechanisms, ecotoxicologists may use this ecologically relevant species in risk assessment studies in marine systems. The combination of uses as a whole-animal bioassay and gene expression studies indicate that Tigriopus may serve as an excellent tool to evaluate the impacts of marine pollution throughout the coastal region. The purpose of this review is to illustrate the potential of using Tigriopus to fulfill the niche as an important invertebrate marine model organism for ecotoxicology and environmental genomics. In addition, the knowledge gaps and areas for further studies have also been discussed.
Jorjani, Hadi; Zavolan, Mihaela
2014-04-01
Accurate identification of transcription start sites (TSSs) is an essential step in the analysis of transcription regulatory networks. In higher eukaryotes, the capped analysis of gene expression technology enabled comprehensive annotation of TSSs in genomes such as those of mice and humans. In bacteria, an equivalent approach, termed differential RNA sequencing (dRNA-seq), has recently been proposed, but the application of this approach to a large number of genomes is hindered by the paucity of computational analysis methods. With few exceptions, when the method has been used, annotation of TSSs has been largely done manually. In this work, we present a computational method called 'TSSer' that enables the automatic inference of TSSs from dRNA-seq data. The method rests on a probabilistic framework for identifying both genomic positions that are preferentially enriched in the dRNA-seq data as well as preferentially captured relative to neighboring genomic regions. Evaluating our approach for TSS calling on several publicly available datasets, we find that TSSer achieves high consistency with the curated lists of annotated TSSs, but identifies many additional TSSs. Therefore, TSSer can accelerate genome-wide identification of TSSs in bacterial genomes and can aid in further characterization of bacterial transcription regulatory networks. TSSer is freely available under GPL license at http://www.clipz.unibas.ch/TSSer/index.php
Genomic analyses of the CAM plant pineapple.
Zhang, Jisen; Liu, Juan; Ming, Ray
2014-07-01
The innovation of crassulacean acid metabolism (CAM) photosynthesis in arid and/or low CO2 conditions is a remarkable case of adaptation in flowering plants. As the most important crop that utilizes CAM photosynthesis, the genetic and genomic resources of pineapple have been developed over many years. Genetic diversity studies using various types of DNA markers led to the reclassification of the two genera Ananas and Pseudananas and nine species into one genus Ananas and two species, A. comosus and A. macrodontes with five botanical varieties in A. comosus. Five genetic maps have been constructed using F1 or F2 populations, and high-density genetic maps generated by genotype sequencing are essential resources for sequencing and assembling the pineapple genome and for marker-assisted selection. There are abundant expression sequence tag resources but limited genomic sequences in pineapple. Genes involved in the CAM pathway has been analysed in several CAM plants but only a few of them are from pineapple. A reference genome of pineapple is being generated and will accelerate genetic and genomic research in this major CAM crop. This reference genome of pineapple provides the foundation for studying the origin and regulatory mechanism of CAM photosynthesis, and the opportunity to evaluate the classification of Ananas species and botanical cultivars. © The Author 2014. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Catfish Genome Consortium; Wang, Shaolin; Peatman, Eric
2010-03-23
Background-Through the Community Sequencing Program, a catfish EST sequencing project was carried out through a collaboration between the catfish research community and the Department of Energy's Joint Genome Institute. Prior to this project, only a limited EST resource from catfish was available for the purpose of SNP identification. Results-A total of 438,321 quality ESTs were generated from 8 channel catfish (Ictalurus punctatus) and 4 blue catfish (Ictalurus furcatus) libraries, bringing the number of catfish ESTs to nearly 500,000. Assembly of all catfish ESTs resulted in 45,306 contigs and 66,272 singletons. Over 35percent of the unique sequences had significant similarities tomore » known genes, allowing the identification of 14,776 unique genes in catfish. Over 300,000 putative SNPs have been identified, of which approximately 48,000 are high-quality SNPs identified from contigs with at least four sequences and the minor allele presence of at least two sequences in the contig. The EST resource should be valuable for identification of microsatellites, genome annotation, large-scale expression analysis, and comparative genome analysis. Conclusions-This project generated a large EST resource for catfish that captured the majority of the catfish transcriptome. The parallel analysis of ESTs from two closely related Ictalurid catfishes should also provide powerful means for the evaluation of ancient and recent gene duplications, and for the development of high-density microarrays in catfish. The inter- and intra-specific SNPs identified from all catfish EST dataset assembly will greatly benefit the catfish introgression breeding program and whole genome association studies.« less
Gendreau, Kerry L; Haney, Robert A; Schwager, Evelyn E; Wierschin, Torsten; Stanke, Mario; Richards, Stephen; Garb, Jessica E
2017-02-16
Black widow spiders are infamous for their neurotoxic venom, which can cause extreme and long-lasting pain. This unusual venom is dominated by latrotoxins and latrodectins, two protein families virtually unknown outside of the black widow genus Latrodectus, that are difficult to study given the paucity of spider genomes. Using tissue-, sex- and stage-specific expression data, we analyzed the recently sequenced genome of the house spider (Parasteatoda tepidariorum), a close relative of black widows, to investigate latrotoxin and latrodectin diversity, expression and evolution. We discovered at least 47 latrotoxin genes in the house spider genome, many of which are tandem-arrayed. Latrotoxins vary extensively in predicted structural domains and expression, implying their significant functional diversification. Phylogenetic analyses show latrotoxins have substantially duplicated after the Latrodectus/Parasteatoda split and that they are also related to proteins found in endosymbiotic bacteria. Latrodectin genes are less numerous than latrotoxins, but analyses show their recruitment for venom function from neuropeptide hormone genes following duplication, inversion and domain truncation. While latrodectins and other peptides are highly expressed in house spider and black widow venom glands, latrotoxins account for a far smaller percentage of house spider venom gland expression. The house spider genome sequence provides novel insights into the evolution of venom toxins once considered unique to black widows. Our results greatly expand the size of the latrotoxin gene family, reinforce its narrow phylogenetic distribution, and provide additional evidence for the lateral transfer of latrotoxins between spiders and bacterial endosymbionts. Moreover, we strengthen the evidence for the evolution of latrodectin venom genes from the ecdysozoan Ion Transport Peptide (ITP)/Crustacean Hyperglycemic Hormone (CHH) neuropeptide superfamily. The lower expression of latrotoxins in house spiders relative to black widows, along with the absence of a vertebrate-targeting α-latrotoxin gene in the house spider genome, may account for the extreme potency of black widow venom.
Moin, Mazahar; Bakshi, Achala; Saha, Anusree; Dutta, Mouboni; Kirti, P B
2017-07-01
The epitome of any genome research is to identify all the existing genes in a genome and investigate their roles. Various techniques have been applied to unveil the functions either by silencing or over-expressing the genes by targeted expression or random mutagenesis. Rice is the most appropriate model crop for generating a mutant resource for functional genomic studies because of the availability of high-quality genome sequence and relatively smaller genome size. Rice has syntenic relationships with members of other cereals. Hence, characterization of functionally unknown genes in rice will possibly provide key genetic insights and can lead to comparative genomics involving other cereals. The current review attempts to discuss the available gain-of-function mutagenesis techniques for functional genomics, emphasizing the contemporary approach, activation tagging and alterations to this method for the enhancement of yield and productivity of rice. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Analysis of Epstein-Barr Virus Genomes and Expression Profiles in Gastric Adenocarcinoma.
Borozan, Ivan; Zapatka, Marc; Frappier, Lori; Ferretti, Vincent
2018-01-15
Epstein-Barr virus (EBV) is a causative agent of a variety of lymphomas, nasopharyngeal carcinoma (NPC), and ∼9% of gastric carcinomas (GCs). An important question is whether particular EBV variants are more oncogenic than others, but conclusions are currently hampered by the lack of sequenced EBV genomes. Here, we contribute to this question by mining whole-genome sequences of 201 GCs to identify 13 EBV-positive GCs and by assembling 13 new EBV genome sequences, almost doubling the number of available GC-derived EBV genome sequences and providing the first non-Asian EBV genome sequences from GC. Whole-genome sequence comparisons of all EBV isolates sequenced to date (85 from tumors and 57 from healthy individuals) showed that most GC and NPC EBV isolates were closely related although American Caucasian GC samples were more distant, suggesting a geographical component. However, EBV GC isolates were found to contain some consistent changes in protein sequences regardless of geographical origin. In addition, transcriptome data available for eight of the EBV-positive GCs were analyzed to determine which EBV genes are expressed in GC. In addition to the expected latency proteins (EBNA1, LMP1, and LMP2A), specific subsets of lytic genes were consistently expressed that did not reflect a typical lytic or abortive lytic infection, suggesting a novel mechanism of EBV gene regulation in the context of GC. These results are consistent with a model in which a combination of specific latent and lytic EBV proteins promotes tumorigenesis. IMPORTANCE Epstein-Barr virus (EBV) is a widespread virus that causes cancer, including gastric carcinoma (GC), in a small subset of individuals. An important question is whether particular EBV variants are more cancer associated than others, but more EBV sequences are required to address this question. Here, we have generated 13 new EBV genome sequences from GC, almost doubling the number of EBV sequences from GC isolates and providing the first EBV sequences from non-Asian GC. We further identify sequence changes in some EBV proteins common to GC isolates. In addition, gene expression analysis of eight of the EBV-positive GCs showed consistent expression of both the expected latency proteins and a subset of lytic proteins that was not consistent with typical lytic or abortive lytic expression. These results suggest that novel mechanisms activate expression of some EBV lytic proteins and that their expression may contribute to oncogenesis. Copyright © 2018 American Society for Microbiology.
Ferreira de Carvalho, Julie; Oplaat, Carla; Pappas, Nikolaos; Derks, Martijn; de Ridder, Dick; Verhoeven, Koen J F
2016-03-08
Asexual reproduction has the potential to enhance deleterious mutation accumulation and to constrain adaptive evolution. One source of mutations that can be especially relevant in recent asexuals is activity of transposable elements (TEs), which may have experienced selection for high transposition rates in sexual ancestor populations. Predictions of genomic divergence under asexual reproduction therefore likely include a large contribution of transposable elements but limited adaptive divergence. For plants empirical insight into genome divergence under asexual reproduction remains limited. Here, we characterize expression divergence between clone members of a single apomictic lineage of the common dandelion (Taraxacum officinale) to contribute to our knowledge of genome evolution under asexuality. Using RNA-Seq, we show that about one third of heritable divergence within the apomictic lineage is driven by TEs and TE-related gene activity. In addition, we identify non-random transcriptional differences in pathways related to acyl-lipid and abscisic acid metabolisms which might reflect functional divergence within the apomictic lineage. We analyze SNPs in the transcriptome to assess genetic divergence between the apomictic clone members and reveal that heritable expression differences between the accessions are not explained simply by genome-wide genetic divergence. The present study depicts a first effort towards a more complete understanding of apomictic plant genome evolution. We identify abundant TE activity and ecologically relevant functional genes and pathways affecting heritable within-lineage expression divergence. These findings offer valuable resources for future work looking at epigenetic silencing and Cis-regulation of gene expression with particular emphasis on the effects of TE activity on asexual species' genome.
Galick, Heather A.; Marsden, Carolyn G.; Kathe, Scott; Dragon, Julie A.; Volk, Lindsay; Nemec, Antonia A.; Wallace, Susan S.; Prakash, Aishwarya; Doublié, Sylvie; Sweasy, Joann B.
2017-01-01
Base excision repair (BER) is a key genome maintenance pathway. The NEIL1 DNA glycosylase recognizes oxidized bases, and likely removes damage in advance of the replication fork. The rs5745906 SNP of the NEIL1 gene is a rare human germline variant that encodes the NEIL1 G83D protein, which is devoid of DNA glycosylase activity. Here we show that expression of G83D NEIL1 in MCF10A immortalized but non-transformed mammary epithelial cells leads to replication fork stress. Upon treatment with hydrogen peroxide, we observe increased levels of stalled replication forks in cells expressing G83D NEIL1 versus cells expressing the wild-type (WT) protein. Double-strand breaks (DSBs) arise in G83D-expressing cells during the S and G2/M phases of the cell cycle. Interestingly, these breaks result in genomic instability in the form of high levels of chromosomal aberrations and micronuclei. Cells expressing G83D also grow in an anchorage independent manner, suggesting that the genomic instability results in a carcinogenic phenotype. Our results are consistent with the idea that an inability to remove oxidative damage in an efficient manner at the replication fork leads to genomic instability and mutagenesis. We suggest that individuals who harbor the G83D NEIL1 variant face an increased risk for human cancer. PMID:29156764
Núñez-Acuña, Gustavo; Aguilar-Espinoza, Andrea; Gallardo-Escárate, Cristian
2013-03-01
Despite the great relevance of mitochondrial genome analysis in evolutionary studies, there is scarce information on how the transcripts associated with the mitogenome are expressed and their role in the genetic structuring of populations. This work reports the complete mitochondrial genome of the marine gastropod Concholepas concholepas, obtained by 454 pryosequencing, and an analysis of mitochondrial transcripts of two populations 1000 km apart along the Chilean coast. The mitochondrion of C. concholepas is 15,495 base pairs (bp) in size and contains the 37 subunits characteristic of metazoans, as well as a non-coding region of 330 bp. In silico analysis of mitochondrial gene variability showed significant differences among populations. In terms of levels of relative abundance of transcripts associated with mitochondrion in the two populations (assessed by qPCR), the genes associated with complexes III and IV of the mitochondrial genome had the highest levels of expression in the northern population while transcripts associated with the ATP synthase complex had the highest levels of expression in the southern population. Moreover, fifteen polymorphic SNPs were identified in silico between the mitogenomes of the two populations. Four of these markers implied different amino acid substitutions (non-synonymous SNPs). This work contributes novel information regarding the mitochondrial genome structure and mRNA expression levels of C. concholepas. Copyright © 2012 Elsevier Inc. All rights reserved.
Wei, Min; Yokoyama, Tadashi; Minamisawa, Kiwamu; Mitsui, Hisayuki; Itakura, Manabu; Kaneko, Takakazu; Tabata, Satoshi; Saeki, Kazuhiko; Omori, Hirofumi; Tajima, Shigeyuki; Uchiumi, Toshiki; Abe, Mikiko; Ohwada, Takuji
2008-08-01
Initial interaction between rhizobia and legumes actually starts via encounters of both partners in the rhizosphere. In this study, the global expression profiles of Bradyrhizobium japonicum USDA 110 in response to soybean (Glycine max) seed extracts (SSE) and genistein, a major soybean-released isoflavone for nod genes induction of B. japonicum, were compared. SSE induced many genomic loci as compared with genistein (5.0 microM), nevertheless SSE-supplemented medium contained 4.7 microM genistein. SSE markedly induced four predominant genomic regions within a large symbiosis island (681 kb), which include tts genes (type III secretion system) and various nod genes. In addition, SSE-treated cells expressed many genomic loci containing genes for polygalacturonase (cell-wall degradation), exopolysaccharide synthesis, 1-aminocyclopropane-1-carboxylate deaminase, ribosome proteins family and energy metabolism even outside symbiosis island. On the other hand, genistein-treated cells exclusively showed one expression cluster including common nod gene operon within symbiosis island and six expression loci including multidrug resistance, which were shared with SSE-treated cells. Twelve putatively regulated genes were indeed validated by quantitative RT-PCR. Several SSE-induced genomic loci likely participate in the initial interaction with legumes. Thus, these results can provide a basic knowledge for screening novel genes relevant to the B. japonicum- soybean symbiosis.
ArrayExpress update--trends in database growth and links to data analysis tools.
Rustici, Gabriella; Kolesnikov, Nikolay; Brandizi, Marco; Burdett, Tony; Dylag, Miroslaw; Emam, Ibrahim; Farne, Anna; Hastings, Emma; Ison, Jon; Keays, Maria; Kurbatova, Natalja; Malone, James; Mani, Roby; Mupo, Annalisa; Pedro Pereira, Rui; Pilicheva, Ekaterina; Rung, Johan; Sharma, Anjan; Tang, Y Amy; Ternent, Tobias; Tikhonov, Andrew; Welter, Danielle; Williams, Eleanor; Brazma, Alvis; Parkinson, Helen; Sarkans, Ugis
2013-01-01
The ArrayExpress Archive of Functional Genomics Data (http://www.ebi.ac.uk/arrayexpress) is one of three international functional genomics public data repositories, alongside the Gene Expression Omnibus at NCBI and the DDBJ Omics Archive, supporting peer-reviewed publications. It accepts data generated by sequencing or array-based technologies and currently contains data from almost a million assays, from over 30 000 experiments. The proportion of sequencing-based submissions has grown significantly over the last 2 years and has reached, in 2012, 15% of all new data. All data are available from ArrayExpress in MAGE-TAB format, which allows robust linking to data analysis and visualization tools, including Bioconductor and GenomeSpace. Additionally, R objects, for microarray data, and binary alignment format files, for sequencing data, have been generated for a significant proportion of ArrayExpress data.
Golden Gate Assembly of CRISPR gRNA expression array for simultaneously targeting multiple genes.
Vad-Nielsen, Johan; Lin, Lin; Bolund, Lars; Nielsen, Anders Lade; Luo, Yonglun
2016-11-01
The engineered CRISPR/Cas9 technology has developed as the most efficient and broadly used genome editing tool. However, simultaneously targeting multiple genes (or genomic loci) in the same individual cells using CRISPR/Cas9 remain one technical challenge. In this article, we have developed a Golden Gate Assembly method for the generation of CRISPR gRNA expression arrays, thus enabling simultaneous gene targeting. Using this method, the generation of CRISPR gRNA expression array can be accomplished in 2 weeks, and contains up to 30 gRNA expression cassettes. We demonstrated in the study that simultaneously targeting 10 genomic loci or simultaneously inhibition of multiple endogenous genes could be achieved using the multiplexed gRNA expression array vector in human cells. The complete set of plasmids is available through the non-profit plasmid repository Addgene.
Genomic and post-genomic effects of anti-glaucoma drugs preservatives in trabecular meshwork.
Izzotti, Alberto; La Maestra, Sebastiano; Micale, Rosanna Tindara; Longobardi, Maria Grazia; Saccà, Sergio Claudio
2015-02-01
Oxidative stress plays an important role in glaucoma. Some preservatives of anti-glaucoma drugs, commonly used in glaucoma therapy, can prevent or induce oxidative stress in the trabecular meshwork. The aim of this study is to evaluate cellular and molecular damage induced in trabecular meshwork by preservatives contained in anti-glaucoma drugs. Cell viability (MTT test), DNA fragmentation (Comet test), oxidative DNA damage (8-oxo-dG), and gene expression (cDNA microarray) have been evaluated in trabecular meshwork specimens and in human trabecular meshwork cells treated with benzalkonium chloride, polyQuad, purite, and sofzia-like mixture. Moreover, antimicrobial effectiveness and safety of preservative contents in drugs was tested. In ex vivo experiments, benzalkonium chloride and polyQuad induced high level of DNA damage in trabecular meshwork specimens, while the effect of purite and sofzia were more attenuated. The level of DNA fragmentation induced by benzalkonium chloride was 2.4-fold higher in subjects older than 50 years than in younger subjects. Benzalkonium chloride, and polyQuad significantly increased oxidative DNA damage as compared to sham-treated specimens. Gene expression was altered by benzalkonium chloride, polyQuad, and purite but not by sofzia. In in vitro experiments, benzalkonium chloride and polyQuad dramatically decreased trabecular meshwork cell viability, increased DNA fragmentation, and altered gene expression. A lesser effect was also exerted by purite and sofzia. Genes targeted by these alterations included Fas and effector caspase-3. The efficacy of the preservatives in inhibiting bacterial growth increased the adverse effects in trabecular meshwork in terms of DNA damage and alteration of gene expression. Presented data indicates the delicate balance between efficacy and safety of drug preservatives as not yet optimized. Copyright © 2014 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Castanera, Raul; Lopez-Varas, Leticia; Borgognone, Alessandra
Transposable elements (TEs) are exceptional contributors to eukaryotic genome diversity. Their ubiquitous presence impacts the genomes of nearly all species and mediates genome evolution by causing mutations and chromosomal rearrangements and by modulating gene expression. We performed an exhaustive analysis of the TE content in 18 fungal genomes, including strains of the same species and species of the same genera. Our results depicted a scenario of exceptional variability, with species having 0.02 to 29.8% of their genome consisting of transposable elements. A detailed analysis performed on two strains of Pleurotus ostreatus uncovered a genome that is populated mainly by Classmore » I elements, especially LTR-retrotransposons amplified in recent bursts from 0 to 2 million years (My) ago. The preferential accumulation of TEs in clusters led to the presence of genomic regions that lacked intra- and inter-specific conservation. In addition, we investigated the effect of TE insertions on the expression of their nearby upstream and downstream genes. Our results showed that an important number of genes under TE influence are significantly repressed, with stronger repression when genes are localized within transposon clusters. Our transcriptional analysis performed in four additional fungal models revealed that this TE-mediated silencing was present only in species with active cytosine methylation machinery. We hypothesize that this phenomenon is related to epigenetic defense mechanisms that are aimed to suppress TE expression and control their proliferation.« less
Castanera, Raul; Lopez-Varas, Leticia; Borgognone, Alessandra; ...
2016-06-13
Transposable elements (TEs) are exceptional contributors to eukaryotic genome diversity. Their ubiquitous presence impacts the genomes of nearly all species and mediates genome evolution by causing mutations and chromosomal rearrangements and by modulating gene expression. We performed an exhaustive analysis of the TE content in 18 fungal genomes, including strains of the same species and species of the same genera. Our results depicted a scenario of exceptional variability, with species having 0.02 to 29.8% of their genome consisting of transposable elements. A detailed analysis performed on two strains of Pleurotus ostreatus uncovered a genome that is populated mainly by Classmore » I elements, especially LTR-retrotransposons amplified in recent bursts from 0 to 2 million years (My) ago. The preferential accumulation of TEs in clusters led to the presence of genomic regions that lacked intra- and inter-specific conservation. In addition, we investigated the effect of TE insertions on the expression of their nearby upstream and downstream genes. Our results showed that an important number of genes under TE influence are significantly repressed, with stronger repression when genes are localized within transposon clusters. Our transcriptional analysis performed in four additional fungal models revealed that this TE-mediated silencing was present only in species with active cytosine methylation machinery. We hypothesize that this phenomenon is related to epigenetic defense mechanisms that are aimed to suppress TE expression and control their proliferation.« less
Verma, Jitendra Kumar; Wardhan, Vijay; Singh, Deepali; Chakraborty, Subhra; Chakraborty, Niranjan
2018-01-01
Architectural proteins play key roles in genome construction and regulate the expression of many genes, albeit the modulation of genome plasticity by these proteins is largely unknown. A critical screening of the architectural proteins in five crop species, viz., Oryza sativa, Zea mays, Sorghum bicolor, Cicer arietinum, and Vitis vinifera, and in the model plant Arabidopsis thaliana along with evolutionary relevant species such as Chlamydomonas reinhardtii, Physcomitrella patens, and Amborella trichopoda, revealed 9, 20, 10, 7, 7, 6, 1, 4, and 4 Alba (acetylation lowers binding affinity) genes, respectively. A phylogenetic analysis of the genes and of their counterparts in other plant species indicated evolutionary conservation and diversification. In each group, the structural components of the genes and motifs showed significant conservation. The chromosomal location of the Alba genes of rice (OsAlba), showed an unequal distribution on 8 of its 12 chromosomes. The expression profiles of the OsAlba genes indicated a distinct tissue-specific expression in the seedling, vegetative, and reproductive stages. The quantitative real-time PCR (qRT-PCR) analysis of the OsAlba genes confirmed their stress-inducible expression under multivariate environmental conditions and phytohormone treatments. The evaluation of the regulatory elements in 68 Alba genes from the 9 species studied led to the identification of conserved motifs and overlapping microRNA (miRNA) target sites, suggesting the conservation of their function in related proteins and a divergence in their biological roles across species. The 3D structure and the prediction of putative ligands and their binding sites for OsAlba proteins offered a key insight into the structure–function relationship. These results provide a comprehensive overview of the subtle genetic diversification of the OsAlba genes, which will help in elucidating their functional role in plants. PMID:29597290
Yi, Yanglei; Zhang, Zhenhua; Zhao, Fan; Liu, Huan; Yu, Lijun; Zha, Jiwei; Wang, Gaoxue
2018-07-01
This study evaluated the probiotic potential of B. velezensis JW through experimental and genomic analysis approaches. Strain JW showed antimicrobial activity against a broad range of fish pathogenic bacteria including Aeromonas hydrophila, Aeromonas salmonicida, Lactococcus garvieae, Streptococcus agalactiae, and Vibrio Parahemolyticus. Fish (Carassius auratus) were fed with the diets containing 0 (control), 10 7 , and 10 9 cfu/g of B. velezensis JW for 4 weeks. Various immune parameters were examined at 1, 2, 3, and 4 weeks of post-feeding. Results showed that JW supplemented diets significantly increased acid phosphatase (ACP), alkaline phosphatase (AKP), and glutathione peroxidase (GSH-PX) activity. The mRNA expression of immune-related genes in the head kidney of C. auratus was measured. Among them, the interferon gamma gene (IFN- γ) and tumor necrosis factor-α (TNF-α) showed higher expression after 3 and 4 weeks of feeding (P < 0.05). The expression of interleukin-1 (IL-1) only being significantly upregulated by 10 9 cfu/g of JW after 1 week of feeding (P < 0.05). The upregulation of interleukin-4 (IL-4) increased over time from 1st to 4th week. The expression of interleukin-10 (IL-10) and interleukin-12 (IL-12) showed an opposite expression pattern with IL-10 significantly upregulated and IL-12 significantly downregulated by JW containing diets at 2, 3, and 4 weeks of post-feeding (P < 0.05). Moreover, fish fed with JW supplemented diets showed significantly improved survival rate after A. hydrophila infection. The analysis of the genome of JW revealed several features aiding host health and being relevant to the GIT adaptation. Four bacteriocins, three Polyketide Synthetase (PKS), and five Nonribosomal Peptide-Synthetase (NRPS) gene clusters were identified in the genome. In summary, the above results clearly proved that B. velezensis JW has the potential to be developed as a probiotic agent in aquaculture. Copyright © 2018 Elsevier Ltd. All rights reserved.
Acosta-Pech, Rocío; Crossa, José; de Los Campos, Gustavo; Teyssèdre, Simon; Claustres, Bruno; Pérez-Elizalde, Sergio; Pérez-Rodríguez, Paulino
2017-07-01
A new genomic model that incorporates genotype × environment interaction gave increased prediction accuracy of untested hybrid response for traits such as percent starch content, percent dry matter content and silage yield of maize hybrids. The prediction of hybrid performance (HP) is very important in agricultural breeding programs. In plant breeding, multi-environment trials play an important role in the selection of important traits, such as stability across environments, grain yield and pest resistance. Environmental conditions modulate gene expression causing genotype × environment interaction (G × E), such that the estimated genetic correlations of the performance of individual lines across environments summarize the joint action of genes and environmental conditions. This article proposes a genomic statistical model that incorporates G × E for general and specific combining ability for predicting the performance of hybrids in environments. The proposed model can also be applied to any other hybrid species with distinct parental pools. In this study, we evaluated the predictive ability of two HP prediction models using a cross-validation approach applied in extensive maize hybrid data, comprising 2724 hybrids derived from 507 dent lines and 24 flint lines, which were evaluated for three traits in 58 environments over 12 years; analyses were performed for each year. On average, genomic models that include the interaction of general and specific combining ability with environments have greater predictive ability than genomic models without interaction with environments (ranging from 12 to 22%, depending on the trait). We concluded that including G × E in the prediction of untested maize hybrids increases the accuracy of genomic models.
Do Deregulated Cas Proteins Induce Genomic Instability in Early-Stage Ovarian Cancer
2006-12-01
use Western blot analysis of tumor lysates to correlate expression of HEF1, p130Cas, Aurora A, and phospho-Aurora A. This analysis is in progress. In...and importantly, evaluated a number of different detection/image analysis systems to ensure reproducible quantitative results. We have used a pilot...reproducible Interestingly, preliminary statistical analysis using Spearman and Pearson correlation indicates at least one striking correlation
TP53 and ATM mRNA expression in skin and skeletal muscle after low-level laser exposure.
Guedes de Almeida, Luciana; Sergio, Luiz Philippe da Silva; de Paoli, Flavia; Mencalha, Andre Luiz; da Fonseca, Adenilson de Souza
2017-08-01
Low-level lasers are widespread in regenerative medicine, but the molecular mechanisms involved in their biological effects are not fully understood, particularly those on DNA stability. Therefore, this study aimed to investigate mRNA expression of genes related to DNA genomic stability in skin and skeletal muscle tissue from Wistar rats exposed to low-level red and infrared lasers. For this, TP53 (Tumor Protein 53) and ATM (Ataxia Telangiectasia Mutated gene) mRNA expressions were evaluated by real-time quantitative PCR (RT-qPCR) technique 24 hours after low-level red and infrared laser exposure. Our data showed that relative TP53 mRNA expression was not significantly altered in both tissues exposed to lasers. For ATM, relative mRNA expression in skin tissue was not significantly altered, but in muscle tissue, laser exposure increased relative ATM mRNA expression. Low-level red and infrared laser radiations alter ATM mRNA expression related to DNA stability in skeletal muscle tissue.
USDA-ARS?s Scientific Manuscript database
Tomato Functional Genomics Database (TFGD; http://ted.bti.cornell.edu) provides a comprehensive systems biology resource to store, mine, analyze, visualize and integrate large-scale tomato functional genomics datasets. The database is expanded from the previously described Tomato Expression Database...
APPLICATION OF DNA MICROARRAYS TO REPRODUCTIVE TOXICOLOGY AND THE DEVELOPMENT OF A TESTIS ARRAY
With the advent of sequence information for entire mammalian genomes, it is now possible to analyze gene expression and gene polymorphisms on a genomic scale. The primary tool for analysis of gene expression is the DNA microarray. We have used commercially available cDNA micro...
A simple Gateway-assisted construction system of TALEN genes for plant genome editing.
Kusano, Hiroaki; Onodera, Hitomi; Kihira, Miho; Aoki, Hiromi; Matsuzaki, Hikaru; Shimada, Hiroaki
2016-07-25
TALEN is an artificial nuclease being applied for sequence-specific genome editing. For the plant genome editing, a pair of TALEN genes is expressed in the cells, and a binary plasmid for Agrobacterium-mediated transformation should be assembled. We developed a novel procedure using the Gateway-assisted plasmids, named Emerald-Gateway TALEN system. We constructed entry vectors, pPlat plasmids, for construction of a desired TALEN gene using Platinum Gate TALEN kit. We also created destination plasmid, pDual35SGw1301, which allowed two TALEN genes to both DNA strands to recruit using Gateway technology. Resultant TALEN genes were evaluated by the single-strand annealing (SSA) assay in E. coli cells. By this assay, the TALENs recognized the corresponding targets in the divided luciferase gene, and induced a specific recombination to generate an active luciferase gene. Using the TALEN genes constructed, we created a transformant potato cells in which a site-specific mutation occurred at the target site of the GBSS gene. This suggested that our system worked effectively and was applicable as a convenient tool for the plant genome editing.
Piras, Bryan A; O'Connor, Daniel M; French, Brent A
2013-01-01
AAV9 is a powerful gene delivery vehicle capable of providing long-term gene expression in a variety of cell types, particularly cardiomyocytes. The use of AAV-delivery for RNA interference is an intense area of research, but a comprehensive analysis of knockdown in cardiac and liver tissues after systemic delivery of AAV9 has yet to be reported. We sought to address this question by using AAV9 to deliver a short-hairpin RNA targeting the enhanced green fluorescent protein (GFP) in transgenic mice that constitutively overexpress GFP in all tissues. The expression cassette was initially tested in vitro and we demonstrated a 61% reduction in mRNA and a 90% reduction in GFP protein in dual-transfected 293 cells. Next, the expression cassette was packaged as single-stranded genomes in AAV9 capsids to test cardiac GFP knockdown with several doses ranging from 1.8×10(10) to 1.8×10(11) viral genomes per mouse and a dose-dependent response was obtained. We then analyzed GFP expression in both heart and liver after delivery of 4.4×10(11) viral genomes per mouse. We found that while cardiac knockdown was highly efficient, with a 77% reduction in GFP mRNA and a 71% reduction in protein versus control-treated mice, there was no change in liver expression. This was despite a 4.5-fold greater number of viral genomes in the liver than in the heart. This study demonstrates that single-stranded AAV9 vectors expressing shRNA can be used to achieve highly efficient cardiac-selective knockdown of GFP expression that is sustained for at least 7 weeks after the systemic injection of 8 day old mice, with no change in liver expression and no evidence of liver damage despite high viral genome presence in the liver.
An expanding universe of the non-coding genome in cancer biology.
Xue, Bin; He, Lin
2014-06-01
Neoplastic transformation is caused by accumulation of genetic and epigenetic alterations that ultimately convert normal cells into tumor cells with uncontrolled proliferation and survival, unlimited replicative potential and invasive growth [Hanahan,D. et al. (2011) Hallmarks of cancer: the next generation. Cell, 144, 646-674]. Although the majority of the cancer studies have focused on the functions of protein-coding genes, emerging evidence has started to reveal the importance of the vast non-coding genome, which constitutes more than 98% of the human genome. A number of non-coding RNAs (ncRNAs) derived from the 'dark matter' of the human genome exhibit cancer-specific differential expression and/or genomic alterations, and it is increasingly clear that ncRNAs, including small ncRNAs and long ncRNAs (lncRNAs), play an important role in cancer development by regulating protein-coding gene expression through diverse mechanisms. In addition to ncRNAs, nearly half of the mammalian genomes consist of transposable elements, particularly retrotransposons. Once depicted as selfish genomic parasites that propagate at the expense of host fitness, retrotransposon elements could also confer regulatory complexity to the host genomes during development and disease. Reactivation of retrotransposons in cancer, while capable of causing insertional mutagenesis and genome rearrangements to promote oncogenesis, could also alter host gene expression networks to favor tumor development. Taken together, the functional significance of non-coding genome in tumorigenesis has been previously underestimated, and diverse transcripts derived from the non-coding genome could act as integral functional components of the oncogene and tumor suppressor network. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
2012-01-01
Background The Azadirachta indica (neem) tree is a source of a wide number of natural products, including the potent biopesticide azadirachtin. In spite of its widespread applications in agriculture and medicine, the molecular aspects of the biosynthesis of neem terpenoids remain largely unexplored. The current report describes the draft genome and four transcriptomes of A. indica and attempts to contextualise the sequence information in terms of its molecular phylogeny, transcript expression and terpenoid biosynthesis pathways. A. indica is the first member of the family Meliaceae to be sequenced using next generation sequencing approach. Results The genome and transcriptomes of A. indica were sequenced using multiple sequencing platforms and libraries. The A. indica genome is AT-rich, bears few repetitive DNA elements and comprises about 20,000 genes. The molecular phylogenetic analyses grouped A. indica together with Citrus sinensis from the Rutaceae family validating its conventional taxonomic classification. Comparative transcript expression analysis showed either exclusive or enhanced expression of known genes involved in neem terpenoid biosynthesis pathways compared to other sequenced angiosperms. Genome and transcriptome analyses in A. indica led to the identification of repeat elements, nucleotide composition and expression profiles of genes in various organs. Conclusions This study on A. indica genome and transcriptomes will provide a model for characterization of metabolic pathways involved in synthesis of bioactive compounds, comparative evolutionary studies among various Meliaceae family members and help annotate their genomes. A better understanding of molecular pathways involved in the azadirachtin synthesis in A. indica will pave ways for bulk production of environment friendly biopesticides. PMID:22958331
Parvovirus-Derived Endogenous Viral Elements in Two South American Rodent Genomes
2014-01-01
We describe endogenous viral elements (EVEs) derived from parvoviruses (family Parvoviridae) in the genomes of the long-tailed chinchilla (Chinchilla lanigera) and the degu (Octodon degus). The novel EVEs include dependovirus-related elements and representatives of a clearly distinct parvovirus lineage that also has endogenous representatives in marsupial genomes. In the degu, one dependovirus-derived EVE was found to carry an intact reading frame and was differentially expressed in vivo, with increased expression in the liver. PMID:25078696
The biology and evolution of transposable elements in parasites.
Thomas, M Carmen; Macias, Francisco; Alonso, Carlos; López, Manuel C
2010-07-01
Transposable elements (TEs) are dynamic elements that can reshape host genomes by generating rearrangements with the potential to create or disrupt genes, to shuffle existing genes, and to modulate their patterns of expression. In the genomes of parasites that infect mammals several TEs have been identified that probably have been maintained throughout evolution due to their contribution to gene function and regulation of gene expression. This review addresses how TEs are organized, how they colonize the genomes of mammalian parasites, the functional role these elements play in parasite biology, and the interactions between these elements and the parasite genome. Copyright 2010 Elsevier Ltd. All rights reserved.
Barlow, Denise P.
2014-01-01
Genomic imprinting affects a subset of genes in mammals and results in a monoallelic, parental-specific expression pattern. Most of these genes are located in clusters that are regulated through the use of insulators or long noncoding RNAs (lncRNAs). To distinguish the parental alleles, imprinted genes are epigenetically marked in gametes at imprinting control elements through the use of DNA methylation at the very least. Imprinted gene expression is subsequently conferred through lncRNAs, histone modifications, insulators, and higher-order chromatin structure. Such imprints are maintained after fertilization through these mechanisms despite extensive reprogramming of the mammalian genome. Genomic imprinting is an excellent model for understanding mammalian epigenetic regulation. PMID:24492710
Camacho, Luísa; Basavarajappa, Mallikarjuna S.; Chang, Ching-Wei; Han, Tao; Kobets, Tetyana; Koturbash, Igor; Surratt, Gordon; Lewis, Sherry M.; Vanlandingham, Michelle M.; Fuscoe, James C.; da Costa, Gonçalo Gamboa; Pogribny, Igor P.; Delclos, K. Barry
2015-01-01
Bisphenol A (BPA), an industrial chemical used in the manufacture of polycarbonate and epoxy resins, binds to the nuclear estrogen receptor with an affinity 4–5 orders of magnitude lower than that of estradiol. We reported previously that “high BPA” (100,000 and 300,000 μg/kg body weight (bw)/day), but not “low BPA” [2.5–2700 μg/kg bw/day], induced clear adverse effects in NCTR Sprague-Dawley rats gavaged daily from gestation day 6 through postnatal day 90. The “high BPA” effects partially overlapped those of ethinyl estradiol (EE2, 0.5 and 5.0 μg/kg bw/day). To evaluate further the potential of “low BPA” to induce biological effects, here we assessed the global genomic DNA methylation and gene expression in the prostate and female mammary glands, tissues identified previously as potential targets of BPA, and uterus, a sensitive estrogen-responsive tissue. Both doses of EE2 modulated gene expression, including of known estrogen-responsive genes, and PND 4 global gene expression data showed a partial overlap of the “high BPA” effects with those of EE2. The “low BPA” doses modulated the expression of several genes; however, the absence of a dose response reduces the likelihood that these changes were causally linked to the treatment. These results are consistent with the toxicity outcomes. PMID:25862956
2013-01-01
Background Sequence-specific DNA-binding proteins, with their paramount importance in the regulation of expression of the genetic material, are encoded by approximately 5% of the genes in an animal’s genome. But it is unclear to what extent alternative transcripts from these genes may further increase the complexity of the transcription factor complement. Results Of the 938 potential C. elegans transcription factor genes, 197 were annotated in WormBase as encoding at least two distinct isoforms. Evaluation of prior evidence identified, with different levels of confidence, 50 genes with alternative transcript starts, 23 with alternative transcript ends, 35 with alternative splicing and 34 with alternative transcripts generated by a combination of mechanisms, leaving 55 that were discounted. Expression patterns were determined for transcripts for a sample of 29 transcription factor genes, concentrating on those with alternative transcript starts for which the evidence was strongest. Seamless fosmid recombineering was used to generate reporter gene fusions with minimal modification to assay expression of specific transcripts while maintaining the broad genomic DNA context and alternative transcript production. Alternative transcription factor gene transcripts were typically expressed with identical or substantially overlapping distributions rather than in distinct domains. Conclusions Increasingly sensitive sequencing technologies will reveal rare transcripts but many of these are clearly non-productive. The majority of the transcription factor gene alternative transcripts that are productive may represent tolerable noise rather than encoding functionally distinct isoforms. PMID:23586691
Swindell, William R; Johnston, Andrew; Carbajal, Steve; Han, Gangwen; Wohn, Christian; Lu, Jun; Xing, Xianying; Nair, Rajan P; Voorhees, John J; Elder, James T; Wang, Xiao-Jing; Sano, Shigetoshi; Prens, Errol P; DiGiovanni, John; Pittelkow, Mark R; Ward, Nicole L; Gudjonsson, Johann E
2011-04-04
Development of a suitable mouse model would facilitate the investigation of pathomechanisms underlying human psoriasis and would also assist in development of therapeutic treatments. However, while many psoriasis mouse models have been proposed, no single model recapitulates all features of the human disease, and standardized validation criteria for psoriasis mouse models have not been widely applied. In this study, whole-genome transcriptional profiling is used to compare gene expression patterns manifested by human psoriatic skin lesions with those that occur in five psoriasis mouse models (K5-Tie2, imiquimod, K14-AREG, K5-Stat3C and K5-TGFbeta1). While the cutaneous gene expression profiles associated with each mouse phenotype exhibited statistically significant similarity to the expression profile of psoriasis in humans, each model displayed distinctive sets of similarities and differences in comparison to human psoriasis. For all five models, correspondence to the human disease was strong with respect to genes involved in epidermal development and keratinization. Immune and inflammation-associated gene expression, in contrast, was more variable between models as compared to the human disease. These findings support the value of all five models as research tools, each with identifiable areas of convergence to and divergence from the human disease. Additionally, the approach used in this paper provides an objective and quantitative method for evaluation of proposed mouse models of psoriasis, which can be strategically applied in future studies to score strengths of mouse phenotypes relative to specific aspects of human psoriasis.
CRISPR-Cas9D10A Nickase-Assisted Genome Editing in Lactobacillus casei
Song, Xin; Huang, He; Xiong, Zhiqiang
2017-01-01
ABSTRACT Lactobacillus casei has drawn increasing attention as a health-promoting probiotic, while effective genetic manipulation tools are often not available, e.g., the single-gene knockout in L. casei still depends on the classic homologous recombination-dependent double-crossover strategy, which is quite labor-intensive and time-consuming. In the present study, a rapid and precise genome editing plasmid, pLCNICK, was established for L. casei genome engineering based on CRISPR-Cas9D10A. In addition to the P23-Cas9D10A and Pldh-sgRNA (single guide RNA) expression cassettes, pLCNICK includes the homologous arms of the target gene as repair templates. The ability and efficiency of chromosomal engineering using pLCNICK were evaluated by in-frame deletions of four independent genes and chromosomal insertion of an enhanced green fluorescent protein (eGFP) expression cassette at the LC2W_1628 locus. The efficiencies associated with in-frame deletions and chromosomal insertion is 25 to 62%. pLCNICK has been proved to be an effective, rapid, and precise tool for genome editing in L. casei, and its potential application in other lactic acid bacteria (LAB) is also discussed in this study. IMPORTANCE The lack of efficient genetic tools has limited the investigation and biotechnological application of many LAB. The CRISPR-Cas9D10A nickase-based genome editing in Lactobacillus casei, an important food industrial microorganism, was demonstrated in this study. This genetic tool allows efficient single-gene deletion and insertion to be accomplished by one-step transformation, and the cycle time is reduced to 9 days. It facilitates a rapid and precise chromosomal manipulation in L. casei and overcomes some limitations of previous methods. This editing system can serve as a basic technological platform and offers the possibility to start a comprehensive investigation on L. casei. As a broad-host-range plasmid, pLCNICK has the potential to be adapted to other Lactobacillus species for genome editing. PMID:28864652
The eastern oyster genome: A resource for comparative genomics in shellfish aquaculture species
USDA-ARS?s Scientific Manuscript database
Oyster aquaculture is an important sector of world food production. As such, it is imperative to develop a high quality reference genome for the eastern oyster, Crassostrea virginica, to assist in the elucidation of the genomic basis of commercially important traits. All genetic, gene expression and...
The bovine lactation genome: Insights into the evolution of mammalian milk
USDA-ARS?s Scientific Manuscript database
The newly assembled Bos Taurus genome sequence enables the linkage of bovine milk and lactation data with other mammalian genomes. Using publicly available milk proteome data and mammary expressed sequence tags, 197 milk protein genes and over 6,000 mammary genes were identified in the bovine genome...
Zhao, Jie
2010-01-01
Arabinogalactan proteins (AGPs) comprise a family of hydroxyproline-rich glycoproteins that are implicated in plant growth and development. In this study, 69 AGPs are identified from the rice genome, including 13 classical AGPs, 15 arabinogalactan (AG) peptides, three non-classical AGPs, three early nodulin-like AGPs (eNod-like AGPs), eight non-specific lipid transfer protein-like AGPs (nsLTP-like AGPs), and 27 fasciclin-like AGPs (FLAs). The results from expressed sequence tags, microarrays, and massively parallel signature sequencing tags are used to analyse the expression of AGP-encoding genes, which is confirmed by real-time PCR. The results reveal that several rice AGP-encoding genes are predominantly expressed in anthers and display differential expression patterns in response to abscisic acid, gibberellic acid, and abiotic stresses. Based on the results obtained from this analysis, an attempt has been made to link the protein structures and expression patterns of rice AGP-encoding genes to their functions. Taken together, the genome-wide identification and expression analysis of the rice AGP gene family might facilitate further functional studies of rice AGPs. PMID:20423940
CD274 (PDL1) and JAK2 genomic amplifications in pulmonary squamous-cell and adenocarcinoma patients.
Clavé, Sergi; Pijuan, Lara; Casadevall, David; Taus, Álvaro; Gimeno, Javier; Hernández-Llodrà, Silvia; Rodríguez-Rivera, María; Lorenzo, Marta; Menéndez, Silvia; Albanell, Joan; Espinet, Blanca; Arriola, Edurne; Salido, Marta
2018-01-01
CD274 (PDL1) and JAK2 (9p24.1) gene amplifications have been recently described in pulmonary carcinomas in association with programmed death-ligand 1 (PD-L1) expression. Furthermore, PTEN loss has been explored preclinically in relation to PD-L1 expression. Our aim was to determine whether these genomic alterations affect PD-L1 expression levels in non-small-cell lung cancer. PD-L1 and PTEN expression determined by immunohistochemistry (IHC), and CD274, JAK2 and PTEN copy number alterations (CNAs) determined by fluorescence in-situ hybridisation, were studied in 171 pulmonary carcinoma specimens. PD-L1 expression was positive in 40 cases (23.3%), and CD274 amplification was present in 14 tumours (8.8%). Concordance between both events was found in 12 of 14 amplified cases (P = 0.0001). We found nine JAK2-amplified cases (5.7%), seven with PD-L1 expression (P = 0.0006). Moreover, six of the seven cases had JAK2 and CD274 coamplification (9p24.1 genomic amplification). Remarkably, the average PD-L1 IHC score was higher in these amplified cases (230 versus 80; P = 0.001). Non-statistical associations were observed between PD-L1 expression and PTEN loss and PTEN deletions. We describe a subset of patients (8.2%) who had 9p24.1 amplifications resulting in high expression of PD-L1. Our results provide evidence for genomic up-regulation of PD-L1 expression in non-small-cell lung cancer. © 2017 John Wiley & Sons Ltd.
Jere, Khuzwayo C.; O'Neill, Hester G.; Potgieter, A. Christiaan; van Dijk, Alberdina A.
2014-01-01
Rotavirus virus-like particles (RV-VLPs) are potential alternative non-live vaccine candidates due to their high immunogenicity. They mimic the natural conformation of native viral proteins but cannot replicate because they do not contain genomic material which makes them safe. To date, most RV-VLPs have been derived from cell culture adapted strains or common G1 and G3 rotaviruses that have been circulating in communities for some time. In this study, chimaeric RV-VLPs were generated from the consensus sequences of African rotaviruses (G2, G8, G9 or G12 strains associated with either P[4], P[6] or P[8] genotypes) characterised directly from human stool samples without prior adaptation of the wild type strains to cell culture. Codon-optimised sequences for insect cell expression of genome segments 2 (VP2), 4 (VP4), 6 (VP6) and 9 (VP7) were cloned into a modified pFASTBAC vector, which allowed simultaneous expression of up to four genes using the Bac-to-Bac Baculovirus Expression System (BEVS; Invitrogen). Several combinations of the genome segments originating from different field strains were cloned to produce double-layered RV-VLPs (dRV-VLP; VP2/6), triple-layered RV-VLPs (tRV-VLP; VP2/6/7 or VP2/6/7/4) and chimaeric tRV-VLPs. The RV-VLPs were produced by infecting Spodoptera frugiperda 9 and Trichoplusia ni cells with recombinant baculoviruses using multi-cistronic, dual co-infection and stepwise-infection expression strategies. The size and morphology of the RV-VLPs, as determined by transmission electron microscopy, revealed successful production of RV-VLPs. The novel approach of producing tRV-VLPs, by using the consensus insect cell codon-optimised nucleotide sequence derived from dsRNA extracted directly from clinical specimens, should speed-up vaccine research and development by by-passing the need to adapt rotaviruses to cell culture. Other problems associated with cell culture adaptation, such as possible changes in epitopes, can also be circumvented. Thus, it is now possible to generate tRV-VLPs for evaluation as non-live vaccine candidates for any human or animal field rotavirus strain. PMID:25268783
Muff, Roman; Rath, Prisni; Ram Kumar, Ram Mohan; Husmann, Knut; Born, Walter; Baudis, Michael; Fuchs, Bruno
2015-01-01
Osteosarcoma is a rare but highly malignant cancer of the bone. As a consequence, the number of established cell lines used for experimental in vitro and in vivo osteosarcoma research is limited and the value of these cell lines relies on their stability during culture. Here we investigated the stability in gene expression by microarray analysis and array genomic hybridization of three low metastatic cell lines and derivatives thereof with increased metastatic potential using cells of different passages. The osteosarcoma cell lines showed altered gene expression during in vitro culture, and it was more pronounced in two metastatic cell lines compared to the respective parental cells. Chromosomal instability contributed in part to the altered gene expression in SAOS and LM5 cells with low and high metastatic potential. To identify metastasis-relevant genes in a background of passage-dependent altered gene expression, genes involved in "Pathways in cancer" that were consistently regulated under all passage comparisons were evaluated. Genes belonging to "Hedgehog signaling pathway" and "Wnt signaling pathway" were significantly up-regulated, and IHH, WNT10B and TCF7 were found up-regulated in all three metastatic compared to the parental cell lines. Considerable instability during culture in terms of gene expression and chromosomal aberrations was observed in osteosarcoma cell lines. The use of cells from different passages and a search for genes consistently regulated in early and late passages allows the analysis of metastasis-relevant genes despite the observed instability in gene expression in osteosarcoma cell lines during culture.
Genome-wide analysis of the heat stress response in Zebu (Sahiwal) cattle.
Mehla, Kusum; Magotra, Ankit; Choudhary, Jyoti; Singh, A K; Mohanty, A K; Upadhyay, R C; Srinivasan, Surendran; Gupta, Pankaj; Choudhary, Neelam; Antony, Bristo; Khan, Farheen
2014-01-10
Environmental-induced hyperthermia compromises animal production with drastic economic consequences to global animal agriculture and jeopardizes animal welfare. Heat stress is a major stressor that occurs as a result of an imbalance between heat production within the body and its dissipation and it affects animals at cellular, molecular and ecological levels. The molecular mechanism underlying the physiology of heat stress in the cattle remains undefined. The present study sought to evaluate mRNA expression profiles in the cattle blood in response to heat stress. In this study we report the genes that were differentially expressed in response to heat stress using global scale genome expression technology (Microarray). Four Sahiwal heifers were exposed to 42°C with 90% humidity for 4h followed by normothermia. Gene expression changes include activation of heat shock transcription factor 1 (HSF1), increased expression of heat shock proteins (HSP) and decreased expression and synthesis of other proteins, immune system activation via extracellular secretion of HSP. A cDNA microarray analysis found 140 transcripts to be up-regulated and 77 down-regulated in the cattle blood after heat treatment (P<0.05). But still a comprehensive explanation for the direction of fold change and the specific genes involved in response to acute heat stress still remains to be explored. These findings may provide insights into the underlying mechanism of physiology of heat stress in cattle. Understanding the biology and mechanisms of heat stress is critical to developing approaches to ameliorate current production issues for improving animal performance and agriculture economics. © 2013 Elsevier B.V. All rights reserved.
Song, Jie; Hu, Yajie; Hu, Yunguang; Wang, Jingjing; Zhang, Xiaolong; Wang, Lichun; Guo, Lei; Wang, Yancui; Ning, Ruotong; Liao, Yun; Zhang, Ying; Zheng, Huiwen; Shi, Haijing; He, Zhanlong; Li, Qihan; Liu, Longding
2016-03-02
Coxsackievirus A16 (CA16) is a dominant pathogen that results in hand, foot, and mouth disease and causes outbreaks worldwide, particularly in the Asia-Pacific region. However, the underlying molecular mechanisms remain unclear. Our previous study has demonstrated that the basic CA16 pathogenic process was successfully mimicked in rhesus monkey infant. The present study focused on the global gene expression changes in peripheral blood mononuclear cells of rhesus monkey infants with hand, foot, and mouth disease induced by CA16 infection at different time points. Genome-wide expression analysis was performed with Agilent whole-genome microarrays and established bioinformatics tools. Nine hundred and forty-eight significant differentially expressed genes that were associated with 5 gene ontology categories, including cell communication, cell cycle, immune system process, regulation of transcription and metabolic process were identified. Subsequently, the mapping of genes related to the immune system process by PANTHER pathway analysis revealed the predominance of inflammation mediated by chemokine and cytokine signaling pathways and the interleukin signaling pathway. Ultimately, co-expressed genes and their networks were analyzed. The results revealed the gene expression profile of the immune system in response to CA16 in rhesus monkey infants and suggested that such an immune response was generated as a result of the positive mobilization of the immune system. This initial microarray study will provide insights into the molecular mechanism of CA16 infection and will facilitate the identification of biomarkers for the evaluation of vaccines against this virus. Copyright © 2016 Elsevier B.V. All rights reserved.
Nuclear envelope and genome interactions in cell fate
Talamas, Jessica A.; Capelson, Maya
2015-01-01
The eukaryotic cell nucleus houses an organism’s genome and is the location within the cell where all signaling induced and development-driven gene expression programs are ultimately specified. The genome is enclosed and separated from the cytoplasm by the nuclear envelope (NE), a double-lipid membrane bilayer, which contains a large variety of trans-membrane and associated protein complexes. In recent years, research regarding multiple aspects of the cell nucleus points to a highly dynamic and coordinated concert of efforts between chromatin and the NE in regulation of gene expression. Details of how this concert is orchestrated and how it directs cell differentiation and disease are coming to light at a rapid pace. Here we review existing and emerging concepts of how interactions between the genome and the NE may contribute to tissue specific gene expression programs to determine cell fate. PMID:25852741
Role of the DNA Damage Response in Human Papillomavirus RNA Splicing and Polyadenylation.
Nilsson, Kersti; Wu, Chengjun; Schwartz, Stefan
2018-06-12
Human papillomaviruses (HPVs) have evolved to use the DNA repair machinery to replicate its DNA genome in differentiated cells. HPV activates the DNA damage response (DDR) in infected cells. Cellular DDR factors are recruited to the HPV DNA genome and position the cellular DNA polymerase on the HPV DNA and progeny genomes are synthesized. Following HPV DNA replication, HPV late gene expression is activated. Recent research has shown that the DDR factors also interact with RNA binding proteins and affects RNA processing. DDR factors activated by DNA damage and that associate with HPV DNA can recruit splicing factors and RNA binding proteins to the HPV DNA and induce HPV late gene expression. This induction is the result of altered alternative polyadenylation and splicing of HPV messenger RNA (mRNA). HPV uses the DDR machinery to replicate its DNA genome and to activate HPV late gene expression at the level of RNA processing.
LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes.
Wang, Dapeng; Zhang, Yubin; Fan, Zhonghua; Liu, Guiming; Yu, Jun
2012-01-01
Animal genes of different lineages, such as vertebrates and arthropods, are well-organized and blended into dynamic chromosomal structures that represent a primary regulatory mechanism for body development and cellular differentiation. The majority of genes in a genome are actually clustered, which are evolutionarily stable to different extents and biologically meaningful when evaluated among genomes within and across lineages. Until now, many questions concerning gene organization, such as what is the minimal number of genes in a cluster and what is the driving force leading to gene co-regulation, remain to be addressed. Here, we provide a user-friendly database-LCGbase (a comprehensive database for lineage-based co-regulated genes)-hosting information on evolutionary dynamics of gene clustering and ordering within animal kingdoms in two different lineages: vertebrates and arthropods. The database is constructed on a web-based Linux-Apache-MySQL-PHP framework and effective interactive user-inquiry service. Compared to other gene annotation databases with similar purposes, our database has three comprehensible advantages. First, our database is inclusive, including all high-quality genome assemblies of vertebrates and representative arthropod species. Second, it is human-centric since we map all gene clusters from other genomes in an order of lineage-ranks (such as primates, mammals, warm-blooded, and reptiles) onto human genome and start the database from well-defined gene pairs (a minimal cluster where the two adjacent genes are oriented as co-directional, convergent, and divergent pairs) to large gene clusters. Furthermore, users can search for any adjacent genes and their detailed annotations. Third, the database provides flexible parameter definitions, such as the distance of transcription start sites between two adjacent genes, which is extendable to genes that flanking the cluster across species. We also provide useful tools for sequence alignment, gene ontology (GO) annotation, promoter identification, gene expression (co-expression), and evolutionary analysis. This database not only provides a way to define lineage-specific and species-specific gene clusters but also facilitates future studies on gene co-regulation, epigenetic control of gene expression (DNA methylation and histone marks), and chromosomal structures in a context of gene clusters and species evolution. LCGbase is freely available at http://lcgbase.big.ac.cn/LCGbase.
Comprehensive Genomic Characterization of Upper Tract Urothelial Carcinoma.
Moss, Tyler J; Qi, Yuan; Xi, Liu; Peng, Bo; Kim, Tae-Beom; Ezzedine, Nader E; Mosqueda, Maribel E; Guo, Charles C; Czerniak, Bogdan A; Ittmann, Michael; Wheeler, David A; Lerner, Seth P; Matin, Surena F
2017-10-01
Upper urinary tract urothelial cancer (UTUC) may have unique etiologic and genomic factors compared to bladder cancer. To characterize the genomic landscape of UTUC and provide insights into its biology using comprehensive integrated genomic analyses. We collected 31 untreated snap-frozen UTUC samples from two institutions and carried out whole-exome sequencing (WES) of DNA, RNA sequencing (RNAseq), and protein analysis. Adjusting for batch effects, consensus mutation calls from independent pipelines identified DNA mutations, gene expression clusters using unsupervised consensus hierarchical clustering (UCHC), and protein expression levels that were correlated with relevant clinical variables, The Cancer Genome Atlas, and other published data. WES identified mutations in FGFR3 (74.1%; 92% low-grade, 60% high-grade), KMT2D (44.4%), PIK3CA (25.9%), and TP53 (22.2%). APOBEC and CpG were the most common mutational signatures. UCHC of RNAseq data segregated samples into four molecular subtypes with the following characteristics. Cluster 1: no PIK3CA mutations, nonsmokers, high-grade
Miller, Ian J.; Vanee, Niti; Fong, Stephen S.; Lim-Fong, Grace E.
2016-01-01
ABSTRACT The uncultured bacterial symbiont “Candidatus Endobugula sertula” is known to produce cytotoxic compounds called bryostatins, which protect the larvae of its host, Bugula neritina. The symbiont has never been successfully cultured, and it was thought that its genome might be significantly reduced. Here, we took a shotgun metagenomics and metatranscriptomics approach to assemble and characterize the genome of “Ca. Endobugula sertula.” We found that it had specific metabolic deficiencies in the biosynthesis of certain amino acids but few other signs of genome degradation, such as small size, abundant pseudogenes, and low coding density. We also identified homologs to genes associated with insect pathogenesis in other gammaproteobacteria, and these genes may be involved in host-symbiont interactions and vertical transmission. Metatranscriptomics revealed that these genes were highly expressed in a reproductive host, along with bry genes for the biosynthesis of bryostatins. We identified two new putative bry genes fragmented from the main bry operon, accounting for previously missing enzymatic functions in the pathway. We also determined that a gene previously assigned to the pathway, bryS, is not expressed in reproductive tissue, suggesting that it is not involved in the production of bryostatins. Our findings suggest that “Ca. Endobugula sertula” may be able to live outside the host if its metabolic deficiencies are alleviated by medium components, which is consistent with recent findings that it may be possible for “Ca. Endobugula sertula” to be transmitted horizontally. IMPORTANCE The bryostatins are potent protein kinase C activators that have been evaluated in clinical trials for a number of indications, including cancer and Alzheimer's disease. There is, therefore, considerable interest in securing a renewable supply of these compounds, which is currently only possible through aquaculture of Bugula neritina and total chemical synthesis. However, these approaches are labor-intensive and low-yielding and thus preclude the use of bryostatins as a viable therapeutic agent. Our genome assembly and transcriptome analysis for “Ca. Endobugula sertula” shed light on the metabolism of this symbiont, potentially aiding isolation and culturing efforts. Our identification of additional bry genes may also facilitate efforts to express the complete pathway heterologously. PMID:27590822
Liu, Lijun; Ramsay, Trevor; Zinkgraf, Matthew; Sundell, David; Street, Nathaniel Robert; Filkov, Vladimir; Groover, Andrew
2015-06-01
Identifying transcription factor target genes is essential for modeling the transcriptional networks underlying developmental processes. Here we report a chromatin immunoprecipitation sequencing (ChIP-seq) resource consisting of genome-wide binding regions and associated putative target genes for four Populus homeodomain transcription factors expressed during secondary growth and wood formation. Software code (programs and scripts) for processing the Populus ChIP-seq data are provided within a publically available iPlant image, including tools for ChIP-seq data quality control and evaluation adapted from the human Encyclopedia of DNA Elements (ENCODE) project. Basic information for each transcription factor (including members of Class I KNOX, Class III HD ZIP, BEL1-like families) binding are summarized, including the number and location of binding regions, distribution of binding regions relative to gene features, associated putative target genes, and enriched functional categories of putative target genes. These ChIP-seq data have been integrated within the Populus Genome Integrative Explorer (PopGenIE) where they can be analyzed using a variety of web-based tools. We present an example analysis that shows preferential binding of transcription factor ARBORKNOX1 to the nearest neighbor genes in a pre-calculated co-expression network module, and enrichment for meristem-related genes within this module including multiple orthologs of Arabidopsis KNOTTED-like Arabidopsis 2/6. © 2015 Society for Experimental Biology and John Wiley & Sons Ltd This article has been contributed to by US Government employees and their work is in the public domain in the USA.
Gene and miRNA expression profiles in autism spectrum disorders.
Ghahramani Seno, Mohammad M; Hu, Pingzhao; Gwadry, Fuad G; Pinto, Dalila; Marshall, Christian R; Casallo, Guillermo; Scherer, Stephen W
2011-03-22
Accumulating data indicate that there is significant genetic heterogeneity underlying the etiology in individuals diagnosed with autism spectrum disorder (ASD). Some rare and highly-penetrant gene variants and copy number variation (CNV) regions including NLGN3, NLGN4, NRXN1, SHANK2, SHANK3, PTCHD1, 1q21.1, maternally-inherited duplication of 15q11-q13, 16p11.2, amongst others, have been identified to be involved in ASD. Genome-wide association studies have identified other apparently low risk loci and in some other cases, ASD arises as a co-morbid phenotype with other medical genetic conditions (e.g. fragile X). The progress studying the genetics of ASD has largely been accomplished using genomic analyses of germline-derived DNA. Here, we used gene and miRNA expression profiling using cell-line derived total RNA to evaluate possible transcripts and networks of molecules involved in ASD. Our analysis identified several novel dysregulated genes and miRNAs in ASD compared with controls, including HEY1, SOX9, miR-486 and miR-181b. All of these are involved in nervous system development and function and some others, for example, are involved in NOTCH signaling networks (e.g. HEY1). Further, we found significant enrichment in molecules associated with neurological disorders such as Rett syndrome and those associated with nervous system development and function including long-term potentiation. Our data will provide a valuable resource for discovery purposes and for comparison to other gene expression-based, genome-wide DNA studies and other functional data. Copyright © 2010 Elsevier B.V. All rights reserved.
2008-05-01
DAMD17-03-1-0297 Title: Genomic and Expression Pr ofiling of Benign and Malignant Nerve Sheath Tumors in Neurofibromatosis Patients...have determined the gene expression signature for benign and malignant peripheral nerve sheath tumors and found that the major trend in transformation...However, EGFR data in soft tissue neoplasms is limited. Using a variety of benign and malignant spindle cell neoplasms, we assessed EGFR status by
Complete Mitochondrial Genome of the Medicinal Mushroom Ganoderma lucidum
Chen, Haimei; Chen, Xiangdong; Lan, Jin; Liu, Chang
2013-01-01
Ganoderma lucidum is one of the well-known medicinal basidiomycetes worldwide. The mitochondrion, referred to as the second genome, is an organelle found in most eukaryotic cells and participates in critical cellular functions. Elucidating the structure and function of this genome is important to understand completely the genetic contents of G. lucidum. In this study, we assembled the mitochondrial genome of G. lucidum and analyzed the differential expressions of its encoded genes across three developmental stages. The mitochondrial genome is a typical circular DNA molecule of 60,630 bp with a GC content of 26.67%. Genome annotation identified genes that encode 15 conserved proteins, 27 tRNAs, small and large rRNAs, four homing endonucleases, and two hypothetical proteins. Except for genes encoding trnW and two hypothetical proteins, all genes were located on the positive strand. For the repeat structure analysis, eight forward, two inverted, and three tandem repeats were detected. A pair of fragments with a total length around 5.5 kb was found in both the nuclear and mitochondrial genomes, which suggests the possible transfer of DNA sequences between two genomes. RNA-Seq data for samples derived from three stages, namely, mycelia, primordia, and fruiting bodies, were mapped to the mitochondrial genome and qualified. The protein-coding genes were expressed higher in mycelia or primordial stages compared with those in the fruiting bodies. The rRNA abundances were significantly higher in all three stages. Two regions were transcribed but did not contain any identified protein or tRNA genes. Furthermore, three RNA-editing sites were detected. Genome synteny analysis showed that significant genome rearrangements occurred in the mitochondrial genomes. This study provides valuable information on the gene contents of the mitochondrial genome and their differential expressions at various developmental stages of G. lucidum. The results contribute to the understanding of the functions and evolution of fungal mitochondrial DNA. PMID:23991034
Context-specific metabolic networks are consistent with experiments.
Becker, Scott A; Palsson, Bernhard O
2008-05-16
Reconstructions of cellular metabolism are publicly available for a variety of different microorganisms and some mammalian genomes. To date, these reconstructions are "genome-scale" and strive to include all reactions implied by the genome annotation, as well as those with direct experimental evidence. Clearly, many of the reactions in a genome-scale reconstruction will not be active under particular conditions or in a particular cell type. Methods to tailor these comprehensive genome-scale reconstructions into context-specific networks will aid predictive in silico modeling for a particular situation. We present a method called Gene Inactivity Moderated by Metabolism and Expression (GIMME) to achieve this goal. The GIMME algorithm uses quantitative gene expression data and one or more presupposed metabolic objectives to produce the context-specific reconstruction that is most consistent with the available data. Furthermore, the algorithm provides a quantitative inconsistency score indicating how consistent a set of gene expression data is with a particular metabolic objective. We show that this algorithm produces results consistent with biological experiments and intuition for adaptive evolution of bacteria, rational design of metabolic engineering strains, and human skeletal muscle cells. This work represents progress towards producing constraint-based models of metabolism that are specific to the conditions where the expression profiling data is available.
Oh, Dong-Ha; Hong, Hyewon; Lee, Sang Yeol; Yun, Dae-Jin; Bohnert, Hans J.; Dassanayake, Maheshi
2014-01-01
Schrenkiella parvula (formerly Thellungiella parvula), a close relative of Arabidopsis (Arabidopsis thaliana) and Brassica crop species, thrives on the shores of Lake Tuz, Turkey, where soils accumulate high concentrations of multiple-ion salts. Despite the stark differences in adaptations to extreme salt stresses, the genomes of S. parvula and Arabidopsis show extensive synteny. S. parvula completes its life cycle in the presence of Na+, K+, Mg2+, Li+, and borate at soil concentrations lethal to Arabidopsis. Genome structural variations, including tandem duplications and translocations of genes, interrupt the colinearity observed throughout the S. parvula and Arabidopsis genomes. Structural variations distinguish homologous gene pairs characterized by divergent promoter sequences and basal-level expression strengths. Comparative RNA sequencing reveals the enrichment of ion-transport functions among genes with higher expression in S. parvula, while pathogen defense-related genes show higher expression in Arabidopsis. Key stress-related ion transporter genes in S. parvula showed increased copy number, higher transcript dosage, and evidence for subfunctionalization. This extremophyte offers a framework to identify the requisite adjustments of genomic architecture and expression control for a set of genes found in most plants in a way to support distinct niche adaptation and lifestyles. PMID:24563282
[Correlation of genomic DNA methylation level with unexplained early spontaneous abortion].
Chao, Yuan; Weng, Lidong; Zeng, Rong
2014-10-01
To investigate the correlation of genomic DNA methylation level with unexplained early spontaneous abortion and analyze the role of DNMT1, DNMT3A and DNMT3B. Forty-five villus samples from spontaneous abortion cases (with 33 maternal peripheral blood samples) and 44 villus samples from induced abortion (with 34 maternal peripheral blood samples) were examined with high-pressure liquid chromatography (HPLC) to measure the overall methylation level of the genomic DNA. The expressions of DNMT mRNAs were detected using fluorescence quantitative-PCR in the villus samples from 33 induced abortion cases and 30 spontaneous abortion cases. Genomic DNA methylation level was significantly lower in the villus in spontaneous abortion group than in induced abortion group (P<0.01), but similar in the maternal blood samples between the two groups (P>0.05). The mean mRNA expression levels of DNMT1 and DNMT3A in the villus were significantly lower in spontaneous abortion group than in induced abortion group (P<0.05), but DNMT3B expression showed no significant difference between them (P>0.05). Insufficient genomic DNA methylation in the villus does exist in human early spontaneous abortion, and this insufficiency is probably associated with down-regulated expressions of DNMT1 and DNMT3A.
Kawazoe, Akihito; Shitara, Kohei; Kuboki, Yasutoshi; Bando, Hideaki; Kojima, Takashi; Yoshino, Takayuki; Ohtsu, Atsushi; Ochiai, Atsushi; Togashi, Yosuke; Nishikawa, Hiroyoshi; Doi, Toshihiko; Kuwata, Takeshi
2018-06-01
Recently, the U.S. Food and Drug Administration approved pembrolizumab for patients (pts) with PD-L1-positive metastatic gastric cancer (MGC) based on 22C3 immunohistochemistry (IHC) assay. However, little is known about detailed clinicopathological features of 22C3 PD-L1 expression in MGC. Pts with histologically confirmed MGC were eligible for this prospective observational study. PD-L1 expression (22C3) on tumor cell (TC) or immune cell (IC) and mismatch repair (MMR) were analyzed by IHC. Epstein-Barr virus (EBV) was detected by in situ hybridization. The expressions of tyrosine kinase receptors (RTKs) and cancer genome alterations were evaluated by IHC or next-generation sequencing. A total of 225 pts were analyzed in this study. PD-L1 expression on TC, PD-L1 on IC, MMR-deficient (D-MMR), and EBV positivity were identified in 8.4, 65.3, 6.2, and 6.2% cases, respectively. PD-L1 expression in TC was more frequently observed in pts with D-MMR (P < 0.001), PIK3CA mutation (P = 0.020), and KRAS mutation (P = 0.002), and PD-L1 on IC was associated with EBV positivity (P = 0.034), and lymph-node metastasis (P < 0.001). PD-L1 expression on either IC or TC was less frequently observed in pts with peritoneal metastasis and Borrmann Type 4. A significant association was not observed between PD-L1 expression and RTKs expression or presence of other gene alterations. PD-L1 expression on either TC or IC was not prognostic factor. 22C3 PD-L1 expression in MGC was associated with distinct clinicopathological features, but was not a prognostic factor.
Technological advances and genomics in metazoan parasites.
Knox, D P
2004-02-01
Molecular biology has provided the means to identify parasite proteins, to define their function, patterns of expression and the means to produce them in quantity for subsequent functional analyses. Whole genome and expressed sequence tag programmes, and the parallel development of powerful bioinformatics tools, allow the execution of genome-wide between stage or species comparisons and meaningful gene-expression profiling. The latter can be undertaken with several new technologies such as DNA microarray and serial analysis of gene expression. Proteome analysis has come to the fore in recent years providing a crucial link between the gene and its protein product. RNA interference and ballistic gene transfer are exciting developments which can provide the means to precisely define the function of individual genes and, of importance in devising novel parasite control strategies, the effect that gene knockdown will have on parasite survival.
GENOMIC ORGANIZATION OF THE SP22 GENE AND A UNIQUE PATTERN OF EXPRESSION IN SPERMATOGENIC CELLS
GENOMIC ORGANIZATION OF THE SP22 GENE AND A UNIQUE PATTERN OF EXPRESSION IN SPERMATOGENIC CELLS.
JE Welch*, RR Barbee*, JD Suarez*, NL Roberts*, and GR Klinefelter. Reproductive Toxicology Division, NHEERL, U.S. EPA, Research Triangle Park, NC, USA.
Our laboratory has rep...
Chuzel, Léa; Ganatra, Mehul B.; Schermerhorn, Kelly M.; Gardner, Andrew F.; Anton, Brian P.
2017-01-01
ABSTRACT We report the genome sequence of the dairy yeast Kluyveromyces lactis strain GG799 obtained using the Pacific Biosciences RS II platform. K. lactis strain GG799 is a common host for the expression of proteins at both laboratory and industrial scales. PMID:28751387
USDA-ARS?s Scientific Manuscript database
The availability of sequenced insect genomes has allowed for discovery and functional characterization of novel genes and proteins. We report use of the Tribolium castaneum (Herbst) (red flour beetle) genome to identify, clone, express, and characterize a novel endo-ß-1,4-glucanase we named TcEG1 (...
A gene expression atlas of developing oat seeds for enhancing nutritional composition
USDA-ARS?s Scientific Manuscript database
Oat (Avena sativa L.) genome resources are less abundant than for wheat and barley, but next generation sequencing (NGS) technologies have great potential to accelerate new genome information for oat in a cost-effective manner. We are employing RNA-Seq to develop a gene expression atlas of developin...
Genomic and Functional Approaches to Understanding Cancer Aneuploidy.
Taylor, Alison M; Shih, Juliann; Ha, Gavin; Gao, Galen F; Zhang, Xiaoyang; Berger, Ashton C; Schumacher, Steven E; Wang, Chen; Hu, Hai; Liu, Jianfang; Lazar, Alexander J; Cherniack, Andrew D; Beroukhim, Rameen; Meyerson, Matthew
2018-04-09
Aneuploidy, whole chromosome or chromosome arm imbalance, is a near-universal characteristic of human cancers. In 10,522 cancer genomes from The Cancer Genome Atlas, aneuploidy was correlated with TP53 mutation, somatic mutation rate, and expression of proliferation genes. Aneuploidy was anti-correlated with expression of immune signaling genes, due to decreased leukocyte infiltrates in high-aneuploidy samples. Chromosome arm-level alterations show cancer-specific patterns, including loss of chromosome arm 3p in squamous cancers. We applied genome engineering to delete 3p in lung cells, causing decreased proliferation rescued in part by chromosome 3 duplication. This study defines genomic and phenotypic correlates of cancer aneuploidy and provides an experimental approach to study chromosome arm aneuploidy. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Quantitative and functional interrogation of parent-of-origin allelic expression biases in the brain
Perez, Julio D; Rubinstein, Nimrod D; Fernandez, Daniel E; Santoro, Stephen W; Needleman, Leigh A; Ho-Shing, Olivia; Choi, John J; Zirlinger, Mariela; Chen, Shau-Kwaun; Liu, Jun S; Dulac, Catherine
2015-01-01
The maternal and paternal genomes play different roles in mammalian brains as a result of genomic imprinting, an epigenetic regulation leading to differential expression of the parental alleles of some genes. Here we investigate genomic imprinting in the cerebellum using a newly developed Bayesian statistical model that provides unprecedented transcript-level resolution. We uncover 160 imprinted transcripts, including 41 novel and independently validated imprinted genes. Strikingly, many genes exhibit parentally biased—rather than monoallelic—expression, with different magnitudes according to age, organ, and brain region. Developmental changes in parental bias and overall gene expression are strongly correlated, suggesting combined roles in regulating gene dosage. Finally, brain-specific deletion of the paternal, but not maternal, allele of the paternally-biased Bcl-x, (Bcl2l1) results in loss of specific neuron types, supporting the functional significance of parental biases. These findings reveal the remarkable complexity of genomic imprinting, with important implications for understanding the normal and diseased brain. DOI: http://dx.doi.org/10.7554/eLife.07860.001 PMID:26140685
Predicting human genetic interactions from cancer genome evolution.
Lu, Xiaowen; Megchelenbrink, Wout; Notebaart, Richard A; Huynen, Martijn A
2015-01-01
Synthetic Lethal (SL) genetic interactions play a key role in various types of biological research, ranging from understanding genotype-phenotype relationships to identifying drug-targets against cancer. Despite recent advances in empirical measuring SL interactions in human cells, the human genetic interaction map is far from complete. Here, we present a novel approach to predict this map by exploiting patterns in cancer genome evolution. First, we show that empirically determined SL interactions are reflected in various gene presence, absence, and duplication patterns in hundreds of cancer genomes. The most evident pattern that we discovered is that when one member of an SL interaction gene pair is lost, the other gene tends not to be lost, i.e. the absence of co-loss. This observation is in line with expectation, because the loss of an SL interacting pair will be lethal to the cancer cell. SL interactions are also reflected in gene expression profiles, such as an under representation of cases where the genes in an SL pair are both under expressed, and an over representation of cases where one gene of an SL pair is under expressed, while the other one is over expressed. We integrated the various previously unknown cancer genome patterns and the gene expression patterns into a computational model to identify SL pairs. This simple, genome-wide model achieves a high prediction power (AUC = 0.75) for known genetic interactions. It allows us to present for the first time a comprehensive genome-wide list of SL interactions with a high estimated prediction precision, covering up to 591,000 gene pairs. This unique list can potentially be used in various application areas ranging from biotechnology to medical genetics.
FANTOM5 CAGE profiles of human and mouse reprocessed for GRCh38 and GRCm38 genome assemblies.
Abugessaisa, Imad; Noguchi, Shuhei; Hasegawa, Akira; Harshbarger, Jayson; Kondo, Atsushi; Lizio, Marina; Severin, Jessica; Carninci, Piero; Kawaji, Hideya; Kasukawa, Takeya
2017-08-29
The FANTOM5 consortium described the promoter-level expression atlas of human and mouse by using CAGE (Cap Analysis of Gene Expression) with single molecule sequencing. In the original publications, GRCh37/hg19 and NCBI37/mm9 assemblies were used as the reference genomes of human and mouse respectively; later, the Genome Reference Consortium released newer genome assemblies GRCh38/hg38 and GRCm38/mm10. To increase the utility of the atlas in forthcoming researches, we reprocessed the data to make them available on the recent genome assemblies. The data include observed frequencies of transcription starting sites (TSSs) based on the realignment of CAGE reads, and TSS peaks that are converted from those based on the previous reference. Annotations of the peak names were also updated based on the latest public databases. The reprocessed results enable us to examine frequencies of transcription initiations on the recent genome assemblies and to refer promoters with updated information across the genome assemblies consistently.
Yamagishi, J; Isobe, R; Takebuchi, T; Bando, H
2003-03-01
We describe, for the first time, the generation of a viral DNA chip for simultaneous expression measurements of nearly all known open reading frames (ORFs) in the best-studied members of the family Baculoviridae, Autographa californica multiple nucleopolyhedrovirus (AcMNPV) and Bombyx mori nucleopolyhedrovirus (BmNPV). In this study, a viral DNA chip (Ac-BmNPV chip) was fabricated and used to characterize the viral gene expression profile for AcMNPV in different cell types. The viral chip is composed of microarrays of viral DNA prepared by robotic deposition of PCR-amplified viral DNA fragments on glass for ORFs in the NPV genome. Viral gene expression was monitored by hybridization to the DNA fragment microarrays with fluorescently labeled cDNAs prepared from infected Spodoptera frugiperda, Sf9 cells and Trichoplusia ni, TnHigh-Five cells, the latter a major producer of baculovirus and recombinant proteins. A comparison of expression profiles of known ORFs in AcMNPV elucidated six genes (ORF150, p10, pk2, and three late gene expression factor genes lef-3, p35 and lef- 6) the expression of each of which was regulated differently in the two cell lines. Most of these genes are known to be closely involved in the viral life cycle such as in DNA replication, late gene expression and the release of polyhedra from infected cells. These results imply that the differential expression of these viral genes accounts for the differences in viral replication between these two cell lines. Thus, these fabricated microarrays of NPV DNA which allow a rapid analysis of gene expression at the viral genome level should greatly speed the functional analysis of large genomes of NPV.
Brown, William M; Consedine, Nathan S
2004-01-01
The favored level of parental investment in a child may differ for genes of maternal and paternal origin in the child. This conflict can be expressed in the phenomenon of genomic imprinting that refers to situations in which the same gene is differentially expressed depending on its parent of origin. Two disorders that show the effects of genomic imprinting--both at 15q11-q13--are Angelman Syndrome (AS) which is due to the absence of expression of maternally-inherited genes and Prader-Willi syndromes (PWS) which is due to the absence of expression of paternally-inherited genes. However, although both disorders can arise from the deletion of the same genetic region, the gustatory, behavioral, and affective characteristics of AS and PWS children are remarkably distinct. Recent research inspired by kinship theory has suggested the origins of these phenotypic differences may lie in the differential investment of each parent's genome in the AS or PWS child. Specifically, it is thought that each set of parental genes have different 'ideas' regarding how the child should behave towards the mother and how much investment they should look to extract. In normal cases, the trade-off between the competing parental genomes produces a behavioral equilibrium in the child. However, in pathological instances, particularly where gene expression is one-sided, the evolved behavioral strategies favored by the contributing genome will dominate the child's behavior. To date, research in the area of genomic conflict in AS and PWS children has primarily focusing on differences in post-natal nutrition-related behaviors. The current paper extends this framework by offering an emotion and evolutionary signaling interpretation of the affective characteristics of AS children. A review of the affective characteristics of the two syndromes (PWS and AS) is presented before kinship and emotions theory are used to examine the functions that differential affect expression may serve in altering maternal investment. We expected that because the ultimate goal of paternal genes is to increase the child rearing burden of mothers, the Angelman behavioral phenotype should exhibit the emotion signaling characteristics that elicit levels of investment more consistent with paternal genetic interests. AS children display more positive, relative to negative, affect expressions (i.e. AS children laugh and smile more frequently than PWS children). In affect signaling theories, positive affect signals (i.e., smiling, laughing) have evolved to manipulate the sensory systems of receivers to increase social resources. In contrast, because the expression of some negative affects may indicate to the mother that the infant is not viable, negative affect expression is characteristically low among AS children. However, AS children may nonetheless have high levels of non-expressed anxiety because of its role in assisting the child (and its paternal genome) to maintain vigilance for changes in investment on the part of the mother. Overall, our kinship and emotion signaling analysis of AS children suggests that their global pattern of affect signaling represents one manifestation of an array of possible evolved strategies within the parental genome. Specifically, because AS exhibits the effects of paternally-inherited genes unhindered by the expression of maternally-inherited genes, the AS infant manifests a pattern of expression and non-expression that maximize maternal investment and thus paternal fitness. This theory is a significant departure from the standard but erroneous conjecture that a mother and child's inclusive fitness interests are one and the same. Copyright 2004 Elsevier Ltd.
Pingault, Lise; Choulet, Frédéric; Alberti, Adriana; Glover, Natasha; Wincker, Patrick; Feuillet, Catherine; Paux, Etienne
2015-02-10
Because of its size, allohexaploid nature, and high repeat content, the bread wheat genome is a good model to study the impact of the genome structure on gene organization, function, and regulation. However, because of the lack of a reference genome sequence, such studies have long been hampered and our knowledge of the wheat gene space is still limited. The access to the reference sequence of the wheat chromosome 3B provided us with an opportunity to study the wheat transcriptome and its relationships to genome and gene structure at a level that has never been reached before. By combining this sequence with RNA-seq data, we construct a fine transcriptome map of the chromosome 3B. More than 8,800 transcription sites are identified, that are distributed throughout the entire chromosome. Expression level, expression breadth, alternative splicing as well as several structural features of genes, including transcript length, number of exons, and cumulative intron length are investigated. Our analysis reveals a non-monotonic relationship between gene expression and structure and leads to the hypothesis that gene structure is determined by its function, whereas gene expression is subject to energetic cost. Moreover, we observe a recombination-based partitioning at the gene structure and function level. Our analysis provides new insights into the relationships between gene and genome structure and function. It reveals mechanisms conserved with other plant species as well as superimposed evolutionary forces that shaped the wheat gene space, likely participating in wheat adaptation.
MVisAGe Identifies Concordant and Discordant Genomic Alterations of Driver Genes in Squamous Tumors.
Walter, Vonn; Du, Ying; Danilova, Ludmila; Hayward, Michele C; Hayes, D Neil
2018-06-15
Integrated analyses of multiple genomic datatypes are now common in cancer profiling studies. Such data present opportunities for numerous computational experiments, yet analytic pipelines are limited. Tools such as the cBioPortal and Regulome Explorer, although useful, are not easy to access programmatically or to implement locally. Here, we introduce the MVisAGe R package, which allows users to quantify gene-level associations between two genomic datatypes to investigate the effect of genomic alterations (e.g., DNA copy number changes on gene expression). Visualizing Pearson/Spearman correlation coefficients according to the genomic positions of the underlying genes provides a powerful yet novel tool for conducting exploratory analyses. We demonstrate its utility by analyzing three publicly available cancer datasets. Our approach highlights canonical oncogenes in chr11q13 that displayed the strongest associations between expression and copy number, including CCND1 and CTTN , genes not identified by copy number analysis in the primary reports. We demonstrate highly concordant usage of shared oncogenes on chr3q, yet strikingly diverse oncogene usage on chr11q as a function of HPV infection status. Regions of chr19 that display remarkable associations between methylation and gene expression were identified, as were previously unreported miRNA-gene expression associations that may contribute to the epithelial-to-mesenchymal transition. Significance: This study presents an important bioinformatics tool that will enable integrated analyses of multiple genomic datatypes. Cancer Res; 78(12); 3375-85. ©2018 AACR . ©2018 American Association for Cancer Research.
Jouffe, Vincent; Rowe, Suzanne; Liaubet, Laurence; Buitenhuis, Bart; Hornshøj, Henrik; SanCristobal, Magali; Mormède, Pierre; de Koning, D J
2009-07-16
Microarray studies can supplement QTL studies by suggesting potential candidate genes in the QTL regions, which by themselves are too large to provide a limited selection of candidate genes. Here we provide a case study where we explore ways to integrate QTL data and microarray data for the pig, which has only a partial genome sequence. We outline various procedures to localize differentially expressed genes on the pig genome and link this with information on published QTL. The starting point is a set of 237 differentially expressed cDNA clones in adrenal tissue from two pig breeds, before and after treatment with adrenocorticotropic hormone (ACTH). Different approaches to localize the differentially expressed (DE) genes to the pig genome showed different levels of success and a clear lack of concordance for some genes between the various approaches. For a focused analysis on 12 genes, overlapping QTL from the public domain were presented. Also, differentially expressed genes underlying QTL for ACTH response were described. Using the latest version of the draft sequence, the differentially expressed genes were mapped to the pig genome. This enabled co-location of DE genes and previously studied QTL regions, but the draft genome sequence is still incomplete and will contain many errors. A further step to explore links between DE genes and QTL at the pathway level was largely unsuccessful due to the lack of annotation of the pig genome. This could be improved by further comparative mapping analyses but this would be time consuming. This paper provides a case study for the integration of QTL data and microarray data for a species with limited genome sequence information and annotation. The results illustrate the challenges that must be addressed but also provide a roadmap for future work that is applicable to other non-model species.
Filloux, Denis; Murrell, Sasha; Koohapitagtam, Maneerat; Golden, Michael; Julian, Charlotte; Galzi, Serge; Uzest, Marilyne; Rodier-Goud, Marguerite; D’Hont, Angélique; Vernerey, Marie Stephanie; Wilkin, Paul; Peterschmitt, Michel; Winter, Stephan; Murrell, Ben; Martin, Darren P.; Roumagnac, Philippe
2015-01-01
Endogenous viral sequences are essentially ‘fossil records’ that can sometimes reveal the genomic features of long extinct virus species. Although numerous known instances exist of single-stranded DNA (ssDNA) genomes becoming stably integrated within the genomes of bacteria and animals, there remain very few examples of such integration events in plants. The best studied of these events are those which yielded the geminivirus-related DNA elements found within the nuclear genomes of various Nicotiana species. Although other ssDNA virus-like sequences are included within the draft genomes of various plant species, it is not entirely certain that these are not contaminants. The Nicotiana geminivirus-related DNA elements therefore remain the only definitively proven instances of endogenous plant ssDNA virus sequences. Here, we characterize two new classes of endogenous plant virus sequence that are also apparently derived from ancient geminiviruses in the genus Begomovirus. These two endogenous geminivirus-like elements (EGV1 and EGV2) are present in the Dioscorea spp. of the Enantiophyllum clade. We used fluorescence in situ hybridization to confirm that the EGV1 sequences are integrated in the D. alata genome and showed that one or two ancestral EGV sequences likely became integrated more than 1.4 million years ago during or before the diversification of the Asian and African Enantiophyllum Dioscorea spp. Unexpectedly, we found evidence of natural selection actively favouring the maintenance of EGV-expressed replication-associated protein (Rep) amino acid sequences, which clearly indicates that functional EGV Rep proteins were probably expressed for prolonged periods following endogenization. Further, the detection in D. alata of EGV gene transcripts, small 21–24 nt RNAs that are apparently derived from these transcripts, and expressed Rep proteins, provides evidence that some EGV genes are possibly still functionally expressed in at least some of the Enantiophyllum clade species. PMID:27774276
Li, Hong-Mei; Yang, Hong; Wen, Dong-Yue; Luo, Yi-Huan; Liang, Chun-Yan; Pan, Deng-Hua; Ma, Wei; Chen, Gang; He, Yun; Chen, Jun-Qiang
2017-05-01
The role of long non-coding RNA (lncRNA) HOX transcript antisense RNA (HOTAIR) in thyroid carcinoma (TC) remains unclear. The current study was aimed to assess the clinical value of HOTAIR expression levels in TC based on publically available data and to evaluate its potential signaling pathways. The expression data of HOTAIR and clinical information concerning TC were downloaded from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO), respectively. Furthermore, 3 online biological databases, Starbase, Cbioportal, and Multi Experiment Matrix, were used to identify HOTAIR-related genes in TC. Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Panther pathway analyses were then undertaken to study the most enriched signaling pathways in TC (EASE score<0.1, Bonferroni<0.05). The TCGA results demonstrated that the expression level of HOTAIR in TC tissues was significantly increased compared with non-cancerous tissues (p<0.001). HOTAIR over-expression was significantly associated with poor survival in TC patients (p=0.03). Meta-analyses of GEO datasets revealed a trend consistent with the above results on HOTAIR expression levels in TC (SMD=0.23; 95%CI, 0.00-0.45; p=0.047). Finally, the results of functional analysis for HOTAIR-related genes indicated that HOTAIR might participate in tumorigenesis via the Wnt signaling pathway. In conclusion, our study demonstrates that HOTAIR may be involved in thyroid carcinogenesis, and the over-expression of HOTAIR could act as a biomarker associated with a poor outcome in TC patients. Moreover, the Wnt signaling pathway may be the key pathway regulated by HOTAIR in TC. © Georg Thieme Verlag KG Stuttgart · New York.
Roymondal, Uttam; Das, Shibsankar; Sahoo, Satyabrata
2009-01-01
We present an expression measure of a gene, devised to predict the level of gene expression from relative codon bias (RCB). There are a number of measures currently in use that quantify codon usage in genes. Based on the hypothesis that gene expressivity and codon composition is strongly correlated, RCB has been defined to provide an intuitively meaningful measure of an extent of the codon preference in a gene. We outline a simple approach to assess the strength of RCB (RCBS) in genes as a guide to their likely expression levels and illustrate this with an analysis of Escherichia coli (E. coli) genome. Our efforts to quantitatively predict gene expression levels in E. coli met with a high level of success. Surprisingly, we observe a strong correlation between RCBS and protein length indicating natural selection in favour of the shorter genes to be expressed at higher level. The agreement of our result with high protein abundances, microarray data and radioactive data demonstrates that the genomic expression profile available in our method can be applied in a meaningful way to the study of cell physiology and also for more detailed studies of particular genes of interest. PMID:19131380
A Genome-Wide Landscape of Retrocopies in Primate Genomes.
Navarro, Fábio C P; Galante, Pedro A F
2015-07-29
Gene duplication is a key factor contributing to phenotype diversity across and within species. Although the availability of complete genomes has led to the extensive study of genomic duplications, the dynamics and variability of gene duplications mediated by retrotransposition are not well understood. Here, we predict mRNA retrotransposition and use comparative genomics to investigate their origin and variability across primates. Analyzing seven anthropoid primate genomes, we found a similar number of mRNA retrotranspositions (∼7,500 retrocopies) in Catarrhini (Old Word Monkeys, including humans), but a surprising large number of retrocopies (∼10,000) in Platyrrhini (New World Monkeys), which may be a by-product of higher long interspersed nuclear element 1 activity in these genomes. By inferring retrocopy orthology, we dated most of the primate retrocopy origins, and estimated a decrease in the fixation rate in recent primate history, implying a smaller number of species-specific retrocopies. Moreover, using RNA-Seq data, we identified approximately 3,600 expressed retrocopies. As expected, most of these retrocopies are located near or within known genes, present tissue-specific and even species-specific expression patterns, and no expression correlation to their parental genes. Taken together, our results provide further evidence that mRNA retrotransposition is an active mechanism in primate evolution and suggest that retrocopies may not only introduce great genetic variability between lineages but also create a large reservoir of potentially functional new genomic loci in primate genomes. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
GPNMB expression in uveal melanoma: a potential for targeted therapy.
Williams, Michelle D; Esmaeli, Bita; Soheili, Aydin; Simantov, Ronit; Gombos, Dan S; Bedikian, Agop Y; Hwu, Patrick
2010-06-01
Uveal melanoma is an aggressive disease without effective adjuvant therapy for metastases. Despite genomic differences between cutaneous and uveal melanomas, therapies based on shared biological factors could be effective against both tumor types. High expression of glycoprotein-NMB (GPNMB) in cutaneous melanomas led to the development of CDX-011 (glembatumumab vedotin), a fully human monoclonal antibody against the extracellular domain of GPNMB conjugated to the cytotoxic microtubule toxin monomethylauristatin E. Ongoing phase II trials suggest that CDX-011 has activity against advanced cutaneous melanomas. To determine the potential role of CDX-011 in uveal melanomas, we studied their GPNMB expression. Paraffin-embedded tissues from 22 uveal melanomas treated by enucleation from 2004-2007 at one institution were evaluated immunohistochemically for expression of GPNMB using biotinylated CDX-011 (unconjugated) antibody. Melanoma cells were evaluated for percentage and intensity of expression. Spectral imaging was used in one case with high melanin content. Clinical data were reviewed. Twelve women and 10 men with a median age of 58.7 years (range: 28-83 years) were included. Eighteen of 21 tumors evaluated immunohistochemically (85.7%) expressed GPNMB in 10-90% of tumor cells with variable intensity (5 tumors, 1+; 11, 2+; and 2, 3+). Eleven of 18 tumors (61.1%) expressed GPNMB in >or=50% of cells. Spectral imaging showed diffuse CDX-011 (unconjugated) reactivity in the remaining case. Uveal melanoma, like cutaneous melanoma, commonly expresses GPNMB. Ongoing clinical trials of CDX-011 should be extended to patients with metastatic uveal melanoma to determine potential efficacy in this subset of patients with melanoma.
RNAi Functions in Adaptive Reprogramming of the Genome | Center for Cancer Research
The regulation of transcribing DNA into RNA, including the production, processing, and degradation of RNA transcripts, affects the expression and the regulation of the genome in ways that are just beginning to be unraveled. A surprising discovery in recent years is that the vast majority of the genome is transcribed to yield an abundance of RNA transcripts. Many transcripts are regulated by the exosome, a multi-protein complex that degrades RNAs, and may also be targeted, under certain conditions, by the RNA interference (RNAi) pathway. These RNA degrading activities can recruit factors to silence certain regions of the genome by condensing the DNA into tightly-packed heterochromatin. For some chromosomal regions, such as centromeres and telomeres, which lie at the center and ends of chromosomes, respectively, silencing must be stably enforced through each cell generation. For other regions, silencing mechanisms must be easily reversible to activate gene expression in response to changing environmental or developmental conditions. Thus, the regulation of gene silencing is key to maintaining the integrity of the genome and proper cellular expression patterns, which, when disrupted can underlie many diseases, including cancer.
Sugano, Shigeo S; Suzuki, Hiroko; Shimokita, Eisuke; Chiba, Hirofumi; Noji, Sumihare; Osakabe, Yuriko; Osakabe, Keishi
2017-04-28
Mushroom-forming basidiomycetes produce a wide range of metabolites and have great value not only as food but also as an important global natural resource. Here, we demonstrate CRISPR/Cas9-based genome editing in the model species Coprinopsis cinerea. Using a high-throughput reporter assay with cryopreserved protoplasts, we identified a novel promoter, CcDED1 pro , with seven times stronger activity in this assay than the conventional promoter GPD2. To develop highly efficient genome editing using CRISPR/Cas9 in C. cinerea, we used the CcDED1 pro to express Cas9 and a U6-snRNA promoter from C. cinerea to express gRNA. Finally, CRISPR/Cas9-mediated GFP mutagenesis was performed in a stable GFP expression line. Individual genome-edited lines were isolated, and loss of GFP function was detected in hyphae and fruiting body primordia. This novel method of high-throughput CRISPR/Cas9-based genome editing using cryopreserved protoplasts should be a powerful tool in the study of edible mushrooms.
Makita, Yuko; Kawashima, Mika; Lau, Nyok Sean; Othman, Ahmad Sofiman; Matsui, Minami
2018-01-19
Natural rubber is an economically important material. Currently the Pará rubber tree, Hevea brasiliensis is the main commercial source. Little is known about rubber biosynthesis at the molecular level. Next-generation sequencing (NGS) technologies brought draft genomes of three rubber cultivars and a variety of RNA sequencing (RNA-seq) data. However, no current genome or transcriptome databases (DB) are organized by gene. A gene-oriented database is a valuable support for rubber research. Based on our original draft genome sequence of H. brasiliensis RRIM600, we constructed a rubber tree genome and transcriptome DB. Our DB provides genome information including gene functional annotations and multi-transcriptome data of RNA-seq, full-length cDNAs including PacBio Isoform sequencing (Iso-Seq), ESTs and genome wide transcription start sites (TSSs) derived from CAGE technology. Using our original and publically available RNA-seq data, we calculated co-expressed genes for identifying functionally related gene sets and/or genes regulated by the same transcription factor (TF). Users can access multi-transcriptome data through both a gene-oriented web page and a genome browser. For the gene searching system, we provide keyword search, sequence homology search and gene expression search; users can also select their expression threshold easily. The rubber genome and transcriptome DB provides rubber tree genome sequence and multi-transcriptomics data. This DB is useful for comprehensive understanding of the rubber transcriptome. This will assist both industrial and academic researchers for rubber and economically important close relatives such as R. communis, M. esculenta and J. curcas. The Rubber Transcriptome DB release 2017.03 is accessible at http://matsui-lab.riken.jp/rubber/ .
Kravatsky, Yuri V; Chechetkin, Vladimir R; Tchurikov, Nikolai A; Kravatskaya, Galina I
2015-02-01
The broad class of tasks in genetics and epigenetics can be reduced to the study of various features that are distributed over the genome (genome tracks). The rapid and efficient processing of the huge amount of data stored in the genome-scale databases cannot be achieved without the software packages based on the analytical criteria. However, strong inhomogeneity of genome tracks hampers the development of relevant statistics. We developed the criteria for the assessment of genome track inhomogeneity and correlations between two genome tracks. We also developed a software package, Genome Track Analyzer, based on this theory. The theory and software were tested on simulated data and were applied to the study of correlations between CpG islands and transcription start sites in the Homo sapiens genome, between profiles of protein-binding sites in chromosomes of Drosophila melanogaster, and between DNA double-strand breaks and histone marks in the H. sapiens genome. Significant correlations between transcription start sites on the forward and the reverse strands were observed in genomes of D. melanogaster, Caenorhabditis elegans, Mus musculus, H. sapiens, and Danio rerio. The observed correlations may be related to the regulation of gene expression in eukaryotes. Genome Track Analyzer is freely available at http://ancorr.eimb.ru/. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Schadt, Eric E; Edwards, Stephen W; GuhaThakurta, Debraj; Holder, Dan; Ying, Lisa; Svetnik, Vladimir; Leonardson, Amy; Hart, Kyle W; Russell, Archie; Li, Guoya; Cavet, Guy; Castle, John; McDonagh, Paul; Kan, Zhengyan; Chen, Ronghua; Kasarskis, Andrew; Margarint, Mihai; Caceres, Ramon M; Johnson, Jason M; Armour, Christopher D; Garrett-Engele, Philip W; Tsinoremas, Nicholas F; Shoemaker, Daniel D
2004-01-01
Background Computational and microarray-based experimental approaches were used to generate a comprehensive transcript index for the human genome. Oligonucleotide probes designed from approximately 50,000 known and predicted transcript sequences from the human genome were used to survey transcription from a diverse set of 60 tissues and cell lines using ink-jet microarrays. Further, expression activity over at least six conditions was more generally assessed using genomic tiling arrays consisting of probes tiled through a repeat-masked version of the genomic sequence making up chromosomes 20 and 22. Results The combination of microarray data with extensive genome annotations resulted in a set of 28,456 experimentally supported transcripts. This set of high-confidence transcripts represents the first experimentally driven annotation of the human genome. In addition, the results from genomic tiling suggest that a large amount of transcription exists outside of annotated regions of the genome and serves as an example of how this activity could be measured on a genome-wide scale. Conclusions These data represent one of the most comprehensive assessments of transcriptional activity in the human genome and provide an atlas of human gene expression over a unique set of gene predictions. Before the annotation of the human genome is considered complete, however, the previously unannotated transcriptional activity throughout the genome must be fully characterized. PMID:15461792
Opazo, Juan C; Lee, Alison P; Hoffmann, Federico G; Toloza-Villalobos, Jessica; Burmester, Thorsten; Venkatesh, Byrappa; Storz, Jay F
2015-07-01
Comparative analyses of vertebrate genomes continue to uncover a surprising diversity of genes in the globin gene superfamily, some of which have very restricted phyletic distributions despite their antiquity. Genomic analysis of the globin gene repertoire of cartilaginous fish (Chondrichthyes) should be especially informative about the duplicative origins and ancestral functions of vertebrate globins, as divergence between Chondrichthyes and bony vertebrates represents the most basal split within the jawed vertebrates. Here, we report a comparative genomic analysis of the vertebrate globin gene family that includes the complete globin gene repertoire of the elephant shark (Callorhinchus milii). Using genomic sequence data from representatives of all major vertebrate classes, integrated analyses of conserved synteny and phylogenetic relationships revealed that the last common ancestor of vertebrates possessed a repertoire of at least seven globin genes: single copies of androglobin and neuroglobin, four paralogous copies of globin X, and the single-copy progenitor of the entire set of vertebrate-specific globins. Combined with expression data, the genomic inventory of elephant shark globins yielded four especially surprising findings: 1) there is no trace of the neuroglobin gene (a highly conserved gene that is present in all other jawed vertebrates that have been examined to date), 2) myoglobin is highly expressed in heart, but not in skeletal muscle (reflecting a possible ancestral condition in vertebrates with single-circuit circulatory systems), 3) elephant shark possesses two highly divergent globin X paralogs, one of which is preferentially expressed in gonads, and 4) elephant shark possesses two structurally distinct α-globin paralogs, one of which is preferentially expressed in the brain. Expression profiles of elephant shark globin genes reveal distinct specializations of function relative to orthologs in bony vertebrates and suggest hypotheses about ancestral functions of vertebrate globins. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Jiang, Peng; Nelson, Jeffrey D.; Leng, Ning; Collins, Michael; Swanson, Scott; Dewey, Colin N.; Thomson, James A.; Stewart, Ron
2016-01-01
The axolotl (Ambystoma mexicanum) has long been the subject of biological research, primarily owing to its outstanding regenerative capabilities. However, the gene expression programs governing its embryonic development are particularly underexplored, especially when compared to other amphibian model species. Therefore, we performed whole transcriptome polyA+ RNA sequencing experiments on 17 stages of embryonic development. As the axolotl genome is unsequenced and its gene annotation is incomplete, we built de novo transcriptome assemblies for each stage and garnered functional annotation by comparing expressed contigs with known genes in other organisms. In evaluating the number of differentially expressed genes over time, we identify three waves of substantial transcriptome upheaval each followed by a period of relative transcriptome stability. The first wave of upheaval is between the one and two cell stage. We show that the number of differentially expressed genes per unit time is higher between the one and two cell stage than it is across the mid-blastula transition (MBT), the period of zygotic genome activation. We use total RNA sequencing to demonstrate that the vast majority of genes with increasing polyA+ signal between the one and two cell stage result from polyadenylation rather than de novo transcription. The first stable phase begins after the two cell stage and continues until the mid-blastula transition, corresponding with the pre-MBT phase of transcriptional quiescence in amphibian development. Following this is a peak of differential gene expression corresponding with the activation of the zygotic genome and a phase of transcriptomic stability from stages 9 to 11. We observe a third wave of transcriptomic change between stages 11 and 14, followed by a final stable period. The last two stable phases have not been documented in amphibians previously and correspond to times of major morphogenic change in the axolotl embryo: gastrulation and neurulation. These results yield new insights into global gene expression during early stages of amphibian embryogenesis and will help to further develop the axolotl as a model species for developmental and regenerative biology. PMID:27475628
CMV induces HERV-K and HERV-W expression in kidney transplant recipients.
Bergallo, Massimiliano; Galliano, Ilaria; Montanari, Paola; Gambarino, Stefano; Mareschi, Katia; Ferro, Francesca; Fagioli, Franca; Tovo, Pier-Angelo; Ravanini, Paolo
2015-07-01
Human endogenous retrovirus (HERVs) constitute approximately 8% of the human genome. Induction of HERV transcription is possible under certain circumstances, and may have a possible role in some pathological conditions. The aim of this study was to evaluate HERV-K and -W pol gene expression in kidney transplant recipients and to investigate the possible relationship between HERVs gene expression and CMV infection in these patients. Thirty-three samples of kidney transplant patients and twenty healthy blood donors were used to analyze, HERV-K and -W pol gene RNA expression by relative quantitative relative Real-Time PCR. We demonstrated that HERVs pol gene expression levels were higher in kidney transplant recipients than in healthy subjects. Moreover, HERV-K and -W pol gene expression was significantly higher in the group of kidney transplant recipients with high CMV viral load than in the groups with no or moderate CMV viral load. Our data suggest that CMV may facilitate in vivo HERV activation. Published by Elsevier B.V.
Nayduch, Dana; Lee, Matthew B; Saski, Christopher A
2014-01-01
Unlike other important vectors such as mosquitoes and sandflies, genetic and genomic tools for Culicoides biting midges are lacking, despite the fact that they vector a large number of arboviruses and other pathogens impacting humans and domestic animals world-wide. In North America, female Culicoides sonorensis midges are important vectors of bluetongue virus (BTV) and epizootic hemorrhagic disease virus (EHDV), orbiviruses that cause significant disease in livestock and wildlife. Libraries of tissue-specific transcripts expressed in response to feeding and oral orbivirus challenge in C. sonorensis have previously been reported, but extensive genome-wide expression profiling in the midge has not. Here, we successfully used deep sequencing technologies to construct the first adult female C. sonorensis reference transcriptome, and utilized genome-wide expression profiling to elucidate the genetic response to blood and sucrose feeding over time. The adult female midge unigene consists of 19,041 genes, of which less than 7% are differentially expressed during the course of a sucrose meal, while up to 52% of the genes respond significantly in blood-fed midges, indicating hematophagy induces complex physiological processes. Many genes that were differentially expressed during blood feeding were associated with digestion (e.g. proteases, lipases), hematophagy (e.g., salivary proteins), and vitellogenesis, revealing many major metabolic and biological factors underlying these critical processes. Additionally, key genes in the vitellogenesis pathway were identified, which provides the first glimpse into the molecular basis of anautogeny for C. sonorensis. This is the first extensive transcriptome for this genus, which will serve as a framework for future expression studies, RNAi, and provide a rich dataset contributing to the ultimate goal of informing a reference genome assembly and annotation. Moreover, this study will serve as a foundation for subsequent studies of genome-wide expression analyses during early orbivirus infection and dissecting the molecular mechanisms behind vector competence in midges.
Applications of the 1000 Genomes Project resources
Zheng-Bradley, Xiangqun
2017-01-01
Abstract The 1000 Genomes Project created a valuable, worldwide reference for human genetic variation. Common uses of the 1000 Genomes dataset include genotype imputation supporting Genome-wide Association Studies, mapping expression Quantitative Trait Loci, filtering non-pathogenic variants from exome, whole genome and cancer genome sequencing projects, and genetic analysis of population structure and molecular evolution. In this article, we will highlight some of the multiple ways that the 1000 Genomes data can be and has been utilized for genetic studies. PMID:27436001
LSD1 dual function in mediating epigenetic corruption of the vitamin D signaling in prostate cancer.
Battaglia, Sebastiano; Karasik, Ellen; Gillard, Bryan; Williams, Jennifer; Winchester, Trisha; Moser, Michael T; Smiraglia, Dominic J; Foster, Barbara A
2017-01-01
Lysine-specific demethylase 1A (LSD1) is a key regulator of the androgen (AR) and estrogen receptors (ER), and LSD1 levels correlate with tumor aggressiveness. Here, we demonstrate that LSD1 regulates vitamin D receptor (VDR) activity and is a mediator of 1,25(OH) 2 -D 3 (vitamin D) action in prostate cancer (PCa). Athymic nude mice were xenografted with CWR22 cells and monitored weekly after testosterone pellet removal. Expression of LSD1 and VDR (IHC) were correlated with tumor growth using log-rank test. TRAMP tumors and prostates from wild-type (WT) mice were used to evaluate VDR and LSD1 expression via IHC and western blotting. The presence of VDR and LSD1 in the same transcriptional complex was evaluated via immunoprecipitation (IP) using nuclear cell lysate. The effect of LSD1 and 1,25(OH) 2 -D 3 on cell viability was evaluated in C4-2 and BC1A cells via trypan blue exclusion. The role of LSD1 in VDR-mediated gene transcription was evaluated for Cdkn1a , E2f1 , Cyp24a1 , and S100g via qRT-PCR-TaqMan and via chromatin immunoprecipitation assay. Methylation of Cdkn1a TSS was measured via bisulfite sequencing, and methylation of a panel of cancer-related genes was quantified using methyl arrays. The Cancer Genome Atlas data were retrieved to identify genes whose status correlates with LSD1 and DNA methyltransferase 1 (DNMT1). Results were correlated with patients' survival data from two separate cohorts of primary and metastatic PCa. LSD1 and VDR protein levels are elevated in PCa tumors and correlate with faster tumor growth in xenograft mouse models. Knockdown of LSD1 reduces PCa cell viability, and gene expression data suggest a dual coregulatory role of LSD1 for VDR, acting as a coactivator and corepressor in a locus-specific manner. LSD1 modulates VDR-dependent transcription by mediating the recruitment of VDR and DNMT1 at the TSS of VDR-targeted genes and modulates the epigenetic status of transcribed genes by altering H3K4me2 and H3K9Ac and DNA methylation. Lastly, LSD1 and DNMT1 belong to a genome-wide signature whose expression correlates with shorter progression-free survival and overall survival in primary and metastatic patients' samples, respectively. Results demonstrate that LSD1 has a dual coregulatory role as corepressor and coactivator for VDR and defines a genomic signature whose targeting might have clinical relevance for PCa patients.
fastBMA: scalable network inference and transitive reduction.
Hung, Ling-Hong; Shi, Kaiyuan; Wu, Migao; Young, William Chad; Raftery, Adrian E; Yeung, Ka Yee
2017-10-01
Inferring genetic networks from genome-wide expression data is extremely demanding computationally. We have developed fastBMA, a distributed, parallel, and scalable implementation of Bayesian model averaging (BMA) for this purpose. fastBMA also includes a computationally efficient module for eliminating redundant indirect edges in the network by mapping the transitive reduction to an easily solved shortest-path problem. We evaluated the performance of fastBMA on synthetic data and experimental genome-wide time series yeast and human datasets. When using a single CPU core, fastBMA is up to 100 times faster than the next fastest method, LASSO, with increased accuracy. It is a memory-efficient, parallel, and distributed application that scales to human genome-wide expression data. A 10 000-gene regulation network can be obtained in a matter of hours using a 32-core cloud cluster (2 nodes of 16 cores). fastBMA is a significant improvement over its predecessor ScanBMA. It is more accurate and orders of magnitude faster than other fast network inference methods such as the 1 based on LASSO. The improved scalability allows it to calculate networks from genome scale data in a reasonable time frame. The transitive reduction method can improve accuracy in denser networks. fastBMA is available as code (M.I.T. license) from GitHub (https://github.com/lhhunghimself/fastBMA), as part of the updated networkBMA Bioconductor package (https://www.bioconductor.org/packages/release/bioc/html/networkBMA.html) and as ready-to-deploy Docker images (https://hub.docker.com/r/biodepot/fastbma/). © The Authors 2017. Published by Oxford University Press.
Novel integrative genomic tool for interrogating lithium response in bipolar disorder
Hunsberger, J G; Chibane, F L; Elkahloun, A G; Henderson, R; Singh, R; Lawson, J; Cruceanu, C; Nagarajan, V; Turecki, G; Squassina, A; Medeiros, C D; Del Zompo, M; Rouleau, G A; Alda, M; Chuang, D-M
2015-01-01
We developed a novel integrative genomic tool called GRANITE (Genetic Regulatory Analysis of Networks Investigational Tool Environment) that can effectively analyze large complex data sets to generate interactive networks. GRANITE is an open-source tool and invaluable resource for a variety of genomic fields. Although our analysis is confined to static expression data, GRANITE has the capability of evaluating time-course data and generating interactive networks that may shed light on acute versus chronic treatment, as well as evaluating dose response and providing insight into mechanisms that underlie therapeutic versus sub-therapeutic doses or toxic doses. As a proof-of-concept study, we investigated lithium (Li) response in bipolar disorder (BD). BD is a severe mood disorder marked by cycles of mania and depression. Li is one of the most commonly prescribed and decidedly effective treatments for many patients (responders), although its mode of action is not yet fully understood, nor is it effective in every patient (non-responders). In an in vitro study, we compared vehicle versus chronic Li treatment in patient-derived lymphoblastoid cells (LCLs) (derived from either responders or non-responders) using both microRNA (miRNA) and messenger RNA gene expression profiling. We present both Li responder and non-responder network visualizations created by our GRANITE analysis in BD. We identified by network visualization that the Let-7 family is consistently downregulated by Li in both groups where this miRNA family has been implicated in neurodegeneration, cell survival and synaptic development. We discuss the potential of this analysis for investigating treatment response and even providing clinicians with a tool for predicting treatment response in their patients, as well as for providing the industry with a tool for identifying network nodes as targets for novel drug discovery. PMID:25646593
Novel integrative genomic tool for interrogating lithium response in bipolar disorder.
Hunsberger, J G; Chibane, F L; Elkahloun, A G; Henderson, R; Singh, R; Lawson, J; Cruceanu, C; Nagarajan, V; Turecki, G; Squassina, A; Medeiros, C D; Del Zompo, M; Rouleau, G A; Alda, M; Chuang, D-M
2015-02-03
We developed a novel integrative genomic tool called GRANITE (Genetic Regulatory Analysis of Networks Investigational Tool Environment) that can effectively analyze large complex data sets to generate interactive networks. GRANITE is an open-source tool and invaluable resource for a variety of genomic fields. Although our analysis is confined to static expression data, GRANITE has the capability of evaluating time-course data and generating interactive networks that may shed light on acute versus chronic treatment, as well as evaluating dose response and providing insight into mechanisms that underlie therapeutic versus sub-therapeutic doses or toxic doses. As a proof-of-concept study, we investigated lithium (Li) response in bipolar disorder (BD). BD is a severe mood disorder marked by cycles of mania and depression. Li is one of the most commonly prescribed and decidedly effective treatments for many patients (responders), although its mode of action is not yet fully understood, nor is it effective in every patient (non-responders). In an in vitro study, we compared vehicle versus chronic Li treatment in patient-derived lymphoblastoid cells (LCLs) (derived from either responders or non-responders) using both microRNA (miRNA) and messenger RNA gene expression profiling. We present both Li responder and non-responder network visualizations created by our GRANITE analysis in BD. We identified by network visualization that the Let-7 family is consistently downregulated by Li in both groups where this miRNA family has been implicated in neurodegeneration, cell survival and synaptic development. We discuss the potential of this analysis for investigating treatment response and even providing clinicians with a tool for predicting treatment response in their patients, as well as for providing the industry with a tool for identifying network nodes as targets for novel drug discovery.
Clinical and Genetic Implications of Mutation Burden in Squamous Cell Carcinoma of the Lung.
Okamoto, Tatsuro; Takada, Kazuki; Sato, Seijiro; Toyokawa, Gouji; Tagawa, Tetsuzo; Shoji, Fumihiro; Nakanishi, Ryota; Oki, Eiji; Koike, Terumoto; Nagahashi, Masayuki; Ichikawa, Hiroshi; Shimada, Yoshifumi; Watanabe, Satoshi; Kikuchi, Toshiaki; Akazawa, Kouhei; Lyle, Stephen; Takabe, Kazuaki; Okuda, Shujiro; Sugio, Kenji; Wakai, Toshifumi; Tsuchida, Masanori; Maehara, Yoshihiko
2018-06-01
Lung squamous cell carcinoma (LSCC) is a major histological subtype of lung cancer. In this study, we investigated genomic alterations in LSCC and evaluated the clinical implications of mutation burden (MB) in LSCC. Genomic alterations were determined in Japanese patients with LSCC (N = 67) using next-generation sequencing of 415 known cancer genes. MB was defined as the number of non-synonymous mutations per 1 Mbp. Programmed death-ligand 1 (PD-L1) protein expression in cancer cells was evaluated by immunohistochemical analysis. TP53 gene mutations were the most common alteration (n = 51/67, 76.1%), followed by gene alterations in cyclin-dependent kinase inhibitor 2B (CDKN2B; 35.8%), CDKN2A (31.3%), phosphatase and tensin homolog (30.0%), and sex-determining region Y-box 2 (SOX2, 28.3%). Histological differentiation was significantly poorer in tumors with high MB (greater than or equal to the median MB) compared with that in tumors with low MB (less than the median MB; p = 0.0446). The high MB group had more tumors located in the upper or middle lobe than tumors located in the lower lobe (p = 0.0019). Moreover, cancers in the upper or middle lobes had significantly higher MB than cancers in the lower lobes (p = 0.0005), and tended to show higher PD-L1 protein expression (p = 0.0573). SOX2 and tyrosine kinase non-receptor 2 amplifications were associated with high MB (p = 0.0065 and p = 0.0010, respectively). The MB level differed according to the tumor location in LSCC, suggesting that the location of cancer development may influence the genomic background of the tumor.
Harkness, Justine M; Kader, Muhamuda; DeLuca, Neal A
2014-06-01
Herpes simplex virus 1 (HSV-1) can undergo a productive infection in nonneuronal and neuronal cells such that the genes of the virus are transcribed in an ordered cascade. HSV-1 can also establish a more quiescent or latent infection in peripheral neurons, where gene expression is substantially reduced relative to that in productive infection. HSV mutants defective in multiple immediate early (IE) gene functions are highly defective for later gene expression and model some aspects of latency in vivo. We compared the expression of wild-type (wt) virus and IE gene mutants in nonneuronal cells (MRC5) and adult murine trigeminal ganglion (TG) neurons using the Illumina platform for cDNA sequencing (RNA-seq). RNA-seq analysis of wild-type virus revealed that expression of the genome mostly followed the previously established kinetics, validating the method, while highlighting variations in gene expression within individual kinetic classes. The accumulation of immediate early transcripts differed between MRC5 cells and neurons, with a greater abundance in neurons. Analysis of a mutant defective in all five IE genes (d109) showed dysregulated genome-wide low-level transcription that was more highly attenuated in MRC5 cells than in TG neurons. Furthermore, a subset of genes in d109 was more abundantly expressed over time in neurons. While the majority of the viral genome became relatively quiescent, the latency-associated transcript was specifically upregulated. Unexpectedly, other genes within repeat regions of the genome, as well as the unique genes just adjacent the repeat regions, also remained relatively active in neurons. The relative permissiveness of TG neurons to viral gene expression near the joint region is likely significant during the establishment and reactivation of latency. During productive infection, the genes of HSV-1 are transcribed in an ordered cascade. HSV can also establish a more quiescent or latent infection in peripheral neurons. HSV mutants defective in multiple immediate early (IE) genes establish a quiescent infection that models aspects of latency in vivo. We simultaneously quantified the expression of all the HSV genes in nonneuronal and neuronal cells by RNA-seq analysis. The results for productive infection shed further light on the nature of genes and promoters of different kinetic classes. In quiescent infection, there was greater transcription across the genome in neurons than in nonneuronal cells. In particular, the transcription of the latency-associated transcript (LAT), IE genes, and genes in the unique regions adjacent to the repeats persisted in neurons. The relative activity of this region of the genome in the absence of viral activators suggests a more dynamic state for quiescent genomes persisting in neurons. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Pernicious plans revealed: Plasmodium falciparum genome wide expression analysis.
Llinás, Manuel; DeRisi, Joseph L
2004-08-01
The asexual intraerythrocytic developmental cycle (IDC) of Plasmodium falciparum is responsible for the majority of the clinical manifestations of malaria in humans. Although malaria has been studied for over a century, the elucidation of the full genome sequence of P. falciparum has now allowed for in-depth studies of gene expression throughout the entire intraerythrocytic stage. As the mainstays of anti-malarial chemotherapy become increasingly ineffective, we need a deeper understanding of fundamental plasmodial bioregulatory mechanisms to successfully subvert them. Recent gene expression studies have begun to examine different aspects of the IDC and are providing key insights into the basic mechanisms of Plasmodium gene regulation and are helping to define gene functions. However, to date, no transcription factor has been fully characterized from Plasmodium and the definitive identification of cis-acting regulatory elements along with their corresponding trans-acting partners is still lacking. The characterization of the transcriptome of P. falciparum is the first major step towards the understanding of the genome wide regulation of gene expression in this parasite. IDC expression data for almost every gene in the P. falciparum genome can now be publicly queried at and. The results of these studies suggest promising leads for identifying novel targets for anti-malarial therapeutics and vaccines in addition to providing a solid foundation for the ongoing elucidation of plasmodial gene expression.
Genome-wide characterization of the Pectate Lyase-like (PLL) genes in Brassica rapa.
Jiang, Jingjing; Yao, Lina; Miao, Ying; Cao, Jiashu
2013-11-01
Pectate lyases (PL) depolymerize demethylated pectin (pectate, EC 4.2.2.2) by catalyzing the eliminative cleavage of α-1,4-glycosidic linked galacturonan. Pectate Lyase-like (PLL) genes are one of the largest and most complex families in plants. However, studies on the phylogeny, gene structure, and expression of PLL genes are limited. To understand the potential functions of PLL genes in plants, we characterized their intron-exon structure, phylogenetic relationships, and protein structures, and measured their expression patterns in various tissues, specifically the reproductive tissues in Brassica rapa. Sequence alignments revealed two characteristic motifs in PLL genes. The chromosome location analysis indicated that 18 of the 46 PLL genes were located in the least fractionated sub-genome (LF) of B. rapa, while 16 were located in the medium fractionated sub-genome (MF1) and 12 in the more fractionated sub-genome (MF2). Quantitative RT-PCR analysis showed that BrPLL genes were expressed in various tissues, with most of them being expressed in flowers. Detailed qRT-PCR analysis identified 11 pollen specific PLL genes and several other genes with unique spatial expression patterns. In addition, some duplicated genes showed similar expression patterns. The phylogenetic analysis identified three PLL gene subfamilies in plants, among which subfamily II might have evolved from gene neofunctionalization or subfunctionalization. Therefore, this study opens the possibility for exploring the roles of PLL genes during plant development.
Coate, Jeremy E; Doyle, Jeff J
2010-01-01
Evolutionary biologists are increasingly comparing gene expression patterns across species. Due to the way in which expression assays are normalized, such studies provide no direct information about expression per gene copy (dosage responses) or per cell and can give a misleading picture of genes that are differentially expressed. We describe an assay for estimating relative expression per cell. When used in conjunction with transcript profiling data, it is possible to compare the sizes of whole transcriptomes, which in turn makes it possible to compare expression per cell for each gene in the transcript profiling data set. We applied this approach, using quantitative reverse transcriptase-polymerase chain reaction and high throughput RNA sequencing, to a recently formed allopolyploid and showed that its leaf transcriptome was approximately 1.4-fold larger than either progenitor transcriptome (70% of the sum of the progenitor transcriptomes). In contrast, the allopolyploid genome is 94.3% as large as the sum of its progenitor genomes and retains > or =93.5% of the sum of its progenitor gene complements. Thus, "transcriptome downsizing" is greater than genome downsizing. Using this transcriptome size estimate, we inferred dosage responses for several thousand genes and showed that the majority exhibit partial dosage compensation. Homoeologue silencing is nonrandomly distributed across dosage responses, with genes showing extreme responses in either direction significantly more likely to have a silent homoeologue. This experimental approach will add value to transcript profiling experiments involving interspecies and interploidy comparisons by converting expression per transcriptome to expression per genome, eliminating the need for assumptions about transcriptome size.
Jiang, Jinjin; Wang, Yue; Zhu, Bao; Fang, Tingting; Fang, Yujie; Wang, Youping
2015-01-27
Brassica includes many successfully cultivated crop species of polyploid origin, either by ancestral genome triplication or by hybridization between two diploid progenitors, displaying complex repetitive sequences and transposons. The U's triangle, which consists of three diploids and three amphidiploids, is optimal for the analysis of complicated genomes after polyploidization. Next-generation sequencing enables the transcriptome profiling of polyploids on a global scale. We examined the gene expression patterns of three diploids (Brassica rapa, B. nigra, and B. oleracea) and three amphidiploids (B. napus, B. juncea, and B. carinata) via digital gene expression analysis. In total, the libraries generated between 5.7 and 6.1 million raw reads, and the clean tags of each library were mapped to 18547-21995 genes of B. rapa genome. The unambiguous tag-mapped genes in the libraries were compared. Moreover, the majority of differentially expressed genes (DEGs) were explored among diploids as well as between diploids and amphidiploids. Gene ontological analysis was performed to functionally categorize these DEGs into different classes. The Kyoto Encyclopedia of Genes and Genomes analysis was performed to assign these DEGs into approximately 120 pathways, among which the metabolic pathway, biosynthesis of secondary metabolites, and peroxisomal pathway were enriched. The non-additive genes in Brassica amphidiploids were analyzed, and the results indicated that orthologous genes in polyploids are frequently expressed in a non-additive pattern. Methyltransferase genes showed differential expression pattern in Brassica species. Our results provided an understanding of the transcriptome complexity of natural Brassica species. The gene expression changes in diploids and allopolyploids may help elucidate the morphological and physiological differences among Brassica species.
Díaz-Castillo, Carlos; Xia, Xiao-Qin; Ranz, José M.
2012-01-01
Why gene order is conserved over long evolutionary timespans remains elusive. A common interpretation is that gene order conservation might reflect the existence of functional constraints that are important for organismal performance. Alteration of the integrity of genomic regions, and therefore of those constraints, would result in detrimental effects. This notion seems especially plausible in those genomes that can easily accommodate gene reshuffling via chromosomal inversions since genomic regions free of constraints are likely to have been disrupted in one or more lineages. Nevertheless, no empirical test has been performed to this notion. Here, we disrupt one of the largest conserved genomic regions of the Drosophila genome by chromosome engineering and examine the phenotypic consequences derived from such disruption. The targeted region exhibits multiple patterns of functional enrichment suggestive of the presence of constraints. The carriers of the disrupted collinear block show no defects in their viability, fertility, and parameters of general homeostasis, although their odorant perception is altered. This change in odorant perception does not correlate with modifications of the level of expression and sex bias of the genes within the genomic region disrupted. Our results indicate that even in highly rearranged genomes, like those of Diptera, unusually high levels of gene order conservation cannot be systematically attributed to functional constraints, which raises the possibility that other mechanisms can be in place and therefore the underpinnings of the maintenance of gene organization might be more diverse than previously thought. PMID:22319453
Liu, Zhijing; Feng, Qiang; Sun, Pengpeng; Lu, Yan; Yang, Minlan; Zhang, Xiaowei; Jin, Xiangshu; Li, Yulin; Lu, Shi-Jiang; Quan, Chengshi
2017-12-01
To investigate the role of DNA methylation during erythrocyte production by human embryonic stem cells (hESCs). We employed an erythroid differentiation model from hESCs, and then tracked the genome-wide DNA methylation maps and gene expression patterns through an Infinium HumanMethylation450K BeadChip and an Ilumina Human HT-12 v4 Expression Beadchip, respectively. A negative correlation between DNA methylation and gene expression was substantially enriched during the later differentiation stage and was present in both the promoter and the gene body. Moreover, erythropoietic genes with differentially methylated CpG sites that were primarily enriched in nonisland regions were upregulated, and demethylation of their gene bodies was associated with the presence of enhancers and DNase I hypersensitive sites. Finally, the components of JAK-STAT-NF-κB signaling were DNA hypomethylated and upregulated, which targets the key genes for erythropoiesis. Erythroid lineage commitment by hESCs requires genome-wide DNA methylation modifications to remodel gene expression dynamics.
Loots, Gabriela G
2008-01-01
Despite remarkable recent advances in genomics that have enabled us to identify most of the genes in the human genome, comparable efforts to define transcriptional cis-regulatory elements that control gene expression are lagging behind. The difficulty of this task stems from two equally important problems: our knowledge of how regulatory elements are encoded in genomes remains elementary, and there is a vast genomic search space for regulatory elements, since most of mammalian genomes are noncoding. Comparative genomic approaches are having a remarkable impact on the study of transcriptional regulation in eukaryotes and currently represent the most efficient and reliable methods of predicting noncoding sequences likely to control the patterns of gene expression. By subjecting eukaryotic genomic sequences to computational comparisons and subsequent experimentation, we are inching our way toward a more comprehensive catalog of common regulatory motifs that lie behind fundamental biological processes. We are still far from comprehending how the transcriptional regulatory code is encrypted in the human genome and providing an initial global view of regulatory gene networks, but collectively, the continued development of comparative and experimental approaches will rapidly expand our knowledge of the transcriptional regulome.
Conifer genomics and adaptation: at the crossroads of genetic diversity and genome function.
Prunier, Julien; Verta, Jukka-Pekka; MacKay, John J
2016-01-01
Conifers have been understudied at the genomic level despite their worldwide ecological and economic importance but the situation is rapidly changing with the development of next generation sequencing (NGS) technologies. With NGS, genomics research has simultaneously gained in speed, magnitude and scope. In just a few years, genomes of 20-24 gigabases have been sequenced for several conifers, with several others expected in the near future. Biological insights have resulted from recent sequencing initiatives as well as genetic mapping, gene expression profiling and gene discovery research over nearly two decades. We review the knowledge arising from conifer genomics research emphasizing genome evolution and the genomic basis of adaptation, and outline emerging questions and knowledge gaps. We discuss future directions in three areas with potential inputs from NGS technologies: the evolutionary impacts of adaptation in conifers based on the adaptation-by-speciation model; the contributions of genetic variability of gene expression in adaptation; and the development of a broader understanding of genetic diversity and its impacts on genome function. These research directions promise to sustain research aimed at addressing the emerging challenges of adaptation that face conifer trees. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Núñez-Hernández, Fernando; Pérez, Lester J; Vera, Gonzalo; Córdoba, Sarai; Segalés, Joaquim; Sánchez, Armand; Núñez, José I
2015-05-01
Porcine circovirus type 2 (PCV2) is a ssDNA virus causing PCV2-systemic disease (PCV2-SD), one of the most important diseases in swine. MicroRNAs (miRNAs) are a new class of small non-coding RNAs that regulate gene expression post-transcriptionally. Viral miRNAs have recently been described and the number of viral miRNAs has been increasing in the past few years. In this study, small RNA libraries were constructed from two tissues of subclinically PCV2 infected pigs to explore if PCV2 can encode viral miRNAs. The deep sequencing data revealed that PCV2 does not express miRNAs in an in vivo subclinical infection.
Parvovirus-derived endogenous viral elements in two South American rodent genomes.
Arriagada, Gloria; Gifford, Robert J
2014-10-01
We describe endogenous viral elements (EVEs) derived from parvoviruses (family Parvoviridae) in the genomes of the long-tailed chinchilla (Chinchilla lanigera) and the degu (Octodon degus). The novel EVEs include dependovirus-related elements and representatives of a clearly distinct parvovirus lineage that also has endogenous representatives in marsupial genomes. In the degu, one dependovirus-derived EVE was found to carry an intact reading frame and was differentially expressed in vivo, with increased expression in the liver. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Zong, Li; Qin, Yanli; Jia, Haodi; Ye, Lei; Wang, Yongxiang; Zhang, Jiming; Wands, Jack R; Tong, Shuping; Li, Jisu
2017-05-01
Hepatitis B virus (HBV) transcribes two subsets of 3.5-kb RNAs: precore RNA for hepatitis B e antigen (HBeAg) expression, and pregenomic RNA for core and P protein translation as well as genome replication. HBeAg expression could be prevented by mutations in the precore region, while an upstream open reading frame (uORF) has been proposed as a negative regulator of core protein translation. We employed replication competent HBV DNA constructs and transient transfection experiments in Huh7 cells to verify the uORF effect and to explore the alternative function of precore RNA. Optimized Kozak sequence for the uORF or extra ATG codons as present in some HBV genotypes reduced core protein expression. G1896A nonsense mutation promoted more efficient core protein expression than mutated precore ATG, while a +1 frameshift mutation was ineffective. In conclusion, various HBeAg-negative precore mutations and mutations affecting uORF differentially regulate core protein expression and genome replication. Copyright © 2017 Elsevier Inc. All rights reserved.
Genome-wide expression profiling in pediatric septic shock
Wong, Hector R.
2013-01-01
For nearly a decade, our research group has had the privilege of developing and mining a multi-center, microarray-based, genome-wide expression database of critically ill children (≤ 10 years of age) with septic shock. Using bioinformatic and systems biology approaches, the expression data generated through this discovery-oriented, exploratory approach have been leveraged for a variety of objectives, which will be reviewed. Fundamental observations include wide spread repression of gene programs corresponding to the adaptive immune system, and biologically significant differential patterns of gene expression across developmental age groups. The data have also identified gene expression-based subclasses of pediatric septic shock having clinically relevant phenotypic differences. The data have also been leveraged for the discovery of novel therapeutic targets, and for the discovery and development of novel stratification and diagnostic biomarkers. Almost a decade of genome-wide expression profiling in pediatric septic shock is now demonstrating tangible results. The studies have progressed from an initial discovery-oriented and exploratory phase, to a new phase where the data are being translated and applied to address several areas of clinical need. PMID:23329198
Pritchard, Victoria L; Viitaniemi, Heidi M; McCairns, R J Scott; Merilä, Juha; Nikinmaa, Mikko; Primmer, Craig R; Leder, Erica H
2017-01-05
Much adaptive evolutionary change is underlain by mutational variation in regions of the genome that regulate gene expression rather than in the coding regions of the genes themselves. An understanding of the role of gene expression variation in facilitating local adaptation will be aided by an understanding of underlying regulatory networks. Here, we characterize the genetic architecture of gene expression variation in the threespine stickleback (Gasterosteus aculeatus), an important model in the study of adaptive evolution. We collected transcriptomic and genomic data from 60 half-sib families using an expression microarray and genotyping-by-sequencing, and located expression quantitative trait loci (eQTL) underlying the variation in gene expression in liver tissue using an interval mapping approach. We identified eQTL for several thousand expression traits. Expression was influenced by polymorphism in both cis- and trans-regulatory regions. Trans-eQTL clustered into hotspots. We did not identify master transcriptional regulators in hotspot locations: rather, the presence of hotspots may be driven by complex interactions between multiple transcription factors. One observed hotspot colocated with a QTL recently found to underlie salinity tolerance in the threespine stickleback. However, most other observed hotspots did not colocate with regions of the genome known to be involved in adaptive divergence between marine and freshwater habitats. Copyright © 2017 Pritchard et al.
Pritchard, Victoria L.; Viitaniemi, Heidi M.; McCairns, R. J. Scott; Merilä, Juha; Nikinmaa, Mikko; Primmer, Craig R.; Leder, Erica H.
2016-01-01
Much adaptive evolutionary change is underlain by mutational variation in regions of the genome that regulate gene expression rather than in the coding regions of the genes themselves. An understanding of the role of gene expression variation in facilitating local adaptation will be aided by an understanding of underlying regulatory networks. Here, we characterize the genetic architecture of gene expression variation in the threespine stickleback (Gasterosteus aculeatus), an important model in the study of adaptive evolution. We collected transcriptomic and genomic data from 60 half-sib families using an expression microarray and genotyping-by-sequencing, and located expression quantitative trait loci (eQTL) underlying the variation in gene expression in liver tissue using an interval mapping approach. We identified eQTL for several thousand expression traits. Expression was influenced by polymorphism in both cis- and trans-regulatory regions. Trans-eQTL clustered into hotspots. We did not identify master transcriptional regulators in hotspot locations: rather, the presence of hotspots may be driven by complex interactions between multiple transcription factors. One observed hotspot colocated with a QTL recently found to underlie salinity tolerance in the threespine stickleback. However, most other observed hotspots did not colocate with regions of the genome known to be involved in adaptive divergence between marine and freshwater habitats. PMID:27836907
Contribution of transposable elements in the plant's genome.
Sahebi, Mahbod; Hanafi, Mohamed M; van Wijnen, Andre J; Rice, David; Rafii, M Y; Azizi, Parisa; Osman, Mohamad; Taheri, Sima; Bakar, Mohd Faizal Abu; Isa, Mohd Noor Mat; Noor, Yusuf Muhammad
2018-07-30
Plants maintain extensive growth flexibility under different environmental conditions, allowing them to continuously and rapidly adapt to alterations in their environment. A large portion of many plant genomes consists of transposable elements (TEs) that create new genetic variations within plant species. Different types of mutations may be created by TEs in plants. Many TEs can avoid the host's defense mechanisms and survive alterations in transposition activity, internal sequence and target site. Thus, plant genomes are expected to utilize a variety of mechanisms to tolerate TEs that are near or within genes. TEs affect the expression of not only nearby genes but also unlinked inserted genes. TEs can create new promoters, leading to novel expression patterns or alternative coding regions to generate alternate transcripts in plant species. TEs can also provide novel cis-acting regulatory elements that act as enhancers or inserts within original enhancers that are required for transcription. Thus, the regulation of plant gene expression is strongly managed by the insertion of TEs into nearby genes. TEs can also lead to chromatin modifications and thereby affect gene expression in plants. TEs are able to generate new genes and modify existing gene structures by duplicating, mobilizing and recombining gene fragments. They can also facilitate cellular functions by sharing their transposase-coding regions. Hence, TE insertions can not only act as simple mutagens but can also alter the elementary functions of the plant genome. Here, we review recent discoveries concerning the contribution of TEs to gene expression in plant genomes and discuss the different mechanisms by which TEs can affect plant gene expression and reduce host defense mechanisms. Copyright © 2018 Elsevier B.V. All rights reserved.
James M. Slavicek
1991-01-01
Genomic expression of the Lymantriu dispar multinucleocapsid nuclear polyhedrosis virus (LdMNPV) was studied. Viral specific transcripts expressed in cell culture at various times from 2 through 72 h postinfection were identified and their genomic origins mapped through Northern analysis. Sixty-five distinct transcripts were identified in this...
Molecular Biology In Young Women With Breast Cancer: From Tumor Gene Expression To DNA Mutations.
Gómez-Flores-Ramos, Liliana; Castro-Sánchez, Andrea; Peña-Curiel, Omar; Mohar-Betancourt, Alejandro
2017-01-01
Young women with breast cancer (YWBC) represent roughly 15% of breast cancer (BC) cases in Latin America and other developing regions. Breast tumors occurring at an early age are more aggressive and have an overall worse prognosis compared to breast tumors in postmenopausal women. The expression of relevant proliferation biomarkers such as endocrine receptors and human epidermal growth factor receptor 2 appears to be unique in YWBC. Moreover, histopathological, molecular, genetic, and genomic studies have shown that YWBC exhibit a higher frequency of aggressive subtypes, differential tumor gene expression, increased genetic susceptibility, and specific genomic signatures, compared to older women with BC. This article reviews the current knowledge on tumor biology and genomic signatures in YWBC.
Ultraconserved regions encoding ncRNAs are altered in human leukemias and carcinomas.
Calin, George A; Liu, Chang-gong; Ferracin, Manuela; Hyslop, Terry; Spizzo, Riccardo; Sevignani, Cinzia; Fabbri, Muller; Cimmino, Amelia; Lee, Eun Joo; Wojcik, Sylwia E; Shimizu, Masayoshi; Tili, Esmerina; Rossi, Simona; Taccioli, Cristian; Pichiorri, Flavia; Liu, Xiuping; Zupo, Simona; Herlea, Vlad; Gramantieri, Laura; Lanza, Giovanni; Alder, Hansjuerg; Rassenti, Laura; Volinia, Stefano; Schmittgen, Thomas D; Kipps, Thomas J; Negrini, Massimo; Croce, Carlo M
2007-09-01
Noncoding RNA (ncRNA) transcripts are thought to be involved in human tumorigenesis. We report that a large fraction of genomic ultraconserved regions (UCRs) encode a particular set of ncRNAs whose expression is altered in human cancers. Genome-wide profiling revealed that UCRs have distinct signatures in human leukemias and carcinomas. UCRs are frequently located at fragile sites and genomic regions involved in cancers. We identified certain UCRs whose expression may be regulated by microRNAs abnormally expressed in human chronic lymphocytic leukemia, and we proved that the inhibition of an overexpressed UCR induces apoptosis in colon cancer cells. Our findings argue that ncRNAs and interaction between noncoding genes are involved in tumorigenesis to a greater extent than previously thought.
Rondon, Michelle R.; Raffel, Sandra J.; Goodman, Robert M.; Handelsman, Jo
1999-01-01
As the study of microbes moves into the era of functional genomics, there is an increasing need for molecular tools for analysis of a wide diversity of microorganisms. Currently, biological study of many prokaryotes of agricultural, medical, and fundamental scientific interest is limited by the lack of adequate genetic tools. We report the application of the bacterial artificial chromosome (BAC) vector to prokaryotic biology as a powerful approach to address this need. We constructed a BAC library in Escherichia coli from genomic DNA of the Gram-positive bacterium Bacillus cereus. This library provides 5.75-fold coverage of the B. cereus genome, with an average insert size of 98 kb. To determine the extent of heterologous expression of B. cereus genes in the library, we screened it for expression of several B. cereus activities in the E. coli host. Clones expressing 6 of 10 activities tested were identified in the library, namely, ampicillin resistance, zwittermicin A resistance, esculin hydrolysis, hemolysis, orange pigment production, and lecithinase activity. We analyzed selected BAC clones genetically to identify rapidly specific B. cereus loci. These results suggest that BAC libraries will provide a powerful approach for studying gene expression from diverse prokaryotes. PMID:10339608
Shanley, Thomas P; Cvijanovich, Natalie; Lin, Richard; Allen, Geoffrey L; Thomas, Neal J; Doctor, Allan; Kalyanaraman, Meena; Tofil, Nancy M; Penfil, Scott; Monaco, Marie; Odoms, Kelli; Barnes, Michael; Sakthivel, Bhuvaneswari; Aronow, Bruce J; Wong, Hector R
2007-01-01
We have conducted longitudinal studies focused on the expression profiles of signaling pathways and gene networks in children with septic shock. Genome-level expression profiles were generated from whole blood-derived RNA of children with septic shock (n = 30) corresponding to day one and day three of septic shock, respectively. Based on sequential statistical and expression filters, day one and day three of septic shock were characterized by differential regulation of 2,142 and 2,504 gene probes, respectively, relative to controls (n = 15). Venn analysis demonstrated 239 unique genes in the day one dataset, 598 unique genes in the day three dataset, and 1,906 genes common to both datasets. Functional analyses demonstrated time-dependent, differential regulation of genes involved in multiple signaling pathways and gene networks primarily related to immunity and inflammation. Notably, multiple and distinct gene networks involving T cell- and MHC antigen-related biology were persistently downregulated on both day one and day three. Further analyses demonstrated large scale, persistent downregulation of genes corresponding to functional annotations related to zinc homeostasis. These data represent the largest reported cohort of patients with septic shock subjected to longitudinal genome-level expression profiling. The data further advance our genome-level understanding of pediatric septic shock and support novel hypotheses. PMID:17932561
Shivaraj, S. M.; Deshmukh, Rupesh K.; Rai, Rhitu; Bélanger, Richard; Agrawal, Pawan K.; Dash, Prasanta K.
2017-01-01
Membrane intrinsic proteins (MIPs) form transmembrane channels and facilitate transport of myriad substrates across the cell membrane in many organisms. Majority of plant MIPs have water transporting ability and are commonly referred as aquaporins (AQPs). In the present study, we identified aquaporin coding genes in flax by genome-wide analysis, their structure, function and expression pattern by pan-genome exploration. Cross-genera phylogenetic analysis with known aquaporins from rice, arabidopsis, and poplar showed five subgroups of flax aquaporins representing 16 plasma membrane intrinsic proteins (PIPs), 17 tonoplast intrinsic proteins (TIPs), 13 NOD26-like intrinsic proteins (NIPs), 2 small basic intrinsic proteins (SIPs), and 3 uncharacterized intrinsic proteins (XIPs). Amongst aquaporins, PIPs contained hydrophilic aromatic arginine (ar/R) selective filter but TIP, NIP, SIP and XIP subfamilies mostly contained hydrophobic ar/R selective filter. Analysis of RNA-seq and microarray data revealed high expression of PIPs in multiple tissues, low expression of NIPs, and seed specific expression of TIP3 in flax. Exploration of aquaporin homologs in three closely related Linum species bienne, grandiflorum and leonii revealed presence of 49, 39 and 19 AQPs, respectively. The genome-wide identification of aquaporins, first in flax, provides insight to elucidate their physiological and developmental roles in flax. PMID:28447607
Shivaraj, S M; Deshmukh, Rupesh K; Rai, Rhitu; Bélanger, Richard; Agrawal, Pawan K; Dash, Prasanta K
2017-04-27
Membrane intrinsic proteins (MIPs) form transmembrane channels and facilitate transport of myriad substrates across the cell membrane in many organisms. Majority of plant MIPs have water transporting ability and are commonly referred as aquaporins (AQPs). In the present study, we identified aquaporin coding genes in flax by genome-wide analysis, their structure, function and expression pattern by pan-genome exploration. Cross-genera phylogenetic analysis with known aquaporins from rice, arabidopsis, and poplar showed five subgroups of flax aquaporins representing 16 plasma membrane intrinsic proteins (PIPs), 17 tonoplast intrinsic proteins (TIPs), 13 NOD26-like intrinsic proteins (NIPs), 2 small basic intrinsic proteins (SIPs), and 3 uncharacterized intrinsic proteins (XIPs). Amongst aquaporins, PIPs contained hydrophilic aromatic arginine (ar/R) selective filter but TIP, NIP, SIP and XIP subfamilies mostly contained hydrophobic ar/R selective filter. Analysis of RNA-seq and microarray data revealed high expression of PIPs in multiple tissues, low expression of NIPs, and seed specific expression of TIP3 in flax. Exploration of aquaporin homologs in three closely related Linum species bienne, grandiflorum and leonii revealed presence of 49, 39 and 19 AQPs, respectively. The genome-wide identification of aquaporins, first in flax, provides insight to elucidate their physiological and developmental roles in flax.
Gervais, Julie; Plissonneau, Clémence; Linglin, Juliette; Meyer, Michel; Labadie, Karine; Cruaud, Corinne; Fudal, Isabelle; Rouxel, Thierry; Balesdent, Marie-Hélène
2017-10-01
Leptosphaeria maculans, the causal agent of stem canker disease, colonizes oilseed rape (Brassica napus) in two stages: a short and early colonization stage corresponding to cotyledon or leaf colonization, and a late colonization stage during which the fungus colonizes systemically and symptomlessly the plant during several months before stem canker appears. To date, the determinants of the late colonization stage are poorly understood; L. maculans may either successfully escape plant defences, leading to stem canker development, or the plant may develop an 'adult-stage' resistance reducing canker incidence. To obtain an insight into these determinants, we performed an RNA-sequencing (RNA-seq) pilot project comparing fungal gene expression in infected cotyledons and in symptomless or necrotic stems. Despite the low fraction of fungal material in infected stems, sufficient fungal transcripts were detected and a large number of fungal genes were expressed, thus validating the feasibility of the approach. Our analysis showed that all avirulence genes previously identified are under-expressed during stem colonization compared with cotyledon colonization. A validation RNA-seq experiment was then performed to investigate the expression of candidate effector genes during systemic colonization. Three hundred and seven 'late' effector candidates, under-expressed in the early colonization stage and over-expressed in the infected stems, were identified. Finally, our analysis revealed a link between the regulation of expression of effectors and their genomic location: the 'late' effector candidates, putatively involved in systemic colonization, are located in gene-rich genomic regions, whereas the 'early' effector genes, over-expressed in the early colonization stage, are located in gene-poor regions of the genome. © 2016 BSPP AND JOHN WILEY & SONS LTD.
Decoding transcriptional enhancers: Evolving from annotation to functional interpretation
Engel, Krysta L.; Mackiewicz, Mark; Hardigan, Andrew A.; Myers, Richard M.; Savic, Daniel
2016-01-01
Deciphering the intricate molecular processes that orchestrate the spatial and temporal regulation of genes has become an increasingly major focus of biological research. The differential expression of genes by diverse cell types with a common genome is a hallmark of complex cellular functions, as well as the basis for multicellular life. Importantly, a more coherent understanding of gene regulation is critical for defining developmental processes, evolutionary principles and disease etiologies. Here we present our current understanding of gene regulation by focusing on the role of enhancer elements in these complex processes. Although functional genomic methods have provided considerable advances to our understanding of gene regulation, these assays, which are usually performed on a genome-wide scale, typically provide correlative observations that lack functional interpretation. Recent innovations in genome editing technologies have placed gene regulatory studies at an exciting crossroads, as systematic, functional evaluation of enhancers and other transcriptional regulatory elements can now be performed in a coordinated, high-throughput manner across the entire genome. This review provides insights on transcriptional enhancer function, their role in development and disease, and catalogues experimental tools commonly used to study these elements. Additionally, we discuss the crucial role of novel techniques in deciphering the complex gene regulatory landscape and how these studies will shape future research. PMID:27224938
Decoding transcriptional enhancers: Evolving from annotation to functional interpretation.
Engel, Krysta L; Mackiewicz, Mark; Hardigan, Andrew A; Myers, Richard M; Savic, Daniel
2016-09-01
Deciphering the intricate molecular processes that orchestrate the spatial and temporal regulation of genes has become an increasingly major focus of biological research. The differential expression of genes by diverse cell types with a common genome is a hallmark of complex cellular functions, as well as the basis for multicellular life. Importantly, a more coherent understanding of gene regulation is critical for defining developmental processes, evolutionary principles and disease etiologies. Here we present our current understanding of gene regulation by focusing on the role of enhancer elements in these complex processes. Although functional genomic methods have provided considerable advances to our understanding of gene regulation, these assays, which are usually performed on a genome-wide scale, typically provide correlative observations that lack functional interpretation. Recent innovations in genome editing technologies have placed gene regulatory studies at an exciting crossroads, as systematic, functional evaluation of enhancers and other transcriptional regulatory elements can now be performed in a coordinated, high-throughput manner across the entire genome. This review provides insights on transcriptional enhancer function, their role in development and disease, and catalogues experimental tools commonly used to study these elements. Additionally, we discuss the crucial role of novel techniques in deciphering the complex gene regulatory landscape and how these studies will shape future research. Copyright © 2016 Elsevier Ltd. All rights reserved.
Feltus, F A; Singh, H P; Lohithaswa, H C; Schulze, S R; Silva, T D; Paterson, A H
2006-04-01
Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species.
Feltus, F.A.; Singh, H.P.; Lohithaswa, H.C.; Schulze, S.R.; Silva, T.D.; Paterson, A.H.
2006-01-01
Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species. PMID:16607031
Particle bombardment - mediated gene transfer and GFP transient expression in Seteria viridis.
Mookkan, Muruganantham
2018-04-03
Setaria viridis is one of the most important model grasses in studying monocot plant biology. Transient gene expression study is a very important tool in plant biotechnology, functional genomics, and CRISPR-Cas9 genome editing technology via particle bombardment. In this study, a particle bombardment-mediated protocol was developed to introduce DNA into Setaria viridis in vitro leaf explants. In addition, physical and biological parameters, such as helium pressure, distance from stopping screen to the target tissues, DNA concentration, and number of bombardments, were tested and optimized. Optimum concentration of transient GFP expression was achieved using 1.5 ug plasmid DNA with 0.6 mm gold particles and 6 cm bombardment distance, using 1,100 psi. Doubling the bombardment instances provides the maximum number of foci of transient GFP expression. This simple protocol will be helpful for genomics studies in the S. viridis monocot model.
Proteogenomic characterization of human colon and rectal cancer
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Bing; Wang, Jing; Wang, Xiaojing
2014-09-18
We analyzed proteomes of colon and rectal tumors previously characterized by the Cancer Genome Atlas (TCGA) and performed integrated proteogenomic analyses. Protein sequence variants encoded by somatic genomic variations displayed reduced expression compared to protein variants encoded by germline variations. mRNA transcript abundance did not reliably predict protein expression differences between tumors. Proteomics identified five protein expression subtypes, two of which were associated with the TCGA "MSI/CIMP" transcriptional subtype, but had distinct mutation and methylation patterns and associated with different clinical outcomes. Although CNAs showed strong cis- and trans-effects on mRNA expression, relatively few of these extend to the proteinmore » level. Thus, proteomics data enabled prioritization of candidate driver genes. Our analyses identified HNF4A, a novel candidate driver gene in tumors with chromosome 20q amplifications. Integrated proteogenomic analysis provides functional context to interpret genomic abnormalities and affords novel insights into cancer biology.« less
Bowman, Megan J.; Park, Wonkeun; Bauer, Philip J.; Udall, Joshua A.; Page, Justin T.; Raney, Joshua; Scheffler, Brian E.; Jones, Don. C.; Campbell, B. Todd
2013-01-01
An RNA-Seq experiment was performed using field grown well-watered and naturally rain fed cotton plants to identify differentially expressed transcripts under water-deficit stress. Our work constitutes the first application of the newly published diploid D5 Gossypium raimondii sequence in the study of tetraploid AD1 upland cotton RNA-seq transcriptome analysis. A total of 1,530 transcripts were differentially expressed between well-watered and water-deficit stressed root tissues, in patterns that confirm the accuracy of this technique for future studies in cotton genomics. Additionally, putative sequence based genome localization of differentially expressed transcripts detected A2 genome specific gene expression under water-deficit stress. These data will facilitate efforts to understand the complex responses governing transcriptomic regulatory mechanisms and to identify candidate genes that may benefit applied plant breeding programs. PMID:24324815
Cytoskeleton structure and total methylation of mouse cardiac and lung tissue during space flight.
Ogneva, Irina V; Loktev, Sergey S; Sychev, Vladimir N
2018-01-01
The purpose of this work was to evaluate the protein and mRNA expression levels of multiple cytoskeletal proteins in the cardiac and lung tissue of mice that were euthanized onboard the United States Orbital Segment of the International Space Station 37 days after the start of the SpaceX-4 mission (September 2014, USA). The results showed no changes in the cytoskeletal protein content in the cardiac and lung tissue of the mice, but there were significant changes in the mRNA expression levels of the associated genes, which may be due to an increase in total genome methylation. The mRNA expression levels of DNA methylases, the cytosine demethylases Tet1 and Tet3, histone acetylase and histone deacetylase did not change, and the mRNA expression level of cytosine demethylase Tet2 was significantly decreased.
Cytoskeleton structure and total methylation of mouse cardiac and lung tissue during space flight
Loktev, Sergey S.; Sychev, Vladimir N.
2018-01-01
The purpose of this work was to evaluate the protein and mRNA expression levels of multiple cytoskeletal proteins in the cardiac and lung tissue of mice that were euthanized onboard the United States Orbital Segment of the International Space Station 37 days after the start of the SpaceX-4 mission (September 2014, USA). The results showed no changes in the cytoskeletal protein content in the cardiac and lung tissue of the mice, but there were significant changes in the mRNA expression levels of the associated genes, which may be due to an increase in total genome methylation. The mRNA expression levels of DNA methylases, the cytosine demethylases Tet1 and Tet3, histone acetylase and histone deacetylase did not change, and the mRNA expression level of cytosine demethylase Tet2 was significantly decreased. PMID:29768411
Reflections on the US FDA's Warning on Direct-to-Consumer Genetic Testing.
Yim, Seon-Hee; Chung, Yeun-Jun
2014-12-01
In November 2013, the US Food and Drug Administration (FDA) sent a warning letter to 23andMe, Inc. and ordered the company to discontinue marketing of the 23andMe Personal Genome Service (PGS) until it receives FDA marketing authorization for the device. The FDA considers the PGS as an unclassified medical device, which requires premarket approval or de novo classification. Opponents of the FDA's action expressed their concerns, saying that the FDA is overcautious and paternalistic, which violates consumers' rights and might stifle the consumer genomics field itself, and insisted that the agency should not restrict direct-to-consumer (DTC) genomic testing without empirical evidence of harm. Proponents support the agency's action as protection of consumers from potentially invalid and almost useless information. This action was also significant, since it reflected the FDA's attitude towards medical application of next-generation sequencing techniques. In this review, we followed up on the FDA-23andMe incident and evaluated the problems and prospects for DTC genetic testing.
CpG islands: algorithms and applications in methylation studies.
Zhao, Zhongming; Han, Leng
2009-05-15
Methylation occurs frequently at 5'-cytosine of the CpG dinucleotides in vertebrate genomes; however, this epigenetic feature is rarely observed in CpG islands (CGIs) or CpG clusters in the promoter regions of genes. Aberrant methylation of the promoter-associated CGIs might influence gene expression and cause carcinogenesis. Because of the functional importance, multiple algorithms have been available for identifying CGIs in a genome or a sequence. They can be categorized into the traditional algorithms (e.g., Gardiner-Garden and Frommer (1987), Takai and Jones (2002), and CpGPRoD (2002)) or statistical property based algorithms (CpGcluster (2006) and CG cluster (2007)). We reviewed the features of these algorithms and evaluated their performance on identifying functional CGIs using genome-wide methylation data. Moreover, identification of CGIs is an initial step in many recent studies for predicting methylation status as well as in the design of methylation detection platforms. We reviewed the benchmarks and features used in these studies.
Ingestion of gallium phosphide nanowires has no adverse effect on Drosophila tissue function.
Adolfsson, Karl; Schneider, Martina; Hammarin, Greger; Häcker, Udo; Prinz, Christelle N
2013-07-19
Engineered nanoparticles have been under increasing scrutiny in recent years. High aspect ratio nanoparticles such as carbon nanotubes and nanowires have raised safety concerns due to their geometrical similarity to asbestos fibers. III-V epitaxial semiconductor nanowires are expected to be utilized in devices such as LEDs and solar cells and will thus be available to the public. In addition, clean-room staff fabricating and characterizing the nanowires are at risk of exposure, emphasizing the importance of investigating their possible toxicity. Here we investigated the effects of gallium phosphide nanowires on the fruit fly Drosophila melanogaster. Drosophila larvae and/or adults were exposed to gallium phosphide nanowires by ingestion with food. The toxicity and tissue interaction of the nanowires was evaluated by investigating tissue distribution, activation of immune response, genome-wide gene expression, life span, fecundity and somatic mutation rates. Our results show that gallium phosphide nanowires applied through the diet are not taken up into Drosophila tissues, do not elicit a measurable immune response or changes in genome-wide gene expression and do not significantly affect life span or somatic mutation rate.
Ingestion of gallium phosphide nanowires has no adverse effect on Drosophila tissue function
NASA Astrophysics Data System (ADS)
Adolfsson, Karl; Schneider, Martina; Hammarin, Greger; Häcker, Udo; Prinz, Christelle N.
2013-07-01
Engineered nanoparticles have been under increasing scrutiny in recent years. High aspect ratio nanoparticles such as carbon nanotubes and nanowires have raised safety concerns due to their geometrical similarity to asbestos fibers. III-V epitaxial semiconductor nanowires are expected to be utilized in devices such as LEDs and solar cells and will thus be available to the public. In addition, clean-room staff fabricating and characterizing the nanowires are at risk of exposure, emphasizing the importance of investigating their possible toxicity. Here we investigated the effects of gallium phosphide nanowires on the fruit fly Drosophila melanogaster. Drosophila larvae and/or adults were exposed to gallium phosphide nanowires by ingestion with food. The toxicity and tissue interaction of the nanowires was evaluated by investigating tissue distribution, activation of immune response, genome-wide gene expression, life span, fecundity and somatic mutation rates. Our results show that gallium phosphide nanowires applied through the diet are not taken up into Drosophila tissues, do not elicit a measurable immune response or changes in genome-wide gene expression and do not significantly affect life span or somatic mutation rate.
USDA-ARS?s Scientific Manuscript database
Single nucleotide polymorphism was employed in the construction of a high-resolution, expressed sequence tag (EST) map of Aegilops tauschii, the diploid source of the wheat D genome. Comparison of the map with the rice and sorghum genome sequences revealed 50 inversions and translocations; 2, 8, and...
Quach, Truyen N; Nguyen, Hanh T M; Valliyodan, Babu; Joshi, Trupti; Xu, Dong; Nguyen, Henry T
2015-06-01
Nuclear factor-Y (NF-Y), a heterotrimeric transcription factor, is composed of NF-YA, NF-YB and NF-YC proteins. In plants, there are usually more than 10 genes for each family and their members have been identified to be key regulators in many developmental and physiological processes controlling gametogenesis, embryogenesis, nodule development, seed development, abscisic acid (ABA) signaling, flowering time, primary root elongation, blue light responses, endoplasmic reticulum (ER) stress response and drought tolerance. Taking the advantages of the recent soybean genome draft and information on functional characterizations of nuclear factor Y (NF-Y) transcription factor family in plants, we identified 21 GmNF-YA, 32 GmNF-YB, and 15 GmNF-YC genes in the soybean (Glycine max) genome. Phylogenetic analyses show that soybean's proteins share strong homology to Arabidopsis and many of them are closely related to functionally characterized NF-Y in plants. Expression analysis in various tissues of flower, leaf, root, seeds of different developmental stages, root hairs under rhizobium inoculation, and drought-treated roots and leaves revealed that certain groups of soybean NF-Y are likely involved in specific developmental and stress responses. This study provides extensive evaluation of the soybean NF-Y family and is particularly useful for further functional characterization of GmNF-Y proteins in seed development, nodulation and drought adaptation of soybean.
Kim, Min-Seok; Jeong, Seok Won; Choi, Seong-Jin; Han, Jin-Young; Kim, Sung-Hwan; Yoon, Seokjoo; Oh, Jung-Hwa; Lee, Kyuhong
2017-02-15
The antimicrobial biocide polyhexamethyleneguanidine (PHMG) phosphate is the main ingredient in the commercially available humidifier disinfectant. PHMG phosphate-based humidifier disinfectants can cause pulmonary fibrosis and induce inflammatory and fibrotic responses both in vivo and in vitro. However, toxicological mechanisms including genomic alterations induced by inhalation exposure to PHMG phosphate have not been elucidated. Therefore, this study evaluated the toxicological effects of the PHMG phosphate-containing humidifier disinfectant. We used DNA microarray to identify global gene expression changes in rats treated with PHMG phosphate-containing humidifier disinfectant for 4 weeks and 10 weeks. Functional significance of differentially expressed genes (DEGs) was estimated by gene ontology (GO) analysis. Four weeks post-exposure, 320 and 392 DEGs were identified in female and male rats, respectively (>2-fold, p<0.05). Ten weeks post-exposure, 1290 and 995 DEGs were identified in females and males, respectively. Of these, 119 and 556 genes overlapped between females and males at 4 weeks and 10 weeks, respectively, post-PHMG phosphate exposure. In addition, 21 genes were upregulated and 4 genes were downregulated in response to PHMG phosphate in a time-dependent manner. Thus, we predict that changes in genomic responses could be a significant molecular mechanism underlying PHMG phosphate toxicity. Further studies are required to determine the detailed mechanism of PHMG phosphate-induced pulmonary toxicity. Copyright © 2016. Published by Elsevier B.V.
An undergraduate laboratory class using CRISPR/Cas9 technology to mutate drosophila genes.
Adame, Vanesa; Chapapas, Holly; Cisneros, Marilyn; Deaton, Carol; Deichmann, Sophia; Gadek, Chauncey; Lovato, TyAnna L; Chechenova, Maria B; Guerin, Paul; Cripps, Richard M
2016-05-06
CRISPR/Cas9 genome editing technology is used in the manipulation of genome sequences and gene expression. Because of the ease and rapidity with which genes can be mutated using CRISPR/Cas9, we sought to determine if a single-semester undergraduate class could be successfully taught, wherein students isolate mutants for specific genes using CRISPR/Cas9. Six students were each assigned a single Drosophila gene, for which no mutants currently exist. Each student designed and created plasmids to encode single guide RNAs that target their selected gene; injected the plasmids into Cas9-expressing embryos, in order to delete the selected gene; carried out a three-generation cross to test for germline transmission of a mutated allele and generate a stable stock of the mutant; and characterized the mutant alleles by PCR and sequencing. Three genes out of six were successfully mutated. Pre- and post- survey evaluations of the students in the class revealed that student attitudes towards their research competencies increased, although the changes were not statistically significant. We conclude that it is feasible to develop a laboratory genome editing class, to provide effective laboratory training to undergraduate students, and to generate mutant lines for use by the broader scientific community. © 2016 by The International Union of Biochemistry and Molecular Biology, 44:263-275, 2016. © 2016 The International Union of Biochemistry and Molecular Biology.
Insights into the Ecology and Evolution of Polyploid Plants through Network Analysis.
Gallagher, Joseph P; Grover, Corrinne E; Hu, Guanjing; Wendel, Jonathan F
2016-06-01
Polyploidy is a widespread phenomenon throughout eukaryotes, with important ecological and evolutionary consequences. Although genes operate as components of complex pathways and networks, polyploid changes in genes and gene expression have typically been evaluated as either individual genes or as a part of broad-scale analyses. Network analysis has been fruitful in associating genomic and other 'omic'-based changes with phenotype for many systems. In polyploid species, network analysis has the potential not only to facilitate a better understanding of the complex 'omic' underpinnings of phenotypic and ecological traits common to polyploidy, but also to provide novel insight into the interaction among duplicated genes and genomes. This adds perspective to the global patterns of expression (and other 'omic') change that accompany polyploidy and to the patterns of recruitment and/or loss of genes following polyploidization. While network analysis in polyploid species faces challenges common to other analyses of duplicated genomes, present technologies combined with thoughtful experimental design provide a powerful system to explore polyploid evolution. Here, we demonstrate the utility and potential of network analysis to questions pertaining to polyploidy with an example involving evolution of the transgressively superior cotton fibres found in polyploid Gossypium hirsutum. By combining network analysis with prior knowledge, we provide further insights into the role of profilins in fibre domestication and exemplify the potential for network analysis in polyploid species. © 2016 John Wiley & Sons Ltd.
Systematic Evaluation of Molecular Networks for Discovery of Disease Genes.
Huang, Justin K; Carlin, Daniel E; Yu, Michael Ku; Zhang, Wei; Kreisberg, Jason F; Tamayo, Pablo; Ideker, Trey
2018-04-25
Gene networks are rapidly growing in size and number, raising the question of which networks are most appropriate for particular applications. Here, we evaluate 21 human genome-wide interaction networks for their ability to recover 446 disease gene sets identified through literature curation, gene expression profiling, or genome-wide association studies. While all networks have some ability to recover disease genes, we observe a wide range of performance with STRING, ConsensusPathDB, and GIANT networks having the best performance overall. A general tendency is that performance scales with network size, suggesting that new interaction discovery currently outweighs the detrimental effects of false positives. Correcting for size, we find that the DIP network provides the highest efficiency (value per interaction). Based on these results, we create a parsimonious composite network with both high efficiency and performance. This work provides a benchmark for selection of molecular networks in human disease research. Copyright © 2018 Elsevier Inc. All rights reserved.
Bioinformatics and expressional analysis of cDNA clones from floral buds
NASA Astrophysics Data System (ADS)
Pawełkowicz, Magdalena Ewa; Skarzyńska, Agnieszka; Cebula, Justyna; Hincha, Dirck; ZiÄ bska, Karolina; PlÄ der, Wojciech; Przybecki, Zbigniew
2017-08-01
The application of genomic approaches may serve as an initial step in understanding the complexity of biochemical network and cellular processes responsible for regulation and execution of many developmental tasks. The molecular mechanism of sex expression in cucumber is still not elucidated. A study of differential expression was conducted to identify genes involved in sex determination and floral organ morphogenesis. Herein, we present generation of expression sequence tags (EST) obtained by differential hybridization (DH) and subtraction technique (cDNA-DSC) and their characteristic features such as molecular function, involvement in biology processes, expression and mapping position on the genome.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hirsch, Candice N.; Hirsch, Cory D.; Brohammer, Alex B.
Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft genome assembly of the inbred line PH207 to complement and compare with the existing B73 reference sequence. B73 is a founder of the Stiff Stalk germplasm pool, while PH207 is a founder of Iodent germplasm, both of which have contributed substantially to the production of temperate commercial maize and are combined to make heterotic hybrids. Comparison ofmore » these two assemblies revealed over 2500 genes present in only one of the two genotypes and 136 gene families that have undergone extensive expansion or contraction. Transcriptome profiling revealed extensive expression variation, with as many as 10,564 differentially expressed transcripts and 7128 transcripts expressed in only one of the two genotypes in a single tissue. Genotype-specific genes were more likely to have tissue/condition-specific expression and lower transcript abundance. The availability of a high-quality genome assembly for the elite maize inbred PH207 expands our knowledge of the breadth of natural genome and transcriptome variation in elite maize inbred lines across heterotic pools.« less
Hirsch, Candice N.; Hirsch, Cory D.; Brohammer, Alex B.; ...
2016-11-01
Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft genome assembly of the inbred line PH207 to complement and compare with the existing B73 reference sequence. B73 is a founder of the Stiff Stalk germplasm pool, while PH207 is a founder of Iodent germplasm, both of which have contributed substantially to the production of temperate commercial maize and are combined to make heterotic hybrids. Comparison ofmore » these two assemblies revealed over 2500 genes present in only one of the two genotypes and 136 gene families that have undergone extensive expansion or contraction. Transcriptome profiling revealed extensive expression variation, with as many as 10,564 differentially expressed transcripts and 7128 transcripts expressed in only one of the two genotypes in a single tissue. Genotype-specific genes were more likely to have tissue/condition-specific expression and lower transcript abundance. The availability of a high-quality genome assembly for the elite maize inbred PH207 expands our knowledge of the breadth of natural genome and transcriptome variation in elite maize inbred lines across heterotic pools.« less
Soifer, Ilya; Barad, Omer; Shem-Tov, Doron; Baruch, Kobi; Lu, Fei; Hernandez, Alvaro G.; Wright, Chris L.; Koehler, Klaus; Buell, C. Robin; de Leon, Natalia
2016-01-01
Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft genome assembly of the inbred line PH207 to complement and compare with the existing B73 reference sequence. B73 is a founder of the Stiff Stalk germplasm pool, while PH207 is a founder of Iodent germplasm, both of which have contributed substantially to the production of temperate commercial maize and are combined to make heterotic hybrids. Comparison of these two assemblies revealed over 2500 genes present in only one of the two genotypes and 136 gene families that have undergone extensive expansion or contraction. Transcriptome profiling revealed extensive expression variation, with as many as 10,564 differentially expressed transcripts and 7128 transcripts expressed in only one of the two genotypes in a single tissue. Genotype-specific genes were more likely to have tissue/condition-specific expression and lower transcript abundance. The availability of a high-quality genome assembly for the elite maize inbred PH207 expands our knowledge of the breadth of natural genome and transcriptome variation in elite maize inbred lines across heterotic pools. PMID:27803309
Genome-, Transcriptome- and Proteome-Wide Analyses of the Gliadin Gene Families in Triticum urartu
Wang, Dongzhi; Yang, Wenlong; Sun, Jiazhu; Zhang, Aimin; Zhan, Kehui
2015-01-01
Gliadins are the major components of storage proteins in wheat grains, and they play an essential role in the dough extensibility and nutritional quality of flour. Because of the large number of the gliadin family members, the high level of sequence identity, and the lack of abundant genomic data for Triticum species, identifying the full complement of gliadin family genes in hexaploid wheat remains challenging. Triticum urartu is a wild diploid wheat species and considered the A-genome donor of polyploid wheat species. The accession PI428198 (G1812) was chosen to determine the complete composition of the gliadin gene families in the wheat A-genome using the available draft genome. Using a PCR-based cloning strategy for genomic DNA and mRNA as well as a bioinformatics analysis of genomic sequence data, 28 gliadin genes were characterized. Of these genes, 23 were α-gliadin genes, three were γ-gliadin genes and two were ω-gliadin genes. An RNA sequencing (RNA-Seq) survey of the dynamic expression patterns of gliadin genes revealed that their synthesis in immature grains began prior to 10 days post-anthesis (DPA), peaked at 15 DPA and gradually decreased at 20 DPA. The accumulation of proteins encoded by 16 of the expressed gliadin genes was further verified and quantified using proteomic methods. The phylogenetic analysis demonstrated that the homologs of these α-gliadin genes were present in tetraploid and hexaploid wheat, which was consistent with T. urartu being the A-genome progenitor species. This study presents a systematic investigation of the gliadin gene families in T. urartu that spans the genome, transcriptome and proteome, and it provides new information to better understand the molecular structure, expression profiles and evolution of the gliadin genes in T. urartu and common wheat. PMID:26132381
Genome-, Transcriptome- and Proteome-Wide Analyses of the Gliadin Gene Families in Triticum urartu.
Zhang, Yanlin; Luo, Guangbin; Liu, Dongcheng; Wang, Dongzhi; Yang, Wenlong; Sun, Jiazhu; Zhang, Aimin; Zhan, Kehui
2015-01-01
Gliadins are the major components of storage proteins in wheat grains, and they play an essential role in the dough extensibility and nutritional quality of flour. Because of the large number of the gliadin family members, the high level of sequence identity, and the lack of abundant genomic data for Triticum species, identifying the full complement of gliadin family genes in hexaploid wheat remains challenging. Triticum urartu is a wild diploid wheat species and considered the A-genome donor of polyploid wheat species. The accession PI428198 (G1812) was chosen to determine the complete composition of the gliadin gene families in the wheat A-genome using the available draft genome. Using a PCR-based cloning strategy for genomic DNA and mRNA as well as a bioinformatics analysis of genomic sequence data, 28 gliadin genes were characterized. Of these genes, 23 were α-gliadin genes, three were γ-gliadin genes and two were ω-gliadin genes. An RNA sequencing (RNA-Seq) survey of the dynamic expression patterns of gliadin genes revealed that their synthesis in immature grains began prior to 10 days post-anthesis (DPA), peaked at 15 DPA and gradually decreased at 20 DPA. The accumulation of proteins encoded by 16 of the expressed gliadin genes was further verified and quantified using proteomic methods. The phylogenetic analysis demonstrated that the homologs of these α-gliadin genes were present in tetraploid and hexaploid wheat, which was consistent with T. urartu being the A-genome progenitor species. This study presents a systematic investigation of the gliadin gene families in T. urartu that spans the genome, transcriptome and proteome, and it provides new information to better understand the molecular structure, expression profiles and evolution of the gliadin genes in T. urartu and common wheat.
Dong, Yanhan; Li, Ying; Zhao, Miaomiao; Jing, Maofeng; Liu, Xinyu; Liu, Muxing; Guo, Xianxian; Zhang, Xing; Chen, Yue; Liu, Yongfeng; Liu, Yanhong; Ye, Wenwu; Zhang, Haifeng; Wang, Yuanchao; Zheng, Xiaobo; Wang, Ping; Zhang, Zhengguang
2015-01-01
Genome dynamics of pathogenic organisms are driven by pathogen and host co-evolution, in which pathogen genomes are shaped to overcome stresses imposed by hosts with various genetic backgrounds through generation of a variety of isolates. This same principle applies to the rice blast pathogen Magnaporthe oryzae and the rice host; however, genetic variations among different isolates of M. oryzae remain largely unknown, particularly at genome and transcriptome levels. Here, we applied genomic and transcriptomic analytical tools to investigate M. oryzae isolate 98-06 that is the most aggressive in infection of susceptible rice cultivars. A unique 1.4 Mb of genomic sequences was found in isolate 98-06 in comparison to reference strain 70-15. Genome-wide expression profiling revealed the presence of two critical expression patterns of M. oryzae based on 64 known pathogenicity-related (PaR) genes. In addition, 134 candidate effectors with various segregation patterns were identified. Five tested proteins could suppress BAX-mediated programmed cell death in Nicotiana benthamiana leaves. Characterization of isolate-specific effector candidates Iug6 and Iug9 and PaR candidate Iug18 revealed that they have a role in fungal propagation and pathogenicity. Moreover, Iug6 and Iug9 are located exclusively in the biotrophic interfacial complex (BIC) and their overexpression leads to suppression of defense-related gene expression in rice, suggesting that they might participate in biotrophy by inhibiting the SA and ET pathways within the host. Thus, our studies identify novel effector and PaR proteins involved in pathogenicity of the highly aggressive M. oryzae field isolate 98-06, and reveal molecular and genomic dynamics in the evolution of M. oryzae and rice host interactions. PMID:25837042
Picking Cell Lines for High-Throughput Transcriptomic Toxicity ...
High throughput, whole genome transcriptomic profiling is a promising approach to comprehensively evaluate chemicals for potential biological effects. To be useful for in vitro toxicity screening, gene expression must be quantified in a set of representative cell types that captures the diversity of potential responses across chemicals. The ideal dataset to select these cell types would consist of hundreds of cell types treated with thousands of chemicals, but does not yet exist. However, basal gene expression data may be useful as a surrogate for representing the relevant biological space necessary for cell type selection. The goal of this study was to identify a small (< 20) number of cell types that capture a large, quantifiable fraction of basal gene expression diversity. Three publicly available collections of Affymetrix U133+2.0 cellular gene expression data were used: 1) 59 cell lines from the NCI60 set; 2) 303 primary cell types from the Mabbott et al (2013) expression atlas; and 3) 1036 cell lines from the Cancer Cell Line Encyclopedia. The data were RMA normalized, log-transformed, and the probe sets mapped to HUGO gene identifiers. The results showed that <20 cell lines capture only a small fraction of the total diversity in basal gene expression when evaluated using either the entire set of 20960 HUGO genes or a subset of druggable genes likely to be chemical targets. The fraction of the total gene expression variation explained was consistent when
Arabidopsis gene expression patterns during spaceflight
NASA Astrophysics Data System (ADS)
Paul, A.-L.; Ferl, R. J.
The exposure of Arabidopsis thaliana (Arabidopsis) plants to spaceflight environments resulted in the differential expression of hundreds of genes. A 5 day mission on orbiter Columbia in 1999 (STS-93) carried transgenic Arabidopsis plants engineered with a transgene composed of the alcohol dehydrogenase (Adh) gene promoter linked to the β -Glucuronidase (GUS) reporter gene. The plants were used to evaluate the effects of spaceflight on two fronts. First, expression patterns visualized with the Adh/GUS transgene were used to address specifically the possibility that spaceflight induces a hypoxic stress response, and to assess whether any spaceflight response was similar to control terrestrial hypoxia-induced gene expression patterns. (Paul et al., Plant Physiol. 2001, 126:613). Second, genome-wide patterns of native gene expression were evaluated utilizing the Affymetrix ATH1 GeneChip? array of 8,000 Arabidopsis genes. As a control for the veracity of the array analyses, a selection of genes identified with the arrays was further characterized with quantitative Real-Time RT PCR (ABI - TaqmanTM). Comparison of the patterns of expression for arrays of hybridized with RNA isolated from plants exposed to spaceflight compared to the control arrays revealed hundreds of genes that were differentially expressed in response to spaceflight, yet most genes that are hallmarks of hypoxic stress were unaffected. These results will be discussed in light of current models for plant responses to the spaceflight environment, and with regard to potential future flight opportunities.
The genomic landscape shaped by selection on transposable elements across 18 mouse strains.
Nellåker, Christoffer; Keane, Thomas M; Yalcin, Binnaz; Wong, Kim; Agam, Avigail; Belgard, T Grant; Flint, Jonathan; Adams, David J; Frankel, Wayne N; Ponting, Chris P
2012-06-15
Transposable element (TE)-derived sequence dominates the landscape of mammalian genomes and can modulate gene function by dysregulating transcription and translation. Our current knowledge of TEs in laboratory mouse strains is limited primarily to those present in the C57BL/6J reference genome, with most mouse TEs being drawn from three distinct classes, namely short interspersed nuclear elements (SINEs), long interspersed nuclear elements (LINEs) and the endogenous retrovirus (ERV) superfamily. Despite their high prevalence, the different genomic and gene properties controlling whether TEs are preferentially purged from, or are retained by, genetic drift or positive selection in mammalian genomes remain poorly defined. Using whole genome sequencing data from 13 classical laboratory and 4 wild-derived mouse inbred strains, we developed a comprehensive catalogue of 103,798 polymorphic TE variants. We employ this extensive data set to characterize TE variants across the Mus lineage, and to infer neutral and selective processes that have acted over 2 million years. Our results indicate that the majority of TE variants are introduced though the male germline and that only a minority of TE variants exert detectable changes in gene expression. However, among genes with differential expression across the strains there are twice as many TE variants identified as being putative causal variants as expected. Most TE variants that cause gene expression changes appear to be purged rapidly by purifying selection. Our findings demonstrate that past TE insertions have often been highly deleterious, and help to prioritize TE variants according to their likely contribution to gene expression or phenotype variation.
Williams-Devane, ClarLynda R; Wolf, Maritja A; Richard, Ann M
2009-06-01
A publicly available toxicogenomics capability for supporting predictive toxicology and meta-analysis depends on availability of gene expression data for chemical treatment scenarios, the ability to locate and aggregate such information by chemical, and broad data coverage within chemical, genomics, and toxicological information domains. This capability also depends on common genomics standards, protocol description, and functional linkages of diverse public Internet data resources. We present a survey of public genomics resources from these vantage points and conclude that, despite progress in many areas, the current state of the majority of public microarray databases is inadequate for supporting these objectives, particularly with regard to chemical indexing. To begin to address these inadequacies, we focus chemical annotation efforts on experimental content contained in the two primary public genomic resources: ArrayExpress and Gene Expression Omnibus. Automated scripts and extensive manual review were employed to transform free-text experiment descriptions into a standardized, chemically indexed inventory of experiments in both resources. These files, which include top-level summary annotations, allow for identification of current chemical-associated experimental content, as well as chemical-exposure-related (or "Treatment") content of greatest potential value to toxicogenomics investigation. With these chemical-index files, it is possible for the first time to assess the breadth and overlap of chemical study space represented in these databases, and to begin to assess the sufficiency of data with shared protocols for chemical similarity inferences. Chemical indexing of public genomics databases is a first important step toward integrating chemical, toxicological and genomics data into predictive toxicology.
Comparative Genomics of Non-TNL Disease Resistance Genes from Six Plant Species.
Nepal, Madhav P; Andersen, Ethan J; Neupane, Surendra; Benson, Benjamin V
2017-09-30
Disease resistance genes (R genes), as part of the plant defense system, have coevolved with corresponding pathogen molecules. The main objectives of this project were to identify non-Toll interleukin receptor, nucleotide-binding site, leucine-rich repeat (nTNL) genes and elucidate their evolutionary divergence across six plant genomes. Using reference sequences from Arabidopsis , we investigated nTNL orthologs in the genomes of common bean, Medicago , soybean, poplar, and rice. We used Hidden Markov Models for sequence identification, performed model-based phylogenetic analyses, visualized chromosomal positioning, inferred gene clustering, and assessed gene expression profiles. We analyzed 908 nTNL R genes in the genomes of the six plant species, and classified them into 12 subgroups based on the presence of coiled-coil (CC), nucleotide binding site (NBS), leucine rich repeat (LRR), resistance to Powdery mildew 8 (RPW8), and BED type zinc finger domains. Traditionally classified CC-NBS-LRR (CNL) genes were nested into four clades (CNL A-D) often with abundant, well-supported homogeneous subclades of Type-II R genes. CNL-D members were absent in rice, indicating a unique R gene retention pattern in the rice genome. Genomes from Arabidopsis , common bean, poplar and soybean had one chromosome without any CNL R genes. Medicago and Arabidopsis had the highest and lowest number of gene clusters, respectively. Gene expression analyses suggested unique patterns of expression for each of the CNL clades. Differential gene expression patterns of the nTNL genes were often found to correlate with number of introns and GC content, suggesting structural and functional divergence.
Comparative Genomics of Non-TNL Disease Resistance Genes from Six Plant Species
Andersen, Ethan J.; Neupane, Surendra; Benson, Benjamin V.
2017-01-01
Disease resistance genes (R genes), as part of the plant defense system, have coevolved with corresponding pathogen molecules. The main objectives of this project were to identify non-Toll interleukin receptor, nucleotide-binding site, leucine-rich repeat (nTNL) genes and elucidate their evolutionary divergence across six plant genomes. Using reference sequences from Arabidopsis, we investigated nTNL orthologs in the genomes of common bean, Medicago, soybean, poplar, and rice. We used Hidden Markov Models for sequence identification, performed model-based phylogenetic analyses, visualized chromosomal positioning, inferred gene clustering, and assessed gene expression profiles. We analyzed 908 nTNL R genes in the genomes of the six plant species, and classified them into 12 subgroups based on the presence of coiled-coil (CC), nucleotide binding site (NBS), leucine rich repeat (LRR), resistance to Powdery mildew 8 (RPW8), and BED type zinc finger domains. Traditionally classified CC-NBS-LRR (CNL) genes were nested into four clades (CNL A-D) often with abundant, well-supported homogeneous subclades of Type-II R genes. CNL-D members were absent in rice, indicating a unique R gene retention pattern in the rice genome. Genomes from Arabidopsis, common bean, poplar and soybean had one chromosome without any CNL R genes. Medicago and Arabidopsis had the highest and lowest number of gene clusters, respectively. Gene expression analyses suggested unique patterns of expression for each of the CNL clades. Differential gene expression patterns of the nTNL genes were often found to correlate with number of introns and GC content, suggesting structural and functional divergence. PMID:28973974
Muthamilarasan, Mehanathan; Khan, Yusuf; Jaishankar, Jananee; Shweta, Shweta; Lata, Charu; Prasad, Manoj
2015-01-01
Several underutilized grasses have excellent potential for use as bioenergy feedstock due to their lignocellulosic biomass. Genomic tools have enabled identification of lignocellulose biosynthesis genes in several sequenced plants. However, the non-availability of whole genome sequence of bioenergy grasses hinders the study on bioenergy genomics and their genomics-assisted crop improvement. Foxtail millet (Setaria italica L.; Si) is a model crop for studying systems biology of bioenergy grasses. In the present study, a systematic approach has been used for identification of gene families involved in cellulose (CesA/Csl), callose (Gsl) and monolignol biosynthesis (PAL, C4H, 4CL, HCT, C3H, CCoAOMT, F5H, COMT, CCR, CAD) and construction of physical map of foxtail millet. Sequence alignment and phylogenetic analysis of identified proteins showed that monolignol biosynthesis proteins were highly diverse, whereas CesA/Csl and Gsl proteins were homologous to rice and Arabidopsis. Comparative mapping of foxtail millet lignocellulose biosynthesis genes with other C4 panicoid genomes revealed maximum homology with switchgrass, followed by sorghum and maize. Expression profiling of candidate lignocellulose genes in response to different abiotic stresses and hormone treatments showed their differential expression pattern, with significant higher expression of SiGsl12, SiPAL2, SiHCT1, SiF5H2, and SiCAD6 genes. Further, due to the evolutionary conservation of grass genomes, the insights gained from the present study could be extrapolated for identifying genes involved in lignocellulose biosynthesis in other biofuel species for further characterization. PMID:26583030
Patel, Hardip; Forêt, Sylvain; Karlsen, Bård Ove; Jørgensen, Tor Erik; Hall-Spencer, Jason M
2018-01-01
Abstract Cnidarians harbor a variety of small regulatory RNAs that include microRNAs (miRNAs) and PIWI-interacting RNAs (piRNAs), but detailed information is limited. Here, we report the identification and expression of novel miRNAs and putative piRNAs, as well as their genomic loci, in the symbiotic sea anemone Anemonia viridis. We generated a draft assembly of the A. viridis genome with putative size of 313 Mb that appeared to be composed of about 36% repeats, including known transposable elements. We detected approximately equal fractions of DNA transposons and retrotransposons. Deep sequencing of small RNA libraries constructed from A. viridis adults sampled at a natural CO2 gradient off Vulcano Island, Italy, identified 70 distinct miRNAs. Eight were homologous to previously reported miRNAs in cnidarians, whereas 62 appeared novel. Nine miRNAs were recognized as differentially expressed along the natural seawater pH gradient. We found a highly abundant and diverse population of piRNAs, with a substantial fraction showing ping–pong signatures. We identified nearly 22% putative piRNAs potentially targeting transposable elements within the A. viridis genome. The A. viridis genome appeared similar in size to that of other hexacorals with a very high divergence of transposable elements resembling that of the sea anemone genus Exaiptasia. The genome encodes and expresses a high number of small regulatory RNAs, which include novel miRNAs and piRNAs. Differentially expressed small RNAs along the seawater pH gradient indicated regulatory gene responses to environmental stressors. PMID:29385567
Muthamilarasan, Mehanathan; Khan, Yusuf; Jaishankar, Jananee; Shweta, Shweta; Lata, Charu; Prasad, Manoj
2015-01-01
Several underutilized grasses have excellent potential for use as bioenergy feedstock due to their lignocellulosic biomass. Genomic tools have enabled identification of lignocellulose biosynthesis genes in several sequenced plants. However, the non-availability of whole genome sequence of bioenergy grasses hinders the study on bioenergy genomics and their genomics-assisted crop improvement. Foxtail millet (Setaria italica L.; Si) is a model crop for studying systems biology of bioenergy grasses. In the present study, a systematic approach has been used for identification of gene families involved in cellulose (CesA/Csl), callose (Gsl) and monolignol biosynthesis (PAL, C4H, 4CL, HCT, C3H, CCoAOMT, F5H, COMT, CCR, CAD) and construction of physical map of foxtail millet. Sequence alignment and phylogenetic analysis of identified proteins showed that monolignol biosynthesis proteins were highly diverse, whereas CesA/Csl and Gsl proteins were homologous to rice and Arabidopsis. Comparative mapping of foxtail millet lignocellulose biosynthesis genes with other C4 panicoid genomes revealed maximum homology with switchgrass, followed by sorghum and maize. Expression profiling of candidate lignocellulose genes in response to different abiotic stresses and hormone treatments showed their differential expression pattern, with significant higher expression of SiGsl12, SiPAL2, SiHCT1, SiF5H2, and SiCAD6 genes. Further, due to the evolutionary conservation of grass genomes, the insights gained from the present study could be extrapolated for identifying genes involved in lignocellulose biosynthesis in other biofuel species for further characterization.
2011-01-01
Background Our previously published reports have described an effective biocontrol agent named Pseudomonas sp. M18 as its 16S rDNA sequence and several regulator genes share homologous sequences with those of P. aeruginosa, but there are several unusual phenotypic features. This study aims to explore its strain specific genomic features and gene expression patterns at different temperatures. Results The complete M18 genome is composed of a single chromosome of 6,327,754 base pairs containing 5684 open reading frames. Seven genomic islands, including two novel prophages and five specific non-phage islands were identified besides the conserved P. aeruginosa core genome. Each prophage contains a putative chitinase coding gene, and the prophage II contains a capB gene encoding a putative cold stress protein. The non-phage genomic islands contain genes responsible for pyoluteorin biosynthesis, environmental substance degradation and type I and III restriction-modification systems. Compared with other P. aeruginosa strains, the fewest number (3) of insertion sequences and the most number (3) of clustered regularly interspaced short palindromic repeats in M18 genome may contribute to the relative genome stability. Although the M18 genome is most closely related to that of P. aeruginosa strain LESB58, the strain M18 is more susceptible to several antimicrobial agents and easier to be erased in a mouse acute lung infection model than the strain LESB58. The whole M18 transcriptomic analysis indicated that 10.6% of the expressed genes are temperature-dependent, with 22 genes up-regulated at 28°C in three non-phage genomic islands and one prophage but none at 37°C. Conclusions The P. aeruginosa strain M18 has evolved its specific genomic structures and temperature dependent expression patterns to meet the requirement of its fitness and competitiveness under selective pressures imposed on the strain in rhizosphere niche. PMID:21884571
Applications of the 1000 Genomes Project resources.
Zheng-Bradley, Xiangqun; Flicek, Paul
2017-05-01
The 1000 Genomes Project created a valuable, worldwide reference for human genetic variation. Common uses of the 1000 Genomes dataset include genotype imputation supporting Genome-wide Association Studies, mapping expression Quantitative Trait Loci, filtering non-pathogenic variants from exome, whole genome and cancer genome sequencing projects, and genetic analysis of population structure and molecular evolution. In this article, we will highlight some of the multiple ways that the 1000 Genomes data can be and has been utilized for genetic studies. © The Author 2016. Published by Oxford University Press.
Widespread antisense transcription of Populus genome under drought.
Yuan, Yinan; Chen, Su
2018-06-06
Antisense transcription is widespread in many genomes and plays important regulatory roles in gene expression. The objective of our study was to investigate the extent and functional relevance of antisense transcription in forest trees. We employed Populus, a model tree species, to probe the antisense transcriptional response of tree genome under drought, through stranded RNA-seq analysis. We detected nearly 48% of annotated Populus gene loci with antisense transcripts and 44% of them with co-transcription from both DNA strands. Global distribution of reads pattern across annotated gene regions uncovered that antisense transcription was enriched in untranslated regions while sense reads were predominantly mapped in coding exons. We further detected 1185 drought-responsive sense and antisense gene loci and identified a strong positive correlation between the expression of antisense and sense transcripts. Additionally, we assessed the antisense expression in introns and found a strong correlation between intronic expression and exonic expression, confirming antisense transcription of introns contributes to transcriptional activity of Populus genome under drought. Finally, we functionally characterized drought-responsive sense-antisense transcript pairs through gene ontology analysis and discovered that functional groups including transcription factors and histones were concordantly regulated at both sense and antisense transcriptional level. Overall, our study demonstrated the extensive occurrence of antisense transcripts of Populus genes under drought and provided insights into genome structure, regulation pattern and functional significance of drought-responsive antisense genes in forest trees. Datasets generated in this study serve as a foundation for future genetic analysis to improve our understanding of gene regulation by antisense transcription.
Yang, Laurence; Tan, Justin; O'Brien, Edward J; Monk, Jonathan M; Kim, Donghyuk; Li, Howard J; Charusanti, Pep; Ebrahim, Ali; Lloyd, Colton J; Yurkovich, James T; Du, Bin; Dräger, Andreas; Thomas, Alex; Sun, Yuekai; Saunders, Michael A; Palsson, Bernhard O
2015-08-25
Finding the minimal set of gene functions needed to sustain life is of both fundamental and practical importance. Minimal gene lists have been proposed by using comparative genomics-based core proteome definitions. A definition of a core proteome that is supported by empirical data, is understood at the systems-level, and provides a basis for computing essential cell functions is lacking. Here, we use a systems biology-based genome-scale model of metabolism and expression to define a functional core proteome consisting of 356 gene products, accounting for 44% of the Escherichia coli proteome by mass based on proteomics data. This systems biology core proteome includes 212 genes not found in previous comparative genomics-based core proteome definitions, accounts for 65% of known essential genes in E. coli, and has 78% gene function overlap with minimal genomes (Buchnera aphidicola and Mycoplasma genitalium). Based on transcriptomics data across environmental and genetic backgrounds, the systems biology core proteome is significantly enriched in nondifferentially expressed genes and depleted in differentially expressed genes. Compared with the noncore, core gene expression levels are also similar across genetic backgrounds (two times higher Spearman rank correlation) and exhibit significantly more complex transcriptional and posttranscriptional regulatory features (40% more transcription start sites per gene, 22% longer 5'UTR). Thus, genome-scale systems biology approaches rigorously identify a functional core proteome needed to support growth. This framework, validated by using high-throughput datasets, facilitates a mechanistic understanding of systems-level core proteome function through in silico models; it de facto defines a paleome.
Aspler, Anne L; Bolshin, Carly; Vernon, Suzanne D; Broderick, Gordon
2008-09-26
Genomic profiling of peripheral blood reveals altered immunity in chronic fatigue syndrome (CFS) however interpretation remains challenging without immune demographic context. The object of this work is to identify modulation of specific immune functional components and restructuring of co-expression networks characteristic of CFS using the quantitative genomics of peripheral blood. Gene sets were constructed a priori for CD4+ T cells, CD8+ T cells, CD19+ B cells, CD14+ monocytes and CD16+ neutrophils from published data. A group of 111 women were classified using empiric case definition (U.S. Centers for Disease Control and Prevention) and unsupervised latent cluster analysis (LCA). Microarray profiles of peripheral blood were analyzed for expression of leukocyte-specific gene sets and characteristic changes in co-expression identified from topological evaluation of linear correlation networks. Median expression for a set of 6 genes preferentially up-regulated in CD19+ B cells was significantly lower in CFS (p = 0.01) due mainly to PTPRK and TSPAN3 expression. Although no other gene set was differentially expressed at p < 0.05, patterns of co-expression in each group differed markedly. Significant co-expression of CD14+ monocyte with CD16+ neutrophil (p = 0.01) and CD19+ B cell sets (p = 0.00) characterized CFS and fatigue phenotype groups. Also in CFS was a significant negative correlation between CD8+ and both CD19+ up-regulated (p = 0.02) and NK gene sets (p = 0.08). These patterns were absent in controls. Dissection of blood microarray profiles points to B cell dysfunction with coordinated immune activation supporting persistent inflammation and antibody-mediated NK cell modulation of T cell activity. This has clinical implications as the CD19+ genes identified could provide robust and biologically meaningful basis for the early detection and unambiguous phenotyping of CFS.
2011-01-01
Background One member of the W family of human endogenous retroviruses (HERV) appears to have been functionally adopted by the human host. Nevertheless, a highly diversified and regulated transcription from a range of HERV-W elements has been observed in human tissues and cells. Aberrant expression of members of this family has also been associated with human disease such as multiple sclerosis (MS) and schizophrenia. It is not known whether this broad expression of HERV-W elements represents transcriptional leakage or specific transcription initiated from the retroviral promoter in the long terminal repeat (LTR) region. Therefore, potential influences of genomic context, structure and orientation on the expression levels of individual HERV-W elements in normal human tissues were systematically investigated. Results Whereas intronic HERV-W elements with a pseudogene structure exhibited a strong anti-sense orientation bias, intronic elements with a proviral structure and solo LTRs did not. Although a highly variable expression across tissues and elements was observed, systematic effects of context, structure and orientation were also observed. Elements located in intronic regions appeared to be expressed at higher levels than elements located in intergenic regions. Intronic elements with proviral structures were expressed at higher levels than those elements bearing hallmarks of processed pseudogenes or solo LTRs. Relative to their corresponding genes, intronic elements integrated on the sense strand appeared to be transcribed at higher levels than those integrated on the anti-sense strand. Moreover, the expression of proviral elements appeared to be independent from that of their corresponding genes. Conclusions Intronic HERV-W provirus integrations on the sense strand appear to have elicited a weaker negative selection than pseudogene integrations of transcripts from such elements. Our current findings suggest that the previously observed diversified and tissue-specific expression of elements in the HERV-W family is the result of both directed transcription (involving both the LTR and internal sequence) and leaky transcription of HERV-W elements in normal human tissues. PMID:21226900
Genome-Wide Tuning of Protein Expression Levels to Rapidly Engineer Microbial Traits.
Freed, Emily F; Winkler, James D; Weiss, Sophie J; Garst, Andrew D; Mutalik, Vivek K; Arkin, Adam P; Knight, Rob; Gill, Ryan T
2015-11-20
The reliable engineering of biological systems requires quantitative mapping of predictable and context-independent expression over a broad range of protein expression levels. However, current techniques for modifying expression levels are cumbersome and are not amenable to high-throughput approaches. Here we present major improvements to current techniques through the design and construction of E. coli genome-wide libraries using synthetic DNA cassettes that can tune expression over a ∼10(4) range. The cassettes also contain molecular barcodes that are optimized for next-generation sequencing, enabling rapid and quantitative tracking of alleles that have the highest fitness advantage. We show these libraries can be used to determine which genes and expression levels confer greater fitness to E. coli under different growth conditions.
Westhoff, Connie M.; Uy, Jon Michael; Aguad, Maria; Smeland‐Wagman, Robin; Kaufman, Richard M.; Rehm, Heidi L.; Green, Robert C.; Silberstein, Leslie E.
2015-01-01
BACKGROUND There are 346 serologically defined red blood cell (RBC) antigens and 33 serologically defined platelet (PLT) antigens, most of which have known genetic changes in 45 RBC or six PLT genes that correlate with antigen expression. Polymorphic sites associated with antigen expression in the primary literature and reference databases are annotated according to nucleotide positions in cDNA. This makes antigen prediction from next‐generation sequencing data challenging, since it uses genomic coordinates. STUDY DESIGN AND METHODS The conventional cDNA reference sequences for all known RBC and PLT genes that correlate with antigen expression were aligned to the human reference genome. The alignments allowed conversion of conventional cDNA nucleotide positions to the corresponding genomic coordinates. RBC and PLT antigen prediction was then performed using the human reference genome and whole genome sequencing (WGS) data with serologic confirmation. RESULTS Some major differences and alignment issues were found when attempting to convert the conventional cDNA to human reference genome sequences for the following genes: ABO, A4GALT, RHD, RHCE, FUT3, ACKR1 (previously DARC), ACHE, FUT2, CR1, GCNT2, and RHAG. However, it was possible to create usable alignments, which facilitated the prediction of all RBC and PLT antigens with a known molecular basis from WGS data. Traditional serologic typing for 18 RBC antigens were in agreement with the WGS‐based antigen predictions, providing proof of principle for this approach. CONCLUSION Detailed mapping of conventional cDNA annotated RBC and PLT alleles can enable accurate prediction of RBC and PLT antigens from whole genomic sequencing data. PMID:26634332
Genomic and physiological footprint of the Deepwater Horizon oil spill on resident marsh fishes.
Whitehead, Andrew; Dubansky, Benjamin; Bodinier, Charlotte; Garcia, Tzintzuni I; Miles, Scott; Pilley, Chet; Raghunathan, Vandana; Roach, Jennifer L; Walker, Nan; Walter, Ronald B; Rice, Charles D; Galvez, Fernando
2012-12-11
The biological consequences of the Deepwater Horizon oil spill are unknown, especially for resident organisms. Here, we report results from a field study tracking the effects of contaminating oil across space and time in resident killifish during the first 4 mo of the spill event. Remote sensing and analytical chemistry identified exposures, which were linked to effects in fish characterized by genome expression and associated gill immunohistochemistry, despite very low concentrations of hydrocarbons remaining in water and tissues. Divergence in genome expression coincides with contaminating oil and is consistent with genome responses that are predictive of exposure to hydrocarbon-like chemicals and indicative of physiological and reproductive impairment. Oil-contaminated waters are also associated with aberrant protein expression in gill tissues of larval and adult fish. These data suggest that heavily weathered crude oil from the spill imparts significant biological impacts in sensitive Louisiana marshes, some of which remain for over 2 mo following initial exposures.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ohm, Robin A.; de Jong, Jan F.; Lugones, Luis G.
2010-07-12
The wood degrading fungus Schizophyllum commune is a model system for mushroom development. Here, we describe the 38.5 Mb assembled genome of this basidiomycete and application of whole genome expression analysis to study the 13,210 predicted genes. Comparative analyses of the S. commune genome revealed unique wood degrading machinery and mating type loci with the highest number of reported genes. Gene expression analyses revealed that one third of the 471 identified transcription factor genes were differentially expressed during sexual development. Two of these transcription factor genes were deleted. Inactivation of fst4 resulted in the inability to form mushrooms, whereas inactivationmore » of fst3 resulted in more but smaller mushrooms than wild-type. These data illustrate that mechanisms underlying mushroom formation can be dissected using S. commune as a model. This will impact commercial production of mushrooms and the industrial use of these fruiting bodies to produce enzymes and pharmaceuticals.« less
Li, Zhiqian; Zhang, Chen; Guo, Yurui; Niu, Weili; Wang, Yuejin; Xu, Yan
2017-09-21
The HD-Zip family has a diversity of functions during plant development. In this study, we identify 33 HD-Zip transcription factors in grape and detect their expressions in ovules and somatic embryos, as well as in various vegetative organs. A genome-wide survey for HD-Zip transcription factors in Vitis was conducted based on the 12 X grape genome (V. vinifera L.). A total of 33 members were identified and classified into four subfamilies (I-IV) based on phylogeny analysis with Arabidopsis, rice and maize. VvHDZs in the same subfamily have similar protein motifs and intron/exon structures. An evaluation of duplication events suggests several HD-Zip genes arose before the divergence of the grape and Arabidopsis lineages. The 33 members of HD-Zip were differentially expressed in ovules of the stenospermic grape, Thompson Seedless and of the seeded grape, Pinot noir. Most have higher expressions during ovule abortion in Thompson Seedless. In addition, transcripts of the HD-Zip family were also detected in somatic embryogenesis of Thompson Seedless and in different vegetative organs of Thompson Seedless at varying levels. Additionally, VvHDZ28 is located in the nucleus and had transcriptional activity consistent with the typical features of the HD-Zip family. Our results provide a foundation for future grape HD-Zip gene function research. The identification and expression profiles of the HD-Zip transcription factors in grape, reveal their diverse roles during ovule abortion and organ development. Our results lay a foundation for functional analysis of grape HDZ genes.
Expression quantitative trait loci: replication, tissue- and sex-specificity in mice.
van Nas, Atila; Ingram-Drake, Leslie; Sinsheimer, Janet S; Wang, Susanna S; Schadt, Eric E; Drake, Thomas; Lusis, Aldons J
2010-07-01
By treating the transcript abundance as a quantitative trait, gene expression can be mapped to local or distant genomic regions relative to the gene encoding the transcript. Local expression quantitative trait loci (eQTL) generally act in cis (that is, control the expression of only the contiguous structural gene), whereas distal eQTL act in trans. Distal eQTL are more difficult to identify with certainty due to the fact that significant thresholds are very high since all regions of the genome must be tested, and confounding factors such as batch effects can produce false positives. Here, we compare findings from two large genetic crosses between mouse strains C3H/HeJ and C57BL/6J to evaluate the reliability of distal eQTL detection, including "hotspots" influencing the expression of multiple genes in trans. We found that >63% of local eQTL and >18% of distal eQTL were replicable at a threshold of LOD > 4.3 between crosses and 76% of local and >24% of distal eQTL at a threshold of LOD > 6. Additionally, at LOD > 4.3 four tissues studied (adipose, brain, liver, and muscle) exhibited >50% preservation of local eQTL and >17% preservation of distal eQTL. We observed replicated distal eQTL hotspots between the crosses on chromosomes 9 and 17. Finally, >69% of local eQTL and >10% of distal eQTL were preserved in most tissues between sexes. We conclude that most local eQTL are highly replicable between mouse crosses, tissues, and sex as compared to distal eQTL, which exhibited modest replicability.
Zhang, Ying; Zhang, Wei; Li, Xinglan; Li, Dapeng; Zhang, Xiaoling; Yin, Yajie; Deng, Xiangyun; Sheng, Xiugui
2016-06-01
Endometrial cancer (EC) is the most prevalent malignancy worldwide. Although several efforts had been made to explore the molecular mechanism responsible for EC progression, it is still not fully understood. To evaluate the clinical characteristics and prognostic factors of patients with EC, and further to search for novel genes associated with EC progression. We recruited 328 patients with EC and analyzed prognostic factors using Cox proportional hazard regression model. Further, a gene expression profile of EC was used to identify the differentially expressed genes (DEGs) between normal samples and tumor samples. Subsequently, Kyoto Encyclopedia of Genes and Genomes pathway enrichment analysis ( http://www.genome.jp/kegg/ ) for DEGs were performed, and then protein-protein interaction (PPI) network of DEGs as well as the subnetwork of PPI were constructed with plug-in, MCODE by mapping DEGs into the Search Tool for the Retrieval of Interacting Genes database. Our results showed that body mass index (BMI), hypertension, myometrial invasion, pathological type, and Glut4 positive expression were prognostic factors in EC (P < 0.05). Bioinformatics analysis showed that upregulated DEGs were associated with cell cycle, and downregulated DEGs were related to MAPK pathway. Meanwhile, PPI network analysis revealed that upregulated CDK1 and CCNA2 as well as downregulated JUN and FOS were listed in top two nodes with high degrees. Patients with EC should be given more focused attentions in respect of pathological type, BMI, hypertension, and Glut4-positive expression. In addition, CDK1, CCNA2, JUN, and FOS might play important roles in EC development.
Swindell, William R.; Johnston, Andrew; Carbajal, Steve; Han, Gangwen; Wohn, Christian; Lu, Jun; Xing, Xianying; Nair, Rajan P.; Voorhees, John J.; Elder, James T.; Wang, Xiao-Jing; Sano, Shigetoshi; Prens, Errol P.; DiGiovanni, John; Pittelkow, Mark R.; Ward, Nicole L.; Gudjonsson, Johann E.
2011-01-01
Development of a suitable mouse model would facilitate the investigation of pathomechanisms underlying human psoriasis and would also assist in development of therapeutic treatments. However, while many psoriasis mouse models have been proposed, no single model recapitulates all features of the human disease, and standardized validation criteria for psoriasis mouse models have not been widely applied. In this study, whole-genome transcriptional profiling is used to compare gene expression patterns manifested by human psoriatic skin lesions with those that occur in five psoriasis mouse models (K5-Tie2, imiquimod, K14-AREG, K5-Stat3C and K5-TGFbeta1). While the cutaneous gene expression profiles associated with each mouse phenotype exhibited statistically significant similarity to the expression profile of psoriasis in humans, each model displayed distinctive sets of similarities and differences in comparison to human psoriasis. For all five models, correspondence to the human disease was strong with respect to genes involved in epidermal development and keratinization. Immune and inflammation-associated gene expression, in contrast, was more variable between models as compared to the human disease. These findings support the value of all five models as research tools, each with identifiable areas of convergence to and divergence from the human disease. Additionally, the approach used in this paper provides an objective and quantitative method for evaluation of proposed mouse models of psoriasis, which can be strategically applied in future studies to score strengths of mouse phenotypes relative to specific aspects of human psoriasis. PMID:21483750
Bruce, A. Gregory; Barcy, Serge; DiMaio, Terri; Gan, Emilia; Garrigues, H. Jacques; Lagunoff, Michael; Rose, Timothy M.
2017-01-01
The transcriptome of the Kaposi’s sarcoma-associated herpesvirus (KSHV/HHV8) after primary latent infection of human blood (BEC), lymphatic (LEC) and immortalized (TIME) endothelial cells was analyzed using RNAseq, and compared to long-term latency in BCBL-1 lymphoma cells. Naturally expressed transcripts were obtained without artificial induction, and a comprehensive annotation of the KSHV genome was determined. A set of unique coding sequence (UCDS) features and a process to resolve overlapping transcripts were developed to accurately quantitate transcript levels from specific promoters. Similar patterns of KSHV expression were detected in BCBL-1 cells undergoing long-term latent infections and in primary latent infections of both BEC and LEC cultures. High expression levels of poly-adenylated nuclear (PAN) RNA and spliced and unspliced transcripts encoding the K12 Kaposin B/C complex and associated microRNA region were detected, with an elevated expression of a large set of lytic genes in all latently infected cultures. Quantitation of non-overlapping regions of transcripts across the complete KSHV genome enabled for the first time accurate evaluation of the KSHV transcriptome associated with viral latency in different cell types. Hierarchical clustering applied to a gene correlation matrix identified modules of co-regulated genes with similar correlation profiles, which corresponded with biological and functional similarities of the encoded gene products. Gene modules were differentially upregulated during latency in specific cell types indicating a role for cellular factors associated with differentiated and/or proliferative states of the host cell to influence viral gene expression. PMID:28335496
Integrative Analysis Reveals Relationships of Genetic and Epigenetic Alterations in Osteosarcoma
Skårn, Magne; Namløs, Heidi M.; Barragan-Polania, Ana H.; Cleton-Jansen, Anne-Marie; Serra, Massimo; Liestøl, Knut; Hogendoorn, Pancras C. W.; Hovig, Eivind; Myklebost, Ola; Meza-Zepeda, Leonardo A.
2012-01-01
Background Osteosarcomas are the most common non-haematological primary malignant tumours of bone, and all conventional osteosarcomas are high-grade tumours showing complex genomic aberrations. We have integrated genome-wide genetic and epigenetic profiles from the EuroBoNeT panel of 19 human osteosarcoma cell lines based on microarray technologies. Principal Findings The cell lines showed complex patterns of DNA copy number changes, where genomic copy number gains were significantly associated with gene-rich regions and losses with gene-poor regions. By integrating the datasets, 350 genes were identified as having two types of aberrations (gain/over-expression, hypo-methylation/over-expression, loss/under-expression or hyper-methylation/under-expression) using a recurrence threshold of 6/19 (>30%) cell lines. The genes showed in general alterations in either DNA copy number or DNA methylation, both within individual samples and across the sample panel. These 350 genes are involved in embryonic skeletal system development and morphogenesis, as well as remodelling of extracellular matrix. The aberrations of three selected genes, CXCL5, DLX5 and RUNX2, were validated in five cell lines and five tumour samples using PCR techniques. Several genes were hyper-methylated and under-expressed compared to normal osteoblasts, and expression could be reactivated by demethylation using 5-Aza-2′-deoxycytidine treatment for four genes tested; AKAP12, CXCL5, EFEMP1 and IL11RA. Globally, there was as expected a significant positive association between gain and over-expression, loss and under-expression as well as hyper-methylation and under-expression, but gain was also associated with hyper-methylation and under-expression, suggesting that hyper-methylation may oppose the effects of increased copy number for detrimental genes. Conclusions Integrative analysis of genome-wide genetic and epigenetic alterations identified dependencies and relationships between DNA copy number, DNA methylation and mRNA expression in osteosarcomas, contributing to better understanding of osteosarcoma biology. PMID:23144859
Soyer, Jessica L; El Ghalid, Mennat; Glaser, Nicolas; Ollivier, Bénédicte; Linglin, Juliette; Grandaubert, Jonathan; Balesdent, Marie-Hélène; Connolly, Lanelle R; Freitag, Michael; Rouxel, Thierry; Fudal, Isabelle
2014-03-01
Plant pathogens secrete an arsenal of small secreted proteins (SSPs) acting as effectors that modulate host immunity to facilitate infection. SSP-encoding genes are often located in particular genomic environments and show waves of concerted expression at diverse stages of plant infection. To date, little is known about the regulation of their expression. The genome of the Ascomycete Leptosphaeria maculans comprises alternating gene-rich GC-isochores and gene-poor AT-isochores. The AT-isochores harbor mosaics of transposable elements, encompassing one-third of the genome, and are enriched in putative effector genes that present similar expression patterns, namely no expression or low-level expression during axenic cultures compared to strong induction of expression during primary infection of oilseed rape (Brassica napus). Here, we investigated the involvement of one specific histone modification, histone H3 lysine 9 methylation (H3K9me3), in epigenetic regulation of concerted effector gene expression in L. maculans. For this purpose, we silenced the expression of two key players in heterochromatin assembly and maintenance, HP1 and DIM-5 by RNAi. By using HP1-GFP as a heterochromatin marker, we observed that almost no chromatin condensation is visible in strains in which LmDIM5 was silenced by RNAi. By whole genome oligoarrays we observed overexpression of 369 or 390 genes, respectively, in the silenced-LmHP1 and -LmDIM5 transformants during growth in axenic culture, clearly favouring expression of SSP-encoding genes within AT-isochores. The ectopic integration of four effector genes in GC-isochores led to their overexpression during growth in axenic culture. These data strongly suggest that epigenetic control, mediated by HP1 and DIM-5, represses the expression of at least part of the effector genes located in AT-isochores during growth in axenic culture. Our hypothesis is that changes of lifestyle and a switch toward pathogenesis lift chromatin-mediated repression, allowing a rapid response to new environmental conditions.
da Silva, Carlos R. M.; Andrade, Alan C.; Marraccini, Pierre; Teixeira, João B.; Carazzolle, Marcelo F.; Pereira, Gonçalo A. G.; Pereira, Luiz Filipe P.; Vanzela, André L. L.; Wang, Lu; Jordan, I. King; Carareto, Claudia M. A.
2013-01-01
Plant genomes are massively invaded by transposable elements (TEs), many of which are located near host genes and can thus impact gene expression. In flowering plants, TE expression can be activated (de-repressed) under certain stressful conditions, both biotic and abiotic, as well as by genome stress caused by hybridization. In this study, we examined the effects of these stress agents on TE expression in two diploid species of coffee, Coffea canephora and C. eugenioides, and their allotetraploid hybrid C. arabica. We also explored the relationship of TE repression mechanisms to host gene regulation via the effects of exonized TE sequences. Similar to what has been seen for other plants, overall TE expression levels are low in Coffea plant cultivars, consistent with the existence of effective TE repression mechanisms. TE expression patterns are highly dynamic across the species and conditions assayed here are unrelated to their classification at the level of TE class or family. In contrast to previous results, cell culture conditions per se do not lead to the de-repression of TE expression in C. arabica. Results obtained here indicate that differing plant drought stress levels relate strongly to TE repression mechanisms. TEs tend to be expressed at significantly higher levels in non-irrigated samples for the drought tolerant cultivars but in drought sensitive cultivars the opposite pattern was shown with irrigated samples showing significantly higher TE expression. Thus, TE genome repression mechanisms may be finely tuned to the ideal growth and/or regulatory conditions of the specific plant cultivars in which they are active. Analysis of TE expression levels in cell culture conditions underscored the importance of nonsense-mediated mRNA decay (NMD) pathways in the repression of Coffea TEs. These same NMD mechanisms can also regulate plant host gene expression via the repression of genes that bear exonized TE sequences. PMID:24244387
Mosier, Annika
2018-01-22
Annika Mosier, graduate student from Stanford University presents a talk titled "In Situ Expression of Acidic and Thermophilic Carbohydrate Active Enzymes by Filamentous Fungi" at the JGI User 7th Annual Genomics of Energy & Environment Meeting on March 22, 2012 in Walnut Creek, CA.
USDA-ARS?s Scientific Manuscript database
Expressed sequence tag (EST) simple sequence repeats (SSRs) in Prunus were mined, and flanking primers designed and used for genome-wide characterization and selection of primers to optimize marker distribution and reliability. A total of 12,618 contigs were assembled from 84,727 ESTs, along with 34...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Christian, A T; Coleman, M A; Tucker, J D
2001-02-08
Gene Recovery Microdissection (GRM) is a unique and cost-effective process for producing chromosome region-specific libraries of expressed genes. It accelerates the pace, reduces the cost, and extends the capabilities of functional genomic research, the means by which scientists will put to life-saving, life-enhancing use their knowledge of any plant or animal genome.
Xu, Yuantao; Wu, Guizhi; Hao, Baohai; Chen, Lingling; Deng, Xiuxin; Xu, Qiang
2015-11-23
With the availability of rapidly increasing number of genome and transcriptome sequences, lineage-specific genes (LSGs) can be identified and characterized. Like other conserved functional genes, LSGs play important roles in biological evolution and functions. Two set of citrus LSGs, 296 citrus-specific genes (CSGs) and 1039 orphan genes specific to sweet orange, were identified by comparative analysis between the sweet orange genome sequences and 41 genomes and 273 transcriptomes. With the two sets of genes, gene structure and gene expression pattern were investigated. On average, both the CSGs and orphan genes have fewer exons, shorter gene length and higher GC content when compared with those evolutionarily conserved genes (ECs). Expression profiling indicated that most of the LSGs expressed in various tissues of sweet orange and some of them exhibited distinct temporal and spatial expression patterns. Particularly, the orphan genes were preferentially expressed in callus, which is an important pluripotent tissue of citrus. Besides, part of the CSGs and orphan genes expressed responsive to abiotic stress, indicating their potential functions during interaction with environment. This study identified and characterized two sets of LSGs in citrus, dissected their sequence features and expression patterns, and provided valuable clues for future functional analysis of the LSGs in sweet orange.
Sirigineedi, Sasibhushan; Vijayagowri, Esvaran; Murthy, Geetha N; Rao, Guruprasada; Ponnuvel, Kangayam M
2014-12-01
A comparison of the cDNA sequences (1 056 bp) of Bombyx mori DnaJ 5 homolog with B. mori genome revealed that unlike in other Hsps, it has an intron of 234 bp. The DnaJ 5 homolog contains 351 amino acids, of which 70 contain the conserved DnaJ domain at the N-terminal end. This homolog of B. mori has all desirable functional domains similar to other insects, and the 13 different DnaJ homologs identified in B. mori genome were distributed on different chromosomes. The expressed sequence tag database analysis of Hsp40 gene expression revealed higher expression in wing disc followed by diapause-induced eggs. Microarray analysis revealed higher expression of DnaJ 5 homolog at 18th h after oviposition in diapause-induced eggs. Further validation of DnaJ 5 expression through qPCR in diapause-induced and nondiapause eggs at different time intervals revealed higher expression in diapause eggs at 18 and 24 h after oviposition, which coincided with the expression of Hsp70 as the Hsp 40 is its co-chaperone. This study thus provides an outline of the genome organization of Hsp40 gene, and its role in egg diapause induction in B. mori. © 2013 Institute of Zoology, Chinese Academy of Sciences.
Inouye, Michael; Ripatti, Samuli; Kettunen, Johannes; Lyytikäinen, Leo-Pekka; Oksala, Niku; Laurila, Pirkka-Pekka; Kangas, Antti J.; Soininen, Pasi; Savolainen, Markku J.; Viikari, Jorma; Kähönen, Mika; Perola, Markus; Salomaa, Veikko; Raitakari, Olli; Lehtimäki, Terho; Taskinen, Marja-Riitta; Järvelin, Marjo-Riitta; Ala-Korpela, Mika; Palotie, Aarno; de Bakker, Paul I. W.
2012-01-01
Association testing of multiple correlated phenotypes offers better power than univariate analysis of single traits. We analyzed 6,600 individuals from two population-based cohorts with both genome-wide SNP data and serum metabolomic profiles. From the observed correlation structure of 130 metabolites measured by nuclear magnetic resonance, we identified 11 metabolic networks and performed a multivariate genome-wide association analysis. We identified 34 genomic loci at genome-wide significance, of which 7 are novel. In comparison to univariate tests, multivariate association analysis identified nearly twice as many significant associations in total. Multi-tissue gene expression studies identified variants in our top loci, SERPINA1 and AQP9, as eQTLs and showed that SERPINA1 and AQP9 expression in human blood was associated with metabolites from their corresponding metabolic networks. Finally, liver expression of AQP9 was associated with atherosclerotic lesion area in mice, and in human arterial tissue both SERPINA1 and AQP9 were shown to be upregulated (6.3-fold and 4.6-fold, respectively) in atherosclerotic plaques. Our study illustrates the power of multi-phenotype GWAS and highlights candidate genes for atherosclerosis. PMID:22916037
Li, Ming; Bui, Michelle; Yang, Ting; Bowman, Christian S.; White, Bradley J.; Akbari, Omar S.
2017-01-01
The development of CRISPR/Cas9 technologies has dramatically increased the accessibility and efficiency of genome editing in many organisms. In general, in vivo germline expression of Cas9 results in substantially higher activity than embryonic injection. However, no transgenic lines expressing Cas9 have been developed for the major mosquito disease vector Aedes aegypti. Here, we describe the generation of multiple stable, transgenic Ae. aegypti strains expressing Cas9 in the germline, resulting in dramatic improvements in both the consistency and efficiency of genome modifications using CRISPR. Using these strains, we disrupted numerous genes important for normal morphological development, and even generated triple mutants from a single injection. We have also managed to increase the rates of homology-directed repair by more than an order of magnitude. Given the exceptional mutagenic efficiency and specificity of the Cas9 strains we engineered, they can be used for high-throughput reverse genetic screens to help functionally annotate the Ae. aegypti genome. Additionally, these strains represent a step toward the development of novel population control technologies targeting Ae. aegypti that rely on Cas9-based gene drives. PMID:29138316
Li, Ming; Bui, Michelle; Yang, Ting; Bowman, Christian S; White, Bradley J; Akbari, Omar S
2017-12-05
The development of CRISPR/Cas9 technologies has dramatically increased the accessibility and efficiency of genome editing in many organisms. In general, in vivo germline expression of Cas9 results in substantially higher activity than embryonic injection. However, no transgenic lines expressing Cas9 have been developed for the major mosquito disease vector Aedes aegypti Here, we describe the generation of multiple stable, transgenic Ae. aegypti strains expressing Cas9 in the germline, resulting in dramatic improvements in both the consistency and efficiency of genome modifications using CRISPR. Using these strains, we disrupted numerous genes important for normal morphological development, and even generated triple mutants from a single injection. We have also managed to increase the rates of homology-directed repair by more than an order of magnitude. Given the exceptional mutagenic efficiency and specificity of the Cas9 strains we engineered, they can be used for high-throughput reverse genetic screens to help functionally annotate the Ae. aegypti genome. Additionally, these strains represent a step toward the development of novel population control technologies targeting Ae. aegypti that rely on Cas9-based gene drives. Copyright © 2017 the Author(s). Published by PNAS.
Variant ribosomal RNA alleles are conserved and exhibit tissue-specific expression
Parks, Matthew M.; Kurylo, Chad M.; Dass, Randall A.; Bojmar, Linda; Lyden, David; Vincent, C. Theresa; Blanchard, Scott C.
2018-01-01
The ribosome, the integration point for protein synthesis in the cell, is conventionally considered a homogeneous molecular assembly that only passively contributes to gene expression. Yet, epigenetic features of the ribosomal DNA (rDNA) operon and changes in the ribosome’s molecular composition have been associated with disease phenotypes, suggesting that the ribosome itself may possess inherent regulatory capacity. Analyzing whole-genome sequencing data from the 1000 Genomes Project and the Mouse Genomes Project, we find that rDNA copy number varies widely across individuals, and we identify pervasive intra- and interindividual nucleotide variation in the 5S, 5.8S, 18S, and 28S ribosomal RNA (rRNA) genes of both human and mouse. Conserved rRNA sequence heterogeneities map to functional centers of the assembled ribosome, variant rRNA alleles exhibit tissue-specific expression, and ribosomes bearing variant rRNA alleles are present in the actively translating ribosome pool. These findings provide a critical framework for exploring the possibility that the expression of genomically encoded variant rRNA alleles gives rise to physically and functionally heterogeneous ribosomes that contribute to mammalian physiology and human disease. PMID:29503865
Brizuela, Leonardo; Richardson, Aaron; Marsischky, Gerald; Labaer, Joshua
2002-01-01
Thanks to the results of the multiple completed and ongoing genome sequencing projects and to the newly available recombination-based cloning techniques, it is now possible to build gene repositories with no precedent in their composition, formatting, and potential. This new type of gene repository is necessary to address the challenges imposed by the post-genomic era, i.e., experimentation on a genome-wide scale. We are building the FLEXGene (Full Length EXpression-ready) repository. This unique resource will contain clones representing the complete ORFeome of different organisms, including Homo sapiens as well as several pathogens and model organisms. It will consist of a comprehensive, characterized (sequence-verified), and arrayed gene repository. This resource will allow full exploitation of the genomic information by enabling genome-wide scale experimentation at the level of functional/phenotypic assays as well as at the level of protein expression, purification, and analysis. Here we describe the rationale and construction of this resource and focus on the data obtained from the Saccharomyces cerevisiae project.
Gusev, Oleg; Suetsugu, Yoshitaka; Cornette, Richard; Kawashima, Takeshi; Logacheva, Maria D.; Kondrashov, Alexey S.; Penin, Aleksey A.; Hatanaka, Rie; Kikuta, Shingo; Shimura, Sachiko; Kanamori, Hiroyuki; Katayose, Yuichi; Matsumoto, Takashi; Shagimardanova, Elena; Alexeev, Dmitry; Govorun, Vadim; Wisecaver, Jennifer; Mikheyev, Alexander; Koyanagi, Ryo; Fujie, Manabu; Nishiyama, Tomoaki; Shigenobu, Shuji; Shibata, Tomoko F.; Golygina, Veronika; Hasebe, Mitsuyasu; Okuda, Takashi; Satoh, Nori; Kikawada, Takahiro
2014-01-01
Anhydrobiosis represents an extreme example of tolerance adaptation to water loss, where an organism can survive in an ametabolic state until water returns. Here we report the first comparative analysis examining the genomic background of extreme desiccation tolerance, which is exclusively found in larvae of the only anhydrobiotic insect, Polypedilum vanderplanki. We compare the genomes of P. vanderplanki and a congeneric desiccation-sensitive midge P. nubifer. We determine that the genome of the anhydrobiotic species specifically contains clusters of multi-copy genes with products that act as molecular shields. In addition, the genome possesses several groups of genes with high similarity to known protective proteins. However, these genes are located in distinct paralogous clusters in the genome apart from the classical orthologues of the corresponding genes shared by both chironomids and other insects. The transcripts of these clustered paralogues contribute to a large majority of the mRNA pool in the desiccating larvae and most likely define successful anhydrobiosis. Comparison of expression patterns of orthologues between two chironomid species provides evidence for the existence of desiccation-specific gene expression systems in P. vanderplanki. PMID:25216354