identify additional genes: Topics by Science.gov

Sample records for identify additional genes

High-throughput discovery of novel developmental phenotypes

PubMed Central

Dickinson, Mary E.; Flenniken, Ann M.; Ji, Xiao; Teboul, Lydia; Wong, Michael D.; White, Jacqueline K.; Meehan, Terrence F.; Weninger, Wolfgang J.; Westerberg, Henrik; Adissu, Hibret; Baker, Candice N.; Bower, Lynette; Brown, James M.; Caddle, L. Brianna; Chiani, Francesco; Clary, Dave; Cleak, James; Daly, Mark J.; Denegre, James M.; Doe, Brendan; Dolan, Mary E.; Edie, Sarah M.; Fuchs, Helmut; Gailus-Durner, Valerie; Galli, Antonella; Gambadoro, Alessia; Gallegos, Juan; Guo, Shiying; Horner, Neil R.; Hsu, Chih-wei; Johnson, Sara J.; Kalaga, Sowmya; Keith, Lance C.; Lanoue, Louise; Lawson, Thomas N.; Lek, Monkol; Mark, Manuel; Marschall, Susan; Mason, Jeremy; McElwee, Melissa L.; Newbigging, Susan; Nutter, Lauryl M.J.; Peterson, Kevin A.; Ramirez-Solis, Ramiro; Rowland, Douglas J.; Ryder, Edward; Samocha, Kaitlin E.; Seavitt, John R.; Selloum, Mohammed; Szoke-Kovacs, Zsombor; Tamura, Masaru; Trainor, Amanda G; Tudose, Ilinca; Wakana, Shigeharu; Warren, Jonathan; Wendling, Olivia; West, David B.; Wong, Leeyean; Yoshiki, Atsushi; MacArthur, Daniel G.; Tocchini-Valentini, Glauco P.; Gao, Xiang; Flicek, Paul; Bradley, Allan; Skarnes, William C.; Justice, Monica J.; Parkinson, Helen E.; Moore, Mark; Wells, Sara; Braun, Robert E.; Svenson, Karen L.; de Angelis, Martin Hrabe; Herault, Yann; Mohun, Tim; Mallon, Ann-Marie; Henkelman, R. Mark; Brown, Steve D.M.; Adams, David J.; Lloyd, K.C. Kent; McKerlie, Colin; Beaudet, Arthur L.; Bucan, Maja; Murray, Stephen A.

2016-01-01

Approximately one third of all mammalian genes are essential for life. Phenotypes resulting from mouse knockouts of these genes have provided tremendous insight into gene function and congenital disorders. As part of the International Mouse Phenotyping Consortium effort to generate and phenotypically characterize 5000 knockout mouse lines, we have identified 410 lethal genes during the production of the first 1751 unique gene knockouts. Using a standardised phenotyping platform that incorporates high-resolution 3D imaging, we identified novel phenotypes at multiple time points for previously uncharacterized genes and additional phenotypes for genes with previously reported mutant phenotypes. Unexpectedly, our analysis reveals that incomplete penetrance and variable expressivity are common even on a defined genetic background. In addition, we show that human disease genes are enriched for essential genes identified in our screen, thus providing a novel dataset that facilitates prioritization and validation of mutations identified in clinical sequencing efforts. PMID:27626380
Combining Genome-Scale Experimental and Computational Methods To Identify Essential Genes in Rhodobacter sphaeroides

DOE PAGES

Burger, Brian T.; Imam, Saheed; Scarborough, Matthew J.; ...

2017-06-06

Rhodobacter sphaeroides is one of the best-studied alphaproteobacteria from biochemical, genetic, and genomic perspectives. To gain a better systems-level understanding of this organism, we generated a large transposon mutant library and used transposon sequencing (Tn-seq) to identify genes that are essential under several growth conditions. Using newly developed Tn-seq analysis software (TSAS), we identified 493 genes as essential for aerobic growth on a rich medium. We then used the mutant library to identify conditionally essential genes under two laboratory growth conditions, identifying 85 additional genes required for aerobic growth in a minimal medium and 31 additional genes required for photosyntheticmore » growth. In all instances, our analyses confirmed essentiality for many known genes and identified genes not previously considered to be essential. We used the resulting Tn-seq data to refine and improve a genome-scale metabolic network model (GEM) for R. sphaeroides. Together, we demonstrate how genetic, genomic, and computational approaches can be combined to obtain a systems-level understanding of the genetic framework underlying metabolic diversity in bacterial species.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)

Burger, Brian T.; Imam, Saheed; Scarborough, Matthew J.

Rhodobacter sphaeroides is one of the best-studied alphaproteobacteria from biochemical, genetic, and genomic perspectives. To gain a better systems-level understanding of this organism, we generated a large transposon mutant library and used transposon sequencing (Tn-seq) to identify genes that are essential under several growth conditions. Using newly developed Tn-seq analysis software (TSAS), we identified 493 genes as essential for aerobic growth on a rich medium. We then used the mutant library to identify conditionally essential genes under two laboratory growth conditions, identifying 85 additional genes required for aerobic growth in a minimal medium and 31 additional genes required for photosyntheticmore » growth. In all instances, our analyses confirmed essentiality for many known genes and identified genes not previously considered to be essential. We used the resulting Tn-seq data to refine and improve a genome-scale metabolic network model (GEM) for R. sphaeroides. Together, we demonstrate how genetic, genomic, and computational approaches can be combined to obtain a systems-level understanding of the genetic framework underlying metabolic diversity in bacterial species.« less
High-throughput discovery of novel developmental phenotypes.

PubMed

Dickinson, Mary E; Flenniken, Ann M; Ji, Xiao; Teboul, Lydia; Wong, Michael D; White, Jacqueline K; Meehan, Terrence F; Weninger, Wolfgang J; Westerberg, Henrik; Adissu, Hibret; Baker, Candice N; Bower, Lynette; Brown, James M; Caddle, L Brianna; Chiani, Francesco; Clary, Dave; Cleak, James; Daly, Mark J; Denegre, James M; Doe, Brendan; Dolan, Mary E; Edie, Sarah M; Fuchs, Helmut; Gailus-Durner, Valerie; Galli, Antonella; Gambadoro, Alessia; Gallegos, Juan; Guo, Shiying; Horner, Neil R; Hsu, Chih-Wei; Johnson, Sara J; Kalaga, Sowmya; Keith, Lance C; Lanoue, Louise; Lawson, Thomas N; Lek, Monkol; Mark, Manuel; Marschall, Susan; Mason, Jeremy; McElwee, Melissa L; Newbigging, Susan; Nutter, Lauryl M J; Peterson, Kevin A; Ramirez-Solis, Ramiro; Rowland, Douglas J; Ryder, Edward; Samocha, Kaitlin E; Seavitt, John R; Selloum, Mohammed; Szoke-Kovacs, Zsombor; Tamura, Masaru; Trainor, Amanda G; Tudose, Ilinca; Wakana, Shigeharu; Warren, Jonathan; Wendling, Olivia; West, David B; Wong, Leeyean; Yoshiki, Atsushi; MacArthur, Daniel G; Tocchini-Valentini, Glauco P; Gao, Xiang; Flicek, Paul; Bradley, Allan; Skarnes, William C; Justice, Monica J; Parkinson, Helen E; Moore, Mark; Wells, Sara; Braun, Robert E; Svenson, Karen L; de Angelis, Martin Hrabe; Herault, Yann; Mohun, Tim; Mallon, Ann-Marie; Henkelman, R Mark; Brown, Steve D M; Adams, David J; Lloyd, K C Kent; McKerlie, Colin; Beaudet, Arthur L; Bućan, Maja; Murray, Stephen A

2016-09-22

Approximately one-third of all mammalian genes are essential for life. Phenotypes resulting from knockouts of these genes in mice have provided tremendous insight into gene function and congenital disorders. As part of the International Mouse Phenotyping Consortium effort to generate and phenotypically characterize 5,000 knockout mouse lines, here we identify 410 lethal genes during the production of the first 1,751 unique gene knockouts. Using a standardized phenotyping platform that incorporates high-resolution 3D imaging, we identify phenotypes at multiple time points for previously uncharacterized genes and additional phenotypes for genes with previously reported mutant phenotypes. Unexpectedly, our analysis reveals that incomplete penetrance and variable expressivity are common even on a defined genetic background. In addition, we show that human disease genes are enriched for essential genes, thus providing a dataset that facilitates the prioritization and validation of mutations identified in clinical sequencing efforts.
A genome-wide association study of corneal astigmatism: The CREAM Consortium.

PubMed

Shah, Rupal L; Li, Qing; Zhao, Wanting; Tedja, Milly S; Tideman, J Willem L; Khawaja, Anthony P; Fan, Qiao; Yazar, Seyhan; Williams, Katie M; Verhoeven, Virginie J M; Xie, Jing; Wang, Ya Xing; Hess, Moritz; Nickels, Stefan; Lackner, Karl J; Pärssinen, Olavi; Wedenoja, Juho; Biino, Ginevra; Concas, Maria Pina; Uitterlinden, André; Rivadeneira, Fernando; Jaddoe, Vincent W V; Hysi, Pirro G; Sim, Xueling; Tan, Nicholas; Tham, Yih-Chung; Sensaki, Sonoko; Hofman, Albert; Vingerling, Johannes R; Jonas, Jost B; Mitchell, Paul; Hammond, Christopher J; Höhn, René; Baird, Paul N; Wong, Tien-Yin; Cheng, Chinfsg-Yu; Teo, Yik Ying; Mackey, David A; Williams, Cathy; Saw, Seang-Mei; Klaver, Caroline C W; Guggenheim, Jeremy A; Bailey-Wilson, Joan E

2018-01-01

To identify genes and genetic markers associated with corneal astigmatism. A meta-analysis of genome-wide association studies (GWASs) of corneal astigmatism undertaken for 14 European ancestry (n=22,250) and 8 Asian ancestry (n=9,120) cohorts was performed by the Consortium for Refractive Error and Myopia. Cases were defined as having >0.75 diopters of corneal astigmatism. Subsequent gene-based and gene-set analyses of the meta-analyzed results of European ancestry cohorts were performed using VEGAS2 and MAGMA software. Additionally, estimates of single nucleotide polymorphism (SNP)-based heritability for corneal and refractive astigmatism and the spherical equivalent were calculated for Europeans using LD score regression. The meta-analysis of all cohorts identified a genome-wide significant locus near the platelet-derived growth factor receptor alpha ( PDGFRA ) gene: top SNP: rs7673984, odds ratio=1.12 (95% CI:1.08-1.16), p=5.55×10 -9 . No other genome-wide significant loci were identified in the combined analysis or European/Asian ancestry-specific analyses. Gene-based analysis identified three novel candidate genes for corneal astigmatism in Europeans-claudin-7 ( CLDN7 ), acid phosphatase 2, lysosomal ( ACP2 ), and TNF alpha-induced protein 8 like 3 ( TNFAIP8L3 ). In addition to replicating a previously identified genome-wide significant locus for corneal astigmatism near the PDGFRA gene, gene-based analysis identified three novel candidate genes, CLDN7 , ACP2 , and TNFAIP8L3 , that warrant further investigation to understand their role in the pathogenesis of corneal astigmatism. The much lower number of genetic variants and genes demonstrating an association with corneal astigmatism compared to published spherical equivalent GWAS analyses suggest a greater influence of rare genetic variants, non-additive genetic effects, or environmental factors in the development of astigmatism.
Automated Update, Revision, and Quality Control of the Maize Genome Annotations Using MAKER-P Improves the B73 RefGen_v3 Gene Models and Identifies New Genes1[OPEN

PubMed Central

Law, MeiYee; Childs, Kevin L.; Campbell, Michael S.; Stein, Joshua C.; Olson, Andrew J.; Holt, Carson; Panchy, Nicholas; Lei, Jikai; Jiao, Dian; Andorf, Carson M.; Lawrence, Carolyn J.; Ware, Doreen; Shiu, Shin-Han; Sun, Yanni; Jiang, Ning; Yandell, Mark

2015-01-01

The large size and relative complexity of many plant genomes make creation, quality control, and dissemination of high-quality gene structure annotations challenging. In response, we have developed MAKER-P, a fast and easy-to-use genome annotation engine for plants. Here, we report the use of MAKER-P to update and revise the maize (Zea mays) B73 RefGen_v3 annotation build (5b+) in less than 3 h using the iPlant Cyberinfrastructure. MAKER-P identified and annotated 4,466 additional, well-supported protein-coding genes not present in the 5b+ annotation build, added additional untranslated regions to 1,393 5b+ gene models, identified 2,647 5b+ gene models that lack any supporting evidence (despite the use of large and diverse evidence data sets), identified 104,215 pseudogene fragments, and created an additional 2,522 noncoding gene annotations. We also describe a method for de novo training of MAKER-P for the annotation of newly sequenced grass genomes. Collectively, these results lead to the 6a maize genome annotation and demonstrate the utility of MAKER-P for rapid annotation, management, and quality control of grasses and other difficult-to-annotate plant genomes. PMID:25384563
Gene Expression Changes in Cervical Squamous Cell Carcinoma After Initiation of Chemoradiation and Correlation With Clinical Outcome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Klopp, Ann H.; Jhingran, Anuja; Ramdas, Latha

2008-05-01

Purpose: The purpose of this study was to investigate early gene expression changes after chemoradiation in a human solid tumor, allowing identification of chemoradiation-induced gene expression changes in the tumor as well as the tumor microenvironment. In addition we aimed to identify a gene expression profile that was associated with clinical outcome. Methods and Materials: Microarray experiments were performed on cervical cancer specimens obtained before and 48 h after chemoradiation from 12 patients with Stage IB2 to IIIB squamous cell carcinoma of the cervix treated between April 2001 and August 2002. Results: A total of 262 genes were identified thatmore » were significantly changed after chemoradiation. Genes involved in DNA repair were identified including DDB2, ERCC4, GADD45A, and XPC. In addition, significantly regulated cell-to-cell signaling pathways included insulin-like growth factor-1 (IGF-1), interferon, and vascular endothelial growth factor signaling. At a median follow-up of 41 months, 5 of 12 patients had experienced either local or distant failure. Supervised clustering analysis identified a 58-gene set from the pretreatment samples that were differentially expressed between patients with and without recurrence. Genes involved in integrin signaling and apoptosis pathways were identified in this gene set. Immortalization-upregulated protein (IMUP), IGF-2, and ARHD had particularly marked differences in expression between patients with and without recurrence. Conclusions: Genetic profiling identified genes regulated by chemoradiation including DNA damage and cell-to-cell signaling pathways. Genes associated with recurrence were identified that will require validation in an independent patient data set to determine whether the 58-gene set associated with clinical outcome could be useful as a prognostic assay.« less
Pathway-driven gene stability selection of two rheumatoid arthritis GWAS identifies and validates new susceptibility genes in receptor mediated signalling pathways.

PubMed

Eleftherohorinou, Hariklia; Hoggart, Clive J; Wright, Victoria J; Levin, Michael; Coin, Lachlan J M

2011-09-01

Rheumatoid arthritis (RA) is the commonest chronic, systemic, inflammatory disorder affecting ∼1% of the world population. It has a strong genetic component and a growing number of associated genes have been discovered in genome-wide association studies (GWAS), which nevertheless only account for 23% of the total genetic risk. We aimed to identify additional susceptibility loci through the analysis of GWAS in the context of biological function. We bridge the gap between pathway and gene-oriented analyses of GWAS, by introducing a pathway-driven gene stability-selection methodology that identifies potential causal genes in the top-associated disease pathways that may be driving the pathway association signals. We analysed the WTCCC and the NARAC studies of ∼5000 and ∼2000 subjects, respectively. We examined 700 pathways comprising ∼8000 genes. Ranking pathways by significance revealed that the NARAC top-ranked ∼6% laid within the top 10% of WTCCC. Gene selection on those pathways identified 58 genes in WTCCC and 61 in NARAC; 21 of those were common (P(overlap)< 10(-21)), of which 16 were novel discoveries. Among the identified genes, we validated 10 known RA associations in WTCCC and 13 in NARAC, not discovered using single-SNP approaches on the same data. Gene ontology functional enrichment analysis on the identified genes showed significant over-representation of signalling activity (P< 10(-29)) in both studies. Our findings suggest a novel model of RA genetic predisposition, which involves cell-membrane receptors and genes in second messenger signalling systems, in addition to genes that regulate immune responses, which have been the focus of interest previously.
Automated update, revision, and quality control of the maize genome annotations using MAKER-P improves the B73 RefGen_v3 gene models and identifies new genes.

PubMed

Law, MeiYee; Childs, Kevin L; Campbell, Michael S; Stein, Joshua C; Olson, Andrew J; Holt, Carson; Panchy, Nicholas; Lei, Jikai; Jiao, Dian; Andorf, Carson M; Lawrence, Carolyn J; Ware, Doreen; Shiu, Shin-Han; Sun, Yanni; Jiang, Ning; Yandell, Mark

2015-01-01

The large size and relative complexity of many plant genomes make creation, quality control, and dissemination of high-quality gene structure annotations challenging. In response, we have developed MAKER-P, a fast and easy-to-use genome annotation engine for plants. Here, we report the use of MAKER-P to update and revise the maize (Zea mays) B73 RefGen_v3 annotation build (5b+) in less than 3 h using the iPlant Cyberinfrastructure. MAKER-P identified and annotated 4,466 additional, well-supported protein-coding genes not present in the 5b+ annotation build, added additional untranslated regions to 1,393 5b+ gene models, identified 2,647 5b+ gene models that lack any supporting evidence (despite the use of large and diverse evidence data sets), identified 104,215 pseudogene fragments, and created an additional 2,522 noncoding gene annotations. We also describe a method for de novo training of MAKER-P for the annotation of newly sequenced grass genomes. Collectively, these results lead to the 6a maize genome annotation and demonstrate the utility of MAKER-P for rapid annotation, management, and quality control of grasses and other difficult-to-annotate plant genomes. © 2015 American Society of Plant Biologists. All Rights Reserved.
A genome-wide association study of corneal astigmatism: The CREAM Consortium

PubMed Central

Shah, Rupal L.; Li, Qing; Zhao, Wanting; Tedja, Milly S.; Tideman, J. Willem L.; Khawaja, Anthony P.; Fan, Qiao; Yazar, Seyhan; Williams, Katie M.; Verhoeven, Virginie J.M.; Xie, Jing; Wang, Ya Xing; Hess, Moritz; Nickels, Stefan; Lackner, Karl J.; Pärssinen, Olavi; Wedenoja, Juho; Biino, Ginevra; Concas, Maria Pina; Uitterlinden, André; Rivadeneira, Fernando; Jaddoe, Vincent W.V.; Hysi, Pirro G.; Sim, Xueling; Tan, Nicholas; Tham, Yih-Chung; Sensaki, Sonoko; Hofman, Albert; Vingerling, Johannes R.; Jonas, Jost B.; Mitchell, Paul; Hammond, Christopher J.; Höhn, René; Baird, Paul N.; Wong, Tien-Yin; Cheng, Chinfsg-Yu; Teo, Yik Ying; Mackey, David A.; Williams, Cathy; Saw, Seang-Mei; Klaver, Caroline C.W.; Bailey-Wilson, Joan E.

2018-01-01

Purpose To identify genes and genetic markers associated with corneal astigmatism. Methods A meta-analysis of genome-wide association studies (GWASs) of corneal astigmatism undertaken for 14 European ancestry (n=22,250) and 8 Asian ancestry (n=9,120) cohorts was performed by the Consortium for Refractive Error and Myopia. Cases were defined as having >0.75 diopters of corneal astigmatism. Subsequent gene-based and gene-set analyses of the meta-analyzed results of European ancestry cohorts were performed using VEGAS2 and MAGMA software. Additionally, estimates of single nucleotide polymorphism (SNP)-based heritability for corneal and refractive astigmatism and the spherical equivalent were calculated for Europeans using LD score regression. Results The meta-analysis of all cohorts identified a genome-wide significant locus near the platelet-derived growth factor receptor alpha (PDGFRA) gene: top SNP: rs7673984, odds ratio=1.12 (95% CI:1.08–1.16), p=5.55×10−9. No other genome-wide significant loci were identified in the combined analysis or European/Asian ancestry-specific analyses. Gene-based analysis identified three novel candidate genes for corneal astigmatism in Europeans—claudin-7 (CLDN7), acid phosphatase 2, lysosomal (ACP2), and TNF alpha-induced protein 8 like 3 (TNFAIP8L3). Conclusions In addition to replicating a previously identified genome-wide significant locus for corneal astigmatism near the PDGFRA gene, gene-based analysis identified three novel candidate genes, CLDN7, ACP2, and TNFAIP8L3, that warrant further investigation to understand their role in the pathogenesis of corneal astigmatism. The much lower number of genetic variants and genes demonstrating an association with corneal astigmatism compared to published spherical equivalent GWAS analyses suggest a greater influence of rare genetic variants, non-additive genetic effects, or environmental factors in the development of astigmatism. PMID:29422769
Screening for ATM Mutations in an African-American Population to Identify a Predictor of Breast Cancer Susceptibility

DTIC Science & Technology

2006-07-01

ATM genetic variant identified affects radiosensitivity and levels of the protein encoded by the ATM gene for each mutation examined. 15. SUBJECT...women without breast cancer. An additional objective is to determine the functional impact upon the protein encoded by the ATM gene for each mutation ...each ATM variant identified affects radiosensitivity and levels of the protein encoded by the ATM gene for mutations identified. Body STATEMENT
Discovery of gene-gene interactions across multiple independent data sets of late onset Alzheimer disease from the Alzheimer Disease Genetics Consortium.

PubMed

Hohman, Timothy J; Bush, William S; Jiang, Lan; Brown-Gentry, Kristin D; Torstenson, Eric S; Dudek, Scott M; Mukherjee, Shubhabrata; Naj, Adam; Kunkle, Brian W; Ritchie, Marylyn D; Martin, Eden R; Schellenberg, Gerard D; Mayeux, Richard; Farrer, Lindsay A; Pericak-Vance, Margaret A; Haines, Jonathan L; Thornton-Wells, Tricia A

2016-02-01

Late-onset Alzheimer disease (AD) has a complex genetic etiology, involving locus heterogeneity, polygenic inheritance, and gene-gene interactions; however, the investigation of interactions in recent genome-wide association studies has been limited. We used a biological knowledge-driven approach to evaluate gene-gene interactions for consistency across 13 data sets from the Alzheimer Disease Genetics Consortium. Fifteen single nucleotide polymorphism (SNP)-SNP pairs within 3 gene-gene combinations were identified: SIRT1 × ABCB1, PSAP × PEBP4, and GRIN2B × ADRA1A. In addition, we extend a previously identified interaction from an endophenotype analysis between RYR3 × CACNA1C. Finally, post hoc gene expression analyses of the implicated SNPs further implicate SIRT1 and ABCB1, and implicate CDH23 which was most recently identified as an AD risk locus in an epigenetic analysis of AD. The observed interactions in this article highlight ways in which genotypic variation related to disease may depend on the genetic context in which it occurs. Further, our results highlight the utility of evaluating genetic interactions to explain additional variance in AD risk and identify novel molecular mechanisms of AD pathogenesis. Copyright © 2016 Elsevier Inc. All rights reserved.
Bioinformatic analysis of the effects and mechanisms of decitabine and cytarabine on acute myeloid leukemia

PubMed Central

Zhou, Shiyong; Liu, Pengfei; Zhang, Huilai

2017-01-01

Acute myeloid leukemia (AML) is a frequently occurring malignant disease of the blood and may result from a variety of genetic disorders. The present study aimed to identify the underlying mechanisms associated with the therapeutic effects of decitabine and cytarabine on AML, using microarray analysis. The microarray datasets GSE40442 and GSE40870 were downloaded from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) and differentially methylated sites were identified in AML cells treated with decitabine compared with those treated with cytarabine via the Linear Models for Microarray Data package, following data pre-processing. Gene Ontology (GO) analysis of DEGs was performed using the Database for Annotation, Visualization and Integrated Analysis Discovery. Genes corresponding to the differentially methylated sites were obtained using the annotation package of the methylation microarray platform. The overlapping genes were identified, which exhibited the opposite variation trend between gene expression and DNA methylation. Important transcription factor (TF)-gene pairs were screened out, and a regulated network subsequently constructed. A total of 190 DEGs and 540 differentially methylated sites were identified in AML cells treated with decitabine compared with those treated with cytarabine. A total of 36 GO terms of DEGs were enriched, including nucleosomes, protein-DNA complexes and the nucleosome assembly. The 540 differentially methylated sites were located on 240 genes, including the acid-repeat containing protein (ACRC) gene that was additionally differentially expressed. In addition, 60 TF pairs and overlapped methylated sites, and 140 TF-pairs and DEGs were screened out. The regulated network included 68 nodes and 140 TF-gene pairs. The present study identified various genes including ACRC and proliferating cell nuclear antigen, in addition to various TFs, including TATA-box binding protein associated factor 1 and CCCTC-binding factor, which may be potential therapeutic targets of AML. PMID:28498449
Bioinformatic analysis of the effects and mechanisms of decitabine and cytarabine on acute myeloid leukemia.

PubMed

Zhou, Shiyong; Liu, Pengfei; Zhang, Huilai

2017-07-01

Acute myeloid leukemia (AML) is a frequently occurring malignant disease of the blood and may result from a variety of genetic disorders. The present study aimed to identify the underlying mechanisms associated with the therapeutic effects of decitabine and cytarabine on AML, using microarray analysis. The microarray datasets GSE40442 and GSE40870 were downloaded from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) and differentially methylated sites were identified in AML cells treated with decitabine compared with those treated with cytarabine via the Linear Models for Microarray Data package, following data pre‑processing. Gene Ontology (GO) analysis of DEGs was performed using the Database for Annotation, Visualization and Integrated Analysis Discovery. Genes corresponding to the differentially methylated sites were obtained using the annotation package of the methylation microarray platform. The overlapping genes were identified, which exhibited the opposite variation trend between gene expression and DNA methylation. Important transcription factor (TF)‑gene pairs were screened out, and a regulated network subsequently constructed. A total of 190 DEGs and 540 differentially methylated sites were identified in AML cells treated with decitabine compared with those treated with cytarabine. A total of 36 GO terms of DEGs were enriched, including nucleosomes, protein‑DNA complexes and the nucleosome assembly. The 540 differentially methylated sites were located on 240 genes, including the acid‑repeat containing protein (ACRC) gene that was additionally differentially expressed. In addition, 60 TF pairs and overlapped methylated sites, and 140 TF‑pairs and DEGs were screened out. The regulated network included 68 nodes and 140 TF‑gene pairs. The present study identified various genes including ACRC and proliferating cell nuclear antigen, in addition to various TFs, including TATA‑box binding protein associated factor 1 and CCCTC‑binding factor, which may be potential therapeutic targets of AML.
Multi-variant study of obesity risk genes in African Americans: The Jackson Heart Study.

PubMed

Liu, Shijian; Wilson, James G; Jiang, Fan; Griswold, Michael; Correa, Adolfo; Mei, Hao

2016-11-30

Genome-wide association study (GWAS) has been successful in identifying obesity risk genes by single-variant association analysis. For this study, we designed steps of analysis strategy and aimed to identify multi-variant effects on obesity risk among candidate genes. Our analyses were focused on 2137 African American participants with body mass index measured in the Jackson Heart Study and 657 common single nucleotide polymorphisms (SNPs) genotyped at 8 GWAS-identified obesity risk genes. Single-variant association test showed that no SNPs reached significance after multiple testing adjustment. The following gene-gene interaction analysis, which was focused on SNPs with unadjusted p-value<0.10, identified 6 significant multi-variant associations. Logistic regression showed that SNPs in these associations did not have significant linear interactions; examination of genetic risk score evidenced that 4 multi-variant associations had significant additive effects of risk SNPs; and haplotype association test presented that all multi-variant associations contained one or several combinations of particular alleles or haplotypes, associated with increased obesity risk. Our study evidenced that obesity risk genes generated multi-variant effects, which can be additive or non-linear interactions, and multi-variant study is an important supplement to existing GWAS for understanding genetic effects of obesity risk genes. Copyright © 2016 Elsevier B.V. All rights reserved.
A complex regulatory network controls aerobic ethanol oxidation in Pseudomonas aeruginosa: indication of four levels of sensor kinases and response regulators.

PubMed

Mern, Demissew S; Ha, Seung-Wook; Khodaverdi, Viola; Gliese, Nicole; Görisch, Helmut

2010-05-01

In addition to the known response regulator ErbR (former AgmR) and the two-component regulatory system EraSR (former ExaDE), three additional regulatory proteins have been identified as being involved in controlling transcription of the aerobic ethanol oxidation system in Pseudomonas aeruginosa. Two putative sensor kinases, ErcS and ErcS', and a response regulator, ErdR, were found, all of which show significant similarity to the two-component flhSR system that controls methanol and formaldehyde metabolism in Paracoccus denitrificans. All three identified response regulators, EraR (formerly ExaE), ErbR (formerly AgmR) and ErdR, are members of the luxR family. The three sensor kinases EraS (formerly ExaD), ErcS and ErcS' do not contain a membrane domain. Apparently, they are localized in the cytoplasm and recognize cytoplasmic signals. Inactivation of gene ercS caused an extended lag phase on ethanol. Inactivation of both genes, ercS and ercS', resulted in no growth at all on ethanol, as did inactivation of erdR. Of the three sensor kinases and three response regulators identified thus far, only the EraSR (formerly ExaDE) system forms a corresponding kinase/regulator pair. Using reporter gene constructs of all identified regulatory genes in different mutants allowed the hierarchy of a hypothetical complex regulatory network to be established. Probably, two additional sensor kinases and two additional response regulators, which are hidden among the numerous regulatory genes annotated in the genome of P. aeruginosa, remain to be identified.
Comparing GWAS Results of Complex Traits Using Full Genetic Model and Additive Models for Revealing Genetic Architecture

PubMed Central

Monir, Md. Mamun; Zhu, Jun

2017-01-01

Most of the genome-wide association studies (GWASs) for human complex diseases have ignored dominance, epistasis and ethnic interactions. We conducted comparative GWASs for total cholesterol using full model and additive models, which illustrate the impacts of the ignoring genetic variants on analysis results and demonstrate how genetic effects of multiple loci could differ across different ethnic groups. There were 15 quantitative trait loci with 13 individual loci and 3 pairs of epistasis loci identified by full model, whereas only 14 loci (9 common loci and 5 different loci) identified by multi-loci additive model. Again, 4 full model detected loci were not detected using multi-loci additive model. PLINK-analysis identified two loci and GCTA-analysis detected only one locus with genome-wide significance. Full model identified three previously reported genes as well as several new genes. Bioinformatics analysis showed some new genes are related with cholesterol related chemicals and/or diseases. Analyses of cholesterol data and simulation studies revealed that the full model performs were better than the additive-model performs in terms of detecting power and unbiased estimations of genetic variants of complex traits. PMID:28079101
Comprehensive analysis of MHC class I genes from the U-, S-, and Z-lineages in Atlantic salmon.

PubMed

Lukacs, Morten F; Harstad, Håvard; Bakke, Hege G; Beetz-Sargent, Marianne; McKinnel, Linda; Lubieniecki, Krzysztof P; Koop, Ben F; Grimholt, Unni

2010-03-05

We have previously sequenced more than 500 kb of the duplicated MHC class I regions in Atlantic salmon. In the IA region we identified the loci for the MHC class I gene Sasa-UBA in addition to a soluble MHC class I molecule, Sasa-ULA. A pseudolocus for Sasa-UCA was identified in the nonclassical IB region. Both regions contained genes for antigen presentation, as wells as orthologues to other genes residing in the human MHC region. The genomic localisation of two MHC class I lineages (Z and S) has been resolved. 7 BACs were sequenced using a combination of standard Sanger and 454 sequencing. The new sequence data extended the IA region with 150 kb identifying the location of one Z-lineage locus, ZAA. The IB region was extended with 350 kb including three new Z-lineage loci, ZBA, ZCA and ZDA in addition to a UGA locus. An allelic version of the IB region contained a functional UDA locus in addition to the UCA pseudolocus. Additionally a BAC harbouring two MHC class I genes (UHA) was placed on linkage group 14, while a BAC containing the S-lineage locus SAA (previously known as UAA) was placed on LG10. Gene expression studies showed limited expression range for all class I genes with exception of UBA being dominantly expressed in gut, spleen and gills, and ZAA with high expression in blood. Here we describe the genomic organization of MHC class I loci from the U-, Z-, and S-lineages in Atlantic salmon. Nine of the described class I genes are located in the extension of the duplicated IA and IB regions, while three class I genes are found on two separate linkage groups. The gene organization of the two regions indicates that the IB region is evolving at a different pace than the IA region. Expression profiling, polymorphic content, peptide binding properties and phylogenetic relationship show that Atlantic salmon has only one MHC class Ia gene (UBA), in addition to a multitude of nonclassical MHC class I genes from the U-, S- and Z-lineages.
Discovery of a Phosphonoacetic Acid Derived Natural Product by Pathway Refactoring.

PubMed

Freestone, Todd S; Ju, Kou-San; Wang, Bin; Zhao, Huimin

2017-02-17

The activation of silent natural product gene clusters is a synthetic biology problem of great interest. As the rate at which gene clusters are identified outpaces the discovery rate of new molecules, this unknown chemical space is rapidly growing, as too are the rewards for developing technologies to exploit it. One class of natural products that has been underrepresented is phosphonic acids, which have important medical and agricultural uses. Hundreds of phosphonic acid biosynthetic gene clusters have been identified encoding for unknown molecules. Although methods exist to elicit secondary metabolite gene clusters in native hosts, they require the strain to be amenable to genetic manipulation. One method to circumvent this is pathway refactoring, which we implemented in an effort to discover new phosphonic acids from a gene cluster from Streptomyces sp. strain NRRL F-525. By reengineering this cluster for expression in the production host Streptomyces lividans, utility of refactoring is demonstrated with the isolation of a novel phosphonic acid, O-phosphonoacetic acid serine, and the characterization of its biosynthesis. In addition, a new biosynthetic branch point is identified with a phosphonoacetaldehyde dehydrogenase, which was used to identify additional phosphonic acid gene clusters that share phosphonoacetic acid as an intermediate.
Genes contributing to the development of alcoholism: an overview.

PubMed

Edenberg, Howard J

2012-01-01

Genetic factors (i.e., variations in specific genes) account for a substantial portion of the risk for alcoholism. However, identifying those genes and the specific variations involved is challenging. Researchers have used both case-control and family studies to identify genes related to alcoholism risk. In addition, different strategies such as candidate gene analyses and genome-wide association studies have been used. The strongest effects have been found for specific variants of genes that encode two enzymes involved in alcohol metabolism-alcohol dehydrogenase and aldehyde dehydrogenase. Accumulating evidence indicates that variations in numerous other genes have smaller but measurable effects.

Development and validation of a gene profile predicting benefit of postmastectomy radiotherapy in patients with high-risk breast cancer: a study of gene expression in the DBCG82bc cohort.

PubMed

Tramm, Trine; Mohammed, Hayat; Myhre, Simen; Kyndi, Marianne; Alsner, Jan; Børresen-Dale, Anne-Lise; Sørlie, Therese; Frigessi, Arnoldo; Overgaard, Jens

2014-10-15

To identify genes predicting benefit of radiotherapy in patients with high-risk breast cancer treated with systemic therapy and randomized to receive or not receive postmastectomy radiotherapy (PMRT). The study was based on the Danish Breast Cancer Cooperative Group (DBCG82bc) cohort. Gene-expression analysis was performed in a training set of frozen tumor tissue from 191 patients. Genes were identified through the Lasso method with the endpoint being locoregional recurrence (LRR). A weighted gene-expression index (DBCG-RT profile) was calculated and transferred to quantitative real-time PCR (qRT-PCR) in corresponding formalin-fixed, paraffin-embedded (FFPE) samples, before validation in FFPE from 112 additional patients. Seven genes were identified, and the derived DBCG-RT profile divided the 191 patients into "high LRR risk" and "low LRR risk" groups. PMRT significantly reduced risk of LRR in "high LRR risk" patients, whereas "low LRR risk" patients showed no additional reduction in LRR rate. Technical transfer of the DBCG-RT profile to FFPE/qRT-PCR was successful, and the predictive impact was successfully validated in another 112 patients. A DBCG-RT gene profile was identified and validated, identifying patients with very low risk of LRR and no benefit from PMRT. The profile may provide a method to individualize treatment with PMRT. ©2014 American Association for Cancer Research.
Common Viral Integration Sites Identified in Avian Leukosis Virus-Induced B-Cell Lymphomas

PubMed Central

Justice, James F.; Morgan, Robin W.

2015-01-01

ABSTRACT Avian leukosis virus (ALV) induces B-cell lymphoma and other neoplasms in chickens by integrating within or near cancer genes and perturbing their expression. Four genes—MYC, MYB, Mir-155, and TERT—have previously been identified as common integration sites in these virus-induced lymphomas and are thought to play a causal role in tumorigenesis. In this study, we employ high-throughput sequencing to identify additional genes driving tumorigenesis in ALV-induced B-cell lymphomas. In addition to the four genes implicated previously, we identify other genes as common integration sites, including TNFRSF1A, MEF2C, CTDSPL, TAB2, RUNX1, MLL5, CXorf57, and BACH2. We also analyze the genome-wide ALV integration landscape in vivo and find increased frequency of ALV integration near transcriptional start sites and within transcripts. Previous work has shown ALV prefers a weak consensus sequence for integration in cultured human cells. We confirm this consensus sequence for ALV integration in vivo in the chicken genome. PMID:26670384
Novel mutations in the SOX10 gene in the first two Chinese cases of type IV Waardenburg syndrome.

PubMed

Jiang, Lu; Chen, Hongsheng; Jiang, Wen; Hu, Zhengmao; Mei, Lingyun; Xue, Jingjie; He, Chufeng; Liu, Yalan; Xia, Kun; Feng, Yong

2011-05-20

We analyzed the clinical features and family-related gene mutations for the first two Chinese cases of type IV Waardenburg syndrome (WS4). Two families were analyzed in this study. The analysis included a medical history, clinical analysis, a hearing test and a physical examination. In addition, the EDNRB, EDN3 and SOX10 genes were sequenced in order to identify the pathogenic mutation responsible for the WS4 observed in these patients. The two WS4 cases presented with high phenotypic variability. Two novel heterozygous mutations (c.254G>A and c.698-2A>T) in the SOX10 gene were detected. The mutations identified in the patients were not found in unaffected family members or in 200 unrelated control subjects. This is the first report of WS4 in Chinese patients. In addition, two novel mutations in SOX10 gene have been identified. Crown Copyright © 2011. Published by Elsevier Inc. All rights reserved.
Specific PCR primers directed to identify cryI and cryIII genes within a Bacillus thuringiensis strain collection.

PubMed Central

Cerón, J; Ortíz, A; Quintero, R; Güereca, L; Bravo, A

1995-01-01

In this paper we describe a PCR strategy that can be used to rapidly identify Bacillus thuringiensis strains that harbor any of the known cryI or cryIII genes. Four general PCR primers which amplify DNA fragments from the known cryI or cryIII genes were selected from conserved regions. Once a strain was identified as an organism that contains a particular type of cry gene, it could be easily characterized by performing additional PCR with specific cryI and cryIII primers selected from variable regions. The method described in this paper can be used to identify the 10 different cryI genes and the five different cryIII genes. One feature of this screening method is that each cry gene is expected to produce a PCR product having a precise molecular weight. The genes which produce PCR products having different sizes probably represent strains that harbor a potentially novel cry gene. Finally, we present evidence that novel crystal genes can be identified by the method described in this paper. PMID:8526493
Identifying Cancer Driver Genes Using Replication-Incompetent Retroviral Vectors

PubMed Central

Bii, Victor M.; Trobridge, Grant D.

2016-01-01

Identifying novel genes that drive tumor metastasis and drug resistance has significant potential to improve patient outcomes. High-throughput sequencing approaches have identified cancer genes, but distinguishing driver genes from passengers remains challenging. Insertional mutagenesis screens using replication-incompetent retroviral vectors have emerged as a powerful tool to identify cancer genes. Unlike replicating retroviruses and transposons, replication-incompetent retroviral vectors lack additional mutagenesis events that can complicate the identification of driver mutations from passenger mutations. They can also be used for almost any human cancer due to the broad tropism of the vectors. Replication-incompetent retroviral vectors have the ability to dysregulate nearby cancer genes via several mechanisms including enhancer-mediated activation of gene promoters. The integrated provirus acts as a unique molecular tag for nearby candidate driver genes which can be rapidly identified using well established methods that utilize next generation sequencing and bioinformatics programs. Recently, retroviral vector screens have been used to efficiently identify candidate driver genes in prostate, breast, liver and pancreatic cancers. Validated driver genes can be potential therapeutic targets and biomarkers. In this review, we describe the emergence of retroviral insertional mutagenesis screens using replication-incompetent retroviral vectors as a novel tool to identify cancer driver genes in different cancer types. PMID:27792127
Early BrdU-responsive genes constitute a novel class of senescence-associated genes in human cells

DOE Office of Scientific and Technical Information (OSTI.GOV)

Minagawa, Sachi; Nakabayashi, Kazuhiko; Fujii, Michihiko

2005-04-01

We identified genes that immediately respond to 5-bromodeoxyuridine (BrdU) in SUSM-1, an immortal fibroblastic line, with DNA microarray and Northern blot analysis. At least 29 genes were found to alter gene expression greater than twice more or less than controls within 36 h after addition of BrdU. They took several different expression patterns upon addition of BrdU, and the majority showed a significant alteration within 12 h. When compared among SUSM-1, HeLa, and TIG-7 normal human fibroblasts, 19 genes behaved similarly upon addition of BrdU. In addition, 14 genes, 9 of which are novel as regards senescence, behaved similarly inmore » senescent TIG-7 cells. The genes do not seem to have a role in proliferation or cell cycle progression. These results suggest that the early BrdU-responsive genes represent early signs of cellular senescence and can be its new biomarkers.« less
Molecular genetics of Alzheimer disease.

PubMed

St George-Hyslop, P H

1999-01-01

Epidemiological and individual case studies indicate that genetic factors play a significant role in the genesis of Alzheimer Disease (AD). To date, molecular genetic studies in families multiply affected with AD have identified three genes (Presenilin 1-PS1, Presenilin 2-PS2, and beta-amyloid precursor protein--betaAPP) associated with highly penetrant early onset AD, and one gene (Apolipoprotein E) associated with late onset AD. A fifth potential AD susceptibility locus has been mapped to a broad region of chromosome 12, but the responsible gene defect has not yet been identified. Case-control studies comparing the frequency of alleles in numerous other candidate genes have identified a number of additional potential AD genes. However, methodological difficulties and conflicting results in follow-up studies, make it unclear whether allelic variations in these genes are truly pathogenic. Nevertheless, analysis of the biochemical effects of mutations in PS1, PS2, betaAPP at least, suggest a common biochemical effect-namely disturbances in the processing of betaAPP protein. In addition to utility in defining potential therapeutic targets, in some circumstances these genes can also potentially be used as adjunctives in clinical presymptomatic, symptomatic or pharmacogenomic diagnosis.
Activation of Ftz-F1-Responsive Genes through Ftz/Ftz-F1 Dependent Enhancers

PubMed Central

Field, Amanda; Xiang, Jie; Anderson, W. Ray; Graham, Patricia; Pick, Leslie

2016-01-01

The orphan nuclear receptor Ftz-F1 is expressed in all somatic nuclei in Drosophila embryos, but mutations result in a pair-rule phenotype. This was explained by the interaction of Ftz-F1 with the homeodomain protein Ftz that is expressed in stripes in the primordia of segments missing in either ftz-f1 or ftz mutants. Ftz-F1 and Ftz were shown to physically interact and coordinately activate the expression of ftz itself and engrailed by synergistic binding to composite Ftz-F1/Ftz binding sites. However, attempts to identify additional target genes on the basis of Ftz-F1/ Ftz binding alone has met with only limited success. To discern rules for Ftz-F1 target site selection in vivo and to identify additional target genes, a microarray analysis was performed comparing wildtype and ftz-f1 mutant embryos. Ftz-F1-responsive genes most highly regulated included engrailed and nine additional genes expressed in patterns dependent on both ftz and ftz-f1. Candidate enhancers for these genes were identified by combining BDTNP Ftz ChIP-chip data with a computational search for Ftz-F1 binding sites. Of eight enhancer reporter genes tested in transgenic embryos, six generated expression patterns similar to the corresponding endogenous gene and expression was lost in ftz mutants. These studies identified a new set of Ftz-F1 targets, all of which are co-regulated by Ftz. Comparative analysis of enhancers containing Ftz/Ftz-F1 binding sites that were or were not bona fide targets in vivo suggested that GAF negatively regulates enhancers that contain Ftz/Ftz-F1 binding sites but are not actually utilized. These targets include other regulatory factors as well as genes involved directly in morphogenesis, providing insight into how pair-rule genes establish the body pattern. PMID:27723822
Sleeping Beauty transposon mutagenesis identifies genes that cooperate with mutant Smad4 in gastric cancer development

PubMed Central

Takeda, Haruna; Rust, Alistair G.; Ward, Jerrold M.; Yew, Christopher Chin Kuan; Jenkins, Nancy A.; Copeland, Neal G.

2016-01-01

Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4+/− mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC. PMID:27006499
Sleeping Beauty transposon mutagenesis identifies genes that cooperate with mutant Smad4 in gastric cancer development.

PubMed

Takeda, Haruna; Rust, Alistair G; Ward, Jerrold M; Yew, Christopher Chin Kuan; Jenkins, Nancy A; Copeland, Neal G

2016-04-05

Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4(+/-) mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC.
Resistance gene candidates identified by PCR with degenerate oligonucleotide primers map to clusters of resistance genes in lettuce.

PubMed

Shen, K A; Meyers, B C; Islam-Faridi, M N; Chin, D B; Stelly, D M; Michelmore, R W

1998-08-01

The recent cloning of genes for resistance against diverse pathogens from a variety of plants has revealed that many share conserved sequence motifs. This provides the possibility of isolating numerous additional resistance genes by polymerase chain reaction (PCR) with degenerate oligonucleotide primers. We amplified resistance gene candidates (RGCs) from lettuce with multiple combinations of primers with low degeneracy designed from motifs in the nucleotide binding sites (NBSs) of RPS2 of Arabidopsis thaliana and N of tobacco. Genomic DNA, cDNA, and bacterial artificial chromosome (BAC) clones were successfully used as templates. Four families of sequences were identified that had the same similarity to each other as to resistance genes from other species. The relationship of the amplified products to resistance genes was evaluated by several sequence and genetic criteria. The amplified products contained open reading frames with additional sequences characteristic of NBSs. Hybridization of RGCs to genomic DNA and to BAC clones revealed large numbers of related sequences. Genetic analysis demonstrated the existence of clustered multigene families for each of the four RGC sequences. This parallels classical genetic data on clustering of disease resistance genes. Two of the four families mapped to known clusters of resistance genes; these two families were therefore studied in greater detail. Additional evidence that these RGCs could be resistance genes was gained by the identification of leucine-rich repeat (LRR) regions in sequences adjoining the NBS similar to those in RPM1 and RPS2 of A. thaliana. Fluorescent in situ hybridization confirmed the clustered genomic distribution of these sequences. The use of PCR with degenerate oligonucleotide primers is therefore an efficient method to identify numerous RGCs in plants.
POEM: Identifying Joint Additive Effects on Regulatory Circuits.

PubMed

Botzman, Maya; Nachshon, Aharon; Brodt, Avital; Gat-Viks, Irit

2016-01-01

Expression Quantitative Trait Locus (eQTL) mapping tackles the problem of identifying variation in DNA sequence that have an effect on the transcriptional regulatory network. Major computational efforts are aimed at characterizing the joint effects of several eQTLs acting in concert to govern the expression of the same genes. Yet, progress toward a comprehensive prediction of such joint effects is limited. For example, existing eQTL methods commonly discover interacting loci affecting the expression levels of a module of co-regulated genes. Such "modularization" approaches, however, are focused on epistatic relations and thus have limited utility for the case of additive (non-epistatic) effects. Here we present POEM (Pairwise effect On Expression Modules), a methodology for identifying pairwise eQTL effects on gene modules. POEM is specifically designed to achieve high performance in the case of additive joint effects. We applied POEM to transcription profiles measured in bone marrow-derived dendritic cells across a population of genotyped mice. Our study reveals widespread additive, trans-acting pairwise effects on gene modules, characterizes their organizational principles, and highlights high-order interconnections between modules within the immune signaling network. These analyses elucidate the central role of additive pairwise effect in regulatory circuits, and provide computational tools for future investigations into the interplay between eQTLs. The software described in this article is available at csgi.tau.ac.il/POEM/.
POEM: Identifying Joint Additive Effects on Regulatory Circuits

PubMed Central

Botzman, Maya; Nachshon, Aharon; Brodt, Avital; Gat-Viks, Irit

2016-01-01

Motivation: Expression Quantitative Trait Locus (eQTL) mapping tackles the problem of identifying variation in DNA sequence that have an effect on the transcriptional regulatory network. Major computational efforts are aimed at characterizing the joint effects of several eQTLs acting in concert to govern the expression of the same genes. Yet, progress toward a comprehensive prediction of such joint effects is limited. For example, existing eQTL methods commonly discover interacting loci affecting the expression levels of a module of co-regulated genes. Such “modularization” approaches, however, are focused on epistatic relations and thus have limited utility for the case of additive (non-epistatic) effects. Results: Here we present POEM (Pairwise effect On Expression Modules), a methodology for identifying pairwise eQTL effects on gene modules. POEM is specifically designed to achieve high performance in the case of additive joint effects. We applied POEM to transcription profiles measured in bone marrow-derived dendritic cells across a population of genotyped mice. Our study reveals widespread additive, trans-acting pairwise effects on gene modules, characterizes their organizational principles, and highlights high-order interconnections between modules within the immune signaling network. These analyses elucidate the central role of additive pairwise effect in regulatory circuits, and provide computational tools for future investigations into the interplay between eQTLs. Availability: The software described in this article is available at csgi.tau.ac.il/POEM/. PMID:27148351
Early gene expression during natural spinal cord regeneration in the salamander Ambystoma mexicanum.

PubMed

Monaghan, James R; Walker, John A; Page, Robert B; Putta, Srikrishna; Beachy, Christopher K; Voss, S Randal

2007-04-01

In contrast to mammals, salamanders have a remarkable ability to regenerate their spinal cord and recover full movement and function after tail amputation. To identify genes that may be associated with this greater regenerative ability, we designed an oligonucleotide microarray and profiled early gene expression during natural spinal cord regeneration in Ambystoma mexicanum. We sampled tissue at five early time points after tail amputation and identified genes that registered significant changes in mRNA abundance during the first 7 days of regeneration. A list of 1036 statistically significant genes was identified. Additional statistical and fold change criteria were applied to identify a smaller list of 360 genes that were used to describe predominant expression patterns and gene functions. Our results show that a diverse injury response is activated in concert with extracellular matrix remodeling mechanisms during the early acute phase of natural spinal cord regeneration. We also report gene expression similarities and differences between our study and studies that have profiled gene expression after spinal cord injury in rat. Our study illustrates the utility of a salamander model for identifying genes and gene functions that may enhance regenerative ability in mammals.
Gene co-expression network analysis in Rhodobacter capsulatus and application to comparative expression analysis of Rhodobacter sphaeroides

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pena-Castillo, Lourdes; Mercer, Ryan; Gurinovich, Anastasia

2014-08-28

The genus Rhodobacter contains purple nonsulfur bacteria found mostly in freshwater environments. Representative strains of two Rhodobacter species, R. capsulatus and R. sphaeroides, have had their genomes fully sequenced and both have been the subject of transcriptional profiling studies. Gene co-expression networks can be used to identify modules of genes with similar expression profiles. Functional analysis of gene modules can then associate co-expressed genes with biological pathways, and network statistics can determine the degree of module preservation in related networks. In this paper, we constructed an R. capsulatus gene co-expression network, performed functional analysis of identified gene modules, and investigatedmore » preservation of these modules in R. capsulatus proteomics data and in R. sphaeroides transcriptomics data. Results: The analysis identified 40 gene co-expression modules in R. capsulatus. Investigation of the module gene contents and expression profiles revealed patterns that were validated based on previous studies supporting the biological relevance of these modules. We identified two R. capsulatus gene modules preserved in the protein abundance data. We also identified several gene modules preserved between both Rhodobacter species, which indicate that these cellular processes are conserved between the species and are candidates for functional information transfer between species. Many gene modules were non-preserved, providing insight into processes that differentiate the two species. In addition, using Local Network Similarity (LNS), a recently proposed metric for expression divergence, we assessed the expression conservation of between-species pairs of orthologs, and within-species gene-protein expression profiles. Conclusions: Our analyses provide new sources of information for functional annotation in R. capsulatus because uncharacterized genes in modules are now connected with groups of genes that constitute a joint functional annotation. We identified R. capsulatus modules enriched with genes for ribosomal proteins, porphyrin and bacteriochlorophyll anabolism, and biosynthesis of secondary metabolites to be preserved in R. sphaeroides whereas modules related to RcGTA production and signalling showed lack of preservation in R. sphaeroides. In addition, we demonstrated that network statistics may also be applied within-species to identify congruence between mRNA expression and protein abundance data for which simple correlation measurements have previously had mixed results.« less
Analyzing gene expression profiles in dilated cardiomyopathy via bioinformatics methods.

PubMed

Wang, Liming; Zhu, L; Luan, R; Wang, L; Fu, J; Wang, X; Sui, L

2016-10-10

Dilated cardiomyopathy (DCM) is characterized by ventricular dilatation, and it is a common cause of heart failure and cardiac transplantation. This study aimed to explore potential DCM-related genes and their underlying regulatory mechanism using methods of bioinformatics. The gene expression profiles of GSE3586 were downloaded from Gene Expression Omnibus database, including 15 normal samples and 13 DCM samples. The differentially expressed genes (DEGs) were identified between normal and DCM samples using Limma package in R language. Pathway enrichment analysis of DEGs was then performed. Meanwhile, the potential transcription factors (TFs) and microRNAs (miRNAs) of these DEGs were predicted based on their binding sequences. In addition, DEGs were mapped to the cMap database to find the potential small molecule drugs. A total of 4777 genes were identified as DEGs by comparing gene expression profiles between DCM and control samples. DEGs were significantly enriched in 26 pathways, such as lymphocyte TarBase pathway and androgen receptor signaling pathway. Furthermore, potential TFs (SP1, LEF1, and NFAT) were identified, as well as potential miRNAs (miR-9, miR-200 family, and miR-30 family). Additionally, small molecules like isoflupredone and trihexyphenidyl were found to be potential therapeutic drugs for DCM. The identified DEGs (PRSS12 and FOXG1), potential TFs, as well as potential miRNAs, might be involved in DCM.
Analyzing gene expression profiles in dilated cardiomyopathy via bioinformatics methods

PubMed Central

Wang, Liming; Zhu, L.; Luan, R.; Wang, L.; Fu, J.; Wang, X.; Sui, L.

2016-01-01

Dilated cardiomyopathy (DCM) is characterized by ventricular dilatation, and it is a common cause of heart failure and cardiac transplantation. This study aimed to explore potential DCM-related genes and their underlying regulatory mechanism using methods of bioinformatics. The gene expression profiles of GSE3586 were downloaded from Gene Expression Omnibus database, including 15 normal samples and 13 DCM samples. The differentially expressed genes (DEGs) were identified between normal and DCM samples using Limma package in R language. Pathway enrichment analysis of DEGs was then performed. Meanwhile, the potential transcription factors (TFs) and microRNAs (miRNAs) of these DEGs were predicted based on their binding sequences. In addition, DEGs were mapped to the cMap database to find the potential small molecule drugs. A total of 4777 genes were identified as DEGs by comparing gene expression profiles between DCM and control samples. DEGs were significantly enriched in 26 pathways, such as lymphocyte TarBase pathway and androgen receptor signaling pathway. Furthermore, potential TFs (SP1, LEF1, and NFAT) were identified, as well as potential miRNAs (miR-9, miR-200 family, and miR-30 family). Additionally, small molecules like isoflupredone and trihexyphenidyl were found to be potential therapeutic drugs for DCM. The identified DEGs (PRSS12 and FOXG1), potential TFs, as well as potential miRNAs, might be involved in DCM. PMID:27737314
Congruence of Additive and Non-Additive Effects on Gene Expression Estimated from Pedigree and SNP Data

PubMed Central

Powell, Joseph E.; Henders, Anjali K.; McRae, Allan F.; Kim, Jinhee; Hemani, Gibran; Martin, Nicholas G.; Dermitzakis, Emmanouil T.; Gibson, Greg

2013-01-01

There is increasing evidence that heritable variation in gene expression underlies genetic variation in susceptibility to disease. Therefore, a comprehensive understanding of the similarity between relatives for transcript variation is warranted—in particular, dissection of phenotypic variation into additive and non-additive genetic factors and shared environmental effects. We conducted a gene expression study in blood samples of 862 individuals from 312 nuclear families containing MZ or DZ twin pairs using both pedigree and genotype information. From a pedigree analysis we show that the vast majority of genetic variation across 17,994 probes is additive, although non-additive genetic variation is identified for 960 transcripts. For 180 of the 960 transcripts with non-additive genetic variation, we identify expression quantitative trait loci (eQTL) with dominance effects in a sample of 339 unrelated individuals and replicate 31% of these associations in an independent sample of 139 unrelated individuals. Over-dominance was detected and replicated for a trans association between rs12313805 and ETV6, located 4MB apart on chromosome 12. Surprisingly, only 17 probes exhibit significant levels of common environmental effects, suggesting that environmental and lifestyle factors common to a family do not affect expression variation for most transcripts, at least those measured in blood. Consistent with the genetic architecture of common diseases, gene expression is predominantly additive, but a minority of transcripts display non-additive effects. PMID:23696747
Congruence of additive and non-additive effects on gene expression estimated from pedigree and SNP data.

PubMed

Powell, Joseph E; Henders, Anjali K; McRae, Allan F; Kim, Jinhee; Hemani, Gibran; Martin, Nicholas G; Dermitzakis, Emmanouil T; Gibson, Greg; Montgomery, Grant W; Visscher, Peter M

2013-05-01

There is increasing evidence that heritable variation in gene expression underlies genetic variation in susceptibility to disease. Therefore, a comprehensive understanding of the similarity between relatives for transcript variation is warranted--in particular, dissection of phenotypic variation into additive and non-additive genetic factors and shared environmental effects. We conducted a gene expression study in blood samples of 862 individuals from 312 nuclear families containing MZ or DZ twin pairs using both pedigree and genotype information. From a pedigree analysis we show that the vast majority of genetic variation across 17,994 probes is additive, although non-additive genetic variation is identified for 960 transcripts. For 180 of the 960 transcripts with non-additive genetic variation, we identify expression quantitative trait loci (eQTL) with dominance effects in a sample of 339 unrelated individuals and replicate 31% of these associations in an independent sample of 139 unrelated individuals. Over-dominance was detected and replicated for a trans association between rs12313805 and ETV6, located 4MB apart on chromosome 12. Surprisingly, only 17 probes exhibit significant levels of common environmental effects, suggesting that environmental and lifestyle factors common to a family do not affect expression variation for most transcripts, at least those measured in blood. Consistent with the genetic architecture of common diseases, gene expression is predominantly additive, but a minority of transcripts display non-additive effects.
Genome-Wide Association Mapping Combined with Reverse Genetics Identifies New Effectors of Low Water Potential-Induced Proline Accumulation in Arabidopsis1[W][OPEN

PubMed Central

Verslues, Paul E.; Lasky, Jesse R.; Juenger, Thomas E.; Liu, Tzu-Wen; Kumar, M. Nagaraj

2014-01-01

Arabidopsis (Arabidopsis thaliana) exhibits natural genetic variation in drought response, including varying levels of proline (Pro) accumulation under low water potential. As Pro accumulation is potentially important for stress tolerance and cellular redox control, we conducted a genome-wide association (GWAS) study of low water potential-induced Pro accumulation using a panel of natural accessions and publicly available single-nucleotide polymorphism (SNP) data sets. Candidate genomic regions were prioritized for subsequent study using metrics considering both the strength and spatial clustering of the association signal. These analyses found many candidate regions likely containing gene(s) influencing Pro accumulation. Reverse genetic analysis of several candidates identified new Pro effector genes, including thioredoxins and several genes encoding Universal Stress Protein A domain proteins. These new Pro effector genes further link Pro accumulation to cellular redox and energy status. Additional new Pro effector genes found include the mitochondrial protease LON1, ribosomal protein RPL24A, protein phosphatase 2A subunit A3, a MADS box protein, and a nucleoside triphosphate hydrolase. Several of these new Pro effector genes were from regions with multiple SNPs, each having moderate association with Pro accumulation. This pattern supports the use of summary approaches that incorporate clusters of SNP associations in addition to consideration of individual SNP probability values. Further GWAS-guided reverse genetics promises to find additional effectors of Pro accumulation. The combination of GWAS and reverse genetics to efficiently identify new effector genes may be especially applicable for traits difficult to analyze by other genetic screening methods. PMID:24218491

CHK2, A Candidate Prostate Cancer Susceptibility Gene

DTIC Science & Technology

2003-01-01

To identify prostate cancer susceptibility genes, we applied a mutation screening of candidate gene approach. We screened for mutations in CHEK2 , the...families, 400 sporadic cases, and 423 unaffected men as control. A total of 28 (4.8%) germline CHEK2 mutations were found among 578 patients and...additional 11 in 9 families. Sixteen of 18 unique CHEK2 mutations identified in this study were not detected among 423 unaffected men, suggesting a
Identification of additive, dominant, and epistatic variation conferred by key genes in cellulose biosynthesis pathway in Populus tomentosa†

PubMed Central

Du, Qingzhang; Tian, Jiaxing; Yang, Xiaohui; Pan, Wei; Xu, Baohua; Li, Bailian; Ingvarsson, Pär K.; Zhang, Deqiang

2015-01-01

Economically important traits in many species generally show polygenic, quantitative inheritance. The components of genetic variation (additive, dominant and epistatic effects) of these traits conferred by multiple genes in shared biological pathways remain to be defined. Here, we investigated 11 full-length genes in cellulose biosynthesis, on 10 growth and wood-property traits, within a population of 460 unrelated Populus tomentosa individuals, via multi-gene association. To validate positive associations, we conducted single-marker analysis in a linkage population of 1,200 individuals. We identified 118, 121, and 43 associations (P< 0.01) corresponding to additive, dominant, and epistatic effects, respectively, with low to moderate proportions of phenotypic variance (R2). Epistatic interaction models uncovered a combination of three non-synonymous sites from three unique genes, representing a significant epistasis for diameter at breast height and stem volume. Single-marker analysis validated 61 associations (false discovery rate, Q ≤ 0.10), representing 38 SNPs from nine genes, and its average effect (R2 = 3.8%) nearly 2-fold higher than that identified with multi-gene association, suggesting that multi-gene association can capture smaller individual variants. Moreover, a structural gene–gene network based on tissue-specific transcript abundances provides a better understanding of the multi-gene pathway affecting tree growth and lignocellulose biosynthesis. Our study highlights the importance of pathway-based multiple gene associations to uncover the nature of genetic variance for quantitative traits and may drive novel progress in molecular breeding. PMID:25428896
Genes in one megabase of the HLA class I region

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wei, H.; Fan, Wu-Fang; Xu, Hongxia

1993-11-15

To define the gene content of the HLA class I region, cDNA selection was applied to three overlapping yeast artificial chromosomes (YACs) that spanned 1 megabase (Mb) of this region of the human major histocompatibility complex. These YACs extended from the region centromeric to HLA-E to the region telomeric to HLA-F. In additions to the recognized class I genes and pseudogenes and the anonymous non-class-I genes described recently by the authors and others, 20 additional anonymous cDNA clones were identified from this 1-Mb region. They also identified a long repetitive DNA element in the region between HLA-B and HLA-E. Homologuesmore » of this outside of the HLA complex. The portion of the HLA class I region represented by these YACs shows an average gene density as high as the class II and class III regions. Thus, the high gene density portion of the HLA complex is extended to more than 3 Mb.« less
Genome-Wide Screens Reveal New Gene Products That Influence Genetic Competence in Streptococcus mutans

PubMed Central

O'Brien, Greg; Maricic, Natalie; Kesterson, Alexandria; Grace, Megan

2017-01-01

ABSTRACT A network of genes and at least two peptide signaling molecules tightly control when Streptococcus mutans becomes competent to take up DNA from its environment. Widespread changes in the expression of genes occur when S. mutans is presented with competence signal peptides in vitro, including the increased production of the alternative sigma factor, ComX, which activates late competence genes. Still, the way that gene products that are regulated by competence peptides influence DNA uptake and cellular physiology are not well understood. Here, we developed and employed comprehensive transposon mutagenesis of the S. mutans genome, with a screen to identify mutants that aberrantly expressed comX, coupled with transposon sequencing (Tn-seq) to gain a more thorough understanding of the factors modulating comX expression and progression to the competent state. The screens effectively identified genes known to affect competence, e.g., comR, comS, comD, comE, cipB, clpX, rcrR, and ciaH, but disclosed an additional 20 genes that were not previously competence associated. The competence phenotypes of mutants were characterized, including by fluorescence microscopy to determine at which stage the mutants were impaired for comX activation. Among the novel genes studied were those implicated in cell division, the sensing of cell envelope stress, cell envelope biogenesis, and RNA stability. Our results provide a platform for determining the specific chemical and physical cues that are required for genetic competence in S. mutans, while highlighting the effectiveness of using Tn-seq in S. mutans to discover and study novel biological processes. IMPORTANCE Streptococcus mutans acquires DNA from its environment by becoming genetically competent, a physiologic state triggered by cell-cell communication using secreted peptides. Competence is important for acquiring novel genetic traits and has a strong influence on the expression of virulence-associated traits of S. mutans. Here, we used transposon mutagenesis and genomic technologies to identify novel genes involved in competence development. In addition to identifying genes previously known to be required for comX expression, 20 additional genes were identified and characterized. The findings create opportunities to diminish the pathogenic potential of S. mutans, while validating technologies that can rapidly advance our understanding of the physiology, biology, and genetics of S. mutans and related pathogens. PMID:29109185
Genome-wide screens reveal new gene products that influence genetic competence in Streptococcus mutans.

PubMed

Shields, Robert C; O'Brien, Greg; Maricic, Natalie; Kesterson, Alexandria; Grace, Megan; Hagen, Stephen J; Burne, Robert A

2017-11-06

A network of genes and at least two peptide signaling molecules tightly control when Streptococcus mutans becomes competent to take up DNA from its environment. Widespread changes in the expression of genes occur when S. mutans is presented with competence signal peptides in vitro , including increased production of the alternative sigma factor, ComX, which activates late competence genes. Still, the way that gene products that are regulated by competence peptides influence DNA uptake and cellular physiology are not well understood. Here, we developed and employed comprehensive transposon mutagenesis of the S. mutans genome with a screen to identify mutants that aberrantly expressed comX , coupled with transposon sequencing (Tn-seq) to gain a more thorough understanding of the factors modulating comX expression and progression to the competent state. The screens effectively identified genes known to affect competence, e.g. comR , comS , comD , comE , cipB , clpX , rcrR , ciaH , but disclosed an additional 20 genes that were not previously competence-associated. The competence phenotypes of mutants were characterized, including using fluorescence microscopy to determine at which stage the mutants were impaired for comX activation. Among the novel genes studied were those implicated in cell division, sensing of cell envelope stress, cell envelope biogenesis, and RNA stability. Our results provide a platform for determining the specific chemical and physical cues that are required for genetic competence in S. mutans , while highlighting the effectiveness of using Tn-seq in S. mutans to discover and study novel biological processes. IMPORTANCE Streptococcus mutans acquires DNA from its environment by becoming genetically competent, a physiologic state triggered by cell-cell communication using secreted peptides. Competence is important for acquiring novel genetic traits and has a strong influence on the expression of virulence-associated traits of S. mutans Here, we used transposon mutagenesis and genomic technologies to identify novel genes involved in competence development. In addition to identifying genes previously known to be required for comX expression, 20 additional genes were identified and characterized. The findings create opportunities to diminish the pathogenic potential of S. mutans , while validating technologies that can rapidly advance our understanding of the physiology, biology and genetics of S. mutans and related pathogens. Copyright © 2017 American Society for Microbiology.
Comparative sequence analysis of a region on human chromosome 13q14, frequently deleted in B-cell chronic lymphocytic leukemia, and its homologous region on mouse chromosome 14.

PubMed

Kapanadze, B; Makeeva, N; Corcoran, M; Jareborg, N; Hammarsund, M; Baranova, A; Zabarovsky, E; Vorontsova, O; Merup, M; Gahrton, G; Jansson, M; Yankovsky, N; Einhorn, S; Oscier, D; Grandér, D; Sangfelt, O

2000-12-15

Previous studies have indicated the presence of a putative tumor suppressor gene on human chromosome 13q14, commonly deleted in patients with B-cell chronic lymphocytic leukemia (B-CLL). We have recently identified a minimally deleted region encompassing parts of two adjacent genes, termed LEU1 and LEU2 (leukemia-associated genes 1 and 2), and several additional transcripts. In addition, 50 kb centromeric to this region we have identified another gene, LEU5/RFP2. To elucidate further the complex genomic organization of this region, we have identified, mapped, and sequenced the homologous region in the mouse. Fluorescence in situ hybridization analysis demonstrated that the region maps to mouse chromosome 14. The overall organization and gene order in this region were found to be highly conserved in the mouse. Sequence comparison between the human deletion hotspot region and its homologous mouse region revealed a high degree of sequence conservation with an overall score of 74%. However, our data also show that in terms of transcribed sequences, only two of those, human LEU2 and LEU5/RFP2, are clearly conserved, strengthening the case for these genes as putative candidate B-CLL tumor suppressor genes.
A framework to identify gene expression profiles in a model of inflammation induced by lipopolysaccharide after treatment with thalidomide

PubMed Central

2012-01-01

Background Thalidomide is an anti-inflammatory and anti-angiogenic drug currently used for the treatment of several diseases, including erythema nodosum leprosum, which occurs in patients with lepromatous leprosy. In this research, we use DNA microarray analysis to identify the impact of thalidomide on gene expression responses in human cells after lipopolysaccharide (LPS) stimulation. We employed a two-stage framework. Initially, we identified 1584 altered genes in response to LPS. Modulation of this set of genes was then analyzed in the LPS stimulated cells treated with thalidomide. Results We identified 64 genes with altered expression induced by thalidomide using the rank product method. In addition, the lists of up-regulated and down-regulated genes were investigated by means of bioinformatics functional analysis, which allowed for the identification of biological processes affected by thalidomide. Confirmatory analysis was done in five of the identified genes using real time PCR. Conclusions The results showed some genes that can further our understanding of the biological mechanisms in the action of thalidomide. Of the five genes evaluated with real time PCR, three were down regulated and two were up regulated confirming the initial results of the microarray analysis. PMID:22695124
Mutational Landscape of Candidate Genes in Familial Prostate Cancer

PubMed Central

Johnson, Anna M.; Zuhlke, Kimberly A.; Plotts, Chris; McDonnell, Shannon K.; Middha, Sumit; Riska, Shaun M.; Thibodeau, Stephen N.; Douglas, Julie A.; Cooney, Kathleen A.

2014-01-01

Background Family history is a major risk factor for prostate cancer (PCa), suggesting a genetic component to the disease. However, traditional linkage and association studies have failed to fully elucidate the underlying genetic basis of familial PCa. Methods Here we use a candidate gene approach to identify potential PCa susceptibility variants in whole exome sequencing data from familial PCa cases. Six hundred ninety-seven candidate genes were identified based on function, location near a known chromosome 17 linkage signal, and/or previous association with prostate or other cancers. Single nucleotide variants (SNVs) in these candidate genes were identified in whole exome sequence data from 33 PCa cases from 11 multiplex PCa families (3 cases/family). Results Overall, 4856 candidate gene SNVs were identified, including 1052 missense and 10 nonsense variants. Twenty missense variants were shared by all 3 family members in each family in which they were observed. Additionally, 15 missense variants were shared by 2 of 3 family members and predicted to be deleterious by 5 different algorithms. Four missense variants, BLM Gln123Arg, PARP2 Arg283Gln, LRCC46 Ala295Thr and KIF2B Pro91Leu, and 1 nonsense variant, CYP3A43 Arg441Ter, showed complete co-segregation with PCa status. Twelve additional variants displayed partial co-segregation with PCa. Conclusions Forty-three nonsense and shared, missense variants were identified in our candidate genes. Further research is needed to determine the contribution of these variants to PCa susceptibility. PMID:25111073
Genes responding to water deficit in apple (Malus × domestica Borkh.) roots.

PubMed

Bassett, Carole Leavel; Baldo, Angela M; Moore, Jacob T; Jenkins, Ryan M; Soffe, Doug S; Wisniewski, Michael E; Norelli, John L; Farrell, Robert E

2014-07-08

Individual plants adapt to their immediate environment using a combination of biochemical, morphological and life cycle strategies. Because woody plants are long-lived perennials, they cannot rely on annual life cycle strategies alone to survive abiotic stresses. In this study we used suppression subtractive hybridization to identify genes both up- and down-regulated in roots during water deficit treatment and recovery. In addition we followed the expression of select genes in the roots, leaves, bark and xylem of 'Royal Gala' apple subjected to a simulated drought and subsequent recovery. In agreement with studies from both herbaceous and woody plants, a number of common drought-responsive genes were identified, as well as a few not previously reported. Three genes were selected for more in depth analysis: a high affinity nitrate transporter (MdNRT2.4), a mitochondrial outer membrane translocase (MdTOM7.1), and a gene encoding an NPR1 homolog (MpNPR1-2). Quantitative expression of these genes in apple roots, bark and leaves was consistent with their roles in nutrition and defense. Additional genes from apple roots responding to drought were identified using suppression subtraction hybridization compared to a previous EST analysis from the same organ. Genes up- and down-regulated during drought recovery in roots were also identified. Elevated levels of a high affinity nitrate transporter were found in roots suggesting that nitrogen uptake shifted from low affinity transport due to the predicted reduction in nitrate concentration in drought-treated roots. Suppression of a NPR1 gene in leaves of drought-treated apple trees may explain in part the increased disease susceptibility of trees subjected to dehydrative conditions.
Exome sequencing of a large family identifies potential candidate genes contributing risk to bipolar disorder.

PubMed

Zhang, Tianxiao; Hou, Liping; Chen, David T; McMahon, Francis J; Wang, Jen-Chyong; Rice, John P

2018-03-01

Bipolar disorder is a mental illness with lifetime prevalence of about 1%. Previous genetic studies have identified multiple chromosomal linkage regions and candidate genes that might be associated with bipolar disorder. The present study aimed to identify potential susceptibility variants for bipolar disorder using 6 related case samples from a four-generation family. A combination of exome sequencing and linkage analysis was performed to identify potential susceptibility variants for bipolar disorder. Our study identified a list of five potential candidate genes for bipolar disorder. Among these five genes, GRID1(Glutamate Receptor Delta-1 Subunit), which was previously reported to be associated with several psychiatric disorders and brain related traits, is particularly interesting. Variants with functional significance in this gene were identified from two cousins in our bipolar disorder pedigree. Our findings suggest a potential role for these genes and the related rare variants in the onset and development of bipolar disorder in this one family. Additional research is needed to replicate these findings and evaluate their patho-biological significance. Copyright © 2017 Elsevier B.V. All rights reserved.
What is currently known about the genetics of venous thromboembolism at the dawn of next generation sequencing technologies.

PubMed

Trégouët, David-Alexandre; Morange, Pierre-Emmanuel

2018-02-01

Venous thromboembolism (VTE) has a strong genetic component. This review summarizes what is known at the seventeen genes that are now well established to harbour VTE-associated genetic variants. In addition, it discusses additional candidate genes that deserve further validation before being claimed as VTE associated genes. Finally, several research strategies are briefly described to identify other molecular determinants of the disease. © 2017 John Wiley & Sons Ltd.
Regulation of gene expression in the mammalian eye and its relevance to eye disease.

PubMed

Scheetz, Todd E; Kim, Kwang-Youn A; Swiderski, Ruth E; Philp, Alisdair R; Braun, Terry A; Knudtson, Kevin L; Dorrance, Anne M; DiBona, Gerald F; Huang, Jian; Casavant, Thomas L; Sheffield, Val C; Stone, Edwin M

2006-09-26

We used expression quantitative trait locus mapping in the laboratory rat (Rattus norvegicus) to gain a broad perspective of gene regulation in the mammalian eye and to identify genetic variation relevant to human eye disease. Of >31,000 gene probes represented on an Affymetrix expression microarray, 18,976 exhibited sufficient signal for reliable analysis and at least 2-fold variation in expression among 120 F(2) rats generated from an SR/JrHsd x SHRSP intercross. Genome-wide linkage analysis with 399 genetic markers revealed significant linkage with at least one marker for 1,300 probes (alpha = 0.001; estimated empirical false discovery rate = 2%). Both contiguous and noncontiguous loci were found to be important in regulating mammalian eye gene expression. We investigated one locus of each type in greater detail and identified putative transcription-altering variations in both cases. We found an inserted cREL binding sequence in the 5' flanking sequence of the Abca4 gene associated with an increased expression level of that gene, and we found a mutation of the gene encoding thyroid hormone receptor beta2 associated with a decreased expression level of the gene encoding short-wavelength sensitive opsin (Opn1sw). In addition to these positional studies, we performed a pairwise analysis of gene expression to identify genes that are regulated in a coordinated manner and used this approach to validate two previously undescribed genes involved in the human disease Bardet-Biedl syndrome. These data and analytical approaches can be used to facilitate the discovery of additional genes and regulatory elements involved in human eye disease.
The sieve element occlusion gene family in dicotyledonous plants

PubMed Central

Jekat, Stephan B; Nordzieke, Steffen; Reineke, Anna R; Müller, Boje; Bornberg-Bauer, Erich; Noll, Gundula A

2011-01-01

Sieve element occlusion (SEO) genes encoding forisome subunits have been identified in Medicago truncatula and other legumes. Forisomes are structural phloem proteins uniquely found in Fabaceae sieve elements. They undergo a reversible conformational change after wounding, from a condensed to a dispersed state, thereby blocking sieve tube translocation and preventing the loss of photoassimilates. Recently, we identified SEO genes in several non-Fabaceae plants (lacking forisomes) and concluded that they most probably encode conventional non-forisome P-proteins. Molecular and phylogenetic analysis of the SEO gene family has identified domains that are characteristic for SEO proteins. Here, we extended our phylogenetic analysis by including additional SEO genes from several diverse species based on recently published genomic data. Our results strengthen the original assumption that SEO genes seem to be widespread in dicotyledonous angiosperms, and further underline the divergent evolution of SEO genes within the Fabaceae. PMID:21422825
The sieve element occlusion gene family in dicotyledonous plants.

PubMed

Ernst, Antonia M; Rüping, Boris; Jekat, Stephan B; Nordzieke, Steffen; Reineke, Anna R; Müller, Boje; Bornberg-Bauer, Erich; Prüfer, Dirk; Noll, Gundula A

2011-01-01

Sieve element occlusion (SEO) genes encoding forisome subunits have been identified in Medicago truncatula and other legumes. Forisomes are structural phloem proteins uniquely found in Fabaceae sieve elements. They undergo a reversible conformational change after wounding, from a condensed to a dispersed state, thereby blocking sieve tube translocation and preventing the loss of photoassimilates. Recently, we identified SEO genes in several non-Fabaceae plants (lacking forisomes) and concluded that they most probably encode conventional non-forisome P-proteins. Molecular and phylogenetic analysis of the SEO gene family has identified domains that are characteristic for SEO proteins. Here, we extended our phylogenetic analysis by including additional SEO genes from several diverse species based on recently published genomic data. Our results strengthen the original assumption that SEO genes seem to be widespread in dicotyledonous angiosperms, and further underline the divergent evolution of SEO genes within the Fabaceae.
Similarity of markers identified from cancer gene expression studies: observations from GEO.

PubMed

Shi, Xingjie; Shen, Shihao; Liu, Jin; Huang, Jian; Zhou, Yong; Ma, Shuangge

2014-09-01

Gene expression profiling has been extensively conducted in cancer research. The analysis of multiple independent cancer gene expression datasets may provide additional information and complement single-dataset analysis. In this study, we conduct multi-dataset analysis and are interested in evaluating the similarity of cancer-associated genes identified from different datasets. The first objective of this study is to briefly review some statistical methods that can be used for such evaluation. Both marginal analysis and joint analysis methods are reviewed. The second objective is to apply those methods to 26 Gene Expression Omnibus (GEO) datasets on five types of cancers. Our analysis suggests that for the same cancer, the marker identification results may vary significantly across datasets, and different datasets share few common genes. In addition, datasets on different cancers share few common genes. The shared genetic basis of datasets on the same or different cancers, which has been suggested in the literature, is not observed in the analysis of GEO data. © The Author 2013. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Golbus, Jessica R.; Puckelwartz, Megan J.; Dellefave-Castillo, Lisa

Background—Cardiomyopathy is highly heritable but genetically diverse. At present, genetic testing for cardiomyopathy uses targeted sequencing to simultaneously assess the coding regions of more than 50 genes. New genes are routinely added to panels to improve the diagnostic yield. With the anticipated $1000 genome, it is expected that genetic testing will shift towards comprehensive genome sequencing accompanied by targeted gene analysis. Therefore, we assessed the reliability of whole genome sequencing and targeted analysis to identify cardiomyopathy variants in 11 subjects with cardiomyopathy. Methods and Results—Whole genome sequencing with an average of 37× coverage was combined with targeted analysis focused onmore » 204 genes linked to cardiomyopathy. Genetic variants were scored using multiple prediction algorithms combined with frequency data from public databases. This pipeline yielded 1-14 potentially pathogenic variants per individual. Variants were further analyzed using clinical criteria and/or segregation analysis. Three of three previously identified primary mutations were detected by this analysis. In six subjects for whom the primary mutation was previously unknown, we identified mutations that segregated with disease, had clinical correlates, and/or had additional pathological correlation to provide evidence for causality. For two subjects with previously known primary mutations, we identified additional variants that may act as modifiers of disease severity. In total, we identified the likely pathological mutation in 9 of 11 (82%) subjects. We conclude that these pilot data demonstrate that ~30-40× coverage whole genome sequencing combined with targeted analysis is feasible and sensitive to identify rare variants in cardiomyopathy-associated genes.« less
Genetic signatures of adaptation revealed from transcriptome sequencing of Arctic and red foxes.

PubMed

Kumar, Vikas; Kutschera, Verena E; Nilsson, Maria A; Janke, Axel

2015-08-07

The genus Vulpes (true foxes) comprises numerous species that inhabit a wide range of habitats and climatic conditions, including one species, the Arctic fox (Vulpes lagopus) which is adapted to the arctic region. A close relative to the Arctic fox, the red fox (Vulpes vulpes), occurs in subarctic to subtropical habitats. To study the genetic basis of their adaptations to different environments, transcriptome sequences from two Arctic foxes and one red fox individual were generated and analyzed for signatures of positive selection. In addition, the data allowed for a phylogenetic analysis and divergence time estimate between the two fox species. The de novo assembly of reads resulted in more than 160,000 contigs/transcripts per individual. Approximately 17,000 homologous genes were identified using human and the non-redundant databases. Positive selection analyses revealed several genes involved in various metabolic and molecular processes such as energy metabolism, cardiac gene regulation, apoptosis and blood coagulation to be under positive selection in foxes. Branch site tests identified four genes to be under positive selection in the Arctic fox transcriptome, two of which are fat metabolism genes. In the red fox transcriptome eight genes are under positive selection, including molecular process genes, notably genes involved in ATP metabolism. Analysis of the three transcriptomes and five Sanger re-sequenced genes in additional individuals identified a lower genetic variability within Arctic foxes compared to red foxes, which is consistent with distribution range differences and demographic responses to past climatic fluctuations. A phylogenomic analysis estimated that the Arctic and red fox lineages diverged about three million years ago. Transcriptome data are an economic way to generate genomic resources for evolutionary studies. Despite not representing an entire genome, this transcriptome analysis identified numerous genes that are relevant to arctic adaptation in foxes. Similar to polar bears, fat metabolism seems to play a central role in adaptation of Arctic foxes to the cold climate, as has been identified in the polar bear, another arctic specialist.
Oxaloacetate and malate production in engineered Escherichia coli by expression of codon-optimized phosphoenolpyruvate carboxylase2 gene from Dunaliella salina.

PubMed

Park, Soohyun; Chang, Kwang Suk; Jin, Eonseon; Pack, Seung Pil; Lee, Jinwon

2013-01-01

A new phosphoenolpyruvate carboxylase (PEPC) gene of Dunaliella salina is identified using homology analysis was conducted using PEPC gene of Chlamydomonas reinhardtii and Arabidopsis thaliana. Recombinant E. coli SGJS115 with increased production of malate and oxaloacetate was developed by introducing codon-optimized phosphoenolpyruvate carboxylase2 (OPDSPEPC2) gene of Dunaliella salina. E. coli SGJS115 yielded a 9.9 % increase in malate production. In addition, E. coli SGJS115 exhibited two times increase in the yield of oxaloacetate over the E. coli SGJS114 having identified PEPC2 gene obtained from Dunaliella salina.
IFT27, encoding a small GTPase component of IFT particles, is mutated in a consanguineous family with Bardet–Biedl syndrome

PubMed Central

Aldahmesh, Mohammed A.; Li, Yuanyuan; Alhashem, Amal; Anazi, Shams; Alkuraya, Hisham; Hashem, Mais; Awaji, Ali A.; Sogaty, Sameera; Alkharashi, Abdullah; Alzahrani, Saeed; Al Hazzaa, Selwa A.; Xiong, Yong; Kong, Shanshan; Sun, Zhaoxia; Alkuraya, Fowzan S.

2014-01-01

Bardet–Biedl syndrome (BBS) is an autosomal recessive ciliopathy with multisystem involvement. So far, 18 BBS genes have been identified and the majority of them are essential for the function of BBSome, a protein complex involved in transporting membrane proteins into and from cilia. Yet defects in the identified genes cannot account for all the BBS cases. The genetic heterogeneity of this disease poses significant challenge to the identification of additional BBS genes. In this study, we coupled human genetics with functional validation in zebrafish and identified IFT27 as a novel BBS gene (BBS19). This is the first time an intraflagellar transport (IFT) gene is implicated in the pathogenesis of BBS, highlighting the genetic complexity of this disease. PMID:24488770
Identifying a gene expression signature of cluster headache in blood

PubMed Central

Eising, Else; Pelzer, Nadine; Vijfhuizen, Lisanne S.; Vries, Boukje de; Ferrari, Michel D.; ‘t Hoen, Peter A. C.; Terwindt, Gisela M.; van den Maagdenberg, Arn M. J. M.

2017-01-01

Cluster headache is a relatively rare headache disorder, typically characterized by multiple daily, short-lasting attacks of excruciating, unilateral (peri-)orbital or temporal pain associated with autonomic symptoms and restlessness. To better understand the pathophysiology of cluster headache, we used RNA sequencing to identify differentially expressed genes and pathways in whole blood of patients with episodic (n = 19) or chronic (n = 20) cluster headache in comparison with headache-free controls (n = 20). Gene expression data were analysed by gene and by module of co-expressed genes with particular attention to previously implicated disease pathways including hypocretin dysregulation. Only moderate gene expression differences were identified and no associations were found with previously reported pathogenic mechanisms. At the level of functional gene sets, associations were observed for genes involved in several brain-related mechanisms such as GABA receptor function and voltage-gated channels. In addition, genes and modules of co-expressed genes showed a role for intracellular signalling cascades, mitochondria and inflammation. Although larger study samples may be required to identify the full range of involved pathways, these results indicate a role for mitochondria, intracellular signalling and inflammation in cluster headache. PMID:28074859

Genotypic variability-based genome-wide association study identifies non-additive loci HLA-C and IL12B for psoriasis.

PubMed

Wei, Wen-Hua; Massey, Jonathan; Worthington, Jane; Barton, Anne; Warren, Richard B

2018-03-01

Genome-wide association studies (GWASs) have identified a number of loci for psoriasis but largely ignored non-additive effects. We report a genotypic variability-based GWAS (vGWAS) that can prioritize non-additive loci without requiring prior knowledge of interaction types or interacting factors in two steps, using a mixed model to partition dichotomous phenotypes into an additive component and non-additive environmental residuals on the liability scale and then the Levene's (Brown-Forsythe) test to assess equality of the residual variances across genotype groups genome widely. The vGWAS identified two genome-wide significant (P < 5.0e-08) non-additive loci HLA-C and IL12B that were also genome-wide significant in an accompanying GWAS in the discovery cohort. Both loci were statistically replicated in vGWAS of an independent cohort with a small sample size. HLA-C and IL12B were reported in moderate gene-gene and/or gene-environment interactions in several occasions. We found a moderate interaction with age-of-onset of psoriasis, which was replicated indirectly. The vGWAS also revealed five suggestive loci (P < 6.76e-05) including FUT2 that was associated with psoriasis with environmental aspects triggered by virus infection and/or metabolic factors. Replication and functional investigation are needed to validate the suggestive vGWAS loci.
Meta-Analysis of Placental Transcriptome Data Identifies a Novel Molecular Pathway Related to Preeclampsia.

PubMed

van Uitert, Miranda; Moerland, Perry D; Enquobahrie, Daniel A; Laivuori, Hannele; van der Post, Joris A M; Ris-Stalpers, Carrie; Afink, Gijs B

2015-01-01

Studies using the placental transcriptome to identify key molecules relevant for preeclampsia are hampered by a relatively small sample size. In addition, they use a variety of bioinformatics and statistical methods, making comparison of findings challenging. To generate a more robust preeclampsia gene expression signature, we performed a meta-analysis on the original data of 11 placenta RNA microarray experiments, representing 139 normotensive and 116 preeclamptic pregnancies. Microarray data were pre-processed and analyzed using standardized bioinformatics and statistical procedures and the effect sizes were combined using an inverse-variance random-effects model. Interactions between genes in the resulting gene expression signature were identified by pathway analysis (Ingenuity Pathway Analysis, Gene Set Enrichment Analysis, Graphite) and protein-protein associations (STRING). This approach has resulted in a comprehensive list of differentially expressed genes that led to a 388-gene meta-signature of preeclamptic placenta. Pathway analysis highlights the involvement of the previously identified hypoxia/HIF1A pathway in the establishment of the preeclamptic gene expression profile, while analysis of protein interaction networks indicates CREBBP/EP300 as a novel element central to the preeclamptic placental transcriptome. In addition, there is an apparent high incidence of preeclampsia in women carrying a child with a mutation in CREBBP/EP300 (Rubinstein-Taybi Syndrome). The 388-gene preeclampsia meta-signature offers a vital starting point for further studies into the relevance of these genes (in particular CREBBP/EP300) and their concomitant pathways as biomarkers or functional molecules in preeclampsia. This will result in a better understanding of the molecular basis of this disease and opens up the opportunity to develop rational therapies targeting the placental dysfunction causal to preeclampsia.
Genetic organization of the unc-22 IV gene and the adjacent region in Caenorhabditis elegans.

PubMed

Rogalski, T M; Baillie, D L

1985-01-01

The genetic organization of the region immediately adjacent to the unc-22 IV gene in Caenorhabditis elegans has been studied. We have identified twenty essential genes in this interval of approximately 1.5-map units on Linkage Group IV. The mutations that define these genes were positioned by recombination mapping and complementation with several deficiencies. With few exceptions, the positions obtained by these two methods agreed. Eight of the twenty essential genes identified are represented by more than one allele. Three possible internal deletions of the unc-22 gene have been located by intra-genic mapping. In addition, the right end point of a deficiency or an inversion affecting the adjacent genes let-56 and unc-22 has been positioned inside the unc-22 gene.
GeneSeqToFamily: a Galaxy workflow to find gene families based on the Ensembl Compara GeneTrees pipeline.

PubMed

Thanki, Anil S; Soranzo, Nicola; Haerty, Wilfried; Davey, Robert P

2018-03-01

Gene duplication is a major factor contributing to evolutionary novelty, and the contraction or expansion of gene families has often been associated with morphological, physiological, and environmental adaptations. The study of homologous genes helps us to understand the evolution of gene families. It plays a vital role in finding ancestral gene duplication events as well as identifying genes that have diverged from a common ancestor under positive selection. There are various tools available, such as MSOAR, OrthoMCL, and HomoloGene, to identify gene families and visualize syntenic information between species, providing an overview of syntenic regions evolution at the family level. Unfortunately, none of them provide information about structural changes within genes, such as the conservation of ancestral exon boundaries among multiple genomes. The Ensembl GeneTrees computational pipeline generates gene trees based on coding sequences, provides details about exon conservation, and is used in the Ensembl Compara project to discover gene families. A certain amount of expertise is required to configure and run the Ensembl Compara GeneTrees pipeline via command line. Therefore, we converted this pipeline into a Galaxy workflow, called GeneSeqToFamily, and provided additional functionality. This workflow uses existing tools from the Galaxy ToolShed, as well as providing additional wrappers and tools that are required to run the workflow. GeneSeqToFamily represents the Ensembl GeneTrees pipeline as a set of interconnected Galaxy tools, so they can be run interactively within the Galaxy's user-friendly workflow environment while still providing the flexibility to tailor the analysis by changing configurations and tools if necessary. Additional tools allow users to subsequently visualize the gene families produced by the workflow, using the Aequatus.js interactive tool, which has been developed as part of the Aequatus software project.
The Genetic and Molecular Organization of the Dopa Decarboxylase Gene Cluster of Drosophila Melanogaster

PubMed Central

Stathakis, D. G.; Pentz, E. S.; Freeman, M. E.; Kullman, J.; Hankins, G. R.; Pearlson, N. J.; Wright, TRF.

1995-01-01

We report the complete molecular organization of the Dopa decarboxylase gene cluster. Mutagenesis screens recovered 77 new Df(2L)TW130 recessive lethal mutations. These new alleles combined with 263 previously isolated mutations in the cluster to define 18 essential genes. In addition, seven new deficiencies were isolated and characterized. Deficiency mapping, restriction fragment length polymorphism (RFLP) analysis and P-element-mediated germline transformation experiments determined the gene order for all 18 loci. Genomic and cDNA restriction endonuclease mapping, Northern blot analysis and DNA sequencing provided information on exact gene location, mRNA size and transcriptional direction for most of these loci. In addition, this analysis identified two transcription units that had not previously been identified by extensive mutagenesis screening. Most of the loci are contained within two dense subclusters. We discuss the effectiveness of mutagens and strategies used in our screens, the variable mutability of loci within the genome of Drosophila melanogaster, the cytological and molecular organization of the Ddc gene cluster, the validity of the one band-one gene hypothesis and a possible purpose for the clustering of genes in the Ddc region. PMID:8647399
Rare Genetic Forms of Obesity: Clinical Approach and Current Treatments in 2016

PubMed Central

Huvenne, Hélène; Dubern, Béatrice; Clément, Karine; Poitou, Christine

2016-01-01

Obesity results from a synergistic relationship between genes and the environment. The phenotypic expression of genetic factors involved in obesity is variable, allowing to distinguish several clinical pictures of obesity. Monogenic obesity is described as rare and severe early-onset obesity with abnormal feeding behavior and endocrine disorders. This is mainly due to autosomal recessive mutations in genes of the leptin-melanocortin pathway which plays a key role in the hypothalamic control of food intake. Melanocortin 4 receptor(MC4R)-linked obesity is characterized by the variable severity of obesity and no notable additional phenotypes. Mutations in the MC4R gene are involved in 2-3% of obese children and adults; the majority of these are heterozygous. Syndromic obesity is associated with mental retardation, dysmorphic features, and organ-specific developmental abnormalities. Additional genes participating in the development of hypothalamus and central nervous system have been regularly identified. But to date, not all involved genes have been identified so far. New diagnostic tools, such as whole-exome sequencing, will probably help to identify other genes. Managing these patients is challenging. Indeed, specific treatments are available only for specific types of monogenic obesity, such as leptin deficiency. Data on bariatric surgery are limited and controversial. New molecules acting on the leptin-melanocortin pathway are currently being developed. PMID:27241181
Integrated analysis of epigenomic and genomic changes by DNA methylation dependent mechanisms provides potential novel biomarkers for prostate cancer.

PubMed

White-Al Habeeb, Nicole M A; Ho, Linh T; Olkhov-Mitsel, Ekaterina; Kron, Ken; Pethe, Vaijayanti; Lehman, Melanie; Jovanovic, Lidija; Fleshner, Neil; van der Kwast, Theodorus; Nelson, Colleen C; Bapat, Bharati

2014-09-15

Epigenetic silencing mediated by CpG methylation is a common feature of many cancers. Characterizing aberrant DNA methylation changes associated with tumor progression may identify potential prognostic markers for prostate cancer (PCa). We treated two PCa cell lines, 22Rv1 and DU-145 with the demethylating agent 5-Aza 2'-deoxycitidine (DAC) and global methylation status was analyzed by performing methylation-sensitive restriction enzyme based differential methylation hybridization strategy followed by genome-wide CpG methylation array profiling. In addition, we examined gene expression changes using a custom microarray. Gene Set Enrichment Analysis (GSEA) identified the most significantly dysregulated pathways. In addition, we assessed methylation status of candidate genes that showed reduced CpG methylation and increased gene expression after DAC treatment, in Gleason score (GS) 8 vs. GS6 patients using three independent cohorts of patients; the publically available The Cancer Genome Atlas (TCGA) dataset, and two separate patient cohorts. Our analysis, by integrating methylation and gene expression in PCa cell lines, combined with patient tumor data, identified novel potential biomarkers for PCa patients. These markers may help elucidate the pathogenesis of PCa and represent potential prognostic markers for PCa patients.
Identification of a p-cresol degradation pathway by a GFP-based transposon in Pseudomonas and its dominant expression in colonies.

PubMed

Cho, Ah Ra; Lim, Eun Jin; Veeranagouda, Yaligara; Lee, Kyoung

2011-11-01

In this study, the chromosome-encoded pcuRCAXB genes that are required for p-cresol degradation have been identified by using a newly constructed green fluorescent protein (GFP)-based promoter probe transposon in the long-chain alkylphenol degrader Pseudomonas alkylphenolia. The deduced amino acid sequences of the genes showed the highest identities at the levels of 65-93% compared with those in the databases. The transposon was identified to be inserted in the pcuA gene, with the promoterless gfp gene being under the control of the pcu catabolic gene promoter. The expression of GFP was positively induced by p-cresol and was about 10 times higher by cells grown on agar than those in liquid culture. In addition, phydroxybenzoic acid was detected during p-cresol degradation. These results indicate that P. alkylphenolia additionally possesses a protocatechuate ortho-cleavage route for pcresol degradation that is dominantly expressed in colonies.
Global analysis of the Burkholderia thailandensis quorum sensing-controlled regulon.

PubMed

Majerczyk, Charlotte; Brittnacher, Mitchell; Jacobs, Michael; Armour, Christopher D; Radey, Mathew; Schneider, Emily; Phattarasokul, Somsak; Bunt, Richard; Greenberg, E Peter

2014-04-01

Burkholderia thailandensis contains three acyl-homoserine lactone quorum sensing circuits and has two additional LuxR homologs. To identify B. thailandensis quorum sensing-controlled genes, we carried out transcriptome sequencing (RNA-seq) analyses of quorum sensing mutants and their parent. The analyses were grounded in the fact that we identified genes coding for factors shown previously to be regulated by quorum sensing among a larger set of quorum-controlled genes. We also found that genes coding for contact-dependent inhibition were induced by quorum sensing and confirmed that specific quorum sensing mutants had a contact-dependent inhibition defect. Additional quorum-controlled genes included those for the production of numerous secondary metabolites, an uncharacterized exopolysaccharide, and a predicted chitin-binding protein. This study provides insights into the roles of the three quorum sensing circuits in the saprophytic lifestyle of B. thailandensis, and it provides a foundation on which to build an understanding of the roles of quorum sensing in the biology of B. thailandensis and the closely related pathogenic Burkholderia pseudomallei and Burkholderia mallei.
Exome Sequence Analysis of 14 Families With High Myopia.

PubMed

Kloss, Bethany A; Tompson, Stuart W; Whisenhunt, Kristina N; Quow, Krystina L; Huang, Samuel J; Pavelec, Derek M; Rosenberg, Thomas; Young, Terri L

2017-04-01

To identify causal gene mutations in 14 families with autosomal dominant (AD) high myopia using exome sequencing. Select individuals from 14 large Caucasian families with high myopia were exome sequenced. Gene variants were filtered to identify potential pathogenic changes. Sanger sequencing was used to confirm variants in original DNA, and to test for disease cosegregation in additional family members. Candidate genes and chromosomal loci previously associated with myopic refractive error and its endophenotypes were comprehensively screened. In 14 high myopia families, we identified 73 rare and 31 novel gene variants as candidates for pathogenicity. In seven of these families, two of the novel and eight of the rare variants were within known myopia loci. A total of 104 heterozygous nonsynonymous rare variants in 104 genes were identified in 10 out of 14 probands. Each variant cosegregated with affection status. No rare variants were identified in genes known to cause myopia or in genes closest to published genome-wide association study association signals for refractive error or its endophenotypes. Whole exome sequencing was performed to determine gene variants implicated in the pathogenesis of AD high myopia. This study provides new genes for consideration in the pathogenesis of high myopia, and may aid in the development of genetic profiling of those at greatest risk for attendant ocular morbidities of this disorder.
A chronological expression profile of gene activity during embryonic mouse brain development.

PubMed

Goggolidou, P; Soneji, S; Powles-Glover, N; Williams, D; Sethi, S; Baban, D; Simon, M M; Ragoussis, I; Norris, D P

2013-12-01

The brain is a functionally complex organ, the patterning and development of which are key to adult health. To help elucidate the genetic networks underlying mammalian brain patterning, we conducted detailed transcriptional profiling during embryonic development of the mouse brain. A total of 2,400 genes were identified as showing differential expression between three developmental stages. Analysis of the data identified nine gene clusters to demonstrate analogous expression profiles. A significant group of novel genes of as yet undiscovered biological function were detected as being potentially relevant to brain development and function, in addition to genes that have previously identified roles in the brain. Furthermore, analysis for genes that display asymmetric expression between the left and right brain hemispheres during development revealed 35 genes as putatively asymmetric from a combined data set. Our data constitute a valuable new resource for neuroscience and neurodevelopment, exposing possible functional associations between genes, including novel loci, and encouraging their further investigation in human neurological and behavioural disorders.
Parkinson's disease candidate gene prioritization based on expression profile of midbrain dopaminergic neurons

PubMed Central

2010-01-01

Background Parkinson's disease is the second most common neurodegenerative disorder. The pathological hallmark of the disease is degeneration of midbrain dopaminergic neurons. Genetic association studies have linked 13 human chromosomal loci to Parkinson's disease. Identification of gene(s), as part of the etiology of Parkinson's disease, within the large number of genes residing in these loci can be achieved through several approaches, including screening methods, and considering appropriate criteria. Since several of the indentified Parkinson's disease genes are expressed in substantia nigra pars compact of the midbrain, expression within the neurons of this area could be a suitable criterion to limit the number of candidates and identify PD genes. Methods In this work we have used the combination of findings from six rodent transcriptome analysis studies on the gene expression profile of midbrain dopaminergic neurons and the PARK loci in OMIM (Online Mendelian Inheritance in Man) database, to identify new candidate genes for Parkinson's disease. Results Merging the two datasets, we identified 20 genes within PARK loci, 7 of which are located in an orphan Parkinson's disease locus and one, which had been identified as a disease gene. In addition to identifying a set of candidates for further genetic association studies, these results show that the criteria of expression in midbrain dopaminergic neurons may be used to narrow down the number of genes in PARK loci for such studies. PMID:20716345
Genome-Wide Association Identifies SLC2A9 and NLN Gene Regions as Associated with Entropion in Domestic Sheep

PubMed Central

Mousel, Michelle R.; Reynolds, James O.; White, Stephen N.

2015-01-01

Entropion is an inward rolling of the eyelid allowing contact between the eyelashes and cornea that may lead to blindness if not corrected. Although many mammalian species, including humans and dogs, are afflicted by congenital entropion, no specific genes or gene regions related to development of entropion have been reported in any mammalian species to date. Entropion in domestic sheep is known to have a genetic component therefore, we used domestic sheep as a model system to identify genomic regions containing genes associated with entropion. A genome-wide association was conducted with congenital entropion in 998 Columbia, Polypay, and Rambouillet sheep genotyped with 50,000 SNP markers. Prevalence of entropion was 6.01%, with all breeds represented. Logistic regression was performed in PLINK with additive allelic, recessive, dominant, and genotypic inheritance models. Two genome-wide significant (empirical P<0.05) SNP were identified, specifically markers in SLC2A9 (empirical P = 0.007; genotypic model) and near NLN (empirical P = 0.026; dominance model). Six additional genome-wide suggestive SNP (nominal P<1x10-5) were identified including markers in or near PIK3CB (P = 2.22x10-6; additive model), KCNB1 (P = 2.93x10-6; dominance model), ZC3H12C (P = 3.25x10-6; genotypic model), JPH1 (P = 4.68x20-6; genotypic model), and MYO3B (P = 5.74x10-6; recessive model). This is the first report of specific gene regions associated with congenital entropion in any mammalian species, to our knowledge. Further, none of these genes have previously been associated with any eyelid traits. These results represent the first genome-wide analysis of gene regions associated with entropion and provide target regions for the development of sheep genetic markers for marker-assisted selection. PMID:26098909
Genome-Wide Association Identifies SLC2A9 and NLN Gene Regions as Associated with Entropion in Domestic Sheep.

PubMed

Mousel, Michelle R; Reynolds, James O; White, Stephen N

2015-01-01

Entropion is an inward rolling of the eyelid allowing contact between the eyelashes and cornea that may lead to blindness if not corrected. Although many mammalian species, including humans and dogs, are afflicted by congenital entropion, no specific genes or gene regions related to development of entropion have been reported in any mammalian species to date. Entropion in domestic sheep is known to have a genetic component therefore, we used domestic sheep as a model system to identify genomic regions containing genes associated with entropion. A genome-wide association was conducted with congenital entropion in 998 Columbia, Polypay, and Rambouillet sheep genotyped with 50,000 SNP markers. Prevalence of entropion was 6.01%, with all breeds represented. Logistic regression was performed in PLINK with additive allelic, recessive, dominant, and genotypic inheritance models. Two genome-wide significant (empirical P<0.05) SNP were identified, specifically markers in SLC2A9 (empirical P = 0.007; genotypic model) and near NLN (empirical P = 0.026; dominance model). Six additional genome-wide suggestive SNP (nominal P<1x10(-5)) were identified including markers in or near PIK3CB (P = 2.22x10(-6); additive model), KCNB1 (P = 2.93x10(-6); dominance model), ZC3H12C (P = 3.25x10(-6); genotypic model), JPH1 (P = 4.68x20(-6); genotypic model), and MYO3B (P = 5.74x10(-6); recessive model). This is the first report of specific gene regions associated with congenital entropion in any mammalian species, to our knowledge. Further, none of these genes have previously been associated with any eyelid traits. These results represent the first genome-wide analysis of gene regions associated with entropion and provide target regions for the development of sheep genetic markers for marker-assisted selection.
A genome-scale map of expression for a mouse brain section obtained using voxelation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chin, Mark H.; Geng, Alex B.; Khan, Arshad H.

Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological diseases. We have reconstructed 2- dimensional images of gene expression for 20,000 genes in a coronal slice of the mouse brain at the level of the striatum by using microarrays in combination with voxelation at a resolution of 1 mm3. Good reliability of the microarray results were confirmed using multiple replicates, subsequent quantitative RT-PCR voxelation, mass spectrometry voxelation and publicly available in situ hybridization data. Known and novel genes were identified with expression patterns localized to defined substructures within the brain. In addition, genesmore » with unexpected patterns were identified and cluster analysis identified a set of genes with a gradient of dorsal/ventral expression not restricted to known anatomical boundaries. The genome-scale maps of gene expression obtained using voxelation will be a valuable tool for the neuroscience community.« less
Identification of pathogenic gene variants in small families with intellectually disabled siblings by exome sequencing.

PubMed

Schuurs-Hoeijmakers, Janneke H M; Vulto-van Silfhout, Anneke T; Vissers, Lisenka E L M; van de Vondervoort, Ilse I G M; van Bon, Bregje W M; de Ligt, Joep; Gilissen, Christian; Hehir-Kwa, Jayne Y; Neveling, Kornelia; del Rosario, Marisol; Hira, Gausiya; Reitano, Santina; Vitello, Aurelio; Failla, Pinella; Greco, Donatella; Fichera, Marco; Galesi, Ornella; Kleefstra, Tjitske; Greally, Marie T; Ockeloen, Charlotte W; Willemsen, Marjolein H; Bongers, Ernie M H F; Janssen, Irene M; Pfundt, Rolph; Veltman, Joris A; Romano, Corrado; Willemsen, Michèl A; van Bokhoven, Hans; Brunner, Han G; de Vries, Bert B A; de Brouwer, Arjan P M

2013-12-01

Intellectual disability (ID) is a common neurodevelopmental disorder affecting 1-3% of the general population. Mutations in more than 10% of all human genes are considered to be involved in this disorder, although the majority of these genes are still unknown. We investigated 19 small non-consanguineous families with two to five affected siblings in order to identify pathogenic gene variants in known, novel and potential ID candidate genes. Non-consanguineous families have been largely ignored in gene identification studies as small family size precludes prior mapping of the genetic defect. Using exome sequencing, we identified pathogenic mutations in three genes, DDHD2, SLC6A8, and SLC9A6, of which the latter two have previously been implicated in X-linked ID phenotypes. In addition, we identified potentially pathogenic mutations in BCORL1 on the X-chromosome and in MCM3AP, PTPRT, SYNE1, and ZNF528 on autosomes. We show that potentially pathogenic gene variants can be identified in small, non-consanguineous families with as few as two affected siblings, thus emphasising their value in the identification of syndromic and non-syndromic ID genes.
Levetiracetam attenuates hippocampal expression of synaptic plasticity-related immediate early and late response genes in amygdala-kindled rats

PubMed Central

2010-01-01

Background The amygdala-kindled rat is a model for human temporal lobe epilepsy and activity-dependent synaptic plasticity. Hippocampal RNA isolated from amygdala-kindled rats at different kindling stages was analyzed to identify kindling-induced genes. Furthermore, effects of the anti-epileptic drug levetiracetam on kindling-induced gene expression were examined. Results Cyclooxygenase-2 (Cox-2), Protocadherin-8 (Pcdh8) and TGF-beta-inducible early response gene-1 (TIEG1) were identified and verified as differentially expressed transcripts in the hippocampus of kindled rats by in situ hybridization and quantitative RT-PCR. In addition, we identified a panel of 16 additional transcripts which included Arc, Egr3/Pilot, Homer1a, Ania-3, MMP9, Narp, c-fos, NGF, BDNF, NT-3, Synaptopodin, Pim1 kinase, TNF-α, RGS2, Egr2/krox-20 and β-A activin that were differentially expressed in the hippocampus of amygdala-kindled rats. The list consists of many synaptic plasticity-related immediate early genes (IEGs) as well as some late response genes encoding transcription factors, neurotrophic factors and proteins that are known to regulate synaptic remodelling. In the hippocampus, induction of IEG expression was dependent on the afterdischarge (AD) duration. Levetiracetam, 40 mg/kg, suppressed the development of kindling measured as severity of seizures and AD duration. In addition, single animal profiling also showed that levetiracetam attenuated the observed kindling-induced IEG expression; an effect that paralleled the anti-epileptic effect of the drug on AD duration. Conclusions The present study provides mRNA expression data that suggest that levetiracetam attenuates expression of genes known to regulate synaptic remodelling. In the kindled rat, levetiracetam does so by shortening the AD duration thereby reducing the seizure-induced changes in mRNA expression in the hippocampus. PMID:20105316
Regulation of gene expression in the mammalian eye and its relevance to eye disease

PubMed Central

Scheetz, Todd E.; Kim, Kwang-Youn A.; Swiderski, Ruth E.; Philp, Alisdair R.; Braun, Terry A.; Knudtson, Kevin L.; Dorrance, Anne M.; DiBona, Gerald F.; Huang, Jian; Casavant, Thomas L.; Sheffield, Val C.; Stone, Edwin M.

2006-01-01

We used expression quantitative trait locus mapping in the laboratory rat (Rattus norvegicus) to gain a broad perspective of gene regulation in the mammalian eye and to identify genetic variation relevant to human eye disease. Of >31,000 gene probes represented on an Affymetrix expression microarray, 18,976 exhibited sufficient signal for reliable analysis and at least 2-fold variation in expression among 120 F2 rats generated from an SR/JrHsd × SHRSP intercross. Genome-wide linkage analysis with 399 genetic markers revealed significant linkage with at least one marker for 1,300 probes (α = 0.001; estimated empirical false discovery rate = 2%). Both contiguous and noncontiguous loci were found to be important in regulating mammalian eye gene expression. We investigated one locus of each type in greater detail and identified putative transcription-altering variations in both cases. We found an inserted cREL binding sequence in the 5′ flanking sequence of the Abca4 gene associated with an increased expression level of that gene, and we found a mutation of the gene encoding thyroid hormone receptor β2 associated with a decreased expression level of the gene encoding short-wavelength sensitive opsin (Opn1sw). In addition to these positional studies, we performed a pairwise analysis of gene expression to identify genes that are regulated in a coordinated manner and used this approach to validate two previously undescribed genes involved in the human disease Bardet–Biedl syndrome. These data and analytical approaches can be used to facilitate the discovery of additional genes and regulatory elements involved in human eye disease. PMID:16983098
Identification of 28 cytochrome P450 genes from the transcriptome of the marine rotifer Brachionus plicatilis and analysis of their expression.

PubMed

Kim, Hui-Su; Han, Jeonghoon; Kim, Hee-Jin; Hagiwara, Atsushi; Lee, Jae-Seong

2017-09-01

Whole transcriptomes of the rotifer Brachionus plicatilis were analyzed using an Illumina sequencer. De novo assembly was performed with 49,122,780 raw reads using Trinity software. Among the assembled 42,820 contigs, 27,437 putative open reading frame contigs were identified (average length 1235bp; N50=1707bp). Functional gene annotation with Gene Ontology and InterProScan, in addition to Kyoto Encyclopedia of Genes and Genomes pathway analysis, highlighted the metabolism of xenobiotics by cytochrome P450 (CYP). In addition, 28 CYP genes were identified, and their transcriptional responses to benzo[α]pyrene (B[α]P) were investigated. Most of the CYPs were significantly upregulated or downregulated (P<0.05) in response to B[α]P, suggesting that Bp-CYP genes play a crucial role in detoxification mechanisms in response to xenobiotics. This study sheds light on the molecular defense mechanisms of the rotifer B. plicatilis in response to exposure to various chemicals. Copyright © 2017 Elsevier Inc. All rights reserved.
Gene-set analysis based on the pharmacological profiles of drugs to identify repurposing opportunities in schizophrenia.

PubMed

de Jong, Simone; Vidler, Lewis R; Mokrab, Younes; Collier, David A; Breen, Gerome

2016-08-01

Genome-wide association studies (GWAS) have identified thousands of novel genetic associations for complex genetic disorders, leading to the identification of potential pharmacological targets for novel drug development. In schizophrenia, 108 conservatively defined loci that meet genome-wide significance have been identified and hundreds of additional sub-threshold associations harbour information on the genetic aetiology of the disorder. In the present study, we used gene-set analysis based on the known binding targets of chemical compounds to identify the 'drug pathways' most strongly associated with schizophrenia-associated genes, with the aim of identifying potential drug repositioning opportunities and clues for novel treatment paradigms, especially in multi-target drug development. We compiled 9389 gene sets (2496 with unique gene content) and interrogated gene-based p-values from the PGC2-SCZ analysis. Although no single drug exceeded experiment wide significance (corrected p<0.05), highly ranked gene-sets reaching suggestive significance including the dopamine receptor antagonists metoclopramide and trifluoperazine and the tyrosine kinase inhibitor neratinib. This is a proof of principle analysis showing the potential utility of GWAS data of schizophrenia for the direct identification of candidate drugs and molecules that show polypharmacy. © The Author(s) 2016.

Identification of T1D susceptibility genes within the MHC region by combining protein interaction networks and SNP genotyping data

PubMed Central

Brorsson, C.; Hansen, N. T.; Lage, K.; Bergholdt, R.; Brunak, S.; Pociot, F.

2009-01-01

Aim To develop novel methods for identifying new genes that contribute to the risk of developing type 1 diabetes within the Major Histocompatibility Complex (MHC) region on chromosome 6, independently of the known linkage disequilibrium (LD) between human leucocyte antigen (HLA)-DRB1, -DQA1, -DQB1 genes. Methods We have developed a novel method that combines single nucleotide polymorphism (SNP) genotyping data with protein–protein interaction (ppi) networks to identify disease-associated network modules enriched for proteins encoded from the MHC region. Approximately 2500 SNPs located in the 4 Mb MHC region were analysed in 1000 affected offspring trios generated by the Type 1 Diabetes Genetics Consortium (T1DGC). The most associated SNP in each gene was chosen and genes were mapped to ppi networks for identification of interaction partners. The association testing and resulting interacting protein modules were statistically evaluated using permutation. Results A total of 151 genes could be mapped to nodes within the protein interaction network and their interaction partners were identified. Five protein interaction modules reached statistical significance using this approach. The identified proteins are well known in the pathogenesis of T1D, but the modules also contain additional candidates that have been implicated in β-cell development and diabetic complications. Conclusions The extensive LD within the MHC region makes it important to develop new methods for analysing genotyping data for identification of additional risk genes for T1D. Combining genetic data with knowledge about functional pathways provides new insight into mechanisms underlying T1D. PMID:19143816
Transposon mutagenesis identifies genes that cooperate with mutant Pten in breast cancer progression

PubMed Central

Rangel, Roberto; Lee, Song-Choon; Hon-Kim Ban, Kenneth; Guzman-Rojas, Liliana; Mann, Michael B.; Newberg, Justin Y.; McNoe, Leslie A.; Selvanesan, Luxmanan; Ward, Jerrold M.; Rust, Alistair G.; Chin, Kuan-Yew; Black, Michael A.; Jenkins, Nancy A.; Copeland, Neal G.

2016-01-01

Triple-negative breast cancer (TNBC) has the worst prognosis of any breast cancer subtype. To better understand the genetic forces driving TNBC, we performed a transposon mutagenesis screen in a phosphatase and tensin homolog (Pten) mutant mice and identified 12 candidate trunk drivers and a much larger number of progression genes. Validation studies identified eight TNBC tumor suppressor genes, including the GATA-like transcriptional repressor TRPS1. Down-regulation of TRPS1 in TNBC cells promoted epithelial-to-mesenchymal transition (EMT) by deregulating multiple EMT pathway genes, in addition to increasing the expression of SERPINE1 and SERPINB2 and the subsequent migration, invasion, and metastasis of tumor cells. Transposon mutagenesis has thus provided a better understanding of the genetic forces driving TNBC and discovered genes with potential clinical importance in TNBC. PMID:27849608
Targeted Analysis of Whole Genome Sequence Data to Diagnose Genetic Cardiomyopathy

DOE PAGES

Golbus, Jessica R.; Puckelwartz, Megan J.; Dellefave-Castillo, Lisa; ...

2014-09-01

Background—Cardiomyopathy is highly heritable but genetically diverse. At present, genetic testing for cardiomyopathy uses targeted sequencing to simultaneously assess the coding regions of more than 50 genes. New genes are routinely added to panels to improve the diagnostic yield. With the anticipated $1000 genome, it is expected that genetic testing will shift towards comprehensive genome sequencing accompanied by targeted gene analysis. Therefore, we assessed the reliability of whole genome sequencing and targeted analysis to identify cardiomyopathy variants in 11 subjects with cardiomyopathy. Methods and Results—Whole genome sequencing with an average of 37× coverage was combined with targeted analysis focused onmore » 204 genes linked to cardiomyopathy. Genetic variants were scored using multiple prediction algorithms combined with frequency data from public databases. This pipeline yielded 1-14 potentially pathogenic variants per individual. Variants were further analyzed using clinical criteria and/or segregation analysis. Three of three previously identified primary mutations were detected by this analysis. In six subjects for whom the primary mutation was previously unknown, we identified mutations that segregated with disease, had clinical correlates, and/or had additional pathological correlation to provide evidence for causality. For two subjects with previously known primary mutations, we identified additional variants that may act as modifiers of disease severity. In total, we identified the likely pathological mutation in 9 of 11 (82%) subjects. We conclude that these pilot data demonstrate that ~30-40× coverage whole genome sequencing combined with targeted analysis is feasible and sensitive to identify rare variants in cardiomyopathy-associated genes.« less
ENU Mutagenesis in Mice Identifies Candidate Genes For Hypogonadism

PubMed Central

Weiss, Jeffrey; Hurley, Lisa A.; Harris, Rebecca M.; Finlayson, Courtney; Tong, Minghan; Fisher, Lisa A.; Moran, Jennifer L.; Beier, David R.; Mason, Christopher; Jameson, J. Larry

2012-01-01

Genome-wide mutagenesis was performed in mice to identify candidate genes for male infertility, for which the predominant causes remain idiopathic. Mice were mutagenized using N-ethyl-N-nitrosourea (ENU), bred, and screened for phenotypes associated with the male urogenital system. Fifteen heritable lines were isolated and chromosomal loci were assigned using low density genome-wide SNP arrays. Ten of the fifteen lines were pursued further using higher resolution SNP analysis to narrow the candidate gene regions. Exon sequencing of candidate genes identified mutations in mice with cystic kidneys (Bicc1), cryptorchidism (Rxfp2), restricted germ cell deficiency (Plk4), and severe germ cell deficiency (Prdm9). In two other lines with severe hypogonadism candidate sequencing failed to identify mutations, suggesting defects in genes with previously undocumented roles in gonadal function. These genomic intervals were sequenced in their entirety and a candidate mutation was identified in SnrpE in one of the two lines. The line harboring the SnrpE variant retains substantial spermatogenesis despite small testis size, an unusual phenotype. In addition to the reproductive defects, heritable phenotypes were observed in mice with ataxia (Myo5a), tremors (Pmp22), growth retardation (unknown gene), and hydrocephalus (unknown gene). These results demonstrate that the ENU screen is an effective tool for identifying potential causes of male infertility. PMID:22258617
Exome Sequencing Identifies Potentially Druggable Mutations in Nasopharyngeal Carcinoma.

PubMed

Chow, Yock Ping; Tan, Lu Ping; Chai, San Jiun; Abdul Aziz, Norazlin; Choo, Siew Woh; Lim, Paul Vey Hong; Pathmanathan, Rajadurai; Mohd Kornain, Noor Kaslina; Lum, Chee Lun; Pua, Kin Choo; Yap, Yoke Yeow; Tan, Tee Yong; Teo, Soo Hwang; Khoo, Alan Soo-Beng; Patel, Vyomesh

2017-03-03

In this study, we first performed whole exome sequencing of DNA from 10 untreated and clinically annotated fresh frozen nasopharyngeal carcinoma (NPC) biopsies and matched bloods to identify somatically mutated genes that may be amenable to targeted therapeutic strategies. We identified a total of 323 mutations which were either non-synonymous (n = 238) or synonymous (n = 85). Furthermore, our analysis revealed genes in key cancer pathways (DNA repair, cell cycle regulation, apoptosis, immune response, lipid signaling) were mutated, of which those in the lipid-signaling pathway were the most enriched. We next extended our analysis on a prioritized sub-set of 37 mutated genes plus top 5 mutated cancer genes listed in COSMIC using a custom designed HaloPlex target enrichment panel with an additional 88 NPC samples. Our analysis identified 160 additional non-synonymous mutations in 37/42 genes in 66/88 samples. Of these, 99/160 mutations within potentially druggable pathways were further selected for validation. Sanger sequencing revealed that 77/99 variants were true positives, giving an accuracy of 78%. Taken together, our study indicated that ~72% (n = 71/98) of NPC samples harbored mutations in one of the four cancer pathways (EGFR-PI3K-Akt-mTOR, NOTCH, NF-κB, DNA repair) which may be potentially useful as predictive biomarkers of response to matched targeted therapies.
Exome Sequencing Identifies Potentially Druggable Mutations in Nasopharyngeal Carcinoma

PubMed Central

Chow, Yock Ping; Tan, Lu Ping; Chai, San Jiun; Abdul Aziz, Norazlin; Choo, Siew Woh; Lim, Paul Vey Hong; Pathmanathan, Rajadurai; Mohd Kornain, Noor Kaslina; Lum, Chee Lun; Pua, Kin Choo; Yap, Yoke Yeow; Tan, Tee Yong; Teo, Soo Hwang; Khoo, Alan Soo-Beng; Patel, Vyomesh

2017-01-01

In this study, we first performed whole exome sequencing of DNA from 10 untreated and clinically annotated fresh frozen nasopharyngeal carcinoma (NPC) biopsies and matched bloods to identify somatically mutated genes that may be amenable to targeted therapeutic strategies. We identified a total of 323 mutations which were either non-synonymous (n = 238) or synonymous (n = 85). Furthermore, our analysis revealed genes in key cancer pathways (DNA repair, cell cycle regulation, apoptosis, immune response, lipid signaling) were mutated, of which those in the lipid-signaling pathway were the most enriched. We next extended our analysis on a prioritized sub-set of 37 mutated genes plus top 5 mutated cancer genes listed in COSMIC using a custom designed HaloPlex target enrichment panel with an additional 88 NPC samples. Our analysis identified 160 additional non-synonymous mutations in 37/42 genes in 66/88 samples. Of these, 99/160 mutations within potentially druggable pathways were further selected for validation. Sanger sequencing revealed that 77/99 variants were true positives, giving an accuracy of 78%. Taken together, our study indicated that ~72% (n = 71/98) of NPC samples harbored mutations in one of the four cancer pathways (EGFR-PI3K-Akt-mTOR, NOTCH, NF-κB, DNA repair) which may be potentially useful as predictive biomarkers of response to matched targeted therapies. PMID:28256603
Genomic analysis of primordial dwarfism reveals novel disease genes.

PubMed

Shaheen, Ranad; Faqeih, Eissa; Ansari, Shinu; Abdel-Salam, Ghada; Al-Hassnan, Zuhair N; Al-Shidi, Tarfa; Alomar, Rana; Sogaty, Sameera; Alkuraya, Fowzan S

2014-02-01

Primordial dwarfism (PD) is a disease in which severely impaired fetal growth persists throughout postnatal development and results in stunted adult size. The condition is highly heterogeneous clinically, but the use of certain phenotypic aspects such as head circumference and facial appearance has proven helpful in defining clinical subgroups. In this study, we present the results of clinical and genomic characterization of 16 new patients in whom a broad definition of PD was used (e.g., 3M syndrome was included). We report a novel PD syndrome with distinct facies in two unrelated patients, each with a different homozygous truncating mutation in CRIPT. Our analysis also reveals, in addition to mutations in known PD disease genes, the first instance of biallelic truncating BRCA2 mutation causing PD with normal bone marrow analysis. In addition, we have identified a novel locus for Seckel syndrome based on a consanguineous multiplex family and identified a homozygous truncating mutation in DNA2 as the likely cause. An additional novel PD disease candidate gene XRCC4 was identified by autozygome/exome analysis, and the knockout mouse phenotype is highly compatible with PD. Thus, we add a number of novel genes to the growing list of PD-linked genes, including one which we show to be linked to a novel PD syndrome with a distinct facial appearance. PD is extremely heterogeneous genetically and clinically, and genomic tools are often required to reach a molecular diagnosis.
Genomic analysis of primordial dwarfism reveals novel disease genes

PubMed Central

Shaheen, Ranad; Faqeih, Eissa; Ansari, Shinu; Abdel-Salam, Ghada; Al-Hassnan, Zuhair N.; Al-Shidi, Tarfa; Alomar, Rana; Sogaty, Sameera; Alkuraya, Fowzan S.

2014-01-01

Primordial dwarfism (PD) is a disease in which severely impaired fetal growth persists throughout postnatal development and results in stunted adult size. The condition is highly heterogeneous clinically, but the use of certain phenotypic aspects such as head circumference and facial appearance has proven helpful in defining clinical subgroups. In this study, we present the results of clinical and genomic characterization of 16 new patients in whom a broad definition of PD was used (e.g., 3M syndrome was included). We report a novel PD syndrome with distinct facies in two unrelated patients, each with a different homozygous truncating mutation in CRIPT. Our analysis also reveals, in addition to mutations in known PD disease genes, the first instance of biallelic truncating BRCA2 mutation causing PD with normal bone marrow analysis. In addition, we have identified a novel locus for Seckel syndrome based on a consanguineous multiplex family and identified a homozygous truncating mutation in DNA2 as the likely cause. An additional novel PD disease candidate gene XRCC4 was identified by autozygome/exome analysis, and the knockout mouse phenotype is highly compatible with PD. Thus, we add a number of novel genes to the growing list of PD-linked genes, including one which we show to be linked to a novel PD syndrome with a distinct facial appearance. PD is extremely heterogeneous genetically and clinically, and genomic tools are often required to reach a molecular diagnosis. PMID:24389050
Ancient gene transfer from algae to animals: Mechanisms and evolutionary significance

PubMed Central

2012-01-01

Background Horizontal gene transfer (HGT) is traditionally considered to be rare in multicellular eukaryotes such as animals. Recently, many genes of miscellaneous algal origins were discovered in choanoflagellates. Considering that choanoflagellates are the existing closest relatives of animals, we speculated that ancient HGT might have occurred in the unicellular ancestor of animals and affected the long-term evolution of animals. Results Through genome screening, phylogenetic and domain analyses, we identified 14 gene families, including 92 genes, in the tunicate Ciona intestinalis that are likely derived from miscellaneous photosynthetic eukaryotes. Almost all of these gene families are distributed in diverse animals, suggesting that they were mostly acquired by the common ancestor of animals. Their miscellaneous origins also suggest that these genes are not derived from a particular algal endosymbiont. In addition, most genes identified in our analyses are functionally related to molecule transport, cellular regulation and methylation signaling, suggesting that the acquisition of these genes might have facilitated the intercellular communication in the ancestral animal. Conclusions Our findings provide additional evidence that algal genes in aplastidic eukaryotes are not exclusively derived from historical plastids and thus important for interpreting the evolution of eukaryotic photosynthesis. Most importantly, our data represent the first evidence that more anciently acquired genes might exist in animals and that ancient HGT events have played an important role in animal evolution. PMID:22690978
Global expression analysis of gene regulatory pathways during endocrine pancreatic development.

PubMed

Gu, Guoqiang; Wells, James M; Dombkowski, David; Preffer, Fred; Aronow, Bruce; Melton, Douglas A

2004-01-01

To define genetic pathways that regulate development of the endocrine pancreas, we generated transcriptional profiles of enriched cells isolated from four biologically significant stages of endocrine pancreas development: endoderm before pancreas specification, early pancreatic progenitor cells, endocrine progenitor cells and adult islets of Langerhans. These analyses implicate new signaling pathways in endocrine pancreas development, and identified sets of known and novel genes that are temporally regulated, as well as genes that spatially define developing endocrine cells from their neighbors. The differential expression of several genes from each time point was verified by RT-PCR and in situ hybridization. Moreover, we present preliminary functional evidence suggesting that one transcription factor encoding gene (Myt1), which was identified in our screen, is expressed in endocrine progenitors and may regulate alpha, beta and delta cell development. In addition to identifying new genes that regulate endocrine cell fate, this global gene expression analysis has uncovered informative biological trends that occur during endocrine differentiation.
Partial least squares based identification of Duchenne muscular dystrophy specific genes.

PubMed

An, Hui-bo; Zheng, Hua-cheng; Zhang, Li; Ma, Lin; Liu, Zheng-yan

2013-11-01

Large-scale parallel gene expression analysis has provided a greater ease for investigating the underlying mechanisms of Duchenne muscular dystrophy (DMD). Previous studies typically implemented variance/regression analysis, which would be fundamentally flawed when unaccounted sources of variability in the arrays existed. Here we aim to identify genes that contribute to the pathology of DMD using partial least squares (PLS) based analysis. We carried out PLS-based analysis with two datasets downloaded from the Gene Expression Omnibus (GEO) database to identify genes contributing to the pathology of DMD. Except for the genes related to inflammation, muscle regeneration and extracellular matrix (ECM) modeling, we found some genes with high fold change, which have not been identified by previous studies, such as SRPX, GPNMB, SAT1, and LYZ. In addition, downregulation of the fatty acid metabolism pathway was found, which may be related to the progressive muscle wasting process. Our results provide a better understanding for the downstream mechanisms of DMD.
Assessment of gene order computing methods for Alzheimer's disease

PubMed Central

2013-01-01

Background Computational genomics of Alzheimer disease (AD), the most common form of senile dementia, is a nascent field in AD research. The field includes AD gene clustering by computing gene order which generates higher quality gene clustering patterns than most other clustering methods. However, there are few available gene order computing methods such as Genetic Algorithm (GA) and Ant Colony Optimization (ACO). Further, their performance in gene order computation using AD microarray data is not known. We thus set forth to evaluate the performances of current gene order computing methods with different distance formulas, and to identify additional features associated with gene order computation. Methods Using different distance formulas- Pearson distance and Euclidean distance, the squared Euclidean distance, and other conditions, gene orders were calculated by ACO and GA (including standard GA and improved GA) methods, respectively. The qualities of the gene orders were compared, and new features from the calculated gene orders were identified. Results Compared to the GA methods tested in this study, ACO fits the AD microarray data the best when calculating gene order. In addition, the following features were revealed: different distance formulas generated a different quality of gene order, and the commonly used Pearson distance was not the best distance formula when used with both GA and ACO methods for AD microarray data. Conclusion Compared with Pearson distance and Euclidean distance, the squared Euclidean distance generated the best quality gene order computed by GA and ACO methods. PMID:23369541
Mapping eQTLs in the Norfolk Island Genetic Isolate Identifies Candidate Genes for CVD Risk Traits

PubMed Central

Benton, Miles C.; Lea, Rod A.; Macartney-Coxson, Donia; Carless, Melanie A.; Göring, Harald H.; Bellis, Claire; Hanna, Michelle; Eccles, David; Chambers, Geoffrey K.; Curran, Joanne E.; Harper, Jacquie L.; Blangero, John; Griffiths, Lyn R.

2013-01-01

Cardiovascular disease (CVD) affects millions of people worldwide and is influenced by numerous factors, including lifestyle and genetics. Expression quantitative trait loci (eQTLs) influence gene expression and are good candidates for CVD risk. Founder-effect pedigrees can provide additional power to map genes associated with disease risk. Therefore, we identified eQTLs in the genetic isolate of Norfolk Island (NI) and tested for associations between these and CVD risk factors. We measured genome-wide transcript levels of blood lymphocytes in 330 individuals and used pedigree-based heritability analysis to identify heritable transcripts. eQTLs were identified by genome-wide association testing of these transcripts. Testing for association between CVD risk factors (i.e., blood lipids, blood pressure, and body fat indices) and eQTLs revealed 1,712 heritable transcripts (p < 0.05) with heritability values ranging from 0.18 to 0.84. From these, we identified 200 cis-acting and 70 trans-acting eQTLs (p < 1.84 × 10−7) An eQTL-centric analysis of CVD risk traits revealed multiple associations, including 12 previously associated with CVD-related traits. Trait versus eQTL regression modeling identified four CVD risk candidates (NAAA, PAPSS1, NME1, and PRDX1), all of which have known biological roles in disease. In addition, we implicated several genes previously associated with CVD risk traits, including MTHFR and FN3KRP. We have successfully identified a panel of eQTLs in the NI pedigree and used this to implicate several genes in CVD risk. Future studies are required for further assessing the functional importance of these eQTLs and whether the findings here also relate to outbred populations. PMID:24314549
Analysis of the Prefoldin Gene Family in 14 Plant Species

PubMed Central

Cao, Jun

2016-01-01

Prefoldin is a hexameric molecular chaperone complex present in all eukaryotes and archaea. The evolution of this gene family in plants is unknown. Here, I identified 140 prefoldin genes in 14 plant species. These prefoldin proteins were divided into nine groups through phylogenetic analysis. Highly conserved gene organization and motif distribution exist in each prefoldin group, implying their functional conservation. I also observed the segmental duplication of maize prefoldin gene family. Moreover, a few functional divergence sites were identified within each group pairs. Functional network analyses identified 78 co-expressed genes, and most of them were involved in carrying, binding and kinase activity. Divergent expression profiles of the maize prefoldin genes were further investigated in different tissues and development periods and under auxin and some abiotic stresses. I also found a few cis-elements responding to abiotic stress and phytohormone in the upstream sequences of the maize prefoldin genes. The results provided a foundation for exploring the characterization of the prefoldin genes in plants and will offer insights for additional functional studies. PMID:27014333
Identification of essential genes and synthetic lethal gene combinations in Escherichia coli K-12.

PubMed

Mori, Hirotada; Baba, Tomoya; Yokoyama, Katsushi; Takeuchi, Rikiya; Nomura, Wataru; Makishi, Kazuichi; Otsuka, Yuta; Dose, Hitomi; Wanner, Barry L

2015-01-01

Here we describe the systematic identification of single genes and gene pairs, whose knockout causes lethality in Escherichia coli K-12. During construction of precise single-gene knockout library of E. coli K-12, we identified 328 essential gene candidates for growth in complex (LB) medium. Upon establishment of the Keio single-gene deletion library, we undertook the development of the ASKA single-gene deletion library carrying a different antibiotic resistance. In addition, we developed tools for identification of synthetic lethal gene combinations by systematic construction of double-gene knockout mutants. We introduce these methods herein.
Gene-Trap Mutagenesis Identifies Mammalian Genes Contributing to Intoxication by Clostridium perfringens ε-Toxin

PubMed Central

Ivie, Susan E.; Fennessey, Christine M.; Sheng, Jinsong; Rubin, Donald H.; McClain, Mark S.

2011-01-01

The Clostridium perfringens ε-toxin is an extremely potent toxin associated with lethal toxemias in domesticated ruminants and may be toxic to humans. Intoxication results in fluid accumulation in various tissues, most notably in the brain and kidneys. Previous studies suggest that the toxin is a pore-forming toxin, leading to dysregulated ion homeostasis and ultimately cell death. However, mammalian host factors that likely contribute to ε-toxin-induced cytotoxicity are poorly understood. A library of insertional mutant Madin Darby canine kidney (MDCK) cells, which are highly susceptible to the lethal affects of ε-toxin, was used to select clones of cells resistant to ε-toxin-induced cytotoxicity. The genes mutated in 9 surviving resistant cell clones were identified. We focused additional experiments on one of the identified genes as a means of validating the experimental approach. Gene expression microarray analysis revealed that one of the identified genes, hepatitis A virus cellular receptor 1 (HAVCR1, KIM-1, TIM1), is more abundantly expressed in human kidney cell lines than it is expressed in human cells known to be resistant to ε-toxin. One human kidney cell line, ACHN, was found to be sensitive to the toxin and expresses a larger isoform of the HAVCR1 protein than the HAVCR1 protein expressed by other, toxin-resistant human kidney cell lines. RNA interference studies in MDCK and in ACHN cells confirmed that HAVCR1 contributes to ε-toxin-induced cytotoxicity. Additionally, ε-toxin was shown to bind to HAVCR1 in vitro. The results of this study indicate that HAVCR1 and the other genes identified through the use of gene-trap mutagenesis and RNA interference strategies represent important targets for investigation of the process by which ε-toxin induces cell death and new targets for potential therapeutic intervention. PMID:21412435
Gene-trap mutagenesis identifies mammalian genes contributing to intoxication by Clostridium perfringens ε-toxin.

PubMed

Ivie, Susan E; Fennessey, Christine M; Sheng, Jinsong; Rubin, Donald H; McClain, Mark S

2011-03-11

The Clostridium perfringens ε-toxin is an extremely potent toxin associated with lethal toxemias in domesticated ruminants and may be toxic to humans. Intoxication results in fluid accumulation in various tissues, most notably in the brain and kidneys. Previous studies suggest that the toxin is a pore-forming toxin, leading to dysregulated ion homeostasis and ultimately cell death. However, mammalian host factors that likely contribute to ε-toxin-induced cytotoxicity are poorly understood. A library of insertional mutant Madin Darby canine kidney (MDCK) cells, which are highly susceptible to the lethal affects of ε-toxin, was used to select clones of cells resistant to ε-toxin-induced cytotoxicity. The genes mutated in 9 surviving resistant cell clones were identified. We focused additional experiments on one of the identified genes as a means of validating the experimental approach. Gene expression microarray analysis revealed that one of the identified genes, hepatitis A virus cellular receptor 1 (HAVCR1, KIM-1, TIM1), is more abundantly expressed in human kidney cell lines than it is expressed in human cells known to be resistant to ε-toxin. One human kidney cell line, ACHN, was found to be sensitive to the toxin and expresses a larger isoform of the HAVCR1 protein than the HAVCR1 protein expressed by other, toxin-resistant human kidney cell lines. RNA interference studies in MDCK and in ACHN cells confirmed that HAVCR1 contributes to ε-toxin-induced cytotoxicity. Additionally, ε-toxin was shown to bind to HAVCR1 in vitro. The results of this study indicate that HAVCR1 and the other genes identified through the use of gene-trap mutagenesis and RNA interference strategies represent important targets for investigation of the process by which ε-toxin induces cell death and new targets for potential therapeutic intervention.
Identification of genes showing differential expression profile associated with growth rate in skeletal muscle tissue of Landrace weanling pig.

PubMed

Komatsu, Yuuta; Sukegawa, Shin; Yamashita, Mai; Katsuda, Naoki; Tong, Bin; Ohta, Takeshi; Kose, Hiroyuki; Yamada, Takahisa

2016-06-01

Suppression subtractive hybridization was used to identify genes showing differential expression profile associated with growth rate in skeletal muscle tissue of Landrace weanling pig. Two subtracted cDNA populations were generated from musculus longissimus muscle tissues of selected pigs with extreme expected breeding values at the age of 100 kg. Three upregulated genes (EEF1A2, TSG101 and TTN) and six downregulated genes (ATP5B, ATP5C1, COQ3, HADHA, MYH1 and MYH7) in pig with genetic propensity for higher growth rate were identified by sequence analysis of 12 differentially expressed clones selected by differential screening following the generation of the subtracted cDNA population. Real-time PCR analysis confirmed difference in expression profiles of the identified genes in musculus longissimus muscle tissues between the two Landrace weanling pig groups with divergent genetic propensity for growth rate. Further, differential expression of the identified genes except for the TTN was validated by Western blot analysis. Additionally, the eight genes other than the ATP5C1 colocalized with the same chromosomal positions as QTLs that have been previously identified for growth rate traits. Finally, the changes of expression predicted from gene function suggested association of upregulation of expression of the EEF1A2, TSG101 and TTN genes and downregulation of the ATP5B, ATP5C1, COQ3, HADHA, MYH1 and MYH7 gene expression with increased growth rate. The identified genes will provide an important insight in understanding the molecular mechanism underlying growth rate in Landrace pig breed.
MicroRNA profiling in the dentate gyrus in epileptic rats: The role of miR-187-3p.

PubMed

Zhang, Suya; Kou, Yubin; Hu, Chunmei; Han, Yan

2017-06-01

This study aimed to explore the role of aberrant miRNA expression in epilepsy and to identify more potential genes associated with epileptogenesis.The miRNA expression profile of GSE49850, which included 20 samples from the rat epileptic dentate gyrus at 7, 14, 30, and 90 days after electrical stimulation and 20 additional samples from sham time-matched controls, was downloaded from the Gene Expression Omnibus database. The significantly differentially expressed miRNAs were identified in stimulated samples at each time point compared to time-matched controls, respectively. The target genes of consistently differentially expressed miRNAs were screened from miRDB and microRNA.org databases, followed by Gene Ontology (GO) and pathway enrichment analysis and regulatory network construction. The overlapping target genes for consistently differentially expressed miRNAs were also identified from these 2 databases. Furthermore, the potential binding sites of miRNAs and their target genes were analyzed.Rno-miR-187-3p was consistently downregulated in stimulated groups compared with time-matched controls. The predicted target genes of rno-miR-187-3p were enriched in different GO terms and pathways. In addition, 7 overlapping target genes of rno-miR-187-3p were identified, including NFS1, PAQR4, CAND1, DCLK1, PRKAR2A, AKAP3, and KCNK10. These 7 overlapping target genes were determined to have a different number of matched binding sites with rno-miR-187-3p.Our study suggests that miR-187-3p may play an important role in epilepsy development and progression via regulating numerous target genes, such as NFS1, CAND1, DCLK1, AKAP3, and KCNK10. Determining the underlying mechanism of the role of miR-187-3p in epilepsy may make it a potential therapeutic option.
Genome-wide association studies in East Asians identify new loci for waist-hip ratio and waist circumference

PubMed Central

Wen, Wanqing; Kato, Norihiro; Hwang, Joo-Yeon; Guo, Xingyi; Tabara, Yasuharu; Li, Huaixing; Dorajoo, Rajkumar; Yang, Xiaobo; Tsai, Fuu-Jen; Li, Shengxu; Wu, Ying; Wu, Tangchun; Kim, Soriul; Guo, Xiuqing; Liang, Jun; Shungin, Dmitry; Adair, Linda S.; Akiyama, Koichi; Allison, Matthew; Cai, Qiuyin; Chang, Li-Ching; Chen, Chien-Hsiun; Chen, Yuan-Tsong; Cho, Yoon Shin; Choi, Bo Youl; Gao, Yutang; Go, Min Jin; Gu, Dongfeng; Han, Bok-Ghee; He, Meian; Hixson, James E.; Hu, Yanling; Huang, Tao; Isono, Masato; Jung, Keum Ji; Kang, Daehee; Kim, Young Jin; Kita, Yoshikuni; Lee, Juyoung; Lee, Nanette R.; Lee, Jeannette; Wang, Yiqin; Liu, Jian-Jun; Long, Jirong; Moon, Sanghoon; Nakamura, Yasuyuki; Nakatochi, Masahiro; Ohnaka, Keizo; Rao, Dabeeru; Shi, Jiajun; Sull, Jae Woong; Tan, Aihua; Ueshima, Hirotsugu; Wu, Chen; Xiang, Yong-Bing; Yamamoto, Ken; Yao, Jie; Ye, Xingwang; Yokota, Mitsuhiro; Zhang, Xiaomin; Zheng, Yan; Qi, Lu; Rotter, Jerome I.; Jee, Sun Ha; Lin, Dongxin; Mohlke, Karen L.; He, Jiang; Mo, Zengnan; Wu, Jer-Yuarn; Tai, E. Shyong; Lin, Xu; Miki, Tetsuro; Kim, Bong-Jo; Takeuchi, Fumihiko; Zheng, Wei; Shu, Xiao-Ou

2016-01-01

Sixty genetic loci associated with abdominal obesity, measured by waist circumference (WC) and waist-hip ratio (WHR), have been previously identified, primarily from studies conducted in European-ancestry populations. We conducted a meta-analysis of associations of abdominal obesity with approximately 2.5 million single nucleotide polymorphisms (SNPs) among 53,052 (for WC) and 48,312 (for WHR) individuals of Asian descent, and replicated 33 selected SNPs among 3,762 to 17,110 additional individuals. We identified four novel loci near the EFEMP1, ADAMTSL3 , CNPY2, and GNAS genes that were associated with WC after adjustment for body mass index (BMI); two loci near the NID2 and HLA-DRB5 genes associated with WHR after adjustment for BMI, and three loci near the CEP120, TSC22D2, and SLC22A2 genes associated with WC without adjustment for BMI. Functional enrichment analyses revealed enrichment of corticotropin-releasing hormone signaling, GNRH signaling, and/or CDK5 signaling pathways for those newly-identified loci. Our study provides additional insight on genetic contribution to abdominal obesity. PMID:26785701

Identifying potential maternal genes of Bombyx mori using digital gene expression profiling

PubMed Central

Xu, Pingzhen

2018-01-01

Maternal genes present in mature oocytes play a crucial role in the early development of silkworm. Although maternal genes have been widely studied in many other species, there has been limited research in Bombyx mori. High-throughput next generation sequencing provides a practical method for gene discovery on a genome-wide level. Herein, a transcriptome study was used to identify maternal-related genes from silkworm eggs. Unfertilized eggs from five different stages of early development were used to detect the changing situation of gene expression. The expressed genes showed different patterns over time. Seventy-six maternal genes were annotated according to homology analysis with Drosophila melanogaster. More than half of the differentially expressed maternal genes fell into four expression patterns, while the expression patterns showed a downward trend over time. The functional annotation of these material genes was mainly related to transcription factor activity, growth factor activity, nucleic acid binding, RNA binding, ATP binding, and ion binding. Additionally, twenty-two gene clusters including maternal genes were identified from 18 scaffolds. Altogether, we plotted a profile for the maternal genes of Bombyx mori using a digital gene expression profiling method. This will provide the basis for maternal-specific signature research and improve the understanding of the early development of silkworm. PMID:29462160
Deletion and Gene Expression Analyses Define the Paxilline Biosynthetic Gene Cluster in Penicillium paxilli

PubMed Central

Scott, Barry; Young, Carolyn A.; Saikia, Sanjay; McMillan, Lisa K.; Monahan, Brendon J.; Koulman, Albert; Astin, Jonathan; Eaton, Carla J.; Bryant, Andrea; Wrenn, Ruth E.; Finch, Sarah C.; Tapper, Brian A.; Parker, Emily J.; Jameson, Geoffrey B.

2013-01-01

The indole-diterpene paxilline is an abundant secondary metabolite synthesized by Penicillium paxilli. In total, 21 genes have been identified at the PAX locus of which six have been previously confirmed to have a functional role in paxilline biosynthesis. A combination of bioinformatics, gene expression and targeted gene replacement analyses were used to define the boundaries of the PAX gene cluster. Targeted gene replacement identified seven genes, paxG, paxA, paxM, paxB, paxC, paxP and paxQ that were all required for paxilline production, with one additional gene, paxD, required for regular prenylation of the indole ring post paxilline synthesis. The two putative transcription factors, PP104 and PP105, were not co-regulated with the pax genes and based on targeted gene replacement, including the double knockout, did not have a role in paxilline production. The relationship of indole dimethylallyl transferases involved in prenylation of indole-diterpenes such as paxilline or lolitrem B, can be found as two disparate clades, not supported by prenylation type (e.g., regular or reverse). This paper provides insight into the P. paxilli indole-diterpene locus and reviews the recent advances identified in paxilline biosynthesis. PMID:23949005
LNDriver: identifying driver genes by integrating mutation and expression data based on gene-gene interaction network.

PubMed

Wei, Pi-Jing; Zhang, Di; Xia, Junfeng; Zheng, Chun-Hou

2016-12-23

Cancer is a complex disease which is characterized by the accumulation of genetic alterations during the patient's lifetime. With the development of the next-generation sequencing technology, multiple omics data, such as cancer genomic, epigenomic and transcriptomic data etc., can be measured from each individual. Correspondingly, one of the key challenges is to pinpoint functional driver mutations or pathways, which contributes to tumorigenesis, from millions of functional neutral passenger mutations. In this paper, in order to identify driver genes effectively, we applied a generalized additive model to mutation profiles to filter genes with long length and constructed a new gene-gene interaction network. Then we integrated the mutation data and expression data into the gene-gene interaction network. Lastly, greedy algorithm was used to prioritize candidate driver genes from the integrated data. We named the proposed method Length-Net-Driver (LNDriver). Experiments on three TCGA datasets, i.e., head and neck squamous cell carcinoma, kidney renal clear cell carcinoma and thyroid carcinoma, demonstrated that the proposed method was effective. Also, it can identify not only frequently mutated drivers, but also rare candidate driver genes.
Identification of KCNJ11 as a functional candidate gene for bovine meat tenderness.

PubMed

Tizioto, Polyana C; Gasparin, Gustavo; Souza, Marcela M; Mudadu, Mauricio A; Coutinho, Luiz L; Mourão, Gerson B; Tholon, Patricia; Meirelles, Sarah L C; Tullio, Rymer R; Rosa, Antônio N; Alencar, Maurício M; Medeiros, Sérgio R; Siqueira, Fabiane; Feijó, Gelson L D; Nassu, Renata T; Regitano, Luciana C A

2013-12-15

The potassium inwardly rectifying channel, subfamily J, member 11 (KCNJ11) gene was investigated as a candidate for meat tenderness based on the effects reported on muscle for KCNJ11 gene knockout in rat models and its position in a quantitative trait locus (QTL) for meat tenderness in the bovine genome. Sequence variations in the KCNJ11 gene were described by sequencing six amplified fragments, covering almost the entire gene. We identified single nucleotide polymorphisms (SNP) and validated them by different approaches, taking advantage of simultaneous projects that are being developed with the same Nelore population. By sequencing the KCNJ11 in Nelore steers representing extreme phenotypes for Warner-Bratzler shear force (WBSF), it was possible to identify 22 SNPs. We validated two of the identified markers by genotyping the whole population (n = 460). Analysis of association between genotypes and WBSF values revealed a significant additive effect of a SNP at different meat aging times (P ≤ 0.05). In addition, an association between the expression levels of KCNJ11 and WBSF was found, with lower expression levels of KCNJ11 associated with more tender meat (P ≤ 0.05). The results showed that the KCNJ11 gene is a candidate mapped to a QTL for meat tenderness previously identified on BTA15 and may be useful to identify animals with genetic potential to produce tender meat. The effect of KCNJ11 observed on muscle is potentially due to changes in activity of KATP channels, which in turn influence the flow of potassium in the intracellular space, allowing establishment of the membrane potential necessary for muscle contraction.
Identification of modulators of the nuclear receptor peroxisome proliferator-activated receptor α (PPARα) in a mouse liver gene expression compendium.

PubMed

Oshida, Keiyu; Vasani, Naresh; Thomas, Russell S; Applegate, Dawn; Rosen, Mitch; Abbott, Barbara; Lau, Christopher; Guo, Grace; Aleksunes, Lauren M; Klaassen, Curtis; Corton, J Christopher

2015-01-01

The nuclear receptor family member peroxisome proliferator-activated receptor α (PPARα) is activated by therapeutic hypolipidemic drugs and environmentally-relevant chemicals to regulate genes involved in lipid transport and catabolism. Chronic activation of PPARα in rodents increases liver cancer incidence, whereas suppression of PPARα activity leads to hepatocellular steatosis. Analytical approaches were developed to identify biosets (i.e., gene expression differences between two conditions) in a genomic database in which PPARα activity was altered. A gene expression signature of 131 PPARα-dependent genes was built using microarray profiles from the livers of wild-type and PPARα-null mice after exposure to three structurally diverse PPARα activators (WY-14,643, fenofibrate and perfluorohexane sulfonate). A fold-change rank-based test (Running Fisher's test (p-value ≤ 10(-4))) was used to evaluate the similarity between the PPARα signature and a test set of 48 and 31 biosets positive or negative, respectively for PPARα activation; the test resulted in a balanced accuracy of 98%. The signature was then used to identify factors that activate or suppress PPARα in an annotated mouse liver/primary hepatocyte gene expression compendium of ~1850 biosets. In addition to the expected activation of PPARα by fibrate drugs, di(2-ethylhexyl) phthalate, and perfluorinated compounds, PPARα was activated by benzofuran, galactosamine, and TCDD and suppressed by hepatotoxins acetaminophen, lipopolysaccharide, silicon dioxide nanoparticles, and trovafloxacin. Additional factors that activate (fasting, caloric restriction) or suppress (infections) PPARα were also identified. This study 1) developed methods useful for future screening of environmental chemicals, 2) identified chemicals that activate or suppress PPARα, and 3) identified factors including diets and infections that modulate PPARα activity and would be hypothesized to affect chemical-induced PPARα activity.
Identification of Modulators of the Nuclear Receptor Peroxisome Proliferator-Activated Receptor α (PPARα) in a Mouse Liver Gene Expression Compendium

PubMed Central

Oshida, Keiyu; Vasani, Naresh; Thomas, Russell S.; Applegate, Dawn; Rosen, Mitch; Abbott, Barbara; Lau, Christopher; Guo, Grace; Aleksunes, Lauren M.; Klaassen, Curtis; Corton, J. Christopher

2015-01-01

The nuclear receptor family member peroxisome proliferator-activated receptor α (PPARα) is activated by therapeutic hypolipidemic drugs and environmentally-relevant chemicals to regulate genes involved in lipid transport and catabolism. Chronic activation of PPARα in rodents increases liver cancer incidence, whereas suppression of PPARα activity leads to hepatocellular steatosis. Analytical approaches were developed to identify biosets (i.e., gene expression differences between two conditions) in a genomic database in which PPARα activity was altered. A gene expression signature of 131 PPARα-dependent genes was built using microarray profiles from the livers of wild-type and PPARα-null mice after exposure to three structurally diverse PPARα activators (WY-14,643, fenofibrate and perfluorohexane sulfonate). A fold-change rank-based test (Running Fisher’s test (p-value ≤ 10-4)) was used to evaluate the similarity between the PPARα signature and a test set of 48 and 31 biosets positive or negative, respectively for PPARα activation; the test resulted in a balanced accuracy of 98%. The signature was then used to identify factors that activate or suppress PPARα in an annotated mouse liver/primary hepatocyte gene expression compendium of ~1850 biosets. In addition to the expected activation of PPARα by fibrate drugs, di(2-ethylhexyl) phthalate, and perfluorinated compounds, PPARα was activated by benzofuran, galactosamine, and TCDD and suppressed by hepatotoxins acetaminophen, lipopolysaccharide, silicon dioxide nanoparticles, and trovafloxacin. Additional factors that activate (fasting, caloric restriction) or suppress (infections) PPARα were also identified. This study 1) developed methods useful for future screening of environmental chemicals, 2) identified chemicals that activate or suppress PPARα, and 3) identified factors including diets and infections that modulate PPARα activity and would be hypothesized to affect chemical-induced PPARα activity. PMID:25689681
Common variants at the CHEK2 gene locus and risk of epithelial ovarian cancer

PubMed Central

Lawrenson, Kate; Iversen, Edwin S.; Tyrer, Jonathan; Weber, Rachel Palmieri; Concannon, Patrick; Hazelett, Dennis J.; Li, Qiyuan; Marks, Jeffrey R.; Berchuck, Andrew; Lee, Janet M.; Aben, Katja K.H.; Anton-Culver, Hoda; Antonenkova, Natalia; Bandera, Elisa V.; Bean, Yukie; Beckmann, Matthias W.; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A.; Brooks-Wilson, Angela; Bruinsma, Fiona; Butzow, Ralf; Campbell, Ian G.; Carty, Karen; Chang-Claude, Jenny; Chenevix-Trench, Georgia; Chen, Ann; Chen, Zhihua; Cook, Linda S.; Cramer, Daniel W.; Cunningham, Julie M.; Cybulski, Cezary; Plisiecka-Halasa, Joanna; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A.; Dörk, Thilo; du Bois, Andreas; Eccles, Diana; Easton, Douglas T.; Edwards, Robert P.; Eilber, Ursula; Ekici, Arif B.; Fasching, Peter A.; Fridley, Brooke L.; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind; Goode, Ellen L.; Goodman, Marc T.; Gronwald, Jacek; Harter, Philipp; Hasmad, Hanis Nazihah; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A.T.; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus; Hosono, Satoyo; Jakubowska, Anna; Paul, James; Jensen, Allan; Karlan, Beth Y.; Kjaer, Susanne Kruger; Kelemen, Linda E.; Kellar, Melissa; Kelley, Joseph L.; Kiemeney, Lambertus A.; Krakstad, Camilla; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D.; Lee, Alice W.; Cannioto, Rikki; Leminen, Arto; Lester, Jenny; Levine, Douglas A.; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F.A.G.; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R.; Nevanlinna, Heli; McNeish, Iain; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B.; Narod, Steven A.; Nedergaard, Lotte; Ness, Roberta B.; Noor Azmi, Mat Adenan; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Orsulic, Sandra; Pearce, Celeste L.; Pejovic, Tanja; Pelttari, Liisa M.; Permuth-Wey, Jennifer; Phelan, Catherine M.; Pike, Malcolm C.; Poole, Elizabeth M.; Ramus, Susan J.; Risch, Harvey A.; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H.; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Budzilowska, Agnieszka; Sellers, Thomas A.; Shu, Xiao-Ou; Shvetsov, Yurii B.; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Sucheston, Lara; Tangen, Ingvild L.; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J.; Timorek, Agnieszka; Tworoger, Shelley S.; Nieuwenhuysen, Els Van; Vergote, Ignace; Vierkant, Robert A.; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S.; Wicklund, Kristine G.; Wilkens, Lynne R.; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna H.; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Coetzee, Gerhard A.; Freedman, Matthew L.; Monteiro, Alvaro N.A.; Moes-Sosnowska, Joanna; Kupryjanczyk, Jolanta; Pharoah, Paul D.; Gayther, Simon A.; Schildkraut, Joellen M.

2015-01-01

Genome-wide association studies have identified 20 genomic regions associated with risk of epithelial ovarian cancer (EOC), but many additional risk variants may exist. Here, we evaluated associations between common genetic variants [single nucleotide polymorphisms (SNPs) and indels] in DNA repair genes and EOC risk. We genotyped 2896 common variants at 143 gene loci in DNA samples from 15 397 patients with invasive EOC and controls. We found evidence of associations with EOC risk for variants at FANCA, EXO1, E2F4, E2F2, CREB5 and CHEK2 genes (P ≤ 0.001). The strongest risk association was for CHEK2 SNP rs17507066 with serous EOC (P = 4.74 x 10–7). Additional genotyping and imputation of genotypes from the 1000 genomes project identified a slightly more significant association for CHEK2 SNP rs6005807 (r 2 with rs17507066 = 0.84, odds ratio (OR) 1.17, 95% CI 1.11–1.24, P = 1.1×10−7). We identified 293 variants in the region with likelihood ratios of less than 1:100 for representing the causal variant. Functional annotation identified 25 candidate SNPs that alter transcription factor binding sites within regulatory elements active in EOC precursor tissues. In The Cancer Genome Atlas dataset, CHEK2 gene expression was significantly higher in primary EOCs compared to normal fallopian tube tissues (P = 3.72×10−8). We also identified an association between genotypes of the candidate causal SNP rs12166475 (r 2 = 0.99 with rs6005807) and CHEK2 expression (P = 2.70×10-8). These data suggest that common variants at 22q12.1 are associated with risk of serous EOC and CHEK2 as a plausible target susceptibility gene. PMID:26424751
Computational identification and validation of alternative splicing in ZSF1 rat RNA-seq data, a preclinical model for type 2 diabetic nephropathy.

PubMed

Zhang, Chi; Dower, Ken; Zhang, Baohong; Martinez, Robert V; Lin, Lih-Ling; Zhao, Shanrong

2018-05-16

Obese ZSF1 rats exhibit spontaneous time-dependent diabetic nephropathy and are considered to be a highly relevant animal model of progressive human diabetic kidney disease. We previously identified gene expression changes between disease and control animals across six time points from 12 to 41 weeks. In this study, the same data were analysed at the isoform and exon levels to reveal additional disease mechanisms that may be governed by alternative splicing. Our analyses identified alternative splicing patterns in genes that may be implicated in disease pathogenesis (such as Shc1, Serpinc1, Epb4.1l5, and Il-33), which would have been overlooked in standard gene-level analysis. The alternatively spliced genes were enriched in pathways related to cell adhesion, cell-cell interactions/junctions, and cytoskeleton signalling, whereas the differentially expressed genes were enriched in pathways related to immune response, G protein-coupled receptor, and cAMP signalling. Our findings indicate that additional mechanistic insights can be gained from exon- and isoform-level data analyses over standard gene-level analysis. Considering alternative splicing is poorly conserved between rodents and humans, it is noted that this work is not translational, but the point holds true that additional insights can be gained from alternative splicing analysis of RNA-seq data.
Identification of key ancestors of modern germplasm in a breeding program of maize.

PubMed

Technow, F; Schrag, T A; Schipprack, W; Melchinger, A E

2014-12-01

Probabilities of gene origin computed from the genomic kinships matrix can accurately identify key ancestors of modern germplasms Identifying the key ancestors of modern plant breeding populations can provide valuable insights into the history of a breeding program and provide reference genomes for next generation whole genome sequencing. In an animal breeding context, a method was developed that employs probabilities of gene origin, computed from the pedigree-based additive kinship matrix, for identifying key ancestors. Because reliable and complete pedigree information is often not available in plant breeding, we replaced the additive kinship matrix with the genomic kinship matrix. As a proof-of-concept, we applied this approach to simulated data sets with known ancestries. The relative contribution of the ancestral lines to later generations could be determined with high accuracy, with and without selection. Our method was subsequently used for identifying the key ancestors of the modern Dent germplasm of the public maize breeding program of the University of Hohenheim. We found that the modern germplasm can be traced back to six or seven key ancestors, with one or two of them having a disproportionately large contribution. These results largely corroborated conjectures based on early records of the breeding program. We conclude that probabilities of gene origin computed from the genomic kinships matrix can be used for identifying key ancestors in breeding programs and estimating the proportion of genes contributed by them.
Cellulose as an extracellular matrix component present in Enterobacter sakazakii biofilms.

PubMed

Grimm, Maya; Stephan, Roger; Iversen, Carol; Manzardo, Giuseppe G G; Rattei, Thomas; Riedel, Kathrin; Ruepp, Andreas; Frishman, Dmitrij; Lehner, Angelika

2008-01-01

Cellulose was identified and characterized as an extracellular matrix component present in the biofilm of an Enterobacter sakazakii clinical isolate grown in nutrient-deficient (M9) medium. Using a bacterial artificial cloning approach in Escherichia coli and subsequent screening of transformants for fluorescence on calcofluor plates, nine genes organized in two operons were identified as putatively responsible for the biosynthesis of cellulose. In addition to the genes already described for cellulose production, two more genes were identified, putatively transcribed together with the genes from the first operon. Putative cellulose in E. sakazakii ES5 biofilm grown on glass coverslips was visualized by calcofluor staining and confocal fluorescence laser scanning microscopy. For the first time, the presence of cellulose in biofilms produced by E. sakazakii was confirmed by methylation analysis.
The human cumulus--oocyte complex gene-expression profile

PubMed Central

Assou, Said; Anahory, Tal; Pantesco, Véronique; Le Carrour, Tanguy; Pellestor, Franck; Klein, Bernard; Reyftmann, Lionel; Dechaud, Hervé; De Vos, John; Hamamah, Samir

2006-01-01

BACKGROUND The understanding of the mechanisms regulating human oocyte maturation is still rudimentary. We have identified transcripts differentially expressed between immature and mature oocytes, and cumulus cells. METHODS Using oligonucleotides microarrays, genome wide gene expression was studied in pooled immature and mature oocytes or cumulus cells from patients who underwent IVF. RESULTS In addition to known genes such as DAZL, BMP15 or GDF9, oocytes upregulated 1514 genes. We show that PTTG3 and AURKC are respectively the securin and the Aurora kinase preferentially expressed during oocyte meiosis. Strikingly, oocytes overexpressed previously unreported growth factors such as TNFSF13/APRIL, FGF9, FGF14, and IL4, and transcription factors including OTX2, SOX15 and SOX30. Conversely, cumulus cells, in addition to known genes such as LHCGR or BMPR2, overexpressed cell-tocell signaling genes including TNFSF11/RANKL, numerous complement components, semaphorins (SEMA3A, SEMA6A, SEMA6D) and CD genes such as CD200. We also identified 52 genes progressively increasing during oocyte maturation, comprising CDC25A and SOCS7. CONCLUSION The identification of genes up and down regulated during oocyte maturation greatly improves our understanding of oocyte biology and will provide new markers that signal viable and competent oocytes. Furthermore, genes found expressed in cumulus cells are potential markers of granulosa cell tumors. PMID:16571642
Incorporation of a horizontally transferred gene into an operon during cnidarian evolution.

PubMed

Dana, Catherine E; Glauber, Kristine M; Chan, Titus A; Bridge, Diane M; Steele, Robert E

2012-01-01

Genome sequencing has revealed examples of horizontally transferred genes, but we still know little about how such genes are incorporated into their host genomes. We have previously reported the identification of a gene (flp) that appears to have entered the Hydra genome through horizontal transfer. Here we provide additional evidence in support of our original hypothesis that the transfer was from a unicellular organism, and we show that the transfer occurred in an ancestor of two medusozoan cnidarian species. In addition we show that the gene is part of a bicistronic operon in the Hydra genome. These findings identify a new animal phylum in which trans-spliced leader addition has led to the formation of operons, and define the requirements for evolution of an operon in Hydra. The identification of operons in Hydra also provides a tool that can be exploited in the construction of transgenic Hydra strains.
Genetics of alcoholism.

PubMed

Edenberg, Howard J; Foroud, Tatiana

2014-01-01

Multiple lines of evidence strongly indicate that genetic factors contribute to the risk for alcohol use disorders (AUD). There is substantial heterogeneity in AUD, which complicates studies seeking to identify specific genetic factors. To identify these genetic effects, several different alcohol-related phenotypes have been analyzed, including diagnosis and quantitative measures related to AUDs. Study designs have used candidate gene analyses, genetic linkage studies, genomewide association studies (GWAS), and analyses of rare variants. Two genes that encode enzymes of alcohol metabolism have the strongest effect on AUD: aldehyde dehydrogenase 2 and alcohol dehydrogenase 1B each has strongly protective variants that reduce risk, with odds ratios approximately 0.2-0.4. A number of other genes important in AUD have been identified and replicated, including GABRA2 and alcohol dehydrogenases 1B and 4. GWAS have identified additional candidates. Rare variants are likely also to play a role; studies of these are just beginning. A multifaceted approach to gene identification, targeting both rare and common variations and assembling much larger datasets for meta-analyses, is critical for identifying the key genes and pathways important in AUD. © 2014 Elsevier B.V. All rights reserved.
Genetic Susceptibility to Vitiligo: GWAS Approaches for Identifying Vitiligo Susceptibility Genes and Loci

PubMed Central

Shen, Changbing; Gao, Jing; Sheng, Yujun; Dou, Jinfa; Zhou, Fusheng; Zheng, Xiaodong; Ko, Randy; Tang, Xianfa; Zhu, Caihong; Yin, Xianyong; Sun, Liangdan; Cui, Yong; Zhang, Xuejun

2016-01-01

Vitiligo is an autoimmune disease with a strong genetic component, characterized by areas of depigmented skin resulting from loss of epidermal melanocytes. Genetic factors are known to play key roles in vitiligo through discoveries in association studies and family studies. Previously, vitiligo susceptibility genes were mainly revealed through linkage analysis and candidate gene studies. Recently, our understanding of the genetic basis of vitiligo has been rapidly advancing through genome-wide association study (GWAS). More than 40 robust susceptible loci have been identified and confirmed to be associated with vitiligo by using GWAS. Most of these associated genes participate in important pathways involved in the pathogenesis of vitiligo. Many susceptible loci with unknown functions in the pathogenesis of vitiligo have also been identified, indicating that additional molecular mechanisms may contribute to the risk of developing vitiligo. In this review, we summarize the key loci that are of genome-wide significance, which have been shown to influence vitiligo risk. These genetic loci may help build the foundation for genetic diagnosis and personalize treatment for patients with vitiligo in the future. However, substantial additional studies, including gene-targeted and functional studies, are required to confirm the causality of the genetic variants and their biological relevance in the development of vitiligo. PMID:26870082
Transcriptome response of the foundation plant Spartina alterniflora to the Deepwater Horizon oil spill.

PubMed

Alvarez, Mariano; Ferreira de Carvalho, Julie; Salmon, Armel; Ainouche, Malika L; Cavé-Radet, Armand; El Amrani, Abdelhak; Foster, Tammy E; Moyer, Sydney; Richards, Christina L

2018-06-04

Despite the severe impacts of the Deepwater Horizon oil spill, the foundation plant species Spartina alterniflora proved resilient to heavy oiling, providing an opportunity to identify mechanisms of response to the anthropogenic stress of crude oil exposure. We assessed plants from oil-affected and unaffected populations using a custom DNA microarray to identify genomewide transcription patterns and gene expression networks that respond to crude oil exposure. In addition, we used T-DNA insertion lines of the model grass Brachypodium distachyon to assess the contribution of four novel candidate genes to crude oil response. Responses in S. alterniflora to hydrocarbon exposure across the transcriptome as well as xenobiotic specific response pathways had little overlap with those previously identified in the model plant Arabidopsis thaliana. Among T-DNA insertion lines of B. distachyon, we found additional support for two candidate genes, one (ATTPS21) involved in volatile production, and the other (SUVH5) involved in epigenetic regulation of gene expression, that may be important in the response to crude oil. The architecture of crude oil response in S. alterniflora is unique from that of the model species A. thaliana, suggesting that xenobiotic response may be highly variable across plant species. In addition, further investigations of regulatory networks may benefit from more information about epigenetic response pathways. © 2018 John Wiley & Sons Ltd.
Identification of genes expressed in the hermaphrodite germ line of C. elegans using SAGE

PubMed Central

Wang, Xin; Zhao, Yongjun; Wong, Kim; Ehlers, Peter; Kohara, Yuji; Jones, Steven J; Marra, Marco A; Holt, Robert A; Moerman, Donald G; Hansen, Dave

2009-01-01

Background Germ cells must progress through elaborate developmental stages from an undifferentiated germ cell to a fully differentiated gamete. Some of these stages include exiting mitosis and entering meiosis, progressing through the various stages of meiotic prophase, adopting either a male (sperm) or female (oocyte) fate, and completing meiosis. Additionally, many of the factors needed to drive embryogenesis are synthesized in the germ line. To increase our understanding of the genes that might be necessary for the formation and function of the germ line, we have constructed a SAGE library from hand dissected C. elegans hermaphrodite gonads. Results We found that 4699 genes, roughly 21% of all known C. elegans genes, are expressed in the adult hermaphrodite germ line. Ribosomal genes are highly expressed in the germ line; roughly four fold above their expression levels in the soma. We further found that 1063 of the germline-expressed genes have enriched expression in the germ line as compared to the soma. A comparison of these 1063 germline-enriched genes with a similar list of genes prepared using microarrays revealed an overlap of 460 genes, mutually reinforcing the two lists. Additionally, we identified 603 germline-enriched genes, supported by in situ expression data, which were not previously identified. We also found >4 fold enrichment for RNA binding proteins in the germ line as compared to the soma. Conclusion Using multiple technological platforms provides a more complete picture of global gene expression patterns. Genes involved in RNA metabolism are expressed at a significantly higher level in the germ line than the soma, suggesting a stronger reliance on RNA metabolism for control of the expression of genes in the germ line. Additionally, the number and expression level of germ line expressed genes on the X chromosome is lower than expected based on a random distribution. PMID:19426519
A cluster of bacterial genes for anaerobic benzene ring biodegradation

PubMed Central

Egland, Paul G.; Pelletier, Dale A.; Dispensa, Marilyn; Gibson, Jane; Harwood, Caroline S.

1997-01-01

A reductive benzoate pathway is the central conduit for the anaerobic biodegradation of aromatic pollutants and lignin monomers. Benzene ring reduction requires a large input of energy and this metabolic capability has, so far, been reported only in bacteria. To determine the molecular basis for this environmentally important process, we cloned and analyzed genes required for the anaerobic degradation of benzoate and related compounds from the phototrophic bacterium, Rhodopseudomonas palustris. A cluster of 24 genes was identified that includes twelve genes likely to be involved in anaerobic benzoate degradation and additional genes that convert the related compounds 4-hydroxybenzoate and cyclohexanecarboxylate to benzoyl-CoA. Genes encoding benzoyl-CoA reductase, a novel enzyme able to overcome the resonance stability of the aromatic ring, were identified by directed mutagenesis. The gene encoding the ring-cleavage enzyme, 2-ketocyclohexanecarboxyl-CoA hydrolase, was identified by assaying the enzymatic activity of the protein expressed in Escherichia coli. Physiological data and DNA sequence analyses indicate that the benzoate pathway consists of unusual enzymes for ring reduction and cleavage interposed among enzymes homologous to those catalyzing fatty acid degradation. The cloned genes should be useful as probes to identify benzoate degradation genes from other metabolically distinct groups of anaerobic bacteria, such as denitrifying bacteria and sulfate-reducing bacteria. PMID:9177244
Computational Analysis of Candidate Disease Genes and Variants for Salt-Sensitive Hypertension in Indigenous Southern Africans

PubMed Central

Tiffin, Nicki; Meintjes, Ayton; Ramesar, Rajkumar; Bajic, Vladimir B.; Rayner, Brian

2010-01-01

Multiple factors underlie susceptibility to essential hypertension, including a significant genetic and ethnic component, and environmental effects. Blood pressure response of hypertensive individuals to salt is heterogeneous, but salt sensitivity appears more prevalent in people of indigenous African origin. The underlying genetics of salt-sensitive hypertension, however, are poorly understood. In this study, computational methods including text- and data-mining have been used to select and prioritize candidate aetiological genes for salt-sensitive hypertension. Additionally, we have compared allele frequencies and copy number variation for single nucleotide polymorphisms in candidate genes between indigenous Southern African and Caucasian populations, with the aim of identifying candidate genes with significant variability between the population groups: identifying genetic variability between population groups can exploit ethnic differences in disease prevalence to aid with prioritisation of good candidate genes. Our top-ranking candidate genes include parathyroid hormone precursor (PTH) and type-1angiotensin II receptor (AGTR1). We propose that the candidate genes identified in this study warrant further investigation as potential aetiological genes for salt-sensitive hypertension. PMID:20886000
SH3BP4, a novel pigmentation gene, is inversely regulated by miR-125b and MITF

PubMed Central

Kim, Kyu-Han; Lee, Tae Ryong; Cho, Eun-Gyung

2017-01-01

Our previous work has identified miR-125b as a negative regulator of melanogenesis. However, the specific melanogenesis-related genes targeted by this miRNA had not been identified. In this study, we established a screening strategy involving three consecutive analytical approaches—analysis of target genes of miR-125b, expression correlation analysis between each target gene and representative pigmentary genes, and functional analysis of candidate genes related to melanogenesis—to discover melanogenesis-related genes targeted by miR-125b. Through these analyses, we identified SRC homology 3 domain-binding protein 4 (SH3BP4) as a novel pigmentation gene. In addition, by combining bioinformatics analysis and experimental validation, we demonstrated that SH3BP4 is a direct target of miR-125b. Finally, we found that SH3BP4 is transcriptionally regulated by microphthalmia-associated transcription factor as its direct target. These findings provide important insights into the roles of miRNAs and their targets in melanogenesis. PMID:28819321
High-Throughput Genetic Screens Identify a Large and Diverse Collection of New Sporulation Genes in Bacillus subtilis.

PubMed

Meeske, Alexander J; Rodrigues, Christopher D A; Brady, Jacqueline; Lim, Hoong Chuin; Bernhardt, Thomas G; Rudner, David Z

2016-01-01

The differentiation of the bacterium Bacillus subtilis into a dormant spore is among the most well-characterized developmental pathways in biology. Classical genetic screens performed over the past half century identified scores of factors involved in every step of this morphological process. More recently, transcriptional profiling uncovered additional sporulation-induced genes required for successful spore development. Here, we used transposon-sequencing (Tn-seq) to assess whether there were any sporulation genes left to be discovered. Our screen identified 133 out of the 148 genes with known sporulation defects. Surprisingly, we discovered 24 additional genes that had not been previously implicated in spore formation. To investigate their functions, we used fluorescence microscopy to survey early, middle, and late stages of differentiation of null mutants from the B. subtilis ordered knockout collection. This analysis identified mutants that are delayed in the initiation of sporulation, defective in membrane remodeling, and impaired in spore maturation. Several mutants had novel sporulation phenotypes. We performed in-depth characterization of two new factors that participate in cell-cell signaling pathways during sporulation. One (SpoIIT) functions in the activation of σE in the mother cell; the other (SpoIIIL) is required for σG activity in the forespore. Our analysis also revealed that as many as 36 sporulation-induced genes with no previously reported mutant phenotypes are required for timely spore maturation. Finally, we discovered a large set of transposon insertions that trigger premature initiation of sporulation. Our results highlight the power of Tn-seq for the discovery of new genes and novel pathways in sporulation and, combined with the recently completed null mutant collection, open the door for similar screens in other, less well-characterized processes.

High-Throughput Genetic Screens Identify a Large and Diverse Collection of New Sporulation Genes in Bacillus subtilis

PubMed Central

Brady, Jacqueline; Lim, Hoong Chuin; Bernhardt, Thomas G.; Rudner, David Z.

2016-01-01

The differentiation of the bacterium Bacillus subtilis into a dormant spore is among the most well-characterized developmental pathways in biology. Classical genetic screens performed over the past half century identified scores of factors involved in every step of this morphological process. More recently, transcriptional profiling uncovered additional sporulation-induced genes required for successful spore development. Here, we used transposon-sequencing (Tn-seq) to assess whether there were any sporulation genes left to be discovered. Our screen identified 133 out of the 148 genes with known sporulation defects. Surprisingly, we discovered 24 additional genes that had not been previously implicated in spore formation. To investigate their functions, we used fluorescence microscopy to survey early, middle, and late stages of differentiation of null mutants from the B. subtilis ordered knockout collection. This analysis identified mutants that are delayed in the initiation of sporulation, defective in membrane remodeling, and impaired in spore maturation. Several mutants had novel sporulation phenotypes. We performed in-depth characterization of two new factors that participate in cell–cell signaling pathways during sporulation. One (SpoIIT) functions in the activation of σE in the mother cell; the other (SpoIIIL) is required for σG activity in the forespore. Our analysis also revealed that as many as 36 sporulation-induced genes with no previously reported mutant phenotypes are required for timely spore maturation. Finally, we discovered a large set of transposon insertions that trigger premature initiation of sporulation. Our results highlight the power of Tn-seq for the discovery of new genes and novel pathways in sporulation and, combined with the recently completed null mutant collection, open the door for similar screens in other, less well-characterized processes. PMID:26735940
Identification of Enzyme Genes Using Chemical Structure Alignments of Substrate-Product Pairs.

PubMed

Moriya, Yuki; Yamada, Takuji; Okuda, Shujiro; Nakagawa, Zenichi; Kotera, Masaaki; Tokimatsu, Toshiaki; Kanehisa, Minoru; Goto, Susumu

2016-03-28

Although there are several databases that contain data on many metabolites and reactions in biochemical pathways, there is still a big gap in the numbers between experimentally identified enzymes and metabolites. It is supposed that many catalytic enzyme genes are still unknown. Although there are previous studies that estimate the number of candidate enzyme genes, these studies required some additional information aside from the structures of metabolites such as gene expression and order in the genome. In this study, we developed a novel method to identify a candidate enzyme gene of a reaction using the chemical structures of the substrate-product pair (reactant pair). The proposed method is based on a search for similar reactant pairs in a reference database and offers ortholog groups that possibly mediate the given reaction. We applied the proposed method to two experimentally validated reactions. As a result, we confirmed that the histidine transaminase was correctly identified. Although our method could not directly identify the asparagine oxo-acid transaminase, we successfully found the paralog gene most similar to the correct enzyme gene. We also applied our method to infer candidate enzyme genes in the mesaconate pathway. The advantage of our method lies in the prediction of possible genes for orphan enzyme reactions where any associated gene sequences are not determined yet. We believe that this approach will facilitate experimental identification of genes for orphan enzymes.
Genome-Wide Identification and Mapping of NBS-Encoding Resistance Genes in Solanum tuberosum Group Phureja

PubMed Central

Lozano, Roberto; Ponce, Olga; Ramirez, Manuel; Mostajo, Nelly; Orjeda, Gisella

2012-01-01

The majority of disease resistance (R) genes identified to date in plants encode a nucleotide-binding site (NBS) and leucine-rich repeat (LRR) domain containing protein. Additional domains such as coiled-coil (CC) and TOLL/interleukin-1 receptor (TIR) domains can also be present. In the recently sequenced Solanum tuberosum group phureja genome we used HMM models and manual curation to annotate 435 NBS-encoding R gene homologs and 142 NBS-derived genes that lack the NBS domain. Highly similar homologs for most previously documented Solanaceae R genes were identified. A surprising ∼41% (179) of the 435 NBS-encoding genes are pseudogenes primarily caused by premature stop codons or frameshift mutations. Alignment of 81.80% of the 577 homologs to S. tuberosum group phureja pseudomolecules revealed non-random distribution of the R-genes; 362 of 470 genes were found in high density clusters on 11 chromosomes. PMID:22493716
Coexpression landscape in ATTED-II: usage of gene list and gene network for various types of pathways.

PubMed

Obayashi, Takeshi; Kinoshita, Kengo

2010-05-01

Gene coexpression analyses are a powerful method to predict the function of genes and/or to identify genes that are functionally related to query genes. The basic idea of gene coexpression analyses is that genes with similar functions should have similar expression patterns under many different conditions. This approach is now widely used by many experimental researchers, especially in the field of plant biology. In this review, we will summarize recent successful examples obtained by using our gene coexpression database, ATTED-II. Specifically, the examples will describe the identification of new genes, such as the subunits of a complex protein, the enzymes in a metabolic pathway and transporters. In addition, we will discuss the discovery of a new intercellular signaling factor and new regulatory relationships between transcription factors and their target genes. In ATTED-II, we provide two basic views of gene coexpression, a gene list view and a gene network view, which can be used as guide gene approach and narrow-down approach, respectively. In addition, we will discuss the coexpression effectiveness for various types of gene sets.
Novel genetic associations for blood pressure identified via gene-alcohol interaction in up to 570K individuals across multiple ancestries

PubMed Central

Guo, Xiuqing; Franceschini, Nora; Cheng, Ching-Yu; Sim, Xueling; Vojinovic, Dina; Marten, Jonathan; Musani, Solomon K.; Li, Changwei; Schwander, Karen; Richard, Melissa A.; Noordam, Raymond; Aschard, Hugues; Bartz, Traci M.; Bielak, Lawrence F.; Dorajoo, Rajkumar; Fisher, Virginia; Hartwig, Fernando P.; Horimoto, Andrea R. V. R.; Lohman, Kurt K.; Manning, Alisa K.; Rankinen, Tuomo; Smith, Albert V.; Wojczynski, Mary K.; Alver, Maris; Boissel, Mathilde; Cai, Qiuyin; Divers, Jasmin; Gao, Chuan; Goel, Anuj; Harris, Sarah E.; He, Meian; Hsu, Fang-Chi; Jackson, Anne U.; Kähönen, Mika; Kasturiratne, Anuradhani; Komulainen, Pirjo; Kühnel, Brigitte; Laguzzi, Federica; Luan, Jian'an; Nolte, Ilja M.; Padmanabhan, Sandosh; Robino, Antonietta; Scott, Robert A.; Sofer, Tamar; Stančáková, Alena; Takeuchi, Fumihiko; Tayo, Bamidele O.; Varga, Tibor V.; Vitart, Veronique; Wang, Yajuan; Warren, Helen R.; Wen, Wanqing; Yanek, Lisa R.; Zhang, Weihua; Zhao, Jing Hua; Afaq, Saima; Amin, Najaf; Arking, Dan E.; Aung, Tin; Boerwinkle, Eric; Borecki, Ingrid; Broeckel, Ulrich; Brown, Morris; Brumat, Marco; Burke, Gregory L.; Chakravarti, Aravinda; Charumathi, Sabanayagam; Ida Chen, Yii-Der; Connell, John M.; Correa, Adolfo; de las Fuentes, Lisa; de Mutsert, Renée; de Silva, H. Janaka; Deng, Xuan; Ding, Jingzhong; Duan, Qing; Eaton, Charles B.; Ehret, Georg; Eppinga, Ruben N.; Faul, Jessica D.; Felix, Stephan B.; Forouhi, Nita G.; Forrester, Terrence; Franco, Oscar H.; Friedlander, Yechiel; Gandin, Ilaria; Gao, He; Ghanbari, Mohsen; Gigante, Bruna; Gu, C. Charles; Gu, Dongfeng; Hagenaars, Saskia P.; Hallmans, Göran; Harris, Tamara B.; He, Jiang; Heng, Chew-Kiat; Hirata, Makoto; Howard, Barbara V.; Ikram, M. Arfan; John, Ulrich; Katsuya, Tomohiro; Khor, Chiea Chuen; Kilpeläinen, Tuomas O.; Koh, Woon-Puay; Krieger, José E.; Kritchevsky, Stephen B.; Kubo, Michiaki; Kuusisto, Johanna; Lakka, Timo A.; Langefeld, Carl D.; Langenberg, Claudia; Launer, Lenore J.; Lehne, Benjamin; Lewis, Cora E.; Li, Yize; Lin, Shiow; Liu, Jianjun; Liu, Jingmin; Loh, Marie; Louie, Tin; Mägi, Reedik; McKenzie, Colin A.; Meitinger, Thomas; Milaneschi, Yuri; Milani, Lili; Mohlke, Karen L.; Momozawa, Yukihide; Nalls, Mike A.; Nelson, Christopher P.; Sotoodehnia, Nona; Norris, Jill M.; O'Connell, Jeff R.; Palmer, Nicholette D.; Perls, Thomas; Pedersen, Nancy L.; Peters, Annette; Peyser, Patricia A.; Poulter, Neil; Raffel, Leslie J.; Raitakari, Olli T.; Roll, Kathryn; Rose, Lynda M.; Rosendaal, Frits R.; Rotter, Jerome I.; Schmidt, Carsten O.; Schreiner, Pamela J.; Schupf, Nicole; Scott, William R.; Shi, Yuan; Sidney, Stephen; Sims, Mario; Sitlani, Colleen M.; Smith, Jennifer A.; Snieder, Harold; Starr, John M.; Strauch, Konstantin; Stringham, Heather M.; Tan, Nicholas Y. Q.; Tang, Hua; Taylor, Kent D.; Teo, Yik Ying; Tham, Yih Chung; Turner, Stephen T.; Uitterlinden, André G.; Vollenweider, Peter; Waldenberger, Melanie; Wang, Lihua; Wang, Ya Xing; Wei, Wen Bin; Williams, Christine; Yao, Jie; Yu, Caizheng; Yuan, Jian-Min; Zhao, Wei; Zonderman, Alan B.; Becker, Diane M.; Boehnke, Michael; Bowden, Donald W.; Chambers, John C.; Deary, Ian J.; Esko, Tõnu; Farrall, Martin; Franks, Paul W.; Freedman, Barry I.; Froguel, Philippe; Gasparini, Paolo; Gieger, Christian; Kamatani, Yoichiro; Kato, Norihiro; Kooner, Jaspal S.; Kutalik, Zoltán; Laakso, Markku; Laurie, Cathy C.; Leander, Karin; Lehtimäki, Terho; Study, Lifelines Cohort; Magnusson, Patrik K. E.; Oldehinkel, Albertine J.; Penninx, Brenda W. J. H.; Polasek, Ozren; Porteous, David J.; Rauramaa, Rainer; Samani, Nilesh J.; Scott, James; Shu, Xiao-Ou; van der Harst, Pim; Wagenknecht, Lynne E.; Watkins, Hugh; Weir, David R.; Wickremasinghe, Ananda R.; Wu, Tangchun; Zheng, Wei; Bouchard, Claude; Christensen, Kaare; Evans, Michele K.; Gudnason, Vilmundur; Horta, Bernardo L.; Kardia, Sharon L. R.; Liu, Yongmei; Pereira, Alexandre C.; Psaty, Bruce M.; Ridker, Paul M.; van Dam, Rob M.; Gauderman, W. James; Zhu, Xiaofeng; Mook-Kanamori, Dennis O.; Fornage, Myriam; Rotimi, Charles N.; Cupples, L. Adrienne; Kelly, Tanika N.; Fox, Ervin R.; Hayward, Caroline; van Duijn, Cornelia M.; Tai, E Shyong; Wong, Tien Yin; Kooperberg, Charles; Palmas, Walter; Morrison, Alanna C.; Caulfield, Mark J.; Munroe, Patricia B.; Rao, Dabeeru C.; Province, Michael A.; Levy, Daniel

2018-01-01

Heavy alcohol consumption is an established risk factor for hypertension; the mechanism by which alcohol consumption impact blood pressure (BP) regulation remains unknown. We hypothesized that a genome-wide association study accounting for gene-alcohol consumption interaction for BP might identify additional BP loci and contribute to the understanding of alcohol-related BP regulation. We conducted a large two-stage investigation incorporating joint testing of main genetic effects and single nucleotide variant (SNV)-alcohol consumption interactions. In Stage 1, genome-wide discovery meta-analyses in ≈131K individuals across several ancestry groups yielded 3,514 SNVs (245 loci) with suggestive evidence of association (P < 1.0 x 10−5). In Stage 2, these SNVs were tested for independent external replication in ≈440K individuals across multiple ancestries. We identified and replicated (at Bonferroni correction threshold) five novel BP loci (380 SNVs in 21 genes) and 49 previously reported BP loci (2,159 SNVs in 109 genes) in European ancestry, and in multi-ancestry meta-analyses (P < 5.0 x 10−8). For African ancestry samples, we detected 18 potentially novel BP loci (P < 5.0 x 10−8) in Stage 1 that warrant further replication. Additionally, correlated meta-analysis identified eight novel BP loci (11 genes). Several genes in these loci (e.g., PINX1, GATA4, BLK, FTO and GABBR2) have been previously reported to be associated with alcohol consumption. These findings provide insights into the role of alcohol consumption in the genetic architecture of hypertension. PMID:29912962
Novel genetic associations for blood pressure identified via gene-alcohol interaction in up to 570K individuals across multiple ancestries.

PubMed

Feitosa, Mary F; Kraja, Aldi T; Chasman, Daniel I; Sung, Yun J; Winkler, Thomas W; Ntalla, Ioanna; Guo, Xiuqing; Franceschini, Nora; Cheng, Ching-Yu; Sim, Xueling; Vojinovic, Dina; Marten, Jonathan; Musani, Solomon K; Li, Changwei; Bentley, Amy R; Brown, Michael R; Schwander, Karen; Richard, Melissa A; Noordam, Raymond; Aschard, Hugues; Bartz, Traci M; Bielak, Lawrence F; Dorajoo, Rajkumar; Fisher, Virginia; Hartwig, Fernando P; Horimoto, Andrea R V R; Lohman, Kurt K; Manning, Alisa K; Rankinen, Tuomo; Smith, Albert V; Tajuddin, Salman M; Wojczynski, Mary K; Alver, Maris; Boissel, Mathilde; Cai, Qiuyin; Campbell, Archie; Chai, Jin Fang; Chen, Xu; Divers, Jasmin; Gao, Chuan; Goel, Anuj; Hagemeijer, Yanick; Harris, Sarah E; He, Meian; Hsu, Fang-Chi; Jackson, Anne U; Kähönen, Mika; Kasturiratne, Anuradhani; Komulainen, Pirjo; Kühnel, Brigitte; Laguzzi, Federica; Luan, Jian'an; Matoba, Nana; Nolte, Ilja M; Padmanabhan, Sandosh; Riaz, Muhammad; Rueedi, Rico; Robino, Antonietta; Said, M Abdullah; Scott, Robert A; Sofer, Tamar; Stančáková, Alena; Takeuchi, Fumihiko; Tayo, Bamidele O; van der Most, Peter J; Varga, Tibor V; Vitart, Veronique; Wang, Yajuan; Ware, Erin B; Warren, Helen R; Weiss, Stefan; Wen, Wanqing; Yanek, Lisa R; Zhang, Weihua; Zhao, Jing Hua; Afaq, Saima; Amin, Najaf; Amini, Marzyeh; Arking, Dan E; Aung, Tin; Boerwinkle, Eric; Borecki, Ingrid; Broeckel, Ulrich; Brown, Morris; Brumat, Marco; Burke, Gregory L; Canouil, Mickaël; Chakravarti, Aravinda; Charumathi, Sabanayagam; Ida Chen, Yii-Der; Connell, John M; Correa, Adolfo; de Las Fuentes, Lisa; de Mutsert, Renée; de Silva, H Janaka; Deng, Xuan; Ding, Jingzhong; Duan, Qing; Eaton, Charles B; Ehret, Georg; Eppinga, Ruben N; Evangelou, Evangelos; Faul, Jessica D; Felix, Stephan B; Forouhi, Nita G; Forrester, Terrence; Franco, Oscar H; Friedlander, Yechiel; Gandin, Ilaria; Gao, He; Ghanbari, Mohsen; Gigante, Bruna; Gu, C Charles; Gu, Dongfeng; Hagenaars, Saskia P; Hallmans, Göran; Harris, Tamara B; He, Jiang; Heikkinen, Sami; Heng, Chew-Kiat; Hirata, Makoto; Howard, Barbara V; Ikram, M Arfan; John, Ulrich; Katsuya, Tomohiro; Khor, Chiea Chuen; Kilpeläinen, Tuomas O; Koh, Woon-Puay; Krieger, José E; Kritchevsky, Stephen B; Kubo, Michiaki; Kuusisto, Johanna; Lakka, Timo A; Langefeld, Carl D; Langenberg, Claudia; Launer, Lenore J; Lehne, Benjamin; Lewis, Cora E; Li, Yize; Lin, Shiow; Liu, Jianjun; Liu, Jingmin; Loh, Marie; Louie, Tin; Mägi, Reedik; McKenzie, Colin A; Meitinger, Thomas; Metspalu, Andres; Milaneschi, Yuri; Milani, Lili; Mohlke, Karen L; Momozawa, Yukihide; Nalls, Mike A; Nelson, Christopher P; Sotoodehnia, Nona; Norris, Jill M; O'Connell, Jeff R; Palmer, Nicholette D; Perls, Thomas; Pedersen, Nancy L; Peters, Annette; Peyser, Patricia A; Poulter, Neil; Raffel, Leslie J; Raitakari, Olli T; Roll, Kathryn; Rose, Lynda M; Rosendaal, Frits R; Rotter, Jerome I; Schmidt, Carsten O; Schreiner, Pamela J; Schupf, Nicole; Scott, William R; Sever, Peter S; Shi, Yuan; Sidney, Stephen; Sims, Mario; Sitlani, Colleen M; Smith, Jennifer A; Snieder, Harold; Starr, John M; Strauch, Konstantin; Stringham, Heather M; Tan, Nicholas Y Q; Tang, Hua; Taylor, Kent D; Teo, Yik Ying; Tham, Yih Chung; Turner, Stephen T; Uitterlinden, André G; Vollenweider, Peter; Waldenberger, Melanie; Wang, Lihua; Wang, Ya Xing; Wei, Wen Bin; Williams, Christine; Yao, Jie; Yu, Caizheng; Yuan, Jian-Min; Zhao, Wei; Zonderman, Alan B; Becker, Diane M; Boehnke, Michael; Bowden, Donald W; Chambers, John C; Deary, Ian J; Esko, Tõnu; Farrall, Martin; Franks, Paul W; Freedman, Barry I; Froguel, Philippe; Gasparini, Paolo; Gieger, Christian; Jonas, Jost Bruno; Kamatani, Yoichiro; Kato, Norihiro; Kooner, Jaspal S; Kutalik, Zoltán; Laakso, Markku; Laurie, Cathy C; Leander, Karin; Lehtimäki, Terho; Study, Lifelines Cohort; Magnusson, Patrik K E; Oldehinkel, Albertine J; Penninx, Brenda W J H; Polasek, Ozren; Porteous, David J; Rauramaa, Rainer; Samani, Nilesh J; Scott, James; Shu, Xiao-Ou; van der Harst, Pim; Wagenknecht, Lynne E; Wareham, Nicholas J; Watkins, Hugh; Weir, David R; Wickremasinghe, Ananda R; Wu, Tangchun; Zheng, Wei; Bouchard, Claude; Christensen, Kaare; Evans, Michele K; Gudnason, Vilmundur; Horta, Bernardo L; Kardia, Sharon L R; Liu, Yongmei; Pereira, Alexandre C; Psaty, Bruce M; Ridker, Paul M; van Dam, Rob M; Gauderman, W James; Zhu, Xiaofeng; Mook-Kanamori, Dennis O; Fornage, Myriam; Rotimi, Charles N; Cupples, L Adrienne; Kelly, Tanika N; Fox, Ervin R; Hayward, Caroline; van Duijn, Cornelia M; Tai, E Shyong; Wong, Tien Yin; Kooperberg, Charles; Palmas, Walter; Rice, Kenneth; Morrison, Alanna C; Elliott, Paul; Caulfield, Mark J; Munroe, Patricia B; Rao, Dabeeru C; Province, Michael A; Levy, Daniel

2018-01-01

Heavy alcohol consumption is an established risk factor for hypertension; the mechanism by which alcohol consumption impact blood pressure (BP) regulation remains unknown. We hypothesized that a genome-wide association study accounting for gene-alcohol consumption interaction for BP might identify additional BP loci and contribute to the understanding of alcohol-related BP regulation. We conducted a large two-stage investigation incorporating joint testing of main genetic effects and single nucleotide variant (SNV)-alcohol consumption interactions. In Stage 1, genome-wide discovery meta-analyses in ≈131K individuals across several ancestry groups yielded 3,514 SNVs (245 loci) with suggestive evidence of association (P < 1.0 x 10-5). In Stage 2, these SNVs were tested for independent external replication in ≈440K individuals across multiple ancestries. We identified and replicated (at Bonferroni correction threshold) five novel BP loci (380 SNVs in 21 genes) and 49 previously reported BP loci (2,159 SNVs in 109 genes) in European ancestry, and in multi-ancestry meta-analyses (P < 5.0 x 10-8). For African ancestry samples, we detected 18 potentially novel BP loci (P < 5.0 x 10-8) in Stage 1 that warrant further replication. Additionally, correlated meta-analysis identified eight novel BP loci (11 genes). Several genes in these loci (e.g., PINX1, GATA4, BLK, FTO and GABBR2) have been previously reported to be associated with alcohol consumption. These findings provide insights into the role of alcohol consumption in the genetic architecture of hypertension.
Genome-Wide Identification of Medicago Peptides Involved in Macronutrient Responses and Nodulation1[OPEN

PubMed Central

Dai, Xinbin; Zhuang, Zhaohong; Torres-Jerez, Ivone; Nogales, Joaquina

2017-01-01

Growing evidence indicates that small, secreted peptides (SSPs) play critical roles in legume growth and development, yet the annotation of SSP-coding genes is far from complete. Systematic reannotation of the Medicago truncatula genome identified 1,970 homologs of established SSP gene families and an additional 2,455 genes that are potentially novel SSPs, previously unreported in the literature. The expression patterns of known and putative SSP genes based on 144 RNA sequencing data sets covering various stages of macronutrient deficiencies and symbiotic interactions with rhizobia and mycorrhiza were investigated. Focusing on those known or suspected to act via receptor-mediated signaling, 240 nutrient-responsive and 365 nodulation-responsive Signaling-SSPs were identified, greatly expanding the number of SSP gene families potentially involved in acclimation to nutrient deficiencies and nodulation. Synthetic peptide applications were shown to alter root growth and nodulation phenotypes, revealing additional regulators of legume nutrient acquisition. Our results constitute a powerful resource enabling further investigations of specific SSP functions via peptide treatment and reverse genetics. PMID:29030416
Global Analysis of the Burkholderia thailandensis Quorum Sensing-Controlled Regulon

PubMed Central

Majerczyk, Charlotte; Brittnacher, Mitchell; Jacobs, Michael; Armour, Christopher D.; Radey, Mathew; Schneider, Emily; Phattarasokul, Somsak; Bunt, Richard

2014-01-01

Burkholderia thailandensis contains three acyl-homoserine lactone quorum sensing circuits and has two additional LuxR homologs. To identify B. thailandensis quorum sensing-controlled genes, we carried out transcriptome sequencing (RNA-seq) analyses of quorum sensing mutants and their parent. The analyses were grounded in the fact that we identified genes coding for factors shown previously to be regulated by quorum sensing among a larger set of quorum-controlled genes. We also found that genes coding for contact-dependent inhibition were induced by quorum sensing and confirmed that specific quorum sensing mutants had a contact-dependent inhibition defect. Additional quorum-controlled genes included those for the production of numerous secondary metabolites, an uncharacterized exopolysaccharide, and a predicted chitin-binding protein. This study provides insights into the roles of the three quorum sensing circuits in the saprophytic lifestyle of B. thailandensis, and it provides a foundation on which to build an understanding of the roles of quorum sensing in the biology of B. thailandensis and the closely related pathogenic Burkholderia pseudomallei and Burkholderia mallei. PMID:24464461
Inferring Gene Family Histories in Yeast Identifies Lineage Specific Expansions

PubMed Central

Ames, Ryan M.; Money, Daniel; Lovell, Simon C.

2014-01-01

The complement of genes found in the genome is a balance between gene gain and gene loss. Knowledge of the specific genes that are gained and lost over evolutionary time allows an understanding of the evolution of biological functions. Here we use new evolutionary models to infer gene family histories across complete yeast genomes; these models allow us to estimate the relative genome-wide rates of gene birth, death, innovation and extinction (loss of an entire family) for the first time. We show that the rates of gene family evolution vary both between gene families and between species. We are also able to identify those families that have experienced rapid lineage specific expansion/contraction and show that these families are enriched for specific functions. Moreover, we find that families with specific functions are repeatedly expanded in multiple species, suggesting the presence of common adaptations and that these family expansions/contractions are not random. Additionally, we identify potential specialisations, unique to specific species, in the functions of lineage specific expanded families. These results suggest that an important mechanism in the evolution of genome content is the presence of lineage-specific gene family changes. PMID:24921666
Exome sequencing reveals riboflavin transporter mutations as a cause of motor neuron disease.

PubMed

Johnson, Janel O; Gibbs, J Raphael; Megarbane, Andre; Urtizberea, J Andoni; Hernandez, Dena G; Foley, A Reghan; Arepalli, Sampath; Pandraud, Amelie; Simón-Sánchez, Javier; Clayton, Peter; Reilly, Mary M; Muntoni, Francesco; Abramzon, Yevgeniya; Houlden, Henry; Singleton, Andrew B

2012-09-01

Brown-Vialetto-Van Laere syndrome was first described in 1894 as a rare neurodegenerative disorder characterized by progressive sensorineural deafness in combination with childhood amyotrophic lateral sclerosis. Mutations in the gene, SLC52A3 (formerly C20orf54), one of three known riboflavin transporter genes, have recently been shown to underlie a number of severe cases of Brown-Vialetto-Van Laere syndrome; however, cases and families with this disease exist that do not appear to be caused by SLC52A3 mutations. We used a combination of linkage and exome sequencing to identify the disease causing mutation in an extended Lebanese Brown-Vialetto-Van Laere kindred, whose affected members were negative for SLC52A3 mutations. We identified a novel mutation in a second member of the riboflavin transporter gene family (gene symbol: SLC52A2) as the cause of disease in this family. The same mutation was identified in one additional subject, from 44 screened. Within this group of 44 patients, we also identified two additional cases with SLC52A3 mutations, but none with mutations in the remaining member of this gene family, SLC52A1. We believe this strongly supports the notion that defective riboflavin transport plays an important role in Brown-Vialetto-Van Laere syndrome. Initial work has indicated that patients with SLC52A3 defects respond to riboflavin treatment clinically and biochemically. Clearly, this makes an excellent candidate therapy for the SLC52A2 mutation-positive patients identified here. Initial riboflavin treatment of one of these patients shows promising results.
Exome sequencing reveals riboflavin transporter mutations as a cause of motor neuron disease

PubMed Central

Johnson, Janel O.; Gibbs, J. Raphael; Megarbane, Andre; Urtizberea, J. Andoni; Hernandez, Dena G.; Foley, A. Reghan; Arepalli, Sampath; Pandraud, Amelie; Simón-Sánchez, Javier; Clayton, Peter; Reilly, Mary M.; Muntoni, Francesco; Abramzon, Yevgeniya; Houlden, Henry

2012-01-01

Brown–Vialetto–Van Laere syndrome was first described in 1894 as a rare neurodegenerative disorder characterized by progressive sensorineural deafness in combination with childhood amyotrophic lateral sclerosis. Mutations in the gene, SLC52A3 (formerly C20orf54), one of three known riboflavin transporter genes, have recently been shown to underlie a number of severe cases of Brown–Vialetto–Van Laere syndrome; however, cases and families with this disease exist that do not appear to be caused by SLC52A3 mutations. We used a combination of linkage and exome sequencing to identify the disease causing mutation in an extended Lebanese Brown–Vialetto–Van Laere kindred, whose affected members were negative for SLC52A3 mutations. We identified a novel mutation in a second member of the riboflavin transporter gene family (gene symbol: SLC52A2) as the cause of disease in this family. The same mutation was identified in one additional subject, from 44 screened. Within this group of 44 patients, we also identified two additional cases with SLC52A3 mutations, but none with mutations in the remaining member of this gene family, SLC52A1. We believe this strongly supports the notion that defective riboflavin transport plays an important role in Brown–Vialetto–Van Laere syndrome. Initial work has indicated that patients with SLC52A3 defects respond to riboflavin treatment clinically and biochemically. Clearly, this makes an excellent candidate therapy for the SLC52A2 mutation-positive patients identified here. Initial riboflavin treatment of one of these patients shows promising results. PMID:22740598
Associations between variants of the HAL gene and milk production traits in Chinese Holstein cows.

PubMed

Wang, Haifei; Jiang, Li; Wang, Wenwen; Zhang, Shengli; Yin, Zongjun; Zhang, Qin; Liu, Jian-Feng

2014-11-25

The histidine ammonia-lyse gene (HAL) encodes the histidine ammonia-lyase, which catalyzes the first reaction of histidine catabolism. In our previous genome-wide association study in Chinese Holstein cows to identify genetic variants affecting milk production traits, a SNP (rs41647754) located 357 bp upstream of HAL, was found to be significantly associated with milk yield and milk protein yield. In addition, the HAL gene resides within the reported QTLs for milk production traits. The aims of this study were to identify genetic variants in HAL and to test the association between these variants and milk production traits. Fifteen SNPs were identified within the regions under study of the HAL gene, including three coding mutations, seven intronic mutations, one promoter region mutation, and four 3'UTR mutations. Nine of these identified SNPs were chosen for subsequent genotyping and association analyses. Our results showed that five SNP markers (ss974768522, ss974768525, ss974768531, ss974768533 and ss974768534) were significantly associated with one or more milk production traits. Haplotype analysis showed that two haplotype blocks were significantly associated with milk yield and milk protein yield, providing additional support for the association between HAL variants and milk production traits in dairy cows (P < 0.05). Our study shows evidence of significant associations between SNPs within the HAL gene and milk production traits in Chinese Holstein cows, indicating the potential role of HAL variants in these traits. These identified SNPs may serve as genetic markers used in genomic selection schemes to accelerate the genetic gains of milk production traits in dairy cattle.
The long tail of oncogenic drivers in prostate cancer.

PubMed

Armenia, Joshua; Wankowicz, Stephanie A M; Liu, David; Gao, Jianjiong; Kundra, Ritika; Reznik, Ed; Chatila, Walid K; Chakravarty, Debyani; Han, G Celine; Coleman, Ilsa; Montgomery, Bruce; Pritchard, Colin; Morrissey, Colm; Barbieri, Christopher E; Beltran, Himisha; Sboner, Andrea; Zafeiriou, Zafeiris; Miranda, Susana; Bielski, Craig M; Penson, Alexander V; Tolonen, Charlotte; Huang, Franklin W; Robinson, Dan; Wu, Yi Mi; Lonigro, Robert; Garraway, Levi A; Demichelis, Francesca; Kantoff, Philip W; Taplin, Mary-Ellen; Abida, Wassim; Taylor, Barry S; Scher, Howard I; Nelson, Peter S; de Bono, Johann S; Rubin, Mark A; Sawyers, Charles L; Chinnaiyan, Arul M; Schultz, Nikolaus; Van Allen, Eliezer M

2018-05-01

Comprehensive genomic characterization of prostate cancer has identified recurrent alterations in genes involved in androgen signaling, DNA repair, and PI3K signaling, among others. However, larger and uniform genomic analysis may identify additional recurrently mutated genes at lower frequencies. Here we aggregate and uniformly analyze exome sequencing data from 1,013 prostate cancers. We identify and validate a new class of E26 transformation-specific (ETS)-fusion-negative tumors defined by mutations in epigenetic regulators, as well as alterations in pathways not previously implicated in prostate cancer, such as the spliceosome pathway. We find that the incidence of significantly mutated genes (SMGs) follows a long-tail distribution, with many genes mutated in less than 3% of cases. We identify a total of 97 SMGs, including 70 not previously implicated in prostate cancer, such as the ubiquitin ligase CUL3 and the transcription factor SPEN. Finally, comparing primary and metastatic prostate cancer identifies a set of genomic markers that may inform risk stratification.
A global analysis of protein expression profiles in Sinorhizobium meliloti: discovery of new genes for nodule occupancy and stress adaptation.

PubMed

Djordjevic, Michael A; Chen, Han Cai; Natera, Siria; Van Noorden, Giel; Menzel, Christian; Taylor, Scott; Renard, Clotilde; Geiger, Otto; Weiller, Georg F

2003-06-01

A proteomic examination of Sinorhizobium meliloti strain 1021 was undertaken using a combination of 2-D gel electrophoresis, peptide mass fingerprinting, and bioinformatics. Our goal was to identify (i) putative symbiosis- or nutrient-stress-specific proteins, (ii) the biochemical pathways active under different conditions, (iii) potential new genes, and (iv) the extent of posttranslational modifications of S. meliloti proteins. In total, we identified the protein products of 810 genes (13.1% of the genome's coding capacity). The 810 genes generated 1,180 gene products, with chromosomal genes accounting for 78% of the gene products identified (18.8% of the chromosome's coding capacity). The activity of 53 metabolic pathways was inferred from bioinformatic analysis of proteins with assigned Enzyme Commission numbers. Of the remaining proteins that did not encode enzymes, ABC-type transporters composed 12.7% and regulatory proteins 3.4% of the total. Proteins with up to seven transmembrane domains were identified in membrane preparations. A total of 27 putative nodule-specific proteins and 35 nutrient-stress-specific proteins were identified and used as a basis to define genes and describe processes occurring in S. meliloti cells in nodules and under stress. Several nodule proteins from the plant host were present in the nodule bacteria preparations. We also identified seven potentially novel proteins not predicted from the DNA sequence. Post-translational modifications such as N-terminal processing could be inferred from the data. The posttranslational addition of UMP to the key regulator of nitrogen metabolism, PII, was demonstrated. This work demonstrates the utility of combining mass spectrometry with protein arraying or separation techniques to identify candidate genes involved in important biological processes and niche occupations that may be intransigent to other methods of gene expression profiling.
Genome-wide association meta-analysis of 78,308 individuals identifies new loci and genes influencing human intelligence.

PubMed

Sniekers, Suzanne; Stringer, Sven; Watanabe, Kyoko; Jansen, Philip R; Coleman, Jonathan R I; Krapohl, Eva; Taskesen, Erdogan; Hammerschlag, Anke R; Okbay, Aysu; Zabaneh, Delilah; Amin, Najaf; Breen, Gerome; Cesarini, David; Chabris, Christopher F; Iacono, William G; Ikram, M Arfan; Johannesson, Magnus; Koellinger, Philipp; Lee, James J; Magnusson, Patrik K E; McGue, Matt; Miller, Mike B; Ollier, William E R; Payton, Antony; Pendleton, Neil; Plomin, Robert; Rietveld, Cornelius A; Tiemeier, Henning; van Duijn, Cornelia M; Posthuma, Danielle

2017-07-01

Intelligence is associated with important economic and health-related life outcomes. Despite intelligence having substantial heritability (0.54) and a confirmed polygenic nature, initial genetic studies were mostly underpowered. Here we report a meta-analysis for intelligence of 78,308 individuals. We identify 336 associated SNPs (METAL P < 5 × 10 -8 ) in 18 genomic loci, of which 15 are new. Around half of the SNPs are located inside a gene, implicating 22 genes, of which 11 are new findings. Gene-based analyses identified an additional 30 genes (MAGMA P < 2.73 × 10 -6 ), of which all but one had not been implicated previously. We show that the identified genes are predominantly expressed in brain tissue, and pathway analysis indicates the involvement of genes regulating cell development (MAGMA competitive P = 3.5 × 10 -6 ). Despite the well-known difference in twin-based heritability for intelligence in childhood (0.45) and adulthood (0.80), we show substantial genetic correlation (r g = 0.89, LD score regression P = 5.4 × 10 -29 ). These findings provide new insight into the genetic architecture of intelligence.
The genetics of alcoholism: identifying specific genes through family studies.

PubMed

Edenberg, Howard J; Foroud, Tatiana

2006-09-01

Alcoholism is a complex disorder with both genetic and environmental risk factors. Studies in humans have begun to elucidate the genetic underpinnings of the risk for alcoholism. Here we briefly review strategies for identifying individual genes in which variations affect the risk for alcoholism and related phenotypes, in the context of one large study that has successfully identified such genes. The Collaborative Study on the Genetics of Alcoholism (COGA) is a family-based study that has collected detailed phenotypic data on individuals in families with multiple alcoholic members. A genome-wide linkage approach led to the identification of chromosomal regions containing genes that influenced alcoholism risk and related phenotypes. Subsequently, single nucleotide polymorphisms (SNPs) were genotyped in positional candidate genes located within the linked chromosomal regions, and analyzed for association with these phenotypes. Using this sequential approach, COGA has detected association with GABRA2, CHRM2 and ADH4; these associations have all been replicated by other researchers. COGA has detected association to additional genes including GABRG3, TAS2R16, SNCA, OPRK1 and PDYN, results that are awaiting confirmation. These successes demonstrate that genes contributing to the risk for alcoholism can be reliably identified using human subjects.
Linkage and association analysis of obesity traits reveals novel loci and interactions with dietary n-3 fatty acids in an Alaska Native (Yup’ik) population

PubMed Central

Vaughan, Laura Kelly; Wiener, Howard W.; Aslibekyan, Stella; Allison, David B.; Havel, Peter J.; Stanhope, Kimber L.; O’Brien, Diane M.; Hopkins, Scarlett E.; Lemas, Dominick J.; Boyer, Bert B.; Tiwari, Hemant K.

2015-01-01

Objective To identify novel genetic markers of obesity-related traits and to identify gene-diet interactions with n-3 polyunsaturated fatty acid (n-3 PUFA) intake in Yup’ik people. Material and Methods We measured body composition, plasma adipokines and ghrelin in 982 participants enrolled in the Center for Alaska Native Health Research (CANHR) Study. We conducted a genome-wide SNP linkage scan and targeted association analysis, fitting additional models to investigate putative gene-diet interactions. Finally, we performed bioinformatic analysis to uncover likely candidate genes within the identified linkage peaks. Results We observed evidence of linkage for all obesity-related traits, replicating previous results and identifying novel regions of interest for adiponectin (10q26.13-2) and thigh circumference (8q21.11-13). Bioinformatic analysis revealed DOCK1, PTPRE (10q26.13-2) and FABP4 (8q21.11-13) as putative candidate genes in the newly identified regions. Targeted SNP analysis under the linkage peaks identified associations between three SNPs and obesity-related traits: rs1007750 on chromosome 8 and thigh circumference (P=0.0005), rs878953 on chromosome 5 and thigh skinfold (P=0.0004), and rs1596854 on chromosome 11 for waist circumference (P=0.0003). Finally, we showed that n-3 PUFA modified the association between obesity related traits and two additional variants (rs2048417 on chromosome 3 for adiponectin, P for interaction=0.0006 and rs730414 on chromosome 11 for percentage body fat, P for interaction=0.0004). Conclusions This study presents evidence of novel genomic regions and gene-diet interactions that may contribute to the pathophysiology of obesity-related traits among Yup’ik people. PMID:25772781
Linkage and association analysis of obesity traits reveals novel loci and interactions with dietary n-3 fatty acids in an Alaska Native (Yup'ik) population.

PubMed

Vaughan, Laura Kelly; Wiener, Howard W; Aslibekyan, Stella; Allison, David B; Havel, Peter J; Stanhope, Kimber L; O'Brien, Diane M; Hopkins, Scarlett E; Lemas, Dominick J; Boyer, Bert B; Tiwari, Hemant K

2015-06-01

To identify novel genetic markers of obesity-related traits and to identify gene-diet interactions with n-3 polyunsaturated fatty acid (n-3 PUFA) intake in Yup'ik people. We measured body composition, plasma adipokines and ghrelin in 982 participants enrolled in the Center for Alaska Native Health Research (CANHR) Study. We conducted a genome-wide SNP linkage scan and targeted association analysis, fitting additional models to investigate putative gene-diet interactions. Finally, we performed bioinformatic analysis to uncover likely candidate genes within the identified linkage peaks. We observed evidence of linkage for all obesity-related traits, replicating previous results and identifying novel regions of interest for adiponectin (10q26.13-2) and thigh circumference (8q21.11-13). Bioinformatic analysis revealed DOCK1, PTPRE (10q26.13-2) and FABP4 (8q21.11-13) as putative candidate genes in the newly identified regions. Targeted SNP analysis under the linkage peaks identified associations between three SNPs and obesity-related traits: rs1007750 on chromosome 8 and thigh circumference (P=0.0005), rs878953 on chromosome 5 and thigh skinfold (P=0.0004), and rs1596854 on chromosome 11 for waist circumference (P=0.0003). Finally, we showed that n-3 PUFA modified the association between obesity related traits and two additional variants (rs2048417 on chromosome 3 for adiponectin, P for interaction=0.0006 and rs730414 on chromosome 11 for percentage body fat, P for interaction=0.0004). This study presents evidence of novel genomic regions and gene-diet interactions that may contribute to the pathophysiology of obesity-related traits among Yup'ik people. Copyright © 2015 Elsevier Inc. All rights reserved.
Identification of a duplication within the GDF9 gene and novel candidate genes for primary ovarian insufficiency (POI) by a customized high-resolution array comparative genomic hybridization platform.

PubMed

Norling, A; Hirschberg, A L; Rodriguez-Wallberg, K A; Iwarsson, E; Wedell, A; Barbaro, M

2014-08-01

Can high-resolution array comparative genomic hybridization (CGH) analysis of DNA samples from women with primary ovarian insufficiency (POI) improve the diagnosis of the condition and identify novel candidate genes for POI? A mutation affecting the regulatory region of growth differentiation factor 9 (GDF9) was identified for the first time together with several novel candidate genes for POI. Most patients with POI do not receive a molecular diagnosis despite a significant genetic component in the pathogenesis. We performed a case-control study. Twenty-six patients were analyzed by array CGH for identification of copy number variants. Novel changes were investigated in 95 controls and in a separate population of 28 additional patients with POI. The experimental procedures were performed during a 1-year period. DNA samples from 26 patients with POI were analyzed by a customized 1M array-CGH platform with whole genome coverage and probe enrichment targeting 78 genes in sex development. By PCR amplification and sequencing, the breakpoint of an identified partial GDF9 gene duplication was characterized. A multiplex ligation-dependent probe amplification (MLPA) probe set for specific identification of deletions/duplications affecting GDF9 was developed. An MLPA probe set for the identification of additional cases or controls carrying novel candidate regions identified by array-CGH was developed. Sequencing of three candidate genes was performed. Eleven unique copy number changes were identified in a total of 11 patients, including a tandem duplication of 475 bp, containing part of the GDF9 gene promoter region. The duplicated region contains three NOBOX-binding elements and an E-box, important for GDF9 gene regulation. This aberration is likely causative of POI. Fifty-four patients were investigated for copy number changes within GDF9, but no additional cases were found. Ten aberrations constituting novel candidate regions were detected, including a second DNAH6 deletion in a patient with POI. Other identified candidate genes were TSPYL6, SMARCC1, CSPG5 and ZFR2. This is a descriptive study and no functional experiments were performed. The study illustrates the importance of analyzing small copy number changes in addition to sequence alterations in the genetic investigation of patients with POI. Also, promoter regions should be included in the investigation. The study was supported by grants from the Swedish Research council (project no 12198 to A.W. and project no 20324 to A.L.H.), Stockholm County Council (E.I., A.W. and K.R.W.), Foundation Frimurare Barnhuset (A.N., A.W. and M.B.), Karolinska Institutet (A.N., A.L.H., E.I., A.W. and M.B.), Novo Nordic Foundation (A.W.) and Svenska Läkaresällskapet (M.B.). The funding sources had no involvement in the design or analysis of the study. The authors have no competing interests to declare. Not applicable. © The Author 2014. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology.
Molecular basis of the dopaminergic system in the cricket Gryllus bimaculatus.

PubMed

Watanabe, Takayuki; Sadamoto, Hisayo; Aonuma, Hitoshi

2013-12-01

In insects, dopamine modulates various aspects of behavior such as learning and memory, arousal and locomotion, and is also a precursor of melanin. To elucidate the molecular basis of the dopaminergic system in the field cricket Gryllus bimaculatus DeGeer, we identified genes involved in dopamine biosynthesis, signal transduction, and dopamine re-uptake in the cricket. Complementary DNA of two isoforms of tyrosine hydroxylase (TH), which convert tyrosine into L-3,4-dihydroxyphenylalanine, was isolated from the cricket brain cDNA library. In addition, four dopamine receptor genes (Dop1, Dop2, Dop3, and DopEcR) and a high-affinity dopamine transporter gene were identified. The two TH isoforms contained isoform-specific regions in the regulatory ACT domain and showed differential expression patterns in different tissues. In addition, the dopamine receptor genes had a receptor subtype-specific distribution: the Dop1, Dop2, and DopEcR genes were broadly expressed in various tissues at differential expression levels, and the Dop3 gene was restrictedly expressed in neuronal tissues and the testicles. Our findings provide a fundamental basis for understanding the dopaminergic regulation of diverse physiological processes in the cricket.

The GENCODE exome: sequencing the complete human exome

PubMed Central

Coffey, Alison J; Kokocinski, Felix; Calafato, Maria S; Scott, Carol E; Palta, Priit; Drury, Eleanor; Joyce, Christopher J; LeProust, Emily M; Harrow, Jen; Hunt, Sarah; Lehesjoki, Anna-Elina; Turner, Daniel J; Hubbard, Tim J; Palotie, Aarno

2011-01-01

Sequencing the coding regions, the exome, of the human genome is one of the major current strategies to identify low frequency and rare variants associated with human disease traits. So far, the most widely used commercial exome capture reagents have mainly targeted the consensus coding sequence (CCDS) database. We report the design of an extended set of targets for capturing the complete human exome, based on annotation from the GENCODE consortium. The extended set covers an additional 5594 genes and 10.3 Mb compared with the current CCDS-based sets. The additional regions include potential disease genes previously inaccessible to exome resequencing studies, such as 43 genes linked to ion channel activity and 70 genes linked to protein kinase activity. In total, the new GENCODE exome set developed here covers 47.9 Mb and performed well in sequence capture experiments. In the sample set used in this study, we identified over 5000 SNP variants more in the GENCODE exome target (24%) than in the CCDS-based exome sequencing. PMID:21364695
A method adapting microarray technology for signature tagged mutagenesis of Dusulfovibrio dusulfuricans G20 and Shewanella oneidensis MR-1 in anaerobic sediment survival experiments

USGS Publications Warehouse

Groh, Jennifer L.; Luo, Qingwei; Ballard , Jimmy D.; Krumholz, Lee R.

2005-01-01

Signature-tagged mutagenesis (STM) is a powerful technique that can be used to identify genes expressed by bacteria during exposure to conditions in their natural environments. To date, there have been no reports of studies in which this approach was used to study organisms of environmental, rather than pathogenic, significance. We used a mini-Tn10 transposon-bearing plasmid, pBSL180, that efficiently and randomly mutagenized Desulfovibrio desulfuricans G20 in addition to Shewanella oneidensis MR-1. Using these organisms as model sediment-dwelling anaerobic bacteria, we developed a new screening system, modified from former STM procedures, to identify genes that are critical for sediment survival. The screening system uses microarray technology to visualize tags from input and output pools, allowing us to identify those lost during sediment incubations. While the majority of data on survival genes identified will be presented in future papers, we report here on chemotaxis-related genes identified by our STM method in both bacteria in order to validate our method. This system may be applicable to the study of numerous environmental bacteria, allowing us to identify functions and roles of survival genes in various habitats.
GTA: a game theoretic approach to identifying cancer subnetwork markers.

PubMed

Farahmand, S; Goliaei, S; Ansari-Pour, N; Razaghi-Moghadam, Z

2016-03-01

The identification of genetic markers (e.g. genes, pathways and subnetworks) for cancer has been one of the most challenging research areas in recent years. A subset of these studies attempt to analyze genome-wide expression profiles to identify markers with high reliability and reusability across independent whole-transcriptome microarray datasets. Therefore, the functional relationships of genes are integrated with their expression data. However, for a more accurate representation of the functional relationships among genes, utilization of the protein-protein interaction network (PPIN) seems to be necessary. Herein, a novel game theoretic approach (GTA) is proposed for the identification of cancer subnetwork markers by integrating genome-wide expression profiles and PPIN. The GTA method was applied to three distinct whole-transcriptome breast cancer datasets to identify the subnetwork markers associated with metastasis. To evaluate the performance of our approach, the identified subnetwork markers were compared with gene-based, pathway-based and network-based markers. We show that GTA is not only capable of identifying robust metastatic markers, it also provides a higher classification performance. In addition, based on these GTA-based subnetworks, we identified a new bonafide candidate gene for breast cancer susceptibility.
Identification of Regulatory Genes Implicated in Continuous Flowering of Longan (Dimocarpus longan L.)

PubMed Central

Jia, Tianqi; Wei, Danfeng; Meng, Shan; Allan, Andrew C.; Zeng, Lihui

2014-01-01

Longan (Dimocarpus longan L.) is a tropical/subtropical fruit tree of significant economic importance in Southeast Asia. However, a lack of transcriptomic and genomic information hinders research on longan traits, such as the control of flowering. In this study, high-throughput RNA sequencing (RNA-Seq) was used to investigate differentially expressed genes between a unique longan cultivar ‘Sijimi’(S) which flowers throughout the year and a more typical cultivar ‘Lidongben’(L) which flowers only once in the season, with the aim of identifying candidate genes associated with continuous flowering. 36,527 and 40,982 unigenes were obtained by de novo assembly of the clean reads from cDNA libraries of L and S cultivars. Additionally 40,513 unigenes were assembled from combined reads of these libraries. A total of 32,475 unigenes were annotated by BLAST search to NCBI non-redundant protein (NR), Swiss-Prot, Clusters of Orthologous Groups (COGs) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Of these, almost fifteen thousand unigenes were identified as significantly differentially expressed genes (DEGs) by using Reads Per kb per Million reads (RPKM) method. A total of 6,415 DEGs were mapped to 128 KEGG pathways, and 8,743 DEGs were assigned to 54 Gene Ontology categories. After blasting the DEGs to public sequence databases, 539 potential flowering-related DEGs were identified. In addition, 107 flowering-time genes were identified in longan, their expression levels between two longan samples were compared by RPKM method, of which the expression levels of 15 were confirmed by real-time quantitative PCR. Our results suggest longan homologues of SHORT VEGETATIVE PHASE (SVP), GIGANTEA (GI), F-BOX 1 (FKF1) and EARLY FLOWERING 4 (ELF4) may be involved this flowering trait and ELF4 may be a key gene. The identification of candidate genes related to continuous flowering will provide new insight into the molecular process of regulating flowering time in woody plants. PMID:25479005
Association of variants in innate immune genes with asthma and eczema

PubMed Central

Sharma, Sunita; Poon, Audrey; Himes, Blanca E.; Lasky-Su, Jessica; Sordillo, Joanne E.; Belanger, Kathleen; Milton, Donald K.; Bracken, Michael B.; Triche, Elizabeth W.; Leaderer, Brian P.; Gold, Diane R.; Litonjua, Augusto A.

2012-01-01

Background The innate immune pathway is important in the pathogenesis of asthma and eczema. However, only a few variants in these genes have been associated with either disease. We investigate the association between polymorphisms of genes in the innate immune pathway with childhood asthma and eczema. In addition, we compare individual associations with those discovered using a multivariate approach. Methods Using a novel method, case control based association testing (C2BAT), 569 single nucleotide polymorphisms (SNPs) in 44 innate immune genes were tested for association with asthma and eczema in children from the Boston Home Allergens and Asthma Study and the Connecticut Childhood Asthma Study. The screening algorithm was used to identify the top SNPs associated with asthma and eczema. We next investigated the interaction of innate immune variants with asthma and eczema risk using Bayesian networks. Results After correction for multiple comparisons, 7 SNPs in 6 genes (CARD25, TGFB1, LY96, ACAA1, DEFB1, and IFNG) were associated with asthma (adjusted p-value<0.02), while 5 SNPs in 3 different genes (CD80, STAT4, and IRAKI) were significantly associated with eczema (adjusted p-value < 0.02). None of these SNPs were associated with both asthma and eczema. Bayesian network analysis identified 4 SNPs that were predictive of asthma and 10 SNPs that predicted eczema. Of the genes identified using Bayesian networks, only CD80 was associated with eczema in the single-SNP study. Using novel methodology that allows for screening and replication in the same population, we have identified associations of innate immune genes with asthma and eczema. Bayesian network analysis suggests that additional SNPs influence disease susceptibility via SNP interactions. Conclusion Our findings suggest that innate immune genes contribute to the pathogenesis of asthma and eczema, and that these diseases likely have different genetic determinants. PMID:22192168
Characterisation of the subtelomeric regions of Giardia lamblia genome isolate WBC6.

PubMed

Prabhu, Anjali; Morrison, Hilary G; Martinez, Charles R; Adam, Rodney D

2007-04-01

Giardia trophozoites are polyploid and have five chromosomes. The chromosome homologues demonstrate considerable size heterogeneity due to variation in the subtelomeric regions. We used clones from the genome project with telomeric sequence at one end to identify six subtelomeric regions in addition to previously identified subtelomeric regions, to study the telomeric arrangement of the chromosomes. The subtelomeric regions included two retroposons, one retroposon pseudogene, and two vsp genes, in addition to the previously identified subtelomeric regions that include ribosomal DNA repeats. The presence of vsp genes in a subtelomeric region suggests that telomeric rearrangements may contribute to the generation of vsp diversity. These studies of the subtelomeric regions of Giardia may contribute to our understanding of the factors that maintain stability, while allowing diversity in chromosome structure.
Characterization of new mutants in the early part of the yeast secretory pathway isolated by a (/sup 3/H)mannose suicide selection

DOE Office of Scientific and Technical Information (OSTI.GOV)

Newman, A.P.; Ferro-Novick, S.

We have adapted a (/sup 3/H)mannose suicide selection to identify mutations in additional genes which function in the early part of the yeast secretory pathway. Thus far this protocol has led to the identification of two new genes which are implicated in this process, as well as additional alleles of previously identified genes. The new mutants, bet1 and bet2, are temperature sensitive for growth and protein transport. Thin section analysis has revealed the accumulation of a network of endoplasmic reticulum (ER) at the restrictive temperature (37/sup 0/C). Precursors of exported proteins that accumulate in the cell at 37/sup 0/C aremore » terminally core glycosylated. These observations suggest that the transport of precursors is blocked subsequent to translocation into the ER but before entry into the Golgi apparatus. The bet1 and bet2 mutants define two new complementation groups which have the same properties as previously identified ER-accumulating mutants. This and previous findings suggest that protein exit from the ER and entry into the Golgi apparatus is a complex process requiring at least 11 genes.« less
Comprehensive analysis of GASA family members in the Malus domestica genome: identification, characterization, and their expressions in response to apple flower induction.

PubMed

Fan, Sheng; Zhang, Dong; Zhang, Lizhi; Gao, Cai; Xin, Mingzhi; Tahir, Muhammad Mobeen; Li, Youmei; Ma, Juanjuan; Han, Mingyu

2017-10-27

The plant-specific gibberellic acid stimulated Arabidopsis (GASA) gene family is critical for plant development. However, little is known about these genes, particularly in fruit tree species. We identified 15 putative Arabidopsis thaliana GASA (AtGASA) and 26 apple GASA (MdGASA) genes. The identified genes were then characterized (e.g., chromosomal location, structure, and evolutionary relationships). All of the identified A. thaliana and apple GASA proteins included a conserved GASA domain and exhibited similar characteristics. Specifically, the MdGASA expression levels in various tissues and organs were analyzed based on an online gene expression profile and by qRT-PCR. These genes were more highly expressed in the leaves, buds, and fruits compared with the seeds, roots, and seedlings. MdGASA genes were also responsive to gibberellic acid (GA 3 ) and abscisic acid treatments. Additionally, transcriptome sequencing results revealed seven potential flowering-related MdGASA genes. We analyzed the expression levels of these genes in response to flowering-related treatments (GA 3 , 6-benzylaminopurine, and sugar) and in apple varieties that differed in terms of flowering ('Nagafu No. 2' and 'Yanfu No. 6') during the flower induction period. These candidate MdGASA genes exhibited diverse expression patterns. The expression levels of six MdGASA genes were inhibited by GA 3 , while the expression of one gene was up-regulated. Additionally, there were expression-level differences induced by the 6-benzylaminopurine and sugar treatments during the flower induction stage, as well as in the different flowering varieties. This study represents the first comprehensive investigation of the A. thaliana and apple GASA gene families. Our data may provide useful clues for future studies and may support the hypotheses regarding the role of GASA proteins during the flower induction stage in fruit tree species.
A data mining paradigm for identifying key factors in biological processes using gene expression data.

PubMed

Li, Jin; Zheng, Le; Uchiyama, Akihiko; Bin, Lianghua; Mauro, Theodora M; Elias, Peter M; Pawelczyk, Tadeusz; Sakowicz-Burkiewicz, Monika; Trzeciak, Magdalena; Leung, Donald Y M; Morasso, Maria I; Yu, Peng

2018-06-13

A large volume of biological data is being generated for studying mechanisms of various biological processes. These precious data enable large-scale computational analyses to gain biological insights. However, it remains a challenge to mine the data efficiently for knowledge discovery. The heterogeneity of these data makes it difficult to consistently integrate them, slowing down the process of biological discovery. We introduce a data processing paradigm to identify key factors in biological processes via systematic collection of gene expression datasets, primary analysis of data, and evaluation of consistent signals. To demonstrate its effectiveness, our paradigm was applied to epidermal development and identified many genes that play a potential role in this process. Besides the known epidermal development genes, a substantial proportion of the identified genes are still not supported by gain- or loss-of-function studies, yielding many novel genes for future studies. Among them, we selected a top gene for loss-of-function experimental validation and confirmed its function in epidermal differentiation, proving the ability of this paradigm to identify new factors in biological processes. In addition, this paradigm revealed many key genes in cold-induced thermogenesis using data from cold-challenged tissues, demonstrating its generalizability. This paradigm can lead to fruitful results for studying molecular mechanisms in an era of explosive accumulation of publicly available biological data.
Phylogenetic analysis of IDD gene family and characterization of its expression in response to flower induction in Malus.

PubMed

Fan, Sheng; Zhang, Dong; Xing, Libo; Qi, Siyan; Du, Lisha; Wu, Haiqin; Shao, Hongxia; Li, Youmei; Ma, Juanjuan; Han, Mingyu

2017-08-01

Although INDETERMINATE DOMAIN (IDD) genes encoding specific plant transcription factors have important roles in plant growth and development, little is known about apple IDD (MdIDD) genes and their potential functions in the flower induction. In this study, we identified 20 putative IDD genes in apple and named them according to their chromosomal locations. All identified MdIDD genes shared a conserved IDD domain. A phylogenetic analysis separated MdIDDs and other plant IDD genes into four groups. Bioinformatic analysis of chemical characteristics, gene structure, and prediction of protein-protein interactions demonstrated the functional and structural diversity of MdIDD genes. To further uncover their potential functions, we performed analysis of tandem, synteny, and gene duplications, which indicated several paired homologs of IDD genes between apple and Arabidopsis. Additionally, genome duplications also promoted the expansion and evolution of the MdIDD genes. Quantitative real-time PCR revealed that all the MdIDD genes showed distinct expression levels in five different tissues (stems, leaves, buds, flowers, and fruits). Furthermore, the expression levels of candidate MdIDD genes were also investigated in response to various circumstances, including GA treatment (decreased the flowering rate), sugar treatment (increased the flowering rate), alternate-bearing conditions, and two varieties with different-flowering intensities. Parts of them were affected by exogenous treatments and showed different expression patterns. Additionally, changes in response to alternate-bearing and different-flowering varieties of apple trees indicated that they were also responsive to flower induction. Taken together, our comprehensive analysis provided valuable information for further analysis of IDD genes aiming at flower induction.
Characterization of Resistance Genes and Plasmids from Outbreaks and Illness Clusters Caused by Salmonella Resistant to Ceftriaxone in the United States, 2011–2012

PubMed Central

Folster, Jason P.; Grass, Julian E.; Bicknese, Amelia; Taylor, Julia; Friedman, Cindy R.; Whichard, Jean M.

2017-01-01

Salmonella is an important cause of foodborne illness; however, quickly identifying the source of these infections can be difficult, and source identification is a crucial step in preventing additional illnesses. Although most infections are self-limited, invasive salmonellosis may require antimicrobial treatment. Ceftriaxone, an extended-spectrum cephalosporin, is commonly used for treatment of salmonellosis. Previous studies have identified a correlation between the food animal/retail meat source of ceftriaxone-resistant Salmonella and the type of resistance gene and plasmid it carries. In this study, we examined seven outbreaks of ceftriaxone-resistant Salmonella infections, caused by serotypes Typhimurium, Newport, Heidelberg, and Infantis. All isolates were positive for a plasmid-encoded blaCMY gene. Plasmid incompatibility typing identified five IncI1 and two IncA/C plasmids. Both outbreaks containing blaCMY-IncA/C plasmids were linked to consumption of cattle products. Three of five outbreaks with blaCMY-IncI1 (ST12) plasmids were linked to a poultry source. The remaining IncI1 outbreaks were associated with ground beef (ST20) and tomatoes (ST12). Additionally, we examined isolates from five unsolved clusters of ceftriaxone-resistant Salmonella infections and used our plasmid encoded gene findings to predict the source. Overall, we identified a likely association between the source of ceftriaxone-resistant Salmonella outbreaks and the type of resistance gene/plasmid it carries. PMID:27828730
Characterization of Resistance Genes and Plasmids from Outbreaks and Illness Clusters Caused by Salmonella Resistant to Ceftriaxone in the United States, 2011-2012.

PubMed

Folster, Jason P; Grass, Julian E; Bicknese, Amelia; Taylor, Julia; Friedman, Cindy R; Whichard, Jean M

2017-03-01

Salmonella is an important cause of foodborne illness; however, quickly identifying the source of these infections can be difficult, and source identification is a crucial step in preventing additional illnesses. Although most infections are self-limited, invasive salmonellosis may require antimicrobial treatment. Ceftriaxone, an extended-spectrum cephalosporin, is commonly used for treatment of salmonellosis. Previous studies have identified a correlation between the food animal/retail meat source of ceftriaxone-resistant Salmonella and the type of resistance gene and plasmid it carries. In this study, we examined seven outbreaks of ceftriaxone-resistant Salmonella infections, caused by serotypes Typhimurium, Newport, Heidelberg, and Infantis. All isolates were positive for a plasmid-encoded bla CMY gene. Plasmid incompatibility typing identified five IncI1 and two IncA/C plasmids. Both outbreaks containing bla CMY -IncA/C plasmids were linked to consumption of cattle products. Three of five outbreaks with bla CMY -IncI1 (ST12) plasmids were linked to a poultry source. The remaining IncI1 outbreaks were associated with ground beef (ST20) and tomatoes (ST12). In addition, we examined isolates from five unsolved clusters of ceftriaxone-resistant Salmonella infections and used our plasmid-encoded gene findings to predict the source. Overall, we identified a likely association between the source of ceftriaxone-resistant Salmonella outbreaks and the type of resistance gene/plasmid it carries.
Gene Expression Profiling in the Hibernating Primate, Cheirogaleus Medius

PubMed Central

Faherty, Sheena L.; Villanueva-Cañas, José Luis; Klopfer, Peter H.; Albà, M. Mar; Yoder, Anne D.

2016-01-01

Hibernation is a complex physiological response that some mammalian species employ to evade energetic demands. Previous work in mammalian hibernators suggests that hibernation is activated not by a set of genes unique to hibernators, but by differential expression of genes that are present in all mammals. This question of universal genetic mechanisms requires further investigation and can only be tested through additional investigations of phylogenetically dispersed species. To explore this question, we use RNA-Seq to investigate gene expression dynamics as they relate to the varying physiological states experienced throughout the year in a group of primate hibernators—Madagascar’s dwarf lemurs (genus Cheirogaleus). In a novel experimental approach, we use longitudinal sampling of biological tissues as a method for capturing gene expression profiles from the same individuals throughout their annual hibernation cycle. We identify 90 candidate genes that have variable expression patterns when comparing two active states (Active 1 and Active 2) with a torpor state. These include genes that are involved in metabolic pathways, feeding behavior, and circadian rhythms, as might be expected to correlate with seasonal physiological state changes. The identified genes appear to be critical for maintaining the health of an animal that undergoes prolonged periods of metabolic depression concurrent with the hibernation phenotype. By focusing on these differentially expressed genes in dwarf lemurs, we compare gene expression patterns in previously studied mammalian hibernators. Additionally, by employing evolutionary rate analysis, we find that hibernation-related genes do not evolve under positive selection in hibernating species relative to nonhibernators. PMID:27412611
Composite selection signals can localize the trait specific genomic regions in multi-breed populations of cattle and sheep

PubMed Central

2014-01-01

Background Discerning the traits evolving under neutral conditions from those traits evolving rapidly because of various selection pressures is a great challenge. We propose a new method, composite selection signals (CSS), which unifies the multiple pieces of selection evidence from the rank distribution of its diverse constituent tests. The extreme CSS scores capture highly differentiated loci and underlying common variants hauling excess haplotype homozygosity in the samples of a target population. Results The data on high-density genotypes were analyzed for evidence of an association with either polledness or double muscling in various cohorts of cattle and sheep. In cattle, extreme CSS scores were found in the candidate regions on autosome BTA-1 and BTA-2, flanking the POLL locus and MSTN gene, for polledness and double muscling, respectively. In sheep, the regions with extreme scores were localized on autosome OAR-2 harbouring the MSTN gene for double muscling and on OAR-10 harbouring the RXFP2 gene for polledness. In comparison to the constituent tests, there was a partial agreement between the signals at the four candidate loci; however, they consistently identified additional genomic regions harbouring no known genes. Persuasively, our list of all the additional significant CSS regions contains genes that have been successfully implicated to secondary phenotypic diversity among several subpopulations in our data. For example, the method identified a strong selection signature for stature in cattle capturing selective sweeps harbouring UQCC-GDF5 and PLAG1-CHCHD7 gene regions on BTA-13 and BTA-14, respectively. Both gene pairs have been previously associated with height in humans, while PLAG1-CHCHD7 has also been reported for stature in cattle. In the additional analysis, CSS identified significant regions harbouring multiple genes for various traits under selection in European cattle including polledness, adaptation, metabolism, growth rate, stature, immunity, reproduction traits and some other candidate genes for dairy and beef production. Conclusions CSS successfully localized the candidate regions in validation datasets as well as identified previously known and novel regions for various traits experiencing selection pressure. Together, the results demonstrate the utility of CSS by its improved power, reduced false positives and high-resolution of selection signals as compared to individual constituent tests. PMID:24636660
Co-fuse: a new class discovery analysis tool to identify and prioritize recurrent fusion genes from RNA-sequencing data.

PubMed

Paisitkriangkrai, Sakrapee; Quek, Kelly; Nievergall, Eva; Jabbour, Anissa; Zannettino, Andrew; Kok, Chung Hoow

2018-06-07

Recurrent oncogenic fusion genes play a critical role in the development of various cancers and diseases and provide, in some cases, excellent therapeutic targets. To date, analysis tools that can identify and compare recurrent fusion genes across multiple samples have not been available to researchers. To address this deficiency, we developed Co-occurrence Fusion (Co-fuse), a new and easy to use software tool that enables biologists to merge RNA-seq information, allowing them to identify recurrent fusion genes, without the need for exhaustive data processing. Notably, Co-fuse is based on pattern mining and statistical analysis which enables the identification of hidden patterns of recurrent fusion genes. In this report, we show that Co-fuse can be used to identify 2 distinct groups within a set of 49 leukemic cell lines based on their recurrent fusion genes: a multiple myeloma (MM) samples-enriched cluster and an acute myeloid leukemia (AML) samples-enriched cluster. Our experimental results further demonstrate that Co-fuse can identify known driver fusion genes (e.g., IGH-MYC, IGH-WHSC1) in MM, when compared to AML samples, indicating the potential of Co-fuse to aid the discovery of yet unknown driver fusion genes through cohort comparisons. Additionally, using a 272 primary glioma sample RNA-seq dataset, Co-fuse was able to validate recurrent fusion genes, further demonstrating the power of this analysis tool to identify recurrent fusion genes. Taken together, Co-fuse is a powerful new analysis tool that can be readily applied to large RNA-seq datasets, and may lead to the discovery of new disease subgroups and potentially new driver genes, for which, targeted therapies could be developed. The Co-fuse R source code is publicly available at https://github.com/sakrapee/co-fuse .
Identifying Epigenetic Biomarkers using Maximal Relevance and Minimal Redundancy Based Feature Selection for Multi-Omics Data.

PubMed

Mallik, Saurav; Bhadra, Tapas; Maulik, Ujjwal

2017-01-01

Epigenetic Biomarker discovery is an important task in bioinformatics. In this article, we develop a new framework of identifying statistically significant epigenetic biomarkers using maximal-relevance and minimal-redundancy criterion based feature (gene) selection for multi-omics dataset. Firstly, we determine the genes that have both expression as well as methylation values, and follow normal distribution. Similarly, we identify the genes which consist of both expression and methylation values, but do not follow normal distribution. For each case, we utilize a gene-selection method that provides maximal-relevant, but variable-weighted minimum-redundant genes as top ranked genes. For statistical validation, we apply t-test on both the expression and methylation data consisting of only the normally distributed top ranked genes to determine how many of them are both differentially expressed andmethylated. Similarly, we utilize Limma package for performing non-parametric Empirical Bayes test on both expression and methylation data comprising only the non-normally distributed top ranked genes to identify how many of them are both differentially expressed and methylated. We finally report the top-ranking significant gene-markerswith biological validation. Moreover, our framework improves positive predictive rate and reduces false positive rate in marker identification. In addition, we provide a comparative analysis of our gene-selection method as well as othermethods based on classificationperformances obtained using several well-known classifiers.
iTAR: a web server for identifying target genes of transcription factors using ChIP-seq or ChIP-chip data.

PubMed

Yang, Chia-Chun; Andrews, Erik H; Chen, Min-Hsuan; Wang, Wan-Yu; Chen, Jeremy J W; Gerstein, Mark; Liu, Chun-Chi; Cheng, Chao

2016-08-12

Chromatin immunoprecipitation followed by massively parallel DNA sequencing (ChIP-seq) or microarray hybridization (ChIP-chip) has been widely used to determine the genomic occupation of transcription factors (TFs). We have previously developed a probabilistic method, called TIP (Target Identification from Profiles), to identify TF target genes using ChIP-seq/ChIP-chip data. To achieve high specificity, TIP applies a conservative method to estimate significance of target genes, with the trade-off being a relatively low sensitivity of target gene identification compared to other methods. Additionally, TIP's output does not render binding-peak locations or intensity, information highly useful for visualization and general experimental biological use, while the variability of ChIP-seq/ChIP-chip file formats has made input into TIP more difficult than desired. To improve upon these facets, here we present are fined TIP with key extensions. First, it implements a Gaussian mixture model for p-value estimation, increasing target gene identification sensitivity and more accurately capturing the shape of TF binding profile distributions. Second, it enables the incorporation of TF binding-peak data by identifying their locations in significant target gene promoter regions and quantifies their strengths. Finally, for full ease of implementation we have incorporated it into a web server ( http://syslab3.nchu.edu.tw/iTAR/ ) that enables flexibility of input file format, can be used across multiple species and genome assembly versions, and is freely available for public use. The web server additionally performs GO enrichment analysis for the identified target genes to reveal the potential function of the corresponding TF. The iTAR web server provides a user-friendly interface and supports target gene identification in seven species, ranging from yeast to human. To facilitate investigating the quality of ChIP-seq/ChIP-chip data, the web server generates the chart of the characteristic binding profiles and the density plot of normalized regulatory scores. The iTAR web server is a useful tool in identifying TF target genes from ChIP-seq/ChIP-chip data and discovering biological insights.
Transcriptional profiling of the host cell response to feline immunodeficiency virus infection.

PubMed

Ertl, Reinhard; Klein, Dieter

2014-03-19

Feline immunodeficiency virus (FIV) is a widespread pathogen of the domestic cat and an important animal model for human immunodeficiency virus (HIV) research. In contrast to HIV, only limited information is available on the transcriptional host cell response to FIV infections. This study aims to identify FIV-induced gene expression changes in feline T-cells during the early phase of the infection. Illumina RNA-sequencing (RNA-seq) was used identify differentially expressed genes (DEGs) at 24 h after FIV infection. After removal of low-quality reads, the remaining sequencing data were mapped against the cat genome and the numbers of mapping reads were counted for each gene. Regulated genes were identified through the comparison of FIV and mock-infected data sets. After statistical analysis and the removal of genes with insufficient coverage, we detected a total of 69 significantly DEGs (44 up- and 25 down-regulated genes) upon FIV infection. The results obtained by RNA-seq were validated by reverse transcription qPCR analysis for 10 genes. Out of the most distinct DEGs identified in this study, several genes are already known to interact with HIV in humans, indicating comparable effects of both viruses on the host cell gene expression and furthermore, highlighting the importance of FIV as a model system for HIV. In addition, a set of new genes not previously linked to virus infections could be identified. The provided list of virus-induced genes may represent useful information for future studies focusing on the molecular mechanisms of virus-host interactions in FIV pathogenesis.
Nearing saturation of cancer driver gene discovery.

PubMed

Hsiehchen, David; Hsieh, Antony

2018-06-15

Extensive sequencing efforts of cancer genomes such as The Cancer Genome Atlas (TCGA) have been undertaken to uncover bona fide cancer driver genes which has enhanced our understanding of cancer and revealed therapeutic targets. However, the number of driver gene mutations is bounded, indicating that there must be a point when further sequencing efforts will be excessive. We found that there was a significant positive correlation between sample size and identified driver gene mutations across 33 cancers sequenced by the TCGA, which is expected if additional sequencing is still leading to the identification of more driver genes. However, the rate of new cancer driver genes being discovered with larger samples is declining rapidly. Our analysis provides a general guide for determining which cancer types would likely benefit from additional sequencing efforts, particularly those with relatively high rates of cancer driver gene discovery. Our results argue that past strategies of indiscriminately sequencing as many specimens as possible for all cancer types is becoming inefficient. In addition, without significant investments into applying our knowledge of cancer genomes, we risk sequencing more cancer genomes for the sake of sequencing rather than meaningful patient benefit.
Gene family innovation, conservation and loss on the animal stem lineage.

PubMed

Richter, Daniel J; Fozouni, Parinaz; Eisen, Michael; King, Nicole

2018-05-31

Choanoflagellates, the closest living relatives of animals, can provide unique insights into the changes in gene content that preceded the origin of animals. However, only two choanoflagellate genomes are currently available, providing poor coverage of their diversity. We sequenced transcriptomes of 19 additional choanoflagellate species to produce a comprehensive reconstruction of the gains and losses that shaped the ancestral animal gene repertoire. We identified ~1,944 gene families that originated on the animal stem lineage, of which only 39 are conserved across all animals in our study. In addition, ~372 gene families previously thought to be animal-specific, including Notch, Delta, and homologs of the animal Toll-like receptor genes, instead evolved prior to the animal-choanoflagellate divergence. Our findings contribute to an increasingly detailed portrait of the gene families that defined the biology of the Urmetazoan and that may underpin core features of extant animals. © 2018, Richter et al.

A framework for the use of single-chemical transcriptomics data in predicting the hazards associated with complex mixtures of polycyclic aromatic hydrocarbons.

PubMed

Labib, Sarah; Williams, Andrew; Kuo, Byron; Yauk, Carole L; White, Paul A; Halappanavar, Sabina

2017-07-01

The assumption of additivity applied in the risk assessment of environmental mixtures containing carcinogenic polycyclic aromatic hydrocarbons (PAHs) was investigated using transcriptomics. MutaTMMouse were gavaged for 28 days with three doses of eight individual PAHs, two defined mixtures of PAHs, or coal tar, an environmentally ubiquitous complex mixture of PAHs. Microarrays were used to identify differentially expressed genes (DEGs) in lung tissue collected 3 days post-exposure. Cancer-related pathways perturbed by the individual or mixtures of PAHs were identified, and dose-response modeling of the DEGs was conducted to calculate gene/pathway benchmark doses (BMDs). Individual PAH-induced pathway perturbations (the median gene expression changes for all genes in a pathway relative to controls) and pathway BMDs were applied to models of additivity [i.e., concentration addition (CA), generalized concentration addition (GCA), and independent action (IA)] to generate predicted pathway-specific dose-response curves for each PAH mixture. The predicted and observed pathway dose-response curves were compared to assess the sensitivity of different additivity models. Transcriptomics-based additivity calculation showed that IA accurately predicted the pathway perturbations induced by all mixtures of PAHs. CA did not support the additivity assumption for the defined mixtures; however, GCA improved the CA predictions. Moreover, pathway BMDs derived for coal tar were comparable to BMDs derived from previously published coal tar-induced mouse lung tumor incidence data. These results suggest that in the absence of tumor incidence data, individual chemical-induced transcriptomics changes associated with cancer can be used to investigate the assumption of additivity and to predict the carcinogenic potential of a mixture.
Global transgenerational gene expression dynamics in two newly synthesized allohexaploid wheat (Triticum aestivum) lines

PubMed Central

2012-01-01

Background Alteration in gene expression resulting from allopolyploidization is a prominent feature in plants, but its spectrum and extent are not fully known. Common wheat (Triticum aestivum) was formed via allohexaploidization about 10,000 years ago, and became the most important crop plant. To gain further insights into the genome-wide transcriptional dynamics associated with the onset of common wheat formation, we conducted microarray-based genome-wide gene expression analysis on two newly synthesized allohexaploid wheat lines with chromosomal stability and a genome constitution analogous to that of the present-day common wheat. Results Multi-color GISH (genomic in situ hybridization) was used to identify individual plants from two nascent allohexaploid wheat lines between Triticum turgidum (2n = 4x = 28; genome BBAA) and Aegilops tauschii (2n = 2x = 14; genome DD), which had a stable chromosomal constitution analogous to that of common wheat (2n = 6x = 42; genome BBAADD). Genome-wide analysis of gene expression was performed for these allohexaploid lines along with their parental plants from T. turgidum and Ae. tauschii, using the Affymetrix Gene Chip Wheat Genome-Array. Comparison with the parental plants coupled with inclusion of empirical mid-parent values (MPVs) revealed that whereas the great majority of genes showed the expected parental additivity, two major patterns of alteration in gene expression in the allohexaploid lines were identified: parental dominance expression and non-additive expression. Genes involved in each of the two altered expression patterns could be classified into three distinct groups, stochastic, heritable and persistent, based on their transgenerational heritability and inter-line conservation. Strikingly, whereas both altered patterns of gene expression showed a propensity of inheritance, identity of the involved genes was highly stochastic, consistent with the involvement of diverse Gene Ontology (GO) terms. Nonetheless, those genes showing non-additive expression exhibited a significant enrichment for vesicle-function. Conclusions Our results show that two patterns of global alteration in gene expression are conditioned by allohexaploidization in wheat, that is, parental dominance expression and non-additive expression. Both altered patterns of gene expression but not the identity of the genes involved are likely to play functional roles in stabilization and establishment of the newly formed allohexaploid plants, and hence, relevant to speciation and evolution of T. aestivum. PMID:22277161
Chronic smoking and alcoholism change expression of selective genes in the human prefrontal cortex.

PubMed

Flatscher-Bader, Traute; Wilce, Peter A

2006-05-01

Alcoholism is commonly associated with chronic smoking. A number of gene expression profiles of regions within the human mesocorticolimbic system have identified potential alcohol-sensitive genes; however, the influence of smoking on these changes was not taken into account. This study addressed the impact of alcohol and smoking on the expression of 4 genes, previously identified as alcoholism-sensitive, in the human prefrontal cortex (PFC). mRNA expression of apolipoprotein D, tissue inhibitor of the metalloproteinase 3, high-affinity glial glutamate transporter and midkine, was measured in the PFC of alcoholic subjects and controls with and without smoking comorbidity using real-time polymerase chain reaction. The results show that alcohol affects transcription of some of these genes. Additionally, smoking has a marked influence on gene expression. This study emphasizes the need for careful case selection in future gene expression studies to delineate the adaptive molecular process associated with smoking and alcohol.
The Orphan Disease Networks

PubMed Central

Zhang, Minlu; Zhu, Cheng; Jacomy, Alexis; Lu, Long J.; Jegga, Anil G.

2011-01-01

The low prevalence rate of orphan diseases (OD) requires special combined efforts to improve diagnosis, prevention, and discovery of novel therapeutic strategies. To identify and investigate relationships based on shared genes or shared functional features, we have conducted a bioinformatic-based global analysis of all orphan diseases with known disease-causing mutant genes. Starting with a bipartite network of known OD and OD-causing mutant genes and using the human protein interactome, we first construct and topologically analyze three networks: the orphan disease network, the orphan disease-causing mutant gene network, and the orphan disease-causing mutant gene interactome. Our results demonstrate that in contrast to the common disease-causing mutant genes that are predominantly nonessential, a majority of orphan disease-causing mutant genes are essential. In confirmation of this finding, we found that OD-causing mutant genes are topologically important in the protein interactome and are ubiquitously expressed. Additionally, functional enrichment analysis of those genes in which mutations cause ODs shows that a majority result in premature death or are lethal in the orthologous mouse gene knockout models. To address the limitations of traditional gene-based disease networks, we also construct and analyze OD networks on the basis of shared enriched features (biological processes, cellular components, pathways, phenotypes, and literature citations). Analyzing these functionally-linked OD networks, we identified several additional OD-OD relations that are both phenotypically similar and phenotypically diverse. Surprisingly, we observed that the wiring of the gene-based and other feature-based OD networks are largely different; this suggests that the relationship between ODs cannot be fully captured by the gene-based network alone. PMID:21664998
Expression-based clustering of CAZyme-encoding genes of Aspergillus niger.

PubMed

Gruben, Birgit S; Mäkelä, Miia R; Kowalczyk, Joanna E; Zhou, Miaomiao; Benoit-Gelber, Isabelle; De Vries, Ronald P

2017-11-23

The Aspergillus niger genome contains a large repertoire of genes encoding carbohydrate active enzymes (CAZymes) that are targeted to plant polysaccharide degradation enabling A. niger to grow on a wide range of plant biomass substrates. Which genes need to be activated in certain environmental conditions depends on the composition of the available substrate. Previous studies have demonstrated the involvement of a number of transcriptional regulators in plant biomass degradation and have identified sets of target genes for each regulator. In this study, a broad transcriptional analysis was performed of the A. niger genes encoding (putative) plant polysaccharide degrading enzymes. Microarray data focusing on the initial response of A. niger to the presence of plant biomass related carbon sources were analyzed of a wild-type strain N402 that was grown on a large range of carbon sources and of the regulatory mutant strains ΔxlnR, ΔaraR, ΔamyR, ΔrhaR and ΔgalX that were grown on their specific inducing compounds. The cluster analysis of the expression data revealed several groups of co-regulated genes, which goes beyond the traditionally described co-regulated gene sets. Additional putative target genes of the selected regulators were identified, based on their expression profile. Notably, in several cases the expression profile puts questions on the function assignment of uncharacterized genes that was based on homology searches, highlighting the need for more extensive biochemical studies into the substrate specificity of enzymes encoded by these non-characterized genes. The data also revealed sets of genes that were upregulated in the regulatory mutants, suggesting interaction between the regulatory systems and a therefore even more complex overall regulatory network than has been reported so far. Expression profiling on a large number of substrates provides better insight in the complex regulatory systems that drive the conversion of plant biomass by fungi. In addition, the data provides additional evidence in favor of and against the similarity-based functions assigned to uncharacterized genes.
Identification and functional analysis of a new glyphosate resistance gene from a fungus cDNA library.

PubMed

Tao, Bo; Shao, Bai-Hui; Qiao, Yu-Xin; Wang, Xiao-Qin; Chang, Shu-Jun; Qiu, Li-Juan

2017-08-01

Glyphosate is a widely used broad spectrum herbicide; however, this limits its use once crops are planted. If glyphosate-resistant crops are grown, glyphosate can be used for weed control in crops. While several glyphosate resistance genes are used in commercial glyphosate tolerant crops, there is interest in identifying additional genes for glyphosate tolerance. This research constructed a high-quality cDNA library form the glyphosate-resistant fungus Aspergillus oryzae RIB40 to identify genes that may confer resistance to glyphosate. Using a medium containing glyphosate (120mM), we screened several clones from the library. Based on a nucleotide sequence analysis, we identified a gene of unknown function (GenBank accession number: XM_001826835.2) that encoded a hypothetical 344-amino acid protein. The gene was named MFS40. Its ORF was amplified to construct an expression vector, pGEX-4T-1-MFS40, to express the protein in Escherichia coli BL21. The gene conferred glyphosate tolerance to E. coli ER2799 cells. Copyright © 2017 Elsevier B.V. All rights reserved.
Recurrent Targeted Genes of Hepatitis B Virus in the Liver Cancer Genomes Identified by a Next-Generation Sequencing–Based Approach

PubMed Central

Ding, Dong; Lou, Xiaoyan; Hua, Dasong; Yu, Wei; Li, Lisha; Wang, Jun; Gao, Feng; Zhao, Na; Ren, Guoping; Li, Lanjuan; Lin, Biaoyang

2012-01-01

Integration of the viral DNA into host chromosomes was found in most of the hepatitis B virus (HBV)–related hepatocellular carcinomas (HCCs). Here we devised a massive anchored parallel sequencing (MAPS) method using next-generation sequencing to isolate and sequence HBV integrants. Applying MAPS to 40 pairs of HBV–related HCC tissues (cancer and adjacent tissues), we identified 296 HBV integration events corresponding to 286 unique integration sites (UISs) with precise HBV–Human DNA junctions. HBV integration favored chromosome 17 and preferentially integrated into human transcript units. HBV targeted genes were enriched in GO terms: cAMP metabolic processes, T cell differentiation and activation, TGF beta receptor pathway, ncRNA catabolic process, and dsRNA fragmentation and cellular response to dsRNA. The HBV targeted genes include 7 genes (PTPRJ, CNTN6, IL12B, MYOM1, FNDC3B, LRFN2, FN1) containing IPR003961 (Fibronectin, type III domain), 7 genes (NRG3, MASP2, NELL1, LRP1B, ADAM21, NRXN1, FN1) containing IPR013032 (EGF-like region, conserved site), and three genes (PDE7A, PDE4B, PDE11A) containing IPR002073 (3′, 5′-cyclic-nucleotide phosphodiesterase). Enriched pathways include hsa04512 (ECM-receptor interaction), hsa04510 (Focal adhesion), and hsa04012 (ErbB signaling pathway). Fewer integration events were found in cancers compared to cancer-adjacent tissues, suggesting a clonal expansion model in HCC development. Finally, we identified 8 genes that were recurrent target genes by HBV integration including fibronectin 1 (FN1) and telomerase reverse transcriptase (TERT1), two known recurrent target genes, and additional novel target genes such as SMAD family member 5 (SMAD5), phosphatase and actin regulator 4 (PHACTR4), and RNA binding protein fox-1 homolog (C. elegans) 1 (RBFOX1). Integrating analysis with recently published whole-genome sequencing analysis, we identified 14 additional recurrent HBV target genes, greatly expanding the HBV recurrent target list. This global survey of HBV integration events, together with recently published whole-genome sequencing analyses, furthered our understanding of the HBV–related HCC. PMID:23236287
Multiway real-time PCR gene expression profiling in yeast Saccharomyces cerevisiae reveals altered transcriptional response of ADH-genes to glucose stimuli.

PubMed

Ståhlberg, Anders; Elbing, Karin; Andrade-Garda, José Manuel; Sjögreen, Björn; Forootan, Amin; Kubista, Mikael

2008-04-16

The large sensitivity, high reproducibility and essentially unlimited dynamic range of real-time PCR to measure gene expression in complex samples provides the opportunity for powerful multivariate and multiway studies of biological phenomena. In multiway studies samples are characterized by their expression profiles to monitor changes over time, effect of treatment, drug dosage etc. Here we perform a multiway study of the temporal response of four yeast Saccharomyces cerevisiae strains with different glucose uptake rates upon altered metabolic conditions. We measured the expression of 18 genes as function of time after addition of glucose to four strains of yeast grown in ethanol. The data are analyzed by matrix-augmented PCA, which is a generalization of PCA for 3-way data, and the results are confirmed by hierarchical clustering and clustering by Kohonen self-organizing map. Our approach identifies gene groups that respond similarly to the change of nutrient, and genes that behave differently in mutant strains. Of particular interest is our finding that ADH4 and ADH6 show a behavior typical of glucose-induced genes, while ADH3 and ADH5 are repressed after glucose addition. Multiway real-time PCR gene expression profiling is a powerful technique which can be utilized to characterize functions of new genes by, for example, comparing their temporal response after perturbation in different genetic variants of the studied subject. The technique also identifies genes that show perturbed expression in specific strains.
Multiway real-time PCR gene expression profiling in yeast Saccharomyces cerevisiae reveals altered transcriptional response of ADH-genes to glucose stimuli

PubMed Central

Ståhlberg, Anders; Elbing, Karin; Andrade-Garda, José Manuel; Sjögreen, Björn; Forootan, Amin; Kubista, Mikael

2008-01-01

Background The large sensitivity, high reproducibility and essentially unlimited dynamic range of real-time PCR to measure gene expression in complex samples provides the opportunity for powerful multivariate and multiway studies of biological phenomena. In multiway studies samples are characterized by their expression profiles to monitor changes over time, effect of treatment, drug dosage etc. Here we perform a multiway study of the temporal response of four yeast Saccharomyces cerevisiae strains with different glucose uptake rates upon altered metabolic conditions. Results We measured the expression of 18 genes as function of time after addition of glucose to four strains of yeast grown in ethanol. The data are analyzed by matrix-augmented PCA, which is a generalization of PCA for 3-way data, and the results are confirmed by hierarchical clustering and clustering by Kohonen self-organizing map. Our approach identifies gene groups that respond similarly to the change of nutrient, and genes that behave differently in mutant strains. Of particular interest is our finding that ADH4 and ADH6 show a behavior typical of glucose-induced genes, while ADH3 and ADH5 are repressed after glucose addition. Conclusion Multiway real-time PCR gene expression profiling is a powerful technique which can be utilized to characterize functions of new genes by, for example, comparing their temporal response after perturbation in different genetic variants of the studied subject. The technique also identifies genes that show perturbed expression in specific strains. PMID:18412983
Mutant phenotypes for thousands of bacterial genes of unknown function

DOE PAGES

Price, Morgan N.; Wetmore, Kelly M.; Waters, R. Jordan; ...

2018-05-16

One-third of all protein-coding genes from bacterial genomes cannot be annotated with a function. Here, to investigate the functions of these genes, we present genome-wide mutant fitness data from 32 diverse bacteria across dozens of growth conditions. We identified mutant phenotypes for 11,779 protein-coding genes that had not been annotated with a specific function. Many genes could be associated with a specific condition because the gene affected fitness only in that condition, or with another gene in the same bacterium because they had similar mutant phenotypes. Of the poorly annotated genes, 2,316 had associations that have high confidence because theymore » are conserved in other bacteria. By combining these conserved associations with comparative genomics, we identified putative DNA repair proteins; in addition, we propose specific functions for poorly annotated enzymes and transporters and for uncharacterized protein families. Lastly, our study demonstrates the scalability of microbial genetics and its utility for improving gene annotations.« less
Mutant phenotypes for thousands of bacterial genes of unknown function

DOE Office of Scientific and Technical Information (OSTI.GOV)

Price, Morgan N.; Wetmore, Kelly M.; Waters, R. Jordan

One-third of all protein-coding genes from bacterial genomes cannot be annotated with a function. Here, to investigate the functions of these genes, we present genome-wide mutant fitness data from 32 diverse bacteria across dozens of growth conditions. We identified mutant phenotypes for 11,779 protein-coding genes that had not been annotated with a specific function. Many genes could be associated with a specific condition because the gene affected fitness only in that condition, or with another gene in the same bacterium because they had similar mutant phenotypes. Of the poorly annotated genes, 2,316 had associations that have high confidence because theymore » are conserved in other bacteria. By combining these conserved associations with comparative genomics, we identified putative DNA repair proteins; in addition, we propose specific functions for poorly annotated enzymes and transporters and for uncharacterized protein families. Lastly, our study demonstrates the scalability of microbial genetics and its utility for improving gene annotations.« less
Targeted sequencing-based analyses of candidate gene variants in ulcerative colitis-associated colorectal neoplasia.

PubMed

Chakrabarty, Sanjiban; Varghese, Vinay Koshy; Sahu, Pranoy; Jayaram, Pradyumna; Shivakumar, Bhadravathi M; Pai, Cannanore Ganesh; Satyamoorthy, Kapaettu

2017-06-27

Long-standing ulcerative colitis (UC) leading to colorectal cancer (CRC) is one of the most serious and life-threatening consequences acknowledged globally. Ulcerative colitis-associated colorectal carcinogenesis showed distinct molecular alterations when compared with sporadic colorectal carcinoma. Targeted sequencing of 409 genes in tissue samples of 18 long-standing UC subjects at high risk of colorectal carcinoma (UCHR) was performed to identify somatic driver mutations, which may be involved in the molecular changes during the transformation of non-dysplastic mucosa to high-grade dysplasia. Findings from the study are also compared with previously published genome wide and exome sequencing data in inflammatory bowel disease-associated and sporadic colorectal carcinoma. Next-generation sequencing analysis identified 1107 mutations in 275 genes in UCHR subjects. In addition to TP53 (17%) and KRAS (22%) mutations, recurrent mutations in APC (33%), ACVR2A (61%), ARID1A (44%), RAF1 (39%) and MTOR (61%) were observed in UCHR subjects. In addition, APC, FGFR3, FGFR2 and PIK3CA driver mutations were identified in UCHR subjects. Recurrent mutations in ARID1A (44%), SMARCA4 (17%), MLL2 (44%), MLL3 (67%), SETD2 (17%) and TET2 (50%) genes involved in histone modification and chromatin remodelling were identified in UCHR subjects. Our study identifies new oncogenic driver mutations which may be involved in the transition of non-dysplastic cells to dysplastic phenotype in the subjects with long-standing UC with high risk of progression into colorectal neoplasia.
Transcriptome-Wide Identification of Preferentially Expressed Genes in the Hypothalamus and Pituitary Gland

PubMed Central

St-Amand, Jonny; Yoshioka, Mayumi; Tanaka, Keitaro; Nishida, Yuichiro

2012-01-01

To identify preferentially expressed genes in the central endocrine organs of the hypothalamus and pituitary gland, we generated transcriptome-wide mRNA profiles of the hypothalamus, pituitary gland, and parietal cortex in male mice (12–15 weeks old) using serial analysis of gene expression (SAGE). Total counts of SAGE tags for the hypothalamus, pituitary gland, and parietal cortex were 165824, 126688, and 161045 tags, respectively. This represented 59244, 45151, and 55131 distinct tags, respectively. Comparison of these mRNA profiles revealed that 22 mRNA species, including three potential novel transcripts, were preferentially expressed in the hypothalamus. In addition to well-known hypothalamic transcripts, such as hypocretin, several genes involved in hormone function, intracellular transduction, metabolism, protein transport, steroidogenesis, extracellular matrix, and brain disease were identified as preferentially expressed hypothalamic transcripts. In the pituitary gland, 106 mRNA species, including 60 potential novel transcripts, were preferentially expressed. In addition to well-known pituitary genes, such as growth hormone and thyroid stimulating hormone beta, a number of genes classified to function in transport, amino acid metabolism, intracellular transduction, cell adhesion, disulfide bond formation, stress response, transcription, protein synthesis, and turnover, cell differentiation, the cell cycle, and in the cytoskeleton and extracellular matrix were also preferentially expressed. In conclusion, the current study identified not only well-known hypothalamic and pituitary transcripts but also a number of new candidates likely to be involved in endocrine homeostatic systems regulated by the hypothalamus and pituitary gland. PMID:22649398
Transcriptome-wide identification of preferentially expressed genes in the hypothalamus and pituitary gland.

PubMed

St-Amand, Jonny; Yoshioka, Mayumi; Tanaka, Keitaro; Nishida, Yuichiro

2011-01-01

To identify preferentially expressed genes in the central endocrine organs of the hypothalamus and pituitary gland, we generated transcriptome-wide mRNA profiles of the hypothalamus, pituitary gland, and parietal cortex in male mice (12-15 weeks old) using serial analysis of gene expression (SAGE). Total counts of SAGE tags for the hypothalamus, pituitary gland, and parietal cortex were 165824, 126688, and 161045 tags, respectively. This represented 59244, 45151, and 55131 distinct tags, respectively. Comparison of these mRNA profiles revealed that 22 mRNA species, including three potential novel transcripts, were preferentially expressed in the hypothalamus. In addition to well-known hypothalamic transcripts, such as hypocretin, several genes involved in hormone function, intracellular transduction, metabolism, protein transport, steroidogenesis, extracellular matrix, and brain disease were identified as preferentially expressed hypothalamic transcripts. In the pituitary gland, 106 mRNA species, including 60 potential novel transcripts, were preferentially expressed. In addition to well-known pituitary genes, such as growth hormone and thyroid stimulating hormone beta, a number of genes classified to function in transport, amino acid metabolism, intracellular transduction, cell adhesion, disulfide bond formation, stress response, transcription, protein synthesis, and turnover, cell differentiation, the cell cycle, and in the cytoskeleton and extracellular matrix were also preferentially expressed. In conclusion, the current study identified not only well-known hypothalamic and pituitary transcripts but also a number of new candidates likely to be involved in endocrine homeostatic systems regulated by the hypothalamus and pituitary gland.
Tissue-specific promoter utilisation of the kallikrein-related peptidase genes, KLK5 and KLK7, and cellular localisation of the encoded proteins suggest roles in exocrine pancreatic function.

PubMed

Dong, Ying; Matigian, Nick; Harvey, Tracey J; Samaratunga, Hemamali; Hooper, John D; Clements, Judith A

2008-02-01

Abstract Tissue kallikrein (kallikrein 1) was first identified in pancreas and is the namesake of the kallikrein-related peptidase (KLK) family. KLK1 and the other 14 members of the human KLK family are encoded by 15 serine protease genes clustered at chromosome 19q13.4. Our Northern blot analysis of 19 normal human tissues for expression of KLK4 to KLK15 identified pancreas as a common expression site for the gene cluster spanning KLK5 to KLK13, as well as for KLK15 which is located adjacent to KLK1. Consistent with previous reports detailing the ability of KLK genes to generate organ- and disease-specific transcripts, detailed molecular and in silico analyses indicated that KLK5 and KLK7 generate transcripts in pancreas variant from those in skin or ovary. Consistently, we identified in the promoters of these KLK genes motifs which conform with consensus binding sites for transcription factors conferring pancreatic expression. In addition, immunohistochemical analysis revealed predominant localisation of KLK5 and KLK7 in acinar cells of the exocrine pancreas, suggesting roles for these enzymes in digestion. Our data also support expression patterns derived from gene duplication events in the human KLK cluster. These findings suggest that, in addition to KLK1, other related KLK enzymes will function in the exocrine pancreas.
A Genome-Wide Association Analysis Reveals Epistatic Cancellation of Additive Genetic Variance for Root Length in Arabidopsis thaliana.

PubMed

Lachowiec, Jennifer; Shen, Xia; Queitsch, Christine; Carlborg, Örjan

2015-01-01

Efforts to identify loci underlying complex traits generally assume that most genetic variance is additive. Here, we examined the genetics of Arabidopsis thaliana root length and found that the genomic narrow-sense heritability for this trait in the examined population was statistically zero. The low amount of additive genetic variance that could be captured by the genome-wide genotypes likely explains why no associations to root length could be found using standard additive-model-based genome-wide association (GWA) approaches. However, as the broad-sense heritability for root length was significantly larger, and primarily due to epistasis, we also performed an epistatic GWA analysis to map loci contributing to the epistatic genetic variance. Four interacting pairs of loci were revealed, involving seven chromosomal loci that passed a standard multiple-testing corrected significance threshold. The genotype-phenotype maps for these pairs revealed epistasis that cancelled out the additive genetic variance, explaining why these loci were not detected in the additive GWA analysis. Small population sizes, such as in our experiment, increase the risk of identifying false epistatic interactions due to testing for associations with very large numbers of multi-marker genotypes in few phenotyped individuals. Therefore, we estimated the false-positive risk using a new statistical approach that suggested half of the associated pairs to be true positive associations. Our experimental evaluation of candidate genes within the seven associated loci suggests that this estimate is conservative; we identified functional candidate genes that affected root development in four loci that were part of three of the pairs. The statistical epistatic analyses were thus indispensable for confirming known, and identifying new, candidate genes for root length in this population of wild-collected A. thaliana accessions. We also illustrate how epistatic cancellation of the additive genetic variance explains the insignificant narrow-sense and significant broad-sense heritability by using a combination of careful statistical epistatic analyses and functional genetic experiments.
Gene-gene and gene-environment interactions: new insights into the prevention, detection and management of coronary artery disease.

PubMed

Lanktree, Matthew B; Hegele, Robert A

2009-02-26

Despite the recent success of genome-wide association studies (GWASs) in identifying loci consistently associated with coronary artery disease (CAD), a large proportion of the genetic components of CAD and its metabolic risk factors, including plasma lipids, type 2 diabetes and body mass index, remain unattributed. Gene-gene and gene-environment interactions might produce a meaningful improvement in quantification of the genetic determinants of CAD. Testing for gene-gene and gene-environment interactions is thus a new frontier for large-scale GWASs of CAD. There are several anecdotal examples of monogenic susceptibility to CAD in which the phenotype was worsened by an adverse environment. In addition, small-scale candidate gene association studies with functional hypotheses have identified gene-environment interactions. For future evaluation of gene-gene and gene-environment interactions to achieve the same success as the single gene associations reported in recent GWASs, it will be important to pre-specify agreed standards of study design and statistical power, environmental exposure measurement, phenomic characterization and analytical strategies. Here we discuss these issues, particularly in relation to the investigation and potential clinical utility of gene-gene and gene-environment interactions in CAD.
Application of an Efficient Gene Targeting System Linking Secondary Metabolites to their Biosynthetic Genes in Aspergillus terreus

DOE Office of Scientific and Technical Information (OSTI.GOV)

Guo, Chun-Jun; Knox, Benjamin P.; Sanchez, James F.

2013-07-19

Nonribosomal peptides (NRPs) are natural products biosynthesized by NRP synthetases. A kusA-, pyrG- mutant strain of Aspergillusterreus NIH 2624 was developed that greatly facilitated the gene targeting efficiency in this organism. Application of this tool allowed us to link four major types of NRP related secondary metabolites to their responsible genes in A. terreus. In addition, an NRP related melanin synthetase was also identified in this species.
A 16-Gene Signature Distinguishes Anaplastic Astrocytoma from Glioblastoma

PubMed Central

Rao, Soumya Alige Mahabala; Srinivasan, Sujaya; Patric, Irene Rosita Pia; Hegde, Alangar Sathyaranjandas; Chandramouli, Bangalore Ashwathnarayanara; Arimappamagan, Arivazhagan; Santosh, Vani; Kondaiah, Paturu; Rao, Manchanahalli R. Sathyanarayana; Somasundaram, Kumaravel

2014-01-01

Anaplastic astrocytoma (AA; Grade III) and glioblastoma (GBM; Grade IV) are diffusely infiltrating tumors and are called malignant astrocytomas. The treatment regimen and prognosis are distinctly different between anaplastic astrocytoma and glioblastoma patients. Although histopathology based current grading system is well accepted and largely reproducible, intratumoral histologic variations often lead to difficulties in classification of malignant astrocytoma samples. In order to obtain a more robust molecular classifier, we analysed RT-qPCR expression data of 175 differentially regulated genes across astrocytoma using Prediction Analysis of Microarrays (PAM) and found the most discriminatory 16-gene expression signature for the classification of anaplastic astrocytoma and glioblastoma. The 16-gene signature obtained in the training set was validated in the test set with diagnostic accuracy of 89%. Additionally, validation of the 16-gene signature in multiple independent cohorts revealed that the signature predicted anaplastic astrocytoma and glioblastoma samples with accuracy rates of 99%, 88%, and 92% in TCGA, GSE1993 and GSE4422 datasets, respectively. The protein-protein interaction network and pathway analysis suggested that the 16-genes of the signature identified epithelial-mesenchymal transition (EMT) pathway as the most differentially regulated pathway in glioblastoma compared to anaplastic astrocytoma. In addition to identifying 16 gene classification signature, we also demonstrated that genes involved in epithelial-mesenchymal transition may play an important role in distinguishing glioblastoma from anaplastic astrocytoma. PMID:24475040
Meta-analysis of gene expression profiles associated with histological classification and survival in 829 ovarian cancer samples.

PubMed

Fekete, Tibor; Rásó, Erzsébet; Pete, Imre; Tegze, Bálint; Liko, István; Munkácsy, Gyöngyi; Sipos, Norbert; Rigó, János; Györffy, Balázs

2012-07-01

Transcriptomic analysis of global gene expression in ovarian carcinoma can identify dysregulated genes capable to serve as molecular markers for histology subtypes and survival. The aim of our study was to validate previous candidate signatures in an independent setting and to identify single genes capable to serve as biomarkers for ovarian cancer progression. As several datasets are available in the GEO today, we were able to perform a true meta-analysis. First, 829 samples (11 datasets) were downloaded, and the predictive power of 16 previously published gene sets was assessed. Of these, eight were capable to discriminate histology subtypes, and none was capable to predict survival. To overcome the differences in previous studies, we used the 829 samples to identify new predictors. Then, we collected 64 ovarian cancer samples (median relapse-free survival 24.5 months) and performed TaqMan Real Time Polimerase Chain Reaction (RT-PCR) analysis for the best 40 genes associated with histology subtypes and survival. Over 90% of subtype-associated genes were confirmed. Overall survival was effectively predicted by hormone receptors (PGR and ESR2) and by TSPAN8. Relapse-free survival was predicted by MAPT and SNCG. In summary, we successfully validated several gene sets in a meta-analysis in large datasets of ovarian samples. Additionally, several individual genes identified were validated in a clinical cohort. Copyright © 2011 UICC.

Evolution of Prdm Genes in Animals: Insights from Comparative Genomics

PubMed Central

Vervoort, Michel; Meulemeester, David; Béhague, Julien; Kerner, Pierre

2016-01-01

Prdm genes encode transcription factors with a subtype of SET domain known as the PRDF1-RIZ (PR) homology domain and a variable number of zinc finger motifs. These genes are involved in a wide variety of functions during animal development. As most Prdm genes have been studied in vertebrates, especially in mice, little is known about the evolution of this gene family. We searched for Prdm genes in the fully sequenced genomes of 93 different species representative of all the main metazoan lineages. A total of 976 Prdm genes were identified in these species. The number of Prdm genes per species ranges from 2 to 19. To better understand how the Prdm gene family has evolved in metazoans, we performed phylogenetic analyses using this large set of identified Prdm genes. These analyses allowed us to define 14 different subfamilies of Prdm genes and to establish, through ancestral state reconstruction, that 11 of them are ancestral to bilaterian animals. Three additional subfamilies were acquired during early vertebrate evolution (Prdm5, Prdm11, and Prdm17). Several gene duplication and gene loss events were identified and mapped onto the metazoan phylogenetic tree. By studying a large number of nonmetazoan genomes, we confirmed that Prdm genes likely constitute a metazoan-specific gene family. Our data also suggest that Prdm genes originated before the diversification of animals through the association of a single ancestral SET domain encoding gene with one or several zinc finger encoding genes. PMID:26560352
Characterization and Comparative Overview of Complete Sequences of the First Plasmids of Pandoraea across Clinical and Non-clinical Strains

PubMed Central

Yong, Delicia; Tee, Kok Keng; Yin, Wai-Fong; Chan, Kok-Gan

2016-01-01

To date, information on plasmid analysis in Pandoraea spp. is scarce. To address the gap of knowledge on this, the complete sequences of eight plasmids from Pandoraea spp. namely Pandoraea faecigallinarum DSM 23572T (pPF72-1, pPF72-2), Pandoraea oxalativorans DSM 23570T (pPO70-1, pPO70-2, pPO70-3, pPO70-4), Pandoraea vervacti NS15 (pPV15) and Pandoraea apista DSM 16535T (pPA35) were studied for the first time in this study. The information on plasmid sequences in Pandoraea spp. is useful as the sequences did not match any known plasmid sequence deposited in public databases. Replication genes were not identified in some plasmids, a situation that has led to the possibility of host interaction involvement. Some plasmids were also void of par genes and intriguingly, repA gene was also not discovered in these plasmids. This further leads to the hypothesis of host-plasmid interaction. Plasmid stabilization/stability protein-encoding genes were observed in some plasmids but were not established for participating in plasmid segregation. Toxin-antitoxin systems MazEF, VapBC, RelBE, YgiT-MqsR, HigBA, and ParDE were identified across the plasmids and their presence would improve plasmid maintenance. Conjugation genes were identified portraying the conjugation ability amongst Pandoraea plasmids. Additionally, we found a shared region amongst some of the plasmids that consists of conjugation genes. The identification of genes involved in replication, segregation, toxin-antitoxin systems and conjugation, would aid the design of drugs to prevent the survival or transmission of plasmids carrying pathogenic properties. Additionally, genes conferring virulence and antibiotic resistance were identified amongst the plasmids. The observed features in the plasmids shed light on the Pandoraea spp. as opportunistic pathogens. PMID:27790203
A novel bioinformatics pipeline to discover genes related to arbuscular mycorrhizal symbiosis based on their evolutionary conservation pattern among higher plants.

PubMed

Favre, Patrick; Bapaume, Laure; Bossolini, Eligio; Delorenzi, Mauro; Falquet, Laurent; Reinhardt, Didier

2014-12-03

Genes involved in arbuscular mycorrhizal (AM) symbiosis have been identified primarily by mutant screens, followed by identification of the mutated genes (forward genetics). In addition, a number of AM-related genes has been identified by their AM-related expression patterns, and their function has subsequently been elucidated by knock-down or knock-out approaches (reverse genetics). However, genes that are members of functionally redundant gene families, or genes that have a vital function and therefore result in lethal mutant phenotypes, are difficult to identify. If such genes are constitutively expressed and therefore escape differential expression analyses, they remain elusive. The goal of this study was to systematically search for AM-related genes with a bioinformatics strategy that is insensitive to these problems. The central element of our approach is based on the fact that many AM-related genes are conserved only among AM-competent species. Our approach involves genome-wide comparisons at the proteome level of AM-competent host species with non-mycorrhizal species. Using a clustering method we first established orthologous/paralogous relationships and subsequently identified protein clusters that contain members only of the AM-competent species. Proteins of these clusters were then analyzed in an extended set of 16 plant species and ranked based on their relatedness among AM-competent monocot and dicot species, relative to non-mycorrhizal species. In addition, we combined the information on the protein-coding sequence with gene expression data and with promoter analysis. As a result we present a list of yet uncharacterized proteins that show a strongly AM-related pattern of sequence conservation, indicating that the respective genes may have been under selection for a function in AM. Among the top candidates are three genes that encode a small family of similar receptor-like kinases that are related to the S-locus receptor kinases involved in sporophytic self-incompatibility. We present a new systematic strategy of gene discovery based on conservation of the protein-coding sequence that complements classical forward and reverse genetics. This strategy can be applied to diverse other biological phenomena if species with established genome sequences fall into distinguished groups that differ in a defined functional trait of interest.
A Penalized Robust Method for Identifying Gene-Environment Interactions

PubMed Central

Shi, Xingjie; Liu, Jin; Huang, Jian; Zhou, Yong; Xie, Yang; Ma, Shuangge

2015-01-01

In high-throughput studies, an important objective is to identify gene-environment interactions associated with disease outcomes and phenotypes. Many commonly adopted methods assume specific parametric or semiparametric models, which may be subject to model mis-specification. In addition, they usually use significance level as the criterion for selecting important interactions. In this study, we adopt the rank-based estimation, which is much less sensitive to model specification than some of the existing methods and includes several commonly encountered data and models as special cases. Penalization is adopted for the identification of gene-environment interactions. It achieves simultaneous estimation and identification and does not rely on significance level. For computation feasibility, a smoothed rank estimation is further proposed. Simulation shows that under certain scenarios, for example with contaminated or heavy-tailed data, the proposed method can significantly outperform the existing alternatives with more accurate identification. We analyze a lung cancer prognosis study with gene expression measurements under the AFT (accelerated failure time) model. The proposed method identifies interactions different from those using the alternatives. Some of the identified genes have important implications. PMID:24616063
Evaluation of genome-wide association study results through development of ontology fingerprints

PubMed Central

Tsoi, Lam C.; Boehnke, Michael; Klein, Richard L.; Zheng, W. Jim

2009-01-01

Motivation: Genome-wide association (GWA) studies may identify multiple variants that are associated with a disease or trait. To narrow down candidates for further validation, quantitatively assessing how identified genes relate to a phenotype of interest is important. Results: We describe an approach to characterize genes or biological concepts (phenotypes, pathways, diseases, etc.) by ontology fingerprint—the set of Gene Ontology (GO) terms that are overrepresented among the PubMed abstracts discussing the gene or biological concept together with the enrichment p-value of these terms generated from a hypergeometric enrichment test. We then quantify the relevance of genes to the trait from a GWA study by calculating similarity scores between their ontology fingerprints using enrichment p-values. We validate this approach by correctly identifying corresponding genes for biological pathways with a 90% average area under the ROC curve (AUC). We applied this approach to rank genes identified through a GWA study that are associated with the lipid concentrations in plasma as well as to prioritize genes within linkage disequilibrium (LD) block. We found that the genes with highest scores were: ABCA1, lipoprotein lipase (LPL) and cholesterol ester transfer protein, plasma for high-density lipoprotein; low-density lipoprotein receptor, APOE and APOB for low-density lipoprotein; and LPL, APOA1 and APOB for triglyceride. In addition, we identified genes relevant to lipid metabolism from the literature even in cases where such knowledge was not reflected in current annotation of these genes. These results demonstrate that ontology fingerprints can be used effectively to prioritize genes from GWA studies for experimental validation. Contact: zhengw@musc.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:19349285
Identification of new participants in the rainbow trout (Oncorhynchus mykiss) oocyte maturation and ovulation processes using cDNA microarrays

PubMed Central

Bobe, Julien; Montfort, Jerôme; Nguyen, Thaovi; Fostier, Alexis

2006-01-01

Background The hormonal control of oocyte maturation and ovulation as well as the molecular mechanisms of nuclear maturation have been thoroughly studied in fish. In contrast, the other molecular events occurring in the ovary during post-vitellogenesis have received far less attention. Methods Nylon microarrays displaying 9152 rainbow trout cDNAs were hybridized using RNA samples originating from ovarian tissue collected during late vitellogenesis, post-vitellogenesis and oocyte maturation. Differentially expressed genes were identified using a statistical analysis. A supervised clustering analysis was performed using only differentially expressed genes in order to identify gene clusters exhibiting similar expression profiles. In addition, specific genes were selected and their preovulatory ovarian expression was analyzed using real-time PCR. Results From the statistical analysis, 310 differentially expressed genes were identified. Among those genes, 90 were up-regulated at the time of oocyte maturation while 220 exhibited an opposite pattern. After clustering analysis, 90 clones belonging to 3 gene clusters exhibiting the most remarkable expression patterns were kept for further analysis. Using real-time PCR analysis, we observed a strong up-regulation of ion and water transport genes such as aquaporin 4 (aqp4) and pendrin (slc26). In addition, a dramatic up-regulation of vasotocin (avt) gene was observed. Furthermore, angiotensin-converting-enzyme 2 (ace2), coagulation factor V (cf5), adam 22, and the chemokine cxcl14 genes exhibited a sharp up-regulation at the time of oocyte maturation. Finally, ovarian aromatase (cyp19a1) exhibited a dramatic down-regulation over the post-vitellogenic period while a down-regulation of Cytidine monophosphate-N-acetylneuraminic acid hydroxylase (cmah) was observed at the time of oocyte maturation. Conclusion We showed the over or under expression of more that 300 genes, most of them being previously unstudied or unknown in the fish preovulatory ovary. Our data confirmed the down-regulation of estrogen synthesis genes during the preovulatory period. In addition, the strong up-regulation of aqp4 and slc26 genes prior to ovulation suggests their participation in the oocyte hydration process occurring at that time. Furthermore, among the most up-regulated clones, several genes such as cxcl14, ace2, adam22, cf5 have pro-inflammatory, vasodilatory, proteolytics and coagulatory functions. The identity and expression patterns of those genes support the theory comparing ovulation to an inflammatory-like reaction. PMID:16872517
A genome-wide association study reveals candidate genes for the supernumerary nipple phenotype in sheep (Ovis aries).

PubMed

Peng, W-F; Xu, S-S; Ren, X; Lv, F-H; Xie, X-L; Zhao, Y-X; Zhang, M; Shen, Z-Q; Ren, Y-L; Gao, L; Shen, M; Kantanen, J; Li, M-H

2017-10-01

Genome-wide association studies (GWASs) have been widely applied in livestock to identify genes associated with traits of economic interest. Here, we conducted the first GWAS of the supernumerary nipple phenotype in Wadi sheep, a native Chinese sheep breed, based on Ovine Infinium HD SNP BeadChip genotypes in a total of 144 ewes (75 cases with four teats, including two normal and two supernumerary teats, and 69 control cases with two teats). We detected 63 significant SNPs at the chromosome-wise threshold. Additionally, one candidate region (chr1: 170.723-170.734 Mb) was identified by haplotype-based association tests, with one SNP (rs413490006) surrounding functional genes BBX and CD47 on chromosome 1 being commonly identified as significant by the two mentioned analyses. Moreover, Gene Ontology enrichment for the significant SNPs identified by the GWAS analysis was functionally clustered into the categories of receptor activity and synaptic membrane. In addition, pathway mapping revealed four promising pathways (Wnt, oxytocin, MAPK and axon guidance) involved in the development of the supernumerary nipple phenotype. Our results provide novel and important insights into the genetic mechanisms underlying the phenotype of supernumerary nipples in mammals, including humans. These findings may be useful for future breeding and genetics in sheep and other livestock. © 2017 Stichting International Foundation for Animal Genetics.
Genome-wide association analysis of age-at-onset in Alzheimer’s disease

PubMed Central

Kamboh, M. Ilyas; Barmada, M. Michael; Demirci, F. Yesim; Minster, Ryan L.; Carrasquillo, Minerva M.; Pankratz, V. Shane; Younkin, Steven G.; Saykin, Andrew J.; Sweet, Robert A.; Feingold, Eleanor; DeKosky, Steven T.; Lopez, Oscar L.

2011-01-01

The risk of Alzheimer’s disease (AD) is strongly determined by genetic factors and recent genome-wide association studies (GWAS) have identified several genes for the disease risk. In addition to the disease risk, age-at-onset (AAO) of AD has also strong genetic component with an estimated heritability of 42%. Identification of AAO genes may help to understand the biological mechanisms that regulate the onset of the disease. Here we report the first GWAS focused on identifying genes for the AAO of AD. We performed a genome-wide meta analysis on 3 samples comprising a total of 2,222 AD cases. A total of ~2.5 million directly genotyped or imputed SNPs were analyzed in relation to AAO of AD. As expected, the most significant associations were observed in the APOE region on chromosome 19 where several SNPs surpassed the conservative genome-wide significant threshold (P<5E-08). The most significant SNP outside the APOE region was located in the DCHS2 gene on chromosome 4q31.3 (rs1466662; P=4.95E-07). There were 19 additional significant SNPs in this region at P<1E-04 and the DCHS2 gene is expressed in the cerebral cortex and thus is a potential candidate for affecting AAO in AD. These findings need to be confirmed in additional well-powered samples. PMID:22005931
Genome-wide association analysis of age-at-onset in Alzheimer's disease.

PubMed

Kamboh, M I; Barmada, M M; Demirci, F Y; Minster, R L; Carrasquillo, M M; Pankratz, V S; Younkin, S G; Saykin, A J; Sweet, R A; Feingold, E; DeKosky, S T; Lopez, O L

2012-12-01

The risk of Alzheimer's disease (AD) is strongly determined by genetic factors and recent genome-wide association studies (GWAS) have identified several genes for the disease risk. In addition to the disease risk, age-at-onset (AAO) of AD has also strong genetic component with an estimated heritability of 42%. Identification of AAO genes may help to understand the biological mechanisms that regulate the onset of the disease. Here we report the first GWAS focused on identifying genes for the AAO of AD. We performed a genome-wide meta-analysis on three samples comprising a total of 2222 AD cases. A total of ~2.5 million directly genotyped or imputed single-nucleotide polymorphisms (SNPs) were analyzed in relation to AAO of AD. As expected, the most significant associations were observed in the apolipoprotein E (APOE) region on chromosome 19 where several SNPs surpassed the conservative genome-wide significant threshold (P<5E-08). The most significant SNP outside the APOE region was located in the DCHS2 gene on chromosome 4q31.3 (rs1466662; P=4.95E-07). There were 19 additional significant SNPs in this region at P<1E-04 and the DCHS2 gene is expressed in the cerebral cortex and thus is a potential candidate for affecting AAO in AD. These findings need to be confirmed in additional well-powered samples.
The 2p21 deletion syndrome: characterization of the transcription content.

PubMed

Parvari, Ruti; Gonen, Yael; Alshafee, Ismael; Buriakovsky, Sophia; Regev, Kfir; Hershkovitz, Eli

2005-08-01

The vast majority of small-deletion syndromes are caused by haploinsufficiency of one or several genes and are transmitted as dominant traits. We have previously identified a homozygous deletion of 179,311 bp on chromosome 2p21 as the cause of a unique syndrome, inherited in a recessive mode, consisting of cystinuria, neonatal seizures, hypotonia, severe somatic and developmental delay, facial dysmorphism, and reduced activity of all the respiratory chain enzymatic complexes that are encoded in the mitochondria. We now present the transcription content of this region: Multiple splicing variants of the genes protein phosphatase 1B (formerly 2C) magnesium-dependent, beta isoform (PPM1B), SLC3A1, and KIAA0436 (approved gene symbol PREPL) were identified and their patterns of expression analyzed. The spliced variants are predicted to have additional functions compared to the known variants and their patterns of expression fit the tissues affected by the syndrome. The first exon of an additional gene (C2orf34) is encoded in the deleted region and the gene is not expressed in the patients. In addition several transcripts with very short open reading frames are also encoded in the deletion. The identification of all transcripts encoded in the region deleted in the patients is the first step in the study of the genotype-phenotype correlation of the 2p21 patients.
De novo assembly and analysis of the Artemisia argyi transcriptome and identification of genes involved in terpenoid biosynthesis.

PubMed

Liu, Miaomiao; Zhu, Jinhang; Wu, Shengbing; Wang, Chenkai; Guo, Xingyi; Wu, Jiawen; Zhou, Meiqi

2018-04-11

Artemisia argyi Lev. et Vant. (A. argyi) is widely utilized for moxibustion in Chinese medicine, and the mechanism underlying terpenoid biosynthesis in its leaves is suggested to play an important role in its medicinal use. However, the A. argyi transcriptome has not been sequenced. Herein, we performed RNA sequencing for A. argyi leaf, root and stem tissues to identify as many as possible of the transcribed genes. In total, 99,807 unigenes were assembled by analysing the expression profiles generated from the three tissue types, and 67,446 of those unigenes were annotated in public databases. We further performed differential gene expression analysis to compare leaf tissue with the other two tissue types and identified numerous genes that were specifically expressed or up-regulated in leaf tissue. Specifically, we identified multiple genes encoding significant enzymes or transcription factors related to terpenoid synthesis. This study serves as a valuable resource for transcriptome information, as many transcribed genes related to terpenoid biosynthesis were identified in the A. argyi transcriptome, providing a functional genomic basis for additional studies on molecular mechanisms underlying the medicinal use of A. argyi.
Cracking the genomic piggy bank: identifying secrets of the pig genome.

PubMed

Mote, B E; Rothschild, M F

2006-01-01

Though researchers are uncovering valuable information about the pig genome at unprecedented speed, the porcine genome community is barely scratching the surface as to understanding interactions of the biological code. The pig genetic linkage map has nearly 5,000 loci comprised of genes, microsatellites, and amplified fragment length polymorphism markers. Likewise, the physical map is becoming denser with nearly 6,000 markers. The long awaited sequencing efforts are providing multidimensional benefits with sequence available for comparative genomics and identifying single nucleotide polymorphisms for use in linkage and trait association studies. Scientists are using exotic and commercial breeds for quantitative trait loci scans. Additionally, candidate gene studies continue to identify chromosomal regions or genes associated with economically important traits such as growth rate, leanness, feed intake, meat quality, litter size, and disease resistance. The commercial pig industry is actively incorporating these markers in marker-assisted selection along with traditional performance information to improve said traits. Researchers are utilizing novel tools including pig microarrays along with advanced bioinformatics to identify new candidate genes, understand gene function, and piece together gene networks involved in important biological processes. Advances in pig genomics and implications to the pork industry as well as human health are reviewed.
An Unbiased Systems Genetics Approach to Mapping Genetic Loci Modulating Susceptibility to Severe Streptococcal Sepsis

PubMed Central

Abdeltawab, Nourtan F.; Aziz, Ramy K.; Kansal, Rita; Rowe, Sarah L.; Su, Yin; Gardner, Lidia; Brannen, Charity; Nooh, Mohammed M.; Attia, Ramy R.; Abdelsamed, Hossam A.; Taylor, William L.; Lu, Lu; Williams, Robert W.; Kotb, Malak

2008-01-01

Striking individual differences in severity of group A streptococcal (GAS) sepsis have been noted, even among patients infected with the same bacterial strain. We had provided evidence that HLA class II allelic variation contributes significantly to differences in systemic disease severity by modulating host responses to streptococcal superantigens. Inasmuch as the bacteria produce additional virulence factors that participate in the pathogenesis of this complex disease, we sought to identify additional gene networks modulating GAS sepsis. Accordingly, we applied a systems genetics approach using a panel of advanced recombinant inbred mice. By analyzing disease phenotypes in the context of mice genotypes we identified a highly significant quantitative trait locus (QTL) on Chromosome 2 between 22 and 34 Mb that strongly predicts disease severity, accounting for 25%–30% of variance. This QTL harbors several polymorphic genes known to regulate immune responses to bacterial infections. We evaluated candidate genes within this QTL using multiple parameters that included linkage, gene ontology, variation in gene expression, cocitation networks, and biological relevance, and identified interleukin1 alpha and prostaglandin E synthases pathways as key networks involved in modulating GAS sepsis severity. The association of GAS sepsis with multiple pathways underscores the complexity of traits modulating GAS sepsis and provides a powerful approach for analyzing interactive traits affecting outcomes of other infectious diseases. PMID:18421376
Comprehensive genomic analysis identifies pathogenic variants in maturity-onset diabetes of the young (MODY) patients in South India.

PubMed

Mohan, Viswanathan; Radha, Venkatesan; Nguyen, Thong T; Stawiski, Eric W; Pahuja, Kanika Bajaj; Goldstein, Leonard D; Tom, Jennifer; Anjana, Ranjit Mohan; Kong-Beltran, Monica; Bhangale, Tushar; Jahnavi, Suresh; Chandni, Radhakrishnan; Gayathri, Vijay; George, Paul; Zhang, Na; Murugan, Sakthivel; Phalke, Sameer; Chaudhuri, Subhra; Gupta, Ravi; Zhang, Jingli; Santhosh, Sam; Stinson, Jeremy; Modrusan, Zora; Ramprasad, V L; Seshagiri, Somasekar; Peterson, Andrew S

2018-02-13

Maturity-onset diabetes of the young (MODY) is an early-onset, autosomal dominant form of non-insulin dependent diabetes. Genetic diagnosis of MODY can transform patient management. Earlier data on the genetic predisposition to MODY have come primarily from familial studies in populations of European origin. In this study, we carried out a comprehensive genomic analysis of 289 individuals from India that included 152 clinically diagnosed MODY cases to identify variants in known MODY genes. Further, we have analyzed exome data to identify putative MODY relevant variants in genes previously not implicated in MODY. Functional validation of MODY relevant variants was also performed. We found MODY 3 (HNF1A; 7.2%) to be most frequently mutated followed by MODY 12 (ABCC8; 3.3%). They together account for ~ 11% of the cases. In addition to known MODY genes, we report the identification of variants in RFX6, WFS1, AKT2, NKX6-1 that may contribute to development of MODY. Functional assessment of the NKX6-1 variants showed that they are functionally impaired. Our findings showed HNF1A and ABCC8 to be the most frequently mutated MODY genes in south India. Further we provide evidence for additional MODY relevant genes, such as NKX6-1, and these require further validation.
A functional screen for copper homeostasis genes identifies a pharmacologically tractable cellular system

PubMed Central

2014-01-01

Background Copper is essential for the survival of aerobic organisms. If copper is not properly regulated in the body however, it can be extremely cytotoxic and genetic mutations that compromise copper homeostasis result in severe clinical phenotypes. Understanding how cells maintain optimal copper levels is therefore highly relevant to human health. Results We found that addition of copper (Cu) to culture medium leads to increased respiratory growth of yeast, a phenotype which we then systematically and quantitatively measured in 5050 homozygous diploid deletion strains. Cu’s positive effect on respiratory growth was quantitatively reduced in deletion strains representing 73 different genes, the function of which identify increased iron uptake as a cause of the increase in growth rate. Conversely, these effects were enhanced in strains representing 93 genes. Many of these strains exhibited respiratory defects that were specifically rescued by supplementing the growth medium with Cu. Among the genes identified are known and direct regulators of copper homeostasis, genes required to maintain low vacuolar pH, and genes where evidence supporting a functional link with Cu has been heretofore lacking. Roughly half of the genes are conserved in man, and several of these are associated with Mendelian disorders, including the Cu-imbalance syndromes Menkes and Wilson’s disease. We additionally demonstrate that pharmacological agents, including the approved drug disulfiram, can rescue Cu-deficiencies of both environmental and genetic origin. Conclusions A functional screen in yeast has expanded the list of genes required for Cu-dependent fitness, revealing a complex cellular system with implications for human health. Respiratory fitness defects arising from perturbations in this system can be corrected with pharmacological agents that increase intracellular copper concentrations. PMID:24708151
EG-05COMBINATION OF GENE COPY GAIN AND EPIGENETIC DEREGULATION ARE ASSOCIATED WITH THE ABERRANT EXPRESSION OF A STEM CELL RELATED HOX-SIGNATURE IN GLIOBLASTOMA

PubMed Central

Kurscheid, Sebastian; Bady, Pierre; Sciuscio, Davide; Samarzija, Ivana; Shay, Tal; Vassallo, Irene; Van Criekinge, Wim; Domany, Eytan; Stupp, Roger; Delorenzi, Mauro; Hegi, Monika

2014-01-01

We previously reported a stem cell related HOX gene signature associated with resistance to chemo-radiotherapy (TMZ/RT- > TMZ) in glioblastoma. However, underlying mechanisms triggering overexpression remain mostly elusive. Interestingly, HOX genes are neither involved in the developing brain, nor expressed in normal brain, suggestive of an acquired gene expression signature during gliomagenesis. HOXA genes are located on CHR 7 that displays trisomy in most glioblastoma which strongly impacts gene expression on this chromosome, modulated by local regulatory elements. Furthermore we observed more pronounced DNA methylation across the HOXA locus as compared to non-tumoral brain (Human methylation 450K BeadChip Illumina; 59 glioblastoma, 5 non-tumoral brain sampes). CpG probes annotated for HOX-signature genes, contributing most to the variability, served as input into the analysis of DNA methylation and expression to identify key regulatory regions. The structural similarity of the observed correlation matrices between DNA methylation and gene expression in our cohort and an independent data-set from TCGA (106 glioblastoma) was remarkable (RV-coefficient, 0.84; p-value < 0.0001). We identified a CpG located in the promoter region of the HOXA10 locus exerting the strongest mean negative correlation between methylation and expression of the whole HOX-signature. Applying this analysis the same CpG emerged in the external set. We then determined the contribution of both, gene copy aberration (CNA) and methylation at the selected probe to explain expression of the HOX-signature using a linear model. Statistically significant results suggested an additive effect between gene dosage and methylation at the key CpG identified. Similarly, such an additive effect was also observed in the external data-set. Taken together, we hypothesize that overexpression of the stem-cell related HOX signature is triggered by gain of trisomy 7 and escape from compensatory DNA methylation at positions controlling the effect of enhanced gene dose on expression.
Genomic Organization, Phylogenetic Comparison and Differential Expression of the SBP-Box Family Genes in Grape

PubMed Central

Hou, Hongmin; Li, Jun; Gao, Min; Singer, Stacy D.; Wang, Hao; Mao, Linyong; Fei, Zhangjun; Wang, Xiping

2013-01-01

Background The SBP-box gene family is specific to plants and encodes a class of zinc finger-containing transcription factors with a broad range of functions. Although SBP-box genes have been identified in numerous plants including green algae, moss, silver birch, snapdragon, Arabidopsis, rice and maize, there is little information concerning SBP-box genes, or the corresponding miR156/157, function in grapevine. Methodology/Principal Findings Eighteen SBP-box gene family members were identified in Vitis vinifera, twelve of which bore sequences that were complementary to miRNA156/157. Phylogenetic reconstruction demonstrated that plant SBP-domain proteins could be classified into seven subgroups, with the V. vinifera SBP-domain proteins being more closely related to SBP-domain proteins from dicotyledonous angiosperms than those from monocotyledonous angiosperms. In addition, synteny analysis between grape and Arabidopsis demonstrated that homologs of several grape SBP genes were found in corresponding syntenic blocks of Arabidopsis. Expression analysis of the grape SBP-box genes in various organs and at different stages of fruit development in V. quinquangularis ‘Shang-24’ revealed distinct spatiotemporal patterns. While the majority of the grape SBP-box genes lacking a miR156/157 target site were expressed ubiquitously and constitutively, most genes bearing a miR156/157 target site exhibited distinct expression patterns, possibly due to the inhibitory role of the microRNA. Furthermore, microarray data mining and quantitative real-time RT-PCR analysis identified several grape SBP-box genes that are potentially involved in the defense against biotic and abiotic stresses. Conclusion The results presented here provide a further understanding of SBP-box gene function in plants, and yields additional insights into the mechanism of stress management in grape, which may have important implications for the future success of this crop. PMID:23527172
Knowledge-guided gene prioritization reveals new insights into the mechanisms of chemoresistance.

PubMed

Emad, Amin; Cairns, Junmei; Kalari, Krishna R; Wang, Liewei; Sinha, Saurabh

2017-08-11

Identification of genes whose basal mRNA expression predicts the sensitivity of tumor cells to cytotoxic treatments can play an important role in individualized cancer medicine. It enables detailed characterization of the mechanism of action of drugs. Furthermore, screening the expression of these genes in the tumor tissue may suggest the best course of chemotherapy or a combination of drugs to overcome drug resistance. We developed a computational method called ProGENI to identify genes most associated with the variation of drug response across different individuals, based on gene expression data. In contrast to existing methods, ProGENI also utilizes prior knowledge of protein-protein and genetic interactions, using random walk techniques. Analysis of two relatively new and large datasets including gene expression data on hundreds of cell lines and their cytotoxic responses to a large compendium of drugs reveals a significant improvement in prediction of drug sensitivity using genes identified by ProGENI compared to other methods. Our siRNA knockdown experiments on ProGENI-identified genes confirmed the role of many new genes in sensitivity to three chemotherapy drugs: cisplatin, docetaxel, and doxorubicin. Based on such experiments and extensive literature survey, we demonstrate that about 73% of our top predicted genes modulate drug response in selected cancer cell lines. In addition, global analysis of genes associated with groups of drugs uncovered pathways of cytotoxic response shared by each group. Our results suggest that knowledge-guided prioritization of genes using ProGENI gives new insight into mechanisms of drug resistance and identifies genes that may be targeted to overcome this phenomenon.
Integrative analysis for identification of shared markers from various functional cells/tissues for rheumatoid arthritis.

PubMed

Xia, Wei; Wu, Jian; Deng, Fei-Yan; Wu, Long-Fei; Zhang, Yong-Hong; Guo, Yu-Fan; Lei, Shu-Feng

2017-02-01

Rheumatoid arthritis (RA) is a systemic autoimmune disease. So far, it is unclear whether there exist common RA-related genes shared in different tissues/cells. In this study, we conducted an integrative analysis on multiple datasets to identify potential shared genes that are significant in multiple tissues/cells for RA. Seven microarray gene expression datasets representing various RA-related tissues/cells were downloaded from the Gene Expression Omnibus (GEO). Statistical analyses, testing both marginal and joint effects, were conducted to identify significant genes shared in various samples. Followed-up analyses were conducted on functional annotation clustering analysis, protein-protein interaction (PPI) analysis, gene-based association analysis, and ELISA validation analysis in in-house samples. We identified 18 shared significant genes, which were mainly involved in the immune response and chemokine signaling pathway. Among the 18 genes, eight genes (PPBP, PF4, HLA-F, S100A8, RNASEH2A, P2RY6, JAG2, and PCBP1) interact with known RA genes. Two genes (HLA-F and PCBP1) are significant in gene-based association analysis (P = 1.03E-31, P = 1.30E-2, respectively). Additionally, PCBP1 also showed differential protein expression levels in in-house case-control plasma samples (P = 2.60E-2). This study represented the first effort to identify shared RA markers from different functional cells or tissues. The results suggested that one of the shared genes, i.e., PCBP1, is a promising biomarker for RA.
Next-generation sequencing to solve complex inherited retinal dystrophy: A case series of multiple genes contributing to disease in extended families.

PubMed

Jones, Kaylie D; Wheaton, Dianna K; Bowne, Sara J; Sullivan, Lori S; Birch, David G; Chen, Rui; Daiger, Stephen P

2017-01-01

With recent availability of next-generation sequencing (NGS), it is becoming more common to pursue disease-targeted panel testing rather than traditional sequential gene-by-gene dideoxy sequencing. In this report, we describe using NGS to identify multiple disease-causing mutations that contribute concurrently or independently to retinal dystrophy in three relatively small families. Family members underwent comprehensive visual function evaluations, and genetic counseling including a detailed family history. A preliminary genetic inheritance pattern was assigned and updated as additional family members were tested. Family 1 (FAM1) and Family 2 (FAM2) were clinically diagnosed with retinitis pigmentosa (RP) and had a suspected autosomal dominant pedigree with non-penetrance (n.p.). Family 3 (FAM3) consisted of a large family with a diagnosis of RP and an overall dominant pedigree, but the proband had phenotypically cone-rod dystrophy. Initial genetic analysis was performed on one family member with traditional Sanger single gene sequencing and/or panel-based testing, and ultimately, retinal gene-targeted NGS was required to identify the underlying cause of disease for individuals within the three families. Results obtained in these families necessitated further genetic and clinical testing of additional family members to determine the complex genetic and phenotypic etiology of each family. Genetic testing of FAM1 (n = 4 affected; 1 n.p.) identified a dominant mutation in RP1 (p.Arg677Ter) that was present for two of the four affected individuals but absent in the proband and the presumed non-penetrant individual. Retinal gene-targeted NGS in the fourth affected family member revealed compound heterozygous mutations in USH2A (p. Cys419Phe, p.Glu767Serfs*21). Genetic testing of FAM2 (n = 3 affected; 1 n.p.) identified three retinal dystrophy genes ( PRPH2 , PRPF8 , and USH2A ) with disease-causing mutations in varying combinations among the affected family members. Genetic testing of FAM3 (n = 7 affected) identified a mutation in PRPH2 (p.Pro216Leu) tracking with disease in six of the seven affected individuals. Additional retinal gene-targeted NGS testing determined that the proband also harbored a multiple exon deletion in the CRX gene likely accounting for her cone-rod phenotype; her son harbored only the mutation in CRX , not the familial mutation in PRPH2 . Multiple genes contributing to the retinal dystrophy genotypes within a family were discovered using retinal gene-targeted NGS. Families with noted examples of phenotypic variation or apparent non-penetrant individuals may offer a clue to suspect complex inheritance. Furthermore, this finding underscores that caution should be taken when attributing a single gene disease-causing mutation (or inheritance pattern) to a family as a whole. Identification of a disease-causing mutation in a proband, even with a clear inheritance pattern in hand, may not be sufficient for targeted, known mutation analysis in other family members.

Genome-Wide Identification, Characterization and Expression Analysis of the Chalcone Synthase Family in Maize

PubMed Central

Han, Yahui; Ding, Ting; Su, Bo; Jiang, Haiyang

2016-01-01

Members of the chalcone synthase (CHS) family participate in the synthesis of a series of secondary metabolites in plants, fungi and bacteria. The metabolites play important roles in protecting land plants against various environmental stresses during the evolutionary process. Our research was conducted on comprehensive investigation of CHS genes in maize (Zea mays L.), including their phylogenetic relationships, gene structures, chromosomal locations and expression analysis. Fourteen CHS genes (ZmCHS01–14) were identified in the genome of maize, representing one of the largest numbers of CHS family members identified in one organism to date. The gene family was classified into four major classes (classes I–IV) based on their phylogenetic relationships. Most of them contained two exons and one intron. The 14 genes were unevenly located on six chromosomes. Two segmental duplication events were identified, which might contribute to the expansion of the maize CHS gene family to some extent. In addition, quantitative real-time PCR and microarray data analyses suggested that ZmCHS genes exhibited various expression patterns, indicating functional diversification of the ZmCHS genes. Our results will contribute to future studies of the complexity of the CHS gene family in maize and provide valuable information for the systematic analysis of the functions of the CHS gene family. PMID:26828478
A massive incorporation of microbial genes into the genome of Tetranychus urticae, a polyphagous arthropod herbivore.

PubMed

Wybouw, N; Van Leeuwen, T; Dermauw, W

2018-06-01

A number of horizontal gene transfers (HGTs) have been identified in the spider mite Tetranychus urticae, a chelicerate herbivore. However, the genome of this mite species has at present not been thoroughly mined for the presence of HGT genes. Here, we performed a systematic screen for HGT genes in the T. urticae genome using the h-index metric. Our results not only validated previously identified HGT genes but also uncovered 25 novel HGT genes. In addition to HGT genes with a predicted biochemical function in carbohydrate, lipid and folate metabolism, we also identified the horizontal transfer of a ketopantoate hydroxymethyltransferase and a pantoate β-alanine ligase gene. In plants and bacteria, both genes are essential for vitamin B5 biosynthesis and their presence in the mite genome strongly suggests that spider mites, similar to Bemisia tabaci and nematodes, can synthesize their own vitamin B5. We further show that HGT genes were physically embedded within the mite genome and were expressed in different life stages. By screening chelicerate genomes and transcriptomes, we were able to estimate the evolutionary histories of these HGTs during chelicerate evolution. Our study suggests that HGT has made a significant and underestimated impact on the metabolic repertoire of plant-feeding spider mites. © 2018 The Royal Entomological Society.
Genome-wide identification of 99 autophagy-related (Atg) genes in the monogonont rotifer Brachionus spp. and transcriptional modulation in response to cadmium.

PubMed

Kang, Hye-Min; Lee, Jin-Sol; Kim, Min-Sub; Lee, Young Hwan; Jung, Jee-Hyun; Hagiwara, Atsushi; Zhou, Bingsheng; Lee, Jae-Seong; Jeong, Chang-Bum

2018-05-30

Autophagy originated from the common ancestor of all life forms, and its function is highly conserved from yeast to humans. Autophagy plays a key role in various fundamental biological processes including defense, and has developed through serial interactions of multiple gene sets referred to as autophagy-related (Atg) genes. Despite their significance in metazoan life and evolution, few studies have been conducted to identify these genes in aquatic invertebrates. In this study, we identified whole Atg genes in four Brachionus rotifer spp., namely B. calyciflorus, B. koreanus, B. plicatilis, and B. rotundiformis, through searches of their entire genomes; and we annotated them according to the yeast nomenclature. Twenty-four genes orthologous to yeast genes were present in all of the Brachionus spp. while three additional gene duplicates were identified in the genome of B. koreanus, indicating that these genes had diversified during the speciation. Also, their transcriptional responses to cadmium exposure indicated regulation by cadmium-induced oxidative-stress-related signaling pathways. This study provides valuable information on 99 conserved Atg genes involved in autophagosome formation in Brachionus spp., with transcriptional modulation in response to cadmium, in the context of the role of autophagy in the damage response. Copyright © 2018 Elsevier B.V. All rights reserved.
Integrative Analysis of Response to Tamoxifen Treatment in ER-Positive Breast Cancer Using GWAS Information and Transcription Profiling.

PubMed

Hicks, Chindo; Kumar, Ranjit; Pannuti, Antonio; Miele, Lucio

2012-01-01

Variable response and resistance to tamoxifen treatment in breast cancer patients remains a major clinical problem. To determine whether genes and biological pathways containing SNPs associated with risk for breast cancer are dysregulated in response to tamoxifen treatment, we performed analysis combining information from 43 genome-wide association studies with gene expression data from 298 ER(+) breast cancer patients treated with tamoxifen and 125 ER(+) controls. We identified 95 genes which distinguished tamoxifen treated patients from controls. Additionally, we identified 54 genes which stratified tamoxifen treated patients into two distinct groups. We identified biological pathways containing SNPs associated with risk for breast cancer, which were dysregulated in response to tamoxifen treatment. Key pathways identified included the apoptosis, P53, NFkB, DNA repair and cell cycle pathways. Combining GWAS with transcription profiling provides a unified approach for associating GWAS findings with response to drug treatment and identification of potential drug targets.
Identification of olfactory receptor genes in the Japanese grenadier anchovy Coilia nasus.

PubMed

Zhu, Guoli; Wang, Liangjiang; Tang, Wenqiao; Wang, Xiaomei; Wang, Cong

2017-01-01

Olfaction is essential for fish to detect odorant elements in the environment and plays a critical role in navigating, locating food and detecting predators. Olfactory function is produced by the olfactory transduction pathway and is activated by olfactory receptors (ORs) through the binding of odorant elements. Recently, four types of olfactory receptors have been identified in vertebrate olfactory epithelium, including main odorant receptors (MORs), vomeronasal type receptors (VRs), trace-amine associated receptors (TAARs) and formyl peptide receptors (FPRs). It has been hypothesized that migratory fish, which have the ability to perform spawning migration, use olfactory cues to return to natal rivers. Therefore, obtaining OR genes from migratory fish will provide a resource for the study of molecular mechanisms that underlie fish spawning migration behaviors. Previous studies of OR genes have mainly focused on genomic data, however little information has been gained at the transcript level. In this study, we identified the OR genes of an economically important commercial fish Coilia nasus through searching for olfactory epithelium transcriptomes. A total of 142 candidate MOR, 52 V2R/OlfC, 32 TAAR and two FPR putative genes were identified. In addition, through genomic analysis we identified several MOR genes containing introns, which is unusual for vertebrate MOR genes. The transcriptome-scale mining strategy proved to be fruitful in identifying large sets of OR genes from species whose genome information is unavailable. Our findings lay the foundation for further research into the possible molecular mechanisms underlying the spawning migration behavior in C. nasus .
Confirming genes influencing risk to cleft lip with/without cleft palate in a case-parent trio study.

PubMed

Beaty, T H; Taub, M A; Scott, A F; Murray, J C; Marazita, M L; Schwender, H; Parker, M M; Hetmanski, J B; Balakrishnan, P; Mansilla, M A; Mangold, E; Ludwig, K U; Noethen, M M; Rubini, M; Elcioglu, N; Ruczinski, I

2013-07-01

A collection of 1,108 case-parent trios ascertained through an isolated, nonsyndromic cleft lip with or without cleft palate (CL/P) was used to replicate the findings from a genome-wide association study (GWAS) conducted by Beaty et al. (Nat Genet 42:525-529, 2010), where four different genes/regions were identified as influencing risk to CL/P. Tagging SNPs for 33 different genes were genotyped (1,269 SNPs). All four of the genes originally identified as showing genome-wide significance (IRF6, ABCA4 and MAF, plus the 8q24 region) were confirmed in this independent sample of trios (who were primarily of European and Southeast Asian ancestry). In addition, eight genes classified as 'second tier' hits in the original study (PAX7, THADA, COL8A1/FILIP1L, DCAF4L2, GADD45G, NTN1, RBFOX3 and FOXE1) showed evidence of linkage and association in this replication sample. Meta-analysis between the original GWAS trios and these replication trios showed PAX7, COL8A1/FILIP1L and NTN1 achieved genome-wide significance. Tests for gene-environment interaction between these 33 genes and maternal smoking found evidence for interaction with two additional genes: GRID2 and ELAVL2 among European mothers (who had a higher rate of smoking than Asian mothers). Formal tests for gene-gene interaction (epistasis) failed to show evidence of statistical interaction in any simple fashion. This study confirms that many different genes influence risk to CL/P.
Confirming genes influencing risk to cleft lip with/without cleft palate in a case-parent trio study

PubMed Central

Beaty, TH; Taub, MA; Scott, AF; Murray, JC; Marazita, ML; Schwender, H; Parker, MM; Hetmanski, JB; Balakrishnan, P; Mansilla, MA; Mangold, E; Ludwig, KU; Noethen, MM; Rubini, M; Elcioglu, N; Ruczinski, I

2013-01-01

A collection of 1,108 case-parent trios ascertained through an isolated, non-syndromic cleft lip with or without cleft palate (CL/P) was used to replicate the findings from a genome-wide association study (GWAS) conducted by Beaty et al. (2010) where four different genes/regions were identified as influencing risk to CL/P. Tagging SNPs for 33 different genes were genotyped (1,269 SNPs). All four of the genes originally identified as showing genome-wide significance (IRF6, ABCA4 and MAF, plus the 8q24 region) were confirmed in this independent sample of trios (who were primarily of European and Southeast Asian ancestry). In addition, eight genes classified as ‘second tier’ hits in the original study (PAX7, THADA, COL8A1/FILIP1L, DCAF4L2, GADD45G, NTN1, RBFOX3 and FOXE1) showed evidence of linkage and association in this replication sample. Meta-analysis between the original GWAS trios and these replication trios showed PAX7, COL8A1/FILIP1L and NTN1 achieved genome-wide significance. Tests for gene-environment interaction between these 33 genes and maternal smoking found evidence for interaction with two additional genes: GRID2 and ELAVL2 among European mothers (who had a higher rate of smoking than Asian mothers). Formal tests for gene-gene interaction (epistasis) failed to show evidence of statistical interaction in any simple fashion. This study confirms that many different genes influence risk to CL/P. PMID:23512105
Genetic assessment of additional endophenotypes from the Consortium on the Genetics of Schizophrenia Family Study.

PubMed

Greenwood, Tiffany A; Lazzeroni, Laura C; Calkins, Monica E; Freedman, Robert; Green, Michael F; Gur, Raquel E; Gur, Ruben C; Light, Gregory A; Nuechterlein, Keith H; Olincy, Ann; Radant, Allen D; Seidman, Larry J; Siever, Larry J; Silverman, Jeremy M; Stone, William S; Sugar, Catherine A; Swerdlow, Neal R; Tsuang, Debby W; Tsuang, Ming T; Turetsky, Bruce I; Braff, David L

2016-01-01

The Consortium on the Genetics of Schizophrenia Family Study (COGS-1) has previously reported our efforts to characterize the genetic architecture of 12 primary endophenotypes for schizophrenia. We now report the characterization of 13 additional measures derived from the same endophenotype test paradigms in the COGS-1 families. Nine of the measures were found to discriminate between schizophrenia patients and controls, were significantly heritable (31 to 62%), and were sufficiently independent of previously assessed endophenotypes, demonstrating utility as additional endophenotypes. Genotyping via a custom array of 1536 SNPs from 94 candidate genes identified associations for CTNNA2, ERBB4, GRID1, GRID2, GRIK3, GRIK4, GRIN2B, NOS1AP, NRG1, and RELN across multiple endophenotypes. An experiment-wide p value of 0.003 suggested that the associations across all SNPs and endophenotypes collectively exceeded chance. Linkage analyses performed using a genome-wide SNP array further identified significant or suggestive linkage for six of the candidate endophenotypes, with several genes of interest located beneath the linkage peaks (e.g., CSMD1, DISC1, DLGAP2, GRIK2, GRIN3A, and SLC6A3). While the partial convergence of the association and linkage likely reflects differences in density of gene coverage provided by the distinct genotyping platforms, it is also likely an indication of the differential contribution of rare and common variants for some genes and methodological differences in detection ability. Still, many of the genes implicated by COGS through endophenotypes have been identified by independent studies of common, rare, and de novo variation in schizophrenia, all converging on a functional genetic network related to glutamatergic neurotransmission that warrants further investigation. Copyright © 2015 Elsevier B.V. All rights reserved.
New lessons from an old gene: complex splicing and a novel cryptic exon in VHL gene cause erythrocytosis and VHL disease.

PubMed

Lenglet, Marion; Robriquet, Florence; Schwarz, Klaus; Camps, Carme; Couturier, Anne; Hoogewijs, David; Buffet, Alexandre; Knight, Samantha Jl; Gad, Sophie; Couvé, Sophie; Chesnel, Franck; Pacault, Mathilde; Lindenbaum, Pierre; Job, Sylvie; Dumont, Solenne; Besnard, Thomas; Cornec, Marine; Dreau, Helene; Pentony, Melissa; Kvikstad, Erika; Deveaux, Sophie; Burnichon, Nelly; Ferlicot, Sophie; Vilaine, Mathias; Mazzella, Jean-Michaël; Airaud, Fabrice; Garrec, Céline; Heidet, Laurence; Irtan, Sabine; Mantadakis, Elpis; Bouchireb, Karim; Debatin, Klaus-Michael; Redon, Richard; Bezieau, Stéphane; Bressac-de Paillerets, Brigitte; Teh, Bin Tean; Girodon, François; Randi, Maria-Luigia; Putti, Maria Caterina; Bours, Vincent; Van Wijk, Richard; Göthert, Joachim R; Kattamis, Antonis; Janin, Nicolas; Bento, Celeste; Taylor, Jenny C; Arlot-Bonnemains, Yannick; Richard, Stéphane; Gimenez-Roqueplo, Anne-Paule; Cario, Holger; Gardie, Betty

2018-06-11

Chuvash polycythemia is an autosomal recessive form of erythrocytosis associated with a homozygous p.Arg200Trp mutation in the von Hippel-Lindau (VHL) gene. Since this discovery, additional VHL mutations have been identified in patients with congenital erythrocytosis, in a homozygous or compound-heterozygous state. VHL is a major tumor suppressor gene, mutations in which were first described in patients presenting with von Hippel-Lindau disease, which is characterized by the development of highly vascularized tumors. Here, we identified a new VHL cryptic-exon (termed E1') deep in intron 1 that is naturally expressed in many tissues. More importantly, we identified mutations in E1' in seven families with erythrocytosis (one homozygous case and six compound-heterozygous cases with a mutation in E1' in addition to a mutation in VHL coding sequences) and in one large family with typical VHL disease but without any alteration in the other VHL exons. In this study we have shown that the mutations induced a dysregulation of the VHL splicing with excessive retention of E1' and are associated with a downregulation of VHL protein expression. In addition, we have demonstrated a pathogenic role for synonymous mutations in VHL-Exon 2 that alter splicing through E2-skipping in five families with erythrocytosis or VHL disease. In all the studied cases, the mutations differentially impact splicing, correlating with phenotype severity. This study demonstrates that cryptic-exon-retention or exon-skipping are new VHL alterations and reveals a novel complex splicing regulation of the VHL gene. These findings open new avenues for diagnosis and research into the VHL-related-hypoxia-signaling pathway. Copyright © 2018 American Society of Hematology.
Additive QTLs on three chromosomes control flowering time in woodland strawberry (Fragaria vesca L.)

PubMed Central

Samad, Samia; Kurokura, Takeshi; Koskela, Elli; Toivainen, Tuomas; Patel, Vipul; Mouhu, Katriina; Sargent, Daniel James; Hytönen, Timo

2017-01-01

Flowering time is an important trait that affects survival, reproduction and yield in both wild and cultivated plants. Therefore, many studies have focused on the identification of flowering time quantitative trait locus (QTLs) in different crops, and molecular control of this trait has been extensively investigated in model species. Here we report the mapping of QTLs for flowering time and vegetative traits in a large woodland strawberry mapping population that was phenotyped both under field conditions and in a greenhouse after flower induction in the field. The greenhouse experiment revealed additive QTLs in three linkage groups (LG), two on both LG4 and LG7, and one on LG6 that explain about half of the flowering time variance in the population. Three of the QTLs were newly identified in this study, and one co-localized with the previously characterized FvTFL1 gene. An additional strong QTL corresponding to previously mapped PFRU was detected in both field and greenhouse experiments indicating that gene(s) in this locus can control the timing of flowering in different environments in addition to the duration of flowering and axillary bud differentiation to runners and branch crowns. Several putative flowering time genes were identified in these QTL regions that await functional validation. Our results indicate that a few major QTLs may control flowering time and axillary bud differentiation in strawberries. We suggest that the identification of causal genes in the diploid strawberry may enable fine tuning of flowering time and vegetative growth in the closely related octoploid cultivated strawberry. PMID:28580150
The ergot alkaloid gene cluster in Claviceps purpurea: extension of the cluster sequence and intra species evolution.

PubMed

Haarmann, Thomas; Machado, Caroline; Lübbe, Yvonne; Correia, Telmo; Schardl, Christopher L; Panaccione, Daniel G; Tudzynski, Paul

2005-06-01

The genomic region of Claviceps purpurea strain P1 containing the ergot alkaloid gene cluster [Tudzynski, P., Hölter, K., Correia, T., Arntz, C., Grammel, N., Keller, U., 1999. Evidence for an ergot alkaloid gene cluster in Claviceps purpurea. Mol. Gen. Genet. 261, 133-141] was explored by chromosome walking, and additional genes probably involved in the ergot alkaloid biosynthesis have been identified. The putative cluster sequence (extending over 68.5kb) contains 4 different nonribosomal peptide synthetase (NRPS) genes and several putative oxidases. Northern analysis showed that most of the genes were co-regulated (repressed by high phosphate), and identified probable flanking genes by lack of co-regulation. Comparison of the cluster sequences of strain P1, an ergotamine producer, with that of strain ECC93, an ergocristine producer, showed high conservation of most of the cluster genes, but significant variation in the NRPS modules, strongly suggesting that evolution of these chemical races of C. purpurea is determined by evolution of NRPS module specificity.
Mining biological databases for candidate disease genes

NASA Astrophysics Data System (ADS)

Braun, Terry A.; Scheetz, Todd; Webster, Gregg L.; Casavant, Thomas L.

2001-07-01

The publicly-funded effort to sequence the complete nucleotide sequence of the human genome, the Human Genome Project (HGP), has currently produced more than 93% of the 3 billion nucleotides of the human genome into a preliminary `draft' format. In addition, several valuable sources of information have been developed as direct and indirect results of the HGP. These include the sequencing of model organisms (rat, mouse, fly, and others), gene discovery projects (ESTs and full-length), and new technologies such as expression analysis and resources (micro-arrays or gene chips). These resources are invaluable for the researchers identifying the functional genes of the genome that transcribe and translate into the transcriptome and proteome, both of which potentially contain orders of magnitude more complexity than the genome itself. Preliminary analyses of this data identified approximately 30,000 - 40,000 human `genes.' However, the bulk of the effort still remains -- to identify the functional and structural elements contained within the transcriptome and proteome, and to associate function in the transcriptome and proteome to genes. A fortuitous consequence of the HGP is the existence of hundreds of databases containing biological information that may contain relevant data pertaining to the identification of disease-causing genes. The task of mining these databases for information on candidate genes is a commercial application of enormous potential. We are developing a system to acquire and mine data from specific databases to aid our efforts to identify disease genes. A high speed cluster of Linux of workstations is used to analyze sequence and perform distributed sequence alignments as part of our data mining and processing. This system has been used to mine GeneMap99 sequences within specific genomic intervals to identify potential candidate disease genes associated with Bardet-Biedle Syndrome (BBS).
The genetics of anophthalmia and microphthalmia.

PubMed

Bardakjian, Tanya M; Schneider, Adele

2011-09-01

To summarize recent breakthroughs regarding the genes known to play a role in normal ocular development in humans and to elucidate the role mutations in these genes play in anophthalmia and microphthalmia. The main themes discussed within this article are the various documented genetic advances in identifying the various causes of anophthalmia and microphthalmia. In addition, the complex interplay of these genes during critical embryonic development will be addressed. The recent identification of many eye development genes has changed the ability to identify a cause of anophthalmia and microphthalmia in many individuals. Syndrome identification and the availability of genetic testing underscores the desirability of evaluation by a geneticist for all individuals with anophthalmia and microphthalmia in order to provide appropriate management, long-term guidance, and genetic counseling.
The ethylene response pathway in Arabidopsis

NASA Technical Reports Server (NTRS)

Kieber, J. J.; Evans, M. L. (Principal Investigator)

1997-01-01

The simple gas ethylene influences a diverse array of plant growth and developmental processes including germination, senescence, cell elongation, and fruit ripening. This review focuses on recent molecular genetic studies, principally in Arabidopsis, in which components of the ethylene response pathway have been identified. The isolation and characterization of two of these genes has revealed that ethylene sensing involves a protein kinase cascade. One of these genes encodes a protein with similarity to the ubiquitous Raf family of Ser/Thr protein kinases. A second gene shows similarity to the prokaryotic two-component histidine kinases and most likely encodes an ethylene receptor. Additional elements involved in ethylene signaling have only been identified genetically. The characterization of these genes and mutants will be discussed.
A Genome-Wide RNAi Screen for Modifiers of the Circadian Clock in Human Cells

PubMed Central

Zhang, Eric E.; Liu, Andrew C.; Hirota, Tsuyoshi; Miraglia, Loren J.; Welch, Genevieve; Pongsawakul, Pagkapol Y.; Liu, Xianzhong; Atwood, Ann; Huss, Jon W.; Janes, Jeff; Su, Andrew I.; Hogenesch, John B.; Kay, Steve A.

2009-01-01

Summary Two decades of research identified more than a dozen clock genes and defined a biochemical feedback mechanism of circadian oscillator function. To identify additional clock genes and modifiers, we conducted a genome-wide siRNA screen in a human cellular clock model. Knockdown of nearly a thousand genes reduced rhythm amplitude. Potent effects on period length or increased amplitude were less frequent; we found hundreds of these and confirmed them in secondary screens. Characterization of a subset of these genes demonstrated a dosage-dependent effect on oscillator function. Protein interaction network analysis showed that dozens of gene products directly or indirectly associate with known clock components. Pathway analysis revealed these genes are overrepresented for components of insulin and hedgehog signaling, the cell cycle, and the folate metabolism. Coupled with data showing many of these pathways are clock-regulated, we conclude the clock is interconnected with many aspects of cellular function. PMID:19765810
Sex Determination in Ceratopteris richardii Is Accompanied by Transcriptome Changes That Drive Epigenetic Reprogramming of the Young Gametophyte.

PubMed

Atallah, Nadia M; Vitek, Olga; Gaiti, Federico; Tanurdzic, Milos; Banks, Jo Ann

2018-05-02

The fern Ceratopteris richardii is an important model for studies of sex determination and gamete differentiation in homosporous plants. Here we use RNA-seq to de novo assemble a transcriptome and identify genes differentially expressed in young gametophytes as their sex is determined by the presence or absence of the male-inducing pheromone called antheridiogen. Of the 1,163 consensus differentially expressed genes identified, the vast majority (1,030) are up-regulated in gametophytes treated with antheridiogen. GO term enrichment analyses of these DEGs reveals that a large number of genes involved in epigenetic reprogramming of the gametophyte genome are up-regulated by the pheromone. Additional hormone response and development genes are also up-regulated by the pheromone. This C. richardii gametophyte transcriptome and gene expression dataset will prove useful for studies focusing on sex determination and differentiation in plants. Copyright © 2018, G3: Genes, Genomes, Genetics.
Fidelity and enhanced sensitivity of differential transcription profiles following linear amplification of nanogram amounts of endothelial mRNA

NASA Technical Reports Server (NTRS)

Polacek, Denise C.; Passerini, Anthony G.; Shi, Congzhu; Francesco, Nadeene M.; Manduchi, Elisabetta; Grant, Gregory R.; Powell, Steven; Bischof, Helen; Winkler, Hans; Stoeckert, Christian J Jr;

2003-01-01

Although mRNA amplification is necessary for microarray analyses from limited amounts of cells and tissues, the accuracy of transcription profiles following amplification has not been well characterized. We tested the fidelity of differential gene expression following linear amplification by T7-mediated transcription in a well-established in vitro model of cytokine [tumor necrosis factor alpha (TNFalpha)]-stimulated human endothelial cells using filter arrays of 13,824 human cDNAs. Transcriptional profiles generated from amplified antisense RNA (aRNA) (from 100 ng total RNA, approximately 1 ng mRNA) were compared with profiles generated from unamplified RNA originating from the same homogeneous pool. Amplification accurately identified TNFalpha-induced differential expression in 94% of the genes detected using unamplified samples. Furthermore, an additional 1,150 genes were identified as putatively differentially expressed using amplified RNA which remained undetected using unamplified RNA. Of genes sampled from this set, 67% were validated by quantitative real-time PCR as truly differentially expressed. Thus, in addition to demonstrating fidelity in gene expression relative to unamplified samples, linear amplification results in improved sensitivity of detection and enhances the discovery potential of high-throughput screening by microarrays.

The genetics of attention deficit/hyperactivity disorder in adults, a review

PubMed Central

Franke, B; Faraone, S V; Asherson, P; Buitelaar, J; Bau, C H D; Ramos-Quiroga, J A; Mick, E; Grevet, E H; Johansson, S; Haavik, J; Lesch, K-P; Cormand, B; Reif, A

2012-01-01

The adult form of attention deficit/hyperactivity disorder (aADHD) has a prevalence of up to 5% and is the most severe long-term outcome of this common neurodevelopmental disorder. Family studies in clinical samples suggest an increased familial liability for aADHD compared with childhood ADHD (cADHD), whereas twin studies based on self-rated symptoms in adult population samples show moderate heritability estimates of 30–40%. However, using multiple sources of information, the heritability of clinically diagnosed aADHD and cADHD is very similar. Results of candidate gene as well as genome-wide molecular genetic studies in aADHD samples implicate some of the same genes involved in ADHD in children, although in some cases different alleles and different genes may be responsible for adult versus childhood ADHD. Linkage studies have been successful in identifying loci for aADHD and led to the identification of LPHN3 and CDH13 as novel genes associated with ADHD across the lifespan. In addition, studies of rare genetic variants have identified probable causative mutations for aADHD. Use of endophenotypes based on neuropsychology and neuroimaging, as well as next-generation genome analysis and improved statistical and bioinformatic analysis methods hold the promise of identifying additional genetic variants involved in disease etiology. Large, international collaborations have paved the way for well-powered studies. Progress in identifying aADHD risk genes may provide us with tools for the prediction of disease progression in the clinic and better treatment, and ultimately may help to prevent persistence of ADHD into adulthood. PMID:22105624
BioGraph: unsupervised biomedical knowledge discovery via automated hypothesis generation

PubMed Central

2011-01-01

We present BioGraph, a data integration and data mining platform for the exploration and discovery of biomedical information. The platform offers prioritizations of putative disease genes, supported by functional hypotheses. We show that BioGraph can retrospectively confirm recently discovered disease genes and identify potential susceptibility genes, outperforming existing technologies, without requiring prior domain knowledge. Additionally, BioGraph allows for generic biomedical applications beyond gene discovery. BioGraph is accessible at http://www.biograph.be. PMID:21696594
Biogeochemical Cycling of Manganese at Hydrothermal Vents

DTIC Science & Technology

1990-01-01

from an anoxic basin) contain the gene for the large subunit of Ribulose- 1,5-bisphosphate Carboxylase Oxygenase ( RubisCO ) suggestive of autotrophy... RubisCO gene probing on the bacterial isolates obtained from the hydrothermal vent environments as part of an ongoing ONR contract. In addition, we have...to test the feasibility of using gene probes for Ribulose-l,5- bisphosphate Carboxylase Oxygenase ( RubisCO ) for identifying autotrophic Mn(II

Engineering Complex Microbial Phenotypes with Continuous Genetic Integration and Plasmid Based Multi-gene Library

DTIC Science & Technology

2013-10-09

have desirable traits. We aim to enlarge the E. coli genome using Lactobacillusplantarum genes to build cells tolerant to EtOH and BT. L. plantarum is...chemicals III. Approach Objective 1 & la: Integrated heterologous (L. plantarum ) DNA into the E. coli chromosome and selected for insertions that...developed in combination with genes identified from screening L. plantarum libraries. Additionally, we have screened heterologous libraries for
Identification of the crucial genes in the elimination and survival process of Salmonella enterica ser. Pullorum in the chicken spleen.

PubMed

Ma, T; Xu, L; Wang, H; Guo, X; Li, Z; Wan, F; Chen, J; Liu, L; Liu, X; Chang, G; Chen, G

2017-06-01

Salmonella enterica ser. Pullorum is one of the most easily re-infecting pathogens in poultry production because of its mechanism of escaping from immune elimination. We used the transcriptome method to investigate the variation in gene expression in chicken spleen resulting from the interaction between hosts and S. Pullorum in the survival process. The expression of various genes related to the maturation and activation of B cells was activated before S. Pullorum was eliminated, which might help S. Pullorum escape from the elimination process. The suppression of some genes involved in the fusion of autophagosomes and lysosomes, such as MYO6, was identified and may be regulated by the secretion systems of S. Pullorum. In addition, a large proportion of these differentially expressed genes could be localized in the identified quantitative trait loci regions associated with the antibody response to bacteria. Collectively, these identified genes provided an outline for further understanding the interaction between chicken immune cells and S. Pullorum in chicken spleen. © 2017 Stichting International Foundation for Animal Genetics.
RNAi screen in Drosophila larvae identifies histone deacetylase 3 as a positive regulator of the hsp70 heat shock gene expression during heat shock.

PubMed

Achary, Bhavana G; Campbell, Katie M; Co, Ivy S; Gilmour, David S

2014-05-01

The transcription regulation of the Drosophila hsp70 gene is a complex process that involves the regulation of multiple steps, including the establishment of paused Pol II and release of Pol II into elongation upon heat shock activation. While the major players involved in the regulation of gene expression have been studied in detail, additional factors involved in this process continue to be discovered. To identify factors involved in hsp70 expression, we developed a screen that capitalizes on a visual assessment of heat shock activation using a hsp70-beta galactosidase reporter and publicly available RNAi fly lines to deplete candidate proteins. We validated the screen by showing that the depletion of HSF, CycT, Cdk9, Nurf 301, or ELL prevented the full induction of hsp70 by heat shock. Our screen also identified the histone deacetylase HDAC3 and its associated protein SMRTER as positive regulators of hsp70 activation. Additionally, we show that HDAC3 and SMRTER contribute to hsp70 gene expression at a step subsequent to HSF-mediated activation and release of the paused Pol II that resides at the promoter prior to heat shock induction. Copyright © 2014 Elsevier B.V. All rights reserved.
Common variants at the CHEK2 gene locus and risk of epithelial ovarian cancer.

PubMed

Lawrenson, Kate; Iversen, Edwin S; Tyrer, Jonathan; Weber, Rachel Palmieri; Concannon, Patrick; Hazelett, Dennis J; Li, Qiyuan; Marks, Jeffrey R; Berchuck, Andrew; Lee, Janet M; Aben, Katja K H; Anton-Culver, Hoda; Antonenkova, Natalia; Bandera, Elisa V; Bean, Yukie; Beckmann, Matthias W; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A; Brooks-Wilson, Angela; Bruinsma, Fiona; Butzow, Ralf; Campbell, Ian G; Carty, Karen; Chang-Claude, Jenny; Chenevix-Trench, Georgia; Chen, Ann; Chen, Zhihua; Cook, Linda S; Cramer, Daniel W; Cunningham, Julie M; Cybulski, Cezary; Plisiecka-Halasa, Joanna; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A; Dörk, Thilo; du Bois, Andreas; Eccles, Diana; Easton, Douglas T; Edwards, Robert P; Eilber, Ursula; Ekici, Arif B; Fasching, Peter A; Fridley, Brooke L; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G; Glasspool, Rosalind; Goode, Ellen L; Goodman, Marc T; Gronwald, Jacek; Harter, Philipp; Hasmad, Hanis Nazihah; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A T; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus; Hosono, Satoyo; Jakubowska, Anna; Paul, James; Jensen, Allan; Karlan, Beth Y; Kjaer, Susanne Kruger; Kelemen, Linda E; Kellar, Melissa; Kelley, Joseph L; Kiemeney, Lambertus A; Krakstad, Camilla; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D; Lee, Alice W; Cannioto, Rikki; Leminen, Arto; Lester, Jenny; Levine, Douglas A; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F A G; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R; Nevanlinna, Heli; McNeish, Iain; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B; Narod, Steven A; Nedergaard, Lotte; Ness, Roberta B; Noor Azmi, Mat Adenan; Odunsi, Kunle; Olson, Sara H; Orlow, Irene; Orsulic, Sandra; Pearce, Celeste L; Pejovic, Tanja; Pelttari, Liisa M; Permuth-Wey, Jennifer; Phelan, Catherine M; Pike, Malcolm C; Poole, Elizabeth M; Ramus, Susan J; Risch, Harvey A; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H; Rudolph, Anja; Runnebaum, Ingo B; Rzepecka, Iwona K; Salvesen, Helga B; Budzilowska, Agnieszka; Sellers, Thomas A; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C; Sucheston, Lara; Tangen, Ingvild L; Teo, Soo-Hwang; Terry, Kathryn L; Thompson, Pamela J; Timorek, Agnieszka; Tworoger, Shelley S; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S; Wicklund, Kristine G; Wilkens, Lynne R; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna H; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Coetzee, Gerhard A; Freedman, Matthew L; Monteiro, Alvaro N A; Moes-Sosnowska, Joanna; Kupryjanczyk, Jolanta; Pharoah, Paul D; Gayther, Simon A; Schildkraut, Joellen M

2015-11-01

Genome-wide association studies have identified 20 genomic regions associated with risk of epithelial ovarian cancer (EOC), but many additional risk variants may exist. Here, we evaluated associations between common genetic variants [single nucleotide polymorphisms (SNPs) and indels] in DNA repair genes and EOC risk. We genotyped 2896 common variants at 143 gene loci in DNA samples from 15 397 patients with invasive EOC and controls. We found evidence of associations with EOC risk for variants at FANCA, EXO1, E2F4, E2F2, CREB5 and CHEK2 genes (P ≤ 0.001). The strongest risk association was for CHEK2 SNP rs17507066 with serous EOC (P = 4.74 x 10(-7)). Additional genotyping and imputation of genotypes from the 1000 genomes project identified a slightly more significant association for CHEK2 SNP rs6005807 (r (2) with rs17507066 = 0.84, odds ratio (OR) 1.17, 95% CI 1.11-1.24, P = 1.1×10(-7)). We identified 293 variants in the region with likelihood ratios of less than 1:100 for representing the causal variant. Functional annotation identified 25 candidate SNPs that alter transcription factor binding sites within regulatory elements active in EOC precursor tissues. In The Cancer Genome Atlas dataset, CHEK2 gene expression was significantly higher in primary EOCs compared to normal fallopian tube tissues (P = 3.72×10(-8)). We also identified an association between genotypes of the candidate causal SNP rs12166475 (r (2) = 0.99 with rs6005807) and CHEK2 expression (P = 2.70×10(-8)). These data suggest that common variants at 22q12.1 are associated with risk of serous EOC and CHEK2 as a plausible target susceptibility gene. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Meta-analysis identifies gene-by-environment interactions as demonstrated in a study of 4,965 mice.

PubMed

Kang, Eun Yong; Han, Buhm; Furlotte, Nicholas; Joo, Jong Wha J; Shih, Diana; Davis, Richard C; Lusis, Aldons J; Eskin, Eleazar

2014-01-01

Identifying environmentally-specific genetic effects is a key challenge in understanding the structure of complex traits. Model organisms play a crucial role in the identification of such gene-by-environment interactions, as a result of the unique ability to observe genetically similar individuals across multiple distinct environments. Many model organism studies examine the same traits but under varying environmental conditions. For example, knock-out or diet-controlled studies are often used to examine cholesterol in mice. These studies, when examined in aggregate, provide an opportunity to identify genomic loci exhibiting environmentally-dependent effects. However, the straightforward application of traditional methodologies to aggregate separate studies suffers from several problems. First, environmental conditions are often variable and do not fit the standard univariate model for interactions. Additionally, applying a multivariate model results in increased degrees of freedom and low statistical power. In this paper, we jointly analyze multiple studies with varying environmental conditions using a meta-analytic approach based on a random effects model to identify loci involved in gene-by-environment interactions. Our approach is motivated by the observation that methods for discovering gene-by-environment interactions are closely related to random effects models for meta-analysis. We show that interactions can be interpreted as heterogeneity and can be detected without utilizing the traditional uni- or multi-variate approaches for discovery of gene-by-environment interactions. We apply our new method to combine 17 mouse studies containing in aggregate 4,965 distinct animals. We identify 26 significant loci involved in High-density lipoprotein (HDL) cholesterol, many of which are consistent with previous findings. Several of these loci show significant evidence of involvement in gene-by-environment interactions. An additional advantage of our meta-analysis approach is that our combined study has significantly higher power and improved resolution compared to any single study thus explaining the large number of loci discovered in the combined study.
Meta-Analysis Identifies Gene-by-Environment Interactions as Demonstrated in a Study of 4,965 Mice

PubMed Central

Joo, Jong Wha J.; Shih, Diana; Davis, Richard C.; Lusis, Aldons J.; Eskin, Eleazar

2014-01-01

Identifying environmentally-specific genetic effects is a key challenge in understanding the structure of complex traits. Model organisms play a crucial role in the identification of such gene-by-environment interactions, as a result of the unique ability to observe genetically similar individuals across multiple distinct environments. Many model organism studies examine the same traits but under varying environmental conditions. For example, knock-out or diet-controlled studies are often used to examine cholesterol in mice. These studies, when examined in aggregate, provide an opportunity to identify genomic loci exhibiting environmentally-dependent effects. However, the straightforward application of traditional methodologies to aggregate separate studies suffers from several problems. First, environmental conditions are often variable and do not fit the standard univariate model for interactions. Additionally, applying a multivariate model results in increased degrees of freedom and low statistical power. In this paper, we jointly analyze multiple studies with varying environmental conditions using a meta-analytic approach based on a random effects model to identify loci involved in gene-by-environment interactions. Our approach is motivated by the observation that methods for discovering gene-by-environment interactions are closely related to random effects models for meta-analysis. We show that interactions can be interpreted as heterogeneity and can be detected without utilizing the traditional uni- or multi-variate approaches for discovery of gene-by-environment interactions. We apply our new method to combine 17 mouse studies containing in aggregate 4,965 distinct animals. We identify 26 significant loci involved in High-density lipoprotein (HDL) cholesterol, many of which are consistent with previous findings. Several of these loci show significant evidence of involvement in gene-by-environment interactions. An additional advantage of our meta-analysis approach is that our combined study has significantly higher power and improved resolution compared to any single study thus explaining the large number of loci discovered in the combined study. PMID:24415945
Genomic characterization of an extensively-drug resistance Salmonella enterica serotype Indiana strain harboring blaNDM-1 gene isolated from a chicken carcass in China.

PubMed

Wang, Wei; Peng, Zixin; Baloch, Zulqarnain; Hu, Yujie; Xu, Jin; Zhang, Wenhui; Fanning, Séamus; Li, Fengqin

2017-11-01

The objective of this study was to genetically characterize the antimicrobial resistance mechanisms of Salmonella enterica serotype Indiana C629 isolated from a chicken carcass in China in 2014. Antimicrobial susceptibility against a panel of 23 antimicrobial agents was carried out on Salmonella enterica serotype Indiana C629 and assessed according to CLSI standards. Whole-genome sequencing of this isolate was conducted to obtain the complete genome of S. Indiana. Salmonella Indiana C629 expressed an XDR phenotype being resistant to more than 20 antimicrobial agents, including imipenem and meropenem. From the analysis of the resistance mechanisms, two mutations were identified in subunit A of DNA gyrase within the quinolone resistance determining region, in addition to the acquisition of mobile efflux pumps encoding oqxA/B/R. Additionally, four beta-lactamases resistance genes (bla CTX-M-65 , bla TEM-1 , bla OXA-1 , and bla NDM-1 ), five aminoglycosides resistance genes (aac(3)-IV, aac(6')-Ib-cr, aadA2, aadA5, and aph(4)-Ia), two phenicol resistance genes (catB3 and floR), and five trimethoprim/sulfamethoxazole resistance genes (sul1/2/3 and dfrA12/17) were also identified. A total of 191 virulence genes were identified. Among them, 57 belonged to type-three secretion system (T3SS) encoding genes, 55 belonged to fimbrial adherence encoding genes, and 39 belonged to flagella-encoding genes CONCLUSIONS: This study demonstrated that multi-resistance mechanisms consistent with an XDR-phenotype, along with various virulence encoding genes of a S. Indiana strain in China These findings highlight the importance of cooperation among different sectors in order to monitor the spread of resistant pathogens among food animal, foods of animal origin and human beings that might further take measures to protect consumers' health. Copyright © 2017 Elsevier GmbH. All rights reserved.
Network-Based Integration of GWAS and Gene Expression Identifies a HOX-Centric Network Associated with Serous Ovarian Cancer Risk.

PubMed

Kar, Siddhartha P; Tyrer, Jonathan P; Li, Qiyuan; Lawrenson, Kate; Aben, Katja K H; Anton-Culver, Hoda; Antonenkova, Natalia; Chenevix-Trench, Georgia; Baker, Helen; Bandera, Elisa V; Bean, Yukie T; Beckmann, Matthias W; Berchuck, Andrew; Bisogna, Maria; Bjørge, Line; Bogdanova, Natalia; Brinton, Louise; Brooks-Wilson, Angela; Butzow, Ralf; Campbell, Ian; Carty, Karen; Chang-Claude, Jenny; Chen, Yian Ann; Chen, Zhihua; Cook, Linda S; Cramer, Daniel; Cunningham, Julie M; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas F; Edwards, Robert P; Ekici, Arif B; Fasching, Peter A; Fridley, Brooke L; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G; Glasspool, Rosalind; Goode, Ellen L; Goodman, Marc T; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A T; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus K; Hosono, Satoyo; Iversen, Edwin S; Jakubowska, Anna; Paul, James; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kjaer, Susanne K; Kelemen, Linda E; Kellar, Melissa; Kelley, Joseph; Kiemeney, Lambertus A; Krakstad, Camilla; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D; Lee, Alice W; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R; McNeish, Iain A; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B; Narod, Steven A; Nedergaard, Lotte; Ness, Roberta B; Nevanlinna, Heli; Odunsi, Kunle; Olson, Sara H; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Pearce, Celeste Leigh; Pejovic, Tanja; Pelttari, Liisa M; Permuth-Wey, Jennifer; Phelan, Catherine M; Pike, Malcolm C; Poole, Elizabeth M; Ramus, Susan J; Risch, Harvey A; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H; Rudolph, Anja; Runnebaum, Ingo B; Rzepecka, Iwona K; Salvesen, Helga B; Schildkraut, Joellen M; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C; Sucheston-Campbell, Lara E; Tangen, Ingvild L; Teo, Soo-Hwang; Terry, Kathryn L; Thompson, Pamela J; Timorek, Agnieszka; Tsai, Ya-Yu; Tworoger, Shelley S; van Altena, Anne M; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S; Wicklund, Kristine G; Wilkens, Lynne R; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Sellers, Thomas A; Monteiro, Alvaro N A; Freedman, Matthew L; Gayther, Simon A; Pharoah, Paul D P

2015-10-01

Genome-wide association studies (GWAS) have so far reported 12 loci associated with serous epithelial ovarian cancer (EOC) risk. We hypothesized that some of these loci function through nearby transcription factor (TF) genes and that putative target genes of these TFs as identified by coexpression may also be enriched for additional EOC risk associations. We selected TF genes within 1 Mb of the top signal at the 12 genome-wide significant risk loci. Mutual information, a form of correlation, was used to build networks of genes strongly coexpressed with each selected TF gene in the unified microarray dataset of 489 serous EOC tumors from The Cancer Genome Atlas. Genes represented in this dataset were subsequently ranked using a gene-level test based on results for germline SNPs from a serous EOC GWAS meta-analysis (2,196 cases/4,396 controls). Gene set enrichment analysis identified six networks centered on TF genes (HOXB2, HOXB5, HOXB6, HOXB7 at 17q21.32 and HOXD1, HOXD3 at 2q31) that were significantly enriched for genes from the risk-associated end of the ranked list (P < 0.05 and FDR < 0.05). These results were replicated (P < 0.05) using an independent association study (7,035 cases/21,693 controls). Genes underlying enrichment in the six networks were pooled into a combined network. We identified a HOX-centric network associated with serous EOC risk containing several genes with known or emerging roles in serous EOC development. Network analysis integrating large, context-specific datasets has the potential to offer mechanistic insights into cancer susceptibility and prioritize genes for experimental characterization. ©2015 American Association for Cancer Research.
Network-based integration of GWAS and gene expression identifies a HOX-centric network associated with serous ovarian cancer risk

PubMed Central

Kar, Siddhartha P.; Tyrer, Jonathan P.; Li, Qiyuan; Lawrenson, Kate; Aben, Katja K.H.; Anton-Culver, Hoda; Antonenkova, Natalia; Chenevix-Trench, Georgia; Baker, Helen; Bandera, Elisa V.; Bean, Yukie T.; Beckmann, Matthias W.; Berchuck, Andrew; Bisogna, Maria; Bjørge, Line; Bogdanova, Natalia; Brinton, Louise; Brooks-Wilson, Angela; Butzow, Ralf; Campbell, Ian; Carty, Karen; Chang-Claude, Jenny; Chen, Yian Ann; Chen, Zhihua; Cook, Linda S.; Cramer, Daniel; Cunningham, Julie M.; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A.; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas F.; Edwards, Robert P.; Ekici, Arif B.; Fasching, Peter A.; Fridley, Brooke L.; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind; Goode, Ellen L.; Goodman, Marc T.; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A.T.; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus K.; Hosono, Satoyo; Iversen, Edwin S.; Jakubowska, Anna; Paul, James; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kjaer, Susanne K.; Kelemen, Linda E.; Kellar, Melissa; Kelley, Joseph; Kiemeney, Lambertus A.; Krakstad, Camilla; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D.; Lee, Alice W.; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A.; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R.; McNeish, Iain A.; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B.; Narod, Steven A.; Nedergaard, Lotte; Ness, Roberta B.; Nevanlinna, Heli; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Pearce, Celeste Leigh; Pejovic, Tanja; Pelttari, Liisa M.; Permuth-Wey, Jennifer; Phelan, Catherine M.; Pike, Malcolm C.; Poole, Elizabeth M.; Ramus, Susan J.; Risch, Harvey A.; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H.; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Schildkraut, Joellen M.; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Sucheston-Campbell, Lara E.; Tangen, Ingvild L.; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J; Timorek, Agnieszka; Tsai, Ya-Yu; Tworoger, Shelley S.; van Altena, Anne M.; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A.; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S.; Wicklund, Kristine G.; Wilkens, Lynne R.; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Sellers, Thomas A.; Monteiro, Alvaro N. A.; Freedman, Matthew L.; Gayther, Simon A.; Pharoah, Paul D. P.

2015-01-01

Background Genome-wide association studies (GWAS) have so far reported 12 loci associated with serous epithelial ovarian cancer (EOC) risk. We hypothesized that some of these loci function through nearby transcription factor (TF) genes and that putative target genes of these TFs as identified by co-expression may also be enriched for additional EOC risk associations. Methods We selected TF genes within 1 Mb of the top signal at the 12 genome-wide significant risk loci. Mutual information, a form of correlation, was used to build networks of genes strongly co-expressed with each selected TF gene in the unified microarray data set of 489 serous EOC tumors from The Cancer Genome Atlas. Genes represented in this data set were subsequently ranked using a gene-level test based on results for germline SNPs from a serous EOC GWAS meta-analysis (2,196 cases/4,396 controls). Results Gene set enrichment analysis identified six networks centered on TF genes (HOXB2, HOXB5, HOXB6, HOXB7 at 17q21.32 and HOXD1, HOXD3 at 2q31) that were significantly enriched for genes from the risk-associated end of the ranked list (P<0.05 and FDR<0.05). These results were replicated (P<0.05) using an independent association study (7,035 cases/21,693 controls). Genes underlying enrichment in the six networks were pooled into a combined network. Conclusion We identified a HOX-centric network associated with serous EOC risk containing several genes with known or emerging roles in serous EOC development. Impact Network analysis integrating large, context-specific data sets has the potential to offer mechanistic insights into cancer susceptibility and prioritize genes for experimental characterization. PMID:26209509
Genetic and Biochemical Map for the Biosynthesis of Occidiofungin, an Antifungal Produced by Burkholderia contaminans Strain MS14 ▿†

PubMed Central

Gu, Ganyu; Smith, Leif; Liu, Aixin; Lu, Shi-En

2011-01-01

A striking feature of Burkholderia contaminans strain MS14 is the production of a glycolipopeptide named occidiofungin. Occidiofungin has a broad range of antifungal activities against plant and animal pathogens. In this study, a complete covalent structure characterization and identification of the whole genomic DNA region for the occidiofungin gene (ocf) cluster are described. Discovery of the presence of 2,4-diaminobutyric acid and 3-chloro-β-hydroxytyrosine and elucidation of the structure of a novel C18 fatty amino acid residue have been achieved. In addition, seven additional putative open reading frames (the genes from ocfI to ocfN [ocfI-N] and ORF16) were identified. Transcription of all the putative genes ocfI-N identified in the region except ORF16 was regulated by both ambR1 and ambR2. Elucidation of the structure and the ocf gene cluster provides insight into the biosynthesis of occidiofungin and promotes future aims at understanding the biosynthetic machinery. This work provides new avenues for optimizing the production and synthesis of structural analogs of occidiofungin. PMID:21742901
Genetic architecture for human aggression: A study of gene-phenotype relationship in OMIM.

PubMed

Zhang-James, Yanli; Faraone, Stephen V

2016-07-01

Genetic studies of human aggression have mainly focused on known candidate genes and pathways regulating serotonin and dopamine signaling and hormonal functions. These studies have taught us much about the genetics of human aggression, but no genetic locus has yet achieved genome-significance. We here present a review based on a paradoxical hypothesis that studies of rare, functional genetic variations can lead to a better understanding of the molecular mechanisms underlying complex multifactorial disorders such as aggression. We examined all aggression phenotypes catalogued in Online Mendelian Inheritance in Man (OMIM), an Online Catalog of Human Genes and Genetic Disorders. We identified 95 human disorders that have documented aggressive symptoms in at least one individual with a well-defined genetic variant. Altogether, we retrieved 86 causal genes. Although most of these genes had not been implicated in human aggression by previous studies, the most significantly enriched canonical pathways had been previously implicated in aggression (e.g., serotonin and dopamine signaling). Our findings provide strong evidence to support the causal role of these pathways in the pathogenesis of aggression. In addition, the novel genes and pathways we identified suggest additional mechanisms underlying the origins of human aggression. Genome-wide association studies with very large samples will be needed to determine if common variants in these genes are risk factors for aggression. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Identification and expression analysis of cold and freezing stress responsive genes of Brassica oleracea.

PubMed

Ahmed, Nasar Uddin; Jung, Hee-Jeong; Park, Jong-In; Cho, Yong-Gu; Hur, Yoonkang; Nou, Ill-Sup

2015-01-10

Cold and freezing stress is a major environmental constraint to the production of Brassica crops. Enhancement of tolerance by exploiting cold and freezing tolerance related genes offers the most efficient approach to address this problem. Cold-induced transcriptional profiling is a promising approach to the identification of potential genes related to cold and freezing stress tolerance. In this study, 99 highly expressed genes were identified from a whole genome microarray dataset of Brassica rapa. Blast search analysis of the Brassica oleracea database revealed the corresponding homologous genes. To validate their expression, pre-selected cold tolerant and susceptible cabbage lines were analyzed. Out of 99 BoCRGs, 43 were differentially expressed in response to varying degrees of cold and freezing stress in the contrasting cabbage lines. Among the differentially expressed genes, 18 were highly up-regulated in the tolerant lines, which is consistent with their microarray expression. Additionally, 12 BoCRGs were expressed differentially after cold stress treatment in two contrasting cabbage lines, and BoCRG54, 56, 59, 62, 70, 72 and 99 were predicted to be involved in cold regulatory pathways. Taken together, the cold-responsive genes identified in this study provide additional direction for elucidating the regulatory network of low temperature stress tolerance and developing cold and freezing stress resistant Brassica crops. Copyright © 2014 Elsevier B.V. All rights reserved.
Characterization of the OmyY1 region on the rainbow trout Y chromosome

USGS Publications Warehouse

Phillips, Ruth B.; DeKoning, Jenefer J.; Brunelli, Joseph P.; Faber-Hammond, Joshua J.; Hansen, John D.; Christensen, Kris A.; Renn, Suzy C.P.; Thorgaard, Gary H.

2013-01-01

We characterized the male-specific region on the Y chromosome of rainbow trout, which contains both sdY (the sex-determining gene) and the male-specific genetic marker, OmyY1. Several clones containing the OmyY1 marker were screened from a BAC library from a YY clonal line and found to be part of an 800 kb BAC contig. Using fluorescence in situ hybridization (FISH), these clones were localized to the end of the short arm of the Y chromosome in rainbow trout, with an additional signal on the end of the X chromosome in many cells. We sequenced a minimum tiling path of these clones using Illumina and 454 pyrosequencing. The region is rich in transposons and rDNA, but also appears to contain several single-copy protein-coding genes. Most of these genes are also found on the X chromosome; and in several cases sex-specific SNPs in these genes were identified between the male (YY) and female (XX) homozygous clonal lines. Additional genes were identified by hybridization of the BACs to the cGRASP salmonid 4x44K oligo microarray. By BLASTn evaluations using hypothetical transcripts of OmyY1-linked candidate genes as query against several EST databases, we conclude at least 12 of these candidate genes are likely functional, and expressed.
Identification of giant Mimivirus protein functions using RNA interference

PubMed Central

Sobhy, Haitham; Scola, Bernard La; Pagnier, Isabelle; Raoult, Didier; Colson, Philippe

2015-01-01

Genomic analysis of giant viruses, such as Mimivirus, has revealed that more than half of the putative genes have no known functions (ORFans). We knocked down Mimivirus genes using short interfering RNA as a proof of concept to determine the functions of giant virus ORFans. As fibers are easy to observe, we targeted a gene encoding a protein absent in a Mimivirus mutant devoid of fibers as well as three genes encoding products identified in a protein concentrate of fibers, including one ORFan and one gene of unknown function. We found that knocking down these four genes was associated with depletion or modification of the fibers. Our strategy of silencing ORFan genes in giant viruses opens a way to identify its complete gene repertoire and may clarify the role of these genes, differentiating between junk DNA and truly used genes. Using this strategy, we were able to annotate four proteins in Mimivirus and 30 homologous proteins in other giant viruses. In addition, we were able to annotate >500 proteins from cellular organisms and 100 from metagenomic databases. PMID:25972846
Genome-wide association for grain yield under rainfed conditions in historical wheat cultivars from Pakistan

PubMed Central

Ain, Qurat-ul; Rasheed, Awais; Anwar, Alia; Mahmood, Tariq; Imtiaz, Muhammad; Mahmood, Tariq; Xia, Xianchun; He, Zhonghu; Quraishi, Umar M.

2015-01-01

Genome-wide association studies (GWAS) were undertaken to identify SNP markers associated with yield and yield-related traits in 123 Pakistani historical wheat cultivars evaluated during 2011–2014 seasons under rainfed field conditions. The population was genotyped by using high-density Illumina iSelect 90K single nucleotide polymorphism (SNP) assay, and finally 14,960 high quality SNPs were used in GWAS. Population structure examined using 1000 unlinked markers identified seven subpopulations (K = 7) that were representative of different breeding programs in Pakistan, in addition to local landraces. Forty four stable marker-trait associations (MTAs) with -log p > 4 were identified for nine yield-related traits. Nine multi-trait MTAs were found on chromosomes 1AL, 1BS, 2AL, 2BS, 2BL, 4BL, 5BL, 6AL, and 6BL, and those on 5BL and 6AL were stable across two seasons. Gene annotation and syntey identified that 14 trait-associated SNPs were linked to genes having significant importance in plant development. Favorable alleles for days to heading (DH), plant height (PH), thousand grain weight (TGW), and grain yield (GY) showed minor additive effects and their frequencies were slightly higher in cultivars released after 2000. However, no selection pressure on any favorable allele was identified. These genomic regions identified have historically contributed to achieve yield gains from 2.63 million tons in 1947 to 25.7 million tons in 2015. Future breeding strategies can be devised to initiate marker assisted breeding to accumulate these favorable alleles of SNPs associated with yield-related traits to increase grain yield. Additionally, in silico identification of 454-contigs corresponding to MTAs will facilitate fine mapping and subsequent cloning of candidate genes and functional marker development. PMID:26442056
Identification of 64 Novel Genetic Loci Provides an Expanded View on the Genetic Architecture of Coronary Artery Disease.

PubMed

van der Harst, Pim; Verweij, Niek

2018-02-02

Coronary artery disease (CAD) is a complex phenotype driven by genetic and environmental factors. Ninety-seven genetic risk loci have been identified to date, but the identification of additional susceptibility loci might be important to enhance our understanding of the genetic architecture of CAD. To expand the number of genome-wide significant loci, catalog functional insights, and enhance our understanding of the genetic architecture of CAD. We performed a genome-wide association study in 34 541 CAD cases and 261 984 controls of UK Biobank resource followed by replication in 88 192 cases and 162 544 controls from CARDIoGRAMplusC4D. We identified 75 loci that replicated and were genome-wide significant ( P <5×10 -8 ) in meta-analysis, 13 of which had not been reported previously. Next, to further identify novel loci, we identified all promising ( P <0.0001) loci in the CARDIoGRAMplusC4D data and performed reciprocal replication and meta-analyses with UK Biobank. This led to the identification of 21 additional novel loci reaching genome-wide significance ( P <5×10 -8 ) in meta-analysis. Finally, we performed a genome-wide meta-analysis of all available data revealing 30 additional novel loci ( P <5×10 -8 ) without further replication. The increase in sample size by UK Biobank raised the number of reconstituted gene sets from 4.2% to 13.9% of all gene sets to be involved in CAD. For the 64 novel loci, 155 candidate causal genes were prioritized, many without an obvious connection to CAD. Fine mapping of the 161 CAD loci generated lists of credible sets of single causal variants and genes for functional follow-up. Genetic risk variants of CAD were linked to development of atrial fibrillation, heart failure, and death. We identified 64 novel genetic risk loci for CAD and performed fine mapping of all 161 risk loci to obtain a credible set of causal variants. The large expansion of reconstituted gene sets argues in favor of an expanded omnigenic model view on the genetic architecture of CAD. © 2017 The Authors.
The Rice B-Box Zinc Finger Gene Family: Genomic Identification, Characterization, Expression Profiling and Diurnal Analysis

PubMed Central

Huang, Jianyan; Zhao, Xiaobo; Weng, Xiaoyu; Wang, Lei; Xie, Weibo

2012-01-01

Background The B-box (BBX) -containing proteins are a class of zinc finger proteins that contain one or two B-box domains and play important roles in plant growth and development. The Arabidopsis BBX gene family has recently been re-identified and renamed. However, there has not been a genome-wide survey of the rice BBX (OsBBX) gene family until now. Methodology/Principal Findings In this study, we identified 30 rice BBX genes through a comprehensive bioinformatics analysis. Each gene was assigned a uniform nomenclature. We described the chromosome localizations, gene structures, protein domains, phylogenetic relationship, whole life-cycle expression profile and diurnal expression patterns of the OsBBX family members. Based on the phylogeny and domain constitution, the OsBBX gene family was classified into five subfamilies. The gene duplication analysis revealed that only chromosomal segmental duplication contributed to the expansion of the OsBBX gene family. The expression profile of the OsBBX genes was analyzed by Affymetrix GeneChip microarrays throughout the entire life-cycle of rice cultivar Zhenshan 97 (ZS97). In addition, microarray analysis was performed to obtain the expression patterns of these genes under light/dark conditions and after three phytohormone treatments. This analysis revealed that the expression patterns of the OsBBX genes could be classified into eight groups. Eight genes were regulated under the light/dark treatments, and eleven genes showed differential expression under at least one phytohormone treatment. Moreover, we verified the diurnal expression of the OsBBX genes using the data obtained from the Diurnal Project and qPCR analysis, and the results indicated that many of these genes had a diurnal expression pattern. Conclusions/Significance The combination of the genome-wide identification and the expression and diurnal analysis of the OsBBX gene family should facilitate additional functional studies of the OsBBX genes. PMID:23118960
Integrative Approach to Pain Genetics Identifies Pain Sensitivity Loci across Diseases

PubMed Central

Ruau, David; Dudley, Joel T.; Chen, Rong; Phillips, Nicholas G.; Swan, Gary E.; Lazzeroni, Laura C.; Clark, J. David

2012-01-01

Identifying human genes relevant for the processing of pain requires difficult-to-conduct and expensive large-scale clinical trials. Here, we examine a novel integrative paradigm for data-driven discovery of pain gene candidates, taking advantage of the vast amount of existing disease-related clinical literature and gene expression microarray data stored in large international repositories. First, thousands of diseases were ranked according to a disease-specific pain index (DSPI), derived from Medical Subject Heading (MESH) annotations in MEDLINE. Second, gene expression profiles of 121 of these human diseases were obtained from public sources. Third, genes with expression variation significantly correlated with DSPI across diseases were selected as candidate pain genes. Finally, selected candidate pain genes were genotyped in an independent human cohort and prospectively evaluated for significant association between variants and measures of pain sensitivity. The strongest signal was with rs4512126 (5q32, ABLIM3, P = 1.3×10−10) for the sensitivity to cold pressor pain in males, but not in females. Significant associations were also observed with rs12548828, rs7826700 and rs1075791 on 8q22.2 within NCALD (P = 1.7×10−4, 1.8×10−4, and 2.2×10−4 respectively). Our results demonstrate the utility of a novel paradigm that integrates publicly available disease-specific gene expression data with clinical data curated from MEDLINE to facilitate the discovery of pain-relevant genes. This data-derived list of pain gene candidates enables additional focused and efficient biological studies validating additional candidates. PMID:22685391
Copper homeostasis gene discovery in Drosophila melanogaster.

PubMed

Norgate, Melanie; Southon, Adam; Zou, Sige; Zhan, Ming; Sun, Yu; Batterham, Phil; Camakaris, James

2007-06-01

Recent studies have shown a high level of conservation between Drosophila melanogaster and mammalian copper homeostasis mechanisms. These studies have also demonstrated the efficiency with which this species can be used to characterize novel genes, at both the cellular and whole organism level. As a versatile and inexpensive model organism, Drosophila is also particularly useful for gene discovery applications and thus has the potential to be extremely useful in identifying novel copper homeostasis genes and putative disease genes. In order to assess the suitability of Drosophila for this purpose, three screening approaches have been investigated. These include an analysis of the global transcriptional response to copper in both adult flies and an embryonic cell line using DNA microarray analysis. Two mutagenesis-based screens were also utilized. Several candidate copper homeostasis genes have been identified through this work. In addition, the results of each screen were carefully analyzed to identify any factors influencing efficiency and sensitivity. These are discussed here with the aim of maximizing the efficiency of future screens and the most suitable approaches are outlined. Building on this information, there is great potential for the further use of Drosophila for copper homeostasis gene discovery.
DNA methylation biomarkers for head and neck squamous cell carcinoma.

PubMed

Zhou, Chongchang; Ye, Meng; Ni, Shumin; Li, Qun; Ye, Dong; Li, Jinyun; Shen, Zhishen; Deng, Hongxia

2018-06-21

DNA methylation plays an important role in the etiology and pathogenesis of head and neck squamous cell carcinoma (HNSCC). The current study aimed to identify aberrantly methylated-differentially expressed genes (DEGs) by a comprehensive bioinformatics analysis. In addition, we screened for DEGs affected by DNA methylation modification and further investigated their prognostic values for HNSCC. We included microarray data of DNA methylation (GSE25093 and GSE33202) and gene expression (GSE23036 and GSE58911) from Gene Expression Omnibus. Aberrantly methylated-DEGs were analyzed with R software. The Cancer Genome Atlas (TCGA) RNA sequencing and DNA methylation (Illumina HumanMethylation450) databases were utilized for validation. In total, 27 aberrantly methylated genes accompanied by altered expression were identified. After confirmation by The Cancer Genome Atlas (TCGA) database, 2 hypermethylated-low-expression genes (FAM135B and ZNF610) and 2 hypomethylated-high-expression genes (HOXA9 and DCC) were identified. A receiver operating characteristic (ROC) curve confirmed the diagnostic value of these four methylated genes for HNSCC. Multivariate Cox proportional hazards analysis showed that FAM135B methylation was a favorable independent prognostic biomarker for overall survival of HNSCC patients.

A Stratified Transcriptomics Analysis of Polygenic Fat and Lean Mouse Adipose Tissues Identifies Novel Candidate Obesity Genes

PubMed Central

Morton, Nicholas M.; Nelson, Yvonne B.; Michailidou, Zoi; Di Rollo, Emma M.; Ramage, Lynne; Hadoke, Patrick W. F.; Seckl, Jonathan R.; Bunger, Lutz; Horvat, Simon; Kenyon, Christopher J.; Dunbar, Donald R.

2011-01-01

Background Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F) mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L) strain. Results To enrich for adipose tissue obesity genes a ‘snap-shot’ pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney) was performed. Known obesity quantitative trait loci (QTL) information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r) as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr) that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity. Conclusions A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes contributing to obesity. PMID:21915269
A stratified transcriptomics analysis of polygenic fat and lean mouse adipose tissues identifies novel candidate obesity genes.

PubMed

Morton, Nicholas M; Nelson, Yvonne B; Michailidou, Zoi; Di Rollo, Emma M; Ramage, Lynne; Hadoke, Patrick W F; Seckl, Jonathan R; Bunger, Lutz; Horvat, Simon; Kenyon, Christopher J; Dunbar, Donald R

2011-01-01

Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F) mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L) strain. To enrich for adipose tissue obesity genes a 'snap-shot' pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney) was performed. Known obesity quantitative trait loci (QTL) information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r) as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr) that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity. A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes contributing to obesity.
Expression quantitative trait loci (eQTL) mapping in Puerto Rican children.

PubMed

Chen, Wei; Brehm, John M; Lin, Jerome; Wang, Ting; Forno, Erick; Acosta-Pérez, Edna; Boutaoui, Nadia; Canino, Glorisa; Celedón, Juan C

2015-01-01

Expression quantitative trait loci (eQTL) have been identified using tissue or cell samples from diverse human populations, thus enhancing our understanding of regulation of gene expression. However, few studies have attempted to identify eQTL in racially admixed populations such as Hispanics. We performed a systematic eQTL study to identify regulatory variants of gene expression in whole blood from 121 Puerto Rican children with (n = 63) and without (n = 58) asthma. Genome-wide genotyping was conducted using the Illumina Omni2.5M Bead Chip, and gene expression was assessed using the Illumina HT-12 microarray. After completing quality control, we performed a pair-wise genome analysis of ~15 K transcripts and ~1.3 M SNPs for both local and distal effects. This analysis was conducted under a regression framework adjusting for age, gender and principal components derived from both genotypic and mRNA data. We used a false discovery rate (FDR) approach to identify significant eQTL signals, which were next compared to top eQTL signals from existing eQTL databases. We then performed a pathway analysis for our top genes. We identified 36,720 local pairs in 3,391 unique genes and 1,851 distal pairs in 446 unique genes at FDR <0.05, corresponding to unadjusted P values lower than 1.5x10-4 and 4.5x10-9, respectively. A significant proportion of genes identified in our study overlapped with those identified in previous studies. We also found an enrichment of disease-related genes in our eQTL list. We present results from the first eQTL study in Puerto Rican children, who are members of a unique Hispanic cohort disproportionately affected with asthma, prematurity, obesity and other common diseases. Our study confirmed eQTL signals identified in other ethnic groups, while also detecting additional eQTLs unique to our study population. The identified eQTLs will help prioritize findings from future genome-wide association studies in Puerto Ricans.
Genes for hereditary sensory and autonomic neuropathies: a genotype–phenotype correlation

PubMed Central

Rotthier, Annelies; Baets, Jonathan; Vriendt, Els De; Jacobs, An; Auer-Grumbach, Michaela; Lévy, Nicolas; Bonello-Palot, Nathalie; Kilic, Sara Sebnem; Weis, Joachim; Nascimento, Andrés; Swinkels, Marielle; Kruyt, Moyo C.; Jordanova, Albena; De Jonghe, Peter

2009-01-01

Hereditary sensory and autonomic neuropathies (HSAN) are clinically and genetically heterogeneous disorders characterized by axonal atrophy and degeneration, exclusively or predominantly affecting the sensory and autonomic neurons. So far, disease-associated mutations have been identified in seven genes: two genes for autosomal dominant (SPTLC1 and RAB7) and five genes for autosomal recessive forms of HSAN (WNK1/HSN2, NTRK1, NGFB, CCT5 and IKBKAP). We performed a systematic mutation screening of the coding sequences of six of these genes on a cohort of 100 familial and isolated patients diagnosed with HSAN. In addition, we screened the functional candidate gene NGFR (p75/NTR) encoding the nerve growth factor receptor. We identified disease-causing mutations in SPTLC1, RAB7, WNK1/HSN2 and NTRK1 in 19 patients, of which three mutations have not previously been reported. The phenotypes associated with mutations in NTRK1 and WNK1/HSN2 typically consisted of congenital insensitivity to pain and anhidrosis, and early-onset ulcero-mutilating sensory neuropathy, respectively. RAB7 mutations were only found in patients with a Charcot-Marie-Tooth type 2B (CMT2B) phenotype, an axonal sensory-motor neuropathy with pronounced ulcero-mutilations. In SPTLC1, we detected a novel mutation (S331F) corresponding to a previously unknown severe and early-onset HSAN phenotype. No mutations were found in NGFB, CCT5 and NGFR. Overall disease-associated mutations were found in 19% of the studied patient group, suggesting that additional genes are associated with HSAN. Our genotype–phenotype correlation study broadens the spectrum of HSAN and provides additional insights for molecular and clinical diagnosis. PMID:19651702
Genes for hereditary sensory and autonomic neuropathies: a genotype-phenotype correlation.

PubMed

Rotthier, Annelies; Baets, Jonathan; De Vriendt, Els; Jacobs, An; Auer-Grumbach, Michaela; Lévy, Nicolas; Bonello-Palot, Nathalie; Kilic, Sara Sebnem; Weis, Joachim; Nascimento, Andrés; Swinkels, Marielle; Kruyt, Moyo C; Jordanova, Albena; De Jonghe, Peter; Timmerman, Vincent

2009-10-01

Hereditary sensory and autonomic neuropathies (HSAN) are clinically and genetically heterogeneous disorders characterized by axonal atrophy and degeneration, exclusively or predominantly affecting the sensory and autonomic neurons. So far, disease-associated mutations have been identified in seven genes: two genes for autosomal dominant (SPTLC1 and RAB7) and five genes for autosomal recessive forms of HSAN (WNK1/HSN2, NTRK1, NGFB, CCT5 and IKBKAP). We performed a systematic mutation screening of the coding sequences of six of these genes on a cohort of 100 familial and isolated patients diagnosed with HSAN. In addition, we screened the functional candidate gene NGFR (p75/NTR) encoding the nerve growth factor receptor. We identified disease-causing mutations in SPTLC1, RAB7, WNK1/HSN2 and NTRK1 in 19 patients, of which three mutations have not previously been reported. The phenotypes associated with mutations in NTRK1 and WNK1/HSN2 typically consisted of congenital insensitivity to pain and anhidrosis, and early-onset ulcero-mutilating sensory neuropathy, respectively. RAB7 mutations were only found in patients with a Charcot-Marie-Tooth type 2B (CMT2B) phenotype, an axonal sensory-motor neuropathy with pronounced ulcero-mutilations. In SPTLC1, we detected a novel mutation (S331F) corresponding to a previously unknown severe and early-onset HSAN phenotype. No mutations were found in NGFB, CCT5 and NGFR. Overall disease-associated mutations were found in 19% of the studied patient group, suggesting that additional genes are associated with HSAN. Our genotype-phenotype correlation study broadens the spectrum of HSAN and provides additional insights for molecular and clinical diagnosis.
Human Papillomavirus Genome Integration and Head and Neck Cancer.

PubMed

Pinatti, L M; Walline, H M; Carey, T E

2018-06-01

We conducted a critical review of human papillomavirus (HPV) integration into the host genome in oral/oropharyngeal cancer, reviewed the literature for HPV-induced cancers, and obtained current data for HPV-related oral and oropharyngeal cancers. In addition, we performed studies to identify HPV integration sites and the relationship of integration to viral-host fusion transcripts and whether integration is required for HPV-associated oncogenesis. Viral integration of HPV into the host genome is not required for the viral life cycle and might not be necessary for cellular transformation, yet HPV integration is frequently reported in cervical and head and neck cancer specimens. Studies of large numbers of early cervical lesions revealed frequent viral integration into gene-poor regions of the host genome with comparatively rare integration into cellular genes, suggesting that integration is a stochastic event and that site of integration may be largely a function of chance. However, more recent studies of head and neck squamous cell carcinomas (HNSCCs) suggest that integration may represent an additional oncogenic mechanism through direct effects on cancer-related gene expression and generation of hybrid viral-host fusion transcripts. In HNSCC cell lines as well as primary tumors, integration into cancer-related genes leading to gene disruption has been reported. The studies have shown that integration-induced altered gene expression may be associated with tumor recurrence. Evidence from several studies indicates that viral integration into genic regions is accompanied by local amplification, increased expression in some cases, interruption of gene expression, and likely additional oncogenic effects. Similarly, reported examples of viral integration near microRNAs suggest that altered expression of these regulatory molecules may also contribute to oncogenesis. Future work is indicated to identify the mechanisms of these events on cancer cell behavior.
Screening of Critical Genes and MicroRNAs in Blood Samples of Patients with Ruptured Intracranial Aneurysms by Bioinformatic Analysis of Gene Expression Data.

PubMed

Bo, Lijuan; Wei, Bo; Wang, Zhanfeng; Kong, Daliang; Gao, Zheng; Miao, Zhuang

2017-09-20

BACKGROUND This study aimed to identify more potential genes and miRNAs associated with the pathogenesis of intracranial aneurysms (IAs). MATERIAL AND METHODS The dataset of GSE36791 (accession number) was downloaded from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) were screened for in the blood samples from patients with ruptured IAs and controls, followed by functional and pathway enrichment analyses. In addition, gene co-expression network was constructed and significant modules were extracted from the network by WGCNA R package. Screening for miRNAs that could regulate DEGs in the modules was performed and an analysis of regulatory relationships was conducted. RESULTS A total of 304 DEGs (167 up-regulated and 137 down-regulated genes) were screened for in blood samples from patients with ruptured IAs compared with those from controls. Functional enrichment analysis showed that the up-regulated genes were mainly associated with immune response and the down-regulated DEGs were mainly concerned with the structure of ribosome and translation. Besides, six functional modules were significantly identified, including four modules enriched by up-regulated genes and two modules enriched by down-regulated genes. Thereinto, the blue, yellow, and turquoise modules of up-regulated genes were all linked with immune response. Additionally, 16 miRNAs were predicted to regulate DEGs in the three modules associated with immune response, such as hsa-miR-1304, hsa-miR-33b, hsa-miR-125b, and hsa-miR-125a-5p. CONCLUSIONS Several genes and miRNAs (such as miR-1304, miR-33b, IRS2 and KCNJ2) may take part in the pathogenesis of IAs.
Molecular and Genetic Characterization of the Drosophila Melanogaster 87e Actin Gene Region

PubMed Central

Manseau, L. J.; Ganetzky, B.; Craig, E. A.

1988-01-01

A combined molecular and genetic analysis of the 87E actin gene (Act87E) in Drosophila melanogaster was undertaken. A clone of Act87E was isolated and characterized. The Act87E transcription unit is 1.57 kb and includes a 556-base intervening sequence in the 5' leader of the gene. The protein-coding region is contiguous and encodes a protein that is >93% identical to the other Drosophila actins. By in situ hybridization with a series of deficiencies that break in 87E, Act87E was localized to a region encompassing one to three faint, polytene chromosome bands. The region between the deficiency endpoints that flank the actin gene was isolated and measures approximately 24-30 kb. The closest proximal deficiency endpoint lies 8-10 kb 5' to the actin gene; the closest distal deficiency endpoint lies 16-20 kb 3' to the actin gene. A single, recessive lethal complementation group lies between the deficiency endpoints that flank the actin gene. An EMS mutagenesis screen produced four additional members of this recessive lethal complementation group. Molecular analysis of the members of this complementation group indicated that two of the newly induced mutations have deletions of approximately 1 kb in a transcribed region 4-5 kb 3' (distal) to the actin gene. This result suggests that the recessive lethal complementation group represents a gene separate from and distal to the actin gene. The mutagenesis screen failed to identify additional recessive lethal complementation groups in the actin gene-containing region. The implications of the failure to identify recessive lethal mutations in the actin gene are discussed in reference to studies of other conserved multigene families and other muscle protein mutations. PMID:2840338
Genome-wide association study of Alzheimer's disease.

PubMed

Kamboh, M I; Demirci, F Y; Wang, X; Minster, R L; Carrasquillo, M M; Pankratz, V S; Younkin, S G; Saykin, A J; Jun, G; Baldwin, C; Logue, M W; Buros, J; Farrer, L; Pericak-Vance, M A; Haines, J L; Sweet, R A; Ganguli, M; Feingold, E; Dekosky, S T; Lopez, O L; Barmada, M M

2012-05-15

In addition to apolipoprotein E (APOE), recent large genome-wide association studies (GWASs) have identified nine other genes/loci (CR1, BIN1, CLU, PICALM, MS4A4/MS4A6E, CD2AP, CD33, EPHA1 and ABCA7) for late-onset Alzheimer's disease (LOAD). However, the genetic effect attributable to known loci is about 50%, indicating that additional risk genes for LOAD remain to be identified. In this study, we have used a new GWAS data set from the University of Pittsburgh (1291 cases and 938 controls) to examine in detail the recently implicated nine new regions with Alzheimer's disease (AD) risk, and also performed a meta-analysis utilizing the top 1% GWAS single-nucleotide polymorphisms (SNPs) with P<0.01 along with four independent data sets (2727 cases and 3336 controls) for these SNPs in an effort to identify new AD loci. The new GWAS data were generated on the Illumina Omni1-Quad chip and imputed at ~2.5 million markers. As expected, several markers in the APOE regions showed genome-wide significant associations in the Pittsburg sample. While we observed nominal significant associations (P<0.05) either within or adjacent to five genes (PICALM, BIN1, ABCA7, MS4A4/MS4A6E and EPHA1), significant signals were observed 69-180 kb outside of the remaining four genes (CD33, CLU, CD2AP and CR1). Meta-analysis on the top 1% SNPs revealed a suggestive novel association in the PPP1R3B gene (top SNP rs3848140 with P = 3.05E-07). The association of this SNP with AD risk was consistent in all five samples with a meta-analysis odds ratio of 2.43. This is a potential candidate gene for AD as this is expressed in the brain and is involved in lipid metabolism. These findings need to be confirmed in additional samples.
Genome-wide association study of Alzheimer's disease

PubMed Central

Kamboh, M I; Demirci, F Y; Wang, X; Minster, R L; Carrasquillo, M M; Pankratz, V S; Younkin, S G; Saykin, A J; Jun, G; Baldwin, C; Logue, M W; Buros, J; Farrer, L; Pericak-Vance, M A; Haines, J L; Sweet, R A; Ganguli, M; Feingold, E; DeKosky, S T; Lopez, O L; Barmada, M M

2012-01-01

In addition to apolipoprotein E (APOE), recent large genome-wide association studies (GWASs) have identified nine other genes/loci (CR1, BIN1, CLU, PICALM, MS4A4/MS4A6E, CD2AP, CD33, EPHA1 and ABCA7) for late-onset Alzheimer's disease (LOAD). However, the genetic effect attributable to known loci is about 50%, indicating that additional risk genes for LOAD remain to be identified. In this study, we have used a new GWAS data set from the University of Pittsburgh (1291 cases and 938 controls) to examine in detail the recently implicated nine new regions with Alzheimer's disease (AD) risk, and also performed a meta-analysis utilizing the top 1% GWAS single-nucleotide polymorphisms (SNPs) with P<0.01 along with four independent data sets (2727 cases and 3336 controls) for these SNPs in an effort to identify new AD loci. The new GWAS data were generated on the Illumina Omni1-Quad chip and imputed at ∼2.5 million markers. As expected, several markers in the APOE regions showed genome-wide significant associations in the Pittsburg sample. While we observed nominal significant associations (P<0.05) either within or adjacent to five genes (PICALM, BIN1, ABCA7, MS4A4/MS4A6E and EPHA1), significant signals were observed 69–180 kb outside of the remaining four genes (CD33, CLU, CD2AP and CR1). Meta-analysis on the top 1% SNPs revealed a suggestive novel association in the PPP1R3B gene (top SNP rs3848140 with P=3.05E–07). The association of this SNP with AD risk was consistent in all five samples with a meta-analysis odds ratio of 2.43. This is a potential candidate gene for AD as this is expressed in the brain and is involved in lipid metabolism. These findings need to be confirmed in additional samples. PMID:22832961
Markov Logic Networks in the Analysis of Genetic Data

PubMed Central

Sakhanenko, Nikita A.

2010-01-01

Abstract Complex, non-additive genetic interactions are common and can be critical in determining phenotypes. Genome-wide association studies (GWAS) and similar statistical studies of linkage data, however, assume additive models of gene interactions in looking for genotype-phenotype associations. These statistical methods view the compound effects of multiple genes on a phenotype as a sum of influences of each gene and often miss a substantial part of the heritable effect. Such methods do not use any biological knowledge about underlying mechanisms. Modeling approaches from the artificial intelligence (AI) field that incorporate deterministic knowledge into models to perform statistical analysis can be applied to include prior knowledge in genetic analysis. We chose to use the most general such approach, Markov Logic Networks (MLNs), for combining deterministic knowledge with statistical analysis. Using simple, logistic regression-type MLNs we can replicate the results of traditional statistical methods, but we also show that we are able to go beyond finding independent markers linked to a phenotype by using joint inference without an independence assumption. The method is applied to genetic data on yeast sporulation, a complex phenotype with gene interactions. In addition to detecting all of the previously identified loci associated with sporulation, our method identifies four loci with smaller effects. Since their effect on sporulation is small, these four loci were not detected with methods that do not account for dependence between markers due to gene interactions. We show how gene interactions can be detected using more complex models, which can be used as a general framework for incorporating systems biology with genetics. PMID:20958249
Analyzing the most frequent disease loci in targeted patient categories optimizes disease gene identification and test accuracy worldwide.

PubMed

Lebo, Roger V; Tonk, Vijay S

2015-01-21

Our genomewide studies support targeted testing the most frequent genetic diseases by patient category: (1) pregnant patients, (2) at-risk conceptuses, (3) affected children, and (4) abnormal adults. This approach not only identifies most reported disease causing sequences accurately, but also minimizes incorrectly identified additional disease causing loci. Diseases were grouped in descending order of occurrence from four data sets: (1) GeneTests 534 listed population prevalences, (2) 4129 high risk prenatal karyotypes, (3) 1265 affected patient microarrays, and (4) reanalysis of 25,452 asymptomatic patient results screened prenatally for 108 genetic diseases. These most frequent diseases are categorized by transmission: (A) autosomal recessive, (B) X-linked, (C) autosomal dominant, (D) microscopic chromosome rearrangements, (E) submicroscopic copy number changes, and (F) frequent ethnic diseases. Among affected and carrier patients worldwide, most reported mutant genes would be identified correctly according to one of four patient categories from at-risk couples with <64 tested genes to affected adults with 314 tested loci. Three clinically reported patient series confirmed this approach. First, only 54 targeted chromosomal sites would have detected all 938 microscopically visible unbalanced karyotypes among 4129 karyotyped POC, CVS, and amniocentesis samples. Second, 37 of 48 reported aneuploid regions were found among our 1265 clinical microarrays confirming the locations of 8 schizophrenia loci and 20 aneuploidies altering intellectual ability, while also identifying 9 of the most frequent deletion syndromes. Third, testing 15 frequent genes would have identified 124 couples with a 1 in 4 risk of a fetus with a recessive disease compared to the 127 couples identified by testing all 108 genes, while testing all mutations in 15 genes could have identified more couples. Testing the most frequent disease causing abnormalities in 1 of 8 reported disease loci [~1 of 84 total genes] will identify ~ 7 of 8 reported abnormal Caucasian newborn genotypes. This would eliminate ~8 to 10 of ~10 Caucasian newborn gene sequences selected as abnormal that are actually normal variants identified when testing all ~2500 diseases looking for the remaining 1 of 8 disease causing genes. This approach enables more accurate testing within available laboratory and reimbursement resources.
Genomewide association study for susceptibility genes contributing to familial Parkinson disease

PubMed Central

Pankratz, Nathan; Wilk, Jemma B.; Latourelle, Jeanne C.; DeStefano, Anita L.; Halter, Cheryl; Pugh, Elizabeth W.; Doheny, Kimberly F.; Gusella, James F.; Nichols, William C.

2009-01-01

Five genes have been identified that contribute to Mendelian forms of Parkinson disease (PD); however, mutations have been found in fewer than 5% of patients, suggesting that additional genes contribute to disease risk. Unlike previous studies that focused primarily on sporadic PD, we have performed the first genomewide association study (GWAS) in familial PD. Genotyping was performed with the Illumina HumanCNV370Duo array in 857 familial PD cases and 867 controls. A logistic model was employed to test for association under additive and recessive modes of inheritance after adjusting for gender and age. No result met genomewide significance based on a conservative Bonferroni correction. The strongest association result was with SNPs in the GAK/DGKQ region on chromosome 4 (additive model: p = 3.4 × 10−6; OR = 1.69). Consistent evidence of association was also observed to the chromosomal regions containing SNCA (additive model: p = 5.5 × 10−5; OR = 1.35) and MAPT (recessive model: p = 2.0 × 10−5; OR = 0.56). Both of these genes have been implicated previously in PD susceptibility; however, neither was identified in previous GWAS studies of PD. Meta-analysis was performed using data from a previous case–control GWAS, and yielded improved p values for several regions, including GAK/DGKQ (additive model: p = 2.5 × 10−7) and the MAPT region (recessive model: p = 9.8 × 10−6; additive model: p = 4.8 × 10−5). These data suggest the identification of new susceptibility alleles for PD in the GAK/DGKQ region, and also provide further support for the role of SNCA and MAPT in PD susceptibility. PMID:18985386
Microscopy and bioinformatic analyses of lipid metabolism implicate a sporophytic signaling network supporting pollen development in Arabidopsis.

PubMed

Wang, Yixing; Wu, Hong; Yang, Ming

2008-07-01

The Arabidopsis sporophytic tapetum undergoes a programmed degeneration process to secrete lipid and other materials to support pollen development. However, the molecular mechanism regulating the degeneration process is unknown. To gain insight into this molecular mechanism, we first determined that the most critical period for tapetal secretion to support pollen development is from the vacuolate microspore stage to the early binucleate pollen stage. We then analyzed the expression of enzymes responsible for lipid biosynthesis and degradation with available in-silico data. The genes for these enzymes that are expressed in the stamen but not in the concurrent uninucleate microspore and binucleate pollen are of particular interest, as they presumably hold the clues to unique molecular processes in the sporophytic tissues compared to the gametophytic tissue. No gene for lipid biosynthesis but a single gene encoding a patatin-like protein likely for lipid mobilization was identified based on the selection criterion. A search for genes co-expressed with this gene identified additional genes encoding typical signal transduction components such as a leucine-rich repeat receptor kinase, an extra-large G-protein, other protein kinases, and transcription factors. In addition, proteases, cell wall degradation enzymes, and other proteins were also identified. These proteins thus may be components of a signaling network leading to degradation of a broad range of cellular components. Since a broad range of degradation activities is expected to occur only in the tapetal degeneration process at this stage in the stamen, it is further hypothesized that the signaling network acts in the tapetal degeneration process.
Variants of the D{sub 5} dopamine receptor gene found in patients with schizophrenia: Identification of a nonsense mutation and multiple missense changes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sobell, J.L.; Lind, T.J.; Sommer, S.S.

To determine whether mutations in the D{sub 5} dopamine receptor (D{sub 5}DR) gene are associated with schizophrenia, the gene was examined in 78 unrelated schizophrenic individuals. After amplification by the polymerase chain reaction, products were examined by dideoxy fingerprinting (ddF), a highly sensitive screening method related to single strand conformational polymorphism analysis. All samples with unusual ddF patterns were sequenced to precisely identify the sequence change. In the 156 D{sub 5}DR alleles examined, nine sequence changes were identified. Four of the nine did not affect protein structure; of these, three were silent changes and one was a transition in themore » 3{prime} untranslated region. The remaining five sequence changes result in protein alterations: of these, one is a missense change in a non-conserved amino acid, 3 are missense changes in amino acids that are conserved in some dopamine D{sub 5} receptors and the last is a nonsense mutation. To investigate whether the nonsense mutation was associated with schizophrenia, 400 additional schizophrenic cases of western European descent and 1914 ethnically-similar controls were screened for the change. One additional schizophrenic carrier was identified and verified by direct genomic sequencing (allele frequency: .0013), but eight carriers also were found and confirmed among the non-schizophrenics (allele frequency: .0021)(p>.25). The gene was re-examined in all newly identified carriers of the nonsense mutation by direct sequencing and/or ddF in search of additional mutations. None were identified. Family studies also were conducted to investigate possible cosegregation of the mutation with other neuropsychiatric diseases, but this was not demonstrated. Thus, the mutation does not appear to be associated with an increased risk of schizophrenia nor does an initial analysis suggest cosegregation with other neuropsychiatric disorders or symptom complexes.« less
Meta-analysis identifies 29 additional ulcerative colitis risk loci, increasing the number of confirmed associations to 47.

PubMed

Anderson, Carl A; Boucher, Gabrielle; Lees, Charlie W; Franke, Andre; D'Amato, Mauro; Taylor, Kent D; Lee, James C; Goyette, Philippe; Imielinski, Marcin; Latiano, Anna; Lagacé, Caroline; Scott, Regan; Amininejad, Leila; Bumpstead, Suzannah; Baidoo, Leonard; Baldassano, Robert N; Barclay, Murray; Bayless, Theodore M; Brand, Stephan; Büning, Carsten; Colombel, Jean-Frédéric; Denson, Lee A; De Vos, Martine; Dubinsky, Marla; Edwards, Cathryn; Ellinghaus, David; Fehrmann, Rudolf S N; Floyd, James A B; Florin, Timothy; Franchimont, Denis; Franke, Lude; Georges, Michel; Glas, Jürgen; Glazer, Nicole L; Guthery, Stephen L; Haritunians, Talin; Hayward, Nicholas K; Hugot, Jean-Pierre; Jobin, Gilles; Laukens, Debby; Lawrance, Ian; Lémann, Marc; Levine, Arie; Libioulle, Cecile; Louis, Edouard; McGovern, Dermot P; Milla, Monica; Montgomery, Grant W; Morley, Katherine I; Mowat, Craig; Ng, Aylwin; Newman, William; Ophoff, Roel A; Papi, Laura; Palmieri, Orazio; Peyrin-Biroulet, Laurent; Panés, Julián; Phillips, Anne; Prescott, Natalie J; Proctor, Deborah D; Roberts, Rebecca; Russell, Richard; Rutgeerts, Paul; Sanderson, Jeremy; Sans, Miquel; Schumm, Philip; Seibold, Frank; Sharma, Yashoda; Simms, Lisa A; Seielstad, Mark; Steinhart, A Hillary; Targan, Stephan R; van den Berg, Leonard H; Vatn, Morten; Verspaget, Hein; Walters, Thomas; Wijmenga, Cisca; Wilson, David C; Westra, Harm-Jan; Xavier, Ramnik J; Zhao, Zhen Z; Ponsioen, Cyriel Y; Andersen, Vibeke; Torkvist, Leif; Gazouli, Maria; Anagnou, Nicholas P; Karlsen, Tom H; Kupcinskas, Limas; Sventoraityte, Jurgita; Mansfield, John C; Kugathasan, Subra; Silverberg, Mark S; Halfvarson, Jonas; Rotter, Jerome I; Mathew, Christopher G; Griffiths, Anne M; Gearry, Richard; Ahmad, Tariq; Brant, Steven R; Chamaillard, Mathias; Satsangi, Jack; Cho, Judy H; Schreiber, Stefan; Daly, Mark J; Barrett, Jeffrey C; Parkes, Miles; Annese, Vito; Hakonarson, Hakon; Radford-Smith, Graham; Duerr, Richard H; Vermeire, Séverine; Weersma, Rinse K; Rioux, John D

2011-03-01

Genome-wide association studies and candidate gene studies in ulcerative colitis have identified 18 susceptibility loci. We conducted a meta-analysis of six ulcerative colitis genome-wide association study datasets, comprising 6,687 cases and 19,718 controls, and followed up the top association signals in 9,628 cases and 12,917 controls. We identified 29 additional risk loci (P < 5 × 10(-8)), increasing the number of ulcerative colitis-associated loci to 47. After annotating associated regions using GRAIL, expression quantitative trait loci data and correlations with non-synonymous SNPs, we identified many candidate genes that provide potentially important insights into disease pathogenesis, including IL1R2, IL8RA-IL8RB, IL7R, IL12B, DAP, PRDM1, JAK2, IRF5, GNA12 and LSP1. The total number of confirmed inflammatory bowel disease risk loci is now 99, including a minimum of 28 shared association signals between Crohn's disease and ulcerative colitis.
Apolipoprotein gene involved in lipid metabolism

DOEpatents

Rubin, Edward [Berkeley, CA; Pennacchio, Len A [Sebastopol, CA

2007-07-03

Methods and materials for studying the effects of a newly identified human gene, APOAV, and the corresponding mouse gene apoAV. The sequences of the genes are given, and transgenic animals which either contain the gene or have the endogenous gene knocked out are described. In addition, single nucleotide polymorphisms (SNPs) in the gene are described and characterized. It is demonstrated that certain SNPs are associated with diseases involving lipids and triglycerides and other metabolic diseases. These SNPs may be used alone or with SNPs from other genes to study individual risk factors. Methods for intervention in lipid diseases, including the screening of drugs to treat lipid-related or diabetic diseases are also disclosed.
The transcriptional control machinery as well as the cell wall integrity and its regulation are involved in the detoxification of the organic solvent dimethyl sulfoxide in Saccharomyces cerevisiae.

PubMed

Zhang, Lilin; Liu, Ningning; Ma, Xiao; Jiang, Linghuo

2013-03-01

In the present study, we have identified 339 dimethyl sulfoxide (DMSO)-sensitive and nine DMSO-tolerant gene mutations in Saccharomyces cerevisiae through a functional genomics approach. Twelve of these identified DMSO-sensitive mutations are of genes involved in the general control of gene expression mediated by the SWR1 complex and the RNA polymerase II mediator complex, whereas 71 of them are of genes involved in the protein trafficking and vacuolar sorting processes. In addition, twelve of these DMSO-sensitive mutations are of genes involved in the cell wall integrity (CWI) and its regulation. DMSO-tolerant mutations are of genes mainly involved in the metabolism and the gene expression control. Therefore, the transcriptional control machinery, the CWI and its regulation as well as the protein trafficking and sorting process play critical roles in the DMSO detoxification in yeast cells. © 2012 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
Comprehensive Ex Vivo Transposon Mutagenesis Identifies Genes That Promote Growth Factor Independence and Leukemogenesis.

PubMed

Guo, Yabin; Updegraff, Barrett L; Park, Sunho; Durakoglugil, Deniz; Cruz, Victoria H; Maddux, Sarah; Hwang, Tae Hyun; O'Donnell, Kathryn A

2016-02-15

Aberrant signaling through cytokine receptors and their downstream signaling pathways is a major oncogenic mechanism underlying hematopoietic malignancies. To better understand how these pathways become pathologically activated and to potentially identify new drivers of hematopoietic cancers, we developed a high-throughput functional screening approach using ex vivo mutagenesis with the Sleeping Beauty transposon. We analyzed over 1,100 transposon-mutagenized pools of Ba/F3 cells, an IL3-dependent pro-B-cell line, which acquired cytokine independence and tumor-forming ability. Recurrent transposon insertions could be mapped to genes in the JAK/STAT and MAPK pathways, confirming the ability of this strategy to identify known oncogenic components of cytokine signaling pathways. In addition, recurrent insertions were identified in a large set of genes that have been found to be mutated in leukemia or associated with survival, but were not previously linked to the JAK/STAT or MAPK pathways nor shown to functionally contribute to leukemogenesis. Forced expression of these novel genes resulted in IL3-independent growth in vitro and tumorigenesis in vivo, validating this mutagenesis-based approach for identifying new genes that promote cytokine signaling and leukemogenesis. Therefore, our findings provide a broadly applicable approach for classifying functionally relevant genes in diverse malignancies and offer new insights into the impact of cytokine signaling on leukemia development. ©2015 American Association for Cancer Research.
Mutation screening of the LRIT3, CABP4, and GPR179 genes in Chinese patients with Schubert-Bornschein congenital stationary night blindness.

PubMed

Dan, Handong; Song, Xiusheng; Li, Jiazhang; Xing, Yiqiao; Li, Tuo

2017-01-01

Schubert-Bornschein congenital stationary night blindness (CSNB) is a rare retinal disorder that may lead to severe visual impairment in patients. The aim of this study was to detect mutations in the LRIT3, CABP4, and GPR179 genes in Chinese patients with Schubert-Bornschein CSNB. A cohort of eight unrelated Chinese probands with Schubert-Bornschein CSNB was recruited for this study. Six of these probands were assessed in our previous study, in which we screened the NYX, CACNA1F, GRM6, and TRPM1 genes for mutations but identified none. The other two patients were newly recruited and had not been screened for mutations in these genes. Genomic DNA and clinical data were collected from the eight recruited families. Variants of the LRIT3, CABP4, and GPR179 genes were identified by Sanger sequencing. All of the identified variants were also assessed in 192 control individuals. In this study, a novel compound heterozygous mutation, c.[1A>G]; [608G>T] (p.[0?]; p.[W203L]), was identified in the LRIT3 gene of a proband. These two mutations were not present in any of the 192 normal control individuals or in the other patients, and the missense mutation c.608G>T was predicted to be pathogenic. No mutations were identified in the CABP4 or GPR179 gene. These results expand the mutational spectrum of LRIT3, thus potentially enriching our understanding of the molecular basis of complete CSNB. Additional genes that potentially contribute to incomplete CSNB remain to be identified in future studies.

Identification of pathogenic genes related to rheumatoid arthritis through integrated analysis of DNA methylation and gene expression profiling.

PubMed

Zhang, Lei; Ma, Shiyun; Wang, Huailiang; Su, Hang; Su, Ke; Li, Longjie

2017-11-15

The purpose of our study was to identify new pathogenic genes used for exploring the pathogenesis of rheumatoid arthritis (RA). To screen pathogenic genes of RA, an integrated analysis was performed by using the microarray datasets in RA derived from the Gene Expression Omnibus (GEO) database. The functional annotation and potential pathways of differentially expressed genes (DEGs) were further discovered by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis. Afterwards, the integrated analysis of DNA methylation and gene expression profiling was used to screen crucial genes. In addition, we used RT-PCR and MSP to verify the expression levels and methylation status of these crucial genes in 20 synovial biopsy samples obtained from 10 RA model mice and 10 normal mice. BCL11B, CCDC88C, FCRLA and APOL6 were both up-regulated and hypomethylated in RA according to integrated analysis, RT-PCR and MSP verification. Four crucial genes (BCL11B, CCDC88C, FCRLA and APOL6) identified and analyzed in this study might be closely connected with the pathogenesis of RA. Copyright © 2017. Published by Elsevier B.V.
Evolutionary analysis of the jacalin-related lectin family genes in 11 fishes.

PubMed

Cao, Jun; Lv, Yueqing

2016-09-01

Jacalin-related lectins are a type of carbohydrate-binding proteins, which are distributed across a wide variety of organisms and involved in some important biological processes. The evolution of this gene family in fishes is unknown. Here, 47 putative jacalin genes in 11 fish species were identified and divided into 4 groups through phylogenetic analysis. Conserved gene organization and motif distribution existed in each group, suggesting their functional conservation. Some fishes have eleven jacalin genes, while others have only one or zero gene in their genomes, suggesting dynamic changes in the number of jacalin genes during the evolution of fishes. Intragenic recombination played a key role in the evolution of jacalin genes. Synteny analyses of jacalin genes in some fishes implied conserved and dynamic evolution characteristics of this gene family and related genome segments. Moreover, a few functional divergence sites were identified within each group pairs. Divergent expression profiles of the zebra fish jacalin genes were further investigated in different stresses. The results provided a foundation for exploring the characterization of the jacalin genes in fishes and will offer insights for additional functional studies. Copyright © 2016 Elsevier Ltd. All rights reserved.
Genomic analysis of the type VI secretion systems in Pseudomonas spp.: novel clusters and putative effectors uncovered.

PubMed

Barret, Matthieu; Egan, Frank; Fargier, Emilie; Morrissey, John P; O'Gara, Fergal

2011-06-01

Bacteria encode multiple protein secretion systems that are crucial for interaction with the environment and with hosts. In recent years, attention has focused on type VI secretion systems (T6SSs), which are specialized transporters widely encoded in Proteobacteria. The myriad of processes associated with these secretion systems could be explained by subclasses of T6SS, each involved in specialized functions. To assess diversity and predict function associated with different T6SSs, comparative genomic analysis of 34 Pseudomonas genomes was performed. This identified 70 T6SSs, with at least one locus in every strain, except for Pseudomonas stutzeri A1501. By comparing 11 core genes of the T6SS, it was possible to identify five main Pseudomonas phylogenetic clusters, with strains typically carrying T6SSs from more than one clade. In addition, most strains encode additional vgrG and hcp genes, which encode extracellular structural components of the secretion apparatus. Using a combination of phylogenetic and meta-analysis of transcriptome datasets it was possible to associate specific subsets of VgrG and Hcp proteins with each Pseudomonas T6SS clade. Moreover, a closer examination of the genomic context of vgrG genes in multiple strains highlights a number of additional genes associated with these regions. It is proposed that these genes may play a role in secretion or alternatively could be new T6S effectors.
Completion of the mitochondrial genome sequence of onion (Allium cepa L.) containing the CMS-S male-sterile cytoplasm and identification of an independent event of the ccmF N gene split.

PubMed

Kim, Bongju; Kim, Kyunghee; Yang, Tae-Jin; Kim, Sunggil

2016-11-01

Cytoplasmic male-sterility (CMS) conferred by the CMS-S cytoplasm has been most commonly used for onion (Allium cepa L.) F 1 hybrid seed production. We first report the complete mitochondrial genome sequence containing CMS-S cytoplasm in this study. Initially, seven contigs were de novo assembled from 150-bp paired-end raw reads produced from the total genomic DNA using the Illumina NextSeq500 platform. These contigs were connected into a single circular genome consisting of 316,363 bp (GenBank accession: KU318712) by PCR amplification. Although all 24 core protein-coding genes were present, no ribosomal protein-coding genes, except rps12, were identified in the onion mitochondrial genome. Unusual trans-splicing of the cox2 gene was verified, and the cox1 gene was identified as part of the chimeric orf725 gene, which is a candidate gene responsible for inducing CMS. In addition to orf725, two small chimeric genes were identified, but no transcripts were detected for these two open reading frames. Thirteen chloroplast-derived sequences, with sizes of 126-13,986 bp, were identified in the intergenic regions. Almost 10 % of the onion mitochondrial genome was composed of repeat sequences. The vast majority of repeats were short repeats of <100 base pairs. Interestingly, the gene encoding ccmF N was split into two genes. The ccmF N gene split is first identified outside the Brassicaceae family. The breakpoint in the onion ccmF N gene was different from that of other Brassicaceae species. This split of the ccmF N gene was also present in 30 other Allium species. The complete onion mitochondrial genome sequence reported in this study would be fundamental information for elucidation of onion CMS evolution.
Comparative genomic analysis of Helicobacter pylori from Malaysia identifies three distinct lineages suggestive of differential evolution

PubMed Central

Kumar, Narender; Mariappan, Vanitha; Baddam, Ramani; Lankapalli, Aditya K.; Shaik, Sabiha; Goh, Khean-Lee; Loke, Mun Fai; Perkins, Tim; Benghezal, Mohammed; Hasnain, Seyed E.; Vadivelu, Jamuna; Marshall, Barry J.; Ahmed, Niyaz

2015-01-01

The discordant prevalence of Helicobacter pylori and its related diseases, for a long time, fostered certain enigmatic situations observed in the countries of the southern world. Variation in H. pylori infection rates and disease outcomes among different populations in multi-ethnic Malaysia provides a unique opportunity to understand dynamics of host–pathogen interaction and genome evolution. In this study, we extensively analyzed and compared genomes of 27 Malaysian H. pylori isolates and identified three major phylogeographic lineages: hspEastAsia, hpEurope and hpSouthIndia. The analysis of the virulence genes within the core genome, however, revealed a comparable pathogenic potential of the strains. In addition, we identified four genes limited to strains of East-Asian lineage. Our analyses identified a few strain-specific genes encoding restriction modification systems and outlined 311 core genes possibly under differential evolutionary constraints, among the strains representing different ethnic groups. The cagA and vacA genes also showed variations in accordance with the host genetic background of the strains. Moreover, restriction modification genes were found to be significantly enriched in East-Asian strains. An understanding of these variations in the genome content would provide significant insights into various adaptive and host modulation strategies harnessed by H. pylori to effectively persist in a host-specific manner. PMID:25452339
Identification of somatic mutations in non-small cell lung carcinomas using whole-exome sequencing

PubMed Central

Liu, Pengyuan; Morrison, Carl; Wang, Liang; Xiong, Donghai; Vedell, Peter; Cui, Peng; Hua, Xing; Ding, Feng; Lu, Yan; James, Michael; Ebben, John D.; Xu, Haiming; Adjei, Alex A.; Head, Karen; Andrae, Jaime W.; Tschannen, Michael R.; Jacob, Howard; Pan, Jing; Zhang, Qi; Van den Bergh, Francoise; Xiao, Haijie; Lo, Ken C.; Patel, Jigar; Richmond, Todd; Watt, Mary-Anne; Albert, Thomas; Selzer, Rebecca; Anderson, Marshall; Wang, Jiang; Wang, Yian; Starnes, Sandra; Yang, Ping; You, Ming

2012-01-01

Lung cancer is the leading cause of cancer-related death, with non-small cell lung cancer (NSCLC) being the predominant form of the disease. Most lung cancer is caused by the accumulation of genomic alterations due to tobacco exposure. To uncover its mutational landscape, we performed whole-exome sequencing in 31 NSCLCs and their matched normal tissue samples. We identified both common and unique mutation spectra and pathway activation in lung adenocarcinomas and squamous cell carcinomas, two major histologies in NSCLC. In addition to identifying previously known lung cancer genes (TP53, KRAS, EGFR, CDKN2A and RB1), the analysis revealed many genes not previously implicated in this malignancy. Notably, a novel gene CSMD3 was identified as the second most frequently mutated gene (next to TP53) in lung cancer. We further demonstrated that loss of CSMD3 results in increased proliferation of airway epithelial cells. The study provides unprecedented insights into mutational processes, cellular pathways and gene networks associated with lung cancer. Of potential immediate clinical relevance, several highly mutated genes identified in our study are promising druggable targets in cancer therapy including ALK, CTNNA3, DCC, MLL3, PCDHIIX, PIK3C2B, PIK3CG and ROCK2. PMID:22510280
In Silico Detection of Sequence Variations Modifying Transcriptional Regulation

PubMed Central

Andersen, Malin C; Engström, Pär G; Lithwick, Stuart; Arenillas, David; Eriksson, Per; Lenhard, Boris; Wasserman, Wyeth W; Odeberg, Jacob

2008-01-01

Identification of functional genetic variation associated with increased susceptibility to complex diseases can elucidate genes and underlying biochemical mechanisms linked to disease onset and progression. For genes linked to genetic diseases, most identified causal mutations alter an encoded protein sequence. Technological advances for measuring RNA abundance suggest that a significant number of undiscovered causal mutations may alter the regulation of gene transcription. However, it remains a challenge to separate causal genetic variations from linked neutral variations. Here we present an in silico driven approach to identify possible genetic variation in regulatory sequences. The approach combines phylogenetic footprinting and transcription factor binding site prediction to identify variation in candidate cis-regulatory elements. The bioinformatics approach has been tested on a set of SNPs that are reported to have a regulatory function, as well as background SNPs. In the absence of additional information about an analyzed gene, the poor specificity of binding site prediction is prohibitive to its application. However, when additional data is available that can give guidance on which transcription factor is involved in the regulation of the gene, the in silico binding site prediction improves the selection of candidate regulatory polymorphisms for further analyses. The bioinformatics software generated for the analysis has been implemented as a Web-based application system entitled RAVEN (regulatory analysis of variation in enhancers). The RAVEN system is available at http://www.cisreg.ca for all researchers interested in the detection and characterization of regulatory sequence variation. PMID:18208319
Characterisation of betalain biosynthesis in Parakeelya flowers identifies the key biosynthetic gene DOD as belonging to an expanded LigB gene family that is conserved in betalain-producing species

PubMed Central

Chung, Hsiao-Hang; Schwinn, Kathy E.; Ngo, Hanh M.; Lewis, David H.; Massey, Baxter; Calcott, Kate E.; Crowhurst, Ross; Joyce, Daryl C.; Gould, Kevin S.; Davies, Kevin M.; Harrison, Dion K.

2015-01-01

Plant betalain pigments are intriguing because they are restricted to the Caryophyllales and are mutually exclusive with the more common anthocyanins. However, betalain biosynthesis is poorly understood compared to that of anthocyanins. In this study, betalain production and betalain-related genes were characterized in Parakeelya mirabilis (Montiaceae). RT-PCR and transcriptomics identified three sequences related to the key biosynthetic enzyme Dopa 4,5-dioxgenase (DOD). In addition to a LigB gene similar to that of non-Caryophyllales species (Class I genes), two other P. mirabilis LigB genes were found (DOD and DOD-like, termed Class II). PmDOD and PmDOD-like had 70% amino acid identity. Only PmDOD was implicated in betalain synthesis based on transient assays of enzyme activity and correlation of transcript abundance to spatio-temporal betalain accumulation. The role of PmDOD-like remains unknown. The striking pigment patterning of the flowers was due to distinct zones of red betacyanin and yellow betaxanthin production. The major betacyanin was the unglycosylated betanidin rather than the commonly found glycosides, an occurrence for which there are a few previous reports. The white petal zones lacked pigment but had DOD activity suggesting alternate regulation of the pathway in this tissue. DOD and DOD-like sequences were also identified in other betalain-producing species but not in examples of anthocyanin-producing Caryophyllales or non-Caryophyllales species. A Class I LigB sequence from the anthocyanin-producing Caryophyllaceae species Dianthus superbus and two DOD-like sequences from the Amaranthaceae species Beta vulgaris and Ptilotus spp. did not show DOD activity in the transient assay. The additional sequences suggests that DOD is part of a larger LigB gene family in betalain-producing Caryophyllales taxa, and the tandem genomic arrangement of two of the three B. vulgaris LigB genes suggests the involvement of duplication in the gene family evolution. PMID:26217353
Statistical inference for time course RNA-Seq data using a negative binomial mixed-effect model.

PubMed

Sun, Xiaoxiao; Dalpiaz, David; Wu, Di; S Liu, Jun; Zhong, Wenxuan; Ma, Ping

2016-08-26

Accurate identification of differentially expressed (DE) genes in time course RNA-Seq data is crucial for understanding the dynamics of transcriptional regulatory network. However, most of the available methods treat gene expressions at different time points as replicates and test the significance of the mean expression difference between treatments or conditions irrespective of time. They thus fail to identify many DE genes with different profiles across time. In this article, we propose a negative binomial mixed-effect model (NBMM) to identify DE genes in time course RNA-Seq data. In the NBMM, mean gene expression is characterized by a fixed effect, and time dependency is described by random effects. The NBMM is very flexible and can be fitted to both unreplicated and replicated time course RNA-Seq data via a penalized likelihood method. By comparing gene expression profiles over time, we further classify the DE genes into two subtypes to enhance the understanding of expression dynamics. A significance test for detecting DE genes is derived using a Kullback-Leibler distance ratio. Additionally, a significance test for gene sets is developed using a gene set score. Simulation analysis shows that the NBMM outperforms currently available methods for detecting DE genes and gene sets. Moreover, our real data analysis of fruit fly developmental time course RNA-Seq data demonstrates the NBMM identifies biologically relevant genes which are well justified by gene ontology analysis. The proposed method is powerful and efficient to detect biologically relevant DE genes and gene sets in time course RNA-Seq data.
Characterisation of the macrophage transcriptome in glomerulonephritis-susceptible and -resistant rat strains

PubMed Central

Maratou, Klio; Behmoaras, Jacques; Fewings, Chris; Srivastava, Prashant; D’Souza, Zelpha; Smith, Jennifer; Game, Laurence; Cook, Terence; Aitman, Tim

2010-01-01

Crescentic glomerulonephritis (CRGN) is a major cause of rapidly progressive renal failure for which the underlying genetic basis is unknown. WKY rats show marked susceptibility to CRGN, while Lewis rats are resistant. Glomerular injury and crescent formation are macrophage-dependent and mainly explained by seven quantitative trait loci (Crgn1-7). Here, we used microarray analysis in basal and lipopolysaccharide (LPS)-stimulated macrophages to identify genes that reside on pathways predisposing WKY rats to CRGN. We detected 97 novel positional candidates for the uncharacterised Crgn3-7. We identified 10 additional secondary effector genes with profound differences in expression between the two strains (>5-fold change, <1% False Discovery Rate) for basal and LPS-stimulated macrophages. Moreover, we identified 8 genes with differentially expressed alternatively spliced isoforms, by using an in depth analysis at probe-level that allowed us to discard false positives due to polymorphisms between the two rat strains. Pathway analysis identified several common linked pathways, enriched for differentially expressed genes, which affect macrophage activation. In summary, our results identify distinct macrophage transcriptome profiles between two rat strains that differ in susceptibility to glomerulonephritis, provide novel positional candidates for Crgn3-7, and define groups of genes that play a significant role in differential regulation of macrophage activity. PMID:21179115
Comparison of gene expression in segregating families identifies genes and genomic regions involved in a novel adaptation, zinc hyperaccumulation.

PubMed

Filatov, Victor; Dowdle, John; Smirnoff, Nicholas; Ford-Lloyd, Brian; Newbury, H John; Macnair, Mark R

2006-09-01

One of the challenges of comparative genomics is to identify specific genetic changes associated with the evolution of a novel adaptation or trait. We need to be able to disassociate the genes involved with a particular character from all the other genetic changes that take place as lineages diverge. Here we show that by comparing the transcriptional profile of segregating families with that of parent species differing in a novel trait, it is possible to narrow down substantially the list of potential target genes. In addition, by assuming synteny with a related model organism for which the complete genome sequence is available, it is possible to use the cosegregation of markers differing in transcription level to identify regions of the genome which probably contain quantitative trait loci (QTLs) for the character. This novel combination of genomics and classical genetics provides a very powerful tool to identify candidate genes. We use this methodology to investigate zinc hyperaccumulation in Arabidopsis halleri, the sister species to the model plant, Arabidopsis thaliana. We compare the transcriptional profile of A. halleri with that of its sister nonaccumulator species, Arabidopsis petraea, and between accumulator and nonaccumulator F(3)s derived from the cross between the two species. We identify eight genes which consistently show greater expression in accumulator phenotypes in both roots and shoots, including two metal transporter genes (NRAMP3 and ZIP6), and cytoplasmic aconitase, a gene involved in iron homeostasis in mammals. We also show that there appear to be two QTLs for zinc accumulation, on chromosomes 3 and 7.
Comparative transcriptome profiling of upland (VS16) and lowland (AP13) ecotypes of switchgrass.

PubMed

Ayyappan, Vasudevan; Saha, Malay C; Thimmapuram, Jyothi; Sripathi, Venkateswara R; Bhide, Ketaki P; Fiedler, Elizabeth; Hayford, Rita K; Kalavacharla, Venu Kal

2017-01-01

Transcriptomes of two switchgrass genotypes representing the upland and lowland ecotypes will be key tools in switchgrass genome annotation and biotic and abiotic stress functional genomics. Switchgrass (Panicum virgatum L.) is an important bioenergy feedstock for cellulosic ethanol production. We report genome-wide transcriptome profiling of two contrasting tetraploid switchgrass genotypes, VS16 and AP13, representing the upland and lowland ecotypes, respectively. A total of 268 million Illumina short reads (50 nt) were generated, of which, 133 million were obtained in AP13 and the rest 135 million in VS16. More than 90% of these reads were mapped to the switchgrass reference genome (V1.1). We identified 6619 and 5369 differentially expressed genes in VS16 and AP13, respectively. Gene ontology and KEGG pathway analysis identified key genes that regulate important pathways including C4 photosynthesis, photorespiration and phenylpropanoid metabolism. A series of genes (33) involved in photosynthetic pathway were up-regulated in AP13 but only two genes showed higher expression in VS16. We identified three dicarboxylate transporter homologs that were highly expressed in AP13. Additionally, genes that mediate drought, heat, and salinity tolerance were also identified. Vesicular transport proteins, syntaxin and signal recognition particles were seen to be up-regulated in VS16. Analyses of selected genes involved in biosynthesis of secondary metabolites, plant-pathogen interaction, membrane transporters, heat, drought and salinity stress responses confirmed significant variation in the relative expression reflected in RNA-Seq data between VS16 and AP13 genotypes. The phenylpropanoid pathway genes identified here are potential targets for biofuel conversion.
Copy number variants analysis in a cohort of isolated and syndromic developmental delay/intellectual disability reveals novel genomic disorders, position effects and candidate disease genes.

PubMed

Di Gregorio, E; Riberi, E; Belligni, E F; Biamino, E; Spielmann, M; Ala, U; Calcia, A; Bagnasco, I; Carli, D; Gai, G; Giordano, M; Guala, A; Keller, R; Mandrile, G; Arduino, C; Maffè, A; Naretto, V G; Sirchia, F; Sorasio, L; Ungari, S; Zonta, A; Zacchetti, G; Talarico, F; Pappi, P; Cavalieri, S; Giorgio, E; Mancini, C; Ferrero, M; Brussino, A; Savin, E; Gandione, M; Pelle, A; Giachino, D F; De Marchi, M; Restagno, G; Provero, P; Cirillo Silengo, M; Grosso, E; Buxbaum, J D; Pasini, B; De Rubeis, S; Brusco, A; Ferrero, G B

2017-10-01

Array-comparative genomic hybridization (array-CGH) is a widely used technique to detect copy number variants (CNVs) associated with developmental delay/intellectual disability (DD/ID). Identification of genomic disorders in DD/ID. We performed a comprehensive array-CGH investigation of 1,015 consecutive cases with DD/ID and combined literature mining, genetic evidence, evolutionary constraint scores, and functional information in order to assess the pathogenicity of the CNVs. We identified non-benign CNVs in 29% of patients. Amongst the pathogenic variants (11%), detected with a yield consistent with the literature, we found rare genomic disorders and CNVs spanning known disease genes. We further identified and discussed 51 cases with likely pathogenic CNVs spanning novel candidate genes, including genes encoding synaptic components and/or proteins involved in corticogenesis. Additionally, we identified two deletions spanning potential Topological Associated Domain (TAD) boundaries probably affecting the regulatory landscape. We show how phenotypic and genetic analyses of array-CGH data allow unraveling complex cases, identifying rare disease genes, and revealing unexpected position effects. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Genomic structure and expression of STM2, the chromosome 1 familial Alzheimer disease gene.

PubMed

Levy-Lahad, E; Poorkaj, P; Wang, K; Fu, Y H; Oshima, J; Mulligan, J; Schellenberg, G D

1996-06-01

Mutations in the gene STM2 result in autosomal dominant familial Alzheimer disease. To screen for mutations and to identify regulatory elements for this gene, the genomic DNA sequence and intron-exon structure were determined. Twelve exons including 10 coding exons were identified in a genomic region spanning 23,737 bp. The first 2 exons encode the 5'-untranslated region. Expression analysis of STM2 indicates that two transcripts of 2.4 and 2.8 kb are found in skeletal muscle, pancreas, and heart. In addition, a splice variant of the 2.4-kb transcript was identified that is the result of the use of an alternative splice acceptor site located in exon 10. The use of this site results in a transcript lacking a single glutamate. The promotor for this gene and the alternatively spliced exons leading to the 2.8-kb form of the gene remain to be identified. Expression of STM2 was high in skeletal muscle and pancreas, with comparatively low levels observed in brain. This expression pattern is intriguing since in Alzheimer disease, pathology and degeneration are observed only in the central nervous system.
Genomic structure and expression of STM2, the chromosome 1 familial Alzheimer disease gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Levy-Lahad, E.; Wang, Kai; Fu, Ying Hui

1996-06-01

Mutations in the gene STM2 result in autosomal dominant familial Alzheimer disease. To screen for mutations and to identify regulatory elements for this gene, the genomic DNA sequence and intron-exon structure were determined. Twelve exons including 10 coding exons were identified in a genomic region spanning 23, 737 bp. The first 2 exons encode the 5{prime}-untranslated region. Expression analysis of STM2 indicates that two transcripts of 2.4 and 2.8 kb are found in skeletal muscle, pancreas, and heart. In addition, a splice variant of the 2.4-kb transcript was identified that is the result of the use of an alternative splicemore » acceptor site located in exon 10. The use of this site results in a transcript lacking a single glutamate. The promotor for this gene and the alternatively spliced exons leading to the 2.8-kb form of the gene remain to be identified. Expression of STM2 was high in skeletal muscle and pancreas, with comparatively low levels observed in brain. This expression pattern is intriguing since in Alzheimer disease, pathology and degeneration are observed only in the central nervous system. 19 refs., 2 figs., 3 tabs.« less
Genome-wide gene-based analysis suggests an association between Neuroligin 1 (NLGN1) and post-traumatic stress disorder.

PubMed

Kilaru, V; Iyer, S V; Almli, L M; Stevens, J S; Lori, A; Jovanovic, T; Ely, T D; Bradley, B; Binder, E B; Koen, N; Stein, D J; Conneely, K N; Wingo, A P; Smith, A K; Ressler, K J

2016-05-24

Post-traumatic stress disorder (PTSD) develops in only some people following trauma exposure, but the mechanisms differentially explaining risk versus resilience remain largely unknown. PTSD is heritable but candidate gene studies and genome-wide association studies (GWAS) have identified only a modest number of genes that reliably contribute to PTSD. New gene-based methods may help identify additional genes that increase risk for PTSD development or severity. We applied gene-based testing to GWAS data from the Grady Trauma Project (GTP), a primarily African American cohort, and identified two genes (NLGN1 and ZNRD1-AS1) that associate with PTSD after multiple test correction. Although the top SNP from NLGN1 did not replicate, we observed gene-based replication of NLGN1 with PTSD in the Drakenstein Child Health Study (DCHS) cohort from Cape Town. NLGN1 has previously been associated with autism, and it encodes neuroligin 1, a protein involved in synaptogenesis, learning, and memory. Within the GTP dataset, a single nucleotide polymorphism (SNP), rs6779753, underlying the gene-based association, associated with the intermediate phenotypes of higher startle response and greater functional magnetic resonance imaging activation of the amygdala, orbitofrontal cortex, right thalamus and right fusiform gyrus in response to fearful faces. These findings support a contribution of the NLGN1 gene pathway to the neurobiological underpinnings of PTSD.
Genome-wide gene-based analysis suggests an association between Neuroligin 1 (NLGN1) and post-traumatic stress disorder

PubMed Central

Kilaru, V; Iyer, S V; Almli, L M; Stevens, J S; Lori, A; Jovanovic, T; Ely, T D; Bradley, B; Binder, E B; Koen, N; Stein, D J; Conneely, K N; Wingo, A P; Smith, A K; Ressler, K J

2016-01-01

Post-traumatic stress disorder (PTSD) develops in only some people following trauma exposure, but the mechanisms differentially explaining risk versus resilience remain largely unknown. PTSD is heritable but candidate gene studies and genome-wide association studies (GWAS) have identified only a modest number of genes that reliably contribute to PTSD. New gene-based methods may help identify additional genes that increase risk for PTSD development or severity. We applied gene-based testing to GWAS data from the Grady Trauma Project (GTP), a primarily African American cohort, and identified two genes (NLGN1 and ZNRD1-AS1) that associate with PTSD after multiple test correction. Although the top SNP from NLGN1 did not replicate, we observed gene-based replication of NLGN1 with PTSD in the Drakenstein Child Health Study (DCHS) cohort from Cape Town. NLGN1 has previously been associated with autism, and it encodes neuroligin 1, a protein involved in synaptogenesis, learning, and memory. Within the GTP dataset, a single nucleotide polymorphism (SNP), rs6779753, underlying the gene-based association, associated with the intermediate phenotypes of higher startle response and greater functional magnetic resonance imaging activation of the amygdala, orbitofrontal cortex, right thalamus and right fusiform gyrus in response to fearful faces. These findings support a contribution of the NLGN1 gene pathway to the neurobiological underpinnings of PTSD. PMID:27219346
The expanding universe of inflammatory bowel disease genetics.

PubMed

Achkar, Jean-Paul; Duerr, Richard

2008-07-01

Genetic factors play an important role in the pathogenesis of inflammatory bowel disease. In this review, we will provide an update on the rapid advances in the discovery of inflammatory bowel disease, primarily Crohn's disease, associated genes. Seven recently published Crohn's disease genome-wide association studies have confirmed prior findings related to the nucleotide-binding oligomerization domain 2 (NOD2) gene and the IBD5 locus. In addition, 10 novel loci have been identified and well replicated. Several promising associations between Crohn's disease and gene variants have been identified and replicated, the two most widely replicated being variants in the IL23R and ATG16L1 genes. These findings highlight and further support the importance of the immune system and its interactions with the intestinal microflora in the pathogenesis of inflammatory bowel disease.
Genetics Home Reference: McKusick-Kaufman syndrome

MedlinePlus

... Kaufman syndrome Additional NIH Resources (1 link) National Human Genome Research Institute: Gene Linked to Developmental Syndrome in Old Order Amish Identified by NIH Scientists Educational Resources ( ...
Genetic Determinants for Promoter Hypermethylation in the Lungs of Smokers: A Candidate Gene-Based Study

PubMed Central

Leng, Shuguang; Stidley, Christine A.; Liu, Yushi; Edlund, Christopher K.; Willink, Randall P.; Han, Younghun; Landi, Maria Teresa; Thun, Michael; Picchi, Maria A.; Bruse, Shannon E.; Crowell, Richard E.; Van Den Berg, David; Caporaso, Neil E.; Amos, Christopher I.; Siegfried, Jill M.; Tesfaigzi, Yohannes; Gilliland, Frank D.; Belinsky, Steven A.

2011-01-01

The detection of tumor suppressor gene promoter methylation in sputum-derived exfoliated cells predicts early lung cancer. Here we identified genetic determinants for this epigenetic process and examined their biological effects on gene regulation. A two-stage approach involving discovery and replication was employed to assess the association between promoter hypermethylation of a 12-gene panel and common variation in 40 genes involved in carcinogen metabolism, regulation of methylation, and DNA damage response in members of the Lovelace Smokers Cohort (n=1434). Molecular validation of three identified variants was conducted using primary bronchial epithelial cells. Association of study-wide significance (P<8.2×10−5) was identified for rs1641511, rs3730859, and rs1883264 in TP53, LIG1, and BIK, respectively. These SNPs were significantly associated with altered expression of the corresponding genes in primary bronchial epithelial cells. In addition, rs3730859 in LIG1 was also moderately associated with increased risk for lung cancer among Caucasian smokers. Together, our findings suggest that genetic variation in DNA replication and apoptosis pathways impacts the propensity for gene promoter hypermethylation in the aerodigestive tract of smokers. The incorporation of genetic biomarkers for gene promoter hypermethylation with clinical and somatic markers may improve risk assessment models for lung cancer. PMID:22139380

Identifying novel genes and chemicals related to nasopharyngeal cancer in a heterogeneous network.

PubMed

Li, Zhandong; An, Lifeng; Li, Hao; Wang, ShaoPeng; Zhou, You; Yuan, Fei; Li, Lin

2016-05-05

Nasopharyngeal cancer or nasopharyngeal carcinoma (NPC) is the most common cancer originating in the nasopharynx. The factors that induce nasopharyngeal cancer are still not clear. Additional information about the chemicals or genes related to nasopharyngeal cancer will promote a better understanding of the pathogenesis of this cancer and the factors that induce it. Thus, a computational method NPC-RGCP was proposed in this study to identify the possible relevant chemicals and genes based on the presently known chemicals and genes related to nasopharyngeal cancer. To extensively utilize the functional associations between proteins and chemicals, a heterogeneous network was constructed based on interactions of proteins and chemicals. The NPC-RGCP included two stages: the searching stage and the screening stage. The former stage is for finding new possible genes and chemicals in the heterogeneous network, while the latter stage is for screening and removing false discoveries and selecting the core genes and chemicals. As a result, five putative genes, CXCR3, IRF1, CDK1, GSTP1, and CDH2, and seven putative chemicals, iron, propionic acid, dimethyl sulfoxide, isopropanol, erythrose 4-phosphate, β-D-Fructose 6-phosphate, and flavin adenine dinucleotide, were identified by NPC-RGCP. Extensive analyses provided confirmation that the putative genes and chemicals have significant associations with nasopharyngeal cancer.
Identifying novel genes and chemicals related to nasopharyngeal cancer in a heterogeneous network

PubMed Central

Li, Zhandong; An, Lifeng; Li, Hao; Wang, ShaoPeng; Zhou, You; Yuan, Fei; Li, Lin

2016-01-01

Nasopharyngeal cancer or nasopharyngeal carcinoma (NPC) is the most common cancer originating in the nasopharynx. The factors that induce nasopharyngeal cancer are still not clear. Additional information about the chemicals or genes related to nasopharyngeal cancer will promote a better understanding of the pathogenesis of this cancer and the factors that induce it. Thus, a computational method NPC-RGCP was proposed in this study to identify the possible relevant chemicals and genes based on the presently known chemicals and genes related to nasopharyngeal cancer. To extensively utilize the functional associations between proteins and chemicals, a heterogeneous network was constructed based on interactions of proteins and chemicals. The NPC-RGCP included two stages: the searching stage and the screening stage. The former stage is for finding new possible genes and chemicals in the heterogeneous network, while the latter stage is for screening and removing false discoveries and selecting the core genes and chemicals. As a result, five putative genes, CXCR3, IRF1, CDK1, GSTP1, and CDH2, and seven putative chemicals, iron, propionic acid, dimethyl sulfoxide, isopropanol, erythrose 4-phosphate, β-D-Fructose 6-phosphate, and flavin adenine dinucleotide, were identified by NPC-RGCP. Extensive analyses provided confirmation that the putative genes and chemicals have significant associations with nasopharyngeal cancer. PMID:27149165
Functional and evolution characterization of SWEET sugar transporters in Ananas comosus.

PubMed

Guo, Chengying; Li, Huayang; Xia, Xinyao; Liu, Xiuyuan; Yang, Long

2018-02-05

Sugars will eventually be exported transporters (SWEETs) are a group of recently identified sugar transporters in plants that play important roles in diverse physiological processes. However, currently, limited information about this gene family is available in pineapple (Ananas comosus). The availability of the recently released pineapple genome sequence provides the opportunity to identify SWEET genes in a Bromeliaceae family member at the genome level. In this study, 39 pineapple SWEET genes were identified in two pineapple cultivars (18 AnfSWEET and 21 AnmSWEET) and further phylogenetically classified into five clades. A phylogenetic analysis revealed distinct evolutionary paths for the SWEET genes of the two pineapple cultivars. The MD2 cultivar might have experienced a different expansion than the F153 cultivar because two additional duplications exist, which separately gave rise to clades III and IV. A gene exon/intron structure analysis showed that the pineapple SWEET genes contained highly conserved exon/intron numbers. An analysis of public RNA-seq data and expression profiling showed that SWEET genes may be involved in fruit development and ripening processes. AnmSWEET5 and AnmSWEET11 were highly expressed in the early stages of pineapple fruit development and then decreased. The study increases the understanding of the roles of SWEET genes in pineapple. Copyright © 2018 Elsevier Inc. All rights reserved.
An improved Pearson's correlation proximity-based hierarchical clustering for mining biological association between genes.

PubMed

Booma, P M; Prabhakaran, S; Dhanalakshmi, R

2014-01-01

Microarray gene expression datasets has concerned great awareness among molecular biologist, statisticians, and computer scientists. Data mining that extracts the hidden and usual information from datasets fails to identify the most significant biological associations between genes. A search made with heuristic for standard biological process measures only the gene expression level, threshold, and response time. Heuristic search identifies and mines the best biological solution, but the association process was not efficiently addressed. To monitor higher rate of expression levels between genes, a hierarchical clustering model was proposed, where the biological association between genes is measured simultaneously using proximity measure of improved Pearson's correlation (PCPHC). Additionally, the Seed Augment algorithm adopts average linkage methods on rows and columns in order to expand a seed PCPHC model into a maximal global PCPHC (GL-PCPHC) model and to identify association between the clusters. Moreover, a GL-PCPHC applies pattern growing method to mine the PCPHC patterns. Compared to existing gene expression analysis, the PCPHC model achieves better performance. Experimental evaluations are conducted for GL-PCPHC model with standard benchmark gene expression datasets extracted from UCI repository and GenBank database in terms of execution time, size of pattern, significance level, biological association efficiency, and pattern quality.
An Improved Pearson's Correlation Proximity-Based Hierarchical Clustering for Mining Biological Association between Genes

PubMed Central

Booma, P. M.; Prabhakaran, S.; Dhanalakshmi, R.

2014-01-01

Microarray gene expression datasets has concerned great awareness among molecular biologist, statisticians, and computer scientists. Data mining that extracts the hidden and usual information from datasets fails to identify the most significant biological associations between genes. A search made with heuristic for standard biological process measures only the gene expression level, threshold, and response time. Heuristic search identifies and mines the best biological solution, but the association process was not efficiently addressed. To monitor higher rate of expression levels between genes, a hierarchical clustering model was proposed, where the biological association between genes is measured simultaneously using proximity measure of improved Pearson's correlation (PCPHC). Additionally, the Seed Augment algorithm adopts average linkage methods on rows and columns in order to expand a seed PCPHC model into a maximal global PCPHC (GL-PCPHC) model and to identify association between the clusters. Moreover, a GL-PCPHC applies pattern growing method to mine the PCPHC patterns. Compared to existing gene expression analysis, the PCPHC model achieves better performance. Experimental evaluations are conducted for GL-PCPHC model with standard benchmark gene expression datasets extracted from UCI repository and GenBank database in terms of execution time, size of pattern, significance level, biological association efficiency, and pattern quality. PMID:25136661
Identification of estrogen-responsive genes using a genome-wide analysis of promoter elements for transcription factor binding sites.

PubMed

Kamalakaran, Sitharthan; Radhakrishnan, Senthil K; Beck, William T

2005-06-03

We developed a pipeline to identify novel genes regulated by the steroid hormone-dependent transcription factor, estrogen receptor, through a systematic analysis of upstream regions of all human and mouse genes. We built a data base of putative promoter regions for 23,077 human and 19,984 mouse transcripts from National Center for Biotechnology Information annotation and 8793 human and 6785 mouse promoters from the Data Base of Transcriptional Start Sites. We used this data base of putative promoters to identify potential targets of estrogen receptor by identifying estrogen response elements (EREs) in their promoters. Our program correctly identified EREs in genes known to be regulated by estrogen in addition to several new genes whose putative promoters contained EREs. We validated six genes (KIAA1243, NRIP1, MADH9, NME3, TPD52L, and ABCG2) to be estrogen-responsive in MCF7 cells using reverse transcription PCR. To allow for extensibility of our program in identifying targets of other transcription factors, we have built a Web interface to access our data base and programs. Our Web-based program for Promoter Analysis of Genome, PAGen@UIC, allows a user to identify putative target genes for vertebrate transcription factors through the analysis of their upstream sequences. The interface allows the user to search the human and mouse promoter data bases for potential target genes containing one or more listed transcription factor binding sites (TFBSs) in their upstream elements, using either regular expression-based consensus or position weight matrices. The data base can also be searched for promoters harboring user-defined TFBSs given as a consensus or a position weight matrix. Furthermore, the user can retrieve putative promoter sequences for any given gene together with identified TFBSs located on its promoter. Orthologous promoters are also analyzed to determine conserved elements.
Caffeine exposure alters cardiac gene expression in embryonic cardiomyocytes

PubMed Central

Fang, Xiefan; Mei, Wenbin; Barbazuk, William B.; Rivkees, Scott A.

2014-01-01

Previous studies demonstrated that in utero caffeine treatment at embryonic day (E) 8.5 alters DNA methylation patterns, gene expression, and cardiac function in adult mice. To provide insight into the mechanisms, we examined cardiac gene and microRNA (miRNA) expression in cardiomyocytes shortly after exposure to physiologically relevant doses of caffeine. In HL-1 and primary embryonic cardiomyocytes, caffeine treatment for 48 h significantly altered the expression of cardiac structural genes (Myh6, Myh7, Myh7b, Tnni3), hormonal genes (Anp and BnP), cardiac transcription factors (Gata4, Mef2c, Mef2d, Nfatc1), and microRNAs (miRNAs; miR208a, miR208b, miR499). In addition, expressions of these genes were significantly altered in embryonic hearts exposed to in utero caffeine. For in utero experiments, pregnant CD-1 dams were treated with 20–60 mg/kg of caffeine, which resulted in maternal circulation levels of 37.3–65.3 μM 2 h after treatment. RNA sequencing was performed on embryonic ventricles treated with vehicle or 20 mg/kg of caffeine daily from E6.5-9.5. Differential expression (DE) analysis revealed that 124 genes and 849 transcripts were significantly altered, and differential exon usage (DEU) analysis identified 597 exons that were changed in response to prenatal caffeine exposure. Among the DE genes identified by RNA sequencing were several cardiac structural genes and genes that control DNA methylation and histone modification. Pathway analysis revealed that pathways related to cardiovascular development and diseases were significantly affected by caffeine. In addition, global cardiac DNA methylation was reduced in caffeine-treated cardiomyocytes. Collectively, these data demonstrate that caffeine exposure alters gene expression and DNA methylation in embryonic cardiomyocytes. PMID:25354728
Nitrate-induced genes in tomato roots. Array analysis reveals novel genes that may play a role in nitrogen nutrition.

PubMed

Wang, Y H; Garvin, D F; Kochian, L V

2001-09-01

A subtractive tomato (Lycopersicon esculentum) root cDNA library enriched in genes up-regulated by changes in plant mineral status was screened with labeled mRNA from roots of both nitrate-induced and mineral nutrient-deficient (-nitrogen [N], -phosphorus, -potassium [K], -sulfur, -magnesium, -calcium, -iron, -zinc, and -copper) tomato plants. A subset of cDNAs was selected from this library based on mineral nutrient-related changes in expression. Additional cDNAs were selected from a second mineral-deficient tomato root library based on sequence homology to known genes. These selection processes yielded a set of 1,280 mineral nutrition-related cDNAs that were arrayed on nylon membranes for further analysis. These high-density arrays were hybridized with mRNA from tomato plants exposed to nitrate at different time points after N was withheld for 48 h, for plants that were grown on nitrate/ammonium for 5 weeks prior to the withholding of N. One hundred-fifteen genes were found to be up-regulated by nitrate resupply. Among these genes were several previously identified as nitrate responsive, including nitrate transporters, nitrate and nitrite reductase, and metabolic enzymes such as transaldolase, transketolase, malate dehydrogenase, asparagine synthetase, and histidine decarboxylase. We also identified 14 novel nitrate-inducible genes, including: (a) water channels, (b) root phosphate and K(+) transporters, (c) genes potentially involved in transcriptional regulation, (d) stress response genes, and (e) ribosomal protein genes. In addition, both families of nitrate transporters were also found to be inducible by phosphate, K, and iron deficiencies. The identification of these novel nitrate-inducible genes is providing avenues of research that will yield new insights into the molecular basis of plant N nutrition, as well as possible networking between the regulation of N, phosphorus, and K nutrition.
Genome-wide histone state profiling of fibroblasts from the opossum, Monodelphis domestica, identifies the first marsupial-specific imprinted gene

PubMed Central

2014-01-01

Background Imprinted genes have been extensively documented in eutherian mammals and found to exhibit significant interspecific variation in the suites of genes that are imprinted and in their regulation between tissues and developmental stages. Much less is known about imprinted loci in metatherian (marsupial) mammals, wherein studies have been limited to a small number of genes previously known to be imprinted in eutherians. We describe the first ab initio search for imprinted marsupial genes, in fibroblasts from the opossum, Monodelphis domestica, based on a genome-wide ChIP-seq strategy to identify promoters that are simultaneously marked by mutually exclusive, transcriptionally opposing histone modifications. Results We identified a novel imprinted gene (Meis1) and two additional monoallelically expressed genes, one of which (Cstb) showed allele-specific, but non-imprinted expression. Imprinted vs. allele-specific expression could not be resolved for the third monoallelically expressed gene (Rpl17). Transcriptionally opposing histone modifications H3K4me3, H3K9Ac, and H3K9me3 were found at the promoters of all three genes, but differential DNA methylation was not detected at CpG islands at any of these promoters. Conclusions In generating the first genome-wide histone modification profiles for a marsupial, we identified the first gene that is imprinted in a marsupial but not in eutherian mammals. This outcome demonstrates the practicality of an ab initio discovery strategy and implicates histone modification, but not differential DNA methylation, as a conserved mechanism for marking imprinted genes in all therian mammals. Our findings suggest that marsupials use multiple epigenetic mechanisms for imprinting and support the concept that lineage-specific selective forces can produce sets of imprinted genes that differ between metatherian and eutherian lines. PMID:24484454
Large-scale gene-centric meta-analysis across 32 studies identifies multiple lipid loci

USDA-ARS?s Scientific Manuscript database

Genome-wide association studies (GWASs) have identified many SNPs underlying variations in plasma-lipid levels. We explore whether additional loci associated with plasma-lipid phenotypes, such as high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C), total cholest...
Exome chip meta-analysis identifies novel loci and East Asian-specific coding variants that contribute to lipid levels and coronary artery disease.

PubMed

Lu, Xiangfeng; Peloso, Gina M; Liu, Dajiang J; Wu, Ying; Zhang, He; Zhou, Wei; Li, Jun; Tang, Clara Sze-Man; Dorajoo, Rajkumar; Li, Huaixing; Long, Jirong; Guo, Xiuqing; Xu, Ming; Spracklen, Cassandra N; Chen, Yang; Liu, Xuezhen; Zhang, Yan; Khor, Chiea Chuen; Liu, Jianjun; Sun, Liang; Wang, Laiyuan; Gao, Yu-Tang; Hu, Yao; Yu, Kuai; Wang, Yiqin; Cheung, Chloe Yu Yan; Wang, Feijie; Huang, Jianfeng; Fan, Qiao; Cai, Qiuyin; Chen, Shufeng; Shi, Jinxiu; Yang, Xueli; Zhao, Wanting; Sheu, Wayne H-H; Cherny, Stacey Shawn; He, Meian; Feranil, Alan B; Adair, Linda S; Gordon-Larsen, Penny; Du, Shufa; Varma, Rohit; Chen, Yii-Der Ida; Shu, Xiao-Ou; Lam, Karen Siu Ling; Wong, Tien Yin; Ganesh, Santhi K; Mo, Zengnan; Hveem, Kristian; Fritsche, Lars G; Nielsen, Jonas Bille; Tse, Hung-Fat; Huo, Yong; Cheng, Ching-Yu; Chen, Y Eugene; Zheng, Wei; Tai, E Shyong; Gao, Wei; Lin, Xu; Huang, Wei; Abecasis, Goncalo; Kathiresan, Sekar; Mohlke, Karen L; Wu, Tangchun; Sham, Pak Chung; Gu, Dongfeng; Willer, Cristen J

2017-12-01

Most genome-wide association studies have been of European individuals, even though most genetic variation in humans is seen only in non-European samples. To search for novel loci associated with blood lipid levels and clarify the mechanism of action at previously identified lipid loci, we used an exome array to examine protein-coding genetic variants in 47,532 East Asian individuals. We identified 255 variants at 41 loci that reached chip-wide significance, including 3 novel loci and 14 East Asian-specific coding variant associations. After a meta-analysis including >300,000 European samples, we identified an additional nine novel loci. Sixteen genes were identified by protein-altering variants in both East Asians and Europeans, and thus are likely to be functional genes. Our data demonstrate that most of the low-frequency or rare coding variants associated with lipids are population specific, and that examining genomic data across diverse ancestries may facilitate the identification of functional genes at associated loci.
Exome chip meta-analysis identifies novel loci and East Asian-specific coding variants contributing to lipid levels and coronary artery disease

PubMed Central

Lu, Xiangfeng; Peloso, Gina M; Liu, Dajiang J.; Wu, Ying; Zhang, He; Zhou, Wei; Li, Jun; Tang, Clara Sze-man; Dorajoo, Rajkumar; Li, Huaixing; Long, Jirong; Guo, Xiuqing; Xu, Ming; Spracklen, Cassandra N.; Chen, Yang; Liu, Xuezhen; Zhang, Yan; Khor, Chiea Chuen; Liu, Jianjun; Sun, Liang; Wang, Laiyuan; Gao, Yu-Tang; Hu, Yao; Yu, Kuai; Wang, Yiqin; Cheung, Chloe Yu Yan; Wang, Feijie; Huang, Jianfeng; Fan, Qiao; Cai, Qiuyin; Chen, Shufeng; Shi, Jinxiu; Yang, Xueli; Zhao, Wanting; Sheu, Wayne H.-H.; Cherny, Stacey Shawn; He, Meian; Feranil, Alan B.; Adair, Linda S.; Gordon-Larsen, Penny; Du, Shufa; Varma, Rohit; da Chen, Yii-Der I; Shu, XiaoOu; Lam, Karen Siu Ling; Wong, Tien Yin; Ganesh, Santhi K.; Mo, Zengnan; Hveem, Kristian; Fritsche, Lars; Nielsen, Jonas Bille; Tse, Hung-fat; Huo, Yong; Cheng, Ching-Yu; Chen, Y. Eugene; Zheng, Wei; Tai, E Shyong; Gao, Wei; Lin, Xu; Huang, Wei; Abecasis, Goncalo; Consortium, GLGC; Kathiresan, Sekar; Mohlke, Karen L.; Wu, Tangchun; Sham, Pak Chung; Gu, Dongfeng; Willer, Cristen J

2017-01-01

Most genome-wide association studies have been conducted in European individuals, even though most genetic variation in humans is seen only in non-European samples. To search for novel loci associated with blood lipid levels and clarify the mechanism of action at previously identified lipid loci, we examined protein-coding genetic variants in 47,532 East Asian individuals using an exome array. We identified 255 variants at 41 loci reaching chip-wide significance, including 3 novel loci and 14 East Asian-specific coding variant associations. After meta-analysis with > 300,000 European samples, we identified an additional 9 novel loci. The same 16 genes were identified by the protein-altering variants in both East Asians and Europeans, likely pointing to the functional genes. Our data demonstrate that most of the low-frequency or rare coding variants associated with lipids are population-specific, and that examining genomic data across diverse ancestries may facilitate the identification of functional genes at associated loci. PMID:29083407
Gene: a gene-centered information resource at NCBI.

PubMed

Brown, Garth R; Hem, Vichet; Katz, Kenneth S; Ovetsky, Michael; Wallin, Craig; Ermolaeva, Olga; Tolstoy, Igor; Tatusova, Tatiana; Pruitt, Kim D; Maglott, Donna R; Murphy, Terence D

2015-01-01

The National Center for Biotechnology Information's (NCBI) Gene database (www.ncbi.nlm.nih.gov/gene) integrates gene-specific information from multiple data sources. NCBI Reference Sequence (RefSeq) genomes for viruses, prokaryotes and eukaryotes are the primary foundation for Gene records in that they form the critical association between sequence and a tracked gene upon which additional functional and descriptive content is anchored. Additional content is integrated based on the genomic location and RefSeq transcript and protein sequence data. The content of a Gene record represents the integration of curation and automated processing from RefSeq, collaborating model organism databases, consortia such as Gene Ontology, and other databases within NCBI. Records in Gene are assigned unique, tracked integers as identifiers. The content (citations, nomenclature, genomic location, gene products and their attributes, phenotypes, sequences, interactions, variation details, maps, expression, homologs, protein domains and external databases) is available via interactive browsing through NCBI's Entrez system, via NCBI's Entrez programming utilities (E-Utilities and Entrez Direct) and for bulk transfer by FTP. Published by Oxford University Press on behalf of Nucleic Acids Research 2014. This work is written by (a) US Government employee(s) and is in the public domain in the US.
The carnegie protein trap library: a versatile tool for Drosophila developmental studies.

PubMed

Buszczak, Michael; Paterno, Shelley; Lighthouse, Daniel; Bachman, Julia; Planck, Jamie; Owen, Stephenie; Skora, Andrew D; Nystul, Todd G; Ohlstein, Benjamin; Allen, Anna; Wilhelm, James E; Murphy, Terence D; Levis, Robert W; Matunis, Erika; Srivali, Nahathai; Hoskins, Roger A; Spradling, Allan C

2007-03-01

Metazoan physiology depends on intricate patterns of gene expression that remain poorly known. Using transposon mutagenesis in Drosophila, we constructed a library of 7404 protein trap and enhancer trap lines, the Carnegie collection, to facilitate gene expression mapping at single-cell resolution. By sequencing the genomic insertion sites, determining splicing patterns downstream of the enhanced green fluorescent protein (EGFP) exon, and analyzing expression patterns in the ovary and salivary gland, we found that 600-900 different genes are trapped in our collection. A core set of 244 lines trapped different identifiable protein isoforms, while insertions likely to act as GFP-enhancer traps were found in 256 additional genes. At least 8 novel genes were also identified. Our results demonstrate that the Carnegie collection will be useful as a discovery tool in diverse areas of cell and developmental biology and suggest new strategies for greatly increasing the coverage of the Drosophila proteome with protein trap insertions.
Androgen Receptor Gene Polymorphism, Aggression, and Reproduction in Tanzanian Foragers and Pastoralists

PubMed Central

Butovskaya, Marina L.; Lazebny, Oleg E.; Vasilyev, Vasiliy A.; Dronova, Daria A.; Karelin, Dmitri V.; Mabulla, Audax Z. P.; Shibalev, Dmitri V.; Shackelford, Todd K.; Fink, Bernhard; Ryskov, Alexey P.

2015-01-01

The androgen receptor (AR) gene polymorphism in humans is linked to aggression and may also be linked to reproduction. Here we report associations between AR gene polymorphism and aggression and reproduction in two small-scale societies in northern Tanzania (Africa)—the Hadza (monogamous foragers) and the Datoga (polygynous pastoralists). We secured self-reports of aggression and assessed genetic polymorphism of the number of CAG repeats for the AR gene for 210 Hadza men and 229 Datoga men (aged 17–70 years). We conducted structural equation modeling to identify links between AR gene polymorphism, aggression, and number of children born, and included age and ethnicity as covariates. Fewer AR CAG repeats predicted greater aggression, and Datoga men reported more aggression than did Hadza men. In addition, aggression mediated the identified negative relationship between CAG repeats and number of children born. PMID:26291982
Mammalian polycistronic mRNAs and disease

PubMed Central

Karginov, Timofey A.; Hejazi Pastor, Daniel Parviz; Semler, Bert L.; Gomez, Christopher M.

2016-01-01

Our understanding of gene expression has come far since the “one-gene one-polypeptide” hypothesis proposed by Beadle and Tatum. This review addresses the gradual recognition that a growing number of polycistronic genes, originally discovered in viruses, are being identified within the mammalian genome, and that these may provide new insights into disease mechanisms and treatment. We have carried out a systematic literature review identifying 13 mammalian genes for which there is evidence for polycistronic expression via translation through an Internal Ribosome Entry Site (IRES). Although the canonical mechanism of translation initiation has been studied extensively, this review highlights a process of non-canonical translation, IRES-mediated translation, that is a growing source of understanding complex inheritance, elucidation of disease mechanisms, and discovery of novel therapeutic targets. Identification of additional polycistronic genes may provide new insights into disease therapy and allow for new discoveries of translational and disease mechanisms. PMID:28012572
Microarray analysis reveals overlapping and specific transcriptional responses to different plant hormones in rice

PubMed Central

Garg, Rohini; Tyagi, Akhilesh K.; Jain, Mukesh

2012-01-01

Hormones exert pleiotropic effects on plant growth and development throughout the life cycle. Many of these effects are mediated at molecular level via altering gene expression. In this study, we investigated the exogenous effect of plant hormones, including auxin, cytokinin, abscisic acid, ethylene, salicylic acid and jasmonic acid, on the transcription of rice genes at whole genome level using microarray. Our analysis identified a total of 4171 genes involved in several biological processes, whose expression was altered significantly in the presence of different hormones. Further, 28% of these genes exhibited overlapping transcriptional responses in the presence of any two hormones, indicating crosstalk among plant hormones. In addition, we identified genes showing only a particular hormone-specific response, which can be used as hormone-specific markers. The results of this study will facilitate further studies in hormone biology in rice. PMID:22827941
Mining microarrays for metabolic meaning: nutritional regulation of hypothalamic gene expression.

PubMed

Mobbs, Charles V; Yen, Kelvin; Mastaitis, Jason; Nguyen, Ha; Watson, Elizabeth; Wurmbach, Elisa; Sealfon, Stuart C; Brooks, Andrew; Salton, Stephen R J

2004-06-01

DNA microarray analysis has been used to investigate relative changes in the level of gene expression in the CNS, including changes that are associated with disease, injury, psychiatric disorders, drug exposure or withdrawal, and memory formation. We have used oligonucleotide microarrays to identify hypothalamic genes that respond to nutritional manipulation. In addition to commonly used microarray analysis based on criteria such as fold-regulation, we have also found that simply carrying out multiple t tests then sorting by P value constitutes a highly reliable method to detect true regulation, as assessed by real-time polymerase chain reaction (PCR), even for relatively low abundance genes or relatively low magnitude of regulation. Such analyses directly suggested novel mechanisms that mediate effects of nutritional state on neuroendocrine function and are being used to identify regulated gene products that may elucidate the metabolic pathology of obese ob/ob, lean Vgf-/Vgf-, and other models with profound metabolic impairments.
The genetics of Alzheimer disease: current status and future prospects.

PubMed

Blacker, D; Tanzi, R E

1998-03-01

Four genes involved in the development of Alzheimer disease have been identified. Three fully penetrant (deterministic) genes lead to the development of Alzheimer disease in patients younger than 60 years: the amyloid beta-protein precursor on chromosome 21, presenilin 1 on chromosome 14, and presenilin 2 on chromosome 1. Together, they account for about half of this early-onset form of the disease. One genetic risk factor--apolipoprotein E-4--is associated with late-onset Alzheimer disease. It accounts for a substantial fraction of disease burden but seems to act primarily to lower the age of disease onset. In general, none of these genes can be easily adapted for use as a diagnostic or predictive test for Alzheimer disease. Research activity includes searching for additional genes, especially for late-onset disease, and elucidating the mechanism of action of all identified genes as part of a long-term effort to develop more effective therapeutic and preventive strategies.
Phylogenetics of Lophotrochozoan bHLH Genes and the Evolution of Lineage-Specific Gene Duplicates.

PubMed

Bao, Yongbo; Xu, Fei; Shimeld, Sebastian M

2017-04-01

The gain and loss of genes encoding transcription factors is of importance to understanding the evolution of gene regulatory complexity. The basic helix-loop-helix (bHLH) genes encode a large superfamily of transcription factors. We systematically classify the bHLH genes from five mollusc, two annelid and one brachiopod genomes, tracing the pattern of bHLH gene evolution across these poorly studied Phyla. In total, 56-88 bHLH genes were identified in each genome, with most identifiable as members of previously described bilaterian families, or of new families we define. Of such families only one, Mesp, appears lost by all these species. Additional duplications have also played a role in the evolution of the bHLH gene repertoire, with many new lophotrochozoan-, mollusc-, bivalve-, or gastropod-specific genes defined. Using a combination of transcriptome mining, RT-PCR, and in situ hybridization we compared the expression of several of these novel genes in tissues and embryos of the molluscs Crassostrea gigas and Patella vulgata, finding both conserved expression and evidence for neofunctionalization. We also map the positions of the genes across these genomes, identifying numerous gene linkages. Some reflect recent paralog divergence by tandem duplication, others are remnants of ancient tandem duplications dating to the lophotrochozoan or bilaterian common ancestors. These data are built into a model of the evolution of bHLH genes in molluscs, showing formidable evolutionary stasis at the family level but considerable within-family diversification by tandem gene duplication. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Lineage-Specific Evolutionary Histories and Regulation of Major Starch Metabolism Genes during Banana Ripening

PubMed Central

Jourda, Cyril; Cardi, Céline; Gibert, Olivier; Giraldo Toro, Andrès; Ricci, Julien; Mbéguié-A-Mbéguié, Didier; Yahiaoui, Nabila

2016-01-01

Starch is the most widespread and abundant storage carbohydrate in plants. It is also a major feature of cultivated bananas as it accumulates to large amounts during banana fruit development before almost complete conversion to soluble sugars during ripening. Little is known about the structure of major gene families involved in banana starch metabolism and their evolution compared to other species. To identify genes involved in banana starch metabolism and investigate their evolutionary history, we analyzed six gene families playing a crucial role in plant starch biosynthesis and degradation: the ADP-glucose pyrophosphorylases (AGPases), starch synthases (SS), starch branching enzymes (SBE), debranching enzymes (DBE), α-amylases (AMY) and β-amylases (BAM). Using comparative genomics and phylogenetic approaches, these genes were classified into families and sub-families and orthology relationships with functional genes in Eudicots and in grasses were identified. In addition to known ancestral duplications shaping starch metabolism gene families, independent evolution in banana and grasses also occurred through lineage-specific whole genome duplications for specific sub-families of AGPase, SS, SBE, and BAM genes; and through gene-scale duplications for AMY genes. In particular, banana lineage duplications yielded a set of AGPase, SBE and BAM genes that were highly or specifically expressed in banana fruits. Gene expression analysis highlighted a complex transcriptional reprogramming of starch metabolism genes during ripening of banana fruits. A differential regulation of expression between banana gene duplicates was identified for SBE and BAM genes, suggesting that part of starch metabolism regulation in the fruit evolved in the banana lineage. PMID:27994606
Construction and analysis of gene-gene dynamics influence networks based on a Boolean model.

PubMed

Mazaya, Maulida; Trinh, Hung-Cuong; Kwon, Yung-Keun

2017-12-21

Identification of novel gene-gene relations is a crucial issue to understand system-level biological phenomena. To this end, many methods based on a correlation analysis of gene expressions or structural analysis of molecular interaction networks have been proposed. They have a limitation in identifying more complicated gene-gene dynamical relations, though. To overcome this limitation, we proposed a measure to quantify a gene-gene dynamical influence (GDI) using a Boolean network model and constructed a GDI network to indicate existence of a dynamical influence for every ordered pair of genes. It represents how much a state trajectory of a target gene is changed by a knockout mutation subject to a source gene in a gene-gene molecular interaction (GMI) network. Through a topological comparison between GDI and GMI networks, we observed that the former network is denser than the latter network, which implies that there exist many gene pairs of dynamically influencing but molecularly non-interacting relations. In addition, a larger number of hub genes were generated in the GDI network. On the other hand, there was a correlation between these networks such that the degree value of a node was positively correlated to each other. We further investigated the relationships of the GDI value with structural properties and found that there are negative and positive correlations with the length of a shortest path and the number of paths, respectively. In addition, a GDI network could predict a set of genes whose steady-state expression is affected in E. coli gene-knockout experiments. More interestingly, we found that the drug-targets with side-effects have a larger number of outgoing links than the other genes in the GDI network, which implies that they are more likely to influence the dynamics of other genes. Finally, we found biological evidences showing that the gene pairs which are not molecularly interacting but dynamically influential can be considered for novel gene-gene relationships. Taken together, construction and analysis of the GDI network can be a useful approach to identify novel gene-gene relationships in terms of the dynamical influence.
Combining Human Epigenetics and Sleep Studies in Caenorhabditis elegans: A Cross-Species Approach for Finding Conserved Genes Regulating Sleep.

PubMed

Huang, Huiyan; Zhu, Yong; Eliot, Melissa N; Knopik, Valerie S; McGeary, John E; Carskadon, Mary A; Hart, Anne C

2017-06-01

We aimed to test a combined approach to identify conserved genes regulating sleep and to explore the association between DNA methylation and sleep length. We identified candidate genes associated with shorter versus longer sleep duration in college students based on DNA methylation using Illumina Infinium HumanMethylation450 BeadChip arrays. Orthologous genes in Caenorhabditis elegans were identified, and we examined whether their loss of function affected C. elegans sleep. For genes whose perturbation affected C. elegans sleep, we subsequently undertook a small pilot study to re-examine DNA methylation in an independent set of human participants with shorter versus longer sleep durations. Eighty-seven out of 485,577 CpG sites had significant differential methylation in young adults with shorter versus longer sleep duration, corresponding to 52 candidate genes. We identified 34 C. elegans orthologs, including NPY/flp-18 and flp-21, which are known to affect sleep. Loss of five additional genes alters developmentally timed C. elegans sleep (B4GALT6/bre-4, DOCK180/ced-5, GNB2L1/rack-1, PTPRN2/ida-1, ZFYVE28/lst-2). For one of these genes, ZFYVE28 (also known as hLst2), the pilot replication study again found decreased DNA methylation associated with shorter sleep duration at the same two CpG sites in the first intron of ZFYVE28. Using an approach that combines human epigenetics and C. elegans sleep studies, we identified five genes that play previously unidentified roles in C. elegans sleep. We suggest sleep duration in humans may be associated with differential DNA methylation at specific sites and that the conserved genes identified here likely play roles in C. elegans sleep and in other species. © Sleep Research Society 2017. Published by Oxford University Press on behalf of the Sleep Research Society. All rights reserved. For permissions, please e-mail journals.permissions@oup.com.
Secretome Characterization and Correlation Analysis Reveal Putative Pathogenicity Mechanisms and Identify Candidate Avirulence Genes in the Wheat Stripe Rust Fungus Puccinia striiformis f. sp. tritici.

PubMed

Xia, Chongjing; Wang, Meinan; Cornejo, Omar E; Jiwan, Derick A; See, Deven R; Chen, Xianming

2017-01-01

Stripe (yellow) rust, caused by Puccinia striiformis f. sp. tritici ( Pst ), is one of the most destructive diseases of wheat worldwide. Planting resistant cultivars is an effective way to control this disease, but race-specific resistance can be overcome quickly due to the rapid evolving Pst population. Studying the pathogenicity mechanisms is critical for understanding how Pst virulence changes and how to develop wheat cultivars with durable resistance to stripe rust. We re-sequenced 7 Pst isolates and included additional 7 previously sequenced isolates to represent balanced virulence/avirulence profiles for several avirulence loci in seretome analyses. We observed an uneven distribution of heterozygosity among the isolates. Secretome comparison of Pst with other rust fungi identified a large portion of species-specific secreted proteins, suggesting that they may have specific roles when interacting with the wheat host. Thirty-two effectors of Pst were identified from its secretome. We identified candidates for Avr genes corresponding to six Yr genes by correlating polymorphisms for effector genes to the virulence/avirulence profiles of the 14 Pst isolates. The putative AvYr76 was present in the avirulent isolates, but absent in the virulent isolates, suggesting that deleting the coding region of the candidate avirulence gene has produced races virulent to resistance gene Yr76 . We conclude that incorporating avirulence/virulence phenotypes into correlation analysis with variations in genomic structure and secretome, particularly presence/absence polymorphisms of effectors, is an efficient way to identify candidate Avr genes in Pst . The candidate effector genes provide a rich resource for further studies to determine the evolutionary history of Pst populations and the co-evolutionary arms race between Pst and wheat. The Avr candidates identified in this study will lead to cloning avirulence genes in Pst , which will enable us to understand molecular mechanisms underlying Pst -wheat interactions, to determine the effectiveness of resistance genes and further to develop durable resistance to stripe rust.
Adenoid cystic carcinoma: current therapy and potential therapeutic advances based on genomic profiling

PubMed Central

Chae, Young Kwang; Chung, Su Yun; Davis, Andrew A.; Carneiro, Benedito A.; Chandra, Sunandana; Kaplan, Jason; Kalyan, Aparna; Giles, Francis J.

2015-01-01

Adenoid cystic carcinoma (ACC) is a rare cancer with high potential for recurrence and metastasis. Efficacy of current treatment options, particularly for advanced disease, is very limited. Recent whole genome and exome sequencing has dramatically improved our understanding of ACC pathogenesis. A balanced translocation resulting in the MYB-NFIB fusion gene appears to be a fundamental signature of ACC. In addition, sequencing has identified a number of other driver genes mutated in downstream pathways common to other well-studied cancers. Overexpression of oncogenic proteins involved in cell growth, adhesion, cell cycle regulation, and angiogenesis are also present in ACC. Collectively, studies have identified genes and proteins for targeted, mechanism-based, therapies based on tumor phenotypes, as opposed to nonspecific cytotoxic agents. In addition, although few studies in ACC currently exist, immunotherapy may also hold promise. Better genetic understanding will enable treatment with novel targeted agents and initial exploration of immune-based therapies with the goal of improving outcomes for patients with ACC. PMID:26359351
Circadian expression profiles of chromatin remodeling factor genes in Arabidopsis.

PubMed

Lee, Hong Gil; Lee, Kyounghee; Jang, Kiyoung; Seo, Pil Joon

2015-01-01

The circadian clock is a biological time keeper mechanism that regulates biological rhythms to a period of approximately 24 h. The circadian clock enables organisms to anticipate environmental cycles and coordinates internal cellular physiology with external environmental cues. In plants, correct matching of the clock with the environment confers fitness advantages to plant survival and reproduction. Therefore, circadian clock components are regulated at multiple layers to fine-tune the circadian oscillation. Epigenetic regulation provides an additional layer of circadian control. However, little is known about which chromatin remodeling factors are responsible for circadian control. In this work, we analyzed circadian expression of 109 chromatin remodeling factor genes and identified 17 genes that display circadian oscillation. In addition, we also found that a candidate interacts with a core clock component, supporting that clock activity is regulated in part by chromatin modification. As an initial attempt to elucidate the relationship between chromatin modification and circadian oscillation, we identified novel regulatory candidates that provide a platform for future investigations of chromatin regulation of the circadian clock.
De novo Sequencing and Comparative Transcriptomics of Floral Development of the Distylous Species Lithospermum multiflorum

PubMed Central

Cohen, James I.

2016-01-01

Genes controlling the morphological, micromorphological, and physiological components of the breeding system distyly have been hypothesized, but many of the genes have not been investigated throughout development of the two floral morphs. To this end, the present study is an examination of comparative transcriptomes from three stages of development for the floral organs of the morphs of Lithospermum multiflorum. Transcriptomes of flowers of the two morphs, from various stages of development, were sequenced using an Illumina HiSeq 2000. The floral transcriptome of L. multiflorum was assembled, and differential gene expression (DE) was identified between morphs, throughout development. Additionally, Gene Ontology (GO) terms for DE genes were determined. Fewer genes were DE early in development compared to later in development, with more genes highly expressed in the gynoecium of the SS morph and the corolla and androecium of the LS morph. A reciprocal pattern was observed later in development, and many more genes were DE during this latter stage. During early development, DE genes appear to be involved in growth and floral development, and during later development, DE genes seem to affect physiological functions. Interestingly, many genes involved in response to stress were identified as DE between morphs. PMID:28066486
De novo Sequencing and Comparative Transcriptomics of Floral Development of the Distylous Species Lithospermum multiflorum.

PubMed

Cohen, James I

2016-01-01

Genes controlling the morphological, micromorphological, and physiological components of the breeding system distyly have been hypothesized, but many of the genes have not been investigated throughout development of the two floral morphs. To this end, the present study is an examination of comparative transcriptomes from three stages of development for the floral organs of the morphs of Lithospermum multiflorum . Transcriptomes of flowers of the two morphs, from various stages of development, were sequenced using an Illumina HiSeq 2000. The floral transcriptome of L. multiflorum was assembled, and differential gene expression (DE) was identified between morphs, throughout development. Additionally, Gene Ontology (GO) terms for DE genes were determined. Fewer genes were DE early in development compared to later in development, with more genes highly expressed in the gynoecium of the SS morph and the corolla and androecium of the LS morph. A reciprocal pattern was observed later in development, and many more genes were DE during this latter stage. During early development, DE genes appear to be involved in growth and floral development, and during later development, DE genes seem to affect physiological functions. Interestingly, many genes involved in response to stress were identified as DE between morphs.
Genome-wide analysis of the WRKY gene family in physic nut (Jatropha curcas L.).

PubMed

Xiong, Wangdan; Xu, Xueqin; Zhang, Lin; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

2013-07-25

The WRKY proteins, which contain highly conserved WRKYGQK amino acid sequences and zinc-finger-like motifs, constitute a large family of transcription factors in plants. They participate in diverse physiological and developmental processes. WRKY genes have been identified and characterized in a number of plant species. We identified a total of 58 WRKY genes (JcWRKY) in the genome of the physic nut (Jatropha curcas L.). On the basis of their conserved WRKY domain sequences, all of the JcWRKY proteins could be assigned to one of the previously defined groups, I-III. Phylogenetic analysis of JcWRKY genes with Arabidopsis and rice WRKY genes, and separately with castor bean WRKY genes, revealed no evidence of recent gene duplication in JcWRKY gene family. Analysis of transcript abundance of JcWRKY gene products were tested in different tissues under normal growth condition. In addition, 47 WRKY genes responded to at least one abiotic stress (drought, salinity, phosphate starvation and nitrogen starvation) in individual tissues (leaf, root and/or shoot cortex). Our study provides a useful reference data set as the basis for cloning and functional analysis of physic nut WRKY genes. Copyright © 2013 Elsevier B.V. All rights reserved.
Novel linkage disequilibrium clustering algorithm identifies new lupus genes on meta-analysis of GWAS datasets.

PubMed

Saeed, Mohammad

2017-05-01

Systemic lupus erythematosus (SLE) is a complex disorder. Genetic association studies of complex disorders suffer from the following three major issues: phenotypic heterogeneity, false positive (type I error), and false negative (type II error) results. Hence, genes with low to moderate effects are missed in standard analyses, especially after statistical corrections. OASIS is a novel linkage disequilibrium clustering algorithm that can potentially address false positives and negatives in genome-wide association studies (GWAS) of complex disorders such as SLE. OASIS was applied to two SLE dbGAP GWAS datasets (6077 subjects; ∼0.75 million single-nucleotide polymorphisms). OASIS identified three known SLE genes viz. IFIH1, TNIP1, and CD44, not previously reported using these GWAS datasets. In addition, 22 novel loci for SLE were identified and the 5 SLE genes previously reported using these datasets were verified. OASIS methodology was validated using single-variant replication and gene-based analysis with GATES. This led to the verification of 60% of OASIS loci. New SLE genes that OASIS identified and were further verified include TNFAIP6, DNAJB3, TTF1, GRIN2B, MON2, LATS2, SNX6, RBFOX1, NCOA3, and CHAF1B. This study presents the OASIS algorithm, software, and the meta-analyses of two publicly available SLE GWAS datasets along with the novel SLE genes. Hence, OASIS is a novel linkage disequilibrium clustering method that can be universally applied to existing GWAS datasets for the identification of new genes.
Functional genomic analysis identifies indoxyl sulfate as a major, poorly dialyzable uremic toxin in end-stage renal disease.

PubMed

Jhawar, Sachin; Singh, Prabhjot; Torres, Daniel; Ramirez-Valle, Francisco; Kassem, Hania; Banerjee, Trina; Dolgalev, Igor; Heguy, Adriana; Zavadil, Jiri; Lowenstein, Jerome

2015-01-01

Chronic renal failure is characterized by progressive renal scarring and accelerated arteriosclerotic cardiovascular disease despite what is considered to be adequate hemodialysis or peritoneal dialysis. In rodents with reduced renal mass, renal scarring has been attributed to poorly filtered, small protein-bound molecules. The best studied of these is indoxyl sulfate (IS). We have attempted to establish whether there are uremic toxins that are not effectively removed by hemodialysis. We examined plasma from patients undergoing hemodialysis, employing global gene expression in normal human renal cortical cells incubated in pre- and post- dialysis plasma as a reporter system. Responses in cells incubated with pre- and post-dialysis uremic plasma (n = 10) were compared with responses elicited by plasma from control subjects (n = 5). The effects of adding IS to control plasma and of adding probenecid to uremic plasma were examined. Plasma concentrations of IS were measured by HPLC (high pressure liquid chromatography). Gene expression in our reporter system revealed dysregulation of 1912 genes in cells incubated with pre-dialysis uremic plasma. In cells incubated in post-dialysis plasma, the expression of 537 of those genes returned to baseline but the majority of them (1375) remained dysregulated. IS concentration was markedly elevated in pre- and post-dialysis plasma. Addition of IS to control plasma simulated more than 80% of the effects of uremic plasma on gene expression; the addition of probenecid, an organic anion transport (OAT) inhibitor, to uremic plasma reversed the changes in gene expression. These findings provide evidence that hemodialysis fails to effectively clear one or more solutes that effect gene expression, in our reporter system, from the plasma of patients with uremia. The finding that gene dysregulation was simulated by the addition of IS to control plasma and inhibited by addition of an OAT inhibitor to uremic plasma identifies IS as a major, poorly dialyzable, uremic toxin. The signaling pathways initiated by IS and possibly other solutes not effectively removed by dialysis may participate in the pathogenesis of renal scarring and uremic vasculopathy.
GSNFS: Gene subnetwork biomarker identification of lung cancer expression data.

PubMed

Doungpan, Narumol; Engchuan, Worrawat; Chan, Jonathan H; Meechai, Asawin

2016-12-05

Gene expression has been used to identify disease gene biomarkers, but there are ongoing challenges. Single gene or gene-set biomarkers are inadequate to provide sufficient understanding of complex disease mechanisms and the relationship among those genes. Network-based methods have thus been considered for inferring the interaction within a group of genes to further study the disease mechanism. Recently, the Gene-Network-based Feature Set (GNFS), which is capable of handling case-control and multiclass expression for gene biomarker identification, has been proposed, partly taking into account of network topology. However, its performance relies on a greedy search for building subnetworks and thus requires further improvement. In this work, we establish a new approach named Gene Sub-Network-based Feature Selection (GSNFS) by implementing the GNFS framework with two proposed searching and scoring algorithms, namely gene-set-based (GS) search and parent-node-based (PN) search, to identify subnetworks. An additional dataset is used to validate the results. The two proposed searching algorithms of the GSNFS method for subnetwork expansion are concerned with the degree of connectivity and the scoring scheme for building subnetworks and their topology. For each iteration of expansion, the neighbour genes of a current subnetwork, whose expression data improved the overall subnetwork score, is recruited. While the GS search calculated the subnetwork score using an activity score of a current subnetwork and the gene expression values of its neighbours, the PN search uses the expression value of the corresponding parent of each neighbour gene. Four lung cancer expression datasets were used for subnetwork identification. In addition, using pathway data and protein-protein interaction as network data in order to consider the interaction among significant genes were discussed. Classification was performed to compare the performance of the identified gene subnetworks with three subnetwork identification algorithms. The two searching algorithms resulted in better classification and gene/gene-set agreement compared to the original greedy search of the GNFS method. The identified lung cancer subnetwork using the proposed searching algorithm resulted in an improvement of the cross-dataset validation and an increase in the consistency of findings between two independent datasets. The homogeneity measurement of the datasets was conducted to assess dataset compatibility in cross-dataset validation. The lung cancer dataset with higher homogeneity showed a better result when using the GS search while the dataset with low homogeneity showed a better result when using the PN search. The 10-fold cross-dataset validation on the independent lung cancer datasets showed higher classification performance of the proposed algorithms when compared with the greedy search in the original GNFS method. The proposed searching algorithms provide a higher number of genes in the subnetwork expansion step than the greedy algorithm. As a result, the performance of the subnetworks identified from the GSNFS method was improved in terms of classification performance and gene/gene-set level agreement depending on the homogeneity of the datasets used in the analysis. Some common genes obtained from the four datasets using different searching algorithms are genes known to play a role in lung cancer. The improvement of classification performance and the gene/gene-set level agreement, and the biological relevance indicated the effectiveness of the GSNFS method for gene subnetwork identification using expression data.
Let them fall where they may: congruence analysis in massive phylogenetically messy data sets.

PubMed

Leigh, Jessica W; Schliep, Klaus; Lopez, Philippe; Bapteste, Eric

2011-10-01

Interest in congruence in phylogenetic data has largely focused on issues affecting multicellular organisms, and animals in particular, in which the level of incongruence is expected to be relatively low. In addition, assessment methods developed in the past have been designed for reasonably small numbers of loci and scale poorly for larger data sets. However, there are currently over a thousand complete genome sequences available and of interest to evolutionary biologists, and these sequences are predominantly from microbial organisms, whose molecular evolution is much less frequently tree-like than that of multicellular life forms. As such, the level of incongruence in these data is expected to be high. We present a congruence method that accommodates both very large numbers of genes and high degrees of incongruence. Our method uses clustering algorithms to identify subsets of genes based on similarity of phylogenetic signal. It involves only a single phylogenetic analysis per gene, and therefore, computation time scales nearly linearly with the number of genes in the data set. We show that our method performs very well with sets of sequence alignments simulated under a wide variety of conditions. In addition, we present an analysis of core genes of prokaryotes, often assumed to have been largely vertically inherited, in which we identify two highly incongruent classes of genes. This result is consistent with the complexity hypothesis.
Systematic prediction of control proteins and their DNA binding sites

PubMed Central

Sorokin, Valeriy; Severinov, Konstantin; Gelfand, Mikhail S.

2009-01-01

We present here the results of a systematic bioinformatics analysis of control (C) proteins, a class of DNA-binding regulators that control time-delayed transcription of their own genes as well as restriction endonuclease genes in many type II restriction-modification systems. More than 290 C protein homologs were identified and DNA-binding sites for ∼70% of new and previously known C proteins were predicted by a combination of phylogenetic footprinting and motif searches in DNA upstream of C protein genes. Additional analysis revealed that a large proportion of C protein genes are translated from leaderless RNA, which may contribute to time-delayed nature of genetic switches operated by these proteins. Analysis of genetic contexts of newly identified C protein genes revealed that they are not exclusively associated with restriction-modification genes; numerous instances of associations with genes originating from mobile genetic elements were observed. These instances might be vestiges of ancient horizontal transfers and indicate that during evolution ancestral restriction-modification system genes were the sites of mobile elements insertions. PMID:19056824
Genome-wide analysis of Atlantic salmon (Salmo salar) mucin genes and their role as biomarkers

PubMed Central

Grammes, Fabian Thomas; Ytteborg, Elisabeth; Takle, Harald; Jørgensen, Sven Martin

2017-01-01

The aim of this study was to identify potential mucin genes in the Atlantic salmon genome and evaluate tissue-specific distribution and transcriptional regulation in response to aquaculture-relevant stress conditions in post-smolts. Seven secreted gel-forming mucin genes were identified based on several layers of evidence; annotation, transcription, phylogeny and domain structure. Two genes were annotated as muc2 and five genes as muc5. The muc2 genes were predominantly transcribed in the intestinal region while the different genes in the muc5 family were mainly transcribed in either skin, gill or pyloric caeca. In order to investigate transcriptional regulation of mucins during stress conditions, two controlled experiments were conducted. In the first experiment, handling stress induced mucin transcription in the gill, while transcription decreased in the skin and intestine. In the second experiment, long term intensive rearing conditions (fish biomass ~125 kg/m3) interrupted by additional confinement led to increased transcription of mucin genes in the skin at one, seven and fourteen days post-confinement. PMID:29236729
Mapping of Gene Expression Reveals CYP27A1 as a Susceptibility Gene for Sporadic ALS

PubMed Central

van Rheenen, Wouter; Franke, Lude; Jansen, Ritsert C.; van Es, Michael A.; van Vught, Paul W. J.; Blauw, Hylke M.; Groen, Ewout J. N.; Horvath, Steve; Estrada, Karol; Rivadeneira, Fernando; Hofman, Albert; Uitterlinden, Andre G.; Robberecht, Wim; Andersen, Peter M.; Melki, Judith; Meininger, Vincent; Hardiman, Orla; Landers, John E.; Brown, Robert H.; Shatunov, Aleksey; Shaw, Christopher E.; Leigh, P. Nigel; Al-Chalabi, Ammar; Ophoff, Roel A.

2012-01-01

Amyotrophic lateral sclerosis (ALS) is a progressive, neurodegenerative disease characterized by loss of upper and lower motor neurons. ALS is considered to be a complex trait and genome-wide association studies (GWAS) have implicated a few susceptibility loci. However, many more causal loci remain to be discovered. Since it has been shown that genetic variants associated with complex traits are more likely to be eQTLs than frequency-matched variants from GWAS platforms, we conducted a two-stage genome-wide screening for eQTLs associated with ALS. In addition, we applied an eQTL analysis to finemap association loci. Expression profiles using peripheral blood of 323 sporadic ALS patients and 413 controls were mapped to genome-wide genotyping data. Subsequently, data from a two-stage GWAS (3,568 patients and 10,163 controls) were used to prioritize eQTLs identified in the first stage (162 ALS, 207 controls). These prioritized eQTLs were carried forward to the second sample with both gene-expression and genotyping data (161 ALS, 206 controls). Replicated eQTL SNPs were then tested for association in the second-stage GWAS data to find SNPs associated with disease, that survived correction for multiple testing. We thus identified twelve cis eQTLs with nominally significant associations in the second-stage GWAS data. Eight SNP-transcript pairs of highest significance (lowest p = 1.27×10−51) withstood multiple-testing correction in the second stage and modulated CYP27A1 gene expression. Additionally, we show that C9orf72 appears to be the only gene in the 9p21.2 locus that is regulated in cis, showing the potential of this approach in identifying causative genes in association loci in ALS. This study has identified candidate genes for sporadic ALS, most notably CYP27A1. Mutations in CYP27A1 are causal to cerebrotendinous xanthomatosis which can present as a clinical mimic of ALS with progressive upper motor neuron loss, making it a plausible susceptibility gene for ALS. PMID:22509407
Mining Gene Expression Signature for the Detection of Pre-Malignant Melanocytes and Early Melanomas with Risk for Metastasis

PubMed Central

de Souza, Camila Ferreira; Xander, Patrícia; Monteiro, Ana Carolina; Silva, Amanda Gonçalves dos Santos; da Silva, Débora Castanheira Pereira; Mai, Sabine; Bernardo, Viviane; Lopes, José Daniel; Jasiulionis, Miriam Galvonas

2012-01-01

Background Metastatic melanoma is a highly aggressive skin cancer and currently resistant to systemic therapy. Melanomas may involve genetic, epigenetic and metabolic abnormalities. Evidence is emerging that epigenetic changes might play a significant role in tumor cell plasticity and metastatic phenotype of melanoma cells. Principal findings In this study, we developed a systematic approach to identify genes implicated in melanoma progression. To do this, we used the Affymetrix GeneChip Arrays to screen 34,000 mouse transcripts in melan-a melanocytes, 4C pre-malignant melanocytes, 4C11− non-metastatic and 4C11+ metastatic melanoma cell lines. The genome-wide association studies revealed pathways commonly over-represented in the transition from immortalized to pre-malignant stage, and under-represented in the transition from non-metastatic to metastatic stage. Additionally, the treatment of cells with 10 µM 5-aza-2′-deoxycytidine (5AzaCdR) for 48 hours allowed us to identify genes differentially re-expressed at specific stages of melan-a malignant transformation. Treatment of human primary melanocytes with the demethylating agent 5AzaCdR in combination to the histone deacetylase inhibitor Trichostatin A (TSA) revealed changes on melanocyte morphology and gene expression which could be an indicator of epigenetic flexibility in normal melanocytes. Moreover, changes on gene expression recognized by affecting the melanocyte biology (NDRG2 and VDR), phenotype of metastatic melanoma cells (HSPB1 and SERPINE1) and response to cancer therapy (CTCF, NSD1 and SRC) were found when Mel-2 and/or Mel-3-derived patient metastases were exposed to 5AzaCdR plus TSA treatment. Hierarchical clustering and network analyses in a panel of five patient-derived metastatic melanoma cells showed gene interactions that have never been described in melanomas. Significance Despite the heterogeneity observed in melanomas, this study demonstrates the utility of our murine melanoma progression model to identify molecular markers commonly perturbed in metastasis. Additionally, the novel gene expression signature identified here may be useful in the future into a model more closely related to translational research. PMID:22984562
Genome-wide analysis identifies 12 loci influencing human reproductive behavior.

PubMed

Barban, Nicola; Jansen, Rick; de Vlaming, Ronald; Vaez, Ahmad; Mandemakers, Jornt J; Tropf, Felix C; Shen, Xia; Wilson, James F; Chasman, Daniel I; Nolte, Ilja M; Tragante, Vinicius; van der Laan, Sander W; Perry, John R B; Kong, Augustine; Ahluwalia, Tarunveer S; Albrecht, Eva; Yerges-Armstrong, Laura; Atzmon, Gil; Auro, Kirsi; Ayers, Kristin; Bakshi, Andrew; Ben-Avraham, Danny; Berger, Klaus; Bergman, Aviv; Bertram, Lars; Bielak, Lawrence F; Bjornsdottir, Gyda; Bonder, Marc Jan; Broer, Linda; Bui, Minh; Barbieri, Caterina; Cavadino, Alana; Chavarro, Jorge E; Turman, Constance; Concas, Maria Pina; Cordell, Heather J; Davies, Gail; Eibich, Peter; Eriksson, Nicholas; Esko, Tõnu; Eriksson, Joel; Falahi, Fahimeh; Felix, Janine F; Fontana, Mark Alan; Franke, Lude; Gandin, Ilaria; Gaskins, Audrey J; Gieger, Christian; Gunderson, Erica P; Guo, Xiuqing; Hayward, Caroline; He, Chunyan; Hofer, Edith; Huang, Hongyan; Joshi, Peter K; Kanoni, Stavroula; Karlsson, Robert; Kiechl, Stefan; Kifley, Annette; Kluttig, Alexander; Kraft, Peter; Lagou, Vasiliki; Lecoeur, Cecile; Lahti, Jari; Li-Gao, Ruifang; Lind, Penelope A; Liu, Tian; Makalic, Enes; Mamasoula, Crysovalanto; Matteson, Lindsay; Mbarek, Hamdi; McArdle, Patrick F; McMahon, George; Meddens, S Fleur W; Mihailov, Evelin; Miller, Mike; Missmer, Stacey A; Monnereau, Claire; van der Most, Peter J; Myhre, Ronny; Nalls, Mike A; Nutile, Teresa; Kalafati, Ioanna Panagiota; Porcu, Eleonora; Prokopenko, Inga; Rajan, Kumar B; Rich-Edwards, Janet; Rietveld, Cornelius A; Robino, Antonietta; Rose, Lynda M; Rueedi, Rico; Ryan, Kathleen A; Saba, Yasaman; Schmidt, Daniel; Smith, Jennifer A; Stolk, Lisette; Streeten, Elizabeth; Tönjes, Anke; Thorleifsson, Gudmar; Ulivi, Sheila; Wedenoja, Juho; Wellmann, Juergen; Willeit, Peter; Yao, Jie; Yengo, Loic; Zhao, Jing Hua; Zhao, Wei; Zhernakova, Daria V; Amin, Najaf; Andrews, Howard; Balkau, Beverley; Barzilai, Nir; Bergmann, Sven; Biino, Ginevra; Bisgaard, Hans; Bønnelykke, Klaus; Boomsma, Dorret I; Buring, Julie E; Campbell, Harry; Cappellani, Stefania; Ciullo, Marina; Cox, Simon R; Cucca, Francesco; Toniolo, Daniela; Davey-Smith, George; Deary, Ian J; Dedoussis, George; Deloukas, Panos; van Duijn, Cornelia M; de Geus, Eco J C; Eriksson, Johan G; Evans, Denis A; Faul, Jessica D; Sala, Cinzia Felicita; Froguel, Philippe; Gasparini, Paolo; Girotto, Giorgia; Grabe, Hans-Jörgen; Greiser, Karin Halina; Groenen, Patrick J F; de Haan, Hugoline G; Haerting, Johannes; Harris, Tamara B; Heath, Andrew C; Heikkilä, Kauko; Hofman, Albert; Homuth, Georg; Holliday, Elizabeth G; Hopper, John; Hyppönen, Elina; Jacobsson, Bo; Jaddoe, Vincent W V; Johannesson, Magnus; Jugessur, Astanand; Kähönen, Mika; Kajantie, Eero; Kardia, Sharon L R; Keavney, Bernard; Kolcic, Ivana; Koponen, Päivikki; Kovacs, Peter; Kronenberg, Florian; Kutalik, Zoltan; La Bianca, Martina; Lachance, Genevieve; Iacono, William G; Lai, Sandra; Lehtimäki, Terho; Liewald, David C; Lindgren, Cecilia M; Liu, Yongmei; Luben, Robert; Lucht, Michael; Luoto, Riitta; Magnus, Per; Magnusson, Patrik K E; Martin, Nicholas G; McGue, Matt; McQuillan, Ruth; Medland, Sarah E; Meisinger, Christa; Mellström, Dan; Metspalu, Andres; Traglia, Michela; Milani, Lili; Mitchell, Paul; Montgomery, Grant W; Mook-Kanamori, Dennis; de Mutsert, Renée; Nohr, Ellen A; Ohlsson, Claes; Olsen, Jørn; Ong, Ken K; Paternoster, Lavinia; Pattie, Alison; Penninx, Brenda W J H; Perola, Markus; Peyser, Patricia A; Pirastu, Mario; Polasek, Ozren; Power, Chris; Kaprio, Jaakko; Raffel, Leslie J; Räikkönen, Katri; Raitakari, Olli; Ridker, Paul M; Ring, Susan M; Roll, Kathryn; Rudan, Igor; Ruggiero, Daniela; Rujescu, Dan; Salomaa, Veikko; Schlessinger, David; Schmidt, Helena; Schmidt, Reinhold; Schupf, Nicole; Smit, Johannes; Sorice, Rossella; Spector, Tim D; Starr, John M; Stöckl, Doris; Strauch, Konstantin; Stumvoll, Michael; Swertz, Morris A; Thorsteinsdottir, Unnur; Thurik, A Roy; Timpson, Nicholas J; Tung, Joyce Y; Uitterlinden, André G; Vaccargiu, Simona; Viikari, Jorma; Vitart, Veronique; Völzke, Henry; Vollenweider, Peter; Vuckovic, Dragana; Waage, Johannes; Wagner, Gert G; Wang, Jie Jin; Wareham, Nicholas J; Weir, David R; Willemsen, Gonneke; Willeit, Johann; Wright, Alan F; Zondervan, Krina T; Stefansson, Kari; Krueger, Robert F; Lee, James J; Benjamin, Daniel J; Cesarini, David; Koellinger, Philipp D; den Hoed, Marcel; Snieder, Harold; Mills, Melinda C

2016-12-01

The genetic architecture of human reproductive behavior-age at first birth (AFB) and number of children ever born (NEB)-has a strong relationship with fitness, human development, infertility and risk of neuropsychiatric disorders. However, very few genetic loci have been identified, and the underlying mechanisms of AFB and NEB are poorly understood. We report a large genome-wide association study of both sexes including 251,151 individuals for AFB and 343,072 individuals for NEB. We identified 12 independent loci that are significantly associated with AFB and/or NEB in a SNP-based genome-wide association study and 4 additional loci associated in a gene-based effort. These loci harbor genes that are likely to have a role, either directly or by affecting non-local gene expression, in human reproduction and infertility, thereby increasing understanding of these complex traits.
An evidence-based knowledgebase of metastasis suppressors to identify key pathways relevant to cancer metastasis

PubMed Central

Zhao, Min; Li, Zhe; Qu, Hong

2015-01-01

Metastasis suppressor genes (MS genes) are genes that play important roles in inhibiting the process of cancer metastasis without preventing growth of the primary tumor. Identification of these genes and understanding their functions are critical for investigation of cancer metastasis. Recent studies on cancer metastasis have identified many new susceptibility MS genes. However, the comprehensive illustration of diverse cellular processes regulated by metastasis suppressors during the metastasis cascade is lacking. Thus, the relationship between MS genes and cancer risk is still unclear. To unveil the cellular complexity of MS genes, we have constructed MSGene (http://MSGene.bioinfo-minzhao.org/), the first literature-based gene resource for exploring human MS genes. In total, we manually curated 194 experimentally verified MS genes and mapped to 1448 homologous genes from 17 model species. Follow-up functional analyses associated 194 human MS genes with epithelium/tissue morphogenesis and epithelia cell proliferation. In addition, pathway analysis highlights the prominent role of MS genes in activation of platelets and coagulation system in tumor metastatic cascade. Moreover, global mutation pattern of MS genes across multiple cancers may reveal common cancer metastasis mechanisms. All these results illustrate the importance of MSGene to our understanding on cell development and cancer metastasis. PMID:26486520
Exome-wide association analysis reveals novel coding sequence variants associated with lipid traits in Chinese.

PubMed

Tang, Clara S; Zhang, He; Cheung, Chloe Y Y; Xu, Ming; Ho, Jenny C Y; Zhou, Wei; Cherny, Stacey S; Zhang, Yan; Holmen, Oddgeir; Au, Ka-Wing; Yu, Haiyi; Xu, Lin; Jia, Jia; Porsch, Robert M; Sun, Lijie; Xu, Weixian; Zheng, Huiping; Wong, Lai-Yung; Mu, Yiming; Dou, Jingtao; Fong, Carol H Y; Wang, Shuyu; Hong, Xueyu; Dong, Liguang; Liao, Yanhua; Wang, Jiansong; Lam, Levina S M; Su, Xi; Yan, Hua; Yang, Min-Lee; Chen, Jin; Siu, Chung-Wah; Xie, Gaoqiang; Woo, Yu-Cho; Wu, Yangfeng; Tan, Kathryn C B; Hveem, Kristian; Cheung, Bernard M Y; Zöllner, Sebastian; Xu, Aimin; Eugene Chen, Y; Jiang, Chao Qiang; Zhang, Youyi; Lam, Tai-Hing; Ganesh, Santhi K; Huo, Yong; Sham, Pak C; Lam, Karen S L; Willer, Cristen J; Tse, Hung-Fat; Gao, Wei

2015-12-22

Blood lipids are important risk factors for coronary artery disease (CAD). Here we perform an exome-wide association study by genotyping 12,685 Chinese, using a custom Illumina HumanExome BeadChip, to identify additional loci influencing lipid levels. Single-variant association analysis on 65,671 single nucleotide polymorphisms reveals 19 loci associated with lipids at exome-wide significance (P<2.69 × 10(-7)), including three Asian-specific coding variants in known genes (CETP p.Asp459Gly, PCSK9 p.Arg93Cys and LDLR p.Arg257Trp). Furthermore, missense variants at two novel loci-PNPLA3 p.Ile148Met and PKD1L3 p.Thr429Ser-also influence levels of triglycerides and low-density lipoprotein cholesterol, respectively. Another novel gene, TEAD2, is found to be associated with high-density lipoprotein cholesterol through gene-based association analysis. Most of these newly identified coding variants show suggestive association (P<0.05) with CAD. These findings demonstrate that exome-wide genotyping on samples of non-European ancestry can identify additional population-specific possible causal variants, shedding light on novel lipid biology and CAD.

A novel dominant GJB2 (DFNA3) mutation in a Chinese family

NASA Astrophysics Data System (ADS)

Wang, Hongyang; Wu, Kaiwen; Yu, Lan; Xie, Linyi; Xiong, Wenping; Wang, Dayong; Guan, Jing; Wang, Qiuju

2017-01-01

To decipher the phenotype and genotype of a Chinese family with autosomal dominant non-syndromic hearing loss (ADNSHL) and a novel dominant missense mutation in the GJB2 gene (DFNA3), mutation screening of GJB2 was performed on the propositus from a five-generation ADNSHL family through polymerase chain reaction amplification and Sanger sequencing. The candidate variation and the co-segregation of the phenotype were verified in all ascertained family members. Targeted genes capture and next-generation sequencing (NGS) were performed to explore additional genetic variations. We identified the novel GJB2 mutation c.524C > A (p.P175H), which segregated with high frequency and was involved in progressive sensorineural hearing loss. One subject with an additional c.235delC mutation showed a more severe phenotype than did the other members with single GJB2 dominant variations. Four patients diagnosed with noise-induced hearing loss did not carry this mutation. No other pathogenic variations or modifier genes were identified by NGS. In conclusion, a novel missense mutation in GJB2 (DFNA3), affecting the second extracellular domain of the protein, was identified in a family with ADNSHL.
Harnessing pain heterogeneity and RNA transcriptome to identify blood–based pain biomarkers: a novel correlational study design and bioinformatics approach in a graded chronic constriction injury model

PubMed Central

Grace, Peter M.; Hurley, Daniel; Barratt, Daniel T.; Tsykin, Anna; Watkins, Linda R.; Rolan, Paul E.; Hutchinson, Mark R.

2017-01-01

A quantitative, peripherally accessible biomarker for neuropathic pain has great potential to improve clinical outcomes. Based on the premise that peripheral and central immunity contribute to neuropathic pain mechanisms, we hypothesized that biomarkers could be identified from the whole blood of adult male rats, by integrating graded chronic constriction injury (CCI), ipsilateral lumbar dorsal quadrant (iLDQ) and whole blood transcriptomes, and pathway analysis with pain behavior. Correlational bioinformatics identified a range of putative biomarker genes for allodynia intensity, many encoding for proteins with a recognized role in immune/nociceptive mechanisms. A selection of these genes was validated in a separate replication study. Pathway analysis of the iLDQ transcriptome identified Fcγ and Fcε signaling pathways, among others. This study is the first to employ the whole blood transcriptome to identify pain biomarker panels. The novel correlational bioinformatics, developed here, selected such putative biomarkers based on a correlation with pain behavior and formation of signaling pathways with iLDQ genes. Future studies may demonstrate the predictive ability of these biomarker genes across other models and additional variables. PMID:22697386
Genome-wide association study for Crohn's disease in the Quebec Founder Population identifies multiple validated disease loci.

PubMed

Raelson, John V; Little, Randall D; Ruether, Andreas; Fournier, Hélène; Paquin, Bruno; Van Eerdewegh, Paul; Bradley, W E C; Croteau, Pascal; Nguyen-Huu, Quynh; Segal, Jonathan; Debrus, Sophie; Allard, René; Rosenstiel, Philip; Franke, Andre; Jacobs, Gunnar; Nikolaus, Susanna; Vidal, Jean-Michel; Szego, Peter; Laplante, Nathalie; Clark, Hilary F; Paulussen, René J; Hooper, John W; Keith, Tim P; Belouchi, Abdelmajid; Schreiber, Stefan

2007-09-11

Genome-wide association (GWA) studies offer a powerful unbiased method for the identification of multiple susceptibility genes for complex diseases. Here we report the results of a GWA study for Crohn's disease (CD) using family trios from the Quebec Founder Population (QFP). Haplotype-based association analyses identified multiple regions associated with the disease that met the criteria for genome-wide significance, with many containing a gene whose function appears relevant to CD. A proportion of these were replicated in two independent German Caucasian samples, including the established CD loci NOD2 and IBD5. The recently described IL23R locus was also identified and replicated. For this region, multiple individuals with all major haplotypes in the QFP were sequenced and extensive fine mapping performed to identify risk and protective alleles. Several additional loci, including a region on 3p21 containing several plausible candidate genes, a region near JAKMIP1 on 4p16.1, and two larger regions on chromosome 17 were replicated. Together with previously published loci, the spectrum of CD genes identified to date involves biochemical networks that affect epithelial defense mechanisms, innate and adaptive immune response, and the repair or remodeling of tissue.
Integrated multi-cohort transcriptional meta-analysis of neurodegenerative diseases.

PubMed

Li, Matthew D; Burns, Terry C; Morgan, Alexander A; Khatri, Purvesh

2014-09-04

Neurodegenerative diseases share common pathologic features including neuroinflammation, mitochondrial dysfunction and protein aggregation, suggesting common underlying mechanisms of neurodegeneration. We undertook a meta-analysis of public gene expression data for neurodegenerative diseases to identify a common transcriptional signature of neurodegeneration. Using 1,270 post-mortem central nervous system tissue samples from 13 patient cohorts covering four neurodegenerative diseases, we identified 243 differentially expressed genes, which were similarly dysregulated in 15 additional patient cohorts of 205 samples including seven neurodegenerative diseases. This gene signature correlated with histologic disease severity. Metallothioneins featured prominently among differentially expressed genes, and functional pathway analysis identified specific convergent themes of dysregulation. MetaCore network analyses revealed various novel candidate hub genes (e.g. STAU2). Genes associated with M1-polarized macrophages and reactive astrocytes were strongly enriched in the meta-analysis data. Evaluation of genes enriched in neurons revealed 70 down-regulated genes, over half not previously associated with neurodegeneration. Comparison with aging brain data (3 patient cohorts, 221 samples) revealed 53 of these to be unique to neurodegenerative disease, many of which are strong candidates to be important in neuropathogenesis (e.g. NDN, NAP1L2). ENCODE ChIP-seq analysis predicted common upstream transcriptional regulators not associated with normal aging (REST, RBBP5, SIN3A, SP2, YY1, ZNF143, IKZF1). Finally, we removed genes common to neurodegeneration from disease-specific gene signatures, revealing uniquely robust immune response and JAK-STAT signaling in amyotrophic lateral sclerosis. Our results implicate pervasive bioenergetic deficits, M1-type microglial activation and gliosis as unifying themes of neurodegeneration, and identify numerous novel genes associated with neurodegenerative processes.
A Multiomics Approach to Identify Genes Associated with Childhood Asthma Risk and Morbidity.

PubMed

Forno, Erick; Wang, Ting; Yan, Qi; Brehm, John; Acosta-Perez, Edna; Colon-Semidey, Angel; Alvarez, Maria; Boutaoui, Nadia; Cloutier, Michelle M; Alcorn, John F; Canino, Glorisa; Chen, Wei; Celedón, Juan C

2017-10-01

Childhood asthma is a complex disease. In this study, we aim to identify genes associated with childhood asthma through a multiomics "vertical" approach that integrates multiple analytical steps using linear and logistic regression models. In a case-control study of childhood asthma in Puerto Ricans (n = 1,127), we used adjusted linear or logistic regression models to evaluate associations between several analytical steps of omics data, including genome-wide (GW) genotype data, GW methylation, GW expression profiling, cytokine levels, asthma-intermediate phenotypes, and asthma status. At each point, only the top genes/single-nucleotide polymorphisms/probes/cytokines were carried forward for subsequent analysis. In step 1, asthma modified the gene expression-protein level association for 1,645 genes; pathway analysis showed an enrichment of these genes in the cytokine signaling system (n = 269 genes). In steps 2-3, expression levels of 40 genes were associated with intermediate phenotypes (asthma onset age, forced expiratory volume in 1 second, exacerbations, eosinophil counts, and skin test reactivity); of those, methylation of seven genes was also associated with asthma. Of these seven candidate genes, IL5RA was also significant in analytical steps 4-8. We then measured plasma IL-5 receptor α levels, which were associated with asthma age of onset and moderate-severe exacerbations. In addition, in silico database analysis showed that several of our identified IL5RA single-nucleotide polymorphisms are associated with transcription factors related to asthma and atopy. This approach integrates several analytical steps and is able to identify biologically relevant asthma-related genes, such as IL5RA. It differs from other methods that rely on complex statistical models with various assumptions.
Gene selection and cancer type classification of diffuse large-B-cell lymphoma using a bivariate mixture model for two-species data.

PubMed

Su, Yuhua; Nielsen, Dahlia; Zhu, Lei; Richards, Kristy; Suter, Steven; Breen, Matthew; Motsinger-Reif, Alison; Osborne, Jason

2013-01-05

: A bivariate mixture model utilizing information across two species was proposed to solve the fundamental problem of identifying differentially expressed genes in microarray experiments. The model utility was illustrated using a dog and human lymphoma data set prepared by a group of scientists in the College of Veterinary Medicine at North Carolina State University. A small number of genes were identified as being differentially expressed in both species and the human genes in this cluster serve as a good predictor for classifying diffuse large-B-cell lymphoma (DLBCL) patients into two subgroups, the germinal center B-cell-like diffuse large B-cell lymphoma and the activated B-cell-like diffuse large B-cell lymphoma. The number of human genes that were observed to be significantly differentially expressed (21) from the two-species analysis was very small compared to the number of human genes (190) identified with only one-species analysis (human data). The genes may be clinically relevant/important, as this small set achieved low misclassification rates of DLBCL subtypes. Additionally, the two subgroups defined by this cluster of human genes had significantly different survival functions, indicating that the stratification based on gene-expression profiling using the proposed mixture model provided improved insight into the clinical differences between the two cancer subtypes.
Gene expression atlas of pigeonpea and its application to gain insights into genes associated with pollen fertility implicated in seed formation

PubMed Central

Pazhamala, Lekha T.; Purohit, Shilp; Saxena, Rachit K.; Garg, Vanika; Krishnamurthy, L.; Verdier, Jerome

2017-01-01

Abstract Pigeonpea (Cajanus cajan) is an important grain legume of the semi-arid tropics, mainly used for its protein rich seeds. To link the genome sequence information with agronomic traits resulting from specific developmental processes, a Cajanus cajan gene expression atlas (CcGEA) was developed using the Asha genotype. Thirty tissues/organs representing developmental stages from germination to senescence were used to generate 590.84 million paired-end RNA-Seq data. The CcGEA revealed a compendium of 28 793 genes with differential, specific, spatio-temporal and constitutive expression during various stages of development in different tissues. As an example to demonstrate the application of the CcGEA, a network of 28 flower-related genes analysed for cis-regulatory elements and splicing variants has been identified. In addition, expression analysis of these candidate genes in male sterile and male fertile genotypes suggested their critical role in normal pollen development leading to seed formation. Gene network analysis also identified two regulatory genes, a pollen-specific SF3 and a sucrose–proton symporter, that could have implications for improvement of agronomic traits such as seed production and yield. In conclusion, the CcGEA provides a valuable resource for pigeonpea to identify candidate genes involved in specific developmental processes and to understand the well-orchestrated growth and developmental process in this resilient crop. PMID:28338822
Integrated network analysis identifies fight-club nodes as a class of hubs encompassing key putative switch genes that induce major transcriptome reprogramming during grapevine development.

PubMed

Palumbo, Maria Concetta; Zenoni, Sara; Fasoli, Marianna; Massonnet, Mélanie; Farina, Lorenzo; Castiglione, Filippo; Pezzotti, Mario; Paci, Paola

2014-12-01

We developed an approach that integrates different network-based methods to analyze the correlation network arising from large-scale gene expression data. By studying grapevine (Vitis vinifera) and tomato (Solanum lycopersicum) gene expression atlases and a grapevine berry transcriptomic data set during the transition from immature to mature growth, we identified a category named "fight-club hubs" characterized by a marked negative correlation with the expression profiles of neighboring genes in the network. A special subset named "switch genes" was identified, with the additional property of many significant negative correlations outside their own group in the network. Switch genes are involved in multiple processes and include transcription factors that may be considered master regulators of the previously reported transcriptome remodeling that marks the developmental shift from immature to mature growth. All switch genes, expressed at low levels in vegetative/green tissues, showed a significant increase in mature/woody organs, suggesting a potential regulatory role during the developmental transition. Finally, our analysis of tomato gene expression data sets showed that wild-type switch genes are downregulated in ripening-deficient mutants. The identification of known master regulators of tomato fruit maturation suggests our method is suitable for the detection of key regulators of organ development in different fleshy fruit crops. © 2014 American Society of Plant Biologists. All rights reserved.
Sparse Additive Ordinary Differential Equations for Dynamic Gene Regulatory Network Modeling.

PubMed

Wu, Hulin; Lu, Tao; Xue, Hongqi; Liang, Hua

2014-04-02

The gene regulation network (GRN) is a high-dimensional complex system, which can be represented by various mathematical or statistical models. The ordinary differential equation (ODE) model is one of the popular dynamic GRN models. High-dimensional linear ODE models have been proposed to identify GRNs, but with a limitation of the linear regulation effect assumption. In this article, we propose a sparse additive ODE (SA-ODE) model, coupled with ODE estimation methods and adaptive group LASSO techniques, to model dynamic GRNs that could flexibly deal with nonlinear regulation effects. The asymptotic properties of the proposed method are established and simulation studies are performed to validate the proposed approach. An application example for identifying the nonlinear dynamic GRN of T-cell activation is used to illustrate the usefulness of the proposed method.
Identification and characterisation of the BPI/LBP/PLUNC-like gene repertoire in chickens reveals the absence of a LBP gene☆

PubMed Central

Chiang, Shih-Chieh; Veldhuizen, Edwin J.A.; Barnes, Frances A.; Craven, C. Jeremy; Haagsman, Henk P.; Bingle, Colin D.

2011-01-01

Palate, lung and nasal epithelial clone (PLUNC) proteins are structural homologues to the innate defence molecules LPS-binding protein (LBP) and bactericidal/permeability-increasing protein (BPI). PLUNCs make up the largest portion of the wider BPI/LBP/PLUNC-like protein family and are amongst the most rapidly evolving mammalian genes. In this study we systematically identified and characterised BPI/LBP/PLUNC-like protein-encoding genes in the chicken genome. We identified eleven complete genes (and a pseudogene). Five of them are clustered on a >50 kb locus on chromosome 20, immediately adjacent to BPI. In addition to BPI, we have identified presumptive orthologues LPLUNCs 2, 3, 4 and 6, and BPIL-2. We find no evidence for the existence of single domain containing proteins in birds. Strikingly our analysis also suggests that there is no LBP orthologue in chicken. This observation may in part account for the relative resistance to LPS toxicity observed in birds. Our results indicate significant differences between the avian and mammalian repertoires of BPI/LBP/PLUNC-like genes at the genomic and transcriptional levels and provide a framework for further functional analyses of this gene family in chickens. PMID:20959152
Ribosomal DNA stability is supported by many 'buffer genes'-introduction to the Yeast rDNA Stability Database.

PubMed

Kobayashi, Takehiko; Sasaki, Mariko

2017-01-01

The ribosomal RNA gene (rDNA) is the most abundant gene in yeast and other eukaryotic organisms. Due to its heavy transcription, repetitive structure and programmed replication fork pauses, the rDNA is one of the most unstable regions in the genome. Thus, the rDNA is the best region to study the mechanisms responsible for maintaining genome integrity. Recently, we screened a library of ∼4800 budding yeast gene knockout strains to identify mutants defective in the maintenance of rDNA stability. The results of this screen are summarized in the Yeast rDNA Stability (YRS) Database, in which the stability and copy number of rDNA in each mutant are presented. From this screen, we identified ∼700 genes that may contribute to the maintenance of rDNA stability. In addition, ∼50 mutants had abnormally high or low rDNA copy numbers. Moreover, some mutants with unstable rDNA displayed abnormalities in another chromosome. In this review, we introduce the YRS Database and discuss the roles of newly identified genes that contribute to rDNA maintenance and genome integrity. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Gene expression in WAT from healthy humans and monkeys correlates with FGF21-induced browning of WAT in mice.

PubMed

Schlessinger, Karni; Li, Wenyu; Tan, Yejun; Liu, Franklin; Souza, Sandra C; Tozzo, Effie; Liu, Kevin; Thompson, John R; Wang, Liangsu; Muise, Eric S

2015-09-01

Identify a gene expression signature in white adipose tissue (WAT) that reports on WAT browning and is associated with a healthy phenotype. RNA from several different adipose depots across three species were analyzed by whole transcriptome profiling, including 1) mouse subcutaneous white fat, brown fat, and white fat after in vivo treatment with FGF21; 2) human subcutaneous and omental fat from insulin-sensitive and insulin-resistant patients; and 3) rhesus monkey subcutaneous fat from healthy and dysmetabolic individuals. A "browning" signature in mice was identified by cross-referencing the FGF21-induced signature in WAT with the brown adipose tissue (BAT) vs. WAT comparison. In addition, gene expression levels in WAT from insulin-sensitive/healthy vs. insulin-resistant/dysmetabolic humans and rhesus monkeys, respectively, correlated with the gene expression levels in mouse BAT vs. WAT. A subset of 49 genes were identified that were consistently regulated or differentially expressed in the mouse and human data sets that could be used to monitor browning of WAT across species. Gene expression profiles of WATs from healthy insulin-sensitive individuals correlate with those of BAT and FGF21-induced browning of WAT. © 2015 The Obesity Society.
Personalized genomic analyses for cancer mutation discovery and interpretation

PubMed Central

Jones, Siân; Anagnostou, Valsamo; Lytle, Karli; Parpart-Li, Sonya; Nesselbush, Monica; Riley, David R.; Shukla, Manish; Chesnick, Bryan; Kadan, Maura; Papp, Eniko; Galens, Kevin G.; Murphy, Derek; Zhang, Theresa; Kann, Lisa; Sausen, Mark; Angiuoli, Samuel V.; Diaz, Luis A.; Velculescu, Victor E.

2015-01-01

Massively parallel sequencing approaches are beginning to be used clinically to characterize individual patient tumors and to select therapies based on the identified mutations. A major question in these analyses is the extent to which these methods identify clinically actionable alterations and whether the examination of the tumor tissue alone is sufficient or whether matched normal DNA should also be analyzed to accurately identify tumor-specific (somatic) alterations. To address these issues, we comprehensively evaluated 815 tumor-normal paired samples from patients of 15 tumor types. We identified genomic alterations using next-generation sequencing of whole exomes or 111 targeted genes that were validated with sensitivities >95% and >99%, respectively, and specificities >99.99%. These analyses revealed an average of 140 and 4.3 somatic mutations per exome and targeted analysis, respectively. More than 75% of cases had somatic alterations in genes associated with known therapies or current clinical trials. Analyses of matched normal DNA identified germline alterations in cancer-predisposing genes in 3% of patients with apparently sporadic cancers. In contrast, a tumor-only sequencing approach could not definitively identify germline changes in cancer-predisposing genes and led to additional false-positive findings comprising 31% and 65% of alterations identified in targeted and exome analyses, respectively, including in potentially actionable genes. These data suggest that matched tumor-normal sequencing analyses are essential for precise identification and interpretation of somatic and germline alterations and have important implications for the diagnostic and therapeutic management of cancer patients. PMID:25877891
Genomic features separating ten strains of Neorhizobium galegae with different symbiotic phenotypes.

PubMed

Österman, Janina; Mousavi, Seyed Abdollah; Koskinen, Patrik; Paulin, Lars; Lindström, Kristina

2015-05-02

The symbiotic phenotype of Neorhizobium galegae, with strains specifically fixing nitrogen with either Galega orientalis or G. officinalis, has made it a target in research on determinants of host specificity in nitrogen fixation. The genomic differences between representative strains of the two symbiovars are, however, relatively small. This introduced a need for a dataset representing a larger bacterial population in order to make better conclusions on characteristics typical for a subset of the species. In this study, we produced draft genomes of eight strains of N. galegae having different symbiotic phenotypes, both with regard to host specificity and nitrogen fixation efficiency. These genomes were analysed together with the previously published complete genomes of N. galegae strains HAMBI 540T and HAMBI 1141. The results showed that the presence of an additional rpoN sigma factor gene in the symbiosis gene region is a characteristic specific to symbiovar orientalis, required for nitrogen fixation. Also the nifQ gene was shown to be crucial for functional symbiosis in both symbiovars. Genome-wide analyses identified additional genes characteristic of strains of the same symbiovar and of strains having similar plant growth promoting properties on Galega orientalis. Many of these genes are involved in transcriptional regulation or in metabolic functions. The results of this study confirm that the only symbiosis-related gene that is present in one symbiovar of N. galegae but not in the other is an rpoN gene. The specific function of this gene remains to be determined, however. New genes that were identified as specific for strains of one symbiovar may be involved in determining host specificity, while others are defined as potential determinant genes for differences in efficiency of nitrogen fixation.
Genetics of Sputum Gene Expression in Chronic Obstructive Pulmonary Disease

PubMed Central

Qiu, Weiliang; Cho, Michael H.; Riley, John H.; Anderson, Wayne H.; Singh, Dave; Bakke, Per; Gulsvik, Amund; Litonjua, Augusto A.; Lomas, David A.; Crapo, James D.; Beaty, Terri H.; Celli, Bartolome R.; Rennard, Stephen; Tal-Singer, Ruth; Fox, Steven M.; Silverman, Edwin K.; Hersh, Craig P.

2011-01-01

Previous expression quantitative trait loci (eQTL) studies have performed genetic association studies for gene expression, but most of these studies examined lymphoblastoid cell lines from non-diseased individuals. We examined the genetics of gene expression in a relevant disease tissue from chronic obstructive pulmonary disease (COPD) patients to identify functional effects of known susceptibility genes and to find novel disease genes. By combining gene expression profiling on induced sputum samples from 131 COPD cases from the ECLIPSE Study with genomewide single nucleotide polymorphism (SNP) data, we found 4315 significant cis-eQTL SNP-probe set associations (3309 unique SNPs). The 3309 SNPs were tested for association with COPD in a genomewide association study (GWAS) dataset, which included 2940 COPD cases and 1380 controls. Adjusting for 3309 tests (p<1.5e-5), the two SNPs which were significantly associated with COPD were located in two separate genes in a known COPD locus on chromosome 15: CHRNA5 and IREB2. Detailed analysis of chromosome 15 demonstrated additional eQTLs for IREB2 mapping to that gene. eQTL SNPs for CHRNA5 mapped to multiple linkage disequilibrium (LD) bins. The eQTLs for IREB2 and CHRNA5 were not in LD. Seventy-four additional eQTL SNPs were associated with COPD at p<0.01. These were genotyped in two COPD populations, finding replicated associations with a SNP in PSORS1C1, in the HLA-C region on chromosome 6. Integrative analysis of GWAS and gene expression data from relevant tissue from diseased subjects has located potential functional variants in two known COPD genes and has identified a novel COPD susceptibility locus. PMID:21949713
Efficiently Identifying Significant Associations in Genome-wide Association Studies

PubMed Central

Eskin, Eleazar

2013-01-01

Abstract Over the past several years, genome-wide association studies (GWAS) have implicated hundreds of genes in common disease. More recently, the GWAS approach has been utilized to identify regions of the genome that harbor variation affecting gene expression or expression quantitative trait loci (eQTLs). Unlike GWAS applied to clinical traits, where only a handful of phenotypes are analyzed per study, in eQTL studies, tens of thousands of gene expression levels are measured, and the GWAS approach is applied to each gene expression level. This leads to computing billions of statistical tests and requires substantial computational resources, particularly when applying novel statistical methods such as mixed models. We introduce a novel two-stage testing procedure that identifies all of the significant associations more efficiently than testing all the single nucleotide polymorphisms (SNPs). In the first stage, a small number of informative SNPs, or proxies, across the genome are tested. Based on their observed associations, our approach locates the regions that may contain significant SNPs and only tests additional SNPs from those regions. We show through simulations and analysis of real GWAS datasets that the proposed two-stage procedure increases the computational speed by a factor of 10. Additionally, efficient implementation of our software increases the computational speed relative to the state-of-the-art testing approaches by a factor of 75. PMID:24033261
Identification and expression analyses of a novel serotonin receptor gene, 5-HT2β, in the field cricket, Gryllus bimaculatus.

PubMed

Watanabe, T; Aonuma, H

2012-01-01

Biogenic amine serotonin (5-HT) modulates various aspects of behaviors such as aggressive behavior and circadian behavior in the cricket. In our previous report, in order to elucidate the molecular basis of the cricket 5-HT system, we identified three genes involved in 5-HT biosynthesis, as well as four 5-HT receptor genes (5-HT1A, 5-HT1B, 5-HT2α, and 5-HT7) expressed in the brain of the field cricket Gryllus bimaculatus DeGeer [7]. In the present study, we identified Gryllus 5-HT2β gene, an additional 5-HT receptor gene expressed in the cricket brain, and examined its tissue-specific distribution and embryonic stage-dependent expression. Gryllus 5-HT2β gene was ubiquitously expressed in the all examined adult tissues, and was expressed during early embryonic development, as well as during later stages. This study suggests functional differences between two 5-HT2 receptors in the cricket.
Gene Signature in Sessile Serrated Polyps Identifies Colon Cancer Subtype

PubMed Central

Kanth, Priyanka; Bronner, Mary P.; Boucher, Kenneth M.; Burt, Randall W.; Neklason, Deborah W.; Hagedorn, Curt H.; Delker, Don A.

2016-01-01

Sessile serrated colon adenoma/polyps (SSA/Ps) are found during routine screening colonoscopy and may account for 20–30% of colon cancers. However, differentiating SSA/Ps from hyperplastic polyps (HP) with little risk of cancer is challenging and complementary molecular markers are needed. Additionally, the molecular mechanisms of colon cancer development from SSA/Ps are poorly understood. RNA sequencing was performed on 21 SSA/Ps, 10 HPs, 10 adenomas, 21 uninvolved colon and 20 control colon specimens. Differential expression and leave-one-out cross validation methods were used to define a unique gene signature of SSA/Ps. Our SSA/P gene signature was evaluated in colon cancer RNA-Seq data from The Cancer Genome Atlas (TCGA) to identify a subtype of colon cancers that may develop from SSA/Ps. A total of 1422 differentially expressed genes were found in SSA/Ps relative to controls. Serrated polyposis syndrome (n=12) and sporadic SSA/Ps (n=9) exhibited almost complete (96%) gene overlap. A 51-gene panel in SSA/P showed similar expression in a subset of TCGA colon cancers with high microsatellite instability (MSI-H). A smaller seven-gene panel showed high sensitivity and specificity in identifying BRAF mutant, CpG island methylator phenotype high (CIMP-H) and MLH1 silenced colon cancers. We describe a unique gene signature in SSA/Ps that identifies a subset of colon cancers likely to develop through the serrated pathway. These gene panels may be utilized for improved differentiation of SSA/Ps from HPs and provide insights into novel molecular pathways altered in colon cancer arising from the serrated pathway. PMID:27026680
Gene expression profiles in rainbow trout, Onchorynchus mykiss, exposed to a simple chemical mixture.

PubMed

Hook, Sharon E; Skillman, Ann D; Gopalan, Banu; Small, Jack A; Schultz, Irvin R

2008-03-01

Among proposed uses for microarrays in environmental toxiciology is the identification of key contributors to toxicity within a mixture. However, it remains uncertain whether the transcriptomic profiles resulting from exposure to a mixture have patterns of altered gene expression that contain identifiable contributions from each toxicant component. We exposed isogenic rainbow trout Onchorynchus mykiss, to sublethal levels of ethynylestradiol, 2,2,4,4-tetrabromodiphenyl ether, and chromium VI or to a mixture of all three toxicants Fluorescently labeled complementary DNA (cDNA) were generated and hybridized against a commercially available Salmonid array spotted with 16,000 cDNAs. Data were analyzed using analysis of variance (p<0.05) with a Benjamani-Hochberg multiple test correction (Genespring [Agilent] software package) to identify up and downregulated genes. Gene clustering patterns that can be used as "expression signatures" were determined using hierarchical cluster analysis. The gene ontology terms associated with significantly altered genes were also used to identify functional groups that were associated with toxicant exposure. Cross-ontological analytics approach was used to assign functional annotations to genes with "unknown" function. Our analysis indicates that transcriptomic profiles resulting from the mixture exposure resemble those of the individual contaminant exposures, but are not a simple additive list. However, patterns of altered genes representative of each component of the mixture are clearly discernible, and the functional classes of genes altered represent the individual components of the mixture. These findings indicate that the use of microarrays to identify transcriptomic profiles may aid in the identification of key stressors within a chemical mixture, ultimately improving environmental assessment.
Wheat differential gene expression induced by different races of Puccinia triticina.

PubMed

Neugebauer, Kerri A; Bruce, Myron; Todd, Tim; Trick, Harold N; Fellers, John P

2018-01-01

Puccinia triticina, the causal agent of wheat leaf rust, causes significant losses in wheat yield and quality each year worldwide. During leaf rust infection, the host plant recognizes numerous molecules, some of which trigger host defenses. Although P. triticina reproduces clonally, there is still variation within the population due to a high mutation frequency, host specificity, and environmental adaptation. This study explores how wheat responds on a gene expression level to different P. triticina races. Six P. triticina races were inoculated onto a susceptible wheat variety and samples were taken at six days post inoculation, just prior to pustule eruption. RNA sequence data identified 63 wheat genes differentially expressed between the six races. A time course, conducted over the first seven days post inoculation, was used to examine the expression pattern of 63 genes during infection. Forty-seven wheat genes were verified to have differential expression. Three common expression patterns were identified. In addition, two genes were associated with race specific gene expression. Differential expression of an ER molecular chaperone gene was associated with races from two different P. triticina lineages. Also, differential expression in an alanine glyoxylate aminotransferase gene was associated with races with virulence shifts for leaf rust resistance genes.

Stable Reference Gene Selection for RT-qPCR Analysis in Nonviruliferous and Viruliferous Frankliniella occidentalis.

PubMed

Yang, Chunxiao; Li, Hui; Pan, Huipeng; Ma, Yabin; Zhang, Deyong; Liu, Yong; Zhang, Zhanhong; Zheng, Changying; Chu, Dong

2015-01-01

Reverse transcriptase-quantitative polymerase chain reaction (RT-qPCR) is a reliable technique for measuring and evaluating gene expression during variable biological processes. To facilitate gene expression studies, normalization of genes of interest relative to stable reference genes is crucial. The western flower thrips Frankliniella occidentalis (Pergande) (Thysanoptera: Thripidae), the main vector of tomato spotted wilt virus (TSWV), is a destructive invasive species. In this study, the expression profiles of 11 candidate reference genes from nonviruliferous and viruliferous F. occidentalis were investigated. Five distinct algorithms, geNorm, NormFinder, BestKeeper, the ΔCt method, and RefFinder, were used to determine the performance of these genes. geNorm, NormFinder, BestKeeper, and RefFinder identified heat shock protein 70 (HSP70), heat shock protein 60 (HSP60), elongation factor 1 α, and ribosomal protein l32 (RPL32) as the most stable reference genes, and the ΔCt method identified HSP60, HSP70, RPL32, and heat shock protein 90 as the most stable reference genes. Additionally, two reference genes were sufficient for reliable normalization in nonviruliferous and viruliferous F. occidentalis. This work provides a foundation for investigating the molecular mechanisms of TSWV and F. occidentalis interactions.
DNA sequence analysis of the photosynthesis region of Rhodobacter sphaeroides 2.4.1.

PubMed

Choudhary, M; Kaplan, S

2000-02-15

This paper describes the DNA sequence of the photosynthesis region of Rhodobacter sphaeroides 2.4.1 (T). The photosynthesis gene cluster is located within a approximately 73 kb Ase I genomic DNA fragment containing the puf, puhA, cycA and puc operons. A total of 65 open reading frames (ORFs) have been identified, of which 61 showed significant similarity to genes/proteins of other organisms while only four did not reveal any significant sequence similarity to any gene/protein sequences in the database. The data were compared with the corresponding genes/ORFs from a different strain of R.sphaeroides and Rhodobacter capsulatus, a close relative of R. sphaeroides. A detailed analysis of the gene organization in the photosynthesis region revealed a similar gene order in both species with some notable differences located to the pucBAC = cycA region. In addition, photosynthesis gene regulatory protein (PpsR, FNR, IHF) binding motifs in upstream sequences of a number of photosynthesis genes have been identified and shown to differ between these two species. The difference in gene organization relative to pucBAC and cycA suggests that this region originated independently of the photosynthesis gene cluster of R.sphaeroides.
Genome-Wide Gene Expression in relation to Age in Large Laboratory Cohorts of Drosophila melanogaster

PubMed Central

Carlson, Kimberly A.; Gardner, Kylee; Pashaj, Anjeza; Carlson, Darby J.; Yu, Fang; Eudy, James D.; Zhang, Chi; Harshman, Lawrence G.

2015-01-01

Aging is a complex process characterized by a steady decline in an organism's ability to perform life-sustaining tasks. In the present study, two cages of approximately 12,000 mated Drosophila melanogaster females were used as a source of RNA from individuals sampled frequently as a function of age. A linear model for microarray data method was used for the microarray analysis to adjust for the box effect; it identified 1,581 candidate aging genes. Cluster analyses using a self-organizing map algorithm on the 1,581 significant genes identified gene expression patterns across different ages. Genes involved in immune system function and regulation, chorion assembly and function, and metabolism were all significantly differentially expressed as a function of age. The temporal pattern of data indicated that gene expression related to aging is affected relatively early in life span. In addition, the temporal variance in gene expression in immune function genes was compared to a random set of genes. There was an increase in the variance of gene expression within each cohort, which was not observed in the set of random genes. This observation is compatible with the hypothesis that D. melanogaster immune function genes lose control of gene expression as flies age. PMID:26090231
Genes uniquely expressed in human growth plate chondrocytes uncover a distinct regulatory network.

PubMed

Li, Bing; Balasubramanian, Karthika; Krakow, Deborah; Cohn, Daniel H

2017-12-20

Chondrogenesis is the earliest stage of skeletal development and is a highly dynamic process, integrating the activities and functions of transcription factors, cell signaling molecules and extracellular matrix proteins. The molecular mechanisms underlying chondrogenesis have been extensively studied and multiple key regulators of this process have been identified. However, a genome-wide overview of the gene regulatory network in chondrogenesis has not been achieved. In this study, employing RNA sequencing, we identified 332 protein coding genes and 34 long non-coding RNA (lncRNA) genes that are highly selectively expressed in human fetal growth plate chondrocytes. Among the protein coding genes, 32 genes were associated with 62 distinct human skeletal disorders and 153 genes were associated with skeletal defects in knockout mice, confirming their essential roles in skeletal formation. These gene products formed a comprehensive physical interaction network and participated in multiple cellular processes regulating skeletal development. The data also revealed 34 transcription factors and 11,334 distal enhancers that were uniquely active in chondrocytes, functioning as transcriptional regulators for the cartilage-selective genes. Our findings revealed a complex gene regulatory network controlling skeletal development whereby transcription factors, enhancers and lncRNAs participate in chondrogenesis by transcriptional regulation of key genes. Additionally, the cartilage-selective genes represent candidate genes for unsolved human skeletal disorders.
Identification of Linkages between EDCs in Personal Care Products and Breast Cancer through Data Integration Combined with Gene Network Analysis.

PubMed

Jeong, Hyeri; Kim, Jongwoon; Kim, Youngjun

2017-09-30

Approximately 1000 chemicals have been reported to possibly have endocrine disrupting effects, some of which are used in consumer products, such as personal care products (PCPs) and cosmetics. We conducted data integration combined with gene network analysis to: (i) identify causal molecular mechanisms between endocrine disrupting chemicals (EDCs) used in PCPs and breast cancer; and (ii) screen candidate EDCs associated with breast cancer. Among EDCs used in PCPs, four EDCs having correlation with breast cancer were selected, and we curated 27 common interacting genes between those EDCs and breast cancer to perform the gene network analysis. Based on the gene network analysis, ESR1, TP53, NCOA1, AKT1, and BCL6 were found to be key genes to demonstrate the molecular mechanisms of EDCs in the development of breast cancer. Using GeneMANIA, we additionally predicted 20 genes which could interact with the 27 common genes. In total, 47 genes combining the common and predicted genes were functionally grouped with the gene ontology and KEGG pathway terms. With those genes, we finally screened candidate EDCs for their potential to increase breast cancer risk. This study highlights that our approach can provide insights to understand mechanisms of breast cancer and identify potential EDCs which are in association with breast cancer.
Cross-species multiple environmental stress responses: An integrated approach to identify candidate genes for multiple stress tolerance in sorghum (Sorghum bicolor (L.) Moench) and related model species

PubMed Central

Modise, David M.; Gemeildien, Junaid; Ndimba, Bongani K.; Christoffels, Alan

2018-01-01

Background Crop response to the changing climate and unpredictable effects of global warming with adverse conditions such as drought stress has brought concerns about food security to the fore; crop yield loss is a major cause of concern in this regard. Identification of genes with multiple responses across environmental stresses is the genetic foundation that leads to crop adaptation to environmental perturbations. Methods In this paper, we introduce an integrated approach to assess candidate genes for multiple stress responses across-species. The approach combines ontology based semantic data integration with expression profiling, comparative genomics, phylogenomics, functional gene enrichment and gene enrichment network analysis to identify genes associated with plant stress phenotypes. Five different ontologies, viz., Gene Ontology (GO), Trait Ontology (TO), Plant Ontology (PO), Growth Ontology (GRO) and Environment Ontology (EO) were used to semantically integrate drought related information. Results Target genes linked to Quantitative Trait Loci (QTLs) controlling yield and stress tolerance in sorghum (Sorghum bicolor (L.) Moench) and closely related species were identified. Based on the enriched GO terms of the biological processes, 1116 sorghum genes with potential responses to 5 different stresses, such as drought (18%), salt (32%), cold (20%), heat (8%) and oxidative stress (25%) were identified to be over-expressed. Out of 169 sorghum drought responsive QTLs associated genes that were identified based on expression datasets, 56% were shown to have multiple stress responses. On the other hand, out of 168 additional genes that have been evaluated for orthologous pairs, 90% were conserved across species for drought tolerance. Over 50% of identified maize and rice genes were responsive to drought and salt stresses and were co-located within multifunctional QTLs. Among the total identified multi-stress responsive genes, 272 targets were shown to be co-localized within QTLs associated with different traits that are responsive to multiple stresses. Ontology mapping was used to validate the identified genes, while reconstruction of the phylogenetic tree was instrumental to infer the evolutionary relationship of the sorghum orthologs. The results also show specific genes responsible for various interrelated components of drought response mechanism such as drought tolerance, drought avoidance and drought escape. Conclusions We submit that this approach is novel and to our knowledge, has not been used previously in any other research; it enables us to perform cross-species queries for genes that are likely to be associated with multiple stress tolerance, as a means to identify novel targets for engineering stress resistance in sorghum and possibly, in other crop species. PMID:29590108
A genome-wide association scan for acute insulin response to glucose in Hispanic Americans: The IRAS Family Study

PubMed Central

Rich, S. S.; Goodarzi, M. O.; Palmer, N. D.; Langefeld, C. D.; Ziegler, J.; Haffner, S. M.; Bryer-Ash, M.; Norris, J. M.; Taylor, K. D.; Haritunians, T.; Rotter, J. I.; Chen, Y-D. I.; Wagenknecht, L. E.; Bowden, D. W.; Bergman, R. N.

2009-01-01

Aims/Hypothesis The goal of this study was to identify genes and regions in the human genome that are associated with the acute insulin response to glucose (AIRg), an important predictor of type 2 diabetes, in Hispanic-American participants from the Insulin Resistance Atherosclerosis Family Study (IRAS FS). Methods A two-stage genome-wide association scan (GWAS) was performed in IRAS FS Hispanic-American samples. In the first stage, 318K single nucleotide polymorphisms (SNPs) were assessed in 229 Hispanic-American DNA samples (from 34 families) from San Antonio, TX. SNPs with the most significant associations with AIRg were genotyped in the entire set of IRAS FS Hispanic-American samples (n = 1190). In chromosomal regions with evidence of association, additional SNPs were genotyped to capture variation in genes. Results No individual SNP achieved genome-wide levels of significance (P < 5 × 10-7); however, two regions — chromosomes 6p21 and 20p11 — had multiple highly-ranked SNPs that were associated with AIRg. Additional genotyping in these regions supported the initial evidence for variants contributing to variation in AIRg. One region resides in a gene desert between PXT1 and KCTD20 on 6p21 while the region on 20p11 has several viable candidate genes (ENTPD6, PYGB, GINS1 and R4-691N24.1). Conclusions/Interpretation A GWAS in Hispanic-American samples identified several candidate genes and loci that may be associated with AIRg. These associations explain a small component of variation in AIRg. The genes identified are involved in phosphorylation and ion transport and provide preliminary evidence that these processes have importance in beta cell response. PMID:19430760
A genome-wide association scan for acute insulin response to glucose in Hispanic-Americans: the Insulin Resistance Atherosclerosis Family Study (IRAS FS).

PubMed

Rich, S S; Goodarzi, M O; Palmer, N D; Langefeld, C D; Ziegler, J; Haffner, S M; Bryer-Ash, M; Norris, J M; Taylor, K D; Haritunians, T; Rotter, J I; Chen, Y-D I; Wagenknecht, L E; Bowden, D W; Bergman, R N

2009-07-01

This study sought to identify genes and regions in the human genome that are associated with the acute insulin response to glucose (AIRg), an important predictor of type 2 diabetes, in Hispanic-American participants from the Insulin Resistance Atherosclerosis Family Study (IRAS FS). A two-stage genome-wide association scan (GWAS) was performed in IRAS FS Hispanic-American samples. In the first stage, 317K single nucleotide polymorphisms (SNPs) were assessed in 229 Hispanic-American DNA samples from 34 families from San Antonio, TX, USA. SNPs with the most significant associations with AIRg were genotyped in the entire set of IRAS FS Hispanic-American samples (n = 1,190). In chromosomal regions with evidence of association, additional SNPs were genotyped to capture variation in genes. No individual SNP achieved genome-wide levels of significance (p < 5 x 10(-7)); however, two regions (chromosomes 6p21 and 20p11) had multiple highly ranked SNPs that were associated with AIRg. Additional genotyping in these regions supported the initial evidence of variants contributing to variation in AIRg. One region resides in a gene desert between PXT1 and KCTD20 on 6p21, while the region on 20p11 has several viable candidate genes (ENTPD6, PYGB, GINS1 and RP4-691N24.1). A GWAS in Hispanic-American samples identified several candidate genes and loci that may be associated with AIRg. These associations explain a small component of variation in AIRg. The genes identified are involved in phosphorylation and ion transport, and provide preliminary evidence that these processes are important in beta cell response.
A Novel Mutation in ERCC8 Gene Causing Cockayne Syndrome

PubMed Central

Taghdiri, Maryam; Dastsooz, Hassan; Fardaei, Majid; Mohammadi, Sanaz; Farazi Fard, Mohammad Ali; Faghihi, Mohammad Ali

2017-01-01

Cockayne syndrome (CS) is a rare autosomal recessive multisystem disorder characterized by impaired neurological and sensory functions, cachectic dwarfism, microcephaly, and photosensitivity. This syndrome shows a variable age of onset and rate of progression, and its phenotypic spectrum include a wide range of severity. Due to the progressive nature of this disorder, diagnosis can be more important when additional signs and symptoms appear gradually and become steadily worse over time. Therefore, mutation analysis of genes involved in CS pathogenesis can be helpful to confirm the suspected clinical diagnosis. Here, we report a novel mutation in ERCC8 gene in a 16-year-old boy who suffers from poor weight gain, short stature, microcephaly, intellectual disability, and photosensitivity. The patient was born to consanguineous family with no previous documented disease in his parents. To identify disease-causing mutation in the patient, whole exome sequencing utilizing next-generation sequencing on an Illumina HiSeq 2000 platform was performed. Results revealed a novel homozygote mutation in ERCC8 gene (NM_000082: exon 11, c.1122G>C) in our patient. Another gene (ERCC6), which is also involved in CS did not have any disease-causing mutations in the proband. The new identified mutation was then confirmed by Sanger sequencing in the proband, his parents, and extended family members, confirming co-segregation with the disease. In addition, different bioinformatics programs which included MutationTaster, I-Mutant v2.0, NNSplice, Combined Annotation Dependent Depletion, The PhastCons, Genomic Evolutationary Rate Profiling conservation score, and T-Coffee Multiple Sequence Alignment predicted the pathogenicity of the mutation. Our study identified a rare novel mutation in ERCC8 gene and help to provide accurate genetic counseling and prenatal diagnosis to minimize new affected individuals in this family. PMID:28848724
A Novel Mutation in ERCC8 Gene Causing Cockayne Syndrome.

PubMed

Taghdiri, Maryam; Dastsooz, Hassan; Fardaei, Majid; Mohammadi, Sanaz; Farazi Fard, Mohammad Ali; Faghihi, Mohammad Ali

2017-01-01

Cockayne syndrome (CS) is a rare autosomal recessive multisystem disorder characterized by impaired neurological and sensory functions, cachectic dwarfism, microcephaly, and photosensitivity. This syndrome shows a variable age of onset and rate of progression, and its phenotypic spectrum include a wide range of severity. Due to the progressive nature of this disorder, diagnosis can be more important when additional signs and symptoms appear gradually and become steadily worse over time. Therefore, mutation analysis of genes involved in CS pathogenesis can be helpful to confirm the suspected clinical diagnosis. Here, we report a novel mutation in ERCC8 gene in a 16-year-old boy who suffers from poor weight gain, short stature, microcephaly, intellectual disability, and photosensitivity. The patient was born to consanguineous family with no previous documented disease in his parents. To identify disease-causing mutation in the patient, whole exome sequencing utilizing next-generation sequencing on an Illumina HiSeq 2000 platform was performed. Results revealed a novel homozygote mutation in ERCC8 gene (NM_000082: exon 11, c.1122G>C) in our patient. Another gene ( ERCC6 ), which is also involved in CS did not have any disease-causing mutations in the proband. The new identified mutation was then confirmed by Sanger sequencing in the proband, his parents, and extended family members, confirming co-segregation with the disease. In addition, different bioinformatics programs which included MutationTaster, I-Mutant v2.0, NNSplice, Combined Annotation Dependent Depletion, The PhastCons, Genomic Evolutationary Rate Profiling conservation score, and T-Coffee Multiple Sequence Alignment predicted the pathogenicity of the mutation. Our study identified a rare novel mutation in ERCC8 gene and help to provide accurate genetic counseling and prenatal diagnosis to minimize new affected individuals in this family.
Complex phenotype of dyskeratosis congenita and mood dysregulation with novel homozygous RTEL1 and TPH1 variants.

PubMed

Ungar, Rachel A; Giri, Neelam; Pao, Maryland; Khincha, Payal P; Zhou, Weiyin; Alter, Blanche P; Savage, Sharon A

2018-06-01

Dyskeratosis congenita (DC) is an inherited bone marrow failure syndrome caused by germline mutations in telomere biology genes. Patients have extremely short telomeres for their age and a complex phenotype including oral leukoplakia, abnormal skin pigmentation, and dysplastic nails in addition to bone marrow failure, pulmonary fibrosis, stenosis of the esophagus, lacrimal ducts and urethra, developmental anomalies, and high risk of cancer. We evaluated a patient with features of DC, mood dysregulation, diabetes, and lack of pubertal development. Family history was not available but genome-wide genotyping was consistent with consanguinity. Whole exome sequencing identified 82 variants of interest in 80 genes based on the following criteria: homozygous, <0.1% minor allele frequency in public and in-house databases, nonsynonymous, and predicted deleterious by multiple in silico prediction programs. Six genes were identified likely contributory to the clinical presentation. The cause of DC is likely due to homozygous splice site variants in regulator of telomere elongation helicase 1, a known DC and telomere biology gene. A homozygous, missense variant in tryptophan hydroxylase 1 may be clinically important as this gene encodes the rate limiting step in serotonin biosynthesis, a biologic pathway connected with mood disorders. Four additional genes (SCN4A, LRP4, GDAP1L1, and SPTBN5) had rare, missense homozygous variants that we speculate may contribute to portions of the clinical phenotype. This case illustrates the value of conducting detailed clinical and genomic evaluations on rare patients in order to identify new areas of research into the functional consequences of rare variants and their contribution to human disease. © 2018 Wiley Periodicals, Inc.
Sequencing of sporadic Attention-Deficit Hyperactivity Disorder (ADHD) identifies novel and potentially pathogenic de novo variants and excludes overlap with genes associated with autism spectrum disorder.

PubMed

Kim, Daniel Seung; Burt, Amber A; Ranchalis, Jane E; Wilmot, Beth; Smith, Joshua D; Patterson, Karynne E; Coe, Bradley P; Li, Yatong K; Bamshad, Michael J; Nikolas, Molly; Eichler, Evan E; Swanson, James M; Nigg, Joel T; Nickerson, Deborah A; Jarvik, Gail P

2017-06-01

Attention-Deficit Hyperactivity Disorder (ADHD) has high heritability; however, studies of common variation account for <5% of ADHD variance. Using data from affected participants without a family history of ADHD, we sought to identify de novo variants that could account for sporadic ADHD. Considering a total of 128 families, two analyses were conducted in parallel: first, in 11 unaffected parent/affected proband trios (or quads with the addition of an unaffected sibling) we completed exome sequencing. Six de novo missense variants at highly conserved bases were identified and validated from four of the 11 families: the brain-expressed genes TBC1D9, DAGLA, QARS, CSMD2, TRPM2, and WDR83. Separately, in 117 unrelated probands with sporadic ADHD, we sequenced a panel of 26 genes implicated in intellectual disability (ID) and autism spectrum disorder (ASD) to evaluate whether variation in ASD/ID-associated genes were also present in participants with ADHD. Only one putative deleterious variant (Gln600STOP) in CHD1L was identified; this was found in a single proband. Notably, no other nonsense, splice, frameshift, or highly conserved missense variants in the 26 gene panel were identified and validated. These data suggest that de novo variant analysis in families with independently adjudicated sporadic ADHD diagnosis can identify novel genes implicated in ADHD pathogenesis. Moreover, that only one of the 128 cases (0.8%, 11 exome, and 117 MIP sequenced participants) had putative deleterious variants within our data in 26 genes related to ID and ASD suggests significant independence in the genetic pathogenesis of ADHD as compared to ASD and ID phenotypes. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Global transcriptome analysis of spore formation in Myxococcus xanthus reveals a locus necessary for cell differentiation

PubMed Central

2010-01-01

Background Myxococcus xanthus is a Gram negative bacterium that can differentiate into metabolically quiescent, environmentally resistant spores. Little is known about the mechanisms involved in differentiation in part because sporulation is normally initiated at the culmination of a complex starvation-induced developmental program and only inside multicellular fruiting bodies. To obtain a broad overview of the sporulation process and to identify novel genes necessary for differentiation, we instead performed global transcriptome analysis of an artificial chemically-induced sporulation process in which addition of glycerol to vegetatively growing liquid cultures of M. xanthus leads to rapid and synchronized differentiation of nearly all cells into myxospore-like entities. Results Our analyses identified 1 486 genes whose expression was significantly regulated at least two-fold within four hours of chemical-induced differentiation. Most of the previously identified sporulation marker genes were significantly upregulated. In contrast, most genes that are required to build starvation-induced multicellular fruiting bodies, but which are not required for sporulation per se, were not significantly regulated in our analysis. Analysis of functional gene categories significantly over-represented in the regulated genes, suggested large rearrangements in core metabolic pathways, and in genes involved in protein synthesis and fate. We used the microarray data to identify a novel operon of eight genes that, when mutated, rendered cells unable to produce viable chemical- or starvation-induced spores. Importantly, these mutants displayed no defects in building fruiting bodies, suggesting these genes are necessary for the core sporulation process. Furthermore, during the starvation-induced developmental program, these genes were expressed in fruiting bodies but not in peripheral rods, a subpopulation of developing cells which do not sporulate. Conclusions These results suggest that microarray analysis of chemical-induced spore formation is an excellent system to specifically identify genes necessary for the core sporulation process of a Gram negative model organism for differentiation. PMID:20420673
Candidate gene biodosimetry markers of exposure to external ionizing radiation in human blood: A systematic review

PubMed Central

Sima, Chao; Amundson, Sally A.; Zenhausern, Frederic

2018-01-01

Purpose To compile a list of genes that have been reported to be affected by external ionizing radiation (IR) and to assess their performance as candidate biomarkers for individual human radiation dosimetry. Methods Eligible studies were identified through extensive searches of the online databases from 1978 to 2017. Original English-language publications of microarray studies assessing radiation-induced changes in gene expression levels in human blood after external IR were included. Genes identified in at least half of the selected studies were retained for bio-statistical analysis in order to evaluate their diagnostic ability. Results 24 studies met the criteria and were included in this study. Radiation-induced expression of 10,170 unique genes was identified and the 31 genes that have been identified in at least 50% of studies (12/24 studies) were selected for diagnostic power analysis. Twenty-seven genes showed a significant Spearman’s correlation with radiation dose. Individually, TNFSF4, FDXR, MYC, ZMAT3 and GADD45A provided the best discrimination of radiation dose < 2 Gy and dose ≥ 2 Gy according to according to their maximized Youden’s index (0.67, 0.55, 0.55, 0.55 and 0.53 respectively). Moreover, 12 combinations of three genes display an area under the Receiver Operating Curve (ROC) curve (AUC) = 1 reinforcing the concept of biomarker combinations instead of looking for an ideal and unique biomarker. Conclusion Gene expression is a promising approach for radiation dosimetry assessment. A list of robust candidate biomarkers has been identified from analysis of the studies published to date, confirming for example the potential of well-known genes such as FDXR and TNFSF4 or highlighting other promising gene such as ZMAT3. However, heterogeneity in protocols and analysis methods will require additional studies to confirm these results. PMID:29879226
Transposon Mutagenesis Identified Chromosomal and Plasmid Genes Essential for Adaptation of the Marine Bacterium Dinoroseobacter shibae to Anaerobic Conditions

PubMed Central

Ebert, Matthias; Laaß, Sebastian; Burghartz, Melanie; Petersen, Jörn; Koßmehl, Sebastian; Wöhlbrand, Lars; Rabus, Ralf; Wittmann, Christoph; Jahn, Dieter

2013-01-01

Anaerobic growth and survival are integral parts of the life cycle of many marine bacteria. To identify genes essential for the anoxic life of Dinoroseobacter shibae, a transposon library was screened for strains impaired in anaerobic denitrifying growth. Transposon insertions in 35 chromosomal and 18 plasmid genes were detected. The essential contribution of plasmid genes to anaerobic growth was confirmed with plasmid-cured D. shibae strains. A combined transcriptome and proteome approach identified oxygen tension-regulated genes. Transposon insertion sites of a total of 1,527 mutants without an anaerobic growth phenotype were determined to identify anaerobically induced but not essential genes. A surprisingly small overlap of only three genes (napA, phaA, and the Na+/Pi antiporter gene Dshi_0543) between anaerobically essential and induced genes was found. Interestingly, transposon mutations in genes involved in dissimilatory and assimilatory nitrate reduction (napA, nasA) and corresponding cofactor biosynthesis (genomic moaB, moeB, and dsbC and plasmid-carried dsbD and ccmH) were found to cause anaerobic growth defects. In contrast, mutation of anaerobically induced genes encoding proteins required for the later denitrification steps (nirS, nirJ, nosD), dimethyl sulfoxide reduction (dmsA1), and fermentation (pdhB1, arcA, aceE, pta, acs) did not result in decreased anaerobic growth under the conditions tested. Additional essential components (ferredoxin, cccA) of the anaerobic electron transfer chain and central metabolism (pdhB) were identified. Another surprise was the importance of sodium gradient-dependent membrane processes and genomic rearrangements via viruses, transposons, and insertion sequence elements for anaerobic growth. These processes and the observed contributions of cell envelope restructuring (lysM, mipA, fadK), C4-dicarboxylate transport (dctM1, dctM3), and protease functions to anaerobic growth require further investigation to unravel the novel underlying adaptation strategies. PMID:23974024
Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databases.

PubMed

Berger, Seth I; Posner, Jeremy M; Ma'ayan, Avi

2007-10-04

In recent years, mammalian protein-protein interaction network databases have been developed. The interactions in these databases are either extracted manually from low-throughput experimental biomedical research literature, extracted automatically from literature using techniques such as natural language processing (NLP), generated experimentally using high-throughput methods such as yeast-2-hybrid screens, or interactions are predicted using an assortment of computational approaches. Genes or proteins identified as significantly changing in proteomic experiments, or identified as susceptibility disease genes in genomic studies, can be placed in the context of protein interaction networks in order to assign these genes and proteins to pathways and protein complexes. Genes2Networks is a software system that integrates the content of ten mammalian interaction network datasets. Filtering techniques to prune low-confidence interactions were implemented. Genes2Networks is delivered as a web-based service using AJAX. The system can be used to extract relevant subnetworks created from "seed" lists of human Entrez gene symbols. The output includes a dynamic linkable three color web-based network map, with a statistical analysis report that identifies significant intermediate nodes used to connect the seed list. Genes2Networks is powerful web-based software that can help experimental biologists to interpret lists of genes and proteins such as those commonly produced through genomic and proteomic experiments, as well as lists of genes and proteins associated with disease processes. This system can be used to find relationships between genes and proteins from seed lists, and predict additional genes or proteins that may play key roles in common pathways or protein complexes.
Selective Sweep Analysis in the Genomes of the 91-R and 91-C Drosophila melanogaster Strains Reveals Few of the ‘Usual Suspects’ in Dichlorodiphenyltrichloroethane (DDT) Resistance

PubMed Central

Steele, Laura D.; Coates, Brad; Valero, M. Carmen; Sun, Weilin; Seong, Keon Mook; Muir, William M.; Clark, John M.; Pittendrigh, Barry R.

2015-01-01

Adaptation of insect phenotypes for survival after exposure to xenobiotics can result from selection at multiple loci with additive genetic effects. To the authors’ knowledge, no selective sweep analysis has been performed to identify such loci in highly dichlorodiphenyltrichloroethane (DDT) resistant insects. Here we compared a highly DDT resistant phenotype in the Drosophila melanogaster (Drosophila) 91-R strain to the DDT susceptible 91-C strain, both of common origin. Whole genome re-sequencing data from pools of individuals was generated separately for 91-R and 91-C, and mapped to the reference Drosophila genome assembly (v. 5.72). Thirteen major and three minor effect chromosome intervals with reduced nucleotide diversity (π) were identified only in the 91-R population. Estimates of Tajima's D (D) showed corresponding evidence of directional selection in these same genome regions of 91-R, however, no similar reductions in π or D estimates were detected in 91-C. An overabundance of non-synonymous proteins coding to synonymous changes were identified in putative open reading frames associated with 91-R. Except for NinaC and Cyp4g1, none of the identified genes were the ‘usual suspects’ previously observed to be associated with DDT resistance. Additionally, up-regulated ATP-binding cassette transporters have been previously associated with DDT resistance; however, here we identified a structurally altered MDR49 candidate resistance gene. The remaining fourteen genes have not previously been shown to be associated with DDT resistance. These results suggest hitherto unknown mechanisms of DDT resistance, most of which have been overlooked in previous transcriptional studies, with some genes having orthologs in mammals. PMID:25826265
Genome-Wide and Gene-Based Meta-Analyses Identify Novel Loci Influencing Blood Pressure Response to Hydrochlorothiazide.

PubMed

Salvi, Erika; Wang, Zhiying; Rizzi, Federica; Gong, Yan; McDonough, Caitrin W; Padmanabhan, Sandosh; Hiltunen, Timo P; Lanzani, Chiara; Zaninello, Roberta; Chittani, Martina; Bailey, Kent R; Sarin, Antti-Pekka; Barcella, Matteo; Melander, Olle; Chapman, Arlene B; Manunta, Paolo; Kontula, Kimmo K; Glorioso, Nicola; Cusi, Daniele; Dominiczak, Anna F; Johnson, Julie A; Barlassina, Cristina; Boerwinkle, Eric; Cooper-DeHoff, Rhonda M; Turner, Stephen T

2017-01-01

This study aimed to identify novel loci influencing the antihypertensive response to hydrochlorothiazide monotherapy. A genome-wide meta-analysis of blood pressure (BP) response to hydrochlorothiazide was performed in 1739 white hypertensives from 6 clinical trials within the International Consortium for Antihypertensive Pharmacogenomics Studies, making it the largest study to date of its kind. No signals reached genome-wide significance (P<5×10 - 8 ), and the suggestive regions (P<10 -5 ) were cross-validated in 2 black cohorts treated with hydrochlorothiazide. In addition, a gene-based analysis was performed on candidate genes with previous evidence of involvement in diuretic response, in BP regulation, or in hypertension susceptibility. Using the genome-wide meta-analysis approach, with validation in blacks, we identified 2 suggestive regulatory regions linked to gap junction protein α1 gene (GJA1) and forkhead box A1 gene (FOXA1), relevant for cardiovascular and kidney function. With the gene-based approach, we identified hydroxy-delta-5-steroid dehydrogenase, 3 β- and steroid δ-isomerase 1 gene (HSD3B1) as significantly associated with BP response (P<2.28×10 - 4 ). HSD3B1 encodes the 3β-hydroxysteroid dehydrogenase enzyme and plays a crucial role in the biosynthesis of aldosterone and endogenous ouabain. By amassing all of the available pharmacogenomic studies of BP response to hydrochlorothiazide, and using 2 different analytic approaches, we identified 3 novel loci influencing BP response to hydrochlorothiazide. The gene-based analysis, never before applied to pharmacogenomics of antihypertensive drugs to our knowledge, provided a powerful strategy to identify a locus of interest, which was not identified in the genome-wide meta-analysis because of high allelic heterogeneity. These data pave the way for future investigations on new pathways and drug targets to enhance the current understanding of personalized antihypertensive treatment. © 2016 American Heart Association, Inc.
A high-resolution genetic, physical, and comparative gene map of the doublefoot (Dbf) region of mouse chromosome 1 and the region of conserved synteny on human chromosome 2q35.

PubMed

Hayes, C; Rump, A; Cadman, M R; Harrison, M; Evans, E P; Lyon, M F; Morriss-Kay, G M; Rosenthal, A; Brown, S D

2001-12-01

The mouse doublefoot (Dbf) mutant exhibits preaxial polydactyly in association with craniofacial defects. This mutation has previously been mapped to mouse chromosome 1. We have used a positional cloning strategy, coupled with a comparative sequencing approach using available human draft sequence, to identify putative candidates for the Dbf gene in the mouse and in homologous human region. We have constructed a high-resolution genetic map of the region, localizing the mutation to a 0.4-cM (+/-0.0061) interval on mouse chromosome 1. Furthermore, we have constructed contiguous BAC/PAC clone maps across the mouse and human Dbf region. Using existing markers and additional sequence tagged sites, which we have generated, we have anchored the physical map to the genetic map. Through the comparative sequencing of these clones we have identified 35 genes within this interval, indicating that the region is gene-rich. From this we have identified several genes that are known to be differentially expressed in the developing mid-gestation mouse embryo, some in the developing embryonic limb buds. These genes include those encoding known developmental signaling molecules such as WNT proteins and IHH, and we provide evidence that these genes are candidates for the Dbf mutation.
Ancestral multipartite units in light-responsive plant promoters have structural features correlating with specific phototransduction pathways.

PubMed Central

Argüello-Astorga, G R; Herrera-Estrella, L R

1996-01-01

Regulation of plant gene transcription by light is mediated by multipartite cis-regulatory units. Previous attempts to identify structural features that are common to all light-responsive elements (LREs) have been unsuccessful. To address the question of what is needed to confer photoresponsiveness to a promoter, the upstream sequences from more than 110 light-regulated plant genes were analyzed by a new, phylogenetic-structural method. As a result, 30 distinct conserved DNA module arrays (CMAs) associated with light-responsive promoter regions were identified. Several of these CMAs have remained invariant throughout the evolutionary radiation of angiosperms and are conserved between homologous genes as well as between members of different gene families. The identified CMAs share a gene superfamily-specific core that correlates with the particular phytochrome-dependent transduction pathway that controls their expression, i.e. ACCTA(A/C)C(A/C) for the cGMP-dependent phenylpropanoid metabolism-associated genes, and GATA(A/T)GR for the Ca2+/calmodulin-dependent photosynthesis-associated nuclear genes. In addition to suggesting a general model for the functional and structural organization of LREs, the data obtained in this study indicate that angiosperm LREs probably evolved from complex cis-acting elements involved in regulatory processes other than photoregulation in gymnosperms. PMID:8938415

Plant U13 orthologues and orphan snoRNAs identified by RNomics of RNA from Arabidopsis nucleoli

PubMed Central

Kim, Sang Hyon; Spensley, Mark; Choi, Seung Kook; Calixto, Cristiane P. G.; Pendle, Ali F.; Koroleva, Olga; Shaw, Peter J.; Brown, John W. S.

2010-01-01

Small nucleolar RNAs (snoRNAs) and small Cajal body-specific RNAs (scaRNAs) are non-coding RNAs whose main function in eukaryotes is to guide the modification of nucleotides in ribosomal and spliceosomal small nuclear RNAs, respectively. Full-length sequences of Arabidopsis snoRNAs and scaRNAs have been obtained from cDNA libraries of capped and uncapped small RNAs using RNA from isolated nucleoli from Arabidopsis cell cultures. We have identified 31 novel snoRNA genes (9 box C/D and 22 box H/ACA) and 15 new variants of previously described snoRNAs. Three related capped snoRNAs with a distinct gene organization and structure were identified as orthologues of animal U13snoRNAs. In addition, eight of the novel genes had no complementarity to rRNAs or snRNAs and are therefore putative orphan snoRNAs potentially reflecting wider functions for these RNAs. The nucleolar localization of a number of the snoRNAs and the localization to nuclear bodies of two putative scaRNAs was confirmed by in situ hybridization. The majority of the novel snoRNA genes were found in new gene clusters or as part of previously described clusters. These results expand the repertoire of Arabidopsis snoRNAs to 188 snoRNA genes with 294 gene variants. PMID:20081206
Sequences associated with human iris pigmentation.

PubMed Central

Frudakis, Tony; Thomas, Matthew; Gaskin, Zach; Venkateswarlu, K; Chandra, K Suresh; Ginjupalli, Siva; Gunturi, Sitaram; Natrajan, Sivamani; Ponnuswamy, Viswanathan K; Ponnuswamy, K N

2003-01-01

To determine whether and how common polymorphisms are associated with natural distributions of iris colors, we surveyed 851 individuals of mainly European descent at 335 SNP loci in 13 pigmentation genes and 419 other SNPs distributed throughout the genome and known or thought to be informative for certain elements of population structure. We identified numerous SNPs, haplotypes, and diplotypes (diploid pairs of haplotypes) within the OCA2, MYO5A, TYRP1, AIM, DCT, and TYR genes and the CYP1A2-15q22-ter, CYP1B1-2p21, CYP2C8-10q23, CYP2C9-10q24, and MAOA-Xp11.4 regions as significantly associated with iris colors. Half of the associated SNPs were located on chromosome 15, which corresponds with results that others have previously obtained from linkage analysis. We identified 5 additional genes (ASIP, MC1R, POMC, and SILV) and one additional region (GSTT2-22q11.23) with haplotype and/or diplotypes, but not individual SNP alleles associated with iris colors. For most of the genes, multilocus gene-wise genotype sequences were more strongly associated with iris colors than were haplotypes or SNP alleles. Diplotypes for these genes explain 15% of iris color variation. Apart from representing the first comprehensive candidate gene study for variable iris pigmentation and constituting a first step toward developing a classification model for the inference of iris color from DNA, our results suggest that cryptic population structure might serve as a leverage tool for complex trait gene mapping if genomes are screened with the appropriate ancestry informative markers. PMID:14704187
Differential gene expression in whitefly Bemisia tabaci-infested tomato (Solanum lycopersicum) plants at progressing developmental stages of the insect's life cycle.

PubMed

Estrada-Hernández, María Gloria; Valenzuela-Soto, José Humberto; Ibarra-Laclette, Enrique; Délano-Frier, John Paul

2009-09-01

A suppression-subtractive-hybridization (SSH) strategy was used to identify genes whose expression was modified in response to virus-free whitefly Bemisia tabaci (Bt, biotype A) infestation in tomato (Solanum lycopersicum) plants. Thus, forward and reverse SSH gene libraries were generated at four points in the whitefly's life cycle, namely at (1) 2 days (adult feeding and oviposition: phase I); (2) 7 days (mobile crawler stage: phase II); (3) 12 days (second to third instar nymphal transition: phase III) and (4) 18 days (fourth instar nymphal stage: phase IV). The 169 genes with altered expression (up and downregulated) that were identified in the eight generated SSH libraries, together with 75 additional genes that were selected on the basis of their involvement in resistance responses against phytofagous insects and pathogens, were printed on a Nexterion(®) Slide MPX 16 to monitor their pattern of expression at the above phases. The results indicated that Bt infestation in tomato led to distinctive phase-specific expression/repression patterns of several genes associated predominantly with photosynthesis, senescence, secondary metabolism and (a)biotic stress. Most of the gene expression modifications were detected in phase III, coinciding with intense larval feeding, whereas fewer changes were detected in phases I and IV. These results complement previously reported gene expression profiles in Bt-infested tomato and Arabidopisis, and support and expand the opinion that Bt infestation leads to the downregulation of specific defense responses in addition to those controlled by jasmonic acid. Copyright © Physiologia Plantarum 2009.
Cloning of the neurodegeneration gene drop-dead and characterization of additional phenotypes of its mutation.

PubMed

Blumenthal, Edward M

2008-01-01

Mutations in the Drosophila gene drop-dead (drd) result in early adult lethality and neurodegeneration, but the molecular identity of the drd gene and its mechanism of action are not known. This paper describes the characterization of a new X-linked recessive adult-lethal mutation, originally called lot's wife (lwf(1)) but subsequently identified as an allele of drd (drd(lwf)); drd(lwf) mutants die within two weeks of eclosion. Through mapping and complementation, the drd gene has been identified as CG33968, which encodes a putative integral membrane protein of unknown function. The drd(lwf) allele is associated with a nonsense mutation that eliminates nearly 80% of the CG33968 gene product; mutations in the same gene were also found in two previously described drd alleles. Characterization of drd (lwf) flies revealed additional phenotypes of drd, most notably, defects in food processing by the digestive system and in oogenesis. Mutant flies store significantly more food in their crops and defecate less than wild-type flies, suggesting that normal transfer of ingested food from the crop into the midgut is dependent upon the DRD gene product. The defect in oogenesis results in the sterility of homozygous mutant females and is associated with a reduction in the number of vitellogenic egg chambers. The disruption in vitellogenesis is far more severe than that seen in starved flies and so is unlikely to be a secondary consequence of the digestive phenotype. This study demonstrates that mutation of the drd gene CG33968 results in a complex phenotype affecting multiple physiological systems within the fly.
Characterization of basal gene expression trends over a diurnal cycle in Xiphophorus maculatus skin, brain and liver.

PubMed

Lu, Yuan; Reyes, Jose; Walter, Sean; Gonzalez, Trevor; Medrano, Geraldo; Boswell, Mikki; Boswell, William; Savage, Markita; Walter, Ronald

2018-06-01

Evolutionarily conserved diurnal circadian mechanisms maintain oscillating patterns of gene expression based on the day-night cycle. Xiphophorus fish have been used to evaluate transcriptional responses after exposure to various light sources and it was determined that each source incites distinct genetic responses in skin tissue. However, basal expression levels of genes that show oscillating expression patterns in day-night cycle, may affect the outcomes of such experiments, since basal gene expression levels at each point in the circadian path may influence the profile of identified light responsive genes. Lack of knowledge regarding diurnal fluctuations in basal gene expression patterns may confound the understanding of genetic responses to external stimuli (e.g., light) since the dynamic nature of gene expression implies animals subjected to stimuli at different times may be at very different stages within the continuum of genetic homeostasis. We assessed basal gene expression changes over a 24-hour period in 200 select Xiphophorus gene targets known to transcriptionally respond to various types of light exposure. We identified 22 genes in skin, 36 genes in brain and 28 genes in liver that exhibit basal oscillation of expression patterns. These genes, including known circadian regulators, produced the expected expression patterns over a 24-hour cycle when compared to circadian regulatory genes identified in other species, especially human and other vertebrate animal models. Our results suggest the regulatory network governing diurnal oscillating gene expression is similar between Xiphophorus and other vertebrates for the three Xiphophorus organs tested. In addition, we were able to categorize light responsive gene sets in Xiphophorus that do, and do not, exhibit circadian based oscillating expression patterns. Copyright © 2017 Elsevier Inc. All rights reserved.
Genome-wide survey and characterization of the WRKY gene family in Populus trichocarpa.

PubMed

He, Hongsheng; Dong, Qing; Shao, Yuanhua; Jiang, Haiyang; Zhu, Suwen; Cheng, Beijiu; Xiang, Yan

2012-07-01

WRKY transcription factors participate in diverse physiological and developmental processes in plants. They have highly conserved WRKYGQK amino acid sequences in their N-termini, followed by the novel zinc-finger-like motifs, Cys₂His₂ or Cys₂HisCys. To date, numerous WRKY genes have been identified and characterized in a number of herbaceous species. Survey and characterization of WRKY genes in a ligneous species would facilitate a better understanding of the evolutionary processes and functions of this gene family. In this study, 104 poplar WRKY genes (PtWRKY) were identified in the latest poplar genome sequence. According to their structural features, the predicted members were divided into the previously defined groups I-III, as described in rice. In addition, chromosomal localization of the genes demonstrated that there might be WRKY gene hot spots in 2.3 Mb regions on chromosome 14. Furthermore, approximately 83% (86 out of 104) WRKY genes participated in gene duplication events, including 69% (29 out of 42) gene pairs which exhibited segmental duplication. Using semi-quantitative RT-PCR, the expression patterns of subgroup III genes were investigated under different stresses [cold, drought, salinity and salicylic acid (SA)]. The data revealed that these genes presented different expression levels in response to various stress conditions. Expression analysis exhibited PtWRKY76 gene induced markedly in 0.1 mM SA or 25% PEG-6000 treatment. The results presented here provide a fundamental clue for cloning specific function genes in further studies and applications. This study identified 104 poplar WRKY genes and demonstrated WRKY gene hot spots on chromosome 14. Furthermore, semi-quantitative RT-PCR showed variable stress responses in subgroup III.
Identifying the viral genes encoding envelope glycoproteins for differentiation of Cyprinid herpesvirus 3 isolates.

PubMed

Han, Jee Eun; Kim, Ji Hyung; Renault, Tristan; Choresca, Casiano; Shin, Sang Phil; Jun, Jin Woo; Park, Se Chang

2013-01-31

Cyprinid herpes virus 3 (CyHV-3) diseases have been reported around the world and are associated with high mortalities of koi (Cyprinus carpio). Although little work has been conducted on the molecular analysis of this virus, glycoprotein genes identified in the present study seem to be valuable targets for genetic comparison of this virus. Three envelope glycoprotein genes (ORF25, 65 and 116) of the CyHV-3 isolates from the USA, Israel, Japan and Korea were compared, and interestingly, sequence insertions or deletions were observed in these target regions. In addition, polymorphisms were presented in microsatellite zones from two glycoprotein genes (ORF65 and 116). In phylogenetic tree analysis, the Korean isolate was remarkably distinguished from USA, Israel, Japan isolates. These findings may be suitable for many applications including isolates differentiation and phylogeny studies.
A genomic approach to the understanding of Xylella fastidiosa pathogenicity.

PubMed

Lambais, M R; Goldman, M H; Camargo, L E; Goldman, G H

2000-10-01

Xylella fastidiosa is a fastidious, xylem-limited bacterium that causes several economically important plant diseases, including citrus variegated chlorosis (CVC). X. fastidiosa is the first plant pathogen to have its genome completely sequenced. In addition, it is probably the least previously studied of any organism for which the complete genome sequence is available. Several pathogenicity-related genes have been identified in the X. fastidiosa genome by similarity with other bacterial genes involved in pathogenesis in plants, as well as in animals. The X. fastidiosa genome encodes different classes of proteins directly or indirectly involved in cell-cell interactions, degradation of plant cell walls, iron homeostasis, anti-oxidant responses, synthesis of toxins, and regulation of pathogenicity. Neither genes encoding members of the type III protein secretion system nor avirulence-like genes have been identified in X. fastidiosa.
Antennal Transcriptome Analysis of Odorant Reception Genes in the Red Turpentine Beetle (RTB), Dendroctonus valens.

PubMed

Gu, Xiao-Cui; Zhang, Ya-Nan; Kang, Ke; Dong, Shuang-Lin; Zhang, Long-Wa

2015-01-01

The red turpentine beetle (RTB), Dendroctonus valens LeConte (Coleoptera: Curculionidae, Scolytinae), is a destructive invasive pest of conifers which has become the second most important forest pest nationwide in China. Dendroctonus valens is known to use host odors and aggregation pheromones, as well as non-host volatiles, in host location and mass-attack modulation, and thus antennal olfaction is of the utmost importance for the beetles' survival and fitness. However, information on the genes underlying olfaction has been lacking in D. valens. Here, we report the antennal transcriptome of D. valens from next-generation sequencing, with the goal of identifying the olfaction gene repertoire that is involved in D. valens odor-processing. We obtained 51 million reads that were assembled into 61,889 genes, including 39,831 contigs and 22,058 unigenes. In total, we identified 68 novel putative odorant reception genes, including 21 transcripts encoding for putative odorant binding proteins (OBP), six chemosensory proteins (CSP), four sensory neuron membrane proteins (SNMP), 22 odorant receptors (OR), four gustatory receptors (GR), three ionotropic receptors (IR), and eight ionotropic glutamate receptors. We also identified 155 odorant/xenobiotic degradation enzymes from the antennal transcriptome, putatively identified to be involved in olfaction processes including cytochrome P450s, glutathione-S-transferases, and aldehyde dehydrogenase. Predicted protein sequences were compared with counterparts in Tribolium castaneum, Megacyllene caryae, Ips typographus, Dendroctonus ponderosae, and Agrilus planipennis. The antennal transcriptome described here represents the first study of the repertoire of odor processing genes in D. valens. The genes reported here provide a significant addition to the pool of identified olfactory genes in Coleoptera, which might represent novel targets for insect management. The results from our study also will assist with evolutionary analyses of coleopteran olfaction.
Antennal Transcriptome Analysis of Odorant Reception Genes in the Red Turpentine Beetle (RTB), Dendroctonus valens

PubMed Central

Dong, Shuang-Lin; Zhang, Long-Wa

2015-01-01

Background The red turpentine beetle (RTB), Dendroctonus valens LeConte (Coleoptera: Curculionidae, Scolytinae), is a destructive invasive pest of conifers which has become the second most important forest pest nationwide in China. Dendroctonus valens is known to use host odors and aggregation pheromones, as well as non-host volatiles, in host location and mass-attack modulation, and thus antennal olfaction is of the utmost importance for the beetles’ survival and fitness. However, information on the genes underlying olfaction has been lacking in D. valens. Here, we report the antennal transcriptome of D. valens from next-generation sequencing, with the goal of identifying the olfaction gene repertoire that is involved in D. valens odor-processing. Results We obtained 51 million reads that were assembled into 61,889 genes, including 39,831 contigs and 22,058 unigenes. In total, we identified 68 novel putative odorant reception genes, including 21 transcripts encoding for putative odorant binding proteins (OBP), six chemosensory proteins (CSP), four sensory neuron membrane proteins (SNMP), 22 odorant receptors (OR), four gustatory receptors (GR), three ionotropic receptors (IR), and eight ionotropic glutamate receptors. We also identified 155 odorant/xenobiotic degradation enzymes from the antennal transcriptome, putatively identified to be involved in olfaction processes including cytochrome P450s, glutathione-S-transferases, and aldehyde dehydrogenase. Predicted protein sequences were compared with counterparts in Tribolium castaneum, Megacyllene caryae, Ips typographus, Dendroctonus ponderosae, and Agrilus planipennis. Conclusion The antennal transcriptome described here represents the first study of the repertoire of odor processing genes in D. valens. The genes reported here provide a significant addition to the pool of identified olfactory genes in Coleoptera, which might represent novel targets for insect management. The results from our study also will assist with evolutionary analyses of coleopteran olfaction. PMID:25938508
Functional genomics annotation of a statistical epistasis network associated with bladder cancer susceptibility.

PubMed

Hu, Ting; Pan, Qinxin; Andrew, Angeline S; Langer, Jillian M; Cole, Michael D; Tomlinson, Craig R; Karagas, Margaret R; Moore, Jason H

2014-04-11

Several different genetic and environmental factors have been identified as independent risk factors for bladder cancer in population-based studies. Recent studies have turned to understanding the role of gene-gene and gene-environment interactions in determining risk. We previously developed the bioinformatics framework of statistical epistasis networks (SEN) to characterize the global structure of interacting genetic factors associated with a particular disease or clinical outcome. By applying SEN to a population-based study of bladder cancer among Caucasians in New Hampshire, we were able to identify a set of connected genetic factors with strong and significant interaction effects on bladder cancer susceptibility. To support our statistical findings using networks, in the present study, we performed pathway enrichment analyses on the set of genes identified using SEN, and found that they are associated with the carcinogen benzo[a]pyrene, a component of tobacco smoke. We further carried out an mRNA expression microarray experiment to validate statistical genetic interactions, and to determine if the set of genes identified in the SEN were differentially expressed in a normal bladder cell line and a bladder cancer cell line in the presence or absence of benzo[a]pyrene. Significant nonrandom sets of genes from the SEN were found to be differentially expressed in response to benzo[a]pyrene in both the normal bladder cells and the bladder cancer cells. In addition, the patterns of gene expression were significantly different between these two cell types. The enrichment analyses and the gene expression microarray results support the idea that SEN analysis of bladder in population-based studies is able to identify biologically meaningful statistical patterns. These results bring us a step closer to a systems genetic approach to understanding cancer susceptibility that integrates population and laboratory-based studies.
Co-expression modules construction by WGCNA and identify potential prognostic markers of uveal melanoma.

PubMed

Wan, Qi; Tang, Jing; Han, Yu; Wang, Dan

2018-01-01

Uveal melanoma is an aggressive cancer which has a high percentage recurrence and with a worse prognosis. Identify the potential prognostic markers of uveal melanoma may provide information for early detection of recurrence and treatment. RNA sequence data of uveal melanoma and patient clinic traits were obtained from The Cancer Genome Atlas (TCGA) database. Co-expression modules were built by weighted gene co -expression network analysis (WGCNA) and applied to investigate the relationship underlying modules and clinic traits. Besides, functional enrichment analysis was performed on these co-expression genes from interested modules. First, using WGCNA, identified 21 co-expression modules were constructed by the 10975 genes from the 80 human uveal melanoma samples. The number of genes in these modules ranged from 42 to 5091. Found four co -expression modules significantly correlated with three clinic traits (status, recurrence and recurrence Time). Module red, and purple positively correlated with patient's life status and recurrence Time. Module green positively correlates with recurrence. The result of functional enrichment analysis showed that the module magenta was mainly enriched genetic material assemble processes, the purple module was mainly enriched in tissue homeostasis and melanosome membrane and the module red was mainly enriched metastasis of cell, suggesting its critical role in the recurrence and development of the disease. Additionally, identified the hug gene (top connectivity with other genes) in each module. The hub gene SLC17A7, NTRK2, ABTB1 and ADPRHL1 might play a vital role in recurrence of uveal melanoma. Our findings provided the framework of co-expression gene modules of uveal melanoma and identified some prognostic markers might be detection of recurrence and treatment for uveal melanoma. Copyright © 2017 Elsevier Ltd. All rights reserved.
An unbiased approach to identify genes involved in development in a turtle with temperature-dependent sex determination.

PubMed

Chojnowski, Jena L; Braun, Edward L

2012-07-15

Many reptiles exhibit temperature-dependent sex determination (TSD). The initial cue in TSD is incubation temperature, unlike genotypic sex determination (GSD) where it is determined by the presence of specific alleles (or genetic loci). We used patterns of gene expression to identify candidates for genes with a role in TSD and other developmental processes without making a priori assumptions about the identity of these genes (ortholog-based approach). We identified genes with sexually dimorphic mRNA accumulation during the temperature sensitive period of development in the Red-eared slider turtle (Trachemys scripta), a turtle with TSD. Genes with differential mRNA accumulation in response to estrogen (estradiol-17β; E(2)) exposure and developmental stages were also identified. Sequencing 767 clones from three suppression-subtractive hybridization libraries yielded a total of 581 unique sequences. Screening a macroarray with a subset of those sequences revealed a total of 26 genes that exhibited differential mRNA accumulation: 16 female biased and 10 male biased. Additional analyses revealed that C16ORF62 (an unknown gene) and MALAT1 (a long noncoding RNA) exhibited increased mRNA accumulation at the male producing temperature relative to the female producing temperature during embryonic sexual development. Finally, we identified four genes (C16ORF62, CCT3, MMP2, and NFIB) that exhibited a stage effect and five genes (C16ORF62, CCT3, MMP2, NFIB and NOTCH2) showed a response to E(2) exposure. Here we report a survey of genes identified using patterns of mRNA accumulation during embryonic development in a turtle with TSD. Many previous studies have focused on examining the turtle orthologs of genes involved in mammalian development. Although valuable, the limitations of this approach are exemplified by our identification of two genes (MALAT1 and C16ORF62) that are sexually dimorphic during embryonic development. MALAT1 is a noncoding RNA that has not been implicated in sexual differentiation in other vertebrates and C16ORF62 has an unknown function. Our results revealed genes that are candidates for having roles in turtle embryonic development, including TSD, and highlight the need to expand our search parameters beyond protein-coding genes.
Prediction of epigenetically regulated genes in breast cancer cell lines.

PubMed

Loss, Leandro A; Sadanandam, Anguraj; Durinck, Steffen; Nautiyal, Shivani; Flaucher, Diane; Carlton, Victoria E H; Moorhead, Martin; Lu, Yontao; Gray, Joe W; Faham, Malek; Spellman, Paul; Parvin, Bahram

2010-06-04

Methylation of CpG islands within the DNA promoter regions is one mechanism that leads to aberrant gene expression in cancer. In particular, the abnormal methylation of CpG islands may silence associated genes. Therefore, using high-throughput microarrays to measure CpG island methylation will lead to better understanding of tumor pathobiology and progression, while revealing potentially new biomarkers. We have examined a recently developed high-throughput technology for measuring genome-wide methylation patterns called mTACL. Here, we propose a computational pipeline for integrating gene expression and CpG island methylation profiles to identify epigenetically regulated genes for a panel of 45 breast cancer cell lines, which is widely used in the Integrative Cancer Biology Program (ICBP). The pipeline (i) reduces the dimensionality of the methylation data, (ii) associates the reduced methylation data with gene expression data, and (iii) ranks methylation-expression associations according to their epigenetic regulation. Dimensionality reduction is performed in two steps: (i) methylation sites are grouped across the genome to identify regions of interest, and (ii) methylation profiles are clustered within each region. Associations between the clustered methylation and the gene expression data sets generate candidate matches within a fixed neighborhood around each gene. Finally, the methylation-expression associations are ranked through a logistic regression, and their significance is quantified through permutation analysis. Our two-step dimensionality reduction compressed 90% of the original data, reducing 137,688 methylation sites to 14,505 clusters. Methylation-expression associations produced 18,312 correspondences, which were used to further analyze epigenetic regulation. Logistic regression was used to identify 58 genes from these correspondences that showed a statistically significant negative correlation between methylation profiles and gene expression in the panel of breast cancer cell lines. Subnetwork enrichment of these genes has identified 35 common regulators with 6 or more predicted markers. In addition to identifying epigenetically regulated genes, we show evidence of differentially expressed methylation patterns between the basal and luminal subtypes. Our results indicate that the proposed computational protocol is a viable platform for identifying epigenetically regulated genes. Our protocol has generated a list of predictors including COL1A2, TOP2A, TFF1, and VAV3, genes whose key roles in epigenetic regulation is documented in the literature. Subnetwork enrichment of these predicted markers further suggests that epigenetic regulation of individual genes occurs in a coordinated fashion and through common regulators.
Signed weighted gene co-expression network analysis of transcriptional regulation in murine embryonic stem cells

PubMed Central

Mason, Mike J; Fan, Guoping; Plath, Kathrin; Zhou, Qing; Horvath, Steve

2009-01-01

Background Recent work has revealed that a core group of transcription factors (TFs) regulates the key characteristics of embryonic stem (ES) cells: pluripotency and self-renewal. Current efforts focus on identifying genes that play important roles in maintaining pluripotency and self-renewal in ES cells and aim to understand the interactions among these genes. To that end, we investigated the use of unsigned and signed network analysis to identify pluripotency and differentiation related genes. Results We show that signed networks provide a better systems level understanding of the regulatory mechanisms of ES cells than unsigned networks, using two independent murine ES cell expression data sets. Specifically, using signed weighted gene co-expression network analysis (WGCNA), we found a pluripotency module and a differentiation module, which are not identified in unsigned networks. We confirmed the importance of these modules by incorporating genome-wide TF binding data for key ES cell regulators. Interestingly, we find that the pluripotency module is enriched with genes related to DNA damage repair and mitochondrial function in addition to transcriptional regulation. Using a connectivity measure of module membership, we not only identify known regulators of ES cells but also show that Mrpl15, Msh6, Nrf1, Nup133, Ppif, Rbpj, Sh3gl2, and Zfp39, among other genes, have important roles in maintaining ES cell pluripotency and self-renewal. We also report highly significant relationships between module membership and epigenetic modifications (histone modifications and promoter CpG methylation status), which are known to play a role in controlling gene expression during ES cell self-renewal and differentiation. Conclusion Our systems biologic re-analysis of gene expression, transcription factor binding, epigenetic and gene ontology data provides a novel integrative view of ES cell biology. PMID:19619308
Analysis of the MdMYB1 gene sequence and development of new molecular markers related to apple skin color and fruit-bearing traits.

PubMed

Yuan, Kejun; Wang, Changjun; Wang, Jianghui; Xin, Li; Zhou, Guangfang; Li, Linguang; Shen, Guangning

2014-12-01

MdMYB1, a key transcription factor determining apple skin color, coordinately regulates genes in the anthocyanin pathway. In this study, we analyzed the MdMYB1 gene and its relationship to apple skin color and fruit-bearing traits to better understand this gene and its application to apple breeding. A previously reported MdMYB1 dCAPS marker failed to identify alleles of the MdMYB1 gene in 'Fuji', a very important apple cultivar. In this study, we revealed that the polymorphic site related to the MdMYB1 dCAPS marker is heterozygous in 'Fuji'. In addition, two new polymorphic sites related to apple skin color were identified in the MdMYB1 gene, with two new molecular markers accordingly developed. Testing of these markers in 'Fuji' and its progeny revealed that they could predict apple skin color and identify alleles of the MdMYB1 gene in this cultivar. Most interestingly, the allele MdMYB1-2 in 'Gala' apple and its hybrid plants was found to be related to the fruit-bearing trait, and the molecular marker Mb2 was able to identify the MdMYB1-2 allele. Our study is apparently the first to report a relationship between the MdMYB1 allele and the fruit-bearing trait in apple. More work is needed to determine whether and how the MdMYB1 gene or a gene linked to the MdMYB1-2 allele influences the flowering trait in perennial apple trees, and whether flowering in other plants is influenced by related genes.
Identifying positive selection candidate loci for high-altitude adaptation in Andean populations

PubMed Central

2009-01-01

High-altitude environments (>2,500 m) provide scientists with a natural laboratory to study the physiological and genetic effects of low ambient oxygen tension on human populations. One approach to understanding how life at high altitude has affected human metabolism is to survey genome-wide datasets for signatures of natural selection. In this work, we report on a study to identify selection-nominated candidate genes involved in adaptation to hypoxia in one highland group, Andeans from the South American Altiplano. We analysed dense microarray genotype data using four test statistics that detect departures from neutrality. Using a candidate gene, single nucleotide polymorphism-based approach, we identified genes exhibiting preliminary evidence of recent genetic adaptation in this population. These included genes that are part of the hypoxia-inducible transcription factor (HIF) pathway, a biochemical pathway involved in oxygen homeostasis, as well as three other genomic regions previously not known to be associated with high-altitude phenotypes. In addition to identifying selection-nominated candidate genes, we also tested whether the HIF pathway shows evidence of natural selection. Our results indicate that the genes of this biochemical pathway as a group show no evidence of having evolved in response to hypoxia in Andeans. Results from particular HIF-targeted genes, however, suggest that genes in this pathway could play a role in Andean adaptation to high altitude, even if the pathway as a whole does not show higher relative rates of evolution. These data suggest a genetic role in high-altitude adaptation and provide a basis for genotype/phenotype association studies that are necessary to confirm the role of putative natural selection candidate genes and gene regions in adaptation to altitude. PMID:20038496
[Design of primers to DNA of lactic acid bacteria].

PubMed

Lashchevskiĭ, V V; Kovalenko, N K

2003-01-01

Primers LP1-LP2 to the gene 16S rRNA have been developed, which permit to differentiate lactic acid bacteria: Lactobacillus plantarum, L. delbrueckii subsp. bulgaricus and Streptococcus salivarius subsp. thermophilus. The strain-specific and species-specific differentiations are possible under different annealing temperature. Additional fragments, which are synthesized outside the framework of gene 16S rRNA reading, provide for the strain-specific type of differentiation, and the fragment F864 read in the gene 16S rRNA permits identifying L. plantarum.
Rapid evolution of avirulence genes in rice blast fungus Magnaporthe oryzae

PubMed Central

2014-01-01

Background Rice blast fungus Magnaporthe oryzae is one of the most devastating pathogens in rice. Avirulence genes in this fungus share a gene-for-gene relationship with the resistance genes in its host rice. Although numerous studies have shown that rice blast R-genes are extremely diverse and evolve rapidly in their host populations, little is known about the evolutionary patterns of the Avr-genes in the pathogens. Results Here, six well-characterized Avr-genes and seven randomly selected non-Avr control genes were used to investigate the genetic variations in 62 rice blast strains from different parts of China. Frequent presence/absence polymorphisms, high levels of nucleotide variation (~10-fold higher than non-Avr genes), high non-synonymous to synonymous substitution ratios, and frequent shared non-synonymous substitution were observed in the Avr-genes of these diversified blast strains. In addition, most Avr-genes are closely associated with diverse repeated sequences, which may partially explain the frequent presence/absence polymorphisms in Avr-genes. Conclusion The frequent deletion and gain of Avr-genes and rapid non-synonymous variations might be the primary mechanisms underlying rapid adaptive evolution of pathogens toward virulence to their host plants, and these features can be used as the indicators for identifying additional Avr-genes. The high number of nucleotide polymorphisms among Avr-gene alleles could also be used to distinguish genetic groups among different strains. PMID:24725999
Targeted Deep Resequencing Identifies Coding Variants in the PEAR1 Gene That Play a Role in Platelet Aggregation

PubMed Central

Kim, Yoonhee; Suktitipat, Bhoom; Yanek, Lisa R.; Faraday, Nauder; Wilson, Alexander F.; Becker, Diane M.; Becker, Lewis C.; Mathias, Rasika A.

2013-01-01

Platelet aggregation is heritable, and genome-wide association studies have detected strong associations with a common intronic variant of the platelet endothelial aggregation receptor1 (PEAR1) gene both in African American and European American individuals. In this study, we used a sequencing approach to identify additional exonic variants in PEAR1 that may also determine variability in platelet aggregation in the GeneSTAR Study. A 0.3 Mb targeted region on chromosome 1q23.1 including the entire PEAR1 gene was Sanger sequenced in 104 subjects (45% male, 49% African American, age = 52±13) selected on the basis of hyper- and hypo- aggregation across three different agonists (collagen, epinephrine, and adenosine diphosphate). Single-variant and multi-variant burden tests for association were performed. Of the 235 variants identified through sequencing, 61 were novel, and three of these were missense variants. More rare variants (MAF<5%) were noted in African Americans compared to European Americans (108 vs. 45). The common intronic GWAS-identified variant (rs12041331) demonstrated the most significant association signal in African Americans (p = 4.020×10−4); no association was seen for additional exonic variants in this group. In contrast, multi-variant burden tests indicated that exonic variants play a more significant role in European Americans (p = 0.0099 for the collective coding variants compared to p = 0.0565 for intronic variant rs12041331). Imputation of the individual exonic variants in the rest of the GeneSTAR European American cohort (N = 1,965) supports the results noted in the sequenced discovery sample: p = 3.56×10−4, 2.27×10−7, 5.20×10−5 for coding synonymous variant rs56260937 and collagen, epinephrine and adenosine diphosphate induced platelet aggregation, respectively. Sequencing approaches confirm that a common intronic variant has the strongest association with platelet aggregation in African Americans, and show that exonic variants play an additional role in platelet aggregation in European Americans. PMID:23704978

Genome-Wide Transcriptome Analysis Reveals Conserved and Distinct Molecular Mechanisms of Al Resistance in Buckwheat (Fagopyrum esculentum Moench) Leaves

PubMed Central

Chen, Wei Wei; Xu, Jia Meng; Jin, Jian Feng; Lou, He Qiang; Fan, Wei

2017-01-01

Being an Al-accumulating crop, buckwheat detoxifies and tolerates Al not only in roots but also in leaves. While much progress has recently been made toward Al toxicity and resistance mechanisms in roots, little is known about the molecular basis responsible for detoxification and tolerance processes in leaves. Here, we carried out transcriptome analysis of buckwheat leaves in response to Al stress (20 µM, 24 h). We obtained 33,931 unigenes with 26,300 unigenes annotated in the NCBI database, and identified 1063 upregulated and 944 downregulated genes under Al stress. Functional category analysis revealed that genes related to protein translation, processing, degradation and metabolism comprised the biological processes most affected by Al, suggesting that buckwheat leaves maintain flexibility under Al stress by rapidly reprogramming their physiology and metabolism. Analysis of genes related to transcription regulation revealed that a large proportion of chromatin-regulation genes are specifically downregulated by Al stress, whereas transcription factor genes are overwhelmingly upregulated. Furthermore, we identified 78 upregulated and 22 downregulated genes that encode transporters. Intriguingly, only a few genes were overlapped with root Al-regulated transporter genes, which include homologs of AtMATE, ALS1, STAR1, ALS3 and a divalent ion symporter. In addition, we identified a subset of genes involved in development, in which genes associated with flowering regulation were important. Based on these data, it is proposed that buckwheat leaves develop conserved and distinct mechanisms to cope with Al toxicity. PMID:28846612
Co-expression analysis identifies CRC and AP1 the regulator of Arabidopsis fatty acid biosynthesis.

PubMed

Han, Xinxin; Yin, Linlin; Xue, Hongwei

2012-07-01

Fatty acids (FAs) play crucial rules in signal transduction and plant development, however, the regulation of FA metabolism is still poorly understood. To study the relevant regulatory network, fifty-eight FA biosynthesis genes including de novo synthases, desaturases and elongases were selected as "guide genes" to construct the co-expression network. Calculation of the correlation between all Arabidopsis thaliana (L.) genes with each guide gene by Arabidopsis co-expression dating mining tools (ACT) identifies 797 candidate FA-correlated genes. Gene ontology (GO) analysis of these co-expressed genes showed they are tightly correlated to photosynthesis and carbohydrate metabolism, and function in many processes. Interestingly, 63 transcription factors (TFs) were identified as candidate FA biosynthesis regulators and 8 TF families are enriched. Two TF genes, CRC and AP1, both correlating with 8 FA guide genes, were further characterized. Analyses of the ap1 and crc mutant showed the altered total FA composition of mature seeds. The contents of palmitoleic acid, stearic acid, arachidic acid and eicosadienoic acid are decreased, whereas that of oleic acid is increased in ap1 and crc seeds, which is consistent with the qRT-PCR analysis revealing the suppressed expression of the corresponding guide genes. In addition, yeast one-hybrid analysis and electrophoretic mobility shift assay (EMSA) revealed that CRC can bind to the promoter regions of KCS7 and KCS15, indicating that CRC may directly regulate FA biosynthesis. © 2012 Institute of Botany, Chinese Academy of Sciences.
Genome-Wide Transcriptome Analysis Reveals Conserved and Distinct Molecular Mechanisms of Al Resistance in Buckwheat (Fagopyrum esculentum Moench) Leaves.

PubMed

Chen, Wei Wei; Xu, Jia Meng; Jin, Jian Feng; Lou, He Qiang; Fan, Wei; Yang, Jian Li

2017-08-27

Being an Al-accumulating crop, buckwheat detoxifies and tolerates Al not only in roots but also in leaves. While much progress has recently been made toward Al toxicity and resistance mechanisms in roots, little is known about the molecular basis responsible for detoxification and tolerance processes in leaves. Here, we carried out transcriptome analysis of buckwheat leaves in response to Al stress (20 µM, 24 h). We obtained 33,931 unigenes with 26,300 unigenes annotated in the NCBI database, and identified 1063 upregulated and 944 downregulated genes under Al stress. Functional category analysis revealed that genes related to protein translation, processing, degradation and metabolism comprised the biological processes most affected by Al, suggesting that buckwheat leaves maintain flexibility under Al stress by rapidly reprogramming their physiology and metabolism. Analysis of genes related to transcription regulation revealed that a large proportion of chromatin-regulation genes are specifically downregulated by Al stress, whereas transcription factor genes are overwhelmingly upregulated. Furthermore, we identified 78 upregulated and 22 downregulated genes that encode transporters. Intriguingly, only a few genes were overlapped with root Al-regulated transporter genes, which include homologs of AtMATE , ALS1 , STAR1 , ALS3 and a divalent ion symporter. In addition, we identified a subset of genes involved in development, in which genes associated with flowering regulation were important. Based on these data, it is proposed that buckwheat leaves develop conserved and distinct mechanisms to cope with Al toxicity.
Wheat CBF gene family: identification of polymorphisms in the CBF coding sequence.

PubMed

Mohseni, Sara; Che, Hua; Djillali, Zakia; Dumont, Estelle; Nankeu, Joseph; Danyluk, Jean

2012-12-01

Expression of cold-regulated genes needed for protection against freezing stress is mediated, in part, by the CBF transcription factor family. Previous studies with temperate cereals suggested that the CBF gene family in wheat was large, and that CBF genes were at the base of an important low temperature tolerance trait. Therefore, the goal of our study was to identify the CBF repertoire in the freezing-tolerant hexaploid wheat cultivar Norstar, and then to examine if the coding region of CBF genes in two spring cultivars contain polymorphisms that could affect the protein sequence and structure. Our analyses reveal that hexaploid wheat contains a complex CBF family consisting of at least 65 CBF genes of which 60 are known to be expressed in the cultivar Norstar. They represent 27 paralogous genes with 1-3 homeologous copies for the A, B, and D genomes. The cultivar Norstar contains two pseudogenes and at least 24 additional proteins having sequences and (or) structures that deviate from the consensus in the conserved AP2 DNA-binding and (or) C-terminal activation-domains. This suggests that in cultivars such as Norstar, low temperature tolerance may be increased through breeding of additional optimal alleles. The examination of the CBF repertoire present in the two spring cultivars, Chinese Spring and Manitou, reveals that they have additional polymorphisms affecting conserved positions in these domains. Understanding the effects of these polymorphisms will provide additional information for the selection of optimum CBF alleles in Triticeae breeding programs.
Microarray and differential display identify genes involved in jasmonate-dependent anther development.

PubMed

Mandaokar, Ajin; Kumar, V Dinesh; Amway, Matt; Browse, John

2003-07-01

Jasmonate (JA) is a signaling compound essential for anther development and pollen fertility in Arabidopsis. Mutations that block the pathway of JA synthesis result into male sterility. To understand the processes of anther and pollen maturation, we used microarray and differential display approaches to compare gene expression pattern in anthers of wild-type Arabidopsis and the male-sterile mutant, opr3. Microarray experiment revealed 25 genes that were up-regulated more than 1.8-fold in wild-type anthers as compared to mutant anthers. Experiments based on differential display identified 13 additional genes up-regulated in wild-type anthers compared to opr3 for a total of 38 differentially expressed genes. Searches of the Arabidopsis and non-redundant databases disclosed known or likely functions for 28 of the 38 genes identified, while 10 genes encode proteins of unknown function. Northern blot analysis of eight representative clones as probes confirmed low expression in opr3 anthers compared with wild-type anthers. JA responsiveness of these same genes was also investigated by northern blot analysis of anther RNA isolated from wild-type and opr3 plants, In these experiments, four genes were induced in opr3 anthers within 0.5-1 h of JA treatment while the remaining genes were up-regulated only 1-8 h after JA application. None of these genes was induced by JA in anthers of the coil mutant that is deficient in JA responsiveness. The four early-induced genes in opr3 encode lipoxygenase, a putative bHLH transcription factor, epithiospecifier protein and an unknown protein. We propose that these and other early components may be involved in JA signaling and in the initiation of developmental processes. The four late genes encode an extensin-like protein, a peptide transporter and two unknown proteins, which may represent components required later in anther and pollen maturation. Transcript profiling has provided a successful approach to identify genes involved in anther and pollen maturation in Arabidopsis.
Lessons learned from additional research analyses of unsolved clinical exome cases.

PubMed

Eldomery, Mohammad K; Coban-Akdemir, Zeynep; Harel, Tamar; Rosenfeld, Jill A; Gambin, Tomasz; Stray-Pedersen, Asbjørg; Küry, Sébastien; Mercier, Sandra; Lessel, Davor; Denecke, Jonas; Wiszniewski, Wojciech; Penney, Samantha; Liu, Pengfei; Bi, Weimin; Lalani, Seema R; Schaaf, Christian P; Wangler, Michael F; Bacino, Carlos A; Lewis, Richard Alan; Potocki, Lorraine; Graham, Brett H; Belmont, John W; Scaglia, Fernando; Orange, Jordan S; Jhangiani, Shalini N; Chiang, Theodore; Doddapaneni, Harsha; Hu, Jianhong; Muzny, Donna M; Xia, Fan; Beaudet, Arthur L; Boerwinkle, Eric; Eng, Christine M; Plon, Sharon E; Sutton, V Reid; Gibbs, Richard A; Posey, Jennifer E; Yang, Yaping; Lupski, James R

2017-03-21

Given the rarity of most single-gene Mendelian disorders, concerted efforts of data exchange between clinical and scientific communities are critical to optimize molecular diagnosis and novel disease gene discovery. We designed and implemented protocols for the study of cases for which a plausible molecular diagnosis was not achieved in a clinical genomics diagnostic laboratory (i.e. unsolved clinical exomes). Such cases were recruited to a research laboratory for further analyses, in order to potentially: (1) accelerate novel disease gene discovery; (2) increase the molecular diagnostic yield of whole exome sequencing (WES); and (3) gain insight into the genetic mechanisms of disease. Pilot project data included 74 families, consisting mostly of parent-offspring trios. Analyses performed on a research basis employed both WES from additional family members and complementary bioinformatics approaches and protocols. Analysis of all possible modes of Mendelian inheritance, focusing on both single nucleotide variants (SNV) and copy number variant (CNV) alleles, yielded a likely contributory variant in 36% (27/74) of cases. If one includes candidate genes with variants identified within a single family, a potential contributory variant was identified in a total of ~51% (38/74) of cases enrolled in this pilot study. The molecular diagnosis was achieved in 30/63 trios (47.6%). Besides this, the analysis workflow yielded evidence for pathogenic variants in disease-associated genes in 4/6 singleton cases (66.6%), 1/1 multiplex family involving three affected siblings, and 3/4 (75%) quartet families. Both the analytical pipeline and the collaborative efforts between the diagnostic and research laboratories provided insights that allowed recent disease gene discoveries (PURA, TANGO2, EMC1, GNB5, ATAD3A, and MIPEP) and increased the number of novel genes, defined in this study as genes identified in more than one family (DHX30 and EBF3). An efficient genomics pipeline in which clinical sequencing in a diagnostic laboratory is followed by the detailed reanalysis of unsolved cases in a research environment, supplemented with WES data from additional family members, and subject to adjuvant bioinformatics analyses including relaxed variant filtering parameters in informatics pipelines, can enhance the molecular diagnostic yield and provide mechanistic insights into Mendelian disorders. Implementing these approaches requires collaborative clinical molecular diagnostic and research efforts.
Novel Crohn disease locus identified by genome-wide association maps to a gene desert on 5p13.1 and modulates expression of PTGER4.

PubMed

Libioulle, Cécile; Louis, Edouard; Hansoul, Sarah; Sandor, Cynthia; Farnir, Frédéric; Franchimont, Denis; Vermeire, Séverine; Dewit, Olivier; de Vos, Martine; Dixon, Anna; Demarche, Bruno; Gut, Ivo; Heath, Simon; Foglio, Mario; Liang, Liming; Laukens, Debby; Mni, Myriam; Zelenika, Diana; Van Gossum, André; Rutgeerts, Paul; Belaiche, Jacques; Lathrop, Mark; Georges, Michel

2007-04-20

To identify novel susceptibility loci for Crohn disease (CD), we undertook a genome-wide association study with more than 300,000 SNPs characterized in 547 patients and 928 controls. We found three chromosome regions that provided evidence of disease association with p-values between 10(-6) and 10(-9). Two of these (IL23R on Chromosome 1 and CARD15 on Chromosome 16) correspond to genes previously reported to be associated with CD. In addition, a 250-kb region of Chromosome 5p13.1 was found to contain multiple markers with strongly suggestive evidence of disease association (including four markers with p < 10(-7)). We replicated the results for 5p13.1 by studying 1,266 additional CD patients, 559 additional controls, and 428 trios. Significant evidence of association (p < 4 x 10(-4)) was found in case/control comparisons with the replication data, while associated alleles were over-transmitted to affected offspring (p < 0.05), thus confirming that the 5p13.1 locus contributes to CD susceptibility. The CD-associated 250-kb region was saturated with 111 SNP markers. Haplotype analysis supports a complex locus architecture with multiple variants contributing to disease susceptibility. The novel 5p13.1 CD locus is contained within a 1.25-Mb gene desert. We present evidence that disease-associated alleles correlate with quantitative expression levels of the prostaglandin receptor EP4, PTGER4, the gene that resides closest to the associated region. Our results identify a major new susceptibility locus for CD, and suggest that genetic variants associated with disease risk at this locus could modulate cis-acting regulatory elements of PTGER4.
Novel Crohn Disease Locus Identified by Genome-Wide Association Maps to a Gene Desert on 5p13.1 and Modulates Expression of PTGER4

PubMed Central

Libioulle, Cécile; Louis, Edouard; Hansoul, Sarah; Sandor, Cynthia; Farnir, Frédéric; Franchimont, Denis; Vermeire, Séverine; Dewit, Olivier; de Vos, Martine; Dixon, Anna; Demarche, Bruno; Gut, Ivo; Heath, Simon; Foglio, Mario; Liang, Liming; Laukens, Debby; Mni, Myriam; Zelenika, Diana; Gossum, André Van; Rutgeerts, Paul; Belaiche, Jacques; Lathrop, Mark; Georges, Michel

2007-01-01

To identify novel susceptibility loci for Crohn disease (CD), we undertook a genome-wide association study with more than 300,000 SNPs characterized in 547 patients and 928 controls. We found three chromosome regions that provided evidence of disease association with p-values between 10−6 and 10−9. Two of these (IL23R on Chromosome 1 and CARD15 on Chromosome 16) correspond to genes previously reported to be associated with CD. In addition, a 250-kb region of Chromosome 5p13.1 was found to contain multiple markers with strongly suggestive evidence of disease association (including four markers with p < 10−7). We replicated the results for 5p13.1 by studying 1,266 additional CD patients, 559 additional controls, and 428 trios. Significant evidence of association (p < 4 × 10−4) was found in case/control comparisons with the replication data, while associated alleles were over-transmitted to affected offspring (p < 0.05), thus confirming that the 5p13.1 locus contributes to CD susceptibility. The CD-associated 250-kb region was saturated with 111 SNP markers. Haplotype analysis supports a complex locus architecture with multiple variants contributing to disease susceptibility. The novel 5p13.1 CD locus is contained within a 1.25-Mb gene desert. We present evidence that disease-associated alleles correlate with quantitative expression levels of the prostaglandin receptor EP4, PTGER4, the gene that resides closest to the associated region. Our results identify a major new susceptibility locus for CD, and suggest that genetic variants associated with disease risk at this locus could modulate cis-acting regulatory elements of PTGER4. PMID:17447842
Expressed sequence tag (EST) analysis of the pine wood nematode Bursaphelenchus xylophilus and B. mucronatus.

PubMed

Kikuchi, Taisei; Aikawa, Takuya; Kosaka, Hajime; Pritchard, Leighton; Ogura, Nobuo; Jones, John T

2007-09-01

Most Bursaphelenchus species feed on fungi that colonise dead or dying trees. However, Bursaphelenchus xylophilus is unique in that in addition to feeding on fungi it has the capacity to be a parasite of live pine trees. We present an analysis of over 13,000 expressed sequence tags (ESTs) from B. xylophilus and, by way of contrast, over 3000 ESTs from a closely related species that does not parasitise plants as readily; B. mucronatus. Four libraries from B. xylophilus, from a variety of life stages including fungal feeding nematodes, nematodes extracted from plants and dauer-like stage nematodes, and one library from B. mucronatus were constructed and used to generate ESTs. Contig analysis showed that the 13,327 B. xylophilus ESTs could be grouped into 2110 contigs and 4377 singletons giving a total of 6487 identified genes. Similarly the 3193 B. mucronatus ESTs yielded a total of 2219 identified genes from 425 contigs and 1794 singletons. A variety of proteins potentially important in the parasitic process of B. xylophilus and B. mucronatus, including plant and fungal cell wall degrading enzymes and a novel gene potentially encoding a expansin-like protein that may disrupt non-covalent bonds in the plant cell wall were identified in the libraries. Additionally several gene candidates potentially involved in dauer entry or maintenance were also identified in the EST dataset. The EST sequences from this study will provide a solid base for future research on the biology, pathogenicity and evolutionary history of this nematode group.
Differentially expressed genes and proteins upon drought acclimation in tolerant and sensitive genotypes of Coffea canephora

PubMed Central

Marraccini, Pierre; Vinecky, Felipe; Alves, Gabriel S.C.; Ramos, Humberto J.O.; Elbelt, Sonia; Vieira, Natalia G.; Carneiro, Fernanda A.; Sujii, Patricia S.; Alekcevetch, Jean C.; Silva, Vânia A.; DaMatta, Fábio M.; Ferrão, Maria A.G.; Leroy, Thierry; Pot, David; Vieira, Luiz G.E.; da Silva, Felipe R.; Andrade, Alan C.

2012-01-01

The aim of this study was to investigate the molecular mechanisms underlying drought acclimation in coffee plants by the identification of candidate genes (CGs) using different approaches. The first approach used the data generated during the Brazilian Coffee expressed sequence tag (EST) project to select 13 CGs by an in silico analysis (electronic northern). The second approach was based on screening macroarrays spotted with plasmid DNA (coffee ESTs) with separate hybridizations using leaf cDNA probes from drought-tolerant and susceptible clones of Coffea canephora var. Conilon, grown under different water regimes. This allowed the isolation of seven additional CGs. The third approach used two-dimensional gel electrophoresis to identify proteins displaying differential accumulation in leaves of drought-tolerant and susceptible clones of C. canephora. Six of them were characterized by MALDI-TOF-MS/MS (matrix-assisted laser desorption-time of flight-tandem mass spectrometry) and the corresponding proteins were identified. Finally, additional CGs were selected from the literature, and quantitative real-time polymerase chain reaction (qPCR) was performed to analyse the expression of all identified CGs. Altogether, >40 genes presenting differential gene expression during drought acclimation were identified, some of them showing different expression profiles between drought-tolerant and susceptible clones. Based on the obtained results, it can be concluded that factors involved a complex network of responses probably involving the abscisic signalling pathway and nitric oxide are major molecular determinants that might explain the better efficiency in controlling stomata closure and transpiration displayed by drought-tolerant clones of C. canephora. PMID:22511801
Stress, burnout and depression: A systematic review on DNA methylation mechanisms.

PubMed

Bakusic, Jelena; Schaufeli, Wilmar; Claes, Stephan; Godderis, Lode

2017-01-01

Despite that burnout presents a serious burden for modern society, there are no diagnostic criteria. Additional difficulty is the differential diagnosis with depression. Consequently, there is a need to dispose of a burnout biomarker. Epigenetic studies suggest that DNA methylation is a possible mediator linking individual response to stress and psychopathology and could be considered as a potential biomarker of stress-related mental disorders. Thus, the aim of this review is to provide an overview of DNA methylation mechanisms in stress, burnout and depression. In addition to state-of-the-art overview, the goal of this review is to provide a scientific base for burnout biomarker research. We performed a systematic literature search and identified 25 pertinent articles. Among these, 15 focused on depression, 7 on chronic stress and only 3 on work stress/burnout. Three epigenome-wide studies were identified and the majority of studies used the candidate-gene approach, assessing 12 different genes. The glucocorticoid receptor gene (NR3C1) displayed different methylation patterns in chronic stress and depression. The serotonin transporter gene (SLC6A4) methylation was similarly affected in stress, depression and burnout. Work-related stress and depressive symptoms were associated with different methylation patterns of the brain derived neurotrophic factor gene (BDNF) in the same human sample. The tyrosine hydroxylase (TH) methylation was correlated with work stress in a single study. Additional, thoroughly designed longitudinal studies are necessary for revealing the cause-effect relationship of work stress, epigenetics and burnout, including its overlap with depression. Copyright © 2016 Elsevier Inc. All rights reserved.
The biosynthetic gene cluster for the cyanogenic glucoside dhurrin in Sorghum bicolor contains its co-expressed vacuolar MATE transporter

PubMed Central

Darbani, Behrooz; Motawia, Mohammed Saddik; Olsen, Carl Erik; Nour-Eldin, Hussam H.; Møller, Birger Lindberg; Rook, Fred

2016-01-01

Genomic gene clusters for the biosynthesis of chemical defence compounds are increasingly identified in plant genomes. We previously reported the independent evolution of biosynthetic gene clusters for cyanogenic glucoside biosynthesis in three plant lineages. Here we report that the gene cluster for the cyanogenic glucoside dhurrin in Sorghum bicolor additionally contains a gene, SbMATE2, encoding a transporter of the multidrug and toxic compound extrusion (MATE) family, which is co-expressed with the biosynthetic genes. The predicted localisation of SbMATE2 to the vacuolar membrane was demonstrated experimentally by transient expression of a SbMATE2-YFP fusion protein and confocal microscopy. Transport studies in Xenopus laevis oocytes demonstrate that SbMATE2 is able to transport dhurrin. In addition, SbMATE2 was able to transport non-endogenous cyanogenic glucosides, but not the anthocyanin cyanidin 3-O-glucoside or the glucosinolate indol-3-yl-methyl glucosinolate. The genomic co-localisation of a transporter gene with the biosynthetic genes producing the transported compound is discussed in relation to the role self-toxicity of chemical defence compounds may play in the formation of gene clusters. PMID:27841372
Comparative genomic analysis of Helicobacter pylori from Malaysia identifies three distinct lineages suggestive of differential evolution.

PubMed

Kumar, Narender; Mariappan, Vanitha; Baddam, Ramani; Lankapalli, Aditya K; Shaik, Sabiha; Goh, Khean-Lee; Loke, Mun Fai; Perkins, Tim; Benghezal, Mohammed; Hasnain, Seyed E; Vadivelu, Jamuna; Marshall, Barry J; Ahmed, Niyaz

2015-01-01

The discordant prevalence of Helicobacter pylori and its related diseases, for a long time, fostered certain enigmatic situations observed in the countries of the southern world. Variation in H. pylori infection rates and disease outcomes among different populations in multi-ethnic Malaysia provides a unique opportunity to understand dynamics of host-pathogen interaction and genome evolution. In this study, we extensively analyzed and compared genomes of 27 Malaysian H. pylori isolates and identified three major phylogeographic lineages: hspEastAsia, hpEurope and hpSouthIndia. The analysis of the virulence genes within the core genome, however, revealed a comparable pathogenic potential of the strains. In addition, we identified four genes limited to strains of East-Asian lineage. Our analyses identified a few strain-specific genes encoding restriction modification systems and outlined 311 core genes possibly under differential evolutionary constraints, among the strains representing different ethnic groups. The cagA and vacA genes also showed variations in accordance with the host genetic background of the strains. Moreover, restriction modification genes were found to be significantly enriched in East-Asian strains. An understanding of these variations in the genome content would provide significant insights into various adaptive and host modulation strategies harnessed by H. pylori to effectively persist in a host-specific manner. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Genome-wide identification of long non-coding RNA genes and their association with insecticide resistance and metamorphosis in diamondback moth, Plutella xylostella.

PubMed

Liu, Feiling; Guo, Dianhao; Yuan, Zhuting; Chen, Chen; Xiao, Huamei

2017-11-20

Long non-coding RNA (lncRNA) is a class of noncoding RNA >200 bp in length that has essential roles in regulating a variety of biological processes. Here, we constructed a computational pipeline to identify lncRNA genes in the diamondback moth (Plutella xylostella), a major insect pest of cruciferous vegetables. In total, 3,324 lncRNAs corresponding to 2,475 loci were identified from 13 RNA-Seq datasets, including samples from parasitized, insecticide-resistant strains and different developmental stages. The identified P. xylostella lncRNAs had shorter transcripts and fewer exons than protein-coding genes. Seven out of nine randomly selected lncRNAs were validated by strand-specific RT-PCR. In total, 54-172 lncRNAs were specifically expressed in the insecticide resistant strains, among which one lncRNA was located adjacent to the sodium channel gene. In addition, 63-135 lncRNAs were specifically expressed in different developmental stages, among which three lncRNAs overlapped or were located adjacent to the metamorphosis-associated genes. These lncRNAs were either strongly or weakly co-expressed with their overlapping or neighboring mRNA genes. In summary, we identified thousands of lncRNAs and presented evidence that lncRNAs might have key roles in conferring insecticide resistance and regulating the metamorphosis development in P. xylostella.
Diversity and evolution of myxozoan minicollagens and nematogalectins.

PubMed

Shpirer, Erez; Chang, E Sally; Diamant, Arik; Rubinstein, Nimrod; Cartwright, Paulyn; Huchon, Dorothée

2014-09-29

Myxozoa are a diverse group of metazoan parasites with a very simple organization, which has for decades eluded their evolutionary origin. Their most prominent and characteristic feature is the polar capsule: a complex intracellular structure of the myxozoan spore, which plays a role in host infection. Striking morphological similarities have been found between myxozoan polar capsules and nematocysts, the stinging structures of cnidarians (corals, sea anemones and jellyfish) leading to the suggestion that Myxozoa and Cnidaria share a more recent common ancestry. This hypothesis has recently been supported by phylogenomic evidence and by the identification of a nematocyst specific minicollagen gene in the myxozoan Tetracapsuloides bryosalmonae. Here we searched genomes and transcriptomes of several myxozoan taxa for the presence of additional cnidarian specific genes and characterized these genes within a phylogenetic context. Illumina assemblies of transcriptome or genome data of three myxozoan species (Enteromyxum leei, Kudoa iwatai, and Sphaeromyxa zaharoni) and of the enigmatic cnidarian parasite Polypodium hydriforme (Polypodiozoa) were mined using tBlastn searches with nematocyst-specific proteins as queries. Several orthologs of nematogalectins and minicollagens were identified. Our phylogenetic analyses indicate that myxozoans possess three distinct minicollagens. We found that the cnidarian repertoire of nematogalectins is more complex than previously thought and we identified additional members of the nematogalectin family. Cnidarians were found to possess four nematogalectin/ nematogalectin-related genes, while in myxozoans only three genes could be identified. Our results demonstrate that myxozoans possess a diverse array of genes that are taxonomically restricted to Cnidaria. Characterization of these genes provide compelling evidence that polar capsules and nematocysts are homologous structures and that myxozoans are highly degenerate cnidarians. The diversity of minicollagens was higher than previously thought, with the presence of three minicollagen genes in myxozoans. Our phylogenetic results suggest that the different myxozoan sequences are the results of ancient divergences within Cnidaria and not of recent specializations of the polar capsule. For both minicollagen and nematogalectin, our results show that myxozoans possess less gene copies than their cnidarian counter parts, suggesting that the polar capsule gene repertoire was simplified with their reduced body plan.
Hordeum chilense genome, a useful tool to investigate the endosperm yellow pigment content in the Triticeae

PubMed Central

2012-01-01

Background The wild barley Hordeum chilense fulfills some requirements for being a useful tool to investigate the endosperm yellow pigment content (YPC) in the Triticeae including its diploid constitution, the availability of genetic resources (addition and deletion stocks and a high density genetic map) and, especially, its high seed YPC not silenced in tritordeums (amphiploids derived from H. chilense and wheat). Thus, the aim of this work was to test the utility of the H. chilense genome for investigating the YPC in the Triticeae. Results Twelve genes related to endosperm carotenoid content and/or YPC in grasses (Dxr, Hdr [synonym ispH], Ggpps1, Psy2, Psy3, Pds, Zds, e-Lcy, b-Lcy, Hyd3, Ccd1 and Ppo1) were identified, and mapped in H. chilense using rice genes to identify orthologs from barley, wheat, sorghum and maize. Macrocolinearity studies revealed that gene positions were in agreement in H. vulgare and H. chilense. Additionally, three main regions associated with YPC were identified in chromosomes 2Hch, 3Hch and 7Hch in H. chilense, the former being the most significant one. Conclusions The results obtained are consistent with previous findings in wheat and suggest that Ggpps1, Zds and Hyd3 on chromosome 2Hch may be considered candidate genes in wheat for further studies in YPC improvement. Considering the syntenic location of carotenoid genes in H. chilense, we have concluded that the Hch genome may constitute a valuable tool for YPC studies in the Triticeae. PMID:23122232
Searching for the molecular benchmark of physiological intestinal anastomotic healing in rats: an experimental study.

PubMed

Seifert, Gabriel J; Seifert, Michael; Kulemann, Birte; Holzner, Philipp A; Glatz, Torben; Timme, Sylvia; Sick, Olivia; Höppner, Jens; Hopt, Ulrich T; Marjanovic, Goran

2014-01-01

This investigation focuses on the physiological characteristics of gene transcription of intestinal tissue following anastomosis formation. In eight rats, end-to-end ileo-ileal anastomoses were performed (n = 2/group). The healthy intestinal tissue resected for this operation was used as a control. On days 0, 2, 4 and 8, 10-mm perianastomotic segments were resected. Control and perianastomotic segments were examined with an Affymetrix microarray chip to assess changes in gene regulation. Microarray findings were validated using real-time PCR for selected genes. In addition to screening global gene expression, we identified genes intensely regulated during healing and also subjected our data sets to an overrepresentation analysis using the Gene Ontology (GO) and Kyoto Encyclopedia for Genes and Genomes (KEGG). Compared to the control group, we observed that the number of differentially regulated genes peaked on day 2 with a total of 2,238 genes, decreasing by day 4 to 1,687 genes and to 1,407 genes by day 8. PCR validation for matrix metalloproteinases-3 and -13 showed not only identical transcription patterns but also analogous regulation intensity. When setting the cutoff of upregulation at 10-fold to identify genes likely to be relevant, the total gene count was significantly lower with 55, 45 and 37 genes on days 2, 4 and 8, respectively. A total of 947 GO subcategories were significantly overrepresented during anastomotic healing. Furthermore, 23 overrepresented KEGG pathways were identified. This study is the first of its kind that focuses explicitly on gene transcription during intestinal anastomotic healing under standardized conditions. Our work sets a foundation for further studies toward a more profound understanding of the physiology of anastomotic healing.
Evolutionary Distance of Amino Acid Sequence Orthologs across Macaque Subspecies: Identifying Candidate Genes for SIV Resistance in Chinese Rhesus Macaques

PubMed Central

Ross, Cody T.; Roodgar, Morteza; Smith, David Glenn

2015-01-01

We use the Reciprocal Smallest Distance (RSD) algorithm to identify amino acid sequence orthologs in the Chinese and Indian rhesus macaque draft sequences and estimate the evolutionary distance between such orthologs. We then use GOanna to map gene function annotations and human gene identifiers to the rhesus macaque amino acid sequences. We conclude methodologically by cross-tabulating a list of amino acid orthologs with large divergence scores with a list of genes known to be involved in SIV or HIV pathogenesis. We find that many of the amino acid sequences with large evolutionary divergence scores, as calculated by the RSD algorithm, have been shown to be related to HIV pathogenesis in previous laboratory studies. Four of the strongest candidate genes for SIVmac resistance in Chinese rhesus macaques identified in this study are CDK9, CXCL12, TRIM21, and TRIM32. Additionally, ANKRD30A, CTSZ, GORASP2, GTF2H1, IL13RA1, MUC16, NMDAR1, Notch1, NT5M, PDCD5, RAD50, and TM9SF2 were identified as possible candidates, among others. We failed to find many laboratory experiments contrasting the effects of Indian and Chinese orthologs at these sites on SIVmac pathogenesis, but future comparative studies might hold fertile ground for research into the biological mechanisms underlying innate resistance to SIVmac in Chinese rhesus macaques. PMID:25884674
Mechanisms of crystalline silica-induced pulmonary toxicity revealed by global gene expression profiling

PubMed Central

Sellamuthu, Rajendran; Umbright, Christina; Li, Shengqiao; Kashon, Michael; Joseph, Pius

2015-01-01

A proper understanding of the mechanisms underlying crystalline silica-induced pulmonary toxicity has implications in the management and potential prevention of the adverse health effects associated with silica exposure including silicosis, cancer and several auto-immune diseases. Human lung type II epithelial cells and rat lungs exposed to crystalline silica were employed as experimental models to determine global gene expression changes in order to understand the molecular mechanisms underlying silica-induced pulmonary toxicity. The differential gene expression profile induced by silica correlated with its toxicity in the A549 cells. The biological processes perturbed by silica exposure in the A549 cells and rat lungs, as identified by the bioinformatics analysis of the differentially expressed genes, demonstrated significant similarity. Functional categorization of the differentially expressed genes identified cancer, cellular movement, cellular growth and proliferation, cell death, inflammatory response, cell cycle, cellular development, and genetic disorder as top ranking biological functions perturbed by silica exposure in A549 cells and rat lungs. Results of our study, in addition to confirming several previously identified molecular targets and mechanisms involved in silica toxicity, identified novel molecular targets and mechanisms potentially involved in silica-induced pulmonary toxicity. Further investigations, including those focused on the novel molecular targets and mechanisms identified in the current study may result in better management and, possibly, reduction and/or prevention of the potential adverse health effects associated with crystalline silica exposure. PMID:22087542
Evaluation of Gene-Based Family-Based Methods to Detect Novel Genes Associated With Familial Late Onset Alzheimer Disease

PubMed Central

Fernández, Maria V.; Budde, John; Del-Aguila, Jorge L.; Ibañez, Laura; Deming, Yuetiva; Harari, Oscar; Norton, Joanne; Morris, John C.; Goate, Alison M.; Cruchaga, Carlos

2018-01-01

Gene-based tests to study the combined effect of rare variants on a particular phenotype have been widely developed for case-control studies, but their evolution and adaptation for family-based studies, especially studies of complex incomplete families, has been slower. In this study, we have performed a practical examination of all the latest gene-based methods available for family-based study designs using both simulated and real datasets. We examined the performance of several collapsing, variance-component, and transmission disequilibrium tests across eight different software packages and 22 models utilizing a cohort of 285 families (N = 1,235) with late-onset Alzheimer disease (LOAD). After a thorough examination of each of these tests, we propose a methodological approach to identify, with high confidence, genes associated with the tested phenotype and we provide recommendations to select the best software and model for family-based gene-based analyses. Additionally, in our dataset, we identified PTK2B, a GWAS candidate gene for sporadic AD, along with six novel genes (CHRD, CLCN2, HDLBP, CPAMD8, NLRP9, and MAS1L) as candidate genes for familial LOAD. PMID:29670507

Repressed expression of a gene for a basic helix-loop-helix protein causes a white flower phenotype in carnation

PubMed Central

Totsuka, Akane; Okamoto, Emi; Miyahara, Taira; Kouno, Takanobu; Cano, Emilio A.; Sasaki, Nobuhiro; Watanabe, Aiko; Tasaki, Keisuke; Nishihara, Masahiro; Ozeki, Yoshihiro

2018-01-01

In a previous study, two genes responsible for white flower phenotypes in carnation were identified. These genes encoded enzymes involved in anthocyanin synthesis, namely, flavanone 3-hydroxylase (F3H) and dihydroflavonol 4-reductase (DFR), and showed reduced expression in the white flower phenotypes. Here, we identify another candidate gene for white phenotype in carnation flowers using an RNA-seq analysis followed by RT-PCR. This candidate gene encodes a transcriptional regulatory factor of the basic helix-loop-helix (bHLH) type. In the cultivar examined here, both F3H and DFR genes produced active enzyme proteins; however, expression of DFR and of genes for enzymes involved in the downstream anthocyanin synthetic pathway from DFR was repressed in the absence of bHLH expression. Occasionally, flowers of the white flowered cultivar used here have red speckles and stripes on the white petals. We found that expression of bHLH occurred in these red petal segments and induced expression of DFR and the following downstream enzymes. Our results indicate that a member of the bHLH superfamily is another gene involved in anthocyanin synthesis in addition to structural genes encoding enzymes. PMID:29681756
Evaluation of Gene-Based Family-Based Methods to Detect Novel Genes Associated With Familial Late Onset Alzheimer Disease.

PubMed

Fernández, Maria V; Budde, John; Del-Aguila, Jorge L; Ibañez, Laura; Deming, Yuetiva; Harari, Oscar; Norton, Joanne; Morris, John C; Goate, Alison M; Cruchaga, Carlos

2018-01-01

Gene-based tests to study the combined effect of rare variants on a particular phenotype have been widely developed for case-control studies, but their evolution and adaptation for family-based studies, especially studies of complex incomplete families, has been slower. In this study, we have performed a practical examination of all the latest gene-based methods available for family-based study designs using both simulated and real datasets. We examined the performance of several collapsing, variance-component, and transmission disequilibrium tests across eight different software packages and 22 models utilizing a cohort of 285 families ( N = 1,235) with late-onset Alzheimer disease (LOAD). After a thorough examination of each of these tests, we propose a methodological approach to identify, with high confidence, genes associated with the tested phenotype and we provide recommendations to select the best software and model for family-based gene-based analyses. Additionally, in our dataset, we identified PTK2B , a GWAS candidate gene for sporadic AD, along with six novel genes ( CHRD, CLCN2, HDLBP, CPAMD8, NLRP9 , and MAS1L ) as candidate genes for familial LOAD.
A CRISPR-Based Screen Identifies Genes Essential for West-Nile-Virus-Induced Cell Death.

PubMed

Ma, Hongming; Dang, Ying; Wu, Yonggan; Jia, Gengxiang; Anaya, Edgar; Zhang, Junli; Abraham, Sojan; Choi, Jang-Gi; Shi, Guojun; Qi, Ling; Manjunath, N; Wu, Haoquan

2015-07-28

West Nile virus (WNV) causes an acute neurological infection attended by massive neuronal cell death. However, the mechanism(s) behind the virus-induced cell death is poorly understood. Using a library containing 77,406 sgRNAs targeting 20,121 genes, we performed a genome-wide screen followed by a second screen with a sub-library. Among the genes identified, seven genes, EMC2, EMC3, SEL1L, DERL2, UBE2G2, UBE2J1, and HRD1, stood out as having the strongest phenotype, whose knockout conferred strong protection against WNV-induced cell death with two different WNV strains and in three cell lines. Interestingly, knockout of these genes did not block WNV replication. Thus, these appear to be essential genes that link WNV replication to downstream cell death pathway(s). In addition, the fact that all of these genes belong to the ER-associated protein degradation (ERAD) pathway suggests that this might be the primary driver of WNV-induced cell death. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Identification of drought-responsive genes in roots of upland rice (Oryza sativa L)

PubMed Central

Rabello, Aline R; Guimarães, Cléber M; Rangel, Paulo HN; da Silva, Felipe R; Seixas, Daniela; de Souza, Emanuel; Brasileiro, Ana CM; Spehar, Carlos R; Ferreira, Márcio E; Mehta, Ângela

2008-01-01

Background Rice (Oryza sativa L.) germplasm represents an extraordinary source of genes that control traits of agronomic importance such as drought tolerance. This diversity is the basis for the development of new cultivars better adapted to water restriction conditions, in particular for upland rice, which is grown under rainfall. The analyses of subtractive cDNA libraries and differential protein expression of drought tolerant and susceptible genotypes can contribute to the understanding of the genetic control of water use efficiency in rice. Results Two subtractive libraries were constructed using cDNA of drought susceptible and tolerant genotypes submitted to stress against cDNA of well-watered plants. In silico analysis revealed 463 reads, which were grouped into 282 clusters. Several genes expressed exclusively in the tolerant or susceptible genotypes were identified. Additionally, proteome analysis of roots from stressed plants was performed and 22 proteins putatively associated to drought tolerance were identified by mass spectrometry. Conclusion Several genes and proteins involved in drought-response, as well as genes with no described homologs were identified. Genes exclusively expressed in the tolerant genotype were, in general, related to maintenance of turgor and cell integrity. In contrast, in the susceptible genotype, expression of genes involved in protection against cell damage was not detected. Several protein families identified in the proteomic analysis were not detected in the cDNA analysis. There is an indication that the mechanisms of susceptibility to drought in upland rice are similar to those of lowland varieties. PMID:18922162
Analysis of resistance genes of clinical Pannonibacter phragmitetus strain 31801 by complete genome sequencing.

PubMed

Ming, De-Song; Chen, Qing-Qing; Chen, Xiao-Tin

2018-05-14

To clarify the resistance mechanisms of Pannonibacter phragmitetus 31801, isolated from the blood of a liver abscess patient, at the genomic level, we performed whole genomic sequencing using a PacBio RS II single-molecule real-time long-read sequencer. Bioinformatic analysis of the resulting sequence was then carried out to identify any possible resistance genes. Analyses included Basic Local Alignment Search Tool searches against the Antibiotic Resistance Genes Database, ResFinder analysis of the genome sequence, and Resistance Gene Identifier analysis within the Comprehensive Antibiotic Resistance Database. Prophages, clustered regularly interspaced short palindromic repeats (CRISPR), and other putative virulence factors were also identified using PHAST, CRISPRfinder, and the Virulence Factors Database, respectively. The circular chromosome and single plasmid of P. phragmitetus 31801 contained multiple antibiotic resistance genes, including those coding for three different types of β-lactamase [NPS β-lactamase (EC 3.5.2.6), β-lactamase class C, and a metal-dependent hydrolase of β-lactamase superfamily I]. In addition, genes coding for subunits of several multidrug-resistance efflux pumps were identified, including those targeting macrolides (adeJ, cmeB), tetracycline (acrB, adeAB), fluoroquinolones (acrF, ceoB), and aminoglycosides (acrD, amrB, ceoB, mexY, smeB). However, apart from the tripartite macrolide efflux pump macAB-tolC, the genome did not appear to contain the complete complement of subunit genes required for production of most of the major multidrug-resistance efflux pumps.
Comparative Metagenomics Revealed Commonly Enriched Gene Sets in Human Gut Microbiomes

PubMed Central

Kurokawa, Ken; Itoh, Takehiko; Kuwahara, Tomomi; Oshima, Kenshiro; Toh, Hidehiro; Toyoda, Atsushi; Takami, Hideto; Morita, Hidetoshi; Sharma, Vineet K.; Srivastava, Tulika P.; Taylor, Todd D.; Noguchi, Hideki; Mori, Hiroshi; Ogura, Yoshitoshi; Ehrlich, Dusko S.; Itoh, Kikuji; Takagi, Toshihisa; Sakaki, Yoshiyuki; Hayashi, Tetsuya; Hattori, Masahira

2007-01-01

Numerous microbes inhabit the human intestine, many of which are uncharacterized or uncultivable. They form a complex microbial community that deeply affects human physiology. To identify the genomic features common to all human gut microbiomes as well as those variable among them, we performed a large-scale comparative metagenomic analysis of fecal samples from 13 healthy individuals of various ages, including unweaned infants. We found that, while the gut microbiota from unweaned infants were simple and showed a high inter-individual variation in taxonomic and gene composition, those from adults and weaned children were more complex but showed a high functional uniformity regardless of age or sex. In searching for the genes over-represented in gut microbiomes, we identified 237 gene families commonly enriched in adult-type and 136 families in infant-type microbiomes, with a small overlap. An analysis of their predicted functions revealed various strategies employed by each type of microbiota to adapt to its intestinal environment, suggesting that these gene sets encode the core functions of adult and infant-type gut microbiota. By analysing the orphan genes, 647 new gene families were identified to be exclusively present in human intestinal microbiomes. In addition, we discovered a conjugative transposon family explosively amplified in human gut microbiomes, which strongly suggests that the intestine is a ‘hot spot’ for horizontal gene transfer between microbes. PMID:17916580
Genetic Determinants Influencing Human Serum Metabolome among African Americans

PubMed Central

Yu, Bing; Zheng, Yan; Alexander, Danny; Morrison, Alanna C.; Coresh, Josef; Boerwinkle, Eric

2014-01-01

Phenotypes proximal to gene action generally reflect larger genetic effect sizes than those that are distant. The human metabolome, a result of multiple cellular and biological processes, are functional intermediate phenotypes proximal to gene action. Here, we present a genome-wide association study of 308 untargeted metabolite levels among African Americans from the Atherosclerosis Risk in Communities (ARIC) Study. Nineteen significant common variant-metabolite associations were identified, including 13 novel loci (p<1.6×10−10). These loci were associated with 7–50% of the difference in metabolite levels per allele, and the variance explained ranged from 4% to 20%. Fourteen genes were identified within the nineteen loci, and four of them contained non-synonymous substitutions in four enzyme-encoding genes (KLKB1, SIAE, CPS1, and NAT8); the other significant loci consist of eight other enzyme-encoding genes (ACE, GATM, ACY3, ACSM2B, THEM4, ADH4, UGT1A, TREH), a transporter gene (SLC6A13) and a polycystin protein gene (PKD2L1). In addition, four potential disease-associated paths were identified, including two direct longitudinal predictive relationships: NAT8 with N-acetylornithine, N-acetyl-1-methylhistidine and incident chronic kidney disease, and TREH with trehalose and incident diabetes. These results highlight the value of using endophenotypes proximal to gene function to discover new insights into biology and disease pathology. PMID:24625756
A serine–arginine-rich (SR) splicing factor modulates alternative splicing of over a thousand genes in Toxoplasma gondii

PubMed Central

Yeoh, Lee M.; Goodman, Christopher D.; Hall, Nathan E.; van Dooren, Giel G.; McFadden, Geoffrey I.; Ralph, Stuart A.

2015-01-01

Single genes are often subject to alternative splicing, which generates alternative mature mRNAs. This phenomenon is widespread in animals, and observed in over 90% of human genes. Recent data suggest it may also be common in Apicomplexa. These parasites have small genomes, and economy of DNA is evolutionarily favoured in this phylum. We investigated the mechanism of alternative splicing in Toxoplasma gondii, and have identified and localized TgSR3, a homologue of ASF/SF2 (alternative-splicing factor/splicing factor 2, a serine-arginine–rich, or SR protein) to a subnuclear compartment. In addition, we conditionally overexpressed this protein, which was deleterious to growth. qRT-PCR was used to confirm perturbation of splicing in a known alternatively-spliced gene. We performed high-throughput RNA-seq to determine the extent of splicing modulated by this protein. Current RNA-seq algorithms are poorly suited to compact parasite genomes, and hence we complemented existing tools by writing a new program, GeneGuillotine, that addresses this deficiency by segregating overlapping reads into distinct genes. In order to identify the extent of alternative splicing, we released another program, JunctionJuror, that detects changes in intron junctions. Using this program, we identified about 2000 genes that were constitutively alternatively spliced in T. gondii. Overexpressing the splice regulator TgSR3 perturbed alternative splicing in over 1000 genes. PMID:25870410
Genome-wide identification of the SWEET gene family in wheat.

PubMed

Gao, Yue; Wang, Zi Yuan; Kumar, Vikranth; Xu, Xiao Feng; Yuan, De Peng; Zhu, Xiao Feng; Li, Tian Ya; Jia, Baolei; Xuan, Yuan Hu

2018-02-05

The SWEET (sugars will eventually be exported transporter) family is a newly characterized group of sugar transporters. In plants, the key roles of SWEETs in phloem transport, nectar secretion, pollen nutrition, stress tolerance, and plant-pathogen interactions have been identified. SWEET family genes have been characterized in many plant species, but a comprehensive analysis of SWEET members has not yet been performed in wheat. Here, 59 wheat SWEETs (hereafter TaSWEETs) were identified through homology searches. Analyses of phylogenetic relationships, numbers of transmembrane helices (TMHs), gene structures, and motifs showed that TaSWEETs carrying 3-7 TMHs could be classified into four clades with 10 different types of motifs. Examination of the expression patterns of 18 SWEET genes revealed that a few are tissue-specific while most are ubiquitously expressed. In addition, the stem rust-mediated expression patterns of SWEET genes were monitored using a stem rust-susceptible cultivar, 'Little Club' (LC). The resulting data showed that the expression of five out of the 18 SWEETs tested was induced following inoculation. In conclusion, we provide the first comprehensive analysis of the wheat SWEET gene family. Information regarding the phylogenetic relationships, gene structures, and expression profiles of SWEET genes in different tissues and following stem rust disease inoculation will be useful in identifying the potential roles of SWEETs in specific developmental and pathogenic processes. Copyright © 2017 Elsevier B.V. All rights reserved.
Pancreatic cancer genomes reveal aberrations in axon guidance pathway genes

PubMed Central

Biankin, Andrew V.; Waddell, Nicola; Kassahn, Karin S.; Gingras, Marie-Claude; Muthuswamy, Lakshmi B.; Johns, Amber L.; Miller, David K.; Wilson, Peter J.; Patch, Ann-Marie; Wu, Jianmin; Chang, David K.; Cowley, Mark J.; Gardiner, Brooke B.; Song, Sarah; Harliwong, Ivon; Idrisoglu, Senel; Nourse, Craig; Nourbakhsh, Ehsan; Manning, Suzanne; Wani, Shivangi; Gongora, Milena; Pajic, Marina; Scarlett, Christopher J.; Gill, Anthony J.; Pinho, Andreia V.; Rooman, Ilse; Anderson, Matthew; Holmes, Oliver; Leonard, Conrad; Taylor, Darrin; Wood, Scott; Xu, Qinying; Nones, Katia; Fink, J. Lynn; Christ, Angelika; Bruxner, Tim; Cloonan, Nicole; Kolle, Gabriel; Newell, Felicity; Pinese, Mark; Mead, R. Scott; Humphris, Jeremy L.; Kaplan, Warren; Jones, Marc D.; Colvin, Emily K.; Nagrial, Adnan M.; Humphrey, Emily S.; Chou, Angela; Chin, Venessa T.; Chantrill, Lorraine A.; Mawson, Amanda; Samra, Jaswinder S.; Kench, James G.; Lovell, Jessica A.; Daly, Roger J.; Merrett, Neil D.; Toon, Christopher; Epari, Krishna; Nguyen, Nam Q.; Barbour, Andrew; Zeps, Nikolajs; Kakkar, Nipun; Zhao, Fengmei; Wu, Yuan Qing; Wang, Min; Muzny, Donna M.; Fisher, William E.; Brunicardi, F. Charles; Hodges, Sally E.; Reid, Jeffrey G.; Drummond, Jennifer; Chang, Kyle; Han, Yi; Lewis, Lora R.; Dinh, Huyen; Buhay, Christian J.; Beck, Timothy; Timms, Lee; Sam, Michelle; Begley, Kimberly; Brown, Andrew; Pai, Deepa; Panchal, Ami; Buchner, Nicholas; De Borja, Richard; Denroche, Robert E.; Yung, Christina K.; Serra, Stefano; Onetto, Nicole; Mukhopadhyay, Debabrata; Tsao, Ming-Sound; Shaw, Patricia A.; Petersen, Gloria M.; Gallinger, Steven; Hruban, Ralph H.; Maitra, Anirban; Iacobuzio-Donahue, Christine A.; Schulick, Richard D.; Wolfgang, Christopher L.; Morgan, Richard A.; Lawlor, Rita T.; Capelli, Paola; Corbo, Vincenzo; Scardoni, Maria; Tortora, Giampaolo; Tempero, Margaret A.; Mann, Karen M.; Jenkins, Nancy A.; Perez-Mancera, Pedro A.; Adams, David J.; Largaespada, David A.; Wessels, Lodewyk F. A.; Rust, Alistair G.; Stein, Lincoln D.; Tuveson, David A.; Copeland, Neal G.; Musgrove, Elizabeth A.; Scarpa, Aldo; Eshleman, James R.; Hudson, Thomas J.; Sutherland, Robert L.; Wheeler, David A.; Pearson, John V.; McPherson, John D.; Gibbs, Richard A.; Grimmond, Sean M.

2012-01-01

Pancreatic cancer is a highly lethal malignancy with few effective therapies. We performed exome sequencing and copy number analysis to define genomic aberrations in a prospectively accrued clinical cohort (n = 142) of early (stage I and II) sporadic pancreatic ductal adenocarcinoma. Detailed analysis of 99 informative tumours identified substantial heterogeneity with 2,016 non-silent mutations and 1,628 copy-number variations. We define 16 significantly mutated genes, reaffirming known mutations (KRAS, TP53, CDKN2A, SMAD4, MLL3, TGFBR2, ARID1A and SF3B1), and uncover novel mutated genes including additional genes involved in chromatin modification (EPC1 and ARID2), DNA damage repair (ATM) and other mechanisms (ZIM2, MAP2K4, NALCN, SLC16A4 and MAGEA6). Integrative analysis with in vitro functional data and animal models provided supportive evidence for potential roles for these genetic aberrations in carcinogenesis. Pathway-based analysis of recurrently mutated genes recapitulated clustering in core signalling pathways in pancreatic ductal adenocarcinoma, and identified new mutated genes in each pathway. We also identified frequent and diverse somatic aberrations in genes described traditionally as embryonic regulators of axon guidance, particularly SLIT/ROBO signalling, which was also evident in murine Sleeping Beauty transposon-mediated somatic mutagenesis models of pancreatic cancer, providing further supportive evidence for the potential involvement of axon guidance genes in pancreatic carcinogenesis. PMID:23103869
Pancreatic cancer genomes reveal aberrations in axon guidance pathway genes.

PubMed

Biankin, Andrew V; Waddell, Nicola; Kassahn, Karin S; Gingras, Marie-Claude; Muthuswamy, Lakshmi B; Johns, Amber L; Miller, David K; Wilson, Peter J; Patch, Ann-Marie; Wu, Jianmin; Chang, David K; Cowley, Mark J; Gardiner, Brooke B; Song, Sarah; Harliwong, Ivon; Idrisoglu, Senel; Nourse, Craig; Nourbakhsh, Ehsan; Manning, Suzanne; Wani, Shivangi; Gongora, Milena; Pajic, Marina; Scarlett, Christopher J; Gill, Anthony J; Pinho, Andreia V; Rooman, Ilse; Anderson, Matthew; Holmes, Oliver; Leonard, Conrad; Taylor, Darrin; Wood, Scott; Xu, Qinying; Nones, Katia; Fink, J Lynn; Christ, Angelika; Bruxner, Tim; Cloonan, Nicole; Kolle, Gabriel; Newell, Felicity; Pinese, Mark; Mead, R Scott; Humphris, Jeremy L; Kaplan, Warren; Jones, Marc D; Colvin, Emily K; Nagrial, Adnan M; Humphrey, Emily S; Chou, Angela; Chin, Venessa T; Chantrill, Lorraine A; Mawson, Amanda; Samra, Jaswinder S; Kench, James G; Lovell, Jessica A; Daly, Roger J; Merrett, Neil D; Toon, Christopher; Epari, Krishna; Nguyen, Nam Q; Barbour, Andrew; Zeps, Nikolajs; Kakkar, Nipun; Zhao, Fengmei; Wu, Yuan Qing; Wang, Min; Muzny, Donna M; Fisher, William E; Brunicardi, F Charles; Hodges, Sally E; Reid, Jeffrey G; Drummond, Jennifer; Chang, Kyle; Han, Yi; Lewis, Lora R; Dinh, Huyen; Buhay, Christian J; Beck, Timothy; Timms, Lee; Sam, Michelle; Begley, Kimberly; Brown, Andrew; Pai, Deepa; Panchal, Ami; Buchner, Nicholas; De Borja, Richard; Denroche, Robert E; Yung, Christina K; Serra, Stefano; Onetto, Nicole; Mukhopadhyay, Debabrata; Tsao, Ming-Sound; Shaw, Patricia A; Petersen, Gloria M; Gallinger, Steven; Hruban, Ralph H; Maitra, Anirban; Iacobuzio-Donahue, Christine A; Schulick, Richard D; Wolfgang, Christopher L; Morgan, Richard A; Lawlor, Rita T; Capelli, Paola; Corbo, Vincenzo; Scardoni, Maria; Tortora, Giampaolo; Tempero, Margaret A; Mann, Karen M; Jenkins, Nancy A; Perez-Mancera, Pedro A; Adams, David J; Largaespada, David A; Wessels, Lodewyk F A; Rust, Alistair G; Stein, Lincoln D; Tuveson, David A; Copeland, Neal G; Musgrove, Elizabeth A; Scarpa, Aldo; Eshleman, James R; Hudson, Thomas J; Sutherland, Robert L; Wheeler, David A; Pearson, John V; McPherson, John D; Gibbs, Richard A; Grimmond, Sean M

2012-11-15

Pancreatic cancer is a highly lethal malignancy with few effective therapies. We performed exome sequencing and copy number analysis to define genomic aberrations in a prospectively accrued clinical cohort (n = 142) of early (stage I and II) sporadic pancreatic ductal adenocarcinoma. Detailed analysis of 99 informative tumours identified substantial heterogeneity with 2,016 non-silent mutations and 1,628 copy-number variations. We define 16 significantly mutated genes, reaffirming known mutations (KRAS, TP53, CDKN2A, SMAD4, MLL3, TGFBR2, ARID1A and SF3B1), and uncover novel mutated genes including additional genes involved in chromatin modification (EPC1 and ARID2), DNA damage repair (ATM) and other mechanisms (ZIM2, MAP2K4, NALCN, SLC16A4 and MAGEA6). Integrative analysis with in vitro functional data and animal models provided supportive evidence for potential roles for these genetic aberrations in carcinogenesis. Pathway-based analysis of recurrently mutated genes recapitulated clustering in core signalling pathways in pancreatic ductal adenocarcinoma, and identified new mutated genes in each pathway. We also identified frequent and diverse somatic aberrations in genes described traditionally as embryonic regulators of axon guidance, particularly SLIT/ROBO signalling, which was also evident in murine Sleeping Beauty transposon-mediated somatic mutagenesis models of pancreatic cancer, providing further supportive evidence for the potential involvement of axon guidance genes in pancreatic carcinogenesis.
Integrated Network Analysis Identifies Fight-Club Nodes as a Class of Hubs Encompassing Key Putative Switch Genes That Induce Major Transcriptome Reprogramming during Grapevine Development[W][OPEN

PubMed Central

Palumbo, Maria Concetta; Zenoni, Sara; Fasoli, Marianna; Massonnet, Mélanie; Farina, Lorenzo; Castiglione, Filippo; Pezzotti, Mario; Paci, Paola

2014-01-01

We developed an approach that integrates different network-based methods to analyze the correlation network arising from large-scale gene expression data. By studying grapevine (Vitis vinifera) and tomato (Solanum lycopersicum) gene expression atlases and a grapevine berry transcriptomic data set during the transition from immature to mature growth, we identified a category named “fight-club hubs” characterized by a marked negative correlation with the expression profiles of neighboring genes in the network. A special subset named “switch genes” was identified, with the additional property of many significant negative correlations outside their own group in the network. Switch genes are involved in multiple processes and include transcription factors that may be considered master regulators of the previously reported transcriptome remodeling that marks the developmental shift from immature to mature growth. All switch genes, expressed at low levels in vegetative/green tissues, showed a significant increase in mature/woody organs, suggesting a potential regulatory role during the developmental transition. Finally, our analysis of tomato gene expression data sets showed that wild-type switch genes are downregulated in ripening-deficient mutants. The identification of known master regulators of tomato fruit maturation suggests our method is suitable for the detection of key regulators of organ development in different fleshy fruit crops. PMID:25490918
The gene for PAX7, a member of the paired-box-containing genes, is localized on human chromosome arm 1p36.

PubMed

Shapiro, D N; Sublett, J E; Li, B; Valentine, M B; Morris, S W; Noll, M

1993-09-01

The murine Pax-7 gene and the cognate human gene, formerly designated HuP1, are members of the multigene paired-box-containing class of developmental regulatory genes first identified in Drosophila. By analysis of somatic cell hybrids segregating human chromosomes, the gene encoding PAX7 was localized to human chromosome 1. Fluorescence in situ hybridization confirmed this assignment and allowed mapping of the gene to the terminal region of the short arm (1p36) of the chromosome. Additionally, these results confirm the extensive homology between human chromosome 1p and the distal segment of mouse chromosome 4, extending from bands C5 through E2.
Behavioral science and the study of gene-nutrition and gene-physical activity interactions in obesity research.

PubMed

Faith, Myles S

2008-12-01

This report summarizes emerging opportunities for behavioral science to help advance the field of gene-environment and gene-behavior interactions, based on presentations at The National Cancer Institute (NCI) Workshop, "Gene-Nutrition and Gene-Physical Activity Interactions in the Etiology of Obesity." Three opportunities are highlighted: (i) designing potent behavioral "challenges" in experiments, (ii) determining viable behavioral phenotypes for genetics studies, and (iii) identifying specific measures of the environment or environmental exposures. Additional points are underscored, including the need to incorporate novel findings from neuroimaging studies regarding motivation and drive for eating and physical activity. Advances in behavioral science theory and methods can play an important role in advancing understanding of gene-brain-behavior relationships in obesity onset.
Host genetics of response to porcine reproductive and respiratory syndrome in nursery pigs.

PubMed

Dekkers, Jack; Rowland, Raymond R R; Lunney, Joan K; Plastow, Graham

2017-09-01

PRRS is the most costly disease in the US pig industry. While vaccination, biosecurity and eradication effort have had some success, the variability and infectiousness of PRRS virus strains have hampered the effectiveness of these measures. We propose the use of genetic selection of pigs as an additional and complementary effort. Several studies have shown that host response to PRRS infection has a sizeable genetic component and recent advances in genomics provide opportunities to capitalize on these genetic differences and improve our understanding of host response to PRRS. While work is also ongoing to understand the genetic basis of host response to reproductive PRRS, the focus of this review is on research conducted on host response to PRRS in the nursery and grow-finish phase as part of the PRRS Host Genetics Consortium. Using experimental infection of large numbers of commercial nursery pigs, combined with deep phenotyping and genomics, this research has identified a major gene that is associated with host response to PRRS. Further functional genomics work identified the GBP5 gene as harboring the putative causative mutation. GBP5 is associated with innate immune response. Subsequent work has validated the effect of this genomic region on host response to a second PRRSV strain and to PRRS vaccination and co-infection of nursery pigs with PRRSV and PCV2b. A genetic marker near GBP5 is available to the industry for use in selection. Genetic differences in host response beyond GBP5 appear to be highly polygenic, i.e. controlled by many genes across the genome, each with a small effect. Such effects can by capitalized on in a selection program using genomic prediction on large numbers of genetic markers across the genome. Additional work has also identified the genetic basis of antibody response to PRRS, which could lead to the use of vaccine response as an indicator trait to select for host response to PRRS. Other genomic analyses, including gene expression analyses, have identified genes and modules of genes that are associated with differences in host response to PRRS and can be used to further understand and utilize differences in host response. Together, these results demonstrate that genetic selection can be an additional and complementary tool to combat PRRS in the swine industry. Copyright © 2017 Elsevier B.V. All rights reserved.
Genetic Analysis of Reduced γ-Tocopherol Content in Ethiopian Mustard Seeds.

PubMed

García-Navarro, Elena; Fernández-Martínez, José M; Pérez-Vich, Begoña; Velasco, Leonardo

2016-01-01

Ethiopian mustard (Brassica carinata A. Braun) line BCT-6, with reduced γ-tocopherol content in the seeds, has been previously developed. The objective of this research was to conduct a genetic analysis of seed tocopherols in this line. BCT-6 was crossed with the conventional line C-101 and the F1, F2, and BC plant generations were analyzed. Generation mean analysis using individual scaling tests indicated that reduced γ-tocopherol content fitted an additive-dominant genetic model with predominance of additive effects and absence of epistatic interactions. This was confirmed through a joint scaling test and additional testing of the goodness of fit of the model. Conversely, epistatic interactions were identified for total tocopherol content. Estimation of the minimum number of genes suggested that both γ- and total tocopherol content may be controlled by two genes. A positive correlation between total tocopherol content and the proportion of γ-tocopherol was identified in the F2 generation. Additional research on the feasibility of developing germplasm with high tocopherol content and reduced concentration of γ-tocopherol is required.
Discovery of mutations in homologous recombination genes in African-American women with breast cancer.

PubMed

Ding, Yuan Chun; Adamson, Aaron W; Steele, Linda; Bailis, Adam M; John, Esther M; Tomlinson, Gail; Neuhausen, Susan L

2018-04-01

African-American women are more likely to develop aggressive breast cancer at younger ages and experience poorer cancer prognoses than non-Hispanic Caucasians. Deficiency in repair of DNA by homologous recombination (HR) is associated with cancer development, suggesting that mutations in genes that affect this process may cause breast cancer. Inherited pathogenic mutations have been identified in genes involved in repairing DNA damage, but few studies have focused on African-Americans. We screened for germline mutations in seven HR repair pathway genes in DNA of 181 African-American women with breast cancer, evaluated the potential effects of identified missense variants using in silico prediction software, and functionally characterized a set of missense variants by yeast two-hybrid assays. We identified five likely-damaging variants, including two PALB2 truncating variants (Q151X and W1038X) and three novel missense variants (RAD51C C135R, and XRCC3 L297P and V337E) that abolish protein-protein interactions in yeast two-hybrid assays. Our results add to evidence that HR gene mutations account for a proportion of the genetic risk for developing breast cancer in African-Americans. Identifying additional mutations that diminish HR may provide a tool for better assessing breast cancer risk and improving approaches for targeted treatment.
Generation and Analysis of Expressed Sequence Tags (ESTs) from Halophyte Atriplex canescens to Explore Salt-Responsive Related Genes

PubMed Central

Li, Jingtao; Sun, Xinhua; Yu, Gang; Jia, Chengguo; Liu, Jinliang; Pan, Hongyu

2014-01-01

Little information is available on gene expression profiling of halophyte A. canescens. To elucidate the molecular mechanism for stress tolerance in A. canescens, a full-length complementary DNA library was generated from A. canescens exposed to 400 mM NaCl, and provided 343 high-quality ESTs. In an evaluation of 343 valid EST sequences in the cDNA library, 197 unigenes were assembled, among which 190 unigenes (83.1% ESTs) were identified according to their significant similarities with proteins of known functions. All the 343 EST sequences have been deposited in the dbEST GenBank under accession numbers JZ535802 to JZ536144. According to Arabidopsis MIPS functional category and GO classifications, we identified 193 unigenes of the 311 annotations EST, representing 72 non-redundant unigenes sharing similarities with genes related to the defense response. The sets of ESTs obtained provide a rich genetic resource and 17 up-regulated genes related to salt stress resistance were identified by qRT-PCR. Six of these genes may contribute crucially to earlier and later stage salt stress resistance. Additionally, among the 343 unigenes sequences, 22 simple sequence repeats (SSRs) were also identified contributing to the study of A. canescens resources. PMID:24960361
Comparative analysis of myostatin gene and promoter sequences of Qinchuan and Red Angus cattle.

PubMed

He, Y L; Wu, Y H; Quan, F S; Liu, Y G; Zhang, Y

2013-09-04

To better understand the function of the myostatin gene and its promoter region in bovine, we amplified and sequenced the myostatin gene and promoter from the blood of Qinchuan and Red Angus cattle by using polymerase chain reaction. The sequences of Qinchuan and Red Angus cattle were compared with those of other cattle breeds available in GenBank. Exon splice sites were confirmed by mRNA sequencing. Compared to the published sequence (GenBank accession No. AF320998), 69 single nucleotide polymorphisms (SNPs) were identified in the Qinchuan myostatin gene, only one of which was an insertion mutation in Qinchuan cattle. There was a 16-bp insertion in the first 705-bp intron in 3 Qinchuan cattle. A total of 7 SNPs were identified in exon 3, in which the mutation occurred in the third base of the codon and was synonymous. On comparing the Qinchuan myostatin gene sequence to that of Red Angus cattle, a total of 50 SNPs were identified in the first and third exons. In addition, there were 18 SNPs identified in the Qinchuan cattle promoter region compared with those of other cattle compared to the Red Angus cattle myostatin promoter region. breeds (GenBank accession No. AF348479), but only 14 SNPs when compared to the Red Angus cattle myostatin promoter region.
Using RNA-Seq for gene identification, polymorphism detection and transcript profiling in two alfalfa genotypes with divergent cell wall composition in stems

PubMed Central

2011-01-01

Background Alfalfa, [Medicago sativa (L.) sativa], a widely-grown perennial forage has potential for development as a cellulosic ethanol feedstock. However, the genomics of alfalfa, a non-model species, is still in its infancy. The recent advent of RNA-Seq, a massively parallel sequencing method for transcriptome analysis, provides an opportunity to expand the identification of alfalfa genes and polymorphisms, and conduct in-depth transcript profiling. Results Cell walls in stems of alfalfa genotype 708 have higher cellulose and lower lignin concentrations compared to cell walls in stems of genotype 773. Using the Illumina GA-II platform, a total of 198,861,304 expression sequence tags (ESTs, 76 bp in length) were generated from cDNA libraries derived from elongating stem (ES) and post-elongation stem (PES) internodes of 708 and 773. In addition, 341,984 ESTs were generated from ES and PES internodes of genotype 773 using the GS FLX Titanium platform. The first alfalfa (Medicago sativa) gene index (MSGI 1.0) was assembled using the Sanger ESTs available from GenBank, the GS FLX Titanium EST sequences, and the de novo assembled Illumina sequences. MSGI 1.0 contains 124,025 unique sequences including 22,729 tentative consensus sequences (TCs), 22,315 singletons and 78,981 pseudo-singletons. We identified a total of 1,294 simple sequence repeats (SSR) among the sequences in MSGI 1.0. In addition, a total of 10,826 single nucleotide polymorphisms (SNPs) were predicted between the two genotypes. Out of 55 SNPs randomly selected for experimental validation, 47 (85%) were polymorphic between the two genotypes. We also identified numerous allelic variations within each genotype. Digital gene expression analysis identified numerous candidate genes that may play a role in stem development as well as candidate genes that may contribute to the differences in cell wall composition in stems of the two genotypes. Conclusions Our results demonstrate that RNA-Seq can be successfully used for gene identification, polymorphism detection and transcript profiling in alfalfa, a non-model, allogamous, autotetraploid species. The alfalfa gene index assembled in this study, and the SNPs, SSRs and candidate genes identified can be used to improve alfalfa as a forage crop and cellulosic feedstock. PMID:21504589

MAGMA: Generalized Gene-Set Analysis of GWAS Data

PubMed Central

de Leeuw, Christiaan A.; Mooij, Joris M.; Heskes, Tom; Posthuma, Danielle

2015-01-01

By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn’s Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn’s Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn’s Disease data was found to be considerably faster as well. PMID:25885710
MAGMA: generalized gene-set analysis of GWAS data.

PubMed

de Leeuw, Christiaan A; Mooij, Joris M; Heskes, Tom; Posthuma, Danielle

2015-04-01

By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn's Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn's Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn's Disease data was found to be considerably faster as well.
Phosphate transporters in marine phytoplankton and their viruses: cross-domain commonalities in viral-host gene exchanges.

PubMed

Monier, Adam; Welsh, Rory M; Gentemann, Chelle; Weinstock, George; Sodergren, Erica; Armbrust, E Virginia; Eisen, Jonathan A; Worden, Alexandra Z

2012-01-01

Phosphate (PO(4)) is an important limiting nutrient in marine environments. Marine cyanobacteria scavenge PO(4) using the high-affinity periplasmic phosphate binding protein PstS. The pstS gene has recently been identified in genomes of cyanobacterial viruses as well. Here, we analyse genes encoding transporters in genomes from viruses that infect eukaryotic phytoplankton. We identified inorganic PO(4) transporter-encoding genes from the PHO4 superfamily in several virus genomes, along with other transporter-encoding genes. Homologues of the viral pho4 genes were also identified in genome sequences from the genera that these viruses infect. Genome sequences were available from host genera of all the phytoplankton viruses analysed except the host genus Bathycoccus. Pho4 was recovered from Bathycoccus by sequencing a targeted metagenome from an uncultured Atlantic Ocean population. Phylogenetic reconstruction showed that pho4 genes from pelagophytes, haptophytes and infecting viruses were more closely related to homologues in prasinophytes than to those in what, at the species level, are considered to be closer relatives (e.g. diatoms). We also identified PHO4 superfamily members in ocean metagenomes, including new metagenomes from the Pacific Ocean. The environmental sequences grouped with pelagophytes, haptophytes, prasinophytes and viruses as well as bacteria. The analyses suggest that multiple independent pho4 gene transfer events have occurred between marine viruses and both eukaryotic and bacterial hosts. Additionally, pho4 genes were identified in available genomes from viruses that infect marine eukaryotes but not those that infect terrestrial hosts. Commonalities in marine host-virus gene exchanges indicate that manipulation of host-PO(4) uptake is an important adaptation for viral proliferation in marine systems. Our findings suggest that PO(4) -availability may not serve as a simple bottom-up control of marine phytoplankton. © 2011 Society for Applied Microbiology and Blackwell Publishing Ltd.
Advancing epilepsy treatment through personalized genetic zebrafish models.

PubMed

Griffin, A; Krasniak, C; Baraban, S C

2016-01-01

With an increase in the number of disease causing genetic mutations identified from epilepsy cohorts, zebrafish are proving to be an attractive vertebrate model for functional analysis of these allele variants. Not only do zebrafish have conserved gene functions, but larvae harboring mutations in identified human epileptic genes show spontaneous seizure activity and mimic the convulsive behavioral movements observed in humans. With zebrafish being compatible with medium to high-throughput screening, they are also proving to be a unique and powerful system for early preclinical drug screening, including novel target identification, pharmacology, and toxicology. Additionally, with recent advances in genomic engineering technologies, it is now possible to study the precise pathophysiology of patient-specific gene mutations in zebrafish. The following sections highlight how the unique attributes of zebrafish, in combination with genetic modifications, are continuing to transform our understanding of epilepsy and help identify personalized therapeutics for specific patient cohorts. © 2016 Elsevier B.V. All rights reserved.
Genome-wide analysis identifies 12 loci influencing human reproductive behavior

PubMed Central

Barban, Nicola; Jansen, Rick; de Vlaming, Ronald; Vaez, Ahmad; Mandemakers, Jornt J.; Tropf, Felix C.; Shen, Xia; Wilson, James F.; Chasman, Daniel I.; Nolte, Ilja M.; Tragante, Vinicius; van der Laan, Sander W.; Perry, John R. B.; Kong, Augustine; Ahluwalia, Tarunveer; Albrecht, Eva; Yerges-Armstrong, Laura; Atzmon, Gil; Auro, Kirsi; Ayers, Kristin; Bakshi, Andrew; Ben-Avraham, Danny; Berger, Klaus; Bergman, Aviv; Bertram, Lars; Bielak, Lawrence F.; Bjornsdottir, Gyda; Bonder, Marc Jan; Broer, Linda; Bui, Minh; Barbieri, Caterina; Cavadino, Alana; Chavarro, Jorge E; Turman, Constance; Concas, Maria Pina; Cordell, Heather J.; Davies, Gail; Eibich, Peter; Eriksson, Nicholas; Esko, Tõnu; Eriksson, Joel; Falahi, Fahimeh; Felix, Janine F.; Fontana, Mark Alan; Franke, Lude; Gandin, Ilaria; Gaskins, Audrey J.; Gieger, Christian; Gunderson, Erica P.; Guo, Xiuqing; Hayward, Caroline; He, Chunyan; Hofer, Edith; Huang, Hongyan; Joshi, Peter K.; Kanoni, Stavroula; Karlsson, Robert; Kiechl, Stefan; Kifley, Annette; Kluttig, Alexander; Kraft, Peter; Lagou, Vasiliki; Lecoeur, Cecile; Lahti, Jari; Li-Gao, Ruifang; Lind, Penelope A.; Liu, Tian; Makalic, Enes; Mamasoula, Crysovalanto; Matteson, Lindsay; Mbarek, Hamdi; McArdle, Patrick F.; McMahon, George; Meddens, S. Fleur W.; Mihailov, Evelin; Miller, Mike; Missmer, Stacey A.; Monnereau, Claire; van der Most, Peter J.; Myhre, Ronny; Nalls, Mike A.; Nutile, Teresa; Panagiota, Kalafati Ioanna; Porcu, Eleonora; Prokopenko, Inga; Rajan, Kumar B.; Rich-Edwards, Janet; Rietveld, Cornelius A.; Robino, Antonietta; Rose, Lynda M.; Rueedi, Rico; Ryan, Kathy; Saba, Yasaman; Schmidt, Daniel; Smith, Jennifer A.; Stolk, Lisette; Streeten, Elizabeth; Tonjes, Anke; Thorleifsson, Gudmar; Ulivi, Sheila; Wedenoja, Juho; Wellmann, Juergen; Willeit, Peter; Yao, Jie; Yengo, Loic; Zhao, Jing Hua; Zhao, Wei; Zhernakova, Daria V.; Amin, Najaf; Andrews, Howard; Balkau, Beverley; Barzilai, Nir; Bergmann, Sven; Biino, Ginevra; Bisgaard, Hans; Bønnelykke, Klaus; Boomsma, Dorret I.; Buring, Julie E.; Campbell, Harry; Cappellani, Stefania; Ciullo, Marina; Cox, Simon R.; Cucca, Francesco; Daniela, Toniolo; Davey-Smith, George; Deary, Ian J.; Dedoussis, George; Deloukas, Panos; van Duijn, Cornelia M.; de Geus, Eco JC.; Eriksson, Johan G.; Evans, Denis A.; Faul, Jessica D.; Felicita, Sala Cinzia; Froguel, Philippe; Gasparini, Paolo; Girotto, Giorgia; Grabe, Hans-Jörgen; Greiser, Karin Halina; Groenen, Patrick J.F.; de Haan, Hugoline G.; Haerting, Johannes; Harris, Tamara B.; Heath, Andrew C.; Heikkilä, Kauko; Hofman, Albert; Homuth, Georg; Holliday, Elizabeth G; Hopper, John; Hypponen, Elina; Jacobsson, Bo; Jaddoe, Vincent W. V.; Johannesson, Magnus; Jugessur, Astanand; Kähönen, Mika; Kajantie, Eero; Kardia, Sharon L.R.; Keavney, Bernard; Kolcic, Ivana; Koponen, Päivikki; Kovacs, Peter; Kronenberg, Florian; Kutalik, Zoltan; La Bianca, Martina; Lachance, Genevieve; Iacono, William; Lai, Sandra; Lehtimäki, Terho; Liewald, David C; Lindgren, Cecilia; Liu, Yongmei; Luben, Robert; Lucht, Michael; Luoto, Riitta; Magnus, Per; Magnusson, Patrik K.E.; Martin, Nicholas G.; McGue, Matt; McQuillan, Ruth; Medland, Sarah E.; Meisinger, Christa; Mellström, Dan; Metspalu, Andres; Michela, Traglia; Milani, Lili; Mitchell, Paul; Montgomery, Grant W.; Mook-Kanamori, Dennis; de Mutsert, Renée; Nohr, Ellen A; Ohlsson, Claes; Olsen, Jørn; Ong, Ken K.; Paternoster, Lavinia; Pattie, Alison; Penninx, Brenda WJH; Perola, Markus; Peyser, Patricia A.; Pirastu, Mario; Polasek, Ozren; Power, Chris; Kaprio, Jaakko; Raffel, Leslie J.; Räikkönen, Katri; Raitakari, Olli; Ridker, Paul M.; Ring, Susan M.; Roll, Kathryn; Rudan, Igor; Ruggiero, Daniela; Rujescu, Dan; Salomaa, Veikko; Schlessinger, David; Schmidt, Helena; Schmidt, Reinhold; Schupf, Nicole; Smit, Johannes; Sorice, Rossella; Spector, Tim D.; Starr, John M.; Stöckl, Doris; Strauch, Konstantin; Stumvoll, Michael; Swertz, Morris A.; Thorsteinsdottir, Unnur; Thurik, A. Roy; Timpson, Nicholas J.; Tönjes, Anke; Tung, Joyce Y.; Uitterlinden, André G.; Vaccargiu, Simona; Viikari, Jorma; Vitart, Veronique; Völzke, Henry; Vollenweider, Peter; Vuckovic, Dragana; Waage, Johannes; Wagner, Gert G.; Wang, Jie Jin; Wareham, Nicholas J.; Weir, David R.; Willemsen, Gonneke; Willeit, Johann; Wright, Alan F.; Zondervan, Krina T.; Stefansson, Kari; Krueger, Robert F.; Lee, James J.; Benjamin, Daniel J.; Cesarini, David; Koellinger, Philipp D.; den Hoed, Marcel; Snieder, Harold; Mills, Melinda C.

2017-01-01

The genetic architecture of human reproductive behavior – age at first birth (AFB) and number of children ever born (NEB) – has a strong relationship with fitness, human development, infertility and risk of neuropsychiatric disorders. However, very few genetic loci have been identified and the underlying mechanisms of AFB and NEB are poorly understood. We report the largest genome-wide association study to date of both sexes including 251,151 individuals for AFB and 343,072 for NEB. We identified 12 independent loci that are significantly associated with AFB and/or NEB in a SNP-based genome-wide association study, and four additional loci in a gene-based effort. These loci harbor genes that are likely to play a role – either directly or by affecting non-local gene expression – in human reproduction and infertility, thereby increasing our understanding of these complex traits. PMID:27798627
Genome-Wide RNAi Screen Identifies Broadly-Acting Host Factors That Inhibit Arbovirus Infection

PubMed Central

Yasunaga, Ari; Hanna, Sheri L.; Li, Jianqing; Cho, Hyelim; Rose, Patrick P.; Spiridigliozzi, Anna; Gold, Beth; Diamond, Michael S.; Cherry, Sara

2014-01-01

Vector-borne viruses are an important class of emerging and re-emerging pathogens; thus, an improved understanding of the cellular factors that modulate infection in their respective vertebrate and insect hosts may aid control efforts. In particular, cell-intrinsic antiviral pathways restrict vector-borne viruses including the type I interferon response in vertebrates and the RNA interference (RNAi) pathway in insects. However, it is likely that additional cell-intrinsic mechanisms exist to limit these viruses. Since insects rely on innate immune mechanisms to inhibit virus infections, we used Drosophila as a model insect to identify cellular factors that restrict West Nile virus (WNV), a flavivirus with a broad and expanding geographical host range. Our genome-wide RNAi screen identified 50 genes that inhibited WNV infection. Further screening revealed that 17 of these genes were antiviral against additional flaviviruses, and seven of these were antiviral against other vector-borne viruses, expanding our knowledge of invertebrate cell-intrinsic immunity. Investigation of two newly identified factors that restrict diverse viruses, dXPO1 and dRUVBL1, in the Tip60 complex, demonstrated they contributed to antiviral defense at the organismal level in adult flies, in mosquito cells, and in mammalian cells. These data suggest the existence of broadly acting and functionally conserved antiviral genes and pathways that restrict virus infections in evolutionarily divergent hosts. PMID:24550726
Homozygosity mapping and sequencing identify two genes that might contribute to pointing behavior in hunting dogs.

PubMed

Akkad, Denis A; Gerding, Wanda M; Gasser, Robin B; Epplen, Jörg T

2015-01-01

The domestic dog represents an important model for studying the genetics of behavior. In spite of technological advances in genomics and phenomics, the genetic basis of most specific canine behaviors is largely unknown. Some breeds of hunting dogs exhibit a behavioral trait called "pointing" (a prolonged halt of movement to indicate the position of a game animal). Here, the genomes of pointing dogs (Large Munsterlander and Weimaraner) were compared with those of behaviorally distinct herding dogs (Berger des Pyrenées and Schapendoes). We assumed (i) that these four dog breeds initially represented inbred populations and (ii) that selective breeding for pointing behavior promotes an enrichment of the genetic trait in a homozygous state. The homozygosity mapping of 52 dogs (13 of each of the four breeds) followed by subsequent interval resequencing identified fixed genetic differences on chromosome 22 between pointers and herding dogs. In addition, we identified one non-synonomous variation in each of the coding genes SETDB2 and CYSLTR2 that might have a functional consequence. Genetic analysis of additional hunting and non-hunting dogs revealed consistent homozygosity for these two variations in six of seven pointing breeds. Based on the present findings, we propose that, together with other genetic, training and/or environmental factors, the nucleotide and associated amino acid variations identified in genes SETDB2 and CYSLTR2 contribute to pointing behavior.
Alterations in gene expression and DNA methylation during murine and human lung alveolar septation.

PubMed

Cuna, Alain; Halloran, Brian; Faye-Petersen, Ona; Kelly, David; Crossman, David K; Cui, Xiangqin; Pandit, Kusum; Kaminski, Naftali; Bhattacharya, Soumyaroop; Ahmad, Ausaf; Mariani, Thomas J; Ambalavanan, Namasivayam

2015-07-01

DNA methylation, a major epigenetic mechanism, may regulate coordinated expression of multiple genes at specific time points during alveolar septation in lung development. The objective of this study was to identify genes regulated by methylation during normal septation in mice and during disordered septation in bronchopulmonary dysplasia. In mice, newborn lungs (preseptation) and adult lungs (postseptation) were evaluated by microarray analysis of gene expression and immunoprecipitation of methylated DNA followed by sequencing (MeDIP-Seq). In humans, microarray gene expression data were integrated with genome-wide DNA methylation data from bronchopulmonary dysplasia versus preterm and term lung. Genes with reciprocal changes in expression and methylation, suggesting regulation by DNA methylation, were identified. In mice, 95 genes with inverse correlation between expression and methylation during normal septation were identified. In addition to genes known to be important in lung development (Wnt signaling, Angpt2, Sox9, etc.) and its extracellular matrix (Tnc, Eln, etc.), genes involved with immune and antioxidant defense (Stat4, Sod3, Prdx6, etc.) were also observed. In humans, 23 genes were differentially methylated with reciprocal changes in expression in bronchopulmonary dysplasia compared with preterm or term lung. Genes of interest included those involved with detoxifying enzymes (Gstm3) and transforming growth factor-β signaling (bone morphogenetic protein 7 [Bmp7]). In terms of overlap, 20 genes and three pathways methylated during mouse lung development also demonstrated changes in methylation between preterm and term human lung. Changes in methylation correspond to altered expression of a number of genes associated with lung development, suggesting that DNA methylation of these genes may regulate normal and abnormal alveolar septation.
Genome-wide physical activity interactions in adiposity - A meta-analysis of 200,452 adults.

PubMed

Graff, Mariaelisa; Scott, Robert A; Justice, Anne E; Young, Kristin L; Feitosa, Mary F; Barata, Llilda; Winkler, Thomas W; Chu, Audrey Y; Mahajan, Anubha; Hadley, David; Xue, Luting; Workalemahu, Tsegaselassie; Heard-Costa, Nancy L; den Hoed, Marcel; Ahluwalia, Tarunveer S; Qi, Qibin; Ngwa, Julius S; Renström, Frida; Quaye, Lydia; Eicher, John D; Hayes, James E; Cornelis, Marilyn; Kutalik, Zoltan; Lim, Elise; Luan, Jian'an; Huffman, Jennifer E; Zhang, Weihua; Zhao, Wei; Griffin, Paula J; Haller, Toomas; Ahmad, Shafqat; Marques-Vidal, Pedro M; Bien, Stephanie; Yengo, Loic; Teumer, Alexander; Smith, Albert Vernon; Kumari, Meena; Harder, Marie Neergaard; Justesen, Johanne Marie; Kleber, Marcus E; Hollensted, Mette; Lohman, Kurt; Rivera, Natalia V; Whitfield, John B; Zhao, Jing Hua; Stringham, Heather M; Lyytikäinen, Leo-Pekka; Huppertz, Charlotte; Willemsen, Gonneke; Peyrot, Wouter J; Wu, Ying; Kristiansson, Kati; Demirkan, Ayse; Fornage, Myriam; Hassinen, Maija; Bielak, Lawrence F; Cadby, Gemma; Tanaka, Toshiko; Mägi, Reedik; van der Most, Peter J; Jackson, Anne U; Bragg-Gresham, Jennifer L; Vitart, Veronique; Marten, Jonathan; Navarro, Pau; Bellis, Claire; Pasko, Dorota; Johansson, Åsa; Snitker, Søren; Cheng, Yu-Ching; Eriksson, Joel; Lim, Unhee; Aadahl, Mette; Adair, Linda S; Amin, Najaf; Balkau, Beverley; Auvinen, Juha; Beilby, John; Bergman, Richard N; Bergmann, Sven; Bertoni, Alain G; Blangero, John; Bonnefond, Amélie; Bonnycastle, Lori L; Borja, Judith B; Brage, Søren; Busonero, Fabio; Buyske, Steve; Campbell, Harry; Chines, Peter S; Collins, Francis S; Corre, Tanguy; Smith, George Davey; Delgado, Graciela E; Dueker, Nicole; Dörr, Marcus; Ebeling, Tapani; Eiriksdottir, Gudny; Esko, Tõnu; Faul, Jessica D; Fu, Mao; Færch, Kristine; Gieger, Christian; Gläser, Sven; Gong, Jian; Gordon-Larsen, Penny; Grallert, Harald; Grammer, Tanja B; Grarup, Niels; van Grootheest, Gerard; Harald, Kennet; Hastie, Nicholas D; Havulinna, Aki S; Hernandez, Dena; Hindorff, Lucia; Hocking, Lynne J; Holmens, Oddgeir L; Holzapfel, Christina; Hottenga, Jouke Jan; Huang, Jie; Huang, Tao; Hui, Jennie; Huth, Cornelia; Hutri-Kähönen, Nina; James, Alan L; Jansson, John-Olov; Jhun, Min A; Juonala, Markus; Kinnunen, Leena; Koistinen, Heikki A; Kolcic, Ivana; Komulainen, Pirjo; Kuusisto, Johanna; Kvaløy, Kirsti; Kähönen, Mika; Lakka, Timo A; Launer, Lenore J; Lehne, Benjamin; Lindgren, Cecilia M; Lorentzon, Mattias; Luben, Robert; Marre, Michel; Milaneschi, Yuri; Monda, Keri L; Montgomery, Grant W; De Moor, Marleen H M; Mulas, Antonella; Müller-Nurasyid, Martina; Musk, A W; Männikkö, Reija; Männistö, Satu; Narisu, Narisu; Nauck, Matthias; Nettleton, Jennifer A; Nolte, Ilja M; Oldehinkel, Albertine J; Olden, Matthias; Ong, Ken K; Padmanabhan, Sandosh; Paternoster, Lavinia; Perez, Jeremiah; Perola, Markus; Peters, Annette; Peters, Ulrike; Peyser, Patricia A; Prokopenko, Inga; Puolijoki, Hannu; Raitakari, Olli T; Rankinen, Tuomo; Rasmussen-Torvik, Laura J; Rawal, Rajesh; Ridker, Paul M; Rose, Lynda M; Rudan, Igor; Sarti, Cinzia; Sarzynski, Mark A; Savonen, Kai; Scott, William R; Sanna, Serena; Shuldiner, Alan R; Sidney, Steve; Silbernagel, Günther; Smith, Blair H; Smith, Jennifer A; Snieder, Harold; Stančáková, Alena; Sternfeld, Barbara; Swift, Amy J; Tammelin, Tuija; Tan, Sian-Tsung; Thorand, Barbara; Thuillier, Dorothée; Vandenput, Liesbeth; Vestergaard, Henrik; van Vliet-Ostaptchouk, Jana V; Vohl, Marie-Claude; Völker, Uwe; Waeber, Gérard; Walker, Mark; Wild, Sarah; Wong, Andrew; Wright, Alan F; Zillikens, M Carola; Zubair, Niha; Haiman, Christopher A; Lemarchand, Loic; Gyllensten, Ulf; Ohlsson, Claes; Hofman, Albert; Rivadeneira, Fernando; Uitterlinden, André G; Pérusse, Louis; Wilson, James F; Hayward, Caroline; Polasek, Ozren; Cucca, Francesco; Hveem, Kristian; Hartman, Catharina A; Tönjes, Anke; Bandinelli, Stefania; Palmer, Lyle J; Kardia, Sharon L R; Rauramaa, Rainer; Sørensen, Thorkild I A; Tuomilehto, Jaakko; Salomaa, Veikko; Penninx, Brenda W J H; de Geus, Eco J C; Boomsma, Dorret I; Lehtimäki, Terho; Mangino, Massimo; Laakso, Markku; Bouchard, Claude; Martin, Nicholas G; Kuh, Diana; Liu, Yongmei; Linneberg, Allan; März, Winfried; Strauch, Konstantin; Kivimäki, Mika; Harris, Tamara B; Gudnason, Vilmundur; Völzke, Henry; Qi, Lu; Järvelin, Marjo-Riitta; Chambers, John C; Kooner, Jaspal S; Froguel, Philippe; Kooperberg, Charles; Vollenweider, Peter; Hallmans, Göran; Hansen, Torben; Pedersen, Oluf; Metspalu, Andres; Wareham, Nicholas J; Langenberg, Claudia; Weir, David R; Porteous, David J; Boerwinkle, Eric; Chasman, Daniel I; Abecasis, Gonçalo R; Barroso, Inês; McCarthy, Mark I; Frayling, Timothy M; O'Connell, Jeffrey R; van Duijn, Cornelia M; Boehnke, Michael; Heid, Iris M; Mohlke, Karen L; Strachan, David P; Fox, Caroline S; Liu, Ching-Ti; Hirschhorn, Joel N; Klein, Robert J; Johnson, Andrew D; Borecki, Ingrid B; Franks, Paul W; North, Kari E; Cupples, L Adrienne; Loos, Ruth J F; Kilpeläinen, Tuomas O

2017-04-01

Physical activity (PA) may modify the genetic effects that give rise to increased risk of obesity. To identify adiposity loci whose effects are modified by PA, we performed genome-wide interaction meta-analyses of BMI and BMI-adjusted waist circumference and waist-hip ratio from up to 200,452 adults of European (n = 180,423) or other ancestry (n = 20,029). We standardized PA by categorizing it into a dichotomous variable where, on average, 23% of participants were categorized as inactive and 77% as physically active. While we replicate the interaction with PA for the strongest known obesity-risk locus in the FTO gene, of which the effect is attenuated by ~30% in physically active individuals compared to inactive individuals, we do not identify additional loci that are sensitive to PA. In additional genome-wide meta-analyses adjusting for PA and interaction with PA, we identify 11 novel adiposity loci, suggesting that accounting for PA or other environmental factors that contribute to variation in adiposity may facilitate gene discovery.
In vivo CRISPR screening identifies Ptpn2 as a cancer immunotherapy target.

PubMed

Manguso, Robert T; Pope, Hans W; Zimmer, Margaret D; Brown, Flavian D; Yates, Kathleen B; Miller, Brian C; Collins, Natalie B; Bi, Kevin; LaFleur, Martin W; Juneja, Vikram R; Weiss, Sarah A; Lo, Jennifer; Fisher, David E; Miao, Diana; Van Allen, Eliezer; Root, David E; Sharpe, Arlene H; Doench, John G; Haining, W Nicholas

2017-07-27

Immunotherapy with PD-1 checkpoint blockade is effective in only a minority of patients with cancer, suggesting that additional treatment strategies are needed. Here we use a pooled in vivo genetic screening approach using CRISPR-Cas9 genome editing in transplantable tumours in mice treated with immunotherapy to discover previously undescribed immunotherapy targets. We tested 2,368 genes expressed by melanoma cells to identify those that synergize with or cause resistance to checkpoint blockade. We recovered the known immune evasion molecules PD-L1 and CD47, and confirmed that defects in interferon-γ signalling caused resistance to immunotherapy. Tumours were sensitized to immunotherapy by deletion of genes involved in several diverse pathways, including NF-κB signalling, antigen presentation and the unfolded protein response. In addition, deletion of the protein tyrosine phosphatase PTPN2 in tumour cells increased the efficacy of immunotherapy by enhancing interferon-γ-mediated effects on antigen presentation and growth suppression. In vivo genetic screens in tumour models can identify new immunotherapy targets in unanticipated pathways.
Gene panel testing for hereditary breast cancer.

PubMed

Winship, Ingrid; Southey, Melissa C

2016-03-21

Inherited predisposition to breast cancer is explained only in part by mutations in the BRCA1 and BRCA2 genes. Most families with an apparent familial clustering of breast cancer who are investigated through Australia's network of genetic services and familial cancer centres do not have mutations in either of these genes. More recently, additional breast cancer predisposition genes, such as PALB2, have been identified. New genetic technology allows a panel of multiple genes to be tested for mutations in a single test. This enables more women and their families to have risk assessment and risk management, in a preventive approach to predictable breast cancer. Predictive testing for a known family-specific mutation in a breast cancer predisposition gene provides personalised risk assessment and evidence-based risk management. Breast cancer predisposition gene panel tests have a greater diagnostic yield than conventional testing of only the BRCA1 and BRCA2 genes. The clinical validity and utility of some of the putative breast cancer predisposition genes is not yet clear. Ethical issues warrant consideration, as multiple gene panel testing has the potential to identify secondary findings not originally sought by the test requested. Multiple gene panel tests may provide an affordable and effective way to investigate the heritability of breast cancer.
[BIOINFORMATIC SEARCH AND PHYLOGENETIC ANALYSIS OF THE CELLULOSE SYNTHASE GENES OF FLAX (LINUM USITATISSIMUM)].

PubMed

Pydiura, N A; Bayer, G Ya; Galinousky, D V; Yemets, A I; Pirko, Ya V; Podvitski, T A; Anisimova, N V; Khotyleva, L V; Kilchevsky, A V; Blume, Ya B

2015-01-01

A bioinformatic search of sequences encoding cellulose synthase genes in the flax genome, and their comparison to dicots orthologs was carried out. The analysis revealed 32 cellulose synthase gene candidates, 16 of which are highly likely to encode cellulose synthases, and the remaining 16--cellulose synthase-like proteins (Csl). Phylogenetic analysis of gene products of cellulose synthase genes allowed distinguishing 6 groups of cellulose synthase genes of different classes: CesA1/10, CesA3, CesA4, CesA5/6/2/9, CesA7 and CesA8. Paralogous sequences within classes CesA1/10 and CesA5/6/2/9 which are associated with the primary cell wall formation are characterized by a greater similarity within these classes than orthologous sequences. Whereas the genes controlling the biosynthesis of secondary cell wall cellulose form distinct clades: CesA4, CesA7, and CesA8. The analysis of 16 identified flax cellulose synthase gene candidates shows the presence of at least 12 different cellulose synthase gene variants in flax genome which are represented in all six clades of cellulose synthase genes. Thus, at this point genes of all ten known cellulose synthase classes are identify in flax genome, but their correct classification requires additional research.
Single nucleotide polymorphisms in bone turnover-related genes in Koreans: ethnic differences in linkage disequilibrium and haplotype

PubMed Central

Kim, Kyung-Seon; Kim, Ghi-Su; Hwang, Joo-Yeon; Lee, Hye-Ja; Park, Mi-Hyun; Kim, Kwang-joong; Jung, Jongsun; Cha, Hyo-Soung; Shin, Hyoung Doo; Kang, Jong-Ho; Park, Eui Kyun; Kim, Tae-Ho; Hong, Jung-Min; Koh, Jung-Min; Oh, Bermseok; Kimm, Kuchan; Kim, Shin-Yoon; Lee, Jong-Young

2007-01-01

Background Osteoporosis is defined as the loss of bone mineral density that leads to bone fragility with aging. Population-based case-control studies have identified polymorphisms in many candidate genes that have been associated with bone mass maintenance or osteoporotic fracture. To investigate single nucleotide polymorphisms (SNPs) that are associated with osteoporosis, we examined the genetic variation among Koreans by analyzing 81 genes according to their function in bone formation and resorption during bone remodeling. Methods We resequenced all the exons, splice junctions and promoter regions of candidate osteoporosis genes using 24 unrelated Korean individuals. Using the common SNPs from our study and the HapMap database, a statistical analysis of deviation in heterozygosity depicted. Results We identified 942 variants, including 888 SNPs, 43 insertion/deletion polymorphisms, and 11 microsatellite markers. Of the SNPs, 557 (63%) had been previously identified and 331 (37%) were newly discovered in the Korean population. When compared SNPs in the Korean population with those in HapMap database, 1% (or less) of SNPs in the Japanese and Chinese subpopulations and 20% of those in Caucasian and African subpopulations were significantly differentiated from the Hardy-Weinberg expectations. In addition, an analysis of the genetic diversity showed that there were no significant differences among Korean, Han Chinese and Japanese populations, but African and Caucasian populations were significantly differentiated in selected genes. Nevertheless, in the detailed analysis of genetic properties, the LD and Haplotype block patterns among the five sub-populations were substantially different from one another. Conclusion Through the resequencing of 81 osteoporosis candidate genes, 118 unknown SNPs with a minor allele frequency (MAF) > 0.05 were discovered in the Korean population. In addition, using the common SNPs between our study and HapMap, an analysis of genetic diversity and deviation in heterozygosity was performed and the polymorphisms of the above genes among the five populations were substantially differentiated from one another. Further studies of osteoporosis could utilize the polymorphisms identified in our data since they may have important implications for the selection of highly informative SNPs for future association studies. PMID:18036257
A statistical approach to identify, monitor, and manage incomplete curated data sets.

PubMed

Howe, Douglas G

2018-04-02

Many biological knowledge bases gather data through expert curation of published literature. High data volume, selective partial curation, delays in access, and publication of data prior to the ability to curate it can result in incomplete curation of published data. Knowing which data sets are incomplete and how incomplete they are remains a challenge. Awareness that a data set may be incomplete is important for proper interpretation, to avoiding flawed hypothesis generation, and can justify further exploration of published literature for additional relevant data. Computational methods to assess data set completeness are needed. One such method is presented here. In this work, a multivariate linear regression model was used to identify genes in the Zebrafish Information Network (ZFIN) Database having incomplete curated gene expression data sets. Starting with 36,655 gene records from ZFIN, data aggregation, cleansing, and filtering reduced the set to 9870 gene records suitable for training and testing the model to predict the number of expression experiments per gene. Feature engineering and selection identified the following predictive variables: the number of journal publications; the number of journal publications already attributed for gene expression annotation; the percent of journal publications already attributed for expression data; the gene symbol; and the number of transgenic constructs associated with each gene. Twenty-five percent of the gene records (2483 genes) were used to train the model. The remaining 7387 genes were used to test the model. One hundred and twenty-two and 165 of the 7387 tested genes were identified as missing expression annotations based on their residuals being outside the model lower or upper 95% confidence interval respectively. The model had precision of 0.97 and recall of 0.71 at the negative 95% confidence interval and precision of 0.76 and recall of 0.73 at the positive 95% confidence interval. This method can be used to identify data sets that are incompletely curated, as demonstrated using the gene expression data set from ZFIN. This information can help both database resources and data consumers gauge when it may be useful to look further for published data to augment the existing expertly curated information.
The TERT gene harbors multiple variants associated with pancreatic cancer susceptibility

PubMed Central

Campa, Daniele; Rizzato, Cosmeri; Stolzenberg-Solomon, Rachael; Pacetti, Paola; Vodicka, Pavel; Cleary, Sean P.; Capurso, Gabriele; Bueno-de-Mesquita, H. Bas; Werner, Jens; Gazouli, Maria; Butterbach, Katja; Ivanauskas, Audrius; Giese, Nathalia; Petersen, Gloria M.; Fogar, Paola; Wang, Zhaoming; Bassi, Claudio; Ryska, Miroslav; Theodoropoulos, George E.; Kooperberg, Charles; Li, Donghui; Greenhalf, William; Pasquali, Claudio; Hackert, Thilo; Fuchs, Charles S.; Mohelnikova-Duchonova, Beatrice; Sperti, Cosimo; Funel, Niccola; Dieffenbach, Aida Karina; Wareham, Nicholas J.; Buring, Julie; Holcátová, Ivana; Costello, Eithne; Zambon, Carlo-Federico; Kupcinskas, Juozas; Risch, Harvey A.; Kraft, Peter; Bracci, Paige M.; Pezzilli, Raffaele; Olson, Sara H.; Sesso, Howard D.; Hartge, Patricia; Strobel, Oliver; Małecka-Panas, Ewa; Visvanathan, Kala; Arslan, Alan A.; Pedrazzoli, Sergio; Souček, Pavel; Gioffreda, Domenica; Key, Timothy J.; Talar-Wojnarowska, Renata; Scarpa, Aldo; Mambrini, Andrea; Jacobs, Eric J.; Jamroziak, Krzysztof; Klein, Alison; Tavano, Francesca; Bambi, Franco; Landi, Stefano; Austin, Melissa A.; Vodickova, Ludmila; Brenner, Hermann; Chanock, Stephen J.; Fave, Gianfranco Delle; Piepoli, Ada; Cantore, Maurizio; Zheng, Wei; Wolpin, Brian M.; Amundadottir, Laufey T.; Canzian, Federico

2015-01-01

A small number of common susceptibility loci have been identified for pancreatic cancer, one of which is marked by rs401681 in the TERT – CLPTM1L gene region on chr5p15.33. Since this region is characterized by low linkage disequilibrium (LD), we sought to identify additional SNPs could be related to pancreatic cancer risk, independently of rs401681. We performed an in-depth analysis of genetic variability of the telomerase reverse transcriptase (TERT) and the telomerase RNA component (TERC) genes, in 5,550 subjects with pancreatic cancer and 7,585 controls from the PANcreatic Disease ReseArch (PANDoRA) and the PanScan consortia. We identified a significant association between a variant in TERT and pancreatic cancer risk (rs2853677, OR=0.85; 95% CI=0.80–0.90, P=8.3×10−8). Additional analysis adjusting rs2853677 for rs401681 indicated that the two SNPs are independently associated with pancreatic cancer risk, as suggested by the low LD between them (r2=0.07, D´=0.28). Three additional SNPs in TERT reached statistical significance after correction for multiple testing: rs2736100 (P=3.0×10−5), rs4583925 (P=4.0×10−5) and rs2735948 (P=5.0×10−5). In conclusion, we confirmed that the TERT locus is associated with pancreatic cancer risk, possibly through several independent variants. PMID:25940397
Expression of multidrug resistance efflux pump gene norA is iron responsive in Staphylococcus aureus.

PubMed

Deng, Xin; Sun, Fei; Ji, Quanjiang; Liang, Haihua; Missiakas, Dominique; Lan, Lefu; He, Chuan

2012-04-01

Staphylococcus aureus utilizes efflux transporter NorA to pump out a wide range of structurally dissimilar drugs, conferring low-level multidrug resistance. The regulation of norA expression has yet to be fully understood although past studies have revealed that this gene is under the control of the global transcriptional regulator MgrA and the two-component system ArlRS. To identify additional regulators of norA, we screened a transposon library in strain Newman expressing the transcriptional fusion norA-lacZ for altered β-galactosidase activity. We identify a transposon insertion in fhuB, a gene that encodes a ferric hydroxamate uptake system permease, and propose that the norA transcription is iron responsive. In agreement with this observation, addition of FeCl(3) repressed the induction of norA-lacZ, suggesting that bacterial iron uptake plays an important role in regulating norA transcription. In addition, a fur (ferric uptake regulator) deletion exhibited compromised norA transcription and reduced resistance to quinolone compared to the wild-type strain, indicating that fur functions as a positive regulator of norA. A putative Fur box identified in the promoter region of norA was confirmed by electrophoretic mobility shift and DNase I footprint assays. Finally, by employing a siderophore secretion assay, we reveal that NorA may contribute to the export of siderophores. Collectively, our experiments uncover some novel interactions between cellular iron level and norA regulation in S. aureus.
Expression of Multidrug Resistance Efflux Pump Gene norA Is Iron Responsive in Staphylococcus aureus

PubMed Central

Deng, Xin; Sun, Fei; Ji, Quanjiang; Liang, Haihua; Missiakas, Dominique; Lan, Lefu

2012-01-01

Staphylococcus aureus utilizes efflux transporter NorA to pump out a wide range of structurally dissimilar drugs, conferring low-level multidrug resistance. The regulation of norA expression has yet to be fully understood although past studies have revealed that this gene is under the control of the global transcriptional regulator MgrA and the two-component system ArlRS. To identify additional regulators of norA, we screened a transposon library in strain Newman expressing the transcriptional fusion norA-lacZ for altered β-galactosidase activity. We identify a transposon insertion in fhuB, a gene that encodes a ferric hydroxamate uptake system permease, and propose that the norA transcription is iron responsive. In agreement with this observation, addition of FeCl3 repressed the induction of norA-lacZ, suggesting that bacterial iron uptake plays an important role in regulating norA transcription. In addition, a fur (ferric uptake regulator) deletion exhibited compromised norA transcription and reduced resistance to quinolone compared to the wild-type strain, indicating that fur functions as a positive regulator of norA. A putative Fur box identified in the promoter region of norA was confirmed by electrophoretic mobility shift and DNase I footprint assays. Finally, by employing a siderophore secretion assay, we reveal that NorA may contribute to the export of siderophores. Collectively, our experiments uncover some novel interactions between cellular iron level and norA regulation in S. aureus. PMID:22267518
Characterization of the interferon genes in homozygous rainbow trout reveals two novel genes, alternate splicing and differential regulation of duplicated genes

USGS Publications Warehouse

Purcell, M.K.; Laing, K.J.; Woodson, J.C.; Thorgaard, G.H.; Hansen, J.D.

2009-01-01

The genes encoding the type I and type II interferons (IFNs) have previously been identified in rainbow trout and their proteins partially characterized. These previous studies reported a single type II IFN (rtIFN-??) and three rainbow trout type I IFN genes that are classified into either group I (rtIFN1, rtIFN2) or group II (rtIFN3). In this present study, we report the identification of a novel IFN-?? gene (rtIFN-??2) and a novel type I group II IFN (rtIFN4) in homozygous rainbow trout and predict that additional IFN genes or pseudogenes exist in the rainbow trout genome. Additionally, we provide evidence that short and long forms of rtIFN1 are actively and differentially transcribed in homozygous trout, and likely arose due to alternate splicing of the first exon. Quantitative reverse transcriptase PCR (qRT-PCR) assays were developed to systematically profile all of the rainbow trout IFN transcripts, with high specificity at an individual gene level, in na??ve fish and after stimulation with virus or viral-related molecules. Cloned PCR products were used to ensure the specificity of the qRT-PCR assays and as absolute standards to assess transcript abundance of each gene. All IFN genes were modulated in response to Infectious hematopoietic necrosis virus (IHNV), a DNA vaccine based on the IHNV glycoprotein, and poly I:C. The most inducible of the type I IFN genes, by all stimuli tested, were rtIFN3 and the short transcript form of rtIFN1. Gene expression of rtIFN-??1 and rtIFN-??2 was highly up-regulated by IHNV infection and DNA vaccination but rtIFN-??2 was induced to a greater magnitude. The specificity of the qRT-PCR assays reported here will be useful for future studies aimed at identifying which cells produce IFNs at early time points after infection. ?? 2008 Elsevier Ltd.
Biomarkers of Selenium Action in Prostate Cancer

DTIC Science & Technology

2005-01-01

secretory by conventional methods according to published literature. In addition, we have determined the similarities and differences in global gene...transition zone tissue of a 42-year-old man ac- arrays in the resulting data tables were ordered by their cording to previously described methods [4]. The pre...hundred fifteen genes identified by ELISA method . Replicating the conditions used for the SAM analysis showed significant differential expres- microarray
Differential regulation of mnp2, a new manganese peroxidase-encoding gene from the ligninolytic fungus Trametes versicolor PRL 572

Treesearch

Tomas Johansson; Per Olof Nyman; Daniel Cullen

2002-01-01

A peroxidase-encoding gene, mnp2, and its corresponding cDNA were characterized from the white-rot basidiomycete Trametes versicolor PRL 572. We used quantitative reverse transcriptase-mediated PCR to identify mnp2 transcripts in nutrient-limited stationary cultures. Although mnp2 lacks upstream metal response elements (MREs), addition of MnSO4 to cultures increased...

Genome-wide characterization of the Pectate Lyase-like (PLL) genes in Brassica rapa.

PubMed

Jiang, Jingjing; Yao, Lina; Miao, Ying; Cao, Jiashu

2013-11-01

Pectate lyases (PL) depolymerize demethylated pectin (pectate, EC 4.2.2.2) by catalyzing the eliminative cleavage of α-1,4-glycosidic linked galacturonan. Pectate Lyase-like (PLL) genes are one of the largest and most complex families in plants. However, studies on the phylogeny, gene structure, and expression of PLL genes are limited. To understand the potential functions of PLL genes in plants, we characterized their intron-exon structure, phylogenetic relationships, and protein structures, and measured their expression patterns in various tissues, specifically the reproductive tissues in Brassica rapa. Sequence alignments revealed two characteristic motifs in PLL genes. The chromosome location analysis indicated that 18 of the 46 PLL genes were located in the least fractionated sub-genome (LF) of B. rapa, while 16 were located in the medium fractionated sub-genome (MF1) and 12 in the more fractionated sub-genome (MF2). Quantitative RT-PCR analysis showed that BrPLL genes were expressed in various tissues, with most of them being expressed in flowers. Detailed qRT-PCR analysis identified 11 pollen specific PLL genes and several other genes with unique spatial expression patterns. In addition, some duplicated genes showed similar expression patterns. The phylogenetic analysis identified three PLL gene subfamilies in plants, among which subfamily II might have evolved from gene neofunctionalization or subfunctionalization. Therefore, this study opens the possibility for exploring the roles of PLL genes during plant development.
Gene Discovery in Bladder Cancer Progression using cDNA Microarrays

PubMed Central

Sanchez-Carbayo, Marta; Socci, Nicholas D.; Lozano, Juan Jose; Li, Wentian; Charytonowicz, Elizabeth; Belbin, Thomas J.; Prystowsky, Michael B.; Ortiz, Angel R.; Childs, Geoffrey; Cordon-Cardo, Carlos

2003-01-01

To identify gene expression changes along progression of bladder cancer, we compared the expression profiles of early-stage and advanced bladder tumors using cDNA microarrays containing 17,842 known genes and expressed sequence tags. The application of bootstrapping techniques to hierarchical clustering segregated early-stage and invasive transitional carcinomas into two main clusters. Multidimensional analysis confirmed these clusters and more importantly, it separated carcinoma in situ from papillary superficial lesions and subgroups within early-stage and invasive tumors displaying different overall survival. Additionally, it recognized early-stage tumors showing gene profiles similar to invasive disease. Different techniques including standard t-test, single-gene logistic regression, and support vector machine algorithms were applied to identify relevant genes involved in bladder cancer progression. Cytokeratin 20, neuropilin-2, p21, and p33ING1 were selected among the top ranked molecular targets differentially expressed and validated by immunohistochemistry using tissue microarrays (n = 173). Their expression patterns were significantly associated with pathological stage, tumor grade, and altered retinoblastoma (RB) expression. Moreover, p33ING1 expression levels were significantly associated with overall survival. Analysis of the annotation of the most significant genes revealed the relevance of critical genes and pathways during bladder cancer progression, including the overexpression of oncogenic genes such as DEK in superficial tumors or immune response genes such as Cd86 antigen in invasive disease. Gene profiling successfully classified bladder tumors based on their progression and clinical outcome. The present study has identified molecular biomarkers of potential clinical significance and critical molecular targets associated with bladder cancer progression. PMID:12875971
Quantitative real time RT-PCR study of pathogen-induced gene expression in rock bream (Oplegnathus fasciatus): internal controls for data normalization.

PubMed

Zhang, Bao-cun; Sun, Li; Xiao, Zhi-zhong; Hu, Yong-hua

2014-06-01

Rock bream Oplegnathus fasciatus is an important economic fish species. In this study, we evaluated the appropriateness of six housekeeping genes as internal controls for quantitative real-time PCR (RT-qPCR) analysis of gene expression in rock bream before and after pathogen infection. The expression of the selected genes in eight tissues infected with Vibrio alginolyticus or megalocytivirus was determined by RT-qPCR, and the PCR data were analyzed with geNorm and NormFinder algorithms. The results showed that before pathogen infection, mediator of RNA polymerase II transcription subunit 8 and β-actin were ranked as the most stable genes across the examined tissues. After bacterial or viral infection, the stabilities of the housekeeping genes varied to significant extents in tissue-dependent manners, and no single pair of genes was identified as suitable references for all tissues for either of the pathogen stimuli. In addition, for the majority of tissues, the most stable genes during bacterial infection differed from those during viral infection. Nevertheless, optimum reference genes were identified for each tissue under different conditions. Taken together, these results indicate that tissue type and the nature of the infectious agent used in the study can all influence the choice of normalization factors, and that the optimum reference genes identified in this study will provide a useful guidance for the selection of internal controls in future RT-PCR study of gene expression in rock bream. Copyright © 2014 Elsevier B.V. All rights reserved.
Meta-analysis identifies common variants associated with body mass index in East Asians

PubMed Central

Wen, Wanqing; Cho, Yoon Shin; Zheng, Wei; Dorajoo, Rajkumar; Kato, Norihiro; Qi, Lu; Chen, Chien-Hsiun; Delahanty, Ryan J.; Okada, Yukinori; Tabara, Yasuharu; Gu, Dongfeng; Zhu, Dingliang; Haiman, Christopher A.; Mo, Zengnan; Gao, Yu-Tang; Saw, Seang Mei; Go, Min Jin; Takeuchi, Fumihiko; Chang, Li-Ching; Kokubo, Yoshihiro; Liang, Jun; Hao, Mei; Marchand, Loic Le; Zhang, Yi; Hu, Yanling; Wong, Tien Yin; Long, Jirong; Han, Bok-Ghee; Kubo, Michiaki; Yamamoto, Ken; Su, Mei-Hsin; Miki, Tetsuro; Henderson, Brian E.; Song, Huaidong; Tan, Aihua; He, Jiang; Ng, Daniel P.-K.; Cai, Qiuyin; Tsunoda, Tatsuhiko; Tsai, Fuu-Jen; Iwai, Naoharu; Chen, Gary K.; Shi, Jiajun; Xu, Jianfeng; Sim, Xueling; Xiang, Yong-Bing; Maeda, Shiro; Ong, Rick T.H.; Li, Chun; Nakamura, Yusuke; Aung, Tin; Kamatani, Naoyuki; Liu, Jian Jun; Lu, Wei; Yokota, Mitsuhiro; Seielstad, Mark; Fann, Cathy S.J.; Wu, Jer-Yuarn; Lee, Jong-Young; Hu, Frank B.; Tanaka, Toshihiro; Tai, E. Shyong; Shu, Xiao Ou

2012-01-01

Multiple genetic loci associated with obesity or body mass index (BMI) have been identified through genome-wide association studies conducted predominantly in populations of European ancestry. We conducted a meta-analysis of associations between BMI and approximately 2.4 million SNPs in 27,715 East Asians, followed by in silico and de novo replication in 37,691 and 17,642 additional East Asians, respectively. We identified ten BMI-associated loci at the genome-wide significance level (P<5.0×10−8), including seven previously identified loci (FTO, SEC16B, MC4R, GIPR/QPCTL, ADCY3/RBJ, BDNF, and MAP2K5) and three novel loci in or near the CDKAL1,PCSK1, and GP2 genes. Three additional loci nearly reached the genome-wide significance threshold, including two previously identified loci in the GNPDA2 and TFAP2B genes and a new locus near PAX6, which all had P<5.0×10−7. Findings from this study may shed light on new pathways involved in obesity and demonstrate the value of conducting genetic studies in non-European populations. PMID:22344219
Gene expression atlas of pigeonpea and its application to gain insights into genes associated with pollen fertility implicated in seed formation.

PubMed

Pazhamala, Lekha T; Purohit, Shilp; Saxena, Rachit K; Garg, Vanika; Krishnamurthy, L; Verdier, Jerome; Varshney, Rajeev K

2017-04-01

Pigeonpea (Cajanus cajan) is an important grain legume of the semi-arid tropics, mainly used for its protein rich seeds. To link the genome sequence information with agronomic traits resulting from specific developmental processes, a Cajanus cajan gene expression atlas (CcGEA) was developed using the Asha genotype. Thirty tissues/organs representing developmental stages from germination to senescence were used to generate 590.84 million paired-end RNA-Seq data. The CcGEA revealed a compendium of 28 793 genes with differential, specific, spatio-temporal and constitutive expression during various stages of development in different tissues. As an example to demonstrate the application of the CcGEA, a network of 28 flower-related genes analysed for cis-regulatory elements and splicing variants has been identified. In addition, expression analysis of these candidate genes in male sterile and male fertile genotypes suggested their critical role in normal pollen development leading to seed formation. Gene network analysis also identified two regulatory genes, a pollen-specific SF3 and a sucrose-proton symporter, that could have implications for improvement of agronomic traits such as seed production and yield. In conclusion, the CcGEA provides a valuable resource for pigeonpea to identify candidate genes involved in specific developmental processes and to understand the well-orchestrated growth and developmental process in this resilient crop. © The Author 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology.
Impact of Cigarette Smoke on the Human and Mouse Lungs: A Gene-Expression Comparison Study

PubMed Central

Morissette, Mathieu C.; Lamontagne, Maxime; Bérubé, Jean-Christophe; Gaschler, Gordon; Williams, Andrew; Yauk, Carole; Couture, Christian; Laviolette, Michel; Hogg, James C.; Timens, Wim; Halappanavar, Sabina; Stampfli, Martin R.; Bossé, Yohan

2014-01-01

Cigarette smoke is well known for its adverse effects on human health, especially on the lungs. Basic research is essential to identify the mechanisms involved in the development of cigarette smoke-related diseases, but translation of new findings from pre-clinical models to the clinic remains difficult. In the present study, we aimed at comparing the gene expression signature between the lungs of human smokers and mice exposed to cigarette smoke to identify the similarities and differences. Using human and mouse whole-genome gene expression arrays, changes in gene expression, signaling pathways and biological functions were assessed. We found that genes significantly modulated by cigarette smoke in humans were enriched for genes modulated by cigarette smoke in mice, suggesting a similar response of both species. Sixteen smoking-induced genes were in common between humans and mice including six newly reported to be modulated by cigarette smoke. In addition, we identified a new conserved pulmonary response to cigarette smoke in the induction of phospholipid metabolism/degradation pathways. Finally, the majority of biological functions modulated by cigarette smoke in humans were also affected in mice. Altogether, the present study provides information on similarities and differences in lung gene expression response to cigarette smoke that exist between human and mouse. Our results foster the idea that animal models should be used to study the involvement of pathways rather than single genes in human diseases. PMID:24663285
Target gene screening and evaluation of prognostic values in non-small cell lung cancers by bioinformatics analysis.

PubMed

Piao, Junjie; Sun, Jie; Yang, Yang; Jin, Tiefeng; Chen, Liyan; Lin, Zhenhua

2018-03-20

Non-small cell lung cancer (NSCLC) is the major leading cause of cancer-related deaths worldwide. This study aims to explore molecular mechanism of NSCLC. Microarray dataset was obtained from the Gene Expression Omnibus (GEO) database, and analyzed by using GEO2R. Functional and pathway enrichment analysis were performed based on Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Then, STRING, Cytoscape and MCODE were applied to construct the Protein-protein interaction (PPI) network and screen hub genes. Following, overall survival (OS) analysis of hub genes was performed by using the Kaplan-Meier plotter online tool. Moreover, miRecords was also applied to predict the targets of the differentially expressed microRNAs (DEMs). A total of 228 DEGs were identified, and they were mainly enriched in the terms of cell adhesion molecules, leukocyte transendothelial migration and ECM-receptor interaction. A PPI network was constructed, and 16 hub genes were identified, including TEK, ANGPT1, MMP9, VWF, CDH5, EDN1, ESAM, CCNE1, CDC45, PRC1, CCNB2, AURKA, MELK, CDC20, TOP2A and PTTG1. Among the genes, expressions of 14 hub genes were associated with prognosis of NSCLC patients. Additionally, a total of 11 DEMs were also identified. Our results provide some potential underlying biomarkers for NSCLC. Further studies are required to elucidate the pathogenesis of NSCLC. Copyright © 2018 Elsevier B.V. All rights reserved.
Gene Expression Patterns Associated With Histopathology in Toxic Liver Fibrosis.

PubMed

Ippolito, Danielle L; AbdulHameed, Mohamed Diwan M; Tawa, Gregory J; Baer, Christine E; Permenter, Matthew G; McDyre, Bonna C; Dennis, William E; Boyle, Molly H; Hobbs, Cheryl A; Streicker, Michael A; Snowden, Bobbi S; Lewis, John A; Wallqvist, Anders; Stallings, Jonathan D

2016-01-01

Toxic industrial chemicals induce liver injury, which is difficult to diagnose without invasive procedures. Identifying indicators of end organ injury can complement exposure-based assays and improve predictive power. A multiplexed approach was used to experimentally evaluate a panel of 67 genes predicted to be associated with the fibrosis pathology by computationally mining DrugMatrix, a publicly available repository of gene microarray data. Five-day oral gavage studies in male Sprague Dawley rats dosed with varying concentrations of 3 fibrogenic compounds (allyl alcohol, carbon tetrachloride, and 4,4'-methylenedianiline) and 2 nonfibrogenic compounds (bromobenzene and dexamethasone) were conducted. Fibrosis was definitively diagnosed by histopathology. The 67-plex gene panel accurately diagnosed fibrosis in both microarray and multiplexed-gene expression assays. Necrosis and inflammatory infiltration were comorbid with fibrosis. ANOVA with contrasts identified that 51 of the 67 predicted genes were significantly associated with the fibrosis phenotype, with 24 of these specific to fibrosis alone. The protein product of the gene most strongly correlated with the fibrosis phenotype PCOLCE (Procollagen C-Endopeptidase Enhancer) was dose-dependently elevated in plasma from animals administered fibrogenic chemicals (P < .05). Semiquantitative global mass spectrometry analysis of the plasma identified an additional 5 protein products of the gene panel which increased after fibrogenic toxicant administration: fibronectin, ceruloplasmin, vitronectin, insulin-like growth factor binding protein, and α2-macroglobulin. These results support the data mining approach for identifying gene and/or protein panels for assessing liver injury and may suggest bridging biomarkers for molecular mediators linked to histopathology. Published by Oxford University Press on behalf of the Society of Toxicology 2015. This work is written by US Government employees and is in the public domain in the US.
IL-32 is a molecular marker of a host defense network in human tuberculosis

PubMed Central

Montoya, Dennis; Inkeles, Megan S.; Liu, Phillip T.; Realegeno, Susan; Teles, Rosane M. B.; Vaidya, Poorva; Munoz, Marcos A.; Schenk, Mirjam; Swindell, William R.; Chun, Rene; Zavala, Kathryn; Hewison, Martin; Adams, John S.; Horvath, Steve; Pellegrini, Matteo; Bloom, Barry R.; Modlin, Robert L.

2014-01-01

Tuberculosis is a leading cause of infectious disease–related death worldwide; however, only 10% of people infected with Mycobacterium tuberculosis develop disease. Factors that contribute to protection could prove to be promising targets for M. tuberculosis therapies. Analysis of peripheral blood gene expression profiles of active tuberculosis patients has identified correlates of risk for disease or pathogenesis. We sought to identify potential human candidate markers of host defense by studying gene expression profiles of macrophages, cells that, upon infection by M. tuberculosis, can mount an antimicrobial response. Weighted gene coexpression network analysis revealed an association between the cytokine interleukin-32 (IL-32) and the vitamin D antimicrobial pathway in a network of interferon-γ– and IL-15–induced “defense response” genes. IL-32 induced the vitamin D–dependent antimicrobial peptides cathelicidin and DEFB4 and to generate antimicrobial activity in vitro, dependent on the presence of adequate 25-hydroxyvitamin D. In addition, the IL-15–induced defense response macrophage gene network was integrated with ranked pairwise comparisons of gene expression from five different clinical data sets of latent compared with active tuberculosis or healthy controls and a coexpression network derived from gene expression in patients with tuberculosis undergoing chemotherapy. Together, these analyses identified eight common genes, including IL-32, as molecular markers of latent tuberculosis and the IL-15–induced gene network. As maintaining M. tuberculosis in a latent state and preventing transition to active disease may represent a form of host resistance, these results identify IL-32 as one functional marker and potential correlate of protection against active tuberculosis. PMID:25143364
Identification of pathogenicity‐related genes in Fusarium oxysporum f. sp. cepae

PubMed Central

Vágány, Viktória; Jackson, Alison C.; Harrison, Richard J.; Rainoni, Alessandro; Clarkson, John P.

2016-01-01

Summary Pathogenic isolates of Fusarium oxysporum, distinguished as formae speciales (f. spp.) on the basis of their host specificity, cause crown rots, root rots and vascular wilts on many important crops worldwide. Fusarium oxysporum f. sp. cepae (FOC) is particularly problematic to onion growers worldwide and is increasing in prevalence in the UK. We characterized 31 F. oxysporum isolates collected from UK onions using pathogenicity tests, sequencing of housekeeping genes and identification of effectors. In onion seedling and bulb tests, 21 isolates were pathogenic and 10 were non‐pathogenic. The molecular characterization of these isolates, and 21 additional isolates comprising other f. spp. and different Fusarium species, was carried out by sequencing three housekeeping genes. A concatenated tree separated the F. oxysporum isolates into six clades, but did not distinguish between pathogenic and non‐pathogenic isolates. Ten putative effectors were identified within FOC, including seven Secreted In Xylem (SIX) genes first reported in F. oxysporum f. sp. lycopersici. Two highly homologous proteins with signal peptides and RxLR motifs (CRX1/CRX2) and a gene with no previously characterized domains (C5) were also identified. The presence/absence of nine of these genes was strongly related to pathogenicity against onion and all were shown to be expressed in planta. Different SIX gene complements were identified in other f. spp., but none were identified in three other Fusarium species from onion. Although the FOC SIX genes had a high level of homology with other f. spp., there were clear differences in sequences which were unique to FOC, whereas CRX1 and C5 genes appear to be largely FOC specific. PMID:26609905
Identification of pathogenicity-related genes in Fusarium oxysporum f. sp. cepae.

PubMed

Taylor, Andrew; Vágány, Viktória; Jackson, Alison C; Harrison, Richard J; Rainoni, Alessandro; Clarkson, John P

2016-09-01

Pathogenic isolates of Fusarium oxysporum, distinguished as formae speciales (f. spp.) on the basis of their host specificity, cause crown rots, root rots and vascular wilts on many important crops worldwide. Fusarium oxysporum f. sp. cepae (FOC) is particularly problematic to onion growers worldwide and is increasing in prevalence in the UK. We characterized 31 F. oxysporum isolates collected from UK onions using pathogenicity tests, sequencing of housekeeping genes and identification of effectors. In onion seedling and bulb tests, 21 isolates were pathogenic and 10 were non-pathogenic. The molecular characterization of these isolates, and 21 additional isolates comprising other f. spp. and different Fusarium species, was carried out by sequencing three housekeeping genes. A concatenated tree separated the F. oxysporum isolates into six clades, but did not distinguish between pathogenic and non-pathogenic isolates. Ten putative effectors were identified within FOC, including seven Secreted In Xylem (SIX) genes first reported in F. oxysporum f. sp. lycopersici. Two highly homologous proteins with signal peptides and RxLR motifs (CRX1/CRX2) and a gene with no previously characterized domains (C5) were also identified. The presence/absence of nine of these genes was strongly related to pathogenicity against onion and all were shown to be expressed in planta. Different SIX gene complements were identified in other f. spp., but none were identified in three other Fusarium species from onion. Although the FOC SIX genes had a high level of homology with other f. spp., there were clear differences in sequences which were unique to FOC, whereas CRX1 and C5 genes appear to be largely FOC specific. © 2015 The Authors Molecular Plant Pathology Published by British Society for Plant Pathology and John Wiley & Sons Ltd.
Bioinformatics analysis of gene expression profiles in B cells of postmenopausal osteoporosis patients.

PubMed

Ma, Min; Luo, Shulin; Zhou, Wei; Lu, Liangyu; Cai, Junfeng; Yuan, Feng; Yin, Feng

2017-04-01

The aim of this study was to gain a better understanding of the molecular mechanisms and identify more critical genes associated with the pathogenesis of postmenopausal osteoporosis (PMOP). Microarray data of GSE13850 were download from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) were identified either in B cells from postmenopausal female nonsmokers with high bone mineral density (BMD) compared with those with low BMD (defined as DEG1 group) or in B cells from postmenopausal female smokers with high BMD compared with postmenopausal female nonsmokers with high BMD (defined as DEG2 group). Gene ontology and immune-related functional enrichment analysis of DEGs were performed. Additionally, the protein-protein interaction network of all DEGs was constructed and subnetworks of the hub genes were extracted. A total of 51 DEGs were identified in the DEG1 group, including 30 up- and 21 downregulated genes. Besides, 86 DEGs were identified in the DEG2 group, of which 46 were upregulated and 40 were downregulated. Immune enrichment analysis showed DEGs were mainly enriched in functions of CD molecules and chemokines and receptor, and the upregulated gene interleukin 4 receptor (IL-4R) was significantly enriched. Moreover, guanine nucleotide-binding protein G (GNAI2), filamin A alpha (FLNA), and transforming growth factor-β1 (TGFB1) were hub proteins in the protein-protein interaction network. IL-4R, GNAI2, FLNA, and TGFB1 may be potential target genes associated with the pathogenesis of PMOP. In particular, FLNA, and TGFB1 may be affected by smoking, a risk factor of PMOP. Copyright © 2017. Published by Elsevier B.V.
Discovery and characterization of two new stem rust resistance genes in Aegilops sharonensis.

PubMed

Yu, Guotai; Champouret, Nicolas; Steuernagel, Burkhard; Olivera, Pablo D; Simmons, Jamie; Williams, Cole; Johnson, Ryan; Moscou, Matthew J; Hernández-Pinzón, Inmaculada; Green, Phon; Sela, Hanan; Millet, Eitan; Jones, Jonathan D G; Ward, Eric R; Steffenson, Brian J; Wulff, Brande B H

2017-06-01

We identified two novel wheat stem rust resistance genes, Sr-1644-1Sh and Sr-1644-5Sh in Aegilops sharonensis that are effective against widely virulent African races of the wheat stem rust pathogen. Stem rust is one of the most important diseases of wheat in the world. When single stem rust resistance (Sr) genes are deployed in wheat, they are often rapidly overcome by the pathogen. To this end, we initiated a search for novel sources of resistance in diverse wheat relatives and identified the wild goatgrass species Aegilops sharonesis (Sharon goatgrass) as a rich reservoir of resistance to wheat stem rust. The objectives of this study were to discover and map novel Sr genes in Ae. sharonensis and to explore the possibility of identifying new Sr genes by genome-wide association study (GWAS). We developed two biparental populations between resistant and susceptible accessions of Ae. sharonensis and performed QTL and linkage analysis. In an F 6 recombinant inbred line and an F 2 population, two genes were identified that mapped to the short arm of chromosome 1S sh , designated as Sr-1644-1Sh, and the long arm of chromosome 5S sh , designated as Sr-1644-5Sh. The gene Sr-1644-1Sh confers a high level of resistance to race TTKSK (a member of the Ug99 race group), while the gene Sr-1644-5Sh conditions strong resistance to TRTTF, another widely virulent race found in Yemen. Additionally, GWAS was conducted on 125 diverse Ae. sharonensis accessions for stem rust resistance. The gene Sr-1644-1Sh was detected by GWAS, while Sr-1644-5Sh was not detected, indicating that the effectiveness of GWAS might be affected by marker density, population structure, low allele frequency and other factors.
IL-32 is a molecular marker of a host defense network in human tuberculosis.

PubMed

Montoya, Dennis; Inkeles, Megan S; Liu, Phillip T; Realegeno, Susan; Teles, Rosane M B; Vaidya, Poorva; Munoz, Marcos A; Schenk, Mirjam; Swindell, William R; Chun, Rene; Zavala, Kathryn; Hewison, Martin; Adams, John S; Horvath, Steve; Pellegrini, Matteo; Bloom, Barry R; Modlin, Robert L

2014-08-20

Tuberculosis is a leading cause of infectious disease-related death worldwide; however, only 10% of people infected with Mycobacterium tuberculosis develop disease. Factors that contribute to protection could prove to be promising targets for M. tuberculosis therapies. Analysis of peripheral blood gene expression profiles of active tuberculosis patients has identified correlates of risk for disease or pathogenesis. We sought to identify potential human candidate markers of host defense by studying gene expression profiles of macrophages, cells that, upon infection by M. tuberculosis, can mount an antimicrobial response. Weighted gene coexpression network analysis revealed an association between the cytokine interleukin-32 (IL-32) and the vitamin D antimicrobial pathway in a network of interferon-γ- and IL-15-induced "defense response" genes. IL-32 induced the vitamin D-dependent antimicrobial peptides cathelicidin and DEFB4 and to generate antimicrobial activity in vitro, dependent on the presence of adequate 25-hydroxyvitamin D. In addition, the IL-15-induced defense response macrophage gene network was integrated with ranked pairwise comparisons of gene expression from five different clinical data sets of latent compared with active tuberculosis or healthy controls and a coexpression network derived from gene expression in patients with tuberculosis undergoing chemotherapy. Together, these analyses identified eight common genes, including IL-32, as molecular markers of latent tuberculosis and the IL-15-induced gene network. As maintaining M. tuberculosis in a latent state and preventing transition to active disease may represent a form of host resistance, these results identify IL-32 as one functional marker and potential correlate of protection against active tuberculosis. Copyright © 2014, American Association for the Advancement of Science.
Microarray RNA expression analysis of cerebral white matter lesions reveals changes in multiple functional pathways.

PubMed

Simpson, Julie E; Hosny, Ola; Wharton, Stephen B; Heath, Paul R; Holden, Hazel; Fernando, Malee S; Matthews, Fiona; Forster, Gill; O'Brien, John T; Barber, Robert; Kalaria, Raj N; Brayne, Carol; Shaw, Pamela J; Lewis, Claire E; Ince, Paul G

2009-02-01

White matter lesions (WML) in brain aging are linked to dementia and depression. Ischemia contributes to their pathogenesis but other mechanisms may contribute. We used RNA microarray analysis with functional pathway grouping as an unbiased approach to investigate evidence for additional pathogenetic mechanisms. WML were identified by MRI and pathology in brains donated to the Medical Research Council Cognitive Function and Ageing Study Cognitive Function and Aging Study. RNA was extracted to compare WML with nonlesional white matter samples from cases with lesions (WM[L]), and from cases with no lesions (WM[C]) using RNA microarray and pathway analysis. Functional pathways were validated for selected genes by quantitative real-time polymerase chain reaction and immunocytochemistry. We identified 8 major pathways in which multiple genes showed altered RNA transcription (immune regulation, cell cycle, apoptosis, proteolysis, ion transport, cell structure, electron transport, metabolism) among 502 genes that were differentially expressed in WML compared to WM[C]. In WM[L], 409 genes were altered involving the same pathways. Genes selected to validate this microarray data all showed the expected changes in RNA levels and immunohistochemical expression of protein. WML represent areas with a complex molecular phenotype. From this and previous evidence, WML may arise through tissue ischemia but may also reflect the contribution of additional factors like blood-brain barrier dysfunction. Differential expression of genes in WM[L] compared to WM[C] indicate a "field effect" in the seemingly normal surrounding white matter.
Genomic characterization of a new endophytic Streptomyces kebangsaanensis identifies biosynthetic pathway gene clusters for novel phenazine antibiotic production

PubMed Central

Remali, Juwairiah; Sarmin, Nurul ‘Izzah Mohd; Ng, Chyan Leong; Tiong, John J.L.; Aizat, Wan M.; Keong, Loke Kok

2017-01-01

Background Streptomyces are well known for their capability to produce many bioactive secondary metabolites with medical and industrial importance. Here we report a novel bioactive phenazine compound, 6-((2-hydroxy-4-methoxyphenoxy) carbonyl) phenazine-1-carboxylic acid (HCPCA) extracted from Streptomyces kebangsaanensis, an endophyte isolated from the ethnomedicinal Portulaca oleracea. Methods The HCPCA chemical structure was determined using nuclear magnetic resonance spectroscopy. We conducted whole genome sequencing for the identification of the gene cluster(s) believed to be responsible for phenazine biosynthesis in order to map its corresponding pathway, in addition to bioinformatics analysis to assess the potential of S. kebangsaanensis in producing other useful secondary metabolites. Results The S. kebangsaanensis genome comprises an 8,328,719 bp linear chromosome with high GC content (71.35%) consisting of 12 rRNA operons, 81 tRNA, and 7,558 protein coding genes. We identified 24 gene clusters involved in polyketide, nonribosomal peptide, terpene, bacteriocin, and siderophore biosynthesis, as well as a gene cluster predicted to be responsible for phenazine biosynthesis. Discussion The HCPCA phenazine structure was hypothesized to derive from the combination of two biosynthetic pathways, phenazine-1,6-dicarboxylic acid and 4-methoxybenzene-1,2-diol, originated from the shikimic acid pathway. The identification of a biosynthesis pathway gene cluster for phenazine antibiotics might facilitate future genetic engineering design of new synthetic phenazine antibiotics. Additionally, these findings confirm the potential of S. kebangsaanensis for producing various antibiotics and secondary metabolites. PMID:29201559
No Association of BDNF, COMT, MAOA, SLC6A3, and SLC6A4 Genes and Depressive Symptoms in a Sample of Healthy Colombian Subjects.

PubMed

González-Giraldo, Yeimy; Camargo, Andrés; López-León, Sandra; Forero, Diego A

2015-01-01

Background. Major depressive disorder (MDD) is the second cause of years lived with disability around the world. A large number of studies have been carried out to identify genetic risk factors for MDD and related endophenotypes, mainly in populations of European and Asian descent, with conflicting results. The main aim of the current study was to analyze the possible association of five candidate genes and depressive symptoms in a Colombian sample of healthy subjects. Methods and Materials. The Spanish adaptation of the Hospital Anxiety and Depression Scale (HADS) was applied to one hundred eighty-eight healthy Colombian subjects. Five functional polymorphisms were genotyped using PCR-based assays: BDNF-Val66Met (rs6265), COMT-Val158Met (rs4680), SLC6A4-HTTLPR (rs4795541), MAOA-uVNTR, and SLC6A3-VNTR (rs28363170). Result. We did not find significant associations with scores of depressive symptoms, derived from the HADS, for any of the five candidate genes (nominal p values >0.05). In addition, we did not find evidence of significant gene-gene interactions. Conclusion. This work is one of the first studies of candidate genes for depressive symptoms in a Latin American sample. Study of additional genetic and epigenetic variants, taking into account other pathophysiological theories, will help to identify novel candidates for MDD in populations around the world.
Evaluating intra- and inter-individual variation in the human placental transcriptome.

PubMed

Hughes, David A; Kircher, Martin; He, Zhisong; Guo, Song; Fairbrother, Genevieve L; Moreno, Carlos S; Khaitovich, Philipp; Stoneking, Mark

2015-03-19

Gene expression variation is a phenotypic trait of particular interest as it represents the initial link between genotype and other phenotypes. Analyzing how such variation apportions among and within groups allows for the evaluation of how genetic and environmental factors influence such traits. It also provides opportunities to identify genes and pathways that may have been influenced by non-neutral processes. Here we use a population genetics framework and next generation sequencing to evaluate how gene expression variation is apportioned among four human groups in a natural biological tissue, the placenta. We estimate that on average, 33.2%, 58.9%, and 7.8% of the placental transcriptome is explained by variation within individuals, among individuals, and among human groups, respectively. Additionally, when technical and biological traits are included in models of gene expression they each account for roughly 2% of total gene expression variation. Notably, the variation that is significantly different among groups is enriched in biological pathways associated with immune response, cell signaling, and metabolism. Many biological traits demonstrate correlated changes in expression in numerous pathways of potential interest to clinicians and evolutionary biologists. Finally, we estimate that the majority of the human placental transcriptome exhibits expression profiles consistent with neutrality; the remainder are consistent with stabilizing selection, directional selection, or diversifying selection. We apportion placental gene expression variation into individual, population, and biological trait factors and identify how each influence the transcriptome. Additionally, we advance methods to associate expression profiles with different forms of selection.
Reranking candidate gene models with cross-species comparison for improved gene prediction

PubMed Central

Liu, Qian; Crammer, Koby; Pereira, Fernando CN; Roos, David S

2008-01-01

Background Most gene finders score candidate gene models with state-based methods, typically HMMs, by combining local properties (coding potential, splice donor and acceptor patterns, etc). Competing models with similar state-based scores may be distinguishable with additional information. In particular, functional and comparative genomics datasets may help to select among competing models of comparable probability by exploiting features likely to be associated with the correct gene models, such as conserved exon/intron structure or protein sequence features. Results We have investigated the utility of a simple post-processing step for selecting among a set of alternative gene models, using global scoring rules to rerank competing models for more accurate prediction. For each gene locus, we first generate the K best candidate gene models using the gene finder Evigan, and then rerank these models using comparisons with putative orthologous genes from closely-related species. Candidate gene models with lower scores in the original gene finder may be selected if they exhibit strong similarity to probable orthologs in coding sequence, splice site location, or signal peptide occurrence. Experiments on Drosophila melanogaster demonstrate that reranking based on cross-species comparison outperforms the best gene models identified by Evigan alone, and also outperforms the comparative gene finders GeneWise and Augustus+. Conclusion Reranking gene models with cross-species comparison improves gene prediction accuracy. This straightforward method can be readily adapted to incorporate additional lines of evidence, as it requires only a ranked source of candidate gene models. PMID:18854050
Mutagenesis Screen Identifies agtpbp1 and eps15L1 as Essential for T lymphocyte Development in Zebrafish.

PubMed

Seiler, Christoph; Gebhart, Nichole; Zhang, Yong; Shinton, Susan A; Li, Yue-sheng; Ross, Nicola L; Liu, Xingjun; Li, Qin; Bilbee, Alison N; Varshney, Gaurav K; LaFave, Matthew C; Burgess, Shawn M; Balciuniene, Jorune; Balciunas, Darius; Hardy, Richard R; Kappes, Dietmar J; Wiest, David L; Rhodes, Jennifer

2015-01-01

Genetic screens are a powerful tool to discover genes that are important in immune cell development and function. The evolutionarily conserved development of lymphoid cells paired with the genetic tractability of zebrafish make this a powerful model system for this purpose. We used a Tol2-based gene-breaking transposon to induce mutations in the zebrafish (Danio rerio, AB strain) genome, which served the dual purpose of fluorescently tagging cells and tissues that express the disrupted gene and provided a means of identifying the disrupted gene. We identified 12 lines in which hematopoietic tissues expressed green fluorescent protein (GFP) during embryonic development, as detected by microscopy. Subsequent analysis of young adult fish, using a novel approach in which single cell suspensions of whole fish were analyzed by flow cytometry, revealed that 8 of these lines also exhibited GFP expression in young adult cells. An additional 15 lines that did not have embryonic GFP+ hematopoietic tissue by microscopy, nevertheless exhibited GFP+ cells in young adults. RT-PCR analysis of purified GFP+ populations for expression of T and B cell-specific markers identified 18 lines in which T and/or B cells were fluorescently tagged at 6 weeks of age. As transposon insertion is expected to cause gene disruption, these lines can be used to assess the requirement for the disrupted genes in immune cell development. Focusing on the lines with embryonic GFP+ hematopoietic tissue, we identified three lines in which homozygous mutants exhibited impaired T cell development at 6 days of age. In two of the lines we identified the disrupted genes, agtpbp1 and eps15L1. Morpholino-mediated knockdown of these genes mimicked the T cell defects in the corresponding mutant embryos, demonstrating the previously unrecognized, essential roles of agtpbp1 and eps15L1 in T cell development.

Hundreds of variants clustered in genomic loci and biological pathways affect human height

PubMed Central

Lango Allen, Hana; Estrada, Karol; Lettre, Guillaume; Berndt, Sonja I.; Weedon, Michael N.; Rivadeneira, Fernando; Willer, Cristen J.; Jackson, Anne U.; Vedantam, Sailaja; Raychaudhuri, Soumya; Ferreira, Teresa; Wood, Andrew R.; Weyant, Robert J.; Segrè, Ayellet V.; Speliotes, Elizabeth K.; Wheeler, Eleanor; Soranzo, Nicole; Park, Ju-Hyun; Yang, Jian; Gudbjartsson, Daniel; Heard-Costa, Nancy L.; Randall, Joshua C.; Qi, Lu; Smith, Albert Vernon; Mägi, Reedik; Pastinen, Tomi; Liang, Liming; Heid, Iris M.; Luan, Jian'an; Thorleifsson, Gudmar; Winkler, Thomas W.; Goddard, Michael E.; Lo, Ken Sin; Palmer, Cameron; Workalemahu, Tsegaselassie; Aulchenko, Yurii S.; Johansson, Åsa; Zillikens, M.Carola; Feitosa, Mary F.; Esko, Tõnu; Johnson, Toby; Ketkar, Shamika; Kraft, Peter; Mangino, Massimo; Prokopenko, Inga; Absher, Devin; Albrecht, Eva; Ernst, Florian; Glazer, Nicole L.; Hayward, Caroline; Hottenga, Jouke-Jan; Jacobs, Kevin B.; Knowles, Joshua W.; Kutalik, Zoltán; Monda, Keri L.; Polasek, Ozren; Preuss, Michael; Rayner, Nigel W.; Robertson, Neil R.; Steinthorsdottir, Valgerdur; Tyrer, Jonathan P.; Voight, Benjamin F.; Wiklund, Fredrik; Xu, Jianfeng; Zhao, Jing Hua; Nyholt, Dale R.; Pellikka, Niina; Perola, Markus; Perry, John R.B.; Surakka, Ida; Tammesoo, Mari-Liis; Altmaier, Elizabeth L.; Amin, Najaf; Aspelund, Thor; Bhangale, Tushar; Boucher, Gabrielle; Chasman, Daniel I.; Chen, Constance; Coin, Lachlan; Cooper, Matthew N.; Dixon, Anna L.; Gibson, Quince; Grundberg, Elin; Hao, Ke; Junttila, M. Juhani; Kaplan, Lee M.; Kettunen, Johannes; König, Inke R.; Kwan, Tony; Lawrence, Robert W.; Levinson, Douglas F.; Lorentzon, Mattias; McKnight, Barbara; Morris, Andrew P.; Müller, Martina; Ngwa, Julius Suh; Purcell, Shaun; Rafelt, Suzanne; Salem, Rany M.; Salvi, Erika; Sanna, Serena; Shi, Jianxin; Sovio, Ulla; Thompson, John R.; Turchin, Michael C.; Vandenput, Liesbeth; Verlaan, Dominique J.; Vitart, Veronique; White, Charles C.; Ziegler, Andreas; Almgren, Peter; Balmforth, Anthony J.; Campbell, Harry; Citterio, Lorena; De Grandi, Alessandro; Dominiczak, Anna; Duan, Jubao; Elliott, Paul; Elosua, Roberto; Eriksson, Johan G.; Freimer, Nelson B.; Geus, Eco J.C.; Glorioso, Nicola; Haiqing, Shen; Hartikainen, Anna-Liisa; Havulinna, Aki S.; Hicks, Andrew A.; Hui, Jennie; Igl, Wilmar; Illig, Thomas; Jula, Antti; Kajantie, Eero; Kilpeläinen, Tuomas O.; Koiranen, Markku; Kolcic, Ivana; Koskinen, Seppo; Kovacs, Peter; Laitinen, Jaana; Liu, Jianjun; Lokki, Marja-Liisa; Marusic, Ana; Maschio, Andrea; Meitinger, Thomas; Mulas, Antonella; Paré, Guillaume; Parker, Alex N.; Peden, John F.; Petersmann, Astrid; Pichler, Irene; Pietiläinen, Kirsi H.; Pouta, Anneli; Ridderstråle, Martin; Rotter, Jerome I.; Sambrook, Jennifer G.; Sanders, Alan R.; Schmidt, Carsten Oliver; Sinisalo, Juha; Smit, Jan H.; Stringham, Heather M.; Walters, G.Bragi; Widen, Elisabeth; Wild, Sarah H.; Willemsen, Gonneke; Zagato, Laura; Zgaga, Lina; Zitting, Paavo; Alavere, Helene; Farrall, Martin; McArdle, Wendy L.; Nelis, Mari; Peters, Marjolein J.; Ripatti, Samuli; van Meurs, Joyce B.J.; Aben, Katja K.; Ardlie, Kristin G; Beckmann, Jacques S.; Beilby, John P.; Bergman, Richard N.; Bergmann, Sven; Collins, Francis S.; Cusi, Daniele; den Heijer, Martin; Eiriksdottir, Gudny; Gejman, Pablo V.; Hall, Alistair S.; Hamsten, Anders; Huikuri, Heikki V.; Iribarren, Carlos; Kähönen, Mika; Kaprio, Jaakko; Kathiresan, Sekar; Kiemeney, Lambertus; Kocher, Thomas; Launer, Lenore J.; Lehtimäki, Terho; Melander, Olle; Mosley, Tom H.; Musk, Arthur W.; Nieminen, Markku S.; O'Donnell, Christopher J.; Ohlsson, Claes; Oostra, Ben; Palmer, Lyle J.; Raitakari, Olli; Ridker, Paul M.; Rioux, John D.; Rissanen, Aila; Rivolta, Carlo; Schunkert, Heribert; Shuldiner, Alan R.; Siscovick, David S.; Stumvoll, Michael; Tönjes, Anke; Tuomilehto, Jaakko; van Ommen, Gert-Jan; Viikari, Jorma; Heath, Andrew C.; Martin, Nicholas G.; Montgomery, Grant W.; Province, Michael A.; Kayser, Manfred; Arnold, Alice M.; Atwood, Larry D.; Boerwinkle, Eric; Chanock, Stephen J.; Deloukas, Panos; Gieger, Christian; Grönberg, Henrik; Hall, Per; Hattersley, Andrew T.; Hengstenberg, Christian; Hoffman, Wolfgang; Lathrop, G.Mark; Salomaa, Veikko; Schreiber, Stefan; Uda, Manuela; Waterworth, Dawn; Wright, Alan F.; Assimes, Themistocles L.; Barroso, Inês; Hofman, Albert; Mohlke, Karen L.; Boomsma, Dorret I.; Caulfield, Mark J.; Cupples, L.Adrienne; Erdmann, Jeanette; Fox, Caroline S.; Gudnason, Vilmundur; Gyllensten, Ulf; Harris, Tamara B.; Hayes, Richard B.; Jarvelin, Marjo-Riitta; Mooser, Vincent; Munroe, Patricia B.; Ouwehand, Willem H.; Penninx, Brenda W.; Pramstaller, Peter P.; Quertermous, Thomas; Rudan, Igor; Samani, Nilesh J.; Spector, Timothy D.; Völzke, Henry; Watkins, Hugh; Wilson, James F.; Groop, Leif C.; Haritunians, Talin; Hu, Frank B.; Kaplan, Robert C.; Metspalu, Andres; North, Kari E.; Schlessinger, David; Wareham, Nicholas J.; Hunter, David J.; O'Connell, Jeffrey R.; Strachan, David P.; Wichmann, H.-Erich; Borecki, Ingrid B.; van Duijn, Cornelia M.; Schadt, Eric E.; Thorsteinsdottir, Unnur; Peltonen, Leena; Uitterlinden, André; Visscher, Peter M.; Chatterjee, Nilanjan; Loos, Ruth J.F.; Boehnke, Michael; McCarthy, Mark I.; Ingelsson, Erik; Lindgren, Cecilia M.; Abecasis, Gonçalo R.; Stefansson, Kari; Frayling, Timothy M.; Hirschhorn, Joel N

2010-01-01

Most common human traits and diseases have a polygenic pattern of inheritance: DNA sequence variants at many genetic loci influence phenotype. Genome-wide association (GWA) studies have identified >600 variants associated with human traits1, but these typically explain small fractions of phenotypic variation, raising questions about the utility of further studies. Here, using 183,727 individuals, we show that hundreds of genetic variants, in at least 180 loci, influence adult height, a highly heritable and classic polygenic trait2,3. The large number of loci reveals patterns with important implications for genetic studies of common human diseases and traits. First, the 180 loci are not random, but instead are enriched for genes that are connected in biological pathways (P=0.016), and that underlie skeletal growth defects (P<0.001). Second, the likely causal gene is often located near the most strongly associated variant: in 13 of 21 loci containing a known skeletal growth gene, that gene was closest to the associated variant. Third, at least 19 loci have multiple independently associated variants, suggesting that allelic heterogeneity is a frequent feature of polygenic traits, that comprehensive explorations of already-discovered loci should discover additional variants, and that an appreciable fraction of associated loci may have been identified. Fourth, associated variants are enriched for likely functional effects on genes, being over-represented amongst variants that alter amino acid structure of proteins and expression levels of nearby genes. Our data explain ∼10% of the phenotypic variation in height, and we estimate that unidentified common variants of similar effect sizes would increase this figure to ∼16% of phenotypic variation (∼20% of heritable variation). Although additional approaches are needed to fully dissect the genetic architecture of polygenic human traits, our findings indicate that GWA studies can identify large numbers of loci that implicate biologically relevant genes and pathways. PMID:20881960
Unique features of a global human ectoparasite identified through sequencing of the bed bug genome.

PubMed

Benoit, Joshua B; Adelman, Zach N; Reinhardt, Klaus; Dolan, Amanda; Poelchau, Monica; Jennings, Emily C; Szuter, Elise M; Hagan, Richard W; Gujar, Hemant; Shukla, Jayendra Nath; Zhu, Fang; Mohan, M; Nelson, David R; Rosendale, Andrew J; Derst, Christian; Resnik, Valentina; Wernig, Sebastian; Menegazzi, Pamela; Wegener, Christian; Peschel, Nicolai; Hendershot, Jacob M; Blenau, Wolfgang; Predel, Reinhard; Johnston, Paul R; Ioannidis, Panagiotis; Waterhouse, Robert M; Nauen, Ralf; Schorn, Corinna; Ott, Mark-Christoph; Maiwald, Frank; Johnston, J Spencer; Gondhalekar, Ameya D; Scharf, Michael E; Peterson, Brittany F; Raje, Kapil R; Hottel, Benjamin A; Armisén, David; Crumière, Antonin Jean Johan; Refki, Peter Nagui; Santos, Maria Emilia; Sghaier, Essia; Viala, Sèverine; Khila, Abderrahman; Ahn, Seung-Joon; Childers, Christopher; Lee, Chien-Yueh; Lin, Han; Hughes, Daniel S T; Duncan, Elizabeth J; Murali, Shwetha C; Qu, Jiaxin; Dugan, Shannon; Lee, Sandra L; Chao, Hsu; Dinh, Huyen; Han, Yi; Doddapaneni, Harshavardhan; Worley, Kim C; Muzny, Donna M; Wheeler, David; Panfilio, Kristen A; Vargas Jentzsch, Iris M; Vargo, Edward L; Booth, Warren; Friedrich, Markus; Weirauch, Matthew T; Anderson, Michelle A E; Jones, Jeffery W; Mittapalli, Omprakash; Zhao, Chaoyang; Zhou, Jing-Jiang; Evans, Jay D; Attardo, Geoffrey M; Robertson, Hugh M; Zdobnov, Evgeny M; Ribeiro, Jose M C; Gibbs, Richard A; Werren, John H; Palli, Subba R; Schal, Coby; Richards, Stephen

2016-02-02

The bed bug, Cimex lectularius, has re-established itself as a ubiquitous human ectoparasite throughout much of the world during the past two decades. This global resurgence is likely linked to increased international travel and commerce in addition to widespread insecticide resistance. Analyses of the C. lectularius sequenced genome (650 Mb) and 14,220 predicted protein-coding genes provide a comprehensive representation of genes that are linked to traumatic insemination, a reduced chemosensory repertoire of genes related to obligate hematophagy, host-symbiont interactions, and several mechanisms of insecticide resistance. In addition, we document the presence of multiple putative lateral gene transfer events. Genome sequencing and annotation establish a solid foundation for future research on mechanisms of insecticide resistance, human-bed bug and symbiont-bed bug associations, and unique features of bed bug biology that contribute to the unprecedented success of C. lectularius as a human ectoparasite.
Unique features of a global human ectoparasite identified through sequencing of the bed bug genome

PubMed Central

Benoit, Joshua B.; Adelman, Zach N.; Reinhardt, Klaus; Dolan, Amanda; Poelchau, Monica; Jennings, Emily C.; Szuter, Elise M.; Hagan, Richard W.; Gujar, Hemant; Shukla, Jayendra Nath; Zhu, Fang; Mohan, M.; Nelson, David R.; Rosendale, Andrew J.; Derst, Christian; Resnik, Valentina; Wernig, Sebastian; Menegazzi, Pamela; Wegener, Christian; Peschel, Nicolai; Hendershot, Jacob M.; Blenau, Wolfgang; Predel, Reinhard; Johnston, Paul R.; Ioannidis, Panagiotis; Waterhouse, Robert M.; Nauen, Ralf; Schorn, Corinna; Ott, Mark-Christoph; Maiwald, Frank; Johnston, J. Spencer; Gondhalekar, Ameya D.; Scharf, Michael E.; Peterson, Brittany F.; Raje, Kapil R.; Hottel, Benjamin A.; Armisén, David; Crumière, Antonin Jean Johan; Refki, Peter Nagui; Santos, Maria Emilia; Sghaier, Essia; Viala, Sèverine; Khila, Abderrahman; Ahn, Seung-Joon; Childers, Christopher; Lee, Chien-Yueh; Lin, Han; Hughes, Daniel S. T.; Duncan, Elizabeth J.; Murali, Shwetha C.; Qu, Jiaxin; Dugan, Shannon; Lee, Sandra L.; Chao, Hsu; Dinh, Huyen; Han, Yi; Doddapaneni, Harshavardhan; Worley, Kim C.; Muzny, Donna M.; Wheeler, David; Panfilio, Kristen A.; Vargas Jentzsch, Iris M.; Vargo, Edward L.; Booth, Warren; Friedrich, Markus; Weirauch, Matthew T.; Anderson, Michelle A. E.; Jones, Jeffery W.; Mittapalli, Omprakash; Zhao, Chaoyang; Zhou, Jing-Jiang; Evans, Jay D.; Attardo, Geoffrey M.; Robertson, Hugh M.; Zdobnov, Evgeny M.; Ribeiro, Jose M. C.; Gibbs, Richard A.; Werren, John H.; Palli, Subba R.; Schal, Coby; Richards, Stephen

2016-01-01

The bed bug, Cimex lectularius, has re-established itself as a ubiquitous human ectoparasite throughout much of the world during the past two decades. This global resurgence is likely linked to increased international travel and commerce in addition to widespread insecticide resistance. Analyses of the C. lectularius sequenced genome (650 Mb) and 14,220 predicted protein-coding genes provide a comprehensive representation of genes that are linked to traumatic insemination, a reduced chemosensory repertoire of genes related to obligate hematophagy, host–symbiont interactions, and several mechanisms of insecticide resistance. In addition, we document the presence of multiple putative lateral gene transfer events. Genome sequencing and annotation establish a solid foundation for future research on mechanisms of insecticide resistance, human–bed bug and symbiont–bed bug associations, and unique features of bed bug biology that contribute to the unprecedented success of C. lectularius as a human ectoparasite. PMID:26836814
Genome-Wide association study identifies candidate genes for Parkinson's disease in an Ashkenazi Jewish population

PubMed Central

2011-01-01

Background To date, nine Parkinson disease (PD) genome-wide association studies in North American, European and Asian populations have been published. The majority of studies have confirmed the association of the previously identified genetic risk factors, SNCA and MAPT, and two studies have identified three new PD susceptibility loci/genes (PARK16, BST1 and HLA-DRB5). In a recent meta-analysis of datasets from five of the published PD GWAS an additional 6 novel candidate genes (SYT11, ACMSD, STK39, MCCC1/LAMP3, GAK and CCDC62/HIP1R) were identified. Collectively the associations identified in these GWAS account for only a small proportion of the estimated total heritability of PD suggesting that an 'unknown' component of the genetic architecture of PD remains to be identified. Methods We applied a GWAS approach to a relatively homogeneous Ashkenazi Jewish (AJ) population from New York to search for both 'rare' and 'common' genetic variants that confer risk of PD by examining any SNPs with allele frequencies exceeding 2%. We have focused on a genetic isolate, the AJ population, as a discovery dataset since this cohort has a higher sharing of genetic background and historically experienced a significant bottleneck. We also conducted a replication study using two publicly available datasets from dbGaP. The joint analysis dataset had a combined sample size of 2,050 cases and 1,836 controls. Results We identified the top 57 SNPs showing the strongest evidence of association in the AJ dataset (p < 9.9 × 10-5). Six SNPs located within gene regions had positive signals in at least one other independent dbGaP dataset: LOC100505836 (Chr3p24), LOC153328/SLC25A48 (Chr5q31.1), UNC13B (9p13.3), SLCO3A1(15q26.1), WNT3(17q21.3) and NSF (17q21.3). We also replicated published associations for the gene regions SNCA (Chr4q21; rs3775442, p = 0.037), PARK16 (Chr1q32.1; rs823114 (NUCKS1), p = 6.12 × 10-4), BST1 (Chr4p15; rs12502586, p = 0.027), STK39 (Chr2q24.3; rs3754775, p = 0.005), and LAMP3 (Chr3; rs12493050, p = 0.005) in addition to the two most common PD susceptibility genes in the AJ population LRRK2 (Chr12q12; rs34637584, p = 1.56 × 10-4) and GBA (Chr1q21; rs2990245, p = 0.015). Conclusions We have demonstrated the utility of the AJ dataset in PD candidate gene and SNP discovery both by replication in dbGaP datasets with a larger sample size and by replicating association of previously identified PD susceptibility genes. Our GWAS study has identified candidate gene regions for PD that are implicated in neuronal signalling and the dopamine pathway. PMID:21812969
Genome-wide analysis of drought induced gene expression changes in flax (Linum usitatissimum).

PubMed

Dash, Prasanta K; Cao, Yongguo; Jailani, Abdul K; Gupta, Payal; Venglat, Prakash; Xiang, Daoquan; Rai, Rhitu; Sharma, Rinku; Thirunavukkarasu, Nepolean; Abdin, Malik Z; Yadava, Devendra K; Singh, Nagendra K; Singh, Jas; Selvaraj, Gopalan; Deyholos, Mike; Kumar, Polumetla Ananda; Datla, Raju

2014-01-01

A robust phenotypic plasticity to ward off adverse environmental conditions determines performance and productivity in crop plants. Flax (linseed), is an important cash crop produced for natural textile fiber (linen) or oilseed with many health promoting products. This crop is prone to drought stress and yield losses in many parts of the world. Despite recent advances in drought research in a number of important crops, related progress in flax is very limited. Since, response of this plant to drought stress has not been addressed at the molecular level; we conducted microarray analysis to capture transcriptome associated with induced drought in flax. This study identified 183 differentially expressed genes (DEGs) associated with diverse cellular, biophysical and metabolic programs in flax. The analysis also revealed especially the altered regulation of cellular and metabolic pathways governing photosynthesis. Additionally, comparative transcriptome analysis identified a plethora of genes that displayed differential regulation both spatially and temporally. These results revealed co-regulated expression of 26 genes in both shoot and root tissues with implications for drought stress response. Furthermore, the data also showed that more genes are upregulated in roots compared to shoots, suggesting that roots may play important and additional roles in response to drought in flax. With prolonged drought treatment, the number of DEGs increased in both tissue types. Differential expression of selected genes was confirmed by qRT-PCR, thus supporting the suggested functional association of these intrinsic genes in maintaining growth and homeostasis in response to imminent drought stress in flax. Together the present study has developed foundational and new transcriptome data sets for drought stress in flax.
Bioinformatics evidence for the transfer of mycosporine-like amino acid core (4-deoxygadusol) synthesizing gene from cyanobacteria to dinoflagellates and an attempt to mutate the same gene (YP_324358) in Anabaena variabilis PCC 7937.

PubMed

Singh, Shailendra P; Häder, Donat-P; Sinha, Rajeshwar P

2012-06-01

We have identified a homologue of 4-deoxygadusol (core of mycosporine-like amino acids) synthesizing gene (ZP_05036788) from Synechococcus sp. PCC 7335 that was found to have additional functionally unknown N-terminal domain similar to homologues from dinoflagellates based on the ClustalW analysis. Phylogenetic analysis revealed that Synechococcus sp. (ZP_05036788) makes a clade together with dinoflagellates and was closest to the Oxyrrhis marina. This study shows for the first time that N-terminal additional sequences that possess upstream plastid targeting sequence in Heterocapsa triquetra and Karlodinium micrum were already evolved in cyanobacteria, and plastid targeting sequence were evolved later in dinoflagellates after divergence from chloroplast lacking Oxyrrhis marina. Thus, MAAs synthesizing genes were transferred from cyanobacteria to dinoflagellates and possibly Synechococcus sp. PCC 7335 acted as a donor during lateral gene transfer event. In addition, we also tried to mutate 4-deoxygadusol synthesizing gene (YP_324358) of Anabaena variabilis PCC 7937 by homologous recombination, however, all approaches to get complete segregation of the mutants from the wild-type were unsuccessful, showing the essentiality of YP_324358 for A. variabilis PCC 7937. Copyright © 2012 Elsevier B.V. All rights reserved.
The uncharacterized transcription factor YdhM is the regulator of the nemA gene, encoding N-ethylmaleimide reductase.

PubMed

Umezawa, Yoshimasa; Shimada, Tomohiro; Kori, Ayako; Yamada, Kayoko; Ishihama, Akira

2008-09-01

N-ethylmaleimide (NEM) has been used as a specific reagent of Cys modification in proteins and thus is toxic for cell growth. On the Escherichia coli genome, the nemA gene coding for NEM reductase is located downstream of the gene encoding an as-yet-uncharacterized transcription factor, YdhM. Disruption of the ydhM gene results in reduction of nemA expression even in the induced state, indicating that the two genes form a single operon. After in vitro genomic SELEX screening, one of the target recognition sequences for YdhM was identified within the promoter region for this ydhM-nemA operon. Both YdhM binding in vitro to the ydhM promoter region and transcription repression in vivo of the ydhM-nemA operon by YdhM were markedly reduced by the addition of NEM. Taken together, we propose that YdhM is the repressor for the nemA gene, thus hereafter designated NemR. The repressor function of NemR was inactivated by the addition of not only NEM but also other Cys modification reagents, implying that Cys modification of NemR renders it inactive. This is an addition to the mode of controlling activity of transcription factors by alkylation with chemical agents.
Unique disease heritage of the Dutch-German Mennonite population.

PubMed

Orton, Noelle C; Innes, A Micheil; Chudley, Albert E; Bech-Hansen, N Torben

2008-04-15

The Dutch-German Mennonites are a religious isolate with foundational roots in the 16th century. A tradition of endogamy, large families, detailed genealogical records, and a unique disease history all contribute to making this a valuable population for genetic studies. Such studies in the Dutch-German Mennonite population have already contributed to the identification of the causative genes in several conditions such as the incomplete form of X-linked congenital stationary night blindness (CSNB2; previously iCSNB) and hypophosphatasia (HOPS), as well as the discovery of founder mutations within established disease genes (MYBPC1, CYP17alpha). The Dutch-German Mennonite population provides a strong resource for gene discovery and could lead to the identification of additional disease genes with relevance to the general population. In addition, further research developments should enhance delivery of clinical genetic services to this unique community. In the current review we discuss 31 genetic conditions, including 17 with identified gene mutations, within the Dutch-German Mennonite population. Copyright 2008 Wiley-Liss, Inc.
Insight into the molecular genetics of myopia

PubMed Central

Li, Jiali

2017-01-01

Myopia is the most common cause of visual impairment worldwide. Genetic and environmental factors contribute to the development of myopia. Studies on the molecular genetics of myopia are well established and have implicated the important role of genetic factors. With linkage analysis, association studies, sequencing analysis, and experimental myopia studies, many of the loci and genes associated with myopia have been identified. Thus far, there has been no systemic review of the loci and genes related to non-syndromic and syndromic myopia based on the different approaches. Such a systemic review of the molecular genetics of myopia will provide clues to identify additional plausible genes for myopia and help us to understand the molecular mechanisms underlying myopia. This paper reviews recent genetic studies on myopia, summarizes all possible reported genes and loci related to myopia, and suggests implications for future studies on the molecular genetics of myopia. PMID:29386878
Mutational Survey of the PHEX Gene in Patients with X-linked Hypophosphatemic Rickets

PubMed Central

Ichikawa, Shoji; Traxler, Elizabeth A.; Estwick, Selina A.; Curry, Leah R.; Johnson, Michelle L.; Sorenson, Andrea H.; Imel, Erik A.; Econs, Michael J.

2008-01-01

X-linked hypophosphatemic rickets (XLH) is a dominantly inherited disorder characterized by renal phosphate wasting, aberrant vitamin D metabolism, and abnormal bone mineralization. XLH is caused by inactivating mutations in PHEX (phosphate-regulating gene with homologies to endopeptidases on the X chromosome). In this study, we sequenced the PHEX gene in subjects from 26 kindreds who were clinically diagnosed with XLH. Sequencing revealed 18 different mutations, of which thirteen have not been reported previously. In addition to deletions, splice site mutations, and missense and nonsense mutations, a rare point mutation in the 3’-untranslated region (3’-UTR) was identified as a novel cause of XLH. In summary, we identified a wide spectrum of mutations in the PHEX gene. Our data, in accord with those of others, indicate that there is no single predominant PHEX mutation responsible for XLH. PMID:18625346
Experience of targeted Usher exome sequencing as a clinical test

PubMed Central

Besnard, Thomas; García-García, Gema; Baux, David; Vaché, Christel; Faugère, Valérie; Larrieu, Lise; Léonard, Susana; Millan, Jose M; Malcolm, Sue; Claustres, Mireille; Roux, Anne-Françoise

2014-01-01

We show that massively parallel targeted sequencing of 19 genes provides a new and reliable strategy for molecular diagnosis of Usher syndrome (USH) and nonsyndromic deafness, particularly appropriate for these disorders characterized by a high clinical and genetic heterogeneity and a complex structure of several of the genes involved. A series of 71 patients including Usher patients previously screened by Sanger sequencing plus newly referred patients was studied. Ninety-eight percent of the variants previously identified by Sanger sequencing were found by next-generation sequencing (NGS). NGS proved to be efficient as it offers analysis of all relevant genes which is laborious to reach with Sanger sequencing. Among the 13 newly referred Usher patients, both mutations in the same gene were identified in 77% of cases (10 patients) and one candidate pathogenic variant in two additional patients. This work can be considered as pilot for implementing NGS for genetically heterogeneous diseases in clinical service. PMID:24498627
Insight into the molecular genetics of myopia.

PubMed

Li, Jiali; Zhang, Qingjiong

2017-01-01

Myopia is the most common cause of visual impairment worldwide. Genetic and environmental factors contribute to the development of myopia. Studies on the molecular genetics of myopia are well established and have implicated the important role of genetic factors. With linkage analysis, association studies, sequencing analysis, and experimental myopia studies, many of the loci and genes associated with myopia have been identified. Thus far, there has been no systemic review of the loci and genes related to non-syndromic and syndromic myopia based on the different approaches. Such a systemic review of the molecular genetics of myopia will provide clues to identify additional plausible genes for myopia and help us to understand the molecular mechanisms underlying myopia. This paper reviews recent genetic studies on myopia, summarizes all possible reported genes and loci related to myopia, and suggests implications for future studies on the molecular genetics of myopia.
Identifying the Viral Genes Encoding Envelope Glycoproteins for Differentiation of Cyprinid herpesvirus 3 Isolates

PubMed Central

Han, Jee Eun; Kim, Ji Hyung; Renault, Tristan; Choresca, Casiano; Shin, Sang Phil; Jun, Jin Woo; Park, Se Chang

2013-01-01

Cyprinid herpes virus 3 (CyHV-3) diseases have been reported around the world and are associated with high mortalities of koi (Cyprinus carpio). Although little work has been conducted on the molecular analysis of this virus, glycoprotein genes identified in the present study seem to be valuable targets for genetic comparison of this virus. Three envelope glycoprotein genes (ORF25, 65 and 116) of the CyHV-3 isolates from the USA, Israel, Japan and Korea were compared, and interestingly, sequence insertions or deletions were observed in these target regions. In addition, polymorphisms were presented in microsatellite zones from two glycoprotein genes (ORF65 and 116). In phylogenetic tree analysis, the Korean isolate was remarkably distinguished from USA, Israel, Japan isolates. These findings may be suitable for many applications including isolates differentiation and phylogeny studies. PMID:23435236
Frequent amplification of receptor tyrosine kinase genes in welldifferentiated/ dedifferentiated liposarcoma.

PubMed

Asano, Naofumi; Yoshida, Akihiko; Mitani, Sachiyo; Kobayashi, Eisuke; Shiotani, Bunsyo; Komiyama, Motokiyo; Fujimoto, Hiroyuki; Chuman, Hirokazu; Morioka, Hideo; Matsumoto, Morio; Nakamura, Masaya; Kubo, Takashi; Kato, Mamoru; Kohno, Takashi; Kawai, Akira; Kondo, Tadashi; Ichikawa, Hitoshi

2017-02-21

Well-differentiated liposarcoma (WDLPS) and dedifferentiated liposarcoma (DDLPS) are closely related tumors commonly characterized by MDM2/CDK4 gene amplification, and lack clinically effective treatment options when inoperable. To identify novel therapeutic targets, we performed targeted genomic sequencing analysis of 19 WDLPS and 37 DDLPS tumor samples using a panel of 104 cancer-related genes (NCC oncopanel v3) developed specifically for genomic testing to select suitable molecular targeted therapies. The results of this analysis indicated that these sarcomas had very few gene mutations and a high frequency of amplifications of not only MDM2 and CDK4 but also other genes. Potential driver mutations were found in only six (11%) samples; however, gene amplification events (other than MDM2 and CDK4 amplification) were identified in 30 (54%) samples. Receptor tyrosine kinase (RTK) genes in particular were amplified in 18 (32%) samples. In addition, growth of a WDLPS cell line with IGF1R amplification was suppressed by simultaneous inhibition of CDK4 and IGF1R, using palbociclib and NVP-AEW541, respectively. Combination therapy with CDK4 and RTK inhibitors may be an effective therapeutic option for WDLPS/DDLPS patients with RTK gene amplification.
The first Taxus rhizosphere microbiome revealed by shotgun metagenomic sequencing.

PubMed

Hao, Da-Cheng; Zhang, Cai-Rong; Xiao, Pei-Gen

2018-06-01

In the present study, the shotgun high throughput metagenomic sequencing was implemented to globally capture the features of Taxus rhizosphere microbiome. Total reads could be assigned to 6925 species belonging to 113 bacteria phyla and 301 species of nine fungi phyla. For archaea and virus, 263 and 134 species were for the first time identified, respectively. More than 720,000 Unigenes were identified by clean reads assembly. The top five assigned phyla were Actinobacteria (363,941 Unigenes), Proteobacteria (182,053), Acidobacteria (44,527), Ascomycota (fungi; 18,267), and Chloroflexi (15,539). KEGG analysis predicted numerous functional genes; 7101 Unigenes belong to "Xenobiotics biodegradation and metabolism." A total of 12,040 Unigenes involved in defense mechanisms (e.g., xenobiotic metabolism) were annotated by eggNOG. Talaromyces addition could influence not only the diversity and structure of microbial communities of Taxus rhizosphere, but also the relative abundance of functional genes, including metabolic genes, antibiotic resistant genes, and genes involved in pathogen-host interaction, bacterial virulence, and bacterial secretion system. The structure and function of rhizosphere microbiome could be sensitive to non-native microbe addition, which could impact on the pollutant degradation. This study, complementary to the amplicon sequencing, more objectively reflects the native microbiome of Taxus rhizosphere and its response to environmental pressure, and lays a foundation for potential combination of phytoremediation and bioaugmentation. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Dealing with the incidental finding of secondary variants by the example of SRNS patients undergoing targeted next-generation sequencing.

PubMed

Weber, Stefanie; Büscher, Anja K; Hagmann, Henning; Liebau, Max C; Heberle, Christian; Ludwig, Michael; Rath, Sabine; Alberer, Martin; Beissert, Antje; Zenker, Martin; Hoyer, Peter F; Konrad, Martin; Klein, Hanns-Georg; Hoefele, Julia

2016-01-01

Steroid-resistant nephrotic syndrome (SRNS) is a severe cause of progressive renal disease. Genetic forms of SRNS can present with autosomal recessive or autosomal dominant inheritance. Recent studies have identified mutations in multiple podocyte genes responsible for SRNS. Improved sequencing methods (next-generation sequencing, NGS) now promise rapid mutational testing of SRNS genes. In the present study, a simultaneous screening of ten SRNS genes in 37 SRNS patients was performed by NGS. In 38 % of the patients, causative mutations in one SRNS gene were found. In 22 % of the patients, in addition to these mutations, a secondary variant in a different gene was identified. This high incidence of accumulating sequence variants was unexpected but, although they might have modifier effects, the pathogenic potential of these additional sequence variants seems unclear so far. The example of molecular diagnostics by NGS in SRNS patients shows that these new sequencing technologies might provide further insight into molecular pathogenicity in genetic disorders but will also generate results, which will be difficult to interpret and complicate genetic counseling. Although NGS promises more frequent identification of disease-causing mutations, the identification of causative mutations, the interpretation of incidental findings and possible pitfalls might pose problems, which hopefully will decrease by further experience and elucidation of molecular interactions.
Chromosomal DNA Deletions Explain Phenotypic Characteristics of Two Antigenic Variants, Phase II and RSA 514 (Crazy), of the Coxiella burnetii Nine Mile Strain†

PubMed Central

Hoover, T. A.; Culp, D. W.; Vodkin, M. H.; Williams, J. C.; Thompson, H. A.

2002-01-01

After repeated passages through embyronated eggs, the Nine Mile strain of Coxiella burnetii exhibits antigenic variation, a loss of virulence characteristics, and transition to a truncated lipopolysaccharide (LPS) structure. In two independently derived strains, Nine Mile phase II and RSA 514, these phenotypic changes were accompanied by a large chromosomal deletion (M. H. Vodkin and J. C. Williams, J. Gen. Microbiol. 132:2587-2594, 1986). In the work reported here, additional screening of a cosmid bank prepared from the wild-type strain was used to map the deletion termini of both mutant strains and to accumulate all the segments of DNA that comprise the two deletions. The corresponding DNAs were then sequenced and annotated. The Nine Mile phase II deletion was completely nested within the deletion of the RSA 514 strain. Basic alignment and homology studies indicated that a large group of LPS biosynthetic genes, arranged in an apparent O-antigen cluster, was deleted in both variants. Database homologies identified, in particular, mannose pathway genes and genes encoding sugar methylases and nucleotide sugar epimerase-dehydratase proteins. Candidate genes for addition of sugar units to the core oligosaccharide for synthesis of the rare sugar 6-deoxy-3-C-methylgulose (virenose) were identified in the deleted region. Repeats, redundancies, paralogous genes, and two regions with reduced G+C contents were found within the deletions. PMID:12438347
Reference genes for measuring mRNA expression.

PubMed

Dundas, Jitesh; Ling, Maurice

2012-12-01

The aim of this review is to find answers to some of the questions surrounding reference genes and their reliability for quantitative experiments. Reference genes are assumed to be at a constant expression level, over a range of conditions such as temperature. These genes, such as GADPH and beta-actin, are used extensively for gene expression studies using techniques like quantitative PCR. There have been several studies carried out on identifying reference genes. However, a lot of evidence indicates issues to the general suitability of these genes. Recent studies had shown that different factors, including the environment and methods, play an important role in changing the expression levels of the reference genes. Thus, we conclude that there is no reference gene that can deemed suitable for all the experimental conditions. In addition, we believe that every experiment will require the scientific evaluation and selection of the best candidate gene for use as a reference gene to obtain reliable scientific results.
Harnessing pain heterogeneity and RNA transcriptome to identify blood-based pain biomarkers: a novel correlational study design and bioinformatics approach in a graded chronic constriction injury model.

PubMed

Grace, Peter M; Hurley, Daniel; Barratt, Daniel T; Tsykin, Anna; Watkins, Linda R; Rolan, Paul E; Hutchinson, Mark R

2012-09-01

A quantitative, peripherally accessible biomarker for neuropathic pain has great potential to improve clinical outcomes. Based on the premise that peripheral and central immunity contribute to neuropathic pain mechanisms, we hypothesized that biomarkers could be identified from the whole blood of adult male rats, by integrating graded chronic constriction injury (CCI), ipsilateral lumbar dorsal quadrant (iLDQ) and whole blood transcriptomes, and pathway analysis with pain behavior. Correlational bioinformatics identified a range of putative biomarker genes for allodynia intensity, many encoding for proteins with a recognized role in immune/nociceptive mechanisms. A selection of these genes was validated in a separate replication study. Pathway analysis of the iLDQ transcriptome identified Fcγ and Fcε signaling pathways, among others. This study is the first to employ the whole blood transcriptome to identify pain biomarker panels. The novel correlational bioinformatics, developed here, selected such putative biomarkers based on a correlation with pain behavior and formation of signaling pathways with iLDQ genes. Future studies may demonstrate the predictive ability of these biomarker genes across other models and additional variables. © 2012 The Authors. Journal of Neurochemistry © 2012 International Society for Neurochemistry.
Differences in X-chromosome transcriptional activity and cholesterol metabolism between placentae from swine breeds from Asian and Western origins.

PubMed

Bischoff, Steve R; Tsai, Shengdar Q; Hardison, Nicholas E; Motsinger-Reif, Alison A; Freking, Bradley A; Nonneman, Dan J; Rohrer, Gary A; Piedrahita, Jorge A

2013-01-01

To gain insight into differences in placental physiology between two swine breeds noted for their dissimilar reproductive performance, that is, the Chinese Meishan and white composite (WC), we examined gene expression profiles of placental tissues collected at 25, 45, 65, 85, and 105 days of gestation by microarrays. Using a linear mixed model, a total of 1,595 differentially expressed genes were identified between the two pig breeds using a false-discovery rate q-value ≤0.05. Among these genes, we identified breed-specific isoforms of XIST, a long non-coding RNA responsible X-chromosome dosage compensation in females. Additionally, we explored the interaction of placental gene expression and chromosomal location by DIGMAP and identified three Sus scrofa X chromosomal bands (Xq13, Xq21, Xp11) that represent transcriptionally active clusters that differ between Meishan and WC during placental development. Also, pathway analysis identified fundamental breed differences in placental cholesterol trafficking and its synthesis. Direct measurement of cholesterol confirmed that the cholesterol content was significantly higher in the Meishan versus WC placentae. Taken together, this work identifies key metabolic pathways that differ in the placentae of two swine breeds noted for differences in reproductive prolificacy.

Identification of two integration sites in favor of transgene expression in Trichoderma reesei.

PubMed

Qin, Lina; Jiang, Xianzhang; Dong, Zhiyang; Huang, Jianzhong; Chen, Xiuzhen

2018-01-01

The ascomycete fungus Trichoderma reesei was widely used as a biotechnological workhorse for production of cellulases and recombinant proteins due to its large capacity of protein secretion. Transgenesis by random integration of a gene of interest (GOI) into the genome of T. reesei can generate series of strains that express different levels of the indicated transgene. The insertion site of the GOI plays an important role in the ultimate production of the targeted proteins. However, so far no systematic studies have been made to identify transgene integration loci for optimal expression of the GOI in T. reesei . Currently, only the locus of exocellobiohydrolases I encoding gene ( cbh1) is widely used as a promising integration site to lead to high expression level of the GOI. No additional sites associated with efficient gene expression have been characterized. To search for gene integration sites that benefit for the secreted expression of GOI, the food-and-mouth disease virus 2A protein was applied for co-expression of an Aspergillus niger lipA gene and Discosoma sp. DsRed1 gene in T. reesei, by random integration of the expression cassette into the genome. We demonstrated that the fluorescent intensity of RFP (red fluorescent protein) inside of the cell was well correlated with the secreted lipase yields, based on which, we successfully developed a high-throughput screening method to screen strains with relatively higher secreted expression of the GOI (in this study, lipase). The copy number and the insertion sites of the transgene were investigated among the selected highly expressed strains. Eventually, in addition to cbh1 gene locus, two other genome insertion loci that efficiently facilitate gene expression in T. reesei were identified. We have successfully developed a high-throughput screening method to screen strains with optimal expression of the indicated secreted proteins in T. reesei . Moreover, we identified two optimal genome loci for transgene expression, which could provide new approach to modulate gene expression levels while retaining the indicated promoter and culture conditions.
GABA metabolism pathway genes, UGA1 and GAD1, regulate replicative lifespan in Saccharomycescerevisiae

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kamei, Yuka; Tamura, Takayuki; Yoshida, Ryo

2011-04-01

Highlights: {yields}We demonstrate that two genes in the yeast GABA metabolism pathway affect aging. {yields} Deletion of the UGA1 or GAD1 genes extends replicative lifespan. {yields} Addition of GABA to wild-type cultures has no effect on lifespan. {yields} Intracellular GABA levels do not differ in longevity mutants and wild-type cells. {yields} Levels of tricarboxylic acid cycle intermediates positively correlate with lifespan. -- Abstract: Many of the genes involved in aging have been identified in organisms ranging from yeast to human. Our previous study showed that deletion of the UGA3 gene-which encodes a zinc-finger transcription factor necessary for {gamma}-aminobutyric acid (GABA)-dependentmore » induction of the UGA1 (GABA aminotransferase), UGA2 (succinate semialdehyde dehydrogenase), and UGA4 (GABA permease) genes-extends replicative lifespan in the budding yeast Saccharomycescerevisiae. Here, we found that deletion of UGA1 lengthened the lifespan, as did deletion of UGA3; in contrast, strains with UGA2 or UGA4 deletions exhibited no lifespan extension. The {Delta}uga1 strain cannot deaminate GABA to succinate semialdehyde. Deletion of GAD1, which encodes the glutamate decarboxylase that converts glutamate into GABA, also increased lifespan. Therefore, two genes in the GABA metabolism pathway, UGA1 and GAD1, were identified as aging genes. Unexpectedly, intracellular GABA levels in mutant cells (except for {Delta}uga2 cells) did not differ from those in wild-type cells. Addition of GABA to culture media, which induces transcription of the UGA structural genes, had no effect on replicative lifespan of wild-type cells. Multivariate analysis of {sup 1}H nuclear magnetic resonance spectra for the whole-cell metabolite levels demonstrated a separation between long-lived and normal-lived strains. Gas chromatography-mass spectrometry analysis of identified metabolites showed that levels of tricarboxylic acid cycle intermediates positively correlated with lifespan extension. These results strongly suggest reduced activity of the GABA-metabolizing enzymes extends lifespan by shifting carbon metabolism toward respiration, as calorie restriction does.« less
Analysis of predicted loss-of-function variants in UK Biobank identifies variants protective for disease.

PubMed

Emdin, Connor A; Khera, Amit V; Chaffin, Mark; Klarin, Derek; Natarajan, Pradeep; Aragam, Krishna; Haas, Mary; Bick, Alexander; Zekavat, Seyedeh M; Nomura, Akihiro; Ardissino, Diego; Wilson, James G; Schunkert, Heribert; McPherson, Ruth; Watkins, Hugh; Elosua, Roberto; Bown, Matthew J; Samani, Nilesh J; Baber, Usman; Erdmann, Jeanette; Gupta, Namrata; Danesh, John; Chasman, Daniel; Ridker, Paul; Denny, Joshua; Bastarache, Lisa; Lichtman, Judith H; D'Onofrio, Gail; Mattera, Jennifer; Spertus, John A; Sheu, Wayne H-H; Taylor, Kent D; Psaty, Bruce M; Rich, Stephen S; Post, Wendy; Rotter, Jerome I; Chen, Yii-Der Ida; Krumholz, Harlan; Saleheen, Danish; Gabriel, Stacey; Kathiresan, Sekar

2018-04-24

Less than 3% of protein-coding genetic variants are predicted to result in loss of protein function through the introduction of a stop codon, frameshift, or the disruption of an essential splice site; however, such predicted loss-of-function (pLOF) variants provide insight into effector transcript and direction of biological effect. In >400,000 UK Biobank participants, we conduct association analyses of 3759 pLOF variants with six metabolic traits, six cardiometabolic diseases, and twelve additional diseases. We identified 18 new low-frequency or rare (allele frequency < 5%) pLOF variant-phenotype associations. pLOF variants in the gene GPR151 protect against obesity and type 2 diabetes, in the gene IL33 against asthma and allergic disease, and in the gene IFIH1 against hypothyroidism. In the gene PDE3B, pLOF variants associate with elevated height, improved body fat distribution and protection from coronary artery disease. Our findings prioritize genes for which pharmacologic mimics of pLOF variants may lower risk for disease.
The Carnegie Protein Trap Library: A Versatile Tool for Drosophila Developmental Studies

PubMed Central

Buszczak, Michael; Paterno, Shelley; Lighthouse, Daniel; Bachman, Julia; Planck, Jamie; Owen, Stephenie; Skora, Andrew D.; Nystul, Todd G.; Ohlstein, Benjamin; Allen, Anna; Wilhelm, James E.; Murphy, Terence D.; Levis, Robert W.; Matunis, Erika; Srivali, Nahathai; Hoskins, Roger A.; Spradling, Allan C.

2007-01-01

Metazoan physiology depends on intricate patterns of gene expression that remain poorly known. Using transposon mutagenesis in Drosophila, we constructed a library of 7404 protein trap and enhancer trap lines, the Carnegie collection, to facilitate gene expression mapping at single-cell resolution. By sequencing the genomic insertion sites, determining splicing patterns downstream of the enhanced green fluorescent protein (EGFP) exon, and analyzing expression patterns in the ovary and salivary gland, we found that 600–900 different genes are trapped in our collection. A core set of 244 lines trapped different identifiable protein isoforms, while insertions likely to act as GFP-enhancer traps were found in 256 additional genes. At least 8 novel genes were also identified. Our results demonstrate that the Carnegie collection will be useful as a discovery tool in diverse areas of cell and developmental biology and suggest new strategies for greatly increasing the coverage of the Drosophila proteome with protein trap insertions. PMID:17194782
Comparative Transcriptome Analyses Reveal Core Parasitism Genes and Suggest Gene Duplication and Repurposing as Sources of Structural Novelty

PubMed Central

Yang, Zhenzhen; Wafula, Eric K.; Honaas, Loren A.; Zhang, Huiting; Das, Malay; Fernandez-Aparicio, Monica; Huang, Kan; Bandaranayake, Pradeepa C.G.; Wu, Biao; Der, Joshua P.; Clarke, Christopher R.; Ralph, Paula E.; Landherr, Lena; Altman, Naomi S.; Timko, Michael P.; Yoder, John I.; Westwood, James H.; dePamphilis, Claude W.

2015-01-01

The origin of novel traits is recognized as an important process underlying many major evolutionary radiations. We studied the genetic basis for the evolution of haustoria, the novel feeding organs of parasitic flowering plants, using comparative transcriptome sequencing in three species of Orobanchaceae. Around 180 genes are upregulated during haustorial development following host attachment in at least two species, and these are enriched in proteases, cell wall modifying enzymes, and extracellular secretion proteins. Additionally, about 100 shared genes are upregulated in response to haustorium inducing factors prior to host attachment. Collectively, we refer to these newly identified genes as putative “parasitism genes.” Most of these parasitism genes are derived from gene duplications in a common ancestor of Orobanchaceae and Mimulus guttatus, a related nonparasitic plant. Additionally, the signature of relaxed purifying selection and/or adaptive evolution at specific sites was detected in many haustorial genes, and may play an important role in parasite evolution. Comparative analysis of gene expression patterns in parasitic and nonparasitic angiosperms suggests that parasitism genes are derived primarily from root and floral tissues, but with some genes co-opted from other tissues. Gene duplication, often taking place in a nonparasitic ancestor of Orobanchaceae, followed by regulatory neofunctionalization, was an important process in the origin of parasitic haustoria. PMID:25534030
Establishing the role of rare coding variants in known Parkinson's disease risk loci.

PubMed

Jansen, Iris E; Gibbs, J Raphael; Nalls, Mike A; Price, T Ryan; Lubbe, Steven; van Rooij, Jeroen; Uitterlinden, André G; Kraaij, Robert; Williams, Nigel M; Brice, Alexis; Hardy, John; Wood, Nicholas W; Morris, Huw R; Gasser, Thomas; Singleton, Andrew B; Heutink, Peter; Sharma, Manu

2017-11-01

Many common genetic factors have been identified to contribute to Parkinson's disease (PD) susceptibility, improving our understanding of the related underlying biological mechanisms. The involvement of rarer variants in these loci has been poorly studied. Using International Parkinson's Disease Genomics Consortium data sets, we performed a comprehensive study to determine the impact of rare variants in 23 previously published genome-wide association studies (GWAS) loci in PD. We applied Prix fixe to select the putative causal genes underneath the GWAS peaks, which was based on underlying functional similarities. The Sequence Kernel Association Test was used to analyze the joint effect of rare, common, or both types of variants on PD susceptibility. All genes were tested simultaneously as a gene set and each gene individually. We observed a moderate association of common variants, confirming the involvement of the known PD risk loci within our genetic data sets. Focusing on rare variants, we identified additional association signals for LRRK2, STBD1, and SPATA19. Our study suggests an involvement of rare variants within several putatively causal genes underneath previously identified PD GWAS peaks. Copyright © 2017 Elsevier Inc. All rights reserved.
Additional targets of the Arabidopsis autonomous pathway members, FCA and FY.

PubMed

Marquardt, S; Boss, P K; Hadfield, J; Dean, C

2006-01-01

A central player in the Arabidopsis floral transition is the floral repressor FLC, the MADS-box transcriptional regulator that inhibits the activity of genes required to switch the meristem from vegetative to floral development. One of the many pathways that regulate FLC expression is the autonomous promotion pathway composed of FCA, FY, FLD, FPA, FVE, LD, and FLK. Rather than a hierarchical set of activities the autonomous promotion pathway comprises sub-pathways of genes with different biochemical functions that all share FLC as a target. One sub-pathway involves FCA and FY, which interact to regulate RNA processing of FLC. Several of the identified components (FY, FVE, and FLD) are homologous to yeast and mammalian proteins with rather generic roles in gene regulation. So why do mutations in these genes specifically show a late-flowering phenotype in Arabidopsis? One reason, found during the analysis of fy alleles, is that the mutant alleles identified in flowering screens can be hypomorphic, they still have partial function. A broader role for the autonomous promotion pathway is supported by a microarray analysis which has identified genes mis-regulated in fca mutants, and whose expression is also altered in fy mutants.
Transcriptome analysis of Spodoptera frugiperda Sf9 cells reveals putative apoptosis-related genes and a preliminary apoptosis mechanism induced by azadirachtin.

PubMed

Shu, Benshui; Zhang, Jingjing; Sethuraman, Veeran; Cui, Gaofeng; Yi, Xin; Zhong, Guohua

2017-10-16

As an important botanical pesticide, azadirachtin demonstrates broad insecticidal activity against many agricultural pests. The results of a previous study indicated the toxicity and apoptosis induction of azadirachtin in Spodoptera frugiperda Sf9 cells. However, the lack of genomic data has hindered a deeper investigation of apoptosis in Sf9 cells at a molecular level. In the present study, the complete transcriptome data for Sf9 cell line was accomplished using Illumina sequencing technology, and 97 putative apoptosis-related genes were identified through BLAST and KEGG orthologue annotations. Fragments of potential candidate apoptosis-related genes were cloned, and the mRNA expression patterns of ten identified genes regulated by azadirachtin were examined using qRT-PCR. Furthermore, Western blot analysis showed that six putative apoptosis-related proteins were upregulated after being treated with azadirachtin while the protein Bcl-2 were downregulated. These data suggested that both intrinsic and extrinsic apoptotic signal pathways comprising the identified potential apoptosis-related genes were potentially active in S. frugiperda. In addition, the preliminary results revealed that caspase-dependent or caspase-independent apoptotic pathways could function in azadirachtin-induced apoptosis in Sf9 cells.
Multi-breed and multi-trait co-association analysis of meat tenderness and other meat quality traits in three French beef cattle breeds.

PubMed

Ramayo-Caldas, Yuliaxis; Renand, Gilles; Ballester, Maria; Saintilan, Romain; Rocha, Dominique

2016-04-23

Studies to identify markers associated with beef tenderness have focused on Warner-Bratzler shear force (WBSF) but the interplay between the genes associated with WBSF has not been explored. We used the association weight matrix (AWM), a systems biology approach, to identify a set of interacting genes that are co-associated with tenderness and other meat quality traits, and shared across the Charolaise, Limousine and Blonde d'Aquitaine beef cattle breeds. Genome-wide association studies were performed using ~500K single nucleotide polymorphisms (SNPs) and 17 phenotypes measured on more than 1000 animals for each breed. First, this multi-trait approach was applied separately for each breed across 17 phenotypes and second, between- and across-breed comparisons at the AWM and functional levels were performed. Genetic heterogeneity was observed, and most of the variants that were associated with WBSF segregated within rather than across breeds. We identified 206 common candidate genes associated with WBSF across the three breeds. SNPs in these common genes explained between 28 and 30 % of the phenotypic variance for WBSF. A reduced number of common SNPs mapping to the 206 common genes were identified, suggesting that different mutations may target the same genes in a breed-specific manner. Therefore, it is likely that, depending on allele frequencies and linkage disequilibrium patterns, a SNP that is identified for one breed may not be informative for another unrelated breed. Well-known candidate genes affecting beef tenderness were identified. In addition, some of the 206 common genes are located within previously reported quantitative trait loci for WBSF in several cattle breeds. Moreover, the multi-breed co-association analysis detected new candidate genes, regulators and metabolic pathways that are likely involved in the determination of meat tenderness and other meat quality traits in beef cattle. Our results suggest that systems biology approaches that explore associations of correlated traits increase statistical power to identify candidate genes beyond the one-dimensional approach. Further studies on the 206 common genes, their pathways, regulators and interactions will expand our knowledge on the molecular basis of meat tenderness and could lead to the discovery of functional mutations useful for genomic selection in a multi-breed beef cattle context.
Leveraging Comparative Genomics to Identify and Functionally Characterize Genes Associated with Sperm Phenotypes in Python bivittatus (Burmese Python)

PubMed Central

Rutllant, Josep

2016-01-01

Comparative genomics approaches provide a means of leveraging functional genomics information from a highly annotated model organism's genome (such as the mouse genome) in order to make physiological inferences about the role of genes and proteins in a less characterized organism's genome (such as the Burmese python). We employed a comparative genomics approach to produce the functional annotation of Python bivittatus genes encoding proteins associated with sperm phenotypes. We identify 129 gene-phenotype relationships in the python which are implicated in 10 specific sperm phenotypes. Results obtained through our systematic analysis identified subsets of python genes exhibiting associations with gene ontology annotation terms. Functional annotation data was represented in a semantic scatter plot. Together, these newly annotated Python bivittatus genome resources provide a high resolution framework from which the biology relating to reptile spermatogenesis, fertility, and reproduction can be further investigated. Applications of our research include (1) production of genetic diagnostics for assessing fertility in domestic and wild reptiles; (2) enhanced assisted reproduction technology for endangered and captive reptiles; and (3) novel molecular targets for biotechnology-based approaches aimed at reducing fertility and reproduction of invasive reptiles. Additional enhancements to reptile genomic resources will further enhance their value. PMID:27200191
A genetic screen for modifiers of UFO meristem activity identifies three novel FUSED FLORAL ORGANS genes required for early flower development in Arabidopsis.

PubMed

Levin, J Z; Fletcher, J C; Chen, X; Meyerowitz, E M

1998-06-01

In a screen to identify novel genes required for early Arabidopsis flower development, we isolated four independent mutations that enhance the Ufo phenotype toward the production of filamentous structures in place of flowers. The mutants fall into three complementation groups, which we have termed FUSED FLORAL ORGANS (FFO) loci. ffo mutants have specific defects in floral organ separation and/or positioning; thus, the FFO genes identify components of a boundary formation mechanism(s) acting between developing floral organ primordia. FFO1 and FFO3 have specific functions in cauline leaf/stem separation and in first- and third-whorl floral organ separation, with FFO3 likely acting to establish and FFO1 to maintain floral organ boundaries. FFO2 acts at early floral stages to regulate floral organ number and positioning and to control organ separation within and between whorls. Plants doubly mutant for two ffo alleles display additive phenotypes, indicating that the FFO genes may act in separate pathways. Plants doubly mutant for an ffo gene and for ufo, lfy, or clv3 reveal that the FFO genes play roles related to those of UFO and LFY in floral meristem initiation and that FFO2 and FFO3 may act to control cell proliferation late in inflorescence development.
Whole Genome Shotgun Sequencing Shows Selection on Leptospira Regulatory Proteins during in vitro Culture Attenuation

PubMed Central

Lehmann, Jason S.; Corey, Victoria C.; Ricaldi, Jessica N.; Vinetz, Joseph M.; Winzeler, Elizabeth A.; Matthias, Michael A.

2016-01-01

Leptospirosis is the most common zoonotic disease worldwide with an estimated 500,000 severe cases reported annually, and case fatality rates of 12–25%, due primarily to acute kidney and lung injuries. Despite its prevalence, the molecular mechanisms underlying leptospirosis pathogenesis remain poorly understood. To identify virulence-related genes in Leptospira interrogans, we delineated cumulative genome changes that occurred during serial in vitro passage of a highly virulent strain of L. interrogans serovar Lai into a nearly avirulent isogenic derivative. Comparison of protein coding and computationally predicted noncoding RNA (ncRNA) genes between these two polyclonal strains identified 15 nonsynonymous single nucleotide variant (nsSNV) alleles that increased in frequency and 19 that decreased, whereas no changes in allelic frequency were observed among the ncRNA genes. Some of the nsSNV alleles were in six genes shown previously to be transcriptionally upregulated during exposure to in vivo-like conditions. Five of these nsSNVs were in evolutionarily conserved positions in genes related to signal transduction and metabolism. Frequency changes of minor nsSNV alleles identified in this study likely contributed to the loss of virulence during serial in vitro culture. The identification of new virulence-associated genes should spur additional experimental inquiry into their potential role in Leptospira pathogenesis. PMID:26711524
Leveraging Comparative Genomics to Identify and Functionally Characterize Genes Associated with Sperm Phenotypes in Python bivittatus (Burmese Python).

PubMed

Irizarry, Kristopher J L; Rutllant, Josep

2016-01-01

Comparative genomics approaches provide a means of leveraging functional genomics information from a highly annotated model organism's genome (such as the mouse genome) in order to make physiological inferences about the role of genes and proteins in a less characterized organism's genome (such as the Burmese python). We employed a comparative genomics approach to produce the functional annotation of Python bivittatus genes encoding proteins associated with sperm phenotypes. We identify 129 gene-phenotype relationships in the python which are implicated in 10 specific sperm phenotypes. Results obtained through our systematic analysis identified subsets of python genes exhibiting associations with gene ontology annotation terms. Functional annotation data was represented in a semantic scatter plot. Together, these newly annotated Python bivittatus genome resources provide a high resolution framework from which the biology relating to reptile spermatogenesis, fertility, and reproduction can be further investigated. Applications of our research include (1) production of genetic diagnostics for assessing fertility in domestic and wild reptiles; (2) enhanced assisted reproduction technology for endangered and captive reptiles; and (3) novel molecular targets for biotechnology-based approaches aimed at reducing fertility and reproduction of invasive reptiles. Additional enhancements to reptile genomic resources will further enhance their value.
Post-genome research on the biosynthesis of ergot alkaloids.

PubMed

Li, Shu-Ming; Unsöld, Inge A

2006-10-01

Genome sequencing provides new opportunities and challenges for identifying genes for the biosynthesis of secondary metabolites. A putative biosynthetic gene cluster of fumigaclavine C, an ergot alkaloid of the clavine type, was identified in the genome sequence of ASPERGILLUS FUMIGATUS by a bioinformatic approach. This cluster spans 22 kb of genomic DNA and comprises at least 11 open reading frames (ORFs). Seven of them are orthologous to genes from the biosynthetic gene cluster of ergot alkaloids in CLAVICEPS PURPUREA. Experimental evidence of the identified cluster was provided by heterologous expression and biochemical characterization of two ORFs, FgaPT1 and FgaPT2, in the cluster of A. FUMIGATUS, which show remarkable similarities to dimethylallyltryptophan synthase from C. PURPUREA and function as prenyltransferases. FgaPT2 converts L-tryptophan to dimethylallyltryptophan and thereby catalyzes the first step of ergot alkaloid biosynthesis, whilst FgaPT1 catalyzes the last step of the fumigaclavine C biosynthesis, i. e., the prenylation of fumigaclavine A at C-2 position of the indole nucleus. In addition to information obtained from the gene cluster of ergot alkaloids from C. PURPUREA, the identification of the biosynthetic gene cluster of fumigaclavine C in A. FUMIGATUS opens an alternative way to study the biosynthesis of ergot alkaloids in fungi.
Identification of AUXIN RESPONSE FACTOR gene family from Prunus sibirica and its expression analysis during mesocarp and kernel development.

PubMed

Niu, Jun; Bi, Quanxin; Deng, Shuya; Chen, Huiping; Yu, Haiyan; Wang, Libing; Lin, Shanzhi

2018-01-24

Auxin response factors (ARFs) in auxin signaling pathway are an important component that can regulate the transcription of auxin-responsive genes involved in almost all aspects of plant growth and development. To our knowledge, the comprehensive and systematic characterization of ARF genes has never been reported in Prunus sibirica, a novel woody biodiesel feedstock in China. In this study, we identified 14 PsARF genes with a perfect open reading frame (ORF) in P. sibirica by using its previous transcriptomic data. Conserved motif analysis showed that all identified PsARF proteins had typical DNA-binding and ARF domain, but 5 members (PsARF3, 8 10, 16 and 17) lacked the dimerization domain. Phylogenetic analysis of the ARF proteins generated from various plant species indicated that ARFs could be categorized into 4 major groups (Class I, II, III and IV), in which all identified ARFs from P. sibirica showed a closest relationship with those from P. mume. Comparison of the expression profiles of 14 PsARF genes in different developmental stages of Siberian apricot mesocarp (SAM) and kernel (SAK) reflected distinct temporal or spatial expression patterns for PsARF genes. Additionally, based on the expressed data from fruit and seed development of multiple plant species, we identified 1514 ARF-correlated genes using weighted gene co-expression network analysis (WGCNA). And the major portion of ARF-correlated gene was characterized to be involved in protein, nucleic acid and carbohydrate metabolic, transport and regulatory processes. In summary, we systematically and comprehensively analyzed the structure, expression pattern and co-expression network of ARF gene family in P. sibirica. All our findings provide theoretical foundation for the PsARF gene family and will pave the way for elucidating the precise role of PsARF genes in SAM and SAK development.
Genome-wide association study and accuracy of genomic prediction for teat number in Duroc pigs using genotyping-by-sequencing.

PubMed

Tan, Cheng; Wu, Zhenfang; Ren, Jiangli; Huang, Zhuolin; Liu, Dewu; He, Xiaoyan; Prakapenka, Dzianis; Zhang, Ran; Li, Ning; Da, Yang; Hu, Xiaoxiang

2017-03-29

The number of teats in pigs is related to a sow's ability to rear piglets to weaning age. Several studies have identified genes and genomic regions that affect teat number in swine but few common results were reported. The objective of this study was to identify genetic factors that affect teat number in pigs, evaluate the accuracy of genomic prediction, and evaluate the contribution of significant genes and genomic regions to genomic broad-sense heritability and prediction accuracy using 41,108 autosomal single nucleotide polymorphisms (SNPs) from genotyping-by-sequencing on 2936 Duroc boars. Narrow-sense heritability and dominance heritability of teat number estimated by genomic restricted maximum likelihood were 0.365 ± 0.030 and 0.035 ± 0.019, respectively. The accuracy of genomic predictions, calculated as the average correlation between the genomic best linear unbiased prediction and phenotype in a tenfold validation study, was 0.437 ± 0.064 for the model with additive and dominance effects and 0.435 ± 0.064 for the model with additive effects only. Genome-wide association studies (GWAS) using three methods of analysis identified 85 significant SNP effects for teat number on chromosomes 1, 6, 7, 10, 11, 12 and 14. The region between 102.9 and 106.0 Mb on chromosome 7, which was reported in several studies, had the most significant SNP effects in or near the PTGR2, FAM161B, LIN52, VRTN, FCF1, AREL1 and LRRC74A genes. This region accounted for 10.0% of the genomic additive heritability and 8.0% of the accuracy of prediction. The second most significant chromosome region not reported by previous GWAS was the region between 77.7 and 79.7 Mb on chromosome 11, where SNPs in the FGF14 gene had the most significant effect and accounted for 5.1% of the genomic additive heritability and 5.2% of the accuracy of prediction. The 85 significant SNPs accounted for 28.5 to 28.8% of the genomic additive heritability and 35.8 to 36.8% of the accuracy of prediction. The three methods used for the GWAS identified 85 significant SNPs with additive effects on teat number, including SNPs in a previously reported chromosomal region and SNPs in novel chromosomal regions. Most significant SNPs with larger estimated effects also had larger contributions to the total genomic heritability and accuracy of prediction than other SNPs.
Whole-exome sequencing reveals the spectrum of gene mutations and the clonal evolution patterns in paediatric acute myeloid leukaemia.

PubMed

Shiba, Norio; Yoshida, Kenichi; Shiraishi, Yuichi; Okuno, Yusuke; Yamato, Genki; Hara, Yusuke; Nagata, Yasunobu; Chiba, Kenichi; Tanaka, Hiroko; Terui, Kiminori; Kato, Motohiro; Park, Myoung-Ja; Ohki, Kentaro; Shimada, Akira; Takita, Junko; Tomizawa, Daisuke; Kudo, Kazuko; Arakawa, Hirokazu; Adachi, Souichi; Taga, Takashi; Tawa, Akio; Ito, Etsuro; Horibe, Keizo; Sanada, Masashi; Miyano, Satoru; Ogawa, Seishi; Hayashi, Yasuhide

2016-11-01

Acute myeloid leukaemia (AML) is a molecularly and clinically heterogeneous disease. Targeted sequencing efforts have identified several mutations with diagnostic and prognostic values in KIT, NPM1, CEBPA and FLT3 in both adult and paediatric AML. In addition, massively parallel sequencing enabled the discovery of recurrent mutations (i.e. IDH1/2 and DNMT3A) in adult AML. In this study, whole-exome sequencing (WES) of 22 paediatric AML patients revealed mutations in components of the cohesin complex (RAD21 and SMC3), BCORL1 and ASXL2 in addition to previously known gene mutations. We also revealed intratumoural heterogeneities in many patients, implicating multiple clonal evolution events in the development of AML. Furthermore, targeted deep sequencing in 182 paediatric AML patients identified three major categories of recurrently mutated genes: cohesion complex genes [STAG2, RAD21 and SMC3 in 17 patients (8·3%)], epigenetic regulators [ASXL1/ASXL2 in 17 patients (8·3%), BCOR/BCORL1 in 7 patients (3·4%)] and signalling molecules. We also performed WES in four patients with relapsed AML. Relapsed AML evolved from one of the subclones at the initial phase and was accompanied by many additional mutations, including common driver mutations that were absent or existed only with lower allele frequency in the diagnostic samples, indicating a multistep process causing leukaemia recurrence. © 2016 John Wiley & Sons Ltd.
A Systems Biology Approach To Identify the Combination Effects of Human Herpesvirus 8 Genes on NF-κB Activation▿

PubMed Central

Konrad, Andreas; Wies, Effi; Thurau, Mathias; Marquardt, Gaby; Naschberger, Elisabeth; Hentschel, Sonja; Jochmann, Ramona; Schulz, Thomas F.; Erfle, Holger; Brors, Benedikt; Lausen, Berthold; Neipel, Frank; Stürzl, Michael

2009-01-01

Human herpesvirus 8 (HHV-8) is the etiologic agent of Kaposi's sarcoma and primary effusion lymphoma. Activation of the cellular transcription factor nuclear factor-kappa B (NF-κB) is essential for latent persistence of HHV-8, survival of HHV-8-infected cells, and disease progression. We used reverse-transfected cell microarrays (RTCM) as an unbiased systems biology approach to systematically analyze the effects of HHV-8 genes on the NF-κB signaling pathway. All HHV-8 genes individually (n = 86) and, additionally, all K and latent genes in pairwise combinations (n = 231) were investigated. Statistical analyses of more than 14,000 transfections identified ORF75 as a novel and confirmed K13 as a known HHV-8 activator of NF-κB. K13 and ORF75 showed cooperative NF-κB activation. Small interfering RNA-mediated knockdown of ORF75 expression demonstrated that this gene contributes significantly to NF-κB activation in HHV-8-infected cells. Furthermore, our approach confirmed K10.5 as an NF-κB inhibitor and newly identified K1 as an inhibitor of both K13- and ORF75-mediated NF-κB activation. All results obtained with RTCM were confirmed with classical transfection experiments. Our work describes the first successful application of RTCM for the systematic analysis of pathofunctions of genes of an infectious agent. With this approach, ORF75 and K1 were identified as novel HHV-8 regulatory molecules on the NF-κB signal transduction pathway. The genes identified may be involved in fine-tuning of the balance between latency and lytic replication, since this depends critically on the state of NF-κB activity. PMID:19129458
Genome-Wide Association Study in African Americans with Acute Respiratory Distress Syndrome Identifies the Selectin P Ligand Gene as a Risk Factor.

PubMed

Bime, Christian; Pouladi, Nima; Sammani, Saad; Batai, Ken; Casanova, Nancy; Zhou, Tong; Kempf, Carrie L; Sun, Xiaoguang; Camp, Sara M; Wang, Ting; Kittles, Rick A; Lussier, Yves A; Jones, Tiffanie K; Reilly, John P; Meyer, Nuala J; Christie, Jason D; Karnes, Jason H; Gonzalez-Garay, Manuel; Christiani, David C; Yates, Charles R; Wurfel, Mark M; Meduri, Gianfranco U; Garcia, Joe G N

2018-06-01

Genetic factors are involved in acute respiratory distress syndrome (ARDS) susceptibility. Identification of novel candidate genes associated with increased risk and severity will improve our understanding of ARDS pathophysiology and enhance efforts to develop novel preventive and therapeutic approaches. To identify genetic susceptibility targets for ARDS. A genome-wide association study was performed on 232 African American patients with ARDS and 162 at-risk control subjects. The Identify Candidate Causal SNPs and Pathways platform was used to infer the association of known gene sets with the top prioritized intragenic SNPs. Preclinical validation of SELPLG (selectin P ligand gene) was performed using mouse models of LPS- and ventilator-induced lung injury. Exonic variation within SELPLG distinguishing patients with ARDS from sepsis control subjects was confirmed in an independent cohort. Pathway prioritization analysis identified a nonsynonymous coding SNP (rs2228315) within SELPLG, encoding P-selectin glycoprotein ligand 1, to be associated with increased susceptibility. In an independent cohort, two exonic SELPLG SNPs were significantly associated with ARDS susceptibility. Additional support for SELPLG as an ARDS candidate gene was derived from preclinical ARDS models where SELPLG gene expression in lung tissues was significantly increased in both ventilator-induced (twofold increase) and LPS-induced (5.7-fold increase) murine lung injury models compared with controls. Furthermore, Selplg -/- mice exhibited significantly reduced LPS-induced inflammatory lung injury compared with wild-type C57/B6 mice. Finally, an antibody that neutralizes P-selectin glycoprotein ligand 1 significantly attenuated LPS-induced lung inflammation. These findings identify SELPLG as a novel ARDS susceptibility gene among individuals of European and African descent.
Hereditary spastic paraplegias: identification of a novel SPG57 variant affecting TFG oligomerization and description of HSP subtypes in Sudan.

PubMed

Elsayed, Liena E O; Mohammed, Inaam N; Hamed, Ahlam A A; Elseed, Maha A; Johnson, Adam; Mairey, Mathilde; Mohamed, Hassab Elrasoul S A; Idris, Mohamed N; Salih, Mustafa A M; El-Sadig, Sarah M; Koko, Mahmoud E; Mohamed, Ashraf Y O; Raymond, Laure; Coutelier, Marie; Darios, Frédéric; Siddig, Rayan A; Ahmed, Ahmed K M A; Babai, Arwa M A; Malik, Hiba M O; Omer, Zulfa M B M; Mohamed, Eman O E; Eltahir, Hanan B; Magboul, Nasr Aldin A; Bushara, Elfatih E; Elnour, Abdelrahman; Rahim, Salah M Abdel; Alattaya, Abdelmoneim; Elbashir, Mustafa I; Ibrahim, Muntaser E; Durr, Alexandra; Audhya, Anjon; Brice, Alexis; Ahmed, Ammar E; Stevanin, Giovanni

2016-01-01

Hereditary spastic paraplegias (HSP) are the second most common type of motor neuron disease recognized worldwide. We investigated a total of 25 consanguineous families from Sudan. We used next-generation sequencing to screen 74 HSP-related genes in 23 families. Linkage analysis and candidate gene sequencing was performed in two other families. We established a genetic diagnosis in six families with autosomal recessive HSP (SPG11 in three families and TFG/SPG57, SACS and ALS2 in one family each). A heterozygous mutation in a gene involved in an autosomal dominant HSP (ATL1/SPG3A) was also identified in one additional family. Six out of seven identified variants were novel. The c.64C>T (p.(Arg22Trp)) TFG/SPG57 variant (PB1 domain) is the second identified that underlies HSP, and we demonstrated its impact on TFG oligomerization in vitro. Patients did not present with visual impairment as observed in a previously reported SPG57 family (c.316C>T (p.(Arg106Cys)) in coiled-coil domain), suggesting unique contributions of the PB1 and coiled-coil domains in TFG complex formation/function and a possible phenotype correlation to variant location. Some families manifested marked phenotypic variations implying the possibility of modifier factors complicated by high inbreeding. Finally, additional genetic heterogeneity is expected in HSP Sudanese families. The remaining families might unravel new genes or uncommon modes of inheritance.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.