uncovering candidate genes: Topics by Science.gov

Sample records for uncovering candidate genes

A Computational Network Biology Approach to Uncover Novel Genes Related to Alzheimer's Disease.

PubMed

Zanzoni, Andreas

2016-01-01

Recent advances in the fields of genetics and genomics have enabled the identification of numerous Alzheimer's disease (AD) candidate genes, although for many of them the role in AD pathophysiology has not been uncovered yet. Concomitantly, network biology studies have shown a strong link between protein network connectivity and disease. In this chapter I describe a computational approach that, by combining local and global network analysis strategies, allows the formulation of novel hypotheses on the molecular mechanisms involved in AD and prioritizes candidate genes for further functional studies.
A data-mining approach to rank candidate protein-binding partners-The case of biogenesis of lysosome-related organelles complex-1 (BLOC-1).

PubMed

Rodriguez-Fernandez, I A; Dell'Angelica, E C

2009-04-01

The study of protein-protein interactions is a powerful approach to uncovering the molecular function of gene products associated with human disease. Protein-protein interaction data are accumulating at an unprecedented pace owing to interactomics projects, although it has been recognized that a significant fraction of these data likely represents false positives. During our studies of biogenesis of lysosome-related organelles complex-1 (BLOC-1), a protein complex involved in protein trafficking and containing the products of genes mutated in Hermansky-Pudlak syndrome, we faced the problem of having too many candidate binding partners to pursue experimentally. In this work, we have explored ways of efficiently gathering high-quality information about candidate binding partners and presenting the information in a visually friendly manner. We applied the approach to rank 70 candidate binding partners of human BLOC-1 and 102 candidates of its counterpart from Drosophila melanogaster. The top candidate for human BLOC-1 was the small GTPase encoded by the RAB11A gene, which is a paralogue of the Rab38 and Rab32 proteins in mammals and the lightoid gene product in flies. Interestingly, genetic analyses in D. melanogaster uncovered a synthetic sick/lethal interaction between Rab11 and lightoid. The data-mining approach described herein can be customized to study candidate binding partners for other proteins or possibly candidates derived from other types of 'omics' data.
MMTV insertional mutagenesis identifies genes, gene families and pathways involved in mammary cancer.

PubMed

Theodorou, Vassiliki; Kimm, Melanie A; Boer, Mandy; Wessels, Lodewyk; Theelen, Wendy; Jonkers, Jos; Hilkens, John

2007-06-01

We performed a high-throughput retroviral insertional mutagenesis screen in mouse mammary tumor virus (MMTV)-induced mammary tumors and identified 33 common insertion sites, of which 17 genes were previously not known to be associated with mammary cancer and 13 had not previously been linked to cancer in general. Although members of the Wnt and fibroblast growth factors (Fgf) families were frequently tagged, our exhaustive screening for MMTV insertion sites uncovered a new repertoire of candidate breast cancer oncogenes. We validated one of these genes, Rspo3, as an oncogene by overexpression in a p53-deficient mammary epithelial cell line. The human orthologs of the candidate oncogenes were frequently deregulated in human breast cancers and associated with several tumor parameters. Computational analysis of all MMTV-tagged genes uncovered specific gene families not previously associated with cancer and showed a significant overrepresentation of protein domains and signaling pathways mainly associated with development and growth factor signaling. Comparison of all tagged genes in MMTV and Moloney murine leukemia virus-induced malignancies showed that both viruses target mostly different genes that act predominantly in distinct pathways.
Candidate Genes for Inherited Autism Susceptibility in the Lebanese Population.

PubMed

Kourtian, Silva; Soueid, Jihane; Makhoul, Nadine J; Guisso, Dikran Richard; Chahrour, Maria; Boustany, Rose-Mary N

2017-03-30

Autism spectrum disorder (ASD) is characterized by ritualistic-repetitive behaviors and impaired verbal/non-verbal communication. Many ASD susceptibility genes implicated in neuronal pathways/brain development have been identified. The Lebanese population is ideal for uncovering recessive genes because of shared ancestry and a high rate of consanguineous marriages. Aims here are to analyze for published ASD genes and uncover novel inherited ASD susceptibility genes specific to the Lebanese. We recruited 36 ASD families (ASD: 37, unaffected parents: 36, unaffected siblings: 33) and 100 unaffected Lebanese controls. Cytogenetics 2.7 M Microarrays/CytoScan™ HD arrays allowed mapping of homozygous regions of the genome. The CNTNAP2 gene was screened by Sanger sequencing. Homozygosity mapping uncovered DPP4, TRHR, and MLF1 as novel candidate susceptibility genes for ASD in the Lebanese. Sequencing of hot spot exons in CNTNAP2 led to discovery of a 5 bp insertion in 23/37 ASD patients. This mutation was present in unaffected family members and unaffected Lebanese controls. Although a slight increase in number was observed in ASD patients and family members compared to controls, there were no significant differences in allele frequencies between affecteds and controls (C/TTCTG: γ 2 value = 0.014; p = 0.904). The CNTNAP2 polymorphism identified in this population, hence, is not linked to the ASD phenotype.
Candidate Genes for Inherited Autism Susceptibility in the Lebanese Population

PubMed Central

Kourtian, Silva; Soueid, Jihane; Makhoul, Nadine J.; Guisso, Dikran Richard; Chahrour, Maria; Boustany, Rose-Mary N.

2017-01-01

Autism spectrum disorder (ASD) is characterized by ritualistic-repetitive behaviors and impaired verbal/non-verbal communication. Many ASD susceptibility genes implicated in neuronal pathways/brain development have been identified. The Lebanese population is ideal for uncovering recessive genes because of shared ancestry and a high rate of consanguineous marriages. Aims here are to analyze for published ASD genes and uncover novel inherited ASD susceptibility genes specific to the Lebanese. We recruited 36 ASD families (ASD: 37, unaffected parents: 36, unaffected siblings: 33) and 100 unaffected Lebanese controls. Cytogenetics 2.7 M Microarrays/CytoScan™ HD arrays allowed mapping of homozygous regions of the genome. The CNTNAP2 gene was screened by Sanger sequencing. Homozygosity mapping uncovered DPP4, TRHR, and MLF1 as novel candidate susceptibility genes for ASD in the Lebanese. Sequencing of hot spot exons in CNTNAP2 led to discovery of a 5 bp insertion in 23/37 ASD patients. This mutation was present in unaffected family members and unaffected Lebanese controls. Although a slight increase in number was observed in ASD patients and family members compared to controls, there were no significant differences in allele frequencies between affecteds and controls (C/TTCTG: γ2 value = 0.014; p = 0.904). The CNTNAP2 polymorphism identified in this population, hence, is not linked to the ASD phenotype. PMID:28358038
Integrative strategies to identify candidate genes in rodent models of human alcoholism.

PubMed

Treadwell, Julie A

2006-01-01

The search for genes underlying alcohol-related behaviours in rodent models of human alcoholism has been ongoing for many years with only limited success. Recently, new strategies that integrate several of the traditional approaches have provided new insights into the molecular mechanisms underlying ethanol's actions in the brain. We have used alcohol-preferring C57BL/6J (B6) and alcohol-avoiding DBA/2J (D2) genetic strains of mice in an integrative strategy combining high-throughput gene expression screening, genetic segregation analysis, and mapping to previously published quantitative trait loci to uncover candidate genes for the ethanol-preference phenotype. In our study, 2 genes, retinaldehyde binding protein 1 (Rlbp1) and syntaxin 12 (Stx12), were found to be strong candidates for ethanol preference. Such experimental approaches have the power and the potential to greatly speed up the laborious process of identifying candidate genes for the animal models of human alcoholism.
Integrative mouse and human mRNA studies using WGCNA nominates novel candidate genes involved in the pathogenesis of major depressive disorder.

PubMed

Malki, Karim; Tosto, Maria Grazia; Jumabhoy, Irfan; Lourdusamy, Anbarasu; Sluyter, Frans; Craig, Ian; Uher, Rudolf; McGuffin, Peter; Schalkwyk, Leonard C

2013-12-01

This study aims to identify novel genes associated with major depressive disorder and pharmacological treatment response using animal and human mRNA studies. Weighted gene coexpression network analysis was used to uncover genes associated with stress factors in mice and to inform mRNA probe set selection in a post-mortem study of depression. A total of 171 genes were found to be differentially regulated in response to both early and late stress protocols in a mouse study. Ten human genes, orthologous to mouse genes differentially expressed by stress, were also found to be dysregulated in depressed cases in a human post-mortem brain study from the Stanley Foundation Brain Collection. Several novel genes associated with depression were uncovered, including NOVA1 and USP9X. Moreover, we found further evidence in support of hippocampal neurogenesis and peripheral inflammation in major depressive disorder.
Knowledge Discovery in Biological Databases for Revealing Candidate Genes Linked to Complex Phenotypes.

PubMed

Hassani-Pak, Keywan; Rawlings, Christopher

2017-06-13

Genetics and "omics" studies designed to uncover genotype to phenotype relationships often identify large numbers of potential candidate genes, among which the causal genes are hidden. Scientists generally lack the time and technical expertise to review all relevant information available from the literature, from key model species and from a potentially wide range of related biological databases in a variety of data formats with variable quality and coverage. Computational tools are needed for the integration and evaluation of heterogeneous information in order to prioritise candidate genes and components of interaction networks that, if perturbed through potential interventions, have a positive impact on the biological outcome in the whole organism without producing negative side effects. Here we review several bioinformatics tools and databases that play an important role in biological knowledge discovery and candidate gene prioritization. We conclude with several key challenges that need to be addressed in order to facilitate biological knowledge discovery in the future.
Uncovering the Role of BMP Signaling in Melanocyte Development and Melanoma Tumorigenesis

DTIC Science & Technology

2014-07-01

clear that these mutations are not sufficient for melanoma formation and other genes are involved. Using genomic studies and cross -species...studies and cross -species comparisons to identify several candidates. One of these candidates, GDF6, is a BMP factor that is recurrently amplified...having to cross cell membranes. Antibodies, such as the VEGF blocker bevacizumab, epitomize this type of therapy. We are currently investigating the
Genomic analysis of Meckel–Gruber syndrome in Arabs reveals marked genetic heterogeneity and novel candidate genes

PubMed Central

Shaheen, Ranad; Faqeih, Eissa; Alshammari, Muneera J; Swaid, Abdulrahman; Al-Gazali, Lihadh; Mardawi, Elham; Ansari, Shinu; Sogaty, Sameera; Seidahmed, Mohammed Z; AlMotairi, Muhammed I; Farra, Chantal; Kurdi, Wesam; Al-Rasheed, Shatha; Alkuraya, Fowzan S

2013-01-01

Meckel–Gruber syndrome (MKS, OMIM #249000) is a multiple congenital malformation syndrome that represents the severe end of the ciliopathy phenotypic spectrum. Despite the relatively common occurrence of this syndrome among Arabs, little is known about its genetic architecture in this population. This is a series of 18 Arab families with MKS, who were evaluated clinically and studied using autozygome-guided mutation analysis and exome sequencing. We show that autozygome-guided candidate gene analysis identified the underlying mutation in the majority (n=12, 71%). Exome sequencing revealed a likely pathogenic mutation in three novel candidate MKS disease genes. These include C5orf42, Ellis–van-Creveld disease gene EVC2 and SEC8 (also known as EXOC4), which encodes an exocyst protein with an established role in ciliogenesis. This is the largest and most comprehensive genomic study on MKS in Arabs and the results, in addition to revealing genetic and allelic heterogeneity, suggest that previously reported disease genes and the novel candidates uncovered by this study account for the overwhelming majority of MKS patients in our population. PMID:23169490
Comparative analysis of protein interactome networks prioritizes candidate genes with cancer signatures.

PubMed

Li, Yongsheng; Sahni, Nidhi; Yi, Song

2016-11-29

Comprehensive understanding of human cancer mechanisms requires the identification of a thorough list of cancer-associated genes, which could serve as biomarkers for diagnoses and therapies in various types of cancer. Although substantial progress has been made in functional studies to uncover genes involved in cancer, these efforts are often time-consuming and costly. Therefore, it remains challenging to comprehensively identify cancer candidate genes. Network-based methods have accelerated this process through the analysis of complex molecular interactions in the cell. However, the extent to which various interactome networks can contribute to prediction of candidate genes responsible for cancer is still enigmatic. In this study, we evaluated different human protein-protein interactome networks and compared their application to cancer gene prioritization. Our results indicate that network analyses can increase the power to identify novel cancer genes. In particular, such predictive power can be enhanced with the use of unbiased systematic protein interaction maps for cancer gene prioritization. Functional analysis reveals that the top ranked genes from network predictions co-occur often with cancer-related terms in literature, and further, these candidate genes are indeed frequently mutated across cancers. Finally, our study suggests that integrating interactome networks with other omics datasets could provide novel insights into cancer-associated genes and underlying molecular mechanisms.
Uncovering the genetic basis of attenuation in Marek’s disease virus

USDA-ARS?s Scientific Manuscript database

While in vitro serial passage of Marek’s disease virus (MDV) is a proven method to attenuate MDV strains, the underlying genetic changes responsible for attenuation remains unknown. To identify candidate genes and mutations, a virulent MDV generated from an Md5-containing BAC clone was serially pass...
Rare variants and autoimmune disease.

PubMed

Massey, Jonathan; Eyre, Steve

2014-09-01

The study of rare variants in monogenic forms of autoimmune disease has offered insight into the aetiology of more complex pathologies. Research in complex autoimmune disease initially focused on sequencing candidate genes, with some early successes, notably in uncovering low-frequency variation associated with Type 1 diabetes mellitus. However, other early examples have proved difficult to replicate, and a recent study across six autoimmune diseases, re-sequencing 25 autoimmune disease-associated genes in large sample sizes, failed to find any associated rare variants. The study of rare and low-frequency variation in autoimmune diseases has been made accessible by the inclusion of such variants on custom genotyping arrays (e.g. Immunochip and Exome arrays). Whole-exome sequencing approaches are now also being utilised to uncover the contribution of rare coding variants to disease susceptibility, severity and treatment response. Other sequencing strategies are starting to uncover the role of regulatory rare variation. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Comparative mRNA analysis of behavioral and genetic mouse models of aggression.

PubMed

Malki, Karim; Tosto, Maria G; Pain, Oliver; Sluyter, Frans; Mineur, Yann S; Crusio, Wim E; de Boer, Sietse; Sandnabba, Kenneth N; Kesserwani, Jad; Robinson, Edward; Schalkwyk, Leonard C; Asherson, Philip

2016-04-01

Mouse models of aggression have traditionally compared strains, most notably BALB/cJ and C57BL/6. However, these strains were not designed to study aggression despite differences in aggression-related traits and distinct reactivity to stress. This study evaluated expression of genes differentially regulated in a stress (behavioral) mouse model of aggression with those from a recent genetic mouse model aggression. The study used a discovery-replication design using two independent mRNA studies from mouse brain tissue. The discovery study identified strain (BALB/cJ and C57BL/6J) × stress (chronic mild stress or control) interactions. Probe sets differentially regulated in the discovery set were intersected with those uncovered in the replication study, which evaluated differences between high and low aggressive animals from three strains specifically bred to study aggression. Network analysis was conducted on overlapping genes uncovered across both studies. A significant overlap was found with the genetic mouse study sharing 1,916 probe sets with the stress model. Fifty-one probe sets were found to be strongly dysregulated across both studies mapping to 50 known genes. Network analysis revealed two plausible pathways including one centered on the UBC gene hub which encodes ubiquitin, a protein well-known for protein degradation, and another on P38 MAPK. Findings from this study support the stress model of aggression, which showed remarkable molecular overlap with a genetic model. The study uncovered a set of candidate genes including the Erg2 gene, which has previously been implicated in different psychopathologies. The gene networks uncovered points at a Redox pathway as potentially being implicated in aggressive related behaviors. © 2016 Wiley Periodicals, Inc.
Computational discovery of pathway-level genetic vulnerabilities in non-small-cell lung cancer | Office of Cancer Genomics

Cancer.gov

Novel approaches are needed for discovery of targeted therapies for non-small-cell lung cancer (NSCLC) that are specific to certain patients. Whole genome RNAi screening of lung cancer cell lines provides an ideal source for determining candidate drug targets. Unsupervised learning algorithms uncovered patterns of differential vulnerability across lung cancer cell lines to loss of functionally related genes. Such genetic vulnerabilities represent candidate targets for therapy and are found to be involved in splicing, translation and protein folding.
Identification of Causal Genes, Networks, and Transcriptional Regulators of REM Sleep and Wake

PubMed Central

Millstein, Joshua; Winrow, Christopher J.; Kasarskis, Andrew; Owens, Joseph R.; Zhou, Lili; Summa, Keith C.; Fitzpatrick, Karrie; Zhang, Bin; Vitaterna, Martha H.; Schadt, Eric E.; Renger, John J.; Turek, Fred W.

2011-01-01

Study Objective: Sleep-wake traits are well-known to be under substantial genetic control, but the specific genes and gene networks underlying primary sleep-wake traits have largely eluded identification using conventional approaches, especially in mammals. Thus, the aim of this study was to use systems genetics and statistical approaches to uncover the genetic networks underlying 2 primary sleep traits in the mouse: 24-h duration of REM sleep and wake. Design: Genome-wide RNA expression data from 3 tissues (anterior cortex, hypothalamus, thalamus/midbrain) were used in conjunction with high-density genotyping to identify candidate causal genes and networks mediating the effects of 2 QTL regulating the 24-h duration of REM sleep and one regulating the 24-h duration of wake. Setting: Basic sleep research laboratory. Patients or Participants: Male [C57BL/6J × (BALB/cByJ × C57BL/6J*) F1] N2 mice (n = 283). Interventions: None. Measurements and Results: The genetic variation of a mouse N2 mapping cross was leveraged against sleep-state phenotypic variation as well as quantitative gene expression measurement in key brain regions using integrative genomics approaches to uncover multiple causal sleep-state regulatory genes, including several surprising novel candidates, which interact as components of networks that modulate REM sleep and wake. In particular, it was discovered that a core network module, consisting of 20 genes, involved in the regulation of REM sleep duration is conserved across the cortex, hypothalamus, and thalamus. A novel application of a formal causal inference test was also used to identify those genes directly regulating sleep via control of expression. Conclusion: Systems genetics approaches reveal novel candidate genes, complex networks and specific transcriptional regulators of REM sleep and wake duration in mammals. Citation: Millstein J; Winrow CJ; Kasarskis A; Owens JR; Zhou L; Summa KC; Fitzpatrick K; Zhang B; Vitaterna MH; Schadt EE; Renger JJ; Turek FW. Identification of causal genes, networks, and transcriptional regulators of REM sleep and wake. SLEEP 2011;34(11):1469-1477. PMID:22043117
Pan-cancer analysis of somatic copy number alterations implicates IRS4 and IGF2 in enhancer hijacking

PubMed Central

Weischenfeldt, Joachim; Dubash, Taronish; Drainas, Alexandros P.; Mardin, Balca R.; Chen, Yuanyuan; Stütz, Adrian M.; Waszak, Sebastian M.; Bosco, Graziella; Halvorsen, Ann Rita; Raeder, Benjamin; Efthymiopoulos, Theocharis; Erkek, Serap; Siegl, Christine; Brenner, Hermann; Brustugun, Odd Terje; Dieter, Sebastian M.; Northcott, Paul A.; Petersen, Iver; Pfister, Stefan M.; Schneider, Martin; Solberg, Steinar K.; Thunissen, Erik; Weichert, Wilko; Zichner, Thomas; Thomas, Roman; Peifer, Martin; Helland, Aslaug; Ball, Claudia R.; Jechlinger, Martin; Sotillo, Rocio; Glimm, Hanno; Korbel, Jan O.

2018-01-01

Extensive prior research has focused on somatic copy-number alterations (SCNAs) affecting cancer genes, yet the extent to which recurrent SCNAs exert their influence through rearranging cis-regulatory elements remains unclear. Here, we present a framework for inferring cancer-related gene overexpression resulting from cis-regulatory element reorganization (e.g., enhancer hijacking), by integrating SCNAs, gene expression data, and information on chromatin interaction domains. Analysis of 7,416 cancer genomes uncovered several pan-cancer candidate genes, including IRS4, SMARCA1 and TERT. We demonstrate that IRS4 overexpression in lung cancer associates with recurrent deletions in cis, and present evidence supporting a tumor-promoting role. We additionally pursued cancer type-specific analyses, uncovering IGF2 as a target for enhancer hijacking in colorectal cancer. IGF2-containing tandem duplications result in the de novo formation of a 3D contact domain comprising IGF2 and a lineage-specific super-enhancer, which mediates high-level gene activation. Our framework enables systematic inference of cis-regulatory element rearrangements mediating dysregulation in cancer. PMID:27869826
High-throughput, pooled sequencing identifies mutations in NUBPL and FOXRED1 in human complex I deficiency

PubMed Central

Calvo, Sarah E; Tucker, Elena J; Compton, Alison G; Kirby, Denise M; Crawford, Gabriel; Burtt, Noel P; Rivas, Manuel A; Guiducci, Candace; Bruno, Damien L; Goldberger, Olga A; Redman, Michelle C; Wiltshire, Esko; Wilson, Callum J; Altshuler, David; Gabriel, Stacey B; Daly, Mark J; Thorburn, David R; Mootha, Vamsi K

2010-01-01

Discovering the molecular basis of mitochondrial respiratory chain disease is challenging given the large number of both mitochondrial and nuclear genes involved. We report a strategy of focused candidate gene prediction, high-throughput sequencing, and experimental validation to uncover the molecular basis of mitochondrial complex I (CI) disorders. We created five pools of DNA from a cohort of 103 patients and then performed deep sequencing of 103 candidate genes to spotlight 151 rare variants predicted to impact protein function. We used confirmatory experiments to establish genetic diagnoses in 22% of previously unsolved cases, and discovered that defects in NUBPL and FOXRED1 can cause CI deficiency. Our study illustrates how large-scale sequencing, coupled with functional prediction and experimental validation, can reveal novel disease-causing mutations in individual patients. PMID:20818383
Scuba: scalable kernel-based gene prioritization.

PubMed

Zampieri, Guido; Tran, Dinh Van; Donini, Michele; Navarin, Nicolò; Aiolli, Fabio; Sperduti, Alessandro; Valle, Giorgio

2018-01-25

The uncovering of genes linked to human diseases is a pressing challenge in molecular biology and precision medicine. This task is often hindered by the large number of candidate genes and by the heterogeneity of the available information. Computational methods for the prioritization of candidate genes can help to cope with these problems. In particular, kernel-based methods are a powerful resource for the integration of heterogeneous biological knowledge, however, their practical implementation is often precluded by their limited scalability. We propose Scuba, a scalable kernel-based method for gene prioritization. It implements a novel multiple kernel learning approach, based on a semi-supervised perspective and on the optimization of the margin distribution. Scuba is optimized to cope with strongly unbalanced settings where known disease genes are few and large scale predictions are required. Importantly, it is able to efficiently deal both with a large amount of candidate genes and with an arbitrary number of data sources. As a direct consequence of scalability, Scuba integrates also a new efficient strategy to select optimal kernel parameters for each data source. We performed cross-validation experiments and simulated a realistic usage setting, showing that Scuba outperforms a wide range of state-of-the-art methods. Scuba achieves state-of-the-art performance and has enhanced scalability compared to existing kernel-based approaches for genomic data. This method can be useful to prioritize candidate genes, particularly when their number is large or when input data is highly heterogeneous. The code is freely available at https://github.com/gzampieri/Scuba .
Leveraging lung tissue transcriptome to uncover candidate causal genes in COPD genetic associations.

PubMed

Lamontagne, Maxime; Bérubé, Jean-Christophe; Obeidat, Ma'en; Cho, Michael H; Hobbs, Brian D; Sakornsakolpat, Phuwanat; de Jong, Kim; Boezen, H Marike; Nickle, David; Hao, Ke; Timens, Wim; van den Berge, Maarten; Joubert, Philippe; Laviolette, Michel; Sin, Don D; Paré, Peter D; Bossé, Yohan

2018-05-15

Causal genes of chronic obstructive pulmonary disease (COPD) remain elusive. The current study aims at integrating genome-wide association studies (GWAS) and lung expression quantitative trait loci (eQTL) data to map COPD candidate causal genes and gain biological insights into the recently discovered COPD susceptibility loci. Two complementary genomic datasets on COPD were studied. First, the lung eQTL dataset which included whole-genome gene expression and genotyping data from 1038 individuals. Second, the largest COPD GWAS to date from the International COPD Genetics Consortium (ICGC) with 13 710 cases and 38 062 controls. Methods that integrated GWAS with eQTL signals including transcriptome-wide association study (TWAS), colocalization and Mendelian randomization-based (SMR) approaches were used to map causality genes, i.e. genes with the strongest evidence of being the functional effector at specific loci. These methods were applied at the genome-wide level and at COPD risk loci derived from the GWAS literature. Replication was performed using lung data from GTEx. We collated 129 non-overlapping risk loci for COPD from the GWAS literature. At the genome-wide scale, 12 new COPD candidate genes/loci were revealed and six replicated in GTEx including CAMK2A, DMPK, MYO15A, TNFRSF10A, BTN3A2 and TRBV30. In addition, we mapped candidate causal genes for 60 out of the 129 GWAS-nominated loci and 23 of them were replicated in GTEx. Mapping candidate causal genes in lung tissue represents an important contribution to the genetics of COPD, enriches our biological interpretation of GWAS findings, and brings us closer to clinical translation of genetic associations.

Pervasive and opposing effects of Unpredictable Chronic Mild Stress (UCMS) on hippocampal gene expression in BALB/cJ and C57BL/6J mouse strains.

PubMed

Malki, Karim; Mineur, Yann S; Tosto, Maria Grazia; Campbell, James; Karia, Priya; Jumabhoy, Irfan; Sluyter, Frans; Crusio, Wim E; Schalkwyk, Leonard C

2015-04-03

BALB/cJ is a strain susceptible to stress and extremely susceptible to a defective hedonic impact in response to chronic stressors. The strain offers much promise as an animal model for the study of stress related disorders. We present a comparative hippocampal gene expression study on the effects of unpredictable chronic mild stress on BALB/cJ and C57BL/6J mice. Affymetrix MOE 430 was used to measure hippocampal gene expression from 16 animals of two different strains (BALB/cJ and C57BL/6J) of both sexes and subjected to either unpredictable chronic mild stress (UCMS) or no stress. Differences were statistically evaluated through supervised and unsupervised linear modelling and using Weighted Gene Coexpression Network Analysis (WGCNA). In order to gain further understanding into mechanisms related to stress response, we cross-validated our results with a parallel study from the GENDEP project using WGCNA in a meta-analysis design. The effects of UCMS are visible through Principal Component Analysis which highlights the stress sensitivity of the BALB/cJ strain. A number of genes and gene networks related to stress response were uncovered including the Creb1 gene. WGCNA and pathway analysis revealed a gene network centered on Nfkb1. Results from the meta-analysis revealed a highly significant gene pathway centred on the Ubiquitin C (Ubc) gene. All pathways uncovered are associated with inflammation and immune response. The study investigated the molecular mechanisms underlying the response to adverse environment in an animal model using a GxE design. Stress-related differences were visible at the genomic level through PCA analysis highlighting the high sensitivity of BALB/cJ animals to environmental stressors. Several candidate genes and gene networks reported are associated with inflammation and neurogenesis and could serve to inform candidate gene selection in human studies and provide additional insight into the pathology of Major Depressive Disorder.
Modeling 3D Facial Shape from DNA

PubMed Central

Claes, Peter; Liberton, Denise K.; Daniels, Katleen; Rosana, Kerri Matthes; Quillen, Ellen E.; Pearson, Laurel N.; McEvoy, Brian; Bauchet, Marc; Zaidi, Arslan A.; Yao, Wei; Tang, Hua; Barsh, Gregory S.; Absher, Devin M.; Puts, David A.; Rocha, Jorge; Beleza, Sandra; Pereira, Rinaldo W.; Baynam, Gareth; Suetens, Paul; Vandermeulen, Dirk; Wagner, Jennifer K.; Boster, James S.; Shriver, Mark D.

2014-01-01

Human facial diversity is substantial, complex, and largely scientifically unexplained. We used spatially dense quasi-landmarks to measure face shape in population samples with mixed West African and European ancestry from three locations (United States, Brazil, and Cape Verde). Using bootstrapped response-based imputation modeling (BRIM), we uncover the relationships between facial variation and the effects of sex, genomic ancestry, and a subset of craniofacial candidate genes. The facial effects of these variables are summarized as response-based imputed predictor (RIP) variables, which are validated using self-reported sex, genomic ancestry, and observer-based facial ratings (femininity and proportional ancestry) and judgments (sex and population group). By jointly modeling sex, genomic ancestry, and genotype, the independent effects of particular alleles on facial features can be uncovered. Results on a set of 20 genes showing significant effects on facial features provide support for this approach as a novel means to identify genes affecting normal-range facial features and for approximating the appearance of a face from genetic markers. PMID:24651127
Drosophila glucome screening identifies Ck1alpha as a regulator of mammalian glucose metabolism

PubMed Central

Ugrankar, Rupali; Berglund, Eric; Akdemir, Fatih; Tran, Christopher; Kim, Min Soo; Noh, Jungsik; Schneider, Rebekka; Ebert, Benjamin; Graff, Jonathan M.

2015-01-01

Circulating carbohydrates are an essential energy source, perturbations in which are pathognomonic of various diseases, diabetes being the most prevalent. Yet many of the genes underlying diabetes and its characteristic hyperglycaemia remain elusive. Here we use physiological and genetic interrogations in D. melanogaster to uncover the ‘glucome', the complete set of genes involved in glucose regulation in flies. Partial genomic screens of ∼1,000 genes yield ∼160 hyperglycaemia ‘flyabetes' candidates that we classify using fat body- and muscle-specific knockdown and biochemical assays. The results highlight the minor glucose fraction as a physiological indicator of metabolism in Drosophila. The hits uncovered in our screen may have conserved functions in mammalian glucose homeostasis, as heterozygous and homozygous mutants of Ck1alpha in the murine adipose lineage, develop diabetes. Our findings demonstrate that glucose has a role in fly biology and that genetic screenings carried out in flies may increase our understanding of mammalian pathophysiology. PMID:25994086
Search for 5'-leader regulatory RNA structures based on gene annotation aided by the RiboGap database.

PubMed

Naghdi, Mohammad Reza; Smail, Katia; Wang, Joy X; Wade, Fallou; Breaker, Ronald R; Perreault, Jonathan

2017-03-15

The discovery of noncoding RNAs (ncRNAs) and their importance for gene regulation led us to develop bioinformatics tools to pursue the discovery of novel ncRNAs. Finding ncRNAs de novo is challenging, first due to the difficulty of retrieving large numbers of sequences for given gene activities, and second due to exponential demands on calculation needed for comparative genomics on a large scale. Recently, several tools for the prediction of conserved RNA secondary structure were developed, but many of them are not designed to uncover new ncRNAs, or are too slow for conducting analyses on a large scale. Here we present various approaches using the database RiboGap as a primary tool for finding known ncRNAs and for uncovering simple sequence motifs with regulatory roles. This database also can be used to easily extract intergenic sequences of eubacteria and archaea to find conserved RNA structures upstream of given genes. We also show how to extend analysis further to choose the best candidate ncRNAs for experimental validation. Copyright © 2017 Elsevier Inc. All rights reserved.
Candidate gene association mapping of Sclerotinia stalk rot resistance in sunflower (Helianthus annuus L.) uncovers the importance of COI1 homologs.

PubMed

Talukder, Zahirul I; Hulke, Brent S; Qi, Lili; Scheffler, Brian E; Pegadaraju, Venkatramana; McPhee, Kevin; Gulya, Thomas J

2014-01-01

Functional markers for Sclerotinia basal stalk rot resistance in sunflower were obtained using gene-level information from the model species Arabidopsis thaliana. Sclerotinia stalk rot, caused by Sclerotinia sclerotiorum, is one of the most destructive diseases of sunflower (Helianthus annuus L.) worldwide. Markers for genes controlling resistance to S. sclerotiorum will enable efficient marker-assisted selection (MAS). We sequenced eight candidate genes homologous to Arabidopsis thaliana defense genes known to be associated with Sclerotinia disease resistance in a sunflower association mapping population evaluated for Sclerotinia stalk rot resistance. The total candidate gene sequence regions covered a concatenated length of 3,791 bp per individual. A total of 187 polymorphic sites were detected for all candidate gene sequences, 149 of which were single nucleotide polymorphisms (SNPs) and 38 were insertions/deletions. Eight SNPs in the coding regions led to changes in amino acid codons. Linkage disequilibrium decay throughout the candidate gene regions declined on average to an r (2) = 0.2 for genetic intervals of 120 bp, but extended up to 350 bp with r (2) = 0.1. A general linear model with modification to account for population structure was found the best fitting model for this population and was used for association mapping. Both HaCOI1-1 and HaCOI1-2 were found to be strongly associated with Sclerotinia stalk rot resistance and explained 7.4 % of phenotypic variation in this population. These SNP markers associated with Sclerotinia stalk rot resistance can potentially be applied to the selection of favorable genotypes, which will significantly improve the efficiency of MAS during the development of stalk rot resistant cultivars.
Differential Connectivity in Colorectal Cancer Gene Expression Network

PubMed

Izadi, Fereshteh

2018-05-30

Colorectal cancer (CRC) is one of the challenging types of cancers; thus, exploring effective biomarkers related to colorectal could lead to significant progresses toward the treatment of this disease. In the present study, CRC gene expression datasets have been reanalyzed. Mutual differentially expressed genes across 294 normal mucosa and adjacent tumoral samples were then utilized in order to build two independent transcriptional regulatory networks. By analyzing the networks topologically, genes with differential global connectivity related to cancer state were determined for which the potential transcriptional regulators including transcription factors were identified. The majority of differentially connected genes (DCGs) were up-regulated in colorectal transcriptome experiments. Moreover, a number of these genes have been experimentally validated as cancer or CRC-associated genes. The DCGs, including GART, TGFB1, ITGA2, SLC16A5, SOX9, and MMP7, were investigated across 12 cancer types. Functional enrichment analysis followed by detailed data mining exhibited that these candidate genes could be related to CRC by mediating in metastatic cascade in addition to shared pathways with 12 cancer types by triggering the inflammatory events Our study uncovered correlated alterations in gene expression related to CRC susceptibility and progression that the potent candidate biomarkers could provide a link to disease.
Identification and fine mapping of a stay-green gene (Brnye1) in pakchoi (Brassica campestris L. ssp. chinensis).

PubMed

Wang, Nan; Liu, Zhiyong; Zhang, Yun; Li, Chengyu; Feng, Hui

2018-03-01

Using bulked segregant analysis combined with next-generation sequencing, we delimited the Brnye1 gene responsible for the stay-green trait of nye in pakchoi. Sequence analysis identified Bra019346 as the candidate gene. "Stay-green" refers to a plant trait whereby leaves remain green during senescence. This trait is useful in the cultivation of pakchoi (Brassica campestris L. ssp. chinensis), which is marketed as a green leaf product. This study aimed to identify the gene responsible for the stay-green trait in pakchoi. We identified a stay-green mutant in pakchoi, which we termed "nye". Genetic analysis revealed that the stay-green trait is controlled by a single recessive gene, Brnye1. Using the BSA-seq method, a 3.0-Mb candidate region was mapped on chromosome A03, which helped us localize Brnye1 to an 81.01-kb interval between SSR markers SSRWN27 and SSRWN30 via linkage analysis in an F 2 population. We identified 12 genes in this region, 11 of which were annotated based on the Brassica rapa annotation database, and one was a functionally unknown gene. An orthologous gene of the Arabidopsis gene AtNYE1, Bra019346, was identified as the potential candidate for Brnye1. Sequence analysis revealed a 40-bp insertion in the second exon of Bra019346 in nye, which generated the TAA stop codon. A candidate gene-specific Indel marker in 1561 F 2 individuals showed perfect cosegregation with Brnye1 in the nye mutant. These results provide a foundation for uncovering the molecular mechanism of the stay-green trait in pakchoi.
Identification of genes and gene pathways associated with major depressive disorder by integrative brain analysis of rat and human prefrontal cortex transcriptomes

PubMed Central

Malki, K; Pain, O; Tosto, M G; Du Rietz, E; Carboni, L; Schalkwyk, L C

2015-01-01

Despite moderate heritability estimates, progress in uncovering the molecular substrate underpinning major depressive disorder (MDD) has been slow. In this study, we used prefrontal cortex (PFC) gene expression from a genetic rat model of MDD to inform probe set prioritization in PFC in a human post-mortem study to uncover genes and gene pathways associated with MDD. Gene expression differences between Flinders sensitive (FSL) and Flinders resistant (FRL) rat lines were statistically evaluated using the RankProd, non-parametric algorithm. Top ranking probe sets in the rat study were subsequently used to prioritize orthologous selection in a human PFC in a case–control post-mortem study on MDD from the Stanley Brain Consortium. Candidate genes in the human post-mortem study were then tested against a matched control sample using the RankProd method. A total of 1767 probe sets were differentially expressed in the PFC between FSL and FRL rat lines at (q⩽0.001). A total of 898 orthologous probe sets was found on Affymetrix's HG-U95A chip used in the human study. Correcting for the number of multiple, non-independent tests, 20 probe sets were found to be significantly dysregulated between human cases and controls at q⩽0.05. These probe sets tagged the expression profile of 18 human genes (11 upregulated and seven downregulated). Using an integrative rat–human study, a number of convergent genes that may have a role in pathogenesis of MDD were uncovered. Eighty percent of these genes were functionally associated with a key stress response signalling cascade, involving NF-κB (nuclear factor kappa-light-chain-enhancer of activated B cells), AP-1 (activator protein 1) and ERK/MAPK, which has been systematically associated with MDD, neuroplasticity and neurogenesis. PMID:25734512
Cracking the genomic piggy bank: identifying secrets of the pig genome.

PubMed

Mote, B E; Rothschild, M F

2006-01-01

Though researchers are uncovering valuable information about the pig genome at unprecedented speed, the porcine genome community is barely scratching the surface as to understanding interactions of the biological code. The pig genetic linkage map has nearly 5,000 loci comprised of genes, microsatellites, and amplified fragment length polymorphism markers. Likewise, the physical map is becoming denser with nearly 6,000 markers. The long awaited sequencing efforts are providing multidimensional benefits with sequence available for comparative genomics and identifying single nucleotide polymorphisms for use in linkage and trait association studies. Scientists are using exotic and commercial breeds for quantitative trait loci scans. Additionally, candidate gene studies continue to identify chromosomal regions or genes associated with economically important traits such as growth rate, leanness, feed intake, meat quality, litter size, and disease resistance. The commercial pig industry is actively incorporating these markers in marker-assisted selection along with traditional performance information to improve said traits. Researchers are utilizing novel tools including pig microarrays along with advanced bioinformatics to identify new candidate genes, understand gene function, and piece together gene networks involved in important biological processes. Advances in pig genomics and implications to the pork industry as well as human health are reviewed.
Massive sequencing of 70 genes reveals a myriad of missing genes or mechanisms to be uncovered in hereditary spastic paraplegias

PubMed Central

Morais, Sara; Raymond, Laure; Mairey, Mathilde; Coutinho, Paula; Brandão, Eva; Ribeiro, Paula; Loureiro, José Leal; Sequeiros, Jorge; Brice, Alexis; Alonso, Isabel; Stevanin, Giovanni

2017-01-01

Hereditary spastic paraplegias (HSP) are neurodegenerative disorders characterized by lower limb spasticity and weakness that can be complicated by other neurological or non-neurological signs. Despite a high genetic heterogeneity (>60 causative genes), 40–70% of the families remain without a molecular diagnosis. Analysis of one of the pioneer cohorts of 193 HSP families generated in the early 1990s in Portugal highlighted that SPAST and SPG11 are the most frequent diagnoses. We have now explored 98 unsolved families from this series using custom next generation sequencing panels analyzing up to 70 candidate HSP genes. We identified the likely disease-causing variant in 20 of the 98 families with KIF5A being the most frequently mutated gene. We also found 52 variants of unknown significance (VUS) in 38% of the cases. These new diagnoses resulted in 42% of solved cases in the full Portuguese cohort (81/193). Segregation of the variants was not always compatible with the presumed inheritance, indicating that the analysis of all HSP genes regardless of the inheritance mode can help to explain some cases. Our results show that there is still a large set of unknown genes responsible for HSP and most likely novel mechanisms or inheritance modes leading to the disease to be uncovered, but this will require international collaborative efforts, particularly for the analysis of VUS. PMID:28832565
Novel Genes Affecting the Interaction between the Cabbage Whitefly and Arabidopsis Uncovered by Genome-Wide Association Mapping

PubMed Central

Broekgaarden, Colette; Bucher, Johan; Bac-Molenaar, Johanna; Keurentjes, Joost J. B.; Kruijer, Willem; Voorrips, Roeland E.; Vosman, Ben

2015-01-01

Plants have evolved a variety of ways to defend themselves against biotic attackers. This has resulted in the presence of substantial variation in defense mechanisms among plants, even within a species. Genome-wide association (GWA) mapping is a useful tool to study the genetic architecture of traits, but has so far only had limited exploitation in studies of plant defense. Here, we study the genetic architecture of defense against the phloem-feeding insect cabbage whitefly (Aleyrodes proletella) in Arabidopsis thaliana. We determined whitefly performance, i.e. the survival and reproduction of whitefly females, on 360 worldwide selected natural accessions and subsequently performed GWA mapping using 214,051 SNPs. Substantial variation for whitefly adult survival and oviposition rate (number of eggs laid per female per day) was observed between the accessions. We identified 39 candidate SNPs for either whitefly adult survival or oviposition rate, all with relatively small effects, underpinning the complex architecture of defense traits. Among the corresponding candidate genes, i.e. genes in linkage disequilibrium (LD) with candidate SNPs, none have previously been identified as a gene playing a role in the interaction between plants and phloem-feeding insects. Whitefly performance on knock-out mutants of a number of candidate genes was significantly affected, validating the potential of GWA mapping for novel gene discovery in plant-insect interactions. Our results show that GWA analysis is a very useful tool to gain insight into the genetic architecture of plant defense against herbivorous insects, i.e. we identified and validated several genes affecting whitefly performance that have not previously been related to plant defense against herbivorous insects. PMID:26699853
Integrated systems analysis reveals a molecular network underlying autism spectrum disorders

PubMed Central

Li, Jingjing; Shi, Minyi; Ma, Zhihai; Zhao, Shuchun; Euskirchen, Ghia; Ziskin, Jennifer; Urban, Alexander; Hallmayer, Joachim; Snyder, Michael

2014-01-01

Autism is a complex disease whose etiology remains elusive. We integrated previously and newly generated data and developed a systems framework involving the interactome, gene expression and genome sequencing to identify a protein interaction module with members strongly enriched for autism candidate genes. Sequencing of 25 patients confirmed the involvement of this module in autism, which was subsequently validated using an independent cohort of over 500 patients. Expression of this module was dichotomized with a ubiquitously expressed subcomponent and another subcomponent preferentially expressed in the corpus callosum, which was significantly affected by our identified mutations in the network center. RNA-sequencing of the corpus callosum from patients with autism exhibited extensive gene mis-expression in this module, and our immunochemical analysis showed that the human corpus callosum is predominantly populated by oligodendrocyte cells. Analysis of functional genomic data further revealed a significant involvement of this module in the development of oligodendrocyte cells in mouse brain. Our analysis delineates a natural network involved in autism, helps uncover novel candidate genes for this disease and improves our understanding of its molecular pathology. PMID:25549968
Genes and gene networks implicated in aggression related behaviour.

PubMed

Malki, Karim; Pain, Oliver; Du Rietz, Ebba; Tosto, Maria Grazia; Paya-Cano, Jose; Sandnabba, Kenneth N; de Boer, Sietse; Schalkwyk, Leonard C; Sluyter, Frans

2014-10-01

Aggressive behaviour is a major cause of mortality and morbidity. Despite of moderate heritability estimates, progress in identifying the genetic factors underlying aggressive behaviour has been limited. There are currently three genetic mouse models of high and low aggression created using selective breeding. This is the first study to offer a global transcriptomic characterization of the prefrontal cortex across all three genetic mouse models of aggression. A systems biology approach has been applied to transcriptomic data across the three pairs of selected inbred mouse strains (Turku Aggressive (TA) and Turku Non-Aggressive (TNA), Short Attack Latency (SAL) and Long Attack Latency (LAL) mice and North Carolina Aggressive (NC900) and North Carolina Non-Aggressive (NC100)), providing novel insight into the neurobiological mechanisms and genetics underlying aggression. First, weighted gene co-expression network analysis (WGCNA) was performed to identify modules of highly correlated genes associated with aggression. Probe sets belonging to gene modules uncovered by WGCNA were carried forward for network analysis using ingenuity pathway analysis (IPA). The RankProd non-parametric algorithm was then used to statistically evaluate expression differences across the genes belonging to modules significantly associated with aggression. IPA uncovered two pathways, involving NF-kB and MAPKs. The secondary RankProd analysis yielded 14 differentially expressed genes, some of which have previously been implicated in pathways associated with aggressive behaviour, such as Adrbk2. The results highlighted plausible candidate genes and gene networks implicated in aggression-related behaviour.
Functional genome-wide siRNA screen identifies KIAA0586 as mutated in Joubert syndrome

PubMed Central

Roosing, Susanne; Hofree, Matan; Kim, Sehyun; Scott, Eric; Copeland, Brett; Romani, Marta; Silhavy, Jennifer L; Rosti, Rasim O; Schroth, Jana; Mazza, Tommaso; Miccinilli, Elide; Zaki, Maha S; Swoboda, Kathryn J; Milisa-Drautz, Joanne; Dobyns, William B; Mikati, Mohamed A; İncecik, Faruk; Azam, Matloob; Borgatti, Renato; Romaniello, Romina; Boustany, Rose-Mary; Clericuzio, Carol L; D'Arrigo, Stefano; Strømme, Petter; Boltshauser, Eugen; Stanzial, Franco; Mirabelli-Badenier, Marisol; Moroni, Isabella; Bertini, Enrico; Emma, Francesco; Steinlin, Maja; Hildebrandt, Friedhelm; Johnson, Colin A; Freilinger, Michael; Vaux, Keith K; Gabriel, Stacey B; Aza-Blanc, Pedro; Heynen-Genel, Susanne; Ideker, Trey; Dynlacht, Brian D; Lee, Ji Eun; Valente, Enza Maria; Kim, Joon; Gleeson, Joseph G

2015-01-01

Defective primary ciliogenesis or cilium stability forms the basis of human ciliopathies, including Joubert syndrome (JS), with defective cerebellar vermis development. We performed a high-content genome-wide small interfering RNA (siRNA) screen to identify genes regulating ciliogenesis as candidates for JS. We analyzed results with a supervised-learning approach, using SYSCILIA gold standard, Cildb3.0, a centriole siRNA screen and the GTex project, identifying 591 likely candidates. Intersection of this data with whole exome results from 145 individuals with unexplained JS identified six families with predominantly compound heterozygous mutations in KIAA0586. A c.428del base deletion in 0.1% of the general population was found in trans with a second mutation in an additional set of 9 of 163 unexplained JS patients. KIAA0586 is an orthologue of chick Talpid3, required for ciliogenesis and Sonic hedgehog signaling. Our results uncover a relatively high frequency cause for JS and contribute a list of candidates for future gene discoveries in ciliopathies. DOI: http://dx.doi.org/10.7554/eLife.06602.001 PMID:26026149
Convergent functional genomics of anxiety disorders: translational identification of genes, biomarkers, pathways and mechanisms.

PubMed

Le-Niculescu, H; Balaraman, Y; Patel, S D; Ayalew, M; Gupta, J; Kuczenski, R; Shekhar, A; Schork, N; Geyer, M A; Niculescu, A B

2011-05-24

Anxiety disorders are prevalent and disabling yet understudied from a genetic standpoint, compared with other major psychiatric disorders such as bipolar disorder and schizophrenia. The fact that they are more common, diverse and perceived as embedded in normal life may explain this relative oversight. In addition, as for other psychiatric disorders, there are technical challenges related to the identification and validation of candidate genes and peripheral biomarkers. Human studies, particularly genetic ones, are susceptible to the issue of being underpowered, because of genetic heterogeneity, the effect of variable environmental exposure on gene expression, and difficulty of accrual of large, well phenotyped cohorts. Animal model gene expression studies, in a genetically homogeneous and experimentally tractable setting, can avoid artifacts and provide sensitivity of detection. Subsequent translational integration of the animal model datasets with human genetic and gene expression datasets can ensure cross-validatory power and specificity for illness. We have used a pharmacogenomic mouse model (involving treatments with an anxiogenic drug--yohimbine, and an anti-anxiety drug--diazepam) as a discovery engine for identification of anxiety candidate genes as well as potential blood biomarkers. Gene expression changes in key brain regions for anxiety (prefrontal cortex, amygdala and hippocampus) and blood were analyzed using a convergent functional genomics (CFG) approach, which integrates our new data with published human and animal model data, as a translational strategy of cross-matching and prioritizing findings. Our work identifies top candidate genes (such as FOS, GABBR1, NR4A2, DRD1, ADORA2A, QKI, RGS2, PTGDS, HSPA1B, DYNLL2, CCKBR and DBP), brain-blood biomarkers (such as FOS, QKI and HSPA1B), pathways (such as cAMP signaling) and mechanisms for anxiety disorders--notably signal transduction and reactivity to environment, with a prominent role for the hippocampus. Overall, this work complements our previous similar work (on bipolar mood disorders and schizophrenia) conducted over the last decade. It concludes our programmatic first pass mapping of the genomic landscape of the triad of major psychiatric disorder domains using CFG, and permitted us to uncover the significant genetic overlap between anxiety and these other major psychiatric disorders, notably the under-appreciated overlap with schizophrenia. PDE10A, TAC1 and other genes uncovered by our work provide a molecular basis for the frequently observed clinical co-morbidity and interdependence between anxiety and other major psychiatric disorders, and suggest schizo-anxiety as a possible new nosological domain.
Somatic mutations in the transcriptional corepressor gene BCORL1 in adult acute myelogenous leukemia.

PubMed

Li, Meng; Collins, Roxane; Jiao, Yuchen; Ouillette, Peter; Bixby, Dale; Erba, Harry; Vogelstein, Bert; Kinzler, Kenneth W; Papadopoulos, Nickolas; Malek, Sami N

2011-11-24

To further our understanding of the genetic basis of acute myelogenous leukemia (AML), we determined the coding exon sequences of ∼ 18 000 protein-encoding genes in 8 patients with secondary AML. Here we report the discovery of novel somatic mutations in the transcriptional corepressor gene BCORL1 that is located on the X-chromosome. Analysis of BCORL1 in an unselected cohort of 173 AML patients identified a total of 10 mutated cases (6%) with BCORL1 mutations, whereas analysis of 19 AML cell lines uncovered 4 (21%) BCORL1 mutated cell lines. The majority (87%) of the mutations in BCORL1 were predicted to inactivate the gene product as a result of nonsense mutations, splice site mutation, or out-of-frame insertions or deletions. These results indicate that BCORL1 by genetic criteria is a novel candidate tumor suppressor gene, joining the growing list of genes recurrently mutated in AML.
GENOMIC BASIS OF AGING AND LIFE HISTORY EVOLUTION IN DROSOPHILA MELANOGASTER

PubMed Central

Remolina, Silvia C.; Chang, Peter L.; Leips, Jeff; Nuzhdin, Sergey V.; Hughes, Kimberly A.

2015-01-01

Natural diversity in aging and other life history patterns is a hallmark of organismal variation. Related species, populations, and individuals within populations show genetically based variation in life span and other aspects of age-related performance. Population differences are especially informative because these differences can be large relative to within-population variation and because they occur in organisms with otherwise similar genomes. We used experimental evolution to produce populations divergent for life span and late-age fertility and then used deep genome sequencing to detect sequence variants with nucleotide-level resolution. Several genes and genome regions showed strong signatures of selection, and the same regions were implicated in independent comparisons, suggesting that the same alleles were selected in replicate lines. Genes related to oogenesis, immunity, and protein degradation were implicated as important modifiers of late-life performance. Expression profiling and functional annotation narrowed the list of strong candidate genes to 38, most of which are novel candidates for regulating aging. Life span and early-age fecundity were negatively correlated among populations; therefore the alleles we identified also are candidate regulators of a major life-history trade-off. More generally, we argue that hitchhiking mapping can be a powerful tool for uncovering the molecular bases of quantitative genetic variation. PMID:23106705
Genomewide DNA methylation analysis in combat veterans reveals a novel locus for PTSD.

PubMed

Mehta, D; Bruenig, D; Carrillo-Roa, T; Lawford, B; Harvey, W; Morris, C P; Smith, A K; Binder, E B; Young, R McD; Voisey, J

2017-11-01

Epigenetic modifications such as DNA methylation may play a key role in the aetiology and serve as biomarkers for post-traumatic stress disorder (PTSD). We performed a genomewide analysis to identify genes whose DNA methylation levels are associated with PTSD. A total of 211 individuals comprising Australian male Vietnam War veterans (n = 96) and males from a general population belonging to the Grady Trauma Project (n = 115) were included. Genomewide DNA methylation was performed from peripheral blood using the Illumina arrays. Data analysis was performed using generalized linear regression models. Differential DNA methylation of 17 previously reported PTSD candidate genes was associated with PTSD symptom severity. Genomewide analyses revealed CpG sites spanning BRSK1, LCN8, NFG and DOCK2 genes were associated with PTSD symptom severity. We replicated the findings of DOCK2 in an independent cohort. Pathway analysis revealed that among the associated genes, genes within actin cytoskeleton and focal adhesion molecular pathways were enriched. These data highlight the role of DNA methylation as biomarkers of PTSD. The results support the role of previous candidates and uncover novel genes associated with PTSD, such as DOCK2. This study contributes to our understanding of the biological underpinnings of PTSD. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Genetics, gene expression and bioinformatics of the pituitary gland.

PubMed

Davis, Shannon W; Potok, Mary Anne; Brinkmeier, Michelle L; Carninci, Piero; Lyons, Robert H; MacDonald, James W; Fleming, Michelle T; Mortensen, Amanda H; Egashira, Noboru; Ghosh, Debashis; Steel, Karen P; Osamura, Robert Y; Hayashizaki, Yoshihide; Camper, Sally A

2009-04-01

Genetic cases of congenital pituitary hormone deficiency are common and many are caused by transcription factor defects. Mouse models with orthologous mutations are invaluable for uncovering the molecular mechanisms that lead to problems in organ development and typical patient characteristics. We are using mutant mice defective in the transcription factors PROP1 and POU1F1 for gene expression profiling to identify target genes for these critical transcription factors and candidates for cases of pituitary hormone deficiency of unknown aetiology. These studies reveal critical roles for Wnt signalling pathways, including the TCF/LEF transcription factors and interacting proteins of the groucho family, bone morphogenetic protein antagonists and targets of notch signalling. Current studies are investigating the roles of novel homeobox genes and pathways that regulate the transition from proliferation to differentiation, cell adhesion and cell migration. Pituitary adenomas are a common human health problem, yet most cases are sporadic, necessitating alternative approaches to traditional Mendelian genetic studies. Mouse models of adenoma formation offer the opportunity for gene expression profiling during progressive stages of hyperplasia, adenoma and tumorigenesis. This approach holds promise for the identification of relevant pathways and candidate genes as risk factors for adenoma formation, understanding mechanisms of progression, and identifying drug targets and clinically relevant biomarkers. Copyright 2009 S. Karger AG, Basel.
Genetics, Gene Expression and Bioinformatics of the Pituitary Gland

PubMed Central

Davis, Shannon W; Potok, Mary Anne; Brinkmeier, Michelle L; Carninci, Piero; Lyons, Robert H; MacDonald, James W.; Fleming, Michelle T; Mortensen, Amanda H; Egashira, Noboru; Ghosh, Debashis; Steel, Karen P.; Osamura, Robert Y; Hayashizaki, Yoshihide; Camper, Sally A

2011-01-01

Genetic cases of congenital pituitary hormone deficiency are common and many are caused by transcription factor defects. Mouse models with orthologous mutations are invaluable for uncovering the molecular mechanisms that lead to problems in organ development and typical patient characteristics. We are using mutant mice defective in the transcription factors PROP1 and POU1F1 for gene expression profiling to identify target genes for these critical transcription factors and candidates for cases of pituitary hormone deficiency of unknown etiology. These studies reveal critical roles for Wnt signalling pathways including the TCF/LEF transcription factors and interacting proteins of the groucho family, bone morphogenetic proteins antagonists, and targets of notch signalling. Current studies are investigating roles of novel homeobox genes and pathways that regulate the transition from proliferation to differentiation, cell adhesion and cell migration. Pituitary adenomas are a common human health problem, yet most cases are sporadic, necessitating alternative approaches to traditional Mendelian genetic studies. Mouse models of adenoma formation offer the opportunity for gene expression profiling during progressive stages of hyperplasia, adenoma and tumorigenesis. This approach holds promise for identification of relevant pathways and candidate genes as risk factors for adenoma formation, understanding mechanisms of progression, and identifying drug targets and clinically relevant biomarkers. PMID:19407506

The ecological genomic basis of salinity adaptation in Tunisian Medicago truncatula.

PubMed

Friesen, Maren L; von Wettberg, Eric J B; Badri, Mounawer; Moriuchi, Ken S; Barhoumi, Fathi; Chang, Peter L; Cuellar-Ortiz, Sonia; Cordeiro, Matilde A; Vu, Wendy T; Arraouadi, Soumaya; Djébali, Naceur; Zribi, Kais; Badri, Yazid; Porter, Stephanie S; Aouani, Mohammed Elarbi; Cook, Douglas R; Strauss, Sharon Y; Nuzhdin, Sergey V

2014-12-22

As our world becomes warmer, agriculture is increasingly impacted by rising soil salinity and understanding plant adaptation to salt stress can help enable effective crop breeding. Salt tolerance is a complex plant phenotype and we know little about the pathways utilized by naturally tolerant plants. Legumes are important species in agricultural and natural ecosystems, since they engage in symbiotic nitrogen-fixation, but are especially vulnerable to salinity stress. Our studies of the model legume Medicago truncatula in field and greenhouse settings demonstrate that Tunisian populations are locally adapted to saline soils at the metapopulation level and that saline origin genotypes are less impacted by salt than non-saline origin genotypes; these populations thus likely contain adaptively diverged alleles. Whole genome resequencing of 39 wild accessions reveals ongoing migration and candidate genomic regions that assort non-randomly with soil salinity. Consistent with natural selection acting at these sites, saline alleles are typically rare in the range-wide species' gene pool and are also typically derived relative to the sister species M. littoralis. Candidate regions for adaptation contain genes that regulate physiological acclimation to salt stress, such as abscisic acid and jasmonic acid signaling, including a novel salt-tolerance candidate orthologous to the uncharacterized gene AtCIPK21. Unexpectedly, these regions also contain biotic stress genes and flowering time pathway genes. We show that flowering time is differentiated between saline and non-saline populations and may allow salt stress escape. This work nominates multiple potential pathways of adaptation to naturally stressful environments in a model legume. These candidates point to the importance of both tolerance and avoidance in natural legume populations. We have uncovered several promising targets that could be used to breed for enhanced salt tolerance in crop legumes to enhance food security in an era of increasing soil salinization.
Identification of genes highly downregulated in pancreatic cancer through a meta-analysis of microarray datasets: implications for discovery of novel tumor-suppressor genes and therapeutic targets.

PubMed

Goonesekere, Nalin C W; Andersen, Wyatt; Smith, Alex; Wang, Xiaosheng

2018-02-01

The lack of specific symptoms at early tumor stages, together with a high biological aggressiveness of the tumor contribute to the high mortality rate for pancreatic cancer (PC), which has a 5-year survival rate of about 7%. Recent failures of targeted therapies inhibiting kinase activity in clinical trials have highlighted the need for new approaches towards combating this deadly disease. In this study, we have identified genes that are significantly downregulated in PC, through a meta-analysis of large number of microarray datasets. We have used qRT-PCR to confirm the downregulation of selected genes in a panel of PC cell lines. This study has yielded several novel candidate tumor-suppressor genes (TSGs) including GNMT, CEL, PLA2G1B and SERPINI2. We highlight the role of GNMT, a methyl transferase associated with the methylation potential of the cell, and CEL, a lipase, as potential therapeutic targets. We have uncovered genetic links to risk factors associated with PC such as smoking and obesity. Genes important for patient survival and prognosis are also discussed, and we confirm the dysregulation of metabolic pathways previously observed in PC. While many of the genes downregulated in our dataset are associated with protein products normally produced by the pancreas for excretion, we have uncovered some genes whose downregulation appear to play a more causal role in PC. These genes will assist in providing a better understanding of the disease etiology of PC, and in the search for new therapeutic targets and biomarkers.
Sporulation genes associated with sporulation efficiency in natural isolates of yeast.

PubMed

Tomar, Parul; Bhatia, Aatish; Ramdas, Shweta; Diao, Liyang; Bhanot, Gyan; Sinha, Himanshu

2013-01-01

Yeast sporulation efficiency is a quantitative trait and is known to vary among experimental populations and natural isolates. Some studies have uncovered the genetic basis of this variation and have identified the role of sporulation genes (IME1, RME1) and sporulation-associated genes (FKH2, PMS1, RAS2, RSF1, SWS2), as well as non-sporulation pathway genes (MKT1, TAO3) in maintaining this variation. However, these studies have been done mostly in experimental populations. Sporulation is a response to nutrient deprivation. Unlike laboratory strains, natural isolates have likely undergone multiple selections for quick adaptation to varying nutrient conditions. As a result, sporulation efficiency in natural isolates may have different genetic factors contributing to phenotypic variation. Using Saccharomyces cerevisiae strains in the genetically and environmentally diverse SGRP collection, we have identified genetic loci associated with sporulation efficiency variation in a set of sporulation and sporulation-associated genes. Using two independent methods for association mapping and correcting for population structure biases, our analysis identified two linked clusters containing 4 non-synonymous mutations in genes - HOS4, MCK1, SET3, and SPO74. Five regulatory polymorphisms in five genes such as MLS1 and CDC10 were also identified as putative candidates. Our results provide candidate genes contributing to phenotypic variation in the sporulation efficiency of natural isolates of yeast.
Sporulation Genes Associated with Sporulation Efficiency in Natural Isolates of Yeast

PubMed Central

Ramdas, Shweta; Diao, Liyang; Bhanot, Gyan; Sinha, Himanshu

2013-01-01

Yeast sporulation efficiency is a quantitative trait and is known to vary among experimental populations and natural isolates. Some studies have uncovered the genetic basis of this variation and have identified the role of sporulation genes (IME1, RME1) and sporulation-associated genes (FKH2, PMS1, RAS2, RSF1, SWS2), as well as non-sporulation pathway genes (MKT1, TAO3) in maintaining this variation. However, these studies have been done mostly in experimental populations. Sporulation is a response to nutrient deprivation. Unlike laboratory strains, natural isolates have likely undergone multiple selections for quick adaptation to varying nutrient conditions. As a result, sporulation efficiency in natural isolates may have different genetic factors contributing to phenotypic variation. Using Saccharomyces cerevisiae strains in the genetically and environmentally diverse SGRP collection, we have identified genetic loci associated with sporulation efficiency variation in a set of sporulation and sporulation-associated genes. Using two independent methods for association mapping and correcting for population structure biases, our analysis identified two linked clusters containing 4 non-synonymous mutations in genes – HOS4, MCK1, SET3, and SPO74. Five regulatory polymorphisms in five genes such as MLS1 and CDC10 were also identified as putative candidates. Our results provide candidate genes contributing to phenotypic variation in the sporulation efficiency of natural isolates of yeast. PMID:23874994
ICan: an integrated co-alteration network to identify ovarian cancer-related genes.

PubMed

Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan

2015-01-01

Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data.
ICan: An Integrated Co-Alteration Network to Identify Ovarian Cancer-Related Genes

PubMed Central

Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan

2015-01-01

Background Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. Results We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). Conclusion In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data. PMID:25803614
De novo assembly and annotation of the Antarctic copepod (Tigriopus kingsejongensis) transcriptome.

PubMed

Kim, Hui-Su; Lee, Bo-Young; Han, Jeonghoon; Lee, Young Hwan; Min, Gi-Sik; Kim, Sanghee; Lee, Jae-Seong

2016-08-01

The whole transcriptome of the Antarctic copepod (Tigriopus kingsejongensis) was sequenced using Illumina RNA-seq. De novo assembly was performed with 64,785,098 raw reads using Trinity, which assembled into 81,653 contigs. TransDecoder found 38,250 candidate coding contigs which showed homology to other species by BLAST analysis. Functional gene annotation was performed by Gene Ontology (GO), InterProScan, and KEGG pathway analyses. Finally, we identified a number of expressed gene catalog for T. kingsejongensis that is a useful model animal for gene information-based polar research to uncover molecular mechanisms of environmental adaptation on harsh environments. In particular, we observed highly developing lipid metabolism in T. kingsejongensis directly compared to those of the Far East Pacific coast copepod Tigriopus japonicus at the transcriptome level. Copyright © 2016 Elsevier B.V. All rights reserved.
Metagenomics uncovers gaps in amplicon-based detection of microbial diversity

DOE PAGES

Eloe-Fadrosh, Emiley A.; Ivanova, Natalia N.; Woyke, Tanja; ...

2016-02-01

Our view of microbial diversity has expanded greatly over the past 40 years, primarily through the wide application of PCR-based surveys of the small-subunit ribosomal RNA (SSU rRNA) gene. Yet significant gaps in knowledge remain due to well-recognized limitations of this method. Here in this paper, we systematically survey primer fidelity in SSU rRNA gene sequences recovered from over 6,000 assembled metagenomes sampled globally. Our findings show that approximately 10% of environmental microbial sequences might be missed from classical PCR-based SSU rRNA gene surveys, mostly members of the Candidate Phyla Radiation (CPR) and as yet uncharacterized Archaea. In conclusion, thesemore » results underscore the extent of uncharacterized microbial diversity and provide fruitful avenues for describing additional phylogenetic lineages.« less
Integration of genome-wide association studies with biological knowledge identifies six novel genes related to kidney function.

PubMed

Chasman, Daniel I; Fuchsberger, Christian; Pattaro, Cristian; Teumer, Alexander; Böger, Carsten A; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Taliun, Daniel; Li, Man; Gao, Xiaoyi; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C; O'Seaghdha, Conall M; Glazer, Nicole; Isaacs, Aaron; Liu, Ching-Ti; Smith, Albert V; O'Connell, Jeffrey R; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Johnson, Andrew D; Gierman, Hinco J; Feitosa, Mary F; Hwang, Shih-Jen; Atkinson, Elizabeth J; Lohman, Kurt; Cornelis, Marilyn C; Johansson, Asa; Tönjes, Anke; Dehghan, Abbas; Lambert, Jean-Charles; Holliday, Elizabeth G; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y; Murgia, Federico; Trompet, Stella; Imboden, Medea; Coassin, Stefan; Pistis, Giorgio; Harris, Tamara B; Launer, Lenore J; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D; Boerwinkle, Eric; Schmidt, Helena; Cavalieri, Margherita; Rao, Madhumathi; Hu, Frank; Demirkan, Ayse; Oostra, Ben A; de Andrade, Mariza; Turner, Stephen T; Ding, Jingzhong; Andrews, Jeanette S; Freedman, Barry I; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Meisinger, Christa; Gieger, Christian; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H; Wright, Alan F; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G; Rivadeneira, Fernando; Aulchenko, Yurii S; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Ketkar, Shamika; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K; Portas, Laura; Ford, Ian; Buckley, Brendan M; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Kim, Stuart K; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J Wouter; Probst-Hensch, Nicole M; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; Siscovick, David S; van Duijn, Cornelia M; Borecki, Ingrid B; Kardia, Sharon L R; Liu, Yongmei; Curhan, Gary C; Rudan, Igor; Gyllensten, Ulf; Wilson, James F; Franke, Andre; Pramstaller, Peter P; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M; Parsa, Afshin; Bochud, Murielle; Heid, Iris M; Kao, W H Linda; Fox, Caroline S; Köttgen, Anna

2012-12-15

In conducting genome-wide association studies (GWAS), analytical approaches leveraging biological information may further understanding of the pathophysiology of clinical traits. To discover novel associations with estimated glomerular filtration rate (eGFR), a measure of kidney function, we developed a strategy for integrating prior biological knowledge into the existing GWAS data for eGFR from the CKDGen Consortium. Our strategy focuses on single nucleotide polymorphism (SNPs) in genes that are connected by functional evidence, determined by literature mining and gene ontology (GO) hierarchies, to genes near previously validated eGFR associations. It then requires association thresholds consistent with multiple testing, and finally evaluates novel candidates by independent replication. Among the samples of European ancestry, we identified a genome-wide significant SNP in FBXL20 (P = 5.6 × 10(-9)) in meta-analysis of all available data, and additional SNPs at the INHBC, LRP2, PLEKHA1, SLC3A2 and SLC7A6 genes meeting multiple-testing corrected significance for replication and overall P-values of 4.5 × 10(-4)-2.2 × 10(-7). Neither the novel PLEKHA1 nor FBXL20 associations, both further supported by association with eGFR among African Americans and with transcript abundance, would have been implicated by eGFR candidate gene approaches. LRP2, encoding the megalin receptor, was identified through connection with the previously known eGFR gene DAB2 and extends understanding of the megalin system in kidney function. These findings highlight integration of existing genome-wide association data with independent biological knowledge to uncover novel candidate eGFR associations, including candidates lacking known connections to kidney-specific pathways. The strategy may also be applicable to other clinical phenotypes, although more testing will be needed to assess its potential for discovery in general.
Integration of genome-wide association studies with biological knowledge identifies six novel genes related to kidney function

PubMed Central

Chasman, Daniel I.; Fuchsberger, Christian; Pattaro, Cristian; Teumer, Alexander; Böger, Carsten A.; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Taliun, Daniel; Li, Man; Gao, Xiaoyi; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C.; O'Seaghdha, Conall M.; Glazer, Nicole; Isaacs, Aaron; Liu, Ching-Ti; Smith, Albert V.; O'Connell, Jeffrey R.; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Johnson, Andrew D.; Gierman, Hinco J.; Feitosa, Mary F.; Hwang, Shih-Jen; Atkinson, Elizabeth J.; Lohman, Kurt; Cornelis, Marilyn C.; Johansson, Åsa; Tönjes, Anke; Dehghan, Abbas; Lambert, Jean-Charles; Holliday, Elizabeth G.; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y.; Murgia, Federico; Trompet, Stella; Imboden, Medea; Coassin, Stefan; Pistis, Giorgio; Harris, Tamara B.; Launer, Lenore J.; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D.; Boerwinkle, Eric; Schmidt, Helena; Cavalieri, Margherita; Rao, Madhumathi; Hu, Frank; Demirkan, Ayse; Oostra, Ben A.; de Andrade, Mariza; Turner, Stephen T.; Ding, Jingzhong; Andrews, Jeanette S.; Freedman, Barry I.; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Meisinger, Christa; Gieger, Christian; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E.; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H.; Wright, Alan F.; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K.; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G.; Rivadeneira, Fernando; Aulchenko, Yurii S.; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Ketkar, Shamika; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K.; Portas, Laura; Ford, Ian; Buckley, Brendan M.; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Kim, Stuart K.; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J. Wouter; Probst-Hensch, Nicole M.; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R.; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; Siscovick, David S.; van Duijn, Cornelia M.; Borecki, Ingrid B.; Kardia, Sharon L.R.; Liu, Yongmei; Curhan, Gary C.; Rudan, Igor; Gyllensten, Ulf; Wilson, James F.; Franke, Andre; Pramstaller, Peter P.; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M; Parsa, Afshin; Bochud, Murielle; Heid, Iris M.; Kao, W.H. Linda; Fox, Caroline S.; Köttgen, Anna

2012-01-01

In conducting genome-wide association studies (GWAS), analytical approaches leveraging biological information may further understanding of the pathophysiology of clinical traits. To discover novel associations with estimated glomerular filtration rate (eGFR), a measure of kidney function, we developed a strategy for integrating prior biological knowledge into the existing GWAS data for eGFR from the CKDGen Consortium. Our strategy focuses on single nucleotide polymorphism (SNPs) in genes that are connected by functional evidence, determined by literature mining and gene ontology (GO) hierarchies, to genes near previously validated eGFR associations. It then requires association thresholds consistent with multiple testing, and finally evaluates novel candidates by independent replication. Among the samples of European ancestry, we identified a genome-wide significant SNP in FBXL20 (P = 5.6 × 10−9) in meta-analysis of all available data, and additional SNPs at the INHBC, LRP2, PLEKHA1, SLC3A2 and SLC7A6 genes meeting multiple-testing corrected significance for replication and overall P-values of 4.5 × 10−4–2.2 × 10−7. Neither the novel PLEKHA1 nor FBXL20 associations, both further supported by association with eGFR among African Americans and with transcript abundance, would have been implicated by eGFR candidate gene approaches. LRP2, encoding the megalin receptor, was identified through connection with the previously known eGFR gene DAB2 and extends understanding of the megalin system in kidney function. These findings highlight integration of existing genome-wide association data with independent biological knowledge to uncover novel candidate eGFR associations, including candidates lacking known connections to kidney-specific pathways. The strategy may also be applicable to other clinical phenotypes, although more testing will be needed to assess its potential for discovery in general. PMID:22962313
A Drosophila mutant of LETM1, a candidate gene for seizures in Wolf-Hirschhorn syndrome.

PubMed

McQuibban, Angus G; Joza, Nicholas; Megighian, Aram; Scorzeto, Michele; Zanini, Damiano; Reipert, Siegfried; Richter, Constance; Schweyen, Rudolf J; Nowikovsky, Karin

2010-03-15

Human Wolf-Hirschhorn syndrome (WHS) is a multigenic disorder resulting from a hemizygous deletion on chromosome 4. LETM1 is the best candidate gene for seizures, the strongest haploinsufficiency phenotype of WHS patients. Here, we identify the Drosophila gene CG4589 as the ortholog of LETM1 and name the gene DmLETM1. Using RNA interference approaches in both Drosophila melanogaster cultured cells and the adult fly, we have assayed the effects of down-regulating the LETM1 gene on mitochondrial function. We also show that DmLETM1 complements growth and mitochondrial K(+)/H(+) exchange (KHE) activity in yeast deficient for LETM1. Genetic studies allowing the conditional inactivation of LETM1 function in specific tissues demonstrate that the depletion of DmLETM1 results in roughening of the adult eye, mitochondrial swelling and developmental lethality in third-instar larvae, possibly the result of deregulated mitophagy. Neuronal specific down-regulation of DmLETM1 results in impairment of locomotor behavior in the fly and reduced synaptic neurotransmitter release. Taken together our results demonstrate the function of DmLETM1 as a mitochondrial osmoregulator through its KHE activity and uncover a pathophysiological WHS phenotype in the model organism D. melanogaster.
Back to the sea twice: identifying candidate plant genes for molecular evolution to marine life.

PubMed

Wissler, Lothar; Codoñer, Francisco M; Gu, Jenny; Reusch, Thorsten B H; Olsen, Jeanine L; Procaccini, Gabriele; Bornberg-Bauer, Erich

2011-01-12

Seagrasses are a polyphyletic group of monocotyledonous angiosperms that have adapted to a completely submerged lifestyle in marine waters. Here, we exploit two collections of expressed sequence tags (ESTs) of two wide-spread and ecologically important seagrass species, the Mediterranean seagrass Posidonia oceanica (L.) Delile and the eelgrass Zostera marina L., which have independently evolved from aquatic ancestors. This replicated, yet independent evolutionary history facilitates the identification of traits that may have evolved in parallel and are possible instrumental candidates for adaptation to a marine habitat. In our study, we provide the first quantitative perspective on molecular adaptations in two seagrass species. By constructing orthologous gene clusters shared between two seagrasses (Z. marina and P. oceanica) and eight distantly related terrestrial angiosperm species, 51 genes could be identified with detection of positive selection along the seagrass branches of the phylogenetic tree. Characterization of these positively selected genes using KEGG pathways and the Gene Ontology uncovered that these genes are mostly involved in translation, metabolism, and photosynthesis. These results provide first insights into which seagrass genes have diverged from their terrestrial counterparts via an initial aquatic stage characteristic of the order and to the derived fully-marine stage characteristic of seagrasses. We discuss how adaptive changes in these processes may have contributed to the evolution towards an aquatic and marine existence.
Network-Based Identification and Prioritization of Key Regulators of Coronary Artery Disease Loci

PubMed Central

Zhao, Yuqi; Chen, Jing; Freudenberg, Johannes M.; Meng, Qingying; Rajpal, Deepak K.; Yang, Xia

2017-01-01

Objective Recent genome-wide association studies of coronary artery disease (CAD) have revealed 58 genome-wide significant and 148 suggestive genetic loci. However, the molecular mechanisms through which they contribute to CAD and the clinical implications of these findings remain largely unknown. We aim to retrieve gene subnetworks of the 206 CAD loci and identify and prioritize candidate regulators to better understand the biological mechanisms underlying the genetic associations. Approach and Results We devised a new integrative genomics approach that incorporated (1) candidate genes from the top CAD loci, (2) the complete genetic association results from the 1000 genomes-based CAD genome-wide association studies from the Coronary Artery Disease Genome Wide Replication and Meta-Analysis Plus the Coronary Artery Disease consortium, (3) tissue-specific gene regulatory networks that depict the potential relationship and interactions between genes, and (4) tissue-specific gene expression patterns between CAD patients and controls. The networks and top-ranked regulators according to these data-driven criteria were further queried against literature, experimental evidence, and drug information to evaluate their disease relevance and potential as drug targets. Our analysis uncovered several potential novel regulators of CAD such as LUM and STAT3, which possess properties suitable as drug targets. We also revealed molecular relations and potential mechanisms through which the top CAD loci operate. Furthermore, we found that multiple CAD-relevant biological processes such as extracellular matrix, inflammatory and immune pathways, complement and coagulation cascades, and lipid metabolism interact in the CAD networks. Conclusions Our data-driven integrative genomics framework unraveled tissue-specific relations among the candidate genes of the CAD genome-wide association studies loci and prioritized novel network regulatory genes orchestrating biological processes relevant to CAD. PMID:26966275
Mapping by sequencing in cotton (Gossypium hirsutum) line MD52ne identified candidate genes for fiber strength and its related quality attributes.

PubMed

Islam, Md S; Zeng, Linghe; Thyssen, Gregory N; Delhom, Christopher D; Kim, Hee Jin; Li, Ping; Fang, David D

2016-06-01

Three QTL regions controlling three fiber quality traits were validated and further fine-mapped with 27 new single nucleotide polymorphism (SNP) markers. Transcriptome analysis suggests that receptor-like kinases found within the validated QTLs are potential candidate genes responsible for superior fiber strength in cotton line MD52ne. Fiber strength, length, maturity and fineness determine the market value of cotton fibers and the quality of spun yarn. Cotton fiber strength has been recognized as a critical quality attribute in the modern textile industry. Fine mapping along with quantitative trait loci (QTL) validation and candidate gene prediction can uncover the genetic and molecular basis of fiber quality traits. Four previously-identified QTLs (qFBS-c3, qSFI-c14, qUHML-c14 and qUHML-c24) related to fiber bundle strength, short fiber index and fiber length, respectively, were validated using an F3 population that originated from a cross of MD90ne × MD52ne. A group of 27 new SNP markers generated from mapping-by-sequencing (MBS) were placed in QTL regions to improve and validate earlier maps. Our refined QTL regions spanned 4.4, 1.8 and 3.7 Mb of physical distance in the Gossypium raimondii reference genome. We performed RNA sequencing (RNA-seq) of 15 and 20 days post-anthesis fiber cells from MD52ne and MD90ne and aligned reads to the G. raimondii genome. The QTL regions contained 21 significantly differentially expressed genes (DEGs) between the two near-isogenic parental lines. SNPs that result in non-synonymous substitutions to amino acid sequences of annotated genes were identified within these DEGs, and mapped. Taken together, transcriptome and amino acid mutation analysis indicate that receptor-like kinase pathway genes are likely candidates for superior fiber strength and length in MD52ne. MBS along with RNA-seq demonstrated a powerful strategy to elucidate candidate genes for the QTLs that control complex traits in a complex genome like tetraploid upland cotton.
Array comparative genomic hybridization and computational genome annotation in constitutional cytogenetics: suggesting candidate genes for novel submicroscopic chromosomal imbalance syndromes.

PubMed

Van Vooren, Steven; Coessens, Bert; De Moor, Bart; Moreau, Yves; Vermeesch, Joris R

2007-09-01

Genome-wide array comparative genomic hybridization screening is uncovering pathogenic submicroscopic chromosomal imbalances in patients with developmental disorders. In those patients, imbalances appear now to be scattered across the whole genome, and most patients carry different chromosomal anomalies. Screening patients with developmental disorders can be considered a forward functional genome screen. The imbalances pinpoint the location of genes that are involved in human development. Because most imbalances encompass regions harboring multiple genes, the challenge is to (1) identify those genes responsible for the specific phenotype and (2) disentangle the role of the different genes located in an imbalanced region. In this review, we discuss novel tools and relevant databases that have recently been developed to aid this gene discovery process. Identification of the functional relevance of genes will not only deepen our understanding of human development but will, in addition, aid in the data interpretation and improve genetic counseling.
Agave tequilana MADS genes show novel expression patterns in meristems, developing bulbils and floral organs.

PubMed

Delgado Sandoval, Silvia del Carmen; Abraham Juárez, María Jazmín; Simpson, June

2012-03-01

Agave tequilana is a monocarpic perennial species that flowers after 5-8 years of vegetative growth signaling the end of the plant's life cycle. When fertilization is unsuccessful, vegetative bulbils are induced on the umbels of the inflorescence near the bracteoles from newly formed meristems. Although the regulation of inflorescence and flower development has been described in detail for monocarpic annuals and polycarpic species, little is known at the molecular level for these processes in monocarpic perennials, and few studies have been carried out on bulbils. Histological samples revealed the early induction of umbel meristems soon after the initiation of the vegetative to inflorescence transition in A. tequilana. To identify candidate genes involved in the regulation of floral induction, a search for MADS-box transcription factor ESTs was conducted using an A. tequilana transcriptome database. Seven different MIKC MADS genes classified into 6 different types were identified based on previously characterized A. thaliana and O. sativa MADS genes and sequences from non-grass monocotyledons. Quantitative real-time PCR analysis of the seven candidate MADS genes in vegetative, inflorescence, bulbil and floral tissues uncovered novel patterns of expression for some of the genes in comparison with orthologous genes characterized in other species. In situ hybridization studies using two different genes showed expression in specific tissues of vegetative meristems and floral buds. Distinct MADS gene regulatory patterns in A. tequilana may be related to the specific reproductive strategies employed by this species.
Heat Shock Protein Beta-1 Modifies Anterior to Posterior Purkinje Cell Vulnerability in a Mouse Model of Niemann-Pick Type C Disease.

PubMed

Chung, Chan; Elrick, Matthew J; Dell'Orco, James M; Qin, Zhaohui S; Kalyana-Sundaram, Shanker; Chinnaiyan, Arul M; Shakkottai, Vikram G; Lieberman, Andrew P

2016-05-01

Selective neuronal vulnerability is characteristic of most degenerative disorders of the CNS, yet mechanisms underlying this phenomenon remain poorly characterized. Many forms of cerebellar degeneration exhibit an anterior-to-posterior gradient of Purkinje cell loss including Niemann-Pick type C1 (NPC) disease, a lysosomal storage disorder characterized by progressive neurological deficits that often begin in childhood. Here, we sought to identify candidate genes underlying vulnerability of Purkinje cells in anterior cerebellar lobules using data freely available in the Allen Brain Atlas. This approach led to the identification of 16 candidate neuroprotective or susceptibility genes. We demonstrate that one candidate gene, heat shock protein beta-1 (HSPB1), promoted neuronal survival in cellular models of NPC disease through a mechanism that involved inhibition of apoptosis. Additionally, we show that over-expression of wild type HSPB1 or a phosphomimetic mutant in NPC mice slowed the progression of motor impairment and diminished cerebellar Purkinje cell loss. We confirmed the modulatory effect of Hspb1 on Purkinje cell degeneration in vivo, as knockdown by Hspb1 shRNA significantly enhanced neuron loss. These results suggest that strategies to promote HSPB1 activity may slow the rate of cerebellar degeneration in NPC disease and highlight the use of bioinformatics tools to uncover pathways leading to neuronal protection in neurodegenerative disorders.
Uncovering disease mechanisms through network biology in the era of Next Generation Sequencing

NASA Astrophysics Data System (ADS)

Piñero, Janet; Berenstein, Ariel; Gonzalez-Perez, Abel; Chernomoretz, Ariel; Furlong, Laura I.

2016-04-01

Characterizing the behavior of disease genes in the context of biological networks has the potential to shed light on disease mechanisms, and to reveal both new candidate disease genes and therapeutic targets. Previous studies addressing the network properties of disease genes have produced contradictory results. Here we have explored the causes of these discrepancies and assessed the relationship between the network roles of disease genes and their tolerance to deleterious germline variants in human populations leveraging on: the abundance of interactome resources, a comprehensive catalog of disease genes and exome variation data. We found that the most salient network features of disease genes are driven by cancer genes and that genes related to different types of diseases play network roles whose centrality is inversely correlated to their tolerance to likely deleterious germline mutations. This proved to be a multiscale signature, including global, mesoscopic and local network centrality features. Cancer driver genes, the most sensitive to deleterious variants, occupy the most central positions, followed by dominant disease genes and then by recessive disease genes, which are tolerant to variants and isolated within their network modules.
Uncovering disease mechanisms through network biology in the era of Next Generation Sequencing

PubMed Central

Piñero, Janet; Berenstein, Ariel; Gonzalez-Perez, Abel; Chernomoretz, Ariel; Furlong, Laura I.

2016-01-01

Characterizing the behavior of disease genes in the context of biological networks has the potential to shed light on disease mechanisms, and to reveal both new candidate disease genes and therapeutic targets. Previous studies addressing the network properties of disease genes have produced contradictory results. Here we have explored the causes of these discrepancies and assessed the relationship between the network roles of disease genes and their tolerance to deleterious germline variants in human populations leveraging on: the abundance of interactome resources, a comprehensive catalog of disease genes and exome variation data. We found that the most salient network features of disease genes are driven by cancer genes and that genes related to different types of diseases play network roles whose centrality is inversely correlated to their tolerance to likely deleterious germline mutations. This proved to be a multiscale signature, including global, mesoscopic and local network centrality features. Cancer driver genes, the most sensitive to deleterious variants, occupy the most central positions, followed by dominant disease genes and then by recessive disease genes, which are tolerant to variants and isolated within their network modules. PMID:27080396
Somatic mutations in the transcriptional corepressor gene BCORL1 in adult acute myelogenous leukemia

PubMed Central

Li, Meng; Collins, Roxane; Jiao, Yuchen; Ouillette, Peter; Bixby, Dale; Erba, Harry; Vogelstein, Bert; Kinzler, Kenneth W.

2011-01-01

To further our understanding of the genetic basis of acute myelogenous leukemia (AML), we determined the coding exon sequences of ∼ 18 000 protein-encoding genes in 8 patients with secondary AML. Here we report the discovery of novel somatic mutations in the transcriptional corepressor gene BCORL1 that is located on the X-chromosome. Analysis of BCORL1 in an unselected cohort of 173 AML patients identified a total of 10 mutated cases (6%) with BCORL1 mutations, whereas analysis of 19 AML cell lines uncovered 4 (21%) BCORL1 mutated cell lines. The majority (87%) of the mutations in BCORL1 were predicted to inactivate the gene product as a result of nonsense mutations, splice site mutation, or out-of-frame insertions or deletions. These results indicate that BCORL1 by genetic criteria is a novel candidate tumor suppressor gene, joining the growing list of genes recurrently mutated in AML. PMID:21989985

Thirty years of Alzheimer's disease genetics: the implications of systematic meta-analyses.

PubMed

Bertram, Lars; Tanzi, Rudolph E

2008-10-01

The genetic underpinnings of Alzheimer's disease (AD) remain largely elusive despite early successes in identifying three genes that cause early-onset familial AD (those that encode amyloid precursor protein (APP) and the presenilins (PSEN1 and PSEN2)), and one genetic risk factor for late-onset AD (the gene that encodes apolipoprotein E (APOE)). A large number of studies that aimed to help uncover the remaining disease-related loci have been published in recent decades, collectively proposing or refuting the involvement of over 500 different gene candidates. Systematic meta-analyses of these studies currently highlight more than 20 loci that have modest but significant effects on AD risk. This Review discusses the putative pathogenetic roles and common biochemical pathways of some of the most genetically and biologically compelling of these potential AD risk factors.
A gene regulatory network model for floral transition of the shoot apex in maize and its dynamic modeling.

PubMed

Dong, Zhanshan; Danilevskaya, Olga; Abadie, Tabare; Messina, Carlos; Coles, Nathan; Cooper, Mark

2012-01-01

The transition from the vegetative to reproductive development is a critical event in the plant life cycle. The accurate prediction of flowering time in elite germplasm is important for decisions in maize breeding programs and best agronomic practices. The understanding of the genetic control of flowering time in maize has significantly advanced in the past decade. Through comparative genomics, mutant analysis, genetic analysis and QTL cloning, and transgenic approaches, more than 30 flowering time candidate genes in maize have been revealed and the relationships among these genes have been partially uncovered. Based on the knowledge of the flowering time candidate genes, a conceptual gene regulatory network model for the genetic control of flowering time in maize is proposed. To demonstrate the potential of the proposed gene regulatory network model, a first attempt was made to develop a dynamic gene network model to predict flowering time of maize genotypes varying for specific genes. The dynamic gene network model is composed of four genes and was built on the basis of gene expression dynamics of the two late flowering id1 and dlf1 mutants, the early flowering landrace Gaspe Flint and the temperate inbred B73. The model was evaluated against the phenotypic data of the id1 dlf1 double mutant and the ZMM4 overexpressed transgenic lines. The model provides a working example that leverages knowledge from model organisms for the utilization of maize genomic information to predict a whole plant trait phenotype, flowering time, of maize genotypes.
Structured association analysis leads to insight into Saccharomyces cerevisiae gene regulation by finding multiple contributing eQTL hotspots associated with functional gene modules.

PubMed

Curtis, Ross E; Kim, Seyoung; Woolford, John L; Xu, Wenjie; Xing, Eric P

2013-03-21

Association analysis using genome-wide expression quantitative trait locus (eQTL) data investigates the effect that genetic variation has on cellular pathways and leads to the discovery of candidate regulators. Traditional analysis of eQTL data via pairwise statistical significance tests or linear regression does not leverage the availability of the structural information of the transcriptome, such as presence of gene networks that reveal correlation and potentially regulatory relationships among the study genes. We employ a new eQTL mapping algorithm, GFlasso, which we have previously developed for sparse structured regression, to reanalyze a genome-wide yeast dataset. GFlasso fully takes into account the dependencies among expression traits to suppress false positives and to enhance the signal/noise ratio. Thus, GFlasso leverages the gene-interaction network to discover the pleiotropic effects of genetic loci that perturb the expression level of multiple (rather than individual) genes, which enables us to gain more power in detecting previously neglected signals that are marginally weak but pleiotropically significant. While eQTL hotspots in yeast have been reported previously as genomic regions controlling multiple genes, our analysis reveals additional novel eQTL hotspots and, more interestingly, uncovers groups of multiple contributing eQTL hotspots that affect the expression level of functional gene modules. To our knowledge, our study is the first to report this type of gene regulation stemming from multiple eQTL hotspots. Additionally, we report the results from in-depth bioinformatics analysis for three groups of these eQTL hotspots: ribosome biogenesis, telomere silencing, and retrotransposon biology. We suggest candidate regulators for the functional gene modules that map to each group of hotspots. Not only do we find that many of these candidate regulators contain mutations in the promoter and coding regions of the genes, in the case of the Ribi group, we provide experimental evidence suggesting that the identified candidates do regulate the target genes predicted by GFlasso. Thus, this structured association analysis of a yeast eQTL dataset via GFlasso, coupled with extensive bioinformatics analysis, discovers a novel regulation pattern between multiple eQTL hotspots and functional gene modules. Furthermore, this analysis demonstrates the potential of GFlasso as a powerful computational tool for eQTL studies that exploit the rich structural information among expression traits due to correlation, regulation, or other forms of biological dependencies.
CRISPR/Cas9: An inexpensive, efficient loss of function tool to screen human disease genes in Xenopus.

PubMed

Bhattacharya, Dipankan; Marfo, Chris A; Li, Davis; Lane, Maura; Khokha, Mustafa K

2015-12-15

Congenital malformations are the major cause of infant mortality in the US and Europe. Due to rapid advances in human genomics, we can now efficiently identify sequence variants that may cause disease in these patients. However, establishing disease causality remains a challenge. Additionally, in the case of congenital heart disease, many of the identified candidate genes are either novel to embryonic development or have no known function. Therefore, there is a pressing need to develop inexpensive and efficient technologies to screen these candidate genes for disease phenocopy in model systems and to perform functional studies to uncover their role in development. For this purpose, we sought to test F0 CRISPR based gene editing as a loss of function strategy for disease phenocopy in the frog model organism, Xenopus tropicalis. We demonstrate that the CRISPR/Cas9 system can efficiently modify both alleles in the F0 generation within a few hours post fertilization, recapitulating even early disease phenotypes that are highly similar to knockdowns from morpholino oligos (MOs) in nearly all cases tested. We find that injecting Cas9 protein is dramatically more efficacious and less toxic than cas9 mRNA. We conclude that CRISPR based F0 gene modification in X. tropicalis is efficient and cost effective and readily recapitulates disease and MO phenotypes. Copyright © 2015 Elsevier Inc. All rights reserved.
Genetics of Psychosis in Alzheimer Disease.

PubMed

DeMichele-Sweet, Mary Ann A; Sweet, Robert A

2014-03-01

Psychosis occurs in approximately half of patients with Alzheimer disease (AD with psychosis, AD+P). AD+P patients have more rapid cognitive decline, greater behavioral symptoms, and higher mortality than do AD patients without psychosis. Studies in three independent cohorts have shown that psychosis in AD aggregates in families, with estimated heritability of 29.5 - 60.8%. These findings have motivated studies to investigate and uncover the genes responsible for the development of psychosis, with the ultimate goal of identifying potential biologic mechanisms that may serve as leads to specific therapies. Linkage analyses have implicated loci on chromosomes 2, 6, 7, 8, 15, and 21 with AD+P. Association studies of APOE do not support it as a risk gene for psychosis in AD. No other candidate genes, such as neurodegenerative and monoamine genes, show conclusive evidence of association with AD+P. However, a recent genome-side association study has produced some promising leads, including among them genes that have been associated with schizophrenia. This review summarizes the current knowledge of the genetic basis of AD+P.
Validation of reference genes aiming accurate normalization of qRT-PCR data in Dendrocalamus latiflorus Munro.

PubMed

Liu, Mingying; Jiang, Jing; Han, Xiaojiao; Qiao, Guirong; Zhuo, Renying

2014-01-01

Dendrocalamus latiflorus Munro distributes widely in subtropical areas and plays vital roles as valuable natural resources. The transcriptome sequencing for D. latiflorus Munro has been performed and numerous genes especially those predicted to be unique to D. latiflorus Munro were revealed. qRT-PCR has become a feasible approach to uncover gene expression profiling, and the accuracy and reliability of the results obtained depends upon the proper selection of stable reference genes for accurate normalization. Therefore, a set of suitable internal controls should be validated for D. latiflorus Munro. In this report, twelve candidate reference genes were selected and the assessment of gene expression stability was performed in ten tissue samples and four leaf samples from seedlings and anther-regenerated plants of different ploidy. The PCR amplification efficiency was estimated, and the candidate genes were ranked according to their expression stability using three software packages: geNorm, NormFinder and Bestkeeper. GAPDH and EF1α were characterized to be the most stable genes among different tissues or in all the sample pools, while CYP showed low expression stability. RPL3 had the optimal performance among four leaf samples. The application of verified reference genes was illustrated by analyzing ferritin and laccase expression profiles among different experimental sets. The analysis revealed the biological variation in ferritin and laccase transcript expression among the tissues studied and the individual plants. geNorm, NormFinder, and BestKeeper analyses recommended different suitable reference gene(s) for normalization according to the experimental sets. GAPDH and EF1α had the highest expression stability across different tissues and RPL3 for the other sample set. This study emphasizes the importance of validating superior reference genes for qRT-PCR analysis to accurately normalize gene expression of D. latiflorus Munro.
Characterization of candidate genes in inflammatory bowel disease–associated risk loci

PubMed Central

Peloquin, Joanna M.; Sartor, R. Balfour; Newberry, Rodney D.; McGovern, Dermot P.; Yajnik, Vijay; Lira, Sergio A.

2016-01-01

GWAS have linked SNPs to risk of inflammatory bowel disease (IBD), but a systematic characterization of disease-associated genes has been lacking. Prior studies utilized microarrays that did not capture many genes encoded within risk loci or defined expression quantitative trait loci (eQTLs) using peripheral blood, which is not the target tissue in IBD. To address these gaps, we sought to characterize the expression of IBD-associated risk genes in disease-relevant tissues and in the setting of active IBD. Terminal ileal (TI) and colonic mucosal tissues were obtained from patients with Crohn’s disease or ulcerative colitis and from healthy controls. We developed a NanoString code set to profile 678 genes within IBD risk loci. A subset of patients and controls were genotyped for IBD-associated risk SNPs. Analyses included differential expression and variance analysis, weighted gene coexpression network analysis, and eQTL analysis. We identified 116 genes that discriminate between healthy TI and colon samples and uncovered patterns in variance of gene expression that highlight heterogeneity of disease. We identified 107 coexpressed gene pairs for which transcriptional regulation is either conserved or reversed in an inflammation-independent or -dependent manner. We demonstrate that on average approximately 60% of disease-associated genes are differentially expressed in inflamed tissue. Last, we identified eQTLs with either genotype-only effects on expression or an interaction effect between genotype and inflammation. Our data reinforce tissue specificity of expression in disease-associated candidate genes, highlight genes and gene pairs that are regulated in disease-relevant tissue and inflammation, and provide a foundation to advance the understanding of IBD pathogenesis. PMID:27668286
Soil environment is a key driver of adaptation in Medicago truncatula: new insights from landscape genomics.

PubMed

Guerrero, Jimena; Andrello, Marco; Burgarella, Concetta; Manel, Stephanie

2018-07-01

Spatial differences in environmental selective pressures interact with the genomes of organisms, ultimately leading to local adaptation. Landscape genomics is an emergent research area that uncovers genome-environment associations, thus allowing researchers to identify candidate loci for adaptation to specific environmental variables. In the present study, we used latent factor mixed models (LFMMs) and Moran spectral outlier detection/randomization (MSOD-MSR) to identify candidate loci for adaptation to 10 environmental variables (climatic, soil and atmospheric) among 43 515 single nucleotide polymorphisms (SNPs) from 202 accessions of the model legume Medicago truncatula. Soil variables were associated with a large number of candidate loci identified through both LFMMs and MSOD-MSR. Genes tagged by candidate loci associated with drought and salinity are involved in the response to biotic and abiotic stresses, while those tagged by candidates associated with soil nitrogen and atmospheric nitrogen, participate in the legume-rhizobia symbiosis. Candidate SNPs identified through both LFMMs and MSOD-MSR explained up to 56% of variance in flowering traits. Our findings highlight the importance of soil in driving adaptation in the system and elucidate the basis of evolutionary potential of M. truncatula to respond to global climate change and anthropogenic disruption of the nitrogen cycle. © 2018 The Authors New Phytologist © 2018 New Phytologist Trust.
Bridging the Gap between Genes and Language Deficits in Schizophrenia: An Oscillopathic Approach

PubMed Central

Murphy, Elliot; Benítez-Burraco, Antonio

2016-01-01

Schizophrenia is characterized by marked language deficits, but it is not clear how these deficits arise from the alteration of genes related to the disease. The goal of this paper is to aid the bridging of the gap between genes and schizophrenia and, ultimately, give support to the view that the abnormal presentation of language in this condition is heavily rooted in the evolutionary processes that brought about modern language. To that end we will focus on how the schizophrenic brain processes language and, particularly, on its distinctive oscillatory profile during language processing. Additionally, we will show that candidate genes for schizophrenia are overrepresented among the set of genes that are believed to be important for the evolution of the human faculty of language. These genes crucially include (and are related to) genes involved in brain rhythmicity. We will claim that this translational effort and the links we uncover may help develop an understanding of language evolution, along with the etiology of schizophrenia, its clinical/linguistic profile, and its high prevalence among modern populations. PMID:27601987
Informed walks: whispering hints to gene hunters inside networks' jungle.

PubMed

Bourdakou, Marilena M; Spyrou, George M

2017-10-11

Systemic approaches offer a different point of view on the analysis of several types of molecular associations as well as on the identification of specific gene communities in several cancer types. However, due to lack of sufficient data needed to construct networks based on experimental evidence, statistical gene co-expression networks are widely used instead. Many efforts have been made to exploit the information hidden in these networks. However, these approaches still need to capitalize comprehensively the prior knowledge encrypted into molecular pathway associations and improve their efficiency regarding the discovery of both exclusive subnetworks as candidate biomarkers and conserved subnetworks that may uncover common origins of several cancer types. In this study we present the development of the Informed Walks model based on random walks that incorporate information from molecular pathways to mine candidate genes and gene-gene links. The proposed model has been applied to TCGA (The Cancer Genome Atlas) datasets from seven different cancer types, exploring the reconstructed co-expression networks of the whole set of genes and driving to highlighted sub-networks for each cancer type. In the sequel, we elucidated the impact of each subnetwork on the indication of underlying exclusive and common molecular mechanisms as well as on the short-listing of drugs that have the potential to suppress the corresponding cancer type through a drug-repurposing pipeline. We have developed a method of gene subnetwork highlighting based on prior knowledge, capable to give fruitful insights regarding the underlying molecular mechanisms and valuable input to drug-repurposing pipelines for a variety of cancer types.
Identification of transcriptional regulators in the mouse immune system

PubMed Central

Jojic, Vladimir; Shay, Tal; Sylvia, Katelyn; Zuk, Or; Sun, Xin; Kang, Joonsoo; Regev, Aviv; Koller, Daphne

2013-01-01

The differentiation of hematopoietic stem cells into immune cells has been extensively studied in mammals, but the transcriptional circuitry controlling it is still only partially understood. Here, the Immunological Genome Project gene expression profiles across mouse immune lineages allowed us to systematically analyze these circuits. Using a computational algorithm called Ontogenet, we uncovered differentiation-stage specific regulators of mouse hematopoiesis, identifying many known hematopoietic regulators, and 175 new candidate regulators, their target genes, and the cell types in which they act. Among the novel regulators, we highlight the role of ETV5 in γδT cells differntiation. Since the transcriptional program of human and mouse cells is highly conserved1, it is likely that many lessons learned from the mouse model apply to humans. PMID:23624555
Comparative genomics of Beauveria bassiana: uncovering signatures of virulence against mosquitoes.

PubMed

Valero-Jiménez, Claudio A; Faino, Luigi; Spring In't Veld, Daphne; Smit, Sandra; Zwaan, Bas J; van Kan, Jan A L

2016-12-01

Entomopathogenic fungi such as Beauveria bassiana are promising biological agents for control of malaria mosquitoes. Indeed, infection with B. bassiana reduces the lifespan of mosquitoes in the laboratory and in the field. Natural isolates of B. bassiana show up to 10-fold differences in virulence between the most and the least virulent isolate. In this study, we sequenced the genomes of five isolates representing the extremes of low/high virulence and three RNA libraries, and applied a genome comparison approach to uncover genetic mechanisms underpinning virulence. A high-quality, near-complete genome assembly was achieved for the highly virulent isolate Bb8028, which was compared to the assemblies of the four other isolates. Whole genome analysis showed a high level of genetic diversity between the five isolates (2.85-16.8 SNPs/kb), which grouped into two distinct phylogenetic clusters. Mating type gene analysis revealed the presence of either the MAT1-1-1 or the MAT1-2-1 gene. Moreover, a putative new MAT gene (MAT1-2-8) was detected in the MAT1-2 locus. Comparative genome analysis revealed that Bb8028 contains 163 genes exclusive for this isolate. These unique genes have a tendency to cluster in the genome and to be often located near the telomeres. Among the genes unique to Bb8028 are a Non-Ribosomal Peptide Synthetase (NRPS) secondary metabolite gene cluster, a polyketide synthase (PKS) gene, and five genes with homology to bacterial toxins. A survey of candidate virulence genes for B. bassiana is presented. Our results indicate several genes and molecular processes that may underpin virulence towards mosquitoes. Thus, the genome sequences of five isolates of B. bassiana provide a better understanding of the natural variation in virulence and will offer a major resource for future research on this important biological control agent.
Ciliary dyslexia candidate genes DYX1C1 and DCDC2 are regulated by Regulatory Factor X (RFX) transcription factors through X-box promoter motifs

PubMed Central

Tammimies, Kristiina; Bieder, Andrea; Lauter, Gilbert; Sugiaman-Trapman, Debora; Torchet, Rachel; Hokkanen, Marie-Estelle; Burghoorn, Jan; Castrén, Eero; Kere, Juha; Tapia-Páez, Isabel; Swoboda, Peter

2016-01-01

DYX1C1, DCDC2, and KIAA0319 are three of the most replicated dyslexia candidate genes (DCGs). Recently, these DCGs were implicated in functions at the cilium. Here, we investigate the regulation of these DCGs by Regulatory Factor X transcription factors (RFX TFs), a gene family known for transcriptionally regulating ciliary genes. We identify conserved X-box motifs in the promoter regions of DYX1C1, DCDC2, and KIAA0319 and demonstrate their functionality, as well as the ability to recruit RFX TFs using reporter gene and electrophoretic mobility shift assays. Furthermore, we uncover a complex regulation pattern between RFX1, RFX2, and RFX3 and their significant effect on modifying the endogenous expression of DYX1C1 and DCDC2 in a human retinal pigmented epithelial cell line immortalized with hTERT (hTERT-RPE1). In addition, induction of ciliogenesis increases the expression of RFX TFs and DCGs. At the protein level, we show that endogenous DYX1C1 localizes to the base of the cilium, whereas DCDC2 localizes along the entire axoneme of the cilium, thereby validating earlier localization studies using overexpression models. Our results corroborate the emerging role of DCGs in ciliary function and characterize functional noncoding elements, X-box promoter motifs, in DCG promoter regions, which thus can be targeted for mutation screening in dyslexia and ciliopathies associated with these genes.—Tammimies, K., Bieder, A., Lauter, G., Sugiaman-Trapman, D., Torchet, R., Hokkanen, M.-E., Burghoorn, J., Castrén, E., Kere, J., Tapia-Páez, I., Swoboda, P. Ciliary dyslexia candidate genes DYX1C1 and DCDC2 are regulated by Regulatory Factor (RF) X transcription factors through X-box promoter motifs. PMID:27451412
Ciliary dyslexia candidate genes DYX1C1 and DCDC2 are regulated by Regulatory Factor X (RFX) transcription factors through X-box promoter motifs.

PubMed

Tammimies, Kristiina; Bieder, Andrea; Lauter, Gilbert; Sugiaman-Trapman, Debora; Torchet, Rachel; Hokkanen, Marie-Estelle; Burghoorn, Jan; Castrén, Eero; Kere, Juha; Tapia-Páez, Isabel; Swoboda, Peter

2016-10-01

DYX1C1, DCDC2, and KIAA0319 are three of the most replicated dyslexia candidate genes (DCGs). Recently, these DCGs were implicated in functions at the cilium. Here, we investigate the regulation of these DCGs by Regulatory Factor X transcription factors (RFX TFs), a gene family known for transcriptionally regulating ciliary genes. We identify conserved X-box motifs in the promoter regions of DYX1C1, DCDC2, and KIAA0319 and demonstrate their functionality, as well as the ability to recruit RFX TFs using reporter gene and electrophoretic mobility shift assays. Furthermore, we uncover a complex regulation pattern between RFX1, RFX2, and RFX3 and their significant effect on modifying the endogenous expression of DYX1C1 and DCDC2 in a human retinal pigmented epithelial cell line immortalized with hTERT (hTERT-RPE1). In addition, induction of ciliogenesis increases the expression of RFX TFs and DCGs. At the protein level, we show that endogenous DYX1C1 localizes to the base of the cilium, whereas DCDC2 localizes along the entire axoneme of the cilium, thereby validating earlier localization studies using overexpression models. Our results corroborate the emerging role of DCGs in ciliary function and characterize functional noncoding elements, X-box promoter motifs, in DCG promoter regions, which thus can be targeted for mutation screening in dyslexia and ciliopathies associated with these genes.-Tammimies, K., Bieder, A., Lauter, G., Sugiaman-Trapman, D., Torchet, R., Hokkanen, M.-E., Burghoorn, J., Castrén, E., Kere, J., Tapia-Páez, I., Swoboda, P. Ciliary dyslexia candidate genes DYX1C1 and DCDC2 are regulated by Regulatory Factor (RF) X transcription factors through X-box promoter motifs. © The Author(s).
Evaluation of RNA extraction methods and identification of putative reference genes for real-time quantitative polymerase chain reaction expression studies on olive (Olea europaea L.) fruits.

PubMed

Nonis, Alberto; Vezzaro, Alice; Ruperti, Benedetto

2012-07-11

Genome wide transcriptomic surveys together with targeted molecular studies are uncovering an ever increasing number of differentially expressed genes in relation to agriculturally relevant processes in olive (Olea europaea L). These data need to be supported by quantitative approaches enabling the precise estimation of transcript abundance. qPCR being the most widely adopted technique for mRNA quantification, preliminary work needs to be done to set up robust methods for extraction of fully functional RNA and for the identification of the best reference genes to obtain reliable quantification of transcripts. In this work, we have assessed different methods for their suitability for RNA extraction from olive fruits and leaves and we have evaluated thirteen potential candidate reference genes on 21 RNA samples belonging to fruit developmental/ripening series and to leaves subjected to wounding. By using two different algorithms, GAPDH2 and PP2A1 were identified as the best reference genes for olive fruit development and ripening, and their effectiveness for normalization of expression of two ripening marker genes was demonstrated.
Integrative Genome Comparison of Primary and Metastatic Melanomas

PubMed Central

Feng, Bin; Nazarian, Rosalynn M.; Bosenberg, Marcus; Wu, Min; Scott, Kenneth L.; Kwong, Lawrence N.; Xiao, Yonghong; Cordon-Cardo, Carlos; Granter, Scott R.; Ramaswamy, Sridhar; Golub, Todd; Duncan, Lyn M.; Wagner, Stephan N.; Brennan, Cameron; Chin, Lynda

2010-01-01

A cardinal feature of malignant melanoma is its metastatic propensity. An incomplete view of the genetic events driving metastatic progression has been a major barrier to rational development of effective therapeutics and prognostic diagnostics for melanoma patients. In this study, we conducted global genomic characterization of primary and metastatic melanomas to examine the genomic landscape associated with metastatic progression. In addition to uncovering three genomic subclasses of metastastic melanomas, we delineated 39 focal and recurrent regions of amplification and deletions, many of which encompassed resident genes that have not been implicated in cancer or metastasis. To identify progression-associated metastasis gene candidates, we applied a statistical approach, Integrative Genome Comparison (IGC), to define 32 genomic regions of interest that were significantly altered in metastatic relative to primary melanomas, encompassing 30 resident genes with statistically significant expression deregulation. Functional assays on a subset of these candidates, including MET, ASPM, AKAP9, IMP3, PRKCA, RPA3, and SCAP2, validated their pro-invasion activities in human melanoma cells. Validity of the IGC approach was further reinforced by tissue microarray analysis of Survivin showing significant increased protein expression in thick versus thin primary cutaneous melanomas, and a progression correlation with lymph node metastases. Together, these functional validation results and correlative analysis of human tissues support the thesis that integrated genomic and pathological analyses of staged melanomas provide a productive entry point for discovery of melanoma metastases genes. PMID:20520718
Integrated analysis of phenome, genome, and transcriptome of hybrid rice uncovered multiple heterosis-related loci for yield increase

PubMed Central

Li, Dayong; Huang, Zhiyuan; Song, Shuhui; Xin, Yeyun; Mao, Donghai; Lv, Qiming; Zhou, Ming; Tian, Dongmei; Tang, Mingfeng; Wu, Qi; Liu, Xue; Chen, Tingting; Song, Xianwei; Fu, Xiqin; Zhao, Bingran; Liang, Chengzhi; Li, Aihong; Liu, Guozhen; Li, Shigui; Hu, Songnian; Cao, Xiaofeng; Yu, Jun; Yuan, Longping; Chen, Caiyan; Zhu, Lihuang

2016-01-01

Hybrid rice is the dominant form of rice planted in China, and its use has extended worldwide since the 1970s. It offers great yield advantages and has contributed greatly to the world’s food security. However, the molecular mechanisms underlying heterosis have remained a mystery. In this study we integrated genetics and omics analyses to determine the candidate genes for yield heterosis in a model two-line rice hybrid system, Liang-you-pei 9 (LYP9) and its parents. Phenomics study revealed that the better parent heterosis (BPH) of yield in hybrid is not ascribed to BPH of all the yield components but is specific to the BPH of spikelet number per panicle (SPP) and paternal parent heterosis (PPH) of effective panicle number (EPN). Genetic analyses then identified multiple quantitative trait loci (QTLs) for these two components. Moreover, a number of differentially expressed genes and alleles in the hybrid were mapped by transcriptome profiling to the QTL regions as possible candidate genes. In parallel, a major QTL for yield heterosis, rice heterosis 8 (RH8), was found to be the DTH8/Ghd8/LHD1 gene. Based on the shared allelic heterozygosity of RH8 in many hybrid rice cultivars, a common mechanism for yield heterosis in the present commercial hybrid rice is proposed. PMID:27663737
Screening of miRNA profiles and construction of regulation networks in early and late lactation of dairy goat mammary glands.

PubMed

Ji, Zhibin; Liu, Zhaohua; Chao, Tianle; Hou, Lei; Fan, Rui; He, Rongyan; Wang, Guizhi; Wang, Jianmin

2017-09-20

In recent years, studies related to the expression profiles of miRNAs in the dairy goat mammary gland were performed, but regulatory mechanisms in the physiological environment and the dynamic homeostasis of mammary gland development and lactation are not clear. In the present study, sequencing data analysis of early and late lactation uncovered a total of 1,487 unique miRNAs, including 45 novel miRNA candidates and 1,442 known and conserved miRNAs, of which 758 miRNAs were co-expressed and 378 differentially expressed with P < 0.05. Moreover, 76 non-redundant target genes were annotated in 347 GO consortiums, with 3,143 candidate target genes grouped into 33 pathways. Additionally, 18 predicted target genes of 214 miRNAs were directly annotated in mammary gland development and used to construct regulatory networks based on GO annotation and the KEGG pathway. The expression levels of seven known miRNAs and three novel miRNAs were examined using quantitative real-time PCR. The results showed that miRNAs might play important roles in early and late lactation during dairy goat mammary gland development, which will be helpful to obtain a better understanding of the genetic control of mammary gland lactation and development.
Detection of genomic signatures of recent selection in commercial broiler chickens.

PubMed

Fu, Weixuan; Lee, William R; Abasht, Behnam

2016-08-26

Identification of the genomic signatures of recent selection may help uncover causal polymorphisms controlling traits relevant to recent decades of selective breeding in livestock. In this study, we aimed at detecting signatures of recent selection in commercial broiler chickens using genotype information from single nucleotide polymorphisms (SNPs). A total of 565 chickens from five commercial purebred lines, including three broiler sire (male) lines and two broiler dam (female) lines, were genotyped using the 60K SNP Illumina iSelect chicken array. To detect genomic signatures of recent selection, we applied two methods based on population comparison, cross-population extended haplotype homozygosity (XP-EHH) and cross-population composite likelihood ratio (XP-CLR), and further analyzed the results to find genomic regions under recent selection in multiple purebred lines. A total of 321 candidate selection regions spanning approximately 1.45 % of the chicken genome in each line were detected by consensus of results of both XP-EHH and XP-CLR methods. To minimize false discovery due to genetic drift, only 42 of the candidate selection regions that were shared by 2 or more purebred lines were considered as high-confidence selection regions in the study. Of these 42 regions, 20 were 50 kb or less while 4 regions were larger than 0.5 Mb. In total, 91 genes could be found in the 42 regions, among which 19 regions contained only 1 or 2 genes, and 9 regions were located at gene deserts. Our results provide a genome-wide scan of recent selection signatures in five purebred lines of commercial broiler chickens. We found several candidate genes for recent selection in multiple lines, such as SOX6 (Sex Determining Region Y-Box 6) and cTR (Thyroid hormone receptor beta). These genes may have been under recent selection due to their essential roles in growth, development and reproduction in chickens. Furthermore, our results suggest that in some candidate regions, the same or opposite alleles have been under recent selection in multiple lines. Most of the candidate genes in the selection regions are novel, and as such they should be of great interest for future research into the genetic architecture of traits relevant to modern broiler breeding.
Filling gaps in PPAR-alpha signaling through comparative nutrigenomics analysis.

PubMed

Cavalieri, Duccio; Calura, Enrica; Romualdi, Chiara; Marchi, Emmanuela; Radonjic, Marijana; Van Ommen, Ben; Müller, Michael

2009-12-11

The application of high-throughput genomic tools in nutrition research is a widespread practice. However, it is becoming increasingly clear that the outcome of individual expression studies is insufficient for the comprehensive understanding of such a complex field. Currently, the availability of the large amounts of expression data in public repositories has opened up new challenges on microarray data analyses. We have focused on PPARalpha, a ligand-activated transcription factor functioning as fatty acid sensor controlling the gene expression regulation of a large set of genes in various metabolic organs such as liver, small intestine or heart. The function of PPARalpha is strictly connected to the function of its target genes and, although many of these have already been identified, major elements of its physiological function remain to be uncovered. To further investigate the function of PPARalpha, we have applied a cross-species meta-analysis approach to integrate sixteen microarray datasets studying high fat diet and PPARalpha signal perturbations in different organisms. We identified 164 genes (MDEGs) that were differentially expressed in a constant way in response to a high fat diet or to perturbations in PPARs signalling. In particular, we found five genes in yeast which were highly conserved and homologous of PPARalpha targets in mammals, potential candidates to be used as models for the equivalent mammalian genes. Moreover, a screening of the MDEGs for all known transcription factor binding sites and the comparison with a human genome-wide screening of Peroxisome Proliferating Response Elements (PPRE), enabled us to identify, 20 new potential candidate genes that show, both binding site, both change in expression in the condition studied. Lastly, we found a non random localization of the differentially expressed genes in the genome. The results presented are potentially of great interest to resume the currently available expression data, exploiting the power of in silico analysis filtered by evolutionary conservation. The analysis enabled us to indicate potential gene candidates that could fill in the gaps with regards to the signalling of PPARalpha and, moreover, the non-random localization of the differentially expressed genes in the genome, suggest that epigenetic mechanisms are of importance in the regulation of the transcription operated by PPARalpha.

Global Genome and Transcriptome Analyses of Magnaporthe oryzae Epidemic Isolate 98-06 Uncover Novel Effectors and Pathogenicity-Related Genes, Revealing Gene Gain and Lose Dynamics in Genome Evolution

PubMed Central

Dong, Yanhan; Li, Ying; Zhao, Miaomiao; Jing, Maofeng; Liu, Xinyu; Liu, Muxing; Guo, Xianxian; Zhang, Xing; Chen, Yue; Liu, Yongfeng; Liu, Yanhong; Ye, Wenwu; Zhang, Haifeng; Wang, Yuanchao; Zheng, Xiaobo; Wang, Ping; Zhang, Zhengguang

2015-01-01

Genome dynamics of pathogenic organisms are driven by pathogen and host co-evolution, in which pathogen genomes are shaped to overcome stresses imposed by hosts with various genetic backgrounds through generation of a variety of isolates. This same principle applies to the rice blast pathogen Magnaporthe oryzae and the rice host; however, genetic variations among different isolates of M. oryzae remain largely unknown, particularly at genome and transcriptome levels. Here, we applied genomic and transcriptomic analytical tools to investigate M. oryzae isolate 98-06 that is the most aggressive in infection of susceptible rice cultivars. A unique 1.4 Mb of genomic sequences was found in isolate 98-06 in comparison to reference strain 70-15. Genome-wide expression profiling revealed the presence of two critical expression patterns of M. oryzae based on 64 known pathogenicity-related (PaR) genes. In addition, 134 candidate effectors with various segregation patterns were identified. Five tested proteins could suppress BAX-mediated programmed cell death in Nicotiana benthamiana leaves. Characterization of isolate-specific effector candidates Iug6 and Iug9 and PaR candidate Iug18 revealed that they have a role in fungal propagation and pathogenicity. Moreover, Iug6 and Iug9 are located exclusively in the biotrophic interfacial complex (BIC) and their overexpression leads to suppression of defense-related gene expression in rice, suggesting that they might participate in biotrophy by inhibiting the SA and ET pathways within the host. Thus, our studies identify novel effector and PaR proteins involved in pathogenicity of the highly aggressive M. oryzae field isolate 98-06, and reveal molecular and genomic dynamics in the evolution of M. oryzae and rice host interactions. PMID:25837042
Integrative Assessment of Chlorine-Induced Acute Lung Injury in Mice

PubMed Central

Pope-Varsalona, Hannah; Concel, Vincent J.; Liu, Pengyuan; Bein, Kiflai; Berndt, Annerose; Martin, Timothy M.; Ganguly, Koustav; Jang, An Soo; Brant, Kelly A.; Dopico, Richard A.; Upadhyay, Swapna; Di, Y. P. Peter; Hu, Zhen; Vuga, Louis J.; Medvedovic, Mario; Kaminski, Naftali; You, Ming; Alexander, Danny C.; McDunn, Jonathan E.; Prows, Daniel R.; Knoell, Daren L.

2012-01-01

The genetic basis for the underlying individual susceptibility to chlorine-induced acute lung injury is unknown. To uncover the genetic basis and pathophysiological processes that could provide additional homeostatic capacities during lung injury, 40 inbred murine strains were exposed to chlorine, and haplotype association mapping was performed. The identified single-nucleotide polymorphism (SNP) associations were evaluated through transcriptomic and metabolomic profiling. Using ≥ 10% allelic frequency and ≥ 10% phenotype explained as threshold criteria, promoter SNPs that could eliminate putative transcriptional factor recognition sites in candidate genes were assessed by determining transcript levels through microarray and reverse real-time PCR during chlorine exposure. The mean survival time varied by approximately 5-fold among strains, and SNP associations were identified for 13 candidate genes on chromosomes 1, 4, 5, 9, and 15. Microarrays revealed several differentially enriched pathways, including protein transport (decreased more in the sensitive C57BLKS/J lung) and protein catabolic process (increased more in the resistant C57BL/10J lung). Lung metabolomic profiling revealed 95 of the 280 metabolites measured were altered by chlorine exposure, and included alanine, which decreased more in the C57BLKS/J than in the C57BL/10J strain, and glutamine, which increased more in the C57BL/10J than in the C57BLKS/J strain. Genetic associations from haplotype mapping were strengthened by an integrated assessment using transcriptomic and metabolomic profiling. The leading candidate genes associated with increased susceptibility to acute lung injury in mice included Klf4, Sema7a, Tns1, Aacs, and a gene that encodes an amino acid carrier, Slc38a4. PMID:22447970
Genome-Wide Association Mapping Uncovers Fw1, a Dominant Gene Conferring Resistance to Fusarium Wilt in Strawberry.

PubMed

Pincot, Dominique D A; Poorten, Thomas J; Hardigan, Michael A; Harshman, Julia M; Acharya, Charlotte B; Cole, Glenn S; Gordon, Thomas R; Stueven, Michelle; Edger, Patrick P; Knapp, Steven J

2018-05-04

Fusarium wilt, a soil-borne disease caused by the fungal pathogen Fusarium oxysporum f. sp. fragariae , threatens strawberry ( Fragaria × ananassa ) production worldwide. The spread of the pathogen, coupled with disruptive changes in soil fumigation practices, have greatly increased disease pressure and the importance of developing resistant cultivars. While resistant and susceptible cultivars have been reported, a limited number of germplasm accessions have been analyzed, and contradictory conclusions have been reached in earlier studies to elucidate the underlying genetic basis of resistance. Here, we report the discovery of Fw1 , a dominant gene conferring resistance to Fusarium wilt in strawberry. The Fw1 locus was uncovered in a genome-wide association study of 565 historically and commercially important strawberry accessions genotyped with 14,408 SNP markers. Fourteen SNPs in linkage disequilibrium with Fw1 physically mapped to a 2.3 Mb segment on chromosome 2 in a diploid F. vesca reference genome. Fw1 and 11 tightly linked GWAS-significant SNPs mapped to linkage group 2C in octoploid segregating populations. The most significant SNP explained 85% of the phenotypic variability and predicted resistance in 97% of the accessions tested-broad-sense heritability was 0.96. Several disease resistance and defense-related gene homologs, including a small cluster of genes encoding nucleotide-binding leucine-rich-repeat proteins, were identified in the 0.7 Mb genomic segment predicted to harbor Fw1 DNA variants and candidate genes identified in the present study should facilitate the development of high-throughput genotyping assays for accurately predicting Fusarium wilt phenotypes and applying marker-assisted selection. Copyright © 2018 Pincot et al.
Genes uniquely expressed in human growth plate chondrocytes uncover a distinct regulatory network.

PubMed

Li, Bing; Balasubramanian, Karthika; Krakow, Deborah; Cohn, Daniel H

2017-12-20

Chondrogenesis is the earliest stage of skeletal development and is a highly dynamic process, integrating the activities and functions of transcription factors, cell signaling molecules and extracellular matrix proteins. The molecular mechanisms underlying chondrogenesis have been extensively studied and multiple key regulators of this process have been identified. However, a genome-wide overview of the gene regulatory network in chondrogenesis has not been achieved. In this study, employing RNA sequencing, we identified 332 protein coding genes and 34 long non-coding RNA (lncRNA) genes that are highly selectively expressed in human fetal growth plate chondrocytes. Among the protein coding genes, 32 genes were associated with 62 distinct human skeletal disorders and 153 genes were associated with skeletal defects in knockout mice, confirming their essential roles in skeletal formation. These gene products formed a comprehensive physical interaction network and participated in multiple cellular processes regulating skeletal development. The data also revealed 34 transcription factors and 11,334 distal enhancers that were uniquely active in chondrocytes, functioning as transcriptional regulators for the cartilage-selective genes. Our findings revealed a complex gene regulatory network controlling skeletal development whereby transcription factors, enhancers and lncRNAs participate in chondrogenesis by transcriptional regulation of key genes. Additionally, the cartilage-selective genes represent candidate genes for unsolved human skeletal disorders.
Large-Scale Discovery of Disease-Disease and Disease-Gene Associations

PubMed Central

Gligorijevic, Djordje; Stojanovic, Jelena; Djuric, Nemanja; Radosavljevic, Vladan; Grbovic, Mihajlo; Kulathinal, Rob J.; Obradovic, Zoran

2016-01-01

Data-driven phenotype analyses on Electronic Health Record (EHR) data have recently drawn benefits across many areas of clinical practice, uncovering new links in the medical sciences that can potentially affect the well-being of millions of patients. In this paper, EHR data is used to discover novel relationships between diseases by studying their comorbidities (co-occurrences in patients). A novel embedding model is designed to extract knowledge from disease comorbidities by learning from a large-scale EHR database comprising more than 35 million inpatient cases spanning nearly a decade, revealing significant improvements on disease phenotyping over current computational approaches. In addition, the use of the proposed methodology is extended to discover novel disease-gene associations by including valuable domain knowledge from genome-wide association studies. To evaluate our approach, its effectiveness is compared against a held-out set where, again, it revealed very compelling results. For selected diseases, we further identify candidate gene lists for which disease-gene associations were not studied previously. Thus, our approach provides biomedical researchers with new tools to filter genes of interest, thus, reducing costly lab studies. PMID:27578529
Direct interplay between two candidate genes in FSHD muscular dystrophy

PubMed Central

Ferri, Giulia; Huichalaf, Claudia H.; Caccia, Roberta; Gabellini, Davide

2015-01-01

Facioscapulohumeral muscular dystrophy (FSHD) is one of the most common neuromuscular disorders. The major form of the disease (FSHD1) is linked to decrease in copy number of a 3.3-kb tandem repeated macrosatellite (D4Z4), located on chromosome 4q35. D4Z4 deletion alters chromatin structure of the locus leading to aberrant expression of nearby 4q35 genes. Given the high variability in disease onset and progression, multiple factors could contribute to the pathogenesis of FSHD. Among the FSHD candidate genes are double homeobox 4 (DUX4), encoded by the most telomeric D4Z4 unit, and FSHD region gene 1 (FRG1). DUX4 is a sequence-specific transcription factor. Here, we located putative DUX4 binding sites in the human FRG1 genomic area and we show specific DUX4 association to these regions. We found also that ectopically expressed DUX4 up-regulates the endogenous human FRG1 gene in healthy muscle cells, while DUX4 knockdown leads to a decrease in FRG1 expression in FSHD muscle cells. Moreover, DUX4 binds directly and specifically to its binding site located in the human FRG1 gene and transactivates constructs containing FRG1 genomic regions. Intriguingly, the mouse Frg1 genomic area lacks DUX4 binding sites and DUX4 is unable to activate the endogenous mouse Frg1 gene providing a possible explanation for the lack of muscle phenotype in DUX4 transgenic mice. Altogether, our results demonstrate that FRG1 is a direct DUX4 transcriptional target uncovering a novel regulatory circuit contributing to FSHD. PMID:25326393
Exome Sequencing and the Management of Neurometabolic Disorders.

PubMed

Tarailo-Graovac, Maja; Shyr, Casper; Ross, Colin J; Horvath, Gabriella A; Salvarinova, Ramona; Ye, Xin C; Zhang, Lin-Hua; Bhavsar, Amit P; Lee, Jessica J Y; Drögemöller, Britt I; Abdelsayed, Mena; Alfadhel, Majid; Armstrong, Linlea; Baumgartner, Matthias R; Burda, Patricie; Connolly, Mary B; Cameron, Jessie; Demos, Michelle; Dewan, Tammie; Dionne, Janis; Evans, A Mark; Friedman, Jan M; Garber, Ian; Lewis, Suzanne; Ling, Jiqiang; Mandal, Rupasri; Mattman, Andre; McKinnon, Margaret; Michoulas, Aspasia; Metzger, Daniel; Ogunbayo, Oluseye A; Rakic, Bojana; Rozmus, Jacob; Ruben, Peter; Sayson, Bryan; Santra, Saikat; Schultz, Kirk R; Selby, Kathryn; Shekel, Paul; Sirrs, Sandra; Skrypnyk, Cristina; Superti-Furga, Andrea; Turvey, Stuart E; Van Allen, Margot I; Wishart, David; Wu, Jiang; Wu, John; Zafeiriou, Dimitrios; Kluijtmans, Leo; Wevers, Ron A; Eydoux, Patrice; Lehman, Anna M; Vallance, Hilary; Stockler-Ipsiroglu, Sylvia; Sinclair, Graham; Wasserman, Wyeth W; van Karnebeek, Clara D

2016-06-09

Whole-exome sequencing has transformed gene discovery and diagnosis in rare diseases. Translation into disease-modifying treatments is challenging, particularly for intellectual developmental disorder. However, the exception is inborn errors of metabolism, since many of these disorders are responsive to therapy that targets pathophysiological features at the molecular or cellular level. To uncover the genetic basis of potentially treatable inborn errors of metabolism, we combined deep clinical phenotyping (the comprehensive characterization of the discrete components of a patient's clinical and biochemical phenotype) with whole-exome sequencing analysis through a semiautomated bioinformatics pipeline in consecutively enrolled patients with intellectual developmental disorder and unexplained metabolic phenotypes. We performed whole-exome sequencing on samples obtained from 47 probands. Of these patients, 6 were excluded, including 1 who withdrew from the study. The remaining 41 probands had been born to predominantly nonconsanguineous parents of European descent. In 37 probands, we identified variants in 2 genes newly implicated in disease, 9 candidate genes, 22 known genes with newly identified phenotypes, and 9 genes with expected phenotypes; in most of the genes, the variants were classified as either pathogenic or probably pathogenic. Complex phenotypes of patients in five families were explained by coexisting monogenic conditions. We obtained a diagnosis in 28 of 41 probands (68%) who were evaluated. A test of a targeted intervention was performed in 18 patients (44%). Deep phenotyping and whole-exome sequencing in 41 probands with intellectual developmental disorder and unexplained metabolic abnormalities led to a diagnosis in 68%, the identification of 11 candidate genes newly implicated in neurometabolic disease, and a change in treatment beyond genetic counseling in 44%. (Funded by BC Children's Hospital Foundation and others.).
Exome Sequencing and the Management of Neurometabolic Disorders

PubMed Central

Tarailo-Graovac, M.; Shyr, C.; Ross, C.J.; Horvath, G.A.; Salvarinova, R.; Ye, X.C.; Zhang, L.-H.; Bhavsar, A.P.; Lee, J.J.Y.; Drögemöller, B.I.; Abdelsayed, M.; Alfadhel, M.; Armstrong, L.; Baumgartner, M.R.; Burda, P.; Connolly, M.B.; Cameron, J.; Demos, M.; Dewan, T.; Dionne, J.; Evans, A.M.; Friedman, J.M.; Garber, I.; Lewis, S.; Ling, J.; Mandal, R.; Mattman, A.; McKinnon, M.; Michoulas, A.; Metzger, D.; Ogunbayo, O.A.; Rakic, B.; Rozmus, J.; Ruben, P.; Sayson, B.; Santra, S.; Schultz, K.R.; Selby, K.; Shekel, P.; Sirrs, S.; Skrypnyk, C.; Superti-Furga, A.; Turvey, S.E.; Van Allen, M.I.; Wishart, D.; Wu, J.; Wu, J.; Zafeiriou, D.; Kluijtmans, L.; Wevers, R.A.; Eydoux, P.; Lehman, A.M.; Vallance, H.; Stockler-Ipsiroglu, S.; Sinclair, G.; Wasserman, W.W.; van Karnebeek, C.D.

2016-01-01

BACKGROUND Whole-exome sequencing has transformed gene discovery and diagnosis in rare diseases. Translation into disease-modifying treatments is challenging, particularly for intellectual developmental disorder. However, the exception is inborn errors of metabolism, since many of these disorders are responsive to therapy that targets pathophysiological features at the molecular or cellular level. METHODS To uncover the genetic basis of potentially treatable inborn errors of metabolism, we combined deep clinical phenotyping (the comprehensive characterization of the discrete components of a patient’s clinical and biochemical phenotype) with whole-exome sequencing analysis through a semiautomated bioinformatics pipeline in consecutively enrolled patients with intellectual developmental disorder and unexplained metabolic phenotypes. RESULTS We performed whole-exome sequencing on samples obtained from 47 probands. Of these patients, 6 were excluded, including 1 who withdrew from the study. The remaining 41 probands had been born to predominantly nonconsanguineous parents of European descent. In 37 probands, we identified variants in 2 genes newly implicated in disease, 9 candidate genes, 22 known genes with newly identified phenotypes, and 9 genes with expected phenotypes; in most of the genes, the variants were classified as either pathogenic or probably pathogenic. Complex phenotypes of patients in five families were explained by coexisting monogenic conditions. We obtained a diagnosis in 28 of 41 probands (68%) who were evaluated. A test of a targeted intervention was performed in 18 patients (44%). CONCLUSIONS Deep phenotyping and whole-exome sequencing in 41 probands with intellectual developmental disorder and unexplained metabolic abnormalities led to a diagnosis in 68%, the identification of 11 candidate genes newly implicated in neurometabolic disease, and a change in treatment beyond genetic counseling in 44%. (Funded by BC Children’s Hospital Foundation and others.) PMID:27276562
Identification of candidate genes in osteoporosis by integrated microarray analysis.

PubMed

Li, J J; Wang, B Q; Fei, Q; Yang, Y; Li, D

2016-12-01

In order to screen the altered gene expression profile in peripheral blood mononuclear cells of patients with osteoporosis, we performed an integrated analysis of the online microarray studies of osteoporosis. We searched the Gene Expression Omnibus (GEO) database for microarray studies of peripheral blood mononuclear cells in patients with osteoporosis. Subsequently, we integrated gene expression data sets from multiple microarray studies to obtain differentially expressed genes (DEGs) between patients with osteoporosis and normal controls. Gene function analysis was performed to uncover the functions of identified DEGs. A total of three microarray studies were selected for integrated analysis. In all, 1125 genes were found to be signiﬁcantly differentially expressed between osteoporosis patients and normal controls, with 373 upregulated and 752 downregulated genes. Positive regulation of the cellular amino metabolic process (gene ontology (GO): 0033240, false discovery rate (FDR) = 1.00E + 00) was significantly enriched under the GO category for biological processes, while for molecular functions, flavin adenine dinucleotide binding (GO: 0050660, FDR = 3.66E-01) and androgen receptor binding (GO: 0050681, FDR = 6.35E-01) were significantly enriched. DEGs were enriched in many osteoporosis-related signalling pathways, including those of mitogen-activated protein kinase (MAPK) and calcium. Protein-protein interaction (PPI) network analysis showed that the significant hub proteins contained ubiquitin specific peptidase 9, X-linked (Degree = 99), ubiquitin specific peptidase 19 (Degree = 57) and ubiquitin conjugating enzyme E2 B (Degree = 57). Analysis of gene function of identified differentially expressed genes may expand our understanding of fundamental mechanisms leading to osteoporosis. Moreover, significantly enriched pathways, such as MAPK and calcium, may involve in osteoporosis through osteoblastic differentiation and bone formation.Cite this article: J. J. Li, B. Q. Wang, Q. Fei, Y. Yang, D. Li. Identification of candidate genes in osteoporosis by integrated microarray analysis. Bone Joint Res 2016;5:594-601. DOI: 10.1302/2046-3758.512.BJR-2016-0073.R1. © 2016 Fei et al.
Identification of differentially expressed genes through RNA sequencing in goats (Capra hircus) at different postnatal stages

PubMed Central

Li, Qian; Lin, Sen

2017-01-01

Intramuscular fat (IMF) content and fatty acid composition of longissimus dorsi muscle (LM) change with growth, which partially determines the flavor and nutritional value of goat (Capra hircus) meat. However, unlike cattle, little information is available on the transcriptome-wide changes during different postnatal stages in small ruminants, especially goats. In this study, the sequencing reads of goat LM tissues collected from kid, youth, and adult period were mapped to the goat genome. Results showed that out of total 24 689 Unigenes, 20 435 Unigenes were annotated. Based on expected number of fragments per kilobase of transcript sequence per million base pairs sequenced (FPKM), 111 annotated differentially expressed genes (DEGs) were identified among different postnatal stages, which were subsequently assigned to 16 possible expression patterns by series-cluster analysis. Functional classification by Gene Ontology (GO) analysis was used for selecting the genes showing highest expression related to lipid metabolism. Finally, we identified the node genes for lipid metabolism regulation using co-expression analysis. In conclusion, these data may uncover candidate genes having functional roles in regulation of goat muscle development and lipid metabolism during the various growth stages in goats. PMID:28800357
Identification of differentially expressed genes through RNA sequencing in goats (Capra hircus) at different postnatal stages.

PubMed

Lin, Yaqiu; Zhu, Jiangjiang; Wang, Yong; Li, Qian; Lin, Sen

2017-01-01

Intramuscular fat (IMF) content and fatty acid composition of longissimus dorsi muscle (LM) change with growth, which partially determines the flavor and nutritional value of goat (Capra hircus) meat. However, unlike cattle, little information is available on the transcriptome-wide changes during different postnatal stages in small ruminants, especially goats. In this study, the sequencing reads of goat LM tissues collected from kid, youth, and adult period were mapped to the goat genome. Results showed that out of total 24 689 Unigenes, 20 435 Unigenes were annotated. Based on expected number of fragments per kilobase of transcript sequence per million base pairs sequenced (FPKM), 111 annotated differentially expressed genes (DEGs) were identified among different postnatal stages, which were subsequently assigned to 16 possible expression patterns by series-cluster analysis. Functional classification by Gene Ontology (GO) analysis was used for selecting the genes showing highest expression related to lipid metabolism. Finally, we identified the node genes for lipid metabolism regulation using co-expression analysis. In conclusion, these data may uncover candidate genes having functional roles in regulation of goat muscle development and lipid metabolism during the various growth stages in goats.
The Quest for the 1p36 Tumor Suppressor

PubMed Central

Bagchi, Anindya; Mills, Alea A.

2010-01-01

Genomic analyses of late-stage human cancers have uncovered deletions encompassing 1p36, thereby providing an extensive body of literature supporting the idea that a potent tumor suppressor resides in this interval. Although a number of genes have been proposed as 1p36 candidate tumor suppressors, convincing evidence that their encoded products protect from cancer has been scanty. A recent functional study identified CHD5 as a novel tumor suppressor mapping to 1p36. Here we discuss evidence supporting CHD5’s tumor suppressive role. Together, these findings suggest that strategies designed to enhance CHD5 activity could provide novel approaches for treating a broad range of human malignancies. PMID:18413720
Cross-ancestry genome-wide association analysis of corneal thickness strengthens link between complex and Mendelian eye diseases.

PubMed

Iglesias, Adriana I; Mishra, Aniket; Vitart, Veronique; Bykhovskaya, Yelena; Höhn, René; Springelkamp, Henriët; Cuellar-Partida, Gabriel; Gharahkhani, Puya; Bailey, Jessica N Cooke; Willoughby, Colin E; Li, Xiaohui; Yazar, Seyhan; Nag, Abhishek; Khawaja, Anthony P; Polašek, Ozren; Siscovick, David; Mitchell, Paul; Tham, Yih Chung; Haines, Jonathan L; Kearns, Lisa S; Hayward, Caroline; Shi, Yuan; van Leeuwen, Elisabeth M; Taylor, Kent D; Bonnemaijer, Pieter; Rotter, Jerome I; Martin, Nicholas G; Zeller, Tanja; Mills, Richard A; Staffieri, Sandra E; Jonas, Jost B; Schmidtmann, Irene; Boutin, Thibaud; Kang, Jae H; Lucas, Sionne E M; Wong, Tien Yin; Beutel, Manfred E; Wilson, James F; Uitterlinden, André G; Vithana, Eranga N; Foster, Paul J; Hysi, Pirro G; Hewitt, Alex W; Khor, Chiea Chuen; Pasquale, Louis R; Montgomery, Grant W; Klaver, Caroline C W; Aung, Tin; Pfeiffer, Norbert; Mackey, David A; Hammond, Christopher J; Cheng, Ching-Yu; Craig, Jamie E; Rabinowitz, Yaron S; Wiggs, Janey L; Burdon, Kathryn P; van Duijn, Cornelia M; MacGregor, Stuart

2018-05-14

Central corneal thickness (CCT) is a highly heritable trait associated with complex eye diseases such as keratoconus and glaucoma. We perform a genome-wide association meta-analysis of CCT and identify 19 novel regions. In addition to adding support for known connective tissue-related pathways, pathway analyses uncover previously unreported gene sets. Remarkably, >20% of the CCT-loci are near or within Mendelian disorder genes. These included FBN1, ADAMTS2 and TGFB2 which associate with connective tissue disorders (Marfan, Ehlers-Danlos and Loeys-Dietz syndromes), and the LUM-DCN-KERA gene complex involved in myopia, corneal dystrophies and cornea plana. Using index CCT-increasing variants, we find a significant inverse correlation in effect sizes between CCT and keratoconus (r = -0.62, P = 5.30 × 10 -5 ) but not between CCT and primary open-angle glaucoma (r = -0.17, P = 0.2). Our findings provide evidence for shared genetic influences between CCT and keratoconus, and implicate candidate genes acting in collagen and extracellular matrix regulation.
Genomic Inventory and Transcriptional Analysis of Medicago truncatula Transporters1[W][OA

PubMed Central

Benedito, Vagner A.; Li, Haiquan; Dai, Xinbin; Wandrey, Maren; He, Ji; Kaundal, Rakesh; Torres-Jerez, Ivone; Gomez, S. Karen; Harrison, Maria J.; Tang, Yuhong; Zhao, Patrick X.; Udvardi, Michael K.

2010-01-01

Transporters move hydrophilic substrates across hydrophobic biological membranes and play key roles in plant nutrition, metabolism, and signaling and, consequently, in plant growth, development, and responses to the environment. To initiate and support systematic characterization of transporters in the model legume Medicago truncatula, we identified 3,830 transporters and classified 2,673 of these into 113 families and 146 subfamilies. Analysis of gene expression data for 2,611 of these transporters identified 129 that are expressed in an organ-specific manner, including 50 that are nodule specific and 36 specific to mycorrhizal roots. Further analysis uncovered 196 transporters that are induced at least 5-fold during nodule development and 44 in roots during arbuscular mycorrhizal symbiosis. Among the nodule- and mycorrhiza-induced transporter genes are many candidates for known transport activities in these beneficial symbioses. The data presented here are a unique resource for the selection and functional characterization of legume transporters. PMID:20023147
Potential genetic polymorphisms predicting polycystic ovary syndrome.

PubMed

Chen, Yao; Fang, Shu-Ying

2018-05-01

Polycystic ovary syndrome (PCOS) is a heterogenous endocrine disorder with typical symptoms of oligomenorrhoea, hyperandrogenism, hirsutism, obesity, insulin resistance and increased risk of type 2 diabetes mellitus. Extensive evidence indicates that PCOS is a genetic disease and numerous biochemical pathways have been linked with its pathogenesis. A number of genes from these pathways have been investigated, which include those involved with steroid hormone biosynthesis and metabolism, action of gonadotropin and gonadal hormones, folliculogenesis, obesity and energy regulation, insulin secretion and action and many others. In this review, we summarize the historical and recent findings in genetic polymorphisms of PCOS from the relevant publications and outline some genetic polymorphisms that are potentially associated with the risk of PCOS. This information could uncover candidate genes associating with PCOS, which will be valuable for the development of novel diagnostic and treatment platforms for PCOS patients. © 2018 The authors.
Accelerated Evolution in Distinctive Species Reveals Candidate Elements for Clinically Relevant Traits, Including Mutation and Cancer Resistance.

PubMed

Ferris, Elliott; Abegglen, Lisa M; Schiffman, Joshua D; Gregg, Christopher

2018-03-06

The identity of most functional elements in the mammalian genome and the phenotypes they impact are unclear. Here, we perform a genome-wide comparative analysis of patterns of accelerated evolution in species with highly distinctive traits to discover candidate functional elements for clinically important phenotypes. We identify accelerated regions (ARs) in the elephant, hibernating bat, orca, dolphin, naked mole rat, and thirteen-lined ground squirrel lineages in mammalian conserved regions, uncovering ∼33,000 elements that bind hundreds of different regulatory proteins in humans and mice. ARs in the elephant, the largest land mammal, are uniquely enriched near elephant DNA damage response genes. The genomic hotspot for elephant ARs is the E3 ligase subunit of the Fanconi anemia complex, a master regulator of DNA repair. Additionally, ARs in the six species are associated with specific human clinical phenotypes that have apparent concordance with overt traits in each species. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
De novo transcriptomic analysis during Lentinula edodes fruiting body growth.

PubMed

Wang, Yingzhu; Zeng, Xianlu; Liu, Wenguang

2018-01-30

The fruiting body of Lentinula edodes is a popular edible mushroom, and extracts from the mycelium and the fruiting body of this species have diverse therapeutic potential. To gain insights into the molecular mechanisms underlying the fruiting body growth of L. edodes from the early bud stage (EBS), through the intermediate developing stage (IDS), to the fully developed stage (FDS), we performed de novo transcriptomic analysis using high-throughput Illumina RNA-sequencing. First, we generated three cDNA libraries representative of the three respective stages. We then obtained 38,933,148, 44,594,472, and 37,905,646 high-quality reads from the respective libraries and assembled the reads into 25,104 transcriptional contigs, containing 15,199 unigenes. We found that only 9331 of the unigenes had been annotated in the NCBI non-redundant protein database, and we functionally annotated 4758 of them through Gene Ontology (GO) analysis and 2921 of them through Clusters of Orthologous Groups of proteins (COGs) analysis. We also assigned 3995 unigenes to metabolic pathways by using the Kyoto Encyclopedia of Genes and Genomes (KEGG). We further identified 399 differentially expressed genes (DEGs) between EBS and IDS, 1428 between IDS and FDS, and 1830 between EBS and FDS, uncovering 769 DEGs in multiple metabolic and signaling pathways. Interestingly, there were a limited number of DEGs whose expression was dramatically associated with FDS. Finally, genes, whose expression was either highly up-regulated in FDS or remained at a high level during fruiting body growth, were annotated specifically in the pathways of purine metabolism, unsaturated fatty acid metabolism and meiosis, suggesting that these key molecular events were actively occurring in the fruiting body. Our work is the first high-throughput transcriptome study on the growth of L. edodes fruiting bodies, and the results uncovered candidate genes for future gene identification and utilization of this commercially and medically important mushroom. Copyright © 2017 Elsevier B.V. All rights reserved.
Cultural Competence and Cultural Identity: Using Telementoring to Form Relationships of Synergy

ERIC Educational Resources Information Center

Friedman, Audrey; Herrmann, Brian

2014-01-01

This study addresses the following research question: How does telementoring urban high school students by English teacher candidates develop candidates' cultural competence and impact mentees' cultural identity development? Mentee-mentor exchanges were analyzed to uncover how mentees used writing to develop cultural identity, how mentors'…
Interactogeneous: Disease Gene Prioritization Using Heterogeneous Networks and Full Topology Scores

PubMed Central

Gonçalves, Joana P.; Francisco, Alexandre P.; Moreau, Yves; Madeira, Sara C.

2012-01-01

Disease gene prioritization aims to suggest potential implications of genes in disease susceptibility. Often accomplished in a guilt-by-association scheme, promising candidates are sorted according to their relatedness to known disease genes. Network-based methods have been successfully exploiting this concept by capturing the interaction of genes or proteins into a score. Nonetheless, most current approaches yield at least some of the following limitations: (1) networks comprise only curated physical interactions leading to poor genome coverage and density, and bias toward a particular source; (2) scores focus on adjacencies (direct links) or the most direct paths (shortest paths) within a constrained neighborhood around the disease genes, ignoring potentially informative indirect paths; (3) global clustering is widely applied to partition the network in an unsupervised manner, attributing little importance to prior knowledge; (4) confidence weights and their contribution to edge differentiation and ranking reliability are often disregarded. We hypothesize that network-based prioritization related to local clustering on graphs and considering full topology of weighted gene association networks integrating heterogeneous sources should overcome the above challenges. We term such a strategy Interactogeneous. We conducted cross-validation tests to assess the impact of network sources, alternative path inclusion and confidence weights on the prioritization of putative genes for 29 diseases. Heat diffusion ranking proved the best prioritization method overall, increasing the gap to neighborhood and shortest paths scores mostly on single source networks. Heterogeneous associations consistently delivered superior performance over single source data across the majority of methods. Results on the contribution of confidence weights were inconclusive. Finally, the best Interactogeneous strategy, heat diffusion ranking and associations from the STRING database, was used to prioritize genes for Parkinson’s disease. This method effectively recovered known genes and uncovered interesting candidates which could be linked to pathogenic mechanisms of the disease. PMID:23185389
Pathways and genes differentially expressed in the motor cortex of patients with sporadic amyotrophic lateral sclerosis.

PubMed

Lederer, Carsten W; Torrisi, Antonietta; Pantelidou, Maria; Santama, Niovi; Cavallaro, Sebastiano

2007-01-23

Amyotrophic lateral sclerosis (ALS) is a fatal disorder caused by the progressive degeneration of motoneurons in brain and spinal cord. Despite identification of disease-linked mutations, the diversity of processes involved and the ambiguity of their relative importance in ALS pathogenesis still represent a major impediment to disease models as a basis for effective therapies. Moreover, the human motor cortex, although critical to ALS pathology and physiologically altered in most forms of the disease, has not been screened systematically for therapeutic targets. By whole-genome expression profiling and stringent significance tests we identify genes and gene groups de-regulated in the motor cortex of patients with sporadic ALS, and interpret the role of individual candidate genes in a framework of differentially expressed pathways. Our findings emphasize the importance of defense responses and cytoskeletal, mitochondrial and proteasomal dysfunction, reflect reduced neuronal maintenance and vesicle trafficking, and implicate impaired ion homeostasis and glycolysis in ALS pathogenesis. Additionally, we compared our dataset with publicly available data for the SALS spinal cord, and show a high correlation of changes linked to the diseased state in the SALS motor cortex. In an analogous comparison with data for the Alzheimer's disease hippocampus we demonstrate a low correlation of global changes and a moderate correlation for changes specifically linked to the SALS diseased state. Gene and sample numbers investigated allow pathway- and gene-based analyses by established error-correction methods, drawing a molecular portrait of the ALS motor cortex that faithfully represents many known disease features and uncovers several novel aspects of ALS pathology. Contrary to expectations for a tissue under oxidative stress, nuclear-encoded mitochondrial genes are uniformly down-regulated. Moreover, the down-regulation of mitochondrial and glycolytic genes implies a combined reduction of mitochondrial and cytoplasmic energy supply, with a possible role in the death of ALS motoneurons. Identifying candidate genes exclusively expressed in non-neuronal cells, we also highlight the importance of these cells in disease development in the motor cortex. Notably, some pathways and candidate genes identified by this study are direct or indirect targets of medication already applied to unrelated illnesses and point the way towards the rapid development of effective symptomatic ALS therapies.

Filling gaps in PPAR-alpha signaling through comparative nutrigenomics analysis

PubMed Central

2009-01-01

Background The application of high-throughput genomic tools in nutrition research is a widespread practice. However, it is becoming increasingly clear that the outcome of individual expression studies is insufficient for the comprehensive understanding of such a complex field. Currently, the availability of the large amounts of expression data in public repositories has opened up new challenges on microarray data analyses. We have focused on PPARα, a ligand-activated transcription factor functioning as fatty acid sensor controlling the gene expression regulation of a large set of genes in various metabolic organs such as liver, small intestine or heart. The function of PPARα is strictly connected to the function of its target genes and, although many of these have already been identified, major elements of its physiological function remain to be uncovered. To further investigate the function of PPARα, we have applied a cross-species meta-analysis approach to integrate sixteen microarray datasets studying high fat diet and PPARα signal perturbations in different organisms. Results We identified 164 genes (MDEGs) that were differentially expressed in a constant way in response to a high fat diet or to perturbations in PPARs signalling. In particular, we found five genes in yeast which were highly conserved and homologous of PPARα targets in mammals, potential candidates to be used as models for the equivalent mammalian genes. Moreover, a screening of the MDEGs for all known transcription factor binding sites and the comparison with a human genome-wide screening of Peroxisome Proliferating Response Elements (PPRE), enabled us to identify, 20 new potential candidate genes that show, both binding site, both change in expression in the condition studied. Lastly, we found a non random localization of the differentially expressed genes in the genome. Conclusion The results presented are potentially of great interest to resume the currently available expression data, exploiting the power of in silico analysis filtered by evolutionary conservation. The analysis enabled us to indicate potential gene candidates that could fill in the gaps with regards to the signalling of PPARα and, moreover, the non-random localization of the differentially expressed genes in the genome, suggest that epigenetic mechanisms are of importance in the regulation of the transcription operated by PPARα. PMID:20003344
Fault tolerance in protein interaction networks: stable bipartite subgraphs and redundant pathways.

PubMed

Brady, Arthur; Maxwell, Kyle; Daniels, Noah; Cowen, Lenore J

2009-01-01

As increasing amounts of high-throughput data for the yeast interactome become available, more system-wide properties are uncovered. One interesting question concerns the fault tolerance of protein interaction networks: whether there exist alternative pathways that can perform some required function if a gene essential to the main mechanism is defective, absent or suppressed. A signature pattern for redundant pathways is the BPM (between-pathway model) motif, introduced by Kelley and Ideker. Past methods proposed to search the yeast interactome for BPM motifs have had several important limitations. First, they have been driven heuristically by local greedy searches, which can lead to the inclusion of extra genes that may not belong in the motif; second, they have been validated solely by functional coherence of the putative pathways using GO enrichment, making it difficult to evaluate putative BPMs in the absence of already known biological annotation. We introduce stable bipartite subgraphs, and show they form a clean and efficient way of generating meaningful BPMs which naturally discard extra genes included by local greedy methods. We show by GO enrichment measures that our BPM set outperforms previous work, covering more known complexes and functional pathways. Perhaps most importantly, since our BPMs are initially generated by examining the genetic-interaction network only, the location of edges in the protein-protein physical interaction network can then be used to statistically validate each candidate BPM, even with sparse GO annotation (or none at all). We uncover some interesting biological examples of previously unknown putative redundant pathways in such areas as vesicle-mediated transport and DNA repair.
Fault Tolerance in Protein Interaction Networks: Stable Bipartite Subgraphs and Redundant Pathways

PubMed Central

Brady, Arthur; Maxwell, Kyle; Daniels, Noah; Cowen, Lenore J.

2009-01-01

As increasing amounts of high-throughput data for the yeast interactome become available, more system-wide properties are uncovered. One interesting question concerns the fault tolerance of protein interaction networks: whether there exist alternative pathways that can perform some required function if a gene essential to the main mechanism is defective, absent or suppressed. A signature pattern for redundant pathways is the BPM (between-pathway model) motif, introduced by Kelley and Ideker. Past methods proposed to search the yeast interactome for BPM motifs have had several important limitations. First, they have been driven heuristically by local greedy searches, which can lead to the inclusion of extra genes that may not belong in the motif; second, they have been validated solely by functional coherence of the putative pathways using GO enrichment, making it difficult to evaluate putative BPMs in the absence of already known biological annotation. We introduce stable bipartite subgraphs, and show they form a clean and efficient way of generating meaningful BPMs which naturally discard extra genes included by local greedy methods. We show by GO enrichment measures that our BPM set outperforms previous work, covering more known complexes and functional pathways. Perhaps most importantly, since our BPMs are initially generated by examining the genetic-interaction network only, the location of edges in the protein-protein physical interaction network can then be used to statistically validate each candidate BPM, even with sparse GO annotation (or none at all). We uncover some interesting biological examples of previously unknown putative redundant pathways in such areas as vesicle-mediated transport and DNA repair. PMID:19399174
Machine Learning-Assisted Network Inference Approach to Identify a New Class of Genes that Coordinate the Functionality of Cancer Networks.

PubMed

Ghanat Bari, Mehrab; Ung, Choong Yong; Zhang, Cheng; Zhu, Shizhen; Li, Hu

2017-08-01

Emerging evidence indicates the existence of a new class of cancer genes that act as "signal linkers" coordinating oncogenic signals between mutated and differentially expressed genes. While frequently mutated oncogenes and differentially expressed genes, which we term Class I cancer genes, are readily detected by most analytical tools, the new class of cancer-related genes, i.e., Class II, escape detection because they are neither mutated nor differentially expressed. Given this hypothesis, we developed a Machine Learning-Assisted Network Inference (MALANI) algorithm, which assesses all genes regardless of expression or mutational status in the context of cancer etiology. We used 8807 expression arrays, corresponding to 9 cancer types, to build more than 2 × 10 8 Support Vector Machine (SVM) models for reconstructing a cancer network. We found that ~3% of ~19,000 not differentially expressed genes are Class II cancer gene candidates. Some Class II genes that we found, such as SLC19A1 and ATAD3B, have been recently reported to associate with cancer outcomes. To our knowledge, this is the first study that utilizes both machine learning and network biology approaches to uncover Class II cancer genes in coordinating functionality in cancer networks and will illuminate our understanding of how genes are modulated in a tissue-specific network contribute to tumorigenesis and therapy development.
Comprehensive genomic profiles of small cell lung cancer

PubMed Central

George, Julie; Lim, Jing Shan; Jang, Se Jin; Cun, Yupeng; Ozretić, Luka; Kong, Gu; Leenders, Frauke; Lu, Xin; Fernández-Cuesta, Lynnette; Bosco, Graziella; Müller, Christian; Dahmen, Ilona; Jahchan, Nadine S.; Park, Kwon-Sik; Yang, Dian; Karnezis, Anthony N.; Vaka, Dedeepya; Torres, Angela; Wang, Maia Segura; Korbel, Jan O.; Menon, Roopika; Chun, Sung-Min; Kim, Deokhoon; Wilkerson, Matt; Hayes, Neil; Engelmann, David; Pützer, Brigitte; Bos, Marc; Michels, Sebastian; Vlasic, Ignacija; Seidel, Danila; Pinther, Berit; Schaub, Philipp; Becker, Christian; Altmüller, Janine; Yokota, Jun; Kohno, Takashi; Iwakawa, Reika; Tsuta, Koji; Noguchi, Masayuki; Muley, Thomas; Hoffmann, Hans; Schnabel, Philipp A.; Petersen, Iver; Chen, Yuan; Soltermann, Alex; Tischler, Verena; Choi, Chang-min; Kim, Yong-Hee; Massion, Pierre P.; Zou, Yong; Jovanovic, Dragana; Kontic, Milica; Wright, Gavin M.; Russell, Prudence A.; Solomon, Benjamin; Koch, Ina; Lindner, Michael; Muscarella, Lucia A.; la Torre, Annamaria; Field, John K.; Jakopovic, Marko; Knezevic, Jelena; Castaños-Vélez, Esmeralda; Roz, Luca; Pastorino, Ugo; Brustugun, Odd-Terje; Lund-Iversen, Marius; Thunnissen, Erik; Köhler, Jens; Schuler, Martin; Botling, Johan; Sandelin, Martin; Sanchez-Cespedes, Montserrat; Salvesen, Helga B.; Achter, Viktor; Lang, Ulrich; Bogus, Magdalena; Schneider, Peter M.; Zander, Thomas; Ansén, Sascha; Hallek, Michael; Wolf, Jürgen; Vingron, Martin; Yatabe, Yasushi; Travis, William D.; Nürnberg, Peter; Reinhardt, Christian; Perner, Sven; Heukamp, Lukas; Büttner, Reinhard; Haas, Stefan A.; Brambilla, Elisabeth; Peifer, Martin; Sage, Julien; Thomas, Roman K.

2016-01-01

We have sequenced the genomes of 110 small cell lung cancers (SCLC), one of the deadliest human cancers. In nearly all the tumours analysed we found bi-allelic inactivation of TP53 and RB1, sometimes by complex genomic rearrangements. Two tumours with wild-type RB1 had evidence of chromothripsis leading to overexpression of cyclin D1 (encoded by the CCND1 gene), revealing an alternative mechanism of Rb1 deregulation. Thus, loss of the tumour suppressors TP53 and RB1 is obligatory in SCLC. We discovered somatic genomic rearrangements of TP73 that create an oncogenic version of this gene, TP73Δex2/3. In rare cases, SCLC tumours exhibited kinase gene mutations, providing a possible therapeutic opportunity for individual patients. Finally, we observed inactivating mutations in NOTCH family genes in 25% of human SCLC. Accordingly, activation of Notch signalling in a pre-clinical SCLC mouse model strikingly reduced the number of tumours and extended the survival of the mutant mice. Furthermore, neuroendocrine gene expression was abrogated by Notch activity in SCLC cells. This first comprehensive study of somatic genome alterations in SCLC uncovers several key biological processes and identifies candidate therapeutic targets in this highly lethal form of cancer. PMID:26168399
Compound heterozygous MYO7A mutations segregating Usher syndrome type 2 in a Han family.

PubMed

Zong, Ling; Chen, Kaitian; Wu, Xuan; Liu, Min; Jiang, Hongyan

2016-11-01

Identification of rare deafness genes for inherited congenital sensorineural hearing impairment remains difficult, because a large variety of genes are implicated. In this study we applied targeted capture and next-generation sequencing to uncover the underlying gene in a three-generation Han family segregating recessive inherited hearing loss and retinitis pigmentosa. After excluding mutations in common deafness genes GJB2, SLC26A4 and the mitochondrial gene, genomic DNA of the proband of a Han family was subjected to targeted next-generation sequencing. The candidate mutations were confirmed by Sanger sequencing and subsequently analyzed with in silico tools. An unreported splice site mutation c.3924+1G > C compound with c.6028G > A in the MYO7A gene were detected to cosegregate with the phenotype in this pedigree. Both mutations, located in the evolutionarily conserved FERM domain in myosin VIIA, were predicted to be pathogenic. In this family, profound sensorineural hearing impairment and retinitis pigmentosa without vestibular disorder, constituted the typical Usher syndrome type 2. Identification of novel mutation in compound heterozygosity in MYO7A gene revealed the genetic origin of Usher syndrome type 2 in this Han family. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
A genome-wide scan for signatures of differential artificial selection in ten cattle breeds.

PubMed

Rothammer, Sophie; Seichter, Doris; Förster, Martin; Medugorac, Ivica

2013-12-21

Since the times of domestication, cattle have been continually shaped by the influence of humans. Relatively recent history, including breed formation and the still enduring enormous improvement of economically important traits, is expected to have left distinctive footprints of selection within the genome. The purpose of this study was to map genome-wide selection signatures in ten cattle breeds and thus improve the understanding of the genome response to strong artificial selection and support the identification of the underlying genetic variants of favoured phenotypes. We analysed 47,651 single nucleotide polymorphisms (SNP) using Cross Population Extended Haplotype Homozygosity (XP-EHH). We set the significance thresholds using the maximum XP-EHH values of two essentially artificially unselected breeds and found up to 229 selection signatures per breed. Through a confirmation process we verified selection for three distinct phenotypes typical for one breed (polledness in Galloway, double muscling in Blanc-Bleu Belge and red coat colour in Red Holstein cattle). Moreover, we detected six genes strongly associated with known QTL for beef or dairy traits (TG, ABCG2, DGAT1, GH1, GHR and the Casein Cluster) within selection signatures of at least one breed. A literature search for genes lying in outstanding signatures revealed further promising candidate genes. However, in concordance with previous genome-wide studies, we also detected a substantial number of signatures without any yet known gene content. These results show the power of XP-EHH analyses in cattle to discover promising candidate genes and raise the hope of identifying phenotypically important variants in the near future. The finding of plausible functional candidates in some short signatures supports this hope. For instance, MAP2K6 is the only annotated gene of two signatures detected in Galloway and Gelbvieh cattle and is already known to be associated with carcass weight, back fat thickness and marbling score in Korean beef cattle. Based on the confirmation process and literature search we deduce that XP-EHH is able to uncover numerous artificial selection targets in subpopulations of domesticated animals.
The large-scale investigation of gene expression in Leymus chinensis stigmas provides a valuable resource for understanding the mechanisms of poaceae self-incompatibility.

PubMed

Zhou, Qingyuan; Jia, Junting; Huang, Xing; Yan, Xueqing; Cheng, Liqin; Chen, Shuangyan; Li, Xiaoxia; Peng, Xianjun; Liu, Gongshe

2014-05-26

Many Poaceae species show a gametophytic self-incompatibility (GSI) system, which is controlled by at least two independent and multiallelic loci, S and Z. Until currently, the gene products for S and Z were unknown. Grass SI plant stigmas discriminate between pollen grains that land on its surface and support compatible pollen tube growth and penetration into the stigma, whereas recognizing incompatible pollen and thus inhibiting pollination behaviors. Leymus chinensis (Trin.) Tzvel. (sheepgrass) is a Poaceae SI species. A comprehensive analysis of sheepgrass stigma transcriptome may provide valuable information for understanding the mechanism of pollen-stigma interactions and grass SI. The transcript abundance profiles of mature stigmas, mature ovaries and leaves were examined using high-throughput next generation sequencing technology. A comparative transcriptomic analysis of these tissues identified 1,025 specifically or preferentially expressed genes in sheepgrass stigmas. These genes contained a significant proportion of genes predicted to function in cell-cell communication and signal transduction. We identified 111 putative transcription factors (TFs) genes and the most abundant groups were MYB, C2H2, C3H, FAR1, MADS. Comparative analysis of the sheepgrass, rice and Arabidopsis stigma-specific or preferential datasets showed broad similarities and some differences in the proportion of genes in the Gene Ontology (GO) functional categories. Potential SI candidate genes identified in other grasses were also detected in the sheepgrass stigma-specific or preferential dataset. Quantitative real-time PCR experiments validated the expression pattern of stigma preferential genes including homologous grass SI candidate genes. This study represents the first large-scale investigation of gene expression in the stigmas of an SI grass species. We uncovered many notable genes that are potentially involved in pollen-stigma interactions and SI mechanisms, including genes encoding receptor-like protein kinases (RLK), CBL (calcineurin B-like proteins) interacting protein kinases, calcium-dependent protein kinase, expansins, pectinesterase, peroxidases and various transcription factors. The availability of a pool of stigma-specific or preferential genes for L. chinensis offers an opportunity to elucidate the mechanisms of SI in Poaceae.
Linkage and association analysis of obesity traits reveals novel loci and interactions with dietary n-3 fatty acids in an Alaska Native (Yup’ik) population

PubMed Central

Vaughan, Laura Kelly; Wiener, Howard W.; Aslibekyan, Stella; Allison, David B.; Havel, Peter J.; Stanhope, Kimber L.; O’Brien, Diane M.; Hopkins, Scarlett E.; Lemas, Dominick J.; Boyer, Bert B.; Tiwari, Hemant K.

2015-01-01

Objective To identify novel genetic markers of obesity-related traits and to identify gene-diet interactions with n-3 polyunsaturated fatty acid (n-3 PUFA) intake in Yup’ik people. Material and Methods We measured body composition, plasma adipokines and ghrelin in 982 participants enrolled in the Center for Alaska Native Health Research (CANHR) Study. We conducted a genome-wide SNP linkage scan and targeted association analysis, fitting additional models to investigate putative gene-diet interactions. Finally, we performed bioinformatic analysis to uncover likely candidate genes within the identified linkage peaks. Results We observed evidence of linkage for all obesity-related traits, replicating previous results and identifying novel regions of interest for adiponectin (10q26.13-2) and thigh circumference (8q21.11-13). Bioinformatic analysis revealed DOCK1, PTPRE (10q26.13-2) and FABP4 (8q21.11-13) as putative candidate genes in the newly identified regions. Targeted SNP analysis under the linkage peaks identified associations between three SNPs and obesity-related traits: rs1007750 on chromosome 8 and thigh circumference (P=0.0005), rs878953 on chromosome 5 and thigh skinfold (P=0.0004), and rs1596854 on chromosome 11 for waist circumference (P=0.0003). Finally, we showed that n-3 PUFA modified the association between obesity related traits and two additional variants (rs2048417 on chromosome 3 for adiponectin, P for interaction=0.0006 and rs730414 on chromosome 11 for percentage body fat, P for interaction=0.0004). Conclusions This study presents evidence of novel genomic regions and gene-diet interactions that may contribute to the pathophysiology of obesity-related traits among Yup’ik people. PMID:25772781
Linkage and association analysis of obesity traits reveals novel loci and interactions with dietary n-3 fatty acids in an Alaska Native (Yup'ik) population.

PubMed

Vaughan, Laura Kelly; Wiener, Howard W; Aslibekyan, Stella; Allison, David B; Havel, Peter J; Stanhope, Kimber L; O'Brien, Diane M; Hopkins, Scarlett E; Lemas, Dominick J; Boyer, Bert B; Tiwari, Hemant K

2015-06-01

To identify novel genetic markers of obesity-related traits and to identify gene-diet interactions with n-3 polyunsaturated fatty acid (n-3 PUFA) intake in Yup'ik people. We measured body composition, plasma adipokines and ghrelin in 982 participants enrolled in the Center for Alaska Native Health Research (CANHR) Study. We conducted a genome-wide SNP linkage scan and targeted association analysis, fitting additional models to investigate putative gene-diet interactions. Finally, we performed bioinformatic analysis to uncover likely candidate genes within the identified linkage peaks. We observed evidence of linkage for all obesity-related traits, replicating previous results and identifying novel regions of interest for adiponectin (10q26.13-2) and thigh circumference (8q21.11-13). Bioinformatic analysis revealed DOCK1, PTPRE (10q26.13-2) and FABP4 (8q21.11-13) as putative candidate genes in the newly identified regions. Targeted SNP analysis under the linkage peaks identified associations between three SNPs and obesity-related traits: rs1007750 on chromosome 8 and thigh circumference (P=0.0005), rs878953 on chromosome 5 and thigh skinfold (P=0.0004), and rs1596854 on chromosome 11 for waist circumference (P=0.0003). Finally, we showed that n-3 PUFA modified the association between obesity related traits and two additional variants (rs2048417 on chromosome 3 for adiponectin, P for interaction=0.0006 and rs730414 on chromosome 11 for percentage body fat, P for interaction=0.0004). This study presents evidence of novel genomic regions and gene-diet interactions that may contribute to the pathophysiology of obesity-related traits among Yup'ik people. Copyright © 2015 Elsevier Inc. All rights reserved.
Application of Whole Exome Sequencing in Six Families with an Initial Diagnosis of Autosomal Dominant Retinitis Pigmentosa: Lessons Learned

PubMed Central

Fernandez-San Jose, Patricia; Liu, Yichuan; March, Michael; Pellegrino, Renata; Golhar, Ryan; Corton, Marta; Blanco-Kelly, Fiona; López-Molina, Maria Isabel; García-Sandoval, Blanca; Guo, Yiran; Tian, Lifeng; Liu, Xuanzhu; Guan, Liping; Zhang, Jianguo; Keating, Brendan; Xu, Xun

2015-01-01

This study aimed to identify the genetics underlying dominant forms of inherited retinal dystrophies using whole exome sequencing (WES) in six families extensively screened for known mutations or genes. Thirty-eight individuals were subjected to WES. Causative variants were searched among single nucleotide variants (SNVs) and insertion/deletion variants (indels) and whenever no potential candidate emerged, copy number variant (CNV) analysis was performed. Variants or regions harboring a candidate variant were prioritized and segregation of the variant with the disease was further assessed using Sanger sequencing in case of SNVs and indels, and quantitative PCR (qPCR) for CNVs. SNV and indel analysis led to the identification of a previously reported mutation in PRPH2. Two additional mutations linked to different forms of retinal dystrophies were identified in two families: a known frameshift deletion in RPGR, a gene responsible for X-linked retinitis pigmentosa and p.Ser163Arg in C1QTNF5 associated with Late-Onset Retinal Degeneration. A novel heterozygous deletion spanning the entire region of PRPF31 was also identified in the affected members of a fourth family, which was confirmed with qPCR. This study allowed the identification of the genetic cause of the retinal dystrophy and the establishment of a correct diagnosis in four families, including a large heterozygous deletion in PRPF31, typically considered one of the pitfalls of this method. Since all findings in this study are restricted to known genes, we propose that targeted sequencing using gene-panel is an optimal first approach for the genetic screening and that once known genetic causes are ruled out, WES might be used to uncover new genes involved in inherited retinal dystrophies. PMID:26197217
A Screen in Mice Uncovers Repression of Lipoprotein Lipase by MicroRNA-29a as a Mechanism for Lipid Distribution Away From the Liver

PubMed Central

Mattis, Aras N.; Song, Guisheng; Hitchner, Kelly; Kim, Roy Y.; Lee, Andrew Y.; Sharma, Amar D.; Malato, Yann; McManus, Michael T.; Esau, Christine C.; Koller, Erich; Koliwad, Suneil; Lim, Lee P.; Maher, Jacquelyn J.; Raffai, Robert L.; Willenbring, Holger

2015-01-01

Identification of microRNAs (miRNAs) that regulate lipid metabolism is important to advance the understanding and treatment of some of the most common human diseases. In the liver, a few key miRNAs have been reported that regulate lipid metabolism, but since many genes contribute to hepatic lipid metabolism, we hypothesized that other such miRNAs exist. To identify genes repressed by miRNAs in mature hepatocytes in vivo, we injected adult mice carrying floxed Dicer1 alleles with an adenoassociated viral vector expressing Cre recombinase specifically in hepatocytes. By inactivating Dicer in adult quiescent hepatocytes we avoided the hepatocyte injury and regeneration observed in previous mouse models of global miRNA deficiency in hepatocytes. Next, we combined gene and miRNA expression profiling to identify candidate gene/miRNA interactions involved in hepatic lipid metabolism, and validated their function in vivo using antisense oligonucleotides. A candidate gene that emerged from our screen was lipoprotein lipase (Lpl), which encodes an enzyme that facilitates cellular uptake of lipids from the circulation. Unlike in energy-dependent cells like myocytes, Lpl is normally repressed in adult hepatocytes. We identified miR-29a as the miRNA responsible for repressing Lpl in hepatocytes, and found that decreasing hepatic miR-29a levels causes lipids to accumulate in mouse livers. Conclusion Our screen suggests several new miRNAs are regulators of hepatic lipid metabolism. We show that one of these, miR-29a, contributes to physiological lipid distribution away from the liver and protects hepatocytes from steatosis. Our results, together with miR-29a’s known anti-fibrotic effect, suggest miR-29a is a therapeutic target in fatty liver disease. PMID:25131933
Temporal transcriptional logic of dynamic regulatory networks underlying nitrogen signaling and use in plants.

PubMed

Varala, Kranthi; Marshall-Colón, Amy; Cirrone, Jacopo; Brooks, Matthew D; Pasquino, Angelo V; Léran, Sophie; Mittal, Shipra; Rock, Tara M; Edwards, Molly B; Kim, Grace J; Ruffel, Sandrine; McCombie, W Richard; Shasha, Dennis; Coruzzi, Gloria M

2018-06-19

This study exploits time, the relatively unexplored fourth dimension of gene regulatory networks (GRNs), to learn the temporal transcriptional logic underlying dynamic nitrogen (N) signaling in plants. Our "just-in-time" analysis of time-series transcriptome data uncovered a temporal cascade of cis elements underlying dynamic N signaling. To infer transcription factor (TF)-target edges in a GRN, we applied a time-based machine learning method to 2,174 dynamic N-responsive genes. We experimentally determined a network precision cutoff, using TF-regulated genome-wide targets of three TF hubs (CRF4, SNZ, and CDF1), used to "prune" the network to 155 TFs and 608 targets. This network precision was reconfirmed using genome-wide TF-target regulation data for four additional TFs (TGA1, HHO5/6, and PHL1) not used in network pruning. These higher-confidence edges in the GRN were further filtered by independent TF-target binding data, used to calculate a TF "N-specificity" index. This refined GRN identifies the temporal relationship of known/validated regulators of N signaling (NLP7/8, TGA1/4, NAC4, HRS1, and LBD37/38/39) and 146 additional regulators. Six TFs-CRF4, SNZ, CDF1, HHO5/6, and PHL1-validated herein regulate a significant number of genes in the dynamic N response, targeting 54% of N-uptake/assimilation pathway genes. Phenotypically, inducible overexpression of CRF4 in planta regulates genes resulting in altered biomass, root development, and 15 NO 3 - uptake, specifically under low-N conditions. This dynamic N-signaling GRN now provides the temporal "transcriptional logic" for 155 candidate TFs to improve nitrogen use efficiency with potential agricultural applications. Broadly, these time-based approaches can uncover the temporal transcriptional logic for any biological response system in biology, agriculture, or medicine. Copyright © 2018 the Author(s). Published by PNAS.
Genome-Wide Variation Patterns Uncover the Origin and Selection in Cultivated Ginseng (Panax ginseng Meyer)

PubMed Central

Li, Ming-Rui; Shi, Feng-Xue; Li, Ya-Ling; Jiang, Peng; Jiao, Lili

2017-01-01

Abstract Chinese ginseng (Panax ginseng Meyer) is a medicinally important herb and plays crucial roles in traditional Chinese medicine. Pharmacological analyses identified diverse bioactive components from Chinese ginseng. However, basic biological attributes including domestication and selection of the ginseng plant remain under-investigated. Here, we presented a genome-wide view of the domestication and selection of cultivated ginseng based on the whole genome data. A total of 8,660 protein-coding genes were selected for genome-wide scanning of the 30 wild and cultivated ginseng accessions. In complement, the 45s rDNA, chloroplast and mitochondrial genomes were included to perform phylogenetic and population genetic analyses. The observed spatial genetic structure between northern cultivated ginseng (NCG) and southern cultivated ginseng (SCG) accessions suggested multiple independent origins of cultivated ginseng. Genome-wide scanning further demonstrated that NCG and SCG have undergone distinct selection pressures during the domestication process, with more genes identified in the NCG (97 genes) than in the SCG group (5 genes). Functional analyses revealed that these genes are involved in diverse pathways, including DNA methylation, lignin biosynthesis, and cell differentiation. These findings suggested that the SCG and NCG groups have distinct demographic histories. Candidate genes identified are useful for future molecular breeding of cultivated ginseng. PMID:28922794
Morphological Characters and Transcriptome Profiles Associated with Black Skin and Red Skin in Crimson Snapper (Lutjanus erythropterus)

PubMed Central

Zhang, Yan-Ping; Wang, Zhong-Duo; Guo, Yu-Song; Liu, Li; Yu, Juan; Zhang, Shun; Liu, Shao-Jun; Liu, Chu-Wu

2015-01-01

In this study, morphology observation and illumina sequencing were performed on two different coloration skins of crimson snapper (Lutjanus erythropterus), the black zone and the red zone. Three types of chromatophores, melanophores, iridophores and xanthophores, were organized in the skins. The main differences between the two colorations were in the amount and distribution of the three chromatophores. After comparing the two transcriptomes, 9200 unigenes with significantly different expressions (ratio change ≥ 2 and q-value ≤ 0.05) were found, of which 5972 were up-regulated in black skin and 3228 were up-regulated in red skin. Through the function annotation, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis of the differentially transcribed genes, we excavated a number of uncharacterized candidate pigment genes as well as found the conserved genes affecting pigmentation in crimson snapper. The patterns of expression of 14 pigment genes were confirmed by the Quantitative real-time PCR analysis between the two color skins. Overall, this study shows a global survey of the morphological characters and transcriptome analysis of the different coloration skins in crimson snapper, and provides valuable cellular and genetic information to uncover the mechanism of the formation of pigment patterns in snappers. PMID:26569232
Morphological Characters and Transcriptome Profiles Associated with Black Skin and Red Skin in Crimson Snapper (Lutjanus erythropterus).

PubMed

Zhang, Yan-Ping; Wang, Zhong-Duo; Guo, Yu-Song; Liu, Li; Yu, Juan; Zhang, Shun; Liu, Shao-Jun; Liu, Chu-Wu

2015-11-12

In this study, morphology observation and illumina sequencing were performed on two different coloration skins of crimson snapper (Lutjanus erythropterus), the black zone and the red zone. Three types of chromatophores, melanophores, iridophores and xanthophores, were organized in the skins. The main differences between the two colorations were in the amount and distribution of the three chromatophores. After comparing the two transcriptomes, 9200 unigenes with significantly different expressions (ratio change ≥ 2 and q-value ≤ 0.05) were found, of which 5972 were up-regulated in black skin and 3228 were up-regulated in red skin. Through the function annotation, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis of the differentially transcribed genes, we excavated a number of uncharacterized candidate pigment genes as well as found the conserved genes affecting pigmentation in crimson snapper. The patterns of expression of 14 pigment genes were confirmed by the Quantitative real-time PCR analysis between the two color skins. Overall, this study shows a global survey of the morphological characters and transcriptome analysis of the different coloration skins in crimson snapper, and provides valuable cellular and genetic information to uncover the mechanism of the formation of pigment patterns in snappers.
Transcriptional and phenotypic comparisons of Ppara knockout and siRNA knockdown mice

PubMed Central

De Souza, Angus T.; Dai, Xudong; Spencer, Andrew G.; Reppen, Tom; Menzie, Ann; Roesch, Paula L.; He, Yudong; Caguyong, Michelle J.; Bloomer, Sherri; Herweijer, Hans; Wolff, Jon A.; Hagstrom, James E.; Lewis, David L.; Linsley, Peter S.; Ulrich, Roger G.

2006-01-01

RNA interference (RNAi) has great potential as a tool for studying gene function in mammals. However, the specificity and magnitude of the in vivo response to RNAi remains to be fully characterized. A molecular and phenotypic comparison of a genetic knockout mouse and the corresponding knockdown version would help clarify the utility of the RNAi approach. Here, we used hydrodynamic delivery of small interfering RNA (siRNA) to knockdown peroxisome proliferator activated receptor alpha (Ppara), a gene that is central to the regulation of fatty acid metabolism. We found that Ppara knockdown in the liver results in a transcript profile and metabolic phenotype that is comparable to those of Ppara−/− mice. Combining the profiles from mice treated with the PPARα agonist fenofibrate, we confirmed the specificity of the RNAi response and identified candidate genes proximal to PPARα regulation. Ppara knockdown animals developed hypoglycemia and hypertriglyceridemia, phenotypes observed in Ppara−/− mice. In contrast to Ppara−/− mice, fasting was not required to uncover these phenotypes. Together, these data validate the utility of the RNAi approach and suggest that siRNA can be used as a complement to classical knockout technology in gene function studies. PMID:16945951
Comparative Transcriptome Analyses Uncover Key Candidate Genes Mediating Flight Capacity in Bactrocera dorsalis (Hendel) and Bactrocera correcta (Bezzi) (Diptera: Tephritidae).

PubMed

Guo, Shaokun; Zhao, Zihua; Liu, Lijun; Li, Zhihong; Shen, Jie

2018-01-30

Flight capacity is important for invasive pests during entry, establishment and spreading. Both Bactrocera dorsalis Hendel and Bactrocera correcta Bezzi are invasive fruit flies but their flight capacities differ. Here, a tethered flight mill test demonstrated that B. dorsalis exhibits a greater flight capacity than B. correcta . RNA-Seq was used to determine the transcriptomic differences associated with the flight capacity of two Bactrocera species. Transcriptome data showed that 6392 unigenes were differentially expressed between the two species in the larval stage, whereas in the adult stage, 4104 differentially expressed genes (DEGs) were identified in females, and 3445 DEGs were observed in males. The flight capacity appeared to be correlated with changes in the transcriptional levels of genes involved in wing formation, flight muscle structure, energy metabolism, and hormonal control. Using RNA interference (RNAi) to verify the function of one DEG, the epidermal growth factor receptor ( EGFR ), we confirmed the role of this gene in regulating wing development, and thereby flight capacity, in both species. This work reveals the flight mechanism of fruit flies and provides insight into fundamental transcriptomics for further studies on the flight performance of insects.
Analysis of transcriptome in hickory (Carya cathayensis), and uncover the dynamics in the hormonal signaling pathway during graft process.

PubMed

Qiu, Lingling; Jiang, Bo; Fang, Jia; Shen, Yike; Fang, Zhongxiang; Rm, Saravana Kumar; Yi, Keke; Shen, Chenjia; Yan, Daoliang; Zheng, Bingsong

2016-11-17

Hickory (Carya cathayensis), a woody plant with high nutritional and economic value, is widely planted in China. Due to its long juvenile phase, grafting is a useful technique for large-scale cultivation of hickory. To reveal the molecular mechanism during the graft process, we sequenced the transcriptomes of graft union in hickory. In our study, six RNA-seq libraries yielded a total of 83,676,860 clean short reads comprising 4.19 Gb of sequence data. A large number of differentially expressed genes (DEGs) at three time points during the graft process were identified. In detail, 777 DEGs in the 7 d vs 0 d (day after grafting) comparison were classified into 11 enriched Gene Ontology (GO) categories, and 262 DEGs in the 14 d vs 0 d comparison were classified into 15 enriched GO categories. Furthermore, an overview of the PPI network was constructed by these DEGs. In addition, 20 genes related to the auxin-and cytokinin-signaling pathways were identified, and some were validated by qRT-PCR analysis. Our comprehensive analysis provides basic information on the candidate genes and hormone signaling pathways involved in the graft process in hickory and other woody plants.
Comparative Transcriptome Analyses Uncover Key Candidate Genes Mediating Flight Capacity in Bactrocera dorsalis (Hendel) and Bactrocera correcta (Bezzi) (Diptera: Tephritidae)

PubMed Central

Zhao, Zihua; Liu, Lijun; Li, Zhihong; Shen, Jie

2018-01-01

Flight capacity is important for invasive pests during entry, establishment and spreading. Both Bactrocera dorsalis Hendel and Bactrocera correcta Bezzi are invasive fruit flies but their flight capacities differ. Here, a tethered flight mill test demonstrated that B. dorsalis exhibits a greater flight capacity than B. correcta. RNA-Seq was used to determine the transcriptomic differences associated with the flight capacity of two Bactrocera species. Transcriptome data showed that 6392 unigenes were differentially expressed between the two species in the larval stage, whereas in the adult stage, 4104 differentially expressed genes (DEGs) were identified in females, and 3445 DEGs were observed in males. The flight capacity appeared to be correlated with changes in the transcriptional levels of genes involved in wing formation, flight muscle structure, energy metabolism, and hormonal control. Using RNA interference (RNAi) to verify the function of one DEG, the epidermal growth factor receptor (EGFR), we confirmed the role of this gene in regulating wing development, and thereby flight capacity, in both species. This work reveals the flight mechanism of fruit flies and provides insight into fundamental transcriptomics for further studies on the flight performance of insects. PMID:29385681

Genetics of primary ovarian insufficiency: new developments and opportunities

PubMed Central

Qin, Yingying; Jiao, Xue; Simpson, Joe Leigh; Chen, Zi-Jiang

2015-01-01

BACKGROUND Primary ovarian insufficiency (POI) is characterized by marked heterogeneity, but with a significant genetic contribution. Identifying exact causative genes has been challenging, with many discoveries not replicated. It is timely to take stock of the field, outlining the progress made, framing the controversies and anticipating future directions in elucidating the genetics of POI. METHODS A search for original articles published up to May 2015 was performed using PubMed and Google Scholar, identifying studies on the genetic etiology of POI. Studies were included if chromosomal analysis, candidate gene screening and a genome-wide study were conducted. Articles identified were restricted to English language full-text papers. RESULTS Chromosomal abnormalities have long been recognized as a frequent cause of POI, with a currently estimated prevalence of 10–13%. Using the traditional karyotype methodology, monosomy X, mosaicism, X chromosome deletions and rearrangements, X-autosome translocations, and isochromosomes have been detected. Based on candidate gene studies, single gene perturbations unequivocally having a deleterious effect in at least one population include Bone morphogenetic protein 15 (BMP15), Progesterone receptor membrane component 1 (PGRMC1), and Fragile X mental retardation 1 (FMR1) premutation on the X chromosome; Growth differentiation factor 9 (GDF9), Folliculogenesis specific bHLH transcription factor (FIGLA), Newborn ovary homeobox gene (NOBOX), Nuclear receptor subfamily 5, group A, member 1 (NR5A1) and Nanos homolog 3 (NANOS3) seem likely as well, but mostly being found in no more than 1–2% of a single population studied. Whole genome approaches have utilized genome-wide association studies (GWAS) to reveal loci not predicted on the basis of a candidate gene, but it remains difficult to locate causative genes and susceptible loci were not always replicated. Cytogenomic methods (array CGH) have identified other regions of interest but studies have not shown consistent results, the resolution of arrays has varied and replication is uncommon. Whole-exome sequencing in non-syndromic POI kindreds has only recently begun, revealing mutations in the Stromal antigen 3 (STAG3), Synaptonemal complex central element 1 (SYCE1), minichromosome maintenance complex component 8 and 9 (MCM8, MCM9) and ATP-dependent DNA helicase homolog (HFM1) genes. Given the slow progress in candidate-gene analysis and relatively small sample sizes available for GWAS, family-based whole exome and whole genome sequencing appear to be the most promising approaches for detecting potential genes responsible for POI. CONCLUSION Taken together, the cytogenetic, cytogenomic (array CGH) and exome sequencing approaches have revealed a genetic causation in ∼20–25% of POI cases. Uncovering the remainder of the causative genes will be facilitated not only by whole genome approaches involving larger cohorts in multiple populations but also incorporating environmental exposures and exploring signaling pathways in intragenic and intergenic regions that point to perturbations in regulatory genes and networks. PMID:26243799
Genetics of primary ovarian insufficiency: new developments and opportunities.

PubMed

Qin, Yingying; Jiao, Xue; Simpson, Joe Leigh; Chen, Zi-Jiang

2015-01-01

Primary ovarian insufficiency (POI) is characterized by marked heterogeneity, but with a significant genetic contribution. Identifying exact causative genes has been challenging, with many discoveries not replicated. It is timely to take stock of the field, outlining the progress made, framing the controversies and anticipating future directions in elucidating the genetics of POI. A search for original articles published up to May 2015 was performed using PubMed and Google Scholar, identifying studies on the genetic etiology of POI. Studies were included if chromosomal analysis, candidate gene screening and a genome-wide study were conducted. Articles identified were restricted to English language full-text papers. Chromosomal abnormalities have long been recognized as a frequent cause of POI, with a currently estimated prevalence of 10-13%. Using the traditional karyotype methodology, monosomy X, mosaicism, X chromosome deletions and rearrangements, X-autosome translocations, and isochromosomes have been detected. Based on candidate gene studies, single gene perturbations unequivocally having a deleterious effect in at least one population include Bone morphogenetic protein 15 (BMP15), Progesterone receptor membrane component 1 (PGRMC1), and Fragile X mental retardation 1 (FMR1) premutation on the X chromosome; Growth differentiation factor 9 (GDF9), Folliculogenesis specific bHLH transcription factor (FIGLA), Newborn ovary homeobox gene (NOBOX), Nuclear receptor subfamily 5, group A, member 1 (NR5A1) and Nanos homolog 3 (NANOS3) seem likely as well, but mostly being found in no more than 1-2% of a single population studied. Whole genome approaches have utilized genome-wide association studies (GWAS) to reveal loci not predicted on the basis of a candidate gene, but it remains difficult to locate causative genes and susceptible loci were not always replicated. Cytogenomic methods (array CGH) have identified other regions of interest but studies have not shown consistent results, the resolution of arrays has varied and replication is uncommon. Whole-exome sequencing in non-syndromic POI kindreds has only recently begun, revealing mutations in the Stromal antigen 3 (STAG3), Synaptonemal complex central element 1 (SYCE1), minichromosome maintenance complex component 8 and 9 (MCM8, MCM9) and ATP-dependent DNA helicase homolog (HFM1) genes. Given the slow progress in candidate-gene analysis and relatively small sample sizes available for GWAS, family-based whole exome and whole genome sequencing appear to be the most promising approaches for detecting potential genes responsible for POI. Taken together, the cytogenetic, cytogenomic (array CGH) and exome sequencing approaches have revealed a genetic causation in ∼20-25% of POI cases. Uncovering the remainder of the causative genes will be facilitated not only by whole genome approaches involving larger cohorts in multiple populations but also incorporating environmental exposures and exploring signaling pathways in intragenic and intergenic regions that point to perturbations in regulatory genes and networks. © The Author 2015. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology.
Adrenocortical Expression Profiling of Cattle with Distinct Juvenile Temperament Types.

PubMed

Friedrich, Juliane; Brand, Bodo; Graunke, Katharina Luise; Langbein, Jan; Schwerin, Manfred; Ponsuksili, Siriluck

2017-01-01

Temperament affects ease of handling, animal welfare, and economically important production traits in cattle. The use of gene expression profiles as molecular traits provides a novel means of gaining insight into behavioural genetics. In this study, differences in adrenocortical expression profiles between 60 F 2 cows (Charolais × German Holstein) of distinct temperament types were analysed. The cows were assessed in a novel-human test at an age of 90 days. Most of the adrenal cortex transcripts which were differentially expressed (FDR <0.05) were found between temperament types of 'fearful/neophobic-alert' and all other temperament types. These transcripts belong to several biological functions like NRF2-mediated oxidative stress response, Glucocorticoid Receptor Signalling and Complement System. Overall, the present study provides new insight into transcriptional differences in the adrenal cortex between cows of distinct temperament types. Genetic regulations of such molecular traits facilitate uncovering positional and functional gene candidates for temperament type in cattle.
Comparative Analysis of Argonaute-dependent Small RNA Pathways in Drosophila

PubMed Central

Zhou, Rui; Hotta, Ikuko; Denli, Ahmet M.; Hong, Pengyu; Perrimon, Norbert; Hannon, Gregory J.

2008-01-01

Summary The specificity of RNAi pathways is determined by several classes of small RNAs, which include siRNAs, piRNAs, endo-siRNAs, and microRNAs (miRNAs). These small RNAs are invariably incorporated into large Argonaute (Ago)-containing effector complexes known as RNA-induced silencing complexes (RISCs), which they guide to silencing targets. Both genetic and biochemical strategies have yielded conserved molecular components of small RNA biogenesis and effector machineries. However, given the complexity of these pathways, there are likely to be additional components and regulators that remain to be uncovered. We have undertaken a comparative and comprehensive RNAi screen to identify genes that impact three major Ago-dependent small RNA pathways that operate in Drosophila S2 cells. We identify subsets of candidates that act positively or negatively in siRNA, endo-siRNA and miRNA pathways. Our studies indicate that many components are shared among all three Argonaute-dependent silencing pathways, though each is also impacted by discrete sets of genes. PMID:19026789
Rare variants in axonogenesis genes connect three families with sound-color synesthesia.

PubMed

Tilot, Amanda K; Kucera, Katerina S; Vino, Arianna; Asher, Julian E; Baron-Cohen, Simon; Fisher, Simon E

2018-03-20

Synesthesia is a rare nonpathological phenomenon where stimulation of one sense automatically provokes a secondary perception in another. Hypothesized to result from differences in cortical wiring during development, synesthetes show atypical structural and functional neural connectivity, but the underlying molecular mechanisms are unknown. The trait also appears to be more common among people with autism spectrum disorder and savant abilities. Previous linkage studies searching for shared loci of large effect size across multiple families have had limited success. To address the critical lack of candidate genes, we applied whole-exome sequencing to three families with sound-color (auditory-visual) synesthesia affecting multiple relatives across three or more generations. We identified rare genetic variants that fully cosegregate with synesthesia in each family, uncovering 37 genes of interest. Consistent with reports indicating genetic heterogeneity, no variants were shared across families. Gene ontology analyses highlighted six genes- COL4A1 , ITGA2 , MYO10 , ROBO3 , SLC9A6 , and SLIT2 -associated with axonogenesis and expressed during early childhood when synesthetic associations are formed. These results are consistent with neuroimaging-based hypotheses about the role of hyperconnectivity in the etiology of synesthesia and offer a potential entry point into the neurobiology that organizes our sensory experiences. Copyright © 2018 the Author(s). Published by PNAS.
Sex determination in papaya.

PubMed

Ming, Ray; Yu, Qingyi; Moore, Paul H

2007-06-01

Sex determination is an intriguing system in trioecious papaya. Over the past seven decades various hypotheses, based on the knowledge and information available at the time, have been proposed to explain the genetics of the papaya's sex determination. These include a single gene with three alleles, a group of closely linked genes, a genic balance of sex chromosome over autosomes, classical XY chromosomes, and regulatory elements of the flower development pathway. Recent advancements in genomic technology make it possible to characterize the genomic region involved in sex determination at the molecular level. High density linkage mapping validated the hypothesis that predicted recombination suppression at the sex determination locus. Physical mapping and sample sequencing of the non-recombination region led to the conclusion that sex determination is controlled by a pair of primitive sex chromosomes with a small male-specific region (MSY) of the Y chromosome. We now postulate that two sex determination genes control the sex determination pathway. One, a feminizing or stamen suppressor gene, causes stamen abortion before or at flower inception while the other, a masculinizing or carpel suppressor gene, causes carpel abortion at a later flower developmental stage. Detailed physical mapping is beginning to reveal structural details about the sex determination region and sequencing is expected to uncover candidate sex determining genes. Cloning of the sex determination genes and understanding the sex determination process could have profound application in papaya production.
A Multi-layered Quantitative In Vivo Expression Atlas of the Podocyte Unravels Kidney Disease Candidate Genes.

PubMed

Rinschen, Markus M; Gödel, Markus; Grahammer, Florian; Zschiedrich, Stefan; Helmstädter, Martin; Kretz, Oliver; Zarei, Mostafa; Braun, Daniela A; Dittrich, Sebastian; Pahmeyer, Caroline; Schroder, Patricia; Teetzen, Carolin; Gee, HeonYung; Daouk, Ghaleb; Pohl, Martin; Kuhn, Elisa; Schermer, Bernhard; Küttner, Victoria; Boerries, Melanie; Busch, Hauke; Schiffer, Mario; Bergmann, Carsten; Krüger, Marcus; Hildebrandt, Friedhelm; Dengjel, Joern; Benzing, Thomas; Huber, Tobias B

2018-05-22

Damage to and loss of glomerular podocytes has been identified as the culprit lesion in progressive kidney diseases. Here, we combine mass spectrometry-based proteomics with mRNA sequencing, bioinformatics, and hypothesis-driven studies to provide a comprehensive and quantitative map of mammalian podocytes that identifies unanticipated signaling pathways. Comparison of the in vivo datasets with proteomics data from podocyte cell cultures showed a limited value of available cell culture models. Moreover, in vivo stable isotope labeling by amino acids uncovered surprisingly rapid synthesis of mitochondrial proteins under steady-state conditions that was perturbed under autophagy-deficient, disease-susceptible conditions. Integration of acquired omics dimensions suggested FARP1 as a candidate essential for podocyte function, which could be substantiated by genetic analysis in humans and knockdown experiments in zebrafish. This work exemplifies how the integration of multi-omics datasets can identify a framework of cell-type-specific features relevant for organ health and disease. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Cancer driver-passenger distinction via sporadic human and dog cancer comparison: a proof of principle study with colorectal cancer

PubMed Central

Tang, Jie; Li, Yaping; Lyon, Kenneth; Camps, Jordi; Dalton, Stephen; Ried, Thomas; Zhao, Shaying

2014-01-01

Herein we report a proof of principle study illustrating a novel dog-human comparison strategy that addresses a central aim of cancer research, namely cancer driver–passenger distinction. We previously demonstrated that sporadic canine colorectal cancers (CRCs) share similar molecular pathogenesis mechanisms as their human counterparts. In this study, we compared the genome-wide copy number abnormalities between 29 human- and 10 canine sporadic CRCs. This led to the identification of 73 driver candidate genes (DCGs), altered in both species and with 27 from the whole genome and 46 from dog-human genomic rearrangement breakpoint (GRB) regions, as well as 38 passenger candidate genes (PCGs), altered in humans only and located in GRB regions. We noted that DCGs significantly differ from PCGs in every analysis conducted to assess their cancer relevance and biological functions. Importantly, while PCGs are not enriched in any specific functions, DCGs possess significantly enhanced functionality closely associated with cell proliferation and death regulation, as well as with epithelial cell apicobasal polarity establishment/maintenance. These observations support the notion that, in sporadic CRCs of both species, cell polarity genes not only contribute in preventing cancer cell invasion and spreading, but also likely serve as tumor suppressors by modulating cell growth. This pilot study validates our novel strategy and has uncovered four new potential cell polarity and colorectal tumor suppressor genes (RASA3, NUPL1, DENND5A, and AVL9). Expansion of this study would make more driver-passenger distinctions for cancers with large genomic amplifications or deletions, and address key questions regarding the relationship between cancer pathogenesis and epithelial cell polarity control in mammals. PMID:23416983
Cancer driver-passenger distinction via sporadic human and dog cancer comparison: a proof-of-principle study with colorectal cancer.

PubMed

Tang, J; Li, Y; Lyon, K; Camps, J; Dalton, S; Ried, T; Zhao, S

2014-02-13

Herein we report a proof-of-principle study illustrating a novel dog-human comparison strategy that addresses a central aim of cancer research, namely cancer driver-passenger distinction. We previously demonstrated that sporadic canine colorectal cancers (CRCs) share similar molecular pathogenesis mechanisms as their human counterparts. In this study, we compared the genome-wide copy number abnormalities between 29 human and 10 canine sporadic CRCs. This led to the identification of 73 driver candidate genes (DCGs), altered in both species, and with 27 from the whole genome and 46 from dog-human genomic rearrangement breakpoint (GRB) regions, as well as 38 passenger candidate genes (PCGs), altered in humans only and located in GRB regions. We noted that DCGs significantly differ from PCGs in every analysis conducted to assess their cancer relevance and biological functions. Importantly, although PCGs are not enriched in any specific functions, DCGs possess significantly enhanced functionality closely associated with cell proliferation and death regulation, as well as with epithelial cell apicobasal polarity establishment/maintenance. These observations support the notion that, in sporadic CRCs of both species, cell polarity genes not only contribute in preventing cancer cell invasion and spreading, but also likely serve as tumor suppressors by modulating cell growth. This pilot study validates our novel strategy and has uncovered four new potential cell polarity and colorectal tumor suppressor genes (RASA3, NUPL1, DENND5A and AVL9). Expansion of this study would make more driver-passenger distinctions for cancers with large genomic amplifications or deletions, and address key questions regarding the relationship between cancer pathogenesis and epithelial cell polarity control in mammals.
Characterization of canine osteosarcoma by array comparative genomic hybridization and RT-qPCR: signatures of genomic imbalance in canine osteosarcoma parallel the human counterpart.

PubMed

Angstadt, Andrea Y; Motsinger-Reif, Alison; Thomas, Rachael; Kisseberth, William C; Guillermo Couto, C; Duval, Dawn L; Nielsen, Dahlia M; Modiano, Jaime F; Breen, Matthew

2011-11-01

Osteosarcoma (OS) is the most commonly diagnosed malignant bone tumor in humans and dogs, characterized in both species by extremely complex karyotypes exhibiting high frequencies of genomic imbalance. Evaluation of genomic signatures in human OS using array comparative genomic hybridization (aCGH) has assisted in uncovering genetic mechanisms that result in disease phenotype. Previous low-resolution (10-20 Mb) aCGH analysis of canine OS identified a wide range of recurrent DNA copy number aberrations, indicating extensive genomic instability. In this study, we profiled 123 canine OS tumors by 1 Mb-resolution aCGH to generate a dataset for direct comparison with current data for human OS, concluding that several high frequency aberrations in canine and human OS are orthologous. To ensure complete coverage of gene annotation, we identified the human refseq genes that map to these orthologous aberrant dog regions and found several candidate genes warranting evaluation for OS involvement. Specifically, subsequenct FISH and qRT-PCR analysis of RUNX2, TUSC3, and PTEN indicated that expression levels correlated with genomic copy number status, showcasing RUNX2 as an OS associated gene and TUSC3 as a possible tumor suppressor candidate. Together these data demonstrate the ability of genomic comparative oncology to identify genetic abberations which may be important for OS progression. Large scale screening of genomic imbalance in canine OS further validates the use of the dog as a suitable model for human cancers, supporting the idea that dysregulation discovered in canine cancers will provide an avenue for complementary study in human counterparts. Copyright © 2011 Wiley-Liss, Inc.
Combining experimental evolution with next-generation sequencing: a powerful tool to study adaptation from standing genetic variation.

PubMed

Schlötterer, C; Kofler, R; Versace, E; Tobler, R; Franssen, S U

2015-05-01

Evolve and resequence (E&R) is a new approach to investigate the genomic responses to selection during experimental evolution. By using whole genome sequencing of pools of individuals (Pool-Seq), this method can identify selected variants in controlled and replicable experimental settings. Reviewing the current state of the field, we show that E&R can be powerful enough to identify causative genes and possibly even single-nucleotide polymorphisms. We also discuss how the experimental design and the complexity of the trait could result in a large number of false positive candidates. We suggest experimental and analytical strategies to maximize the power of E&R to uncover the genotype-phenotype link and serve as an important research tool for a broad range of evolutionary questions.
Time-Resolved Transposon Insertion Sequencing Reveals Genome-Wide Fitness Dynamics during Infection.

PubMed

Yang, Guanhua; Billings, Gabriel; Hubbard, Troy P; Park, Joseph S; Yin Leung, Ka; Liu, Qin; Davis, Brigid M; Zhang, Yuanxing; Wang, Qiyao; Waldor, Matthew K

2017-10-03

Transposon insertion sequencing (TIS) is a powerful high-throughput genetic technique that is transforming functional genomics in prokaryotes, because it enables genome-wide mapping of the determinants of fitness. However, current approaches for analyzing TIS data assume that selective pressures are constant over time and thus do not yield information regarding changes in the genetic requirements for growth in dynamic environments (e.g., during infection). Here, we describe structured analysis of TIS data collected as a time series, termed pattern analysis of conditional essentiality (PACE). From a temporal series of TIS data, PACE derives a quantitative assessment of each mutant's fitness over the course of an experiment and identifies mutants with related fitness profiles. In so doing, PACE circumvents major limitations of existing methodologies, specifically the need for artificial effect size thresholds and enumeration of bacterial population expansion. We used PACE to analyze TIS samples of Edwardsiella piscicida (a fish pathogen) collected over a 2-week infection period from a natural host (the flatfish turbot). PACE uncovered more genes that affect E. piscicida 's fitness in vivo than were detected using a cutoff at a terminal sampling point, and it identified subpopulations of mutants with distinct fitness profiles, one of which informed the design of new live vaccine candidates. Overall, PACE enables efficient mining of time series TIS data and enhances the power and sensitivity of TIS-based analyses. IMPORTANCE Transposon insertion sequencing (TIS) enables genome-wide mapping of the genetic determinants of fitness, typically based on observations at a single sampling point. Here, we move beyond analysis of endpoint TIS data to create a framework for analysis of time series TIS data, termed pattern analysis of conditional essentiality (PACE). We applied PACE to identify genes that contribute to colonization of a natural host by the fish pathogen Edwardsiella piscicida. PACE uncovered more genes that affect E. piscicida 's fitness in vivo than were detected using a terminal sampling point, and its clustering of mutants with related fitness profiles informed design of new live vaccine candidates. PACE yields insights into patterns of fitness dynamics and circumvents major limitations of existing methodologies. Finally, the PACE method should be applicable to additional "omic" time series data, including screens based on clustered regularly interspaced short palindromic repeats with Cas9 (CRISPR/Cas9). Copyright © 2017 Yang et al.
Subsurface metagenomes uncover a vast repertoire of hypervariable proteins encoded by genetic elements in uncultivated organisms and viruses

NASA Astrophysics Data System (ADS)

Paul, B. G.; Burstein, D.; Castelle, C. J.; Banfield, J. F.; Valentine, D. L.; Miller, J. F.; Ghosh, P.; Handa, S.; Arambula, D.; Czornyj, E.; Thomas, B. C.

2016-12-01

Uncultivated microorganisms primarily account for the remarkable diversity harbored in subsurface environments and represent an expansive subset of the current Tree of Life. Recent metagenomic efforts to investigate subsurface biomes have unveiled an array of bacterial and archaeal candidate phyla, whose members have minimal genomes and an apparent host-dependent existence. Still, little is known about the adaptive strategies that mediate host interactions in these organisms or their viruses. Genomic features known as diversity-generating retroelements (DGRs), which guide variability into targeted genes, were recently discovered in two single-cell genomes of uncultivated nanoarchaea, and independently in the genome of a marine virus from methane seep sediments. These prodigious drivers of protein hypervariability were first identified as the key force behind phage tail fiber diversification for binding different host receptors. Since their discovery, approximately 500 new DGRs have been found across a wide range of bacterial genomes representing various niches. We identified an unexpected 1136 distinct diversifiers from a single groundwater environment in reconstructed microbial genomes and genome fragments. The newly detected DGRs - predominantly linked to members of the candidate phyla radiation (CPR) - appear to target genes associated with cell-cell attachment, signaling, and transcription regulation. These findings suggest that targeted protein diversification may have an important role in regulating symbiotic or parasitic associations in groundwater microbiomes.
Phylogenetic analysis of IDD gene family and characterization of its expression in response to flower induction in Malus.

PubMed

Fan, Sheng; Zhang, Dong; Xing, Libo; Qi, Siyan; Du, Lisha; Wu, Haiqin; Shao, Hongxia; Li, Youmei; Ma, Juanjuan; Han, Mingyu

2017-08-01

Although INDETERMINATE DOMAIN (IDD) genes encoding specific plant transcription factors have important roles in plant growth and development, little is known about apple IDD (MdIDD) genes and their potential functions in the flower induction. In this study, we identified 20 putative IDD genes in apple and named them according to their chromosomal locations. All identified MdIDD genes shared a conserved IDD domain. A phylogenetic analysis separated MdIDDs and other plant IDD genes into four groups. Bioinformatic analysis of chemical characteristics, gene structure, and prediction of protein-protein interactions demonstrated the functional and structural diversity of MdIDD genes. To further uncover their potential functions, we performed analysis of tandem, synteny, and gene duplications, which indicated several paired homologs of IDD genes between apple and Arabidopsis. Additionally, genome duplications also promoted the expansion and evolution of the MdIDD genes. Quantitative real-time PCR revealed that all the MdIDD genes showed distinct expression levels in five different tissues (stems, leaves, buds, flowers, and fruits). Furthermore, the expression levels of candidate MdIDD genes were also investigated in response to various circumstances, including GA treatment (decreased the flowering rate), sugar treatment (increased the flowering rate), alternate-bearing conditions, and two varieties with different-flowering intensities. Parts of them were affected by exogenous treatments and showed different expression patterns. Additionally, changes in response to alternate-bearing and different-flowering varieties of apple trees indicated that they were also responsive to flower induction. Taken together, our comprehensive analysis provided valuable information for further analysis of IDD genes aiming at flower induction.
The response and recovery of the Arabidopsis thaliana transcriptome to phosphate starvation.

PubMed

Woo, Jongchan; MacPherson, Cameron Ross; Liu, Jun; Wang, Huan; Kiba, Takatoshi; Hannah, Matthew A; Wang, Xiu-Jie; Bajic, Vladimir B; Chua, Nam-Hai

2012-05-03

Over application of phosphate fertilizers in modern agriculture contaminates waterways and disrupts natural ecosystems. Nevertheless, this is a common practice among farmers, especially in developing countries as abundant fertilizers are believed to boost crop yields. The study of plant phosphate metabolism and its underlying genetic pathways is key to discovering methods of efficient fertilizer usage. The work presented here describes a genome-wide resource on the molecular dynamics underpinning the response and recovery in roots and shoots of Arabidopsis thaliana to phosphate-starvation. Genome-wide profiling by micro- and tiling-arrays (accessible from GEO: GSE34004) revealed minimal overlap between root and shoot transcriptomes suggesting two independent phosphate-starvation regulons. Novel gene expression patterns were detected for over 1000 candidates and were classified as either initial, persistent, or latent responders. Comparative analysis to AtGenExpress identified cohorts of genes co-regulated across multiple stimuli. The hormone ABA displayed a dominant role in regulating many phosphate-responsive candidates. Analysis of co-regulation enabled the determination of specific versus generic members of closely related gene families with respect to phosphate-starvation. Thus, among others, we showed that PHR1-regulated members of closely related phosphate-responsive families (PHT1;1, PHT1;7-9, SPX1-3, and PHO1;H1) display greater specificity to phosphate-starvation than their more generic counterparts. Our results uncover much larger, staged responses to phosphate-starvation than previously described. To our knowledge, this work describes the most complete genome-wide data on plant nutrient stress to-date.
Genome-Wide Variation Patterns Uncover the Origin and Selection in Cultivated Ginseng (Panax ginseng Meyer).

PubMed

Li, Ming-Rui; Shi, Feng-Xue; Li, Ya-Ling; Jiang, Peng; Jiao, Lili; Liu, Bao; Li, Lin-Feng

2017-09-01

Chinese ginseng (Panax ginseng Meyer) is a medicinally important herb and plays crucial roles in traditional Chinese medicine. Pharmacological analyses identified diverse bioactive components from Chinese ginseng. However, basic biological attributes including domestication and selection of the ginseng plant remain under-investigated. Here, we presented a genome-wide view of the domestication and selection of cultivated ginseng based on the whole genome data. A total of 8,660 protein-coding genes were selected for genome-wide scanning of the 30 wild and cultivated ginseng accessions. In complement, the 45s rDNA, chloroplast and mitochondrial genomes were included to perform phylogenetic and population genetic analyses. The observed spatial genetic structure between northern cultivated ginseng (NCG) and southern cultivated ginseng (SCG) accessions suggested multiple independent origins of cultivated ginseng. Genome-wide scanning further demonstrated that NCG and SCG have undergone distinct selection pressures during the domestication process, with more genes identified in the NCG (97 genes) than in the SCG group (5 genes). Functional analyses revealed that these genes are involved in diverse pathways, including DNA methylation, lignin biosynthesis, and cell differentiation. These findings suggested that the SCG and NCG groups have distinct demographic histories. Candidate genes identified are useful for future molecular breeding of cultivated ginseng. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Global gene expression analysis using RNA-seq uncovered a new role for SR1/CAMTA3 transcription factor in salt stress

PubMed Central

Prasad, Kasavajhala V. S. K.; Abdel-Hameed, Amira A. E.; Xing, Denghui; Reddy, Anireddy S. N.

2016-01-01

Abiotic and biotic stresses cause significant yield losses in all crops. Acquisition of stress tolerance in plants requires rapid reprogramming of gene expression. SR1/CAMTA3, a member of signal responsive transcription factors (TFs), functions both as a positive and a negative regulator of biotic stress responses and as a positive regulator of cold stress-induced gene expression. Using high throughput RNA-seq, we identified ~3000 SR1-regulated genes. Promoters of about 60% of the differentially expressed genes have a known DNA binding site for SR1, suggesting that they are likely direct targets. Gene ontology analysis of SR1-regulated genes confirmed previously known functions of SR1 and uncovered a potential role for this TF in salt stress. Our results showed that SR1 mutant is more tolerant to salt stress than the wild type and complemented line. Improved tolerance of sr1 seedlings to salt is accompanied with the induction of salt-responsive genes. Furthermore, ChIP-PCR results showed that SR1 binds to promoters of several salt-responsive genes. These results suggest that SR1 acts as a negative regulator of salt tolerance by directly repressing the expression of salt-responsive genes. Overall, this study identified SR1-regulated genes globally and uncovered a previously uncharacterized role for SR1 in salt stress response. PMID:27251464
NetMiner-an ensemble pipeline for building genome-wide and high-quality gene co-expression network using massive-scale RNA-seq samples.

PubMed

Yu, Hua; Jiao, Bingke; Lu, Lu; Wang, Pengfei; Chen, Shuangcheng; Liang, Chengzhi; Liu, Wei

2018-01-01

Accurately reconstructing gene co-expression network is of great importance for uncovering the genetic architecture underlying complex and various phenotypes. The recent availability of high-throughput RNA-seq sequencing has made genome-wide detecting and quantifying of the novel, rare and low-abundance transcripts practical. However, its potential merits in reconstructing gene co-expression network have still not been well explored. Using massive-scale RNA-seq samples, we have designed an ensemble pipeline, called NetMiner, for building genome-scale and high-quality Gene Co-expression Network (GCN) by integrating three frequently used inference algorithms. We constructed a RNA-seq-based GCN in one species of monocot rice. The quality of network obtained by our method was verified and evaluated by the curated gene functional association data sets, which obviously outperformed each single method. In addition, the powerful capability of network for associating genes with functions and agronomic traits was shown by enrichment analysis and case studies. In particular, we demonstrated the potential value of our proposed method to predict the biological roles of unknown protein-coding genes, long non-coding RNA (lncRNA) genes and circular RNA (circRNA) genes. Our results provided a valuable and highly reliable data source to select key candidate genes for subsequent experimental validation. To facilitate identification of novel genes regulating important biological processes and phenotypes in other plants or animals, we have published the source code of NetMiner, making it freely available at https://github.com/czllab/NetMiner.
Implications of monotreme and marsupial chromosome evolution on sex determination and differentiation.

PubMed

Deakin, Janine E

2017-04-01

Studies of chromosomes from monotremes and marsupials endemic to Australasia have provided important insight into the evolution of their genomes as well as uncovering fundamental differences in their sex determination/differentiation pathways. Great advances have been made this century into solving the mystery of the complicated sex chromosome system in monotremes. Monotremes possess multiple different X and Y chromosomes and a candidate sex determining gene has been identified. Even greater advancements have been made for marsupials, with reconstruction of the ancestral karyotype enabling the evolutionary history of marsupial chromosomes to be determined. Furthermore, the study of sex chromosomes in intersex marsupials has afforded insight into differences in the sexual differentiation pathway between marsupials and eutherians, together with experiments showing the insensitivity of the mammary glands, pouch and scrotum to exogenous hormones, led to the hypothesis that there is a gene (or genes) on the X chromosome responsible for the development of either pouch or scrotum. This review highlights the major advancements made towards understanding chromosome evolution and how this has impacted on our understanding of sex determination and differentiation in these interesting mammals. Copyright © 2015 Elsevier Inc. All rights reserved.
Maximizing mutagenesis with solubilized CRISPR-Cas9 ribonucleoprotein complexes.

PubMed

Burger, Alexa; Lindsay, Helen; Felker, Anastasia; Hess, Christopher; Anders, Carolin; Chiavacci, Elena; Zaugg, Jonas; Weber, Lukas M; Catena, Raul; Jinek, Martin; Robinson, Mark D; Mosimann, Christian

2016-06-01

CRISPR-Cas9 enables efficient sequence-specific mutagenesis for creating somatic or germline mutants of model organisms. Key constraints in vivo remain the expression and delivery of active Cas9-sgRNA ribonucleoprotein complexes (RNPs) with minimal toxicity, variable mutagenesis efficiencies depending on targeting sequence, and high mutation mosaicism. Here, we apply in vitro assembled, fluorescent Cas9-sgRNA RNPs in solubilizing salt solution to achieve maximal mutagenesis efficiency in zebrafish embryos. MiSeq-based sequence analysis of targeted loci in individual embryos using CrispRVariants, a customized software tool for mutagenesis quantification and visualization, reveals efficient bi-allelic mutagenesis that reaches saturation at several tested gene loci. Such virtually complete mutagenesis exposes loss-of-function phenotypes for candidate genes in somatic mutant embryos for subsequent generation of stable germline mutants. We further show that targeting of non-coding elements in gene regulatory regions using saturating mutagenesis uncovers functional control elements in transgenic reporters and endogenous genes in injected embryos. Our results establish that optimally solubilized, in vitro assembled fluorescent Cas9-sgRNA RNPs provide a reproducible reagent for direct and scalable loss-of-function studies and applications beyond zebrafish experiments that require maximal DNA cutting efficiency in vivo. © 2016. Published by The Company of Biologists Ltd.

Differential Gene Expression between Leaf and Rhizome in Atractylodes lancea: A Comparative Transcriptome Analysis

PubMed Central

Huang, Qianqian; Huang, Xiao; Deng, Juan; Liu, Hegang; Liu, Yanwen; Yu, Kun; Huang, Bisheng

2016-01-01

The rhizome of Atractylodes lancea is extensively used in the practice of Traditional Chinese Medicine because of its broad pharmacological activities. This study was designed to characterize the transcriptome profiling of the rhizome and leaf of Atractylodes lancea in an attempt to uncover the molecular mechanisms regulating rhizome formation and growth. Over 270 million clean reads were assembled into 92,366 unigenes, 58% of which are homologous with sequences in public protein databases (NR, Swiss-Prot, GO, and KEGG). Analysis of expression levels showed that genes involved in photosynthesis, stress response, and translation were the most abundant transcripts in the leaf, while transcripts involved in stress response, transcription regulation, translation, and metabolism were dominant in the rhizome. Tissue-specific gene analysis identified distinct gene families active in the leaf and rhizome. Differential gene expression analysis revealed a clear difference in gene expression pattern, identifying 1518 up-regulated genes and 3464 down-regulated genes in the rhizome compared with the leaf, including a series of genes related to signal transduction, primary and secondary metabolism. Transcription factor (TF) analysis identified 42 TF families, with 67 and 60 TFs up-regulated in the rhizome and leaf, respectively. A total of 104 unigenes were identified as candidates for regulating rhizome formation and development. These data offer an overview of the gene expression pattern of the rhizome and leaf and provide essential information for future studies on the molecular mechanisms of controlling rhizome formation and growth. The extensive transcriptome data generated in this study will be a valuable resource for further functional genomics studies of A. lancea. PMID:27066021
A proteome-scale map of the human interactome network

PubMed Central

Rolland, Thomas; Taşan, Murat; Charloteaux, Benoit; Pevzner, Samuel J.; Zhong, Quan; Sahni, Nidhi; Yi, Song; Lemmens, Irma; Fontanillo, Celia; Mosca, Roberto; Kamburov, Atanas; Ghiassian, Susan D.; Yang, Xinping; Ghamsari, Lila; Balcha, Dawit; Begg, Bridget E.; Braun, Pascal; Brehme, Marc; Broly, Martin P.; Carvunis, Anne-Ruxandra; Convery-Zupan, Dan; Corominas, Roser; Coulombe-Huntington, Jasmin; Dann, Elizabeth; Dreze, Matija; Dricot, Amélie; Fan, Changyu; Franzosa, Eric; Gebreab, Fana; Gutierrez, Bryan J.; Hardy, Madeleine F.; Jin, Mike; Kang, Shuli; Kiros, Ruth; Lin, Guan Ning; Luck, Katja; MacWilliams, Andrew; Menche, Jörg; Murray, Ryan R.; Palagi, Alexandre; Poulin, Matthew M.; Rambout, Xavier; Rasla, John; Reichert, Patrick; Romero, Viviana; Ruyssinck, Elien; Sahalie, Julie M.; Scholz, Annemarie; Shah, Akash A.; Sharma, Amitabh; Shen, Yun; Spirohn, Kerstin; Tam, Stanley; Tejeda, Alexander O.; Trigg, Shelly A.; Twizere, Jean-Claude; Vega, Kerwin; Walsh, Jennifer; Cusick, Michael E.; Xia, Yu; Barabási, Albert-László; Iakoucheva, Lilia M.; Aloy, Patrick; De Las Rivas, Javier; Tavernier, Jan; Calderwood, Michael A.; Hill, David E.; Hao, Tong; Roth, Frederick P.; Vidal, Marc

2014-01-01

SUMMARY Just as reference genome sequences revolutionized human genetics, reference maps of interactome networks will be critical to fully understand genotype-phenotype relationships. Here, we describe a systematic map of ~14,000 high-quality human binary protein-protein interactions. At equal quality, this map is ~30% larger than what is available from small-scale studies published in the literature in the last few decades. While currently available information is highly biased and only covers a relatively small portion of the proteome, our systematic map appears strikingly more homogeneous, revealing a “broader” human interactome network than currently appreciated. The map also uncovers significant inter-connectivity between known and candidate cancer gene products, providing unbiased evidence for an expanded functional cancer landscape, while demonstrating how high quality interactome models will help “connect the dots” of the genomic revolution. PMID:25416956
Rare and Common Variants Conferring Risk of Tooth Agenesis.

PubMed

Jonsson, L; Magnusson, T E; Thordarson, A; Jonsson, T; Geller, F; Feenstra, B; Melbye, M; Nohr, E A; Vucic, S; Dhamo, B; Rivadeneira, F; Ongkosuwito, E M; Wolvius, E B; Leslie, E J; Marazita, M L; Howe, B J; Moreno Uribe, L M; Alonso, I; Santos, M; Pinho, T; Jonsson, R; Audolfsson, G; Gudmundsson, L; Nawaz, M S; Olafsson, S; Gustafsson, O; Ingason, A; Unnsteinsdottir, U; Bjornsdottir, G; Walters, G B; Zervas, M; Oddsson, A; Gudbjartsson, D F; Steinberg, S; Stefansson, H; Stefansson, K

2018-05-01

We present association results from a large genome-wide association study of tooth agenesis (TA) as well as selective TA, including 1,944 subjects with congenitally missing teeth, excluding third molars, and 338,554 controls, all of European ancestry. We also tested the association of previously identified risk variants, for timing of tooth eruption and orofacial clefts, with TA. We report associations between TA and 9 novel risk variants. Five of these variants associate with selective TA, including a variant conferring risk of orofacial clefts. These results contribute to a deeper understanding of the genetic architecture of tooth development and disease. The few variants previously associated with TA were uncovered through candidate gene studies guided by mouse knockouts. Knowing the etiology and clinical features of TA is important for planning oral rehabilitation that often involves an interdisciplinary approach.
Time-Series Analyses of Transcriptomes and Proteomes Reveal Molecular Networks Underlying Oil Accumulation in Canola.

PubMed

Wan, Huafang; Cui, Yixin; Ding, Yijuan; Mei, Jiaqin; Dong, Hongli; Zhang, Wenxin; Wu, Shiqi; Liang, Ying; Zhang, Chunyu; Li, Jiana; Xiong, Qing; Qian, Wei

2016-01-01

Understanding the regulation of lipid metabolism is vital for genetic engineering of canola ( Brassica napus L.) to increase oil yield or modify oil composition. We conducted time-series analyses of transcriptomes and proteomes to uncover the molecular networks associated with oil accumulation and dynamic changes in these networks in canola. The expression levels of genes and proteins were measured at 2, 4, 6, and 8 weeks after pollination (WAP). Our results show that the biosynthesis of fatty acids is a dominant cellular process from 2 to 6 WAP, while the degradation mainly happens after 6 WAP. We found that genes in almost every node of fatty acid synthesis pathway were significantly up-regulated during oil accumulation. Moreover, significant expression changes of two genes, acetyl-CoA carboxylase and acyl-ACP desaturase, were detected on both transcriptomic and proteomic levels. We confirmed the temporal expression patterns revealed by the transcriptomic analyses using quantitative real-time PCR experiments. The gene set association analysis show that the biosynthesis of fatty acids and unsaturated fatty acids are the most significant biological processes from 2-4 WAP and 4-6 WAP, respectively, which is consistent with the results of time-series analyses. These results not only provide insight into the mechanisms underlying lipid metabolism, but also reveal novel candidate genes that are worth further investigation for their values in the genetic engineering of canola.
A genome-wide SNP scan accelerates trait-regulatory genomic loci identification in chickpea

PubMed Central

Kujur, Alice; Bajaj, Deepak; Upadhyaya, Hari D.; Das, Shouvik; Ranjan, Rajeev; Shree, Tanima; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C.L.L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.

2015-01-01

We identified 44844 high-quality SNPs by sequencing 92 diverse chickpea accessions belonging to a seed and pod trait-specific association panel using reference genome- and de novo-based GBS (genotyping-by-sequencing) assays. A GWAS (genome-wide association study) in an association panel of 211, including the 92 sequenced accessions, identified 22 major genomic loci showing significant association (explaining 23–47% phenotypic variation) with pod and seed number/plant and 100-seed weight. Eighteen trait-regulatory major genomic loci underlying 13 robust QTLs were validated and mapped on an intra-specific genetic linkage map by QTL mapping. A combinatorial approach of GWAS, QTL mapping and gene haplotype-specific LD mapping and transcript profiling uncovered one superior haplotype and favourable natural allelic variants in the upstream regulatory region of a CesA-type cellulose synthase (Ca_Kabuli_CesA3) gene regulating high pod and seed number/plant (explaining 47% phenotypic variation) in chickpea. The up-regulation of this superior gene haplotype correlated with increased transcript expression of Ca_Kabuli_CesA3 gene in the pollen and pod of high pod/seed number accession, resulting in higher cellulose accumulation for normal pollen and pollen tube growth. A rapid combinatorial genome-wide SNP genotyping-based approach has potential to dissect complex quantitative agronomic traits and delineate trait-regulatory genomic loci (candidate genes) for genetic enhancement in crop plants, including chickpea. PMID:26058368
Uncovering the mechanisms of Caenorhabditis elegans ageing from global quantification of the underlying landscape.

PubMed

Zhao, Lei; Wang, Jin

2016-11-01

Recent studies on Caenorhabditis elegans reveal that gene manipulations can extend its lifespan several fold. However, how the genes work together to determine longevity is still an open question. Here we construct a gene regulatory network for worm ageing and quantify its underlying potential and flux landscape. We found ageing and rejuvenation states can emerge as basins of attraction at certain gene expression levels. The system state can switch from one attractor to another driven by the intrinsic or external perturbations through genetics or the environment. Furthermore, we simulated gene silencing experiments and found that the silencing of longevity-promoting or lifespan-limiting genes leads to ageing or rejuvenation domination, respectively. This indicates that the difference in depths between ageing and the rejuvenation attractor is highly correlated with worm longevity. We further uncovered some key genes and regulations which have a strong influence on landscape basin stability. A dynamic landscape model is proposed to describe the whole process of ageing: the ageing attractor dominates when senescence progresses. We also uncovered the oscillation dynamics, and a similar behaviour was observed in the long-lived creature Turritopsis dohrnii Our landscape theory provides a global and physical approach to explore the underlying mechanisms of ageing. © 2016 The Author(s).
Candidate genes for obesity-susceptibility show enriched association within a large genome-wide association study for BMI.

PubMed

Vimaleswaran, Karani S; Tachmazidou, Ioanna; Zhao, Jing Hua; Hirschhorn, Joel N; Dudbridge, Frank; Loos, Ruth J F

2012-10-15

Before the advent of genome-wide association studies (GWASs), hundreds of candidate genes for obesity-susceptibility had been identified through a variety of approaches. We examined whether those obesity candidate genes are enriched for associations with body mass index (BMI) compared with non-candidate genes by using data from a large-scale GWAS. A thorough literature search identified 547 candidate genes for obesity-susceptibility based on evidence from animal studies, Mendelian syndromes, linkage studies, genetic association studies and expression studies. Genomic regions were defined to include the genes ±10 kb of flanking sequence around candidate and non-candidate genes. We used summary statistics publicly available from the discovery stage of the genome-wide meta-analysis for BMI performed by the genetic investigation of anthropometric traits consortium in 123 564 individuals. Hypergeometric, rank tail-strength and gene-set enrichment analysis tests were used to test for the enrichment of association in candidate compared with non-candidate genes. The hypergeometric test of enrichment was not significant at the 5% P-value quantile (P = 0.35), but was nominally significant at the 25% quantile (P = 0.015). The rank tail-strength and gene-set enrichment tests were nominally significant for the full set of genes and borderline significant for the subset without SNPs at P < 10(-7). Taken together, the observed evidence for enrichment suggests that the candidate gene approach retains some value. However, the degree of enrichment is small despite the extensive number of candidate genes and the large sample size. Studies that focus on candidate genes have only slightly increased chances of detecting associations, and are likely to miss many true effects in non-candidate genes, at least for obesity-related traits.
Global Gene Expression Patterns and Somatic Mutations in Sporadic Intracranial Aneurysms.

PubMed

Li, Zhili; Tan, Haibin; Shi, Yi; Huang, Guangfu; Wang, Zhenyu; Liu, Ling; Yin, Cheng; Wang, Qi

2017-04-01

High-throughput sequencing technologies can expand our understanding of the pathologic basis of intracranial aneurysms (IAs). Our study was aimed to decipher the gene expression signature and genetic factors associated with IAs. We determined the gene expression levels of 3 cases of IAs by RNA sequencing. Bioinformatics analysis was conducted to identify the differentially expressed genes (DEGs) and uncover their biological function. In addition, whole genome sequencing was performed on an additional 6 cases of IAs to detect the potential somatic alterations in DEGs. Compared with the normal arterial tissue, 1709 genes were differentially expressed in IAs arterial tissue. The most significantly up-regulated gene and down-regulated gene, H19 and HIST1H3J, may be essential for tumorigenesis of IAs. Hub protein of IKBKG in protein-protein interaction network was probably involved in the inflammation process in aneurysms. Another 2 hub proteins, ACTB and MKI67IP, as well as up-regulated genes, might be abnormally activated in aneurysms and involved in the pathogenesis of IAs. Further whole genome sequencing and filtering yielded 4 candidate somatic single nucleotide variants including MUC3B, and BLM may be involved in the pathogenesis of IAs. Even though, our results do not support the hypothesis of somatic mutations occurred in the DEGs. Two-dimensional genomic data from transcriptome and whole genome sequencing indicated that no somatic mutations occurred in DEGs. In addition, 3 DEGs (IKBKG, ACTB, and MKI67IP) and 2 mutant genes (MUC3B and BLM) were essential in IAs. Copyright © 2017 Elsevier Inc. All rights reserved.
Balloon-borne three-meter telescope for far-infrared and submillimeter astronomy

NASA Technical Reports Server (NTRS)

Fazlo, G. G.

1986-01-01

The drawbacks uncovered in the initial gimbal design are reviewed. The behavior of ball bearings are considered and two candidate gimbal designs are proposed which overcome the problems in the initial flex pivot based design.
The chromatin accessibility signature of human immune aging stems from CD8+ T cells.

PubMed

Ucar, Duygu; Márquez, Eladio J; Chung, Cheng-Han; Marches, Radu; Rossi, Robert J; Uyar, Asli; Wu, Te-Chia; George, Joshy; Stitzel, Michael L; Palucka, A Karolina; Kuchel, George A; Banchereau, Jacques

2017-10-02

Aging is linked to deficiencies in immune responses and increased systemic inflammation. To unravel the regulatory programs behind these changes, we applied systems immunology approaches and profiled chromatin accessibility and the transcriptome in PBMCs and purified monocytes, B cells, and T cells. Analysis of samples from 77 young and elderly donors revealed a novel and robust aging signature in PBMCs, with simultaneous systematic chromatin closing at promoters and enhancers associated with T cell signaling and a potentially stochastic chromatin opening mostly found at quiescent and repressed sites. Combined analyses of chromatin accessibility and the transcriptome uncovered immune molecules activated/inactivated with aging and identified the silencing of the IL7R gene and the IL-7 signaling pathway genes as potential biomarkers. This signature is borne by memory CD8 + T cells, which exhibited an aging-related loss in binding of NF-κB and STAT factors. Thus, our study provides a unique and comprehensive approach to identifying candidate biomarkers and provides mechanistic insights into aging-associated immunodeficiency. © 2017 Ucar et al.
The chromatin accessibility signature of human immune aging stems from CD8+ T cells

PubMed Central

Marches, Radu; Rossi, Robert J.; Uyar, Asli; Wu, Te-Chia; Stitzel, Michael L.; Palucka, A. Karolina

2017-01-01

Aging is linked to deficiencies in immune responses and increased systemic inflammation. To unravel the regulatory programs behind these changes, we applied systems immunology approaches and profiled chromatin accessibility and the transcriptome in PBMCs and purified monocytes, B cells, and T cells. Analysis of samples from 77 young and elderly donors revealed a novel and robust aging signature in PBMCs, with simultaneous systematic chromatin closing at promoters and enhancers associated with T cell signaling and a potentially stochastic chromatin opening mostly found at quiescent and repressed sites. Combined analyses of chromatin accessibility and the transcriptome uncovered immune molecules activated/inactivated with aging and identified the silencing of the IL7R gene and the IL-7 signaling pathway genes as potential biomarkers. This signature is borne by memory CD8+ T cells, which exhibited an aging-related loss in binding of NF-κB and STAT factors. Thus, our study provides a unique and comprehensive approach to identifying candidate biomarkers and provides mechanistic insights into aging-associated immunodeficiency. PMID:28904110
Fluorescence Reporter-Based Genome-Wide RNA Interference Screening to Identify Alternative Splicing Regulators.

PubMed

Misra, Ashish; Green, Michael R

2017-01-01

Alternative splicing is a regulated process that leads to inclusion or exclusion of particular exons in a pre-mRNA transcript, resulting in multiple protein isoforms being encoded by a single gene. With more than 90 % of human genes known to undergo alternative splicing, it represents a major source for biological diversity inside cells. Although in vitro splicing assays have revealed insights into the mechanisms regulating individual alternative splicing events, our global understanding of alternative splicing regulation is still evolving. In recent years, genome-wide RNA interference (RNAi) screening has transformed biological research by enabling genome-scale loss-of-function screens in cultured cells and model organisms. In addition to resulting in the identification of new cellular pathways and potential drug targets, these screens have also uncovered many previously unknown mechanisms regulating alternative splicing. Here, we describe a method for the identification of alternative splicing regulators using genome-wide RNAi screening, as well as assays for further validation of the identified candidates. With modifications, this method can also be adapted to study the splicing regulation of pre-mRNAs that contain two or more splice isoforms.
Strand-Specific RNA-Seq Analyses of Fruiting Body Development in Coprinopsis cinerea

DOE PAGES

Muraguchi, Hajime; Umezawa, Kiwamu; Niikura, Mai; ...

2015-10-28

We report that the basidiomycete fungus Coprinopsis cinerea is an important model system for multicellular development. Fruiting bodies of C. cinerea are typical mushrooms, which can be produced synchronously on defined media in the laboratory. To investigate the transcriptome in detail during fruiting body development, high-throughput sequencing (RNA-seq) was performed using cDNA libraries strand-specifically constructed from 13 points (stages/tissues) with two biological replicates. The reads were aligned to 14,245 predicted transcripts, and counted for forward and reverse transcripts. Differentially expressed genes (DEGs) between two adjacent points and between vegetative mycelium and each point were detected by Tag Count Comparison (TCC).more » To validate RNA-seq data, expression levels of selected genes were compared using RPKM values in RNA-seq data and qRT-PCR data, and DEGs detected in microarray data were examined in MA plots of RNA-seq data by TCC. We discuss events deduced from GO analysis of DEGs. In addition, we uncovered both transcription factor candidates and antisense transcripts that are likely to be involved in developmental regulation for fruiting.« less
Strand-Specific RNA-Seq Analyses of Fruiting Body Development in Coprinopsis cinerea

DOE Office of Scientific and Technical Information (OSTI.GOV)

Muraguchi, Hajime; Umezawa, Kiwamu; Niikura, Mai

We report that the basidiomycete fungus Coprinopsis cinerea is an important model system for multicellular development. Fruiting bodies of C. cinerea are typical mushrooms, which can be produced synchronously on defined media in the laboratory. To investigate the transcriptome in detail during fruiting body development, high-throughput sequencing (RNA-seq) was performed using cDNA libraries strand-specifically constructed from 13 points (stages/tissues) with two biological replicates. The reads were aligned to 14,245 predicted transcripts, and counted for forward and reverse transcripts. Differentially expressed genes (DEGs) between two adjacent points and between vegetative mycelium and each point were detected by Tag Count Comparison (TCC).more » To validate RNA-seq data, expression levels of selected genes were compared using RPKM values in RNA-seq data and qRT-PCR data, and DEGs detected in microarray data were examined in MA plots of RNA-seq data by TCC. We discuss events deduced from GO analysis of DEGs. In addition, we uncovered both transcription factor candidates and antisense transcripts that are likely to be involved in developmental regulation for fruiting.« less
Rare and low-frequency coding variants alter human adult height

PubMed Central

Marouli, Eirini; Graff, Mariaelisa; Medina-Gomez, Carolina; Lo, Ken Sin; Wood, Andrew R; Kjaer, Troels R; Fine, Rebecca S; Lu, Yingchang; Schurmann, Claudia; Highland, Heather M; Rüeger, Sina; Thorleifsson, Gudmar; Justice, Anne E; Lamparter, David; Stirrups, Kathleen E; Turcot, Valérie; Young, Kristin L; Winkler, Thomas W; Esko, Tõnu; Karaderi, Tugce; Locke, Adam E; Masca, Nicholas GD; Ng, Maggie CY; Mudgal, Poorva; Rivas, Manuel A; Vedantam, Sailaja; Mahajan, Anubha; Guo, Xiuqing; Abecasis, Goncalo; Aben, Katja K; Adair, Linda S; Alam, Dewan S; Albrecht, Eva; Allin, Kristine H; Allison, Matthew; Amouyel, Philippe; Appel, Emil V; Arveiler, Dominique; Asselbergs, Folkert W; Auer, Paul L; Balkau, Beverley; Banas, Bernhard; Bang, Lia E; Benn, Marianne; Bergmann, Sven; Bielak, Lawrence F; Blüher, Matthias; Boeing, Heiner; Boerwinkle, Eric; Böger, Carsten A; Bonnycastle, Lori L; Bork-Jensen, Jette; Bots, Michiel L; Bottinger, Erwin P; Bowden, Donald W; Brandslund, Ivan; Breen, Gerome; Brilliant, Murray H; Broer, Linda; Burt, Amber A; Butterworth, Adam S; Carey, David J; Caulfield, Mark J; Chambers, John C; Chasman, Daniel I; Chen, Yii-Der Ida; Chowdhury, Rajiv; Christensen, Cramer; Chu, Audrey Y; Cocca, Massimiliano; Collins, Francis S; Cook, James P; Corley, Janie; Galbany, Jordi Corominas; Cox, Amanda J; Cuellar-Partida, Gabriel; Danesh, John; Davies, Gail; de Bakker, Paul IW; de Borst, Gert J.; de Denus, Simon; de Groot, Mark CH; de Mutsert, Renée; Deary, Ian J; Dedoussis, George; Demerath, Ellen W; den Hollander, Anneke I; Dennis, Joe G; Di Angelantonio, Emanuele; Drenos, Fotios; Du, Mengmeng; Dunning, Alison M; Easton, Douglas F; Ebeling, Tapani; Edwards, Todd L; Ellinor, Patrick T; Elliott, Paul; Evangelou, Evangelos; Farmaki, Aliki-Eleni; Faul, Jessica D; Feitosa, Mary F; Feng, Shuang; Ferrannini, Ele; Ferrario, Marco M; Ferrieres, Jean; Florez, Jose C; Ford, Ian; Fornage, Myriam; Franks, Paul W; Frikke-Schmidt, Ruth; Galesloot, Tessel E; Gan, Wei; Gandin, Ilaria; Gasparini, Paolo; Giedraitis, Vilmantas; Giri, Ayush; Girotto, Giorgia; Gordon, Scott D; Gordon-Larsen, Penny; Gorski, Mathias; Grarup, Niels; Grove, Megan L.; Gudnason, Vilmundur; Gustafsson, Stefan; Hansen, Torben; Harris, Kathleen Mullan; Harris, Tamara B; Hattersley, Andrew T; Hayward, Caroline; He, Liang; Heid, Iris M; Heikkilä, Kauko; Helgeland, Øyvind; Hernesniemi, Jussi; Hewitt, Alex W; Hocking, Lynne J; Hollensted, Mette; Holmen, Oddgeir L; Hovingh, G. Kees; Howson, Joanna MM; Hoyng, Carel B; Huang, Paul L; Hveem, Kristian; Ikram, M. Arfan; Ingelsson, Erik; Jackson, Anne U; Jansson, Jan-Håkan; Jarvik, Gail P; Jensen, Gorm B; Jhun, Min A; Jia, Yucheng; Jiang, Xuejuan; Johansson, Stefan; Jørgensen, Marit E; Jørgensen, Torben; Jousilahti, Pekka; Jukema, J Wouter; Kahali, Bratati; Kahn, René S; Kähönen, Mika; Kamstrup, Pia R; Kanoni, Stavroula; Kaprio, Jaakko; Karaleftheri, Maria; Kardia, Sharon LR; Karpe, Fredrik; Kee, Frank; Keeman, Renske; Kiemeney, Lambertus A; Kitajima, Hidetoshi; Kluivers, Kirsten B; Kocher, Thomas; Komulainen, Pirjo; Kontto, Jukka; Kooner, Jaspal S; Kooperberg, Charles; Kovacs, Peter; Kriebel, Jennifer; Kuivaniemi, Helena; Küry, Sébastien; Kuusisto, Johanna; La Bianca, Martina; Laakso, Markku; Lakka, Timo A; Lange, Ethan M; Lange, Leslie A; Langefeld, Carl D; Langenberg, Claudia; Larson, Eric B; Lee, I-Te; Lehtimäki, Terho; Lewis, Cora E; Li, Huaixing; Li, Jin; Li-Gao, Ruifang; Lin, Honghuang; Lin, Li-An; Lin, Xu; Lind, Lars; Lindström, Jaana; Linneberg, Allan; Liu, Yeheng; Liu, Yongmei; Lophatananon, Artitaya; Luan, Jian'an; Lubitz, Steven A; Lyytikäinen, Leo-Pekka; Mackey, David A; Madden, Pamela AF; Manning, Alisa K; Männistö, Satu; Marenne, Gaëlle; Marten, Jonathan; Martin, Nicholas G; Mazul, Angela L; Meidtner, Karina; Metspalu, Andres; Mitchell, Paul; Mohlke, Karen L; Mook-Kanamori, Dennis O; Morgan, Anna; Morris, Andrew D; Morris, Andrew P; Müller-Nurasyid, Martina; Munroe, Patricia B; Nalls, Mike A; Nauck, Matthias; Nelson, Christopher P; Neville, Matt; Nielsen, Sune F; Nikus, Kjell; Njølstad, Pål R; Nordestgaard, Børge G; Ntalla, Ioanna; O'Connel, Jeffrey R; Oksa, Heikki; Loohuis, Loes M Olde; Ophoff, Roel A; Owen, Katharine R; Packard, Chris J; Padmanabhan, Sandosh; Palmer, Colin NA; Pasterkamp, Gerard; Patel, Aniruddh P; Pattie, Alison; Pedersen, Oluf; Peissig, Peggy L; Peloso, Gina M; Pennell, Craig E; Perola, Markus; Perry, James A; Perry, John R.B.; Person, Thomas N; Pirie, Ailith; Polasek, Ozren; Posthuma, Danielle; Raitakari, Olli T; Rasheed, Asif; Rauramaa, Rainer; Reilly, Dermot F; Reiner, Alex P; Renström, Frida; Ridker, Paul M; Rioux, John D; Robertson, Neil; Robino, Antonietta; Rolandsson, Olov; Rudan, Igor; Ruth, Katherine S; Saleheen, Danish; Salomaa, Veikko; Samani, Nilesh J; Sandow, Kevin; Sapkota, Yadav; Sattar, Naveed; Schmidt, Marjanka K; Schreiner, Pamela J; Schulze, Matthias B; Scott, Robert A; Segura-Lepe, Marcelo P; Shah, Svati; Sim, Xueling; Sivapalaratnam, Suthesh; Small, Kerrin S; Smith, Albert Vernon; Smith, Jennifer A; Southam, Lorraine; Spector, Timothy D; Speliotes, Elizabeth K; Starr, John M; Steinthorsdottir, Valgerdur; Stringham, Heather M; Stumvoll, Michael; Surendran, Praveen; Hart, Leen M ‘t; Tansey, Katherine E; Tardif, Jean-Claude; Taylor, Kent D; Teumer, Alexander; Thompson, Deborah J; Thorsteinsdottir, Unnur; Thuesen, Betina H; Tönjes, Anke; Tromp, Gerard; Trompet, Stella; Tsafantakis, Emmanouil; Tuomilehto, Jaakko; Tybjaerg-Hansen, Anne; Tyrer, Jonathan P; Uher, Rudolf; Uitterlinden, André G; Ulivi, Sheila; van der Laan, Sander W; Van Der Leij, Andries R; van Duijn, Cornelia M; van Schoor, Natasja M; van Setten, Jessica; Varbo, Anette; Varga, Tibor V; Varma, Rohit; Edwards, Digna R Velez; Vermeulen, Sita H; Vestergaard, Henrik; Vitart, Veronique; Vogt, Thomas F; Vozzi, Diego; Walker, Mark; Wang, Feijie; Wang, Carol A; Wang, Shuai; Wang, Yiqin; Wareham, Nicholas J; Warren, Helen R; Wessel, Jennifer; Willems, Sara M; Wilson, James G; Witte, Daniel R; Woods, Michael O; Wu, Ying; Yaghootkar, Hanieh; Yao, Jie; Yao, Pang; Yerges-Armstrong, Laura M; Young, Robin; Zeggini, Eleftheria; Zhan, Xiaowei; Zhang, Weihua; Zhao, Jing Hua; Zhao, Wei; Zhao, Wei; Zheng, He; Zhou, Wei; Rotter, Jerome I; Boehnke, Michael; Kathiresan, Sekar; McCarthy, Mark I; Willer, Cristen J; Stefansson, Kari; Borecki, Ingrid B; Liu, Dajiang J; North, Kari E; Heard-Costa, Nancy L; Pers, Tune H; Lindgren, Cecilia M; Oxvig, Claus; Kutalik, Zoltán; Rivadeneira, Fernando; Loos, Ruth JF; Frayling, Timothy M; Hirschhorn, Joel N; Deloukas, Panos; Lettre, Guillaume

2016-01-01

Summary Height is a highly heritable, classic polygenic trait with ∼700 common associated variants identified so far through genome-wide association studies. Here, we report 83 height-associated coding variants with lower minor allele frequencies (range of 0.1-4.8%) and effects of up to 2 cm/allele (e.g. in IHH, STC2, AR and CRISPLD2), >10 times the average effect of common variants. In functional follow-up studies, rare height-increasing alleles of STC2 (+1-2 cm/allele) compromised proteolytic inhibition of PAPP-A and increased cleavage of IGFBP-4 in vitro, resulting in higher bioavailability of insulin-like growth factors. These 83 height-associated variants overlap genes mutated in monogenic growth disorders and highlight new biological candidates (e.g. ADAMTS3, IL11RA, NOX4) and pathways (e.g. proteoglycan/glycosaminoglycan synthesis) involved in growth. Our results demonstrate that sufficiently large sample sizes can uncover rare and low-frequency variants of moderate to large effect associated with polygenic human phenotypes, and that these variants implicate relevant genes and pathways. PMID:28146470
A novel approach for predicting microRNA-disease associations by unbalanced bi-random walk on heterogeneous network.

PubMed

Luo, Jiawei; Xiao, Qiu

2017-02-01

MicroRNAs (miRNAs) play a critical role by regulating their targets in post-transcriptional level. Identification of potential miRNA-disease associations will aid in deciphering the pathogenesis of human polygenic diseases. Several computational models have been developed to uncover novel miRNA-disease associations based on the predicted target genes. However, due to the insufficient number of experimentally validated miRNA-target interactions as well as the relatively high false-positive and false-negative rates of predicted target genes, it is still challenging for these prediction models to obtain remarkable performances. The purpose of this study is to prioritize miRNA candidates for diseases. We first construct a heterogeneous network, which consists of a disease similarity network, a miRNA functional similarity network and a known miRNA-disease association network. Then, an unbalanced bi-random walk-based algorithm on the heterogeneous network (BRWH) is adopted to discover potential associations by exploiting bipartite subgraphs. Based on 5-fold cross validation, the proposed network-based method achieves AUC values ranging from 0.782 to 0.907 for the 22 human diseases and an average AUC of almost 0.846. The experiments indicated that BRWH can achieve better performances compared with several popular methods. In addition, case studies of some common diseases further demonstrated the superior performance of our proposed method on prioritizing disease-related miRNA candidates. Copyright Â© 2017 Elsevier Inc. All rights reserved.
Inferring Selective Constraint from Population Genomic Data Suggests Recent Regulatory Turnover in the Human Brain

PubMed Central

Schrider, Daniel R.; Kern, Andrew D.

2015-01-01

The comparative genomics revolution of the past decade has enabled the discovery of functional elements in the human genome via sequence comparison. While that is so, an important class of elements, those specific to humans, is entirely missed by searching for sequence conservation across species. Here we present an analysis based on variation data among human genomes that utilizes a supervised machine learning approach for the identification of human-specific purifying selection in the genome. Using only allele frequency information from the complete low-coverage 1000 Genomes Project data set in conjunction with a support vector machine trained from known functional and nonfunctional portions of the genome, we are able to accurately identify portions of the genome constrained by purifying selection. Our method identifies previously known human-specific gains or losses of function and uncovers many novel candidates. Candidate targets for gain and loss of function along the human lineage include numerous putative regulatory regions of genes essential for normal development of the central nervous system, including a significant enrichment of gain of function events near neurotransmitter receptor genes. These results are consistent with regulatory turnover being a key mechanism in the evolution of human-specific characteristics of brain development. Finally, we show that the majority of the genome is unconstrained by natural selection currently, in agreement with what has been estimated from phylogenetic methods but in sharp contrast to estimates based on transcriptomics or other high-throughput functional methods. PMID:26590212
Quantitative Profiling of Brain Lipid Raft Proteome in a Mouse Model of Fragile X Syndrome

PubMed Central

Kalinowska, Magdalena; Castillo, Catherine; Francesconi, Anna

2015-01-01

Fragile X Syndrome, a leading cause of inherited intellectual disability and autism, arises from transcriptional silencing of the FMR1 gene encoding an RNA-binding protein, Fragile X Mental Retardation Protein (FMRP). FMRP can regulate the expression of approximately 4% of brain transcripts through its role in regulation of mRNA transport, stability and translation, thus providing a molecular rationale for its potential pleiotropic effects on neuronal and brain circuitry function. Several intracellular signaling pathways are dysregulated in the absence of FMRP suggesting that cellular deficits may be broad and could result in homeostatic changes. Lipid rafts are specialized regions of the plasma membrane, enriched in cholesterol and glycosphingolipids, involved in regulation of intracellular signaling. Among transcripts targeted by FMRP, a subset encodes proteins involved in lipid biosynthesis and homeostasis, dysregulation of which could affect the integrity and function of lipid rafts. Using a quantitative mass spectrometry-based approach we analyzed the lipid raft proteome of Fmr1 knockout mice, an animal model of Fragile X syndrome, and identified candidate proteins that are differentially represented in Fmr1 knockout mice lipid rafts. Furthermore, network analysis of these candidate proteins reveals connectivity between them and predicts functional connectivity with genes encoding components of myelin sheath, axonal processes and growth cones. Our findings provide insight to aid identification of molecular and cellular dysfunctions arising from Fmr1 silencing and for uncovering shared pathologies between Fragile X syndrome and other autism spectrum disorders. PMID:25849048
Dysfunctional SEMA3E signaling underlies gonadotropin-releasing hormone neuron deficiency in Kallmann syndrome.

PubMed

Cariboni, Anna; André, Valentina; Chauvet, Sophie; Cassatella, Daniele; Davidson, Kathryn; Caramello, Alessia; Fantin, Alessandro; Bouloux, Pierre; Mann, Fanny; Ruhrberg, Christiana

2015-06-01

Individuals with an inherited deficiency in gonadotropin-releasing hormone (GnRH) have impaired sexual reproduction. Previous genetic linkage studies and sequencing of plausible gene candidates have identified mutations associated with inherited GnRH deficiency, but the small number of affected families and limited success in validating candidates have impeded genetic diagnoses for most patients. Using a combination of exome sequencing and computational modeling, we have identified a shared point mutation in semaphorin 3E (SEMA3E) in 2 brothers with Kallmann syndrome (KS), which causes inherited GnRH deficiency. Recombinant wild-type SEMA3E protected maturing GnRH neurons from cell death by triggering a plexin D1-dependent (PLXND1-dependent) activation of PI3K-mediated survival signaling. In contrast, recombinant SEMA3E carrying the KS-associated mutation did not protect GnRH neurons from death. In murine models, lack of either SEMA3E or PLXND1 increased apoptosis of GnRH neurons in the developing brain, reducing innervation of the adult median eminence by GnRH-positive neurites. GnRH neuron deficiency in male mice was accompanied by impaired testes growth, a characteristic feature of KS. Together, these results identify SEMA3E as an essential gene for GnRH neuron development, uncover a neurotrophic function for SEMA3E in the developing brain, and elucidate SEMA3E/PLXND1/PI3K signaling as a mechanism that prevents GnRH neuron deficiency.
A Reevaluation of Rice Mitochondrial Evolution Based on the Complete Sequence of Male-Fertile and Male-Sterile Mitochondrial Genomes1[C][W][OA

PubMed Central

Bentolila, Stéphane; Stefanov, Stefan

2012-01-01

Plant mitochondrial genomes have features that distinguish them radically from their animal counterparts: a high rate of rearrangement, of uptake and loss of DNA sequences, and an extremely low point mutation rate. Perhaps the most unique structural feature of plant mitochondrial DNAs is the presence of large repeated sequences involved in intramolecular and intermolecular recombination. In addition, rare recombination events can occur across shorter repeats, creating rearrangements that result in aberrant phenotypes, including pollen abortion, which is known as cytoplasmic male sterility (CMS). Using next-generation sequencing, we pyrosequenced two rice (Oryza sativa) mitochondrial genomes that belong to the indica subspecies. One genome is normal, while the other carries the wild abortive-CMS. We find that numerous rearrangements in the rice mitochondrial genome occur even between close cytotypes during rice evolution. Unlike maize (Zea mays), a closely related species also belonging to the grass family, integration of plastid sequences did not play a role in the sequence divergence between rice cytotypes. This study also uncovered an excellent candidate for the wild abortive-CMS-encoding gene; like most of the CMS-associated open reading frames that are known in other species, this candidate was created via a rearrangement, is chimeric in structure, possesses predicted transmembrane domains, and coopted the promoter of a genuine mitochondrial gene. Our data give new insights into rice mitochondrial evolution, correcting previous reports. PMID:22128137

Analysis of Over 10,000 Cases Finds No Association between Previously-Reported Candidate Polymorphisms and Ovarian Cancer Outcome

PubMed Central

White, Kristin L.; Vierkant, Robert A.; Fogarty, Zachary C.; Charbonneau, Bridget; Block, Matthew S.; Pharoah, Paul D.P.; Chenevix-Trench, Georgia; Rossing, Mary Anne; Cramer, Daniel W.; Pearce, C. Leigh; Schildkraut, Joellen M.; Menon, Usha; Kjaer, Susanne Kruger; Levine, Douglas A.; Gronwald, Jacek; Culver, Hoda Anton; Whittemore, Alice S.; Karlan, Beth Y.; Lambrechts, Diether; Wentzensen, Nicolas; Kupryjanczyk, Jolanta; Chang-Claude, Jenny; Bandera, Elisa V.; Hogdall, Estrid; Heitz, Florian; Kaye, Stanley B.; Fasching, Peter A.; Campbell, Ian; Goodman, Marc T.; Pejovic, Tanja; Bean, Yukie; Lurie, Galina; Eccles, Diana; Hein, Alexander; Beckmann, Matthias W.; Ekici, Arif B.; Paul, James; Brown, Robert; Flanagan, James; Harter, Philipp; du Bois, Andreas; Schwaab, Ira; Hogdall, Claus K.; Lundvall, Lene; Olson, Sara H.; Orlow, Irene; Paddock, Lisa E.; Rudolph, Anja; Eilber, Ursula; Dansonka-Mieszkowska, Agnieszka; Rzepecka, Iwona K.; Ziolkowska-Seta, Izabela; Brinton, Louise; Yang, Hannah; Garcia-Closas, Montserrat; Despierre, Evelyn; Lambrechts, Sandrina; Vergote, Ignace; Walsh, Christine; Lester, Jenny; Sieh, Weiva; McGuire, Valerie; Rothstein, Joseph H.; Ziogas, Argyrios; Lubiński, Jan; Cybulski, Cezary; Menkiszak, Janusz; Jensen, Allan; Gayther, Simon A.; Ramus, Susan J.; Gentry-Maharaj, Aleksandra; Berchuck, Andrew; Wu, Anna H.; Pike, Malcolm C.; Van Den Berg, David; Terry, Kathryn L.; Vitonis, Allison F.; Doherty, Jennifer A.; Johnatty, Sharon; deFazio, Anna; Song, Honglin; Tyrer, Jonathan; Sellers, Thomas A.; Phelan, Catherine M.; Kalli, Kimberly R.; Cunningham, Julie M.; Fridley, Brooke L.; Goode, Ellen L.

2013-01-01

Background Ovarian cancer is a leading cause of cancer-related death among women. In an effort to understand contributors to disease outcome, we evaluated single-nucleotide polymorphisms (SNPs) previously associated with ovarian cancer recurrence or survival, specifically in angiogenesis, inflammation, mitosis, and drug disposition genes. Methods Twenty-seven SNPs in VHL, HGF, IL18, PRKACB, ABCB1, CYP2C8, ERCC2, and ERCC1 previously associated with ovarian cancer outcome were genotyped in 10,084 invasive cases from 28 studies from the Ovarian Cancer Association Consortium with over 37,000 observed person-years and 4,478 deaths. Cox proportional hazards models were used to examine the association between candidate SNPs and ovarian cancer recurrence or survival with and without adjustment for key covariates. Results We observed no association between genotype and ovarian cancer recurrence or survival for any of the SNPs examined. Conclusions These results refute prior associations between these SNPs and ovarian cancer outcome and underscore the importance of maximally powered genetic association studies. Impact These variants should not be used in prognostic models. Alternate approaches to uncovering inherited prognostic factors, if they exist, are needed. PMID:23513043
Computational discovery of pathway-level genetic vulnerabilities in non-small-cell lung cancer

PubMed Central

Young, Jonathan H.; Peyton, Michael; Seok Kim, Hyun; McMillan, Elizabeth; Minna, John D.; White, Michael A.; Marcotte, Edward M.

2016-01-01

Motivation: Novel approaches are needed for discovery of targeted therapies for non-small-cell lung cancer (NSCLC) that are specific to certain patients. Whole genome RNAi screening of lung cancer cell lines provides an ideal source for determining candidate drug targets. Results: Unsupervised learning algorithms uncovered patterns of differential vulnerability across lung cancer cell lines to loss of functionally related genes. Such genetic vulnerabilities represent candidate targets for therapy and are found to be involved in splicing, translation and protein folding. In particular, many NSCLC cell lines were especially sensitive to the loss of components of the LSm2-8 protein complex or the CCT/TRiC chaperonin. Different vulnerabilities were also found for different cell line subgroups. Furthermore, the predicted vulnerability of a single adenocarcinoma cell line to loss of the Wnt pathway was experimentally validated with screening of small-molecule Wnt inhibitors against an extensive cell line panel. Availability and implementation: The clustering algorithm is implemented in Python and is freely available at https://bitbucket.org/youngjh/nsclc_paper. Contact: marcotte@icmb.utexas.edu or jon.young@utexas.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26755624
Computational discovery of pathway-level genetic vulnerabilities in non-small-cell lung cancer.

PubMed

Young, Jonathan H; Peyton, Michael; Seok Kim, Hyun; McMillan, Elizabeth; Minna, John D; White, Michael A; Marcotte, Edward M

2016-05-01

Novel approaches are needed for discovery of targeted therapies for non-small-cell lung cancer (NSCLC) that are specific to certain patients. Whole genome RNAi screening of lung cancer cell lines provides an ideal source for determining candidate drug targets. Unsupervised learning algorithms uncovered patterns of differential vulnerability across lung cancer cell lines to loss of functionally related genes. Such genetic vulnerabilities represent candidate targets for therapy and are found to be involved in splicing, translation and protein folding. In particular, many NSCLC cell lines were especially sensitive to the loss of components of the LSm2-8 protein complex or the CCT/TRiC chaperonin. Different vulnerabilities were also found for different cell line subgroups. Furthermore, the predicted vulnerability of a single adenocarcinoma cell line to loss of the Wnt pathway was experimentally validated with screening of small-molecule Wnt inhibitors against an extensive cell line panel. The clustering algorithm is implemented in Python and is freely available at https://bitbucket.org/youngjh/nsclc_paper marcotte@icmb.utexas.edu or jon.young@utexas.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Planetary Nebula Candidates Uncovered with the HASH Research Platform

NASA Astrophysics Data System (ADS)

Fragkou, Vasiliki; Bojičić, Ivan; Frew, David; Parker, Quentin

2017-10-01

A detailed examination of new high quality radio catalogues (e.g. Cornish) in combination with available mid-infrared (MIR) satellite imagery (e.g. Glimpse) has allowed us to find 70 new planetary nebula (PN) candidates based on existing knowledge of their typical colors and fluxes. To further examine the nature of these sources, multiple diagnostic tools have been applied to these candidates based on published data and on available imagery in the HASH (Hong Kong/ AAO/ Strasbourg Hα planetary nebula) research platform. Some candidates have previously-missed optical counterparts allowing for spectroscopic follow-up. Indeed, the single object spectroscopically observed so far has turned out to be a bona fide PN.
Analysis of the Transcriptome of Erigeron breviscapus Uncovers Putative Scutellarin and Chlorogenic Acids Biosynthetic Genes and Genetic Markers

PubMed Central

Zhang, Jia-Jin; Shu, Li-Ping; Zhang, Wei; Long, Guang-Qiang; Liu, Tao; Meng, Zheng-Gui; Chen, Jun-Wen; Yang, Sheng-Chao

2014-01-01

Background Erigeron breviscapus (Vant.) Hand-Mazz. is a famous medicinal plant. Scutellarin and chlorogenic acids are the primary active components in this herb. However, the mechanisms of biosynthesis and regulation for scutellarin and chlorogenic acids in E. breviscapus are considerably unknown. In addition, genomic information of this herb is also unavailable. Principal Findings Using Illumina sequencing on GAIIx platform, a total of 64,605,972 raw sequencing reads were generated and assembled into 73,092 non-redundant unigenes. Among them, 44,855 unigenes (61.37%) were annotated in the public databases Nr, Swiss-Prot, KEGG, and COG. The transcripts encoding the known enzymes involved in flavonoids and in chlorogenic acids biosynthesis were discovered in the Illumina dataset. Three candidate cytochrome P450 genes were discovered which might encode flavone 6-hydroase converting apigenin to scutellarein. Furthermore, 4 unigenes encoding the homologues of maize P1 (R2R3-MYB transcription factors) were defined, which might regulate the biosynthesis of scutellarin. Additionally, a total of 11,077 simple sequence repeat (SSR) were identified from 9,255 unigenes. Of SSRs, tri-nucleotide motifs were the most abundant motif. Thirty-six primer pairs for SSRs were randomly selected for validation of the amplification and polymorphism. The result revealed that 34 (94.40%) primer pairs were successfully amplified and 19 (52.78%) primer pairs exhibited polymorphisms. Conclusion Using next generation sequencing (NGS) technology, this study firstly provides abundant genomic data for E. breviscapus. The candidate genes involved in the biosynthesis and transcriptional regulation of scutellarin and chlorogenic acids were obtained in this study. Additionally, a plenty of genetic makers were generated by identification of SSRs, which is a powerful tool for molecular breeding and genetics applications in this herb. PMID:24956277
Molecular Characterization of Three Canine Models of Human Rare Bone Diseases: Caffey, van den Ende-Gupta, and Raine Syndromes.

PubMed

Hytönen, Marjo K; Arumilli, Meharji; Lappalainen, Anu K; Owczarek-Lipska, Marta; Jagannathan, Vidhya; Hundi, Sruthi; Salmela, Elina; Venta, Patrick; Sarkiala, Eva; Jokinen, Tarja; Gorgas, Daniela; Kere, Juha; Nieminen, Pekka; Drögemüller, Cord; Lohi, Hannes

2016-05-01

One to two percent of all children are born with a developmental disorder requiring pediatric hospital admissions. For many such syndromes, the molecular pathogenesis remains poorly characterized. Parallel developmental disorders in other species could provide complementary models for human rare diseases by uncovering new candidate genes, improving the understanding of the molecular mechanisms and opening possibilities for therapeutic trials. We performed various experiments, e.g. combined genome-wide association and next generation sequencing, to investigate the clinico-pathological features and genetic causes of three developmental syndromes in dogs, including craniomandibular osteopathy (CMO), a previously undescribed skeletal syndrome, and dental hypomineralization, for which we identified pathogenic variants in the canine SLC37A2 (truncating splicing enhancer variant), SCARF2 (truncating 2-bp deletion) and FAM20C (missense variant) genes, respectively. CMO is a clinical equivalent to an infantile cortical hyperostosis (Caffey disease), for which SLC37A2 is a new candidate gene. SLC37A2 is a poorly characterized member of a glucose-phosphate transporter family without previous disease associations. It is expressed in many tissues, including cells of the macrophage lineage, e.g. osteoclasts, and suggests a disease mechanism, in which an impaired glucose homeostasis in osteoclasts compromises their function in the developing bone, leading to hyperostosis. Mutations in SCARF2 and FAM20C have been associated with the human van den Ende-Gupta and Raine syndromes that include numerous features similar to the affected dogs. Given the growing interest in the molecular characterization and treatment of human rare diseases, our study presents three novel physiologically relevant models for further research and therapy approaches, while providing the molecular identity for the canine conditions.
Molecular Characterization of Three Canine Models of Human Rare Bone Diseases: Caffey, van den Ende-Gupta, and Raine Syndromes

PubMed Central

Hytönen, Marjo K.; Arumilli, Meharji; Lappalainen, Anu K.; Owczarek-Lipska, Marta; Jagannathan, Vidhya; Hundi, Sruthi; Salmela, Elina; Venta, Patrick; Sarkiala, Eva; Jokinen, Tarja; Gorgas, Daniela; Kere, Juha; Nieminen, Pekka

2016-01-01

One to two percent of all children are born with a developmental disorder requiring pediatric hospital admissions. For many such syndromes, the molecular pathogenesis remains poorly characterized. Parallel developmental disorders in other species could provide complementary models for human rare diseases by uncovering new candidate genes, improving the understanding of the molecular mechanisms and opening possibilities for therapeutic trials. We performed various experiments, e.g. combined genome-wide association and next generation sequencing, to investigate the clinico-pathological features and genetic causes of three developmental syndromes in dogs, including craniomandibular osteopathy (CMO), a previously undescribed skeletal syndrome, and dental hypomineralization, for which we identified pathogenic variants in the canine SLC37A2 (truncating splicing enhancer variant), SCARF2 (truncating 2-bp deletion) and FAM20C (missense variant) genes, respectively. CMO is a clinical equivalent to an infantile cortical hyperostosis (Caffey disease), for which SLC37A2 is a new candidate gene. SLC37A2 is a poorly characterized member of a glucose-phosphate transporter family without previous disease associations. It is expressed in many tissues, including cells of the macrophage lineage, e.g. osteoclasts, and suggests a disease mechanism, in which an impaired glucose homeostasis in osteoclasts compromises their function in the developing bone, leading to hyperostosis. Mutations in SCARF2 and FAM20C have been associated with the human van den Ende-Gupta and Raine syndromes that include numerous features similar to the affected dogs. Given the growing interest in the molecular characterization and treatment of human rare diseases, our study presents three novel physiologically relevant models for further research and therapy approaches, while providing the molecular identity for the canine conditions. PMID:27187611
Analysis of the transcriptome of Erigeron breviscapus uncovers putative scutellarin and chlorogenic acids biosynthetic genes and genetic markers.

PubMed

Jiang, Ni-Hao; Zhang, Guang-Hui; Zhang, Jia-Jin; Shu, Li-Ping; Zhang, Wei; Long, Guang-Qiang; Liu, Tao; Meng, Zheng-Gui; Chen, Jun-Wen; Yang, Sheng-Chao

2014-01-01

Erigeron breviscapus (Vant.) Hand-Mazz. is a famous medicinal plant. Scutellarin and chlorogenic acids are the primary active components in this herb. However, the mechanisms of biosynthesis and regulation for scutellarin and chlorogenic acids in E. breviscapus are considerably unknown. In addition, genomic information of this herb is also unavailable. Using Illumina sequencing on GAIIx platform, a total of 64,605,972 raw sequencing reads were generated and assembled into 73,092 non-redundant unigenes. Among them, 44,855 unigenes (61.37%) were annotated in the public databases Nr, Swiss-Prot, KEGG, and COG. The transcripts encoding the known enzymes involved in flavonoids and in chlorogenic acids biosynthesis were discovered in the Illumina dataset. Three candidate cytochrome P450 genes were discovered which might encode flavone 6-hydroase converting apigenin to scutellarein. Furthermore, 4 unigenes encoding the homologues of maize P1 (R2R3-MYB transcription factors) were defined, which might regulate the biosynthesis of scutellarin. Additionally, a total of 11,077 simple sequence repeat (SSR) were identified from 9,255 unigenes. Of SSRs, tri-nucleotide motifs were the most abundant motif. Thirty-six primer pairs for SSRs were randomly selected for validation of the amplification and polymorphism. The result revealed that 34 (94.40%) primer pairs were successfully amplified and 19 (52.78%) primer pairs exhibited polymorphisms. Using next generation sequencing (NGS) technology, this study firstly provides abundant genomic data for E. breviscapus. The candidate genes involved in the biosynthesis and transcriptional regulation of scutellarin and chlorogenic acids were obtained in this study. Additionally, a plenty of genetic makers were generated by identification of SSRs, which is a powerful tool for molecular breeding and genetics applications in this herb.
Association genetics and transcriptome analysis reveal a gibberellin-responsive pathway involved in regulating photosynthesis.

PubMed

Xie, Jianbo; Tian, Jiaxing; Du, Qingzhang; Chen, Jinhui; Li, Ying; Yang, Xiaohui; Li, Bailian; Zhang, Deqiang

2016-05-01

Gibberellins (GAs) regulate a wide range of important processes in plant growth and development, including photosynthesis. However, the mechanism by which GAs regulate photosynthesis remains to be understood. Here, we used multi-gene association to investigate the effect of genes in the GA-responsive pathway, as constructed by RNA sequencing, on photosynthesis, growth, and wood property traits, in a population of 435 Populus tomentosa By analyzing changes in the transcriptome following GA treatment, we identified many key photosynthetic genes, in agreement with the observed increase in measurements of photosynthesis. Regulatory motif enrichment analysis revealed that 37 differentially expressed genes related to photosynthesis shared two essential GA-related cis-regulatory elements, the GA response element and the pyrimidine box. Thus, we constructed a GA-responsive pathway consisting of 47 genes involved in regulating photosynthesis, including GID1, RGA, GID2, MYBGa, and 37 photosynthetic differentially expressed genes. Single nucleotide polymorphism (SNP)-based association analysis showed that 142 SNPs, representing 40 candidate genes in this pathway, were significantly associated with photosynthesis, growth, and wood property traits. Epistasis analysis uncovered interactions between 310 SNP-SNP pairs from 37 genes in this pathway, revealing possible genetic interactions. Moreover, a structural gene-gene matrix based on a time-course of transcript abundances provided a better understanding of the multi-gene pathway affecting photosynthesis. The results imply a functional role for these genes in mediating photosynthesis, growth, and wood properties, demonstrating the potential of combining transcriptome-based regulatory pathway construction and genetic association approaches to detect the complex genetic networks underlying quantitative traits. © The Author 2016. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.
A deep intronic CLRN1 (USH3A) founder mutation generates an aberrant exon and underlies severe Usher syndrome on the Arabian Peninsula.

PubMed

Khan, Arif O; Becirovic, Elvir; Betz, Christian; Neuhaus, Christine; Altmüller, Janine; Maria Riedmayr, Lisa; Motameny, Susanne; Nürnberg, Gudrun; Nürnberg, Peter; Bolz, Hanno J

2017-05-03

Deafblindness is mostly due to Usher syndrome caused by recessive mutations in the known genes. Mutation-negative patients therefore either have distinct diseases, mutations in yet unknown Usher genes or in extra-exonic parts of the known genes - to date a largely unexplored possibility. In a consanguineous Saudi family segregating Usher syndrome type 1 (USH1), NGS of genes for Usher syndrome, deafness and retinal dystrophy and subsequent whole-exome sequencing each failed to identify a mutation. Genome-wide linkage analysis revealed two small candidate regions on chromosome 3, one containing the USH3A gene CLRN1, which has never been associated with Usher syndrome in Saudi Arabia. Whole-genome sequencing (WGS) identified a homozygous deep intronic mutation, c.254-649T > G, predicted to generate a novel donor splice site. CLRN1 minigene-based analysis confirmed the splicing of an aberrant exon due to usage of this novel motif, resulting in a frameshift and a premature termination codon. We identified this mutation in an additional two of seven unrelated mutation-negative Saudi USH1 patients. Locus-specific markers indicated that c.254-649T > G CLRN1 represents a founder allele that may significantly contribute to deafblindness in this population. Our finding underlines the potential of WGS to uncover atypically localized, hidden mutations in patients who lack exonic mutations in the known disease genes.
Genome wide analysis reveals Zic3 interaction with distal regulatory elements of stage specific developmental genes in zebrafish.

PubMed

Winata, Cecilia L; Kondrychyn, Igor; Kumar, Vibhor; Srinivasan, Kandhadayar G; Orlov, Yuriy; Ravishankar, Ashwini; Prabhakar, Shyam; Stanton, Lawrence W; Korzh, Vladimir; Mathavan, Sinnakaruppan

2013-10-01

Zic3 regulates early embryonic patterning in vertebrates. Loss of Zic3 function is known to disrupt gastrulation, left-right patterning, and neurogenesis. However, molecular events downstream of this transcription factor are poorly characterized. Here we use the zebrafish as a model to study the developmental role of Zic3 in vivo, by applying a combination of two powerful genomics approaches--ChIP-seq and microarray. Besides confirming direct regulation of previously implicated Zic3 targets of the Nodal and canonical Wnt pathways, analysis of gastrula stage embryos uncovered a number of novel candidate target genes, among which were members of the non-canonical Wnt pathway and the neural pre-pattern genes. A similar analysis in zic3-expressing cells obtained by FACS at segmentation stage revealed a dramatic shift in Zic3 binding site locations and identified an entirely distinct set of target genes associated with later developmental functions such as neural development. We demonstrate cis-regulation of several of these target genes by Zic3 using in vivo enhancer assay. Analysis of Zic3 binding sites revealed a distribution biased towards distal intergenic regions, indicative of a long distance regulatory mechanism; some of these binding sites are highly conserved during evolution and act as functional enhancers. This demonstrated that Zic3 regulation of developmental genes is achieved predominantly through long distance regulatory mechanism and revealed that developmental transitions could be accompanied by dramatic changes in regulatory landscape.
Comprehensive transcriptome-based characterization of differentially expressed genes involved in microsporogenesis of radish CMS line and its maintainer.

PubMed

Xie, Yang; Zhang, Wei; Wang, Yan; Xu, Liang; Zhu, Xianwen; Muleke, Everlyne M; Liu, Liwang

2016-09-01

Microsporogenesis is an indispensable period for investigating microspore development and cytoplasmic male sterility (CMS) occurrence. Radish CMS line plays a critical role in elite F1 hybrid seed production and heterosis utilization. However, the molecular mechanisms of microspore development and CMS occurrence have not been thoroughly uncovered in radish. In this study, a comparative analysis of radish floral buds from a CMS line (NAU-WA) and its maintainer (NAU-WB) was conducted using next generation sequencing (NGS) technology. Digital gene expression (DGE) profiling revealed that 3504 genes were significantly differentially expressed between NAU-WA and NAU-WB library, among which 1910 were upregulated and 1594 were downregulated. Gene ontology (GO) analysis showed that these differentially expressed genes (DEGs) were mainly enriched in extracellular region, catalytic activity, and response to stimulus. KEGG enrichment analysis revealed that the DEGs were predominantly associated with flavonoid biosynthesis, glycolysis, and biosynthesis of secondary metabolites. Real-time quantitative PCR analysis showed that the expression profiles of 13 randomly selected DEGs were in high agreement with results from Illumina sequencing. Several candidate genes encoding ATP synthase, auxin response factor (ARF), transcription factors (TFs), chalcone synthase (CHS), and male sterility (MS) were responsible for microsporogenesis. Furthermore, a schematic diagram for functional interaction of DEGs from NAU-WA vs. NAU-WB library in radish plants was proposed. These results could provide new information on the dissection of the molecular mechanisms underlying microspore development and CMS occurrence in radish.
High degree of genetic differentiation in marine three-spined sticklebacks (Gasterosteus aculeatus).

PubMed

Defaveri, Jacquelin; Shikano, Takahito; Shimada, Yukinori; Merilä, Juha

2013-09-01

Populations of widespread marine organisms are typically characterized by a low degree of genetic differentiation in neutral genetic markers, but much less is known about differentiation in genes whose functional roles are associated with specific selection regimes. To uncover possible adaptive population divergence and heterogeneous genomic differentiation in marine three-spined sticklebacks (Gasterosteus aculeatus), we used a candidate gene-based genome-scan approach to analyse variability in 138 microsatellite loci located within/close to (<6 kb) functionally important genes in samples collected from ten geographic locations. The degree of genetic differentiation in markers classified as neutral or under balancing selection-as determined with several outlier detection methods-was low (F(ST) = 0.033 or 0.011, respectively), whereas average FST for directionally selected markers was significantly higher (F(ST) = 0.097). Clustering analyses provided support for genomic and geographic heterogeneity in selection: six genetic clusters were identified based on allele frequency differences in the directionally selected loci, whereas four were identified with the neutral loci. Allelic variation in several loci exhibited significant associations with environmental variables, supporting the conjecture that temperature and salinity, but not optic conditions, are important drivers of adaptive divergence among populations. In general, these results suggest that in spite of the high degree of physical connectivity and gene flow as inferred from neutral marker genes, marine stickleback populations are strongly genetically structured in loci associated with functionally relevant genes. © 2013 John Wiley & Sons Ltd.
Physiological Investigation and Transcriptome Analysis of Polyethylene Glycol (PEG)-Induced Dehydration Stress in Cassava.

PubMed

Fu, Lili; Ding, Zehong; Han, Bingying; Hu, Wei; Li, Yajun; Zhang, Jiaming

2016-02-25

Cassava is an important tropical and sub-tropical root crop that is adapted to drought environment. However, severe drought stress significantly influences biomass accumulation and starchy root production. The mechanism underlying drought-tolerance remains obscure in cassava. In this study, changes of physiological characters and gene transcriptome profiles were investigated under dehydration stress simulated by polyethylene glycol (PEG) treatments. Five traits, including peroxidase (POD) activity, proline content, malondialdehyde (MDA), soluble sugar and soluble protein, were all dramatically induced in response to PEG treatment. RNA-seq analysis revealed a gradient decrease of differentially expressed (DE) gene number in tissues from bottom to top of a plant, suggesting that cassava root has a quicker response and more induced/depressed DE genes than leaves in response to drought. Overall, dynamic changes of gene expression profiles in cassava root and leaves were uncovered: genes related to glycolysis, abscisic acid and ethylene biosynthesis, lipid metabolism, protein degradation, and second metabolism of flavonoids were significantly induced, while genes associated with cell cycle/organization, cell wall synthesis and degradation, DNA synthesis and chromatin structure, protein synthesis, light reaction of photosynthesis, gibberelin pathways and abiotic stress were greatly depressed. Finally, novel pathways in ABA-dependent and ABA-independent regulatory networks underlying PEG-induced dehydration response in cassava were detected, and the RNA-Seq results of a subset of fifteen genes were confirmed by real-time PCR. The findings will improve our understanding of the mechanism related to dehydration stress-tolerance in cassava and will provide useful candidate genes for breeding of cassava varieties better adapted to drought environment.
Physiological Investigation and Transcriptome Analysis of Polyethylene Glycol (PEG)-Induced Dehydration Stress in Cassava

PubMed Central

Fu, Lili; Ding, Zehong; Han, Bingying; Hu, Wei; Li, Yajun; Zhang, Jiaming

2016-01-01

Cassava is an important tropical and sub-tropical root crop that is adapted to drought environment. However, severe drought stress significantly influences biomass accumulation and starchy root production. The mechanism underlying drought-tolerance remains obscure in cassava. In this study, changes of physiological characters and gene transcriptome profiles were investigated under dehydration stress simulated by polyethylene glycol (PEG) treatments. Five traits, including peroxidase (POD) activity, proline content, malondialdehyde (MDA), soluble sugar and soluble protein, were all dramatically induced in response to PEG treatment. RNA-seq analysis revealed a gradient decrease of differentially expressed (DE) gene number in tissues from bottom to top of a plant, suggesting that cassava root has a quicker response and more induced/depressed DE genes than leaves in response to drought. Overall, dynamic changes of gene expression profiles in cassava root and leaves were uncovered: genes related to glycolysis, abscisic acid and ethylene biosynthesis, lipid metabolism, protein degradation, and second metabolism of flavonoids were significantly induced, while genes associated with cell cycle/organization, cell wall synthesis and degradation, DNA synthesis and chromatin structure, protein synthesis, light reaction of photosynthesis, gibberelin pathways and abiotic stress were greatly depressed. Finally, novel pathways in ABA-dependent and ABA-independent regulatory networks underlying PEG-induced dehydration response in cassava were detected, and the RNA-Seq results of a subset of fifteen genes were confirmed by real-time PCR. The findings will improve our understanding of the mechanism related to dehydration stress-tolerance in cassava and will provide useful candidate genes for breeding of cassava varieties better adapted to drought environment. PMID:26927071
On the role of PDZ domain-encoding genes in Drosophila border cell migration.

PubMed

Aranjuez, George; Kudlaty, Elizabeth; Longworth, Michelle S; McDonald, Jocelyn A

2012-11-01

Cells often move as collective groups during normal embryonic development and wound healing, although the mechanisms governing this type of migration are poorly understood. The Drosophila melanogaster border cells migrate as a cluster during late oogenesis and serve as a powerful in vivo genetic model for collective cell migration. To discover new genes that participate in border cell migration, 64 out of 66 genes that encode PDZ domain-containing proteins were systematically targeted by in vivo RNAi knockdown. The PDZ domain is one of the largest families of protein-protein interaction domains found in eukaryotes. Proteins that contain PDZ domains participate in a variety of biological processes, including signal transduction and establishment of epithelial apical-basal polarity. Targeting PDZ proteins effectively assesses a larger number of genes via the protein complexes and pathways through which these proteins function. par-6, a known regulator of border cell migration, was a positive hit and thus validated the approach. Knockdown of 14 PDZ domain genes disrupted migration with multiple RNAi lines. The candidate genes have diverse predicted cellular functions and are anticipated to provide new insights into the mechanisms that control border cell movement. As a test of this concept, two genes that disrupted migration were characterized in more detail: big bang and the Dlg5 homolog CG6509. We present evidence that Big bang regulates JAK/STAT signaling, whereas Dlg5/CG6509 maintains cluster cohesion. Moreover, these results demonstrate that targeting a selected class of genes by RNAi can uncover novel regulators of collective cell migration.
Ensemble positive unlabeled learning for disease gene identification.

PubMed

Yang, Peng; Li, Xiaoli; Chua, Hon-Nian; Kwoh, Chee-Keong; Ng, See-Kiong

2014-01-01

An increasing number of genes have been experimentally confirmed in recent years as causative genes to various human diseases. The newly available knowledge can be exploited by machine learning methods to discover additional unknown genes that are likely to be associated with diseases. In particular, positive unlabeled learning (PU learning) methods, which require only a positive training set P (confirmed disease genes) and an unlabeled set U (the unknown candidate genes) instead of a negative training set N, have been shown to be effective in uncovering new disease genes in the current scenario. Using only a single source of data for prediction can be susceptible to bias due to incompleteness and noise in the genomic data and a single machine learning predictor prone to bias caused by inherent limitations of individual methods. In this paper, we propose an effective PU learning framework that integrates multiple biological data sources and an ensemble of powerful machine learning classifiers for disease gene identification. Our proposed method integrates data from multiple biological sources for training PU learning classifiers. A novel ensemble-based PU learning method EPU is then used to integrate multiple PU learning classifiers to achieve accurate and robust disease gene predictions. Our evaluation experiments across six disease groups showed that EPU achieved significantly better results compared with various state-of-the-art prediction methods as well as ensemble learning classifiers. Through integrating multiple biological data sources for training and the outputs of an ensemble of PU learning classifiers for prediction, we are able to minimize the potential bias and errors in individual data sources and machine learning algorithms to achieve more accurate and robust disease gene predictions. In the future, our EPU method provides an effective framework to integrate the additional biological and computational resources for better disease gene predictions.
Integrative Transcriptomic Analysis Uncovers Novel Gene Modules That Underlie the Sulfate Response in Arabidopsis thaliana

PubMed Central

Henríquez-Valencia, Carlos; Arenas-M, Anita; Medina, Joaquín; Canales, Javier

2018-01-01

Sulfur is an essential nutrient for plant growth and development. Sulfur is a constituent of proteins, the plasma membrane and cell walls, among other important cellular components. To obtain new insights into the gene regulatory networks underlying the sulfate response, we performed an integrative meta-analysis of transcriptomic data from five different sulfate experiments available in public databases. This bioinformatic approach allowed us to identify a robust set of genes whose expression depends only on sulfate availability, indicating that those genes play an important role in the sulfate response. In relation to sulfate metabolism, the biological function of approximately 45% of these genes is currently unknown. Moreover, we found several consistent Gene Ontology terms related to biological processes that have not been extensively studied in the context of the sulfate response; these processes include cell wall organization, carbohydrate metabolism, nitrogen compound transport, and the regulation of proteolysis. Gene co-expression network analyses revealed relationships between the sulfate-responsive genes that were distributed among seven function-specific co-expression modules. The most connected genes in the sulfate co-expression network belong to a module related to the carbon response, suggesting that this biological function plays an important role in the control of the sulfate response. Temporal analyses of the network suggest that sulfate starvation generates a biphasic response, which involves that major changes in gene expression occur during both the early and late responses. Network analyses predicted that the sulfate response is regulated by a limited number of transcription factors, including MYBs, bZIPs, and NF-YAs. In conclusion, our analysis identified new candidate genes and provided new hypotheses to advance our understanding of the transcriptional regulation of sulfate metabolism in plants. PMID:29692794
Integrative Transcriptomic Analysis Uncovers Novel Gene Modules That Underlie the Sulfate Response in Arabidopsis thaliana.

PubMed

Henríquez-Valencia, Carlos; Arenas-M, Anita; Medina, Joaquín; Canales, Javier

2018-01-01

Sulfur is an essential nutrient for plant growth and development. Sulfur is a constituent of proteins, the plasma membrane and cell walls, among other important cellular components. To obtain new insights into the gene regulatory networks underlying the sulfate response, we performed an integrative meta-analysis of transcriptomic data from five different sulfate experiments available in public databases. This bioinformatic approach allowed us to identify a robust set of genes whose expression depends only on sulfate availability, indicating that those genes play an important role in the sulfate response. In relation to sulfate metabolism, the biological function of approximately 45% of these genes is currently unknown. Moreover, we found several consistent Gene Ontology terms related to biological processes that have not been extensively studied in the context of the sulfate response; these processes include cell wall organization, carbohydrate metabolism, nitrogen compound transport, and the regulation of proteolysis. Gene co-expression network analyses revealed relationships between the sulfate-responsive genes that were distributed among seven function-specific co-expression modules. The most connected genes in the sulfate co-expression network belong to a module related to the carbon response, suggesting that this biological function plays an important role in the control of the sulfate response. Temporal analyses of the network suggest that sulfate starvation generates a biphasic response, which involves that major changes in gene expression occur during both the early and late responses. Network analyses predicted that the sulfate response is regulated by a limited number of transcription factors, including MYBs, bZIPs, and NF-YAs. In conclusion, our analysis identified new candidate genes and provided new hypotheses to advance our understanding of the transcriptional regulation of sulfate metabolism in plants.
Household Energy Conservation from Elementary Science Teacher Candidates' Perspective

ERIC Educational Resources Information Center

Sahin, Elvan

2016-01-01

This study was conducted to understand the complex nature of gender differentiation in household energy consumption, and uncover the factors characterizing Turkish female university students' contribution on household energy conservation. Specifically, the study hypothesized that energy-related attributes would significantly differentiate female…

An Evolutionarily Conserved Innate Immunity Protein Interaction Network*

PubMed Central

De Arras, Lesly; Seng, Amara; Lackford, Brad; Keikhaee, Mohammad R.; Bowerman, Bruce; Freedman, Jonathan H.; Schwartz, David A.; Alper, Scott

2013-01-01

The innate immune response plays a critical role in fighting infection; however, innate immunity also can affect the pathogenesis of a variety of diseases, including sepsis, asthma, cancer, and atherosclerosis. To identify novel regulators of innate immunity, we performed comparative genomics RNA interference screens in the nematode Caenorhabditis elegans and mouse macrophages. These screens have uncovered many candidate regulators of the response to lipopolysaccharide (LPS), several of which interact physically in multiple species to form an innate immunity protein interaction network. This protein interaction network contains several proteins in the canonical LPS-responsive TLR4 pathway as well as many novel interacting proteins. Using RNAi and overexpression studies, we show that almost every gene in this network can modulate the innate immune response in mouse cell lines. We validate the importance of this network in innate immunity regulation in vivo using available mutants in C. elegans and mice. PMID:23209288
Genetic screens for mutations affecting development of Xenopus tropicalis.

PubMed

Goda, Tadahiro; Abu-Daya, Anita; Carruthers, Samantha; Clark, Matthew D; Stemple, Derek L; Zimmerman, Lyle B

2006-06-01

We present here the results of forward and reverse genetic screens for chemically-induced mutations in Xenopus tropicalis. In our forward genetic screen, we have uncovered 77 candidate phenotypes in diverse organogenesis and differentiation processes. Using a gynogenetic screen design, which minimizes time and husbandry space expenditures, we find that if a phenotype is detected in the gynogenetic F2 of a given F1 female twice, it is highly likely to be a heritable abnormality (29/29 cases). We have also demonstrated the feasibility of reverse genetic approaches for obtaining carriers of mutations in specific genes, and have directly determined an induced mutation rate by sequencing specific exons from a mutagenized population. The Xenopus system, with its well-understood embryology, fate map, and gain-of-function approaches, can now be coupled with efficient loss-of-function genetic strategies for vertebrate functional genomics and developmental genetics.
Dissecting engineered cell types and enhancing cell fate conversion via CellNet

PubMed Central

Morris, Samantha A.; Cahan, Patrick; Li, Hu; Zhao, Anna M.; San Roman, Adrianna K.; Shivdasani, Ramesh A.; Collins, James J.; Daley, George Q.

2014-01-01

SUMMARY Engineering clinically relevant cells in vitro holds promise for regenerative medicine, but most protocols fail to faithfully recapitulate target cell properties. To address this, we developed CellNet, a network biology platform that determines whether engineered cells are equivalent to their target tissues, diagnoses aberrant gene regulatory networks, and prioritizes candidate transcriptional regulators to enhance engineered conversions. Using CellNet, we improved B cell to macrophage conversion, transcriptionally and functionally, by knocking down predicted B cell regulators. Analyzing conversion of fibroblasts to induced hepatocytes (iHeps), CellNet revealed an unexpected intestinal program regulated by the master regulator Cdx2. We observed long-term functional engraftment of mouse colon by iHeps, thereby establishing their broader potential as endoderm progenitors and demonstrating direct conversion of fibroblasts into intestinal epithelium. Our studies illustrate how CellNet can be employed to improve direct conversion and to uncover unappreciated properties of engineered cells. PMID:25126792
mRNA interactome capture in mammalian cells.

PubMed

Kastelic, Nicolai; Landthaler, Markus

2017-08-15

Throughout their entire life cycle, mRNAs are associated with RNA-binding proteins (RBPs), forming ribonucleoprotein (RNP) complexes with highly dynamic compositions. Their interplay is one key to control gene regulatory mechanisms from mRNA synthesis to decay. To assay the global scope of RNA-protein interactions, we and others have published a method combining crosslinking with highly stringent oligo(dT) affinity purification to enrich proteins associated with polyadenylated RNA (poly(A)+ RNA). Identification of the poly(A)+ RNA-bound proteome (also: mRNA interactome capture) has by now been applied to a diversity of cell lines and model organisms, uncovering comprehensive repertoires of RBPs and hundreds of novel RBP candidates. In addition to determining the RBP catalog in a given biological system, mRNA interactome capture allows the examination of changes in protein-mRNA interactions in response to internal and external stimuli, altered cellular programs and disease. Copyright © 2017. Published by Elsevier Inc.
Understanding Heterogeneity in the Effects of Birth Weight on Adult Cognition and Wages

PubMed Central

Cook, C. Justin; Fletcher, Jason M.

2015-01-01

A large economics literature has shown long term impacts of birth weight on adult outcomes, including IQ and earnings that are often robust to sibling or twin fixed effects. We examine potential mechanisms underlying these effects by incorporating findings from the genetics and neuroscience literatures. We use a sample of siblings combined with an “orchids and dandelions hypothesis”, where the IQ of genetic dandelions is not affected by in utero nutrition variation but genetic orchids thrive under advantageous conditions and wilt in poor conditions. Indeed, using variation in three candidate genes related to neuroplasticity (APOE, BDNF, and COMT), we find substantial heterogeneity in the associations between birth weight and adult outcomes, where part of the population (i.e., “dandelions”) is not affected by birth weight variation. Our results help uncover why birth weight affects adult outcomes. PMID:25770970
Involvement of MicroRNAs in Lung Cancer Biology and Therapy

PubMed Central

Liu, Xi; Sempere, Lorenzo F.; Guo, Yongli; Korc, Murray; Kauppinen, Sakari; Freemantle, Sarah J.; Dmitrovsky, Ethan

2011-01-01

MicroRNAs (miRNAs) are a class of small RNAs that regulate gene expression. Expression profiles of specific miRNAs have improved cancer diagnosis and classification and even provided prognostic information in many human cancers, including lung cancer. Tumor suppressive and oncogenic miRNAs were uncovered in lung carcinogenesis. The biological functions of these miRNAs in lung cancer were recently validated in well characterized cellular, murine transgenic as well as transplantable lung cancer models and in human paired normal-malignant lung tissue banks and tissue arrays. Tumor suppressive and oncogenic miRNAs that were identified in lung cancer will be reviewed here. Emphasis is placed on highlighting those functionally validated miRNAs that are not only biomarkers of lung carcinogenesis, but also candidate pharmacologic targets. How these miRNA findings advance an understanding of lung cancer biology and could improve lung cancer therapy are discussed in this article. PMID:21420030
Understanding heterogeneity in the effects of birth weight on adult cognition and wages.

PubMed

Justin Cook, C; Fletcher, Jason M

2015-05-01

A large economics literature has shown long term impacts of birth weight on adult outcomes, including IQ and earnings that are often robust to sibling or twin fixed effects. We examine potential mechanisms underlying these effects by incorporating findings from the genetics and neuroscience literatures. We use a sample of siblings combined with an "orchids and dandelions hypothesis", where the IQ of genetic dandelions is not affected by in utero nutrition variation but genetic orchids thrive under advantageous conditions and wilt in poor conditions. Indeed, using variation in three candidate genes related to neuroplasticity (APOE, BDNF, and COMT), we find substantial heterogeneity in the associations between birth weight and adult outcomes, where part of the population (i.e., "dandelions") is not affected by birth weight variation. Our results help uncover why birth weight affects adult outcomes. Copyright © 2015 Elsevier B.V. All rights reserved.
Genome and transcriptome sequencing characterises the gene space of Macadamia integrifolia (Proteaceae).

PubMed

Nock, Catherine J; Baten, Abdul; Barkla, Bronwyn J; Furtado, Agnelo; Henry, Robert J; King, Graham J

2016-11-17

The large Gondwanan plant family Proteaceae is an early-diverging eudicot lineage renowned for its morphological, taxonomic and ecological diversity. Macadamia is the most economically important Proteaceae crop and represents an ancient rainforest-restricted lineage. The family is a focus for studies of adaptive radiation due to remarkable species diversification in Mediterranean-climate biodiversity hotspots, and numerous evolutionary transitions between biomes. Despite a long history of research, comparative analyses in the Proteaceae and macadamia breeding programs are restricted by a paucity of genetic information. To address this, we sequenced the genome and transcriptome of the widely grown Macadamia integrifolia cultivar 741. Over 95 gigabases of DNA and RNA-seq sequence data were de novo assembled and annotated. The draft assembly has a total length of 518 Mb and spans approximately 79% of the estimated genome size. Following annotation, 35,337 protein-coding genes were predicted of which over 90% were expressed in at least one of the leaf, shoot or flower tissues examined. Gene family comparisons with five other eudicot species revealed 13,689 clusters containing macadamia genes and 1005 macadamia-specific clusters, and provides evidence for linage-specific expansion of gene families involved in pathogen recognition, plant defense and monoterpene synthesis. Cyanogenesis is an important defense strategy in the Proteaceae, and a detailed analysis of macadamia gene homologues potentially involved in cyanogenic glycoside biosynthesis revealed several highly expressed candidate genes. The gene space of macadamia provides a foundation for comparative genomics, gene discovery and the acceleration of molecular-assisted breeding. This study presents the first available genomic resources for the large basal eudicot family Proteaceae, access to most macadamia genes and opportunities to uncover the genetic basis of traits of importance for adaptation and crop improvement.
Transcriptomics-based analysis using RNA-Seq of the coconut (Cocos nucifera) leaf in response to yellow decline phytoplasma infection.

PubMed

Nejat, Naghmeh; Cahill, David M; Vadamalai, Ganesan; Ziemann, Mark; Rookes, James; Naderali, Neda

2015-10-01

Invasive phytoplasmas wreak havoc on coconut palms worldwide, leading to high loss of income, food insecurity and extreme poverty of farmers in producing countries. Phytoplasmas as strictly biotrophic insect-transmitted bacterial pathogens instigate distinct changes in developmental processes and defence responses of the infected plants and manipulate plants to their own advantage; however, little is known about the cellular and molecular mechanisms underlying host-phytoplasma interactions. Further, phytoplasma-mediated transcriptional alterations in coconut palm genes have not yet been identified. This study evaluated the whole transcriptome profiles of naturally infected leaves of Cocos nucifera ecotype Malayan Red Dwarf in response to yellow decline phytoplasma from group 16SrXIV, using RNA-Seq technique. Transcriptomics-based analysis reported here identified genes involved in coconut innate immunity. The number of down-regulated genes in response to phytoplasma infection exceeded the number of genes up-regulated. Of the 39,873 differentially expressed unigenes, 21,860 unigenes were suppressed and 18,013 were induced following infection. Comparative analysis revealed that genes associated with defence signalling against biotic stimuli were significantly overexpressed in phytoplasma-infected leaves versus healthy coconut leaves. Genes involving cell rescue and defence, cellular transport, oxidative stress, hormone stimulus and metabolism, photosynthesis reduction, transcription and biosynthesis of secondary metabolites were differentially represented. Our transcriptome analysis unveiled a core set of genes associated with defence of coconut in response to phytoplasma attack, although several novel defence response candidate genes with unknown function have also been identified. This study constitutes valuable sequence resource for uncovering the resistance genes and/or susceptibility genes which can be used as genetic tools in disease resistance breeding.
The prediction of candidate genes for cervix related cancer through gene ontology and graph theoretical approach.

PubMed

Hindumathi, V; Kranthi, T; Rao, S B; Manimaran, P

2014-06-01

With rapidly changing technology, prediction of candidate genes has become an indispensable task in recent years mainly in the field of biological research. The empirical methods for candidate gene prioritization that succors to explore the potential pathway between genetic determinants and complex diseases are highly cumbersome and labor intensive. In such a scenario predicting potential targets for a disease state through in silico approaches are of researcher's interest. The prodigious availability of protein interaction data coupled with gene annotation renders an ease in the accurate determination of disease specific candidate genes. In our work we have prioritized the cervix related cancer candidate genes by employing Csaba Ortutay and his co-workers approach of identifying the candidate genes through graph theoretical centrality measures and gene ontology. With the advantage of the human protein interaction data, cervical cancer gene sets and the ontological terms, we were able to predict 15 novel candidates for cervical carcinogenesis. The disease relevance of the anticipated candidate genes was corroborated through a literature survey. Also the presence of the drugs for these candidates was detected through Therapeutic Target Database (TTD) and DrugMap Central (DMC) which affirms that they may be endowed as potential drug targets for cervical cancer.
The emerging role of genomics in the diagnosis and workup of congenital urinary tract defects: a novel deletion syndrome on chromosome 3q13.31-22.1

PubMed Central

Materna-Kiryluk, Anna; Kiryluk, Krzysztof; Burgess, Katelyn E; Bieleninik, Arkadiusz; Sanna-Cherchi, Simone; Gharavi, Ali G.; Latos-Bielenska, Anna

2014-01-01

Background Copy number variants (CNVs) are increasingly recognized as an important cause of congenital malformations and likely explain over 16% cases of CAKUT. Here, we illustrate how a molecular diagnosis of CNV can inform the clinical management of a pediatric patient presenting with CAKUT and other organ defects. Methods We describe a 14 year-old girl with a large de novo deletion of chromosome 3q13.31-22.1 that disrupts 101 known genes and manifests with CAKUT, neurodevelopmental delay, agenesis of corpus callosum (ACC), cardiac malformations, electrolyte and endocrine disorders, skeletal abnormalities and dysmorphic features. We perform extensive annotation of the deleted region to prioritize genes for specific phenotypes and to predict future disease risk. Results Our case defined new minimal chromosomal candidate regions for both CAKUT and ACC. Moreover, the presence of the CASR gene in the deleted interval predicted a diagnosis of hypocalciuric hypercalcemia, which was confirmed by serum and urine chemistries. Our gene annotation explained clinical hypothyroidism and predicted that the index case is at increased risk of thoracic aortic aneurysm, renal cell carcinoma and myeloproliferative disorder. Conclusions Extended annotation of CNV regions refines diagnosis and uncovers previously unrecognized phenotypic features. This approach enables personalized treatment and prevention strategies in patients harboring genomic deletions. PMID:24292865
Transfer RNA Derived Small RNAs Targeting Defense Responsive Genes Are Induced during Phytophthora capsici Infection in Black Pepper (Piper nigrum L.)

PubMed Central

Asha, Srinivasan; Soniya, Eppurath V.

2016-01-01

Small RNAs derived from transfer RNAs were recently assigned as potential gene regulatory candidates for various stress responses in eukaryotes. In this study, we report on the cloning and identification of tRNA derived small RNAs from black pepper plants in response to the infection of the quick wilt pathogen, Phytophthora capsici. 5′tRFs cloned from black pepper were validated as highly expressed during P. capsici infection. A high-throughput systematic analysis of the small RNAome (sRNAome) revealed the predominance of 5′tRFs in the infected leaf and root. The abundance of 5′tRFs in the sRNAome and the defense responsive genes as their potential targets indicated their regulatory role during stress response in black pepper. The 5′AlaCGC tRF mediated cleavage was experimentally mapped at the tRF binding sites on the mRNA targets of Non-expresser of pathogenesis related protein (NPR1), which was down-regulated during pathogen infection. Comparative sRNAome further demonstrated sequence conservation of 5′Ala tRFs across the angiosperm plant groups, and many important genes in the defense response were identified in silico as their potential targets. Our findings uncovered the diversity, differential expression and stress responsive functional role of tRNA-derived small RNAs during Phytophthora infection in black pepper. PMID:27313593
Transfer RNA Derived Small RNAs Targeting Defense Responsive Genes Are Induced during Phytophthora capsici Infection in Black Pepper (Piper nigrum L.).

PubMed

Asha, Srinivasan; Soniya, Eppurath V

2016-01-01

Small RNAs derived from transfer RNAs were recently assigned as potential gene regulatory candidates for various stress responses in eukaryotes. In this study, we report on the cloning and identification of tRNA derived small RNAs from black pepper plants in response to the infection of the quick wilt pathogen, Phytophthora capsici. 5'tRFs cloned from black pepper were validated as highly expressed during P. capsici infection. A high-throughput systematic analysis of the small RNAome (sRNAome) revealed the predominance of 5'tRFs in the infected leaf and root. The abundance of 5'tRFs in the sRNAome and the defense responsive genes as their potential targets indicated their regulatory role during stress response in black pepper. The 5'Ala(CGC) tRF mediated cleavage was experimentally mapped at the tRF binding sites on the mRNA targets of Non-expresser of pathogenesis related protein (NPR1), which was down-regulated during pathogen infection. Comparative sRNAome further demonstrated sequence conservation of 5'Ala tRFs across the angiosperm plant groups, and many important genes in the defense response were identified in silico as their potential targets. Our findings uncovered the diversity, differential expression and stress responsive functional role of tRNA-derived small RNAs during Phytophthora infection in black pepper.
Drosophila and mammalian models uncover a role for the myoblast fusion gene TANC1 in rhabdomyosarcoma.

PubMed

Avirneni-Vadlamudi, Usha; Galindo, Kathleen A; Endicott, Tiana R; Paulson, Vera; Cameron, Scott; Galindo, Rene L

2012-01-01

Rhabdomyosarcoma (RMS) is a malignancy of muscle myoblasts, which fail to exit the cell cycle, resist terminal differentiation, and are blocked from fusing into syncytial skeletal muscle. In some patients, RMS is caused by a translocation that generates the fusion oncoprotein PAX-FOXO1, but the underlying RMS pathogenetic mechanisms that impede differentiation and promote neoplastic transformation remain unclear. Using a Drosophila model of PAX-FOXO1-mediated transformation, we show here that mutation in the myoblast fusion gene rolling pebbles (rols) dominantly suppresses PAX-FOXO1 lethality. Further analysis indicated that PAX-FOXO1 expression caused upregulation of rols, which suggests that Rols acts downstream of PAX-FOXO1. In mammalian myoblasts, gene silencing of Tanc1, an ortholog of rols, revealed that it is essential for myoblast fusion, but is dispensable for terminal differentiation. Misexpression of PAX-FOXO1 in myoblasts upregulated Tanc1 and blocked differentiation, whereas subsequent reduction of Tanc1 expression to native levels by RNAi restored both fusion and differentiation. Furthermore, decreasing human TANC1 gene expression caused RMS cancer cells to lose their neoplastic state, undergo fusion, and form differentiated syncytial muscle. Taken together, these findings identify misregulated myoblast fusion caused by ectopic TANC1 expression as a RMS neoplasia mechanism and suggest fusion molecules as candidates for targeted RMS therapy.
Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production.

PubMed

Roth, Melissa S; Cokus, Shawn J; Gallaher, Sean D; Walter, Andreas; Lopez, David; Erickson, Erika; Endelman, Benjamin; Westcott, Daniel; Larabell, Carolyn A; Merchant, Sabeeha S; Pellegrini, Matteo; Niyogi, Krishna K

2017-05-23

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis , because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. To advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ∼58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniform gene density over chromosomes, low repetitive sequence content (∼6%), and a high fraction of protein-coding sequence (∼39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (∼73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase ( BKT ), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. The high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.
Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production

DOE PAGES

Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.; ...

2017-05-08

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. Here, to advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ~58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniformmore » gene density over chromosomes, low repetitive sequence content (~6%), and a high fraction of protein-coding sequence (~39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (~73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. Finally, the high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.« less
Hindsight regulates photoreceptor axon targeting through transcriptional control of jitterbug/Filamin and multiple genes involved in axon guidance in Drosophila.

PubMed

Oliva, Carlos; Molina-Fernandez, Claudia; Maureira, Miguel; Candia, Noemi; López, Estefanía; Hassan, Bassem; Aerts, Stein; Cánovas, José; Olguín, Patricio; Sierralta, Jimena

2015-09-01

During axon targeting, a stereotyped pattern of connectivity is achieved by the integration of intrinsic genetic programs and the response to extrinsic long and short-range directional cues. How this coordination occurs is the subject of intense study. Transcription factors play a central role due to their ability to regulate the expression of multiple genes required to sense and respond to these cues during development. Here we show that the transcription factor HNT regulates layer-specific photoreceptor axon targeting in Drosophila through transcriptional control of jbug/Filamin and multiple genes involved in axon guidance and cytoskeleton organization.Using a microarray analysis we identified 235 genes whose expression levels were changed by HNT overexpression in the eye primordia. We analyzed nine candidate genes involved in cytoskeleton regulation and axon guidance, six of which displayed significantly altered gene expression levels in hnt mutant retinas. Functional analysis confirmed the role of OTK/PTK7 in photoreceptor axon targeting and uncovered Tiggrin, an integrin ligand, and Jbug/Filamin, a conserved actin- binding protein, as new factors that participate of photoreceptor axon targeting. Moreover, we provided in silico and molecular evidence that supports jbug/Filamin as a direct transcriptional target of HNT and that HNT acts partially through Jbug/Filamin in vivo to regulate axon guidance. Our work broadens the understanding of how HNT regulates the coordinated expression of a group of genes to achieve the correct connectivity pattern in the Drosophila visual system. © 2015 Wiley Periodicals, Inc. Develop Neurobiol 75: 1018-1032, 2015. © 2015 Wiley Periodicals, Inc.
Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production

DOE Office of Scientific and Technical Information (OSTI.GOV)

Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. Here, to advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ~58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniformmore » gene density over chromosomes, low repetitive sequence content (~6%), and a high fraction of protein-coding sequence (~39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (~73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. Finally, the high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.« less
Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production

PubMed Central

Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.; Walter, Andreas; Lopez, David; Erickson, Erika; Endelman, Benjamin; Westcott, Daniel; Larabell, Carolyn A.; Merchant, Sabeeha S.; Pellegrini, Matteo

2017-01-01

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. To advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ∼58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniform gene density over chromosomes, low repetitive sequence content (∼6%), and a high fraction of protein-coding sequence (∼39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (∼73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. The high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production. PMID:28484037
Exaptation of Transposable Elements into Novel Cis-Regulatory Elements: Is the Evidence Always Strong?

PubMed Central

de Souza, Flávio S.J.; Franchini, Lucía F.; Rubinstein, Marcelo

2013-01-01

Transposable elements (TEs) are mobile genetic sequences that can jump around the genome from one location to another, behaving as genomic parasites. TEs have been particularly effective in colonizing mammalian genomes, and such heavy TE load is expected to have conditioned genome evolution. Indeed, studies conducted both at the gene and genome levels have uncovered TE insertions that seem to have been co-opted—or exapted—by providing transcription factor binding sites (TFBSs) that serve as promoters and enhancers, leading to the hypothesis that TE exaptation is a major factor in the evolution of gene regulation. Here, we critically review the evidence for exaptation of TE-derived sequences as TFBSs, promoters, enhancers, and silencers/insulators both at the gene and genome levels. We classify the functional impact attributed to TE insertions into four categories of increasing complexity and argue that so far very few studies have conclusively demonstrated exaptation of TEs as transcriptional regulatory regions. We also contend that many genome-wide studies dealing with TE exaptation in recent lineages of mammals are still inconclusive and that the hypothesis of rapid transcriptional regulatory rewiring mediated by TE mobilization must be taken with caution. Finally, we suggest experimental approaches that may help attributing higher-order functions to candidate exapted TEs. PMID:23486611

Dissecting the genetic architecture of F1 hybrid sterility in house mice.

PubMed

Dzur-Gejdosova, Maria; Simecek, Petr; Gregorova, Sona; Bhattacharyya, Tanmoy; Forejt, Jiri

2012-11-01

Hybrid sterility as a postzygotic reproductive isolation mechanism has been studied for over 80 years, yet the first identifications of hybrid sterility genes in Drosophila and mouse are quite recent. To study the genetic architecture of F(1) hybrid sterility between young subspecies of house mouse Mus m. domesticus and M. m. musculus, we conducted QTL analysis of a backcross between inbred strains representing these two subspecies and probed the role of individual chromosomes in hybrid sterility using the intersubspecific chromosome substitution strains. We provide direct evidence that the asymmetry in male infertility between reciprocal crosses is conferred by the middle region of M. m. musculus Chr X, thus excluding other potential candidates such as Y, imprinted genes, and mitochondrial DNA. QTL analysis identified strong hybrid sterility loci on Chr 17 and Chr X and predicted a set of interchangeable autosomal loci, a subset of which is sufficient to activate the Dobzhansky-Muller incompatibility of the strong loci. Overall, our results indicate the oligogenic nature of F(1) hybrid sterility, which should be amenable to reconstruction by proper combination of chromosome substitution strains. Such a prefabricated model system should help to uncover the gene networks and molecular mechanisms underlying hybrid sterility. © 2012 The Author(s). Evolution© 2012 The Society for the Study of Evolution.
Repositioning of drugs using open-access data portal DTome: A test case with probenecid (Review).

PubMed

Ahmed, Mohammad U; Bennett, Dylan J; Hsieh, Tze-Chen; Doonan, Barbara B; Ahmed, Saba; Wu, Joseph M

2016-01-01

The one gene-one enzyme hypothesis, first introduced by Beadle and Tatum in the 1940s and based on their genetic analysis and observation of phenotype changes in Neurospora crassa challenged by various experimental conditions, has witnessed significant advances in recent decades. Much of our understanding of the association between genes and their phenotype expression has benefited from the completion of the human genome project, and has shown continual transformation guided by the effort directed at the annotation and characterization of human genes. Similarly, the idea of one drug‑one primary disease indication that traditionally has been the benchmark for the labeling and usage of drugs has also undergone evident progressive refinements; in recent years the science and practice of pharmaceutical development has notable success in the strategy of drug repurposing. Drug repurposing is an innovative approach where, instead of de novo synthesis and discovery of new drugs with novel indications, drug candidates with the desired usage are identified by a process of re‑profiling using an open‑source database or knowledge of known or failed drugs already in existence. In the present study, the repurposing drug strategy employing open‑access data portal drug‑target interactome (DTome) is applied to the uncovering of new clinical usage for probenecid.
Novel Insights into the Transcriptome of Dirofilaria immitis

PubMed Central

Zhang, Zhihe; Hou, Rong; Wu, Xuhang; Yang, Deying; Zhang, Runhui; Zheng, Wanpeng; Nie, Huaming; Xie, Yue; Yan, Ning; Yang, Zhi; Wang, Chengdong; Luo, Li; Liu, Li; Gu, Xiaobin; Wang, Shuxian; Peng, Xuerong; Yang, Guangyou

2012-01-01

Background The heartworm Dirofilaria immitis is the causal agent of cardiopulmonary dirofilariosis in dogs and cats, and also infects a wide range of wild mammals as well as humans. One bottleneck for the design of fundamentally new intervention and management strategies against D. immitis may be the currently limited knowledge of fundamental molecular aspects of D. immitis. Methodology/Principal Findings A next-generation sequencing platform combining computational approaches was employed to assess a global view of the heartworm transcriptome. A total of 20,810 unigenes (mean length = 1,270 bp) were assembled from 22.3 million clean reads. From these, 15,698 coding sequences (CDS) were inferred, and about 85% of the unigenes had orthologs/homologs in public databases. Comparative transcriptomic study uncovered 4,157 filarial-specific genes as well as 3,795 genes potentially involved in filarial-Wolbachia symbiosis. In addition, the potential intestine transcriptome of D. immitis (1,101 genes) was mined for the first time, which might help to discover ‘hidden antigens’. Conclusions/Significance This study provides novel insights into the transcriptome of D. immitis and sheds light on its molecular processes and survival mechanisms. Furthermore, it provides a platform to discover new vaccine candidates and potential targets for new drugs against dirofilariosis. PMID:22911833
Population genomic analysis uncovers environmental stress-driven selection and adaptation of Lentinula edodes population in China.

PubMed

Xiao, Yang; Cheng, Xuanjin; Liu, Jun; Li, Chuang; Nong, Wenyan; Bian, Yinbing; Cheung, Man Kit; Kwan, Hoi Shan

2016-11-10

The elucidation of genome-wide variations could help reveal aspects of divergence, domestication, and adaptation of edible mushrooms. Here, we resequenced the whole genomes of 39 wild and 21 cultivated strains of Chinese Lentinula edodes, the shiitake mushroom. We identified three distinct genetic groups in the Chinese L. edodes population with robust differentiation. Results of phylogenetic and population structure analyses suggest that the cultivated strains and most of the wild trains of L. edodes in China possess different gene pools and two outlier strains show signatures of hybridization between groups. Eighty-four candidate genes contributing to population divergence were detected in outlier analysis, 18 of which are involved in response to environmental stresses. Gene enrichment analysis of group-specific single nucleotide polymorphisms showed that the cultivated strains were genetically diversified in biological processes related to stress response. As the formation of fruiting bodies is a stress-response process, we postulate that environment factors, such as temperature, drove the population divergence of L. edodes in China by natural or artificial selection. We also found phenotypic variations between groups and identified some wild strains that have potential to diversify the genetic pool for improving agricultural traits of L. edodes cultivars in China.
Population genomic analysis uncovers environmental stress-driven selection and adaptation of Lentinula edodes population in China

PubMed Central

Xiao, Yang; Cheng, Xuanjin; Liu, Jun; Li, Chuang; Nong, Wenyan; Bian, Yinbing; Cheung, Man Kit; Kwan, Hoi Shan

2016-01-01

The elucidation of genome-wide variations could help reveal aspects of divergence, domestication, and adaptation of edible mushrooms. Here, we resequenced the whole genomes of 39 wild and 21 cultivated strains of Chinese Lentinula edodes, the shiitake mushroom. We identified three distinct genetic groups in the Chinese L. edodes population with robust differentiation. Results of phylogenetic and population structure analyses suggest that the cultivated strains and most of the wild trains of L. edodes in China possess different gene pools and two outlier strains show signatures of hybridization between groups. Eighty-four candidate genes contributing to population divergence were detected in outlier analysis, 18 of which are involved in response to environmental stresses. Gene enrichment analysis of group-specific single nucleotide polymorphisms showed that the cultivated strains were genetically diversified in biological processes related to stress response. As the formation of fruiting bodies is a stress-response process, we postulate that environment factors, such as temperature, drove the population divergence of L. edodes in China by natural or artificial selection. We also found phenotypic variations between groups and identified some wild strains that have potential to diversify the genetic pool for improving agricultural traits of L. edodes cultivars in China. PMID:27830835
Evolutionary Proteomics Uncovers Ancient Associations of Cilia with Signaling Pathways.

PubMed

Sigg, Monika Abedin; Menchen, Tabea; Lee, Chanjae; Johnson, Jeffery; Jungnickel, Melissa K; Choksi, Semil P; Garcia, Galo; Busengdal, Henriette; Dougherty, Gerard W; Pennekamp, Petra; Werner, Claudius; Rentzsch, Fabian; Florman, Harvey M; Krogan, Nevan; Wallingford, John B; Omran, Heymut; Reiter, Jeremy F

2017-12-18

Cilia are organelles specialized for movement and signaling. To infer when during evolution signaling pathways became associated with cilia, we characterized the proteomes of cilia from sea urchins, sea anemones, and choanoflagellates. We identified 437 high-confidence ciliary candidate proteins conserved in mammals and discovered that Hedgehog and G-protein-coupled receptor pathways were linked to cilia before the origin of bilateria and transient receptor potential (TRP) channels before the origin of animals. We demonstrated that candidates not previously implicated in ciliary biology localized to cilia and further investigated ENKUR, a TRP channel-interacting protein identified in the cilia of all three organisms. ENKUR localizes to motile cilia and is required for patterning the left-right axis in vertebrates. Moreover, mutation of ENKUR causes situs inversus in humans. Thus, proteomic profiling of cilia from diverse eukaryotes defines a conserved ciliary proteome, reveals ancient connections to signaling, and uncovers a ciliary protein that underlies development and human disease. Copyright © 2017 Elsevier Inc. All rights reserved.
QTL mapping and candidate genes for resistance to Fusarium ear rot and fumonisin contamination in maize.

PubMed

Maschietto, Valentina; Colombi, Cinzia; Pirona, Raul; Pea, Giorgio; Strozzi, Francesco; Marocco, Adriano; Rossini, Laura; Lanubile, Alessandra

2017-01-21

Fusarium verticillioides is a common maize pathogen causing ear rot (FER) and contamination of the grains with the fumonisin B1 (FB1) mycotoxin. Resistance to FER and FB1 contamination are quantitative traits, affected by environmental conditions, and completely resistant maize genotypes to the pathogen are so far unknown. In order to uncover genomic regions associated to reduced FER and FB1 contamination and identify molecular markers for assisted selection, an F 2:3 population of 188 progenies was developed crossing CO441 (resistant) and CO354 (susceptible) genotypes. FER severity and FB1 contamination content were evaluated over 2 years and sowing dates (early and late) in ears artificially inoculated with F. verticillioides by the use of either side-needle or toothpick inoculation techniques. Weather conditions significantly changed in the two phenotyping seasons and FER and FB1 content distribution significantly differed in the F 3 progenies according to the year and the sowing time. Significant positive correlations (P < 0.01) were detected between FER and FB1 contamination, ranging from 0.72 to 0.81. A low positive correlation was determined between FB1 contamination and silking time (DTS). A genetic map was generated for the cross, based on 41 microsatellite markers and 342 single nucleotide polymorphisms (SNPs) derived from Genotyping-by-Sequencing (GBS). QTL analyses revealed 15 QTLs for FER, 17 QTLs for FB1 contamination and nine QTLs for DTS. Eight QTLs located on linkage group (LG) 1, 2, 3, 6, 7 and 9 were in common between FER and FB1, making possible the selection of genotypes with both low disease severity and low fumonisin contamination. Moreover, five QTLs on LGs 1, 2, 4, 5 and 9 located close to previously reported QTLs for resistance to other mycotoxigenic fungi. Finally, 24 candidate genes for resistance to F. verticillioides are proposed combining previous transcriptomic data with QTL mapping. This study identified a set of QTLs and candidate genes that could accelerate breeding for resistance of maize lines showing reduced disease severity and low mycotoxin contamination determined by F. verticillioides.
THE USE OF GENE ARRAYS TO DETERMINE TEMPORAL GENE INDUCTION IN SHEEPSHEAD MINNOWS EXPOSED TO E2

EPA Science Inventory

Gene arrays provide a means to study differential gene expression in fish exposed to environmental estrogens by providing a "snapshot" of the genes expressed at a given time. Such array data may also uncover previously unknown biochemical pathways affected by estrogenic compounds...
Genetic dissection of ethanol tolerance in the budding yeast Saccharomyces cerevisiae.

PubMed

Hu, X H; Wang, M H; Tan, T; Li, J R; Yang, H; Leach, L; Zhang, R M; Luo, Z W

2007-03-01

Uncovering genetic control of variation in ethanol tolerance in natural populations of yeast Saccharomyces cerevisiae is essential for understanding the evolution of fermentation, the dominant lifestyle of the species, and for improving efficiency of selection for strains with high ethanol tolerance, a character of great economic value for the brewing and biofuel industries. To date, as many as 251 genes have been predicted to be involved in influencing this character. Candidacy of these genes was determined from a tested phenotypic effect following gene knockout, from an induced change in gene function under an ethanol stress condition, or by mutagenesis. This article represents the first genomics approach for dissecting genetic variation in ethanol tolerance between two yeast strains with a highly divergent trait phenotype. We developed a simple but reliable experimental protocol for scoring the phenotype and a set of STR/SNP markers evenly covering the whole genome. We created a mapping population comprising 319 segregants from crossing the parental strains. On the basis of the data sets, we find that the tolerance trait has a high heritability and that additive genetic variance dominates genetic variation of the trait. Segregation at five QTL detected has explained approximately 50% of phenotypic variation; in particular, the major QTL mapped on yeast chromosome 9 has accounted for a quarter of the phenotypic variation. We integrated the QTL analysis with the predicted candidacy of ethanol resistance genes and found that only a few of these candidates fall in the QTL regions.
Integrated Genome-wide association and hypothalamus eQTL studies indicate a link between the circadian rhythm-related gene PER1 and coping behavior

PubMed Central

Ponsuksili, Siriluck; Zebunke, Manuela; Murani, Eduard; Trakooljul, Nares; Krieter, Joachim; Puppe, Birger; Schwerin, Manfred; Wimmers, Klaus

2015-01-01

Animal personality and coping styles are basic concepts for evaluating animal welfare. Struggling response of piglets in so-called backtests early in life reflects their coping strategy. Behavioral reactions of piglets in backtests have a moderate heritability, but their genetic basis largely remains unknown. Here, latency, duration and frequency of struggling attempts during one-minute backtests were repeatedly recorded of piglets at days 5, 12, 19, and 26. A genome-wide association study for backtest traits revealed 465 significant SNPs (FDR ≤ 0.05) mostly located in QTL (quantitative trait locus) regions on chromosome 3, 5, 12 and 16. In order to capture genes in these regions, 37 transcripts with significant SNPs were selected for expressionQTL analysis in the hypothalamus. Eight genes (ASGR1, CPAMD8, CTC1, FBXO39, IL19, LOC100511790, RAD51B, UBOX5) had cis- and five (RANGRF, PER1, PDZRN3, SH2D4B, LONP2) had trans-expressionQTL. In particular, for PER1, with known physiological implications for maintenance of circadian rhythms, a role in coping behavior was evidenced by confirmed association in an independent population. For CTC1 a cis-expression QTL and the consistent relationship of gene polymorphism, mRNA expression level and backtest traits promoted its link to coping style. GWAS and eQTL analyses uncovered positional and functional gene candidates for coping behavior. PMID:26537429
Integrated Genome-wide association and hypothalamus eQTL studies indicate a link between the circadian rhythm-related gene PER1 and coping behavior.

PubMed

Ponsuksili, Siriluck; Zebunke, Manuela; Murani, Eduard; Trakooljul, Nares; Krieter, Joachim; Puppe, Birger; Schwerin, Manfred; Wimmers, Klaus

2015-11-05

Animal personality and coping styles are basic concepts for evaluating animal welfare. Struggling response of piglets in so-called backtests early in life reflects their coping strategy. Behavioral reactions of piglets in backtests have a moderate heritability, but their genetic basis largely remains unknown. Here, latency, duration and frequency of struggling attempts during one-minute backtests were repeatedly recorded of piglets at days 5, 12, 19, and 26. A genome-wide association study for backtest traits revealed 465 significant SNPs (FDR ≤ 0.05) mostly located in QTL (quantitative trait locus) regions on chromosome 3, 5, 12 and 16. In order to capture genes in these regions, 37 transcripts with significant SNPs were selected for expressionQTL analysis in the hypothalamus. Eight genes (ASGR1, CPAMD8, CTC1, FBXO39, IL19, LOC100511790, RAD51B, UBOX5) had cis- and five (RANGRF, PER1, PDZRN3, SH2D4B, LONP2) had trans-expressionQTL. In particular, for PER1, with known physiological implications for maintenance of circadian rhythms, a role in coping behavior was evidenced by confirmed association in an independent population. For CTC1 a cis-expression QTL and the consistent relationship of gene polymorphism, mRNA expression level and backtest traits promoted its link to coping style. GWAS and eQTL analyses uncovered positional and functional gene candidates for coping behavior.
Population genomics of Fusarium graminearum reveals signatures of divergent evolution within a major cereal pathogen

PubMed Central

2018-01-01

The cereal pathogen Fusarium graminearum is the primary cause of Fusarium head blight (FHB) and a significant threat to food safety and crop production. To elucidate population structure and identify genomic targets of selection within major FHB pathogen populations in North America we sequenced the genomes of 60 diverse F. graminearum isolates. We also assembled the first pan-genome for F. graminearum to clarify population-level differences in gene content potentially contributing to pathogen diversity. Bayesian and phylogenomic analyses revealed genetic structure associated with isolates that produce the novel NX-2 mycotoxin, suggesting a North American population that has remained genetically distinct from other endemic and introduced cereal-infecting populations. Genome scans uncovered distinct signatures of selection within populations, focused in high diversity, frequently recombining regions. These patterns suggested selection for genomic divergence at the trichothecene toxin gene cluster and thirteen additional regions containing genes potentially involved in pathogen specialization. Gene content differences further distinguished populations, in that 121 genes showed population-specific patterns of conservation. Genes that differentiated populations had predicted functions related to pathogenesis, secondary metabolism and antagonistic interactions, though a subset had unique roles in temperature and light sensitivity. Our results indicated that F. graminearum populations are distinguished by dozens of genes with signatures of selection and an array of dispensable accessory genes, suggesting that FHB pathogen populations may be equipped with different traits to exploit the agroecosystem. These findings provide insights into the evolutionary processes and genomic features contributing to population divergence in plant pathogens, and highlight candidate genes for future functional studies of pathogen specialization across evolutionarily and ecologically diverse fungi. PMID:29584736
Genetic loci associated with coronary artery disease harbor evidence of selection and antagonistic pleiotropy.

PubMed

Byars, Sean G; Huang, Qin Qin; Gray, Lesley-Ann; Bakshi, Andrew; Ripatti, Samuli; Abraham, Gad; Stearns, Stephen C; Inouye, Michael

2017-06-01

Traditional genome-wide scans for positive selection have mainly uncovered selective sweeps associated with monogenic traits. While selection on quantitative traits is much more common, very few signals have been detected because of their polygenic nature. We searched for positive selection signals underlying coronary artery disease (CAD) in worldwide populations, using novel approaches to quantify relationships between polygenic selection signals and CAD genetic risk. We identified new candidate adaptive loci that appear to have been directly modified by disease pressures given their significant associations with CAD genetic risk. These candidates were all uniquely and consistently associated with many different male and female reproductive traits suggesting selection may have also targeted these because of their direct effects on fitness. We found that CAD loci are significantly enriched for lifetime reproductive success relative to the rest of the human genome, with evidence that the relationship between CAD and lifetime reproductive success is antagonistic. This supports the presence of antagonistic-pleiotropic tradeoffs on CAD loci and provides a novel explanation for the maintenance and high prevalence of CAD in modern humans. Lastly, we found that positive selection more often targeted CAD gene regulatory variants using HapMap3 lymphoblastoid cell lines, which further highlights the unique biological significance of candidate adaptive loci underlying CAD. Our study provides a novel approach for detecting selection on polygenic traits and evidence that modern human genomes have evolved in response to CAD-induced selection pressures and other early-life traits sharing pleiotropic links with CAD.
No Evidence That Schizophrenia Candidate Genes Are More Associated With Schizophrenia Than Noncandidate Genes.

PubMed

Johnson, Emma C; Border, Richard; Melroy-Greif, Whitney E; de Leeuw, Christiaan A; Ehringer, Marissa A; Keller, Matthew C

2017-11-15

A recent analysis of 25 historical candidate gene polymorphisms for schizophrenia in the largest genome-wide association study conducted to date suggested that these commonly studied variants were no more associated with the disorder than would be expected by chance. However, the same study identified other variants within those candidate genes that demonstrated genome-wide significant associations with schizophrenia. As such, it is possible that variants within historic schizophrenia candidate genes are associated with schizophrenia at levels above those expected by chance, even if the most-studied specific polymorphisms are not. The present study used association statistics from the largest schizophrenia genome-wide association study conducted to date as input to a gene set analysis to investigate whether variants within schizophrenia candidate genes are enriched for association with schizophrenia. As a group, variants in the most-studied candidate genes were no more associated with schizophrenia than were variants in control sets of noncandidate genes. While a small subset of candidate genes did appear to be significantly associated with schizophrenia, these genes were not particularly noteworthy given the large number of more strongly associated noncandidate genes. The history of schizophrenia research should serve as a cautionary tale to candidate gene investigators examining other phenotypes: our findings indicate that the most investigated candidate gene hypotheses of schizophrenia are not well supported by genome-wide association studies, and it is likely that this will be the case for other complex traits as well. Copyright © 2017 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
Functional Properties of Lactobacillus mucosae Strains Isolated from Brazilian Goat Milk.

PubMed

de Moraes, Georgia Maciel Dias; de Abreu, Louricélia Rodrigues; do Egito, Antônio Silvio; Salles, Hévila Oliveira; da Silva, Liana Maria Ferreira; Nero, Luís Augusto; Todorov, Svetoslav Dimitrov; Dos Santos, Karina Maria Olbrich

2017-09-01

The search for probiotic candidates among lactic acid bacteria (LAB) isolated from food may uncover new strains with promising health and technological properties. Lactobacillus mucosae strains attracted recent research attention due to their ability to adhere to intestinal mucus and to inhibit pathogens in the gastrointestinal tract, both related to a probiotic potential. Properties of interest and safety aspects of three Lb. mucosae strains (CNPC006, CNPC007, and CNPC009) isolated from goat milk were investigated employing in vitro tests. The presence of genetic factors related to bile salt hydrolase production (bsh), intestinal adhesion properties (msa, map, mub, and ef-tu), virulence, and biogenic amine production were also verified. All strains exhibited the target map, mub, and ef-tu sequences; the msa gene was detected in CNPC006 and CNPC007 strains. Some of the searched sequences for virulence factors were detected, especially in the CNPC009 strain; all strains carried the hyl gene, related to the production of hyaluronidase. Lb. mucosae CNPC007 exhibited a high survival rate in simulated gastric and enteric conditions. Besides, all strains exhibited the bsh sequence, and CNPC006 and CNPC007 were able to deconjugate salts of glycodeoxycholic acid (GDC). Regarding technological properties for dairy product applications, a relatively higher milk acidification and clotting capacity, diacetyl production, and proteolytic activity were registered for CNPC007 in comparison to the other strains. Collectively, the results aim at Lb. mucosae CNPC007 as a promising probiotic candidate for application in dairy products, deserving further studies to confirm and explore its potential.
Targeting SOD1 induces synthetic lethal killing in BLM- and CHEK2-deficient colorectal cancer cells

PubMed Central

Sajesh, Babu V.; McManus, Kirk J.

2015-01-01

Cancer is a major cause of death throughout the world, and there is a large need for better and more personalized approaches to combat the disease. Over the past decade, synthetic lethal approaches have been developed that are designed to exploit the aberrant molecular origins (i.e. defective genes) that underlie tumorigenesis. BLM and CHEK2 are two evolutionarily conserved genes that are somatically altered in a number of tumor types. Both proteins normally function in preserving genome stability through facilitating the accurate repair of DNA double strand breaks. Thus, uncovering synthetic lethal interactors of BLM and CHEK2 will identify novel candidate drug targets and lead chemical compounds. Here we identify an evolutionarily conserved synthetic lethal interaction between SOD1 and both BLM and CHEK2 in two distinct cell models. Using quantitative imaging microscopy, real-time cellular analyses, colony formation and tumor spheroid models we show that SOD1 silencing and inhibition (ATTM and LCS-1 treatments), or the induction of reactive oxygen species (2ME2 treatment) induces selective killing within BLM- and CHEK2-deficient cells relative to controls. We further show that increases in reactive oxygen species follow SOD1 silencing and inhibition that are associated with the persistence of DNA double strand breaks, and increases in apoptosis. Collectively, these data identify SOD1 as a novel candidate drug target in BLM and CHEK2 cancer contexts, and further suggest that 2ME2, ATTM and LCS-1 are lead therapeutic compounds warranting further pre-clinical study. PMID:26318585
Targeting SOD1 induces synthetic lethal killing in BLM- and CHEK2-deficient colorectal cancer cells.

PubMed

Sajesh, Babu V; McManus, Kirk J

2015-09-29

Cancer is a major cause of death throughout the world, and there is a large need for better and more personalized approaches to combat the disease. Over the past decade, synthetic lethal approaches have been developed that are designed to exploit the aberrant molecular origins (i.e. defective genes) that underlie tumorigenesis. BLM and CHEK2 are two evolutionarily conserved genes that are somatically altered in a number of tumor types. Both proteins normally function in preserving genome stability through facilitating the accurate repair of DNA double strand breaks. Thus, uncovering synthetic lethal interactors of BLM and CHEK2 will identify novel candidate drug targets and lead chemical compounds. Here we identify an evolutionarily conserved synthetic lethal interaction between SOD1 and both BLM and CHEK2 in two distinct cell models. Using quantitative imaging microscopy, real-time cellular analyses, colony formation and tumor spheroid models we show that SOD1 silencing and inhibition (ATTM and LCS-1 treatments), or the induction of reactive oxygen species (2ME2 treatment) induces selective killing within BLM- and CHEK2-deficient cells relative to controls. We further show that increases in reactive oxygen species follow SOD1 silencing and inhibition that are associated with the persistence of DNA double strand breaks, and increases in apoptosis. Collectively, these data identify SOD1 as a novel candidate drug target in BLM and CHEK2 cancer contexts, and further suggest that 2ME2, ATTM and LCS-1 are lead therapeutic compounds warranting further pre-clinical study.
Uncovering production of specialized metabolites by Streptomyces argillaceus: Activation of cryptic biosynthesis gene clusters using nutritional and genetic approaches.

PubMed

Becerril, Adriana; Álvarez, Susana; Braña, Alfredo F; Rico, Sergio; Díaz, Margarita; Santamaría, Ramón I; Salas, José A; Méndez, Carmen

2018-01-01

Sequencing of Streptomyces genomes has revealed they harbor a high number of biosynthesis gene cluster (BGC), which uncovered their enormous potentiality to encode specialized metabolites. However, these metabolites are not usually produced under standard laboratory conditions. In this manuscript we report the activation of BGCs for antimycins, carotenoids, germicidins and desferrioxamine compounds in Streptomyces argillaceus, and the identification of the encoded compounds. This was achieved by following different strategies, including changing the growth conditions, heterologous expression of the cluster and inactivating the adpAa or overexpressing the abrC3 global regulatory genes. In addition, three new carotenoid compounds have been identified.
An In-Depth Characterization of the Major Psoriasis Susceptibility Locus Identifies Candidate Susceptibility Alleles within an HLA-C Enhancer Element

PubMed Central

Clop, Alex; Bertoni, Anna; Spain, Sarah L.; Simpson, Michael A.; Pullabhatla, Venu; Tonda, Raul; Hundhausen, Christian; Di Meglio, Paola; De Jong, Pieter; Hayday, Adrian C.; Nestle, Frank O.; Barker, Jonathan N.; Bell, Robert J. A.; Capon, Francesca; Trembath, Richard C.

2013-01-01

Psoriasis is an immune-mediated skin disorder that is inherited as a complex genetic trait. Although genome-wide association scans (GWAS) have identified 36 disease susceptibility regions, more than 50% of the genetic variance can be attributed to a single Major Histocompatibility Complex (MHC) locus, known as PSORS1. Genetic studies indicate that HLA-C is the strongest PSORS1 candidate gene, since markers tagging HLA-Cw*0602 consistently generate the most significant association signals in GWAS. However, it is unclear whether HLA-Cw*0602 is itself the causal PSORS1 allele, especially as the role of SNPs that may affect its expression has not been investigated. Here, we have undertaken an in-depth molecular characterization of the PSORS1 interval, with a view to identifying regulatory variants that may contribute to disease susceptibility. By analysing high-density SNP data, we refined PSORS1 to a 179 kb region encompassing HLA-C and the neighbouring HCG27 pseudogene. We compared multiple MHC sequences spanning this refined locus and identified 144 candidate susceptibility variants, which are unique to chromosomes bearing HLA-Cw*0602. In parallel, we investigated the epigenetic profile of the critical PSORS1 interval and uncovered three enhancer elements likely to be active in T lymphocytes. Finally we showed that nine candidate susceptibility SNPs map within a HLA-C enhancer and that three of these variants co-localise with binding sites for immune-related transcription factors. These data indicate that SNPs affecting HLA-Cw*0602 expression are likely to contribute to psoriasis susceptibility and highlight the importance of integrating multiple experimental approaches in the investigation of complex genomic regions such as the MHC. PMID:23990973
Cloning of the Arabidopsis rwm1 gene for resistance to Watermelon mosaic virus points to a new function for natural virus resistance genes.

PubMed

Ouibrahim, Laurence; Mazier, Marianne; Estevan, Joan; Pagny, Gaëlle; Decroocq, Véronique; Desbiez, Cécile; Moretti, André; Gallois, Jean-Luc; Caranta, Carole

2014-09-01

Arabidopsis thaliana represents a valuable and efficient model to understand mechanisms underlying plant susceptibility to viral diseases. Here, we describe the identification and molecular cloning of a new gene responsible for recessive resistance to several isolates of Watermelon mosaic virus (WMV, genus Potyvirus) in the Arabidopsis Cvi-0 accession. rwm1 acts at an early stage of infection by impairing viral accumulation in initially infected leaf tissues. Map-based cloning delimited rwm1 on chromosome 1 in a 114-kb region containing 30 annotated genes. Positional and functional candidate gene analysis suggested that rwm1 encodes cPGK2 (At1g56190), an evolutionary conserved nucleus-encoded chloroplast phosphoglycerate kinase with a key role in cell metabolism. Comparative sequence analysis indicates that a single amino acid substitution (S78G) in the N-terminal domain of cPGK2 is involved in rwm1-mediated resistance. This mutation may have functional consequences because it targets a highly conserved residue, affects a putative phosphorylation site and occurs within a predicted nuclear localization signal. Transgenic complementation in Arabidopsis together with virus-induced gene silencing in Nicotiana benthamiana confirmed that cPGK2 corresponds to rwm1 and that the protein is required for efficient WMV infection. This work uncovers new insight into natural plant resistance mechanisms that may provide interesting opportunities for the genetic control of plant virus diseases. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.

Comprehensive analysis of transcriptome variation uncovers known and novel driver events in T-cell acute lymphoblastic leukemia.

PubMed

Atak, Zeynep Kalender; Gianfelici, Valentina; Hulselmans, Gert; De Keersmaecker, Kim; Devasia, Arun George; Geerdens, Ellen; Mentens, Nicole; Chiaretti, Sabina; Durinck, Kaat; Uyttebroeck, Anne; Vandenberghe, Peter; Wlodarska, Iwona; Cloos, Jacqueline; Foà, Robin; Speleman, Frank; Cools, Jan; Aerts, Stein

2013-01-01

RNA-seq is a promising technology to re-sequence protein coding genes for the identification of single nucleotide variants (SNV), while simultaneously obtaining information on structural variations and gene expression perturbations. We asked whether RNA-seq is suitable for the detection of driver mutations in T-cell acute lymphoblastic leukemia (T-ALL). These leukemias are caused by a combination of gene fusions, over-expression of transcription factors and cooperative point mutations in oncogenes and tumor suppressor genes. We analyzed 31 T-ALL patient samples and 18 T-ALL cell lines by high-coverage paired-end RNA-seq. First, we optimized the detection of SNVs in RNA-seq data by comparing the results with exome re-sequencing data. We identified known driver genes with recurrent protein altering variations, as well as several new candidates including H3F3A, PTK2B, and STAT5B. Next, we determined accurate gene expression levels from the RNA-seq data through normalizations and batch effect removal, and used these to classify patients into T-ALL subtypes. Finally, we detected gene fusions, of which several can explain the over-expression of key driver genes such as TLX1, PLAG1, LMO1, or NKX2-1; and others result in novel fusion transcripts encoding activated kinases (SSBP2-FER and TPM3-JAK2) or involving MLLT10. In conclusion, we present novel analysis pipelines for variant calling, variant filtering, and expression normalization on RNA-seq data, and successfully applied these for the detection of translocations, point mutations, INDELs, exon-skipping events, and expression perturbations in T-ALL.
Blood expression profiles of fragile X premutation carriers identify candidate genes involved in neurodegenerative and infertility phenotypes.

PubMed

Mateu-Huertas, Elisabet; Rodriguez-Revenga, Laia; Alvarez-Mora, Maria Isabel; Madrigal, Irene; Willemsen, Rob; Milà, Montserrat; Martí, Eulàlia; Estivill, Xavier

2014-05-01

Male premutation carriers presenting between 55 and 200 CGG repeats in the Fragile-X-associated (FMR1) gene are at risk of developing Fragile X Tremor/Ataxia Syndrome (FXTAS), and females undergo Premature Ovarian Failure (POF1). Here, we have evaluated gene expression profiles from blood in male FMR1 premutation carriers and detected a strong deregulation of genes enriched in FXTAS relevant biological pathways, including inflammation, neuronal homeostasis and viability. Gene expression profiling distinguished between control individuals, carriers with FXTAS and carriers without FXTAS, with levels of expanded FMR1 mRNA being increased in FXTAS patients. In vitro studies in a neuronal cell model indicate that expression levels of expanded FMR1 5'-UTR are relevant in modulating the transcriptome. Thus, perturbations of the transcriptome may be an interplay between the CGG expansion size and FMR1 expression levels. Several deregulated genes (DFFA, BCL2L11, BCL2L1, APP, SOD1, RNF10, HDAC5, KCNC3, ATXN7, ATXN3 and EAP1) were validated in brain samples of a FXTAS mouse model. Downregulation of EAP1, a gene involved in the female reproductive system physiology, was confirmed in female carriers. Decreased levels were detected in female carriers with POF1 compared to those without POF1, suggesting that EAP1 levels contribute to ovarian insufficiency. In summary, gene expression profiling in blood has uncovered mechanisms that may underlie different pathological aspects of the premutation. A better understanding of the transcriptome dynamics in relation with expanded FMR1 mRNA expression levels and CGG expansion size may provide mechanistic insights into the disease process and a more accurate FXTAS diagnosis to the myriad of phenotypes associated with the premutation. Copyright © 2014. Published by Elsevier Inc.
Characterization of the product of a nonribosomal peptide synthetase-like (NRPS-like) gene using the doxycycline dependent Tet-on system in Aspergillus terreus.

PubMed

Sun, Wei-Wen; Guo, Chun-Jun; Wang, Clay C C

2016-04-01

Genome sequencing of the fungus Aspergillus terreus uncovered a number of silent core structural biosynthetic genes encoding enzymes presumed to be involved in the production of cryptic secondary metabolites. There are five nonribosomal peptide synthetase (NRPS)-like genes with the predicted A-T-TE domain architecture within the A. terreus genome. Among the five genes, only the product of pgnA remains unknown. The Tet-on system is an inducible, tunable and metabolism-independent expression system originally developed for Aspergillus niger. Here we report the adoption of the Tet-on system as an effective gene activation tool in A. terreus. Application of this system in A. terreus allowed us to uncover the product of the cryptic NRPS-like gene, pgnA. Furthermore expression of pgnA in the heterologous Aspergillus nidulans host suggested that the pgnA gene alone is necessary for phenguignardic acid (1) biosynthesis. Copyright © 2016 Elsevier Inc. All rights reserved.
A Predictive Model of the Oxygen and Heme Regulatory Network in Yeast

PubMed Central

Kundaje, Anshul; Xin, Xiantong; Lan, Changgui; Lianoglou, Steve; Zhou, Mei; Zhang, Li; Leslie, Christina

2008-01-01

Deciphering gene regulatory mechanisms through the analysis of high-throughput expression data is a challenging computational problem. Previous computational studies have used large expression datasets in order to resolve fine patterns of coexpression, producing clusters or modules of potentially coregulated genes. These methods typically examine promoter sequence information, such as DNA motifs or transcription factor occupancy data, in a separate step after clustering. We needed an alternative and more integrative approach to study the oxygen regulatory network in Saccharomyces cerevisiae using a small dataset of perturbation experiments. Mechanisms of oxygen sensing and regulation underlie many physiological and pathological processes, and only a handful of oxygen regulators have been identified in previous studies. We used a new machine learning algorithm called MEDUSA to uncover detailed information about the oxygen regulatory network using genome-wide expression changes in response to perturbations in the levels of oxygen, heme, Hap1, and Co2+. MEDUSA integrates mRNA expression, promoter sequence, and ChIP-chip occupancy data to learn a model that accurately predicts the differential expression of target genes in held-out data. We used a novel margin-based score to extract significant condition-specific regulators and assemble a global map of the oxygen sensing and regulatory network. This network includes both known oxygen and heme regulators, such as Hap1, Mga2, Hap4, and Upc2, as well as many new candidate regulators. MEDUSA also identified many DNA motifs that are consistent with previous experimentally identified transcription factor binding sites. Because MEDUSA's regulatory program associates regulators to target genes through their promoter sequences, we directly tested the predicted regulators for OLE1, a gene specifically induced under hypoxia, by experimental analysis of the activity of its promoter. In each case, deletion of the candidate regulator resulted in the predicted effect on promoter activity, confirming that several novel regulators identified by MEDUSA are indeed involved in oxygen regulation. MEDUSA can reveal important information from a small dataset and generate testable hypotheses for further experimental analysis. Supplemental data are included. PMID:19008939
A side-effect free method for identifying cancer drug targets.

PubMed

Ashraf, Md Izhar; Ong, Seng-Kai; Mujawar, Shama; Pawar, Shrikant; More, Pallavi; Paul, Somnath; Lahiri, Chandrajit

2018-04-27

Identifying effective drug targets, with little or no side effects, remains an ever challenging task. A potential pitfall of failing to uncover the correct drug targets, due to side effect of pleiotropic genes, might lead the potential drugs to be illicit and withdrawn. Simplifying disease complexity, for the investigation of the mechanistic aspects and identification of effective drug targets, have been done through several approaches of protein interactome analysis. Of these, centrality measures have always gained importance in identifying candidate drug targets. Here, we put forward an integrated method of analysing a complex network of cancer and depict the importance of k-core, functional connectivity and centrality (KFC) for identifying effective drug targets. Essentially, we have extracted the proteins involved in the pathways leading to cancer from the pathway databases which enlist real experimental datasets. The interactions between these proteins were mapped to build an interactome. Integrative analyses of the interactome enabled us to unearth plausible reasons for drugs being rendered withdrawn, thereby giving future scope to pharmaceutical industries to potentially avoid them (e.g. ESR1, HDAC2, F2, PLG, PPARA, RXRA, etc). Based upon our KFC criteria, we have shortlisted ten proteins (GRB2, FYN, PIK3R1, CBL, JAK2, LCK, LYN, SYK, JAK1 and SOCS3) as effective candidates for drug development.
Genetics of obesity: can an old dog teach us new tricks?

PubMed

Yeo, Giles S H

2017-05-01

At one level, obesity is clearly a problem of simple physics, a result of eating too much and not expending enough energy. The more complex question, however, is why do some people eat more than others? Studies of human and mouse genetics over the past two decades have uncovered a number of pathways within the brain that play a key role in the control of food intake. A prime example is the leptin-melanocortin pathway, which we now know greatly contributes to mammalian appetitive behaviour. However, genetic disruption of this pathway remains rare and does not represent the major burden of the disease that is carried by those of us with 'common obesity'. In recent years, genome-wide association studies have revealed more than 100 different candidate genes linked to BMI, with most (including many components of the melanocortin pathway) acting in the central nervous system and influencing food intake. So while severe disruption of the melanocortin pathway results in severe obesity, subtle variations in these genes influence where you might sit in the normal distribution of BMI. As we now enter this 'post-genomics' world, can this new information influence our treatment and management of obese patients?
Multiple capacitors for natural genetic variation in Drosophila melanogaster.

PubMed

Takahashi, Kazuo H

2013-03-01

Cryptic genetic variation (CGV) or a standing genetic variation that is not ordinarily expressed as a phenotype is released when the robustness of organisms is impaired under environmental or genetic perturbations. Evolutionary capacitors modulate the amount of genetic variation exposed to natural selection and hidden cryptically; they have a fundamental effect on the evolvability of traits on evolutionary timescales. In this study, I have demonstrated the effects of multiple genomic regions of Drosophila melanogaster on CGV in wing shape. I examined the effects of 61 genomic deficiencies on quantitative and qualitative natural genetic variation in the wing shape of D. melanogaster. I have identified 10 genomic deficiencies that do not encompass a known candidate evolutionary capacitor, Hsp90, exposing natural CGV differently depending on the location of the deficiencies in the genome. Furthermore, five genomic deficiencies uncovered qualitative CGV in wing morphology. These findings suggest that CGV in wing shape of wild-type D. melanogaster is regulated by multiple capacitors with divergent functions. Future analysis of genes encompassed by these genomic regions would help elucidate novel capacitor genes and better understand the general features of capacitors regarding natural genetic variation. © 2012 Blackwell Publishing Ltd.
Rare and low-frequency coding variants alter human adult height.

PubMed

Marouli, Eirini; Graff, Mariaelisa; Medina-Gomez, Carolina; Lo, Ken Sin; Wood, Andrew R; Kjaer, Troels R; Fine, Rebecca S; Lu, Yingchang; Schurmann, Claudia; Highland, Heather M; Rüeger, Sina; Thorleifsson, Gudmar; Justice, Anne E; Lamparter, David; Stirrups, Kathleen E; Turcot, Valérie; Young, Kristin L; Winkler, Thomas W; Esko, Tõnu; Karaderi, Tugce; Locke, Adam E; Masca, Nicholas G D; Ng, Maggie C Y; Mudgal, Poorva; Rivas, Manuel A; Vedantam, Sailaja; Mahajan, Anubha; Guo, Xiuqing; Abecasis, Goncalo; Aben, Katja K; Adair, Linda S; Alam, Dewan S; Albrecht, Eva; Allin, Kristine H; Allison, Matthew; Amouyel, Philippe; Appel, Emil V; Arveiler, Dominique; Asselbergs, Folkert W; Auer, Paul L; Balkau, Beverley; Banas, Bernhard; Bang, Lia E; Benn, Marianne; Bergmann, Sven; Bielak, Lawrence F; Blüher, Matthias; Boeing, Heiner; Boerwinkle, Eric; Böger, Carsten A; Bonnycastle, Lori L; Bork-Jensen, Jette; Bots, Michiel L; Bottinger, Erwin P; Bowden, Donald W; Brandslund, Ivan; Breen, Gerome; Brilliant, Murray H; Broer, Linda; Burt, Amber A; Butterworth, Adam S; Carey, David J; Caulfield, Mark J; Chambers, John C; Chasman, Daniel I; Chen, Yii-Der Ida; Chowdhury, Rajiv; Christensen, Cramer; Chu, Audrey Y; Cocca, Massimiliano; Collins, Francis S; Cook, James P; Corley, Janie; Galbany, Jordi Corominas; Cox, Amanda J; Cuellar-Partida, Gabriel; Danesh, John; Davies, Gail; de Bakker, Paul I W; de Borst, Gert J; de Denus, Simon; de Groot, Mark C H; de Mutsert, Renée; Deary, Ian J; Dedoussis, George; Demerath, Ellen W; den Hollander, Anneke I; Dennis, Joe G; Di Angelantonio, Emanuele; Drenos, Fotios; Du, Mengmeng; Dunning, Alison M; Easton, Douglas F; Ebeling, Tapani; Edwards, Todd L; Ellinor, Patrick T; Elliott, Paul; Evangelou, Evangelos; Farmaki, Aliki-Eleni; Faul, Jessica D; Feitosa, Mary F; Feng, Shuang; Ferrannini, Ele; Ferrario, Marco M; Ferrieres, Jean; Florez, Jose C; Ford, Ian; Fornage, Myriam; Franks, Paul W; Frikke-Schmidt, Ruth; Galesloot, Tessel E; Gan, Wei; Gandin, Ilaria; Gasparini, Paolo; Giedraitis, Vilmantas; Giri, Ayush; Girotto, Giorgia; Gordon, Scott D; Gordon-Larsen, Penny; Gorski, Mathias; Grarup, Niels; Grove, Megan L; Gudnason, Vilmundur; Gustafsson, Stefan; Hansen, Torben; Harris, Kathleen Mullan; Harris, Tamara B; Hattersley, Andrew T; Hayward, Caroline; He, Liang; Heid, Iris M; Heikkilä, Kauko; Helgeland, Øyvind; Hernesniemi, Jussi; Hewitt, Alex W; Hocking, Lynne J; Hollensted, Mette; Holmen, Oddgeir L; Hovingh, G Kees; Howson, Joanna M M; Hoyng, Carel B; Huang, Paul L; Hveem, Kristian; Ikram, M Arfan; Ingelsson, Erik; Jackson, Anne U; Jansson, Jan-Håkan; Jarvik, Gail P; Jensen, Gorm B; Jhun, Min A; Jia, Yucheng; Jiang, Xuejuan; Johansson, Stefan; Jørgensen, Marit E; Jørgensen, Torben; Jousilahti, Pekka; Jukema, J Wouter; Kahali, Bratati; Kahn, René S; Kähönen, Mika; Kamstrup, Pia R; Kanoni, Stavroula; Kaprio, Jaakko; Karaleftheri, Maria; Kardia, Sharon L R; Karpe, Fredrik; Kee, Frank; Keeman, Renske; Kiemeney, Lambertus A; Kitajima, Hidetoshi; Kluivers, Kirsten B; Kocher, Thomas; Komulainen, Pirjo; Kontto, Jukka; Kooner, Jaspal S; Kooperberg, Charles; Kovacs, Peter; Kriebel, Jennifer; Kuivaniemi, Helena; Küry, Sébastien; Kuusisto, Johanna; La Bianca, Martina; Laakso, Markku; Lakka, Timo A; Lange, Ethan M; Lange, Leslie A; Langefeld, Carl D; Langenberg, Claudia; Larson, Eric B; Lee, I-Te; Lehtimäki, Terho; Lewis, Cora E; Li, Huaixing; Li, Jin; Li-Gao, Ruifang; Lin, Honghuang; Lin, Li-An; Lin, Xu; Lind, Lars; Lindström, Jaana; Linneberg, Allan; Liu, Yeheng; Liu, Yongmei; Lophatananon, Artitaya; Luan, Jian'an; Lubitz, Steven A; Lyytikäinen, Leo-Pekka; Mackey, David A; Madden, Pamela A F; Manning, Alisa K; Männistö, Satu; Marenne, Gaëlle; Marten, Jonathan; Martin, Nicholas G; Mazul, Angela L; Meidtner, Karina; Metspalu, Andres; Mitchell, Paul; Mohlke, Karen L; Mook-Kanamori, Dennis O; Morgan, Anna; Morris, Andrew D; Morris, Andrew P; Müller-Nurasyid, Martina; Munroe, Patricia B; Nalls, Mike A; Nauck, Matthias; Nelson, Christopher P; Neville, Matt; Nielsen, Sune F; Nikus, Kjell; Njølstad, Pål R; Nordestgaard, Børge G; Ntalla, Ioanna; O'Connel, Jeffrey R; Oksa, Heikki; Loohuis, Loes M Olde; Ophoff, Roel A; Owen, Katharine R; Packard, Chris J; Padmanabhan, Sandosh; Palmer, Colin N A; Pasterkamp, Gerard; Patel, Aniruddh P; Pattie, Alison; Pedersen, Oluf; Peissig, Peggy L; Peloso, Gina M; Pennell, Craig E; Perola, Markus; Perry, James A; Perry, John R B; Person, Thomas N; Pirie, Ailith; Polasek, Ozren; Posthuma, Danielle; Raitakari, Olli T; Rasheed, Asif; Rauramaa, Rainer; Reilly, Dermot F; Reiner, Alex P; Renström, Frida; Ridker, Paul M; Rioux, John D; Robertson, Neil; Robino, Antonietta; Rolandsson, Olov; Rudan, Igor; Ruth, Katherine S; Saleheen, Danish; Salomaa, Veikko; Samani, Nilesh J; Sandow, Kevin; Sapkota, Yadav; Sattar, Naveed; Schmidt, Marjanka K; Schreiner, Pamela J; Schulze, Matthias B; Scott, Robert A; Segura-Lepe, Marcelo P; Shah, Svati; Sim, Xueling; Sivapalaratnam, Suthesh; Small, Kerrin S; Smith, Albert Vernon; Smith, Jennifer A; Southam, Lorraine; Spector, Timothy D; Speliotes, Elizabeth K; Starr, John M; Steinthorsdottir, Valgerdur; Stringham, Heather M; Stumvoll, Michael; Surendran, Praveen; 't Hart, Leen M; Tansey, Katherine E; Tardif, Jean-Claude; Taylor, Kent D; Teumer, Alexander; Thompson, Deborah J; Thorsteinsdottir, Unnur; Thuesen, Betina H; Tönjes, Anke; Tromp, Gerard; Trompet, Stella; Tsafantakis, Emmanouil; Tuomilehto, Jaakko; Tybjaerg-Hansen, Anne; Tyrer, Jonathan P; Uher, Rudolf; Uitterlinden, André G; Ulivi, Sheila; van der Laan, Sander W; Van Der Leij, Andries R; van Duijn, Cornelia M; van Schoor, Natasja M; van Setten, Jessica; Varbo, Anette; Varga, Tibor V; Varma, Rohit; Edwards, Digna R Velez; Vermeulen, Sita H; Vestergaard, Henrik; Vitart, Veronique; Vogt, Thomas F; Vozzi, Diego; Walker, Mark; Wang, Feijie; Wang, Carol A; Wang, Shuai; Wang, Yiqin; Wareham, Nicholas J; Warren, Helen R; Wessel, Jennifer; Willems, Sara M; Wilson, James G; Witte, Daniel R; Woods, Michael O; Wu, Ying; Yaghootkar, Hanieh; Yao, Jie; Yao, Pang; Yerges-Armstrong, Laura M; Young, Robin; Zeggini, Eleftheria; Zhan, Xiaowei; Zhang, Weihua; Zhao, Jing Hua; Zhao, Wei; Zhao, Wei; Zheng, He; Zhou, Wei; Rotter, Jerome I; Boehnke, Michael; Kathiresan, Sekar; McCarthy, Mark I; Willer, Cristen J; Stefansson, Kari; Borecki, Ingrid B; Liu, Dajiang J; North, Kari E; Heard-Costa, Nancy L; Pers, Tune H; Lindgren, Cecilia M; Oxvig, Claus; Kutalik, Zoltán; Rivadeneira, Fernando; Loos, Ruth J F; Frayling, Timothy M; Hirschhorn, Joel N; Deloukas, Panos; Lettre, Guillaume

2017-02-09

Height is a highly heritable, classic polygenic trait with approximately 700 common associated variants identified through genome-wide association studies so far. Here, we report 83 height-associated coding variants with lower minor-allele frequencies (in the range of 0.1-4.8%) and effects of up to 2 centimetres per allele (such as those in IHH, STC2, AR and CRISPLD2), greater than ten times the average effect of common variants. In functional follow-up studies, rare height-increasing alleles of STC2 (giving an increase of 1-2 centimetres per allele) compromised proteolytic inhibition of PAPP-A and increased cleavage of IGFBP-4 in vitro, resulting in higher bioavailability of insulin-like growth factors. These 83 height-associated variants overlap genes that are mutated in monogenic growth disorders and highlight new biological candidates (such as ADAMTS3, IL11RA and NOX4) and pathways (such as proteoglycan and glycosaminoglycan synthesis) involved in growth. Our results demonstrate that sufficiently large sample sizes can uncover rare and low-frequency variants of moderate-to-large effect associated with polygenic human phenotypes, and that these variants implicate relevant genes and pathways.
Endophytic actinobacteria: Diversity, secondary metabolism and mechanisms to unsilence biosynthetic gene clusters.

PubMed

Dinesh, Raghavan; Srinivasan, Veeraraghavan; T E, Sheeja; Anandaraj, Muthuswamy; Srambikkal, Hamza

2017-09-01

Endophytic actinobacteria, which reside in the inner tissues of host plants, are gaining serious attention due to their capacity to produce a plethora of secondary metabolites (e.g. antibiotics) possessing a wide variety of biological activity with diverse functions. This review encompasses the recent reports on endophytic actinobacterial species diversity, in planta habitats and mechanisms underlying their mode of entry into plants. Besides, their metabolic potential, novel bioactive compounds they produce and mechanisms to unravel their hidden metabolic repertoire by activation of cryptic or silent biosynthetic gene clusters (BGCs) for eliciting novel secondary metabolite production are discussed. The study also reviews the classical conservative techniques (chemical/biological/physical elicitation, co-culturing) as well as modern microbiology tools (e.g. next generation sequencing) that are being gainfully employed to uncover the vast hidden scaffolds for novel secondary metabolites produced by these endophytes, which would subsequently herald a revolution in drug engineering. The potential role of these endophytes in the agro-environment as promising biological candidates for inhibition of phytopathogens and the way forward to thoroughly exploit this unique microbial community by inducing expression of cryptic BGCs for encoding unseen products with novel therapeutic properties are also discussed.
Computational Analysis of Candidate Disease Genes and Variants for Salt-Sensitive Hypertension in Indigenous Southern Africans

PubMed Central

Tiffin, Nicki; Meintjes, Ayton; Ramesar, Rajkumar; Bajic, Vladimir B.; Rayner, Brian

2010-01-01

Multiple factors underlie susceptibility to essential hypertension, including a significant genetic and ethnic component, and environmental effects. Blood pressure response of hypertensive individuals to salt is heterogeneous, but salt sensitivity appears more prevalent in people of indigenous African origin. The underlying genetics of salt-sensitive hypertension, however, are poorly understood. In this study, computational methods including text- and data-mining have been used to select and prioritize candidate aetiological genes for salt-sensitive hypertension. Additionally, we have compared allele frequencies and copy number variation for single nucleotide polymorphisms in candidate genes between indigenous Southern African and Caucasian populations, with the aim of identifying candidate genes with significant variability between the population groups: identifying genetic variability between population groups can exploit ethnic differences in disease prevalence to aid with prioritisation of good candidate genes. Our top-ranking candidate genes include parathyroid hormone precursor (PTH) and type-1angiotensin II receptor (AGTR1). We propose that the candidate genes identified in this study warrant further investigation as potential aetiological genes for salt-sensitive hypertension. PMID:20886000
The wheat Phs-A1 pre-harvest sprouting resistance locus delays the rate of seed dormancy loss and maps 0.3 cM distal to the PM19 genes in UK germplasm

PubMed Central

Shorinola, Oluwaseyi; Bird, Nicholas; Simmonds, James; Berry, Simon; Henriksson, Tina; Jack, Peter; Werner, Peter; Gerjets, Tanja; Scholefield, Duncan; Balcárková, Barbara; Valárik, Miroslav; Holdsworth, M. J.; Flintham, John; Uauy, Cristobal

2016-01-01

The precocious germination of cereal grains before harvest, also known as pre-harvest sprouting, is an important source of yield and quality loss in cereal production. Pre-harvest sprouting is a complex grain defect and is becoming an increasing challenge due to changing climate patterns. Resistance to sprouting is multi-genic, although a significant proportion of the sprouting variation in modern wheat cultivars is controlled by a few major quantitative trait loci, including Phs-A1 in chromosome arm 4AL. Despite its importance, little is known about the physiological basis and the gene(s) underlying this important locus. In this study, we characterized Phs-A1 and show that it confers resistance to sprouting damage by affecting the rate of dormancy loss during dry seed after-ripening. We show Phs-A1 to be effective even when seeds develop at low temperature (13 °C). Comparative analysis of syntenic Phs-A1 intervals in wheat and Brachypodium uncovered ten orthologous genes, including the Plasma Membrane 19 genes (PM19-A1 and PM19-A2) previously proposed as the main candidates for this locus. However, high-resolution fine-mapping in two bi-parental UK mapping populations delimited Phs-A1 to an interval 0.3 cM distal to the PM19 genes. This study suggests the possibility that more than one causal gene underlies this major pre-harvest sprouting locus. The information and resources reported in this study will help test this hypothesis across a wider set of germplasm and will be of importance for breeding more sprouting resilient wheat varieties. PMID:27217549
Whole Body Melanoma Transcriptome Response in Medaka

PubMed Central

Schartl, Manfred; Shen, Yingjia; Maurus, Katja; Walter, Ron; Tomlinson, Chad; Wilson, Richard K.; Postlethwait, John; Warren, Wesley C.

2015-01-01

The incidence of malignant melanoma continues to increase each year with poor prognosis for survival in many relapse cases. To reverse this trend, whole body response measures are needed to discover collaborative paths to primary and secondary malignancy. Several species of fish provide excellent melanoma models because fish and human melanocytes both appear in the epidermis, and fish and human pigment cell tumors share conserved gene expression signatures. For the first time, we have examined the whole body transcriptome response to invasive melanoma as a prelude to using transcriptome profiling to screen for drugs in a medaka (Oryzias latipes) model. We generated RNA-seq data from whole body RNA isolates for controls and melanoma fish. After testing for differential expression, 396 genes had significantly different expression (adjusted p-value <0.02) in the whole body transcriptome between melanoma and control fish; 379 of these genes were matched to human orthologs with 233 having annotated human gene symbols and 14 matched genes that contain putative deleterious variants in human melanoma at varying levels of recurrence. A detailed canonical pathway evaluation for significant enrichment showed the top scoring pathway to be antigen presentation but also included the expected melanocyte development and pigmentation signaling pathway. Results revealed a profound down-regulation of genes involved in the immune response, especially the innate immune system. We hypothesize that the developing melanoma actively suppresses the immune system responses of the body in reacting to the invasive malignancy, and that this mal-adaptive response contributes to disease progression, a result that suggests our whole-body transcriptomic approach merits further use. In these findings, we also observed novel genes not yet identified in human melanoma expression studies and uncovered known and new candidate drug targets for further testing in this malignant melanoma medaka model. PMID:26714172
Whole Body Melanoma Transcriptome Response in Medaka.

PubMed

Schartl, Manfred; Shen, Yingjia; Maurus, Katja; Walter, Ron; Tomlinson, Chad; Wilson, Richard K; Postlethwait, John; Warren, Wesley C

2015-01-01

The incidence of malignant melanoma continues to increase each year with poor prognosis for survival in many relapse cases. To reverse this trend, whole body response measures are needed to discover collaborative paths to primary and secondary malignancy. Several species of fish provide excellent melanoma models because fish and human melanocytes both appear in the epidermis, and fish and human pigment cell tumors share conserved gene expression signatures. For the first time, we have examined the whole body transcriptome response to invasive melanoma as a prelude to using transcriptome profiling to screen for drugs in a medaka (Oryzias latipes) model. We generated RNA-seq data from whole body RNA isolates for controls and melanoma fish. After testing for differential expression, 396 genes had significantly different expression (adjusted p-value <0.02) in the whole body transcriptome between melanoma and control fish; 379 of these genes were matched to human orthologs with 233 having annotated human gene symbols and 14 matched genes that contain putative deleterious variants in human melanoma at varying levels of recurrence. A detailed canonical pathway evaluation for significant enrichment showed the top scoring pathway to be antigen presentation but also included the expected melanocyte development and pigmentation signaling pathway. Results revealed a profound down-regulation of genes involved in the immune response, especially the innate immune system. We hypothesize that the developing melanoma actively suppresses the immune system responses of the body in reacting to the invasive malignancy, and that this mal-adaptive response contributes to disease progression, a result that suggests our whole-body transcriptomic approach merits further use. In these findings, we also observed novel genes not yet identified in human melanoma expression studies and uncovered known and new candidate drug targets for further testing in this malignant melanoma medaka model.
De Novo Assembly and Annotation of the Transcriptome of the Agricultural Weed Ipomoea purpurea Uncovers Gene Expression Changes Associated with Herbicide Resistance

PubMed Central

Leslie, Trent; Baucom, Regina S.

2014-01-01

Human-mediated selection can lead to rapid evolution in very short time scales, and the evolution of herbicide resistance in agricultural weeds is an excellent example of this phenomenon. The common morning glory, Ipomoea purpurea, is resistant to the herbicide glyphosate, but genetic investigations of this trait have been hampered by the lack of genomic resources for this species. Here, we present the annotated transcriptome of the common morning glory, Ipomoea purpurea, along with an examination of whole genome expression profiling to assess potential gene expression differences between three artificially selected herbicide resistant lines and three susceptible lines. The assembled Ipomoea transcriptome reported in this work contains 65,459 assembled transcripts, ~28,000 of which were functionally annotated by assignment to Gene Ontology categories. Our RNA-seq survey using this reference transcriptome identified 19 differentially expressed genes associated with resistance—one of which, a cytochrome P450, belongs to a large plant family of genes involved in xenobiotic detoxification. The differentially expressed genes also broadly implicated receptor-like kinases, which were down-regulated in the resistant lines, and other growth and defense genes, which were up-regulated in resistant lines. Interestingly, the target of glyphosate—EPSP synthase—was not overexpressed in the resistant Ipomoea lines as in other glyphosate resistant weeds. Overall, this work identifies potential candidate resistance loci for future investigations and dramatically increases genomic resources for this species. The assembled transcriptome presented herein will also provide a valuable resource to the Ipomoea community, as well as to those interested in utilizing the close relationship between the Convolvulaceae and the Solanaceae for phylogenetic and comparative genomics examinations. PMID:25155274
De novo assembly and annotation of the transcriptome of the agricultural weed Ipomoea purpurea uncovers gene expression changes associated with herbicide resistance.

PubMed

Leslie, Trent; Baucom, Regina S

2014-08-25

Human-mediated selection can lead to rapid evolution in very short time scales, and the evolution of herbicide resistance in agricultural weeds is an excellent example of this phenomenon. The common morning glory, Ipomoea purpurea, is resistant to the herbicide glyphosate, but genetic investigations of this trait have been hampered by the lack of genomic resources for this species. Here, we present the annotated transcriptome of the common morning glory, Ipomoea purpurea, along with an examination of whole genome expression profiling to assess potential gene expression differences between three artificially selected herbicide resistant lines and three susceptible lines. The assembled Ipomoea transcriptome reported in this work contains 65,459 assembled transcripts, ~28,000 of which were functionally annotated by assignment to Gene Ontology categories. Our RNA-seq survey using this reference transcriptome identified 19 differentially expressed genes associated with resistance-one of which, a cytochrome P450, belongs to a large plant family of genes involved in xenobiotic detoxification. The differentially expressed genes also broadly implicated receptor-like kinases, which were down-regulated in the resistant lines, and other growth and defense genes, which were up-regulated in resistant lines. Interestingly, the target of glyphosate-EPSP synthase-was not overexpressed in the resistant Ipomoea lines as in other glyphosate resistant weeds. Overall, this work identifies potential candidate resistance loci for future investigations and dramatically increases genomic resources for this species. The assembled transcriptome presented herein will also provide a valuable resource to the Ipomoea community, as well as to those interested in utilizing the close relationship between the Convolvulaceae and the Solanaceae for phylogenetic and comparative genomics examinations. Copyright © 2014 Leslie and Baucom.
Horizontally Acquired Biosynthesis Genes Boost Coxiella burnetii's Physiology.

PubMed

Moses, Abraham S; Millar, Jess A; Bonazzi, Matteo; Beare, Paul A; Raghavan, Rahul

2017-01-01

Coxiella burnetii , the etiologic agent of acute Q fever and chronic endocarditis, has a unique biphasic life cycle, which includes a metabolically active intracellular form that occupies a large lysosome-derived acidic vacuole. C. burnetii is the only bacterium known to thrive within such an hostile intracellular niche, and this ability is fundamental to its pathogenicity; however, very little is known about genes that facilitate Coxiella 's intracellular growth. Recent studies indicate that C. burnetii evolved from a tick-associated ancestor and that the metabolic capabilities of C. burnetii are different from that of Coxiella -like bacteria found in ticks. Horizontally acquired genes that allow C. burnetii to infect and grow within mammalian cells likely facilitated the host shift; however, because of its obligate intracellular replication, C. burnetii would have lost most genes that have been rendered redundant due to the availability of metabolites within the host cell. Based on these observations, we reasoned that horizontally derived biosynthetic genes that have been retained in the reduced genome of C. burnetii are ideal candidates to begin to uncover its intracellular metabolic requirements. Our analyses identified a large number of putative foreign-origin genes in C. burnetii , including tRNA Glu 2 that is potentially required for heme biosynthesis, and genes involved in the production of lipopolysaccharide-a virulence factor, and of critical metabolites such as fatty acids and biotin. In comparison to wild-type C. burnetii , a strain that lacks tRNA Glu 2 exhibited reduced growth, indicating its importance to Coxiella 's physiology. Additionally, by using chemical agents that block heme and biotin biosyntheses, we show that these pathways are promising targets for the development of new anti- Coxiella therapies.
RNA-Seq analysis uncovers non-coding small RNA system of Mycobacterium neoaurum in the metabolism of sterols to accumulate steroid intermediates.

PubMed

Liu, Min; Zhu, Zhan-Tao; Tao, Xin-Yi; Wang, Feng-Qing; Wei, Dong-Zhi

2016-04-25

Understanding the metabolic mechanism of sterols to produce valuable steroid intermediates in mycobacterium by a noncoding small RNA (sRNA) view is still limited. In the work, RNA-seq was implemented to investigate the noncoding transcriptome of Mycobacterium neoaurum (Mn) in the transformation process of sterols to valuable steroid intermediates, including 9α-hydroxy-4-androstene-3,17-dione (9OHAD), 1,4-androstadiene-3,17-dione (ADD), and 22-hydroxy-23, 24-bisnorchola-1,4-dien-3-one (1,4-BNA). A total of 263 sRNA candidates were predicted from the intergenic regions in Mn. Differential expression of sRNA candidates was explored in the wide type Mn with vs without sterol addition, and the steroid intermediate producing Mn strains vs wide type Mn with sterol addition, respectively. Generally, sRNA candidates were differentially expressed in various strains, but there were still some shared candidates with outstandingly upregulated or downregulated expression in these steroid producing strains. Accordingly, four regulatory networks were constructed to reveal the direct and/or indirect interactions between sRNA candidates and their target genes in four groups, including wide type Mn with vs without sterol addition, 9OHAD, ADD, and BNA producing strains vs wide type Mn with sterol addition, respectively. Based on these constructed networks, several highly focused sRNA candidates were discovered to be prevalent in the networks, which showed comprehensive regulatory roles in various cellular processes, including lipid transport and metabolism, amino acid transport and metabolism, signal transduction, cell envelope biosynthesis and ATP synthesis. To explore the functional role of sRNA candidates in Mn cells, we manipulated the overexpression of candidates 131 and 138 in strain Mn-9OHAD, which led to enhanced production of 9OHAD from 1.5- to 2.3-fold during 6 d' fermentation and a slight effect on growth rate. This study revealed the complex and important regulatory roles of noncoding small RNAs in the metabolism of sterols to produce steroid intermediates in Mn, further analysis of which will promote the better understanding about the molecular metabolism of these sRNA candidates and open a broad range of opportunities in the field.
Identification of 15 candidate structured noncoding RNA motifs in fungi by comparative genomics.

PubMed

Li, Sanshu; Breaker, Ronald R

2017-10-13

With the development of rapid and inexpensive DNA sequencing, the genome sequences of more than 100 fungal species have been made available. This dataset provides an excellent resource for comparative genomics analyses, which can be used to discover genetic elements, including noncoding RNAs (ncRNAs). Bioinformatics tools similar to those used to uncover novel ncRNAs in bacteria, likewise, should be useful for searching fungal genomic sequences, and the relative ease of genetic experiments with some model fungal species could facilitate experimental validation studies. We have adapted a bioinformatics pipeline for discovering bacterial ncRNAs to systematically analyze many fungal genomes. This comparative genomics pipeline integrates information on conserved RNA sequence and structural features with alternative splicing information to reveal fungal RNA motifs that are candidate regulatory domains, or that might have other possible functions. A total of 15 prominent classes of structured ncRNA candidates were identified, including variant HDV self-cleaving ribozyme representatives, atypical snoRNA candidates, and possible structured antisense RNA motifs. Candidate regulatory motifs were also found associated with genes for ribosomal proteins, S-adenosylmethionine decarboxylase (SDC), amidase, and HexA protein involved in Woronin body formation. We experimentally confirm that the variant HDV ribozymes undergo rapid self-cleavage, and we demonstrate that the SDC RNA motif reduces the expression of SAM decarboxylase by translational repression. Furthermore, we provide evidence that several other motifs discovered in this study are likely to be functional ncRNA elements. Systematic screening of fungal genomes using a computational discovery pipeline has revealed the existence of a variety of novel structured ncRNAs. Genome contexts and similarities to known ncRNA motifs provide strong evidence for the biological and biochemical functions of some newly found ncRNA motifs. Although initial examinations of several motifs provide evidence for their likely functions, other motifs will require more in-depth analysis to reveal their functions.
Comprehensive analysis of alternative splicing and functionality in neuronal differentiation of P19 cells.

PubMed

Suzuki, Hitoshi; Osaki, Ken; Sano, Kaori; Alam, A H M Khurshid; Nakamura, Yuichiro; Ishigaki, Yasuhito; Kawahara, Kozo; Tsukahara, Toshifumi

2011-02-18

Alternative splicing, which produces multiple mRNAs from a single gene, occurs in most human genes and contributes to protein diversity. Many alternative isoforms are expressed in a spatio-temporal manner, and function in diverse processes, including in the neural system. The purpose of the present study was to comprehensively investigate neural-splicing using P19 cells. GeneChip Exon Array analysis was performed using total RNAs purified from cells during neuronal cell differentiation. To efficiently and readily extract the alternative exon candidates, 9 filtering conditions were prepared, yielding 262 candidate exons (236 genes). Semiquantitative RT-PCR results in 30 randomly selected candidates suggested that 87% of the candidates were differentially alternatively spliced in neuronal cells compared to undifferentiated cells. Gene ontology and pathway analyses suggested that many of the candidate genes were associated with neural events. Together with 66 genes whose functions in neural cells or organs were reported previously, 47 candidate genes were found to be linked to 189 events in the gene-level profile of neural differentiation. By text-mining for the alternative isoform, distinct functions of the isoforms of 9 candidate genes indicated by the result of Exon Array were confirmed. Alternative exons were successfully extracted. Results from the informatics analyses suggested that neural events were primarily governed by genes whose expression was increased and whose transcripts were differentially alternatively spliced in the neuronal cells. In addition to known functions in neural cells or organs, the uninvestigated alternative splicing events of 11 genes among 47 candidate genes suggested that cell cycle events are also potentially important. These genes may help researchers to differentiate the roles of alternative splicing in cell differentiation and cell proliferation.
Convergence of GWA and candidate gene studies for alcoholism

PubMed Central

Olfson, Emily; Bierut, Laura Jean

2012-01-01

Background Genome-wide association (GWA) studies have led to a paradigm shift in how researchers study the genetics underlying disease. Many GWA studies are now publicly available and can be used to examine whether or not previously proposed candidate genes are supported by GWA data. This approach is particularly important for the field of alcoholism because the contribution of many candidate genes remains controversial. Methods Using the Human Genome Epidemiology (HuGE) Navigator, we selected candidate genes for alcoholism that have been frequently examined in scientific articles in the past decade. Specific candidate loci as well as all the reported SNPs in candidate genes were examined in the Study of Alcohol Addiction: Genetics and Addiction (SAGE), a GWA study comparing alcohol dependent and non-dependent subjects. Results Several commonly reported candidate loci, including rs1800497 in DRD2, rs698 in ADH1C, rs1799971 in OPRM1 and rs4680 in COMT, are not replicated in SAGE (p> .05). Among candidate loci available for analysis, only rs279858 in GABRA2 (p=0.0052, OR=1.16) demonstrated a modest association. Examination of all SNPs reported in SAGE in over 50 candidate genes revealed no SNPs with large frequency differences between cases and controls and the lowest p value of any SNP was .0006. Discussion We provide evidence that several extensively studied candidate loci do not have a strong contribution to risk of developing alcohol dependence in European and African Ancestry populations. Due to lack of coverage, we were unable to rule out the contribution of other variants and these genes and particular loci warrant further investigation. Our analysis demonstrates that publicly available GWA results can be used to better understand which if any of previously proposed candidate genes contribute to disease. Furthermore, we illustrate how examining the convergence of candidate gene and GWA studies can help elucidate the genetic architecture of alcoholism and more generally complex diseases. PMID:22978509

Mining and harnessing natural variation - a little MAGIC

USDA-ARS?s Scientific Manuscript database

As has been frequently noted, exotic germplasm ( lines unadapted to local conditions) can be sources of very beneficial genes. The trouble is that it's often difficult to identify these genes. We propose an approach in which mutations can be used to uncover useful variants of natural genes....
Uncovering Student Values for Hiring in the Software Industry

ERIC Educational Resources Information Center

Chinn, Donald; Vandegrift, Tammy

2008-01-01

This article provides an analysis of student responses to an exercise used in a computer ethics and a software engineering course to raise awareness of issues related to hiring, including issues of professional responsibility and diversity. Students from two different universities were asked to evaluate four candidates for two positions in a…
Uncovering Preservice Teachers' Beliefs about Diversity through Reflective Writing

ERIC Educational Resources Information Center

Kyles, Carli R.; Olafson, Lori

2008-01-01

This article reports findings from a mixed-method investigation of a cohort of teacher candidates who were placed in an urban and culturally diverse practicum site at an elementary school. Fifteen preservice teachers completed pre- and posttest measures related to hope, motivation for teaching, and efficacy for teaching. Throughout the semester,…
Root Cell-Specific Regulators of Phosphate-Dependent Growth1[OPEN

PubMed Central

Ding, Wona

2017-01-01

Cellular specialization in abiotic stress responses is an important regulatory feature driving plant acclimation. Our in silico approach of iterative coexpression, interaction, and enrichment analyses predicted root cell-specific regulators of phosphate starvation response networks in Arabidopsis (Arabidopsis thaliana). This included three uncharacterized genes termed Phosphate starvation-induced gene interacting Root Cell Enriched (PRCE1, PRCE2, and PRCE3). Root cell-specific enrichment of 12 candidates was confirmed in promoter-GFP lines. T-DNA insertion lines of 11 genes showed changes in phosphate status and growth responses to phosphate availability compared with the wild type. Some mutants (cbl1, cipk2, prce3, and wdd1) displayed strong biomass gain irrespective of phosphate supply, while others (cipk14, mfs1, prce1, prce2, and s6k2) were able to sustain growth under low phosphate supply better than the wild type. Notably, root or shoot phosphate accumulation did not strictly correlate with organ growth. Mutant response patterns markedly differed from those of master regulators of phosphate homeostasis, PHOSPHATE STARVATION RESPONSE1 (PHR1) and PHOSPHATE2 (PHO2), demonstrating that negative growth responses in the latter can be overcome when cell-specific regulators are targeted. RNA sequencing analysis highlighted the transcriptomic plasticity in these mutants and revealed PHR1-dependent and -independent regulatory circuits with gene coexpression profiles that were highly correlated to the quantified physiological traits. The results demonstrate how in silico prediction of cell-specific, stress-responsive genes uncovers key regulators and how their manipulation can have positive impacts on plant growth under abiotic stress. PMID:28465462
RYR2, PTDSS1 and AREG genes are implicated in a Lebanese population-based study of copy number variation in autism

PubMed Central

Soueid, Jihane; Kourtian, Silva; Makhoul, Nadine J.; Makoukji, Joelle; Haddad, Sariah; Ghanem, Simona S.; Kobeissy, Firas; Boustany, Rose-Mary

2016-01-01

Autism Spectrum Disorders (ASDs) are a group of neurodevelopmental disorders characterized by ritualistic-repetitive behaviors and impaired verbal and non-verbal communication. Objectives were to determine the contribution of genetic variation to ASDs in the Lebanese. Affymetrix Cytogenetics Whole-Genome 2.7 M and CytoScan™ HD Arrays were used to detect CNVs in 41 Lebanese autistic children and 35 non-autistic, developmentally delayed and intellectually disabled patients. 33 normal participants were used as controls. 16 de novo CNVs and 57 inherited CNVs, including recognized pathogenic 16p11.2 duplications and 2p16.3 deletions were identified. A duplication at 1q43 classified as likely pathogenic encompasses RYR2 as a potential ASD candidate gene. This previously identified CNV has been classified as both pathogenic, and, of uncertain significance. A duplication of unknown significance at 10q11.22, proposed as a modulator for phenotypic disease expression in Rett syndrome, was also identified. The novel potential autism susceptibility genes PTDSS1 and AREG were uncovered and warrant further genetic and functional analyses. Previously described and novel genetic targets in ASD were identified in Lebanese families with autism. These findings may lead to improved diagnosis of ASDs and informed genetic counseling, and may also lead to untapped therapeutic targets applicable to Lebanese and non-Lebanese patients. PMID:26742492
Genome-Wide Association Study Identifying Candidate Genes Influencing Important Agronomic Traits of Flax (Linum usitatissimum L.) Using SLAF-seq

PubMed Central

Xie, Dongwei; Dai, Zhigang; Yang, Zemao; Sun, Jian; Zhao, Debao; Yang, Xue; Zhang, Liguo; Tang, Qing; Su, Jianguang

2018-01-01

Flax (Linum usitatissimum L.) is an important cash crop, and its agronomic traits directly affect yield and quality. Molecular studies on flax remain inadequate because relatively few flax genes have been associated with agronomic traits or have been identified as having potential applications. To identify markers and candidate genes that can potentially be used for genetic improvement of crucial agronomic traits, we examined 224 specimens of core flax germplasm; specifically, phenotypic data for key traits, including plant height, technical length, number of branches, number of fruits, and 1000-grain weight were investigated under three environmental conditions before specific-locus amplified fragment sequencing (SLAF-seq) was employed to perform a genome-wide association study (GWAS) for these five agronomic traits. Subsequently, the results were used to screen single nucleotide polymorphism (SNP) loci and candidate genes that exhibited a significant correlation with the important agronomic traits. Our analyses identified a total of 42 SNP loci that showed significant correlations with the five important agronomic flax traits. Next, candidate genes were screened in the 10 kb zone of each of the 42 SNP loci. These SNP loci were then analyzed by a more stringent screening via co-identification using both a general linear model (GLM) and a mixed linear model (MLM) as well as co-occurrences in at least two of the three environments, whereby 15 final candidate genes were obtained. Based on these results, we determined that UGT and PL are candidate genes for plant height, GRAS and XTH are candidate genes for the number of branches, Contig1437 and LU0019C12 are candidate genes for the number of fruits, and PHO1 is a candidate gene for the 1000-seed weight. We propose that the identified SNP loci and corresponding candidate genes might serve as a biological basis for improving crucial agronomic flax traits. PMID:29375606
Genome-Wide Association Study Identifying Candidate Genes Influencing Important Agronomic Traits of Flax (Linum usitatissimum L.) Using SLAF-seq.

PubMed

Xie, Dongwei; Dai, Zhigang; Yang, Zemao; Sun, Jian; Zhao, Debao; Yang, Xue; Zhang, Liguo; Tang, Qing; Su, Jianguang

2017-01-01

Flax ( Linum usitatissimum L.) is an important cash crop, and its agronomic traits directly affect yield and quality. Molecular studies on flax remain inadequate because relatively few flax genes have been associated with agronomic traits or have been identified as having potential applications. To identify markers and candidate genes that can potentially be used for genetic improvement of crucial agronomic traits, we examined 224 specimens of core flax germplasm; specifically, phenotypic data for key traits, including plant height, technical length, number of branches, number of fruits, and 1000-grain weight were investigated under three environmental conditions before specific-locus amplified fragment sequencing (SLAF-seq) was employed to perform a genome-wide association study (GWAS) for these five agronomic traits. Subsequently, the results were used to screen single nucleotide polymorphism (SNP) loci and candidate genes that exhibited a significant correlation with the important agronomic traits. Our analyses identified a total of 42 SNP loci that showed significant correlations with the five important agronomic flax traits. Next, candidate genes were screened in the 10 kb zone of each of the 42 SNP loci. These SNP loci were then analyzed by a more stringent screening via co-identification using both a general linear model (GLM) and a mixed linear model (MLM) as well as co-occurrences in at least two of the three environments, whereby 15 final candidate genes were obtained. Based on these results, we determined that UGT and PL are candidate genes for plant height, GRAS and XTH are candidate genes for the number of branches, Contig1437 and LU0019C12 are candidate genes for the number of fruits, and PHO1 is a candidate gene for the 1000-seed weight. We propose that the identified SNP loci and corresponding candidate genes might serve as a biological basis for improving crucial agronomic flax traits.
A direct molecular link between the autism candidate gene RORa and the schizophrenia candidate MIR137

NASA Astrophysics Data System (ADS)

Devanna, Paolo; Vernes, Sonja C.

2014-02-01

Retinoic acid-related orphan receptor alpha gene (RORa) and the microRNA MIR137 have both recently been identified as novel candidate genes for neuropsychiatric disorders. RORa encodes a ligand-dependent orphan nuclear receptor that acts as a transcriptional regulator and miR-137 is a brain enriched small non-coding RNA that interacts with gene transcripts to control protein levels. Given the mounting evidence for RORa in autism spectrum disorders (ASD) and MIR137 in schizophrenia and ASD, we investigated if there was a functional biological relationship between these two genes. Herein, we demonstrate that miR-137 targets the 3'UTR of RORa in a site specific manner. We also provide further support for MIR137 as an autism candidate by showing that a large number of previously implicated autism genes are also putatively targeted by miR-137. This work supports the role of MIR137 as an ASD candidate and demonstrates a direct biological link between these previously unrelated autism candidate genes.
Candidate genes and molecular markers associated with heat tolerance in colonial Bentgrass.

PubMed

Jespersen, David; Belanger, Faith C; Huang, Bingru

2017-01-01

Elevated temperature is a major abiotic stress limiting the growth of cool-season grasses during the summer months. The objectives of this study were to determine the genetic variation in the expression patterns of selected genes involved in several major metabolic pathways regulating heat tolerance for two genotypes contrasting in heat tolerance to confirm their status as potential candidate genes, and to identify PCR-based markers associated with candidate genes related to heat tolerance in a colonial (Agrostis capillaris L.) x creeping bentgrass (Agrostis stolonifera L.) hybrid backcross population. Plants were subjected to heat stress in controlled-environmental growth chambers for phenotypic evaluation and determination of genetic variation in candidate gene expression. Molecular markers were developed for genes involved in protein degradation (cysteine protease), antioxidant defense (catalase and glutathione-S-transferase), energy metabolism (glyceraldehyde-3-phosphate dehydrogenase), cell expansion (expansin), and stress protection (heat shock proteins HSP26, HSP70, and HSP101). Kruskal-Wallis analysis, a commonly used non-parametric test used to compare population individuals with or without the gene marker, found the physiological traits of chlorophyll content, electrolyte leakage, normalized difference vegetative index, and turf quality were associated with all candidate gene markers with the exception of HSP101. Differential gene expression was frequently found for the tested candidate genes. The development of candidate gene markers for important heat tolerance genes may allow for the development of new cultivars with increased abiotic stress tolerance using marker-assisted selection.
Candidate genes and molecular markers associated with heat tolerance in colonial Bentgrass

PubMed Central

Jespersen, David; Belanger, Faith C.; Huang, Bingru

2017-01-01

Elevated temperature is a major abiotic stress limiting the growth of cool-season grasses during the summer months. The objectives of this study were to determine the genetic variation in the expression patterns of selected genes involved in several major metabolic pathways regulating heat tolerance for two genotypes contrasting in heat tolerance to confirm their status as potential candidate genes, and to identify PCR-based markers associated with candidate genes related to heat tolerance in a colonial (Agrostis capillaris L.) x creeping bentgrass (Agrostis stolonifera L.) hybrid backcross population. Plants were subjected to heat stress in controlled-environmental growth chambers for phenotypic evaluation and determination of genetic variation in candidate gene expression. Molecular markers were developed for genes involved in protein degradation (cysteine protease), antioxidant defense (catalase and glutathione-S-transferase), energy metabolism (glyceraldehyde-3-phosphate dehydrogenase), cell expansion (expansin), and stress protection (heat shock proteins HSP26, HSP70, and HSP101). Kruskal-Wallis analysis, a commonly used non-parametric test used to compare population individuals with or without the gene marker, found the physiological traits of chlorophyll content, electrolyte leakage, normalized difference vegetative index, and turf quality were associated with all candidate gene markers with the exception of HSP101. Differential gene expression was frequently found for the tested candidate genes. The development of candidate gene markers for important heat tolerance genes may allow for the development of new cultivars with increased abiotic stress tolerance using marker-assisted selection. PMID:28187136
Impact of environmental inputs on reverse-engineering approach to network structures.

PubMed

Wu, Jianhua; Sinfield, James L; Buchanan-Wollaston, Vicky; Feng, Jianfeng

2009-12-04

Uncovering complex network structures from a biological system is one of the main topic in system biology. The network structures can be inferred by the dynamical Bayesian network or Granger causality, but neither techniques have seriously taken into account the impact of environmental inputs. With considerations of natural rhythmic dynamics of biological data, we propose a system biology approach to reveal the impact of environmental inputs on network structures. We first represent the environmental inputs by a harmonic oscillator and combine them with Granger causality to identify environmental inputs and then uncover the causal network structures. We also generalize it to multiple harmonic oscillators to represent various exogenous influences. This system approach is extensively tested with toy models and successfully applied to a real biological network of microarray data of the flowering genes of the model plant Arabidopsis Thaliana. The aim is to identify those genes that are directly affected by the presence of the sunlight and uncover the interactive network structures associating with flowering metabolism. We demonstrate that environmental inputs are crucial for correctly inferring network structures. Harmonic causal method is proved to be a powerful technique to detect environment inputs and uncover network structures, especially when the biological data exhibit periodic oscillations.
Genetic loci associated with coronary artery disease harbor evidence of selection and antagonistic pleiotropy

PubMed Central

Byars, Sean G.; Gray, Lesley-Ann; Ripatti, Samuli; Stearns, Stephen C.; Inouye, Michael

2017-01-01

Traditional genome-wide scans for positive selection have mainly uncovered selective sweeps associated with monogenic traits. While selection on quantitative traits is much more common, very few signals have been detected because of their polygenic nature. We searched for positive selection signals underlying coronary artery disease (CAD) in worldwide populations, using novel approaches to quantify relationships between polygenic selection signals and CAD genetic risk. We identified new candidate adaptive loci that appear to have been directly modified by disease pressures given their significant associations with CAD genetic risk. These candidates were all uniquely and consistently associated with many different male and female reproductive traits suggesting selection may have also targeted these because of their direct effects on fitness. We found that CAD loci are significantly enriched for lifetime reproductive success relative to the rest of the human genome, with evidence that the relationship between CAD and lifetime reproductive success is antagonistic. This supports the presence of antagonistic-pleiotropic tradeoffs on CAD loci and provides a novel explanation for the maintenance and high prevalence of CAD in modern humans. Lastly, we found that positive selection more often targeted CAD gene regulatory variants using HapMap3 lymphoblastoid cell lines, which further highlights the unique biological significance of candidate adaptive loci underlying CAD. Our study provides a novel approach for detecting selection on polygenic traits and evidence that modern human genomes have evolved in response to CAD-induced selection pressures and other early-life traits sharing pleiotropic links with CAD. PMID:28640878
Uncovering trends in gene naming

PubMed Central

Seringhaus, Michael R; Cayting, Philip D; Gerstein, Mark B

2008-01-01

We take stock of current genetic nomenclature and attempt to organize strange and notable gene names. We categorize, for instance, those that involve a naming system transferred from another context (for example, Pavlov’s dogs). We hope this analysis provides clues to better steer gene naming in the future. PMID:18254929
A small number of candidate gene SNPs reveal continental ancestry in African Americans

PubMed Central

KODAMAN, NURI; ALDRICH, MELINDA C.; SMITH, JEFFREY R.; SIGNORELLO, LISA B.; BRADLEY, KEVIN; BREYER, JOAN; COHEN, SARAH S.; LONG, JIRONG; CAI, QIUYIN; GILES, JUSTIN; BUSH, WILLIAM S.; BLOT, WILLIAM J.; MATTHEWS, CHARLES E.; WILLIAMS, SCOTT M.

2013-01-01

SUMMARY Using genetic data from an obesity candidate gene study of self-reported African Americans and European Americans, we investigated the number of Ancestry Informative Markers (AIMs) and candidate gene SNPs necessary to infer continental ancestry. Proportions of African and European ancestry were assessed with STRUCTURE (K=2), using 276 AIMs. These reference values were compared to estimates derived using 120, 60, 30, and 15 SNP subsets randomly chosen from the 276 AIMs and from 1144 SNPs in 44 candidate genes. All subsets generated estimates of ancestry consistent with the reference estimates, with mean correlations greater than 0.99 for all subsets of AIMs, and mean correlations of 0.99±0.003; 0.98± 0.01; 0.93±0.03; and 0.81± 0.11 for subsets of 120, 60, 30, and 15 candidate gene SNPs, respectively. Among African Americans, the median absolute difference from reference African ancestry values ranged from 0.01 to 0.03 for the four AIMs subsets and from 0.03 to 0.09 for the four candidate gene SNP subsets. Furthermore, YRI/CEU Fst values provided a metric to predict the performance of candidate gene SNPs. Our results demonstrate that a small number of SNPs randomly selected from candidate genes can be used to estimate admixture proportions in African Americans reliably. PMID:23278390
Analysis of the transcriptome of Panax notoginseng root uncovers putative triterpene saponin-biosynthetic genes and genetic markers

PubMed Central

2011-01-01

Background Panax notoginseng (Burk) F.H. Chen is important medicinal plant of the Araliacease family. Triterpene saponins are the bioactive constituents in P. notoginseng. However, available genomic information regarding this plant is limited. Moreover, details of triterpene saponin biosynthesis in the Panax species are largely unknown. Results Using the 454 pyrosequencing technology, a one-quarter GS FLX titanium run resulted in 188,185 reads with an average length of 410 bases for P. notoginseng root. These reads were processed and assembled by 454 GS De Novo Assembler software into 30,852 unique sequences. A total of 70.2% of unique sequences were annotated by Basic Local Alignment Search Tool (BLAST) similarity searches against public sequence databases. The Kyoto Encyclopedia of Genes and Genomes (KEGG) assignment discovered 41 unique sequences representing 11 genes involved in triterpene saponin backbone biosynthesis in the 454-EST dataset. In particular, the transcript encoding dammarenediol synthase (DS), which is the first committed enzyme in the biosynthetic pathway of major triterpene saponins, is highly expressed in the root of four-year-old P. notoginseng. It is worth emphasizing that the candidate cytochrome P450 (Pn02132 and Pn00158) and UDP-glycosyltransferase (Pn00082) gene most likely to be involved in hydroxylation or glycosylation of aglycones for triterpene saponin biosynthesis were discovered from 174 cytochrome P450s and 242 glycosyltransferases by phylogenetic analysis, respectively. Putative transcription factors were detected in 906 unique sequences, including Myb, homeobox, WRKY, basic helix-loop-helix (bHLH), and other family proteins. Additionally, a total of 2,772 simple sequence repeat (SSR) were identified from 2,361 unique sequences, of which, di-nucleotide motifs were the most abundant motif. Conclusion This study is the first to present a large-scale EST dataset for P. notoginseng root acquired by next-generation sequencing (NGS) technology. The candidate genes involved in triterpene saponin biosynthesis, including the putative CYP450s and UGTs, were obtained in this study. Additionally, the identification of SSRs provided plenty of genetic makers for molecular breeding and genetics applications in this species. These data will provide information on gene discovery, transcriptional regulation and marker-assisted selection for P. notoginseng. The dataset establishes an important foundation for the study with the purpose of ensuring adequate drug resources for this species. PMID:22369100
EnRICH: Extraction and Ranking using Integration and Criteria Heuristics.

PubMed

Zhang, Xia; Greenlee, M Heather West; Serb, Jeanne M

2013-01-15

High throughput screening technologies enable biologists to generate candidate genes at a rate that, due to time and cost constraints, cannot be studied by experimental approaches in the laboratory. Thus, it has become increasingly important to prioritize candidate genes for experiments. To accomplish this, researchers need to apply selection requirements based on their knowledge, which necessitates qualitative integration of heterogeneous data sources and filtration using multiple criteria. A similar approach can also be applied to putative candidate gene relationships. While automation can assist in this routine and imperative procedure, flexibility of data sources and criteria must not be sacrificed. A tool that can optimize the trade-off between automation and flexibility to simultaneously filter and qualitatively integrate data is needed to prioritize candidate genes and generate composite networks from heterogeneous data sources. We developed the java application, EnRICH (Extraction and Ranking using Integration and Criteria Heuristics), in order to alleviate this need. Here we present a case study in which we used EnRICH to integrate and filter multiple candidate gene lists in order to identify potential retinal disease genes. As a result of this procedure, a candidate pool of several hundred genes was narrowed down to five candidate genes, of which four are confirmed retinal disease genes and one is associated with a retinal disease state. We developed a platform-independent tool that is able to qualitatively integrate multiple heterogeneous datasets and use different selection criteria to filter each of them, provided the datasets are tables that have distinct identifiers (required) and attributes (optional). With the flexibility to specify data sources and filtering criteria, EnRICH automatically prioritizes candidate genes or gene relationships for biologists based on their specific requirements. Here, we also demonstrate that this tool can be effectively and easily used to apply highly specific user-defined criteria and can efficiently identify high quality candidate genes from relatively sparse datasets.
Degrees of separation as a statistical tool for evaluating candidate genes.

PubMed

Nelson, Ronald M; Pettersson, Mats E

2014-12-01

Selection of candidate genes is an important step in the exploration of complex genetic architecture. The number of gene networks available is increasing and these can provide information to help with candidate gene selection. It is currently common to use the degree of connectedness in gene networks as validation in Genome Wide Association (GWA) and Quantitative Trait Locus (QTL) mapping studies. However, it can cause misleading results if not validated properly. Here we present a method and tool for validating the gene pairs from GWA studies given the context of the network they co-occur in. It ensures that proposed interactions and gene associations are not statistical artefacts inherent to the specific gene network architecture. The CandidateBacon package provides an easy and efficient method to calculate the average degree of separation (DoS) between pairs of genes to currently available gene networks. We show how these empirical estimates of average connectedness are used to validate candidate gene pairs. Validation of interacting genes by comparing their connectedness with the average connectedness in the gene network will provide support for said interactions by utilising the growing amount of gene network information available. Copyright © 2014 Elsevier Ltd. All rights reserved.
COE loss-of-function analysis reveals a genetic program underlying maintenance and regeneration of the nervous system in planarians.

PubMed

Cowles, Martis W; Omuro, Kerilyn C; Stanley, Brianna N; Quintanilla, Carlo G; Zayas, Ricardo M

2014-10-01

Members of the COE family of transcription factors are required for central nervous system (CNS) development. However, the function of COE in the post-embryonic CNS remains largely unknown. An excellent model for investigating gene function in the adult CNS is the freshwater planarian. This animal is capable of regenerating neurons from an adult pluripotent stem cell population and regaining normal function. We previously showed that planarian coe is expressed in differentiating and mature neurons and that its function is required for proper CNS regeneration. Here, we show that coe is essential to maintain nervous system architecture and patterning in intact (uninjured) planarians. We took advantage of the robust phenotype in intact animals to investigate the genetic programs coe regulates in the CNS. We compared the transcriptional profiles of control and coe RNAi planarians using RNA sequencing and identified approximately 900 differentially expressed genes in coe knockdown animals, including 397 downregulated genes that were enriched for nervous system functional annotations. Next, we validated a subset of the downregulated transcripts by analyzing their expression in coe-deficient planarians and testing if the mRNAs could be detected in coe+ cells. These experiments revealed novel candidate targets of coe in the CNS such as ion channel, neuropeptide, and neurotransmitter genes. Finally, to determine if loss of any of the validated transcripts underscores the coe knockdown phenotype, we knocked down their expression by RNAi and uncovered a set of coe-regulated genes implicated in CNS regeneration and patterning, including orthologs of sodium channel alpha-subunit and pou4. Our study broadens the knowledge of gene expression programs regulated by COE that are required for maintenance of neural subtypes and nervous system architecture in adult animals.
The Transcriptome of Compatible and Incompatible Interactions of Potato (Solanum tuberosum) with Phytophthora infestans Revealed by DeepSAGE Analysis

PubMed Central

Gyetvai, Gabor; Sønderkær, Mads; Göbel, Ulrike; Basekow, Rico; Ballvora, Agim; Imhoff, Maren; Kersten, Birgit; Nielsen, Kåre-Lehman; Gebhardt, Christiane

2012-01-01

Late blight, caused by the oomycete Phytophthora infestans, is the most important disease of potato (Solanum tuberosum). Understanding the molecular basis of resistance and susceptibility to late blight is therefore highly relevant for developing resistant cultivars, either by marker-assissted selection or by transgenic approaches. Specific P. infestans races having the Avr1 effector gene trigger a hypersensitive resistance response in potato plants carrying the R1 resistance gene (incompatible interaction) and cause disease in plants lacking R1 (compatible interaction). The transcriptomes of the compatible and incompatible interaction were captured by DeepSAGE analysis of 44 biological samples comprising five genotypes, differing only by the presence or absence of the R1 transgene, three infection time points and three biological replicates. 30.859 unique 21 base pair sequence tags were obtained, one third of which did not match any known potato transcript sequence. Two third of the tags were expressed at low frequency (<10 tag counts/million). 20.470 unitags matched to approximately twelve thousand potato transcribed genes. Tag frequencies were compared between compatible and incompatible interactions over the infection time course and between compatible and incompatible genotypes. Transcriptional changes were more numerous in compatible than in incompatible interactions. In contrast to incompatible interactions, transcriptional changes in the compatible interaction were observed predominantly for multigene families encoding defense response genes and genes functional in photosynthesis and CO2 fixation. Numerous transcriptional differences were also observed between near isogenic genotypes prior to infection with P. infestans. Our DeepSAGE transcriptome analysis uncovered novel candidate genes for plant host pathogen interactions, examples of which are discussed with respect to possible function. PMID:22328937
Interactome analysis reveals ZNF804A, a schizophrenia risk gene, as a novel component of protein translational machinery critical for embryonic neurodevelopment

PubMed Central

Zhou, Y; Dong, F; Lanz, T A; Reinhart, V; Li, M; Liu, L; Zou, J; Xi, H S; Mao, Y

2018-01-01

Recent genome-wide association studies identified over 100 genetic loci that significantly associate with schizophrenia (SZ). A top candidate gene, ZNF804A, was robustly replicated in different populations. However, its neural functions are largely unknown. Here we show in mouse that ZFP804A, the homolog of ZNF804A, is required for normal progenitor proliferation and neuronal migration. Using a yeast two-hybrid genome-wide screen, we identified novel interacting proteins of ZNF804A. Rather than transcriptional factors, genes involved in mRNA translation are highly represented in our interactome result. ZNF804A co-fractionates with translational machinery and modulates the translational efficiency as well as the mTOR pathway. The ribosomal protein RPSA interacts with ZNF804A and rescues the migration and translational defects caused by ZNF804A knockdown. RNA immunoprecipitation–RNAseq (RIP-Seq) identified transcripts bound to ZFP804A. Consistently, ZFP804A associates with many short transcripts involved in translational and mitochondrial regulation. Moreover, among the transcripts associated with ZFP804A, a SZ risk gene, neurogranin (NRGN), is one of ZFP804A targets. Interestingly, downregulation of ZFP804A decreases NRGN expression and overexpression of NRGN can ameliorate ZFP804A-mediated migration defect. To verify the downstream targets of ZNF804A, a Duolink in situ interaction assay confirmed genes from our RIP-Seq data as the ZNF804A targets. Thus, our work uncovered a novel mechanistic link of a SZ risk gene to neurodevelopment and translational control. The interactome-driven approach here is an effective way for translating genome-wide association findings into novel biological insights of human diseases. PMID:28924186

Search for sarcoidosis candidate genes by integration of data from genomic, transcriptomic and proteomic studies.

PubMed

Maver, Ales; Medica, Igor; Peterlin, Borut

2009-12-01

The search for gene candidates in multifactorial diseases such as sarcoidosis can be based on the integration of linkage association data, gene expression data, and protein profile data from genomic, transcriptomic and proteomic studies, respectively. In this study we performed a literature-based search for studies reporting such data, followed by integration of collected information. Different databases were examined--Medline, HugGE Navigator, ArrayExpress and Gene Expression Omnibus (GEO). Candidate genes were defined as genes which were reported in at least 2 different types of omics studies. Genes previously investigated in sarcoidosis were excluded from further analyses. We identified 177 genes associated with sarcoidosis as potential new candidate genes. Subsequently, 9 gene candidates identified to overlap in 2 different types of studies (genomic, transcriptomic and/or proteomic) were consistently reported in at least 3 studies: SERPINB1, FABP4, S100A8, HBEGF, IL7R, LRIG1, PTPN23, DPM2 and NUP214. These genes are involved in regulation of immune response, cellular proliferation, apoptosis, inhibition of protease activity, lipid metabolism. Exact biological functions of HBEGF, LRIG1, PTPN23, DPM2 and NUP214 remain to be completely elucidated. We propose 9 candidate genes: SERPINB1, FABP4, S100A8, HBEGF, IL7R, LRIG1, PTPN23, DPM2 and NUP214, as genes with high potential for association with sarcoidosis.
Epigenetic profiling of growth plate chondrocytes sheds insight into regulatory genetic variation influencing height.

PubMed

Guo, Michael; Liu, Zun; Willen, Jessie; Shaw, Cameron P; Richard, Daniel; Jagoda, Evelyn; Doxey, Andrew C; Hirschhorn, Joel; Capellini, Terence D

2017-12-05

GWAS have identified hundreds of height-associated loci. However, determining causal mechanisms is challenging, especially since height-relevant tissues (e.g. growth plates) are difficult to study. To uncover mechanisms by which height GWAS variants function, we performed epigenetic profiling of murine femoral growth plates. The profiled open chromatin regions recapitulate known chondrocyte and skeletal biology, are enriched at height GWAS loci, particularly near differentially expressed growth plate genes, and enriched for binding motifs of transcription factors with roles in chondrocyte biology. At specific loci, our analyses identified compelling mechanisms for GWAS variants. For example, at CHSY1 , we identified a candidate causal variant (rs9920291) overlapping an open chromatin region. Reporter assays demonstrated that rs9920291 shows allelic regulatory activity, and CRISPR/Cas9 targeting of human chondrocytes demonstrates that the region regulates CHSY1 expression. Thus, integrating biologically relevant epigenetic information (here, from growth plates) with genetic association results can identify biological mechanisms important for human growth.
On the path to genetic novelties: insights from programmed DNA elimination and RNA splicing.

PubMed

Catania, Francesco; Schmitz, Jürgen

2015-01-01

Understanding how genetic novelties arise is a central goal of evolutionary biology. To this end, programmed DNA elimination and RNA splicing deserve special consideration. While programmed DNA elimination reshapes genomes by eliminating chromatin during organismal development, RNA splicing rearranges genetic messages by removing intronic regions during transcription. Small RNAs help to mediate this class of sequence reorganization, which is not error-free. It is this imperfection that makes programmed DNA elimination and RNA splicing excellent candidates for generating evolutionary novelties. Leveraging a number of these two processes' mechanistic and evolutionary properties, which have been uncovered over the past years, we present recently proposed models and empirical evidence for how splicing can shape the structure of protein-coding genes in eukaryotes. We also chronicle a number of intriguing similarities between the processes of programmed DNA elimination and RNA splicing, and highlight the role that the variation in the population-genetic environment may play in shaping their target sequences. © 2015 Wiley Periodicals, Inc.
Genome-Wide RNAi Ionomics Screen Reveals New Genes and Regulation of Human Trace Element Metabolism

PubMed Central

Malinouski, Mikalai; Hasan, Nesrin M.; Zhang, Yan; Seravalli, Javier; Lin, Jie; Avanesov, Andrei; Lutsenko, Svetlana; Gladyshev, Vadim N.

2017-01-01

Trace elements are essential for human metabolism and dysregulation of their homeostasis is associated with numerous disorders. Here we characterize mechanisms that regulate trace elements in human cells by designing and performing a genome-wide high-throughput siRNA/ionomics screen, and examining top hits in cellular and biochemical assays. The screen reveals high stability of the ionomes, especially the zinc ionome, and yields known regulators and novel candidates. We further uncover fundamental differences in the regulation of different trace elements. Specifically, selenium levels are controlled through the selenocysteine machinery and expression of abundant selenoproteins; copper balance is affected by lipid metabolism and requires machinery involved in protein trafficking and posttranslational modifications; and the iron levels are influenced by iron import and expression of the iron/heme-containing enzymes. Our approach can be applied to a variety of disease models and/or nutritional conditions, and the generated dataset opens new directions for studies of human trace element metabolism. PMID:24522796
KLF6 and iNOS regulates apoptosis during respiratory syncytial virus infection

PubMed Central

Mgbemena, Victoria; Segovia, Jesus; Chang, Te-Hung; Bose, Santanu

2013-01-01

Human respiratory syncytial virus (RSV) is a highly pathogenic lung-tropic virus that causes severe respiratory diseases. Enzymatic activity of inducible nitric oxide (iNOS) is required for NO generation. Although NO contributes to exaggerated lung disease during RSV infection, the role of NO in apoptosis during infection is not known. In addition, host trans-activator(s) required for iNOS gene expression during RSV infection is unknown. In the current study we have uncovered the mechanism of iNOS gene induction by identifying kruppel-like factor 6 (KLF6) as a critical transcription factor required for iNOS gene expression during RSV infection. Furthermore, we have also uncovered the role of iNOS as a critical host factor regulating apoptosis during RSV infection. PMID:23831683
Tracing common origins of Genomic Islands in prokaryotes based on genome signature analyses.

PubMed

van Passel, Mark Wj

2011-09-01

Horizontal gene transfer constitutes a powerful and innovative force in evolution, but often little is known about the actual origins of transferred genes. Sequence alignments are generally of limited use in tracking the original donor, since still only a small fraction of the total genetic diversity is thought to be uncovered. Alternatively, approaches based on similarities in the genome specific relative oligonucleotide frequencies do not require alignments. Even though the exact origins of horizontally transferred genes may still not be established using these compositional analyses, it does suggest that compositionally very similar regions are likely to have had a common origin. These analyses have shown that up to a third of large acquired gene clusters that reside in the same genome are compositionally very similar, indicative of a shared origin. This brings us closer to uncovering the original donors of horizontally transferred genes, and could help in elucidating possible regulatory interactions between previously unlinked sequences.
LOD score exclusion analyses for candidate QTLs using random population samples.

PubMed

Deng, Hong-Wen

2003-11-01

While extensive analyses have been conducted to test for, no formal analyses have been conducted to test against, the importance of candidate genes as putative QTLs using random population samples. Previously, we developed an LOD score exclusion mapping approach for candidate genes for complex diseases. Here, we extend this LOD score approach for exclusion analyses of candidate genes for quantitative traits. Under this approach, specific genetic effects (as reflected by heritability) and inheritance models at candidate QTLs can be analyzed and if an LOD score is < or = -2.0, the locus can be excluded from having a heritability larger than that specified. Simulations show that this approach has high power to exclude a candidate gene from having moderate genetic effects if it is not a QTL and is robust to population admixture. Our exclusion analysis complements association analysis for candidate genes as putative QTLs in random population samples. The approach is applied to test the importance of Vitamin D receptor (VDR) gene as a potential QTL underlying the variation of bone mass, an important determinant of osteoporosis.
Association of candidate genes with drought tolerance traits in diverse perennial ryegrass accessions

Treesearch

Xiaoqing Yu; Guihua Bai; Shuwei Liu; Na Luo; Ying Wang; Douglas S. Richmond; Paula M. Pijut; Scott A. Jackson; Jianming Yu; Yiwei Jiang

2013-01-01

Drought is a major environmental stress limiting growth of perennial grasses in temperate regions. Plant drought tolerance is a complex trait that is controlled by multiple genes. Candidate gene association mapping provides a powerful tool for dissection of complex traits. Candidate gene association mapping of drought tolerance traits was conducted in 192 diverse...
Convergence of genome-wide association and candidate gene studies for alcoholism.

PubMed

Olfson, Emily; Bierut, Laura Jean

2012-12-01

Genome-wide association (GWA) studies have led to a paradigm shift in how researchers study the genetics underlying disease. Many GWA studies are now publicly available and can be used to examine whether or not previously proposed candidate genes are supported by GWA data. This approach is particularly important for the field of alcoholism because the contribution of many candidate genes remains controversial. Using the Human Genome Epidemiology (HuGE) Navigator, we selected candidate genes for alcoholism that have been frequently examined in scientific articles in the past decade. Specific candidate loci as well as all the reported single nucleotide polymorphisms (SNPs) in candidate genes were examined in the Study of Addiction: Genetics and Environment (SAGE), a GWA study comparing alcohol-dependent and nondependent subjects. Several commonly reported candidate loci, including rs1800497 in DRD2, rs698 in ADH1C, rs1799971 in OPRM1, and rs4680 in COMT, are not replicated in SAGE (p > 0.05). Among candidate loci available for analysis, only rs279858 in GABRA2 (p = 0.0052, OR = 1.16) demonstrated a modest association. Examination of all SNPs reported in SAGE in over 50 candidate genes revealed no SNPs with large frequency differences between cases and controls, and the lowest p-value of any SNP was 0.0006. We provide evidence that several extensively studied candidate loci do not have a strong contribution to risk of developing alcohol dependence in European and African ancestry populations. Owing to the lack of coverage, we were unable to rule out the contribution of other variants, and these genes and particular loci warrant further investigation. Our analysis demonstrates that publicly available GWA results can be used to better understand which if any of previously proposed candidate genes contribute to disease. Furthermore, we illustrate how examining the convergence of candidate gene and GWA studies can help elucidate the genetic architecture of alcoholism and more generally complex diseases. Copyright © 2012 by the Research Society on Alcoholism.
Understanding Plagiarism and How It Differs from Copyright Infringement

ERIC Educational Resources Information Center

Dames, K. Matthew

2007-01-01

Plagiarism has become the new piracy. Just as piracy was a few years ago, plagiarism has become the hot, new crime du jour--an act that suggests immorality and often scandal at once. What's more, plagiarism allegations feed into the society's "Candid Camera" mentality--the seemingly insatiable need to uncover wrongdoing. One of the biggest…
Defining the Human Macula Transcriptome and Candidate Retinal Disease Genes UsingEyeSAGE

PubMed Central

Rickman, Catherine Bowes; Ebright, Jessica N.; Zavodni, Zachary J.; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P.; Wistow, Graeme; Boon, Kathy; Hauser, Michael A.

2009-01-01

Purpose To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Methods Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Results Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. Conclusions The EyeSAGE database, combining three different gene-profiling platforms including the authors’ multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions. PMID:16723438
Defining the human macula transcriptome and candidate retinal disease genes using EyeSAGE.

PubMed

Bowes Rickman, Catherine; Ebright, Jessica N; Zavodni, Zachary J; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P; Wistow, Graeme; Boon, Kathy; Hauser, Michael A

2006-06-01

To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. The EyeSAGE database, combining three different gene-profiling platforms including the authors' multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions.
Cre/lox-recombinase-mediated cassette exchange for reversible site-specific genomic targeting of the disease vector, Aedes aegypti

USDA-ARS?s Scientific Manuscript database

Site-specific genome modification is an important tool for mosquito functional genomics studies that help to uncover gene functions, identify gene regulatory elements, and perform comparative gene expression studies, all of which contribute to a better understanding of mosquito biology and are thus ...
Candidate gene identification of ovulation-inducing genes by RNA sequencing with an in vivo assay in zebrafish.

PubMed

Klangnurak, Wanlada; Fukuyo, Taketo; Rezanujjaman, M D; Seki, Masahide; Sugano, Sumio; Suzuki, Yutaka; Tokumoto, Toshinobu

2018-01-01

We previously reported the microarray-based selection of three ovulation-related genes in zebrafish. We used a different selection method in this study, RNA sequencing analysis. An additional eight up-regulated candidates were found as specifically up-regulated genes in ovulation-induced samples. Changes in gene expression were confirmed by qPCR analysis. Furthermore, up-regulation prior to ovulation during natural spawning was verified in samples from natural pairing. Gene knock-out zebrafish strains of one of the candidates, the starmaker gene (stm), were established by CRISPR genome editing techniques. Unexpectedly, homozygous mutants were fertile and could spawn eggs. However, a high percentage of unfertilized eggs and abnormal embryos were produced from these homozygous females. The results suggest that the stm gene is necessary for fertilization. In this study, we selected additional ovulation-inducing candidate genes, and a novel function of the stm gene was investigated.
Modifier gene study of meconium ileus in cystic fibrosis: statistical considerations and gene mapping results

PubMed Central

Dorfman, Ruslan; Li, Weili; Sun, Lei; Lin, Fan; Wang, Yongqian; Sandford, Andrew; Paré, Peter D.; McKay, Karen; Kayserova, Hana; Piskackova, Tereza; Macek, Milan; Czerska, Kamila; Sands, Dorota; Tiddens, Harm; Margarit, Sonia; Repetto, Gabriela; Sontag, Marci K.; Accurso, Frank J.; Blackman, Scott; Cutting, Garry R.; Tsui, Lap-Chee; Corey, Mary; Durie, Peter; Zielenski, Julian; Strug, Lisa J.

2010-01-01

Cystic fibrosis (CF) is a monogenic disease due to mutations in the CFTR gene. Yet, variability in CF disease presentation is presumed to be affected by modifier genes, such as those recently demonstrated for the pulmonary aspect. Here, we conduct a modifier gene study for meconium ileus (MI), an intestinal obstruction that occurs in 16–20% of CF newborns, providing linkage and association results from large family and case–control samples. Linkage analysis of modifier traits is different than linkage analysis of primary traits on which a sample was ascertained. Here, we articulate a source of confounding unique to modifier gene studies and provide an example of how one might overcome the confounding in the context of linkage studies. Our linkage analysis provided evidence of a MI locus on chromosome 12p13.3, which was segregating in up to 80% of MI families with at least one affected offspring (HLOD = 2.9). Fine mapping of the 12p13.3 region in a large case–control sample of pancreatic insufficient Canadian CF patients with and without MI pointed to the involvement of ADIPOR2 in MI (p = 0.002). This marker was substantially out of Hardy–Weinberg equilibrium in the cases only, and provided evidence of a cohort effect. The association with rs9300298 in the ADIPOR2 gene at the 12p13.3 locus was replicated in an independent sample of CF families. A protective locus, using the phenotype of no-MI, mapped to 4q13.3 (HLOD = 3.19), with substantial heterogeneity. A candidate gene in the region, SLC4A4, provided preliminary evidence of association (p = 0.002), warranting further follow-up studies. Our linkage approach was used to direct our fine-mapping studies, which uncovered two potential modifier genes worthy of follow-up. PMID:19662435
Next-generation sequencing to identify candidate genes and develop diagnostic markers for a novel Phytophthora resistance gene, RpsHC18, in soybean.

PubMed

Zhong, Chao; Sun, Suli; Li, Yinping; Duan, Canxing; Zhu, Zhendong

2018-03-01

A novel Phytophthora sojae resistance gene RpsHC18 was identified and finely mapped on soybean chromosome 3. Two NBS-LRR candidate genes were identified and two diagnostic markers of RpsHC18 were developed. Phytophthora root rot caused by Phytophthora sojae is a destructive disease of soybean. The most effective disease-control strategy is to deploy resistant cultivars carrying Phytophthora-resistant Rps genes. The soybean cultivar Huachun 18 has a broad and distinct resistance spectrum to 12 P. sojae isolates. Quantitative trait loci sequencing (QTL-seq), based on the whole-genome resequencing (WGRS) of two extreme resistant and susceptible phenotype bulks from an F 2:3 population, was performed, and one 767-kb genomic region with ΔSNP-index ≥ 0.9 on chromosome 3 was identified as the RpsHC18 candidate region in Huachun 18. The candidate region was reduced to a 146-kb region by fine mapping. Nonsynonymous SNP and haplotype analyses were carried out in the 146-kb region among ten soybean genotypes using WGRS. Four specific nonsynonymous SNPs were identified in two nucleotide-binding sites-leucine-rich repeat (NBS-LRR) genes, RpsHC18-NBL1 and RpsHC18-NBL2, which were considered to be the candidate genes. Finally, one specific SNP marker in each candidate gene was successfully developed using a tetra-primer ARMS-PCR assay, and the two markers were verified to be specific for RpsHC18 and to effectively distinguish other known Rps genes. In this study, we applied an integrated genomic-based strategy combining WGRS with traditional genetic mapping to identify RpsHC18 candidate genes and develop diagnostic markers. These results suggest that next-generation sequencing is a precise, rapid and cost-effective way to identify candidate genes and develop diagnostic markers, and it can accelerate Rps gene cloning and marker-assisted selection for breeding of P. sojae-resistant soybean cultivars.
Database of cattle candidate genes and genetic markers for milk production and mastitis

PubMed Central

Ogorevc, J; Kunej, T; Razpet, A; Dovc, P

2009-01-01

A cattle database of candidate genes and genetic markers for milk production and mastitis has been developed to provide an integrated research tool incorporating different types of information supporting a genomic approach to study lactation, udder development and health. The database contains 943 genes and genetic markers involved in mammary gland development and function, representing candidates for further functional studies. The candidate loci were drawn on a genetic map to reveal positional overlaps. For identification of candidate loci, data from seven different research approaches were exploited: (i) gene knockouts or transgenes in mice that result in specific phenotypes associated with mammary gland (143 loci); (ii) cattle QTL for milk production (344) and mastitis related traits (71); (iii) loci with sequence variations that show specific allele-phenotype interactions associated with milk production (24) or mastitis (10) in cattle; (iv) genes with expression profiles associated with milk production (207) or mastitis (107) in cattle or mouse; (v) cattle milk protein genes that exist in different genetic variants (9); (vi) miRNAs expressed in bovine mammary gland (32) and (vii) epigenetically regulated cattle genes associated with mammary gland function (1). Fourty-four genes found by multiple independent analyses were suggested as the most promising candidates and were further in silico analysed for expression levels in lactating mammary gland, genetic variability and top biological functions in functional networks. A miRNA target search for mammary gland expressed miRNAs identified 359 putative binding sites in 3′UTRs of candidate genes. PMID:19508288
Dissecting the organ specificity of insecticide resistance candidate genes in Anopheles gambiae: known and novel candidate genes.

PubMed

Ingham, Victoria A; Jones, Christopher M; Pignatelli, Patricia; Balabanidou, Vasileia; Vontas, John; Wagstaff, Simon C; Moore, Jonathan D; Ranson, Hilary

2014-11-25

The elevated expression of enzymes with insecticide metabolism activity can lead to high levels of insecticide resistance in the malaria vector, Anopheles gambiae. In this study, adult female mosquitoes from an insecticide susceptible and resistant strain were dissected into four different body parts. RNA from each of these samples was used in microarray analysis to determine the enrichment patterns of the key detoxification gene families within the mosquito and to identify additional candidate insecticide resistance genes that may have been overlooked in previous experiments on whole organisms. A general enrichment in the transcription of genes from the four major detoxification gene families (carboxylesterases, glutathione transferases, UDP glucornyltransferases and cytochrome P450s) was observed in the midgut and malpighian tubules. Yet the subset of P450 genes that have previously been implicated in insecticide resistance in An gambiae, show a surprisingly varied profile of tissue enrichment, confirmed by qPCR and, for three candidates, by immunostaining. A stringent selection process was used to define a list of 105 genes that are significantly (p ≤0.001) over expressed in body parts from the resistant versus susceptible strain. Over half of these, including all the cytochrome P450s on this list, were identified in previous whole organism comparisons between the strains, but several new candidates were detected, notably from comparisons of the transcriptomes from dissected abdomen integuments. The use of RNA extracted from the whole organism to identify candidate insecticide resistance genes has a risk of missing candidates if key genes responsible for the phenotype have restricted expression within the body and/or are over expression only in certain tissues. However, as transcription of genes implicated in metabolic resistance to insecticides is not enriched in any one single organ, comparison of the transcriptome of individual dissected body parts cannot be recommended as a preferred means to identify new candidate insecticide resistant genes. Instead the rich data set on in vivo sites of transcription should be consulted when designing follow up qPCR validation steps, or for screening known candidates in field populations.
Meta-review of protein network regulating obesity between validated obesity candidate genes in the white adipose tissue of high-fat diet-induced obese C57BL/6J mice.

PubMed

Kim, Eunjung; Kim, Eun Jung; Seo, Seung-Won; Hur, Cheol-Goo; McGregor, Robin A; Choi, Myung-Sook

2014-01-01

Worldwide obesity and related comorbidities are increasing, but identifying new therapeutic targets remains a challenge. A plethora of microarray studies in diet-induced obesity models has provided large datasets of obesity associated genes. In this review, we describe an approach to examine the underlying molecular network regulating obesity, and we discuss interactions between obesity candidate genes. We conducted network analysis on functional protein-protein interactions associated with 25 obesity candidate genes identified in a literature-driven approach based on published microarray studies of diet-induced obesity. The obesity candidate genes were closely associated with lipid metabolism and inflammation. Peroxisome proliferator activated receptor gamma (Pparg) appeared to be a core obesity gene, and obesity candidate genes were highly interconnected, suggesting a coordinately regulated molecular network in adipose tissue. In conclusion, the current network analysis approach may help elucidate the underlying molecular network regulating obesity and identify anti-obesity targets for therapeutic intervention.
Whole exome sequencing identifies novel candidate genes that modify chronic obstructive pulmonary disease susceptibility.

PubMed

Bruse, Shannon; Moreau, Michael; Bromberg, Yana; Jang, Jun-Ho; Wang, Nan; Ha, Hongseok; Picchi, Maria; Lin, Yong; Langley, Raymond J; Qualls, Clifford; Klensney-Tait, Julia; Zabner, Joseph; Leng, Shuguang; Mao, Jenny; Belinsky, Steven A; Xing, Jinchuan; Nyunoya, Toru

2016-01-07

Chronic obstructive pulmonary disease (COPD) is characterized by an irreversible airflow limitation in response to inhalation of noxious stimuli, such as cigarette smoke. However, only 15-20 % smokers manifest COPD, suggesting a role for genetic predisposition. Although genome-wide association studies have identified common genetic variants that are associated with susceptibility to COPD, effect sizes of the identified variants are modest, as is the total heritability accounted for by these variants. In this study, an extreme phenotype exome sequencing study was combined with in vitro modeling to identify COPD candidate genes. We performed whole exome sequencing of 62 highly susceptible smokers and 30 exceptionally resistant smokers to identify rare variants that may contribute to disease risk or resistance to COPD. This was a cross-sectional case-control study without therapeutic intervention or longitudinal follow-up information. We identified candidate genes based on rare variant analyses and evaluated exonic variants to pinpoint individual genes whose function was computationally established to be significantly different between susceptible and resistant smokers. Top scoring candidate genes from these analyses were further filtered by requiring that each gene be expressed in human bronchial epithelial cells (HBECs). A total of 81 candidate genes were thus selected for in vitro functional testing in cigarette smoke extract (CSE)-exposed HBECs. Using small interfering RNA (siRNA)-mediated gene silencing experiments, we showed that silencing of several candidate genes augmented CSE-induced cytotoxicity in vitro. Our integrative analysis through both genetic and functional approaches identified two candidate genes (TACC2 and MYO1E) that augment cigarette smoke (CS)-induced cytotoxicity and, potentially, COPD susceptibility.

Genome-scale reconstruction of the metabolic network in Yersinia pestis, strain 91001

DOE Office of Scientific and Technical Information (OSTI.GOV)

Navid, A; Almaas, E

2009-01-13

The gram-negative bacterium Yersinia pestis, the aetiological agent of bubonic plague, is one the deadliest pathogens known to man. Despite its historical reputation, plague is a modern disease which annually afflicts thousands of people. Public safety considerations greatly limit clinical experimentation on this organism and thus development of theoretical tools to analyze the capabilities of this pathogen is of utmost importance. Here, we report the first genome-scale metabolic model of Yersinia pestis biovar Mediaevalis based both on its recently annotated genome, and physiological and biochemical data from literature. Our model demonstrates excellent agreement with Y. pestis known metabolic needs andmore » capabilities. Since Y. pestis is a meiotrophic organism, we have developed CryptFind, a systematic approach to identify all candidate cryptic genes responsible for known and theoretical meiotrophic phenomena. In addition to uncovering every known cryptic gene for Y. pestis, our analysis of the rhamnose fermentation pathway suggests that betB is the responsible cryptic gene. Despite all of our medical advances, we still do not have a vaccine for bubonic plague. Recent discoveries of antibiotic resistant strains of Yersinia pestis coupled with the threat of plague being used as a bioterrorism weapon compel us to develop new tools for studying the physiology of this deadly pathogen. Using our theoretical model, we can study the cell's phenotypic behavior under different circumstances and identify metabolic weaknesses which may be harnessed for the development of therapeutics. Additionally, the automatic identification of cryptic genes expands the usage of genomic data for pharmaceutical purposes.« less
Gene expression divergence between malaria vector sibling species Anopheles gambiae and An. coluzzii from rural and urban Yaoundé Cameroon

PubMed Central

Cassone, Bryan J.; Kamdem, Colince; Cheng, Changde; Tan, John C.; Hahn, Matthew W.; Costantini, Carlo; Besansky, Nora J.

2014-01-01

Divergent selection based on aquatic larval ecology is a likely factor in the recent isolation of two broadly sympatric and morphologically identical African mosquito species, the malaria vectors Anopheles gambiae and An. coluzzii. Population-based genome scans have revealed numerous candidate regions of recent positive selection, but have provided few clues as to the genetic mechanisms underlying behavioral and physiological divergence between the two species, phenotypes which themselves remain obscure. To uncover possible genetic mechanisms, we compared global transcriptional profiles of natural and experimental populations using gene-based microarrays. Larvae were sampled as second and fourth instars from natural populations in and around the city of Yaoundé, capital of Cameroon, where the two species segregate along a gradient of urbanization. Functional enrichment analysis of differentially expressed genes revealed that An. coluzzii—the species that breeds in more stable, biotically complex and potentially polluted urban water bodies—over-expresses genes implicated in detoxification and immunity relative to An. gambiae, which breeds in more ephemeral and relatively depauperate pools and puddles in suburbs and rural areas. Moreover, our data suggest that such over-expression by An. coluzzii is not a transient result of induction by xenobiotics in the larval habitat, but an inherent and presumably adaptive response to repeatedly encountered environmental stressors. Finally, we find no significant overlap between the differentially expressed loci and previously identified genomic regions of recent positive selection, suggesting that transcriptome divergence is regulated by trans-acting factors rather than cis-acting elements. PMID:24673723
Evaluating Reported Candidate Gene Associations with Polycystic Ovary Syndrome

PubMed Central

Pau, Cindy; Saxena, Richa; Welt, Corrine Kolka

2013-01-01

Objective To replicate variants in candidate genes associated with PCOS in a population of European PCOS and control subjects. Design Case-control association analysis and meta-analysis. Setting Major academic hospital Patients Women of European ancestry with PCOS (n=525) and controls (n=472), aged 18 to 45 years. Intervention Variants previously associated with PCOS in candidate gene studies were genotyped (n=39). Metabolic, reproductive and anthropomorphic parameters were examined as a function of the candidate variants. All genetic association analyses were adjusted for age, BMI and ancestry and were reported after correction for multiple testing. Main Outcome Measure Association of candidate gene variants with PCOS. Results Three variants, rs3797179 (SRD5A1), rs12473543 (POMC), and rs1501299 (ADIPOQ), were nominally associated with PCOS. However, they did not remain significant after correction for multiple testing and none of the variants replicated in a sufficiently powered meta-analysis. Variants in the FBN3 gene (rs17202517 and rs73503752) were associated with smaller waist circumferences and variant rs727428 in the SHBG gene was associated with lower SHBG levels. Conclusion Previously identified variants in candidate genes do not appear to be associated with PCOS risk. PMID:23375202
Reranking candidate gene models with cross-species comparison for improved gene prediction

PubMed Central

Liu, Qian; Crammer, Koby; Pereira, Fernando CN; Roos, David S

2008-01-01

Background Most gene finders score candidate gene models with state-based methods, typically HMMs, by combining local properties (coding potential, splice donor and acceptor patterns, etc). Competing models with similar state-based scores may be distinguishable with additional information. In particular, functional and comparative genomics datasets may help to select among competing models of comparable probability by exploiting features likely to be associated with the correct gene models, such as conserved exon/intron structure or protein sequence features. Results We have investigated the utility of a simple post-processing step for selecting among a set of alternative gene models, using global scoring rules to rerank competing models for more accurate prediction. For each gene locus, we first generate the K best candidate gene models using the gene finder Evigan, and then rerank these models using comparisons with putative orthologous genes from closely-related species. Candidate gene models with lower scores in the original gene finder may be selected if they exhibit strong similarity to probable orthologs in coding sequence, splice site location, or signal peptide occurrence. Experiments on Drosophila melanogaster demonstrate that reranking based on cross-species comparison outperforms the best gene models identified by Evigan alone, and also outperforms the comparative gene finders GeneWise and Augustus+. Conclusion Reranking gene models with cross-species comparison improves gene prediction accuracy. This straightforward method can be readily adapted to incorporate additional lines of evidence, as it requires only a ranked source of candidate gene models. PMID:18854050
APPLICATION OF GENE ARRAY TECHNOLOGY IN THE RESEARCH OF CARDIOPULMONARY TOXICITY INDUCED BY PARTICULATE MATTER (PM) AND ITS CONSTITUENTS.

EPA Science Inventory

Because of its ability to provide a "snap-shot" view of expression of large number of genes simultaneously, the microarray technology may be a useful tool to uncover new mechanisms of toxicity. This proposal will use the state-of-the-art gene microarrays and a new bioinformatic t...
Variable association of reactive intermediate genes with systemic lupus erythematosus (SLE) in populations with different African ancestry

PubMed Central

Ramos, Paula S.; Oates, James C.; Kamen, Diane L.; Williams, Adrienne H.; Gaffney, Patrick M.; Kelly, Jennifer A.; Kaufman, Kenneth M.; Kimberly, Robert P.; Niewold, Timothy B.; Jacob, Chaim O.; Tsao, Betty P.; Alarcón, Graciela S.; Brown, Elizabeth E.; Edberg, Jeffrey C.; Petri, Michelle A.; Ramsey-Goldman, Rosalind; Reveille, John D.; Vilá, Luis M.; James, Judith A.; Guthridge, Joel M.; Merrill, Joan T.; Boackle, Susan A.; Freedman, Barry I.; Scofield, R. Hal; Stevens, Anne M.; Vyse, Timothy J.; Criswell, Lindsey A.; Moser, Kathy L.; Alarcón-Riquelme, Marta E.; Langefeld, Carl D.; Harley, John B.; Gilkeson, Gary S.

2013-01-01

Objective Little is known about the genetic etiology of systemic lupus erythematosus (SLE) in individuals of African ancestry, despite its higher prevalence and greater disease severity. Overproduction of nitric oxide (NO) and reactive oxygen species are implicated in the pathogenesis and severity of SLE, making NO synthases and other reactive intermediate related genes biological candidates for disease susceptibility. This study analyzed variation in reactive intermediate genes for association with SLE in two populations with African ancestry. Methods A total of 244 SNPs from 53 regions were analyzed in non-Gullah African Americans (AA; 1432 cases and 1687 controls) and the genetically more homogeneous Gullah of the Sea Islands of South Carolina (133 cases and 112 controls) and. Single-marker, haplotype, and two-locus interaction tests were computed for these populations. Results The glutathione reductase gene GSR (rs2253409, P=0.0014, OR [95% CI]=1.26 [1.09–1.44]) was the most significant single-SNP association in AA. In the Gullah, the NADH dehydrogenase NDUFS4 (rs381575, P=0.0065, OR [95%CI]=2.10 [1.23–3.59]) and nitric oxide synthase gene NOS1 (rs561712, P=0.0072, OR [95%CI]=0.62 [0.44–0.88]) were most strongly associated with SLE. When both populations were analyzed together, GSR remained the most significant effect (rs2253409, P=0.00072, OR [95%CI]=1.26 [1.10–1.44]). Haplotype and two-locus interaction analyses also uncovered different loci in each population. Conclusion These results suggest distinct patterns of association with SLE in African-derived populations; specific loci may be more strongly associated within select population groups. PMID:23637325
Targeted Exome Sequencing of Krebs Cycle Genes Reveals Candidate Cancer-Predisposing Mutations in Pheochromocytomas and Paragangliomas.

PubMed

Remacha, Laura; Comino-Méndez, Iñaki; Richter, Susan; Contreras, Laura; Currás-Freixes, María; Pita, Guillermo; Letón, Rocío; Galarreta, Antonio; Torres-Pérez, Rafael; Honrado, Emiliano; Jiménez, Scherezade; Maestre, Lorena; Moran, Sebastian; Esteller, Manel; Satrústegui, Jorgina; Eisenhofer, Graeme; Robledo, Mercedes; Cascón, Alberto

2017-10-15

Purpose: Mutations in Krebs cycle genes are frequently found in patients with pheochromocytomas/paragangliomas. Disruption of SDH, FH or MDH2 enzymatic activities lead to accumulation of specific metabolites, which give rise to epigenetic changes in the genome that cause a characteristic hypermethylated phenotype. Tumors showing this phenotype, but no alterations in the known predisposing genes, could harbor mutations in other Krebs cycle genes. Experimental Design: We used downregulation and methylation of RBP1, as a marker of a hypermethylation phenotype, to select eleven pheochromocytomas and paragangliomas for targeted exome sequencing of a panel of Krebs cycle-related genes. Methylation profiling, metabolite assessment and additional analyses were also performed in selected cases. Results: One of the 11 tumors was found to carry a known cancer-predisposing somatic mutation in IDH1 A variant in GOT2 , c.357A>T, found in a patient with multiple tumors, was associated with higher tumor mRNA and protein expression levels, increased GOT2 enzymatic activity in lymphoblastic cells, and altered metabolite ratios both in tumors and in GOT2 knockdown HeLa cells transfected with the variant. Array methylation-based analysis uncovered a somatic epigenetic mutation in SDHC in a patient with multiple pheochromocytomas and a gastrointestinal stromal tumor. Finally, a truncating germline IDH3B mutation was found in a patient with a single paraganglioma showing an altered α-ketoglutarate/isocitrate ratio. Conclusions: This study further attests to the relevance of the Krebs cycle in the development of PCC and PGL, and points to a potential role of other metabolic enzymes involved in metabolite exchange between mitochondria and cytosol. Clin Cancer Res; 23(20); 6315-24. ©2017 AACR . ©2017 American Association for Cancer Research.
Variable association of reactive intermediate genes with systemic lupus erythematosus in populations with different African ancestry.

PubMed

Ramos, Paula S; Oates, James C; Kamen, Diane L; Williams, Adrienne H; Gaffney, Patrick M; Kelly, Jennifer A; Kaufman, Kenneth M; Kimberly, Robert P; Niewold, Timothy B; Jacob, Chaim O; Tsao, Betty P; Alarcón, Graciela S; Brown, Elizabeth E; Edberg, Jeffrey C; Petri, Michelle A; Ramsey-Goldman, Rosalind; Reveille, John D; Vilá, Luis M; James, Judith A; Guthridge, Joel M; Merrill, Joan T; Boackle, Susan A; Freedman, Barry I; Scofield, R Hal; Stevens, Anne M; Vyse, Timothy J; Criswell, Lindsey A; Moser, Kathy L; Alarcón-Riquelme, Marta E; Langefeld, Carl D; Harley, John B; Gilkeson, Gary S

2013-06-01

Little is known about the genetic etiology of systemic lupus erythematosus (SLE) in individuals of African ancestry, despite its higher prevalence and greater disease severity. Overproduction of nitric oxide (NO) and reactive oxygen species are implicated in the pathogenesis and severity of SLE, making NO synthases and other reactive intermediate-related genes biological candidates for disease susceptibility. We analyzed variation in reactive intermediate genes for association with SLE in 2 populations with African ancestry. A total of 244 single-nucleotide polymorphisms (SNP) from 53 regions were analyzed in non-Gullah African Americans (AA; 1432 cases and 1687 controls) and the genetically more homogeneous Gullah of the Sea Islands of South Carolina (133 cases and 112 controls). Single-marker, haplotype, and 2-locus interaction tests were computed for these populations. The glutathione reductase gene GSR (rs2253409; p = 0.0014, OR 1.26, 95% CI 1.09-1.44) was the most significant single SNP association in AA. In the Gullah, the NADH dehydrogenase NDUFS4 (rs381575; p = 0.0065, OR 2.10, 95% CI 1.23-3.59) and NO synthase gene NOS1 (rs561712; p = 0.0072, OR 0.62, 95% CI 0.44-0.88) were most strongly associated with SLE. When both populations were analyzed together, GSR remained the most significant effect (rs2253409; p = 0.00072, OR 1.26, 95% CI 1.10-1.44). Haplotype and 2-locus interaction analyses also uncovered different loci in each population. These results suggest distinct patterns of association with SLE in African-derived populations; specific loci may be more strongly associated within select population groups.
Candidate gene prioritization by network analysis of differential expression using machine learning approaches

PubMed Central

2010-01-01

Background Discovering novel disease genes is still challenging for diseases for which no prior knowledge - such as known disease genes or disease-related pathways - is available. Performing genetic studies frequently results in large lists of candidate genes of which only few can be followed up for further investigation. We have recently developed a computational method for constitutional genetic disorders that identifies the most promising candidate genes by replacing prior knowledge by experimental data of differential gene expression between affected and healthy individuals. To improve the performance of our prioritization strategy, we have extended our previous work by applying different machine learning approaches that identify promising candidate genes by determining whether a gene is surrounded by highly differentially expressed genes in a functional association or protein-protein interaction network. Results We have proposed three strategies scoring disease candidate genes relying on network-based machine learning approaches, such as kernel ridge regression, heat kernel, and Arnoldi kernel approximation. For comparison purposes, a local measure based on the expression of the direct neighbors is also computed. We have benchmarked these strategies on 40 publicly available knockout experiments in mice, and performance was assessed against results obtained using a standard procedure in genetics that ranks candidate genes based solely on their differential expression levels (Simple Expression Ranking). Our results showed that our four strategies could outperform this standard procedure and that the best results were obtained using the Heat Kernel Diffusion Ranking leading to an average ranking position of 8 out of 100 genes, an AUC value of 92.3% and an error reduction of 52.8% relative to the standard procedure approach which ranked the knockout gene on average at position 17 with an AUC value of 83.7%. Conclusion In this study we could identify promising candidate genes using network based machine learning approaches even if no knowledge is available about the disease or phenotype. PMID:20840752
[Stability analysis of reference gene based on real-time PCR in Artemisia annua under cadmium treatment].

PubMed

Zhou, Liang-Yun; Mo, Ge; Wang, Sheng; Tang, Jin-Fu; Yue, Hong; Huang, Lu-Qi; Shao, Ai-Juan; Guo, Lan-Ping

2014-03-01

In this study, Actin, 18S rRNA, PAL, GAPDH and CPR of Artemisia annua were selected as candidate reference genes, and their gene-specific primers for real-time PCR were designed, then geNorm, NormFinder, BestKeeper, Delta CT and RefFinder were used to evaluate their expression stability in the leaves of A. annua under treatment of different concentrations of Cd, with the purpose of finding a reliable reference gene to ensure the reliability of gene-expression analysis. The results showed that there were some significant differences among the candidate reference genes under different treatments and the order of expression stability of candidate reference gene was Actin > 18S rRNA > PAL > GAPDH > CPR. These results suggested that Actin, 18S rRNA and PAL could be used as ideal reference genes of gene expression analysis in A. annua and multiple internal control genes were adopted for results calibration. In addition, differences in expression stability of candidate reference genes in the leaves of A. annua under the same concentrations of Cd were observed, which suggested that the screening of candidate reference genes was needed even under the same treatment. To our best knowledge, this study for the first time provided the ideal reference genes under Cd treatment in the leaves of A. annua and offered reference for the gene expression analysis of A. annua under other conditions.
Antimicrobial Resistance Gene Transfer in Drug Resistant Acinetobacter Species

USDA-ARS?s Scientific Manuscript database

Abstract: Antibiotic resistance is rapidly developing into one of the most formidable challenges for healthcare providers and researchers alike. To combat the rapid evolution of resistance, it will be important to uncover different mechanisms that bacteria use to acquire drug resistance genes. Acine...
Prediction of Human Disease Genes by Human-Mouse Conserved Coexpression Analysis

PubMed Central

Grassi, Elena; Damasco, Christian; Silengo, Lorenzo; Oti, Martin; Provero, Paolo; Di Cunto, Ferdinando

2008-01-01

Background Even in the post-genomic era, the identification of candidate genes within loci associated with human genetic diseases is a very demanding task, because the critical region may typically contain hundreds of positional candidates. Since genes implicated in similar phenotypes tend to share very similar expression profiles, high throughput gene expression data may represent a very important resource to identify the best candidates for sequencing. However, so far, gene coexpression has not been used very successfully to prioritize positional candidates. Methodology/Principal Findings We show that it is possible to reliably identify disease-relevant relationships among genes from massive microarray datasets by concentrating only on genes sharing similar expression profiles in both human and mouse. Moreover, we show systematically that the integration of human-mouse conserved coexpression with a phenotype similarity map allows the efficient identification of disease genes in large genomic regions. Finally, using this approach on 850 OMIM loci characterized by an unknown molecular basis, we propose high-probability candidates for 81 genetic diseases. Conclusion Our results demonstrate that conserved coexpression, even at the human-mouse phylogenetic distance, represents a very strong criterion to predict disease-relevant relationships among human genes. PMID:18369433
SRF modulates seizure occurrence, activity induced gene transcription and hippocampal circuit reorganization in the mouse pilocarpine epilepsy model.

PubMed

Lösing, Pascal; Niturad, Cristina Elena; Harrer, Merle; Reckendorf, Christopher Meyer Zu; Schatz, Theresa; Sinske, Daniela; Lerche, Holger; Maljevic, Snezana; Knöll, Bernd

2017-07-17

A hallmark of temporal lobe epilepsy (TLE) is hippocampal neuronal demise and aberrant mossy fiber sprouting. In addition, unrestrained neuronal activity in TLE patients induces gene expression including immediate early genes (IEGs) such as Fos and Egr1.We employed the mouse pilocarpine model to analyze the transcription factor (TF) serum response factor (SRF) in epileptogenesis, seizure induced histopathology and IEG induction. SRF is a neuronal activity regulated TF stimulating IEG expression as well as nerve fiber growth and guidance. Adult conditional SRF deficient mice (Srf CaMKCreERT2 ) were more refractory to initial status epilepticus (SE) acquisition. Further, SRF deficient mice developed more spontaneous recurrent seizures (SRS). Genome-wide transcriptomic analysis uncovered a requirement of SRF for SE and SRS induced IEG induction (e.g. Fos, Egr1, Arc, Npas4, Btg2, Atf3). SRF was required for epilepsy associated neurodegeneration, mossy fiber sprouting and inflammation. We uncovered MAP kinase signaling as SRF target during epilepsy. Upon SRF ablation, seizure evoked induction of dual specific phosphatases (Dusp5 and Dusp6) was reduced. Lower expression of these negative ERK kinase regulators correlated with altered P-ERK levels in epileptic Srf mutant animals.Overall, this study uncovered an SRF contribution to several processes of epileptogenesis in the pilocarpine model.
Identification of ANKRD11 and ZNF778 as candidate genes for autism and variable cognitive impairment in the novel 16q24.3 microdeletion syndrome

PubMed Central

Willemsen, Marjolein H; Fernandez, Bridget A; Bacino, Carlos A; Gerkes, Erica; de Brouwer, Arjan PM; Pfundt, Rolph; Sikkema-Raddatz, Birgit; Scherer, Stephen W; Marshall, Christian R; Potocki, Lorraine; van Bokhoven, Hans; Kleefstra, Tjitske

2010-01-01

The clinical use of array comparative genomic hybridization in the evaluation of patients with multiple congenital anomalies and/or mental retardation has recently led to the discovery of a number of novel microdeletion and microduplication syndromes. We present four male patients with overlapping molecularly defined de novo microdeletions of 16q24.3. The clinical features observed in these patients include facial dysmorphisms comprising prominent forehead, large ears, smooth philtrum, pointed chin and wide mouth, variable cognitive impairment, autism spectrum disorder, structural anomalies of the brain, seizures and neonatal thrombocytopenia. Although deletions vary in size, the common region of overlap is only 90 kb and comprises two known genes, Ankyrin Repeat Domain 11 (ANKRD11) (MIM 611192) and Zinc Finger 778 (ZNF778), and is located approximately 10 kb distally to Cadherin 15 (CDH15) (MIM 114019). This region is not found as a copy number variation in controls. We propose that these patients represent a novel and distinctive microdeletion syndrome, characterized by autism spectrum disorder, variable cognitive impairment, facial dysmorphisms and brain abnormalities. We suggest that haploinsufficiency of ANKRD11 and/or ZNF778 contribute to this phenotype and speculate that further investigation of non-deletion patients who have features suggestive of this 16q24.3 microdeletion syndrome might uncover other mutations in one or both of these genes. PMID:19920853
A practical guide to environmental association analysis in landscape genomics.

PubMed

Rellstab, Christian; Gugerli, Felix; Eckert, Andrew J; Hancock, Angela M; Holderegger, Rolf

2015-09-01

Landscape genomics is an emerging research field that aims to identify the environmental factors that shape adaptive genetic variation and the gene variants that drive local adaptation. Its development has been facilitated by next-generation sequencing, which allows for screening thousands to millions of single nucleotide polymorphisms in many individuals and populations at reasonable costs. In parallel, data sets describing environmental factors have greatly improved and increasingly become publicly accessible. Accordingly, numerous analytical methods for environmental association studies have been developed. Environmental association analysis identifies genetic variants associated with particular environmental factors and has the potential to uncover adaptive patterns that are not discovered by traditional tests for the detection of outlier loci based on population genetic differentiation. We review methods for conducting environmental association analysis including categorical tests, logistic regressions, matrix correlations, general linear models and mixed effects models. We discuss the advantages and disadvantages of different approaches, provide a list of dedicated software packages and their specific properties, and stress the importance of incorporating neutral genetic structure in the analysis. We also touch on additional important aspects such as sampling design, environmental data preparation, pooled and reduced-representation sequencing, candidate-gene approaches, linearity of allele-environment associations and the combination of environmental association analyses with traditional outlier detection tests. We conclude by summarizing expected future directions in the field, such as the extension of statistical approaches, environmental association analysis for ecological gene annotation, and the need for replication and post hoc validation studies. © 2015 John Wiley & Sons Ltd.
Thyroid C-Cell Biology and Oncogenic Transformation

PubMed Central

Cote, Gilbert J.; Grubbs, Elizabeth G.; Hofmann, Marie-Claude

2017-01-01

The thyroid parafollicular cell, or commonly named “C-cell,” functions in serum calcium homeostasis. Elevations in serum calcium trigger release of calcitonin from the C-cell, which in turn functions to inhibit absorption of calcium by the intestine, resorption of bone by the osteoclast, and reabsorption of calcium by renal tubular cells. Oncogenic transformation of the thyroid C-cell is thought to progress through a hyperplastic process prior to malignancy with increasing levels of serum calcitonin serving as a biomarker for tumor burden. The discovery that Multiple Endocrine Neoplasia, type 2 is caused by activating mutations of the RET gene serves to highlight the RET-RAS-MAPK signaling pathway in both initiation and progression of medullary thyroid carcinoma. Thyroid C-cells are known to express RET at high levels relative to most cell types, therefore aberrant activation of this receptor is targeted primarily to the C-cell, providing one possible cause of tissue-specific oncogenesis. The role of RET signaling in normal C-cell function is unknown though calcitonin gene transcription appears to be sensitive to RET activation. Beyond RET the modeling of oncogenesis in animals and screening of human tumors for candidate gene mutations has uncovered mutation of RAS family members and inactivation of Rb1 regulatory pathway as potential mediators of C-cell transformation. A growing understanding of how RET interacts with these pathways, both in normal C-cell function and during oncogenic transformation will help in the development of novel molecular targeted therapies. PMID:26494382
Investigation of the mechanism of meiotic DNA cleavage by VMA1-derived endonuclease uncovers a meiotic alteration in chromatin structure around the target site.

PubMed

Fukuda, Tomoyuki; Ohta, Kunihiro; Ohya, Yoshikazu

2006-06-01

VMA1-derived endonuclease (VDE), a homing endonuclease in Saccharomyces cerevisiae, is encoded by the mobile intein-coding sequence within the nuclear VMA1 gene. VDE recognizes and cleaves DNA at the 31-bp VDE recognition sequence (VRS) in the VMA1 gene lacking the intein-coding sequence during meiosis to insert a copy of the intein-coding sequence at the cleaved site. The mechanism underlying the meiosis specificity of VMA1 intein-coding sequence homing remains unclear. We studied various factors that might influence the cleavage activity in vivo and found that VDE binding to the VRS can be detected only when DNA cleavage by VDE takes place, implying that meiosis-specific DNA cleavage is regulated by the accessibility of VDE to its target site. As a possible candidate for the determinant of this accessibility, we analyzed chromatin structure around the VRS and revealed that local chromatin structure near the VRS is altered during meiosis. Although the meiotic chromatin alteration exhibits correlations with DNA binding and cleavage by VDE at the VMA1 locus, such a chromatin alteration is not necessarily observed when the VRS is embedded in ectopic gene loci. This suggests that nucleosome positioning or occupancy around the VRS by itself is not the sole mechanism for the regulation of meiosis-specific DNA cleavage by VDE and that other mechanisms are involved in the regulation.
Investigation of the Mechanism of Meiotic DNA Cleavage by VMA1-Derived Endonuclease Uncovers a Meiotic Alteration in Chromatin Structure around the Target Site

PubMed Central

Fukuda, Tomoyuki; Ohta, Kunihiro; Ohya, Yoshikazu

2006-01-01

VMA1-derived endonuclease (VDE), a homing endonuclease in Saccharomyces cerevisiae, is encoded by the mobile intein-coding sequence within the nuclear VMA1 gene. VDE recognizes and cleaves DNA at the 31-bp VDE recognition sequence (VRS) in the VMA1 gene lacking the intein-coding sequence during meiosis to insert a copy of the intein-coding sequence at the cleaved site. The mechanism underlying the meiosis specificity of VMA1 intein-coding sequence homing remains unclear. We studied various factors that might influence the cleavage activity in vivo and found that VDE binding to the VRS can be detected only when DNA cleavage by VDE takes place, implying that meiosis-specific DNA cleavage is regulated by the accessibility of VDE to its target site. As a possible candidate for the determinant of this accessibility, we analyzed chromatin structure around the VRS and revealed that local chromatin structure near the VRS is altered during meiosis. Although the meiotic chromatin alteration exhibits correlations with DNA binding and cleavage by VDE at the VMA1 locus, such a chromatin alteration is not necessarily observed when the VRS is embedded in ectopic gene loci. This suggests that nucleosome positioning or occupancy around the VRS by itself is not the sole mechanism for the regulation of meiosis-specific DNA cleavage by VDE and that other mechanisms are involved in the regulation. PMID:16757746
Essential and Unexpected Role of YY1 to Promote Mesodermal Cardiac Differentiation

PubMed Central

Gregoire, Serge; Karra, Ravi; Passer, Derek; Deutsch, Marcus-Andre; Krane, Markus; Feistritzer, Rebecca; Sturzu, Anthony; Domian, Ibrahim; Saga, Yumiko; Wu, Sean M.

2013-01-01

Rational Cardiogenesis is regulated by a complex interplay between transcription factors. However, little is known about how these interactions regulate the transition from mesodermal precursors to cardiac progenitor cells (CPCs). Objective To identify novel regulators of mesodermal cardiac lineage commitment. Methods and Results We performed a bioinformatic-based transcription factor binding site analysis on upstream promoter regions of genes that are enriched in embryonic stem cell (ESC)-derived CPCs. From 32 candidate transcription factors screened, we found that YY1, a repressor of sarcomeric gene expression, is present in CPCs in vivo. Interestingly, we uncovered the ability of YY1 to transcriptionally activate Nkx2.5, a key marker of early cardiogenic commitment. YY1 regulates Nkx2.5 expression via a 2.1 kb cardiac-specific enhancer as demonstrated by in vitro luciferase-based assays and in vivo chromatin immunoprecipitation (ChIP) and genome-wide sequencing analysis. Furthermore, the ability of YY1 to activate Nkx2.5 expression depends on its cooperative interaction with Gata4 at a nearby chromatin. Cardiac mesoderm-specific loss-of-function of YY1 resulted in early embryonic lethality. This was corroborated in vitro by ESC-based assays where we show that the overexpression of YY1 enhanced the cardiogenic differentiation of ESCs into CPCs. Conclusion These results demonstrate an essential and unexpected role for YY1 to promote cardiogenesis as a transcriptional activator of Nkx2.5 and other CPC-enriched genes. PMID:23307821
Identification of Inherited Retinal Disease-Associated Genetic Variants in 11 Candidate Genes.

PubMed

Astuti, Galuh D N; van den Born, L Ingeborgh; Khan, M Imran; Hamel, Christian P; Bocquet, Béatrice; Manes, Gaël; Quinodoz, Mathieu; Ali, Manir; Toomes, Carmel; McKibbin, Martin; El-Asrag, Mohammed E; Haer-Wigman, Lonneke; Inglehearn, Chris F; Black, Graeme C M; Hoyng, Carel B; Cremers, Frans P M; Roosing, Susanne

2018-01-10

Inherited retinal diseases (IRDs) display an enormous genetic heterogeneity. Whole exome sequencing (WES) recently identified genes that were mutated in a small proportion of IRD cases. Consequently, finding a second case or family carrying pathogenic variants in the same candidate gene often is challenging. In this study, we searched for novel candidate IRD gene-associated variants in isolated IRD families, assessed their causality, and searched for novel genotype-phenotype correlations. Whole exome sequencing was performed in 11 probands affected with IRDs. Homozygosity mapping data was available for five cases. Variants with minor allele frequencies ≤ 0.5% in public databases were selected as candidate disease-causing variants. These variants were ranked based on their: (a) presence in a gene that was previously implicated in IRD; (b) minor allele frequency in the Exome Aggregation Consortium database (ExAC); (c) in silico pathogenicity assessment using the combined annotation dependent depletion (CADD) score; and (d) interaction of the corresponding protein with known IRD-associated proteins. Twelve unique variants were found in 11 different genes in 11 IRD probands. Novel autosomal recessive and dominant inheritance patterns were found for variants in Small Nuclear Ribonucleoprotein U5 Subunit 200 ( SNRNP200 ) and Zinc Finger Protein 513 ( ZNF513 ), respectively. Using our pathogenicity assessment, a variant in DEAH-Box Helicase 32 ( DHX32 ) was the top ranked novel candidate gene to be associated with IRDs, followed by eight medium and lower ranked candidate genes. The identification of candidate disease-associated sequence variants in 11 single families underscores the notion that the previously identified IRD-associated genes collectively carry > 90% of the defects implicated in IRDs. To identify multiple patients or families with variants in the same gene and thereby provide extra proof for pathogenicity, worldwide data sharing is needed.

Transcriptome Analysis of H2O2-Treated Wheat Seedlings Reveals a H2O2-Responsive Fatty Acid Desaturase Gene Participating in Powdery Mildew Resistance

PubMed Central

Tang, Lichuan; Zhao, Guangyao; Zhu, Mingzhu; Chu, Jinfang; Sun, Xiaohong; Wei, Bo; Zhang, Xiangqi; Jia, Jizeng; Mao, Long

2011-01-01

Hydrogen peroxide (H2O2) plays important roles in plant biotic and abiotic stress responses. However, the effect of H2O2 stress on the bread wheat transcriptome is still lacking. To investigate the cellular and metabolic responses triggered by H2O2, we performed an mRNA tag analysis of wheat seedlings under 10 mM H2O2 treatment for 6 hour in one powdery mildew (PM) resistant (PmA) and two susceptible (Cha and Han) lines. In total, 6,156, 6,875 and 3,276 transcripts were found to be differentially expressed in PmA, Han and Cha respectively. Among them, 260 genes exhibited consistent expression patterns in all three wheat lines and may represent a subset of basal H2O2 responsive genes that were associated with cell defense, signal transduction, photosynthesis, carbohydrate metabolism, lipid metabolism, redox homeostasis, and transport. Among genes specific to PmA, ‘transport’ activity was significantly enriched in Gene Ontology analysis. MapMan classification showed that, while both up- and down- regulations were observed for auxin, abscisic acid, and brassinolides signaling genes, the jasmonic acid and ethylene signaling pathway genes were all up-regulated, suggesting H2O2-enhanced JA/Et functions in PmA. To further study whether any of these genes were involved in wheat PM response, 19 H2O2-responsive putative defense related genes were assayed in wheat seedlings infected with Blumeria graminis f. sp. tritici (Bgt). Eight of these genes were found to be co-regulated by H2O2 and Bgt, among which a fatty acid desaturase gene TaFAD was then confirmed by virus induced gene silencing (VIGS) to be required for the PM resistance. Together, our data presents the first global picture of the wheat transcriptome under H2O2 stress and uncovers potential links between H2O2 and Bgt responses, hence providing important candidate genes for the PM resistance in wheat. PMID:22174904
Looking into flowering time in almond (Prunus dulcis (Mill) D. A. Webb): the candidate gene approach.

PubMed

Silva, C; Garcia-Mas, J; Sánchez, A M; Arús, P; Oliveira, M M

2005-03-01

Blooming time is one of the most important agronomic traits in almond. Biochemical and molecular events underlying flowering regulation must be understood before methods to stimulate late flowering can be developed. Attempts to elucidate the genetic control of this process have led to the identification of a major gene (Lb) and quantitative trait loci (QTLs) linked to observed phenotypic differences, but although this gene and these QTLs have been placed on the Prunus reference genetic map, their sequences and specific functions remain unknown. The aim of our investigation was to associate these loci with known genes using a candidate gene approach. Two almond cDNAs and eight Prunus expressed sequence tags were selected as candidate genes (CGs) since their sequences were highly identical to those of flowering regulatory genes characterized in other species. The CGs were amplified from both parental lines of the mapping population using specific primers. Sequence comparison revealed DNA polymorphisms between the parental lines, mainly of the single nucleotide type. Polymorphisms were used to develop co-dominant cleaved amplified polymorphic sequence markers or length polymorphisms based on insertion/deletion events for mapping the candidate genes on the Prunus reference map. Ten candidate genes were assigned to six linkage groups in the Prunus genome. The positions of two of these were compatible with the regions where two QTLs for blooming time were detected. One additional candidate was localized close to the position of the Evergrowing gene, which determines a non-deciduous behaviour in peach.
Bioinformatics analysis and detection of gelatinase encoded gene in Lysinibacillussphaericus

NASA Astrophysics Data System (ADS)

Repin, Rul Aisyah Mat; Mutalib, Sahilah Abdul; Shahimi, Safiyyah; Khalid, Rozida Mohd.; Ayob, Mohd. Khan; Bakar, Mohd. Faizal Abu; Isa, Mohd Noor Mat

2016-11-01

In this study, we performed bioinformatics analysis toward genome sequence of Lysinibacillussphaericus (L. sphaericus) to determine gene encoded for gelatinase. L. sphaericus was isolated from soil and gelatinase species-specific bacterium to porcine and bovine gelatin. This bacterium offers the possibility of enzymes production which is specific to both species of meat, respectively. The main focus of this research is to identify the gelatinase encoded gene within the bacteria of L. Sphaericus using bioinformatics analysis of partially sequence genome. From the research study, three candidate gene were identified which was, gelatinase candidate gene 1 (P1), NODE_71_length_93919_cov_158.931839_21 which containing 1563 base pair (bp) in size with 520 amino acids sequence; Secondly, gelatinase candidate gene 2 (P2), NODE_23_length_52851_cov_190.061386_17 which containing 1776 bp in size with 591 amino acids sequence; and Thirdly, gelatinase candidate gene 3 (P3), NODE_106_length_32943_cov_169.147919_8 containing 1701 bp in size with 566 amino acids sequence. Three pairs of oligonucleotide primers were designed and namely as, F1, R1, F2, R2, F3 and R3 were targeted short sequences of cDNA by PCR. The amplicons were reliably results in 1563 bp in size for candidate gene P1 and 1701 bp in size for candidate gene P3. Therefore, the results of bioinformatics analysis of L. Sphaericus resulting in gene encoded gelatinase were identified.
Separating the wheat from the chaff: systematic identification of functionally relevant noncoding variants in ADHD.

PubMed

Tong, J H S; Hawi, Z; Dark, C; Cummins, T D R; Johnson, B P; Newman, D P; Lau, R; Vance, A; Heussler, H S; Matthews, N; Bellgrove, M A; Pang, K C

2016-11-01

Attention deficit hyperactivity disorder (ADHD) is a highly heritable psychiatric condition with negative lifetime outcomes. Uncovering its genetic architecture should yield important insights into the neurobiology of ADHD and assist development of novel treatment strategies. Twenty years of candidate gene investigations and more recently genome-wide association studies have identified an array of potential association signals. In this context, separating the likely true from false associations ('the wheat' from 'the chaff') will be crucial for uncovering the functional biology of ADHD. Here, we defined a set of 2070 DNA variants that showed evidence of association with ADHD (or were in linkage disequilibrium). More than 97% of these variants were noncoding, and were prioritised for further exploration using two tools-genome-wide annotation of variants (GWAVA) and Combined Annotation-Dependent Depletion (CADD)-that were recently developed to rank variants based upon their likely pathogenicity. Capitalising on recent efforts such as the Encyclopaedia of DNA Elements and US National Institutes of Health Roadmap Epigenomics Projects to improve understanding of the noncoding genome, we subsequently identified 65 variants to which we assigned functional annotations, based upon their likely impact on alternative splicing, transcription factor binding and translational regulation. We propose that these 65 variants, which possess not only a high likelihood of pathogenicity but also readily testable functional hypotheses, represent a tractable shortlist for future experimental validation in ADHD. Taken together, this study brings into sharp focus the likely relevance of noncoding variants for the genetic risk associated with ADHD, and more broadly suggests a bioinformatics approach that should be relevant to other psychiatric disorders.
Association and linkage studies of candidate genes involved in GABAergic neurotransmission in lithium-responsive bipolar disorder.

PubMed Central

Duffy, A; Turecki, G; Grof, P; Cavazzoni, P; Grof, E; Joober, R; Ahrens, B; Berghöfer, A; Müller-Oerlinghausen, B; Dvoráková, M; Libigerová, E; Vojtĕchovský, M; Zvolský, P; Nilsson, A; Licht, R W; Rasmussen, N A; Schou, M; Vestergaard, P; Holzinger, A; Schumann, C; Thau, K; Robertson, C; Rouleau, G A; Alda, M

2000-01-01

OBJECTIVE: To test for genetic linkage and association with GABAergic candidate genes in lithium-responsive bipolar disorder. DESIGN: Polymorphisms located in genes that code for GABRA3, GABRA5 and GABRB3 subunits of the GABAA receptor were investigated using association and linkage strategies. PARTICIPANTS: A total of 138 patients with bipolar 1 disorder with a clear response to lithium prophylaxis, selected from specialized lithium clinics in Canada and Europe that are part of the International Group for the Study of Lithium-Treated Patients, and 108 psychiatrically healthy controls. Families of 24 probands were suitable for linkage analysis. OUTCOME MEASURES: The association between the candidate genes and patients with bipolar disorder versus that of controls and genetic linkage within families. RESULTS: There was no significant association or linkage found between lithium-responsive bipolar disorder and the GABAergic candidate genes investigated. CONCLUSIONS: This study does not support a major role for the GABAergic candidate genes tested in lithium-responsive bipolar disorder. PMID:11022400
Diversity of ARSACS mutations in French-Canadians.

PubMed

Thiffault, I; Dicaire, M J; Tetreault, M; Huang, K N; Demers-Lamarche, J; Bernard, G; Duquette, A; Larivière, R; Gehring, K; Montpetit, A; McPherson, P S; Richter, A; Montermini, L; Mercier, J; Mitchell, G A; Dupré, N; Prévost, C; Bouchard, J P; Mathieu, J; Brais, B

2013-01-01

The growing number of spastic ataxia of Charlevoix-Saguenay (SACS) gene mutations reported worldwide has broadened the clinical phenotype of autosomal recessive spastic ataxia of Charlevoix-Saguenay (ARSACS). The identification of Quebec ARSACS cases without two known SACS mutation led to the development of a multi-modal genomic strategy to uncover mutations in this large gene and explore phenotype variability. Search for SACS mutations by combining various methods on 20 cases with a classical French-Canadian ARSACS phenotype without two mutations and a group of 104 sporadic or recessive spastic ataxia cases of unknown cause. Western blot on lymphoblast protein from cases with different genotypes was probed to establish if they still expressed sacsin. A total of 12 mutations, including 7 novels, were uncovered in Quebec ARSACS cases. The screening of 104 spastic ataxia cases of unknown cause for 98 SACS mutations did not uncover carriers of two mutations. Compounds heterozygotes for one missense SACS mutation were found to minimally express sacsin. The large number of SACS mutations present even in Quebec suggests that the size of the gene alone may explain the great genotypic diversity. This study does not support an expanding ARSACS phenotype in the French-Canadian population. Most mutations lead to loss of function, though phenotypic variability in other populations may reflect partial loss of function with preservation of some sacsin expression. Our results also highlight the challenge of SACS mutation screening and the necessity to develop new generation sequencing methods to ensure low cost complete gene sequencing.
Defining a new candidate gene for amelogenesis imperfecta: from molecular genetics to biochemistry.

PubMed

Urzúa, Blanca; Ortega-Pinto, Ana; Morales-Bozo, Irene; Rojas-Alcayaga, Gonzalo; Cifuentes, Víctor

2011-02-01

Amelogenesis imperfecta is a group of genetic conditions that affect the structure and clinical appearance of tooth enamel. The types (hypoplastic, hypocalcified, and hypomature) are correlated with defects in different stages of the process of enamel synthesis. Autosomal dominant, recessive, and X-linked types have been previously described. These disorders are considered clinically and genetically heterogeneous in etiology, involving a variety of genes, such as AMELX, ENAM, DLX3, FAM83H, MMP-20, KLK4, and WDR72. The mutations identified within these causal genes explain less than half of all cases of amelogenesis imperfecta. Most of the candidate and causal genes currently identified encode proteins involved in enamel synthesis. We think it is necessary to refocus the search for candidate genes using biochemical processes. This review provides theoretical evidence that the human SLC4A4 gene (sodium bicarbonate cotransporter) may be a new candidate gene.
A public platform for the verification of the phenotypic effect of candidate genes for resistance to aflatoxin accumulation and Aspergillus flavus infection in maize.

PubMed

Warburton, Marilyn L; Williams, William Paul; Hawkins, Leigh; Bridges, Susan; Gresham, Cathy; Harper, Jonathan; Ozkan, Seval; Mylroie, J Erik; Shan, Xueyan

2011-07-01

A public candidate gene testing pipeline for resistance to aflatoxin accumulation or Aspergillus flavus infection in maize is presented here. The pipeline consists of steps for identifying, testing, and verifying the association of selected maize gene sequences with resistance under field conditions. Resources include a database of genetic and protein sequences associated with the reduction in aflatoxin contamination from previous studies; eight diverse inbred maize lines for polymorphism identification within any maize gene sequence; four Quantitative Trait Loci (QTL) mapping populations and one association mapping panel, all phenotyped for aflatoxin accumulation resistance and associated phenotypes; and capacity for Insertion/Deletion (InDel) and SNP genotyping in the population(s) for mapping. To date, ten genes have been identified as possible candidate genes and put through the candidate gene testing pipeline, and results are presented here to demonstrate the utility of the pipeline.
ENU Mutagenesis in Mice Identifies Candidate Genes For Hypogonadism

PubMed Central

Weiss, Jeffrey; Hurley, Lisa A.; Harris, Rebecca M.; Finlayson, Courtney; Tong, Minghan; Fisher, Lisa A.; Moran, Jennifer L.; Beier, David R.; Mason, Christopher; Jameson, J. Larry

2012-01-01

Genome-wide mutagenesis was performed in mice to identify candidate genes for male infertility, for which the predominant causes remain idiopathic. Mice were mutagenized using N-ethyl-N-nitrosourea (ENU), bred, and screened for phenotypes associated with the male urogenital system. Fifteen heritable lines were isolated and chromosomal loci were assigned using low density genome-wide SNP arrays. Ten of the fifteen lines were pursued further using higher resolution SNP analysis to narrow the candidate gene regions. Exon sequencing of candidate genes identified mutations in mice with cystic kidneys (Bicc1), cryptorchidism (Rxfp2), restricted germ cell deficiency (Plk4), and severe germ cell deficiency (Prdm9). In two other lines with severe hypogonadism candidate sequencing failed to identify mutations, suggesting defects in genes with previously undocumented roles in gonadal function. These genomic intervals were sequenced in their entirety and a candidate mutation was identified in SnrpE in one of the two lines. The line harboring the SnrpE variant retains substantial spermatogenesis despite small testis size, an unusual phenotype. In addition to the reproductive defects, heritable phenotypes were observed in mice with ataxia (Myo5a), tremors (Pmp22), growth retardation (unknown gene), and hydrocephalus (unknown gene). These results demonstrate that the ENU screen is an effective tool for identifying potential causes of male infertility. PMID:22258617
Transcriptional profiling identifies differentially expressed genes in developing turkey skeletal muscle

PubMed Central

2011-01-01

Background Skeletal muscle growth and development from embryo to adult consists of a series of carefully regulated changes in gene expression. Understanding these developmental changes in agriculturally important species is essential to the production of high quality meat products. For example, consumer demand for lean, inexpensive meat products has driven the turkey industry to unprecedented production through intensive genetic selection. However, achievements of increased body weight and muscle mass have been countered by an increased incidence of myopathies and meat quality defects. In a previous study, we developed and validated a turkey skeletal muscle-specific microarray as a tool for functional genomics studies. The goals of the current study were to utilize this microarray to elucidate functional pathways of genes responsible for key events in turkey skeletal muscle development and to compare differences in gene expression between two genetic lines of turkeys. To achieve these goals, skeletal muscle samples were collected at three critical stages in muscle development: 18d embryo (hyperplasia), 1d post-hatch (shift from myoblast-mediated growth to satellite cell-modulated growth by hypertrophy), and 16wk (market age) from two genetic lines: a randombred control line (RBC2) maintained without selection pressure, and a line (F) selected from the RBC2 line for increased 16wk body weight. Array hybridizations were performed in two experiments: Experiment 1 directly compared the developmental stages within genetic line, while Experiment 2 directly compared the two lines within each developmental stage. Results A total of 3474 genes were differentially expressed (false discovery rate; FDR < 0.001) by overall effect of development, while 16 genes were differentially expressed (FDR < 0.10) by overall effect of genetic line. Ingenuity Pathways Analysis was used to group annotated genes into networks, functions, and canonical pathways. The expression of 28 genes involved in extracellular matrix regulation, cell death/apoptosis, and calcium signaling/muscle function, as well as genes with miscellaneous function was confirmed by qPCR. Conclusions The current study identified gene pathways and uncovered novel genes important in turkey muscle growth and development. Future experiments will focus further on several of these candidate genes and the expression and mechanism of action of their protein products. PMID:21385442
The cld mutation: narrowing the critical chromosomal region and selecting candidate genes.

PubMed

Péterfy, Miklós; Mao, Hui Z; Doolittle, Mark H

2006-10-01

Combined lipase deficiency (cld) is a recessive, lethal mutation specific to the tw73 haplotype on mouse Chromosome 17. While the cld mutation results in lipase proteins that are inactive, aggregated, and retained in the endoplasmic reticulum (ER), it maps separately from the lipase structural genes. We have narrowed the gene critical region by about 50% using the tw18 haplotype for deletion mapping and a recombinant chromosome used originally to map cld with respect to the phenotypic marker tf. The region now extends from 22 to 25.6 Mbp on the wild-type chromosome, currently containing 149 genes and 50 expressed sequence tags (ESTs). To identify the affected gene, we have selected candidates based on their known role in associated biological processes, cellular components, and molecular functions that best fit with the predicted function of the cld gene. A secondary approach was based on differences in mRNA levels between mutant (cld/cld) and unaffected (+/cld) cells. Using both approaches, we have identified seven functional candidates with an ER localization and/or an involvement in protein maturation and folding that could explain the lipase deficiency, and six expression candidates that exhibit large differences in mRNA levels between mutant and unaffected cells. Significantly, two genes were found to be candidates with regard to both function and expression, thus emerging as the strongest candidates for cld. We discuss the implications of our mapping results and our selection of candidates with respect to other genes, deletions, and mutations occurring in the cld critical region.
Annotation of a hybrid partial genome of the coffee rust (Hemileia vastatrix) contributes to the gene repertoire catalog of the Pucciniales

PubMed Central

Cristancho, Marco A.; Botero-Rozo, David Octavio; Giraldo, William; Tabima, Javier; Riaño-Pachón, Diego Mauricio; Escobar, Carolina; Rozo, Yomara; Rivera, Luis F.; Durán, Andrés; Restrepo, Silvia; Eilam, Tamar; Anikster, Yehoshua; Gaitán, Alvaro L.

2014-01-01

Coffee leaf rust caused by the fungus Hemileia vastatrix is the most damaging disease to coffee worldwide. The pathogen has recently appeared in multiple outbreaks in coffee producing countries resulting in significant yield losses and increases in costs related to its control. New races/isolates are constantly emerging as evidenced by the presence of the fungus in plants that were previously resistant. Genomic studies are opening new avenues for the study of the evolution of pathogens, the detailed description of plant-pathogen interactions and the development of molecular techniques for the identification of individual isolates. For this purpose we sequenced 8 different H. vastatrix isolates using NGS technologies and gathered partial genome assemblies due to the large repetitive content in the coffee rust hybrid genome; 74.4% of the assembled contigs harbor repetitive sequences. A hybrid assembly of 333 Mb was built based on the 8 isolates; this assembly was used for subsequent analyses. Analysis of the conserved gene space showed that the hybrid H. vastatrix genome, though highly fragmented, had a satisfactory level of completion with 91.94% of core protein-coding orthologous genes present. RNA-Seq from urediniospores was used to guide the de novo annotation of the H. vastatrix gene complement. In total, 14,445 genes organized in 3921 families were uncovered; a considerable proportion of the predicted proteins (73.8%) were homologous to other Pucciniales species genomes. Several gene families related to the fungal lifestyle were identified, particularly 483 predicted secreted proteins that represent candidate effector genes and will provide interesting hints to decipher virulence in the coffee rust fungus. The genome sequence of Hva will serve as a template to understand the molecular mechanisms used by this fungus to attack the coffee plant, to study the diversity of this species and for the development of molecular markers to distinguish races/isolates. PMID:25400655
Like Climbing Jacob's Ladder: An Art-Based Exploration of the Comprehensive Exam Process

ERIC Educational Resources Information Center

Shields, Sara Scott

2015-01-01

The comprehensive exam process is a rite of passage in the scholarly world, and as such the movements of this process often feel like a guarded secret to graduate students. As a PhD candidate, I left the comprehensive exam process feeling both initiated and inundated. This article is an attempt to uncover the secret that is the comprehensive exam…
Genetics and Peer Relations: A Review

ERIC Educational Resources Information Center

Brendgen, Mara

2012-01-01

Researchers have become increasingly interested in uncovering how genetic factors work together with the peer environment in influencing development. This article offers an overview of the state of knowledge. It first describes the different types of gene-environment correlations (rGE) and gene-environment interactions (GxE) that are of relevance…
Genome Reduction Uncovers a Large Dispensable Genome and Adaptive Role for Copy Number Variation in Asexually Propagated Solanum tuberosum[OPEN

PubMed Central

Hardigan, Michael A.; Crisovan, Emily; Hamilton, John P.; Laimbeer, Parker; Leisner, Courtney P.; Manrique-Carpintero, Norma C.; Newton, Linsey; Pham, Gina M.; Vaillancourt, Brieanne; Zeng, Zixian; Jiang, Jiming

2016-01-01

Clonally reproducing plants have the potential to bear a significantly greater mutational load than sexually reproducing species. To investigate this possibility, we examined the breadth of genome-wide structural variation in a panel of monoploid/doubled monoploid clones generated from native populations of diploid potato (Solanum tuberosum), a highly heterozygous asexually propagated plant. As rare instances of purely homozygous clones, they provided an ideal set for determining the degree of structural variation tolerated by this species and deriving its minimal gene complement. Extensive copy number variation (CNV) was uncovered, impacting 219.8 Mb (30.2%) of the potato genome with nearly 30% of genes subject to at least partial duplication or deletion, revealing the highly heterogeneous nature of the potato genome. Dispensable genes (>7000) were associated with limited transcription and/or a recent evolutionary history, with lower deletion frequency observed in genes conserved across angiosperms. Association of CNV with plant adaptation was highlighted by enrichment in gene clusters encoding functions for environmental stress response, with gene duplication playing a part in species-specific expansions of stress-related gene families. This study revealed unique impacts of CNV in a species with asexual reproductive habits and how CNV may drive adaption through evolution of key stress pathways. PMID:26772996
Human Intellectual Disability Genes Form Conserved Functional Modules in Drosophila

PubMed Central

Oortveld, Merel A. W.; Keerthikumar, Shivakumar; Oti, Martin; Nijhof, Bonnie; Fernandes, Ana Clara; Kochinke, Korinna; Castells-Nobau, Anna; van Engelen, Eva; Ellenkamp, Thijs; Eshuis, Lilian; Galy, Anne; van Bokhoven, Hans; Habermann, Bianca; Brunner, Han G.; Zweier, Christiane; Verstreken, Patrik; Huynen, Martijn A.; Schenck, Annette

2013-01-01

Intellectual Disability (ID) disorders, defined by an IQ below 70, are genetically and phenotypically highly heterogeneous. Identification of common molecular pathways underlying these disorders is crucial for understanding the molecular basis of cognition and for the development of therapeutic intervention strategies. To systematically establish their functional connectivity, we used transgenic RNAi to target 270 ID gene orthologs in the Drosophila eye. Assessment of neuronal function in behavioral and electrophysiological assays and multiparametric morphological analysis identified phenotypes associated with knockdown of 180 ID gene orthologs. Most of these genotype-phenotype associations were novel. For example, we uncovered 16 genes that are required for basal neurotransmission and have not previously been implicated in this process in any system or organism. ID gene orthologs with morphological eye phenotypes, in contrast to genes without phenotypes, are relatively highly expressed in the human nervous system and are enriched for neuronal functions, suggesting that eye phenotyping can distinguish different classes of ID genes. Indeed, grouping genes by Drosophila phenotype uncovered 26 connected functional modules. Novel links between ID genes successfully predicted that MYCN, PIGV and UPF3B regulate synapse development. Drosophila phenotype groups show, in addition to ID, significant phenotypic similarity also in humans, indicating that functional modules are conserved. The combined data indicate that ID disorders, despite their extreme genetic diversity, are caused by disruption of a limited number of highly connected functional modules. PMID:24204314
Human intellectual disability genes form conserved functional modules in Drosophila.

PubMed

Oortveld, Merel A W; Keerthikumar, Shivakumar; Oti, Martin; Nijhof, Bonnie; Fernandes, Ana Clara; Kochinke, Korinna; Castells-Nobau, Anna; van Engelen, Eva; Ellenkamp, Thijs; Eshuis, Lilian; Galy, Anne; van Bokhoven, Hans; Habermann, Bianca; Brunner, Han G; Zweier, Christiane; Verstreken, Patrik; Huynen, Martijn A; Schenck, Annette

2013-10-01

Intellectual Disability (ID) disorders, defined by an IQ below 70, are genetically and phenotypically highly heterogeneous. Identification of common molecular pathways underlying these disorders is crucial for understanding the molecular basis of cognition and for the development of therapeutic intervention strategies. To systematically establish their functional connectivity, we used transgenic RNAi to target 270 ID gene orthologs in the Drosophila eye. Assessment of neuronal function in behavioral and electrophysiological assays and multiparametric morphological analysis identified phenotypes associated with knockdown of 180 ID gene orthologs. Most of these genotype-phenotype associations were novel. For example, we uncovered 16 genes that are required for basal neurotransmission and have not previously been implicated in this process in any system or organism. ID gene orthologs with morphological eye phenotypes, in contrast to genes without phenotypes, are relatively highly expressed in the human nervous system and are enriched for neuronal functions, suggesting that eye phenotyping can distinguish different classes of ID genes. Indeed, grouping genes by Drosophila phenotype uncovered 26 connected functional modules. Novel links between ID genes successfully predicted that MYCN, PIGV and UPF3B regulate synapse development. Drosophila phenotype groups show, in addition to ID, significant phenotypic similarity also in humans, indicating that functional modules are conserved. The combined data indicate that ID disorders, despite their extreme genetic diversity, are caused by disruption of a limited number of highly connected functional modules.
Genome diversity of tuber-bearing Solanum uncovers complex evolutionary history and targets of domestication in the cultivated potato

PubMed Central

Hardigan, Michael A.; Laimbeer, F. Parker E.; Newton, Linsey; Crisovan, Emily; Hamilton, John P.; Vaillancourt, Brieanne; Wiegert-Rininger, Krystle; Wood, Joshua C.; Douches, David S.; Farré, Eva M.; Veilleux, Richard E.; Buell, C. Robin

2017-01-01

Cultivated potatoes (Solanum tuberosum L.), domesticated from wild Solanum species native to the Andes of southern Peru, possess a diverse gene pool representing more than 100 tuber-bearing relatives (Solanum section Petota). A diversity panel of wild species, landraces, and cultivars was sequenced to assess genetic variation within tuber-bearing Solanum and the impact of domestication on genome diversity and identify key loci selected for cultivation in North and South America. Sequence diversity of diploid and tetraploid S. tuberosum exceeded any crop resequencing study to date, in part due to expanded wild introgressions following polyploidy that captured alleles outside of their geographic origin. We identified 2,622 genes as under selection, with only 14–16% shared by North American and Andean cultivars, showing that a limited gene set drove early improvement of cultivated potato, while adaptation of upland (S. tuberosum group Andigena) and lowland (S. tuberosum groups Chilotanum and Tuberosum) populations targeted distinct loci. Signatures of selection were uncovered in genes controlling carbohydrate metabolism, glycoalkaloid biosynthesis, the shikimate pathway, the cell cycle, and circadian rhythm. Reduced sexual fertility that accompanied the shift to asexual reproduction in cultivars was reflected by signatures of selection in genes regulating pollen development/gametogenesis. Exploration of haplotype diversity at potato’s maturity locus (StCDF1) revealed introgression of truncated alleles from wild species, particularly S. microdontum in long-day–adapted cultivars. This study uncovers a historic role of wild Solanum species in the diversification of long-day–adapted tetraploid potatoes, showing that extant natural populations represent an essential source of untapped adaptive potential. PMID:29087343
A computational approach to candidate gene prioritization for X-linked mental retardation using annotation-based binary filtering and motif-based linear discriminatory analysis

PubMed Central

2011-01-01

Background Several computational candidate gene selection and prioritization methods have recently been developed. These in silico selection and prioritization techniques are usually based on two central approaches - the examination of similarities to known disease genes and/or the evaluation of functional annotation of genes. Each of these approaches has its own caveats. Here we employ a previously described method of candidate gene prioritization based mainly on gene annotation, in accompaniment with a technique based on the evaluation of pertinent sequence motifs or signatures, in an attempt to refine the gene prioritization approach. We apply this approach to X-linked mental retardation (XLMR), a group of heterogeneous disorders for which some of the underlying genetics is known. Results The gene annotation-based binary filtering method yielded a ranked list of putative XLMR candidate genes with good plausibility of being associated with the development of mental retardation. In parallel, a motif finding approach based on linear discriminatory analysis (LDA) was employed to identify short sequence patterns that may discriminate XLMR from non-XLMR genes. High rates (>80%) of correct classification was achieved, suggesting that the identification of these motifs effectively captures genomic signals associated with XLMR vs. non-XLMR genes. The computational tools developed for the motif-based LDA is integrated into the freely available genomic analysis portal Galaxy (http://main.g2.bx.psu.edu/). Nine genes (APLN, ZC4H2, MAGED4, MAGED4B, RAP2C, FAM156A, FAM156B, TBL1X, and UXT) were highlighted as highly-ranked XLMR methods. Conclusions The combination of gene annotation information and sequence motif-orientated computational candidate gene prediction methods highlight an added benefit in generating a list of plausible candidate genes, as has been demonstrated for XLMR. Reviewers: This article was reviewed by Dr Barbara Bardoni (nominated by Prof Juergen Brosius); Prof Neil Smalheiser and Dr Dustin Holloway (nominated by Prof Charles DeLisi). PMID:21668950
The CanOE strategy: integrating genomic and metabolic contexts across multiple prokaryote genomes to find candidate genes for orphan enzymes.

PubMed

Smith, Adam Alexander Thil; Belda, Eugeni; Viari, Alain; Medigue, Claudine; Vallenet, David

2012-05-01

Of all biochemically characterized metabolic reactions formalized by the IUBMB, over one out of four have yet to be associated with a nucleic or protein sequence, i.e. are sequence-orphan enzymatic activities. Few bioinformatics annotation tools are able to propose candidate genes for such activities by exploiting context-dependent rather than sequence-dependent data, and none are readily accessible and propose result integration across multiple genomes. Here, we present CanOE (Candidate genes for Orphan Enzymes), a four-step bioinformatics strategy that proposes ranked candidate genes for sequence-orphan enzymatic activities (or orphan enzymes for short). The first step locates "genomic metabolons", i.e. groups of co-localized genes coding proteins catalyzing reactions linked by shared metabolites, in one genome at a time. These metabolons can be particularly helpful for aiding bioanalysts to visualize relevant metabolic data. In the second step, they are used to generate candidate associations between un-annotated genes and gene-less reactions. The third step integrates these gene-reaction associations over several genomes using gene families, and summarizes the strength of family-reaction associations by several scores. In the final step, these scores are used to rank members of gene families which are proposed for metabolic reactions. These associations are of particular interest when the metabolic reaction is a sequence-orphan enzymatic activity. Our strategy found over 60,000 genomic metabolons in more than 1,000 prokaryote organisms from the MicroScope platform, generating candidate genes for many metabolic reactions, of which more than 70 distinct orphan reactions. A computational validation of the approach is discussed. Finally, we present a case study on the anaerobic allantoin degradation pathway in Escherichia coli K-12.

Transcriptome analysis of Brassica napus pod using RNA-Seq and identification of lipid-related candidate genes.

PubMed

Xu, Hai-Ming; Kong, Xiang-Dong; Chen, Fei; Huang, Ji-Xiang; Lou, Xiang-Yang; Zhao, Jian-Yi

2015-10-24

Brassica napus is an important oilseed crop. Dissection of the genetic architecture underlying oil-related biological processes will greatly facilitates the genetic improvement of rapeseed. The differential gene expression during pod development offers a snapshot on the genes responsible for oil accumulation in. To identify candidate genes in the linkage peaks reported previously, we used RNA sequencing (RNA-Seq) technology to analyze the pod transcriptomes of German cultivar Sollux and Chinese inbred line Gaoyou. The RNA samples were collected for RNA-Seq at 5-7, 15-17 and 25-27 days after flowering (DAF). Bioinformatics analysis was performed to investigate differentially expressed genes (DEGs). Gene annotation analysis was integrated with QTL mapping and Brassica napus pod transcriptome profiling to detect potential candidate genes in oilseed. Four hundred sixty five and two thousand, one hundred fourteen candidate DEGs were identified, respectively, between two varieties at the same stages and across different periods of each variety. Then, 33 DEGs between Sollux and Gaoyou were identified as the candidate genes affecting seed oil content by combining those DEGs with the quantitative trait locus (QTL) mapping results, of which, one was found to be homologous to Arabidopsis thaliana lipid-related genes. Intervarietal DEGs of lipid pathways in QTL regions represent important candidate genes for oil-related traits. Integrated analysis of transcriptome profiling, QTL mapping and comparative genomics with other relative species leads to efficient identification of most plausible functional genes underlying oil-content related characters, offering valuable resources for bettering breeding program of Brassica napus. This study provided a comprehensive overview on the pod transcriptomes of two varieties with different oil-contents at the three developmental stages.
Prediction of gene-phenotype associations in humans, mice, and plants using phenologs.

PubMed

Woods, John O; Singh-Blom, Ulf Martin; Laurent, Jon M; McGary, Kriston L; Marcotte, Edward M

2013-06-21

Phenotypes and diseases may be related to seemingly dissimilar phenotypes in other species by means of the orthology of underlying genes. Such "orthologous phenotypes," or "phenologs," are examples of deep homology, and may be used to predict additional candidate disease genes. In this work, we develop an unsupervised algorithm for ranking phenolog-based candidate disease genes through the integration of predictions from the k nearest neighbor phenologs, comparing classifiers and weighting functions by cross-validation. We also improve upon the original method by extending the theory to paralogous phenotypes. Our algorithm makes use of additional phenotype data--from chicken, zebrafish, and E. coli, as well as new datasets for C. elegans--establishing that several types of annotations may be treated as phenotypes. We demonstrate the use of our algorithm to predict novel candidate genes for human atrial fibrillation (such as HRH2, ATP4A, ATP4B, and HOPX) and epilepsy (e.g., PAX6 and NKX2-1). We suggest gene candidates for pharmacologically-induced seizures in mouse, solely based on orthologous phenotypes from E. coli. We also explore the prediction of plant gene-phenotype associations, as for the Arabidopsis response to vernalization phenotype. We are able to rank gene predictions for a significant portion of the diseases in the Online Mendelian Inheritance in Man database. Additionally, our method suggests candidate genes for mammalian seizures based only on bacterial phenotypes and gene orthology. We demonstrate that phenotype information may come from diverse sources, including drug sensitivities, gene ontology biological processes, and in situ hybridization annotations. Finally, we offer testable candidates for a variety of human diseases, plant traits, and other classes of phenotypes across a wide array of species.
Endeavour update: a web resource for gene prioritization in multiple species

PubMed Central

Tranchevent, Léon-Charles; Barriot, Roland; Yu, Shi; Van Vooren, Steven; Van Loo, Peter; Coessens, Bert; De Moor, Bart; Aerts, Stein; Moreau, Yves

2008-01-01

Endeavour (http://www.esat.kuleuven.be/endeavourweb; this web site is free and open to all users and there is no login requirement) is a web resource for the prioritization of candidate genes. Using a training set of genes known to be involved in a biological process of interest, our approach consists of (i) inferring several models (based on various genomic data sources), (ii) applying each model to the candidate genes to rank those candidates against the profile of the known genes and (iii) merging the several rankings into a global ranking of the candidate genes. In the present article, we describe the latest developments of Endeavour. First, we provide a web-based user interface, besides our Java client, to make Endeavour more universally accessible. Second, we support multiple species: in addition to Homo sapiens, we now provide gene prioritization for three major model organisms: Mus musculus, Rattus norvegicus and Caenorhabditis elegans. Third, Endeavour makes use of additional data sources and is now including numerous databases: ontologies and annotations, protein–protein interactions, cis-regulatory information, gene expression data sets, sequence information and text-mining data. We tested the novel version of Endeavour on 32 recent disease gene associations from the literature. Additionally, we describe a number of recent independent studies that made use of Endeavour to prioritize candidate genes for obesity and Type II diabetes, cleft lip and cleft palate, and pulmonary fibrosis. PMID:18508807
Identification of downy mildew resistance gene candidates by positional cloning in maize (Zea mays subsp. mays; Poaceae)1

PubMed Central

Kim, Jae Yoon; Moon, Jun-Cheol; Kim, Hyo Chul; Shin, Seungho; Song, Kitae; Kim, Kyung-Hee; Lee, Byung-Moo

2017-01-01

Premise of the study: Positional cloning in combination with phenotyping is a general approach to identify disease-resistance gene candidates in plants; however, it requires several time-consuming steps including population or fine mapping. Therefore, in the present study, we suggest a new combined strategy to improve the identification of disease-resistance gene candidates. Methods and Results: Downy mildew (DM)–resistant maize was selected from five cultivars using a spreader row technique. Positional cloning and bioinformatics tools were used to identify the DM-resistance quantitative trait locus marker (bnlg1702) and 47 protein-coding gene annotations. Eventually, five DM-resistance gene candidates, including bZIP34, Bak1, and Ppr, were identified by quantitative reverse-transcription PCR (RT-PCR) without fine mapping of the bnlg1702 locus. Conclusions: The combined protocol with the spreader row technique, quantitative trait locus positional cloning, and quantitative RT-PCR was effective for identifying DM-resistance candidate genes. This cloning approach may be applied to other whole-genome-sequenced crops or resistance to other diseases. PMID:28224059
LOD score exclusion analyses for candidate genes using random population samples.

PubMed

Deng, H W; Li, J; Recker, R R

2001-05-01

While extensive analyses have been conducted to test for, no formal analyses have been conducted to test against, the importance of candidate genes with random population samples. We develop a LOD score approach for exclusion analyses of candidate genes with random population samples. Under this approach, specific genetic effects and inheritance models at candidate genes can be analysed and if a LOD score is < or = - 2.0, the locus can be excluded from having an effect larger than that specified. Computer simulations show that, with sample sizes often employed in association studies, this approach has high power to exclude a gene from having moderate genetic effects. In contrast to regular association analyses, population admixture will not affect the robustness of our analyses; in fact, it renders our analyses more conservative and thus any significant exclusion result is robust. Our exclusion analysis complements association analysis for candidate genes in random population samples and is parallel to the exclusion mapping analyses that may be conducted in linkage analyses with pedigrees or relative pairs. The usefulness of the approach is demonstrated by an application to test the importance of vitamin D receptor and estrogen receptor genes underlying the differential risk to osteoporotic fractures.
Identification of additive, dominant, and epistatic variation conferred by key genes in cellulose biosynthesis pathway in Populus tomentosa†

PubMed Central

Du, Qingzhang; Tian, Jiaxing; Yang, Xiaohui; Pan, Wei; Xu, Baohua; Li, Bailian; Ingvarsson, Pär K.; Zhang, Deqiang

2015-01-01

Economically important traits in many species generally show polygenic, quantitative inheritance. The components of genetic variation (additive, dominant and epistatic effects) of these traits conferred by multiple genes in shared biological pathways remain to be defined. Here, we investigated 11 full-length genes in cellulose biosynthesis, on 10 growth and wood-property traits, within a population of 460 unrelated Populus tomentosa individuals, via multi-gene association. To validate positive associations, we conducted single-marker analysis in a linkage population of 1,200 individuals. We identified 118, 121, and 43 associations (P< 0.01) corresponding to additive, dominant, and epistatic effects, respectively, with low to moderate proportions of phenotypic variance (R2). Epistatic interaction models uncovered a combination of three non-synonymous sites from three unique genes, representing a significant epistasis for diameter at breast height and stem volume. Single-marker analysis validated 61 associations (false discovery rate, Q ≤ 0.10), representing 38 SNPs from nine genes, and its average effect (R2 = 3.8%) nearly 2-fold higher than that identified with multi-gene association, suggesting that multi-gene association can capture smaller individual variants. Moreover, a structural gene–gene network based on tissue-specific transcript abundances provides a better understanding of the multi-gene pathway affecting tree growth and lignocellulose biosynthesis. Our study highlights the importance of pathway-based multiple gene associations to uncover the nature of genetic variance for quantitative traits and may drive novel progress in molecular breeding. PMID:25428896
Meta-analysis of loci associated with age at natural menopause in African-American women

PubMed Central

Chen, Christina T.L.; Liu, Ching-Ti; Chen, Gary K.; Andrews, Jeanette S.; Arnold, Alice M.; Dreyfus, Jill; Franceschini, Nora; Garcia, Melissa E.; Kerr, Kathleen F.; Li, Guo; Lohman, Kurt K.; Musani, Solomon K.; Nalls, Michael A.; Raffel, Leslie J.; Smith, Jennifer; Ambrosone, Christine B.; Bandera, Elisa V.; Bernstein, Leslie; Britton, Angela; Brzyski, Robert G.; Cappola, Anne; Carlson, Christopher S.; Couper, David; Deming, Sandra L.; Goodarzi, Mark O.; Heiss, Gerardo; John, Esther M.; Lu, Xiaoning; Le Marchand, Loic; Marciante, Kristin; Mcknight, Barbara; Millikan, Robert; Nock, Nora L.; Olshan, Andrew F.; Press, Michael F.; Vaiyda, Dhananjay; Woods, Nancy F.; Taylor, Herman A.; Zhao, Wei; Zheng, Wei; Evans, Michele K.; Harris, Tamara B.; Henderson, Brian E.; Kardia, Sharon L.R.; Kooperberg, Charles; Liu, Yongmei; Mosley, Thomas H.; Psaty, Bruce; Wellons, Melissa; Windham, Beverly G.; Zonderman, Alan B.; Cupples, L. Adrienne; Demerath, Ellen W.; Haiman, Christopher; Murabito, Joanne M.; Rajkovic, Aleksandar

2014-01-01

Age at menopause marks the end of a woman's reproductive life and its timing associates with risks for cancer, cardiovascular and bone disorders. GWAS and candidate gene studies conducted in women of European ancestry have identified 27 loci associated with age at menopause. The relevance of these loci to women of African ancestry has not been previously studied. We therefore sought to uncover additional menopause loci and investigate the relevance of European menopause loci by performing a GWAS meta-analysis in 6510 women with African ancestry derived from 11 studies across the USA. We did not identify any additional loci significantly associated with age at menopause in African Americans. We replicated the associations between six loci and age at menopause (P-value < 0.05): AMHR2, RHBLD2, PRIM1, HK3/UMC1, BRSK1/TMEM150B and MCM8. In addition, associations of 14 loci are directionally consistent with previous reports. We provide evidence that genetic variants influencing reproductive traits identified in European populations are also important in women of African ancestry residing in USA. PMID:24493794
Iris pigmentation as a quantitative trait: variation in populations of European, East Asian and South Asian ancestry and association with candidate gene polymorphisms.

PubMed

Edwards, Melissa; Cha, David; Krithika, S; Johnson, Monique; Cook, Gillian; Parra, Esteban J

2016-03-01

In this study, we present a new quantitative method to measure iris colour based on high-resolution photographs. We applied this method to analyse iris colour variation in a sample of individuals of East Asian, European and South Asian ancestry. We show that measuring iris colour using the coordinates of the CIELAB colour space uncovers a significant amount of variation that is not captured using conventional categorical classifications, such as 'brown', 'blue' or 'green'. We tested the association of a selected panel of polymorphisms with iris colour in each population group. Six markers showed significant associations with iris colour in the European sample, three in the South Asian sample and two in the East Asian sample. We also observed that the marker HERC2 rs12913832, which is the main determinant of 'blue' versus 'brown' iris colour in European populations, is also significantly associated with central heterochromia in the European sample. © 2015 The Authors. Pigment Cell & Melanoma Research Published by John Wiley & Sons Ltd.
Identification of Staphylococcal Nuclease Domain-containing 1 (SND1) as a Metadherin-interacting Protein with Metastasis-promoting Functions*

PubMed Central

Blanco, Mario Andres; Alečković, Maša; Hua, Yuling; Li, Tuo; Wei, Yong; Xu, Zhen; Cristea, Ileana M.; Kang, Yibin

2011-01-01

Metastasis is the deadliest and most poorly understood feature of malignant diseases. Recent work has shown that Metadherin (MTDH) is overexpressed in over 40% of breast cancer patients and promotes metastasis and chemoresistance in experimental models of breast cancer progression. Here we applied mass spectrometry-based screen to identify staphylococcal nuclease domain-containing 1 (SND1) as a candidate MTDH-interacting protein. After confirming the interaction between SND1 and MTDH, we tested the role of SND1 in breast cancer and found that it strongly promotes lung metastasis. SND1 was further shown to promote resistance to apoptosis and to regulate the expression of genes associated with metastasis and chemoresistance. Analyses of breast cancer clinical microarray data indicated that high expression of SND1 in primary tumors is strongly associated with reduced metastasis-free survival in multiple large scale data sets. Thus, we have uncovered SND1 as a novel MTDH-interacting protein and shown that it is a functionally and clinically significant mediator of metastasis. PMID:21478147
Phenoscape: Identifying Candidate Genes for Evolutionary Phenotypes

PubMed Central

Edmunds, Richard C.; Su, Baofeng; Balhoff, James P.; Eames, B. Frank; Dahdul, Wasila M.; Lapp, Hilmar; Lundberg, John G.; Vision, Todd J.; Dunham, Rex A.; Mabee, Paula M.; Westerfield, Monte

2016-01-01

Phenotypes resulting from mutations in genetic model organisms can help reveal candidate genes for evolutionarily important phenotypic changes in related taxa. Although testing candidate gene hypotheses experimentally in nonmodel organisms is typically difficult, ontology-driven information systems can help generate testable hypotheses about developmental processes in experimentally tractable organisms. Here, we tested candidate gene hypotheses suggested by expert use of the Phenoscape Knowledgebase, specifically looking for genes that are candidates responsible for evolutionarily interesting phenotypes in the ostariophysan fishes that bear resemblance to mutant phenotypes in zebrafish. For this, we searched ZFIN for genetic perturbations that result in either loss of basihyal element or loss of scales phenotypes, because these are the ancestral phenotypes observed in catfishes (Siluriformes). We tested the identified candidate genes by examining their endogenous expression patterns in the channel catfish, Ictalurus punctatus. The experimental results were consistent with the hypotheses that these features evolved through disruption in developmental pathways at, or upstream of, brpf1 and eda/edar for the ancestral losses of basihyal element and scales, respectively. These results demonstrate that ontological annotations of the phenotypic effects of genetic alterations in model organisms, when aggregated within a knowledgebase, can be used effectively to generate testable, and useful, hypotheses about evolutionary changes in morphology. PMID:26500251
Walking the interactome for candidate prioritization in exome sequencing studies of Mendelian diseases

DOE PAGES

Smedley, Damian; Kohler, Sebastian; Czeschik, Johanna Christina; ...

2014-07-30

Here, whole-exome sequencing (WES) has opened up previously unheard of possibilities for identifying novel disease genes in Mendelian disorders, only about half of which have been elucidated to date. However, interpretation of WES data remains challenging. As a result, we analyze protein–protein association (PPA) networks to identify candidate genes in the vicinity of genes previously implicated in a disease. The analysis, using a random-walk with restart (RWR) method, is adapted to the setting of WES by developing a composite variant-gene relevance score based on the rarity, location and predicted pathogenicity of variants and the RWR evaluation of genes harboring themore » variants. Benchmarking using known disease variants from 88 disease-gene families reveals that the correct gene is ranked among the top 10 candidates in ≥50% of cases, a figure which we confirmed using a prospective study of disease genes identified in 2012 and PPA data produced before that date. In conclusion, we implement our method in a freely available Web server, ExomeWalker, that displays a ranked list of candidates together with information on PPAs, frequency and predicted pathogenicity of the variants to allow quick and effective searches for candidates that are likely to reward closer investigation.« less
Integration of QTL and bioinformatic tools to identify candidate genes for triglycerides in mice[S

PubMed Central

Leduc, Magalie S.; Hageman, Rachael S.; Verdugo, Ricardo A.; Tsaih, Shirng-Wern; Walsh, Kenneth; Churchill, Gary A.; Paigen, Beverly

2011-01-01

To identify genetic loci influencing lipid levels, we performed quantitative trait loci (QTL) analysis between inbred mouse strains MRL/MpJ and SM/J, measuring triglyceride levels at 8 weeks of age in F2 mice fed a chow diet. We identified one significant QTL on chromosome (Chr) 15 and three suggestive QTL on Chrs 2, 7, and 17. We also carried out microarray analysis on the livers of parental strains of 282 F2 mice and used these data to find cis-regulated expression QTL. We then narrowed the list of candidate genes under significant QTL using a “toolbox” of bioinformatic resources, including haplotype analysis; parental strain comparison for gene expression differences and nonsynonymous coding single nucleotide polymorphisms (SNP); cis-regulated eQTL in livers of F2 mice; correlation between gene expression and phenotype; and conditioning of expression on the phenotype. We suggest Slc25a7 as a candidate gene for the Chr 7 QTL and, based on expression differences, five genes (Polr3 h, Cyp2d22, Cyp2d26, Tspo, and Ttll12) as candidate genes for Chr 15 QTL. This study shows how bioinformatics can be used effectively to reduce candidate gene lists for QTL related to complex traits. PMID:21622629
Walking the interactome for candidate prioritization in exome sequencing studies of Mendelian diseases

DOE Office of Scientific and Technical Information (OSTI.GOV)

Smedley, Damian; Kohler, Sebastian; Czeschik, Johanna Christina

Here, whole-exome sequencing (WES) has opened up previously unheard of possibilities for identifying novel disease genes in Mendelian disorders, only about half of which have been elucidated to date. However, interpretation of WES data remains challenging. As a result, we analyze protein–protein association (PPA) networks to identify candidate genes in the vicinity of genes previously implicated in a disease. The analysis, using a random-walk with restart (RWR) method, is adapted to the setting of WES by developing a composite variant-gene relevance score based on the rarity, location and predicted pathogenicity of variants and the RWR evaluation of genes harboring themore » variants. Benchmarking using known disease variants from 88 disease-gene families reveals that the correct gene is ranked among the top 10 candidates in ≥50% of cases, a figure which we confirmed using a prospective study of disease genes identified in 2012 and PPA data produced before that date. In conclusion, we implement our method in a freely available Web server, ExomeWalker, that displays a ranked list of candidates together with information on PPAs, frequency and predicted pathogenicity of the variants to allow quick and effective searches for candidates that are likely to reward closer investigation.« less
Identification and characterization of moonlighting long non-coding RNAs based on RNA and protein interactome.

PubMed

Cheng, Lixin; Leung, Kwong-Sak

2018-05-16

Moonlighting proteins are a class of proteins having multiple distinct functions, which play essential roles in a variety of cellular and enzymatic functioning systems. Although there have long been calls for computational algorithms for the identification of moonlighting proteins, research on approaches to identify moonlighting long non-coding RNAs (lncRNAs) has never been undertaken. Here, we introduce a novel methodology, MoonFinder, for the identification of moonlighting lncRNAs. MoonFinder is a statistical algorithm identifying moonlighting lncRNAs without a priori knowledge through the integration of protein interactome, RNA-protein interactions, and functional annotation of proteins. We identify 155 moonlighting lncRNA candidates and uncover that they are a distinct class of lncRNAs characterized by specific sequence and cellular localization features. The non-coding genes that transcript moonlighting lncRNAs tend to have shorter but more exons and the moonlighting lncRNAs have a variable localization pattern with a high chance of residing in the cytoplasmic compartment in comparison to the other lncRNAs. Moreover, moonlighting lncRNAs and moonlighting proteins are rather mutually exclusive in terms of both their direct interactions and interacting partners. Our results also shed light on how the moonlighting candidates and their interacting proteins implicated in the formation and development of cancers and other diseases. The code implementing MoonFinder is supplied as an R package in the supplementary material. lxcheng@cse.cuhk.edu.hk or ksleung@cse.cuhk.edu.hk. Supplementary data are available at Bioinformatics online.
Selective Ablation of Tumor Suppressors in Parafollicular C Cells Elicits Medullary Thyroid Carcinoma.

PubMed

Song, Hai; Lin, Chuwen; Yao, Erica; Zhang, Kuan; Li, Xiaoling; Wu, Qingzhe; Chuang, Pao-Tien

2017-03-03

Among the four different types of thyroid cancer, treatment of medullary thyroid carcinoma poses a major challenge because of its propensity of early metastasis. To further investigate the molecular mechanisms of medullary thyroid carcinoma and discover candidates for targeted therapies, we developed a new mouse model of medullary thyroid carcinoma based on our CGRP CreER mouse line. This system enables gene manipulation in parafollicular C cells in the thyroid, the purported cells of origin of medullary thyroid carcinoma. Selective inactivation of tumor suppressors, such as p53 , Rb , and Pten , in mature parafollicular C cells via an inducible Cre recombinase from CGRP CreER led to development of murine medullary thyroid carcinoma. Loss of Pten accelerated p53 / Rb -induced medullary thyroid carcinoma, indicating interactions between pathways controlled by tumor suppressors. Moreover, labeling differentiated parafollicular C cells by CGRP CreER allows us to follow their fate during malignant transformation to medullary thyroid tumor. Our findings support a model in which mutational events in differentiated parafollicular C cells result in medullary thyroid carcinoma. Through expression analysis including RNA-Seq, we uncovered major signaling pathways and networks that are perturbed following the removal of tumor suppressors. Taken together, these studies not only increase our molecular understanding of medullary thyroid carcinoma but also offer new candidates for designing targeted therapies or other treatment modalities. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles.

PubMed

Zhu, Jie; Qin, Yufang; Liu, Taigang; Wang, Jun; Zheng, Xiaoqi

2013-01-01

Identification of gene-phenotype relationships is a fundamental challenge in human health clinic. Based on the observation that genes causing the same or similar phenotypes tend to correlate with each other in the protein-protein interaction network, a lot of network-based approaches were proposed based on different underlying models. A recent comparative study showed that diffusion-based methods achieve the state-of-the-art predictive performance. In this paper, a new diffusion-based method was proposed to prioritize candidate disease genes. Diffusion profile of a disease was defined as the stationary distribution of candidate genes given a random walk with restart where similarities between phenotypes are incorporated. Then, candidate disease genes are prioritized by comparing their diffusion profiles with that of the disease. Finally, the effectiveness of our method was demonstrated through the leave-one-out cross-validation against control genes from artificial linkage intervals and randomly chosen genes. Comparative study showed that our method achieves improved performance compared to some classical diffusion-based methods. To further illustrate our method, we used our algorithm to predict new causing genes of 16 multifactorial diseases including Prostate cancer and Alzheimer's disease, and the top predictions were in good consistent with literature reports. Our study indicates that integration of multiple information sources, especially the phenotype similarity profile data, and introduction of global similarity measure between disease and gene diffusion profiles are helpful for prioritizing candidate disease genes. Programs and data are available upon request.
The genetic interacting landscape of 63 candidate genes in Major Depressive Disorder: an explorative study.

PubMed

Lekman, Magnus; Hössjer, Ola; Andrews, Peter; Källberg, Henrik; Uvehag, Daniel; Charney, Dennis; Manji, Husseini; Rush, John A; McMahon, Francis J; Moore, Jason H; Kockum, Ingrid

2014-01-01

Genetic contributions to major depressive disorder (MDD) are thought to result from multiple genes interacting with each other. Different procedures have been proposed to detect such interactions. Which approach is best for explaining the risk of developing disease is unclear. This study sought to elucidate the genetic interaction landscape in candidate genes for MDD by conducting a SNP-SNP interaction analysis using an exhaustive search through 3,704 SNP-markers in 1,732 cases and 1,783 controls provided from the GAIN MDD study. We used three different methods to detect interactions, two logistic regressions models (multiplicative and additive) and one data mining and machine learning (MDR) approach. Although none of the interaction survived correction for multiple comparisons, the results provide important information for future genetic interaction studies in complex disorders. Among the 0.5% most significant observations, none had been reported previously for risk to MDD. Within this group of interactions, less than 0.03% would have been detectable based on main effect approach or an a priori algorithm. We evaluated correlations among the three different models and conclude that all three algorithms detected the same interactions to a low degree. Although the top interactions had a surprisingly large effect size for MDD (e.g. additive dominant model Puncorrected = 9.10E-9 with attributable proportion (AP) value = 0.58 and multiplicative recessive model with Puncorrected = 6.95E-5 with odds ratio (OR estimated from β3) value = 4.99) the area under the curve (AUC) estimates were low (< 0.54). Moreover, the population attributable fraction (PAF) estimates were also low (< 0.15). We conclude that the top interactions on their own did not explain much of the genetic variance of MDD. The different statistical interaction methods we used in the present study did not identify the same pairs of interacting markers. Genetic interaction studies may uncover previously unsuspected effects that could provide novel insights into MDD risk, but much larger sample sizes are needed before this strategy can be powerfully applied.
A genome-wide association study of seed composition traits in wild soybean (Glycine soja).

PubMed

Leamy, Larry J; Zhang, Hengyou; Li, Changbao; Chen, Charles Y; Song, Bao-Hua

2017-01-05

Cultivated soybean (Glycine max) is a major agricultural crop that provides a crucial source of edible protein and oil. Decreased amounts of saturated palmitic acid and increased amounts of unsaturated oleic acid in soybean oil are considered optimal for human cardiovascular health and therefore there has considerable interest by breeders in discovering genes affecting the relative concentrations of these fatty acids. Using a genome-wide association (GWA) approach with nearly 30,000 single nucleotide polymorphisms (SNPs), we investigated the genetic basis of protein, oil and all five fatty acid levels in seeds from a sample of 570 wild soybeans (Glycine soja), the progenitor of domesticated soybean, to identify quantitative trait loci (QTLs) affecting these seed composition traits. We discovered 29 SNPs located on ten different chromosomes that are significantly associated with the seven seed composition traits in our wild soybean sample. Eight SNPs co-localized with QTLs previously uncovered in linkage or association mapping studies conducted with cultivated soybean samples, while the remaining SNPs appeared to be in novel locations. Twenty-four of the SNPs significantly associated with fatty acid variation, with the majority located on chromosomes 14 (6 SNPs) and seven (8 SNPs). Two SNPs were common for two or more fatty acids, suggesting loci with pleiotropic effects. We also identified some candidate genes that are involved in fatty acid metabolism and regulation. For each of the seven traits, most of the SNPs produced differences between the average phenotypic values of the two homozygotes of about one-half standard deviation and contributed over 3% of their total variability. This is the first GWA study conducted on seed composition traits solely in wild soybean populations, and a number of QTLs were found that have not been previously discovered. Some of these may be useful to breeders who select for increased protein/oil content or altered fatty acid ratios in the seeds. The results also provide additional insight into the genetic architecture of these traits in a large sample of wild soybean, and suggest some new candidate genes whose molecular effects on these traits need to be further studied.
In Silico Gene Prioritization by Integrating Multiple Data Sources

PubMed Central

Zhou, Yingyao; Shields, Robert; Chanda, Sumit K.; Elston, Robert C.; Li, Jing

2011-01-01

Identifying disease genes is crucial to the understanding of disease pathogenesis, and to the improvement of disease diagnosis and treatment. In recent years, many researchers have proposed approaches to prioritize candidate genes by considering the relationship of candidate genes and existing known disease genes, reflected in other data sources. In this paper, we propose an expandable framework for gene prioritization that can integrate multiple heterogeneous data sources by taking advantage of a unified graphic representation. Gene-gene relationships and gene-disease relationships are then defined based on the overall topology of each network using a diffusion kernel measure. These relationship measures are in turn normalized to derive an overall measure across all networks, which is utilized to rank all candidate genes. Based on the informativeness of available data sources with respect to each specific disease, we also propose an adaptive threshold score to select a small subset of candidate genes for further validation studies. We performed large scale cross-validation analysis on 110 disease families using three data sources. Results have shown that our approach consistently outperforms other two state of the art programs. A case study using Parkinson disease (PD) has identified four candidate genes (UBB, SEPT5, GPR37 and TH) that ranked higher than our adaptive threshold, all of which are involved in the PD pathway. In particular, a very recent study has observed a deletion of TH in a patient with PD, which supports the importance of the TH gene in PD pathogenesis. A web tool has been implemented to assist scientists in their genetic studies. PMID:21731658
Detection of genomic structural variations in Guizhou indigenous pigs and the comparison with other breeds

PubMed Central

Ran, Xueqin; Wang, Jiafu; Li, Sheng; Liu, Jianfeng

2018-01-01

Genomic structural variation (SV) is noticed for the contribution to genetic diversity and phenotypic changes. Guizhou indigenous pig (GZP) has been raised for hundreds of years with many special characteristics. The present paper aimed to uncover the influence of SV on gene polymorphism and the genetic mechanisms of phenotypic traits for GZP. Eighteen GZPs were chosen for resequencing by Illumina sequencing platform. The confident SVs of GZP were called out by both programs of pindel and softSV simultaneously and compared with the SVs deduced from the genomic data of European pig (EUP) and the native pig outside of Guizhou, China (NPOG). A total of 39,166 SVs were detected and covered 27.37 Mb of pig genome. All of 76 SVs were confirmed in GZP pig population by PCR method. The SVs numbers in NPOG and GZP were about 1.8 to 1.9 times higher than that in EUP. And a SV hotspot was found out from the 20 Mb of chromosome X of GZP, which harbored 29 genes and focused on histone modification. More than half of SVs was positioned in the intergenic regions and about one third of SVs in the introns of genes. And we found that SVs tended to locate in genes produced multi-transcripts, in which a positive correlation was found out between the numbers of SV and the gene transcripts. It illustrated that the primary mode of SVs might function on the regulation of gene expression or the transcripts splicing process. A total of 1,628 protein-coding genes were disturbed by 1,956 SVs specific in GZP, in which 93 GZP-specific SV-related genes would lose their functions due to the SV interference and gathered in reproduction ability. Interestingly, the 1,628 protein-coding genes were mainly enriched in estrogen receptor binding, steroid hormone receptor binding, retinoic acid receptor binding, oxytocin signaling pathway, mTOR signaling pathway, axon guidance and cholinergic synapse pathways. It suggested that SV might be a reason for the strong adaptability and low fecundity of GZP, and 51 candidate genes would be useful for the configuration phenotype in Xiang pig breed. PMID:29558483

Detection of genomic structural variations in Guizhou indigenous pigs and the comparison with other breeds.

PubMed

Liu, Chang; Ran, Xueqin; Wang, Jiafu; Li, Sheng; Liu, Jianfeng

2018-01-01

Genomic structural variation (SV) is noticed for the contribution to genetic diversity and phenotypic changes. Guizhou indigenous pig (GZP) has been raised for hundreds of years with many special characteristics. The present paper aimed to uncover the influence of SV on gene polymorphism and the genetic mechanisms of phenotypic traits for GZP. Eighteen GZPs were chosen for resequencing by Illumina sequencing platform. The confident SVs of GZP were called out by both programs of pindel and softSV simultaneously and compared with the SVs deduced from the genomic data of European pig (EUP) and the native pig outside of Guizhou, China (NPOG). A total of 39,166 SVs were detected and covered 27.37 Mb of pig genome. All of 76 SVs were confirmed in GZP pig population by PCR method. The SVs numbers in NPOG and GZP were about 1.8 to 1.9 times higher than that in EUP. And a SV hotspot was found out from the 20 Mb of chromosome X of GZP, which harbored 29 genes and focused on histone modification. More than half of SVs was positioned in the intergenic regions and about one third of SVs in the introns of genes. And we found that SVs tended to locate in genes produced multi-transcripts, in which a positive correlation was found out between the numbers of SV and the gene transcripts. It illustrated that the primary mode of SVs might function on the regulation of gene expression or the transcripts splicing process. A total of 1,628 protein-coding genes were disturbed by 1,956 SVs specific in GZP, in which 93 GZP-specific SV-related genes would lose their functions due to the SV interference and gathered in reproduction ability. Interestingly, the 1,628 protein-coding genes were mainly enriched in estrogen receptor binding, steroid hormone receptor binding, retinoic acid receptor binding, oxytocin signaling pathway, mTOR signaling pathway, axon guidance and cholinergic synapse pathways. It suggested that SV might be a reason for the strong adaptability and low fecundity of GZP, and 51 candidate genes would be useful for the configuration phenotype in Xiang pig breed.
The Genetic Basis for Variation in Sensitivity to Lead Toxicity in Drosophila melanogaster.

PubMed

Zhou, Shanshan; Morozova, Tatiana V; Hussain, Yasmeen N; Luoma, Sarah E; McCoy, Lenovia; Yamamoto, Akihiko; Mackay, Trudy F C; Anholt, Robert R H

2016-07-01

Lead toxicity presents a worldwide health problem, especially due to its adverse effects on cognitive development in children. However, identifying genes that give rise to individual variation in susceptibility to lead toxicity is challenging in human populations. Our goal was to use Drosophila melanogaster to identify evolutionarily conserved candidate genes associated with individual variation in susceptibility to lead exposure. To identify candidate genes associated with variation in susceptibility to lead toxicity, we measured effects of lead exposure on development time, viability and adult activity in the Drosophila melanogaster Genetic Reference Panel (DGRP) and performed genome-wide association analyses to identify candidate genes. We used mutants to assess functional causality of candidate genes and constructed a genetic network associated with variation in sensitivity to lead exposure, on which we could superimpose human orthologs. We found substantial heritabilities for all three traits and identified candidate genes associated with variation in susceptibility to lead exposure for each phenotype. The genetic architectures that determine variation in sensitivity to lead exposure are highly polygenic. Gene ontology and network analyses showed enrichment of genes associated with early development and function of the nervous system. Drosophila melanogaster presents an advantageous model to study the genetic underpinnings of variation in susceptibility to lead toxicity. Evolutionary conservation of cellular pathways that respond to toxic exposure allows predictions regarding orthologous genes and pathways across phyla. Thus, studies in the D. melanogaster model system can identify candidate susceptibility genes to guide subsequent studies in human populations. Zhou S, Morozova TV, Hussain YN, Luoma SE, McCoy L, Yamamoto A, Mackay TF, Anholt RR. 2016. The genetic basis for variation in sensitivity to lead toxicity in Drosophila melanogaster. Environ Health Perspect 124:1062-1070; http://dx.doi.org/10.1289/ehp.1510513.
Replication of type 2 diabetes candidate genes variations in three geographically unrelated Indian population groups.

PubMed

Ali, Shafat; Chopra, Rupali; Manvati, Siddharth; Singh, Yoginder Pal; Kaul, Nabodita; Behura, Anita; Mahajan, Ankit; Sehajpal, Prabodh; Gupta, Subash; Dhar, Manoj K; Chainy, Gagan B N; Bhanwer, Amarjit S; Sharma, Swarkar; Bamezai, Rameshwar N K

2013-01-01

Type 2 diabetes (T2D) is a syndrome of multiple metabolic disorders and is genetically heterogeneous. India comprises one of the largest global populations with highest number of reported type 2 diabetes cases. However, limited information about T2D associated loci is available for Indian populations. It is, therefore, pertinent to evaluate the previously associated candidates as well as identify novel genetic variations in Indian populations to understand the extent of genetic heterogeneity. We chose to do a cost effective high-throughput mass-array genotyping and studied the candidate gene variations associated with T2D in literature. In this case-control candidate genes association study, 91 SNPs from 55 candidate genes have been analyzed in three geographically independent population groups from India. We report the genetic variants in five candidate genes: TCF7L2, HHEX, ENPP1, IDE and FTO, are significantly associated (after Bonferroni correction, p<5.5E-04) with T2D susceptibility in combined population. Interestingly, SNP rs7903146 of the TCF7L2 gene passed the genome wide significance threshold (combined P value = 2.05E-08) in the studied populations. We also observed the association of rs7903146 with blood glucose (fasting and postprandial) levels, supporting the role of TCF7L2 gene in blood glucose homeostasis. Further, we noted that the moderate risk provided by the independently associated loci in combined population with Odds Ratio (OR)<1.38 increased to OR = 2.44, (95%CI = 1.67-3.59) when the risk providing genotypes of TCF7L2, HHEX, ENPP1 and FTO genes were combined, suggesting the importance of gene-gene interactions evaluation in complex disorders like T2D.
Replication of Type 2 Diabetes Candidate Genes Variations in Three Geographically Unrelated Indian Population Groups

PubMed Central

Ali, Shafat; Chopra, Rupali; Manvati, Siddharth; Mahajan, Ankit; Sehajpal, Prabodh; Gupta, Subash; Dhar, Manoj K.; Chainy, Gagan B. N.; Bhanwer, Amarjit S.; Sharma, Swarkar; Bamezai, Rameshwar N. K.

2013-01-01

Type 2 diabetes (T2D) is a syndrome of multiple metabolic disorders and is genetically heterogeneous. India comprises one of the largest global populations with highest number of reported type 2 diabetes cases. However, limited information about T2D associated loci is available for Indian populations. It is, therefore, pertinent to evaluate the previously associated candidates as well as identify novel genetic variations in Indian populations to understand the extent of genetic heterogeneity. We chose to do a cost effective high-throughput mass-array genotyping and studied the candidate gene variations associated with T2D in literature. In this case-control candidate genes association study, 91 SNPs from 55 candidate genes have been analyzed in three geographically independent population groups from India. We report the genetic variants in five candidate genes: TCF7L2, HHEX, ENPP1, IDE and FTO, are significantly associated (after Bonferroni correction, p<5.5E−04) with T2D susceptibility in combined population. Interestingly, SNP rs7903146 of the TCF7L2 gene passed the genome wide significance threshold (combined P value = 2.05E−08) in the studied populations. We also observed the association of rs7903146 with blood glucose (fasting and postprandial) levels, supporting the role of TCF7L2 gene in blood glucose homeostasis. Further, we noted that the moderate risk provided by the independently associated loci in combined population with Odds Ratio (OR)<1.38 increased to OR = 2.44, (95%CI = 1.67–3.59) when the risk providing genotypes of TCF7L2, HHEX, ENPP1 and FTO genes were combined, suggesting the importance of gene-gene interactions evaluation in complex disorders like T2D. PMID:23527042
Using Osteoclast Differentiation as a Model for Gene Discovery in an Undergraduate Cell Biology Laboratory

ERIC Educational Resources Information Center

Birnbaum, Mark J.; Picco, Jenna; Clements, Meghan; Witwicka, Hanna; Yang, Meiheng; Hoey, Margaret T.; Odgren, Paul R.

2010-01-01

A key goal of molecular/cell biology/biotechnology is to identify essential genes in virtually every physiological process to uncover basic mechanisms of cell function and to establish potential targets of drug therapy combating human disease. This article describes a semester-long, project-oriented molecular/cellular/biotechnology laboratory…
Genetic basis of interindividual susceptibility to cancer cachexia: selection of potential candidate gene polymorphisms for association studies.

PubMed

Johns, N; Tan, B H; MacMillan, M; Solheim, T S; Ross, J A; Baracos, V E; Damaraju, S; Fearon, K C H

2014-12-01

Cancer cachexia is a complex and multifactorial disease. Evolving definitions highlight the fact that a diverse range of biological processes contribute to cancer cachexia. Part of the variation in who will and who will not develop cancer cachexia may be genetically determined. As new definitions, classifications and biological targets continue to evolve, there is a need for reappraisal of the literature for future candidate association studies. This review summarizes genes identified or implicated as well as putative candidate genes contributing to cachexia, identified through diverse technology platforms and model systems to further guide association studies. A systematic search covering 1986-2012 was performed for potential candidate genes / genetic polymorphisms relating to cancer cachexia. All candidate genes were reviewed for functional polymorphisms or clinically significant polymorphisms associated with cachexia using the OMIM and GeneRIF databases. Pathway analysis software was used to reveal possible network associations between genes. Functionality of SNPs/genes was explored based on published literature, algorithms for detecting putative deleterious SNPs and interrogating the database for expression of quantitative trait loci (eQTLs). A total of 154 genes associated with cancer cachexia were identified and explored for functional polymorphisms. Of these 154 genes, 119 had a combined total of 281 polymorphisms with functional and/or clinical significance in terms of cachexia associated with them. Of these, 80 polymorphisms (in 51 genes) were replicated in more than one study with 24 polymorphisms found to influence two or more hallmarks of cachexia (i.e., inflammation, loss of fat mass and/or lean mass and reduced survival). Selection of candidate genes and polymorphisms is a key element of multigene study design. The present study provides a contemporary basis to select genes and/or polymorphisms for further association studies in cancer cachexia, and to develop their potential as susceptibility biomarkers of cachexia.
A fruit quality gene map of Prunus

PubMed Central

2009-01-01

Background Prunus fruit development, growth, ripening, and senescence includes major biochemical and sensory changes in texture, color, and flavor. The genetic dissection of these complex processes has important applications in crop improvement, to facilitate maximizing and maintaining stone fruit quality from production and processing through to marketing and consumption. Here we present an integrated fruit quality gene map of Prunus containing 133 genes putatively involved in the determination of fruit texture, pigmentation, flavor, and chilling injury resistance. Results A genetic linkage map of 211 markers was constructed for an intraspecific peach (Prunus persica) progeny population, Pop-DG, derived from a canning peach cultivar 'Dr. Davis' and a fresh market cultivar 'Georgia Belle'. The Pop-DG map covered 818 cM of the peach genome and included three morphological markers, 11 ripening candidate genes, 13 cold-responsive genes, 21 novel EST-SSRs from the ChillPeach database, 58 previously reported SSRs, 40 RAFs, 23 SRAPs, 14 IMAs, and 28 accessory markers from candidate gene amplification. The Pop-DG map was co-linear with the Prunus reference T × E map, with 39 SSR markers in common to align the maps. A further 158 markers were bin-mapped to the reference map: 59 ripening candidate genes, 50 cold-responsive genes, and 50 novel EST-SSRs from ChillPeach, with deduced locations in Pop-DG via comparative mapping. Several candidate genes and EST-SSRs co-located with previously reported major trait loci and quantitative trait loci for chilling injury symptoms in Pop-DG. Conclusion The candidate gene approach combined with bin-mapping and availability of a community-recognized reference genetic map provides an efficient means of locating genes of interest in a target genome. We highlight the co-localization of fruit quality candidate genes with previously reported fruit quality QTLs. The fruit quality gene map developed here is a valuable tool for dissecting the genetic architecture of fruit quality traits in Prunus crops. PMID:19995417
Summary of mutations underlying autosomal recessive congenital ichthyoses (ARCI) in Arabs with four novel mutations in ARCI-related genes from the United Arab Emirates.

PubMed

Bastaki, Fatma; Mohamed, Madiha; Nair, Pratibha; Saif, Fatima; Mustafa, Ethar M; Bizzari, Sami; Al-Ali, Mahmoud T; Hamzeh, Abdul Rezzak

2017-05-01

Clinical and molecular heterogeneity is a prominent characteristic of congenital ichthyoses, with the involvement of numerous causative loci. Mutations in these loci feature in autosomal recessive congenital ichthyoses (ARCIs) quite variably, with certain genes/mutations being more frequently uncovered in particular populations. In this study, we used whole exome sequencing as well as direct Sanger sequencing to uncover four novel mutations in ARCI-related genes, which were found in families from the United Arab Emirates. In silico tools such as CADD and SIFT Indel were used to predict the functional consequences of these mutations. The here-presented mutations occurred in three genes (ALOX12B, TGM1, ABCA12), and these are a mixture of missense and indel variants with damaging functional consequences on their encoded proteins. This study presents an overview of the mutations that were found in ARCI-related genes in Arabs and discusses molecular and clinical details pertaining to the above-mentioned Emirati cases and their novel mutations with special emphasis on the resulting protein changes. © 2017 The International Society of Dermatology.
Candidate Gene Identification of Feed Efficiency and Coat Color Traits in a C57BL/6J × Kunming F2 Mice Population Using Genome-Wide Association Study.

PubMed

Miao, Yuanxin; Soudy, Fathia; Xu, Zhong; Liao, Mingxing; Zhao, Shuhong; Li, Xinyun

2017-01-01

Feed efficiency (FE) is a very important trait in livestock industry. Identification of the candidate genes could be of benefit for the improvement of FE trait. Mouse is used as the model for many studies in mammals. In this study, the candidate genes related to FE and coat color were identified using C57BL/6J (C57) × Kunming (KM) F2 mouse population. GWAS results showed that 61 and 2 SNPs were genome-wise suggestive significantly associated with feed conversion ratio (FCR) and feed intake (FI) traits, respectively. Moreover, the Erbin, Msrb2, Ptf1a, and Fgf10 were considered as the candidate genes of FE. The Lpl was considered as the candidate gene of FI. Further, the coat color trait was studied. KM mice are white and C57 ones are black. The GWAS results showed that the most significant SNP was located at chromosome 7, and the closely linked gene was Tyr. Therefore, our study offered useful target genes related to FE in mice; these genes may play similar roles in FE of livestock. Also, we identified the major gene of coat color in mice, which would be useful for better understanding of natural mutation of the coat color in mice.
Defining the role of the MADS-box gene, Zea agamous like1, in maize domestication

USDA-ARS?s Scientific Manuscript database

Genomic scans for genes that show the signature of past selection have been widely applied to a number of species and have identified a large number of selection candidate genes. In cultivated maize (Zea mays ssp. mays) selection scans have identified several hundred candidate domestication genes...
TSUNAMI: an antisense method to phenocopy splicing-associated diseases in animals

PubMed Central

Sahashi, Kentaro; Hua, Yimin; Ling, Karen K.Y.; Hung, Gene; Rigo, Frank; Horev, Guy; Katsuno, Masahisa; Sobue, Gen; Ko, Chien-Ping; Bennett, C. Frank; Krainer, Adrian R.

2012-01-01

Antisense oligonucleotides (ASOs) are versatile molecules that can be designed to specifically alter splicing patterns of target pre-mRNAs. Here we exploit this feature to phenocopy a genetic disease. Spinal muscular atrophy (SMA) is a motor neuron disease caused by loss-of-function mutations in the SMN1 gene. The related SMN2 gene expresses suboptimal levels of functional SMN protein due to alternative splicing that skips exon 7; correcting this defect—e.g., with ASOs—is a promising therapeutic approach. We describe the use of ASOs that exacerbate SMN2 missplicing and phenocopy SMA in a dose-dependent manner when administered to transgenic Smn−/− mice. Intracerebroventricular ASO injection in neonatal mice recapitulates SMA-like progressive motor dysfunction, growth impairment, and shortened life span, with α-motor neuron loss and abnormal neuromuscular junctions. These SMA-like phenotypes are prevented by a therapeutic ASO that restores correct SMN2 splicing. We uncovered starvation-induced splicing changes, particularly in SMN2, which likely accelerate disease progression. These results constitute proof of principle that ASOs designed to cause sustained splicing defects can be used to induce pathogenesis and rapidly and accurately model splicing-associated diseases in animals. This approach allows the dissection of pathogenesis mechanisms, including spatial and temporal features of disease onset and progression, as well as testing of candidate therapeutics. PMID:22895255
Genome-wide association study uncovers a novel QTL allele of AtS40-3 that affects the sex ratio of cyst nematodes in Arabidopsis.

PubMed

Anwer, Muhammad Arslan; Anjam, Muhammad Shahzad; Shah, Syed Jehangir; Hasan, M Shamim; Naz, Ali A; Grundler, Florian M W; Siddique, Shahid

2018-03-24

Plant-parasitic cyst nematodes are obligate sedentary parasites that infect the roots of a broad range of host plants. Cyst nematodes are sexually dimorphic, but differentiation into male or female is strongly influenced by interactions with the host environment. Female populations typically predominate under favorable conditions, whereas male populations predominate under adverse conditions. Here, we performed a genome-wide association study (GWAS) in an Arabidopsis diversity panel to identify host loci underlying variation in susceptibility to cyst nematode infection. Three different susceptibility parameters were examined, with the aim of providing insights into the infection process, the number of females and males present in the infected plant, and the female-to-male sex ratio. GWAS results suggested that variation in sex ratio is associated with a novel quantitative trait locus allele on chromosome 4. Subsequent candidate genes and functional analyses revealed that a senescence-associated transcription factor, AtS40-3, and PPR may act in combination to influence nematode sex ratio. A detailed molecular characterization revealed that variation in nematode sex ratio was due to the disturbed common promoter of AtS40-3 and PPR genes. Additionally, single nucleotide polymorphisms in the coding sequence of AtS40-3 might contribute to the natural variation in nematode sex ratio.
Genome-Wide Mapping of Loci Explaining Variance in Scrotal Circumference in Nellore Cattle

PubMed Central

Utsunomiya, Yuri T.; Carmo, Adriana S.; Neves, Haroldo H. R.; Carvalheiro, Roberto; Matos, Márcia C.; Zavarez, Ludmilla B.; Ito, Pier K. R. K.; Pérez O'Brien, Ana M.; Sölkner, Johann; Porto-Neto, Laercio R.; Schenkel, Flávio S.; McEwan, John; Cole, John B.; da Silva, Marcos V. G. B.; Van Tassell, Curtis P.; Sonstegard, Tad S.; Garcia, José Fernando

2014-01-01

The reproductive performance of bulls has a high impact on the beef cattle industry. Scrotal circumference (SC) is the most recorded reproductive trait in beef herds, and is used as a major selection criterion to improve precocity and fertility. The characterization of genomic regions affecting SC can contribute to the identification of diagnostic markers for reproductive performance and uncover molecular mechanisms underlying complex aspects of bovine reproductive biology. In this paper, we report a genome-wide scan for chromosome segments explaining differences in SC, using data of 861 Nellore bulls (Bos indicus) genotyped for over 777,000 single nucleotide polymorphisms. Loci that excel from the genome background were identified on chromosomes 4, 6, 7, 10, 14, 18 and 21. The majority of these regions were previously found to be associated with reproductive and body size traits in cattle. The signal on chromosome 14 replicates the pleiotropic quantitative trait locus encompassing PLAG1 that affects male fertility in cattle and stature in several species. Based on intensive literature mining, SP4, MAGEL2, SH3RF2, PDE5A and SNAI2 are proposed as novel candidate genes for SC, as they affect growth and testicular size in other animal models. These findings contribute to linking reproductive phenotypes to gene functions, and may offer new insights on the molecular biology of male fertility. PMID:24558400
Genetic and Proteomic Interrogation of Lower Confidence Candidate Genes Reveals Signaling Networks in beta-Catenin-Active Cancers | Office of Cancer Genomics

Cancer.gov

Genome-scale expression studies and comprehensive loss-of-function genetic screens have focused almost exclusively on the highest confidence candidate genes. Here, we describe a strategy for characterizing the lower confidence candidates identified by such approaches.
Uncovering co-expression gene network modules regulating fruit acidity in diverse apples.

PubMed

Bai, Yang; Dougherty, Laura; Cheng, Lailiang; Zhong, Gan-Yuan; Xu, Kenong

2015-08-16

Acidity is a major contributor to fruit quality. Several organic acids are present in apple fruit, but malic acid is predominant and determines fruit acidity. The trait is largely controlled by the Malic acid (Ma) locus, underpinning which Ma1 that putatively encodes a vacuolar aluminum-activated malate transporter1 (ALMT1)-like protein is a strong candidate gene. We hypothesize that fruit acidity is governed by a gene network in which Ma1 is key member. The goal of this study is to identify the gene network and the potential mechanisms through which the network operates. Guided by Ma1, we analyzed the transcriptomes of mature fruit of contrasting acidity from six apple accessions of genotype Ma_ (MaMa or Mama) and four of mama using RNA-seq and identified 1301 fruit acidity associated genes, among which 18 were most significant acidity genes (MSAGs). Network inferring using weighted gene co-expression network analysis (WGCNA) revealed five co-expression gene network modules of significant (P < 0.001) correlation with malate. Of these, the Ma1 containing module (Turquoise) of 336 genes showed the highest correlation (0.79). We also identified 12 intramodular hub genes from each of the five modules and 18 enriched gene ontology (GO) terms and MapMan sub-bines, including two GO terms (GO:0015979 and GO:0009765) and two MapMap sub-bins (1.3.4 and 1.1.1.1) related to photosynthesis in module Turquoise. Using Lemon-Tree algorithms, we identified 12 regulator genes of probabilistic scores 35.5-81.0, including MDP0000525602 (a LLR receptor kinase), MDP0000319170 (an IQD2-like CaM binding protein) and MDP0000190273 (an EIN3-like transcription factor) of greater interest for being one of the 18 MSAGs or one of the 12 intramodular hub genes in Turquoise, and/or a regulator to the cluster containing Ma1. The most relevant finding of this study is the identification of the MSAGs, intramodular hub genes, enriched photosynthesis related processes, and regulator genes in a WGCNA module Turquoise that not only encompasses Ma1 but also shows the highest modular correlation with acidity. Overall, this study provides important insight into the Ma1-mediated gene network controlling acidity in mature apple fruit of diverse genetic background.
Targeting Mechanisms of Resistance to Taxane-Based Chemotherapy

DTIC Science & Technology

2007-09-01

gene ; monoamine oxidase A ( MAOA ) was upregulated in patients with PSA relapse (Figure 5A). Quantitative real-time PCR (qRT-PCR) was performed to...resistance and uncover mechanisms or pathways suitable for targeting with the objective of improving tumor responses to chemotherapy. Gene expression...CXCL10 but not IL8 conferring chemoresistance to prostate cancer cells. When using longer term clinical outcome, we found genes correlated with PSA
A low-pungency S3212 genotype of Capsicum frutescens caused by a mutation in the putative aminotransferase (p-AMT) gene.

PubMed

Park, Young-Jun; Nishikawa, Tomotaro; Minami, Mineo; Nemoto, Kazuhiro; Iwasaki, Tomohiro; Matsushima, Kenichi

2015-12-01

The purpose of this study was to identify the genetic mechanism underlying capsinoid biosynthesis in S3212, a low-pungency genotype of Capsicum frutescens. Screening of C. frutescens accessions for capsaicinoid and capsiate contents by high-performance liquid chromatography revealed that low-pungency S3212 contained high levels of capsiate but no capsaicin. Comparison of DNA coding sequences of pungent (T1 and Bird Eye) and low-pungency (S3212) genotypes uncovered a significant 12-bp deletion mutation in exon 7 of the p-AMT gene of S3212. In addition, p-AMT gene transcript levels in placental tissue were positively correlated with the degree of pungency. S3212, the low-pungency genotype, exhibited no significant p-AMT transcript levels, whereas T1, one of the pungent genotypes, displayed high transcript levels of this gene. We therefore conclude that the deletion mutation in the p-AMT gene is related to the loss of pungency in placental tissue and has given rise to the low-pungency S3212 C. frutescens genotype. C. frutescens S3212 represents a good natural source of capsinoids. Finally, our basic characterization of the uncovered p-AMT gene mutation should contribute to future studies of capsinoid biosynthesis in Capsicum.
A candidate gene study in low HDL-cholesterol families provides evidence for the involvement of the APOA2 gene and the APOA1C3A4 gene cluster.

PubMed

Lilja, Heidi E; Soro, Aino; Ylitalo, Kati; Nuotio, Ilpo; Viikari, Jorma S A; Salomaa, Veikko; Vartiainen, Erkki; Taskinen, Marja-Riitta; Peltonen, Leena; Pajukanta, Päivi

2002-09-01

In patients with premature coronary heart disease, the most common lipoprotein abnormality is high-density lipoprotein (HDL) deficiency. To assess the genetic background of the low HDL-cholesterol trait, we performed a candidate gene study in 25 families with low HDL, collected from the genetically isolated population of Finland. We studied 21 genes encoding essential proteins involved in the HDL metabolism by genotyping intragenic and flanking markers for these genes. We found suggestive evidence for linkage in two candidate regions: Marker D1S2844, in the apolipoprotein A-II (APOA2) region, yielded a LOD score of 2.14 and marker D11S939 flanking the apolipoprotein A-I/C-III/A-IV gene cluster (APOA1C3A4) produced a LOD score of 1.69. Interestingly, we identified potential shared haplotypes in these two regions in a subset of low HDL families. These families also contributed to the obtained positive LOD scores, whereas the rest of the families produced negative LOD scores. None of the remaining candidate regions provided any evidence for linkage. Since only a limited number of loci were tested in this candidate gene study, these LOD scores suggest significant involvement of the APOA2 gene and the APOA1C3A4 gene cluster, or loci in their immediate vicinity, in the pathogenesis of low HDL.
Combining mouse mammary gland gene expression and comparative mapping for the identification of candidate genes for QTL of milk production traits in cattle

PubMed Central

Ron, Micha; Israeli, Galit; Seroussi, Eyal; Weller, Joel I; Gregg, Jeffrey P; Shani, Moshe; Medrano, Juan F

2007-01-01

Background Many studies have found segregating quantitative trait loci (QTL) for milk production traits in different dairy cattle populations. However, even for relatively large effects with a saturated marker map the confidence interval for QTL location by linkage analysis spans tens of map units, or hundreds of genes. Combining mapping and arraying has been suggested as an approach to identify candidate genes. Thus, gene expression analysis in the mammary gland of genes positioned in the confidence interval of the QTL can bridge the gap between fine mapping and quantitative trait nucleotide (QTN) determination. Results We hybridized Affymetrix microarray (MG-U74v2), containing 12,488 murine probes, with RNA derived from mammary gland of virgin, pregnant, lactating and involuting C57BL/6J mice in a total of nine biological replicates. We combined microarray data from two additional studies that used the same design in mice with a total of 75 biological replicates. The same filtering and normalization was applied to each microarray data using GeneSpring software. Analysis of variance identified 249 differentially expressed probe sets common to the three experiments along the four developmental stages of puberty, pregnancy, lactation and involution. 212 genes were assigned to their bovine map positions through comparative mapping, and thus form a list of candidate genes for previously identified QTLs for milk production traits. A total of 82 of the genes showed mammary gland-specific expression with at least 3-fold expression over the median representing all tissues tested in GeneAtlas. Conclusion This work presents a web tool for candidate genes for QTL (cgQTL) that allows navigation between the map of bovine milk production QTL, potential candidate genes and their level of expression in mammary gland arrays and in GeneAtlas. Three out of four confirmed genes that affect QTL in livestock (ABCG2, DGAT1, GDF8, IGF2) were over expressed in the target organ. Thus, cgQTL can be used to determine priority of candidate genes for QTN analysis based on differential expression in the target organ. PMID:17584498
Single-cell and coupled GRN models of cell patterning in the Arabidopsis thaliana root stem cell niche

PubMed Central

2010-01-01

Background Recent experimental work has uncovered some of the genetic components required to maintain the Arabidopsis thaliana root stem cell niche (SCN) and its structure. Two main pathways are involved. One pathway depends on the genes SHORTROOT and SCARECROW and the other depends on the PLETHORA genes, which have been proposed to constitute the auxin readouts. Recent evidence suggests that a regulatory circuit, composed of WOX5 and CLE40, also contributes to the SCN maintenance. Yet, we still do not understand how the niche is dynamically maintained and patterned or if the uncovered molecular components are sufficient to recover the observed gene expression configurations that characterize the cell types within the root SCN. Mathematical and computational tools have proven useful in understanding the dynamics of cell differentiation. Hence, to further explore root SCN patterning, we integrated available experimental data into dynamic Gene Regulatory Network (GRN) models and addressed if these are sufficient to attain observed gene expression configurations in the root SCN in a robust and autonomous manner. Results We found that an SCN GRN model based only on experimental data did not reproduce the configurations observed within the root SCN. We developed several alternative GRN models that recover these expected stable gene configurations. Such models incorporate a few additional components and interactions in addition to those that have been uncovered. The recovered configurations are stable to perturbations, and the models are able to recover the observed gene expression profiles of almost all the mutants described so far. However, the robustness of the postulated GRNs is not as high as that of other previously studied networks. Conclusions These models are the first published approximations for a dynamic mechanism of the A. thaliana root SCN cellular pattering. Our model is useful to formally show that the data now available are not sufficient to fully reproduce root SCN organization and genetic profiles. We then highlight some experimental holes that remain to be studied and postulate some novel gene interactions. Finally, we suggest the existence of a generic dynamical motif that can be involved in both plant and animal SCN maintenance. PMID:20920363

HUBBLE'S TOP TEN GRAVITATIONAL LENSES

NASA Technical Reports Server (NTRS)

2002-01-01

The NASA Hubble Space Telescope serendipitous survey of the sky has uncovered exotic patterns, rings, arcs and crosses that are all optical mirages produced by a gravitational lens, nature's equivalent of having giant magnifying glass in space. Shown are the top 10 lens candidates uncovered in the deepest 100 Hubble fields. Hubble's sensitivity and high resolution allow it to see faint and distant lenses that cannot be detected with ground-based telescopes whose images are blurred by Earth's atmosphere. [Top Left] - HST 01248+0351 is a lensed pair on either side of the edge-on disk lensing galaxy. [Top Center] - HST 01247+0352 is another pair of bluer lensed source images around the red spherical elliptical lensing galaxy. Two much fainter images can be seen near the detection limit which might make this a quadruple system. [Top Right] - HST 15433+5352 is a very good lens candidate with a bluer lensed source in the form of an extended arc about the redder elliptical lensing galaxy. [Middle Far Left] - HST 16302+8230 could be an 'Einstein ring' and the most intriguing lens candidate. It has been nicknamed the 'the London Underground' since it resembles that logo. [Middle Near Left] - HST 14176+5226 is the first, and brightest lens system discovered in 1995 with the Hubble telescope. This lens candidate has now been confirmed spectroscopically using large ground-based telescopes. The elliptical lensing galaxy is located 7 billion light-years away, and the lensed quasar is about 11 billion light-years distant. [Middle Near Right] - HST 12531-2914 is the second quadruple lens candidate discovered with Hubble. It is similar to the first, but appears smaller and fainter. [Middle Far Right] - HST 14164+5215 is a pair of bluish lensed images symmetrically placed around a brighter, redder galaxy. [Bottom Left] - HST 16309+8230 is an edge-on disk-like galaxy (blue arc) which has been significantly distorted by the redder lensing elliptical galaxy. [Bottom Center] - HST 12368+6212 is a blue arc in the Hubble Deep Field (HDF). [Bottom Right] - HST 18078+4600 is a blue arc caused by the gravitational potential of a small group of 4 galaxies. Credit: Kavan Ratnatunga (Carnegie Mellon Univ.) and NASA
Breast Tumors with Elevated Expression of 1q Candidate Genes Confer Poor Clinical Outcome and Sensitivity to Ras/PI3K Inhibition

PubMed Central

Viveka Thangaraj, Soundara; Periasamy, Jayaprakash; Bhaskar Rao, Divya; Barnabas, Georgina D.; Raghavan, Swetha; Ganesan, Kumaresan

2013-01-01

Genomic aberrations are common in cancers and the long arm of chromosome 1 is known for its frequent amplifications in breast cancer. However, the key candidate genes of 1q, and their contribution in breast cancer pathogenesis remain unexplored. We have analyzed the gene expression profiles of 1635 breast tumor samples using meta-analysis based approach and identified clinically significant candidates from chromosome 1q. Seven candidate genes including exonuclease 1 (EXO1) are consistently over expressed in breast tumors, specifically in high grade and aggressive breast tumors with poor clinical outcome. We derived a EXO1 co-expression module from the mRNA profiles of breast tumors which comprises 1q candidate genes and their co-expressed genes. By integrative functional genomics investigation, we identified the involvement of EGFR, RAS, PI3K / AKT, MYC, E2F signaling in the regulation of these selected 1q genes in breast tumors and breast cancer cell lines. Expression of EXO1 module was found as indicative of elevated cell proliferation, genomic instability, activated RAS/AKT/MYC/E2F1 signaling pathways and loss of p53 activity in breast tumors. mRNA–drug connectivity analysis indicates inhibition of RAS/PI3K as a possible targeted therapeutic approach for the patients with activated EXO1 module in breast tumors. Thus, we identified seven 1q candidate genes strongly associated with the poor survival of breast cancer patients and identified the possibility of targeting them with EGFR/RAS/PI3K inhibitors. PMID:24147022
Identification of genes from the Treacher Collins candidate region

DOE Office of Scientific and Technical Information (OSTI.GOV)

Dixon, M.; Dixon, J.; Edwards, S.

Treacher Collins syndrome (TCOF1) is an autosomal dominant disorder of craniofacial development. The TCOF1 locus has previously been mapped to chromosome 5q32-33. The candidate gene region has been defined as being between two flanking markers, ribosomal protein S14 (RPS14) and Annexin 6 (ANX6), by analyzing recombination events in affected individuals. It is estimated that the distance between these flanking markers is 500 kb by three separate analysis methods: (1) radiation hybrid mapping; (2) genetic linkage; and (3) YAC contig analysis. A cosmid contig which spans the candidate gene region for TCOF1 has been constructed by screening the Los Alamos Nationalmore » Laboratory flow-sorted chromosome 5 cosmid library. Cosmids were obtained by using a combination of probes generated from YAC end clones, Alu-PCR fragments from YACs, and asymmetric PCR fragments from both T7 and T3 cosmid ends. Exon amplifications, the selection of genomic coding sequences based upon the presence of functional splice acceptor and donor sites, was used to identify potential exon sequences. Sequences found to be conserved between species were then used to screen cDNA libraries in order to identify candidate genes. To date, four different cDNAs have been isolated from this region and are being analyzed as potential candidate genes for TCOF1. These include the genes encoding plasma glutathione peroxidase (GPX3), heparin sulfate sulfotransferase (HSST), a gene with homology to the ETS family of proteins and one which shows no homology to any known genes. Work is also in progress to identify and characterize additional cDNAs from the candidate gene region.« less
Mapping a candidate gene (MdMYB10) for red flesh and foliage colour in apple

PubMed Central

Chagné, David; Carlisle, Charmaine M; Blond, Céline; Volz, Richard K; Whitworth, Claire J; Oraguzie, Nnadozie C; Crowhurst, Ross N; Allan, Andrew C; Espley, Richard V; Hellens, Roger P; Gardiner, Susan E

2007-01-01

Background Integrating plant genomics and classical breeding is a challenge for both plant breeders and molecular biologists. Marker-assisted selection (MAS) is a tool that can be used to accelerate the development of novel apple varieties such as cultivars that have fruit with anthocyanin through to the core. In addition, determining the inheritance of novel alleles, such as the one responsible for red flesh, adds to our understanding of allelic variation. Our goal was to map candidate anthocyanin biosynthetic and regulatory genes in a population segregating for the red flesh phenotypes. Results We have identified the Rni locus, a major genetic determinant of the red foliage and red colour in the core of apple fruit. In a population segregating for the red flesh and foliage phenotype we have determined the inheritance of the Rni locus and DNA polymorphisms of candidate anthocyanin biosynthetic and regulatory genes. Simple Sequence Repeats (SSRs) and Single Nucleotide Polymorphisms (SNPs) in the candidate genes were also located on an apple genetic map. We have shown that the MdMYB10 gene co-segregates with the Rni locus and is on Linkage Group (LG) 09 of the apple genome. Conclusion We have performed candidate gene mapping in a fruit tree crop and have provided genetic evidence that red colouration in the fruit core as well as red foliage are both controlled by a single locus named Rni. We have shown that the transcription factor MdMYB10 may be the gene underlying Rni as there were no recombinants between the marker for this gene and the red phenotype in a population of 516 individuals. Associating markers derived from candidate genes with a desirable phenotypic trait has demonstrated the application of genomic tools in a breeding programme of a horticultural crop species. PMID:17608951
Antennal transcriptome analysis of the piercing moth Oraesia emarginata (Lepidoptera: Noctuidae)

PubMed Central

Feng, Bo; Guo, Qianshuang; Zheng, Kaidi; Qin, Yuanxia; Du, Yongjun

2017-01-01

The piercing fruit moth Oraesia emarginata is an economically significant pest; however, our understanding of its olfactory mechanisms in infestation is limited. The present study conducted antennal transcriptome analysis of olfactory genes using real-time quantitative reverse transcription PCR analysis (RT-qPCR). We identified a total of 104 candidate chemosensory genes from several gene families, including 35 olfactory receptors (ORs), 41 odorant-binding proteins, 20 chemosensory proteins, 6 ionotropic receptors, and 2 sensory neuron membrane proteins. Seven candidate pheromone receptors (PRs) and 3 candidate pheromone-binding proteins (PBPs) for sex pheromone recognition were found. OemaOR29 and OemaPBP1 had the highest fragments per kb per million fragments (FPKM) values in all ORs and OBPs, respectively. Eighteen olfactory genes were upregulated in females, including 5 candidate PRs, and 20 olfactory genes were upregulated in males, including 2 candidate PRs (OemaOR29 and 4) and 2 PBPs (OemaPBP1 and 3). These genes may have roles in mediating sex-specific behaviors. Most candidate olfactory genes of sex pheromone recognition (except OemaOR29 and OemaPBP3) in O. emarginata were not clustered with those of studied noctuid species (type I pheromone). In addition, OemaOR29 was belonged to cluster PRIII, which comprise proteins that recognize type II pheromones instead of type I pheromones. The structure and function of olfactory genes that encode sex pheromones in O. emarginata might thus differ from those of other studied noctuids. The findings of the present study may help explain the molecular mechanism underlying olfaction and the evolution of olfactory genes encoding sex pheromones in O. emarginata. PMID:28614384
Elevated transcription factor specificity protein 1 in autistic brains alters the expression of autism candidate genes.

PubMed

Thanseem, Ismail; Anitha, Ayyappan; Nakamura, Kazuhiko; Suda, Shiro; Iwata, Keiko; Matsuzaki, Hideo; Ohtsubo, Masafumi; Ueki, Takatoshi; Katayama, Taiichi; Iwata, Yasuhide; Suzuki, Katsuaki; Minoshima, Shinsei; Mori, Norio

2012-03-01

Profound changes in gene expression can result from abnormalities in the concentrations of sequence-specific transcription factors like specificity protein 1 (Sp1). Specificity protein 1 binding sites have been reported in the promoter regions of several genes implicated in autism. We hypothesize that dysfunction of Sp1 could affect the expression of multiple autism candidate genes, contributing to the heterogeneity of autism. We assessed any alterations in the expression of Sp1 and that of autism candidate genes in the postmortem brain (anterior cingulate gyrus [ACG], motor cortex, and thalamus) of autism patients (n = 8) compared with healthy control subjects (n = 13). Alterations in the expression of candidate genes upon Sp1/DNA binding inhibition with mithramycin and Sp1 silencing by RNAi were studied in SK-N-SH neuronal cells. We observed elevated expression of Sp1 in ACG of autism patients (p = .010). We also observed altered expression of several autism candidate genes. GABRB3, RELN, and HTR2A showed reduced expression, whereas CD38, ITGB3, MAOA, MECP2, OXTR, and PTEN showed elevated expression in autism. In SK-N-SH cells, OXTR, PTEN, and RELN showed reduced expression upon Sp1/DNA binding inhibition and Sp1 silencing. The RNA integrity number was not available for any of the samples. Transcription factor Sp1 is dysfunctional in the ACG of autistic brain. Consequently, the expression of potential autism candidate genes regulated by Sp1, especially OXTR and PTEN, could be affected. The diverse downstream pathways mediated by the Sp1-regulated genes, along with the environmental and intracellular signal-related regulation of Sp1, could explain the complex phenotypes associated with autism.
EBF factors drive expression of multiple classes of target genes governing neuronal development.

PubMed

Green, Yangsook S; Vetter, Monica L

2011-04-30

Early B cell factor (EBF) family members are transcription factors known to have important roles in several aspects of vertebrate neurogenesis, including commitment, migration and differentiation. Knowledge of how EBF family members contribute to neurogenesis is limited by a lack of detailed understanding of genes that are transcriptionally regulated by these factors. We performed a microarray screen in Xenopus animal caps to search for targets of EBF transcriptional activity, and identified candidate targets with multiple roles, including transcription factors of several classes. We determined that, among the most upregulated candidate genes with expected neuronal functions, most require EBF activity for some or all of their expression, and most have overlapping expression with ebf genes. We also found that the candidate target genes that had the most strongly overlapping expression patterns with ebf genes were predicted to be direct transcriptional targets of EBF transcriptional activity. The identification of candidate targets that are transcription factor genes, including nscl-1, emx1 and aml1, improves our understanding of how EBF proteins participate in the hierarchy of transcription control during neuronal development, and suggests novel mechanisms by which EBF activity promotes migration and differentiation. Other candidate targets, including pcdh8 and kcnk5, expand our knowledge of the types of terminal differentiated neuronal functions that EBF proteins regulate.
Development of New Candidate Gene and EST-Based Molecular Markers for Gossypium Species

PubMed Central

Buyyarapu, Ramesh; Kantety, Ramesh V.; Yu, John Z.; Saha, Sukumar; Sharma, Govind C.

2011-01-01

New source of molecular markers accelerate the efforts in improving cotton fiber traits and aid in developing high-density integrated genetic maps. We developed new markers based on candidate genes and G. arboreum EST sequences that were used for polymorphism detection followed by genetic and physical mapping. Nineteen gene-based markers were surveyed for polymorphism detection in 26 Gossypium species. Cluster analysis generated a phylogenetic tree with four major sub-clusters for 23 species while three species branched out individually. CAP method enhanced the rate of polymorphism of candidate gene-based markers between G. hirsutum and G. barbadense. Two hundred A-genome based SSR markers were designed after datamining of G. arboreum EST sequences (Mississippi Gossypium arboreum EST-SSR: MGAES). Over 70% of MGAES markers successfully produced amplicons while 65 of them demonstrated polymorphism between the parents of G. hirsutum and G. barbadense RIL population and formed 14 linkage groups. Chromosomal localization of both candidate gene-based and MGAES markers was assisted by euploid and hypoaneuploid CS-B analysis. Gene-based and MGAES markers were highly informative as they were designed from candidate genes and fiber transcriptome with a potential to be integrated into the existing cotton genetic and physical maps. PMID:22315588
The multiple roles of TDP-43 in pre-mRNA processing and gene expression regulation.

PubMed

Buratti, Emanuele; Baralle, Francisco Ernesto

2010-01-01

Heterogeneous ribonucleoproteins (hnRNPs) are multifunctional RNA-binding proteins (RBPs) involved in many cellular processes. They participate in most gene expression pathways, from DNA replication and repair to mRNA translation. Among this class of proteins, TDP-43 (and more recently FUS/TLS) have received considerable attention due to their involvement in several neurodegenerative diseases. This finding has prompted many research groups to focus on the gene expression pathways that are regulated by these proteins. The results have uncovered a considerable complexity of TDP-43 and FUS/TLS functions due to the many independent mechanisms by which they may act to influence various cellular processes (such as DNA transcription, pre-mRNA splicing, mRNA export/import). The aim of this chapter will be to review especially some of the novel functions that have been uncovered, such as role in miRNA synthesis, regulation of transcript levels, and potential autoregulatory mechanisms in order to provide the basis for further investigations.
ketu mutant mice uncover an essential meiotic function for the ancient RNA helicase YTHDC2

PubMed Central

Jain, Devanshi; Puno, M Rhyan; Meydan, Cem; Lailler, Nathalie; Mason, Christopher E; Lima, Christopher D; Anderson, Kathryn V

2018-01-01

Mechanisms regulating mammalian meiotic progression are poorly understood. Here we identify mouse YTHDC2 as a critical component. A screen yielded a sterile mutant, ‘ketu’, caused by a Ythdc2 missense mutation. Mutant germ cells enter meiosis but proceed prematurely to aberrant metaphase and apoptosis, and display defects in transitioning from spermatogonial to meiotic gene expression programs. ketu phenocopies mutants lacking MEIOC, a YTHDC2 partner. Consistent with roles in post-transcriptional regulation, YTHDC2 is cytoplasmic, has 3′→5′ RNA helicase activity in vitro, and has similarity within its YTH domain to an N6-methyladenosine recognition pocket. Orthologs are present throughout metazoans, but are diverged in nematodes and, more dramatically, Drosophilidae, where Bgcn is descended from a Ythdc2 gene duplication. We also uncover similarity between MEIOC and Bam, a Bgcn partner unique to schizophoran flies. We propose that regulation of gene expression by YTHDC2-MEIOC is an evolutionarily ancient strategy for controlling the germline transition into meiosis. PMID:29360036
Analysis and functional annotation of expressed sequence tags (ESTs) from multiple tissues of oil palm (Elaeis guineensis Jacq.)

PubMed Central

Ho, Chai-Ling; Kwan, Yen-Yen; Choi, Mei-Chooi; Tee, Sue-Sean; Ng, Wai-Har; Lim, Kok-Ang; Lee, Yang-Ping; Ooi, Siew-Eng; Lee, Weng-Wah; Tee, Jin-Ming; Tan, Siang-Hee; Kulaveerasingam, Harikrishna; Alwee, Sharifah Shahrul Rabiah Syed; Abdullah, Meilina Ong

2007-01-01

Background Oil palm is the second largest source of edible oil which contributes to approximately 20% of the world's production of oils and fats. In order to understand the molecular biology involved in in vitro propagation, flowering, efficient utilization of nitrogen sources and root diseases, we have initiated an expressed sequence tag (EST) analysis on oil palm. Results In this study, six cDNA libraries from oil palm zygotic embryos, suspension cells, shoot apical meristems, young flowers, mature flowers and roots, were constructed. We have generated a total of 14537 expressed sequence tags (ESTs) from these libraries, from which 6464 tentative unique contigs (TUCs) and 2129 singletons were obtained. Approximately 6008 of these tentative unique genes (TUGs) have significant matches to the non-redundant protein database, from which 2361 were assigned to one or more Gene Ontology categories. Predominant transcripts and differentially expressed genes were identified in multiple oil palm tissues. Homologues of genes involved in many aspects of flower development were also identified among the EST collection, such as CONSTANS-like, AGAMOUS-like (AGL)2, AGL20, LFY-like, SQUAMOSA, SQUAMOSA binding protein (SBP) etc. Majority of them are the first representatives in oil palm, providing opportunities to explore the cause of epigenetic homeotic flowering abnormality in oil palm, given the importance of flowering in fruit production. The transcript levels of two flowering-related genes, EgSBP and EgSEP were analysed in the flower tissues of various developmental stages. Gene homologues for enzymes involved in oil biosynthesis, utilization of nitrogen sources, and scavenging of oxygen radicals, were also uncovered among the oil palm ESTs. Conclusion The EST sequences generated will allow comparative genomic studies between oil palm and other monocotyledonous and dicotyledonous plants, development of gene-targeted markers for the reference genetic map, design and fabrication of DNA array for future studies of oil palm. The outcomes of such studies will contribute to oil palm improvements through the establishment of breeding program using marker-assisted selection, development of diagnostic assays using gene targeted markers, and discovery of candidate genes related to important agronomic traits of oil palm. PMID:17953740
Identification of candidate transmission-blocking antigen genes in Theileria annulata and related vector-borne apicomplexan parasites.

PubMed

Lempereur, Laetitia; Larcombe, Stephen D; Durrani, Zeeshan; Karagenc, Tulin; Bilgic, Huseyin Bilgin; Bakirci, Serkan; Hacilarlioglu, Selin; Kinnaird, Jane; Thompson, Joanne; Weir, William; Shiels, Brian

2017-06-05

Vector-borne apicomplexan parasites are a major cause of mortality and morbidity to humans and livestock globally. The most important disease syndromes caused by these parasites are malaria, babesiosis and theileriosis. Strategies for control often target parasite stages in the mammalian host that cause disease, but this can result in reservoir infections that promote pathogen transmission and generate economic loss. Optimal control strategies should protect against clinical disease, block transmission and be applicable across related genera of parasites. We have used bioinformatics and transcriptomics to screen for transmission-blocking candidate antigens in the tick-borne apicomplexan parasite, Theileria annulata. A number of candidate antigen genes were identified which encoded amino acid domains that are conserved across vector-borne Apicomplexa (Babesia, Plasmodium and Theileria), including the Pfs48/45 6-cys domain and a novel cysteine-rich domain. Expression profiling confirmed that selected candidate genes are expressed by life cycle stages within infected ticks. Additionally, putative B cell epitopes were identified in the T. annulata gene sequences encoding the 6-cys and cysteine rich domains, in a gene encoding a putative papain-family cysteine peptidase, with similarity to the Plasmodium SERA family, and the gene encoding the T. annulata major merozoite/piroplasm surface antigen, Tams1. Candidate genes were identified that encode proteins with similarity to known transmission blocking candidates in related parasites, while one is a novel candidate conserved across vector-borne apicomplexans and has a potential role in the sexual phase of the life cycle. The results indicate that a 'One Health' approach could be utilised to develop a transmission-blocking strategy effective against vector-borne apicomplexan parasites of animals and humans.
Copy number variability in Parkinson's disease: assembling the puzzle through a systems biology approach.

PubMed

La Cognata, Valentina; Morello, Giovanna; D'Agata, Velia; Cavallaro, Sebastiano

2017-01-01

Parkinson's disease (PD), the second most common progressive neurodegenerative disorder of aging, was long believed to be a non-genetic sporadic origin syndrome. The proof that several genetic loci are responsible for rare Mendelian forms has represented a revolutionary breakthrough, enabling to reveal molecular mechanisms underlying this debilitating still incurable condition. While single nucleotide polymorphisms (SNPs) and small indels constitute the most commonly investigated DNA variations accounting for only a limited number of PD cases, larger genomic molecular rearrangements have emerged as significant PD-causing mutations, including submicroscopic Copy Number Variations (CNVs). CNVs constitute a prevalent source of genomic variations and substantially participate in each individual's genomic makeup and phenotypic outcome. However, the majority of genetic studies have focused their attention on single candidate-gene mutations or on common variants reaching a significant statistical level of acceptance. This gene-centric approach is insufficient to uncover the genetic background of polygenic multifactorial disorders like PD, and potentially masks rare individual CNVs that all together might contribute to disease development or progression. In this review, we will discuss literature and bioinformatic data describing the involvement of CNVs on PD pathobiology. We will analyze the most frequent copy number changes in familiar PD genes and provide a "systems biology" overview of rare individual rearrangements that could functionally act on commonly deregulated molecular pathways. Assessing the global genome-wide burden of CNVs in PD patients may reveal new disease-related molecular mechanisms, and open the window to a new possible genetic scenario in the unsolved PD puzzle.
Sleeping Beauty transposon mutagenesis identifies genes that cooperate with mutant Smad4 in gastric cancer development

PubMed Central

Takeda, Haruna; Rust, Alistair G.; Ward, Jerrold M.; Yew, Christopher Chin Kuan; Jenkins, Nancy A.; Copeland, Neal G.

2016-01-01

Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4+/− mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC. PMID:27006499
Sleeping Beauty transposon mutagenesis identifies genes that cooperate with mutant Smad4 in gastric cancer development.

PubMed

Takeda, Haruna; Rust, Alistair G; Ward, Jerrold M; Yew, Christopher Chin Kuan; Jenkins, Nancy A; Copeland, Neal G

2016-04-05

Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4(+/-) mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC.
Identification of Candidate Genes Responsible for Stem Pith Production Using Expression Analysis in Solid-Stemmed Wheat.

PubMed

Oiestad, A J; Martin, J M; Cook, J; Varella, A C; Giroux, M J

2017-07-01

The wheat stem sawfly (WSS) is an economically important pest of wheat in the Northern Great Plains. The primary means of WSS control is resistance associated with the single quantitative trait locus (QTL) , which controls most stem solidness variation. The goal of this study was to identify stem solidness candidate genes via RNA-seq. This study made use of 28 single nucleotide polymorphism (SNP) makers derived from expressed sequence tags (ESTs) linked to contained within a 5.13 cM region. Allele specific expression of EST markers was examined in stem tissue for solid and hollow-stemmed pairs of two spring wheat near isogenic lines (NILs) differing for the QTL. Of the 28 ESTs, 13 were located within annotated genes and 10 had detectable stem expression. Annotated genes corresponding to four of the ESTs were differentially expressed between solid and hollow-stemmed NILs and represent possible stem solidness gene candidates. Further examination of the 5.13 cM region containing the 28 EST markers identified 260 annotated genes. Twenty of the 260 linked genes were up-regulated in hollow NIL stems, while only seven genes were up-regulated in solid NIL stems. An -methyltransferase within the region of interest was identified as a candidate based on differential expression between solid and hollow-stemmed NILs and putative function. Further study of these candidate genes may lead to the identification of the gene(s) controlling stem solidness and an increased ability to select for wheat stem solidness and manage WSS. Copyright © 2017 Crop Science Society of America.
confFuse: High-Confidence Fusion Gene Detection across Tumor Entities.

PubMed

Huang, Zhiqin; Jones, David T W; Wu, Yonghe; Lichter, Peter; Zapatka, Marc

2017-01-01

Background: Fusion genes play an important role in the tumorigenesis of many cancers. Next-generation sequencing (NGS) technologies have been successfully applied in fusion gene detection for the last several years, and a number of NGS-based tools have been developed for identifying fusion genes during this period. Most fusion gene detection tools based on RNA-seq data report a large number of candidates (mostly false positives), making it hard to prioritize candidates for experimental validation and further analysis. Selection of reliable fusion genes for downstream analysis becomes very important in cancer research. We therefore developed confFuse, a scoring algorithm to reliably select high-confidence fusion genes which are likely to be biologically relevant. Results: confFuse takes multiple parameters into account in order to assign each fusion candidate a confidence score, of which score ≥8 indicates high-confidence fusion gene predictions. These parameters were manually curated based on our experience and on certain structural motifs of fusion genes. Compared with alternative tools, based on 96 published RNA-seq samples from different tumor entities, our method can significantly reduce the number of fusion candidates (301 high-confidence from 8,083 total predicted fusion genes) and keep high detection accuracy (recovery rate 85.7%). Validation of 18 novel, high-confidence fusions detected in three breast tumor samples resulted in a 100% validation rate. Conclusions: confFuse is a novel downstream filtering method that allows selection of highly reliable fusion gene candidates for further downstream analysis and experimental validations. confFuse is available at https://github.com/Zhiqin-HUANG/confFuse.
Linked genetic variants on chromosome 10 control ear morphology and body mass among dog breeds.

PubMed

Webster, Matthew T; Kamgari, Nona; Perloski, Michele; Hoeppner, Marc P; Axelsson, Erik; Hedhammar, Åke; Pielberg, Gerli; Lindblad-Toh, Kerstin

2015-06-23

The domestic dog is a rich resource for mapping the genetic components of phenotypic variation due to its unique population history involving strong artificial selection. Genome-wide association studies have revealed a number of chromosomal regions where genetic variation associates with morphological characters that typify dog breeds. A region on chromosome 10 is among those with the highest levels of genetic differentiation between dog breeds and is associated with body mass and ear morphology, a common motif of animal domestication. We characterised variation in this region to uncover haplotype structure and identify candidate functional variants. We first identified SNPs that strongly associate with body mass and ear type by comparing sequence variation in a 3 Mb region between 19 breeds with a variety of phenotypes. We next genotyped a subset of 123 candidate SNPs in 288 samples from 46 breeds to identify the variants most highly associated with phenotype and infer haplotype structure. A cluster of SNPs that associate strongly with the drop ear phenotype is located within a narrow interval downstream of the gene MSRB3, which is involved in human hearing. These SNPs are in strong genetic linkage with another set of variants that correlate with body mass within the gene HMGA2, which affects human height. In addition we find evidence that this region has been under selection during dog domestication, and identify a cluster of SNPs within MSRB3 that are highly differentiated between dogs and wolves. We characterise genetically linked variants that potentially influence ear type and body mass in dog breeds, both key traits that have been modified by selective breeding that may also be important for domestication. The finding that variants on long haplotypes have effects on more than one trait suggests that genetic linkage can be an important determinant of the phenotypic response to selection in domestic animals.
Survey of candidate genes for maize resistance to infection by Aspergillus flavus and/or aflatoxin contamination

Treesearch

Leigh Hawkins; Marilyn Warburton; Juliet Tang; John Tomashek; Dafne Alves Oliveira; Oluwaseun Ogunola; J. Smith; W. Williams

2018-01-01

Many projects have identified candidate genes for resistance to aflatoxin accumulation or Aspergillus flavus infection and growth in maize using genetic mapping, genomics, transcriptomics and/or proteomics studies. However, only a small percentage of these candidates have been validated in field conditions, and their relative contribution to...
A comparative analysis of genetic diversity of candidate genes associated with type 2 diabetes in worldwide populations.

PubMed

Gong, Xian; Zhang, Chao; Yiliyasi·Aisa, Yiliyasi·Aisa; Shi, Ying; Yang, Xue-wei; NuersimanguliAosiman, NuersimanguliAosiman; Guan, Ya-qun; Xu, Shu-hua

2016-06-20

Over the last decade, a larger number of type 2 diabetes mellitus (T2DM) susceptible candidate genes have been reported by numerous genome-wide association studies (GWAS). Understanding the genetic diversity of these candidate genes among worldwide populations not only facilitates to elucidating the genetic mechanism of T2DM, but also provides guidance to further studies of pathogenesis of T2DM in any certain population. In this study, we identified 170 genes or genomic regions associated with T2DM by searching the GWAS databases and related literatures. We next analyzed the genetic diversity of these genes (or genomic regions) among present-day human populations by curetting the 1000 Genomes Projects phase1 dataset covering 14 worldwide populations. We further compared the characteristics of T2DM genes in different populations. No significant differences of genetic diversity were observed among the 14 worldwide populations between the T2DM candidate genes and the non-T2DM genes in terms of overall pattern. However, we observed some genes, such as IL20RA, RNMTL1-NXN, NOTCH2, ADRA2A-BTBD7P2, TBC1D4, RBM38-HMGB1P1, UBE2E2, and PPARD, show considerable differentiation between populations. In particular, IL20RA (FST=0.1521) displays the greatest population difference which is mainly contributed by that between Africans and non-Africans. Moreover, we revealed genetic differences between East Asians and Europeans on some candidate genes such as DGKB-AGMO (FST=0.173) and JAZF1 (FST=0.182). Our results indicate that some T2DM susceptible candidate genes harbor highly-differentiated variants between populations. These analyses, despite preliminary, should advance our understanding of the population difference of susceptibility to T2DM and provide insightful reference that future studies can relay on.

Characterization of a new endogenous endo-ß-1,4-glucanase of Formosan subterranean termite (Coptotermes formosanus)

USDA-ARS?s Scientific Manuscript database

The present work characterized a second endogenous cellulase (endo-ß-1,4-glucanase) gene, CfEG4, uncovered in the transcriptome of Formosan subterranean termite (Coptotermes formosanus). The full-length gene was cloned and sequenced. It is similar to the CfEG3a described earlier (Zhang et al. 2009) ...
Transcriptome analysis and identification of genes associated with omega-3 fatty acid biosynthesis in Perilla frutescens (L.) var. frutescens

USDA-ARS?s Scientific Manuscript database

Background: Perilla (Perilla frutescens (L.) var frutescens) produces high levels of a-linolenic acid (ALA), an omega-3 fatty acid important to health and development. To uncover key genes involved in fatty acid (FA) and triacylglycerol (TAG) synthesis in perilla, we conducted deep sequencing of cD...
Identification of Immunity Related Genes to Study the Physalis peruviana – Fusarium oxysporum Pathosystem

PubMed Central

Enciso-Rodríguez, Felix E.; González, Carolina; Rodríguez, Edwin A.; López, Camilo E.; Landsman, David; Barrero, Luz Stella; Mariño-Ramírez, Leonardo

2013-01-01

The Cape gooseberry ( Physalis peruviana L) is an Andean exotic fruit with high nutritional value and appealing medicinal properties. However, its cultivation faces important phytosanitary problems mainly due to pathogens like Fusarium oxysporum, Cercosporaphysalidis and Alternaria spp. Here we used the Cape gooseberry foliar transcriptome to search for proteins that encode conserved domains related to plant immunity including: NBS (Nucleotide Binding Site), CC (Coiled-Coil), TIR (Toll/Interleukin-1 Receptor). We identified 74 immunity related gene candidates in P . peruviana which have the typical resistance gene (R-gene) architecture, 17 Receptor like kinase (RLKs) candidates related to PAMP-Triggered Immunity (PTI), eight (TIR-NBS-LRR, or TNL) and nine (CC–NBS-LRR, or CNL) candidates related to Effector-Triggered Immunity (ETI) genes among others. These candidate genes were categorized by molecular function (98%), biological process (85%) and cellular component (79%) using gene ontology. Some of the most interesting predicted roles were those associated with binding and transferase activity. We designed 94 primers pairs from the 74 immunity-related genes (IRGs) to amplify the corresponding genomic regions on six genotypes that included resistant and susceptible materials. From these, we selected 17 single band amplicons and sequenced them in 14 F. oxysporum resistant and susceptible genotypes. Sequence polymorphisms were analyzed through preliminary candidate gene association, which allowed the detection of one SNP at the PpIRG-63 marker revealing a nonsynonymous mutation in the predicted LRR domain suggesting functional roles for resistance. PMID:23844210
Identification of immunity related genes to study the Physalis peruviana--Fusarium oxysporum pathosystem.

PubMed

Enciso-Rodríguez, Felix E; González, Carolina; Rodríguez, Edwin A; López, Camilo E; Landsman, David; Barrero, Luz Stella; Mariño-Ramírez, Leonardo

2013-01-01

The Cape gooseberry (Physalisperuviana L) is an Andean exotic fruit with high nutritional value and appealing medicinal properties. However, its cultivation faces important phytosanitary problems mainly due to pathogens like Fusarium oxysporum, Cercosporaphysalidis and Alternaria spp. Here we used the Cape gooseberry foliar transcriptome to search for proteins that encode conserved domains related to plant immunity including: NBS (Nucleotide Binding Site), CC (Coiled-Coil), TIR (Toll/Interleukin-1 Receptor). We identified 74 immunity related gene candidates in P. peruviana which have the typical resistance gene (R-gene) architecture, 17 Receptor like kinase (RLKs) candidates related to PAMP-Triggered Immunity (PTI), eight (TIR-NBS-LRR, or TNL) and nine (CC-NBS-LRR, or CNL) candidates related to Effector-Triggered Immunity (ETI) genes among others. These candidate genes were categorized by molecular function (98%), biological process (85%) and cellular component (79%) using gene ontology. Some of the most interesting predicted roles were those associated with binding and transferase activity. We designed 94 primers pairs from the 74 immunity-related genes (IRGs) to amplify the corresponding genomic regions on six genotypes that included resistant and susceptible materials. From these, we selected 17 single band amplicons and sequenced them in 14 F. oxysporum resistant and susceptible genotypes. Sequence polymorphisms were analyzed through preliminary candidate gene association, which allowed the detection of one SNP at the PpIRG-63 marker revealing a nonsynonymous mutation in the predicted LRR domain suggesting functional roles for resistance.
Next-generation sequencing for identification of candidate genes for Fusarium wilt and sterility mosaic disease in pigeonpea (Cajanus cajan).

PubMed

Singh, Vikas K; Khan, Aamir W; Saxena, Rachit K; Kumar, Vinay; Kale, Sandip M; Sinha, Pallavi; Chitikineni, Annapurna; Pazhamala, Lekha T; Garg, Vanika; Sharma, Mamta; Sameer Kumar, Chanda Venkata; Parupalli, Swathi; Vechalapu, Suryanarayana; Patil, Suyash; Muniswamy, Sonnappa; Ghanta, Anuradha; Yamini, Kalinati Narasimhan; Dharmaraj, Pallavi Subbanna; Varshney, Rajeev K

2016-05-01

To map resistance genes for Fusarium wilt (FW) and sterility mosaic disease (SMD) in pigeonpea, sequencing-based bulked segregant analysis (Seq-BSA) was used. Resistant (R) and susceptible (S) bulks from the extreme recombinant inbred lines of ICPL 20096 × ICPL 332 were sequenced. Subsequently, SNP index was calculated between R- and S-bulks with the help of draft genome sequence and reference-guided assembly of ICPL 20096 (resistant parent). Seq-BSA has provided seven candidate SNPs for FW and SMD resistance in pigeonpea. In parallel, four additional genotypes were re-sequenced and their combined analysis with R- and S-bulks has provided a total of 8362 nonsynonymous (ns) SNPs. Of 8362 nsSNPs, 60 were found within the 2-Mb flanking regions of seven candidate SNPs identified through Seq-BSA. Haplotype analysis narrowed down to eight nsSNPs in seven genes. These eight nsSNPs were further validated by re-sequencing 11 genotypes that are resistant and susceptible to FW and SMD. This analysis revealed association of four candidate nsSNPs in four genes with FW resistance and four candidate nsSNPs in three genes with SMD resistance. Further, In silico protein analysis and expression profiling identified two most promising candidate genes namely C.cajan_01839 for SMD resistance and C.cajan_03203 for FW resistance. Identified candidate genomic regions/SNPs will be useful for genomics-assisted breeding in pigeonpea. © 2015 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
A Novel Strategy for Selection and Validation of Reference Genes in Dynamic Multidimensional Experimental Design in Yeast

PubMed Central

Cankorur-Cetinkaya, Ayca; Dereli, Elif; Eraslan, Serpil; Karabekmez, Erkan; Dikicioglu, Duygu; Kirdar, Betul

2012-01-01

Background Understanding the dynamic mechanism behind the transcriptional organization of genes in response to varying environmental conditions requires time-dependent data. The dynamic transcriptional response obtained by real-time RT-qPCR experiments could only be correctly interpreted if suitable reference genes are used in the analysis. The lack of available studies on the identification of candidate reference genes in dynamic gene expression studies necessitates the identification and the verification of a suitable gene set for the analysis of transient gene expression response. Principal Findings In this study, a candidate reference gene set for RT-qPCR analysis of dynamic transcriptional changes in Saccharomyces cerevisiae was determined using 31 different publicly available time series transcriptome datasets. Ten of the twelve candidates (TPI1, FBA1, CCW12, CDC19, ADH1, PGK1, GCN4, PDC1, RPS26A and ARF1) we identified were not previously reported as potential reference genes. Our method also identified the commonly used reference genes ACT1 and TDH3. The most stable reference genes from this pool were determined as TPI1, FBA1, CDC19 and ACT1 in response to a perturbation in the amount of available glucose and as FBA1, TDH3, CCW12 and ACT1 in response to a perturbation in the amount of available ammonium. The use of these newly proposed gene sets outperformed the use of common reference genes in the determination of dynamic transcriptional response of the target genes, HAP4 and MEP2, in response to relaxation from glucose and ammonium limitations, respectively. Conclusions A candidate reference gene set to be used in dynamic real-time RT-qPCR expression profiling in yeast was proposed for the first time in the present study. Suitable pools of stable reference genes to be used under different experimental conditions could be selected from this candidate set in order to successfully determine the expression profiles for the genes of interest. PMID:22675547
25 years ago: CCR scientists discover first gene linked to kidney cancer | Center for Cancer Research

Cancer.gov

Twenty-five years ago, NCI scientists uncovered the VHL gene, a gene whose mutation can lead to the development of kidney tumors. The discovery, the result of a decade-long partnership between CCR scientists, including W. Marston Linehan, M.D., Chief of CCR’s Urologic Oncology Branch, and families affected by the disease, paved the way for new targeted therapies that have
Schizophrenia, vitamin D, and brain development.

PubMed

Mackay-Sim, Alan; Féron, François; Eyles, Darryl; Burne, Thomas; McGrath, John

2004-01-01

Schizophrenia research is invigorated at present by the recent discovery of several plausible candidate susceptibility genes identified from genetic linkage and gene expression studies of brains from persons with schizophrenia. It is a current challenge to reconcile this gathering evidence for specific candidate susceptibility genes with the "neurodevelopmental hypothesis," which posits that schizophrenia arises from gene-environment interactions that disrupt brain development. We make the case here that schizophrenia may result not from numerous genes of small effect, but a few genes of transcriptional regulation acting during brain development. In particular we propose that low vitamin D during brain development interacts with susceptibility genes to alter the trajectory of brain development, probably by epigenetic regulation that alters gene expression throughout adult life. Vitamin D is an attractive "environmental" candidate because it appears to explain several key epidemiological features of schizophrenia. Vitamin D is an attractive "genetic" candidate because its nuclear hormone receptor regulates gene expression and nervous system development. The polygenic quality of schizophrenia, with linkage to many genes of small effect, maybe brought together via this "vitamin D hypothesis." We also discuss the possibility of a broader set of environmental and genetic factors interacting via the nuclear hormone receptors to affect the development of the brain leading to schizophrenia.
Multi-Dimensional Prioritization of Dental Caries Candidate Genes and Its Enriched Dense Network Modules

PubMed Central

Wang, Quan; Jia, Peilin; Cuenco, Karen T.; Feingold, Eleanor; Marazita, Mary L.; Wang, Lily; Zhao, Zhongming

2013-01-01

A number of genetic studies have suggested numerous susceptibility genes for dental caries over the past decade with few definite conclusions. The rapid accumulation of relevant information, along with the complex architecture of the disease, provides a challenging but also unique opportunity to review and integrate the heterogeneous data for follow-up validation and exploration. In this study, we collected and curated candidate genes from four major categories: association studies, linkage scans, gene expression analyses, and literature mining. Candidate genes were prioritized according to the magnitude of evidence related to dental caries. We then searched for dense modules enriched with the prioritized candidate genes through their protein-protein interactions (PPIs). We identified 23 modules comprising of 53 genes. Functional analyses of these 53 genes revealed three major clusters: cytokine network relevant genes, matrix metalloproteinases (MMPs) family, and transforming growth factor-beta (TGF-β) family, all of which have been previously implicated to play important roles in tooth development and carious lesions. Through our extensive data collection and an integrative application of gene prioritization and PPI network analyses, we built a dental caries-specific sub-network for the first time. Our study provided insights into the molecular mechanisms underlying dental caries. The framework we proposed in this work can be applied to other complex diseases. PMID:24146904
Uncovering Heavily Obscured AGN with WISE and NuSTAR

NASA Astrophysics Data System (ADS)

Hickox, Ryan C.; Carroll, Christopher M.; Yan, Wei; DiPompeo, Michael A.; Hainline, Kevin N.; NuSTAR Obscured AGN Team

2018-01-01

Supermassive black holes gain their mass through accretion as active galactic nuclei (AGN), but it is now clear that a large fraction of this growth is "hidden" behind large columns of gas and dust. Of particular interest are Compton-thick (CT) AGN, with columns NH > 1024 cm-2, that have been difficult to identify using optical or soft X-ray surveys. We will present two studies of heavily obscured AGN that aim to uncover more of the full population of "hidden" growing black holes: (1) Analysis of the spectral energy distributions of millions of galaxies with photometry from WISE (mid-IR), UKIDSS (near-IR), and SDSS (optical), that uncovers large populations of weak or heavily buried AGN, and (2) NuSTAR observations of a sample of candidate highly obscured AGN, selected from WISE and SDSS photometry,and confirmed using SALT and Keck spectroscopy. The NuSTAR data reveal the existence of powerful CT quasars with extremely large columns NH > 1025 cm-2, which may represent a significant fraction of previously hidden black hole growth. This work is supported by NASA grant numbers NNX16AN48G and NNX15AP24G, and the NSF through grant numbers 1515364 and 1554584.
Integrative Approach to Pain Genetics Identifies Pain Sensitivity Loci across Diseases

PubMed Central

Ruau, David; Dudley, Joel T.; Chen, Rong; Phillips, Nicholas G.; Swan, Gary E.; Lazzeroni, Laura C.; Clark, J. David

2012-01-01

Identifying human genes relevant for the processing of pain requires difficult-to-conduct and expensive large-scale clinical trials. Here, we examine a novel integrative paradigm for data-driven discovery of pain gene candidates, taking advantage of the vast amount of existing disease-related clinical literature and gene expression microarray data stored in large international repositories. First, thousands of diseases were ranked according to a disease-specific pain index (DSPI), derived from Medical Subject Heading (MESH) annotations in MEDLINE. Second, gene expression profiles of 121 of these human diseases were obtained from public sources. Third, genes with expression variation significantly correlated with DSPI across diseases were selected as candidate pain genes. Finally, selected candidate pain genes were genotyped in an independent human cohort and prospectively evaluated for significant association between variants and measures of pain sensitivity. The strongest signal was with rs4512126 (5q32, ABLIM3, P = 1.3×10−10) for the sensitivity to cold pressor pain in males, but not in females. Significant associations were also observed with rs12548828, rs7826700 and rs1075791 on 8q22.2 within NCALD (P = 1.7×10−4, 1.8×10−4, and 2.2×10−4 respectively). Our results demonstrate the utility of a novel paradigm that integrates publicly available disease-specific gene expression data with clinical data curated from MEDLINE to facilitate the discovery of pain-relevant genes. This data-derived list of pain gene candidates enables additional focused and efficient biological studies validating additional candidates. PMID:22685391
Genome-wide association studies and epistasis analyses of candidate genes related to age at menarche and age at natural menopause in a Korean population.

PubMed

Pyun, Jung-A; Kim, Sunshin; Cho, Nam H; Koh, InSong; Lee, Jong-Young; Shin, Chol; Kwack, KyuBum

2014-05-01

The aim of this study was to identify polymorphisms and gene-gene interactions that are significantly associated with age at menarche and age at menopause in a Korean population. A total of 3,452 and 1,827 women participated in studies of age at menarche and age at natural menopause, respectively. Linear regression analyses adjusted for residence area were used to perform genome-wide association studies (GWAS), candidate gene association studies, and interactions between the candidate genes for age at menarche and age at natural menopause. In GWAS, four single nucleotide polymorphisms (SNPs; rs7528241, rs1324329, rs11597068, and rs6495785) were strongly associated with age at natural menopause (lowest P = 9.66 × 10). However, GWAS of age at menarche did not reveal any strong associations. In candidate gene association studies, SNPs with P < 0.01 were selected to test their synergistic interactions. For age at natural menopause, there was a significant interaction between intronic SNPs on ADAM metallopeptidase with thrombospondin type I motif 9 (ADAMTS9) and SMAD family member 3 (SMAD3) genes (P = 9.52 × 10). For age at menarche, there were three significant interactions between three intronic SNPs on follicle-stimulating hormone receptor (FSHR) gene and one SNP located at the 3' flanking region of insulin-like growth factor 2 receptor (IGF2R) gene (lowest P = 1.95 × 10). Novel SNPs and synergistic interactions between candidate genes are significantly associated with age at menarche and age at natural menopause in a Korean population.
Implications of the Occurrence of Glitches in Pulsar Free Precession Candidates.

PubMed

Jones, D I; Ashton, G; Prix, R

2017-06-30

The timing properties of radio pulsars provide a unique probe of neutron star interiors. Recent observations have uncovered quasiperiodicities in the timing and pulse properties of some pulsars, a phenomenon that has often been attributed to free precession of the neutron star, with profound implications for the distribution of superfluidity and superconductivity in the star. We advance this program by developing consistency relations between free precession and pulsars glitches, and we show that there are difficulties in reconciling the two phenomena in some precession candidates. This indicates that the precession model used here needs to be modified or some other phenomenon is at work in producing the quasiperiodicities, or even that there is something missing in terms of our understanding of glitches.
The Effects of Selenium Supplementation on Gene Expression Related to Insulin and Lipid in Infertile Polycystic Ovary Syndrome Women Candidate for In Vitro Fertilization: a Randomized, Double-Blind, Placebo-Controlled Trial.

PubMed

Zadeh Modarres, Shahrzad; Heidar, Zahra; Foroozanfard, Fatemeh; Rahmati, Zahra; Aghadavod, Esmat; Asemi, Zatollah

2018-06-01

This study was conducted to evaluate the effects of selenium supplementation on gene expression related to insulin and lipid in infertile women with polycystic ovary syndrome (PCOS) candidate for in vitro fertilization (IVF). This randomized double-blind, placebo-controlled trial was conducted among 40 infertile women with PCOS candidate for IVF. Subjects were randomly allocated into two groups to intake either 200-μg selenium (n = 20) or placebo (n = 20) per day for 8 weeks. Gene expression levels related to insulin and lipid were quantified in lymphocytes of women with PCOS candidate for IVF with RT-PCR method. Results of RT-PCR demonstrated that after the 8-week intervention, compared with the placebo, selenium supplementation upregulated gene expression of peroxisome proliferator-activated receptor gamma (PPAR-γ) (1.06 ± 0.15-fold increase vs. 0.94 ± 0.18-fold reduction, P = 0.02) and glucose transporter 1 (GLUT-1) (1.07 ± 0.20-fold increase vs. 0.87 ± 0.18-fold reduction, P = 0.003) in lymphocytes of women with PCOS candidate for IVF. In addition, compared with the placebo, selenium supplementation downregulated gene expression of low-density lipoprotein receptor (LDLR) (0.88 ± 0.17-fold reduction vs. 1.05 ± 0.22-fold increase, P = 0.01) in lymphocytes of women with PCOS candidate for IVF. We did not observe any significant effect of selenium supplementation on gene expression levels of lipoprotein(a) [LP(a)] in lymphocytes of women with PCOS candidate for IVF. Overall, selenium supplementation for 8 weeks in lymphocytes of women with infertile PCOS candidate for IVF significantly increased gene expression levels of PPAR-γ and GLUT-1 and significantly decreased gene expression levels of LDLR, but did not affect LP(a). http://www.irct.ir : IRCT201704245623N113.
Detecting Novel and Emerging Drug Terms Using Natural Language Processing: A Social Media Corpus Study.

PubMed

Simpson, Sean S; Adams, Nikki; Brugman, Claudia M; Conners, Thomas J

2018-01-08

With the rapid development of new psychoactive substances (NPS) and changes in the use of more traditional drugs, it is increasingly difficult for researchers and public health practitioners to keep up with emerging drugs and drug terms. Substance use surveys and diagnostic tools need to be able to ask about substances using the terms that drug users themselves are likely to be using. Analyses of social media may offer new ways for researchers to uncover and track changes in drug terms in near real time. This study describes the initial results from an innovative collaboration between substance use epidemiologists and linguistic scientists employing techniques from the field of natural language processing to examine drug-related terms in a sample of tweets from the United States. The objective of this study was to assess the feasibility of using distributed word-vector embeddings trained on social media data to uncover previously unknown (to researchers) drug terms. In this pilot study, we trained a continuous bag of words (CBOW) model of distributed word-vector embeddings on a Twitter dataset collected during July 2016 (roughly 884.2 million tokens). We queried the trained word embeddings for terms with high cosine similarity (a proxy for semantic relatedness) to well-known slang terms for marijuana to produce a list of candidate terms likely to function as slang terms for this substance. This candidate list was then compared with an expert-generated list of marijuana terms to assess the accuracy and efficacy of using word-vector embeddings to search for novel drug terminology. The method described here produced a list of 200 candidate terms for the target substance (marijuana). Of these 200 candidates, 115 were determined to in fact relate to marijuana (65 terms for the substance itself, 50 terms related to paraphernalia). This included 30 terms which were used to refer to the target substance in the corpus yet did not appear on the expert-generated list and were therefore considered to be successful cases of uncovering novel drug terminology. Several of these novel terms appear to have been introduced as recently as 1 or 2 months before the corpus time slice used to train the word embeddings. Though the precision of the method described here is low enough as to still necessitate human review of any candidate term lists generated in such a manner, the fact that this process was able to detect 30 novel terms for the target substance based only on one month's worth of Twitter data is highly promising. We see this pilot study as an important proof of concept and a first step toward producing a fully automated drug term discovery system capable of tracking emerging NPS terms in real time. ©Sean S Simpson, Nikki Adams, Claudia M Brugman, Thomas J Conners. Originally published in JMIR Public Health and Surveillance (http://publichealth.jmir.org), 08.01.2018.
Detecting Novel and Emerging Drug Terms Using Natural Language Processing: A Social Media Corpus Study

PubMed Central

Simpson, Sean S; Brugman, Claudia M; Conners, Thomas J

2018-01-01

Background With the rapid development of new psychoactive substances (NPS) and changes in the use of more traditional drugs, it is increasingly difficult for researchers and public health practitioners to keep up with emerging drugs and drug terms. Substance use surveys and diagnostic tools need to be able to ask about substances using the terms that drug users themselves are likely to be using. Analyses of social media may offer new ways for researchers to uncover and track changes in drug terms in near real time. This study describes the initial results from an innovative collaboration between substance use epidemiologists and linguistic scientists employing techniques from the field of natural language processing to examine drug-related terms in a sample of tweets from the United States. Objective The objective of this study was to assess the feasibility of using distributed word-vector embeddings trained on social media data to uncover previously unknown (to researchers) drug terms. Methods In this pilot study, we trained a continuous bag of words (CBOW) model of distributed word-vector embeddings on a Twitter dataset collected during July 2016 (roughly 884.2 million tokens). We queried the trained word embeddings for terms with high cosine similarity (a proxy for semantic relatedness) to well-known slang terms for marijuana to produce a list of candidate terms likely to function as slang terms for this substance. This candidate list was then compared with an expert-generated list of marijuana terms to assess the accuracy and efficacy of using word-vector embeddings to search for novel drug terminology. Results The method described here produced a list of 200 candidate terms for the target substance (marijuana). Of these 200 candidates, 115 were determined to in fact relate to marijuana (65 terms for the substance itself, 50 terms related to paraphernalia). This included 30 terms which were used to refer to the target substance in the corpus yet did not appear on the expert-generated list and were therefore considered to be successful cases of uncovering novel drug terminology. Several of these novel terms appear to have been introduced as recently as 1 or 2 months before the corpus time slice used to train the word embeddings. Conclusions Though the precision of the method described here is low enough as to still necessitate human review of any candidate term lists generated in such a manner, the fact that this process was able to detect 30 novel terms for the target substance based only on one month’s worth of Twitter data is highly promising. We see this pilot study as an important proof of concept and a first step toward producing a fully automated drug term discovery system capable of tracking emerging NPS terms in real time. PMID:29311050
Finding gene regulatory network candidates using the gene expression knowledge base.

PubMed

Venkatesan, Aravind; Tripathi, Sushil; Sanz de Galdeano, Alejandro; Blondé, Ward; Lægreid, Astrid; Mironov, Vladimir; Kuiper, Martin

2014-12-10

Network-based approaches for the analysis of large-scale genomics data have become well established. Biological networks provide a knowledge scaffold against which the patterns and dynamics of 'omics' data can be interpreted. The background information required for the construction of such networks is often dispersed across a multitude of knowledge bases in a variety of formats. The seamless integration of this information is one of the main challenges in bioinformatics. The Semantic Web offers powerful technologies for the assembly of integrated knowledge bases that are computationally comprehensible, thereby providing a potentially powerful resource for constructing biological networks and network-based analysis. We have developed the Gene eXpression Knowledge Base (GeXKB), a semantic web technology based resource that contains integrated knowledge about gene expression regulation. To affirm the utility of GeXKB we demonstrate how this resource can be exploited for the identification of candidate regulatory network proteins. We present four use cases that were designed from a biological perspective in order to find candidate members relevant for the gastrin hormone signaling network model. We show how a combination of specific query definitions and additional selection criteria derived from gene expression data and prior knowledge concerning candidate proteins can be used to retrieve a set of proteins that constitute valid candidates for regulatory network extensions. Semantic web technologies provide the means for processing and integrating various heterogeneous information sources. The GeXKB offers biologists such an integrated knowledge resource, allowing them to address complex biological questions pertaining to gene expression. This work illustrates how GeXKB can be used in combination with gene expression results and literature information to identify new potential candidates that may be considered for extending a gene regulatory network.
The effects of polymorphisms in IL-2, IFN-γ, TGF-β2, IgL, TLR-4, MD-2, and iNOS genes on resistance to Salmonella enteritidis in indigenous chickens.

PubMed

Tohidi, Reza; Idris, Ismail Bin; Panandam, Jothi Malar; Bejo, Mohd Hair

2012-12-01

Salmonella Enteritidis is a major cause of food poisoning worldwide, and poultry products are the main source of S. Enteritidis contamination for humans. Among the numerous strategies for disease control, improving genetic resistance to S. Enteritidis has been the most effective approach. We investigated the association between S. Enteritidis burden in the caecum, spleen, and liver of young indigenous chickens and seven candidate genes, selected on the basis of their critical roles in immunological functions. The genes included those encoding interleukin 2 (IL-2), interferon-γ (IFN-γ), transforming growth factor β2 (TGF-β2), immunoglobulin light chain (IgL), toll-like receptor 4 (TLR-4), myeloid differentiation protein 2 (MD-2), and inducible nitric oxide synthase (iNOS). Two Malaysian indigenous chicken breeds were used as sustainable genetic sources of alleles that are resistant to salmonellosis. The polymerase chain reaction restriction fragment-length polymorphism technique was used to genotype the candidate genes. Three different genotypes were observed in all of the candidate genes, except for MD-2. All of the candidate genes showed the Hardy-Weinberg equilibrium for the two populations. The IL-2-MnlI polymorphism was associated with S. Enteritidis burden in the caecum and spleen. The TGF-β2-RsaI, TLR-4-Sau 96I, and iNOS-AluI polymorphisms were associated with the caecum S. Enteritidis load. The other candidate genes were not associated with S. Enteritidis load in any organ. The results indicate that the IL-2, TGF-β2, TLR-4, and iNOS genes are potential candidates for use in selection programmes for increasing genetic resistance against S. Enteritidis in Malaysian indigenous chickens.
Challenges of ligand identification for the second wave of orphan riboswitch candidates.

PubMed

Greenlee, Etienne B; Stav, Shira; Atilho, Ruben M; Brewer, Kenneth I; Harris, Kimberly A; Malkowski, Sarah N; Mirihana Arachchilage, Gayan; Perkins, Kevin R; Sherlock, Madeline E; Breaker, Ronald R

2018-03-04

Orphan riboswitch candidates are noncoding RNA motifs whose representatives are believed to function as genetic regulatory elements, but whose target ligands have yet to be identified. The study of certain orphans, particularly classes that have resisted experimental validation for many years, has led to the discovery of important biological pathways and processes once their ligands were identified. Previously, we highlighted details for four of the most common and intriguing orphan riboswitch candidates. This facilitated the validation of riboswitches for the signaling molecules c-di-AMP, ZTP, and ppGpp, the metal ion Mn 2+ , and the metabolites guanidine and PRPP. Such studies also yield useful linkages between the ligands sensed by the riboswitches and numerous biochemical pathways. In the current report, we describe the known characteristics of 30 distinct classes of orphan riboswitch candidates - some of which have remained unsolved for over a decade. We also discuss the prospects for uncovering novel biological insights via focused studies on these RNAs. Lastly, we make recommendations for experimental objectives along the path to finding ligands for these mysterious RNAs.
Chromosome biology: conflict management for replication and transcription.

PubMed

Dewar, James M; Walter, Johannes C

2013-03-04

A recent study has uncovered a new mechanism that attenuates DNA replication during periods of heightened gene expression to avoid collisions between replication and transcription. Copyright © 2013 Elsevier Ltd. All rights reserved.

A Modularity-Based Method Reveals Mixed Modules from Chemical-Gene Heterogeneous Network

PubMed Central

Song, Jianglong; Tang, Shihuan; Liu, Xi; Gao, Yibo; Yang, Hongjun; Lu, Peng

2015-01-01

For a multicomponent therapy, molecular network is essential to uncover its specific mode of action from a holistic perspective. The molecular system of a Traditional Chinese Medicine (TCM) formula can be represented by a 2-class heterogeneous network (2-HN), which typically includes chemical similarities, chemical-target interactions and gene interactions. An important premise of uncovering the molecular mechanism is to identify mixed modules from complex chemical-gene heterogeneous network of a TCM formula. We thus proposed a novel method (MixMod) based on mixed modularity to detect accurate mixed modules from 2-HNs. At first, we compared MixMod with Clauset-Newman-Moore algorithm (CNM), Markov Cluster algorithm (MCL), Infomap and Louvain on benchmark 2-HNs with known module structure. Results showed that MixMod was superior to other methods when 2-HNs had promiscuous module structure. Then these methods were tested on a real drug-target network, in which 88 disease clusters were regarded as real modules. MixMod could identify the most accurate mixed modules from the drug-target 2-HN (normalized mutual information 0.62 and classification accuracy 0.4524). In the end, MixMod was applied to the 2-HN of Buchang naoxintong capsule (BNC) and detected 49 mixed modules. By using enrichment analysis, we investigated five mixed modules that contained primary constituents of BNC intestinal absorption liquid. As a matter of fact, the findings of in vitro experiments using BNC intestinal absorption liquid were found to highly accord with previous analysis. Therefore, MixMod is an effective method to detect accurate mixed modules from chemical-gene heterogeneous networks and further uncover the molecular mechanism of multicomponent therapies, especially TCM formulae. PMID:25927435
Drought response in wheat: key genes and regulatory mechanisms controlling root system architecture and transpiration efficiency

NASA Astrophysics Data System (ADS)

Kulkarni, Manoj; Soolanayakanahally, Raju; Ogawa, Satoshi; Uga, Yusaku; Selvaraj, Michael G.; Kagale, Sateesh

2017-12-01

Abiotic stresses such as drought, heat, salinity and flooding threaten global food security. Crop genetic improvement with increased resilience to abiotic stresses is a critical component of crop breeding strategies. Wheat is an important cereal crop and a staple food source globally. Enhanced drought tolerance in wheat is critical for sustainable food production and global food security. Recent advances in drought tolerance research have uncovered many key genes and transcription regulators governing morpho-physiological traits. Genes controlling root architecture and stomatal development play an important role in soil moisture extraction and its retention, and therefore have been targets of molecular breeding strategies for improving drought tolerance. In this systematic review, we have summarized evidence of beneficial contributions of root and stomatal traits to plant adaptation to drought stress. Specifically, we discuss a few key genes such as DRO1 in rice and ERECTA in Arabidopsis and rice that were identified to be the enhancers of drought tolerance via regulation of root traits and transpiration efficiency. Additionally, we highlight several transcription factor families, such as ERF (ethylene response factors), DREB (dehydration responsive element binding), ZFP (zinc finger proteins), WRKY and MYB that were identified to be both positive and negative regulators of drought responses in wheat, rice, maize and/or Arabidopsis. The overall aim of this review was to provide an overview of candidate genes that have been tested as regulators of drought response in plants. The lack of a reference genome sequence for wheat and nontransgenic approaches for manipulation of gene functions in the past had impeded high-resolution interrogation of functional elements, including genes and QTLs, and their application in cultivar improvement. The recent developments in wheat genomics and reverse genetics, including the availability of a gold-standard reference genome sequence and advent genome editing technologies, are expected to aid in deciphering of the functional roles of genes and regulatory networks underlying adaptive phenological traits, and utilizing the outcomes of such studies in developing drought tolerance cultivars.
Drought Response in Wheat: Key Genes and Regulatory Mechanisms Controlling Root System Architecture and Transpiration Efficiency.

PubMed

Kulkarni, Manoj; Soolanayakanahally, Raju; Ogawa, Satoshi; Uga, Yusaku; Selvaraj, Michael G; Kagale, Sateesh

2017-01-01

Abiotic stresses such as, drought, heat, salinity, and flooding threaten global food security. Crop genetic improvement with increased resilience to abiotic stresses is a critical component of crop breeding strategies. Wheat is an important cereal crop and a staple food source globally. Enhanced drought tolerance in wheat is critical for sustainable food production and global food security. Recent advances in drought tolerance research have uncovered many key genes and transcription regulators governing morpho-physiological traits. Genes controlling root architecture and stomatal development play an important role in soil moisture extraction and its retention, and therefore have been targets of molecular breeding strategies for improving drought tolerance. In this systematic review, we have summarized evidence of beneficial contributions of root and stomatal traits to plant adaptation to drought stress. Specifically, we discuss a few key genes such as, DRO1 in rice and ERECTA in Arabidopsis and rice that were identified to be the enhancers of drought tolerance via regulation of root traits and transpiration efficiency. Additionally, we highlight several transcription factor families, such as, ERF (ethylene response factors), DREB (dehydration responsive element binding), ZFP (zinc finger proteins), WRKY, and MYB that were identified to be both positive and negative regulators of drought responses in wheat, rice, maize, and/or Arabidopsis. The overall aim of this review is to provide an overview of candidate genes that have been identified as regulators of drought response in plants. The lack of a reference genome sequence for wheat and non-transgenic approaches for manipulation of gene functions in wheat in the past had impeded high-resolution interrogation of functional elements, including genes and QTLs, and their application in cultivar improvement. The recent developments in wheat genomics and reverse genetics, including the availability of a gold-standard reference genome sequence and advent of genome editing technologies, are expected to aid in deciphering of the functional roles of genes and regulatory networks underlying adaptive phenological traits, and utilizing the outcomes of such studies in developing drought tolerant cultivars.
The Genetic Basis for Variation in Sensitivity to Lead Toxicity in Drosophila melanogaster

PubMed Central

Zhou, Shanshan; Morozova, Tatiana V.; Hussain, Yasmeen N.; Luoma, Sarah E.; McCoy, Lenovia; Yamamoto, Akihiko; Mackay, Trudy F.C.; Anholt, Robert R.H.

2016-01-01

Background: Lead toxicity presents a worldwide health problem, especially due to its adverse effects on cognitive development in children. However, identifying genes that give rise to individual variation in susceptibility to lead toxicity is challenging in human populations. Objectives: Our goal was to use Drosophila melanogaster to identify evolutionarily conserved candidate genes associated with individual variation in susceptibility to lead exposure. Methods: To identify candidate genes associated with variation in susceptibility to lead toxicity, we measured effects of lead exposure on development time, viability and adult activity in the Drosophila melanogaster Genetic Reference Panel (DGRP) and performed genome-wide association analyses to identify candidate genes. We used mutants to assess functional causality of candidate genes and constructed a genetic network associated with variation in sensitivity to lead exposure, on which we could superimpose human orthologs. Results: We found substantial heritabilities for all three traits and identified candidate genes associated with variation in susceptibility to lead exposure for each phenotype. The genetic architectures that determine variation in sensitivity to lead exposure are highly polygenic. Gene ontology and network analyses showed enrichment of genes associated with early development and function of the nervous system. Conclusions: Drosophila melanogaster presents an advantageous model to study the genetic underpinnings of variation in susceptibility to lead toxicity. Evolutionary conservation of cellular pathways that respond to toxic exposure allows predictions regarding orthologous genes and pathways across phyla. Thus, studies in the D. melanogaster model system can identify candidate susceptibility genes to guide subsequent studies in human populations. Citation: Zhou S, Morozova TV, Hussain YN, Luoma SE, McCoy L, Yamamoto A, Mackay TF, Anholt RR. 2016. The genetic basis for variation in sensitivity to lead toxicity in Drosophila melanogaster. Environ Health Perspect 124:1062–1070; http://dx.doi.org/10.1289/ehp.1510513 PMID:26859824
Secretome Characterization and Correlation Analysis Reveal Putative Pathogenicity Mechanisms and Identify Candidate Avirulence Genes in the Wheat Stripe Rust Fungus Puccinia striiformis f. sp. tritici.

PubMed

Xia, Chongjing; Wang, Meinan; Cornejo, Omar E; Jiwan, Derick A; See, Deven R; Chen, Xianming

2017-01-01

Stripe (yellow) rust, caused by Puccinia striiformis f. sp. tritici ( Pst ), is one of the most destructive diseases of wheat worldwide. Planting resistant cultivars is an effective way to control this disease, but race-specific resistance can be overcome quickly due to the rapid evolving Pst population. Studying the pathogenicity mechanisms is critical for understanding how Pst virulence changes and how to develop wheat cultivars with durable resistance to stripe rust. We re-sequenced 7 Pst isolates and included additional 7 previously sequenced isolates to represent balanced virulence/avirulence profiles for several avirulence loci in seretome analyses. We observed an uneven distribution of heterozygosity among the isolates. Secretome comparison of Pst with other rust fungi identified a large portion of species-specific secreted proteins, suggesting that they may have specific roles when interacting with the wheat host. Thirty-two effectors of Pst were identified from its secretome. We identified candidates for Avr genes corresponding to six Yr genes by correlating polymorphisms for effector genes to the virulence/avirulence profiles of the 14 Pst isolates. The putative AvYr76 was present in the avirulent isolates, but absent in the virulent isolates, suggesting that deleting the coding region of the candidate avirulence gene has produced races virulent to resistance gene Yr76 . We conclude that incorporating avirulence/virulence phenotypes into correlation analysis with variations in genomic structure and secretome, particularly presence/absence polymorphisms of effectors, is an efficient way to identify candidate Avr genes in Pst . The candidate effector genes provide a rich resource for further studies to determine the evolutionary history of Pst populations and the co-evolutionary arms race between Pst and wheat. The Avr candidates identified in this study will lead to cloning avirulence genes in Pst , which will enable us to understand molecular mechanisms underlying Pst -wheat interactions, to determine the effectiveness of resistance genes and further to develop durable resistance to stripe rust.
Array-based comparative genomic hybridization-guided identification of reference genes for normalization of real-time quantitative polymerase chain reaction assay data for lymphomas, histiocytic sarcomas, and osteosarcomas of dogs.

PubMed

Tsai, Pei-Chien; Breen, Matthew

2012-09-01

To identify suitable reference genes for normalization of real-time quantitative PCR (RT-qPCR) assay data for common tumors of dogs. Malignant lymph node (n = 8), appendicular osteosarcoma (9), and histiocytic sarcoma (12) samples and control samples of various nonneoplastic canine tissues. Array-based comparative genomic hybridization (aCGH) data were used to guide selection of 9 candidate reference genes. Expression stability of candidate reference genes and 4 commonly used reference genes was determined for tumor samples with RT-qPCR assays and 3 software programs. LOC611555 was the candidate reference gene with the highest expression stability among the 3 tumor types. Of the commonly used reference genes, expression stability of HPRT was high in histiocytic sarcoma samples, and expression stability of Ubi and RPL32 was high in osteosarcoma samples. Some of the candidate reference genes had higher expression stability than did the commonly used reference genes. Data for constitutively expressed genes with high expression stability are required for normalization of RT-qPCR assay results. Without such data, accurate quantification of gene expression in tumor tissue samples is difficult. Results of the present study indicated LOC611555 may be a useful RT-qPCR assay reference gene for multiple tissue types. Some commonly used reference genes may be suitable for normalization of gene expression data for tumors of dogs, such as lymphomas, osteosarcomas, or histiocytic sarcomas.
Genome-wide association links candidate genes to resistance to Plum Pox Virus in apricot (Prunus armeniaca).

PubMed

Mariette, Stéphanie; Wong Jun Tai, Fabienne; Roch, Guillaume; Barre, Aurélien; Chague, Aurélie; Decroocq, Stéphane; Groppi, Alexis; Laizet, Yec'han; Lambert, Patrick; Tricon, David; Nikolski, Macha; Audergon, Jean-Marc; Abbott, Albert G; Decroocq, Véronique

2016-01-01

In fruit tree species, many important traits have been characterized genetically by using single-family descent mapping in progenies segregating for the traits. However, most mapped loci have not been sufficiently resolved to the individual genes due to insufficient progeny sizes for high resolution mapping and the previous lack of whole-genome sequence resources of the study species. To address this problem for Plum Pox Virus (PPV) candidate resistance gene identification in Prunus species, we implemented a genome-wide association (GWA) approach in apricot. This study exploited the broad genetic diversity of the apricot (Prunus armeniaca) germplasm containing resistance to PPV, next-generation sequence-based genotyping, and the high-quality peach (Prunus persica) genome reference sequence for single nucleotide polymorphism (SNP) identification. The results of this GWA study validated previously reported PPV resistance quantitative trait loci (QTL) intervals, highlighted other potential resistance loci, and resolved each to a limited set of candidate genes for further study. This work substantiates the association genetics approach for resolution of QTL to candidate genes in apricot and suggests that this approach could simplify identification of other candidate genes for other marked trait intervals in this germplasm. © 2015 INRA, UMR 1332 BFP New Phytologist © 2015 New Phytologist Trust.
Mutational Landscape of Candidate Genes in Familial Prostate Cancer

PubMed Central

Johnson, Anna M.; Zuhlke, Kimberly A.; Plotts, Chris; McDonnell, Shannon K.; Middha, Sumit; Riska, Shaun M.; Thibodeau, Stephen N.; Douglas, Julie A.; Cooney, Kathleen A.

2014-01-01

Background Family history is a major risk factor for prostate cancer (PCa), suggesting a genetic component to the disease. However, traditional linkage and association studies have failed to fully elucidate the underlying genetic basis of familial PCa. Methods Here we use a candidate gene approach to identify potential PCa susceptibility variants in whole exome sequencing data from familial PCa cases. Six hundred ninety-seven candidate genes were identified based on function, location near a known chromosome 17 linkage signal, and/or previous association with prostate or other cancers. Single nucleotide variants (SNVs) in these candidate genes were identified in whole exome sequence data from 33 PCa cases from 11 multiplex PCa families (3 cases/family). Results Overall, 4856 candidate gene SNVs were identified, including 1052 missense and 10 nonsense variants. Twenty missense variants were shared by all 3 family members in each family in which they were observed. Additionally, 15 missense variants were shared by 2 of 3 family members and predicted to be deleterious by 5 different algorithms. Four missense variants, BLM Gln123Arg, PARP2 Arg283Gln, LRCC46 Ala295Thr and KIF2B Pro91Leu, and 1 nonsense variant, CYP3A43 Arg441Ter, showed complete co-segregation with PCa status. Twelve additional variants displayed partial co-segregation with PCa. Conclusions Forty-three nonsense and shared, missense variants were identified in our candidate genes. Further research is needed to determine the contribution of these variants to PCa susceptibility. PMID:25111073
Parkinson's disease candidate gene prioritization based on expression profile of midbrain dopaminergic neurons

PubMed Central

2010-01-01

Background Parkinson's disease is the second most common neurodegenerative disorder. The pathological hallmark of the disease is degeneration of midbrain dopaminergic neurons. Genetic association studies have linked 13 human chromosomal loci to Parkinson's disease. Identification of gene(s), as part of the etiology of Parkinson's disease, within the large number of genes residing in these loci can be achieved through several approaches, including screening methods, and considering appropriate criteria. Since several of the indentified Parkinson's disease genes are expressed in substantia nigra pars compact of the midbrain, expression within the neurons of this area could be a suitable criterion to limit the number of candidates and identify PD genes. Methods In this work we have used the combination of findings from six rodent transcriptome analysis studies on the gene expression profile of midbrain dopaminergic neurons and the PARK loci in OMIM (Online Mendelian Inheritance in Man) database, to identify new candidate genes for Parkinson's disease. Results Merging the two datasets, we identified 20 genes within PARK loci, 7 of which are located in an orphan Parkinson's disease locus and one, which had been identified as a disease gene. In addition to identifying a set of candidates for further genetic association studies, these results show that the criteria of expression in midbrain dopaminergic neurons may be used to narrow down the number of genes in PARK loci for such studies. PMID:20716345
Candidate genes for idiopathic epilepsy in four dog breeds.

PubMed

Ekenstedt, Kari J; Patterson, Edward E; Minor, Katie M; Mickelson, James R

2011-04-25

Idiopathic epilepsy (IE) is a naturally occurring and significant seizure disorder affecting all dog breeds. Because dog breeds are genetically isolated populations, it is possible that IE is attributable to common founders and is genetically homogenous within breeds. In humans, a number of mutations, the majority of which are genes encoding ion channels, neurotransmitters, or their regulatory subunits, have been discovered to cause rare, specific types of IE. It was hypothesized that there are simple genetic bases for IE in some purebred dog breeds, specifically in Vizslas, English Springer Spaniels (ESS), Greater Swiss Mountain Dogs (GSMD), and Beagles, and that the gene(s) responsible may, in some cases, be the same as those already discovered in humans. Candidate genes known to be involved in human epilepsy, along with selected additional genes in the same gene families that are involved in murine epilepsy or are expressed in neural tissue, were examined in populations of affected and unaffected dogs. Microsatellite markers in close proximity to each candidate gene were genotyped and subjected to two-point linkage in Vizslas, and association analysis in ESS, GSMD and Beagles. Most of these candidate genes were not significantly associated with IE in these four dog breeds, while a few genes remained inconclusive. Other genes not included in this study may still be causing monogenic IE in these breeds or, like many cases of human IE, the disease in dogs may be likewise polygenic.
Adaptation to climate through flowering phenology: a case study in Medicago truncatula.

PubMed

Burgarella, Concetta; Chantret, Nathalie; Gay, Laurène; Prosperi, Jean-Marie; Bonhomme, Maxime; Tiffin, Peter; Young, Nevin D; Ronfort, Joelle

2016-07-01

Local climatic conditions likely constitute an important selective pressure on genes underlying important fitness-related traits such as flowering time, and in many species, flowering phenology and climatic gradients strongly covary. To test whether climate shapes the genetic variation on flowering time genes and to identify candidate flowering genes involved in the adaptation to environmental heterogeneity, we used a large Medicago truncatula core collection to examine the association between nucleotide polymorphisms at 224 candidate genes and both climate variables and flowering phenotypes. Unlike genome-wide studies, candidate gene approaches are expected to enrich for the number of meaningful trait associations because they specifically target genes that are known to affect the trait of interest. We found that flowering time mediates adaptation to climatic conditions mainly by variation at genes located upstream in the flowering pathways, close to the environmental stimuli. Variables related to the annual precipitation regime reflected selective constraints on flowering time genes better than the other variables tested (temperature, altitude, latitude or longitude). By comparing phenotype and climate associations, we identified 12 flowering genes as the most promising candidates responsible for phenological adaptation to climate. Four of these genes were located in the known flowering time QTL region on chromosome 7. However, climate and flowering associations also highlighted largely distinct gene sets, suggesting different genetic architectures for adaptation to climate and flowering onset. © 2016 John Wiley & Sons Ltd.
Surface Proteins of Gram-Positive Pathogens: Using Crystallography to Uncover Novel Features in Drug and Vaccine Candidates

NASA Astrophysics Data System (ADS)

Baker, Edward N.; Proft, Thomas; Kang, Haejoo

Proteins displayed on the cell surfaces of pathogenic organisms are the front-line troops of bacterial attack, playing critical roles in colonization, infection and virulence. Although such proteins can often be recognized from genome sequence data, through characteristic sequence motifs, their functions are often unknown. One such group of surface proteins is attached to the cell surface of Gram-positive pathogens through the action of sortase enzymes. Some of these proteins are now known to form pili: long filamentous structures that mediate attachment to human cells. Crystallographic analyses of these and other cell surface proteins have uncovered novel features in their structure, assembly and stability, including the presence of inter- and intramolecular isopeptide crosslinks. This improved understanding of structures on the bacterial cell surface offers opportunities for the development of some new drug targets and for novel approaches to vaccine design.
Transcriptome profiling of two maize inbreds with distinct responses to Gibberella ear rot disease to identify candidate resistance genes.

PubMed

Kebede, Aida Z; Johnston, Anne; Schneiderman, Danielle; Bosnich, Whynn; Harris, Linda J

2018-02-09

Gibberella ear rot (GER) is one of the most economically important fungal diseases of maize in the temperate zone due to moldy grain contaminated with health threatening mycotoxins. To develop resistant genotypes and control the disease, understanding the host-pathogen interaction is essential. RNA-Seq-derived transcriptome profiles of fungal- and mock-inoculated developing kernel tissues of two maize inbred lines were used to identify differentially expressed transcripts and propose candidate genes mapping within GER resistance quantitative trait loci (QTL). A total of 1255 transcripts were significantly (P ≤ 0.05) up regulated due to fungal infection in both susceptible and resistant inbreds. A greater number of transcripts were up regulated in the former (1174) than the latter (497) and increased as the infection progressed from 1 to 2 days after inoculation. Focusing on differentially expressed genes located within QTL regions for GER resistance, we identified 81 genes involved in membrane transport, hormone regulation, cell wall modification, cell detoxification, and biosynthesis of pathogenesis related proteins and phytoalexins as candidate genes contributing to resistance. Applying droplet digital PCR, we validated the expression profiles of a subset of these candidate genes from QTL regions contributed by the resistant inbred on chromosomes 1, 2 and 9. By screening global gene expression profiles for differentially expressed genes mapping within resistance QTL regions, we have identified candidate genes for gibberella ear rot resistance on several maize chromosomes which could potentially lead to a better understanding of Fusarium resistance mechanisms.
An Integrative Genetics Approach to Identify Candidate Genes Regulating BMD: Combining Linkage, Gene Expression, and Association

PubMed Central

Farber, Charles R; van Nas, Atila; Ghazalpour, Anatole; Aten, Jason E; Doss, Sudheer; Sos, Brandon; Schadt, Eric E; Ingram-Drake, Leslie; Davis, Richard C; Horvath, Steve; Smith, Desmond J; Drake, Thomas A; Lusis, Aldons J

2009-01-01

Numerous quantitative trait loci (QTLs) affecting bone traits have been identified in the mouse; however, few of the underlying genes have been discovered. To improve the process of transitioning from QTL to gene, we describe an integrative genetics approach, which combines linkage analysis, expression QTL (eQTL) mapping, causality modeling, and genetic association in outbred mice. In C57BL/6J × C3H/HeJ (BXH) F2 mice, nine QTLs regulating femoral BMD were identified. To select candidate genes from within each QTL region, microarray gene expression profiles from individual F2 mice were used to identify 148 genes whose expression was correlated with BMD and regulated by local eQTLs. Many of the genes that were the most highly correlated with BMD have been previously shown to modulate bone mass or skeletal development. Candidates were further prioritized by determining whether their expression was predicted to underlie variation in BMD. Using network edge orienting (NEO), a causality modeling algorithm, 18 of the 148 candidates were predicted to be causally related to differences in BMD. To fine-map QTLs, markers in outbred MF1 mice were tested for association with BMD. Three chromosome 11 SNPs were identified that were associated with BMD within the Bmd11 QTL. Finally, our approach provides strong support for Wnt9a, Rasd1, or both underlying Bmd11. Integration of multiple genetic and genomic data sets can substantially improve the efficiency of QTL fine-mapping and candidate gene identification. PMID:18767929
Mining the bitter melon (momordica charantia l.) seed transcriptome by 454 analysis of non-normalized and normalized cDNA populations for conjugated fatty acid metabolism-related genes

USDA-ARS?s Scientific Manuscript database

Seeds of Momordica charantia (bitter melon) produce high levels of eleostearic acid, an unusual conjugated fatty acid with industrial value. Deep sequencing of non-normalized and normalized cDNAs from developing bitter melon seeds was conducted to uncover key genes required for biotechnological tran...
Relationship between the IQ of People with Prader-Willi Syndrome and that of Their Siblings: Evidence for Imprinted Gene Effects

ERIC Educational Resources Information Center

Whittington, J.; Holland, A.; Webb, T.

2009-01-01

Background: Genetic disorders occasionally provide the means to uncover potential mechanisms linking gene expression and physical or cognitive characteristics or behaviour. Prader-Willi syndrome (PWS) is one such genetic disorder in which differences between the two main genetic subtypes have been documented (e.g. higher verbal IQ in one vs.…
Gene-Environment Interaction in Externalizing Problems among Adolescents: Evidence from the Pelotas 1993 Birth Cohort Study

ERIC Educational Resources Information Center

Kieling, Christian; Hutz, Mara H.; Genro, Julia P.; Polanczyk, Guilherme V.; Anselmi, Luciana; Camey, Suzi; Hallal, Pedro C.; Barros, Fernando C.; Victora, Cesar G.; Menezes, Ana M. B.; Rohde, Luis Augusto

2013-01-01

Background: The study of gene-environment interactions (G by E) is one of the most promising strategies to uncover the origins of mental disorders. Replication of initial findings, however, is essential because there is a strong possibility of publication bias in the literature. In addition, there is a scarcity of research on the topic originated…
Identification of genes specifically or preferentially expressed in maize silk reveals similarity and diversity in transcript abundance of different dry stigmas.

PubMed

Xu, Xiao Hui; Chen, Hao; Sang, Ya Lin; Wang, Fang; Ma, Jun Ping; Gao, Xin-Qi; Zhang, Xian Sheng

2012-07-02

In plants, pollination is a critical step in reproduction. During pollination, constant communication between male pollen and the female stigma is required for pollen adhesion, germination, and tube growth. The detailed mechanisms of stigma-mediated reproductive processes, however, remain largely unknown. Maize (Zea mays L.), one of the world's most important crops, has been extensively used as a model species to study molecular mechanisms of pollen and stigma interaction. A comprehensive analysis of maize silk transcriptome may provide valuable information for investigating stigma functionality. A comparative analysis of expression profiles between maize silk and dry stigmas of other species might reveal conserved and diverse mechanisms that underlie stigma-mediated reproductive processes in various plant species. Transcript abundance profiles of mature silk, mature pollen, mature ovary, and seedling were investigated using RNA-seq. By comparing the transcriptomes of these tissues, we identified 1,427 genes specifically or preferentially expressed in maize silk. Bioinformatic analyses of these genes revealed many genes with known functions in plant reproduction as well as novel candidate genes that encode amino acid transporters, peptide and oligopeptide transporters, and cysteine-rich receptor-like kinases. In addition, comparison of gene sets specifically or preferentially expressed in stigmas of maize, rice (Oryza sativa L.), and Arabidopsis (Arabidopsis thaliana [L.] Heynh.) identified a number of homologous genes involved either in pollen adhesion, hydration, and germination or in initial growth and penetration of pollen tubes into the stigma surface. The comparison also indicated that maize shares a more similar profile and larger number of conserved genes with rice than with Arabidopsis, and that amino acid and lipid transport-related genes are distinctively overrepresented in maize. Many of the novel genes uncovered in this study are potentially involved in stigma-mediated reproductive processes, including genes encoding amino acid transporters, peptide and oligopeptide transporters, and cysteine-rich receptor-like kinases. The data also suggest that dry stigmas share similar mechanisms at early stages of pollen-stigma interaction. Compared with Arabidopsis, maize and rice appear to have more conserved functional mechanisms. Genes involved in amino acid and lipid transport may be responsible for mechanisms in the reproductive process that are unique to maize silk.
Identification of genes specifically or preferentially expressed in maize silk reveals similarity and diversity in transcript abundance of different dry stigmas

PubMed Central

2012-01-01

Background In plants, pollination is a critical step in reproduction. During pollination, constant communication between male pollen and the female stigma is required for pollen adhesion, germination, and tube growth. The detailed mechanisms of stigma-mediated reproductive processes, however, remain largely unknown. Maize (Zea mays L.), one of the world’s most important crops, has been extensively used as a model species to study molecular mechanisms of pollen and stigma interaction. A comprehensive analysis of maize silk transcriptome may provide valuable information for investigating stigma functionality. A comparative analysis of expression profiles between maize silk and dry stigmas of other species might reveal conserved and diverse mechanisms that underlie stigma-mediated reproductive processes in various plant species. Results Transcript abundance profiles of mature silk, mature pollen, mature ovary, and seedling were investigated using RNA-seq. By comparing the transcriptomes of these tissues, we identified 1,427 genes specifically or preferentially expressed in maize silk. Bioinformatic analyses of these genes revealed many genes with known functions in plant reproduction as well as novel candidate genes that encode amino acid transporters, peptide and oligopeptide transporters, and cysteine-rich receptor-like kinases. In addition, comparison of gene sets specifically or preferentially expressed in stigmas of maize, rice (Oryza sativa L.), and Arabidopsis (Arabidopsis thaliana [L.] Heynh.) identified a number of homologous genes involved either in pollen adhesion, hydration, and germination or in initial growth and penetration of pollen tubes into the stigma surface. The comparison also indicated that maize shares a more similar profile and larger number of conserved genes with rice than with Arabidopsis, and that amino acid and lipid transport-related genes are distinctively overrepresented in maize. Conclusions Many of the novel genes uncovered in this study are potentially involved in stigma-mediated reproductive processes, including genes encoding amino acid transporters, peptide and oligopeptide transporters, and cysteine-rich receptor-like kinases. The data also suggest that dry stigmas share similar mechanisms at early stages of pollen-stigma interaction. Compared with Arabidopsis, maize and rice appear to have more conserved functional mechanisms. Genes involved in amino acid and lipid transport may be responsible for mechanisms in the reproductive process that are unique to maize silk. PMID:22748054
Probing the Xenopus laevis inner ear transcriptome for biological function

PubMed Central

2012-01-01

Background The senses of hearing and balance depend upon mechanoreception, a process that originates in the inner ear and shares features across species. Amphibians have been widely used for physiological studies of mechanotransduction by sensory hair cells. In contrast, much less is known of the genetic basis of auditory and vestibular function in this class of animals. Among amphibians, the genus Xenopus is a well-characterized genetic and developmental model that offers unique opportunities for inner ear research because of the amphibian capacity for tissue and organ regeneration. For these reasons, we implemented a functional genomics approach as a means to undertake a large-scale analysis of the Xenopus laevis inner ear transcriptome through microarray analysis. Results Microarray analysis uncovered genes within the X. laevis inner ear transcriptome associated with inner ear function and impairment in other organisms, thereby supporting the inclusion of Xenopus in cross-species genetic studies of the inner ear. The use of gene categories (inner ear tissue; deafness; ion channels; ion transporters; transcription factors) facilitated the assignment of functional significance to probe set identifiers. We enhanced the biological relevance of our microarray data by using a variety of curation approaches to increase the annotation of the Affymetrix GeneChip® Xenopus laevis Genome array. In addition, annotation analysis revealed the prevalence of inner ear transcripts represented by probe set identifiers that lack functional characterization. Conclusions We identified an abundance of targets for genetic analysis of auditory and vestibular function. The orthologues to human genes with known inner ear function and the highly expressed transcripts that lack annotation are particularly interesting candidates for future analyses. We used informatics approaches to impart biologically relevant information to the Xenopus inner ear transcriptome, thereby addressing the impediment imposed by insufficient gene annotation. These findings heighten the relevance of Xenopus as a model organism for genetic investigations of inner ear organogenesis, morphogenesis, and regeneration. PMID:22676585

Information-dependent enrichment analysis reveals time-dependent transcriptional regulation of the estrogen pathway of toxicity.

PubMed

Pendse, Salil N; Maertens, Alexandra; Rosenberg, Michael; Roy, Dipanwita; Fasani, Rick A; Vantangoli, Marguerite M; Madnick, Samantha J; Boekelheide, Kim; Fornace, Albert J; Odwin, Shelly-Ann; Yager, James D; Hartung, Thomas; Andersen, Melvin E; McMullen, Patrick D

2017-04-01

The twenty-first century vision for toxicology involves a transition away from high-dose animal studies to in vitro and computational models (NRC in Toxicity testing in the 21st century: a vision and a strategy, The National Academies Press, Washington, DC, 2007). This transition requires mapping pathways of toxicity by understanding how in vitro systems respond to chemical perturbation. Uncovering transcription factors/signaling networks responsible for gene expression patterns is essential for defining pathways of toxicity, and ultimately, for determining the chemical modes of action through which a toxicant acts. Traditionally, transcription factor identification is achieved via chromatin immunoprecipitation studies and summarized by calculating which transcription factors are statistically associated with up- and downregulated genes. These lists are commonly determined via statistical or fold-change cutoffs, a procedure that is sensitive to statistical power and may not be as useful for determining transcription factor associations. To move away from an arbitrary statistical or fold-change-based cutoff, we developed, in the context of the Mapping the Human Toxome project, an enrichment paradigm called information-dependent enrichment analysis (IDEA) to guide identification of the transcription factor network. We used a test case of activation in MCF-7 cells by 17β estradiol (E2). Using this new approach, we established a time course for transcriptional and functional responses to E2. ERα and ERβ were associated with short-term transcriptional changes in response to E2. Sustained exposure led to recruitment of additional transcription factors and alteration of cell cycle machinery. TFAP2C and SOX2 were the transcription factors most highly correlated with dose. E2F7, E2F1, and Foxm1, which are involved in cell proliferation, were enriched only at 24 h. IDEA should be useful for identifying candidate pathways of toxicity. IDEA outperforms gene set enrichment analysis (GSEA) and provides similar results to weighted gene correlation network analysis, a platform that helps to identify genes not annotated to pathways.
A comprehensive catalogue of the coding and non-coding transcripts of the human inner ear

PubMed Central

Corneveaux, Jason J.; Ohmen, Jeffrey; White, Cory; Allen, April N.; Lusis, Aldons J.; Van Camp, Guy; Huentelman, Matthew J.; Friedman, Rick A.

2015-01-01

The mammalian inner ear consists of the cochlea and the vestibular labyrinth (utricle, saccule, and semicircular canals), which participate in both hearing and balance. Proper development and life-long function of these structures involves a highly complex coordinated system of spatial and temporal gene expression. The characterization of the inner ear transcriptome is likely important for the functional study of auditory and vestibular components, yet, primarily due to tissue unavailability, detailed expression catalogues of the human inner ear remain largely incomplete. We report here, for the first time, comprehensive transcriptome characterization of the adult human cochlea, ampulla, saccule and utricle of the vestibule obtained from patients without hearing abnormalities. Using RNA-Seq, we measured the expression of >50,000 predicted genes corresponding to approximately 200,000 transcripts, in the adult inner ear and compared it to 32 other human tissues. First, we identified genes preferentially expressed in the inner ear, and unique either to the vestibule or cochlea. Next, we examined expression levels of specific groups of potentially interesting RNAs, such as genes implicated in hearing loss, long non-coding RNAs, pseudogenes and transcripts subject to nonsense mediated decay (NMD). We uncover the spatial specificity of expression of these RNAs in the hearing/balance system, and reveal evidence of tissue specific NMD. Lastly, we investigated the non-syndromic deafness loci to which no gene has been mapped, and narrow the list of potential candidates for each locus. These data represent the first high-resolution transcriptome catalogue of the adult human inner ear. A comprehensive identification of coding and non-coding RNAs in the inner ear will enable pathways of auditory and vestibular function to be further defined in the study of hearing and balance. Expression data are freely accessible at https://www.tgen.org/home/research/research-divisions/neurogenomics/supplementary-data/inner-ear-transcriptome.aspx PMID:26341477
A Stratified Transcriptomics Analysis of Polygenic Fat and Lean Mouse Adipose Tissues Identifies Novel Candidate Obesity Genes

PubMed Central

Morton, Nicholas M.; Nelson, Yvonne B.; Michailidou, Zoi; Di Rollo, Emma M.; Ramage, Lynne; Hadoke, Patrick W. F.; Seckl, Jonathan R.; Bunger, Lutz; Horvat, Simon; Kenyon, Christopher J.; Dunbar, Donald R.

2011-01-01

Background Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F) mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L) strain. Results To enrich for adipose tissue obesity genes a ‘snap-shot’ pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney) was performed. Known obesity quantitative trait loci (QTL) information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r) as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr) that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity. Conclusions A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes contributing to obesity. PMID:21915269
A stratified transcriptomics analysis of polygenic fat and lean mouse adipose tissues identifies novel candidate obesity genes.

PubMed

Morton, Nicholas M; Nelson, Yvonne B; Michailidou, Zoi; Di Rollo, Emma M; Ramage, Lynne; Hadoke, Patrick W F; Seckl, Jonathan R; Bunger, Lutz; Horvat, Simon; Kenyon, Christopher J; Dunbar, Donald R

2011-01-01

Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F) mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L) strain. To enrich for adipose tissue obesity genes a 'snap-shot' pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney) was performed. Known obesity quantitative trait loci (QTL) information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r) as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr) that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity. A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes contributing to obesity.
A whole genome SNP genotyping by DNA microarray and candidate gene association study for kidney stone disease

PubMed Central

2014-01-01

Background Kidney stone disease (KSD) is a complex disorder with unknown etiology in majority of the patients. Genetic and environmental factors may cause the disease. In the present study, we used DNA microarray to genotype single nucleotide polymorphisms (SNP) and performed candidate gene association analysis to determine genetic variations associated with the disease. Methods A whole genome SNP genotyping by DNA microarray was initially conducted in 101 patients and 105 control subjects. A set of 104 candidate genes reported to be involved in KSD, gathered from public databases and candidate gene association study databases, were evaluated for their variations associated with KSD. Results Altogether 82 SNPs distributed within 22 candidate gene regions showed significant differences in SNP allele frequencies between the patient and control groups (P < 0.05). Of these, 4 genes including BGLAP, AHSG, CD44, and HAO1, encoding osteocalcin, fetuin-A, CD44-molecule and glycolate oxidase 1, respectively, were further assessed for their associations with the disease because they carried high proportion of SNPs with statistical differences of allele frequencies between the patient and control groups within the gene. The total of 26 SNPs showed significant differences of allele frequencies between the patient and control groups and haplotypes associated with disease risk were identified. The SNP rs759330 located 144 bp downstream of BGLAP where it is a predicted microRNA binding site at 3′UTR of PAQR6 – a gene encoding progestin and adipoQ receptor family member VI, was genotyped in 216 patients and 216 control subjects and found to have significant differences in its genotype and allele frequencies (P = 0.0007, OR 2.02 and P = 0.0001, OR 2.02, respectively). Conclusions Our results suggest that these candidate genes are associated with KSD and PAQR6 comes into our view as the most potent candidate since associated SNP rs759330 is located in the miRNA binding site and may affect mRNA expression level. PMID:24886237
Pivotal role of the muscle-contraction pathway in cryptorchidism and evidence for genomic connections with cardiomyopathy pathways in RASopathies.

PubMed

Cannistraci, Carlo V; Ogorevc, Jernej; Zorc, Minja; Ravasi, Timothy; Dovc, Peter; Kunej, Tanja

2013-02-14

Cryptorchidism is the most frequent congenital disorder in male children; however the genetic causes of cryptorchidism remain poorly investigated. Comparative integratomics combined with systems biology approach was employed to elucidate genetic factors and molecular pathways underlying testis descent. Literature mining was performed to collect genomic loci associated with cryptorchidism in seven mammalian species. Information regarding the collected candidate genes was stored in MySQL relational database. Genomic view of the loci was presented using Flash GViewer web tool (http://gmod.org/wiki/Flashgviewer/). DAVID Bioinformatics Resources 6.7 was used for pathway enrichment analysis. Cytoscape plug-in PiNGO 1.11 was employed for protein-network-based prediction of novel candidate genes. Relevant protein-protein interactions were confirmed and visualized using the STRING database (version 9.0). The developed cryptorchidism gene atlas includes 217 candidate loci (genes, regions involved in chromosomal mutations, and copy number variations) identified at the genomic, transcriptomic, and proteomic level. Human orthologs of the collected candidate loci were presented using a genomic map viewer. The cryptorchidism gene atlas is freely available online: http://www.integratomics-time.com/cryptorchidism/. Pathway analysis suggested the presence of twelve enriched pathways associated with the list of 179 literature-derived candidate genes. Additionally, a list of 43 network-predicted novel candidate genes was significantly associated with four enriched pathways. Joint pathway analysis of the collected and predicted candidate genes revealed the pivotal importance of the muscle-contraction pathway in cryptorchidism and evidence for genomic associations with cardiomyopathy pathways in RASopathies. The developed gene atlas represents an important resource for the scientific community researching genetics of cryptorchidism. The collected data will further facilitate development of novel genetic markers and could be of interest for functional studies in animals and human. The proposed network-based systems biology approach elucidates molecular mechanisms underlying co-presence of cryptorchidism and cardiomyopathy in RASopathies. Such approach could also aid in molecular explanation of co-presence of diverse and apparently unrelated clinical manifestations in other syndromes.
Genome-Wide Prediction of the Polymorphic Ser Gene Family in Tetrahymena thermophila Based on Motif Analysis

PubMed Central

Ponsuwanna, Patrath; Kümpornsin, Krittikorn; Chookajorn, Thanat

2014-01-01

Even though antigenic variation is employed among parasitic protozoa for host immune evasion, Tetrahymena thermophila, a free-living ciliate, can also change its surface protein antigens. These cysteine-rich glycosylphosphatidylinositol (GPI)-linked surface proteins are encoded by a family of polymorphic Ser genes. Despite the availability of T. thermophila genome, a comprehensive analysis of the Ser family is limited by its high degree of polymorphism. In order to overcome this problem, a new approach was adopted by searching for Ser candidates with common motif sequences, namely length-specific repetitive cysteine pattern and GPI anchor site. The candidate genes were phylogenetically compared with the previously identified Ser genes and classified into subtypes. Ser candidates were often found to be located as tandem arrays of the same subtypes on several chromosomal scaffolds. Certain Ser candidates located in the same chromosomal arrays were transcriptionally expressed at specific T. thermophila developmental stages. These Ser candidates selected by the motif analysis approach can form the foundation for a systematic identification of the entire Ser gene family, which will contribute to the understanding of their function and the basis of T. thermophila antigenic variation. PMID:25133747
Analysis of Craniocardiac Malformations in Xenopus using Optical Coherence Tomography

PubMed Central

Deniz, Engin; Jonas, Stephan; Hooper, Michael; N. Griffin, John; Choma, Michael A.; Khokha, Mustafa K.

2017-01-01

Birth defects affect 3% of children in the United States. Among the birth defects, congenital heart disease and craniofacial malformations are major causes of mortality and morbidity. Unfortunately, the genetic mechanisms underlying craniocardiac malformations remain largely uncharacterized. To address this, human genomic studies are identifying sequence variations in patients, resulting in numerous candidate genes. However, the molecular mechanisms of pathogenesis for most candidate genes are unknown. Therefore, there is a need for functional analyses in rapid and efficient animal models of human disease. Here, we coupled the frog Xenopus tropicalis with Optical Coherence Tomography (OCT) to create a fast and efficient system for testing craniocardiac candidate genes. OCT can image cross-sections of microscopic structures in vivo at resolutions approaching histology. Here, we identify optimal OCT imaging planes to visualize and quantitate Xenopus heart and facial structures establishing normative data. Next we evaluate known human congenital heart diseases: cardiomyopathy and heterotaxy. Finally, we examine craniofacial defects by a known human teratogen, cyclopamine. We recapitulate human phenotypes readily and quantify the functional and structural defects. Using this approach, we can quickly test human craniocardiac candidate genes for phenocopy as a critical first step towards understanding disease mechanisms of the candidate genes. PMID:28195132
Detection limit of intragenic deletions with targeted array comparative genomic hybridization

PubMed Central

2013-01-01

Background Pathogenic mutations range from single nucleotide changes to deletions or duplications that encompass a single exon to several genes. The use of gene-centric high-density array comparative genomic hybridization (aCGH) has revolutionized the detection of intragenic copy number variations. We implemented an exon-centric design of high-resolution aCGH to detect single- and multi-exon deletions and duplications in a large set of genes using the OGT 60 K and 180 K arrays. Here we describe the molecular characterization and breakpoint mapping of deletions at the smaller end of the detectable range in several genes using aCGH. Results The method initially implemented to detect single to multiple exon deletions, was able to detect deletions much smaller than anticipated. The selected deletions we describe vary in size, ranging from over 2 kb to as small as 12 base pairs. The smallest of these deletions are only detectable after careful manual review during data analysis. Suspected deletions smaller than the detection size for which the method was optimized, were rigorously followed up and confirmed with PCR-based investigations to uncover the true detection size limit of intragenic deletions with this technology. False-positive deletion calls often demonstrated single nucleotide changes or an insertion causing lower hybridization of probes demonstrating the sensitivity of aCGH. Conclusions With optimizing aCGH design and careful review process, aCGH can uncover intragenic deletions as small as dozen bases. These data provide insight that will help optimize probe coverage in array design and illustrate the true assay sensitivity. Mapping of the breakpoints confirms smaller deletions and contributes to the understanding of the mechanism behind these events. Our knowledge of the mutation spectra of several genes can be expected to change as previously unrecognized intragenic deletions are uncovered. PMID:24304607
Using Association Mapping in Teosinte (Zea Mays ssp Parviglumis) to Investigate the Function of Selection-Candidate Genes

USDA-ARS?s Scientific Manuscript database

Large-scale screens of the maize genome identified 48 genes that show the putative signature of artificial selection during maize domestication or improvement. These selection-candidate genes may act as quantitative trait loci (QTL) that control the phenotypic differences between maize and its proge...
PINTA: a web server for network-based gene prioritization from expression data

PubMed Central

Nitsch, Daniela; Tranchevent, Léon-Charles; Gonçalves, Joana P.; Vogt, Josef Korbinian; Madeira, Sara C.; Moreau, Yves

2011-01-01

PINTA (available at http://www.esat.kuleuven.be/pinta/; this web site is free and open to all users and there is no login requirement) is a web resource for the prioritization of candidate genes based on the differential expression of their neighborhood in a genome-wide protein–protein interaction network. Our strategy is meant for biological and medical researchers aiming at identifying novel disease genes using disease specific expression data. PINTA supports both candidate gene prioritization (starting from a user defined set of candidate genes) as well as genome-wide gene prioritization and is available for five species (human, mouse, rat, worm and yeast). As input data, PINTA only requires disease specific expression data, whereas various platforms (e.g. Affymetrix) are supported. As a result, PINTA computes a gene ranking and presents the results as a table that can easily be browsed and downloaded by the user. PMID:21602267
Identifying positive selection candidate loci for high-altitude adaptation in Andean populations

PubMed Central

2009-01-01

High-altitude environments (>2,500 m) provide scientists with a natural laboratory to study the physiological and genetic effects of low ambient oxygen tension on human populations. One approach to understanding how life at high altitude has affected human metabolism is to survey genome-wide datasets for signatures of natural selection. In this work, we report on a study to identify selection-nominated candidate genes involved in adaptation to hypoxia in one highland group, Andeans from the South American Altiplano. We analysed dense microarray genotype data using four test statistics that detect departures from neutrality. Using a candidate gene, single nucleotide polymorphism-based approach, we identified genes exhibiting preliminary evidence of recent genetic adaptation in this population. These included genes that are part of the hypoxia-inducible transcription factor (HIF) pathway, a biochemical pathway involved in oxygen homeostasis, as well as three other genomic regions previously not known to be associated with high-altitude phenotypes. In addition to identifying selection-nominated candidate genes, we also tested whether the HIF pathway shows evidence of natural selection. Our results indicate that the genes of this biochemical pathway as a group show no evidence of having evolved in response to hypoxia in Andeans. Results from particular HIF-targeted genes, however, suggest that genes in this pathway could play a role in Andean adaptation to high altitude, even if the pathway as a whole does not show higher relative rates of evolution. These data suggest a genetic role in high-altitude adaptation and provide a basis for genotype/phenotype association studies that are necessary to confirm the role of putative natural selection candidate genes and gene regions in adaptation to altitude. PMID:20038496
Analysis of 60 reported glioma risk SNPs replicates published GWAS findings but fails to replicate associations from published candidate-gene studies.

PubMed

Walsh, Kyle M; Anderson, Erik; Hansen, Helen M; Decker, Paul A; Kosel, Matt L; Kollmeyer, Thomas; Rice, Terri; Zheng, Shichun; Xiao, Yuanyuan; Chang, Jeffrey S; McCoy, Lucie S; Bracci, Paige M; Wiemels, Joe L; Pico, Alexander R; Smirnov, Ivan; Lachance, Daniel H; Sicotte, Hugues; Eckel-Passow, Jeanette E; Wiencke, John K; Jenkins, Robert B; Wrensch, Margaret R

2013-02-01

Genomewide association studies (GWAS) and candidate-gene studies have implicated single-nucleotide polymorphisms (SNPs) in at least 45 different genes as putative glioma risk factors. Attempts to validate these associations have yielded variable results and few genetic risk factors have been consistently replicated. We conducted a case-control study of Caucasian glioma cases and controls from the University of California San Francisco (810 cases, 512 controls) and the Mayo Clinic (852 cases, 789 controls) in an attempt to replicate previously reported genetic risk factors for glioma. Sixty SNPs selected from the literature (eight from GWAS and 52 from candidate-gene studies) were successfully genotyped on an Illumina custom genotyping panel. Eight SNPs in/near seven different genes (TERT, EGFR, CCDC26, CDKN2A, PHLDB1, RTEL1, TP53) were significantly associated with glioma risk in the combined dataset (P < 0.05), with all associations in the same direction as in previous reports. Several SNP associations showed considerable differences across histologic subtype. All eight successfully replicated associations were first identified by GWAS, although none of the putative risk SNPs from candidate-gene studies was associated in the full case-control sample (all P values > 0.05). Although several confirmed associations are located near genes long known to be involved in gliomagenesis (e.g., EGFR, CDKN2A, TP53), these associations were first discovered by the GWAS approach and are in noncoding regions. These results highlight that the deficiencies of the candidate-gene approach lay in selecting both appropriate genes and relevant SNPs within these genes. © 2012 WILEY PERIODICALS, INC.
Indel-seq: a fast-forward genetics approach for identification of trait-associated putative candidate genomic regions and its application in pigeonpea (Cajanus cajan).

PubMed

Singh, Vikas K; Khan, Aamir W; Saxena, Rachit K; Sinha, Pallavi; Kale, Sandip M; Parupalli, Swathi; Kumar, Vinay; Chitikineni, Annapurna; Vechalapu, Suryanarayana; Sameer Kumar, Chanda Venkata; Sharma, Mamta; Ghanta, Anuradha; Yamini, Kalinati Narasimhan; Muniswamy, Sonnappa; Varshney, Rajeev K

2017-07-01

Identification of candidate genomic regions associated with target traits using conventional mapping methods is challenging and time-consuming. In recent years, a number of single nucleotide polymorphism (SNP)-based mapping approaches have been developed and used for identification of candidate/putative genomic regions. However, in the majority of these studies, insertion-deletion (Indel) were largely ignored. For efficient use of Indels in mapping target traits, we propose Indel-seq approach, which is a combination of whole-genome resequencing (WGRS) and bulked segregant analysis (BSA) and relies on the Indel frequencies in extreme bulks. Deployment of Indel-seq approach for identification of candidate genomic regions associated with fusarium wilt (FW) and sterility mosaic disease (SMD) resistance in pigeonpea has identified 16 Indels affecting 26 putative candidate genes. Of these 26 affected putative candidate genes, 24 genes showed effect in the upstream/downstream of the genic region and two genes showed effect in the genes. Validation of these 16 candidate Indels in other FW- and SMD-resistant and FW- and SMD-susceptible genotypes revealed a significant association of five Indels (three for FW and two for SMD resistance). Comparative analysis of Indel-seq with other genetic mapping approaches highlighted the importance of the approach in identification of significant genomic regions associated with target traits. Therefore, the Indel-seq approach can be used for quick and precise identification of candidate genomic regions for any target traits in any crop species. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Candidate EDA targets revealed by expression profiling of primary keratinocytes from Tabby mutant mice

PubMed Central

Esibizione, Diana; Cui, Chang-Yi; Schlessinger, David

2009-01-01

EDA, the gene mutated in anhidrotic ectodermal dysplasia, encodes ectodysplasin, a TNF superfamily member that activates NF-kB mediated transcription. To identify EDA target genes, we have earlier used expression profiling to infer genes differentially expressed at various developmental time points in Tabby (Eda-deficient) compared to wild-type mouse skin. To increase the resolution to find genes whose expression may be restricted to epidermal cells, we have now extended studies to primary keratinocyte cultures established from E19 wild-type and Tabby skin. Using microarrays bearing 44,000 gene probes, we found 385 preliminary candidate genes whose expression was significantly affected by Eda loss. By comparing expression profiles to those from Eda-A1 transgenic skin, we restricted the list to 38 “candidate EDA targets”, 14 of which were already known to be expressed in hair follicles or epidermis. We confirmed expression changes for 3 selected genes, Tbx1, Bmp7, and Jag1, both in keratinocytes and in whole skin, by Q-PCR and Western blotting analyses. Thus, by the analysis of keratinocytes, novel candidate pathways downstream of EDA were detected. PMID:18848976
A comprehensive study of the genomic differentiation between temperate Dent and Flint maize.

PubMed

Unterseer, Sandra; Pophaly, Saurabh D; Peis, Regina; Westermeier, Peter; Mayer, Manfred; Seidel, Michael A; Haberer, Georg; Mayer, Klaus F X; Ordas, Bernardo; Pausch, Hubert; Tellier, Aurélien; Bauer, Eva; Schön, Chris-Carolin

2016-07-08

Dent and Flint represent two major germplasm pools exploited in maize breeding. Several traits differentiate the two pools, like cold tolerance, early vigor, and flowering time. A comparative investigation of their genomic architecture relevant for quantitative trait expression has not been reported so far. Understanding the genomic differences between germplasm pools may contribute to a better understanding of the complementarity in heterotic patterns exploited in hybrid breeding and of mechanisms involved in adaptation to different environments. We perform whole-genome screens for signatures of selection specific to temperate Dent and Flint maize by comparing high-density genotyping data of 70 American and European Dent and 66 European Flint inbred lines. We find 2.2 % and 1.4 % of the genes are under selective pressure, respectively, and identify candidate genes associated with agronomic traits known to differ between the two pools. Taking flowering time as an example for the differentiation between Dent and Flint, we investigate candidate genes involved in the flowering network by phenotypic analyses in a Dent-Flint introgression library and find that the Flint haplotypes of the candidates promote earlier flowering. Within the flowering network, the majority of Flint candidates are associated with endogenous pathways in contrast to Dent candidate genes, which are mainly involved in response to environmental factors like light and photoperiod. The diversity patterns of the candidates in a unique panel of more than 900 individuals from 38 European landraces indicate a major contribution of landraces from France, Germany, and Spain to the candidate gene diversity of the Flint elite lines. In this study, we report the investigation of pool-specific differences between temperate Dent and Flint on a genome-wide scale. The identified candidate genes represent a promising source for the functional investigation of pool-specific haplotypes in different genetic backgrounds and for the evaluation of their potential for future crop improvement like the adaptation to specific environments.
Identification of candidate genes associated with fibromyalgia susceptibility in southern Spanish women: the al-Ándalus project.

PubMed

Estévez-López, Fernando; Camiletti-Moirón, Daniel; Aparicio, Virginia A; Segura-Jiménez, Víctor; Álvarez-Gallardo, Inmaculada C; Soriano-Maldonado, Alberto; Borges-Cosic, Milkana; Acosta-Manzano, Pedro; Geenen, Rinie; Delgado-Fernández, Manuel; Martínez-González, Luis J; Ruiz, Jonatan R; Álvarez-Cubero, María J

2018-02-27

Candidate-gene studies on fibromyalgia susceptibility often include a small number of single nucleotide polymorphisms (SNPs), which is a limitation. Moreover, there is a paucity of evidence in Europe. Therefore, we compared genotype frequencies of candidate SNPs in a well-characterised sample of Spanish women with fibromyalgia and healthy non-fibromyalgia women. A total of 314 women with a diagnosis of fibromyalgia (cases) and 112 non-fibromyalgia healthy (controls) women participated in this candidate-gene study. Buccal swabs were collected for DNA extraction. Using TaqMan™ OpenArray™, we analysed 61 SNPs of 33 genes related to fibromyalgia susceptibility, symptoms, or potential mechanisms. We observed that the rs841 and rs1799971 GG genotype was more frequently observed in fibromyalgia than in controls (p = 0.04 and p = 0.02, respectively). The rs2097903 AT/TT genotypes were also more often present in the fibromyalgia participants than in their control peers (p = 0.04). There were no differences for the remaining SNPs. We identified, for the first time, associations of the rs841 (guanosine triphosphate cyclohydrolase 1 gene) and rs2097903 (catechol-O-methyltransferase gene) SNPs with higher risk of fibromyalgia susceptibility. We also confirmed that the rs1799971 SNP (opioid receptor μ1 gene) might confer genetic risk of fibromyalgia. We did not adjust for multiple comparisons, which would be too stringent and yield to non-significant differences in the genotype frequencies between cases and controls. Our findings may be biologically meaningful and informative, and should be further investigated in other populations. Of particular interest is to replicate the present study in a larger independent sample to confirm or refute our findings. On the other hand, by including 61 SNPs of 33 candidate-genes with a strong rationale (they were previously investigated in relation to fibromyalgia susceptibility, symptoms or potential mechanisms), the present research is the most comprehensive candidate-gene study on fibromyalgia susceptibility to date.
Frequent mutation of histone-modifying genes in non-Hodgkin lymphoma | Office of Cancer Genomics

Cancer.gov

In a recent Nature article, Morin et al. uncovered a novel role for chromatin modification in driving the progression of two non-Hodgkin lymphomas (NHLs), follicular lymphoma and diffuse large B-cell lymphoma. Through DNA and RNA sequencing of 117 tumor samples and 10 assorted cell lines, the authors identified and validated 109 genes with multiple mutations in these B-cell NHLs. Of the 109 genes, several genes not previously linked to lymphoma demonstrated positive selection for mutation including two genes involved in histone modification, MLL2 and MEF2B.
Fusion genes in solid tumors: an emerging target for cancer diagnosis and treatment.

PubMed

Parker, Brittany C; Zhang, Wei

2013-11-01

Studies over the past decades have uncovered fusion genes, a class of oncogenes that provide immense diagnostic and therapeutic advantages because of their tumor-specific expression. Originally associated with hemotologic cancers, fusion genes have recently been discovered in a wide array of solid tumors, including sarcomas, carcinomas, and tumors of the central nervous system. Fusion genes are attractive as both therapeutic targets and diagnostic tools due to their inherent expression in tumor tissue alone. Therefore, the discovery and elucidation of fusion genes in various cancer types may provide more effective therapies in the future for cancer patients.
Comparative molecular analyses of select pH- and osmoregulatory genes in three freshwater crayfish Cherax quadricarinatus, C. destructor and C. cainii.

PubMed

Ali, Muhammad Y; Pavasovic, Ana; Dammannagoda, Lalith K; Mather, Peter B; Prentis, Peter J

2017-01-01

Systemic acid-base balance and osmotic/ionic regulation in decapod crustaceans are in part maintained by a set of transport-related enzymes such as carbonic anhydrase (CA), Na + /K + -ATPase (NKA), H + -ATPase (HAT), Na + /K + /2Cl - cotransporter (NKCC), Na + /Cl - /HCO[Formula: see text] cotransporter (NBC), Na + /H + exchanger (NHE), Arginine kinase (AK), Sarcoplasmic Ca +2 -ATPase (SERCA) and Calreticulin (CRT). We carried out a comparative molecular analysis of these genes in three commercially important yet eco-physiologically distinct freshwater crayfish , Cherax quadricarinatus, C. destructor and C. cainii , with the aim to identify mutations in these genes and determine if observed patterns of mutations were consistent with the action of natural selection. We also conducted a tissue-specific expression analysis of these genes across seven different organs, including gills, hepatopancreas, heart, kidney, liver, nerve and testes using NGS transcriptome data. The molecular analysis of the candidate genes revealed a high level of sequence conservation across the three Cherax sp. Hyphy analysis revealed that all candidate genes showed patterns of molecular variation consistent with neutral evolution. The tissue-specific expression analysis showed that 46% of candidate genes were expressed in all tissue types examined, while approximately 10% of candidate genes were only expressed in a single tissue type. The largest number of genes was observed in nerve (84%) and gills (78%) and the lowest in testes (66%). The tissue-specific expression analysis also revealed that most of the master genes regulating pH and osmoregulation (CA, NKA, HAT, NKCC, NBC, NHE) were expressed in all tissue types indicating an important physiological role for these genes outside of osmoregulation in other tissue types. The high level of sequence conservation observed in the candidate genes may be explained by the important role of these genes as well as potentially having a number of other basic physiological functions in different tissue types.

Identification of an miRNA candidate reflects the possible significance of transcribed microsatellites in the hairpin precursors of black pepper.

PubMed

Joy, Nisha; Soniya, Eppurathu Vasudevan

2012-06-01

Plant miRNAs (18-24nt) are generated by the RNase III-type Dicer endonuclease from the endogenous hairpin precursors ('pre-miRNAs') with significant regulatory functions. The transcribed regions display a higher frequency of microsatellites, when compared to other regions of the genomic DNA. Simple sequence repeats (SSRs) resulting from replication slippage occurring in transcripts affect the expression of genes. The available experimental evidence for the incidence of SSRs in the miRNA precursors is limited. Considering the potential significance of SSRs in the miRNA genes, we carried out a preliminary analysis to verify the presence of SSRs in the pri-miRNAs of black pepper (Piper nigrum L.). We isolated a (CT) dinucleotide SSR bearing transcript using SMART strategy. The transcript was predicted to be a 'pri-miRNA candidate' with Dicer sites based on miRNA prediction tools and MFOLD structural predictions. The presence of this 'miRNA candidate' was confirmed by real-time TaqMan assays. The upstream sequence of the 'miRNA candidate' by genome walking when subjected to PlantCARE showed the presence of certain promoter elements, and the deduced amino acid showed significant similarity with NAP1 gene, which affects the transcription of many genes. Moreover the hairpin-like precursor overlapped the neighbouring NAP1 gene. In silico analysis revealed distinct putative functions for the 'miRNA candidate', of which majority were related to growth. Hence, we assume that this 'miRNA candidate' may get activated during transcription of NAP gene, thereby regulating the expression of many genes involved in developmental processes.
Novel candidate genes may be possible predisposing factors revealed by whole exome sequencing in familial esophageal squamous cell carcinoma.

PubMed

Forouzanfar, Narjes; Baranova, Ancha; Milanizadeh, Saman; Heravi-Moussavi, Alireza; Jebelli, Amir; Abbaszadegan, Mohammad Reza

2017-05-01

Esophageal squamous cell carcinoma is one of the deadliest of all the cancers. Its metastatic properties portend poor prognosis and high rate of recurrence. A more advanced method to identify new molecular biomarkers predicting disease prognosis can be whole exome sequencing. Here, we report the most effective genetic variants of the Notch signaling pathway in esophageal squamous cell carcinoma susceptibility by whole exome sequencing. We analyzed nine probands in unrelated familial esophageal squamous cell carcinoma pedigrees to identify candidate genes. Genomic DNA was extracted and whole exome sequencing performed to generate information about genetic variants in the coding regions. Bioinformatics software applications were utilized to exploit statistical algorithms to demonstrate protein structure and variants conservation. Polymorphic regions were excluded by false-positive investigations. Gene-gene interactions were analyzed for Notch signaling pathway candidates. We identified novel and damaging variants of the Notch signaling pathway through extensive pathway-oriented filtering and functional predictions, which led to the study of 27 candidate novel mutations in all nine patients. Detection of the trinucleotide repeat containing 6B gene mutation (a slice site alteration) in five of the nine probands, but not in any of the healthy samples, suggested that it may be a susceptibility factor for familial esophageal squamous cell carcinoma. Noticeably, 8 of 27 novel candidate gene mutations (e.g. epidermal growth factor, signal transducer and activator of transcription 3, MET) act in a cascade leading to cell survival and proliferation. Our results suggest that the trinucleotide repeat containing 6B mutation may be a candidate predisposing gene in esophageal squamous cell carcinoma. In addition, some of the Notch signaling pathway genetic mutations may act as key contributors to esophageal squamous cell carcinoma.
RNA-Seq and Gene Network Analysis Uncover Activation of an ABA-Dependent Signalosome During the Cork Oak Root Response to Drought

PubMed Central

Magalhães, Alexandre P.; Verde, Nuno; Reis, Francisca; Martins, Inês; Costa, Daniela; Lino-Neto, Teresa; Castro, Pedro H.; Tavares, Rui M.; Azevedo, Herlânder

2016-01-01

Quercus suber (cork oak) is a West Mediterranean species of key economic interest, being extensively explored for its ability to generate cork. Like other Mediterranean plants, Q. suber is significantly threatened by climatic changes, imposing the need to quickly understand its physiological and molecular adaptability to drought stress imposition. In the present report, we uncovered the differential transcriptome of Q. suber roots exposed to long-term drought, using an RNA-Seq approach. 454-sequencing reads were used to de novo assemble a reference transcriptome, and mapping of reads allowed the identification of 546 differentially expressed unigenes. These were enriched in both effector genes (e.g., LEA, chaperones, transporters) as well as regulatory genes, including transcription factors (TFs) belonging to various different classes, and genes associated with protein turnover. To further extend functional characterization, we identified the orthologs of differentially expressed unigenes in the model species Arabidopsis thaliana, which then allowed us to perform in silico functional inference, including gene network analysis for protein function, protein subcellular localization and gene co-expression, and in silico enrichment analysis for TFs and cis-elements. Results indicated the existence of extensive transcriptional regulatory events, including activation of ABA-responsive genes and ABF-dependent signaling. We were then able to establish that a core ABA-signaling pathway involving PP2C-SnRK2-ABF components was induced in stressed Q. suber roots, identifying a key mechanism in this species’ response to drought. PMID:26793200
A large-scale RNA interference screen identifies genes that regulate autophagy at different stages.

PubMed

Guo, Sujuan; Pridham, Kevin J; Virbasius, Ching-Man; He, Bin; Zhang, Liqing; Varmark, Hanne; Green, Michael R; Sheng, Zhi

2018-02-12

Dysregulated autophagy is central to the pathogenesis and therapeutic development of cancer. However, how autophagy is regulated in cancer is not well understood and genes that modulate cancer autophagy are not fully defined. To gain more insights into autophagy regulation in cancer, we performed a large-scale RNA interference screen in K562 human chronic myeloid leukemia cells using monodansylcadaverine staining, an autophagy-detecting approach equivalent to immunoblotting of the autophagy marker LC3B or fluorescence microscopy of GFP-LC3B. By coupling monodansylcadaverine staining with fluorescence-activated cell sorting, we successfully isolated autophagic K562 cells where we identified 336 short hairpin RNAs. After candidate validation using Cyto-ID fluorescence spectrophotometry, LC3B immunoblotting, and quantitative RT-PCR, 82 genes were identified as autophagy-regulating genes. 20 genes have been reported previously and the remaining 62 candidates are novel autophagy mediators. Bioinformatic analyses revealed that most candidate genes were involved in molecular pathways regulating autophagy, rather than directly participating in the autophagy process. Further autophagy flux assays revealed that 57 autophagy-regulating genes suppressed autophagy initiation, whereas 21 candidates promoted autophagy maturation. Our RNA interference screen identifies identified genes that regulate autophagy at different stages, which helps decode autophagy regulation in cancer and offers novel avenues to develop autophagy-related therapies for cancer.
Bioinformatics-Based Identification of Candidate Genes from QTLs Associated with Cell Wall Traits in Populus

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ranjan, Priya; Yin, Tongming; Zhang, Xinye

2009-11-01

Quantitative trait locus (QTL) studies are an integral part of plant research and are used to characterize the genetic basis of phenotypic variation observed in structured populations and inform marker-assisted breeding efforts. These QTL intervals can span large physical regions on a chromosome comprising hundreds of genes, thereby hampering candidate gene identification. Genome history, evolution, and expression evidence can be used to narrow the genes in the interval to a smaller list that is manageable for detailed downstream functional genomics characterization. Our primary motivation for the present study was to address the need for a research methodology that identifies candidatemore » genes within a broad QTL interval. Here we present a bioinformatics-based approach for subdividing candidate genes within QTL intervals into alternate groups of high probability candidates. Application of this approach in the context of studying cell wall traits, specifically lignin content and S/G ratios of stem and root in Populus plants, resulted in manageable sets of genes of both known and putative cell wall biosynthetic function. These results provide a roadmap for future experimental work leading to identification of new genes controlling cell wall recalcitrance and, ultimately, in the utility of plant biomass as an energy feedstock.« less
A public platform for the verification of the phenotypic effect of candidate genes for resistance to aflatoxin accumulation and Aspergillus flavus infection in maize

USDA-ARS?s Scientific Manuscript database

A public candidate gene testing pipeline for resistance to aflatoxin accumulation or Aspergillus flavus infection in maize is presented here. The pipeline consists of steps for identifying, testing, and verifying the association of any maize gene sequence with resistance under field conditions. Reso...
SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate

Treesearch

Gretchen H. Roffler; Stephen J. Amish; Seth Smith; Ted Cosart; Marty Kardos; Michael K. Schwartz; Gordon Luikart

2016-01-01

Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding...
A genomic scan for selection reveals candidates for genes involved in the evolution of cultivated sunflower (Helianthus annuus).

PubMed

Chapman, Mark A; Pashley, Catherine H; Wenzler, Jessica; Hvala, John; Tang, Shunxue; Knapp, Steven J; Burke, John M

2008-11-01

Genomic scans for selection are a useful tool for identifying genes underlying phenotypic transitions. In this article, we describe the results of a genome scan designed to identify candidates for genes targeted by selection during the evolution of cultivated sunflower. This work involved screening 492 loci derived from ESTs on a large panel of wild, primitive (i.e., landrace), and improved sunflower (Helianthus annuus) lines. This sampling strategy allowed us to identify candidates for selectively important genes and investigate the likely timing of selection. Thirty-six genes showed evidence of selection during either domestication or improvement based on multiple criteria, and a sequence-based test of selection on a subset of these loci confirmed this result. In view of what is known about the structure of linkage disequilibrium across the sunflower genome, these genes are themselves likely to have been targeted by selection, rather than being merely linked to the actual targets. While the selection candidates showed a broad range of putative functions, they were enriched for genes involved in amino acid synthesis and protein catabolism. Given that a similar pattern has been detected in maize (Zea mays), this finding suggests that selection on amino acid composition may be a general feature of the evolution of crop plants. In terms of genomic locations, the selection candidates were significantly clustered near quantitative trait loci (QTL) that contribute to phenotypic differences between wild and cultivated sunflower, and specific instances of QTL colocalization provide some clues as to the roles that these genes may have played during sunflower evolution.
A priori and a posteriori approaches for finding genes of evolutionary interest in non-model species: osmoregulatory genes in the kidney transcriptome of the desert rodent Dipodomys spectabilis (banner-tailed kangaroo rat).

PubMed

Marra, Nicholas J; Eo, Soo Hyung; Hale, Matthew C; Waser, Peter M; DeWoody, J Andrew

2012-12-01

One common goal in evolutionary biology is the identification of genes underlying adaptive traits of evolutionary interest. Recently next-generation sequencing techniques have greatly facilitated such evolutionary studies in species otherwise depauperate of genomic resources. Kangaroo rats (Dipodomys sp.) serve as exemplars of adaptation in that they inhabit extremely arid environments, yet require no drinking water because of ultra-efficient kidney function and osmoregulation. As a basis for identifying water conservation genes in kangaroo rats, we conducted a priori bioinformatics searches in model rodents (Mus musculus and Rattus norvegicus) to identify candidate genes with known or suspected osmoregulatory function. We then obtained 446,758 reads via 454 pyrosequencing to characterize genes expressed in the kidney of banner-tailed kangaroo rats (Dipodomys spectabilis). We also determined candidates a posteriori by identifying genes that were overexpressed in the kidney. The kangaroo rat sequences revealed nine different a priori candidate genes predicted from our Mus and Rattus searches, as well as 32 a posteriori candidate genes that were overexpressed in kidney. Mutations in two of these genes, Slc12a1 and Slc12a3, cause human renal diseases that result in the inability to concentrate urine. These genes are likely key determinants of physiological water conservation in desert rodents. Copyright © 2012 Elsevier Inc. All rights reserved.
Senataxin Mutation Reveals How R-Loops Promote Transcription by Blocking DNA Methylation at Gene Promoters.

PubMed

Grunseich, Christopher; Wang, Isabel X; Watts, Jason A; Burdick, Joshua T; Guber, Robert D; Zhu, Zhengwei; Bruzel, Alan; Lanman, Tyler; Chen, Kelian; Schindler, Alice B; Edwards, Nancy; Ray-Chaudhury, Abhik; Yao, Jianhua; Lehky, Tanya; Piszczek, Grzegorz; Crain, Barbara; Fischbeck, Kenneth H; Cheung, Vivian G

2018-02-01

R-loops are three-stranded nucleic acid structures found abundantly and yet often viewed as by-products of transcription. Studying cells from patients with a motor neuron disease (amyotrophic lateral sclerosis 4 [ALS4]) caused by a mutation in senataxin, we uncovered how R-loops promote transcription. In ALS4 patients, the senataxin mutation depletes R-loops with a consequent effect on gene expression. With fewer R-loops in ALS4 cells, the expression of BAMBI, a negative regulator of transforming growth factor β (TGF-β), is reduced; that then leads to the activation of the TGF-β pathway. We uncovered that genome-wide R-loops influence promoter methylation of over 1,200 human genes. DNA methyl-transferase 1 favors binding to double-stranded DNA over R-loops. Thus, in forming R-loops, nascent RNA blocks DNA methylation and promotes further transcription. Hence, our results show that nucleic acid structures, in addition to sequences, influence the binding and activity of regulatory proteins. Copyright © 2017 Elsevier Inc. All rights reserved.
Integrative analysis of gene expression and DNA methylation using unsupervised feature extraction for detecting candidate cancer biomarkers.

PubMed

Moon, Myungjin; Nakai, Kenta

2018-04-01

Currently, cancer biomarker discovery is one of the important research topics worldwide. In particular, detecting significant genes related to cancer is an important task for early diagnosis and treatment of cancer. Conventional studies mostly focus on genes that are differentially expressed in different states of cancer; however, noise in gene expression datasets and insufficient information in limited datasets impede precise analysis of novel candidate biomarkers. In this study, we propose an integrative analysis of gene expression and DNA methylation using normalization and unsupervised feature extractions to identify candidate biomarkers of cancer using renal cell carcinoma RNA-seq datasets. Gene expression and DNA methylation datasets are normalized by Box-Cox transformation and integrated into a one-dimensional dataset that retains the major characteristics of the original datasets by unsupervised feature extraction methods, and differentially expressed genes are selected from the integrated dataset. Use of the integrated dataset demonstrated improved performance as compared with conventional approaches that utilize gene expression or DNA methylation datasets alone. Validation based on the literature showed that a considerable number of top-ranked genes from the integrated dataset have known relationships with cancer, implying that novel candidate biomarkers can also be acquired from the proposed analysis method. Furthermore, we expect that the proposed method can be expanded for applications involving various types of multi-omics datasets.
Association analysis of single nucleotide polymorphisms in candidate genes with root traits in maize (Zea mays L.) seedlings.

PubMed

Kumar, Bharath; Abdel-Ghani, Adel H; Pace, Jordon; Reyes-Matamoros, Jenaro; Hochholdinger, Frank; Lübberstedt, Thomas

2014-07-01

Several genes involved in maize root development have been isolated. Identification of SNPs associated with root traits would enable the selection of maize lines with better root architecture that might help to improve N uptake, and consequently plant growth particularly under N deficient conditions. In the present study, an association study (AS) panel consisting of 74 maize inbred lines was screened for seedling root traits in 6, 10, and 14-day-old seedlings. Allele re-sequencing of candidate root genes Rtcl, Rth3, Rum1, and Rul1 was also carried out in the same AS panel lines. All four candidate genes displayed different levels of nucleotide diversity, haplotype diversity and linkage disequilibrium. Gene based association analyses were carried out between individual polymorphisms in candidate genes, and root traits measured in 6, 10, and 14-day-old maize seedlings. Association analyses revealed several polymorphisms within the Rtcl, Rth3, Rum1, and Rul1 genes associated with seedling root traits. Several nucleotide polymorphisms in Rtcl, Rth3, Rum1, and Rul1 were significantly (P<0.05) associated with seedling root traits in maize suggesting that all four tested genes are involved in the maize root development. Thus considerable allelic variation present in these root genes can be exploited for improving maize root characteristics. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Rapid Communication: MiR-92a as a housekeeping gene for analysis of bovine mastitis-related microRNA in milk.

PubMed

Lai, Y C; Fujikawa, T; Ando, T; Kitahara, G; Koiwa, M; Kubota, C; Miura, N

2017-06-01

Our aim was to identify a suitable microRNA housekeeping gene for real-time PCR analysis of bovine mastitis-related microRNA in milk. We identified , , and as housekeeping gene candidates on the basis of previous Solexa sequencing results. Threshold cycle (CT) values for , , and did not differ between milk from control cows and milk from mastitis-affected cows. NormFinder software identified as the most stable single housekeeping gene. We evaluated the suitability of the housekeeping gene candidates by using them to assess expression levels of the inflammation-related gene . Regardless of the housekeeping gene candidates used for normalization, relative expression levels of were significantly higher in mastitis-affected samples than in control samples. However, of all the housekeeping genes and gene combinations investigated, normalization with alone generated the difference in relative expression between mastitis-affected and control samples with the highest significance. These results suggest that is suitable for use as a housekeeping gene for analysis of bovine mastitis-related microRNA in milk.
Fractal landscape analysis of DNA walks

NASA Technical Reports Server (NTRS)

Peng, C. K.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Sciortino, F.; Simons, M.; Stanley, H. E.

1992-01-01

By mapping nucleotide sequences onto a "DNA walk", we uncovered remarkably long-range power law correlations [Nature 356 (1992) 168] that imply a new scale invariant property of DNA. We found such long-range correlations in intron-containing genes and in non-transcribed regulatory DNA sequences, but not in cDNA sequences or intron-less genes. In this paper, we present more explicit evidences to support our findings.
Revealing Alzheimer's disease genes spectrum in the whole-genome by machine learning.

PubMed

Huang, Xiaoyan; Liu, Hankui; Li, Xinming; Guan, Liping; Li, Jiankang; Tellier, Laurent Christian Asker M; Yang, Huanming; Wang, Jian; Zhang, Jianguo

2018-01-10

Alzheimer's disease (AD) is an important, progressive neurodegenerative disease, with a complex genetic architecture. A key goal of biomedical research is to seek out disease risk genes, and to elucidate the function of these risk genes in the development of disease. For this purpose, expanding the AD-associated gene set is necessary. In past research, the prediction methods for AD related genes has been limited in their exploration of the target genome regions. We here present a genome-wide method for AD candidate genes predictions. We present a machine learning approach (SVM), based upon integrating gene expression data with human brain-specific gene network data, to discover the full spectrum of AD genes across the whole genome. We classified AD candidate genes with an accuracy and the area under the receiver operating characteristic (ROC) curve of 84.56% and 94%. Our approach provides a supplement for the spectrum of AD-associated genes extracted from more than 20,000 genes in a genome wide scale. In this study, we have elucidated the whole-genome spectrum of AD, using a machine learning approach. Through this method, we expect for the candidate gene catalogue to provide a more comprehensive annotation of AD for researchers.
Highly polygenic architecture of antidepressant treatment response: Comparative analysis of SSRI and NRI treatment in an animal model of depression.

PubMed

Malki, Karim; Tosto, Maria Grazia; Mouriño-Talín, Héctor; Rodríguez-Lorenzo, Sabela; Pain, Oliver; Jumhaboy, Irfan; Liu, Tina; Parpas, Panos; Newman, Stuart; Malykh, Artem; Carboni, Lucia; Uher, Rudolf; McGuffin, Peter; Schalkwyk, Leonard C; Bryson, Kevin; Herbster, Mark

2017-04-01

Response to antidepressant (AD) treatment may be a more polygenic trait than previously hypothesized, with many genetic variants interacting in yet unclear ways. In this study we used methods that can automatically learn to detect patterns of statistical regularity from a sparsely distributed signal across hippocampal transcriptome measurements in a large-scale animal pharmacogenomic study to uncover genomic variations associated with AD. The study used four inbred mouse strains of both sexes, two drug treatments, and a control group (escitalopram, nortriptyline, and saline). Multi-class and binary classification using Machine Learning (ML) and regularization algorithms using iterative and univariate feature selection methods, including InfoGain, mRMR, ANOVA, and Chi Square, were used to uncover genomic markers associated with AD response. Relevant genes were selected based on Jaccard distance and carried forward for gene-network analysis. Linear association methods uncovered only one gene associated with drug treatment response. The implementation of ML algorithms, together with feature reduction methods, revealed a set of 204 genes associated with SSRI and 241 genes associated with NRI response. Although only 10% of genes overlapped across the two drugs, network analysis shows that both drugs modulated the CREB pathway, through different molecular mechanisms. Through careful implementation and optimisations, the algorithms detected a weak signal used to predict whether an animal was treated with nortriptyline (77%) or escitalopram (67%) on an independent testing set. The results from this study indicate that the molecular signature of AD treatment may include a much broader range of genomic markers than previously hypothesized, suggesting that response to medication may be as complex as the pathology. The search for biomarkers of antidepressant treatment response could therefore consider a higher number of genetic markers and their interactions. Through predominately different molecular targets and mechanisms of action, the two drugs modulate the same Creb1 pathway which plays a key role in neurotrophic responses and in inflammatory processes. © 2016 The Authors. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics Published by Wiley Periodicals, Inc. © 2016 The Authors. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics Published by Wiley Periodicals, Inc.
Pericytes in kidney fibrosis.

PubMed

Ren, Shuyu; Duffield, Jeremy S

2013-07-01

Pericytes and perivascular fibroblasts have emerged as poorly appreciated yet extensive populations of mesenchymal cells in the kidney that play important roles in homeostasis and responses to injury. This review will update readers on the evolving understanding of the biology of these cells. Fate mapping has identified pericytes and perivascular fibroblasts as the major source of pathological fibrillar matrix-forming cells in interstitial kidney disease. In other organs similar cells have been described and independent fate mapping indicates that pericytes or perivascular cells are myofibroblast progenitors in multiple organs. Over the last year, new insights into the function of pericytes in kidney homeostasis has been uncovered and new molecular pathways that regulate detachment and their transdifferentiation into pathological myofibroblasts, including Wingless/Int, ephrin, transforming growth factor β, platelet derived growth factor, and Hedgehog signaling pathways, have been reported. In addition provocative studies indicate that microRNAs, which regulate posttranscriptional gene expression, may also play important roles in their transdifferentiation. Pericytes and perivascular fibroblasts are the major source of pathological collagen fiber-forming cells in interstitial kidney diseases. New avenues of research into their activation and differentiation has identified new drug candidates for the treatment of interstitial kidney disease.
Genome-wide meta-analysis in alopecia areata resolves HLA associations and reveals two new susceptibility loci

PubMed Central

Huang, Hailiang; Menelaou, Androniki; Redler, Silke; Becker, Tim; Heilmann, Stefanie; Yamany, Tarek; Duvic, Madeliene; Hordinsky, Maria; Norris, David; Price, Vera H.; Mackay-Wiggan, Julian; de Jong, Annemieke; DeStefano, Gina M.; Moebus, Susanne; Böhm, Markus; Blume-Peytavi, Ulrike; Wolff, Hans; Lutz, Gerhard; Kruse, Roland; Bian, Li; Amos, Christopher I.; Lee, Annette; Gregersen, Peter K.; Blaumeiser, Bettina; Altshuler, David; Clynes, Raphael; de Bakker, Paul I. W.; Nöthen, Markus M.; Daly, Mark J.; Christiano, Angela M.

2015-01-01

Alopecia areata (AA) is a prevalent autoimmune disease with ten known susceptibility loci. Here we perform the first meta-analysis in AA by combining data from two genome-wide association studies (GWAS), and replication with supplemented ImmunoChip data for a total of 3,253 cases and 7,543 controls. The strongest region of association is the MHC, where we fine-map 4 independent effects, all implicating HLA-DR as a key etiologic driver. Outside the MHC, we identify two novel loci that exceed statistical significance, containing ACOXL/BCL2L11(BIM) (2q13); GARP (LRRC32) (11q13.5), as well as a third nominally significant region SH2B3(LNK)/ATXN2 (12q24.12). Candidate susceptibility gene expression analysis in these regions demonstrates expression in relevant immune cells and the hair follicle. We integrate our results with data from seven other autoimmune diseases and provide insight into the alignment of AA within these disorders. Our findings uncover new molecular pathways disrupted in AA, including autophagy/apoptosis, TGFß/Tregs and JAK kinase signaling, and support the causal role of aberrant immune processes in AA. PMID:25608926
The use of transgenic parasites in malaria vaccine research.

PubMed

Othman, Ahmad Syibli; Marin-Mogollon, Catherin; Salman, Ahmed M; Franke-Fayard, Blandine M; Janse, Chris J; Khan, Shahid M

2017-07-01

Transgenic malaria parasites expressing foreign genes, for example fluorescent and luminescent proteins, are used extensively to interrogate parasite biology and host-parasite interactions associated with malaria pathology. Increasingly transgenic parasites are also exploited to advance malaria vaccine development. Areas covered: We review how transgenic malaria parasites are used, in vitro and in vivo, to determine protective efficacy of different antigens and vaccination strategies and to determine immunological correlates of protection. We describe how chimeric rodent parasites expressing P. falciparum or P. vivax antigens are being used to directly evaluate and rank order human malaria vaccines before their advancement to clinical testing. In addition, we describe how transgenic human and rodent parasites are used to develop and evaluate live (genetically) attenuated vaccines. Expert commentary: Transgenic rodent and human malaria parasites are being used to both identify vaccine candidate antigens and to evaluate both sub-unit and whole organism vaccines before they are advanced into clinical testing. Transgenic parasites combined with in vivo pre-clinical testing models (e.g. mice) are used to evaluate vaccine safety, potency and the durability of protection as well as to uncover critical protective immune responses and to refine vaccination strategies.
Meta-analysis of loci associated with age at natural menopause in African-American women.

PubMed

Chen, Christina T L; Liu, Ching-Ti; Chen, Gary K; Andrews, Jeanette S; Arnold, Alice M; Dreyfus, Jill; Franceschini, Nora; Garcia, Melissa E; Kerr, Kathleen F; Li, Guo; Lohman, Kurt K; Musani, Solomon K; Nalls, Michael A; Raffel, Leslie J; Smith, Jennifer; Ambrosone, Christine B; Bandera, Elisa V; Bernstein, Leslie; Britton, Angela; Brzyski, Robert G; Cappola, Anne; Carlson, Christopher S; Couper, David; Deming, Sandra L; Goodarzi, Mark O; Heiss, Gerardo; John, Esther M; Lu, Xiaoning; Le Marchand, Loic; Marciante, Kristin; Mcknight, Barbara; Millikan, Robert; Nock, Nora L; Olshan, Andrew F; Press, Michael F; Vaiyda, Dhananjay; Woods, Nancy F; Taylor, Herman A; Zhao, Wei; Zheng, Wei; Evans, Michele K; Harris, Tamara B; Henderson, Brian E; Kardia, Sharon L R; Kooperberg, Charles; Liu, Yongmei; Mosley, Thomas H; Psaty, Bruce; Wellons, Melissa; Windham, Beverly G; Zonderman, Alan B; Cupples, L Adrienne; Demerath, Ellen W; Haiman, Christopher; Murabito, Joanne M; Rajkovic, Aleksandar

2014-06-15

Age at menopause marks the end of a woman's reproductive life and its timing associates with risks for cancer, cardiovascular and bone disorders. GWAS and candidate gene studies conducted in women of European ancestry have identified 27 loci associated with age at menopause. The relevance of these loci to women of African ancestry has not been previously studied. We therefore sought to uncover additional menopause loci and investigate the relevance of European menopause loci by performing a GWAS meta-analysis in 6510 women with African ancestry derived from 11 studies across the USA. We did not identify any additional loci significantly associated with age at menopause in African Americans. We replicated the associations between six loci and age at menopause (P-value < 0.05): AMHR2, RHBLD2, PRIM1, HK3/UMC1, BRSK1/TMEM150B and MCM8. In addition, associations of 14 loci are directionally consistent with previous reports. We provide evidence that genetic variants influencing reproductive traits identified in European populations are also important in women of African ancestry residing in USA. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Genomic variation at the tips of the adaptive radiation of Darwin's finches.

PubMed

Chaves, Jaime A; Cooper, Elizabeth A; Hendry, Andrew P; Podos, Jeffrey; De León, Luis F; Raeymaekers, Joost A M; MacMillan, W Owen; Uy, J Albert C

2016-11-01

Adaptive radiation unfolds as selection acts on the genetic variation underlying functional traits. The nature of this variation can be revealed by studying the tips of an ongoing adaptive radiation. We studied genomic variation at the tips of the Darwin's finch radiation; specifically focusing on polymorphism within, and variation among, three sympatric species of the genus Geospiza. Using restriction site-associated DNA (RAD-seq), we characterized 32 569 single-nucleotide polymorphisms (SNPs), from which 11 outlier SNPs for beak and body size were uncovered by a genomewide association study (GWAS). Principal component analysis revealed that these 11 SNPs formed four statistically linked groups. Stepwise regression then revealed that the first PC score, which included 6 of the 11 top SNPs, explained over 80% of the variation in beak size, suggesting that selection on these traits influences multiple correlated loci. The two SNPs most strongly associated with beak size were near genes associated with beak morphology across deeper branches of the radiation: delta-like 1 homologue (DLK1) and high-mobility group AT-hook 2 (HMGA2). Our results suggest that (i) key adaptive traits are associated with a small fraction of the genome (11 of 32 569 SNPs), (ii) SNPs linked to the candidate genes are dispersed throughout the genome (on several chromosomes), and (iii) micro- and macro-evolutionary variation (roots and tips of the radiation) involve some shared and some unique genomic regions. © 2016 John Wiley & Sons Ltd.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Wrighton, Kelly C.; Castelle, Cindy J.; Varaljay, Vanessa A.

Metagenomic studies recently uncovered form II/III RubisCO genes, originally thought to only occur in archaea, from uncultivated bacteria of the candidate phyla radiation (CPR). There are no isolated CPR bacteria and these organisms are predicted to have limited metabolic capacities. Here we expand the known diversity of RubisCO from CPR lineages. We report a form of RubisCO, distantly similar to the archaeal form III RubisCO, in some CPR bacteria from the Parcubacteria (OD1), WS6 and Microgenomates (OP11) phyla. In addition, we significantly expand the Peregrinibacteria (PER) II/III RubisCO diversity and report the first II/III RubisCO sequences from the Microgenomates andmore » WS6 phyla. To provide a metabolic context for these RubisCOs, we reconstructed near-complete ( > 93%) PER genomes and the first closed genome for a WS6 bacterium, for which we propose the phylum name Dojkabacteria. Genomic and bioinformatic analyses suggest that the CPR RubisCOs function in a nucleoside pathway similar to that proposed in Archaea. Detection of form II/III RubisCO and nucleoside metabolism gene transcripts from a PER supports the operation of this pathway in situ. We demonstrate that the PER form II/III RubisCO is catalytically active, fixing CO 2 to physiologically complement phototrophic growth in a bacterial photoautotrophic RubisCO deletion strain. We propose that the identification of these RubisCOs across a radiation of obligately fermentative, small-celled organisms hints at a widespread, simple metabolic platform in which ribose may be a prominent currency.« less
Comparative genomics and functional analysis of the NiaP family uncover nicotinate transporters from bacteria, plants, and mammals.

PubMed

Jeanguenin, Linda; Lara-Núñez, Aurora; Rodionov, Dmitry A; Osterman, Andrei L; Komarova, Nataliya Y; Rentsch, Doris; Gregory, Jesse F; Hanson, Andrew D

2012-03-01

The transporter(s) that mediate uptake of nicotinate and its N-methyl derivative trigonelline are not known in plants, and certain mammalian nicotinate transporters also remain unidentified. Potential candidates for these missing transporters include proteins from the ubiquitous NiaP family. In bacteria, niaP genes often belong to NAD-related regulons, and genetic evidence supports a role for Bacillus subtilis and Acinetobacter baumannii NiaP proteins in uptake of nicotinate or nicotinamide. Other bacterial niaP genes are, however, not in NAD-related regulons but cluster on the chromosome with choline-related (e.g., Ralstonia solanacearum and Burkholderia xenovorans) or thiamin-related (e.g., Thermus thermophilus) genes, implying that they might encode transporters for these compounds. Radiometric uptake assays using Lactococcus lactis cells expressing NiaP proteins showed that B. subtilis, R. solanacearum, and B. xenovorans NiaP transport nicotinate via an energy-dependent mechanism. Likewise, NiaP proteins from maize (GRMZM2G381453, GRMZM2G066801, and GRMZM2G081774), Arabidopsis (At3g13050), and mouse (SVOP) transported nicotinate; the Arabidopsis protein also transported trigonelline. In contrast, T. thermophilus NiaP transported only thiamin. None of the proteins tested transported choline or the thiazole and pyrimidine products of thiamin breakdown. The maize and Arabidopsis NiaP proteins are the first nicotinate transporters reported in plants, the Arabidopsis protein is the first trigonelline transporter, and mouse SVOP appears to represent a novel type of mammalian nicotinate transporter. More generally, these results indicate that specificity for nicotinate is conserved widely, but not absolutely, among pro- and eukaryotic NiaP family proteins.
Systematic assessment of cervical cancer initiation and progression uncovers genetic panels for deep learning-based early diagnosis and proposes novel diagnostic and prognostic biomarkers.

PubMed

Long, Nguyen Phuoc; Jung, Kyung Hee; Yoon, Sang Jun; Anh, Nguyen Hoang; Nghi, Tran Diem; Kang, Yun Pyo; Yan, Hong Hua; Min, Jung Eun; Hong, Soon-Sun; Kwon, Sung Won

2017-12-12

Although many outstanding achievements in the management of cervical cancer (CxCa) have obtained, it still imposes a major burden which has prompted scientists to discover and validate new CxCa biomarkers to improve the diagnostic and prognostic assessment of CxCa. In this study, eight different gene expression data sets containing 202 cancer, 115 cervical intraepithelial neoplasia (CIN), and 105 normal samples were utilized for an integrative systems biology assessment in a multi-stage carcinogenesis manner. Deep learning-based diagnostic models were established based on the genetic panels of intrinsic genes of cervical carcinogenesis as well as on the unbiased variable selection approach. Survival analysis was also conducted to explore the potential biomarker candidates for prognostic assessment. Our results showed that cell cycle, RNA transport, mRNA surveillance, and one carbon pool by folate were the key regulatory mechanisms involved in the initiation, progression, and metastasis of CxCa. Various genetic panels combined with machine learning algorithms successfully differentiated CxCa from CIN and normalcy in cross-study normalized data sets. In particular, the 168-gene deep learning model for the differentiation of cancer from normalcy achieved an externally validated accuracy of 97.96% (99.01% sensitivity and 95.65% specificity). Survival analysis revealed that ZNF281 and EPHB6 were the two most promising prognostic genetic markers for CxCa among others. Our findings open new opportunities to enhance current understanding of the characteristics of CxCa pathobiology. In addition, the combination of transcriptomics-based signatures and deep learning classification may become an important approach to improve CxCa diagnosis and management in clinical practice.
Identification of rare X-linked neuroligin variants by massively parallel sequencing in males with autism spectrum disorder.

PubMed

Steinberg, Karyn Meltz; Ramachandran, Dhanya; Patel, Viren C; Shetty, Amol C; Cutler, David J; Zwick, Michael E

2012-09-28

Autism spectrum disorder (ASD) is highly heritable, but the genetic risk factors for it remain largely unknown. Although structural variants with large effect sizes may explain up to 15% ASD, genome-wide association studies have failed to uncover common single nucleotide variants with large effects on phenotype. The focus within ASD genetics is now shifting to the examination of rare sequence variants of modest effect, which is most often achieved via exome selection and sequencing. This strategy has indeed identified some rare candidate variants; however, the approach does not capture the full spectrum of genetic variation that might contribute to the phenotype. We surveyed two loci with known rare variants that contribute to ASD, the X-linked neuroligin genes by performing massively parallel Illumina sequencing of the coding and noncoding regions from these genes in males from families with multiplex autism. We annotated all variant sites and functionally tested a subset to identify other rare mutations contributing to ASD susceptibility. We found seven rare variants at evolutionary conserved sites in our study population. Functional analyses of the three 3' UTR variants did not show statistically significant effects on the expression of NLGN3 and NLGN4X. In addition, we identified two NLGN3 intronic variants located within conserved transcription factor binding sites that could potentially affect gene regulation. These data demonstrate the power of massively parallel, targeted sequencing studies of affected individuals for identifying rare, potentially disease-contributing variation. However, they also point out the challenges and limitations of current methods of direct functional testing of rare variants and the difficulties of identifying alleles with modest effects.
Identification of rare X-linked neuroligin variants by massively parallel sequencing in males with autism spectrum disorder

PubMed Central

2012-01-01

Background Autism spectrum disorder (ASD) is highly heritable, but the genetic risk factors for it remain largely unknown. Although structural variants with large effect sizes may explain up to 15% ASD, genome-wide association studies have failed to uncover common single nucleotide variants with large effects on phenotype. The focus within ASD genetics is now shifting to the examination of rare sequence variants of modest effect, which is most often achieved via exome selection and sequencing. This strategy has indeed identified some rare candidate variants; however, the approach does not capture the full spectrum of genetic variation that might contribute to the phenotype. Methods We surveyed two loci with known rare variants that contribute to ASD, the X-linked neuroligin genes by performing massively parallel Illumina sequencing of the coding and noncoding regions from these genes in males from families with multiplex autism. We annotated all variant sites and functionally tested a subset to identify other rare mutations contributing to ASD susceptibility. Results We found seven rare variants at evolutionary conserved sites in our study population. Functional analyses of the three 3’ UTR variants did not show statistically significant effects on the expression of NLGN3 and NLGN4X. In addition, we identified two NLGN3 intronic variants located within conserved transcription factor binding sites that could potentially affect gene regulation. Conclusions These data demonstrate the power of massively parallel, targeted sequencing studies of affected individuals for identifying rare, potentially disease-contributing variation. However, they also point out the challenges and limitations of current methods of direct functional testing of rare variants and the difficulties of identifying alleles with modest effects. PMID:23020841
Molecular profiling of aged neural progenitors identifies Dbx2 as a candidate regulator of age-associated neurogenic decline.

PubMed

Lupo, Giuseppe; Nisi, Paola S; Esteve, Pilar; Paul, Yu-Lee; Novo, Clara Lopes; Sidders, Ben; Khan, Muhammad A; Biagioni, Stefano; Liu, Hai-Kun; Bovolenta, Paola; Cacci, Emanuele; Rugg-Gunn, Peter J

2018-06-01

Adult neurogenesis declines with aging due to the depletion and functional impairment of neural stem/progenitor cells (NSPCs). An improved understanding of the underlying mechanisms that drive age-associated neurogenic deficiency could lead to the development of strategies to alleviate cognitive impairment and facilitate neuroregeneration. An essential step towards this aim is to investigate the molecular changes that occur in NSPC aging on a genomewide scale. In this study, we compare the transcriptional, histone methylation and DNA methylation signatures of NSPCs derived from the subventricular zone (SVZ) of young adult (3 months old) and aged (18 months old) mice. Surprisingly, the transcriptional and epigenomic profiles of SVZ-derived NSPCs are largely unchanged in aged cells. Despite the global similarities, we detect robust age-dependent changes at several hundred genes and regulatory elements, thereby identifying putative regulators of neurogenic decline. Within this list, the homeobox gene Dbx2 is upregulated in vitro and in vivo, and its promoter region has altered histone and DNA methylation levels, in aged NSPCs. Using functional in vitro assays, we show that elevated Dbx2 expression in young adult NSPCs promotes age-related phenotypes, including the reduced proliferation of NSPC cultures and the altered transcript levels of age-associated regulators of NSPC proliferation and differentiation. Depleting Dbx2 in aged NSPCs caused the reverse gene expression changes. Taken together, these results provide new insights into the molecular programmes that are affected during mouse NSPC aging, and uncover a new functional role for Dbx2 in promoting age-related neurogenic decline. © 2018 The Authors. Aging Cell published by the Anatomical Society and John Wiley & Sons Ltd.
Systematic assessment of cervical cancer initiation and progression uncovers genetic panels for deep learning-based early diagnosis and proposes novel diagnostic and prognostic biomarkers

PubMed Central

Long, Nguyen Phuoc; Jung, Kyung Hee; Yoon, Sang Jun; Anh, Nguyen Hoang; Nghi, Tran Diem; Kang, Yun Pyo; Yan, Hong Hua; Min, Jung Eun; Hong, Soon-Sun; Kwon, Sung Won

2017-01-01

Although many outstanding achievements in the management of cervical cancer (CxCa) have obtained, it still imposes a major burden which has prompted scientists to discover and validate new CxCa biomarkers to improve the diagnostic and prognostic assessment of CxCa. In this study, eight different gene expression data sets containing 202 cancer, 115 cervical intraepithelial neoplasia (CIN), and 105 normal samples were utilized for an integrative systems biology assessment in a multi-stage carcinogenesis manner. Deep learning-based diagnostic models were established based on the genetic panels of intrinsic genes of cervical carcinogenesis as well as on the unbiased variable selection approach. Survival analysis was also conducted to explore the potential biomarker candidates for prognostic assessment. Our results showed that cell cycle, RNA transport, mRNA surveillance, and one carbon pool by folate were the key regulatory mechanisms involved in the initiation, progression, and metastasis of CxCa. Various genetic panels combined with machine learning algorithms successfully differentiated CxCa from CIN and normalcy in cross-study normalized data sets. In particular, the 168-gene deep learning model for the differentiation of cancer from normalcy achieved an externally validated accuracy of 97.96% (99.01% sensitivity and 95.65% specificity). Survival analysis revealed that ZNF281 and EPHB6 were the two most promising prognostic genetic markers for CxCa among others. Our findings open new opportunities to enhance current understanding of the characteristics of CxCa pathobiology. In addition, the combination of transcriptomics-based signatures and deep learning classification may become an important approach to improve CxCa diagnosis and management in clinical practice. PMID:29312619
Look beyond one's own nose: combination of information from publicly available sources reveals an association of GATA4 polymorphisms with plasma triglycerides.

PubMed

Lamina, Claudia; Coassin, Stefan; Illig, Thomas; Kronenberg, Florian

2011-12-01

GATA4iKO mice exhibit impeded triglyceride absorption from intestine and decreased plasma triglyceride levels. Data in humans are lacking. We hypothesized that triglyceride levels might also be regulated by polymorphisms in the GATA4 gene in humans. We used publicly available data from different sources to evaluate this hypothesis. Our approach is a more often applicable advance to uncover associations and their functional implications which would have been otherwise missed by standard genome-wide association studies (GWAS). We used the publicly available GWAS results from 137 SNPs in the GATA4 region for triglyceride levels. We embedded these results into the comprehensive functional genomics data provided in the UCSC Genome Browser including among others information on regulatory elements and interspecies conservation. A concise graphical presentation is proposed together with an R function for automatic data preparation. This process is presented in an educational manner using a screencast to become most useful for other researchers. We observed several polymorphisms in and around the GATA4 gene which have a significant influence on plasma triglyceride levels with the lowest p-value at SNP rs1466785 (Bonferroni-corrected p-value = 1.76e-5). The bioinformatic evaluation of this locus in publicly available functional genomics data provided converging evidence for the presence of a transcriptional regulator downstream of GATA4. The combination of different sources of data has revealed an association of GATA4 with triglyceride levels in humans. Our evaluation exemplifies how an integrative analysis including both statistical and biological perspectives can shed new light on available association data and reveals novel candidate genes, which are otherwise hidden in the noisy region below genome-wide significance. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
N-of-1-pathways MixEnrich: advancing precision medicine via single-subject analysis in discovering dynamic changes of transcriptomes.

PubMed

Li, Qike; Schissler, A Grant; Gardeux, Vincent; Achour, Ikbel; Kenost, Colleen; Berghout, Joanne; Li, Haiquan; Zhang, Hao Helen; Lussier, Yves A

2017-05-24

Transcriptome analytic tools are commonly used across patient cohorts to develop drugs and predict clinical outcomes. However, as precision medicine pursues more accurate and individualized treatment decisions, these methods are not designed to address single-patient transcriptome analyses. We previously developed and validated the N-of-1-pathways framework using two methods, Wilcoxon and Mahalanobis Distance (MD), for personal transcriptome analysis derived from a pair of samples of a single patient. Although, both methods uncover concordantly dysregulated pathways, they are not designed to detect dysregulated pathways with up- and down-regulated genes (bidirectional dysregulation) that are ubiquitous in biological systems. We developed N-of-1-pathways MixEnrich, a mixture model followed by a gene set enrichment test, to uncover bidirectional and concordantly dysregulated pathways one patient at a time. We assess its accuracy in a comprehensive simulation study and in a RNA-Seq data analysis of head and neck squamous cell carcinomas (HNSCCs). In presence of bidirectionally dysregulated genes in the pathway or in presence of high background noise, MixEnrich substantially outperforms previous single-subject transcriptome analysis methods, both in the simulation study and the HNSCCs data analysis (ROC Curves; higher true positive rates; lower false positive rates). Bidirectional and concordant dysregulated pathways uncovered by MixEnrich in each patient largely overlapped with the quasi-gold standard compared to other single-subject and cohort-based transcriptome analyses. The greater performance of MixEnrich presents an advantage over previous methods to meet the promise of providing accurate personal transcriptome analysis to support precision medicine at point of care.
Genomics of Nitrogen Cycle in Freshwater Lakes with Focus on Methylotrophic Bacteria

NASA Astrophysics Data System (ADS)

Chistoserdova, L.

2014-12-01

Data will be presented on communities of microbes active in methane oxidation in Lake Washington, Seattle. Metagenomic sequencing of sediment samples reveals dominant presence of Methylobacter, contrary to prior understanding based on cultivation of methanotrophs. Stable isotope probing of microcosms incubated with methane at varying concentrations of oxygen and nitrate uncover a dominant response by Methylobacter species and a correlation between the populations of Methylobacter and Methylotenera, both responding positively to nitrate. We also uncover a propensity of Methylobacter to act in microoxic conditions, in this case transferring carbon down a food chain represented by a variety of bacteria. Functional gene profiling detects upwards shifts in the abundances of nitrogen metabolism genes in response to nitrate, with Methylococcaceae and Methylophilaceae genes being most abundant. We test a hypothesis of cooperative behavior between Methylobacter, Methylotenera and other species using two alternative approaches: a top-down approach in which we incubate native lake sediments under different conditions and observe trajectories of community simplification, and a bottom-up approach in which we construct synthetic communities from pure cultures of bacteria and observe their behavior. We also cultivate Methylobacter as well as multiple species of Methylophilaceae and analyze their genomes. Among the Methylophilaceae, we uncover a remarkable flexibility in terms of both central carbon and nitrogen metabolic pathways. We hypothesize that this diversity may be driven by microniche conditions along the methane and oxygen countergradients, as well as by the availability of nitrogen sources. Our future plans include deciphering the mechanistic details of cooperative behavior in methane oxidation, using Lake Washington communities as a model.
Selection and Validation of Reference Genes for qRT-PCR Expression Analysis of Candidate Genes Involved in Olfactory Communication in the Butterfly Bicyclus anynana

PubMed Central

Arun, Alok; Baumlé, Véronique; Amelot, Gaël; Nieberding, Caroline M.

2015-01-01

Real-time quantitative reverse transcription PCR (qRT-PCR) is a technique widely used to quantify the transcriptional expression level of candidate genes. qRT-PCR requires the selection of one or several suitable reference genes, whose expression profiles remain stable across conditions, to normalize the qRT-PCR expression profiles of candidate genes. Although several butterfly species (Lepidoptera) have become important models in molecular evolutionary ecology, so far no study aimed at identifying reference genes for accurate data normalization for any butterfly is available. The African bush brown butterfly Bicyclus anynana has drawn considerable attention owing to its suitability as a model for evolutionary ecology, and we here provide a maiden extensive study to identify suitable reference gene in this species. We monitored the expression profile of twelve reference genes: eEF-1α, FK506, UBQL40, RpS8, RpS18, HSP, GAPDH, VATPase, ACT3, TBP, eIF2 and G6PD. We tested the stability of their expression profiles in three different tissues (wings, brains, antennae), two developmental stages (pupal and adult) and two sexes (male and female), all of which were subjected to two food treatments (food stress and control feeding ad libitum). The expression stability and ranking of twelve reference genes was assessed using two algorithm-based methods, NormFinder and geNorm. Both methods identified RpS8 as the best suitable reference gene for expression data normalization. We also showed that the use of two reference genes is sufficient to effectively normalize the qRT-PCR data under varying tissues and experimental conditions that we used in B. anynana. Finally, we tested the effect of choosing reference genes with different stability on the normalization of the transcript abundance of a candidate gene involved in olfactory communication in B. anynana, the Fatty Acyl Reductase 2, and we confirmed that using an unstable reference gene can drastically alter the expression profile of the target candidate genes. PMID:25793735
Selection and validation of reference genes for qRT-PCR expression analysis of candidate genes involved in olfactory communication in the butterfly Bicyclus anynana.

PubMed

Arun, Alok; Baumlé, Véronique; Amelot, Gaël; Nieberding, Caroline M

2015-01-01

Real-time quantitative reverse transcription PCR (qRT-PCR) is a technique widely used to quantify the transcriptional expression level of candidate genes. qRT-PCR requires the selection of one or several suitable reference genes, whose expression profiles remain stable across conditions, to normalize the qRT-PCR expression profiles of candidate genes. Although several butterfly species (Lepidoptera) have become important models in molecular evolutionary ecology, so far no study aimed at identifying reference genes for accurate data normalization for any butterfly is available. The African bush brown butterfly Bicyclus anynana has drawn considerable attention owing to its suitability as a model for evolutionary ecology, and we here provide a maiden extensive study to identify suitable reference gene in this species. We monitored the expression profile of twelve reference genes: eEF-1α, FK506, UBQL40, RpS8, RpS18, HSP, GAPDH, VATPase, ACT3, TBP, eIF2 and G6PD. We tested the stability of their expression profiles in three different tissues (wings, brains, antennae), two developmental stages (pupal and adult) and two sexes (male and female), all of which were subjected to two food treatments (food stress and control feeding ad libitum). The expression stability and ranking of twelve reference genes was assessed using two algorithm-based methods, NormFinder and geNorm. Both methods identified RpS8 as the best suitable reference gene for expression data normalization. We also showed that the use of two reference genes is sufficient to effectively normalize the qRT-PCR data under varying tissues and experimental conditions that we used in B. anynana. Finally, we tested the effect of choosing reference genes with different stability on the normalization of the transcript abundance of a candidate gene involved in olfactory communication in B. anynana, the Fatty Acyl Reductase 2, and we confirmed that using an unstable reference gene can drastically alter the expression profile of the target candidate genes.
Children’s Hospital of Pittsburgh and Diabetes Institute of the Walter Reed Health Care System Genetic Screening in Diabetes: Candidate Gene Analysis for Diabetic Retinopathy

DTIC Science & Technology

2010-05-01

Screening in Diabetes : Candidate Gene Analysis for Diabetic Retinopathy PRINCIPAL INVESTIGATOR: Robert A. Vigersky, COL MC CONTRACTING ORGANIZATION... Diabetes Institute of the Walter Reed Health Care System Genetic Screening in Diabetes : Candidate Gene Analysis for Diabetic Retinopathy 5c. PROGRAM... diabetic neuropathy, and diabetic retinopathy . This was an observational study in which the investigators obtained DNA samples from the blood of
Transcriptome profiling of low temperature-treated cassava apical shoots showed dynamic responses of tropical plant to cold stress

PubMed Central

2012-01-01

Background Cassava is an important tropical root crop adapted to a wide range of environmental stimuli such as drought and acid soils. Nevertheless, it is an extremely cold-sensitive tropical species. Thus far, there is limited information about gene regulation and signalling pathways related to the cold stress response in cassava. The development of microarray technology has accelerated the study of global transcription profiling under certain conditions. Results A 60-mer oligonucleotide microarray representing 20,840 genes was used to perform transcriptome profiling in apical shoots of cassava subjected to cold at 7°C for 0, 4 and 9 h. A total of 508 transcripts were identified as early cold-responsive genes in which 319 sequences had functional descriptions when aligned with Arabidopsis proteins. Gene ontology annotation analysis identified many cold-relevant categories, including 'Response to abiotic and biotic stimulus', 'Response to stress', 'Transcription factor activity', and 'Chloroplast'. Various stress-associated genes with a wide range of biological functions were found, such as signal transduction components (e.g., MAP kinase 4), transcription factors (TFs, e.g., RAP2.11), and reactive oxygen species (ROS) scavenging enzymes (e.g., catalase 2), as well as photosynthesis-related genes (e.g., PsaL). Seventeen major TF families including many well-studied members (e.g., AP2-EREBP) were also involved in the early response to cold stress. Meanwhile, KEGG pathway analysis uncovered many important pathways, such as 'Plant hormone signal transduction' and 'Starch and sucrose metabolism'. Furthermore, the expression changes of 32 genes under cold and other abiotic stress conditions were validated by real-time RT-PCR. Importantly, most of the tested stress-responsive genes were primarily expressed in mature leaves, stem cambia, and fibrous roots rather than apical buds and young leaves. As a response to cold stress in cassava, an increase in transcripts and enzyme activities of ROS scavenging genes and the accumulation of total soluble sugars (including sucrose and glucose) were also detected. Conclusions The dynamic expression changes reflect the integrative controlling and transcriptome regulation of the networks in the cold stress response of cassava. The biological processes involved in the signal perception and physiological response might shed light on the molecular mechanisms related to cold tolerance in tropical plants and provide useful candidate genes for genetic improvement. PMID:22321773
ProDiGe: Prioritization Of Disease Genes with multitask machine learning from positive and unlabeled examples

PubMed Central

2011-01-01

Background Elucidating the genetic basis of human diseases is a central goal of genetics and molecular biology. While traditional linkage analysis and modern high-throughput techniques often provide long lists of tens or hundreds of disease gene candidates, the identification of disease genes among the candidates remains time-consuming and expensive. Efficient computational methods are therefore needed to prioritize genes within the list of candidates, by exploiting the wealth of information available about the genes in various databases. Results We propose ProDiGe, a novel algorithm for Prioritization of Disease Genes. ProDiGe implements a novel machine learning strategy based on learning from positive and unlabeled examples, which allows to integrate various sources of information about the genes, to share information about known disease genes across diseases, and to perform genome-wide searches for new disease genes. Experiments on real data show that ProDiGe outperforms state-of-the-art methods for the prioritization of genes in human diseases. Conclusions ProDiGe implements a new machine learning paradigm for gene prioritization, which could help the identification of new disease genes. It is freely available at http://cbio.ensmp.fr/prodige. PMID:21977986
Gene Expression Profiling of Gastric Cancer

PubMed Central

Marimuthu, Arivusudar; Jacob, Harrys K.C.; Jakharia, Aniruddha; Subbannayya, Yashwanth; Keerthikumar, Shivakumar; Kashyap, Manoj Kumar; Goel, Renu; Balakrishnan, Lavanya; Dwivedi, Sutopa; Pathare, Swapnali; Dikshit, Jyoti Bajpai; Maharudraiah, Jagadeesha; Singh, Sujay; Sameer Kumar, Ghantasala S; Vijayakumar, M.; Veerendra Kumar, Kariyanakatte Veeraiah; Premalatha, Chennagiri Shrinivasamurthy; Tata, Pramila; Hariharan, Ramesh; Roa, Juan Carlos; Prasad, T.S.K; Chaerkady, Raghothama; Kumar, Rekha Vijay; Pandey, Akhilesh

2015-01-01

Gastric cancer is the second leading cause of cancer death worldwide, both in men and women. A genomewide gene expression analysis was carried out to identify differentially expressed genes in gastric adenocarcinoma tissues as compared to adjacent normal tissues. We used Agilent’s whole human genome oligonucleotide microarray platform representing ~41,000 genes to carry out gene expression analysis. Two-color microarray analysis was employed to directly compare the expression of genes between tumor and normal tissues. Through this approach, we identified several previously known candidate genes along with a number of novel candidate genes in gastric cancer. Testican-1 (SPOCK1) was one of the novel molecules that was 10-fold upregulated in tumors. Using tissue microarrays, we validated the expression of testican-1 by immunohistochemical staining. It was overexpressed in 56% (160/282) of the cases tested. Pathway analysis led to the identification of several networks in which SPOCK1 was among the topmost networks of interacting genes. By gene enrichment analysis, we identified several genes involved in cell adhesion and cell proliferation to be significantly upregulated while those corresponding to metabolic pathways were significantly downregulated. The differentially expressed genes identified in this study are candidate biomarkers for gastric adenoacarcinoma. PMID:27030788
Exploring Valid Reference Genes for Quantitative Real-time PCR Analysis in Plutella xylostella (Lepidoptera: Plutellidae)

PubMed Central

Fu, Wei; Xie, Wen; Zhang, Zhuo; Wang, Shaoli; Wu, Qingjun; Liu, Yong; Zhou, Xiaomao; Zhou, Xuguo; Zhang, Youjun

2013-01-01

Abstract: Quantitative real-time PCR (qRT-PCR), a primary tool in gene expression analysis, requires an appropriate normalization strategy to control for variation among samples. The best option is to compare the mRNA level of a target gene with that of reference gene(s) whose expression level is stable across various experimental conditions. In this study, expression profiles of eight candidate reference genes from the diamondback moth, Plutella xylostella, were evaluated under diverse experimental conditions. RefFinder, a web-based analysis tool, integrates four major computational programs including geNorm, Normfinder, BestKeeper, and the comparative ΔCt method to comprehensively rank the tested candidate genes. Elongation factor 1 (EF1) was the most suited reference gene for the biotic factors (development stage, tissue, and strain). In contrast, although appropriate reference gene(s) do exist for several abiotic factors (temperature, photoperiod, insecticide, and mechanical injury), we were not able to identify a single universal reference gene. Nevertheless, a suite of candidate reference genes were specifically recommended for selected experimental conditions. Our finding is the first step toward establishing a standardized qRT-PCR analysis of this agriculturally important insect pest. PMID:23983612
Identification of possible genetic polymorphisms involved in cancer cachexia: a systematic review.

PubMed

Tan, Benjamin H L; Ross, James A; Kaasa, Stein; Skorpen, Frank; Fearon, Kenneth C H

2011-04-01

Cancer cachexia is a polygenic and complex syndrome. Genetic variations in regulation of the inflammatory response, muscle and fat metabolic pathways, and pathways in appetite regulation are likely to contribute to the susceptibility or resistance to developing cancer cachexia. A systematic search of Medline and EmBase databases, covering 1986-2008 was performed for potential candidate genes/genetic polymorphisms relating to cancer cachexia. Related genes were then identified using pathway functional analysis software. All candidate genes were reviewed for functional polymorphisms or clinically significant polymorphisms associated with cachexia using the OMIM and GeneRIF databases. Genes with variants which had functional or clinical associations with cachexia and replicated in at least one study were entered into pathway analysis software to reveal possible network associations between genes. A total of 184 polymorphisms with functional or clinical relevance to cancer cachexia were identified in 92 candidate genes. Of these, 42 polymorphisms (in 33 genes) were replicated in more than one study with 13 polymorphisms found to influence two or more hallmarks of cachexia (i.e. inflammation, loss of fat mass and/or lean mass and reduced survival). Thirty-three genes were found to be significantly interconnected in two major networks with four genes (ADIPOQ, IL6, NFKB1 and TLR4) interlinking both networks. Selection of candidate genes and polymorphisms is a key element of multigene study design. The present study provides an initial framework to select genes/polymorphisms for further study in cancer cachexia, and to develop their potential as susceptibility biomarkers of developing cachexia.
Transcription map of Xq27: candidates for several X-linked diseases.

PubMed

Zucchi, I; Jones, J; Affer, M; Montagna, C; Redolfi, E; Susani, L; Vezzoni, P; Parvari, R; Schlessinger, D; Whyte, M P; Mumm, S

1999-04-15

Human Xq27 contains candidate regions for several disorders, yet is predicted to be a gene-poor cytogenetic band. We have developed a transcription map for the entire cytogenetic band to facilitate the identification of the relatively small number of expected candidate genes. Two approaches were taken to identify genes: (1) a group of 64 unique STSs that were generated during the physical mapping of the region were used in RT-PCR with RNA from human adult and fetal brain and (2) ESTs that have been broadly mapped to this region of the chromosome were finely mapped using a high-resolution yeast artificial chromosome contig. This combined approach identified four distinct regions of transcriptional activity within the Xq27 band. Among them is a region at the centromeric boundary that contains candidate regions for several rare developmental disorders (X-linked recessive hypoparathyroidism, thoracoabdominal syndrome, albinism-deafness syndrome, and Borjeson-Forssman-Lehman syndrome). Two transcriptionally active regions were identified in the center of Xq27 and include candidate regions for X-linked mental retardation syndrome 6, X-linked progressive cone dystrophy, X-linked retinitis pigmentosa 24, and a prostate cancer susceptibility locus. The fourth region of transcriptional activity encompasses the FMR1 (FRAXA) and FMR2 (FRAXE) genes. The analysis thus suggests clustered transcription in Xq27 and provides candidates for several heritable disorders for which the causative genes have not yet been found. Copyright 1999 Academic Press.

Understanding phylogenetic incongruence: lessons from phyllostomid bats

PubMed Central

Dávalos, Liliana M; Cirranello, Andrea L; Geisler, Jonathan H; Simmons, Nancy B

2012-01-01

All characters and trait systems in an organism share a common evolutionary history that can be estimated using phylogenetic methods. However, differential rates of change and the evolutionary mechanisms driving those rates result in pervasive phylogenetic conflict. These drivers need to be uncovered because mismatches between evolutionary processes and phylogenetic models can lead to high confidence in incorrect hypotheses. Incongruence between phylogenies derived from morphological versus molecular analyses, and between trees based on different subsets of molecular sequences has become pervasive as datasets have expanded rapidly in both characters and species. For more than a decade, evolutionary relationships among members of the New World bat family Phyllostomidae inferred from morphological and molecular data have been in conflict. Here, we develop and apply methods to minimize systematic biases, uncover the biological mechanisms underlying phylogenetic conflict, and outline data requirements for future phylogenomic and morphological data collection. We introduce new morphological data for phyllostomids and outgroups and expand previous molecular analyses to eliminate methodological sources of phylogenetic conflict such as taxonomic sampling, sparse character sampling, or use of different algorithms to estimate the phylogeny. We also evaluate the impact of biological sources of conflict: saturation in morphological changes and molecular substitutions, and other processes that result in incongruent trees, including convergent morphological and molecular evolution. Methodological sources of incongruence play some role in generating phylogenetic conflict, and are relatively easy to eliminate by matching taxa, collecting more characters, and applying the same algorithms to optimize phylogeny. The evolutionary patterns uncovered are consistent with multiple biological sources of conflict, including saturation in morphological and molecular changes, adaptive morphological convergence among nectar-feeding lineages, and incongruent gene trees. Applying methods to account for nucleotide sequence saturation reduces, but does not completely eliminate, phylogenetic conflict. We ruled out paralogy, lateral gene transfer, and poor taxon sampling and outgroup choices among the processes leading to incongruent gene trees in phyllostomid bats. Uncovering and countering the possible effects of introgression and lineage sorting of ancestral polymorphism on gene trees will require great leaps in genomic and allelic sequencing in this species-rich mammalian family. We also found evidence for adaptive molecular evolution leading to convergence in mitochondrial proteins among nectar-feeding lineages. In conclusion, the biological processes that generate phylogenetic conflict are ubiquitous, and overcoming incongruence requires better models and more data than have been collected even in well-studied organisms such as phyllostomid bats. PMID:22891620
Genetic risk factors of systemic lupus erythematosus in the Malaysian population: a minireview.

PubMed

Chai, Hwa Chia; Phipps, Maude Elvira; Chua, Kek Heng

2012-01-01

SLE is an autoimmune disease that is not uncommon in Malaysia. In contrast to Malays and Indians, the Chinese seem to be most affected. SLE is characterized by deficiency of body's immune response that leads to production of autoantibodies and failure of immune complex clearance. This minireview attempts to summarize the association of several candidate genes with risk for SLE in the Malaysian population and discuss the genetic heterogeneity that exists locally in Asians and in comparison with SLE in Caucasians. Several groups of researchers have been actively investigating genes that are associated with SLE susceptibility in the Malaysian population by screening possible reported candidate genes across the SLE patients and healthy controls. These candidate genes include MHC genes and genes encoding complement components, TNF, FcγR, T-cell receptors, and interleukins. However, most of the polymorphisms investigated in these genes did not show significant associations with susceptibility to SLE in the Malaysian scenario, except for those occurring in MHC genes and genes coding for TNF-α, IL-1β, IL-1RN, and IL-6.
Genetic Risk Factors of Systemic Lupus Erythematosus in the Malaysian Population: A Minireview

PubMed Central

Chai, Hwa Chia; Phipps, Maude Elvira; Chua, Kek Heng

2012-01-01

SLE is an autoimmune disease that is not uncommon in Malaysia. In contrast to Malays and Indians, the Chinese seem to be most affected. SLE is characterized by deficiency of body's immune response that leads to production of autoantibodies and failure of immune complex clearance. This minireview attempts to summarize the association of several candidate genes with risk for SLE in the Malaysian population and discuss the genetic heterogeneity that exists locally in Asians and in comparison with SLE in Caucasians. Several groups of researchers have been actively investigating genes that are associated with SLE susceptibility in the Malaysian population by screening possible reported candidate genes across the SLE patients and healthy controls. These candidate genes include MHC genes and genes encoding complement components, TNF, FcγR, T-cell receptors, and interleukins. However, most of the polymorphisms investigated in these genes did not show significant associations with susceptibility to SLE in the Malaysian scenario, except for those occurring in MHC genes and genes coding for TNF-α, IL-1β, IL-1RN, and IL-6. PMID:21941582
Identification of candidate genes in Populus cell wall biosynthesis using text-mining, co-expression network and comparative genomics

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yang, Xiaohan; Ye, Chuyu; Bisaria, Anjali

2011-01-01

Populus is an important bioenergy crop for bioethanol production. A greater understanding of cell wall biosynthesis processes is critical in reducing biomass recalcitrance, a major hindrance in efficient generation of ethanol from lignocellulosic biomass. Here, we report the identification of candidate cell wall biosynthesis genes through the development and application of a novel bioinformatics pipeline. As a first step, via text-mining of PubMed publications, we obtained 121 Arabidopsis genes that had the experimental evidences supporting their involvement in cell wall biosynthesis or remodeling. The 121 genes were then used as bait genes to query an Arabidopsis co-expression database and additionalmore » genes were identified as neighbors of the bait genes in the network, increasing the number of genes to 548. The 548 Arabidopsis genes were then used to re-query the Arabidopsis co-expression database and re-construct a network that captured additional network neighbors, expanding to a total of 694 genes. The 694 Arabidopsis genes were computationally divided into 22 clusters. Queries of the Populus genome using the Arabidopsis genes revealed 817 Populus orthologs. Functional analysis of gene ontology and tissue-specific gene expression indicated that these Arabidopsis and Populus genes are high likelihood candidates for functional genomics in relation to cell wall biosynthesis.« less
The Influence of Genetics on Cystic Fibrosis Phenotypes

PubMed Central

Knowles, Michael R.; Drumm, Mitchell

2012-01-01

Technological advances in genetics have made feasible and affordable large studies to identify genetic variants that cause or modify a trait. Genetic studies have been carried out to assess variants in candidate genes, as well as polymorphisms throughout the genome, for their associations with heritable clinical outcomes of cystic fibrosis (CF), such as lung disease, meconium ileus, and CF-related diabetes. The candidate gene approach has identified some predicted relationships, while genome-wide surveys have identified several genes that would not have been obvious disease-modifying candidates, such as a methionine sulfoxide transferase gene that influences intestinal obstruction, or a region on chromosome 11 proximate to genes encoding a transcription factor and an apoptosis controller that associates with lung function. These unforeseen associations thus provide novel insight into disease pathophysiology, as well as suggesting new therapeutic strategies for CF. PMID:23209180
The Candidate Cancer Gene Database: a database of cancer driver genes from forward genetic screens in mice.

PubMed

Abbott, Kenneth L; Nyre, Erik T; Abrahante, Juan; Ho, Yen-Yi; Isaksson Vogel, Rachel; Starr, Timothy K

2015-01-01

Identification of cancer driver gene mutations is crucial for advancing cancer therapeutics. Due to the overwhelming number of passenger mutations in the human tumor genome, it is difficult to pinpoint causative driver genes. Using transposon mutagenesis in mice many laboratories have conducted forward genetic screens and identified thousands of candidate driver genes that are highly relevant to human cancer. Unfortunately, this information is difficult to access and utilize because it is scattered across multiple publications using different mouse genome builds and strength metrics. To improve access to these findings and facilitate meta-analyses, we developed the Candidate Cancer Gene Database (CCGD, http://ccgd-starrlab.oit.umn.edu/). The CCGD is a manually curated database containing a unified description of all identified candidate driver genes and the genomic location of transposon common insertion sites (CISs) from all currently published transposon-based screens. To demonstrate relevance to human cancer, we performed a modified gene set enrichment analysis using KEGG pathways and show that human cancer pathways are highly enriched in the database. We also used hierarchical clustering to identify pathways enriched in blood cancers compared to solid cancers. The CCGD is a novel resource available to scientists interested in the identification of genetic drivers of cancer. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Integrated Systems Biology Analysis of Transcriptomes Reveals Candidate Genes for Acidity Control in Developing Fruits of Sweet Orange (Citrus sinensis L. Osbeck).

PubMed

Huang, Dingquan; Zhao, Yihong; Cao, Minghao; Qiao, Liang; Zheng, Zhi-Liang

2016-01-01

Organic acids, such as citrate and malate, are important contributors for the sensory traits of fleshy fruits. Although their biosynthesis has been illustrated, regulatory mechanisms of acid accumulation remain to be dissected. To provide transcriptional architecture and identify candidate genes for citrate accumulation in fruits, we have selected for transcriptome analysis four varieties of sweet orange (Citrus sinensis L. Osbeck) with varying fruit acidity, Succari (acidless), Bingtang (low acid), and Newhall and Xinhui (normal acid). Fruits of these varieties at 45 days post anthesis (DPA), which corresponds to Stage I (cell division), had similar acidity, but they displayed differential acid accumulation at 142 DPA (Stage II, cell expansion). Transcriptomes of fruits at 45 and 142 DPA were profiled using RNA sequencing and analyzed with three different algorithms (Pearson correlation, gene coexpression network and surrogate variable analysis). Our network analysis shows that the acid-correlated genes belong to three distinct network modules. Several of these candidate fruit acidity genes encode regulatory proteins involved in transport (such as AHA10), degradation (such as APD2) and transcription (such as AIL6) and act as hubs in the citrate accumulation gene networks. Taken together, our integrated systems biology analysis has provided new insights into the fruit citrate accumulation gene network and led to the identification of candidate genes likely associated with the fruit acidity control.
Integrated Systems Biology Analysis of Transcriptomes Reveals Candidate Genes for Acidity Control in Developing Fruits of Sweet Orange (Citrus sinensis L. Osbeck)

PubMed Central

Huang, Dingquan; Zhao, Yihong; Cao, Minghao; Qiao, Liang; Zheng, Zhi-Liang

2016-01-01

Organic acids, such as citrate and malate, are important contributors for the sensory traits of fleshy fruits. Although their biosynthesis has been illustrated, regulatory mechanisms of acid accumulation remain to be dissected. To provide transcriptional architecture and identify candidate genes for citrate accumulation in fruits, we have selected for transcriptome analysis four varieties of sweet orange (Citrus sinensis L. Osbeck) with varying fruit acidity, Succari (acidless), Bingtang (low acid), and Newhall and Xinhui (normal acid). Fruits of these varieties at 45 days post anthesis (DPA), which corresponds to Stage I (cell division), had similar acidity, but they displayed differential acid accumulation at 142 DPA (Stage II, cell expansion). Transcriptomes of fruits at 45 and 142 DPA were profiled using RNA sequencing and analyzed with three different algorithms (Pearson correlation, gene coexpression network and surrogate variable analysis). Our network analysis shows that the acid-correlated genes belong to three distinct network modules. Several of these candidate fruit acidity genes encode regulatory proteins involved in transport (such as AHA10), degradation (such as APD2) and transcription (such as AIL6) and act as hubs in the citrate accumulation gene networks. Taken together, our integrated systems biology analysis has provided new insights into the fruit citrate accumulation gene network and led to the identification of candidate genes likely associated with the fruit acidity control. PMID:27092171
Effector-mediated discovery of a novel resistance gene against Bremia lactucae in a nonhost lettuce species.

PubMed

Giesbers, Anne K J; Pelgrom, Alexandra J E; Visser, Richard G F; Niks, Rients E; Van den Ackerveken, Guido; Jeuken, Marieke J W

2017-11-01

Candidate effectors from lettuce downy mildew (Bremia lactucae) enable high-throughput germplasm screening for the presence of resistance (R) genes. The nonhost species Lactuca saligna comprises a source of B. lactucae R genes that has hardly been exploited in lettuce breeding. Its cross-compatibility with the host species L. sativa enables the study of inheritance of nonhost resistance (NHR). We performed transient expression of candidate RXLR effector genes from B. lactucae in a diverse Lactuca germplasm set. Responses to two candidate effectors (BLR31 and BLN08) were genetically mapped and tested for co-segregation with disease resistance. BLN08 induced a hypersensitive response (HR) in 55% of the L. saligna accessions, but responsiveness did not co-segregate with resistance to Bl:24. BLR31 triggered an HR in 5% of the L. saligna accessions, and revealed a novel R gene providing complete B. lactucae race Bl:24 resistance. Resistant hybrid plants that were BLR31 nonresponsive indicated other unlinked R genes and/or nonhost QTLs. We have identified a candidate avirulence effector of B. lactucae (BLR31) and its cognate R gene in L. saligna. Concurrently, our results suggest that R genes are not required for NHR of L. saligna. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
Functional dissection of drought-responsive gene expression patterns in Cynodon dactylon L.

PubMed

Kim, Changsoo; Lemke, Cornelia; Paterson, Andrew H

2009-05-01

Water deficit is one of the main abiotic factors that affect plant productivity in subtropical regions. To identify genes induced during the water stress response in Bermudagrass (Cynodon dactylon), cDNA macroarrays were used. The macroarray analysis identified 189 drought-responsive candidate genes from C. dactylon, of which 120 were up-regulated and 69 were down-regulated. The candidate genes were classified into seven groups by cluster analysis of expression levels across two intensities and three durations of imposed stress. Annotation using BLASTX suggested that up-regulated genes may be involved in proline biosynthesis, signal transduction pathways, protein repair systems, and removal of toxins, while down-regulated genes were mostly related to basic plant metabolism such as photosynthesis and glycolysis. The functional classification of gene ontology (GO) was consistent with the BLASTX results, also suggesting some crosstalk between abiotic and biotic stress. Comparative analysis of cis-regulatory elements from the candidate genes implicated specific elements in drought response in Bermudagrass. Although only a subset of genes was studied, Bermudagrass shared many drought-responsive genes and cis-regulatory elements with other botanical models, supporting a strategy of cross-taxon application of drought-responsive genes, regulatory cues, and physiological-genetic information.
BioGPS and MyGene.info: organizing online, gene-centric information.

PubMed

Wu, Chunlei; Macleod, Ian; Su, Andrew I

2013-01-01

Fast-evolving technologies have enabled researchers to easily generate data at genome scale, and using these technologies to compare biological states typically results in a list of candidate genes. Researchers are then faced with the daunting task of prioritizing these candidate genes for follow-up studies. There are hundreds, possibly even thousands, of web-based gene annotation resources available, but it quickly becomes impractical to manually access and review all of these sites for each gene in a candidate gene list. BioGPS (http://biogps.org) was created as a centralized gene portal for aggregating distributed gene annotation resources, emphasizing community extensibility and user customizability. BioGPS serves as a convenient tool for users to access known gene-centric resources, as well as a mechanism to discover new resources that were previously unknown to the user. This article describes updates to BioGPS made after its initial release in 2008. We summarize recent additions of features and data, as well as the robust user activity that underlies this community intelligence application. Finally, we describe MyGene.info (http://mygene.info) and related web services that provide programmatic access to BioGPS.
Fine mapping of Restorer-of-fertility in pepper (Capsicum annuum L.) identified a candidate gene encoding a pentatricopeptide repeat (PPR)-containing protein.

PubMed

Jo, Yeong Deuk; Ha, Yeaseong; Lee, Joung-Ho; Park, Minkyu; Bergsma, Alex C; Choi, Hong-Il; Goritschnig, Sandra; Kloosterman, Bjorn; van Dijk, Peter J; Choi, Doil; Kang, Byoung-Cheorl

2016-10-01

Using fine mapping techniques, the genomic region co-segregating with Restorer - of - fertility ( Rf ) in pepper was delimited to a region of 821 kb in length. A PPR gene in this region, CaPPR6 , was identified as a strong candidate for Rf based on expression pattern and characteristics of encoding sequence. Cytoplasmic-genic male sterility (CGMS) has been used for the efficient production of hybrid seeds in peppers (Capsicum annuum L.). Although the mitochondrial candidate genes that might be responsible for cytoplasmic male sterility (CMS) have been identified, the nuclear Restorer-of-fertility (Rf) gene has not been isolated. To identify the genomic region co-segregating with Rf in pepper, we performed fine mapping using an Rf-segregating population consisting of 1068 F2 individuals, based on BSA-AFLP and a comparative mapping approach. Through six cycles of chromosome walking, the co-segregating region harboring the Rf locus was delimited to be within 821 kb of sequence. Prediction of expressed genes in this region based on transcription analysis revealed four candidate genes. Among these, CaPPR6 encodes a pentatricopeptide repeat (PPR) protein with PPR motifs that are repeated 14 times. Characterization of the CaPPR6 protein sequence, based on alignment with other homologs, showed that CaPPR6 is a typical Rf-like (RFL) gene reported to have undergone diversifying selection during evolution. A marker developed from a sequence near CaPPR6 showed a higher prediction rate of the Rf phenotype than those of previously developed markers when applied to a panel of breeding lines of diverse origin. These results suggest that CaPPR6 is a strong candidate for the Rf gene in pepper.
Analysis of shared homozygosity regions in Saudi siblings with attention deficit hyperactivity disorder

PubMed Central

Al Yemni, Eman A.A.; Alnaemi, Faten M.; Abebe, Dejene; Al-Abdulaziz, Basma S.; Al Mubarak, Bashayer R.; Ghaziuddin, Mohammad; Al Tassan, Nada A.

2017-01-01

Aim Genetic and clinical complexities are common features of most psychiatric illnesses that pose a major obstacle in risk-gene identification. Attention deficit hyperactivity disorder (ADHD) is the most prevalent child-onset psychiatric illness, with high heritability. Over the past decade, numerous genetic studies utilizing various approaches, such as genome-wide association, candidate-gene association, and linkage analysis, have identified a multitude of candidate loci/genes. However, such studies have yielded diverse findings that are rarely reproduced, indicating that other genetic determinants have not been discovered yet. In this study, we carried out sib-pair analysis on seven multiplex families with ADHD from Saudi Arabia. We aimed to identify the candidate chromosomal regions and genes linked to the disease. Patients and methods A total of 41 individuals from multiplex families were analyzed for shared regions of homozygosity. Genes within these regions were prioritized according to their potential relevance to ADHD. Results We identified multiple genomic regions spanning different chromosomes to be shared among affected members of each family; these included chromosomes 3, 5, 6, 7, 8, 9, 10, 13, 17, and 18. We also found specific regions on chromosomes 8 and 17 to be shared between affected individuals from more than one family. Among the genes present in the regions reported here were involved in neurotransmission (GRM3, SIGMAR1, CHAT, and SLC18A3) and members of the HLA gene family (HLA-A, HLA-DPA1, and MICC). Conclusion The candidate regions identified in this study highlight the genetic diversity of ADHD. Upon further investigation, these loci may reveal candidate genes that enclose variants associated with ADHD. Although most ADHD studies were conducted in other populations, our study provides insight from an understudied, ethnically interesting population. PMID:28452824
Analysis of shared homozygosity regions in Saudi siblings with attention deficit hyperactivity disorder.

PubMed

Shinwari, Jameela M A; Al Yemni, Eman A A; Alnaemi, Faten M; Abebe, Dejene; Al-Abdulaziz, Basma S; Al Mubarak, Bashayer R; Ghaziuddin, Mohammad; Al Tassan, Nada A

2017-08-01

Genetic and clinical complexities are common features of most psychiatric illnesses that pose a major obstacle in risk-gene identification. Attention deficit hyperactivity disorder (ADHD) is the most prevalent child-onset psychiatric illness, with high heritability. Over the past decade, numerous genetic studies utilizing various approaches, such as genome-wide association, candidate-gene association, and linkage analysis, have identified a multitude of candidate loci/genes. However, such studies have yielded diverse findings that are rarely reproduced, indicating that other genetic determinants have not been discovered yet. In this study, we carried out sib-pair analysis on seven multiplex families with ADHD from Saudi Arabia. We aimed to identify the candidate chromosomal regions and genes linked to the disease. A total of 41 individuals from multiplex families were analyzed for shared regions of homozygosity. Genes within these regions were prioritized according to their potential relevance to ADHD. We identified multiple genomic regions spanning different chromosomes to be shared among affected members of each family; these included chromosomes 3, 5, 6, 7, 8, 9, 10, 13, 17, and 18. We also found specific regions on chromosomes 8 and 17 to be shared between affected individuals from more than one family. Among the genes present in the regions reported here were involved in neurotransmission (GRM3, SIGMAR1, CHAT, and SLC18A3) and members of the HLA gene family (HLA-A, HLA-DPA1, and MICC). The candidate regions identified in this study highlight the genetic diversity of ADHD. Upon further investigation, these loci may reveal candidate genes that enclose variants associated with ADHD. Although most ADHD studies were conducted in other populations, our study provides insight from an understudied, ethnically interesting population.
Candidate genes and adaptive radiation: insights from transcriptional adaptation to the limnetic niche among coregonine fishes (Coregonus spp., Salmonidae).

PubMed

Jeukens, Julie; Bittner, David; Knudsen, Rune; Bernatchez, Louis

2009-01-01

In the past 40 years, there has been increasing acceptance that variation in levels of gene expression represents a major source of evolutionary novelty. Gene expression divergence is therefore likely to be involved in the emergence of incipient species, namely, in a context of adaptive radiation. In the lake whitefish species complex (Coregonus clupeaformis), previous microarray experiments have led to the identification of candidate genes potentially implicated in the parallel evolution of the limnetic dwarf lake whitefish, which is highly distinct from the benthic normal lake whitefish in life history, morphology, metabolism, and behavior, and yet diverged from it only approximately 15,000 years before present. The aim of the present study was to address transcriptional divergence for six candidate genes among lake whitefish and European whitefish (Coregonus lavaretus) species pairs, as well as lake cisco (Coregonus artedi) and vendace (Coregonus albula). The main goal was to test the hypothesis that parallel phenotypic adaptation toward the use of the limnetic niche in coregonine fishes is accompanied by parallelism in candidate gene transcription as measured by quantitative real-time polymerase chain reaction. Results obtained for three candidate genes, whereby parallelism in expression was observed across all whitefish species pairs, provide strong support for the hypothesis that divergent natural selection plays an important role in the adaptive radiation of whitefish species. However, this parallelism in expression did not extend to cisco and vendace, thereby infirming transcriptional convergence between limnetic whitefish species and their limnetic congeners for these genes. As recently proposed (Lynch 2007a. The evolution of genetic networks by non-adaptive processes. Nat Rev Genet. 8:803-813), these results may suggest that convergent phenotypic evolution can result from nonadaptive shaping of genome architecture in independently evolved coregonine lineages.
Quantifying the underlying landscape and paths of cancer

PubMed Central

Li, Chunhe; Wang, Jin

2014-01-01

Cancer is a disease regulated by the underlying gene networks. The emergence of normal and cancer states as well as the transformation between them can be thought of as a result of the gene network interactions and associated changes. We developed a global potential landscape and path framework to quantify cancer and associated processes. We constructed a cancer gene regulatory network based on the experimental evidences and uncovered the underlying landscape. The resulting tristable landscape characterizes important biological states: normal, cancer and apoptosis. The landscape topography in terms of barrier heights between stable state attractors quantifies the global stability of the cancer network system. We propose two mechanisms of cancerization: one is by the changes of landscape topography through the changes in regulation strengths of the gene networks. The other is by the fluctuations that help the system to go over the critical barrier at fixed landscape topography. The kinetic paths from least action principle quantify the transition processes among normal state, cancer state and apoptosis state. The kinetic rates provide the quantification of transition speeds among normal, cancer and apoptosis attractors. By the global sensitivity analysis of the gene network parameters on the landscape topography, we uncovered some key gene regulations determining the transitions between cancer and normal states. This can be used to guide the design of new anti-cancer tactics, through cocktail strategy of targeting multiple key regulation links simultaneously, for preventing cancer occurrence or transforming the early cancer state back to normal state. PMID:25232051
Where we stand, where we are moving: Surveying computational techniques for identifying miRNA genes and uncovering their regulatory role.

PubMed

Kleftogiannis, Dimitrios; Korfiati, Aigli; Theofilatos, Konstantinos; Likothanassis, Spiros; Tsakalidis, Athanasios; Mavroudi, Seferina

2013-06-01

Traditional biology was forced to restate some of its principles when the microRNA (miRNA) genes and their regulatory role were firstly discovered. Typically, miRNAs are small non-coding RNA molecules which have the ability to bind to the 3'untraslated region (UTR) of their mRNA target genes for cleavage or translational repression. Existing experimental techniques for their identification and the prediction of the target genes share some important limitations such as low coverage, time consuming experiments and high cost reagents. Hence, many computational methods have been proposed for these tasks to overcome these limitations. Recently, many researchers emphasized on the development of computational approaches to predict the participation of miRNA genes in regulatory networks and to analyze their transcription mechanisms. All these approaches have certain advantages and disadvantages which are going to be described in the present survey. Our work is differentiated from existing review papers by updating the methodologies list and emphasizing on the computational issues that arise from the miRNA data analysis. Furthermore, in the present survey, the various miRNA data analysis steps are treated as an integrated procedure whose aims and scope is to uncover the regulatory role and mechanisms of the miRNA genes. This integrated view of the miRNA data analysis steps may be extremely useful for all researchers even if they work on just a single step. Copyright © 2013 Elsevier Inc. All rights reserved.
Methods of Combinatorial Optimization to Reveal Factors Affecting Gene Length

PubMed Central

Bolshoy, Alexander; Tatarinova, Tatiana

2012-01-01

In this paper we present a novel method for genome ranking according to gene lengths. The main outcomes described in this paper are the following: the formulation of the genome ranking problem, presentation of relevant approaches to solve it, and the demonstration of preliminary results from prokaryotic genomes ordering. Using a subset of prokaryotic genomes, we attempted to uncover factors affecting gene length. We have demonstrated that hyperthermophilic species have shorter genes as compared with mesophilic organisms, which probably means that environmental factors affect gene length. Moreover, these preliminary results show that environmental factors group together in ranking evolutionary distant species. PMID:23300345
Global Transcriptome Changes Underlying Colony Growth in the Opportunistic Human Pathogen Aspergillus fumigatus

PubMed Central

Gibbons, John G.; Beauvais, Anne; Beau, Remi; McGary, Kriston L.

2012-01-01

Aspergillus fumigatus is the most common and deadly pulmonary fungal infection worldwide. In the lung, the fungus usually forms a dense colony of filaments embedded in a polymeric extracellular matrix. To identify candidate genes involved in this biofilm (BF) growth, we used RNA-Seq to compare the transcriptomes of BF and liquid plankton (PL) growth. Sequencing and mapping of tens of millions sequence reads against the A. fumigatus transcriptome identified 3,728 differentially regulated genes in the two conditions. Although many of these genes, including the ones coding for transcription factors, stress response, the ribosome, and the translation machinery, likely reflect the different growth demands in the two conditions, our experiment also identified hundreds of candidate genes for the observed differences in morphology and pathobiology between BF and PL. We found an overrepresentation of upregulated genes in transport, secondary metabolism, and cell wall and surface functions. Furthermore, upregulated genes showed significant spatial structure across the A. fumigatus genome; they were more likely to occur in subtelomeric regions and colocalized in 27 genomic neighborhoods, many of which overlapped with known or candidate secondary metabolism gene clusters. We also identified 1,164 genes that were downregulated. This gene set was not spatially structured across the genome and was overrepresented in genes participating in primary metabolic functions, including carbon and amino acid metabolism. These results add valuable insight into the genetics of biofilm formation in A. fumigatus and other filamentous fungi and identify many relevant, in the context of biofilm biology, candidate genes for downstream functional experiments. PMID:21724936
A database analysis method identifies an endogenous trans-acting short-interfering RNA that targets the Arabidopsis ARF2, ARF3, and ARF4 genes

PubMed Central

Williams, Leor; Carles, Cristel C.; Osmont, Karen S.; Fletcher, Jennifer C.

2005-01-01

Two classes of small RNAs, microRNAs and short-interfering RNA (siRNAs), have been extensively studied in plants and animals. In Arabidopsis, the capacity to uncover previously uncharacterized small RNAs by means of conventional strategies seems to be reaching its limits. To discover new plant small RNAs, we developed a protocol to mine an Arabidopsis nonannotated, noncoding EST database. Using this approach, we identified an endogenous small RNA, trans-acting short-interfering RNA–auxin response factor (tasiR-ARF), that shares a 21- and 22-nt region of sequence similarity with members of the ARF gene family. tasiR-ARF has characteristics of both short-interfering RNA and microRNA, recently defined as tasiRNA. Accumulation of trans-acting siRNA depends on DICER-LIKE1 and RNA-DEPENDENT RNA POLYMERASE6 but not RNA-DEPENDENT RNA POLYMERASE2. We demonstrate that tasiR-ARF targets three ARF genes, ARF2, ARF3/ETT, and ARF4, and that both the tasiR-ARF precursor and its target genes are evolutionarily conserved. The identification of tasiRNA-ARF as a low-abundance, previously uncharacterized small RNA species proves our method to be a useful tool to uncover additional small regulatory RNAs. PMID:15980147

Some links on this page may take you to non-federal websites. Their policies may differ from this site.