Suzuki, Hitoshi; Osaki, Ken; Sano, Kaori; Alam, A H M Khurshid; Nakamura, Yuichiro; Ishigaki, Yasuhito; Kawahara, Kozo; Tsukahara, Toshifumi
2011-02-18
Alternative splicing, which produces multiple mRNAs from a single gene, occurs in most human genes and contributes to protein diversity. Many alternative isoforms are expressed in a spatio-temporal manner, and function in diverse processes, including in the neural system. The purpose of the present study was to comprehensively investigate neural-splicing using P19 cells. GeneChip Exon Array analysis was performed using total RNAs purified from cells during neuronal cell differentiation. To efficiently and readily extract the alternative exon candidates, 9 filtering conditions were prepared, yielding 262 candidate exons (236 genes). Semiquantitative RT-PCR results in 30 randomly selected candidates suggested that 87% of the candidates were differentially alternatively spliced in neuronal cells compared to undifferentiated cells. Gene ontology and pathway analyses suggested that many of the candidate genes were associated with neural events. Together with 66 genes whose functions in neural cells or organs were reported previously, 47 candidate genes were found to be linked to 189 events in the gene-level profile of neural differentiation. By text-mining for the alternative isoform, distinct functions of the isoforms of 9 candidate genes indicated by the result of Exon Array were confirmed. Alternative exons were successfully extracted. Results from the informatics analyses suggested that neural events were primarily governed by genes whose expression was increased and whose transcripts were differentially alternatively spliced in the neuronal cells. In addition to known functions in neural cells or organs, the uninvestigated alternative splicing events of 11 genes among 47 candidate genes suggested that cell cycle events are also potentially important. These genes may help researchers to differentiate the roles of alternative splicing in cell differentiation and cell proliferation.
Analysis of Craniocardiac Malformations in Xenopus using Optical Coherence Tomography
Deniz, Engin; Jonas, Stephan; Hooper, Michael; N. Griffin, John; Choma, Michael A.; Khokha, Mustafa K.
2017-01-01
Birth defects affect 3% of children in the United States. Among the birth defects, congenital heart disease and craniofacial malformations are major causes of mortality and morbidity. Unfortunately, the genetic mechanisms underlying craniocardiac malformations remain largely uncharacterized. To address this, human genomic studies are identifying sequence variations in patients, resulting in numerous candidate genes. However, the molecular mechanisms of pathogenesis for most candidate genes are unknown. Therefore, there is a need for functional analyses in rapid and efficient animal models of human disease. Here, we coupled the frog Xenopus tropicalis with Optical Coherence Tomography (OCT) to create a fast and efficient system for testing craniocardiac candidate genes. OCT can image cross-sections of microscopic structures in vivo at resolutions approaching histology. Here, we identify optimal OCT imaging planes to visualize and quantitate Xenopus heart and facial structures establishing normative data. Next we evaluate known human congenital heart diseases: cardiomyopathy and heterotaxy. Finally, we examine craniofacial defects by a known human teratogen, cyclopamine. We recapitulate human phenotypes readily and quantify the functional and structural defects. Using this approach, we can quickly test human craniocardiac candidate genes for phenocopy as a critical first step towards understanding disease mechanisms of the candidate genes. PMID:28195132
SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate
Gretchen H. Roffler; Stephen J. Amish; Seth Smith; Ted Cosart; Marty Kardos; Michael K. Schwartz; Gordon Luikart
2016-01-01
Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding...
The cld mutation: narrowing the critical chromosomal region and selecting candidate genes.
Péterfy, Miklós; Mao, Hui Z; Doolittle, Mark H
2006-10-01
Combined lipase deficiency (cld) is a recessive, lethal mutation specific to the tw73 haplotype on mouse Chromosome 17. While the cld mutation results in lipase proteins that are inactive, aggregated, and retained in the endoplasmic reticulum (ER), it maps separately from the lipase structural genes. We have narrowed the gene critical region by about 50% using the tw18 haplotype for deletion mapping and a recombinant chromosome used originally to map cld with respect to the phenotypic marker tf. The region now extends from 22 to 25.6 Mbp on the wild-type chromosome, currently containing 149 genes and 50 expressed sequence tags (ESTs). To identify the affected gene, we have selected candidates based on their known role in associated biological processes, cellular components, and molecular functions that best fit with the predicted function of the cld gene. A secondary approach was based on differences in mRNA levels between mutant (cld/cld) and unaffected (+/cld) cells. Using both approaches, we have identified seven functional candidates with an ER localization and/or an involvement in protein maturation and folding that could explain the lipase deficiency, and six expression candidates that exhibit large differences in mRNA levels between mutant and unaffected cells. Significantly, two genes were found to be candidates with regard to both function and expression, thus emerging as the strongest candidates for cld. We discuss the implications of our mapping results and our selection of candidates with respect to other genes, deletions, and mutations occurring in the cld critical region.
Identifying metabolic enzymes with multiple types of association evidence
Kharchenko, Peter; Chen, Lifeng; Freund, Yoav; Vitkup, Dennis; Church, George M
2006-01-01
Background Existing large-scale metabolic models of sequenced organisms commonly include enzymatic functions which can not be attributed to any gene in that organism. Existing computational strategies for identifying such missing genes rely primarily on sequence homology to known enzyme-encoding genes. Results We present a novel method for identifying genes encoding for a specific metabolic function based on a local structure of metabolic network and multiple types of functional association evidence, including clustering of genes on the chromosome, similarity of phylogenetic profiles, gene expression, protein fusion events and others. Using E. coli and S. cerevisiae metabolic networks, we illustrate predictive ability of each individual type of association evidence and show that significantly better predictions can be obtained based on the combination of all data. In this way our method is able to predict 60% of enzyme-encoding genes of E. coli metabolism within the top 10 (out of 3551) candidates for their enzymatic function, and as a top candidate within 43% of the cases. Conclusion We illustrate that a combination of genome context and other functional association evidence is effective in predicting genes encoding metabolic enzymes. Our approach does not rely on direct sequence homology to known enzyme-encoding genes, and can be used in conjunction with traditional homology-based metabolic reconstruction methods. The method can also be used to target orphan metabolic activities. PMID:16571130
EBF factors drive expression of multiple classes of target genes governing neuronal development.
Green, Yangsook S; Vetter, Monica L
2011-04-30
Early B cell factor (EBF) family members are transcription factors known to have important roles in several aspects of vertebrate neurogenesis, including commitment, migration and differentiation. Knowledge of how EBF family members contribute to neurogenesis is limited by a lack of detailed understanding of genes that are transcriptionally regulated by these factors. We performed a microarray screen in Xenopus animal caps to search for targets of EBF transcriptional activity, and identified candidate targets with multiple roles, including transcription factors of several classes. We determined that, among the most upregulated candidate genes with expected neuronal functions, most require EBF activity for some or all of their expression, and most have overlapping expression with ebf genes. We also found that the candidate target genes that had the most strongly overlapping expression patterns with ebf genes were predicted to be direct transcriptional targets of EBF transcriptional activity. The identification of candidate targets that are transcription factor genes, including nscl-1, emx1 and aml1, improves our understanding of how EBF proteins participate in the hierarchy of transcription control during neuronal development, and suggests novel mechanisms by which EBF activity promotes migration and differentiation. Other candidate targets, including pcdh8 and kcnk5, expand our knowledge of the types of terminal differentiated neuronal functions that EBF proteins regulate.
Functional and Genomic Features of Human Genes Mutated in Neuropsychiatric Disorders.
Forero, Diego A; Prada, Carlos F; Perry, George
2016-01-01
In recent years, a large number of studies around the world have led to the identification of causal genes for hereditary types of common and rare neurological and psychiatric disorders. To explore the functional and genomic features of known human genes mutated in neuropsychiatric disorders. A systematic search was used to develop a comprehensive catalog of genes mutated in neuropsychiatric disorders (NPD). Functional enrichment and protein-protein interaction analyses were carried out. A false discovery rate approach was used for correction for multiple testing. We found several functional categories that are enriched among NPD genes, such as gene ontologies, protein domains, tissue expression, signaling pathways and regulation by brain-expressed miRNAs and transcription factors. Sixty six of those NPD genes are known to be druggable. Several topographic parameters of protein-protein interaction networks and the degree of conservation between orthologous genes were identified as significant among NPD genes. These results represent one of the first analyses of enrichment of functional categories of genes known to harbor mutations for NPD. These findings could be useful for a future creation of computational tools for prioritization of novel candidate genes for NPD.
Functional and Genomic Features of Human Genes Mutated in Neuropsychiatric Disorders
Forero, Diego A.; Prada, Carlos F.; Perry, George
2016-01-01
Background: In recent years, a large number of studies around the world have led to the identification of causal genes for hereditary types of common and rare neurological and psychiatric disorders. Objective: To explore the functional and genomic features of known human genes mutated in neuropsychiatric disorders. Methods: A systematic search was used to develop a comprehensive catalog of genes mutated in neuropsychiatric disorders (NPD). Functional enrichment and protein-protein interaction analyses were carried out. A false discovery rate approach was used for correction for multiple testing. Results: We found several functional categories that are enriched among NPD genes, such as gene ontologies, protein domains, tissue expression, signaling pathways and regulation by brain-expressed miRNAs and transcription factors. Sixty six of those NPD genes are known to be druggable. Several topographic parameters of protein-protein interaction networks and the degree of conservation between orthologous genes were identified as significant among NPD genes. Conclusion: These results represent one of the first analyses of enrichment of functional categories of genes known to harbor mutations for NPD. These findings could be useful for a future creation of computational tools for prioritization of novel candidate genes for NPD. PMID:27990183
Discovering novel subsystems using comparative genomics
Ferrer, Luciana; Shearer, Alexander G.; Karp, Peter D.
2011-01-01
Motivation: Key problems for computational genomics include discovering novel pathways in genome data, and discovering functional interaction partners for genes to define new members of partially elucidated pathways. Results: We propose a novel method for the discovery of subsystems from annotated genomes. For each gene pair, a score measuring the likelihood that the two genes belong to a same subsystem is computed using genome context methods. Genes are then grouped based on these scores, and the resulting groups are filtered to keep only high-confidence groups. Since the method is based on genome context analysis, it relies solely on structural annotation of the genomes. The method can be used to discover new pathways, find missing genes from a known pathway, find new protein complexes or other kinds of functional groups and assign function to genes. We tested the accuracy of our method in Escherichia coli K-12. In one configuration of the system, we find that 31.6% of the candidate groups generated by our method match a known pathway or protein complex closely, and that we rediscover 31.2% of all known pathways and protein complexes of at least 4 genes. We believe that a significant proportion of the candidates that do not match any known group in E.coli K-12 corresponds to novel subsystems that may represent promising leads for future laboratory research. We discuss in-depth examples of these findings. Availability: Predicted subsystems are available at http://brg.ai.sri.com/pwy-discovery/journal.html. Contact: lferrer@ai.sri.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:21775308
Chasman, Daniel I; Fuchsberger, Christian; Pattaro, Cristian; Teumer, Alexander; Böger, Carsten A; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Taliun, Daniel; Li, Man; Gao, Xiaoyi; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C; O'Seaghdha, Conall M; Glazer, Nicole; Isaacs, Aaron; Liu, Ching-Ti; Smith, Albert V; O'Connell, Jeffrey R; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Johnson, Andrew D; Gierman, Hinco J; Feitosa, Mary F; Hwang, Shih-Jen; Atkinson, Elizabeth J; Lohman, Kurt; Cornelis, Marilyn C; Johansson, Asa; Tönjes, Anke; Dehghan, Abbas; Lambert, Jean-Charles; Holliday, Elizabeth G; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y; Murgia, Federico; Trompet, Stella; Imboden, Medea; Coassin, Stefan; Pistis, Giorgio; Harris, Tamara B; Launer, Lenore J; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D; Boerwinkle, Eric; Schmidt, Helena; Cavalieri, Margherita; Rao, Madhumathi; Hu, Frank; Demirkan, Ayse; Oostra, Ben A; de Andrade, Mariza; Turner, Stephen T; Ding, Jingzhong; Andrews, Jeanette S; Freedman, Barry I; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Meisinger, Christa; Gieger, Christian; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H; Wright, Alan F; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G; Rivadeneira, Fernando; Aulchenko, Yurii S; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Ketkar, Shamika; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K; Portas, Laura; Ford, Ian; Buckley, Brendan M; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Kim, Stuart K; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J Wouter; Probst-Hensch, Nicole M; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; Siscovick, David S; van Duijn, Cornelia M; Borecki, Ingrid B; Kardia, Sharon L R; Liu, Yongmei; Curhan, Gary C; Rudan, Igor; Gyllensten, Ulf; Wilson, James F; Franke, Andre; Pramstaller, Peter P; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M; Parsa, Afshin; Bochud, Murielle; Heid, Iris M; Kao, W H Linda; Fox, Caroline S; Köttgen, Anna
2012-12-15
In conducting genome-wide association studies (GWAS), analytical approaches leveraging biological information may further understanding of the pathophysiology of clinical traits. To discover novel associations with estimated glomerular filtration rate (eGFR), a measure of kidney function, we developed a strategy for integrating prior biological knowledge into the existing GWAS data for eGFR from the CKDGen Consortium. Our strategy focuses on single nucleotide polymorphism (SNPs) in genes that are connected by functional evidence, determined by literature mining and gene ontology (GO) hierarchies, to genes near previously validated eGFR associations. It then requires association thresholds consistent with multiple testing, and finally evaluates novel candidates by independent replication. Among the samples of European ancestry, we identified a genome-wide significant SNP in FBXL20 (P = 5.6 × 10(-9)) in meta-analysis of all available data, and additional SNPs at the INHBC, LRP2, PLEKHA1, SLC3A2 and SLC7A6 genes meeting multiple-testing corrected significance for replication and overall P-values of 4.5 × 10(-4)-2.2 × 10(-7). Neither the novel PLEKHA1 nor FBXL20 associations, both further supported by association with eGFR among African Americans and with transcript abundance, would have been implicated by eGFR candidate gene approaches. LRP2, encoding the megalin receptor, was identified through connection with the previously known eGFR gene DAB2 and extends understanding of the megalin system in kidney function. These findings highlight integration of existing genome-wide association data with independent biological knowledge to uncover novel candidate eGFR associations, including candidates lacking known connections to kidney-specific pathways. The strategy may also be applicable to other clinical phenotypes, although more testing will be needed to assess its potential for discovery in general.
Chasman, Daniel I.; Fuchsberger, Christian; Pattaro, Cristian; Teumer, Alexander; Böger, Carsten A.; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Taliun, Daniel; Li, Man; Gao, Xiaoyi; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C.; O'Seaghdha, Conall M.; Glazer, Nicole; Isaacs, Aaron; Liu, Ching-Ti; Smith, Albert V.; O'Connell, Jeffrey R.; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Johnson, Andrew D.; Gierman, Hinco J.; Feitosa, Mary F.; Hwang, Shih-Jen; Atkinson, Elizabeth J.; Lohman, Kurt; Cornelis, Marilyn C.; Johansson, Åsa; Tönjes, Anke; Dehghan, Abbas; Lambert, Jean-Charles; Holliday, Elizabeth G.; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y.; Murgia, Federico; Trompet, Stella; Imboden, Medea; Coassin, Stefan; Pistis, Giorgio; Harris, Tamara B.; Launer, Lenore J.; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D.; Boerwinkle, Eric; Schmidt, Helena; Cavalieri, Margherita; Rao, Madhumathi; Hu, Frank; Demirkan, Ayse; Oostra, Ben A.; de Andrade, Mariza; Turner, Stephen T.; Ding, Jingzhong; Andrews, Jeanette S.; Freedman, Barry I.; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Meisinger, Christa; Gieger, Christian; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E.; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H.; Wright, Alan F.; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K.; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G.; Rivadeneira, Fernando; Aulchenko, Yurii S.; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Ketkar, Shamika; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K.; Portas, Laura; Ford, Ian; Buckley, Brendan M.; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Kim, Stuart K.; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J. Wouter; Probst-Hensch, Nicole M.; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R.; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; Siscovick, David S.; van Duijn, Cornelia M.; Borecki, Ingrid B.; Kardia, Sharon L.R.; Liu, Yongmei; Curhan, Gary C.; Rudan, Igor; Gyllensten, Ulf; Wilson, James F.; Franke, Andre; Pramstaller, Peter P.; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M; Parsa, Afshin; Bochud, Murielle; Heid, Iris M.; Kao, W.H. Linda; Fox, Caroline S.; Köttgen, Anna
2012-01-01
In conducting genome-wide association studies (GWAS), analytical approaches leveraging biological information may further understanding of the pathophysiology of clinical traits. To discover novel associations with estimated glomerular filtration rate (eGFR), a measure of kidney function, we developed a strategy for integrating prior biological knowledge into the existing GWAS data for eGFR from the CKDGen Consortium. Our strategy focuses on single nucleotide polymorphism (SNPs) in genes that are connected by functional evidence, determined by literature mining and gene ontology (GO) hierarchies, to genes near previously validated eGFR associations. It then requires association thresholds consistent with multiple testing, and finally evaluates novel candidates by independent replication. Among the samples of European ancestry, we identified a genome-wide significant SNP in FBXL20 (P = 5.6 × 10−9) in meta-analysis of all available data, and additional SNPs at the INHBC, LRP2, PLEKHA1, SLC3A2 and SLC7A6 genes meeting multiple-testing corrected significance for replication and overall P-values of 4.5 × 10−4–2.2 × 10−7. Neither the novel PLEKHA1 nor FBXL20 associations, both further supported by association with eGFR among African Americans and with transcript abundance, would have been implicated by eGFR candidate gene approaches. LRP2, encoding the megalin receptor, was identified through connection with the previously known eGFR gene DAB2 and extends understanding of the megalin system in kidney function. These findings highlight integration of existing genome-wide association data with independent biological knowledge to uncover novel candidate eGFR associations, including candidates lacking known connections to kidney-specific pathways. The strategy may also be applicable to other clinical phenotypes, although more testing will be needed to assess its potential for discovery in general. PMID:22962313
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ranjan, Priya; Yin, Tongming; Zhang, Xinye
2009-11-01
Quantitative trait locus (QTL) studies are an integral part of plant research and are used to characterize the genetic basis of phenotypic variation observed in structured populations and inform marker-assisted breeding efforts. These QTL intervals can span large physical regions on a chromosome comprising hundreds of genes, thereby hampering candidate gene identification. Genome history, evolution, and expression evidence can be used to narrow the genes in the interval to a smaller list that is manageable for detailed downstream functional genomics characterization. Our primary motivation for the present study was to address the need for a research methodology that identifies candidatemore » genes within a broad QTL interval. Here we present a bioinformatics-based approach for subdividing candidate genes within QTL intervals into alternate groups of high probability candidates. Application of this approach in the context of studying cell wall traits, specifically lignin content and S/G ratios of stem and root in Populus plants, resulted in manageable sets of genes of both known and putative cell wall biosynthetic function. These results provide a roadmap for future experimental work leading to identification of new genes controlling cell wall recalcitrance and, ultimately, in the utility of plant biomass as an energy feedstock.« less
Prioritization of Disease Susceptibility Genes Using LSM/SVD.
Gong, Lejun; Yang, Ronggen; Yan, Qin; Sun, Xiao
2013-12-01
Understanding the role of genetics in diseases is one of the most important tasks in the postgenome era. It is generally too expensive and time consuming to perform experimental validation for all candidate genes related to disease. Computational methods play important roles for prioritizing these candidates. Herein, we propose an approach to prioritize disease genes using latent semantic mapping based on singular value decomposition. Our hypothesis is that similar functional genes are likely to cause similar diseases. Measuring the functional similarity between known disease susceptibility genes and unknown genes is to predict new disease susceptibility genes. Taking autism as an instance, the analysis results of the top ten genes prioritized demonstrate they might be autism susceptibility genes, which also indicates our approach could discover new disease susceptibility genes. The novel approach of disease gene prioritization could discover new disease susceptibility genes, and latent disease-gene relations. The prioritized results could also support the interpretive diversity and experimental views as computational evidence for disease researchers.
2011-01-01
Background Several computational candidate gene selection and prioritization methods have recently been developed. These in silico selection and prioritization techniques are usually based on two central approaches - the examination of similarities to known disease genes and/or the evaluation of functional annotation of genes. Each of these approaches has its own caveats. Here we employ a previously described method of candidate gene prioritization based mainly on gene annotation, in accompaniment with a technique based on the evaluation of pertinent sequence motifs or signatures, in an attempt to refine the gene prioritization approach. We apply this approach to X-linked mental retardation (XLMR), a group of heterogeneous disorders for which some of the underlying genetics is known. Results The gene annotation-based binary filtering method yielded a ranked list of putative XLMR candidate genes with good plausibility of being associated with the development of mental retardation. In parallel, a motif finding approach based on linear discriminatory analysis (LDA) was employed to identify short sequence patterns that may discriminate XLMR from non-XLMR genes. High rates (>80%) of correct classification was achieved, suggesting that the identification of these motifs effectively captures genomic signals associated with XLMR vs. non-XLMR genes. The computational tools developed for the motif-based LDA is integrated into the freely available genomic analysis portal Galaxy (http://main.g2.bx.psu.edu/). Nine genes (APLN, ZC4H2, MAGED4, MAGED4B, RAP2C, FAM156A, FAM156B, TBL1X, and UXT) were highlighted as highly-ranked XLMR methods. Conclusions The combination of gene annotation information and sequence motif-orientated computational candidate gene prediction methods highlight an added benefit in generating a list of plausible candidate genes, as has been demonstrated for XLMR. Reviewers: This article was reviewed by Dr Barbara Bardoni (nominated by Prof Juergen Brosius); Prof Neil Smalheiser and Dr Dustin Holloway (nominated by Prof Charles DeLisi). PMID:21668950
Gladitz, Josef; Klink, Barbara; Seifert, Michael
2018-06-11
Oligodendrogliomas are primary human brain tumors with a characteristic 1p/19q co-deletion of important prognostic relevance, but little is known about the pathology of this chromosomal mutation. We developed a network-based approach to identify novel cancer gene candidates in the region of the 1p/19q co-deletion. Gene regulatory networks were learned from gene expression and copy number data of 178 oligodendrogliomas and further used to quantify putative impacts of differentially expressed genes of the 1p/19q region on cancer-relevant pathways. We predicted 8 genes with strong impact on signaling pathways and 14 genes with strong impact on metabolic pathways widespread across the region of the 1p/19 co-deletion. Many of these candidates (e.g. ELTD1, SDHB, SEPW1, SLC17A7, SZRD1, THAP3, ZBTB17) are likely to push, whereas others (e.g. CAP1, HBXIP, KLK6, PARK7, PTAFR) might counteract oligodendroglioma development. For example, ELTD1, a functionally validated glioblastoma oncogene located on 1p, was overexpressed. Further, the known glioblastoma tumor suppressor SLC17A7 located on 19q was underexpressed. Moreover, known epigenetic alterations triggered by mutated SDHB in paragangliomas suggest that underexpressed SDHB in oligodendrogliomas may support and possibly enhance the epigenetic reprogramming induced by the IDH-mutation. We further analyzed rarely observed deletions and duplications of chromosomal arms within oligodendroglioma subcohorts identifying putative oncogenes and tumor suppressors that possibly influence the development of oligodendroglioma subgroups. Our in-depth computational study contributes to a better understanding of the pathology of the 1p/19q co-deletion and other chromosomal arm mutations. This might open opportunities for functional validations and new therapeutic strategies.
Gibbons, John G.; Beauvais, Anne; Beau, Remi; McGary, Kriston L.
2012-01-01
Aspergillus fumigatus is the most common and deadly pulmonary fungal infection worldwide. In the lung, the fungus usually forms a dense colony of filaments embedded in a polymeric extracellular matrix. To identify candidate genes involved in this biofilm (BF) growth, we used RNA-Seq to compare the transcriptomes of BF and liquid plankton (PL) growth. Sequencing and mapping of tens of millions sequence reads against the A. fumigatus transcriptome identified 3,728 differentially regulated genes in the two conditions. Although many of these genes, including the ones coding for transcription factors, stress response, the ribosome, and the translation machinery, likely reflect the different growth demands in the two conditions, our experiment also identified hundreds of candidate genes for the observed differences in morphology and pathobiology between BF and PL. We found an overrepresentation of upregulated genes in transport, secondary metabolism, and cell wall and surface functions. Furthermore, upregulated genes showed significant spatial structure across the A. fumigatus genome; they were more likely to occur in subtelomeric regions and colocalized in 27 genomic neighborhoods, many of which overlapped with known or candidate secondary metabolism gene clusters. We also identified 1,164 genes that were downregulated. This gene set was not spatially structured across the genome and was overrepresented in genes participating in primary metabolic functions, including carbon and amino acid metabolism. These results add valuable insight into the genetics of biofilm formation in A. fumigatus and other filamentous fungi and identify many relevant, in the context of biofilm biology, candidate genes for downstream functional experiments. PMID:21724936
Clark, Jo-Anna B J; Tully, Sara J; Dawn Marshall, H
2014-12-01
Hereditary hyperplastic gingivitis (HHG) is an autosomal recessive disease that presents with progressive gingival proliferation in farmed silver foxes. Hereditary gingival fibromatosis (HGF) is an analogous condition in humans that is genetically heterogeneous with several known autosomal dominant loci. For one locus the causative mutation is in the Son of sevenless homologue 1 (SOS1) gene. For the remaining loci, the molecular mechanisms are unknown but Ras pathway involvement is suspected. Here we compare sequences for the SOS1 gene, and two adjacent genes in the Ras pathway, growth receptor bound protein 2 (GRB2) and epidermal growth factor receptor (EGFR), between HHG-affected and unaffected foxes. We conclude that the known HGF causative mutation does not cause HHG in foxes, nor do the coding regions or intron-exon boundaries of these three genes contain any candidate mutations for fox gum disease. Patterns of molecular evolution among foxes and other mammals reflect high conservation and strong functional constraints for SOS1 and GRB2 but reveal a lineage-specific pattern of variability in EGFR consistent with mutational rate differences, relaxed functional constraints, and possibly positive selection.
Morton, Nicholas M.; Nelson, Yvonne B.; Michailidou, Zoi; Di Rollo, Emma M.; Ramage, Lynne; Hadoke, Patrick W. F.; Seckl, Jonathan R.; Bunger, Lutz; Horvat, Simon; Kenyon, Christopher J.; Dunbar, Donald R.
2011-01-01
Background Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F) mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L) strain. Results To enrich for adipose tissue obesity genes a ‘snap-shot’ pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney) was performed. Known obesity quantitative trait loci (QTL) information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r) as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr) that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity. Conclusions A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes contributing to obesity. PMID:21915269
Morton, Nicholas M; Nelson, Yvonne B; Michailidou, Zoi; Di Rollo, Emma M; Ramage, Lynne; Hadoke, Patrick W F; Seckl, Jonathan R; Bunger, Lutz; Horvat, Simon; Kenyon, Christopher J; Dunbar, Donald R
2011-01-01
Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F) mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L) strain. To enrich for adipose tissue obesity genes a 'snap-shot' pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney) was performed. Known obesity quantitative trait loci (QTL) information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r) as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr) that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity. A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes contributing to obesity.
Sharp, Peter; Dong, Chongmei
2014-01-01
TILLING is widely used in plant functional genomics. Mutagenesis and SNP detection is combined to allow for the isolation of mutations in genes of interest. It can also be used as a plant breeding tool, whereby variation in known or candidate genes of interest to breeding programs is generated. Here we describe a simple low-cost TILLING procedure.
Wang, Nan; Zhang, Yeting; Gedvilaite, Erika; Loh, Jui Wan; Lin, Timothy; Liu, Xiuping; Liu, Chang-Gong; Kumar, Dibyendu; Donnelly, Robert; Raymond, Kimiyo; Schuchman, Edward H; Sleat, David E; Lobel, Peter; Xing, Jinchuan
2017-11-01
Lysosomes are membrane-bound, acidic eukaryotic cellular organelles that play important roles in the degradation of macromolecules. Mutations that cause the loss of lysosomal protein function can lead to a group of disorders categorized as the lysosomal storage diseases (LSDs). Suspicion of LSD is frequently based on clinical and pathologic findings, but in some cases, the underlying genetic and biochemical defects remain unknown. Here, we performed whole-exome sequencing (WES) on 14 suspected LSD cases to evaluate the feasibility of using WES for identifying causal mutations. By examining 2,157 candidate genes potentially associated with lysosomal function, we identified eight variants in five genes as candidate disease-causing variants in four individuals. These included both known and novel mutations. Variants were corroborated by targeted sequencing and, when possible, functional assays. In addition, we identified nonsense mutations in two individuals in genes that are not known to have lysosomal function. However, mutations in these genes could have resulted in phenotypes that were diagnosed as LSDs. This study demonstrates that WES can be used to identify causal mutations in suspected LSD cases. We also demonstrate cases where a confounding clinical phenotype may potentially reflect more than one lysosomal protein defect. © 2017 Wiley Periodicals, Inc.
Marra, Nicholas J; Eo, Soo Hyung; Hale, Matthew C; Waser, Peter M; DeWoody, J Andrew
2012-12-01
One common goal in evolutionary biology is the identification of genes underlying adaptive traits of evolutionary interest. Recently next-generation sequencing techniques have greatly facilitated such evolutionary studies in species otherwise depauperate of genomic resources. Kangaroo rats (Dipodomys sp.) serve as exemplars of adaptation in that they inhabit extremely arid environments, yet require no drinking water because of ultra-efficient kidney function and osmoregulation. As a basis for identifying water conservation genes in kangaroo rats, we conducted a priori bioinformatics searches in model rodents (Mus musculus and Rattus norvegicus) to identify candidate genes with known or suspected osmoregulatory function. We then obtained 446,758 reads via 454 pyrosequencing to characterize genes expressed in the kidney of banner-tailed kangaroo rats (Dipodomys spectabilis). We also determined candidates a posteriori by identifying genes that were overexpressed in the kidney. The kangaroo rat sequences revealed nine different a priori candidate genes predicted from our Mus and Rattus searches, as well as 32 a posteriori candidate genes that were overexpressed in kidney. Mutations in two of these genes, Slc12a1 and Slc12a3, cause human renal diseases that result in the inability to concentrate urine. These genes are likely key determinants of physiological water conservation in desert rodents. Copyright © 2012 Elsevier Inc. All rights reserved.
Matimba, Alice; Li, Fang; Livshits, Alina; Cartwright, Cher S; Scully, Stephen; Fridley, Brooke L; Jenkins, Gregory; Batzler, Anthony; Wang, Liewei; Weinshilboum, Richard; Lennard, Lynne
2014-01-01
Aim We investigated candidate genes associated with thiopurine metabolism and clinical response in childhood acute lymphoblastic leukemia. Materials & methods We performed genome-wide SNP association studies of 6-thioguanine and 6-mercaptopurine cytotoxicity using lymphoblastoid cell lines. We then genotyped the top SNPs associated with lymphoblastoid cell line cytotoxicity, together with tagSNPs for genes in the ‘thiopurine pathway’ (686 total SNPs), in DNA from 589 Caucasian UK ALL97 patients. Functional validation studies were performed by siRNA knockdown in cancer cell lines. Results SNPs in the thiopurine pathway genes ABCC4, ABCC5, IMPDH1, ITPA, SLC28A3 and XDH, and SNPs located within or near ATP6AP2, FRMD4B, GNG2, KCNMA1 and NME1, were associated with clinical response and measures of thiopurine metabolism. Functional validation showed shifts in cytotoxicity for these genes. Conclusion The clinical response to thiopurines may be regulated by variation in known thiopurine pathway genes and additional novel genes outside of the thiopurine pathway. PMID:24624911
Silva, C; Garcia-Mas, J; Sánchez, A M; Arús, P; Oliveira, M M
2005-03-01
Blooming time is one of the most important agronomic traits in almond. Biochemical and molecular events underlying flowering regulation must be understood before methods to stimulate late flowering can be developed. Attempts to elucidate the genetic control of this process have led to the identification of a major gene (Lb) and quantitative trait loci (QTLs) linked to observed phenotypic differences, but although this gene and these QTLs have been placed on the Prunus reference genetic map, their sequences and specific functions remain unknown. The aim of our investigation was to associate these loci with known genes using a candidate gene approach. Two almond cDNAs and eight Prunus expressed sequence tags were selected as candidate genes (CGs) since their sequences were highly identical to those of flowering regulatory genes characterized in other species. The CGs were amplified from both parental lines of the mapping population using specific primers. Sequence comparison revealed DNA polymorphisms between the parental lines, mainly of the single nucleotide type. Polymorphisms were used to develop co-dominant cleaved amplified polymorphic sequence markers or length polymorphisms based on insertion/deletion events for mapping the candidate genes on the Prunus reference map. Ten candidate genes were assigned to six linkage groups in the Prunus genome. The positions of two of these were compatible with the regions where two QTLs for blooming time were detected. One additional candidate was localized close to the position of the Evergrowing gene, which determines a non-deciduous behaviour in peach.
Riazuddin, S; Hussain, M; Razzaq, A; Iqbal, Z; Shahzad, M; Polla, D L; Song, Y; van Beusekom, E; Khan, A A; Tomas-Roca, L; Rashid, M; Zahoor, M Y; Wissink-Lindhout, W M; Basra, M A R; Ansar, M; Agha, Z; van Heeswijk, K; Rasheed, F; Van de Vorst, M; Veltman, J A; Gilissen, C; Akram, J; Kleefstra, T; Assir, M Z; Grozeva, D; Carss, K; Raymond, F L; O'Connor, T D; Riazuddin, S A; Khan, S N; Ahmed, Z M; de Brouwer, A P M; van Bokhoven, H; Riazuddin, S
2017-01-01
Intellectual disability (ID) is a clinically and genetically heterogeneous disorder, affecting 1–3% of the general population. Although research into the genetic causes of ID has recently gained momentum, identification of pathogenic mutations that cause autosomal recessive ID (ARID) has lagged behind, predominantly due to non-availability of sizeable families. Here we present the results of exome sequencing in 121 large consanguineous Pakistani ID families. In 60 families, we identified homozygous or compound heterozygous DNA variants in a single gene, 30 affecting reported ID genes and 30 affecting novel candidate ID genes. Potential pathogenicity of these alleles was supported by co-segregation with the phenotype, low frequency in control populations and the application of stringent bioinformatics analyses. In another eight families segregation of multiple pathogenic variants was observed, affecting 19 genes that were either known or are novel candidates for ID. Transcriptome profiles of normal human brain tissues showed that the novel candidate ID genes formed a network significantly enriched for transcriptional co-expression (P<0.0001) in the frontal cortex during fetal development and in the temporal–parietal and sub-cortex during infancy through adulthood. In addition, proteins encoded by 12 novel ID genes directly interact with previously reported ID proteins in six known pathways essential for cognitive function (P<0.0001). These results suggest that disruptions of temporal parietal and sub-cortical neurogenesis during infancy are critical to the pathophysiology of ID. These findings further expand the existing repertoire of genes involved in ARID, and provide new insights into the molecular mechanisms and the transcriptome map of ID. PMID:27457812
Riazuddin, S; Hussain, M; Razzaq, A; Iqbal, Z; Shahzad, M; Polla, D L; Song, Y; van Beusekom, E; Khan, A A; Tomas-Roca, L; Rashid, M; Zahoor, M Y; Wissink-Lindhout, W M; Basra, M A R; Ansar, M; Agha, Z; van Heeswijk, K; Rasheed, F; Van de Vorst, M; Veltman, J A; Gilissen, C; Akram, J; Kleefstra, T; Assir, M Z; Grozeva, D; Carss, K; Raymond, F L; O'Connor, T D; Riazuddin, S A; Khan, S N; Ahmed, Z M; de Brouwer, A P M; van Bokhoven, H; Riazuddin, S
2017-11-01
Intellectual disability (ID) is a clinically and genetically heterogeneous disorder, affecting 1-3% of the general population. Although research into the genetic causes of ID has recently gained momentum, identification of pathogenic mutations that cause autosomal recessive ID (ARID) has lagged behind, predominantly due to non-availability of sizeable families. Here we present the results of exome sequencing in 121 large consanguineous Pakistani ID families. In 60 families, we identified homozygous or compound heterozygous DNA variants in a single gene, 30 affecting reported ID genes and 30 affecting novel candidate ID genes. Potential pathogenicity of these alleles was supported by co-segregation with the phenotype, low frequency in control populations and the application of stringent bioinformatics analyses. In another eight families segregation of multiple pathogenic variants was observed, affecting 19 genes that were either known or are novel candidates for ID. Transcriptome profiles of normal human brain tissues showed that the novel candidate ID genes formed a network significantly enriched for transcriptional co-expression (P<0.0001) in the frontal cortex during fetal development and in the temporal-parietal and sub-cortex during infancy through adulthood. In addition, proteins encoded by 12 novel ID genes directly interact with previously reported ID proteins in six known pathways essential for cognitive function (P<0.0001). These results suggest that disruptions of temporal parietal and sub-cortical neurogenesis during infancy are critical to the pathophysiology of ID. These findings further expand the existing repertoire of genes involved in ARID, and provide new insights into the molecular mechanisms and the transcriptome map of ID.
A human functional protein interaction network and its application to cancer data analysis
2010-01-01
Background One challenge facing biologists is to tease out useful information from massive data sets for further analysis. A pathway-based analysis may shed light by projecting candidate genes onto protein functional relationship networks. We are building such a pathway-based analysis system. Results We have constructed a protein functional interaction network by extending curated pathways with non-curated sources of information, including protein-protein interactions, gene coexpression, protein domain interaction, Gene Ontology (GO) annotations and text-mined protein interactions, which cover close to 50% of the human proteome. By applying this network to two glioblastoma multiforme (GBM) data sets and projecting cancer candidate genes onto the network, we found that the majority of GBM candidate genes form a cluster and are closer than expected by chance, and the majority of GBM samples have sequence-altered genes in two network modules, one mainly comprising genes whose products are localized in the cytoplasm and plasma membrane, and another comprising gene products in the nucleus. Both modules are highly enriched in known oncogenes, tumor suppressors and genes involved in signal transduction. Similar network patterns were also found in breast, colorectal and pancreatic cancers. Conclusions We have built a highly reliable functional interaction network upon expert-curated pathways and applied this network to the analysis of two genome-wide GBM and several other cancer data sets. The network patterns revealed from our results suggest common mechanisms in the cancer biology. Our system should provide a foundation for a network or pathway-based analysis platform for cancer and other diseases. PMID:20482850
EBF proteins participate in transcriptional regulation of Xenopus muscle development.
Green, Yangsook Song; Vetter, Monica L
2011-10-01
EBF proteins have diverse functions in the development of multiple lineages, including neurons, B cells and adipocytes. During Drosophila muscle development EBF proteins are expressed in muscle progenitors and are required for muscle cell differentiation, but there is no known function of EBF proteins in vertebrate muscle development. In this study, we examine the expression of ebf genes in Xenopus muscle tissue and show that EBF activity is necessary for aspects of Xenopus skeletal muscle development, including somite organization, migration of hypaxial muscle anlagen toward the ventral abdomen, and development of jaw muscle. From a microarray screen, we have identified multiple candidate targets of EBF activity with known roles in muscle development. The candidate targets we have verified are MYOD, MYF5, M-Cadherin and SEB-4. In vivo overexpression of the ebf2 and ebf3 genes leads to ectopic expression of these candidate targets, and knockdown of EBF activity causes downregulation of the endogenous expression of the candidate targets. Furthermore, we found that MYOD and MYF5 are likely to be direct targets. Finally we show that MYOD can upregulate the expression of ebf genes, indicating the presence of a positive feedback loop between EBF and MYOD that we find to be important for maintenance of MYOD expression in Xenopus. These results suggest that EBF activity is important for both stabilizing commitment and driving aspects of differentiation in Xenopus muscle cells. Copyright © 2010 Elsevier Inc. All rights reserved.
Shaw, Lindsay M.; Turner, Adrian S.; Herry, Laurence; Griffiths, Simon; Laurie, David A.
2013-01-01
Flowering time in wheat and barley is known to be modified by mutations in the Photoperiod-1 (Ppd-1) gene. Semi-dominant Ppd-1a mutations conferring an early flowering phenotype are well documented in wheat but gene sequencing has also identified candidate loss of function mutations for Ppd-A1 and Ppd-D1. By analogy to the recessive ppd-H1 mutation in barley, loss of function mutations in wheat are predicted to delay flowering under long day conditions. To test this experimentally, introgression lines were developed in the spring wheat variety ‘Paragon’. Plants lacking a Ppd-B1 gene were identified from a gamma irradiated ‘Paragon’ population. These were crossed with the other introgression lines to generate plants with candidate loss of function mutations on one, two or three genomes. Lines lacking Ppd-B1 flowered 10 to 15 days later than controls under long days. Candidate loss of function Ppd-A1 alleles delayed flowering by 1 to 5 days while candidate loss of function Ppd-D1 alleles did not affect flowering time. Loss of Ppd-A1 gave an enhanced effect, and loss of Ppd-D1 became detectable in lines where Ppd-B1 was absent, indicating effects may be buffered by functional Ppd-1 alleles on other genomes. Expression analysis revealed that delayed flowering was associated with reduced expression of the TaFT1 gene and increased expression of TaCO1. A survey of the GEDIFLUX wheat collection grown in the UK and North Western Europe between the 1940s and 1980s and the A.E. Watkins global collection of landraces from the 1920s and 1930s showed that the identified candidate loss of function mutations for Ppd-D1 were common and widespread, while the identified candidate Ppd-A1 loss of function mutation was rare in countries around the Mediterranean and in the Far East but was common in North Western Europe. This may reflect a possible benefit of the latter in northern locations. PMID:24244507
González-Plaza, Juan J; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R
2016-01-01
Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species.
Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones
Imanishi, Tadashi; Itoh, Takeshi; Suzuki, Yutaka; O'Donovan, Claire; Fukuchi, Satoshi; Koyanagi, Kanako O; Barrero, Roberto A; Tamura, Takuro; Yamaguchi-Kabata, Yumi; Tanino, Motohiko; Yura, Kei; Miyazaki, Satoru; Ikeo, Kazuho; Homma, Keiichi; Kasprzyk, Arek; Nishikawa, Tetsuo; Hirakawa, Mika; Thierry-Mieg, Jean; Thierry-Mieg, Danielle; Ashurst, Jennifer; Jia, Libin; Nakao, Mitsuteru; Thomas, Michael A; Mulder, Nicola; Karavidopoulou, Youla; Jin, Lihua; Kim, Sangsoo; Yasuda, Tomohiro; Lenhard, Boris; Eveno, Eric; Suzuki, Yoshiyuki; Yamasaki, Chisato; Takeda, Jun-ichi; Gough, Craig; Hilton, Phillip; Fujii, Yasuyuki; Sakai, Hiroaki; Tanaka, Susumu; Amid, Clara; Bellgard, Matthew; Bonaldo, Maria de Fatima; Bono, Hidemasa; Bromberg, Susan K; Brookes, Anthony J; Bruford, Elspeth; Carninci, Piero; Chelala, Claude; Couillault, Christine; de Souza, Sandro J.; Debily, Marie-Anne; Devignes, Marie-Dominique; Dubchak, Inna; Endo, Toshinori; Estreicher, Anne; Eyras, Eduardo; Fukami-Kobayashi, Kaoru; R. Gopinath, Gopal; Graudens, Esther; Hahn, Yoonsoo; Han, Michael; Han, Ze-Guang; Hanada, Kousuke; Hanaoka, Hideki; Harada, Erimi; Hashimoto, Katsuyuki; Hinz, Ursula; Hirai, Momoki; Hishiki, Teruyoshi; Hopkinson, Ian; Imbeaud, Sandrine; Inoko, Hidetoshi; Kanapin, Alexander; Kaneko, Yayoi; Kasukawa, Takeya; Kelso, Janet; Kersey, Paul; Kikuno, Reiko; Kimura, Kouichi; Korn, Bernhard; Kuryshev, Vladimir; Makalowska, Izabela; Makino, Takashi; Mano, Shuhei; Mariage-Samson, Regine; Mashima, Jun; Matsuda, Hideo; Mewes, Hans-Werner; Minoshima, Shinsei; Nagai, Keiichi; Nagasaki, Hideki; Nagata, Naoki; Nigam, Rajni; Ogasawara, Osamu; Ohara, Osamu; Ohtsubo, Masafumi; Okada, Norihiro; Okido, Toshihisa; Oota, Satoshi; Ota, Motonori; Ota, Toshio; Otsuki, Tetsuji; Piatier-Tonneau, Dominique; Poustka, Annemarie; Ren, Shuang-Xi; Saitou, Naruya; Sakai, Katsunaga; Sakamoto, Shigetaka; Sakate, Ryuichi; Schupp, Ingo; Servant, Florence; Sherry, Stephen; Shiba, Rie; Shimizu, Nobuyoshi; Shimoyama, Mary; Simpson, Andrew J; Soares, Bento; Steward, Charles; Suwa, Makiko; Suzuki, Mami; Takahashi, Aiko; Tamiya, Gen; Tanaka, Hiroshi; Taylor, Todd; Terwilliger, Joseph D; Unneberg, Per; Veeramachaneni, Vamsi; Watanabe, Shinya; Wilming, Laurens; Yasuda, Norikazu; Yoo, Hyang-Sook; Stodolsky, Marvin; Makalowski, Wojciech; Go, Mitiko; Nakai, Kenta; Takagi, Toshihisa; Kanehisa, Minoru; Sakaki, Yoshiyuki; Quackenbush, John; Okazaki, Yasushi; Hayashizaki, Yoshihide; Hide, Winston; Chakraborty, Ranajit; Nishikawa, Ken; Sugawara, Hideaki; Tateno, Yoshio; Chen, Zhu; Oishi, Michio; Tonellato, Peter; Apweiler, Rolf; Okubo, Kousaku; Wagner, Lukas; Wiemann, Stefan; Strausberg, Robert L; Isogai, Takao; Auffray, Charles; Nomura, Nobuo; Sugano, Sumio
2004-01-01
The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology. PMID:15103394
Discovery of new candidate genes related to brain development using protein interaction information.
Chen, Lei; Chu, Chen; Kong, Xiangyin; Huang, Tao; Cai, Yu-Dong
2015-01-01
Human brain development is a dramatic process composed of a series of complex and fine-tuned spatiotemporal gene expressions. A good comprehension of this process can assist us in developing the potential of our brain. However, we have only limited knowledge about the genes and gene functions that are involved in this biological process. Therefore, a substantial demand remains to discover new brain development-related genes and identify their biological functions. In this study, we aimed to discover new brain-development related genes by building a computational method. We referred to a series of computational methods used to discover new disease-related genes and developed a similar method. In this method, the shortest path algorithm was executed on a weighted graph that was constructed using protein-protein interactions. New candidate genes fell on at least one of the shortest paths connecting two known genes that are related to brain development. A randomization test was then adopted to filter positive discoveries. Of the final identified genes, several have been reported to be associated with brain development, indicating the effectiveness of the method, whereas several of the others may have potential roles in brain development.
[Genetic aspects of the Stroop test].
Nánási, Tibor; Katonai, Enikő Rózsa; Sasvári-Székely, Mária; Székely, Anna
2012-12-01
Impairment of executive control functions in depression is well documented, and performance on the Stroop Test is one of the most widely used markers to measure the decline. This tool provides reliable quantitative phenotype data that can be used efficiently in candidate gene studies investigating inherited components of executive control. Aim of the present review is to summarize research on genetic factors of Stroop performance. Interestingly, only a few such candidate gene studies have been carried out to date. Twin studies show a 30-60% heritability estimate for the Stroop test, suggesting a significant genetic component. A single genome-wide association study has been carried out on Stroop performance, and it did not show any significant association with any of the tested polymorphisms after correction for multiple testing. Candidate gene studies to date pointed to the polymorphisms of several neurotransmitter systems (dopamine, serotonin, acetylcholine) and to the role of the APOE ε4 allele. Surprisingly, little is known about the genetic role of neurothrophic factors and survival factors. In conclusion, further studies are needed for clarifying the genetic background of Stroop performance, characterizing attentional functions.
TOM: a web-based integrated approach for identification of candidate disease genes.
Rossi, Simona; Masotti, Daniele; Nardini, Christine; Bonora, Elena; Romeo, Giovanni; Macii, Enrico; Benini, Luca; Volinia, Stefano
2006-07-01
The massive production of biological data by means of highly parallel devices like microarrays for gene expression has paved the way to new possible approaches in molecular genetics. Among them the possibility of inferring biological answers by querying large amounts of expression data. Based on this principle, we present here TOM, a web-based resource for the efficient extraction of candidate genes for hereditary diseases. The service requires the previous knowledge of at least another gene responsible for the disease and the linkage area, or else of two disease associated genetic intervals. The algorithm uses the information stored in public resources, including mapping, expression and functional databases. Given the queries, TOM will select and list one or more candidate genes. This approach allows the geneticist to bypass the costly and time consuming tracing of genetic markers through entire families and might improve the chance of identifying disease genes, particularly for rare diseases. We present here the tool and the results obtained on known benchmark and on hereditary predisposition to familial thyroid cancer. Our algorithm is available at http://www-micrel.deis.unibo.it/~tom/.
Stessman, Holly A. F.; Xiong, Bo; Coe, Bradley P.; Wang, Tianyun; Hoekzema, Kendra; Fenckova, Michaela; Kvarnung, Malin; Gerdts, Jennifer; Trinh, Sandy; Cosemans, Nele; Vives, Laura; Lin, Janice; Turner, Tychele N.; Santen, Gijs; Ruivenkamp, Claudia; Kriek, Marjolein; van Haeringen, Arie; Aten, Emmelien; Friend, Kathryn; Liebelt, Jan; Barnett, Christopher; Haan, Eric; Shaw, Marie; Gecz, Jozef; Anderlid, Britt-Marie; Nordgren, Ann; Lindstrand, Anna; Schwartz, Charles; Kooy, R. Frank; Vandeweyer, Geert; Helsmoortel, Celine; Romano, Corrado; Alberti, Antonino; Vinci, Mirella; Avola, Emanuela; Giusto, Stefania; Courchesne, Eric; Pramparo, Tiziano; Pierce, Karen; Nalabolu, Srinivasa; Amaral, David; Scheffer, Ingrid E.; Delatycki, Martin B.; Lockhart, Paul J.; Hormozdiari, Fereydoun; Harich, Benjamin; Castells-Nobau, Anna; Xia, Kun; Peeters, Hilde; Nordenskjöld, Magnus; Schenck, Annette; Bernier, Raphael A.; Eichler, Evan E.
2017-01-01
Gene-disruptive mutations contribute to the biology of neurodevelopmental disorders (NDDs), but most pathogenic genes are not known. We sequenced 208 candidate genes from >11,730 patients and >2,867 controls. We report 91 genes with an excess of de novo mutations or private disruptive mutations in 5.7% of patients, including 38 novel NDD genes. Drosophila functional assays of a subset bolster their involvement in NDDs. We identify 25 genes that show a bias for autism versus intellectual disability and highlight a network associated with high-functioning autism (FSIQ>100). Clinical follow-up for NAA15, KMT5B, and ASH1L reveals novel syndromic and non-syndromic forms of disease. PMID:28191889
A candidate multimodal functional genetic network for thermal adaptation
Pathak, Rachana; Prajapati, Indira; Bankston, Shannon; Thompson, Aprylle; Usher, Jaytriece; Isokpehi, Raphael D.
2014-01-01
Vertebrate ectotherms such as reptiles provide ideal organisms for the study of adaptation to environmental thermal change. Comparative genomic and exomic studies can recover markers that diverge between warm and cold adapted lineages, but the genes that are functionally related to thermal adaptation may be difficult to identify. We here used a bioinformatics genome-mining approach to predict and identify functions for suitable candidate markers for thermal adaptation in the chicken. We first established a framework of candidate functions for such markers, and then compiled the literature on genes known to adapt to the thermal environment in different lineages of vertebrates. We then identified them in the genomes of human, chicken, and the lizard Anolis carolinensis, and established a functional genetic interaction network in the chicken. Surprisingly, markers initially identified from diverse lineages of vertebrates such as human and fish were all in close functional relationship with each other and more associated than expected by chance. This indicates that the general genetic functional network for thermoregulation and/or thermal adaptation to the environment might be regulated via similar evolutionarily conserved pathways in different vertebrate lineages. We were able to identify seven functions that were statistically overrepresented in this network, corresponding to four of our originally predicted functions plus three unpredicted functions. We describe this network as multimodal: central regulator genes with the function of relaying thermal signal (1), affect genes with different cellular functions, namely (2) lipoprotein metabolism, (3) membrane channels, (4) stress response, (5) response to oxidative stress, (6) muscle contraction and relaxation, and (7) vasodilation, vasoconstriction and regulation of blood pressure. This network constitutes a novel resource for the study of thermal adaptation in the closely related nonavian reptiles and other vertebrate ectotherms. PMID:25289178
2010-01-01
Background Discovering novel disease genes is still challenging for diseases for which no prior knowledge - such as known disease genes or disease-related pathways - is available. Performing genetic studies frequently results in large lists of candidate genes of which only few can be followed up for further investigation. We have recently developed a computational method for constitutional genetic disorders that identifies the most promising candidate genes by replacing prior knowledge by experimental data of differential gene expression between affected and healthy individuals. To improve the performance of our prioritization strategy, we have extended our previous work by applying different machine learning approaches that identify promising candidate genes by determining whether a gene is surrounded by highly differentially expressed genes in a functional association or protein-protein interaction network. Results We have proposed three strategies scoring disease candidate genes relying on network-based machine learning approaches, such as kernel ridge regression, heat kernel, and Arnoldi kernel approximation. For comparison purposes, a local measure based on the expression of the direct neighbors is also computed. We have benchmarked these strategies on 40 publicly available knockout experiments in mice, and performance was assessed against results obtained using a standard procedure in genetics that ranks candidate genes based solely on their differential expression levels (Simple Expression Ranking). Our results showed that our four strategies could outperform this standard procedure and that the best results were obtained using the Heat Kernel Diffusion Ranking leading to an average ranking position of 8 out of 100 genes, an AUC value of 92.3% and an error reduction of 52.8% relative to the standard procedure approach which ranked the knockout gene on average at position 17 with an AUC value of 83.7%. Conclusion In this study we could identify promising candidate genes using network based machine learning approaches even if no knowledge is available about the disease or phenotype. PMID:20840752
Viveka Thangaraj, Soundara; Periasamy, Jayaprakash; Bhaskar Rao, Divya; Barnabas, Georgina D.; Raghavan, Swetha; Ganesan, Kumaresan
2013-01-01
Genomic aberrations are common in cancers and the long arm of chromosome 1 is known for its frequent amplifications in breast cancer. However, the key candidate genes of 1q, and their contribution in breast cancer pathogenesis remain unexplored. We have analyzed the gene expression profiles of 1635 breast tumor samples using meta-analysis based approach and identified clinically significant candidates from chromosome 1q. Seven candidate genes including exonuclease 1 (EXO1) are consistently over expressed in breast tumors, specifically in high grade and aggressive breast tumors with poor clinical outcome. We derived a EXO1 co-expression module from the mRNA profiles of breast tumors which comprises 1q candidate genes and their co-expressed genes. By integrative functional genomics investigation, we identified the involvement of EGFR, RAS, PI3K / AKT, MYC, E2F signaling in the regulation of these selected 1q genes in breast tumors and breast cancer cell lines. Expression of EXO1 module was found as indicative of elevated cell proliferation, genomic instability, activated RAS/AKT/MYC/E2F1 signaling pathways and loss of p53 activity in breast tumors. mRNA–drug connectivity analysis indicates inhibition of RAS/PI3K as a possible targeted therapeutic approach for the patients with activated EXO1 module in breast tumors. Thus, we identified seven 1q candidate genes strongly associated with the poor survival of breast cancer patients and identified the possibility of targeting them with EGFR/RAS/PI3K inhibitors. PMID:24147022
Mutational Landscape of Candidate Genes in Familial Prostate Cancer
Johnson, Anna M.; Zuhlke, Kimberly A.; Plotts, Chris; McDonnell, Shannon K.; Middha, Sumit; Riska, Shaun M.; Thibodeau, Stephen N.; Douglas, Julie A.; Cooney, Kathleen A.
2014-01-01
Background Family history is a major risk factor for prostate cancer (PCa), suggesting a genetic component to the disease. However, traditional linkage and association studies have failed to fully elucidate the underlying genetic basis of familial PCa. Methods Here we use a candidate gene approach to identify potential PCa susceptibility variants in whole exome sequencing data from familial PCa cases. Six hundred ninety-seven candidate genes were identified based on function, location near a known chromosome 17 linkage signal, and/or previous association with prostate or other cancers. Single nucleotide variants (SNVs) in these candidate genes were identified in whole exome sequence data from 33 PCa cases from 11 multiplex PCa families (3 cases/family). Results Overall, 4856 candidate gene SNVs were identified, including 1052 missense and 10 nonsense variants. Twenty missense variants were shared by all 3 family members in each family in which they were observed. Additionally, 15 missense variants were shared by 2 of 3 family members and predicted to be deleterious by 5 different algorithms. Four missense variants, BLM Gln123Arg, PARP2 Arg283Gln, LRCC46 Ala295Thr and KIF2B Pro91Leu, and 1 nonsense variant, CYP3A43 Arg441Ter, showed complete co-segregation with PCa status. Twelve additional variants displayed partial co-segregation with PCa. Conclusions Forty-three nonsense and shared, missense variants were identified in our candidate genes. Further research is needed to determine the contribution of these variants to PCa susceptibility. PMID:25111073
Ding, Fangrui; Tan, Aidi; Ju, Wenjun; Li, Xuejuan; Li, Shao; Ding, Jie
2016-01-01
Maintenance of the physiological morphologies of different types of cells and tissues is essential for the normal functioning of each system in the human body. Dynamic variations in cell and tissue morphologies depend on accurate adjustments of the cytoskeletal system. The cytoskeletal system in the glomerulus plays a key role in the normal process of kidney filtration. To enhance the understanding of the possible roles of the cytoskeleton in glomerular diseases, we constructed the Glomerular Cytoskeleton Network (GCNet), which shows the protein-protein interaction network in the glomerulus, and identified several possible key cytoskeletal components involved in glomerular diseases. In this study, genes/proteins annotated to the cytoskeleton were detected by Gene Ontology analysis, and glomerulus-enriched genes were selected from nine available glomerular expression datasets. Then, the GCNet was generated by combining these two sets of information. To predict the possible key cytoskeleton components in glomerular diseases, we then examined the common regulation of the genes in GCNet in the context of five glomerular diseases based on their transcriptomic data. As a result, twenty-one cytoskeleton components as potential candidate were highlighted for consistently down- or up-regulating in all five glomerular diseases. And then, these candidates were examined in relation to existing known glomerular diseases and genes to determine their possible functions and interactions. In addition, the mRNA levels of these candidates were also validated in a puromycin aminonucleoside(PAN) induced rat nephropathy model and were also matched with existing Diabetic Nephropathy (DN) transcriptomic data. As a result, there are 15 of 21 candidates in PAN induced nephropathy model were consistent with our predication and also 12 of 21 candidates were matched with differentially expressed genes in the DN transcriptomic data. By providing a novel interaction network and prediction, GCNet contributes to improving the understanding of normal glomerular function and will be useful for detecting target cytoskeleton molecules of interest that may be involved in glomerular diseases in future studies.
Ju, Wenjun; Li, Xuejuan; Li, Shao; Ding, Jie
2016-01-01
Maintenance of the physiological morphologies of different types of cells and tissues is essential for the normal functioning of each system in the human body. Dynamic variations in cell and tissue morphologies depend on accurate adjustments of the cytoskeletal system. The cytoskeletal system in the glomerulus plays a key role in the normal process of kidney filtration. To enhance the understanding of the possible roles of the cytoskeleton in glomerular diseases, we constructed the Glomerular Cytoskeleton Network (GCNet), which shows the protein-protein interaction network in the glomerulus, and identified several possible key cytoskeletal components involved in glomerular diseases. In this study, genes/proteins annotated to the cytoskeleton were detected by Gene Ontology analysis, and glomerulus-enriched genes were selected from nine available glomerular expression datasets. Then, the GCNet was generated by combining these two sets of information. To predict the possible key cytoskeleton components in glomerular diseases, we then examined the common regulation of the genes in GCNet in the context of five glomerular diseases based on their transcriptomic data. As a result, twenty-one cytoskeleton components as potential candidate were highlighted for consistently down- or up-regulating in all five glomerular diseases. And then, these candidates were examined in relation to existing known glomerular diseases and genes to determine their possible functions and interactions. In addition, the mRNA levels of these candidates were also validated in a puromycin aminonucleoside(PAN) induced rat nephropathy model and were also matched with existing Diabetic Nephropathy (DN) transcriptomic data. As a result, there are 15 of 21 candidates in PAN induced nephropathy model were consistent with our predication and also 12 of 21 candidates were matched with differentially expressed genes in the DN transcriptomic data. By providing a novel interaction network and prediction, GCNet contributes to improving the understanding of normal glomerular function and will be useful for detecting target cytoskeleton molecules of interest that may be involved in glomerular diseases in future studies. PMID:27227331
Badoni, Saurabh; Das, Sweta; Sayal, Yogesh K.; Gopalakrishnan, S.; Singh, Ashok K.; Rao, Atmakuri R.; Agarwal, Pinky; Parida, Swarup K.; Tyagi, Akhilesh K.
2016-01-01
We developed genome-wide 84634 ISM (intron-spanning marker) and 16510 InDel-fragment length polymorphism-based ILP (intron-length polymorphism) markers from genes physically mapped on 12 rice chromosomes. These genic markers revealed much higher amplification-efficiency (80%) and polymorphic-potential (66%) among rice accessions even by a cost-effective agarose gel-based assay. A wider level of functional molecular diversity (17–79%) and well-defined precise admixed genetic structure was assayed by 3052 genome-wide markers in a structured population of indica, japonica, aromatic and wild rice. Six major grain weight QTLs (11.9–21.6% phenotypic variation explained) were mapped on five rice chromosomes of a high-density (inter-marker distance: 0.98 cM) genetic linkage map (IR 64 x Sonasal) anchored with 2785 known/candidate gene-derived ISM and ILP markers. The designing of multiple ISM and ILP markers (2 to 4 markers/gene) in an individual gene will broaden the user-preference to select suitable primer combination for efficient assaying of functional allelic variation/diversity and realistic estimation of differential gene expression profiles among rice accessions. The genomic information generated in our study is made publicly accessible through a user-friendly web-resource, “Oryza ISM-ILP marker” database. The known/candidate gene-derived ISM and ILP markers can be enormously deployed to identify functionally relevant trait-associated molecular tags by optimal-resource expenses, leading towards genomics-assisted crop improvement in rice. PMID:27032371
Li, Zhao-Qun; Luo, Zong-Xiu; Cai, Xiao-Ming; Bian, Lei; Xin, Zhao-Jun; Liu, Yan; Chu, Bo; Chen, Zong-Mao
2017-01-01
Tea grey geometrid ( Ectropis grisescens ), a devastating chewing pest in tea plantations throughout China, produces Type-II pheromone components. Little is known about the genes encoding proteins involved in the perception of Type-II sex pheromone components. To investigate the olfaction genes involved in E . grisescens sex pheromones and plant volatiles perception, we sequenced female and male antennae transcriptomes of E . grisescens . After assembly and annotation, we identified 153 candidate chemoreception genes in E. grisescens , including 40 odorant-binding proteins (OBPs), 30 chemosensory proteins (CSPs), 59 odorant receptors (ORs), and 24 ionotropic receptors (IRs). The results of phylogenetic, qPCR, and mRNA abundance analyses suggested that three candidate pheromone-binding proteins (EgriOBP2, 3, and 25), two candidate general odorant-binding proteins (EgriOBP1 and 29), six pheromone receptors (EgriOR24, 25, 28, 31, 37, and 44), and EgriCSP8 may be involved in the detection of Type-II sex pheromone components. Functional investigation by heterologous expression in Xenopus oocytes revealed that EgriOR31 was robustly tuned to the E . grisescens sex pheromone component (Z,Z,Z)-3,6,9-octadecatriene and weakly to the other sex pheromone component (Z,Z)-3,9-6,7-epoxyoctadecadiene. Our results represent a systematic functional analysis of the molecular mechanism of olfaction perception in E . grisescens with an emphasis on gene encoding proteins involved in perception of Type-II sex pheromones, and provide information that will be relevant to other Lepidoptera species.
Li, Zhao-Qun; Luo, Zong-Xiu; Cai, Xiao-Ming; Bian, Lei; Xin, Zhao-Jun; Liu, Yan; Chu, Bo; Chen, Zong-Mao
2017-01-01
Tea grey geometrid (Ectropis grisescens), a devastating chewing pest in tea plantations throughout China, produces Type-II pheromone components. Little is known about the genes encoding proteins involved in the perception of Type-II sex pheromone components. To investigate the olfaction genes involved in E. grisescens sex pheromones and plant volatiles perception, we sequenced female and male antennae transcriptomes of E. grisescens. After assembly and annotation, we identified 153 candidate chemoreception genes in E. grisescens, including 40 odorant-binding proteins (OBPs), 30 chemosensory proteins (CSPs), 59 odorant receptors (ORs), and 24 ionotropic receptors (IRs). The results of phylogenetic, qPCR, and mRNA abundance analyses suggested that three candidate pheromone-binding proteins (EgriOBP2, 3, and 25), two candidate general odorant-binding proteins (EgriOBP1 and 29), six pheromone receptors (EgriOR24, 25, 28, 31, 37, and 44), and EgriCSP8 may be involved in the detection of Type-II sex pheromone components. Functional investigation by heterologous expression in Xenopus oocytes revealed that EgriOR31 was robustly tuned to the E. grisescens sex pheromone component (Z,Z,Z)-3,6,9-octadecatriene and weakly to the other sex pheromone component (Z,Z)-3,9-6,7-epoxyoctadecadiene. Our results represent a systematic functional analysis of the molecular mechanism of olfaction perception in E. grisescens with an emphasis on gene encoding proteins involved in perception of Type-II sex pheromones, and provide information that will be relevant to other Lepidoptera species. PMID:29209233
Schultink, Alex; Cheng, Kun; Park, Yong Bum; Cosgrove, Daniel J.; Pauly, Markus
2013-01-01
Xyloglucan (XyG) is the dominant hemicellulose present in the primary cell walls of dicotyledonous plants. Unlike Arabidopsis (Arabidopsis thaliana) XyG, which contains galactosyl and fucosyl substituents, tomato (Solanum lycopersicum) XyG contains arabinofuranosyl residues. To investigate the biological function of these differing substituents, we used a functional complementation approach. Candidate glycosyltransferases were identified from tomato by using comparative genomics with known XyG galactosyltransferase genes from Arabidopsis. These candidate genes were expressed in an Arabidopsis mutant lacking XyG galactosylation, and two of them resulted in the production of arabinosylated XyG, a structure not previously found in this plant species. These genes may therefore encode XyG arabinofuranosyltransferases. Moreover, the addition of arabinofuranosyl residues to the XyG of this Arabidopsis mutant rescued a growth and cell wall biomechanics phenotype, demonstrating that the function of XyG in plant growth, development, and mechanics has considerable flexibility in terms of the specific residues in the side chains. These experiments also highlight the potential of reengineering the sugar substituents on plant wall polysaccharides without compromising growth or viability. PMID:23893172
Identification of genes from the Treacher Collins candidate region
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dixon, M.; Dixon, J.; Edwards, S.
Treacher Collins syndrome (TCOF1) is an autosomal dominant disorder of craniofacial development. The TCOF1 locus has previously been mapped to chromosome 5q32-33. The candidate gene region has been defined as being between two flanking markers, ribosomal protein S14 (RPS14) and Annexin 6 (ANX6), by analyzing recombination events in affected individuals. It is estimated that the distance between these flanking markers is 500 kb by three separate analysis methods: (1) radiation hybrid mapping; (2) genetic linkage; and (3) YAC contig analysis. A cosmid contig which spans the candidate gene region for TCOF1 has been constructed by screening the Los Alamos Nationalmore » Laboratory flow-sorted chromosome 5 cosmid library. Cosmids were obtained by using a combination of probes generated from YAC end clones, Alu-PCR fragments from YACs, and asymmetric PCR fragments from both T7 and T3 cosmid ends. Exon amplifications, the selection of genomic coding sequences based upon the presence of functional splice acceptor and donor sites, was used to identify potential exon sequences. Sequences found to be conserved between species were then used to screen cDNA libraries in order to identify candidate genes. To date, four different cDNAs have been isolated from this region and are being analyzed as potential candidate genes for TCOF1. These include the genes encoding plasma glutathione peroxidase (GPX3), heparin sulfate sulfotransferase (HSST), a gene with homology to the ETS family of proteins and one which shows no homology to any known genes. Work is also in progress to identify and characterize additional cDNAs from the candidate gene region.« less
González-Plaza, Juan J.; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F.; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R.; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R.
2016-01-01
Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species. PMID:26973682
Proteomic analysis of isolated chlamydomonas centrioles reveals orthologs of ciliary-disease genes.
Keller, Lani C; Romijn, Edwin P; Zamora, Ivan; Yates, John R; Marshall, Wallace F
2005-06-21
The centriole is one of the most enigmatic organelles in the cell. Centrioles are cylindrical, microtubule-based barrels found in the core of the centrosome. Centrioles also act as basal bodies during interphase to nucleate the assembly of cilia and flagella. There are currently only a handful of known centriole proteins. We used mass-spectrometry-based MudPIT (multidimensional protein identification technology) to identify the protein composition of basal bodies (centrioles) isolated from the green alga Chlamydomonas reinhardtii. This analysis detected the majority of known centriole proteins, including centrin, epsilon tubulin, and the cartwheel protein BLD10p. By combining proteomic data with information about gene expression and comparative genomics, we identified 45 cross-validated centriole candidate proteins in two classes. Members of the first class of proteins (BUG1-BUG27) are encoded by genes whose expression correlates with flagellar assembly and which therefore may play a role in ciliogenesis-related functions of basal bodies. Members of the second class (POC1-POC18) are implicated by comparative-genomics and -proteomics studies to be conserved components of the centriole. We confirmed centriolar localization for the human homologs of four candidate proteins. Three of the cross-validated centriole candidate proteins are encoded by orthologs of genes (OFD1, NPHP-4, and PACRG) implicated in mammalian ciliary function and disease, suggesting that oral-facial-digital syndrome and nephronophthisis may involve a dysfunction of centrioles and/or basal bodies. By analyzing isolated Chlamydomonas basal bodies, we have been able to obtain the first reported proteomic analysis of the centriole.
Machine Learning Helps Identify CHRONO as a Circadian Clock Component
Venkataraman, Anand; Ramanathan, Chidambaram; Kavakli, Ibrahim H.; Hughes, Michael E.; Baggs, Julie E.; Growe, Jacqueline; Liu, Andrew C.; Kim, Junhyong; Hogenesch, John B.
2014-01-01
Over the last decades, researchers have characterized a set of “clock genes” that drive daily rhythms in physiology and behavior. This arduous work has yielded results with far-reaching consequences in metabolic, psychiatric, and neoplastic disorders. Recent attempts to expand our understanding of circadian regulation have moved beyond the mutagenesis screens that identified the first clock components, employing higher throughput genomic and proteomic techniques. In order to further accelerate clock gene discovery, we utilized a computer-assisted approach to identify and prioritize candidate clock components. We used a simple form of probabilistic machine learning to integrate biologically relevant, genome-scale data and ranked genes on their similarity to known clock components. We then used a secondary experimental screen to characterize the top candidates. We found that several physically interact with known clock components in a mammalian two-hybrid screen and modulate in vitro cellular rhythms in an immortalized mouse fibroblast line (NIH 3T3). One candidate, Gene Model 129, interacts with BMAL1 and functionally represses the key driver of molecular rhythms, the BMAL1/CLOCK transcriptional complex. Given these results, we have renamed the gene CHRONO (computationally highlighted repressor of the network oscillator). Bi-molecular fluorescence complementation and co-immunoprecipitation demonstrate that CHRONO represses by abrogating the binding of BMAL1 to its transcriptional co-activator CBP. Most importantly, CHRONO knockout mice display a prolonged free-running circadian period similar to, or more drastic than, six other clock components. We conclude that CHRONO is a functional clock component providing a new layer of control on circadian molecular dynamics. PMID:24737000
Candidate Chemosensory Genes in the Stemborer Sesamia nonagrioides
Glaser, Nicolas; Gallot, Aurore; Legeai, Fabrice; Montagné, Nicolas; Poivet, Erwan; Harry, Myriam; Calatayud, Paul-André; Jacquin-Joly, Emmanuelle
2013-01-01
The stemborer Sesamia nonagrioides is an important pest of maize in the Mediterranean Basin. Like other moths, this noctuid uses its chemosensory system to efficiently interact with its environment. However, very little is known on the molecular mechanisms that underlie chemosensation in this species. Here, we used next-generation sequencing (454 and Illumina) on different tissues from adult and larvae, including chemosensory organs and female ovipositors, to describe the chemosensory transcriptome of S. nonagrioides and identify key molecular components of the pheromone production and detection systems. We identified a total of 68 candidate chemosensory genes in this species, including 31 candidate binding-proteins and 23 chemosensory receptors. In particular, we retrieved the three co-receptors Orco, IR25a and IR8a necessary for chemosensory receptor functioning. Focusing on the pheromonal communication system, we identified a new pheromone-binding protein in this species, four candidate pheromone receptors and 12 carboxylesterases as candidate acetate degrading enzymes. In addition, we identified enzymes putatively involved in S. nonagrioides pheromone biosynthesis, including a ∆11-desaturase and different acetyltransferases and reductases. RNAseq analyses and RT-PCR were combined to profile gene expression in different tissues. This study constitutes the first large scale description of chemosensory genes in S. nonagrioides. PMID:23781142
SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate
Roffler, Gretchen H.; Amish, Stephen J.; Smith, Seth; Cosart, Ted F.; Kardos, Marty; Schwartz, Michael K.; Luikart, Gordon
2016-01-01
Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding and nearby 5′ and 3′ untranslated regions of chosen candidate genes. Targeted sequences were taken from bighorn sheep (Ovis canadensis) exon capture data and directly from the domestic sheep genome (Ovis aries v. 3; oviAri3). The bighorn sheep sequences used in the Dall's sheep (Ovis dalli dalli) exon capture aligned to 2350 genes on the oviAri3 genome with an average of 2 exons each. We developed a microfluidic qPCR-based SNP chip to genotype 476 Dall's sheep from locations across their range and test for patterns of selection. Using multiple corroborating approaches (lositan and bayescan), we detected 28 SNP loci potentially under selection. We additionally identified candidate loci significantly associated with latitude, longitude, precipitation and temperature, suggesting local environmental adaptation. The three methods demonstrated consistent support for natural selection on nine genes with immune and disease-regulating functions (e.g. Ovar-DRA, APC, BATF2, MAGEB18), cell regulation signalling pathways (e.g. KRIT1, PI3K, ORRC3), and respiratory health (CYSLTR1). Characterizing adaptive allele distributions from novel genetic techniques will facilitate investigation of the influence of environmental variation on local adaptation of a northern alpine ungulate throughout its range. This research demonstrated the utility of exon capture for gene-targeted SNP discovery and subsequent SNP chip genotyping using low-quality samples in a nonmodel species.
Luo, Xiongjian; Huang, Liang; Han, Leng; Luo, Zhenwu; Hu, Fang; Tieu, Roger; Gan, Lin
2014-01-01
Schizophrenia is a common mental disorder with high heritability and strong genetic heterogeneity. Common disease-common variants hypothesis predicts that schizophrenia is attributable in part to common genetic variants. However, recent studies have clearly demonstrated that copy number variations (CNVs) also play pivotal roles in schizophrenia susceptibility and explain a proportion of missing heritability. Though numerous CNVs have been identified, many of the regions affected by CNVs show poor overlapping among different studies, and it is not known whether the genes disrupted by CNVs contribute to the risk of schizophrenia. By using cumulative scoring, we systematically prioritized the genes affected by CNVs in schizophrenia. We identified 8 top genes that are frequently disrupted by CNVs, including NRXN1, CHRNA7, BCL9, CYFIP1, GJA8, NDE1, SNAP29, and GJA5. Integration of genes affected by CNVs with known schizophrenia susceptibility genes (from previous genetic linkage and association studies) reveals that many genes disrupted by CNVs are also associated with schizophrenia. Further protein-protein interaction (PPI) analysis indicates that protein products of genes affected by CNVs frequently interact with known schizophrenia-associated proteins. Finally, systematic integration of CNVs prioritization data with genetic association and PPI data identifies key schizophrenia candidate genes. Our results provide a global overview of genes impacted by CNVs in schizophrenia and reveal a densely interconnected molecular network of de novo CNVs in schizophrenia. Though the prioritized top genes represent promising schizophrenia risk genes, further work with different prioritization methods and independent samples is needed to confirm these findings. Nevertheless, the identified key candidate genes may have important roles in the pathogenesis of schizophrenia, and further functional characterization of these genes may provide pivotal targets for future therapeutics and diagnostics. PMID:24664977
Database of cattle candidate genes and genetic markers for milk production and mastitis
Ogorevc, J; Kunej, T; Razpet, A; Dovc, P
2009-01-01
A cattle database of candidate genes and genetic markers for milk production and mastitis has been developed to provide an integrated research tool incorporating different types of information supporting a genomic approach to study lactation, udder development and health. The database contains 943 genes and genetic markers involved in mammary gland development and function, representing candidates for further functional studies. The candidate loci were drawn on a genetic map to reveal positional overlaps. For identification of candidate loci, data from seven different research approaches were exploited: (i) gene knockouts or transgenes in mice that result in specific phenotypes associated with mammary gland (143 loci); (ii) cattle QTL for milk production (344) and mastitis related traits (71); (iii) loci with sequence variations that show specific allele-phenotype interactions associated with milk production (24) or mastitis (10) in cattle; (iv) genes with expression profiles associated with milk production (207) or mastitis (107) in cattle or mouse; (v) cattle milk protein genes that exist in different genetic variants (9); (vi) miRNAs expressed in bovine mammary gland (32) and (vii) epigenetically regulated cattle genes associated with mammary gland function (1). Fourty-four genes found by multiple independent analyses were suggested as the most promising candidates and were further in silico analysed for expression levels in lactating mammary gland, genetic variability and top biological functions in functional networks. A miRNA target search for mammary gland expressed miRNAs identified 359 putative binding sites in 3′UTRs of candidate genes. PMID:19508288
Chapman, Mark A; Pashley, Catherine H; Wenzler, Jessica; Hvala, John; Tang, Shunxue; Knapp, Steven J; Burke, John M
2008-11-01
Genomic scans for selection are a useful tool for identifying genes underlying phenotypic transitions. In this article, we describe the results of a genome scan designed to identify candidates for genes targeted by selection during the evolution of cultivated sunflower. This work involved screening 492 loci derived from ESTs on a large panel of wild, primitive (i.e., landrace), and improved sunflower (Helianthus annuus) lines. This sampling strategy allowed us to identify candidates for selectively important genes and investigate the likely timing of selection. Thirty-six genes showed evidence of selection during either domestication or improvement based on multiple criteria, and a sequence-based test of selection on a subset of these loci confirmed this result. In view of what is known about the structure of linkage disequilibrium across the sunflower genome, these genes are themselves likely to have been targeted by selection, rather than being merely linked to the actual targets. While the selection candidates showed a broad range of putative functions, they were enriched for genes involved in amino acid synthesis and protein catabolism. Given that a similar pattern has been detected in maize (Zea mays), this finding suggests that selection on amino acid composition may be a general feature of the evolution of crop plants. In terms of genomic locations, the selection candidates were significantly clustered near quantitative trait loci (QTL) that contribute to phenotypic differences between wild and cultivated sunflower, and specific instances of QTL colocalization provide some clues as to the roles that these genes may have played during sunflower evolution.
Zang, Wen; Eckstein, Peter E; Colin, Mark; Voth, Doug; Himmelbach, Axel; Beier, Sebastian; Stein, Nils; Scoles, Graham J; Beattie, Aaron D
2015-07-01
The candidate gene for the barley Un8 true loose smut resistance gene encodes a deduced protein containing two tandem protein kinase domains. In North America, durable resistance against all known isolates of barley true loose smut, caused by the basidiomycete pathogen Ustilago nuda (Jens.) Rostr. (U. nuda), is under the control of the Un8 resistance gene. Previous genetic studies mapped Un8 to the long arm of chromosome 5 (1HL). Here, a population of 4625 lines segregating for Un8 was used to delimit the Un8 gene to a 0.108 cM interval on chromosome arm 1HL, and assign it to fingerprinted contig 546 of the barley physical map. The minimal tilling path was identified for the Un8 locus using two flanking markers and consisted of two overlapping bacterial artificial chromosomes. One gene located close to a marker co-segregating with Un8 showed high sequence identity to a disease resistance gene containing two kinase domains. Sequence of the candidate gene from the parents of the segregating population, and in an additional 19 barley lines representing a broader spectrum of diversity, showed there was no intron in alleles present in either resistant or susceptible lines, and fifteen amino acid variations unique to the deduced protein sequence in resistant lines differentiated it from the deduced protein sequences in susceptible lines. Some of these variations were present within putative functional domains which may cause a loss of function in the deduced protein sequences within susceptible lines.
Genomic Heterogeneity of Osteosarcoma - Shift from Single Candidates to Functional Modules
Maugg, Doris; Eckstein, Gertrud; Baumhoer, Daniel; Nathrath, Michaela; Korsching, Eberhard
2015-01-01
Osteosarcoma (OS), a bone tumor, exhibit a complex karyotype. On the genomic level a highly variable degree of alterations in nearly all chromosomal regions and between individual tumors is observable. This hampers the identification of common drivers in OS biology. To identify the common molecular mechanisms involved in the maintenance of OS, we follow the hypothesis that all the copy number-associated differences between the patients are intercepted on the level of the functional modules. The implementation is based on a network approach utilizing copy number associated genes in OS, paired expression data and protein interaction data. The resulting functional modules of tightly connected genes were interpreted regarding their biological functions in OS and their potential prognostic significance. We identified an osteosarcoma network assembling well-known and lesser-known candidates. The derived network shows a significant connectivity and modularity suggesting that the genes affected by the heterogeneous genetic alterations share the same biological context. The network modules participate in several critical aspects of cancer biology like DNA damage response, cell growth, and cell motility which is in line with the hypothesis of specifically deregulated but functional modules in cancer. Further, we could deduce genes with possible prognostic significance in OS for further investigation (e.g. EZR, CDKN2A, MAP3K5). Several of those module genes were located on chromosome 6q. The given systems biological approach provides evidence that heterogeneity on the genomic and expression level is ordered by the biological system on the level of the functional modules. Different genomic aberrations are pointing to the same cellular network vicinity to form vital, but already neoplastically altered, functional modules maintaining OS. This observation, exemplarily now shown for OS, has been under discussion already for a longer time, but often in a hypothetical manner, and can here be exemplified for OS. PMID:25848766
Detection of gene communities in multi-networks reveals cancer drivers
NASA Astrophysics Data System (ADS)
Cantini, Laura; Medico, Enzo; Fortunato, Santo; Caselle, Michele
2015-12-01
We propose a new multi-network-based strategy to integrate different layers of genomic information and use them in a coordinate way to identify driving cancer genes. The multi-networks that we consider combine transcription factor co-targeting, microRNA co-targeting, protein-protein interaction and gene co-expression networks. The rationale behind this choice is that gene co-expression and protein-protein interactions require a tight coregulation of the partners and that such a fine tuned regulation can be obtained only combining both the transcriptional and post-transcriptional layers of regulation. To extract the relevant biological information from the multi-network we studied its partition into communities. To this end we applied a consensus clustering algorithm based on state of art community detection methods. Even if our procedure is valid in principle for any pathology in this work we concentrate on gastric, lung, pancreas and colorectal cancer and identified from the enrichment analysis of the multi-network communities a set of candidate driver cancer genes. Some of them were already known oncogenes while a few are new. The combination of the different layers of information allowed us to extract from the multi-network indications on the regulatory pattern and functional role of both the already known and the new candidate driver genes.
Dardick, Chris; Callahan, Ann; Horn, Renate; Ruiz, Karina B; Zhebentyayeva, Tetyana; Hollender, Courtney; Whitaker, Michael; Abbott, Albert; Scorza, Ralph
2013-08-01
Trees are capable of tremendous architectural plasticity, allowing them to maximize their light exposure under highly competitive environments. One key component of tree architecture is the branch angle, yet little is known about the molecular basis for the spatial patterning of branches in trees. Here, we report the identification of a candidate gene for the br mutation in Prunus persica (peach) associated with vertically oriented growth of branches, referred to as 'pillar' or 'broomy'. Ppa010082, annotated as hypothetical protein in the peach genome sequence, was identified as a candidate gene for br using a next generation sequence-based mapping approach. Sequence similarity searches identified rice TAC1 (tiller angle control 1) as a putative ortholog, and we thus named it PpeTAC1. In monocots, TAC1 is known to lead to less compact growth by increasing the tiller angle. In Arabidopsis, an attac1 mutant showed more vertical branch growth angles, suggesting that the gene functions universally to promote the horizontal growth of branches. TAC1 genes belong to a gene family (here named IGT for a shared conserved motif) found in all plant genomes, consisting of two clades: one containing TAC1-like genes; the other containing LAZY1, which contains an EAR motif, and promotes vertical shoot growth in Oryza sativa (rice) and Arabidopsis through influencing polar auxin transport. The data suggest that IGT genes are ancient, and play conserved roles in determining shoot growth angles in plants. Understanding how IGT genes modulate branch angles will provide insights into how different architectural growth habits evolved in terrestrial plants. © 2013 The Authors The Plant Journal © 2013 John Wiley & Sons Ltd.
Franke, Lude; Bakel, Harm van; Fokkens, Like; de Jong, Edwin D.; Egmont-Petersen, Michael; Wijmenga, Cisca
2006-01-01
Most common genetic disorders have a complex inheritance and may result from variants in many genes, each contributing only weak effects to the disease. Pinpointing these disease genes within the myriad of susceptibility loci identified in linkage studies is difficult because these loci may contain hundreds of genes. However, in any disorder, most of the disease genes will be involved in only a few different molecular pathways. If we know something about the relationships between the genes, we can assess whether some genes (which may reside in different loci) functionally interact with each other, indicating a joint basis for the disease etiology. There are various repositories of information on pathway relationships. To consolidate this information, we developed a functional human gene network that integrates information on genes and the functional relationships between genes, based on data from the Kyoto Encyclopedia of Genes and Genomes, the Biomolecular Interaction Network Database, Reactome, the Human Protein Reference Database, the Gene Ontology database, predicted protein-protein interactions, human yeast two-hybrid interactions, and microarray coexpressions. We applied this network to interrelate positional candidate genes from different disease loci and then tested 96 heritable disorders for which the Online Mendelian Inheritance in Man database reported at least three disease genes. Artificial susceptibility loci, each containing 100 genes, were constructed around each disease gene, and we used the network to rank these genes on the basis of their functional interactions. By following up the top five genes per artificial locus, we were able to detect at least one known disease gene in 54% of the loci studied, representing a 2.8-fold increase over random selection. This suggests that our method can significantly reduce the cost and effort of pinpointing true disease genes in analyses of disorders for which numerous loci have been reported but for which most of the genes are unknown. PMID:16685651
Reyes-Gibby, Cielito C; Yuan, Christine; Wang, Jian; Yeung, Sai-Ching J; Shete, Sanjay
2015-06-05
Addictions to alcohol and tobacco, known risk factors for cancer, are complex heritable disorders. Addictive behaviors have a bidirectional relationship with pain. We hypothesize that the associations between alcohol, smoking, and opioid addiction observed in cancer patients have a genetic basis. Therefore, using bioinformatics tools, we explored the underlying genetic basis and identified new candidate genes and common biological pathways for smoking, alcohol, and opioid addiction. Literature search showed 56 genes associated with alcohol, smoking and opioid addiction. Using Core Analysis function in Ingenuity Pathway Analysis software, we found that ERK1/2 was strongly interconnected across all three addiction networks. Genes involved in immune signaling pathways were shown across all three networks. Connect function from IPA My Pathway toolbox showed that DRD2 is the gene common to both the list of genetic variations associated with all three addiction phenotypes and the components of the brain neuronal signaling network involved in substance addiction. The top canonical pathways associated with the 56 genes were: 1) calcium signaling, 2) GPCR signaling, 3) cAMP-mediated signaling, 4) GABA receptor signaling, and 5) G-alpha i signaling. Cancer patients are often prescribed opioids for cancer pain thus increasing their risk for opioid abuse and addiction. Our findings provide candidate genes and biological pathways underlying addiction phenotypes, which may be future targets for treatment of addiction. Further study of the variations of the candidate genes could allow physicians to make more informed decisions when treating cancer pain with opioid analgesics.
Endeavour update: a web resource for gene prioritization in multiple species
Tranchevent, Léon-Charles; Barriot, Roland; Yu, Shi; Van Vooren, Steven; Van Loo, Peter; Coessens, Bert; De Moor, Bart; Aerts, Stein; Moreau, Yves
2008-01-01
Endeavour (http://www.esat.kuleuven.be/endeavourweb; this web site is free and open to all users and there is no login requirement) is a web resource for the prioritization of candidate genes. Using a training set of genes known to be involved in a biological process of interest, our approach consists of (i) inferring several models (based on various genomic data sources), (ii) applying each model to the candidate genes to rank those candidates against the profile of the known genes and (iii) merging the several rankings into a global ranking of the candidate genes. In the present article, we describe the latest developments of Endeavour. First, we provide a web-based user interface, besides our Java client, to make Endeavour more universally accessible. Second, we support multiple species: in addition to Homo sapiens, we now provide gene prioritization for three major model organisms: Mus musculus, Rattus norvegicus and Caenorhabditis elegans. Third, Endeavour makes use of additional data sources and is now including numerous databases: ontologies and annotations, protein–protein interactions, cis-regulatory information, gene expression data sets, sequence information and text-mining data. We tested the novel version of Endeavour on 32 recent disease gene associations from the literature. Additionally, we describe a number of recent independent studies that made use of Endeavour to prioritize candidate genes for obesity and Type II diabetes, cleft lip and cleft palate, and pulmonary fibrosis. PMID:18508807
Uddin, Raihan; Singh, Shiva M.
2017-01-01
As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in “learning and memory” related functions and pathways. Subsequent differential network analysis of this “learning and memory” module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they provide a new insight and generate new hypotheses into the molecular mechanisms responsible for age associated learning impairment, including spatial learning. PMID:29066959
Uddin, Raihan; Singh, Shiva M
2017-01-01
As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in "learning and memory" related functions and pathways. Subsequent differential network analysis of this "learning and memory" module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they provide a new insight and generate new hypotheses into the molecular mechanisms responsible for age associated learning impairment, including spatial learning.
Filling gaps in PPAR-alpha signaling through comparative nutrigenomics analysis.
Cavalieri, Duccio; Calura, Enrica; Romualdi, Chiara; Marchi, Emmanuela; Radonjic, Marijana; Van Ommen, Ben; Müller, Michael
2009-12-11
The application of high-throughput genomic tools in nutrition research is a widespread practice. However, it is becoming increasingly clear that the outcome of individual expression studies is insufficient for the comprehensive understanding of such a complex field. Currently, the availability of the large amounts of expression data in public repositories has opened up new challenges on microarray data analyses. We have focused on PPARalpha, a ligand-activated transcription factor functioning as fatty acid sensor controlling the gene expression regulation of a large set of genes in various metabolic organs such as liver, small intestine or heart. The function of PPARalpha is strictly connected to the function of its target genes and, although many of these have already been identified, major elements of its physiological function remain to be uncovered. To further investigate the function of PPARalpha, we have applied a cross-species meta-analysis approach to integrate sixteen microarray datasets studying high fat diet and PPARalpha signal perturbations in different organisms. We identified 164 genes (MDEGs) that were differentially expressed in a constant way in response to a high fat diet or to perturbations in PPARs signalling. In particular, we found five genes in yeast which were highly conserved and homologous of PPARalpha targets in mammals, potential candidates to be used as models for the equivalent mammalian genes. Moreover, a screening of the MDEGs for all known transcription factor binding sites and the comparison with a human genome-wide screening of Peroxisome Proliferating Response Elements (PPRE), enabled us to identify, 20 new potential candidate genes that show, both binding site, both change in expression in the condition studied. Lastly, we found a non random localization of the differentially expressed genes in the genome. The results presented are potentially of great interest to resume the currently available expression data, exploiting the power of in silico analysis filtered by evolutionary conservation. The analysis enabled us to indicate potential gene candidates that could fill in the gaps with regards to the signalling of PPARalpha and, moreover, the non-random localization of the differentially expressed genes in the genome, suggest that epigenetic mechanisms are of importance in the regulation of the transcription operated by PPARalpha.
Bhattacharya, Dipankan; Marfo, Chris A; Li, Davis; Lane, Maura; Khokha, Mustafa K
2015-12-15
Congenital malformations are the major cause of infant mortality in the US and Europe. Due to rapid advances in human genomics, we can now efficiently identify sequence variants that may cause disease in these patients. However, establishing disease causality remains a challenge. Additionally, in the case of congenital heart disease, many of the identified candidate genes are either novel to embryonic development or have no known function. Therefore, there is a pressing need to develop inexpensive and efficient technologies to screen these candidate genes for disease phenocopy in model systems and to perform functional studies to uncover their role in development. For this purpose, we sought to test F0 CRISPR based gene editing as a loss of function strategy for disease phenocopy in the frog model organism, Xenopus tropicalis. We demonstrate that the CRISPR/Cas9 system can efficiently modify both alleles in the F0 generation within a few hours post fertilization, recapitulating even early disease phenotypes that are highly similar to knockdowns from morpholino oligos (MOs) in nearly all cases tested. We find that injecting Cas9 protein is dramatically more efficacious and less toxic than cas9 mRNA. We conclude that CRISPR based F0 gene modification in X. tropicalis is efficient and cost effective and readily recapitulates disease and MO phenotypes. Copyright © 2015 Elsevier Inc. All rights reserved.
FUN-L: gene prioritization for RNAi screens.
Lees, Jonathan G; Hériché, Jean-Karim; Morilla, Ian; Fernández, José M; Adler, Priit; Krallinger, Martin; Vilo, Jaak; Valencia, Alfonso; Ellenberg, Jan; Ranea, Juan A; Orengo, Christine
2015-06-15
Most biological processes remain only partially characterized with many components still to be identified. Given that a whole genome can usually not be tested in a functional assay, identifying the genes most likely to be of interest is of critical importance to avoid wasting resources. Given a set of known functionally related genes and using a state-of-the-art approach to data integration and mining, our Functional Lists (FUN-L) method provides a ranked list of candidate genes for testing. Validation of predictions from FUN-L with independent RNAi screens confirms that FUN-L-produced lists are enriched in genes with the expected phenotypes. In this article, we describe a website front end to FUN-L. The website is freely available to use at http://funl.org © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Nigam, Deepti; Sawant, Samir V
2013-01-01
Technological development led to an increased interest in systems biological approaches in plants to characterize developmental mechanism and candidate genes relevant to specific tissue or cell morphology. AUX-IAA proteins are important plant-specific putative transcription factors. There are several reports on physiological response of this family in Arabidopsis but in cotton fiber the transcriptional network through which AUX-IAA regulated its target genes is still unknown. in-silico modelling of cotton fiber development specific gene expression data (108 microarrays and 22,737 genes) using Algorithm for the Reconstruction of Accurate Cellular Networks (ARACNe) reveals 3690 putative AUX-IAA target genes of which 139 genes were known to be AUX-IAA co-regulated within Arabidopsis. Further AUX-IAA targeted gene regulatory network (GRN) had substantial impact on the transcriptional dynamics of cotton fiber, as showed by, altered TF networks, and Gene Ontology (GO) biological processes and metabolic pathway associated with its target genes. Analysis of the AUX-IAA-correlated gene network reveals multiple functions for AUX-IAA target genes such as unidimensional cell growth, cellular nitrogen compound metabolic process, nucleosome organization, DNA-protein complex and process related to cell wall. These candidate networks/pathways have a variety of profound impacts on such cellular functions as stress response, cell proliferation, and cell differentiation. While these functions are fairly broad, their underlying TF networks may provide a global view of AUX-IAA regulated gene expression and a GRN that guides future studies in understanding role of AUX-IAA box protein and its targets regulating fiber development. PMID:24497725
Sharma, Amitabh; Gulbahce, Natali; Pevzner, Samuel J.; Menche, Jörg; Ladenvall, Claes; Folkersen, Lasse; Eriksson, Per; Orho-Melander, Marju; Barabási, Albert-László
2013-01-01
Genome wide association studies (GWAS) identify susceptibility loci for complex traits, but do not identify particular genes of interest. Integration of functional and network information may help in overcoming this limitation and identifying new susceptibility loci. Using GWAS and comorbidity data, we present a network-based approach to predict candidate genes for lipid and lipoprotein traits. We apply a prediction pipeline incorporating interactome, co-expression, and comorbidity data to Global Lipids Genetics Consortium (GLGC) GWAS for four traits of interest, identifying phenotypically coherent modules. These modules provide insights regarding gene involvement in complex phenotypes with multiple susceptibility alleles and low effect sizes. To experimentally test our predictions, we selected four candidate genes and genotyped representative SNPs in the Malmö Diet and Cancer Cardiovascular Cohort. We found significant associations with LDL-C and total-cholesterol levels for a synonymous SNP (rs234706) in the cystathionine beta-synthase (CBS) gene (p = 1 × 10−5 and adjusted-p = 0.013, respectively). Further, liver samples taken from 206 patients revealed that patients with the minor allele of rs234706 had significant dysregulation of CBS (p = 0.04). Despite the known biological role of CBS in lipid metabolism, SNPs within the locus have not yet been identified in GWAS of lipoprotein traits. Thus, the GWAS-based Comorbidity Module (GCM) approach identifies candidate genes missed by GWAS studies, serving as a broadly applicable tool for the investigation of other complex disease phenotypes. PMID:23882023
Talukder, Zahirul I; Hulke, Brent S; Qi, Lili; Scheffler, Brian E; Pegadaraju, Venkatramana; McPhee, Kevin; Gulya, Thomas J
2014-01-01
Functional markers for Sclerotinia basal stalk rot resistance in sunflower were obtained using gene-level information from the model species Arabidopsis thaliana. Sclerotinia stalk rot, caused by Sclerotinia sclerotiorum, is one of the most destructive diseases of sunflower (Helianthus annuus L.) worldwide. Markers for genes controlling resistance to S. sclerotiorum will enable efficient marker-assisted selection (MAS). We sequenced eight candidate genes homologous to Arabidopsis thaliana defense genes known to be associated with Sclerotinia disease resistance in a sunflower association mapping population evaluated for Sclerotinia stalk rot resistance. The total candidate gene sequence regions covered a concatenated length of 3,791 bp per individual. A total of 187 polymorphic sites were detected for all candidate gene sequences, 149 of which were single nucleotide polymorphisms (SNPs) and 38 were insertions/deletions. Eight SNPs in the coding regions led to changes in amino acid codons. Linkage disequilibrium decay throughout the candidate gene regions declined on average to an r (2) = 0.2 for genetic intervals of 120 bp, but extended up to 350 bp with r (2) = 0.1. A general linear model with modification to account for population structure was found the best fitting model for this population and was used for association mapping. Both HaCOI1-1 and HaCOI1-2 were found to be strongly associated with Sclerotinia stalk rot resistance and explained 7.4 % of phenotypic variation in this population. These SNP markers associated with Sclerotinia stalk rot resistance can potentially be applied to the selection of favorable genotypes, which will significantly improve the efficiency of MAS during the development of stalk rot resistant cultivars.
A comprehensive study of the genomic differentiation between temperate Dent and Flint maize.
Unterseer, Sandra; Pophaly, Saurabh D; Peis, Regina; Westermeier, Peter; Mayer, Manfred; Seidel, Michael A; Haberer, Georg; Mayer, Klaus F X; Ordas, Bernardo; Pausch, Hubert; Tellier, Aurélien; Bauer, Eva; Schön, Chris-Carolin
2016-07-08
Dent and Flint represent two major germplasm pools exploited in maize breeding. Several traits differentiate the two pools, like cold tolerance, early vigor, and flowering time. A comparative investigation of their genomic architecture relevant for quantitative trait expression has not been reported so far. Understanding the genomic differences between germplasm pools may contribute to a better understanding of the complementarity in heterotic patterns exploited in hybrid breeding and of mechanisms involved in adaptation to different environments. We perform whole-genome screens for signatures of selection specific to temperate Dent and Flint maize by comparing high-density genotyping data of 70 American and European Dent and 66 European Flint inbred lines. We find 2.2 % and 1.4 % of the genes are under selective pressure, respectively, and identify candidate genes associated with agronomic traits known to differ between the two pools. Taking flowering time as an example for the differentiation between Dent and Flint, we investigate candidate genes involved in the flowering network by phenotypic analyses in a Dent-Flint introgression library and find that the Flint haplotypes of the candidates promote earlier flowering. Within the flowering network, the majority of Flint candidates are associated with endogenous pathways in contrast to Dent candidate genes, which are mainly involved in response to environmental factors like light and photoperiod. The diversity patterns of the candidates in a unique panel of more than 900 individuals from 38 European landraces indicate a major contribution of landraces from France, Germany, and Spain to the candidate gene diversity of the Flint elite lines. In this study, we report the investigation of pool-specific differences between temperate Dent and Flint on a genome-wide scale. The identified candidate genes represent a promising source for the functional investigation of pool-specific haplotypes in different genetic backgrounds and for the evaluation of their potential for future crop improvement like the adaptation to specific environments.
Chauvet, Cristina; Ménard, Annie; Deng, Alan Y
2015-09-01
Multiple quantitative trait loci (QTLs) for blood pressure (BP) have been detected in rat models of human polygenic hypertension. They influence BP physiologically via epistatic modules. Little is known about the causal genes and virtually nothing is known on modularized mechanisms governing their regulatory connections. Two genes responsible for two individual BP QTLs on rat Chromosome 18 have been identified that belong to the same epistatic module. Treacher Collins-Franceschetti syndrome 1 (Tcof1) gene is the only function candidate for C18QTL3. Haloacid dehalogenase like hydrolase domain containing 2 (Hdhd2), although a gene of previously unknown function, is C18QTL4, and encodes a newly identified phosphatase. The current work has provided the premier evidence that Hdhd2/C18QTL4 and Tcof1/C18QTL3 may be involved in polygenic hypertension. Hdhd2/C18QTL4 can regulate the function of Tcof1/C18QTL3 via de-phosphorylation, and, for the first time, furbishes a molecular mechanism in support of a genetically epistatic hierarchy between two BP QTLs, and thus authenticates the epistasis-common pathway paradigm. The pathway initiated by Hdhd2/C18QTL4 upstream of Tcof1/C18QTL3 reveals novel mechanistic insights into BP modulations. Their discovery might yield innovative therapeutic targets and diagnostic tools predicated on a novel BP cause and mechanism that is determined by a regulatory hierarchy. Optimizing the de-phosphorylation capability and its downstream target could be antihypertensive. The conceptual paradigm of an order and regulatory hierarchy may help unravel genetic and molecular relationships among certain human BP QTLs.
Genomic approaches for the elucidation of genes and gene networks underlying cardiovascular traits.
Adriaens, M E; Bezzina, C R
2018-06-22
Genome-wide association studies have shed light on the association between natural genetic variation and cardiovascular traits. However, linking a cardiovascular trait associated locus to a candidate gene or set of candidate genes for prioritization for follow-up mechanistic studies is all but straightforward. Genomic technologies based on next-generation sequencing technology nowadays offer multiple opportunities to dissect gene regulatory networks underlying genetic cardiovascular trait associations, thereby aiding in the identification of candidate genes at unprecedented scale. RNA sequencing in particular becomes a powerful tool when combined with genotyping to identify loci that modulate transcript abundance, known as expression quantitative trait loci (eQTL), or loci modulating transcript splicing known as splicing quantitative trait loci (sQTL). Additionally, the allele-specific resolution of RNA-sequencing technology enables estimation of allelic imbalance, a state where the two alleles of a gene are expressed at a ratio differing from the expected 1:1 ratio. When multiple high-throughput approaches are combined with deep phenotyping in a single study, a comprehensive elucidation of the relationship between genotype and phenotype comes into view, an approach known as systems genetics. In this review, we cover key applications of systems genetics in the broad cardiovascular field.
Transcript Analysis and Regulative Events during Flower Development in Olive (Olea europaea L.).
Alagna, Fiammetta; Cirilli, Marco; Galla, Giulio; Carbone, Fabrizio; Daddiego, Loretta; Facella, Paolo; Lopez, Loredana; Colao, Chiara; Mariotti, Roberto; Cultrera, Nicolò; Rossi, Martina; Barcaccia, Gianni; Baldoni, Luciana; Muleo, Rosario; Perrotta, Gaetano
2016-01-01
The identification and characterization of transcripts involved in flower organ development, plant reproduction and metabolism represent key steps in plant phenotypic and physiological pathways, and may generate high-quality transcript variants useful for the development of functional markers. This study was aimed at obtaining an extensive characterization of the olive flower transcripts, by providing sound information on the candidate MADS-box genes related to the ABC model of flower development and on the putative genetic and molecular determinants of ovary abortion and pollen-pistil interaction. The overall sequence data, obtained by pyrosequencing of four cDNA libraries from flowers at different developmental stages of three olive varieties with distinct reproductive features (Leccino, Frantoio and Dolce Agogia), included approximately 465,000 ESTs, which gave rise to more than 14,600 contigs and approximately 92,000 singletons. As many as 56,700 unigenes were successfully annotated and provided gene ontology insights into the structural organization and putative molecular function of sequenced transcripts and deduced proteins in the context of their corresponding biological processes. Differentially expressed genes with potential regulatory roles in biosynthetic pathways and metabolic networks during flower development were identified. The gene expression studies allowed us to select the candidate genes that play well-known molecular functions in a number of biosynthetic pathways and specific biological processes that affect olive reproduction. A sound understanding of gene functions and regulatory networks that characterize the olive flower is provided.
Transcript Analysis and Regulative Events during Flower Development in Olive (Olea europaea L.)
Alagna, Fiammetta; Cirilli, Marco; Galla, Giulio; Carbone, Fabrizio; Daddiego, Loretta; Facella, Paolo; Lopez, Loredana; Colao, Chiara; Mariotti, Roberto; Cultrera, Nicolò; Rossi, Martina; Barcaccia, Gianni; Baldoni, Luciana; Muleo, Rosario; Perrotta, Gaetano
2016-01-01
The identification and characterization of transcripts involved in flower organ development, plant reproduction and metabolism represent key steps in plant phenotypic and physiological pathways, and may generate high-quality transcript variants useful for the development of functional markers. This study was aimed at obtaining an extensive characterization of the olive flower transcripts, by providing sound information on the candidate MADS-box genes related to the ABC model of flower development and on the putative genetic and molecular determinants of ovary abortion and pollen-pistil interaction. The overall sequence data, obtained by pyrosequencing of four cDNA libraries from flowers at different developmental stages of three olive varieties with distinct reproductive features (Leccino, Frantoio and Dolce Agogia), included approximately 465,000 ESTs, which gave rise to more than 14,600 contigs and approximately 92,000 singletons. As many as 56,700 unigenes were successfully annotated and provided gene ontology insights into the structural organization and putative molecular function of sequenced transcripts and deduced proteins in the context of their corresponding biological processes. Differentially expressed genes with potential regulatory roles in biosynthetic pathways and metabolic networks during flower development were identified. The gene expression studies allowed us to select the candidate genes that play well-known molecular functions in a number of biosynthetic pathways and specific biological processes that affect olive reproduction. A sound understanding of gene functions and regulatory networks that characterize the olive flower is provided. PMID:27077738
Juul, Malene; Bertl, Johanna; Guo, Qianyun; Nielsen, Morten Muhlig; Świtnicki, Michał; Hornshøj, Henrik; Madsen, Tobias; Hobolth, Asger; Pedersen, Jakob Skou
2017-01-01
Non-coding mutations may drive cancer development. Statistical detection of non-coding driver regions is challenged by a varying mutation rate and uncertainty of functional impact. Here, we develop a statistically founded non-coding driver-detection method, ncdDetect, which includes sample-specific mutational signatures, long-range mutation rate variation, and position-specific impact measures. Using ncdDetect, we screened non-coding regulatory regions of protein-coding genes across a pan-cancer set of whole-genomes (n = 505), which top-ranked known drivers and identified new candidates. For individual candidates, presence of non-coding mutations associates with altered expression or decreased patient survival across an independent pan-cancer sample set (n = 5454). This includes an antigen-presenting gene (CD1A), where 5’UTR mutations correlate significantly with decreased survival in melanoma. Additionally, mutations in a base-excision-repair gene (SMUG1) correlate with a C-to-T mutational-signature. Overall, we find that a rich model of mutational heterogeneity facilitates non-coding driver identification and integrative analysis points to candidates of potential clinical relevance. DOI: http://dx.doi.org/10.7554/eLife.21778.001 PMID:28362259
Tammimies, Kristiina; Bieder, Andrea; Lauter, Gilbert; Sugiaman-Trapman, Debora; Torchet, Rachel; Hokkanen, Marie-Estelle; Burghoorn, Jan; Castrén, Eero; Kere, Juha; Tapia-Páez, Isabel; Swoboda, Peter
2016-01-01
DYX1C1, DCDC2, and KIAA0319 are three of the most replicated dyslexia candidate genes (DCGs). Recently, these DCGs were implicated in functions at the cilium. Here, we investigate the regulation of these DCGs by Regulatory Factor X transcription factors (RFX TFs), a gene family known for transcriptionally regulating ciliary genes. We identify conserved X-box motifs in the promoter regions of DYX1C1, DCDC2, and KIAA0319 and demonstrate their functionality, as well as the ability to recruit RFX TFs using reporter gene and electrophoretic mobility shift assays. Furthermore, we uncover a complex regulation pattern between RFX1, RFX2, and RFX3 and their significant effect on modifying the endogenous expression of DYX1C1 and DCDC2 in a human retinal pigmented epithelial cell line immortalized with hTERT (hTERT-RPE1). In addition, induction of ciliogenesis increases the expression of RFX TFs and DCGs. At the protein level, we show that endogenous DYX1C1 localizes to the base of the cilium, whereas DCDC2 localizes along the entire axoneme of the cilium, thereby validating earlier localization studies using overexpression models. Our results corroborate the emerging role of DCGs in ciliary function and characterize functional noncoding elements, X-box promoter motifs, in DCG promoter regions, which thus can be targeted for mutation screening in dyslexia and ciliopathies associated with these genes.—Tammimies, K., Bieder, A., Lauter, G., Sugiaman-Trapman, D., Torchet, R., Hokkanen, M.-E., Burghoorn, J., Castrén, E., Kere, J., Tapia-Páez, I., Swoboda, P. Ciliary dyslexia candidate genes DYX1C1 and DCDC2 are regulated by Regulatory Factor (RF) X transcription factors through X-box promoter motifs. PMID:27451412
Tammimies, Kristiina; Bieder, Andrea; Lauter, Gilbert; Sugiaman-Trapman, Debora; Torchet, Rachel; Hokkanen, Marie-Estelle; Burghoorn, Jan; Castrén, Eero; Kere, Juha; Tapia-Páez, Isabel; Swoboda, Peter
2016-10-01
DYX1C1, DCDC2, and KIAA0319 are three of the most replicated dyslexia candidate genes (DCGs). Recently, these DCGs were implicated in functions at the cilium. Here, we investigate the regulation of these DCGs by Regulatory Factor X transcription factors (RFX TFs), a gene family known for transcriptionally regulating ciliary genes. We identify conserved X-box motifs in the promoter regions of DYX1C1, DCDC2, and KIAA0319 and demonstrate their functionality, as well as the ability to recruit RFX TFs using reporter gene and electrophoretic mobility shift assays. Furthermore, we uncover a complex regulation pattern between RFX1, RFX2, and RFX3 and their significant effect on modifying the endogenous expression of DYX1C1 and DCDC2 in a human retinal pigmented epithelial cell line immortalized with hTERT (hTERT-RPE1). In addition, induction of ciliogenesis increases the expression of RFX TFs and DCGs. At the protein level, we show that endogenous DYX1C1 localizes to the base of the cilium, whereas DCDC2 localizes along the entire axoneme of the cilium, thereby validating earlier localization studies using overexpression models. Our results corroborate the emerging role of DCGs in ciliary function and characterize functional noncoding elements, X-box promoter motifs, in DCG promoter regions, which thus can be targeted for mutation screening in dyslexia and ciliopathies associated with these genes.-Tammimies, K., Bieder, A., Lauter, G., Sugiaman-Trapman, D., Torchet, R., Hokkanen, M.-E., Burghoorn, J., Castrén, E., Kere, J., Tapia-Páez, I., Swoboda, P. Ciliary dyslexia candidate genes DYX1C1 and DCDC2 are regulated by Regulatory Factor (RF) X transcription factors through X-box promoter motifs. © The Author(s).
Simm, Franziska; Griesbeck, Anne; Choukair, Daniela; Weiß, Birgit; Paramasivam, Nagarajan; Klammt, Jürgen; Schlesner, Matthias; Wiemann, Stefan; Martinez, Cristina; Hoffmann, Georg F; Pfäffle, Roland W; Bettendorf, Markus; Rappold, Gudrun A
2017-10-26
PurposeCombined pituitary hormone deficiency (CPHD) is characterized by a malformed or underdeveloped pituitary gland resulting in an impaired pituitary hormone secretion. Several transcription factors have been described in its etiology, but defects in known genes account for only a small proportion of cases.MethodsTo identify novel genetic causes for congenital hypopituitarism, we performed exome-sequencing studies on 10 patients with CPHD and their unaffected parents. Two candidate genes were sequenced in further 200 patients. Genotype data of known hypopituitary genes are reviewed.ResultsWe discovered 51 likely damaging variants in 38 genes; 12 of the 51 variants represent de novo events (24%); 11 of the 38 genes (29%) were present in the E12.5/E14.5 pituitary transcriptome. Targeted sequencing of two candidate genes, SLC20A1 and SLC15A4, of the solute carrier membrane transport protein family in 200 additional patients demonstrated two further variants predicted as damaging. We also found combinations of de novo (SLC20A1/SLC15A4) and transmitted variants (GLI2/LHX3) in the same individuals, leading to the full-blown CPHD phenotype.ConclusionThese data expand the pituitary target genes repertoire for diagnostics and further functional studies. Exome sequencing has identified a combination of rare variants in different genes that might explain incomplete penetrance in CPHD.Genetics in Medicine advance online publication, 26 October 2017; doi:10.1038/gim.2017.165.
Molecular evolution of candidate male reproductive genes in the brown algal model Ectocarpus.
Lipinska, Agnieszka P; Van Damme, Els J M; De Clerck, Olivier
2016-01-05
Evolutionary studies of genes that mediate recognition between sperm and egg contribute to our understanding of reproductive isolation and speciation. Surface receptors involved in fertilization are targets of sexual selection, reinforcement, and other evolutionary forces including positive selection. This observation was made across different lineages of the eukaryotic tree from land plants to mammals, and is particularly evident in free-spawning animals. Here we use the brown algal model species Ectocarpus (Phaeophyceae) to investigate the evolution of candidate gamete recognition proteins in a distant major phylogenetic group of eukaryotes. Male gamete specific genes were identified by comparing transcriptome data covering different stages of the Ectocarpus life cycle and screened for characteristics expected from gamete recognition receptors. Selected genes were sequenced in a representative number of strains from distant geographical locations and varying stages of reproductive isolation, to search for signatures of adaptive evolution. One of the genes (Esi0130_0068) showed evidence of selective pressure. Interestingly, that gene displayed domain similarities to the receptor for egg jelly (REJ) protein involved in sperm-egg recognition in sea urchins. We have identified a male gamete specific gene with similarity to known gamete recognition receptors and signatures of adaptation. Altogether, this gene could contribute to gamete interaction during reproduction as well as reproductive isolation in Ectocarpus and is therefore a good candidate for further functional evaluation.
NCG 4.0: the network of cancer genes in the era of massive mutational screenings of cancer genomes
An, Omer; Pendino, Vera; D’Antonio, Matteo; Ratti, Emanuele; Gentilini, Marco; Ciccarelli, Francesca D.
2014-01-01
NCG 4.0 is the latest update of the Network of Cancer Genes, a web-based repository of systems-level properties of cancer genes. In its current version, the database collects information on 537 known (i.e. experimentally supported) and 1463 candidate (i.e. inferred using statistical methods) cancer genes. Candidate cancer genes derive from the manual revision of 67 original publications describing the mutational screening of 3460 human exomes and genomes in 23 different cancer types. For all 2000 cancer genes, duplicability, evolutionary origin, expression, functional annotation, interaction network with other human proteins and with microRNAs are reported. In addition to providing a substantial update of cancer-related information, NCG 4.0 also introduces two new features. The first is the annotation of possible false-positive cancer drivers, defined as candidate cancer genes inferred from large-scale screenings whose association with cancer is likely to be spurious. The second is the description of the systems-level properties of 64 human microRNAs that are causally involved in cancer progression (oncomiRs). Owing to the manual revision of all information, NCG 4.0 constitutes a complete and reliable resource on human coding and non-coding genes whose deregulation drives cancer onset and/or progression. NCG 4.0 can also be downloaded as a free application for Android smart phones. Database URL: http://bio.ieo.eu/ncg/ PMID:24608173
Ingham, Victoria A; Jones, Christopher M; Pignatelli, Patricia; Balabanidou, Vasileia; Vontas, John; Wagstaff, Simon C; Moore, Jonathan D; Ranson, Hilary
2014-11-25
The elevated expression of enzymes with insecticide metabolism activity can lead to high levels of insecticide resistance in the malaria vector, Anopheles gambiae. In this study, adult female mosquitoes from an insecticide susceptible and resistant strain were dissected into four different body parts. RNA from each of these samples was used in microarray analysis to determine the enrichment patterns of the key detoxification gene families within the mosquito and to identify additional candidate insecticide resistance genes that may have been overlooked in previous experiments on whole organisms. A general enrichment in the transcription of genes from the four major detoxification gene families (carboxylesterases, glutathione transferases, UDP glucornyltransferases and cytochrome P450s) was observed in the midgut and malpighian tubules. Yet the subset of P450 genes that have previously been implicated in insecticide resistance in An gambiae, show a surprisingly varied profile of tissue enrichment, confirmed by qPCR and, for three candidates, by immunostaining. A stringent selection process was used to define a list of 105 genes that are significantly (p ≤0.001) over expressed in body parts from the resistant versus susceptible strain. Over half of these, including all the cytochrome P450s on this list, were identified in previous whole organism comparisons between the strains, but several new candidates were detected, notably from comparisons of the transcriptomes from dissected abdomen integuments. The use of RNA extracted from the whole organism to identify candidate insecticide resistance genes has a risk of missing candidates if key genes responsible for the phenotype have restricted expression within the body and/or are over expression only in certain tissues. However, as transcription of genes implicated in metabolic resistance to insecticides is not enriched in any one single organ, comparison of the transcriptome of individual dissected body parts cannot be recommended as a preferred means to identify new candidate insecticide resistant genes. Instead the rich data set on in vivo sites of transcription should be consulted when designing follow up qPCR validation steps, or for screening known candidates in field populations.
Ballester, M; Castelló, A; Peiró, R; Argente, M J; Santacreu, M A; Folch, J M
2013-06-01
Suppressive subtractive hybridization libraries from oviduct at 62 h post-mating of two lines of rabbits divergently selected for uterine capacity were generated to identify differentially expressed genes. A total of 438 singletons and 126 contigs were obtained by cluster assembly and sequence alignment of 704 expressed sequence tags (ESTs), of which 54% showed homology to known proteins of the non-redundant NCBI databases. Differential screening by dot blot validated 71 ESTs, of which 47 showed similarity to known genes. Transcripts of genes were functionally annotated in the molecular function and the biological process gene ontology categories using the BLAST2GO software and were assigned to reproductive developmental process, immune response, amino acid metabolism and degradation, response to stress and apoptosis terms. Finally, three interesting genes, PGR, HSD17B4 and ERO1L, were identified as overexpressed in the low line using RT-qPCR. Our study provides a list of candidate genes that can be useful to understanding the molecular mechanisms underlying the phenotypic differences observed in early embryo survival and development traits. © 2012 The Authors, Animal Genetics © 2012 Stichting International Foundation for Animal Genetics.
Schrider, Daniel R.; Kern, Andrew D.
2015-01-01
The comparative genomics revolution of the past decade has enabled the discovery of functional elements in the human genome via sequence comparison. While that is so, an important class of elements, those specific to humans, is entirely missed by searching for sequence conservation across species. Here we present an analysis based on variation data among human genomes that utilizes a supervised machine learning approach for the identification of human-specific purifying selection in the genome. Using only allele frequency information from the complete low-coverage 1000 Genomes Project data set in conjunction with a support vector machine trained from known functional and nonfunctional portions of the genome, we are able to accurately identify portions of the genome constrained by purifying selection. Our method identifies previously known human-specific gains or losses of function and uncovers many novel candidates. Candidate targets for gain and loss of function along the human lineage include numerous putative regulatory regions of genes essential for normal development of the central nervous system, including a significant enrichment of gain of function events near neurotransmitter receptor genes. These results are consistent with regulatory turnover being a key mechanism in the evolution of human-specific characteristics of brain development. Finally, we show that the majority of the genome is unconstrained by natural selection currently, in agreement with what has been estimated from phylogenetic methods but in sharp contrast to estimates based on transcriptomics or other high-throughput functional methods. PMID:26590212
Shaheen, Ranad; Faqeih, Eissa; Alshammari, Muneera J; Swaid, Abdulrahman; Al-Gazali, Lihadh; Mardawi, Elham; Ansari, Shinu; Sogaty, Sameera; Seidahmed, Mohammed Z; AlMotairi, Muhammed I; Farra, Chantal; Kurdi, Wesam; Al-Rasheed, Shatha; Alkuraya, Fowzan S
2013-01-01
Meckel–Gruber syndrome (MKS, OMIM #249000) is a multiple congenital malformation syndrome that represents the severe end of the ciliopathy phenotypic spectrum. Despite the relatively common occurrence of this syndrome among Arabs, little is known about its genetic architecture in this population. This is a series of 18 Arab families with MKS, who were evaluated clinically and studied using autozygome-guided mutation analysis and exome sequencing. We show that autozygome-guided candidate gene analysis identified the underlying mutation in the majority (n=12, 71%). Exome sequencing revealed a likely pathogenic mutation in three novel candidate MKS disease genes. These include C5orf42, Ellis–van-Creveld disease gene EVC2 and SEC8 (also known as EXOC4), which encodes an exocyst protein with an established role in ciliogenesis. This is the largest and most comprehensive genomic study on MKS in Arabs and the results, in addition to revealing genetic and allelic heterogeneity, suggest that previously reported disease genes and the novel candidates uncovered by this study account for the overwhelming majority of MKS patients in our population. PMID:23169490
Molecular and comparative genetics of mental retardation.
Inlow, Jennifer K; Restifo, Linda L
2004-01-01
Affecting 1-3% of the population, mental retardation (MR) poses significant challenges for clinicians and scientists. Understanding the biology of MR is complicated by the extraordinary heterogeneity of genetic MR disorders. Detailed analyses of >1000 Online Mendelian Inheritance in Man (OMIM) database entries and literature searches through September 2003 revealed 282 molecularly identified MR genes. We estimate that hundreds more MR genes remain to be identified. A novel test, in which we distributed unmapped MR disorders proportionately across the autosomes, failed to eliminate the well-known X-chromosome overrepresentation of MR genes and candidate genes. This evidence argues against ascertainment bias as the main cause of the skewed distribution. On the basis of a synthesis of clinical and laboratory data, we developed a biological functions classification scheme for MR genes. Metabolic pathways, signaling pathways, and transcription are the most common functions, but numerous other aspects of neuronal and glial biology are controlled by MR genes as well. Using protein sequence and domain-organization comparisons, we found a striking conservation of MR genes and genetic pathways across the approximately 700 million years that separate Homo sapiens and Drosophila melanogaster. Eighty-seven percent have one or more fruit fly homologs and 76% have at least one candidate functional ortholog. We propose that D. melanogaster can be used in a systematic manner to study MR and possibly to develop bioassays for therapeutic drug discovery. We selected 42 Drosophila orthologs as most likely to reveal molecular and cellular mechanisms of nervous system development or plasticity relevant to MR. PMID:15020472
Adaptation to climate through flowering phenology: a case study in Medicago truncatula.
Burgarella, Concetta; Chantret, Nathalie; Gay, Laurène; Prosperi, Jean-Marie; Bonhomme, Maxime; Tiffin, Peter; Young, Nevin D; Ronfort, Joelle
2016-07-01
Local climatic conditions likely constitute an important selective pressure on genes underlying important fitness-related traits such as flowering time, and in many species, flowering phenology and climatic gradients strongly covary. To test whether climate shapes the genetic variation on flowering time genes and to identify candidate flowering genes involved in the adaptation to environmental heterogeneity, we used a large Medicago truncatula core collection to examine the association between nucleotide polymorphisms at 224 candidate genes and both climate variables and flowering phenotypes. Unlike genome-wide studies, candidate gene approaches are expected to enrich for the number of meaningful trait associations because they specifically target genes that are known to affect the trait of interest. We found that flowering time mediates adaptation to climatic conditions mainly by variation at genes located upstream in the flowering pathways, close to the environmental stimuli. Variables related to the annual precipitation regime reflected selective constraints on flowering time genes better than the other variables tested (temperature, altitude, latitude or longitude). By comparing phenotype and climate associations, we identified 12 flowering genes as the most promising candidates responsible for phenological adaptation to climate. Four of these genes were located in the known flowering time QTL region on chromosome 7. However, climate and flowering associations also highlighted largely distinct gene sets, suggesting different genetic architectures for adaptation to climate and flowering onset. © 2016 John Wiley & Sons Ltd.
[Strategies of elucidation of biosynthetic pathways of natural products].
Zou, Li-Qiu; Kuang, Xue-Jun; Sun, Chao; Chen, Shi-Lin
2016-11-01
Elucidation of the biosynthetic pathways of natural products is not only the major goal of herb genomics, but also the solid foundation of synthetic biology of natural products. Here, this paper reviewed recent advance in this field and put forward strategies to elucidate the biosynthetic pathway of natural products. Firstly, a proposed biosynthetic pathway should be set up based on well-known knowledge about chemical reactions and information on the identified compounds, as well as studies with isotope tracer. Secondly, candidate genes possibly involved in the biosynthetic pathway were screened out by co-expression analysis and/or gene cluster mining. Lastly, all the candidate genes were heterologously expressed in the host and then the enzyme involved in the biosynthetic pathway was characterized by activity assay. Sometimes, the function of the enzyme in the original plant could be further studied by RNAi or VIGS technology. Understanding the biosynthetic pathways of natural products will contribute to supply of new leading compounds by synthetic biology and provide "functional marker" for herbal molecular breeding, thus but boosting the development of traditional Chinese medicine agriculture. Copyright© by the Chinese Pharmaceutical Association.
Jonczyk, Magda S; Simon, Michelle; Kumar, Saumya; Fernandes, Vitor E; Sylvius, Nicolas; Mallon, Ann-Marie; Denny, Paul; Andrew, Peter W
2014-01-01
Streptococcus pneumoniae is an important human pathogen responsible for high mortality and morbidity worldwide. The susceptibility to pneumococcal infections is controlled by as yet unknown genetic factors. To elucidate these factors could help to develop new medical treatments and tools to identify those most at risk. In recent years genome wide association studies (GWAS) in mice and humans have proved successful in identification of causal genes involved in many complex diseases for example diabetes, systemic lupus or cholesterol metabolism. In this study a GWAS approach was used to map genetic loci associated with susceptibility to pneumococcal infection in 26 inbred mouse strains. As a result four candidate QTLs were identified on chromosomes 7, 13, 18 and 19. Interestingly, the QTL on chromosome 7 was located within S. pneumoniae resistance QTL (Spir1) identified previously in a linkage study of BALB/cOlaHsd and CBA/CaOlaHsd F2 intercrosses. We showed that only a limited number of genes encoded within the QTLs carried phenotype-associated polymorphisms (22 genes out of several hundred located within the QTLs). These candidate genes are known to regulate TGFβ signalling, smooth muscle and immune cells functions. Interestingly, our pulmonary histopathology and gene expression data demonstrated, lung vasculature plays an important role in resistance to pneumococcal infection. Therefore we concluded that the cumulative effect of these candidate genes on vasculature and immune cells functions as contributory factors in the observed differences in susceptibility to pneumococcal infection. We also propose that TGFβ-mediated regulation of fibroblast differentiation plays an important role in development of invasive pneumococcal disease. Gene expression data submitted to the NCBI Gene Expression Omnibus Accession No: GSE49533 SNP data submitted to NCBI dbSNP Short Genetic Variation http://www.ncbi.nlm.nih.gov/projects/SNP/snp_viewTable.cgi?handle=MUSPNEUMONIA.
Bigham, Abigail; Bauchet, Marc; Pinto, Dalila; Mao, Xianyun; Akey, Joshua M; Mei, Rui; Scherer, Stephen W; Julian, Colleen G; Wilson, Megan J; López Herráez, David; Brutsaert, Tom; Parra, Esteban J; Moore, Lorna G; Shriver, Mark D
2010-09-09
High-altitude hypoxia (reduced inspired oxygen tension due to decreased barometric pressure) exerts severe physiological stress on the human body. Two high-altitude regions where humans have lived for millennia are the Andean Altiplano and the Tibetan Plateau. Populations living in these regions exhibit unique circulatory, respiratory, and hematological adaptations to life at high altitude. Although these responses have been well characterized physiologically, their underlying genetic basis remains unknown. We performed a genome scan to identify genes showing evidence of adaptation to hypoxia. We looked across each chromosome to identify genomic regions with previously unknown function with respect to altitude phenotypes. In addition, groups of genes functioning in oxygen metabolism and sensing were examined to test the hypothesis that particular pathways have been involved in genetic adaptation to altitude. Applying four population genetic statistics commonly used for detecting signatures of natural selection, we identified selection-nominated candidate genes and gene regions in these two populations (Andeans and Tibetans) separately. The Tibetan and Andean patterns of genetic adaptation are largely distinct from one another, with both populations showing evidence of positive natural selection in different genes or gene regions. Interestingly, one gene previously known to be important in cellular oxygen sensing, EGLN1 (also known as PHD2), shows evidence of positive selection in both Tibetans and Andeans. However, the pattern of variation for this gene differs between the two populations. Our results indicate that several key HIF-regulatory and targeted genes are responsible for adaptation to high altitude in Andeans and Tibetans, and several different chromosomal regions are implicated in the putative response to selection. These data suggest a genetic role in high-altitude adaption and provide a basis for future genotype/phenotype association studies necessary to confirm the role of selection-nominated candidate genes and gene regions in adaptation to altitude.
Bigham, Abigail; Bauchet, Marc; Pinto, Dalila; Mao, Xianyun; Akey, Joshua M.; Mei, Rui; Scherer, Stephen W.; Julian, Colleen G.; Wilson, Megan J.; López Herráez, David; Brutsaert, Tom; Parra, Esteban J.; Moore, Lorna G.; Shriver, Mark D.
2010-01-01
High-altitude hypoxia (reduced inspired oxygen tension due to decreased barometric pressure) exerts severe physiological stress on the human body. Two high-altitude regions where humans have lived for millennia are the Andean Altiplano and the Tibetan Plateau. Populations living in these regions exhibit unique circulatory, respiratory, and hematological adaptations to life at high altitude. Although these responses have been well characterized physiologically, their underlying genetic basis remains unknown. We performed a genome scan to identify genes showing evidence of adaptation to hypoxia. We looked across each chromosome to identify genomic regions with previously unknown function with respect to altitude phenotypes. In addition, groups of genes functioning in oxygen metabolism and sensing were examined to test the hypothesis that particular pathways have been involved in genetic adaptation to altitude. Applying four population genetic statistics commonly used for detecting signatures of natural selection, we identified selection-nominated candidate genes and gene regions in these two populations (Andeans and Tibetans) separately. The Tibetan and Andean patterns of genetic adaptation are largely distinct from one another, with both populations showing evidence of positive natural selection in different genes or gene regions. Interestingly, one gene previously known to be important in cellular oxygen sensing, EGLN1 (also known as PHD2), shows evidence of positive selection in both Tibetans and Andeans. However, the pattern of variation for this gene differs between the two populations. Our results indicate that several key HIF-regulatory and targeted genes are responsible for adaptation to high altitude in Andeans and Tibetans, and several different chromosomal regions are implicated in the putative response to selection. These data suggest a genetic role in high-altitude adaption and provide a basis for future genotype/phenotype association studies necessary to confirm the role of selection-nominated candidate genes and gene regions in adaptation to altitude. PMID:20838600
Woldesemayat, Adugna Abdi; Van Heusden, Peter; Ndimba, Bongani K; Christoffels, Alan
2017-12-22
Drought is the most disastrous abiotic stress that severely affects agricultural productivity worldwide. Understanding the biological basis of drought-regulated traits, requires identification and an in-depth characterization of genetic determinants using model organisms and high-throughput technologies. However, studies on drought tolerance have generally been limited to traditional candidate gene approach that targets only a single gene in a pathway that is related to a trait. In this study, we used sorghum, one of the model crops that is well adapted to arid regions, to mine genes and define determinants for drought tolerance using drought expression libraries and RNA-seq data. We provide an integrated and comparative in silico candidate gene identification, characterization and annotation approach, with an emphasis on genes playing a prominent role in conferring drought tolerance in sorghum. A total of 470 non-redundant functionally annotated drought responsive genes (DRGs) were identified using experimental data from drought responses by employing pairwise sequence similarity searches, pathway and interpro-domain analysis, expression profiling and orthology relation. Comparison of the genomic locations between these genes and sorghum quantitative trait loci (QTLs) showed that 40% of these genes were co-localized with QTLs known for drought tolerance. The genome reannotation conducted using the Program to Assemble Spliced Alignment (PASA), resulted in 9.6% of existing single gene models being updated. In addition, 210 putative novel genes were identified using AUGUSTUS and PASA based analysis on expression dataset. Among these, 50% were single exonic, 69.5% represented drought responsive and 5.7% were complete gene structure models. Analysis of biochemical metabolism revealed 14 metabolic pathways that are related to drought tolerance and also had a strong biological network, among categories of genes involved. Identification of these pathways, signifies the interplay of biochemical reactions that make up the metabolic network, constituting fundamental interface for sorghum defence mechanism against drought stress. This study suggests untapped natural variability in sorghum that could be used for developing drought tolerance. The data presented here, may be regarded as an initial reference point in functional and comparative genomics in the Gramineae family.
Rare copy number variants in patients with congenital conotruncal heart defects.
Xie, Hongbo M; Werner, Petra; Stambolian, Dwight; Bailey-Wilson, Joan E; Hakonarson, Hakon; White, Peter S; Taylor, Deanne M; Goldmuntz, Elizabeth
2017-03-01
Previous studies using different cardiac phenotypes, technologies and designs suggest a burden of large, rare or de novo copy number variants (CNVs) in subjects with congenital heart defects. We sought to identify disease-related CNVs, candidate genes, and functional pathways in a large number of cases with conotruncal and related defects that carried no known genetic syndrome. Cases and control samples were divided into two cohorts and genotyped to assess each subject's CNV content. Analyses were performed to ascertain differences in overall CNV prevalence and to identify enrichment of specific genes and functional pathways in conotruncal cases relative to healthy controls. Only findings present in both cohorts are presented. From 973 total conotruncal cases, a burden of rare CNVs was detected in both cohorts. Candidate genes from rare CNVs found in both cohorts were identified based on their association with cardiac development or disease, and/or their reported disruption in published studies. Functional and pathway analyses revealed significant enrichment of terms involved in either heart or early embryonic development. Our study tested one of the largest cohorts specifically with cardiac conotruncal and related defects. These results confirm and extend previous findings that CNVs contribute to disease risk for congenital heart defects in general and conotruncal defects in particular. As disease heterogeneity renders identification of single recurrent genes or loci difficult, functional pathway and gene regulation network analyses appear to be more informative. Birth Defects Research 109:271-295, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Figueroa, Melania; Upadhyaya, Narayana M; Sperschneider, Jana; Park, Robert F; Szabo, Les J; Steffenson, Brian; Ellis, Jeff G; Dodds, Peter N
2016-01-01
The recent resurgence of wheat stem rust caused by new virulent races of Puccinia graminis f. sp. tritici (Pgt) poses a threat to food security. These concerns have catalyzed an extensive global effort toward controlling this disease. Substantial research and breeding programs target the identification and introduction of new stem rust resistance (Sr) genes in cultivars for genetic protection against the disease. Such resistance genes typically encode immune receptor proteins that recognize specific components of the pathogen, known as avirulence (Avr) proteins. A significant drawback to deploying cultivars with single Sr genes is that they are often overcome by evolution of the pathogen to escape recognition through alterations in Avr genes. Thus, a key element in achieving durable rust control is the deployment of multiple effective Sr genes in combination, either through conventional breeding or transgenic approaches, to minimize the risk of resistance breakdown. In this situation, evolution of pathogen virulence would require changes in multiple Avr genes in order to bypass recognition. However, choosing the optimal Sr gene combinations to deploy is a challenge that requires detailed knowledge of the pathogen Avr genes with which they interact and the virulence phenotypes of Pgt existing in nature. Identifying specific Avr genes from Pgt will provide screening tools to enhance pathogen virulence monitoring, assess heterozygosity and propensity for mutation in pathogen populations, and confirm individual Sr gene functions in crop varieties carrying multiple effective resistance genes. Toward this goal, much progress has been made in assembling a high quality reference genome sequence for Pgt, as well as a Pan-genome encompassing variation between multiple field isolates with diverse virulence spectra. In turn this has allowed prediction of Pgt effector gene candidates based on known features of Avr genes in other plant pathogens, including the related flax rust fungus. Upregulation of gene expression in haustoria and evidence for diversifying selection are two useful parameters to identify candidate Avr genes. Recently, we have also applied machine learning approaches to agnostically predict candidate effectors. Here, we review progress in stem rust pathogenomics and approaches currently underway to identify Avr genes recognized by wheat Sr genes.
Bruse, Shannon; Moreau, Michael; Bromberg, Yana; Jang, Jun-Ho; Wang, Nan; Ha, Hongseok; Picchi, Maria; Lin, Yong; Langley, Raymond J; Qualls, Clifford; Klensney-Tait, Julia; Zabner, Joseph; Leng, Shuguang; Mao, Jenny; Belinsky, Steven A; Xing, Jinchuan; Nyunoya, Toru
2016-01-07
Chronic obstructive pulmonary disease (COPD) is characterized by an irreversible airflow limitation in response to inhalation of noxious stimuli, such as cigarette smoke. However, only 15-20 % smokers manifest COPD, suggesting a role for genetic predisposition. Although genome-wide association studies have identified common genetic variants that are associated with susceptibility to COPD, effect sizes of the identified variants are modest, as is the total heritability accounted for by these variants. In this study, an extreme phenotype exome sequencing study was combined with in vitro modeling to identify COPD candidate genes. We performed whole exome sequencing of 62 highly susceptible smokers and 30 exceptionally resistant smokers to identify rare variants that may contribute to disease risk or resistance to COPD. This was a cross-sectional case-control study without therapeutic intervention or longitudinal follow-up information. We identified candidate genes based on rare variant analyses and evaluated exonic variants to pinpoint individual genes whose function was computationally established to be significantly different between susceptible and resistant smokers. Top scoring candidate genes from these analyses were further filtered by requiring that each gene be expressed in human bronchial epithelial cells (HBECs). A total of 81 candidate genes were thus selected for in vitro functional testing in cigarette smoke extract (CSE)-exposed HBECs. Using small interfering RNA (siRNA)-mediated gene silencing experiments, we showed that silencing of several candidate genes augmented CSE-induced cytotoxicity in vitro. Our integrative analysis through both genetic and functional approaches identified two candidate genes (TACC2 and MYO1E) that augment cigarette smoke (CS)-induced cytotoxicity and, potentially, COPD susceptibility.
Valentine, M C; Linabery, A M; Chasnoff, S; Hughes, A E O; Mallaney, C; Sanchez, N; Giacalone, J; Heerema, N A; Hilden, J M; Spector, L G; Ross, J A; Druley, T E
2014-01-01
Infant leukemia (IL) is a rare sporadic cancer with a grim prognosis. Although most cases are accompanied by MLL rearrangements and harbor very few somatic mutations, less is known about the genetics of the cases without MLL translocations. We performed the largest exome-sequencing study to date on matched non-cancer DNA from pairs of mothers and IL patients to characterize congenital variation that may contribute to early leukemogenesis. Using the COSMIC database to define acute leukemia-associated candidate genes, we find a significant enrichment of rare, potentially functional congenital variation in IL patients compared with randomly selected genes within the same patients and unaffected pediatric controls. IL acute myeloid leukemia (AML) patients had more overall variation than IL acute lymphocytic leukemia (ALL) patients, but less of that variation was inherited from mothers. Of our candidate genes, we found that MLL3 was a compound heterozygote in every infant who developed AML and 50% of infants who developed ALL. These data suggest a model by which known genetic mechanisms for leukemogenesis could be disrupted without an abundance of somatic mutation or chromosomal rearrangements. This model would be consistent with existing models for the establishment of leukemia clones in utero and the high rate of IL concordance in monozygotic twins. PMID:24301523
Takeda, Haruna; Rust, Alistair G.; Ward, Jerrold M.; Yew, Christopher Chin Kuan; Jenkins, Nancy A.; Copeland, Neal G.
2016-01-01
Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4+/− mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC. PMID:27006499
Takeda, Haruna; Rust, Alistair G; Ward, Jerrold M; Yew, Christopher Chin Kuan; Jenkins, Nancy A; Copeland, Neal G
2016-04-05
Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4(+/-) mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC.
Sykes, Timothy; Yates, Steven; Nagy, Istvan; Asp, Torben; Small, Ian
2017-01-01
Perennial ryegrass (Lolium perenne L.) is widely used for forage production in both permanent and temporary grassland systems. To increase yields in perennial ryegrass, recent breeding efforts have been focused on strategies to more efficiently exploit heterosis by hybrid breeding. Cytoplasmic male sterility (CMS) is a widely applied mechanism to control pollination for commercial hybrid seed production and although CMS systems have been identified in perennial ryegrass, they are yet to be fully characterized. Here, we present a bioinformatics pipeline for efficient identification of candidate restorer of fertility (Rf) genes for CMS. From a high-quality draft of the perennial ryegrass genome, 373 pentatricopeptide repeat (PPR) genes were identified and classified, further identifying 25 restorer of fertility-like PPR (RFL) genes through a combination of DNA sequence clustering and comparison to known Rf genes. This extensive gene family was targeted as the majority of Rf genes in higher plants are RFL genes. These RFL genes were further investigated by phylogenetic analyses, identifying three groups of perennial ryegrass RFLs. These three groups likely represent genomic regions of active RFL generation and identify the probable location of perennial ryegrass PPR-Rf genes. This pipeline allows for the identification of candidate PPR-Rf genes from genomic sequence data and can be used in any plant species. Functional markers for PPR-Rf genes will facilitate map-based cloning of Rf genes and enable the use of CMS as an efficient tool to control pollination for hybrid crop production. PMID:26951780
Santos, Maria CLG; Hart, P Suzanne; Ramaswami, Mukundhan; Kanno, Cláudia M; Hart, Thomas C; Line, Sergio RP
2007-01-01
Amelogenesis imperfecta (AI) is a genetically heterogeneous group of diseases that result in defective development of tooth enamel. Mutations in several enamel proteins and proteinases have been associated with AI. The object of this study was to evaluate evidence of etiology for the six major candidate gene loci in two Brazilian families with AI. Genomic DNA was obtained from family members and all exons and exon-intron boundaries of the ENAM, AMBN, AMELX, MMP20, KLK4 and Amelotin gene were amplified and sequenced. Each family was also evaluated for linkage to chromosome regions known to contain genes important in enamel development. The present study indicates that the AI in these two families is not caused by any of the known loci for AI or any of the major candidate genes proposed in the literature. These findings indicate extensive genetic heterogeneity for non-syndromic AI. PMID:17266769
Santos, Maria C L G; Hart, P Suzanne; Ramaswami, Mukundhan; Kanno, Cláudia M; Hart, Thomas C; Line, Sergio R P
2007-01-31
Amelogenesis imperfecta (AI) is a genetically heterogeneous group of diseases that result in defective development of tooth enamel. Mutations in several enamel proteins and proteinases have been associated with AI. The object of this study was to evaluate evidence of etiology for the six major candidate gene loci in two Brazilian families with AI. Genomic DNA was obtained from family members and all exons and exon-intron boundaries of the ENAM, AMBN, AMELX, MMP20, KLK4 and Amelotin gene were amplified and sequenced. Each family was also evaluated for linkage to chromosome regions known to contain genes important in enamel development. The present study indicates that the AI in these two families is not caused by any of the known loci for AI or any of the major candidate genes proposed in the literature. These findings indicate extensive genetic heterogeneity for non-syndromic AI.
Bogdanova, Vera S.; Zaytseva, Olga O.; Mglinets, Anatoliy V.; Shatskaya, Natalia V.; Kosterin, Oleg E.; Vasiliev, Gennadiy V.
2015-01-01
In crosses of wild and cultivated peas (Pisum sativum L.), nuclear-cytoplasmic incompatibility frequently occurs manifested as decreased pollen fertility, male gametophyte lethality, sporophyte lethality. High-throughput sequencing of plastid genomes of one cultivated and four wild pea accessions differing in cross-compatibility was performed. Candidate genes for involvement in the nuclear-plastid conflict were searched in the reconstructed plastid genomes. In the annotated Medicago truncatula genome, nuclear candidate genes were searched in the portion syntenic to the pea chromosome region known to harbor a locus involved in the conflict. In the plastid genomes, a substantial variability of the accD locus represented by nucleotide substitutions and indels was found to correspond to the pattern of cross-compatibility among the accessions analyzed. Amino acid substitutions in the polypeptides encoded by the alleles of a nuclear locus, designated as Bccp3, with a complementary function to accD, fitted the compatibility pattern. The accD locus in the plastid genome encoding beta subunit of the carboxyltransferase of acetyl-coA carboxylase and the nuclear locus Bccp3 encoding biotin carboxyl carrier protein of the same multi-subunit enzyme were nominated as candidate genes for main contribution to nuclear-cytoplasmic incompatibility in peas. Existence of another nuclear locus involved in the accD-mediated conflict is hypothesized. PMID:25789472
Rai, Amit; Nakaya, Taiki; Shimizu, Yohei; Rai, Megha; Nakamura, Michimi; Suzuki, Hideyuki; Saito, Kazuki; Yamazaki, Mami
2018-05-29
Lithospermum officinale is a valuable source of bioactive metabolites with medicinal and industrial values. However, little is known about genes involved in the biosynthesis of these metabolites, primarily due to the lack of genome or transcriptome resources. This study presents the first effort to establish and characterize de novo transcriptome assembly resource for L. officinale and expression analysis for three of its tissues, namely leaf, stem, and root. Using over 4Gbps of RNA-sequencing datasets, we obtained de novo transcriptome assembly of L. officinale , consisting of 77,047 unigenes with assembly N50 value as 1524 bps. Based on transcriptome annotation and functional classification, 52,766 unigenes were assigned with putative genes functions, gene ontology terms, and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. KEGG pathway and gene ontology enrichment analysis using highly expressed unigenes across three tissues and targeted metabolome analysis showed active secondary metabolic processes enriched specifically in the root of L. officinale . Using co-expression analysis, we also identified 20 and 48 unigenes representing different enzymes of lithospermic/chlorogenic acid and shikonin biosynthesis pathways, respectively. We further identified 15 candidate unigenes annotated as cytochrome P450 with the highest expression in the root of L. officinale as novel genes with a role in key biochemical reactions toward shikonin biosynthesis. Thus, through this study, we not only generated a high-quality genomic resource for L. officinale but also propose candidate genes to be involved in shikonin biosynthesis pathways for further functional characterization. Georg Thieme Verlag KG Stuttgart · New York.
Jąkalski, Marcin; Takeshita, Kazutaka; Deblieck, Mathieu; Koyanagi, Kanako O; Makałowska, Izabela; Watanabe, Hidemi; Makałowski, Wojciech
2016-08-04
Retroposition, one of the processes of copying the genetic material, is an important RNA-mediated mechanism leading to the emergence of new genes. Because the transcription controlling segments are usually not copied to the new location in this mechanism, the duplicated gene copies (retrocopies) become pseudogenized. However, few can still survive, e.g. by recruiting novel regulatory elements from the region of insertion. Subsequently, these duplicated genes can contribute to the formation of lineage-specific traits and phenotypic diversity. Despite the numerous studies of the functional retrocopies (retrogenes) in animals and plants, very little is known about their presence in green algae, including morphologically diverse species. The current availability of the genomes of both uni- and multicellular algae provides a good opportunity to conduct a genome-wide investigation in order to fill the knowledge gap in retroposition phenomenon in this lineage. Here we present a comparative genomic analysis of uni- and multicellular algae, Chlamydomonas reinhardtii and Volvox carteri, respectively, to explore their retrogene complements. By adopting a computational approach, we identified 141 retrogene candidates in total in both genomes, with their fraction being significantly higher in the multicellular Volvox. Majority of the retrogene candidates showed signatures of functional constraints, thus indicating their functionality. Detailed analyses of the identified retrogene candidates, their parental genes, and homologs of both, revealed that most of the retrogene candidates were derived from ancient retroposition events in the common ancestor of the two algae and that the parental genes were subsequently lost from the respective lineages, making many retrogenes 'orphan'. We revealed that the genomes of the green algae have maintained many possibly functional retrogenes in spite of experiencing various molecular evolutionary events during a long evolutionary time after the retroposition events. Our first report about the retrogene set in the green algae provides a good foundation for any future investigation of the repertoire of retrogenes and facilitates the assessment of the evolutionary impact of retroposition on diverse morphological traits in this lineage. This article was reviewed by William Martin and Piotr Zielenkiewicz.
Alangari, Abdullah A; Alsultan, Abdulrahman; Osman, Mohamed Elfaki; Anazi, Shamsa; Alkuraya, Fowzan S
2013-11-01
Patients with autosomal recessive cyclic neutropenia have no known causative genetic defect yet. Autozygosity mapping on two branches of an extended multiplex consanguineous family presenting with cyclic neutropenia or severe congenital neutropenia to look for candidate gene, followed by candidate gene selection and sequencing. A single autozygous interval on Chr17:33,901,938-45,675,414 that is exclusively shared by the affected members was identified. This interval spans 11.8 Mb and contains 30 genes. Review of these genes highlighted G6PC3 as the most likely candidate given its known role in neutrophil biology. Direct sequencing revealed a novel homozygous mutation (NM_138387.3, c.974T > G, p.Leu325Arg). Two of our patients had associated congenital defects that are known to occur in patients with G6PC3 mutations, including congenital heart disease and intermittent thrombocytopenia. Biallelic G6PC3 defects should be considered in patients with autosomal recessive cyclic neutropenia, especially those with typical associated congenital defects.
Filling gaps in PPAR-alpha signaling through comparative nutrigenomics analysis
2009-01-01
Background The application of high-throughput genomic tools in nutrition research is a widespread practice. However, it is becoming increasingly clear that the outcome of individual expression studies is insufficient for the comprehensive understanding of such a complex field. Currently, the availability of the large amounts of expression data in public repositories has opened up new challenges on microarray data analyses. We have focused on PPARα, a ligand-activated transcription factor functioning as fatty acid sensor controlling the gene expression regulation of a large set of genes in various metabolic organs such as liver, small intestine or heart. The function of PPARα is strictly connected to the function of its target genes and, although many of these have already been identified, major elements of its physiological function remain to be uncovered. To further investigate the function of PPARα, we have applied a cross-species meta-analysis approach to integrate sixteen microarray datasets studying high fat diet and PPARα signal perturbations in different organisms. Results We identified 164 genes (MDEGs) that were differentially expressed in a constant way in response to a high fat diet or to perturbations in PPARs signalling. In particular, we found five genes in yeast which were highly conserved and homologous of PPARα targets in mammals, potential candidates to be used as models for the equivalent mammalian genes. Moreover, a screening of the MDEGs for all known transcription factor binding sites and the comparison with a human genome-wide screening of Peroxisome Proliferating Response Elements (PPRE), enabled us to identify, 20 new potential candidate genes that show, both binding site, both change in expression in the condition studied. Lastly, we found a non random localization of the differentially expressed genes in the genome. Conclusion The results presented are potentially of great interest to resume the currently available expression data, exploiting the power of in silico analysis filtered by evolutionary conservation. The analysis enabled us to indicate potential gene candidates that could fill in the gaps with regards to the signalling of PPARα and, moreover, the non-random localization of the differentially expressed genes in the genome, suggest that epigenetic mechanisms are of importance in the regulation of the transcription operated by PPARα. PMID:20003344
Utility and Limitations of Using Gene Expression Data to Identify Functional Associations
Peng, Cheng; Shiu, Shin-Han
2016-01-01
Gene co-expression has been widely used to hypothesize gene function through guilt-by association. However, it is not clear to what degree co-expression is informative, whether it can be applied to genes involved in different biological processes, and how the type of dataset impacts inferences about gene functions. Here our goal is to assess the utility and limitations of using co-expression as a criterion to recover functional associations between genes. By determining the percentage of gene pairs in a metabolic pathway with significant expression correlation, we found that many genes in the same pathway do not have similar transcript profiles and the choice of dataset, annotation quality, gene function, expression similarity measure, and clustering approach significantly impacts the ability to recover functional associations between genes using Arabidopsis thaliana as an example. Some datasets are more informative in capturing coordinated expression profiles and larger data sets are not always better. In addition, to recover the maximum number of known pathways and identify candidate genes with similar functions, it is important to explore rather exhaustively multiple dataset combinations, similarity measures, clustering algorithms and parameters. Finally, we validated the biological relevance of co-expression cluster memberships with an independent phenomics dataset and found that genes that consistently cluster with leucine degradation genes tend to have similar leucine levels in mutants. This study provides a framework for obtaining gene functional associations by maximizing the information that can be obtained from gene expression datasets. PMID:27935950
Torrezan, Giovana T; de Almeida, Fernanda G Dos Santos R; Figueiredo, Márcia C P; Barros, Bruna D de Figueiredo; de Paula, Cláudia A A; Valieris, Renan; de Souza, Jorge E S; Ramalho, Rodrigo F; da Silva, Felipe C C; Ferreira, Elisa N; de Nóbrega, Amanda F; Felicio, Paula S; Achatz, Maria I; de Souza, Sandro J; Palmero, Edenir I; Carraro, Dirce M
2018-01-01
Pathogenic variants in known breast cancer (BC) predisposing genes explain only about 30% of Hereditary Breast Cancer (HBC) cases, whereas the underlying genetic factors for most families remain unknown. Here, we used whole-exome sequencing (WES) to identify genetic variants associated to HBC in 17 patients of Brazil with familial BC and negative for causal variants in major BC risk genes ( BRCA1/2, TP53 , and CHEK2 c.1100delC). First, we searched for rare variants in 27 known HBC genes and identified two patients harboring truncating pathogenic variants in ATM and BARD1 . For the remaining 15 negative patients, we found a substantial vast number of rare genetic variants. Thus, for selecting the most promising variants we used functional-based variant prioritization, followed by NGS validation, analysis in a control group, cosegregation analysis in one family and comparison with previous WES studies, shrinking our list to 23 novel BC candidate genes, which were evaluated in an independent cohort of 42 high-risk BC patients. Rare and possibly damaging variants were identified in 12 candidate genes in this cohort, including variants in DNA repair genes ( ERCC1 and SXL4 ) and other cancer-related genes ( NOTCH2, ERBB2, MST1R , and RAF1 ). Overall, this is the first WES study applied for identifying novel genes associated to HBC in Brazilian patients, in which we provide a set of putative BC predisposing genes. We also underpin the value of using WES for assessing the complex landscape of HBC susceptibility, especially in less characterized populations.
Zhang, Nan; Han, Zhentai; Sun, Guiling; Hoffman, Angela; Wilson, Iain W; Yang, Yanfang; Gao, Qian; Wu, Jianqiang; Xie, Dan; Dai, Jungui; Qiu, Deyou
2014-01-17
Taxol is a well-known effective anticancer compound. Due to the inability to synthesize sufficient quantities of taxol to satisfy commercial demand, a biotechnological approach for a large-scale cell or cell-free system for its production is highly desirable. Several important genes in taxol biosynthesis are currently still unknown and have been shown to be difficult to isolate directly from Taxus, including the gene encoding taxoid 9α-hydroxylase. Ginkgo biloba suspension cells exhibit taxoid hydroxylation activity and provides an alternate means of identifying genes encoding enzymes with taxoid 9α-hydroxylation activity. Through analysis of high throughput RNA sequencing data from G. biloba, we identified two candidate genes with high similarity to Taxus CYP450s. Using in vitro cell-free protein synthesis assays and LC-MS analysis, we show that one candidate that belongs to the CYP716B, a subfamily whose biochemical functions have not been previously studied, possessed 9α-hydroxylation activity. This work will aid future identification of the taxoid 9α-hydroxylase gene from Taxus sp. Copyright © 2013 Elsevier Inc. All rights reserved.
A Drosophila model for toxicogenomics: Genetic variation in susceptibility to heavy metal exposure
Luoma, Sarah E.; St. Armour, Genevieve E.; Thakkar, Esha
2017-01-01
The genetic factors that give rise to variation in susceptibility to environmental toxins remain largely unexplored. Studies on genetic variation in susceptibility to environmental toxins are challenging in human populations, due to the variety of clinical symptoms and difficulty in determining which symptoms causally result from toxic exposure; uncontrolled environments, often with exposure to multiple toxicants; and difficulty in relating phenotypic effect size to toxic dose, especially when symptoms become manifest with a substantial time lag. Drosophila melanogaster is a powerful model that enables genome-wide studies for the identification of allelic variants that contribute to variation in susceptibility to environmental toxins, since the genetic background, environmental rearing conditions and toxic exposure can be precisely controlled. Here, we used extreme QTL mapping in an outbred population derived from the D. melanogaster Genetic Reference Panel to identify alleles associated with resistance to lead and/or cadmium, two ubiquitous environmental toxins that present serious health risks. We identified single nucleotide polymorphisms (SNPs) associated with variation in resistance to both heavy metals as well as SNPs associated with resistance specific to each of them. The effects of these SNPs were largely sex-specific. We applied mutational and RNAi analyses to 33 candidate genes and functionally validated 28 of them. We constructed networks of candidate genes as blueprints for orthologous networks of human genes. The latter not only provided functional contexts for known human targets of heavy metal toxicity, but also implicated novel candidate susceptibility genes. These studies validate Drosophila as a translational toxicogenomics gene discovery system. PMID:28732062
Genetic study of intracranial aneurysms.
Yan, Junxia; Hitomi, Toshiaki; Takenaka, Katsunobu; Kato, Masayasu; Kobayashi, Hatasu; Okuda, Hiroko; Harada, Kouji H; Koizumi, Akio
2015-03-01
Rupture of intracranial aneurysms (IAs) causes subarachnoid hemorrhage, leading to immediate death or severe disability. Identification of the genetic factors involved is critical for disease prevention and treatment. We aimed to identify the susceptibility genes for IAs. Exome sequencing was performed in 12 families with histories of multiple cases of IA (number of cases per family ≥3), with a total of 42 cases. Various filtering strategies were used to select the candidate variants. Replicate association studies of several candidate variants were performed in probands of 24 additional IA families and 426 sporadic IA cases. Functional analysis for the mutations was conducted. After sequencing and filtering, 78 variants were selected for the following reasons: allele frequencies of variants in 42 patients was significantly (P<0.05) larger than expected; variants were completely shared by all patients with IA within ≥1 family; variants predicted damage to the structure or function of the protein by PolyPhen-2 (Polymorphism Phenotyping V2) and SIFT (Sorting Intolerance From Tolerant). We selected 10 variants from 9 genes (GPR63, ADAMST15, MLL2, IL10RA, PAFAH2, THBD, IL11RA, FILIP1L, and ZNF222) to form 78 candidate variants by considering commonness in families, known disease genes, or ontology association with angiogenesis. Replicate association studies revealed that only p.E133Q in ADAMTS15 was aggregated in the familial IA cases (odds ratio, 5.96; 95% confidence interval, 2.40-14.82; P=0.0001; significant after the Bonferroni correction [P=0.05/78=0.0006]). Silencing ADAMTS15 and overexpression of ADAMTS15 p.E133Q accelerated endothelial cell migration, suggesting that ADAMTS15 may have antiangiogenic activity. ADAMTS15 is a candidate gene for IAs. © 2015 American Heart Association, Inc.
USDA-ARS?s Scientific Manuscript database
Development rate has important implications for many aspects of an individual's biology. In rainbow trout (Oncorhynchus mykiss), a major QTL for embryonic development rate has been detected on chromosome 5, but at present, few candidate genes have been mapped to this region. This paucity of known ge...
Quaggiotti, Silvia; Barcaccia, Gianni; Schiavon, Michela; Nicolé, Silvia; Galla, Giulio; Rossignolo, Virginia; Soattin, Marica; Malagoli, Mario
2007-11-01
In this research a differential display based on the detection of cDNA-AFLP markers was used to identify candidate genes potentially involved in the regulation of the response to chromium in four different willow species (Salix alba, Salix eleagnos, Salix fragilis and Salix matsudana) chosen on the basis of their suitability in phytoremediation techniques. Our approach enabled the assay of a large set of mRNA-related fragments and increased the reliability of amplification-based transcriptome analysis. The vast majority of transcript-derived fragments were shared among samples within species and thus attributable to constitutively expressed genes. However, a number of differentially expressed mRNAs were scored in each species and a total of 68 transcripts displaying an altered expression in response to Cr were isolated and sequenced. Public database querying revealed that 44.1% and 4.4% of the cloned ESTs score significant similarity with genes encoding proteins having known or putative function, or with genes coding for unknown proteins, respectively, whereas the remaining 51.5% did not retrieve any homology. Semi-quantitative RT-PCR analysis of seven candidate genes fully confirmed the expression patterns obtained by cDNA-AFLP. Our results indicate the existence of common mechanisms of gene regulation in response to Cr, pathogen attack and senescence-mediated programmed cell death, and suggest a role for the genes isolated in the cross-talk of the signaling pathways governing the adaptation to biotic and abiotic stresses.
Exploring digenic inheritance in arrhythmogenic cardiomyopathy.
König, Eva; Volpato, Claudia Béu; Motta, Benedetta Maria; Blankenburg, Hagen; Picard, Anne; Pramstaller, Peter; Casella, Michela; Rauhe, Werner; Pompilio, Giulio; Meraviglia, Viviana; Domingues, Francisco S; Sommariva, Elena; Rossini, Alessandra
2017-12-08
Arrhythmogenic cardiomyopathy (ACM) is an inherited genetic disorder, characterized by the substitution of heart muscle with fibro-fatty tissue and severe ventricular arrhythmias, often leading to heart failure and sudden cardiac death. ACM is considered a monogenic disorder, but the low penetrance of mutations identified in patients suggests the involvement of additional genetic or environmental factors. We used whole exome sequencing to investigate digenic inheritance in two ACM families where previous diagnostic tests have revealed a PKP2 mutation in all affected and some healthy individuals. In family members with PKP2 mutations we determined all genes that harbor variants in affected but not in healthy carriers or vice versa. We computationally prioritized the most likely candidates, focusing on known ACM genes and genes related to PKP2 through protein interactions, functional relationships, or shared biological processes. We identified four candidate genes in family 1, namely DAG1, DAB2IP, CTBP2 and TCF25, and eleven candidate genes in family 2. The most promising gene in the second family is TTN, a gene previously associated with ACM, in which the affected individual harbors two rare deleterious-predicted missense variants, one of which is located in the protein's only serine kinase domain. In this study we report genes that might act as digenic players in ACM pathogenesis, on the basis of co-segregation with PKP2 mutations. Validation in larger cohorts is still required to prove the utility of this model.
Zhou, Bin; Irwanto, Astrid; Guo, Yun-Miao; Bei, Jin-Xin; Wu, Qiao; Chen, Ge; Zhang, Tai-Ping; Lei, Jin-Jv; Feng, Qi-Sheng; Chen, Li-Zhen; Liu, Jianjun; Zhao, Yu-Pei
2012-08-01
Pancreatic ductal adenocarcinoma (PDAC) is one of the most malignant cancers with more than 94% mortality rate mainly due to the widespread metastases. To find out the somatically mutated genes related to the metastasis of PDAC, we analyzed the matched tumor and normal tissue samples from a patient diagnosed with liver metastatic PDAC using intensive exome capture-sequencing analysis (> 170× coverage). Searching for the somatic mutations that drive the clonal expansion of metastasis, we identified 12 genes with higher allele frequencies (AFs) of functional mutations in the metastatic tumor, including known genes KRAS and TP53 for metastasis. Of the 10 candidate genes, 6 (ADRB1, DCLK1, KCNH2, NOP14, SIGLEC1, and ZC3H7A), together with KRAS and TP53, were clustered into a single network (p value = 1 × 10(-22)) that is related to cancer development. Moreover, these candidate genes showed abnormal expression in PDAC tissues and functional impacts on the migration, proliferation, and colony formation abilities of pancreatic cancer cell lines. Furthermore, through digital PCR analysis, we revealed potential genomic mechanisms for the KRAS and TP53 mutations in the metastatic tumor. Taken together, our study shows the possibility for such personalized genomic profiling to provide new biological insight into the metastasis of PDAC.
Johns, N; Tan, B H; MacMillan, M; Solheim, T S; Ross, J A; Baracos, V E; Damaraju, S; Fearon, K C H
2014-12-01
Cancer cachexia is a complex and multifactorial disease. Evolving definitions highlight the fact that a diverse range of biological processes contribute to cancer cachexia. Part of the variation in who will and who will not develop cancer cachexia may be genetically determined. As new definitions, classifications and biological targets continue to evolve, there is a need for reappraisal of the literature for future candidate association studies. This review summarizes genes identified or implicated as well as putative candidate genes contributing to cachexia, identified through diverse technology platforms and model systems to further guide association studies. A systematic search covering 1986-2012 was performed for potential candidate genes / genetic polymorphisms relating to cancer cachexia. All candidate genes were reviewed for functional polymorphisms or clinically significant polymorphisms associated with cachexia using the OMIM and GeneRIF databases. Pathway analysis software was used to reveal possible network associations between genes. Functionality of SNPs/genes was explored based on published literature, algorithms for detecting putative deleterious SNPs and interrogating the database for expression of quantitative trait loci (eQTLs). A total of 154 genes associated with cancer cachexia were identified and explored for functional polymorphisms. Of these 154 genes, 119 had a combined total of 281 polymorphisms with functional and/or clinical significance in terms of cachexia associated with them. Of these, 80 polymorphisms (in 51 genes) were replicated in more than one study with 24 polymorphisms found to influence two or more hallmarks of cachexia (i.e., inflammation, loss of fat mass and/or lean mass and reduced survival). Selection of candidate genes and polymorphisms is a key element of multigene study design. The present study provides a contemporary basis to select genes and/or polymorphisms for further association studies in cancer cachexia, and to develop their potential as susceptibility biomarkers of cachexia.
Panigrahi, Priyabrata; Jere, Abhay; Anamika, Krishanpal
2018-01-01
Gene fusion is a chromosomal rearrangement event which plays a significant role in cancer due to the oncogenic potential of the chimeric protein generated through fusions. At present many databases are available in public domain which provides detailed information about known gene fusion events and their functional role. Existing gene fusion detection tools, based on analysis of transcriptomics data usually report a large number of fusion genes as potential candidates, which could be either known or novel or false positives. Manual annotation of these putative genes is indeed time-consuming. We have developed a web platform FusionHub, which acts as integrated search engine interfacing various fusion gene databases and simplifies large scale annotation of fusion genes in a seamless way. In addition, FusionHub provides three ways of visualizing fusion events: circular view, domain architecture view and network view. Design of potential siRNA molecules through ensemble method is another utility integrated in FusionHub that could aid in siRNA-based targeted therapy. FusionHub is freely available at https://fusionhub.persistent.co.in.
A Consensus Network of Gene Regulatory Factors in the Human Frontal Lobe
Berto, Stefano; Perdomo-Sabogal, Alvaro; Gerighausen, Daniel; Qin, Jing; Nowick, Katja
2016-01-01
Cognitive abilities, such as memory, learning, language, problem solving, and planning, involve the frontal lobe and other brain areas. Not much is known yet about the molecular basis of cognitive abilities, but it seems clear that cognitive abilities are determined by the interplay of many genes. One approach for analyzing the genetic networks involved in cognitive functions is to study the coexpression networks of genes with known importance for proper cognitive functions, such as genes that have been associated with cognitive disorders like intellectual disability (ID) or autism spectrum disorders (ASD). Because many of these genes are gene regulatory factors (GRFs) we aimed to provide insights into the gene regulatory networks active in the human frontal lobe. Using genome wide human frontal lobe expression data from 10 independent data sets, we first derived 10 individual coexpression networks for all GRFs including their potential target genes. We observed a high level of variability among these 10 independently derived networks, pointing out that relying on results from a single study can only provide limited biological insights. To instead focus on the most confident information from these 10 networks we developed a method for integrating such independently derived networks into a consensus network. This consensus network revealed robust GRF interactions that are conserved across the frontal lobes of different healthy human individuals. Within this network, we detected a strong central module that is enriched for 166 GRFs known to be involved in brain development and/or cognitive disorders. Interestingly, several hubs of the consensus network encode for GRFs that have not yet been associated with brain functions. Their central role in the network suggests them as excellent new candidates for playing an essential role in the regulatory network of the human frontal lobe, which should be investigated in future studies. PMID:27014338
Regulation of neural macroRNAs by the transcriptional repressor REST
Johnson, Rory; Teh, Christina Hui-Leng; Jia, Hui; Vanisri, Ravi Raj; Pandey, Tridansh; Lu, Zhong-Hao; Buckley, Noel J.; Stanton, Lawrence W.; Lipovich, Leonard
2009-01-01
The essential transcriptional repressor REST (repressor element 1-silencing transcription factor) plays central roles in development and human disease by regulating a large cohort of neural genes. These have conventionally fallen into the class of known, protein-coding genes; recently, however, several noncoding microRNA genes were identified as REST targets. Given the widespread transcription of messenger RNA-like, noncoding RNAs (“macroRNAs”), some of which are functional and implicated in disease in mammalian genomes, we sought to determine whether this class of noncoding RNAs can also be regulated by REST. By applying a new, unbiased target gene annotation pipeline to computationally discovered REST binding sites, we find that 23% of mammalian REST genomic binding sites are within 10 kb of a macroRNA gene. These putative target genes were overlooked by previous studies. Focusing on a set of 18 candidate macroRNA targets from mouse, we experimentally demonstrate that two are regulated by REST in neural stem cells. Flanking protein-coding genes are, at most, weakly repressed, suggesting specific targeting of the macroRNAs by REST. Similar to the majority of known REST target genes, both of these macroRNAs are induced during nervous system development and have neurally restricted expression profiles in adult mouse. We observe a similar phenomenon in human: the DiGeorge syndrome-associated noncoding RNA, DGCR5, is repressed by REST through a proximal upstream binding site. Therefore neural macroRNAs represent an additional component of the REST regulatory network. These macroRNAs are new candidates for understanding the role of REST in neuronal development, neurodegeneration, and cancer. PMID:19050060
Regulation of neural macroRNAs by the transcriptional repressor REST.
Johnson, Rory; Teh, Christina Hui-Leng; Jia, Hui; Vanisri, Ravi Raj; Pandey, Tridansh; Lu, Zhong-Hao; Buckley, Noel J; Stanton, Lawrence W; Lipovich, Leonard
2009-01-01
The essential transcriptional repressor REST (repressor element 1-silencing transcription factor) plays central roles in development and human disease by regulating a large cohort of neural genes. These have conventionally fallen into the class of known, protein-coding genes; recently, however, several noncoding microRNA genes were identified as REST targets. Given the widespread transcription of messenger RNA-like, noncoding RNAs ("macroRNAs"), some of which are functional and implicated in disease in mammalian genomes, we sought to determine whether this class of noncoding RNAs can also be regulated by REST. By applying a new, unbiased target gene annotation pipeline to computationally discovered REST binding sites, we find that 23% of mammalian REST genomic binding sites are within 10 kb of a macroRNA gene. These putative target genes were overlooked by previous studies. Focusing on a set of 18 candidate macroRNA targets from mouse, we experimentally demonstrate that two are regulated by REST in neural stem cells. Flanking protein-coding genes are, at most, weakly repressed, suggesting specific targeting of the macroRNAs by REST. Similar to the majority of known REST target genes, both of these macroRNAs are induced during nervous system development and have neurally restricted expression profiles in adult mouse. We observe a similar phenomenon in human: the DiGeorge syndrome-associated noncoding RNA, DGCR5, is repressed by REST through a proximal upstream binding site. Therefore neural macroRNAs represent an additional component of the REST regulatory network. These macroRNAs are new candidates for understanding the role of REST in neuronal development, neurodegeneration, and cancer.
Yang, Mei; Zhu, Lingping; Pan, Cheng; Xu, Liming; Liu, Yanling; Ke, Weidong; Yang, Pingfang
2015-08-17
Rhizome is the storage organ of lotus derived from modified stems. The development of rhizome is a complex process and depends on the balanced expression of the genes that is controlled by environmental and endogenous factors. However, little is known about the mechanism that regulates rhizome girth enlargement. In this study, using RNA-seq, transcriptomic analyses were performed at three rhizome developmental stages-the stolon, middle swelling and later swelling stage -in the cultivars 'ZO' (temperate lotus with enlarged rhizome) and 'RL' (tropical lotus with stolon). About 348 million high-quality reads were generated, and 88.5% of the data were mapped to the reference genome. Of 26783 genes identified, 24069 genes were previously predicted in the reference, and 2714 genes were novel transcripts. Moreover, 8821 genes were differentially expressed between the cultivars at the three stages. Functional analysis identified that these genes were significantly enriched in pathways carbohydrate metabolism and plant hormone signal transduction. Twenty-two genes involved in photoperiod pathway, starch metabolism and hormone signal transduction were candidate genes inducing rhizome girth enlargement. Comparative transcriptomic analysis detected several differentially expressed genes and potential candidate genes required for rhizome girth enlargement, which lay a foundation for future studies on molecular mechanisms underlying rhizome formation.
Yang, Mei; Zhu, Lingping; Pan, Cheng; Xu, Liming; Liu, Yanling; Ke, Weidong; Yang, Pingfang
2015-01-01
Rhizome is the storage organ of lotus derived from modified stems. The development of rhizome is a complex process and depends on the balanced expression of the genes that is controlled by environmental and endogenous factors. However, little is known about the mechanism that regulates rhizome girth enlargement. In this study, using RNA-seq, transcriptomic analyses were performed at three rhizome developmental stages—the stolon, middle swelling and later swelling stage —in the cultivars ‘ZO’ (temperate lotus with enlarged rhizome) and ‘RL’ (tropical lotus with stolon). About 348 million high-quality reads were generated, and 88.5% of the data were mapped to the reference genome. Of 26783 genes identified, 24069 genes were previously predicted in the reference, and 2714 genes were novel transcripts. Moreover, 8821 genes were differentially expressed between the cultivars at the three stages. Functional analysis identified that these genes were significantly enriched in pathways carbohydrate metabolism and plant hormone signal transduction. Twenty-two genes involved in photoperiod pathway, starch metabolism and hormone signal transduction were candidate genes inducing rhizome girth enlargement. Comparative transcriptomic analysis detected several differentially expressed genes and potential candidate genes required for rhizome girth enlargement, which lay a foundation for future studies on molecular mechanisms underlying rhizome formation. PMID:26279185
Genomic Signatures Reveal New Evidences for Selection of Important Traits in Domestic Cattle
Xu, Lingyang; Bickhart, Derek M.; Cole, John B.; Schroeder, Steven G.; Song, Jiuzhou; Tassell, Curtis P. Van; Sonstegard, Tad S.; Liu, George E.
2015-01-01
We investigated diverse genomic selections using high-density single nucleotide polymorphism data of five distinct cattle breeds. Based on allele frequency differences, we detected hundreds of candidate regions under positive selection across Holstein, Angus, Charolais, Brahman, and N'Dama. In addition to well-known genes such as KIT, MC1R, ASIP, GHR, LCORL, NCAPG, WIF1, and ABCA12, we found evidence for a variety of novel and less-known genes under selection in cattle, such as LAP3, SAR1B, LRIG3, FGF5, and NUDCD3. Selective sweeps near LAP3 were then validated by next-generation sequencing. Genome-wide association analysis involving 26,362 Holsteins confirmed that LAP3 and SAR1B were related to milk production traits, suggesting that our candidate regions were likely functional. In addition, haplotype network analyses further revealed distinct selective pressures and evolution patterns across these five cattle breeds. Our results provided a glimpse into diverse genomic selection during cattle domestication, breed formation, and recent genetic improvement. These findings will facilitate genome-assisted breeding to improve animal production and health. PMID:25431480
Towards an informative mutant phenotype for every bacterial gene
Deutschbauer, Adam; Price, Morgan N.; Wetmore, Kelly M.; ...
2014-08-11
Mutant phenotypes provide strong clues to the functions of the underlying genes and could allow annotation of the millions of sequenced yet uncharacterized bacterial genes. However, it is not known how many genes have a phenotype under laboratory conditions, how many phenotypes are biologically interpretable for predicting gene function, and what experimental conditions are optimal to maximize the number of genes with a phenotype. To address these issues, we measured the mutant fitness of 1,586 genes of the ethanol-producing bacterium Zymomonas mobilis ZM4 across 492 diverse experiments and found statistically significant phenotypes for 89% of all assayed genes. Thus, inmore » Z. mobilis, most genes have a functional consequence under laboratory conditions. We demonstrate that 41% of Z. mobilis genes have both a strong phenotype and a similar fitness pattern (cofitness) to another gene, and are therefore good candidates for functional annotation using mutant fitness. Among 502 poorly characterized Z. mobilis genes, we identified a significant cofitness relationship for 174. For 57 of these genes without a specific functional annotation, we found additional evidence to support the biological significance of these gene-gene associations, and in 33 instances, we were able to predict specific physiological or biochemical roles for the poorly characterized genes. Last, we identified a set of 79 diverse mutant fitness experiments in Z. mobilis that are nearly as biologically informative as the entire set of 492 experiments. Therefore, our work provides a blueprint for the functional annotation of diverse bacteria using mutant fitness.« less
ERIC Educational Resources Information Center
Hessl, David; Tassone, Flora; Cordeiro, Lisa; Koldewyn, Kami; McCormick, Carolyn; Green, Cherie; Wegelin, Jacob; Yuhas, Jennifer; Hagerman, Randi J.
2008-01-01
Although fragile X syndrome (FXS) is a single gene disorder with a well-described phenotype, it is not known why some individuals develop more significant maladaptive behaviors such as aggression or autistic symptoms. Here, we studied two candidate genes known to affect mood and aggression, the serotonin transporter (5-HTTLPR) and monoamine…
Quillé, Marie-Lise; Hirchaud, Edouard; Baron, Daniel; Benech, Caroline; Guihot, Jeanne; Placet, Morgane; Mignen, Olivier; Férec, Claude; Houlgatte, Rémi; Friocourt, Gaëlle
2011-01-01
Genetic investigations of X-linked intellectual disabilities have implicated the ARX (Aristaless-related homeobox) gene in a wide spectrum of disorders extending from phenotypes characterised by severe neuronal migration defects such as lissencephaly, to mild or moderate forms of mental retardation without apparent brain abnormalities but with associated features of dystonia and epilepsy. Analysis of Arx spatio-temporal localisation profile in mouse revealed expression in telencephalic structures, mainly restricted to populations of GABAergic neurons at all stages of development. Furthermore, studies of the effects of ARX loss of function in humans and animal models revealed varying defects, suggesting multiple roles of this gene during brain development. However, to date, little is known about how ARX functions as a transcription factor and the nature of its targets. To better understand its role, we combined chromatin immunoprecipitation and mRNA expression with microarray analysis and identified a total of 1006 gene promoters bound by Arx in transfected neuroblastoma (N2a) cells and in mouse embryonic brain. Approximately 24% of Arx-bound genes were found to show expression changes following Arx overexpression or knock-down. Several of the Arx target genes we identified are known to be important for a variety of functions in brain development and some of them suggest new functions for Arx. Overall, these results identified multiple new candidate targets for Arx and should help to better understand the pathophysiological mechanisms of intellectual disability and epilepsy associated with ARX mutations. PMID:21966449
DOE Office of Scientific and Technical Information (OSTI.GOV)
Deutschbauer, Adam; Price, Morgan N.; Wetmore, Kelly M.
Mutant phenotypes provide strong clues to the functions of the underlying genes and could allow annotation of the millions of sequenced yet uncharacterized bacterial genes. However, it is not known how many genes have a phenotype under laboratory conditions, how many phenotypes are biologically interpretable for predicting gene function, and what experimental conditions are optimal to maximize the number of genes with a phenotype. To address these issues, we measured the mutant fitness of 1,586 genes of the ethanol-producing bacterium Zymomonas mobilis ZM4 across 492 diverse experiments and found statistically significant phenotypes for 89% of all assayed genes. Thus, inmore » Z. mobilis, most genes have a functional consequence under laboratory conditions. We demonstrate that 41% of Z. mobilis genes have both a strong phenotype and a similar fitness pattern (cofitness) to another gene, and are therefore good candidates for functional annotation using mutant fitness. Among 502 poorly characterized Z. mobilis genes, we identified a significant cofitness relationship for 174. For 57 of these genes without a specific functional annotation, we found additional evidence to support the biological significance of these gene-gene associations, and in 33 instances, we were able to predict specific physiological or biochemical roles for the poorly characterized genes. Last, we identified a set of 79 diverse mutant fitness experiments in Z. mobilis that are nearly as biologically informative as the entire set of 492 experiments. Therefore, our work provides a blueprint for the functional annotation of diverse bacteria using mutant fitness.« less
A genome-wide scan for signatures of differential artificial selection in ten cattle breeds.
Rothammer, Sophie; Seichter, Doris; Förster, Martin; Medugorac, Ivica
2013-12-21
Since the times of domestication, cattle have been continually shaped by the influence of humans. Relatively recent history, including breed formation and the still enduring enormous improvement of economically important traits, is expected to have left distinctive footprints of selection within the genome. The purpose of this study was to map genome-wide selection signatures in ten cattle breeds and thus improve the understanding of the genome response to strong artificial selection and support the identification of the underlying genetic variants of favoured phenotypes. We analysed 47,651 single nucleotide polymorphisms (SNP) using Cross Population Extended Haplotype Homozygosity (XP-EHH). We set the significance thresholds using the maximum XP-EHH values of two essentially artificially unselected breeds and found up to 229 selection signatures per breed. Through a confirmation process we verified selection for three distinct phenotypes typical for one breed (polledness in Galloway, double muscling in Blanc-Bleu Belge and red coat colour in Red Holstein cattle). Moreover, we detected six genes strongly associated with known QTL for beef or dairy traits (TG, ABCG2, DGAT1, GH1, GHR and the Casein Cluster) within selection signatures of at least one breed. A literature search for genes lying in outstanding signatures revealed further promising candidate genes. However, in concordance with previous genome-wide studies, we also detected a substantial number of signatures without any yet known gene content. These results show the power of XP-EHH analyses in cattle to discover promising candidate genes and raise the hope of identifying phenotypically important variants in the near future. The finding of plausible functional candidates in some short signatures supports this hope. For instance, MAP2K6 is the only annotated gene of two signatures detected in Galloway and Gelbvieh cattle and is already known to be associated with carcass weight, back fat thickness and marbling score in Korean beef cattle. Based on the confirmation process and literature search we deduce that XP-EHH is able to uncover numerous artificial selection targets in subpopulations of domesticated animals.
Zagrobelny, Mika; Scheibye-Alsing, Karsten; Jensen, Niels Bjerg; Møller, Birger Lindberg; Gorodkin, Jan; Bak, Søren
2009-12-02
An essential driving component in the co-evolution of plants and insects is the ability to produce and handle bioactive compounds. Plants produce bioactive natural products for defense, but some insects detoxify and/or sequester the compounds, opening up for new niches with fewer competitors. To study the molecular mechanism behind the co-adaption in plant-insect interactions, we have investigated the interactions between Lotus corniculatus and Zygaena filipendulae. They both contain cyanogenic glucosides which liberate toxic hydrogen cyanide upon breakdown. Moths belonging to the Zygaena family are the only insects known, able to carry out both de novo biosynthesis and sequestration of the same cyanogenic glucosides as those from their feed plants. The biosynthetic pathway for cyanogenic glucoside biosynthesis in Z. filipendulae proceeds using the same intermediates as in the well known pathway from plants, but none of the enzymes responsible have been identified. A genomics strategy founded on 454 pyrosequencing of the Z. filipendulae transcriptome was undertaken to identify some of these enzymes in Z. filipendulae. Comparisons of the Z. filipendulae transcriptome with the sequenced genomes of Bombyx mori, Drosophila melanogaster, Tribolium castaneum, Apis mellifera and Anopheles gambiae indicate a high coverage of the Z. filipendulae transcriptome. 11% of the Z. filipendulae transcriptome sequences were assigned to Gene Ontology categories. Candidate genes for enzymes functioning in the biosynthesis of cyanogenic glucosides (cytochrome P450 and family 1 glycosyltransferases) were identified based on sequence length, number of copies and presence/absence of close homologs in D. melanogaster, B. mori and the cyanogenic butterfly Heliconius. Examination of biased codon usage, GC content and selection on gene candidates support the notion of cyanogenesis as an "old" trait within Ditrysia, as well as its origins being convergent between plants and insects. Pyrosequencing is an attractive approach to gain access to genes in the biosynthesis of bio-active natural products from insects and other organisms, for which the genome sequence is not known. Based on analysis of the Z. filipendulae transcriptome, promising gene candidates for biosynthesis of cyanogenic glucosides was identified, and the suitability of Z. filipendulae as a model system for cyanogenesis in insects is evident.
GraphTeams: a method for discovering spatial gene clusters in Hi-C sequencing data.
Schulz, Tizian; Stoye, Jens; Doerr, Daniel
2018-05-08
Hi-C sequencing offers novel, cost-effective means to study the spatial conformation of chromosomes. We use data obtained from Hi-C experiments to provide new evidence for the existence of spatial gene clusters. These are sets of genes with associated functionality that exhibit close proximity to each other in the spatial conformation of chromosomes across several related species. We present the first gene cluster model capable of handling spatial data. Our model generalizes a popular computational model for gene cluster prediction, called δ-teams, from sequences to graphs. Following previous lines of research, we subsequently extend our model to allow for several vertices being associated with the same label. The model, called δ-teams with families, is particular suitable for our application as it enables handling of gene duplicates. We develop algorithmic solutions for both models. We implemented the algorithm for discovering δ-teams with families and integrated it into a fully automated workflow for discovering gene clusters in Hi-C data, called GraphTeams. We applied it to human and mouse data to find intra- and interchromosomal gene cluster candidates. The results include intrachromosomal clusters that seem to exhibit a closer proximity in space than on their chromosomal DNA sequence. We further discovered interchromosomal gene clusters that contain genes from different chromosomes within the human genome, but are located on a single chromosome in mouse. By identifying δ-teams with families, we provide a flexible model to discover gene cluster candidates in Hi-C data. Our analysis of Hi-C data from human and mouse reveals several known gene clusters (thus validating our approach), but also few sparsely studied or possibly unknown gene cluster candidates that could be the source of further experimental investigations.
[Molecular genetics of functional articulation disorder in children].
Zhao, Yun-Jing; Ma, Hong-Wei
2012-04-01
Genetic factors are an important cause of functional articulation disorder in children. This article reviews some genes and chromosome regions associated with a genetic susceptibility to functional articulation disorders. The forkhead box P2 (FOXP2) gene on chromosome 7 is introduced in details including its structure, expression and function. The relationship between the FOXP2 gene and developmental apraxia of speech is discussed. As a transcription factor, FOXP2 gene regulates the expression of many genes. CNTNAP2 as an important target gene of FOXP2 is a key gene influencing language development. Functional articulation disorder may be developed to dyslexia, therefore some candidate regions and genes related to dyslexia, such as 3p12-13, 15q11-21, 6p22 and 1p34-36, are also introduced. ROBO1 gene in 3p12.3, ZNF280D gene, TCF12 gene, EKN1 gene in 15q21, and KIAA0319 gene in 6p22 have been candidate genes for the study of functional articulation disorder.
Ross, Cody T.; Roodgar, Morteza; Smith, David Glenn
2015-01-01
We use the Reciprocal Smallest Distance (RSD) algorithm to identify amino acid sequence orthologs in the Chinese and Indian rhesus macaque draft sequences and estimate the evolutionary distance between such orthologs. We then use GOanna to map gene function annotations and human gene identifiers to the rhesus macaque amino acid sequences. We conclude methodologically by cross-tabulating a list of amino acid orthologs with large divergence scores with a list of genes known to be involved in SIV or HIV pathogenesis. We find that many of the amino acid sequences with large evolutionary divergence scores, as calculated by the RSD algorithm, have been shown to be related to HIV pathogenesis in previous laboratory studies. Four of the strongest candidate genes for SIVmac resistance in Chinese rhesus macaques identified in this study are CDK9, CXCL12, TRIM21, and TRIM32. Additionally, ANKRD30A, CTSZ, GORASP2, GTF2H1, IL13RA1, MUC16, NMDAR1, Notch1, NT5M, PDCD5, RAD50, and TM9SF2 were identified as possible candidates, among others. We failed to find many laboratory experiments contrasting the effects of Indian and Chinese orthologs at these sites on SIVmac pathogenesis, but future comparative studies might hold fertile ground for research into the biological mechanisms underlying innate resistance to SIVmac in Chinese rhesus macaques. PMID:25884674
Wang, Zhepeng; Meng, Guohua; Bai, Yun; Liu, Ruifang; Du, Yu; Su, Lihong
2017-09-12
In birds, blue-green eggshell color (BGEC) is caused by biliverdin, a bile pigment derived from the degradation of heme and secreted in the eggshell by the shell gland. Functionally, BGEC might promote the paternal investment of males in the nest and eggs. However, little is known about its formation mechanisms. Jinding ducks (Anas platyrhynchos) are an ideal breed for research into the mechanisms, in which major birds lay BGEC eggs with minor individuals laying white eggs. Using this breed, this study aimed to provide insight into the mechanisms via comparative transcriptome analysis. Blue-shelled ducks (BSD) and white-shelled ducks (WSD) were selected from two populations, forming 4 groups (3 ducks/group): BSD1 and WSD1 from population 1 and BSD2 and WSD2 from population 2. Twelve libraries from shell glands were sequenced using the Illumina RNA-seq platform, generating an average of 41 million clean reads per library, of which 55.9% were mapped to the duck reference genome and assembled into 31,542 transcripts. Expression levels of 11,698 genes were successfully compared between all pairs of 4 groups. Of these, 464 candidate genes were differentially expressed between cross-phenotype groups, but not for between same-phenotype groups. Gene Ontology (GO) annotation showed that 390 candidate genes were annotated with 2234 GO terms. No candidate genes were directly involved in biosynthesis or transport of biliverdin. However, the integral components of membrane, metal ion transport, cholesterol biosynthesis, signal transduction, skeletal system development, and chemotaxis were significantly (P < 0.05) overrepresented by candidate genes. This study identified 464 candidate genes associated with duck BGEC, providing valuable information for a better understanding of the mechanisms underlying this trait. Given the involvement of membrane cholesterol contents, ions and ATP levels in modulating the transport activity of bile pigment transporters, the data suggest a potential association between duck BGEC and the transport activity of the related transporters.
Wu, Mengmeng; Zeng, Wanwen; Liu, Wenqiang; Lv, Hairong; Chen, Ting; Jiang, Rui
2018-06-03
Genome-wide association studies (GWAS) have successfully discovered a number of disease-associated genetic variants in the past decade, providing an unprecedented opportunity for deciphering genetic basis of human inherited diseases. However, it is still a challenging task to extract biological knowledge from the GWAS data, due to such issues as missing heritability and weak interpretability. Indeed, the fact that the majority of discovered loci fall into noncoding regions without clear links to genes has been preventing the characterization of their functions and appealing for a sophisticated approach to bridge genetic and genomic studies. Towards this problem, network-based prioritization of candidate genes, which performs integrated analysis of gene networks with GWAS data, has emerged as a promising direction and attracted much attention. However, most existing methods overlook the sparse and noisy properties of gene networks and thus may lead to suboptimal performance. Motivated by this understanding, we proposed a novel method called REGENT for integrating multiple gene networks with GWAS data to prioritize candidate genes for complex diseases. We leveraged a technique called the network representation learning to embed a gene network into a compact and robust feature space, and then designed a hierarchical statistical model to integrate features of multiple gene networks with GWAS data for the effective inference of genes associated with a disease of interest. We applied our method to six complex diseases and demonstrated the superior performance of REGENT over existing approaches in recovering known disease-associated genes. We further conducted a pathway analysis and showed that the ability of REGENT to discover disease-associated pathways. We expect to see applications of our method to a broad spectrum of diseases for post-GWAS analysis. REGENT is freely available at https://github.com/wmmthu/REGENT. Copyright © 2018 Elsevier Inc. All rights reserved.
1997-07-01
minimum region of allelic loss on chromosome 17p 13.3, between polymorphic markers D17S5 and D17S28, in genomic DNA from breast and ovarian tumors (Figure 1...encode proteins of 443 and 227 amino acids, with no known functional motifs. Comparison of genomic and cDNA sequences showed that the genes overlap...is tissue specific (Figure 4). When zoo blots comprised of EcoRI fragments of genomic DNA from various species were probed with the unique exon 1 of
Hériché, Jean-Karim; Lees, Jon G.; Morilla, Ian; Walter, Thomas; Petrova, Boryana; Roberti, M. Julia; Hossain, M. Julius; Adler, Priit; Fernández, José M.; Krallinger, Martin; Haering, Christian H.; Vilo, Jaak; Valencia, Alfonso; Ranea, Juan A.; Orengo, Christine; Ellenberg, Jan
2014-01-01
The advent of genome-wide RNA interference (RNAi)–based screens puts us in the position to identify genes for all functions human cells carry out. However, for many functions, assay complexity and cost make genome-scale knockdown experiments impossible. Methods to predict genes required for cell functions are therefore needed to focus RNAi screens from the whole genome on the most likely candidates. Although different bioinformatics tools for gene function prediction exist, they lack experimental validation and are therefore rarely used by experimentalists. To address this, we developed an effective computational gene selection strategy that represents public data about genes as graphs and then analyzes these graphs using kernels on graph nodes to predict functional relationships. To demonstrate its performance, we predicted human genes required for a poorly understood cellular function—mitotic chromosome condensation—and experimentally validated the top 100 candidates with a focused RNAi screen by automated microscopy. Quantitative analysis of the images demonstrated that the candidates were indeed strongly enriched in condensation genes, including the discovery of several new factors. By combining bioinformatics prediction with experimental validation, our study shows that kernels on graph nodes are powerful tools to integrate public biological data and predict genes involved in cellular functions of interest. PMID:24943848
Ramayo-Caldas, Yuliaxis; Renand, Gilles; Ballester, Maria; Saintilan, Romain; Rocha, Dominique
2016-04-23
Studies to identify markers associated with beef tenderness have focused on Warner-Bratzler shear force (WBSF) but the interplay between the genes associated with WBSF has not been explored. We used the association weight matrix (AWM), a systems biology approach, to identify a set of interacting genes that are co-associated with tenderness and other meat quality traits, and shared across the Charolaise, Limousine and Blonde d'Aquitaine beef cattle breeds. Genome-wide association studies were performed using ~500K single nucleotide polymorphisms (SNPs) and 17 phenotypes measured on more than 1000 animals for each breed. First, this multi-trait approach was applied separately for each breed across 17 phenotypes and second, between- and across-breed comparisons at the AWM and functional levels were performed. Genetic heterogeneity was observed, and most of the variants that were associated with WBSF segregated within rather than across breeds. We identified 206 common candidate genes associated with WBSF across the three breeds. SNPs in these common genes explained between 28 and 30 % of the phenotypic variance for WBSF. A reduced number of common SNPs mapping to the 206 common genes were identified, suggesting that different mutations may target the same genes in a breed-specific manner. Therefore, it is likely that, depending on allele frequencies and linkage disequilibrium patterns, a SNP that is identified for one breed may not be informative for another unrelated breed. Well-known candidate genes affecting beef tenderness were identified. In addition, some of the 206 common genes are located within previously reported quantitative trait loci for WBSF in several cattle breeds. Moreover, the multi-breed co-association analysis detected new candidate genes, regulators and metabolic pathways that are likely involved in the determination of meat tenderness and other meat quality traits in beef cattle. Our results suggest that systems biology approaches that explore associations of correlated traits increase statistical power to identify candidate genes beyond the one-dimensional approach. Further studies on the 206 common genes, their pathways, regulators and interactions will expand our knowledge on the molecular basis of meat tenderness and could lead to the discovery of functional mutations useful for genomic selection in a multi-breed beef cattle context.
Alu Elements as Novel Regulators of Gene Expression in Type 1 Diabetes Susceptibility Genes?
Kaur, Simranjeet; Pociot, Flemming
2015-07-13
Despite numerous studies implicating Alu repeat elements in various diseases, there is sparse information available with respect to the potential functional and biological roles of the repeat elements in Type 1 diabetes (T1D). Therefore, we performed a genome-wide sequence analysis of T1D candidate genes to identify embedded Alu elements within these genes. We observed significant enrichment of Alu elements within the T1D genes (p-value < 10e-16), which highlights their importance in T1D. Functional annotation of T1D genes harboring Alus revealed significant enrichment for immune-mediated processes (p-value < 10e-6). We also identified eight T1D genes harboring inverted Alus (IRAlus) within their 3' untranslated regions (UTRs) that are known to regulate the expression of host mRNAs by generating double stranded RNA duplexes. Our in silico analysis predicted the formation of duplex structures by IRAlus within the 3'UTRs of T1D genes. We propose that IRAlus might be involved in regulating the expression levels of the host T1D genes.
Gene Expression Changes in the Motor Cortex Mediating Motor Skill Learning
Cheung, Vincent C. K.; DeBoer, Caroline; Hanson, Elizabeth; Tunesi, Marta; D'Onofrio, Mara; Arisi, Ivan; Brandi, Rossella; Cattaneo, Antonino; Goosens, Ki A.
2013-01-01
The primary motor cortex (M1) supports motor skill learning, yet little is known about the genes that contribute to motor cortical plasticity. Such knowledge could identify candidate molecules whose targeting might enable a new understanding of motor cortical functions, and provide new drug targets for the treatment of diseases which impair motor function, such as ischemic stroke. Here, we assess changes in the motor-cortical transcriptome across different stages of motor skill acquisition. Adult rats were trained on a gradually acquired appetitive reach and grasp task that required different strategies for successful pellet retrieval, or a sham version of the task in which the rats received pellet reward without needing to develop the reach and grasp skill. Tissue was harvested from the forelimb motor-cortical area either before training commenced, prior to the initial rise in task performance, or at peak performance. Differential classes of gene expression were observed at the time point immediately preceding motor task improvement. Functional clustering revealed that gene expression changes were related to the synapse, development, intracellular signaling, and the fibroblast growth factor (FGF) family, with many modulated genes known to regulate synaptic plasticity, synaptogenesis, and cytoskeletal dynamics. The modulated expression of synaptic genes likely reflects ongoing network reorganization from commencement of training till the point of task improvement, suggesting that motor performance improves only after sufficient modifications in the cortical circuitry have accumulated. The regulated FGF-related genes may together contribute to M1 remodeling through their roles in synaptic growth and maturation. PMID:23637843
St-Amand, Jonny; Yoshioka, Mayumi; Tanaka, Keitaro; Nishida, Yuichiro
2012-01-01
To identify preferentially expressed genes in the central endocrine organs of the hypothalamus and pituitary gland, we generated transcriptome-wide mRNA profiles of the hypothalamus, pituitary gland, and parietal cortex in male mice (12–15 weeks old) using serial analysis of gene expression (SAGE). Total counts of SAGE tags for the hypothalamus, pituitary gland, and parietal cortex were 165824, 126688, and 161045 tags, respectively. This represented 59244, 45151, and 55131 distinct tags, respectively. Comparison of these mRNA profiles revealed that 22 mRNA species, including three potential novel transcripts, were preferentially expressed in the hypothalamus. In addition to well-known hypothalamic transcripts, such as hypocretin, several genes involved in hormone function, intracellular transduction, metabolism, protein transport, steroidogenesis, extracellular matrix, and brain disease were identified as preferentially expressed hypothalamic transcripts. In the pituitary gland, 106 mRNA species, including 60 potential novel transcripts, were preferentially expressed. In addition to well-known pituitary genes, such as growth hormone and thyroid stimulating hormone beta, a number of genes classified to function in transport, amino acid metabolism, intracellular transduction, cell adhesion, disulfide bond formation, stress response, transcription, protein synthesis, and turnover, cell differentiation, the cell cycle, and in the cytoskeleton and extracellular matrix were also preferentially expressed. In conclusion, the current study identified not only well-known hypothalamic and pituitary transcripts but also a number of new candidates likely to be involved in endocrine homeostatic systems regulated by the hypothalamus and pituitary gland. PMID:22649398
St-Amand, Jonny; Yoshioka, Mayumi; Tanaka, Keitaro; Nishida, Yuichiro
2011-01-01
To identify preferentially expressed genes in the central endocrine organs of the hypothalamus and pituitary gland, we generated transcriptome-wide mRNA profiles of the hypothalamus, pituitary gland, and parietal cortex in male mice (12-15 weeks old) using serial analysis of gene expression (SAGE). Total counts of SAGE tags for the hypothalamus, pituitary gland, and parietal cortex were 165824, 126688, and 161045 tags, respectively. This represented 59244, 45151, and 55131 distinct tags, respectively. Comparison of these mRNA profiles revealed that 22 mRNA species, including three potential novel transcripts, were preferentially expressed in the hypothalamus. In addition to well-known hypothalamic transcripts, such as hypocretin, several genes involved in hormone function, intracellular transduction, metabolism, protein transport, steroidogenesis, extracellular matrix, and brain disease were identified as preferentially expressed hypothalamic transcripts. In the pituitary gland, 106 mRNA species, including 60 potential novel transcripts, were preferentially expressed. In addition to well-known pituitary genes, such as growth hormone and thyroid stimulating hormone beta, a number of genes classified to function in transport, amino acid metabolism, intracellular transduction, cell adhesion, disulfide bond formation, stress response, transcription, protein synthesis, and turnover, cell differentiation, the cell cycle, and in the cytoskeleton and extracellular matrix were also preferentially expressed. In conclusion, the current study identified not only well-known hypothalamic and pituitary transcripts but also a number of new candidates likely to be involved in endocrine homeostatic systems regulated by the hypothalamus and pituitary gland.
Comprehensive genomic analysis of patients with disorders of cerebral cortical development.
Wiszniewski, Wojciech; Gawlinski, Pawel; Gambin, Tomasz; Bekiesinska-Figatowska, Monika; Obersztyn, Ewa; Antczak-Marach, Dorota; Akdemir, Zeynep Hande Coban; Harel, Tamar; Karaca, Ender; Jurek, Marta; Sobecka, Katarzyna; Nowakowska, Beata; Kruk, Malgorzata; Terczynska, Iwona; Goszczanska-Ciuchta, Alicja; Rudzka-Dybala, Mariola; Jamroz, Ewa; Pyrkosz, Antoni; Jakubiuk-Tomaszuk, Anna; Iwanowski, Piotr; Gieruszczak-Bialek, Dorota; Piotrowicz, Malgorzata; Sasiadek, Maria; Kochanowska, Iwona; Gurda, Barbara; Steinborn, Barbara; Dawidziuk, Mateusz; Castaneda, Jennifer; Wlasienko, Pawel; Bezniakow, Natalia; Jhangiani, Shalini N; Hoffman-Zacharska, Dorota; Bal, Jerzy; Szczepanik, Elzbieta; Boerwinkle, Eric; Gibbs, Richard A; Lupski, James R
2018-04-30
Malformations of cortical development (MCDs) manifest with structural brain anomalies that lead to neurologic sequelae, including epilepsy, cerebral palsy, developmental delay, and intellectual disability. To investigate the underlying genetic architecture of patients with disorders of cerebral cortical development, a cohort of 54 patients demonstrating neuroradiologic signs of MCDs was investigated. Individual genomes were interrogated for single-nucleotide variants (SNV) and copy number variants (CNV) with whole-exome sequencing and chromosomal microarray studies. Variation affecting known MCDs-associated genes was found in 16/54 cases, including 11 patients with SNV, 2 patients with CNV, and 3 patients with both CNV and SNV, at distinct loci. Diagnostic pathogenic SNV and potentially damaging variants of unknown significance (VUS) were identified in two groups of seven individuals each. We demonstrated that de novo variants are important among patients with MCDs as they were identified in 10/16 individuals with a molecular diagnosis. Three patients showed changes in known MCDs genes and a clinical phenotype beyond the usual characteristics observed, i.e., phenotypic expansion, for a particular known disease gene clinical entity. We also discovered 2 likely candidate genes, CDH4, and ASTN1, with human and animal studies supporting their roles in brain development, and 5 potential candidate genes. Our findings emphasize genetic heterogeneity of MCDs disorders and postulate potential novel candidate genes involved in cerebral cortical development.
Bull, James C.; Ryabov, Eugene V.; Prince, Gill; Mead, Andrew; Zhang, Cunjin; Baxter, Laura A.; Pell, Judith K.; Osborne, Juliet L.; Chandler, Dave
2012-01-01
Honeybees, Apis mellifera, show age-related division of labor in which young adults perform maintenance (“housekeeping”) tasks inside the colony before switching to outside foraging at approximately 23 days old. Disease resistance is an important feature of honeybee biology, but little is known about the interaction of pathogens and age-related division of labor. We tested a hypothesis that older forager bees and younger “house” bees differ in susceptibility to infection. We coupled an infection bioassay with a functional analysis of gene expression in individual bees using a whole genome microarray. Forager bees treated with the entomopathogenic fungus Metarhizium anisopliae s.l. survived for significantly longer than house bees. This was concomitant with substantial differences in gene expression including genes associated with immune function. In house bees, infection was associated with differential expression of 35 candidate immune genes contrasted with differential expression of only two candidate immune genes in forager bees. For control bees (i.e. not treated with M. anisopliae) the development from the house to the forager stage was associated with differential expression of 49 candidate immune genes, including up-regulation of the antimicrobial peptide gene abaecin, plus major components of the Toll pathway, serine proteases, and serpins. We infer that reduced pathogen susceptibility in forager bees was associated with age-related activation of specific immune system pathways. Our findings contrast with the view that the immunocompetence in social insects declines with the onset of foraging as a result of a trade-off in the allocation of resources for foraging. The up-regulation of immune-related genes in young adult bees in response to M. anisopliae infection was an indicator of disease susceptibility; this also challenges previous research in social insects, in which an elevated immune status has been used as a marker of increased disease resistance and fitness without considering the effects of age-related development. PMID:23300441
Wang, Yinliang; Chen, Qi; Zhao, Hanbo; Ren, Bingzhong
2016-01-01
The leaf beetle Ambrostoma quadriimpressum (Coleoptera: Chrysomelidae) is a predominant forest pest that causes substantial damage to the lumber industry and city management. However, no effective and environmentally friendly chemical method has been discovered to control this pest. Until recently, the molecular basis of the olfactory system in A. quadriimpressum was completely unknown. In this study, antennae and leg transcriptomes were analyzed and compared using deep sequencing data to identify the olfactory genes in A. quadriimpressum. Moreover, the expression profiles of both male and female candidate olfactory genes were analyzed and validated by bioinformatics, motif analysis, homology analysis, semi-quantitative RT-PCR and RT-qPCR experiments in antennal and non-olfactory organs to explore the candidate olfactory genes that might play key roles in the life cycle of A. quadriimpressum. As a result, approximately 102.9 million and 97.3 million clean reads were obtained from the libraries created from the antennas and legs, respectively. Annotation led to 34344 Unigenes, which were matched to known proteins. Annotation data revealed that the number of genes in antenna with binding functions and receptor activity was greater than that of legs. Furthermore, many pathway genes were differentially expressed in the two organs. Sixteen candidate odorant binding proteins (OBPs), 10 chemosensory proteins (CSPs), 34 odorant receptors (ORs), 20 inotropic receptors [1] and 2 sensory neuron membrane proteins (SNMPs) and their isoforms were identified. Additionally, 15 OBPs, 9 CSPs, 18 ORs, 6 IRs and 2 SNMPs were predicted to be complete ORFs. Using RT-PCR, RT-qPCR and homology analysis, AquaOBP1/2/4/7/C1/C6, AquaCSP3/9, AquaOR8/9/10/14/15/18/20/26/29/33, AquaIR8a/13/25a showed olfactory-specific expression, indicating that these genes might play a key role in olfaction-related behaviors in A. quadriimpressum such as foraging and seeking. AquaOBP4/C5, AquaOBP4/C5, AquaCSP7/9/10, AquaOR17/24/32 and AquaIR4 were highly expressed in the antenna of males, suggesting that these genes were related to sex-specific behaviors, and expression trends that were male specific were observed for most candidate olfactory genes, which supported the existence of a female-produced sex pheromone in A. quadriimpressum. All of these results could provide valuable information and guidance for future functional studies on these genes and provide better molecular knowledge regarding the olfactory system in A. quadriimpressum.
New York esophageal squamous cell carcinoma-1 and cancer immunotherapy.
Esfandiary, Ali; Ghafouri-Fard, Soudeh
2015-01-01
New York esophageal squamous cell carcinoma 1 (NY-ESO-1) is a known cancer testis gene with exceptional immunogenicity and prevalent expression in many cancer types. These characteristics have made it an appropriate vaccine candidate with the potential application against various malignancies. This article reviews recent knowledge about the NY-ESO-1 biology, function, immunogenicity and expression in cancers as well as and the results of clinical trials with this antigen.
Chambers, Alan H; Pillet, Jeremy; Plotto, Anne; Bai, Jinhe; Whitaker, Vance M; Folta, Kevin M
2014-04-17
There is interest in improving the flavor of commercial strawberry (Fragaria × ananassa) varieties. Fruit flavor is shaped by combinations of sugars, acids and volatile compounds. Many efforts seek to use genomics-based strategies to identify genes controlling flavor, and then designing durable molecular markers to follow these genes in breeding populations. In this report, fruit from two cultivars, varying for presence-absence of volatile compounds, along with segregating progeny, were analyzed using GC/MS and RNAseq. Expression data were bulked in silico according to presence/absence of a given volatile compound, in this case γ-decalactone, a compound conferring a peach flavor note to fruits. Computationally sorting reads in segregating progeny based on γ-decalactone presence eliminated transcripts not directly relevant to the volatile, revealing transcripts possibly imparting quantitative contributions. One candidate encodes an omega-6 fatty acid desaturase, an enzyme known to participate in lactone production in fungi, noted here as FaFAD1. This candidate was induced by ripening, was detected in certain harvests, and correlated with γ-decalactone presence. The FaFAD1 gene is present in every genotype where γ-decalactone has been detected, and it was invariably missing in non-producers. A functional, PCR-based molecular marker was developed that cosegregates with the phenotype in F1 and BC1 populations, as well as in many other cultivars and wild Fragaria accessions. Genetic, genomic and analytical chemistry techniques were combined to identify FaFAD1, a gene likely controlling a key flavor volatile in strawberry. The same data may now be re-sorted based on presence/absence of any other volatile to identify other flavor-affecting candidates, leading to rapid generation of gene-specific markers.
2014-01-01
Background There is interest in improving the flavor of commercial strawberry (Fragaria × ananassa) varieties. Fruit flavor is shaped by combinations of sugars, acids and volatile compounds. Many efforts seek to use genomics-based strategies to identify genes controlling flavor, and then designing durable molecular markers to follow these genes in breeding populations. In this report, fruit from two cultivars, varying for presence-absence of volatile compounds, along with segregating progeny, were analyzed using GC/MS and RNAseq. Expression data were bulked in silico according to presence/absence of a given volatile compound, in this case γ-decalactone, a compound conferring a peach flavor note to fruits. Results Computationally sorting reads in segregating progeny based on γ-decalactone presence eliminated transcripts not directly relevant to the volatile, revealing transcripts possibly imparting quantitative contributions. One candidate encodes an omega-6 fatty acid desaturase, an enzyme known to participate in lactone production in fungi, noted here as FaFAD1. This candidate was induced by ripening, was detected in certain harvests, and correlated with γ-decalactone presence. The FaFAD1 gene is present in every genotype where γ-decalactone has been detected, and it was invariably missing in non-producers. A functional, PCR-based molecular marker was developed that cosegregates with the phenotype in F1 and BC1 populations, as well as in many other cultivars and wild Fragaria accessions. Conclusions Genetic, genomic and analytical chemistry techniques were combined to identify FaFAD1, a gene likely controlling a key flavor volatile in strawberry. The same data may now be re-sorted based on presence/absence of any other volatile to identify other flavor-affecting candidates, leading to rapid generation of gene-specific markers. PMID:24742080
Identification of candidate genes affecting Δ9-tetrahydrocannabinol biosynthesis in Cannabis sativa
Marks, M. David; Tian, Li; Wenger, Jonathan P.; Omburo, Stephanie N.; Soto-Fuentes, Wilfredo; He, Ji; Gang, David R.; Weiblen, George D.; Dixon, Richard A.
2009-01-01
RNA isolated from the glands of a Δ9-tetrahydrocannabinolic acid (THCA)-producing strain of Cannabis sativa was used to generate a cDNA library containing over 100 000 expressed sequence tags (ESTs). Sequencing of over 2000 clones from the library resulted in the identification of over 1000 unigenes. Candidate genes for almost every step in the biochemical pathways leading from primary metabolites to THCA were identified. Quantitative PCR analysis suggested that many of the pathway genes are preferentially expressed in the glands. Hexanoyl-CoA, one of the metabolites required for THCA synthesis, could be made via either de novo fatty acids synthesis or via the breakdown of existing lipids. qPCR analysis supported the de novo pathway. Many of the ESTs encode transcription factors and two putative MYB genes were identified that were preferentially expressed in glands. Given the similarity of the Cannabis MYB genes to those in other species with known functions, these Cannabis MYBs may play roles in regulating gland development and THCA synthesis. Three candidates for the polyketide synthase (PKS) gene responsible for the first committed step in the pathway to THCA were characterized in more detail. One of these was identical to a previously reported chalcone synthase (CHS) and was found to have CHS activity. All three could use malonyl-CoA and hexanoyl-CoA as substrates, including the CHS, but reaction conditions were not identified that allowed for the production of olivetolic acid (the proposed product of the PKS activity needed for THCA synthesis). One of the PKS candidates was highly and specifically expressed in glands (relative to whole leaves) and, on the basis of these expression data, it is proposed to be the most likely PKS responsible for olivetolic acid synthesis in Cannabis glands. PMID:19581347
Nambeesan, Savithri U; Mandel, Jennifer R; Bowers, John E; Marek, Laura F; Ebert, Daniel; Corbi, Jonathan; Rieseberg, Loren H; Knapp, Steven J; Burke, John M
2015-03-11
Shoot branching is an important determinant of plant architecture and influences various aspects of growth and development. Selection on branching has also played an important role in the domestication of crop plants, including sunflower (Helianthus annuus L.). Here, we describe an investigation of the genetic basis of variation in branching in sunflower via association mapping in a diverse collection of cultivated sunflower lines. Detailed phenotypic analyses revealed extensive variation in the extent and type of branching within the focal population. After correcting for population structure and kinship, association analyses were performed using a genome-wide collection of SNPs to identify genomic regions that influence a variety of branching-related traits. This work resulted in the identification of multiple previously unidentified genomic regions that contribute to variation in branching. Genomic regions that were associated with apical and mid-apical branching were generally distinct from those associated with basal and mid-basal branching. Homologs of known branching genes from other study systems (i.e., Arabidopsis, rice, pea, and petunia) were also identified from the draft assembly of the sunflower genome and their map positions were compared to those of associations identified herein. Numerous candidate branching genes were found to map in close proximity to significant branching associations. In sunflower, variation in branching is genetically complex and overall branching patterns (i.e., apical vs. basal) were found to be influenced by distinct genomic regions. Moreover, numerous candidate branching genes mapped in close proximity to significant branching associations. Although the sunflower genome exhibits localized islands of elevated linkage disequilibrium (LD), these non-random associations are known to decay rapidly elsewhere. The subset of candidate genes that co-localized with significant associations in regions of low LD represents the most promising target for future functional analyses.
Schweizer, Rena M; Robinson, Jacqueline; Harrigan, Ryan; Silva, Pedro; Galverni, Marco; Musiani, Marco; Green, Richard E; Novembre, John; Wayne, Robert K
2016-01-01
In an era of ever-increasing amounts of whole-genome sequence data for individuals and populations, the utility of traditional single nucleotide polymorphisms (SNPs) array-based genome scans is uncertain. We previously performed a SNP array-based genome scan to identify candidate genes under selection in six distinct grey wolf (Canis lupus) ecotypes. Using this information, we designed a targeted capture array for 1040 genes, including all exons and flanking regions, as well as 5000 1-kb nongenic neutral regions, and resequenced these regions in 107 wolves. Selection tests revealed striking patterns of variation within candidate genes relative to noncandidate regions and identified potentially functional variants related to local adaptation. We found 27% and 47% of candidate genes from the previous SNP array study had functional changes that were outliers in sweed and bayenv analyses, respectively. This result verifies the use of genomewide SNP surveys to tag genes that contain functional variants between populations. We highlight nonsynonymous variants in APOB, LIPG and USH2A that occur in functional domains of these proteins, and that demonstrate high correlation with precipitation seasonality and vegetation. We find Arctic and High Arctic wolf ecotypes have higher numbers of genes under selection, which highlight their conservation value and heightened threat due to climate change. This study demonstrates that combining genomewide genotyping arrays with large-scale resequencing and environmental data provides a powerful approach to discern candidate functional variants in natural populations. © 2015 John Wiley & Sons Ltd.
Takashima, Eizo; Williams, Marni; Eiglmeier, Karin; Pain, Adrien; Guelbeogo, Wamdaogo M.; Gneme, Awa; Brito-Fravallo, Emma; Holm, Inge; Lavazec, Catherine; Sagnon, N’Fale; Baxter, Richard H.; Riehle, Michelle M.; Vernick, Kenneth D.
2015-01-01
Nucleotide variation patterns across species are shaped by the processes of natural selection, including exposure to environmental pathogens. We examined patterns of genetic variation in two sister species, Anopheles gambiae and Anopheles coluzzii, both efficient natural vectors of human malaria in West Africa. We used the differentiation signature displayed by a known coordinate selective sweep of immune genes APL1 and TEP1 in A. coluzzii to design a population genetic screen trained on the sweep, classified a panel of 26 potential immune genes for concordance with the signature, and functionally tested their immune phenotypes. The screen results were strongly predictive for genes with protective immune phenotypes: genes meeting the screen criteria were significantly more likely to display a functional phenotype against malaria infection than genes not meeting the criteria (p = 0.0005). Thus, an evolution-based screen can efficiently prioritize candidate genes for labor-intensive downstream functional testing, and safely allow the elimination of genes not meeting the screen criteria. The suite of immune genes with characteristics similar to the APL1-TEP1 selective sweep appears to be more widespread in the A. coluzzii genome than previously recognized. The immune gene differentiation may be a consequence of adaptation of A. coluzzii to new pathogens encountered in its niche expansion during the separation from A. gambiae, although the role, if any of natural selection by Plasmodium is unknown. Application of the screen allowed identification of new functional immune factors, and assignment of new functions to known factors. We describe biochemical binding interactions between immune proteins that underlie functional activity for malaria infection, which highlights the interplay between pathogen specificity and the structure of immune complexes. We also find that most malaria-protective immune factors display phenotypes for either human or rodent malaria, with broad specificity a rarity. PMID:26633695
Integrative Functional Genomics for Systems Genetics in GeneWeaver.org.
Bubier, Jason A; Langston, Michael A; Baker, Erich J; Chesler, Elissa J
2017-01-01
The abundance of existing functional genomics studies permits an integrative approach to interpreting and resolving the results of diverse systems genetics studies. However, a major challenge lies in assembling and harmonizing heterogeneous data sets across species for facile comparison to the positional candidate genes and coexpression networks that come from systems genetic studies. GeneWeaver is an online database and suite of tools at www.geneweaver.org that allows for fast aggregation and analysis of gene set-centric data. GeneWeaver contains curated experimental data together with resource-level data such as GO annotations, MP annotations, and KEGG pathways, along with persistent stores of user entered data sets. These can be entered directly into GeneWeaver or transferred from widely used resources such as GeneNetwork.org. Data are analyzed using statistical tools and advanced graph algorithms to discover new relations, prioritize candidate genes, and generate function hypotheses. Here we use GeneWeaver to find genes common to multiple gene sets, prioritize candidate genes from a quantitative trait locus, and characterize a set of differentially expressed genes. Coupling a large multispecies repository curated and empirical functional genomics data to fast computational tools allows for the rapid integrative analysis of heterogeneous data for interpreting and extrapolating systems genetics results.
Liu, Bin; Jin, Min; Zeng, Pan
2015-10-01
The identification of gene-phenotype relationships is very important for the treatment of human diseases. Studies have shown that genes causing the same or similar phenotypes tend to interact with each other in a protein-protein interaction (PPI) network. Thus, many identification methods based on the PPI network model have achieved good results. However, in the PPI network, some interactions between the proteins encoded by candidate gene and the proteins encoded by known disease genes are very weak. Therefore, some studies have combined the PPI network with other genomic information and reported good predictive performances. However, we believe that the results could be further improved. In this paper, we propose a new method that uses the semantic similarity between the candidate gene and known disease genes to set the initial probability vector of a random walk with a restart algorithm in a human PPI network. The effectiveness of our method was demonstrated by leave-one-out cross-validation, and the experimental results indicated that our method outperformed other methods. Additionally, our method can predict new causative genes of multifactor diseases, including Parkinson's disease, breast cancer and obesity. The top predictions were good and consistent with the findings in the literature, which further illustrates the effectiveness of our method. Copyright © 2015 Elsevier Inc. All rights reserved.
Revealing the Strong Functional Association of adipor2 and cdh13 with adipoq: A Gene Network Study.
Bag, Susmita; Anbarasu, Anand
2015-04-01
In the present study, we have analyzed functional gene interactions of adiponectin gene (adipoq). The key role of adipoq is in regulating energy homeostasis and it functions as a novel signaling molecule for adipose tissue. Modules of highly inter-connected genes in disease-specific adipoq network are derived by integrating gene function and protein interaction data. Among twenty genes in adipoq web, adipoq is effectively conjoined with two genes: Adiponectin receptor 2 (adipor2) and cadherin 13 (cdh13). The functional analysis is done via ontological briefing and candidate disease identification. We observed that the highly efficient-interlinked genes connected with adipoq are adipor2 and cdh13. Interestingly, the ontological aspect of adipor2 and cdh13 in the adipoq network reveal the fact that adipoq and adipor2 are involved mostly in glucose and lipid metabolic processes. The gene cdh13 indulge in cell adhesion process with adipoq and adipor2. Our computational gene web analysis also predicts potential candidate disease recognition, thus indicating the involvement of adipoq, adipor2, and cdh13 with not only with obesity but also with breast cancer, leukemia, renal cancer, lung cancer, and cervical cancer. The current study provides researchers a comprehensible layout of adipoq network, its functional strategies and candidate disease approach associated with adipoq network.
Genome-scale expression studies and comprehensive loss-of-function genetic screens have focused almost exclusively on the highest confidence candidate genes. Here, we describe a strategy for characterizing the lower confidence candidates identified by such approaches.
Identification of possible genetic polymorphisms involved in cancer cachexia: a systematic review.
Tan, Benjamin H L; Ross, James A; Kaasa, Stein; Skorpen, Frank; Fearon, Kenneth C H
2011-04-01
Cancer cachexia is a polygenic and complex syndrome. Genetic variations in regulation of the inflammatory response, muscle and fat metabolic pathways, and pathways in appetite regulation are likely to contribute to the susceptibility or resistance to developing cancer cachexia. A systematic search of Medline and EmBase databases, covering 1986-2008 was performed for potential candidate genes/genetic polymorphisms relating to cancer cachexia. Related genes were then identified using pathway functional analysis software. All candidate genes were reviewed for functional polymorphisms or clinically significant polymorphisms associated with cachexia using the OMIM and GeneRIF databases. Genes with variants which had functional or clinical associations with cachexia and replicated in at least one study were entered into pathway analysis software to reveal possible network associations between genes. A total of 184 polymorphisms with functional or clinical relevance to cancer cachexia were identified in 92 candidate genes. Of these, 42 polymorphisms (in 33 genes) were replicated in more than one study with 13 polymorphisms found to influence two or more hallmarks of cachexia (i.e. inflammation, loss of fat mass and/or lean mass and reduced survival). Thirty-three genes were found to be significantly interconnected in two major networks with four genes (ADIPOQ, IL6, NFKB1 and TLR4) interlinking both networks. Selection of candidate genes and polymorphisms is a key element of multigene study design. The present study provides an initial framework to select genes/polymorphisms for further study in cancer cachexia, and to develop their potential as susceptibility biomarkers of developing cachexia.
Di Gregorio, E; Riberi, E; Belligni, E F; Biamino, E; Spielmann, M; Ala, U; Calcia, A; Bagnasco, I; Carli, D; Gai, G; Giordano, M; Guala, A; Keller, R; Mandrile, G; Arduino, C; Maffè, A; Naretto, V G; Sirchia, F; Sorasio, L; Ungari, S; Zonta, A; Zacchetti, G; Talarico, F; Pappi, P; Cavalieri, S; Giorgio, E; Mancini, C; Ferrero, M; Brussino, A; Savin, E; Gandione, M; Pelle, A; Giachino, D F; De Marchi, M; Restagno, G; Provero, P; Cirillo Silengo, M; Grosso, E; Buxbaum, J D; Pasini, B; De Rubeis, S; Brusco, A; Ferrero, G B
2017-10-01
Array-comparative genomic hybridization (array-CGH) is a widely used technique to detect copy number variants (CNVs) associated with developmental delay/intellectual disability (DD/ID). Identification of genomic disorders in DD/ID. We performed a comprehensive array-CGH investigation of 1,015 consecutive cases with DD/ID and combined literature mining, genetic evidence, evolutionary constraint scores, and functional information in order to assess the pathogenicity of the CNVs. We identified non-benign CNVs in 29% of patients. Amongst the pathogenic variants (11%), detected with a yield consistent with the literature, we found rare genomic disorders and CNVs spanning known disease genes. We further identified and discussed 51 cases with likely pathogenic CNVs spanning novel candidate genes, including genes encoding synaptic components and/or proteins involved in corticogenesis. Additionally, we identified two deletions spanning potential Topological Associated Domain (TAD) boundaries probably affecting the regulatory landscape. We show how phenotypic and genetic analyses of array-CGH data allow unraveling complex cases, identifying rare disease genes, and revealing unexpected position effects. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Burt, Andrew J; William, H Manilal; Perry, Gregory; Khanal, Raja; Pauls, K Peter; Kelly, James D; Navabi, Alireza
2015-01-01
Anthracnose, caused by Colletotrichum lindemuthianum, is an important fungal disease of common bean (Phaseolus vulgaris). Alleles at the Co-4 locus confer resistance to a number of races of C. lindemuthianum. A population of 94 F4:5 recombinant inbred lines of a cross between resistant black bean genotype B09197 and susceptible navy bean cultivar Nautica was used to identify markers associated with resistance in bean chromosome 8 (Pv08) where Co-4 is localized. Three SCAR markers with known linkage to Co-4 and a panel of single nucleotide markers were used for genotyping. A refined physical region on Pv08 with significant association with anthracnose resistance identified by markers was used in BLAST searches with the genomic sequence of common bean accession G19833. Thirty two unique annotated candidate genes were identified that spanned a physical region of 936.46 kb. A majority of the annotated genes identified had functional similarity to leucine rich repeats/receptor like kinase domains. Three annotated genes had similarity to 1, 3-β-glucanase domains. There were sequence similarities between some of the annotated genes found in the study and the genes associated with phosphoinositide-specific phosphilipases C associated with Co-x and the COK-4 loci found in previous studies. It is possible that the Co-4 locus is structured as a group of genes with functional domains dominated by protein tyrosine kinase along with leucine rich repeats/nucleotide binding site, phosphilipases C as well as β-glucanases.
Burt, Andrew J.; William, H. Manilal; Perry, Gregory; Khanal, Raja; Pauls, K. Peter; Kelly, James D.; Navabi, Alireza
2015-01-01
Anthracnose, caused by Colletotrichum lindemuthianum, is an important fungal disease of common bean (Phaseolus vulgaris). Alleles at the Co–4 locus confer resistance to a number of races of C. lindemuthianum. A population of 94 F4:5 recombinant inbred lines of a cross between resistant black bean genotype B09197 and susceptible navy bean cultivar Nautica was used to identify markers associated with resistance in bean chromosome 8 (Pv08) where Co–4 is localized. Three SCAR markers with known linkage to Co–4 and a panel of single nucleotide markers were used for genotyping. A refined physical region on Pv08 with significant association with anthracnose resistance identified by markers was used in BLAST searches with the genomic sequence of common bean accession G19833. Thirty two unique annotated candidate genes were identified that spanned a physical region of 936.46 kb. A majority of the annotated genes identified had functional similarity to leucine rich repeats/receptor like kinase domains. Three annotated genes had similarity to 1, 3-β-glucanase domains. There were sequence similarities between some of the annotated genes found in the study and the genes associated with phosphoinositide-specific phosphilipases C associated with Co-x and the COK–4 loci found in previous studies. It is possible that the Co–4 locus is structured as a group of genes with functional domains dominated by protein tyrosine kinase along with leucine rich repeats/nucleotide binding site, phosphilipases C as well as β-glucanases. PMID:26431031
Exploiting induced variation to dissect quantitative traits in barley.
Druka, Arnis; Franckowiak, Jerome; Lundqvist, Udda; Bonar, Nicola; Alexander, Jill; Guzy-Wrobelska, Justyna; Ramsay, Luke; Druka, Ilze; Grant, Iain; Macaulay, Malcolm; Vendramin, Vera; Shahinnia, Fahimeh; Radovic, Slobodanka; Houston, Kelly; Harrap, David; Cardle, Linda; Marshall, David; Morgante, Michele; Stein, Nils; Waugh, Robbie
2010-04-01
The identification of genes underlying complex quantitative traits such as grain yield by means of conventional genetic analysis (positional cloning) requires the development of several large mapping populations. However, it is possible that phenotypically related, but more extreme, allelic variants generated by mutational studies could provide a means for more efficient cloning of QTLs (quantitative trait loci). In barley (Hordeum vulgare), with the development of high-throughput genome analysis tools, efficient genome-wide identification of genetic loci harbouring mutant alleles has recently become possible. Genotypic data from NILs (near-isogenic lines) that carry induced or natural variants of genes that control aspects of plant development can be compared with the location of QTLs to potentially identify candidate genes for development--related traits such as grain yield. As yield itself can be divided into a number of allometric component traits such as tillers per plant, kernels per spike and kernel size, mutant alleles that both affect these traits and are located within the confidence intervals for major yield QTLs may represent extreme variants of the underlying genes. In addition, the development of detailed comparative genomic models based on the alignment of a high-density barley gene map with the rice and sorghum physical maps, has enabled an informed prioritization of 'known function' genes as candidates for both QTLs and induced mutant genes.
Sun, Celi; Molineros, Julio E; Looger, Loren L; Zhou, Xu-Jie; Kim, Kwangwoo; Okada, Yukinori; Ma, Jianyang; Qi, Yuan-Yuan; Kim-Howard, Xana; Motghare, Prasenjeet; Bhattarai, Krishna; Adler, Adam; Bang, So-Young; Lee, Hye-Soon; Kim, Tae-Hwan; Kang, Young Mo; Suh, Chang-Hee; Chung, Won Tae; Park, Yong-Beom; Choe, Jung-Yoon; Shim, Seung Cheol; Kochi, Yuta; Suzuki, Akari; Kubo, Michiaki; Sumida, Takayuki; Yamamoto, Kazuhiko; Lee, Shin-Seok; Kim, Young Jin; Han, Bok-Ghee; Dozmorov, Mikhail; Kaufman, Kenneth M; Wren, Jonathan D; Harley, John B; Shen, Nan; Chua, Kek Heng; Zhang, Hong; Bae, Sang-Cheol; Nath, Swapan K
2016-03-01
Systemic lupus erythematosus (SLE) has a strong but incompletely understood genetic architecture. We conducted an association study with replication in 4,478 SLE cases and 12,656 controls from six East Asian cohorts to identify new SLE susceptibility loci and better localize known loci. We identified ten new loci and confirmed 20 known loci with genome-wide significance. Among the new loci, the most significant locus was GTF2IRD1-GTF2I at 7q11.23 (rs73366469, Pmeta = 3.75 × 10(-117), odds ratio (OR) = 2.38), followed by DEF6, IL12B, TCF7, TERT, CD226, PCNXL3, RASGRP1, SYNGR1 and SIGLEC6. We identified the most likely functional variants at each locus by analyzing epigenetic marks and gene expression data. Ten candidate variants are known to alter gene expression in cis or in trans. Enrichment analysis highlights the importance of these loci in B cell and T cell biology. The new loci, together with previously known loci, increase the explained heritability of SLE to 24%. The new loci share functional and ontological characteristics with previously reported loci and are possible drug targets for SLE therapeutics.
Tavtigian, Sean V; Byrnes, Graham B; Goldgar, David E; Thomas, Alun
2008-11-01
Many individually rare missense substitutions are encountered during deep resequencing of candidate susceptibility genes and clinical mutation screening of known susceptibility genes. BRCA1 and BRCA2 are among the most resequenced of all genes, and clinical mutation screening of these genes provides an extensive data set for analysis of rare missense substitutions. Align-GVGD is a mathematically simple missense substitution analysis algorithm, based on the Grantham difference, which has already contributed to classification of missense substitutions in BRCA1, BRCA2, and CHEK2. However, the distribution of genetic risk as a function of Align-GVGD's output variables Grantham variation (GV) and Grantham deviation (GD) has not been well characterized. Here, we used data from the Myriad Genetic Laboratories database of nearly 70,000 full-sequence tests plus two risk estimates, one approximating the odds ratio and the other reflecting strength of selection, to display the distribution of risk in the GV-GD plane as a series of surfaces. We abstracted contours from the surfaces and used the contours to define a sequence of missense substitution grades ordered from greatest risk to least risk. The grades were validated internally using a third, personal and family history-based, measure of risk. The Align-GVGD grades defined here are applicable to both the genetic epidemiology problem of classifying rare missense substitutions observed in known susceptibility genes and the molecular epidemiology problem of analyzing rare missense substitutions observed during case-control mutation screening studies of candidate susceptibility genes. (c) 2008 Wiley-Liss, Inc.
Posnien, Nico; Koniszewski, Nikolaus Dieter Bernhard; Hein, Hendrikje Jeannette; Bucher, Gregor
2011-12-01
Several highly conserved genes play a role in anterior neural plate patterning of vertebrates and in head and brain patterning of insects. However, head involution in Drosophila has impeded a systematic identification of genes required for insect head formation. Therefore, we use the red flour beetle Tribolium castaneum in order to comprehensively test the function of orthologs of vertebrate neural plate patterning genes for a function in insect head development. RNAi analysis reveals that most of these genes are indeed required for insect head capsule patterning, and we also identified several genes that had not been implicated in this process before. Furthermore, we show that Tc-six3/optix acts upstream of Tc-wingless, Tc-orthodenticle1, and Tc-eyeless to control anterior median development. Finally, we demonstrate that Tc-six3/optix is the first gene known to be required for the embryonic formation of the central complex, a midline-spanning brain part connected to the neuroendocrine pars intercerebralis. These functions are very likely conserved among bilaterians since vertebrate six3 is required for neuroendocrine and median brain development with certain mutations leading to holoprosencephaly.
Hein, Hendrikje Jeannette; Bucher, Gregor
2011-01-01
Several highly conserved genes play a role in anterior neural plate patterning of vertebrates and in head and brain patterning of insects. However, head involution in Drosophila has impeded a systematic identification of genes required for insect head formation. Therefore, we use the red flour beetle Tribolium castaneum in order to comprehensively test the function of orthologs of vertebrate neural plate patterning genes for a function in insect head development. RNAi analysis reveals that most of these genes are indeed required for insect head capsule patterning, and we also identified several genes that had not been implicated in this process before. Furthermore, we show that Tc-six3/optix acts upstream of Tc-wingless, Tc-orthodenticle1, and Tc-eyeless to control anterior median development. Finally, we demonstrate that Tc-six3/optix is the first gene known to be required for the embryonic formation of the central complex, a midline-spanning brain part connected to the neuroendocrine pars intercerebralis. These functions are very likely conserved among bilaterians since vertebrate six3 is required for neuroendocrine and median brain development with certain mutations leading to holoprosencephaly. PMID:22216011
Fu, Hsu-Yuan; Lu, Yen-Hsu; Yi, Hsiu-Ping; Yang, Chii-Shen
2013-04-05
Microbial sensory rhodopsins are known to mediate phototaxis, and all of the known sensory rhodopsins execute this function with a specific cognate transducer that has two-transmembrane (2-TM) regions. In the genome of Haloarcula marismortui, a total of six rhodopsin genes were annotated, and we previously showed three of them to be the ion type and suggested the other three as sensory type, even though the candidate transducer gene, htr, for HmSRI was missing the 2-TM region that is found in all of the other known transducers. Here we showed this htr gene featured a preceding 2-TM region when the alternative start codon GTG located 291 nucleotides upstream of the original annotated open reading frame (ORF) was introduced and it is named as htrI in this study. Overexpression of HmHtrI exhibited it existed as a membrane protein and several biophysical assays confirmed it functionally interacted with HmSRI. Together with our previous reverse-transcriptase-PCR results and phototaxis measurements, the new ORF of original predicted soluble htr gene product was a membrane protein with a 2-TM region, HmHtrI; and it serves as the cognate transducer for HmSRI. HmHtrI therefore is the first transducer for the sensory rhodopsin adopted start codon other than ATG. Copyright © 2013 Elsevier B.V. All rights reserved.
Reveal genes functionally associated with ACADS by a network study.
Chen, Yulong; Su, Zhiguang
2015-09-15
Establishing a systematic network is aimed at finding essential human gene-gene/gene-disease pathway by means of network inter-connecting patterns and functional annotation analysis. In the present study, we have analyzed functional gene interactions of short-chain acyl-coenzyme A dehydrogenase gene (ACADS). ACADS plays a vital role in free fatty acid β-oxidation and regulates energy homeostasis. Modules of highly inter-connected genes in disease-specific ACADS network are derived by integrating gene function and protein interaction data. Among the 8 genes in ACADS web retrieved from both STRING and GeneMANIA, ACADS is effectively conjoined with 4 genes including HAHDA, HADHB, ECHS1 and ACAT1. The functional analysis is done via ontological briefing and candidate disease identification. We observed that the highly efficient-interlinked genes connected with ACADS are HAHDA, HADHB, ECHS1 and ACAT1. Interestingly, the ontological aspect of genes in the ACADS network reveals that ACADS, HAHDA and HADHB play equally vital roles in fatty acid metabolism. The gene ACAT1 together with ACADS indulges in ketone metabolism. Our computational gene web analysis also predicts potential candidate disease recognition, thus indicating the involvement of ACADS, HAHDA, HADHB, ECHS1 and ACAT1 not only with lipid metabolism but also with infant death syndrome, skeletal myopathy, acute hepatic encephalopathy, Reye-like syndrome, episodic ketosis, and metabolic acidosis. The current study presents a comprehensible layout of ACADS network, its functional strategies and candidate disease approach associated with ACADS network. Copyright © 2015 Elsevier B.V. All rights reserved.
Transcriptional Regulatory Network Analysis of MYB Transcription Factor Family Genes in Rice.
Smita, Shuchi; Katiyar, Amit; Chinnusamy, Viswanathan; Pandey, Dev M; Bansal, Kailash C
2015-01-01
MYB transcription factor (TF) is one of the largest TF families and regulates defense responses to various stresses, hormone signaling as well as many metabolic and developmental processes in plants. Understanding these regulatory hierarchies of gene expression networks in response to developmental and environmental cues is a major challenge due to the complex interactions between the genetic elements. Correlation analyses are useful to unravel co-regulated gene pairs governing biological process as well as identification of new candidate hub genes in response to these complex processes. High throughput expression profiling data are highly useful for construction of co-expression networks. In the present study, we utilized transcriptome data for comprehensive regulatory network studies of MYB TFs by "top-down" and "guide-gene" approaches. More than 50% of OsMYBs were strongly correlated under 50 experimental conditions with 51 hub genes via "top-down" approach. Further, clusters were identified using Markov Clustering (MCL). To maximize the clustering performance, parameter evaluation of the MCL inflation score (I) was performed in terms of enriched GO categories by measuring F-score. Comparison of co-expressed cluster and clads analyzed from phylogenetic analysis signifies their evolutionarily conserved co-regulatory role. We utilized compendium of known interaction and biological role with Gene Ontology enrichment analysis to hypothesize function of coexpressed OsMYBs. In the other part, the transcriptional regulatory network analysis by "guide-gene" approach revealed 40 putative targets of 26 OsMYB TF hubs with high correlation value utilizing 815 microarray data. The putative targets with MYB-binding cis-elements enrichment in their promoter region, functional co-occurrence as well as nuclear localization supports our finding. Specially, enrichment of MYB binding regions involved in drought-inducibility implying their regulatory role in drought response in rice. Thus, the co-regulatory network analysis facilitated the identification of complex OsMYB regulatory networks, and candidate target regulon genes of selected guide MYB genes. The results contribute to the candidate gene screening, and experimentally testable hypotheses for potential regulatory MYB TFs, and their targets under stress conditions.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Weigel, D.
2003-03-11
OAK-B135 Results obtained during this funding period: (1) Phylogenetic footprinting of AG regulatory sequences Sequences necessary and sufficient for AGAMOUS (AG) expression in the center of Arabidopsis flowers are located in the second intron, which is about 3 kb in size. This intron contains binding sites for two transcription factors, LEAFY (LFY) and WUSCHEL (WUS), which are direct activators of AG. We used the new method of phylogenetic shadowing to identify new regulatory elements. Among 29 Brassicaceae, several other motifs, but not the LFY and WUS binding sites previously identified, are largely invariant. Using reporter gene analyses, we tested sixmore » of these motifs and found that they are all functionally important for activity of AG regulatory sequences in A. thaliana. (2) Repression of AG by MADS box genes A candidate for repressing AG in the shoot apical meristem has been the MADS box gene FUL, since it is expressed in the shoot apical meristem and since an activated version (FUL:VP16) leads to ectopic AG expression in the shoot apical meristem. However, there is no ectopic AG expression in full single mutants. We therefore started to generate VP16 fusions of several other MADS box genes expressed in the shoot apical meristem, to determine which of these might be candidates for FUL redundant genes. We found that AGL6:VP16 has a similar phenotype as FUL:VP16, suggesting that AGL6 and FUL interact. We are now testing this hypothesis. (3) Two candidate AG regulators, WOW and ULA Because the phylogenetic footprinting project has identified several new candidate regulatory motifs, of which at least one (the CCAATCA motif) has rather strong effects, we had decided to put the analysis of WOW and ULA on hold, and to focus on using the newly identified motifs as tools. We conduct ed yeast one-hybrid screen with two of the conserved motifs, and identified several classes of transcription factors that can interact with them. One of these is encoded by the PAN gene, previously known to be expressed in a domain that overlaps the AG domain, but not known before to regulate AG. (4) New genetic modifiers of AG This part of the project was concluded in the previous funding period.« less
Kahrizi, Kimia; Musante, Luciana; Fattahi, Zohreh; Hosseini, Masoumeh; Maqsoud, Fariba; Farajollahi, Reza; Wienker, Thomas F.; Ropers, H. Hilger; Najmabadi, Hossein
2015-01-01
Cognitive impairment or intellectual disability (ID) is a widespread neurodevelopmental disorder characterized by low IQ (below 70). ID is genetically heterogeneous and is estimated to affect 1–3% of the world’s population. In affected children from consanguineous families, autosomal recessive inheritance is common, and identifying the underlying genetic cause is an important issue in clinical genetics. In the framework of a larger project, aimed at identifying candidate genes for autosomal recessive intellectual disorder (ARID), we recently carried out single nucleotide polymorphism-based genome-wide linkage analysis in several families from Ardabil province in Iran. The identification of homozygosity-by-descent loci in these families, in combination with whole exome sequencing, led us to identify possible causative homozygous changes in two families. In the first family, a missense variant was found in GRM1 gene, while in the second family, a frameshift alteration was identified in TRMT1, both of which were found to co-segregate with the disease. GRM1, a known causal gene for autosomal recessive spinocerebellar ataxia (SCAR13, MIM#614831), encodes the metabotropic glutamate receptor1 (mGluR1). This gene plays an important role in synaptic plasticity and cerebellar development. Conversely, the TRMT1 gene encodes a tRNA methyltransferase that dimethylates a single guanine residue at position 26 of most tRNAs using S-adenosyl methionine as the methyl group donor. We recently presented TRMT1 as a candidate gene for ARID in a consanguineous Iranian family (Najmabadi et al., 2011). We believe that this second Iranian family with a biallelic loss-of-function mutation in TRMT1 gene supports the idea that this gene likely has function in development of the disorder. PMID:26308914
Davarniya, Behzad; Hu, Hao; Kahrizi, Kimia; Musante, Luciana; Fattahi, Zohreh; Hosseini, Masoumeh; Maqsoud, Fariba; Farajollahi, Reza; Wienker, Thomas F; Ropers, H Hilger; Najmabadi, Hossein
2015-01-01
Cognitive impairment or intellectual disability (ID) is a widespread neurodevelopmental disorder characterized by low IQ (below 70). ID is genetically heterogeneous and is estimated to affect 1-3% of the world's population. In affected children from consanguineous families, autosomal recessive inheritance is common, and identifying the underlying genetic cause is an important issue in clinical genetics. In the framework of a larger project, aimed at identifying candidate genes for autosomal recessive intellectual disorder (ARID), we recently carried out single nucleotide polymorphism-based genome-wide linkage analysis in several families from Ardabil province in Iran. The identification of homozygosity-by-descent loci in these families, in combination with whole exome sequencing, led us to identify possible causative homozygous changes in two families. In the first family, a missense variant was found in GRM1 gene, while in the second family, a frameshift alteration was identified in TRMT1, both of which were found to co-segregate with the disease. GRM1, a known causal gene for autosomal recessive spinocerebellar ataxia (SCAR13, MIM#614831), encodes the metabotropic glutamate receptor1 (mGluR1). This gene plays an important role in synaptic plasticity and cerebellar development. Conversely, the TRMT1 gene encodes a tRNA methyltransferase that dimethylates a single guanine residue at position 26 of most tRNAs using S-adenosyl methionine as the methyl group donor. We recently presented TRMT1 as a candidate gene for ARID in a consanguineous Iranian family (Najmabadi et al., 2011). We believe that this second Iranian family with a biallelic loss-of-function mutation in TRMT1 gene supports the idea that this gene likely has function in development of the disorder.
Baumgartner, Desiree; Kopf, Matthias; Klähn, Stephan; Steglich, Claudia; Hess, Wolfgang R
2016-11-28
Despite their versatile functions in multimeric protein complexes, in the modification of enzymatic activities, intercellular communication or regulatory processes, proteins shorter than 80 amino acids (μ-proteins) are a systematically underestimated class of gene products in bacteria. Photosynthetic cyanobacteria provide a paradigm for small protein functions due to extensive work on the photosynthetic apparatus that led to the functional characterization of 19 small proteins of less than 50 amino acids. In analogy, previously unstudied small ORFs with similar degrees of conservation might encode small proteins of high relevance also in other functional contexts. Here we used comparative transcriptomic information available for two model cyanobacteria, Synechocystis sp. PCC 6803 and Synechocystis sp. PCC 6714 for the prediction of small ORFs. We found 293 transcriptional units containing candidate small ORFs ≤80 codons in Synechocystis sp. PCC 6803, also including the known mRNAs encoding small proteins of the photosynthetic apparatus. From these transcriptional units, 146 are shared between the two strains, 42 are shared with the higher plant Arabidopsis thaliana and 25 with E. coli. To verify the existence of the respective μ-proteins in vivo, we selected five genes as examples to which a FLAG tag sequence was added and re-introduced them into Synechocystis sp. PCC 6803. These were the previously annotated gene ssr1169, two newly defined genes norf1 and norf4, as well as nsiR6 (nitrogen stress-induced RNA 6) and hliR1(high light-inducible RNA 1) , which originally were considered non-coding. Upon activation of expression via the Cu 2+. responsive petE promoter or from the native promoters, all five proteins were detected in Western blot experiments. The distribution and conservation of these five genes as well as their regulation of expression and the physico-chemical properties of the encoded proteins underline the likely great bandwidth of small protein functions in bacteria and makes them attractive candidates for functional studies.
Inflammatory Bowel Diseases: the genetic revolution.
Jung, C; Hugot, J-P
2009-06-01
The genetic component of Inflammatory Bowel Diseases is among the best known for complex genetic disorders. If the functional candidate gene approach was rarely fruitful in the past, genome-wide scans allowed finding several susceptibility genes for Crohn disease including NOD2, IL23R, ATG16L1, IRGM, TNFSF15, a region close to PTGER4, PTPN2, PTPN22, NKX2-3 and many others. Only one gene, ECM1, has been reported for ulcerative colitis alone. We now need to further explore these new genes before to understand their biological role. However they clearly demonstrate the importance of innate immunity and autophagy for Crohn's disease and of the TH-17 differentiation for ulcerative colitis, Crohn's disease and other inflammatory disorders. Copyright 2009 Elsevier Masson SAS. All rights reserved.
NASA Astrophysics Data System (ADS)
Devanna, Paolo; Vernes, Sonja C.
2014-02-01
Retinoic acid-related orphan receptor alpha gene (RORa) and the microRNA MIR137 have both recently been identified as novel candidate genes for neuropsychiatric disorders. RORa encodes a ligand-dependent orphan nuclear receptor that acts as a transcriptional regulator and miR-137 is a brain enriched small non-coding RNA that interacts with gene transcripts to control protein levels. Given the mounting evidence for RORa in autism spectrum disorders (ASD) and MIR137 in schizophrenia and ASD, we investigated if there was a functional biological relationship between these two genes. Herein, we demonstrate that miR-137 targets the 3'UTR of RORa in a site specific manner. We also provide further support for MIR137 as an autism candidate by showing that a large number of previously implicated autism genes are also putatively targeted by miR-137. This work supports the role of MIR137 as an ASD candidate and demonstrates a direct biological link between these previously unrelated autism candidate genes.
Arafa, Ramadan A.; Rakha, Mohamed T.; Kamel, Said M.
2017-01-01
Tomato late blight caused by Phytophthora infestans (Mont.) de Bary, also known as the Irish famine pathogen, is one of the most destructive plant diseases. Wild relatives of tomato possess useful resistance genes against this disease, and could therefore be used in breeding to improve cultivated varieties. In the genome of a wild relative of tomato, Solanum habrochaites accession LA1777, we identified a new quantitative trait locus for resistance against blight caused by an aggressive Egyptian isolate of P. infestans. Using double-digest restriction site–associated DNA sequencing (ddRAD-Seq) technology, we determined 6,514 genome-wide SNP genotypes of an F2 population derived from an interspecific cross. Subsequent association analysis of genotypes and phenotypes of the mapping population revealed that a 6.8 Mb genome region on chromosome 6 was a candidate locus for disease resistance. Whole-genome resequencing analysis revealed that 298 genes in this region potentially had functional differences between the parental lines. Among of them, two genes with missense mutations, Solyc06g071810.1 and Solyc06g083640.3, were considered to be potential candidates for disease resistance. SNP and SSR markers linking to this region can be used in marker-assisted selection in future breeding programs for late blight disease, including introgression of new genetic loci from wild species. In addition, the approach developed in this study provides a model for identification of other genes for attractive agronomical traits. PMID:29253902
Li, Yuanjun; Gou, Junbo; Chen, Fangfang; Li, Changfu; Zhang, Yansheng
2016-01-01
Xanthium strumarium L. is a traditional Chinese herb belonging to the Asteraceae family. The major bioactive components of this plant are sesquiterpene lactones (STLs), which include the xanthanolides. To date, the biogenesis of xanthanolides, especially their downstream pathway, remains largely unknown. In X. strumarium, xanthanolides primarily accumulate in its glandular trichomes. To identify putative gene candidates involved in the biosynthesis of xanthanolides, three X. strumarium transcriptomes, which were derived from the young leaves of two different cultivars and the purified glandular trichomes from one of the cultivars, were constructed in this study. In total, 157 million clean reads were generated and assembled into 91,861 unigenes, of which 59,858 unigenes were successfully annotated. All the genes coding for known enzymes in the upstream pathway to the biosynthesis of xanthanolides were present in the X. strumarium transcriptomes. From a comparative analysis of the X. strumarium transcriptomes, this study identified a number of gene candidates that are putatively involved in the downstream pathway to the synthesis of xanthanolides, such as four unigenes encoding CYP71 P450s, 50 unigenes for dehydrogenases, and 27 genes for acetyltransferases. The possible functions of these four CYP71 candidates are extensively discussed. In addition, 116 transcription factors that are highly expressed in X. strumarium glandular trichomes were also identified. Their possible regulatory roles in the biosynthesis of STLs are discussed. The global transcriptomic data for X. strumarium should provide a valuable resource for further research into the biosynthesis of xanthanolides.
Du, Qingzhang; Gong, Chenrui; Pan, Wei; Zhang, Deqiang
2013-02-01
Gene-derived simple sequence repeats (genic SSRs), also known as functional markers, are often preferred over random genomic markers because they represent variation in gene coding and/or regulatory regions. We characterized 544 genic SSR loci derived from 138 candidate genes involved in wood formation, distributed throughout the genome of Populus tomentosa, a key ecological and cultivated wood production species. Of these SSRs, three-quarters were located in the promoter or intron regions, and dinucleotide (59.7%) and trinucleotide repeat motifs (26.5%) predominated. By screening 15 wild P. tomentosa ecotypes, we identified 188 polymorphic genic SSRs with 861 alleles, 2-7 alleles for each marker. Transferability analysis of 30 random genic SSRs, testing whether these SSRs work in 26 genotypes of five genus Populus sections (outgroup, Salix matsudana), showed that 72% of the SSRs could be amplified in Turanga and 100% could be amplified in Leuce. Based on genotyping of these 26 genotypes, a neighbour-joining analysis showed the expected six phylogenetic groupings. In silico analysis of SSR variation in 220 sequences that are homologous between P. tomentosa and Populus trichocarpa suggested that genic SSR variations between relatives were predominantly affected by repeat motif variations or flanking sequence mutations. Inheritance tests and single-marker associations demonstrated the power of genic SSRs in family-based linkage mapping and candidate gene-based association studies, as well as marker-assisted selection and comparative genomic studies of P. tomentosa and related species.
Systematic analysis of copy number variation associated with congenital diaphragmatic hernia.
Zhu, Qihui; High, Frances A; Zhang, Chengsheng; Cerveira, Eliza; Russell, Meaghan K; Longoni, Mauro; Joy, Maliackal P; Ryan, Mallory; Mil-Homens, Adam; Bellfy, Lauren; Coletti, Caroline M; Bhayani, Pooja; Hila, Regis; Wilson, Jay M; Donahoe, Patricia K; Lee, Charles
2018-05-15
Congenital diaphragmatic hernia (CDH), characterized by malformation of the diaphragm and hypoplasia of the lungs, is one of the most common and severe birth defects, and is associated with high morbidity and mortality rates. There is growing evidence demonstrating that genetic factors contribute to CDH, although the pathogenesis remains largely elusive. Single-nucleotide polymorphisms have been studied in recent whole-exome sequencing efforts, but larger copy number variants (CNVs) have not yet been studied on a large scale in a case control study. To capture CNVs within CDH candidate regions, we developed and tested a targeted array comparative genomic hybridization platform to identify CNVs within 140 regions in 196 patients and 987 healthy controls, and identified six significant CNVs that were either unique to patients or enriched in patients compared with controls. These CDH-associated CNVs reveal high-priority candidate genes including HLX , LHX1 , and HNF1B We also discuss CNVs that are present in only one patient in the cohort but have additional evidence of pathogenicity, including extremely rare large and/or de novo CNVs. The candidate genes within these predicted disease-causing CNVs form functional networks with other known CDH genes and play putative roles in DNA binding/transcription regulation and embryonic development. These data substantiate the importance of CNVs in the etiology of CDH, identify CDH candidate genes and pathways, and highlight the importance of ongoing analysis of CNVs in the study of CDH and other structural birth defects. Copyright © 2018 the Author(s). Published by PNAS.
Genome-Wide Specific Selection in Three Domestic Sheep Breeds.
Wang, Huihua; Zhang, Li; Cao, Jiaxve; Wu, Mingming; Ma, Xiaomeng; Liu, Zhen; Liu, Ruizao; Zhao, Fuping; Wei, Caihong; Du, Lixin
2015-01-01
Commercial sheep raised for mutton grow faster than traditional Chinese sheep breeds. Here, we aimed to evaluate genetic selection among three different types of sheep breed: two well-known commercial mutton breeds and one indigenous Chinese breed. We first combined locus-specific branch lengths and di statistical methods to detect candidate regions targeted by selection in the three different populations. The results showed that the genetic distances reached at least medium divergence for each pairwise combination. We found these two methods were highly correlated, and identified many growth-related candidate genes undergoing artificial selection. For production traits, APOBR and FTO are associated with body mass index. For meat traits, ALDOA, STK32B and FAM190A are related to marbling. For reproduction traits, CCNB2 and SLC8A3 affect oocyte development. We also found two well-known genes, GHR (which affects meat production and quality) and EDAR (associated with hair thickness) were associated with German mutton merino sheep. Furthermore, four genes (POL, RPL7, MSL1 and SHISA9) were associated with pre-weaning gain in our previous genome-wide association study. Our results indicated that combine locus-specific branch lengths and di statistical approaches can reduce the searching ranges for specific selection. And we got many credible candidate genes which not only confirm the results of previous reports, but also provide a suite of novel candidate genes in defined breeds to guide hybridization breeding.
Sivley, R Michael; Sheehan, Jonathan H; Kropski, Jonathan A; Cogan, Joy; Blackwell, Timothy S; Phillips, John A; Bush, William S; Meiler, Jens; Capra, John A
2018-01-23
Next-generation sequencing of individuals with genetic diseases often detects candidate rare variants in numerous genes, but determining which are causal remains challenging. We hypothesized that the spatial distribution of missense variants in protein structures contains information about function and pathogenicity that can help prioritize variants of unknown significance (VUS) and elucidate the structural mechanisms leading to disease. To illustrate this approach in a clinical application, we analyzed 13 candidate missense variants in regulator of telomere elongation helicase 1 (RTEL1) identified in patients with Familial Interstitial Pneumonia (FIP). We curated pathogenic and neutral RTEL1 variants from the literature and public databases. We then used homology modeling to construct a 3D structural model of RTEL1 and mapped known variants into this structure. We next developed a pathogenicity prediction algorithm based on proximity to known disease causing and neutral variants and evaluated its performance with leave-one-out cross-validation. We further validated our predictions with segregation analyses, telomere lengths, and mutagenesis data from the homologous XPD protein. Our algorithm for classifying RTEL1 VUS based on spatial proximity to pathogenic and neutral variation accurately distinguished 7 known pathogenic from 29 neutral variants (ROC AUC = 0.85) in the N-terminal domains of RTEL1. Pathogenic proximity scores were also significantly correlated with effects on ATPase activity (Pearson r = -0.65, p = 0.0004) in XPD, a related helicase. Applying the algorithm to 13 VUS identified from sequencing of RTEL1 from patients predicted five out of six disease-segregating VUS to be pathogenic. We provide structural hypotheses regarding how these mutations may disrupt RTEL1 ATPase and helicase function. Spatial analysis of missense variation accurately classified candidate VUS in RTEL1 and suggests how such variants cause disease. Incorporating spatial proximity analyses into other pathogenicity prediction tools may improve accuracy for other genes and genetic diseases.
A genome-wide scan for signatures of directional selection in domesticated pigs.
Moon, Sunjin; Kim, Tae-Hun; Lee, Kyung-Tai; Kwak, Woori; Lee, Taeheon; Lee, Si-Woo; Kim, Myung-Jick; Cho, Kyuho; Kim, Namshin; Chung, Won-Hyong; Sung, Samsun; Park, Taesung; Cho, Seoae; Groenen, Martien Am; Nielsen, Rasmus; Kim, Yuseob; Kim, Heebal
2015-02-25
Animal domestication involved drastic phenotypic changes driven by strong artificial selection and also resulted in new populations of breeds, established by humans. This study aims to identify genes that show evidence of recent artificial selection during pig domestication. Whole-genome resequencing of 30 individual pigs from domesticated breeds, Landrace and Yorkshire, and 10 Asian wild boars at ~16-fold coverage was performed resulting in over 4.3 million SNPs for 19,990 genes. We constructed a comprehensive genome map of directional selection by detecting selective sweeps using an F ST-based approach that detects directional selection in lineages leading to the domesticated breeds and using a haplotype-based test that detects ongoing selective sweeps within the breeds. We show that candidate genes under selection are significantly enriched for loci implicated in quantitative traits important to pig reproduction and production. The candidate gene with the strongest signals of directional selection belongs to group III of the metabolomics glutamate receptors, known to affect brain functions associated with eating behavior, suggesting that loci under strong selection include loci involved in behaviorial traits in domesticated pigs including tameness. We show that a significant proportion of selection signatures coincide with loci that were previously inferred to affect phenotypic variation in pigs. We further identify functional enrichment related to behavior, such as signal transduction and neuronal activities, for those targets of selection during domestication in pigs.
Aschenbrenner, Anna-Katharina; Kwon, Moonhyuk; Conrad, Jürgen; Ro, Dae-Kyun; Spring, Otmar
2016-04-01
Sunflower is known to produce a variety of bisabolene-type sesquiterpenes and accumulates these substances in trichomes of leaves, stems and flowering parts. A bioinformatics approach was used to identify the enzyme responsible for the initial step in the biosynthesis of these compounds from its precursor farnesyl pyrophosphate. Based on sequence similarity with a known bisabolene synthases from Arabidopsis thaliana AtTPS12, candidate genes of Helianthus were searched in EST-database and used to design specific primers. PCR experiments identified two candidates in the RNA pool of linear glandular trichomes of sunflower. Their sequences contained the typical motifs of sesquiterpene synthases and their expression in yeast functionally characterized them as bisabolene synthases. Spectroscopic analysis identified the stereochemistry of the product of both enzymes as (Z)-γ-bisabolene. The origin of the two sunflower bisabolene synthase genes from the transcripts of linear trichomes indicates that they may be involved in the synthesis of sesquiterpenes produced in these trichomes. Comparison of the amino acid sequences of the sunflower bisabolene synthases showed high similarity with sesquiterpene synthases from other Asteracean species and indicated putative evolutionary origin from a β-farnesene synthase. Copyright © 2016 Elsevier Ltd. All rights reserved.
Identification of candidate regions for a novel Usher syndrome type II locus.
Ben Rebeh, Imen; Benzina, Zeineb; Dhouib, Houria; Hadjamor, Imen; Amyere, Mustapha; Ayadi, Leila; Turki, Khalil; Hammami, Bouthaina; Kmiha, Noureddine; Kammoun, Hassen; Hakim, Bochra; Charfedine, Ilhem; Vikkula, Miikka; Ghorbel, Abdelmonem; Ayadi, Hammadi; Masmoudi, Saber
2008-09-19
Chronic diseases affecting the inner ear and the retina cause severe impairments to our communication systems. In more than half of the cases, Usher syndrome (USH) is the origin of these double defects. Patients with USH type II (USH2) have retinitis pigmentosa (RP) that develops during puberty, moderate to severe hearing impairment with downsloping pure-tone audiogram, and normal vestibular function. Four loci and three genes are known for USH2. In this study, we proposed to localize the gene responsible for USH2 in a consanguineous family of Tunisian origin. Affected members underwent detailed ocular and audiologic characterization. One Tunisian family with USH2 and 45 healthy controls unrelated to the family were recruited. Two affected and six unaffected family members attended our study. DNA samples of eight family members were genotyped with polymorphic markers. Two-point and multipoint LOD scores were calculated using Genehunter software v2.1. Sequencing was used to investigate candidate genes. Haplotype analysis showed no significant linkage to any known USH gene or locus. A genome-wide screen, using microsatellite markers, was performed, allowing the identification of three homozygous regions in chromosomes 2, 4, and 15. We further confirmed and refined these three regions using microsatellite and single-nucleotide polymorphisms. With recessive mode of inheritance, the highest multipoint LOD score of 1.765 was identified for the candidate regions on chromosomes 4 and 15. The chromosome 15 locus is large (55 Mb), underscoring the limited number of meioses in the consanguineous pedigree. Moreover, the linked, homozygous chromosome 15q alleles, unlike those of the chromosome 2 and 4 loci, are infrequent in the local population. Thus, the data strongly suggest that the novel locus for USH2 is likely to reside on 15q. Our data provide a basis for the localization and the identification of a novel gene implicated in USH2, most likely localized on 15q.
Smedley, Damian; Kohler, Sebastian; Czeschik, Johanna Christina; ...
2014-07-30
Here, whole-exome sequencing (WES) has opened up previously unheard of possibilities for identifying novel disease genes in Mendelian disorders, only about half of which have been elucidated to date. However, interpretation of WES data remains challenging. As a result, we analyze protein–protein association (PPA) networks to identify candidate genes in the vicinity of genes previously implicated in a disease. The analysis, using a random-walk with restart (RWR) method, is adapted to the setting of WES by developing a composite variant-gene relevance score based on the rarity, location and predicted pathogenicity of variants and the RWR evaluation of genes harboring themore » variants. Benchmarking using known disease variants from 88 disease-gene families reveals that the correct gene is ranked among the top 10 candidates in ≥50% of cases, a figure which we confirmed using a prospective study of disease genes identified in 2012 and PPA data produced before that date. In conclusion, we implement our method in a freely available Web server, ExomeWalker, that displays a ranked list of candidates together with information on PPAs, frequency and predicted pathogenicity of the variants to allow quick and effective searches for candidates that are likely to reward closer investigation.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Smedley, Damian; Kohler, Sebastian; Czeschik, Johanna Christina
Here, whole-exome sequencing (WES) has opened up previously unheard of possibilities for identifying novel disease genes in Mendelian disorders, only about half of which have been elucidated to date. However, interpretation of WES data remains challenging. As a result, we analyze protein–protein association (PPA) networks to identify candidate genes in the vicinity of genes previously implicated in a disease. The analysis, using a random-walk with restart (RWR) method, is adapted to the setting of WES by developing a composite variant-gene relevance score based on the rarity, location and predicted pathogenicity of variants and the RWR evaluation of genes harboring themore » variants. Benchmarking using known disease variants from 88 disease-gene families reveals that the correct gene is ranked among the top 10 candidates in ≥50% of cases, a figure which we confirmed using a prospective study of disease genes identified in 2012 and PPA data produced before that date. In conclusion, we implement our method in a freely available Web server, ExomeWalker, that displays a ranked list of candidates together with information on PPAs, frequency and predicted pathogenicity of the variants to allow quick and effective searches for candidates that are likely to reward closer investigation.« less
X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes.
Hu, H; Haas, S A; Chelly, J; Van Esch, H; Raynaud, M; de Brouwer, A P M; Weinert, S; Froyen, G; Frints, S G M; Laumonnier, F; Zemojtel, T; Love, M I; Richard, H; Emde, A-K; Bienek, M; Jensen, C; Hambrock, M; Fischer, U; Langnick, C; Feldkamp, M; Wissink-Lindhout, W; Lebrun, N; Castelnau, L; Rucci, J; Montjean, R; Dorseuil, O; Billuart, P; Stuhlmann, T; Shaw, M; Corbett, M A; Gardner, A; Willis-Owen, S; Tan, C; Friend, K L; Belet, S; van Roozendaal, K E P; Jimenez-Pocquet, M; Moizard, M-P; Ronce, N; Sun, R; O'Keeffe, S; Chenna, R; van Bömmel, A; Göke, J; Hackett, A; Field, M; Christie, L; Boyle, J; Haan, E; Nelson, J; Turner, G; Baynam, G; Gillessen-Kaesbach, G; Müller, U; Steinberger, D; Budny, B; Badura-Stronka, M; Latos-Bieleńska, A; Ousager, L B; Wieacker, P; Rodríguez Criado, G; Bondeson, M-L; Annerén, G; Dufke, A; Cohen, M; Van Maldergem, L; Vincent-Delorme, C; Echenne, B; Simon-Bouy, B; Kleefstra, T; Willemsen, M; Fryns, J-P; Devriendt, K; Ullmann, R; Vingron, M; Wrogemann, K; Wienker, T F; Tzschach, A; van Bokhoven, H; Gecz, J; Jentsch, T J; Chen, W; Ropers, H-H; Kalscheuer, V M
2016-01-01
X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or loci are yet to be identified. Here, we have investigated 405 unresolved families with XLID. We employed massively parallel sequencing of all X-chromosome exons in the index males. The majority of these males were previously tested negative for copy number variations and for mutations in a subset of known XLID genes by Sanger sequencing. In total, 745 X-chromosomal genes were screened. After stringent filtering, a total of 1297 non-recurrent exonic variants remained for prioritization. Co-segregation analysis of potential clinically relevant changes revealed that 80 families (20%) carried pathogenic variants in established XLID genes. In 19 families, we detected likely causative protein truncating and missense variants in 7 novel and validated XLID genes (CLCN4, CNKSR2, FRMPD4, KLHL15, LAS1L, RLIM and USP27X) and potentially deleterious variants in 2 novel candidate XLID genes (CDK16 and TAF1). We show that the CLCN4 and CNKSR2 variants impair protein functions as indicated by electrophysiological studies and altered differentiation of cultured primary neurons from Clcn4(-/-) mice or after mRNA knock-down. The newly identified and candidate XLID proteins belong to pathways and networks with established roles in cognitive function and intellectual disability in particular. We suggest that systematic sequencing of all X-chromosomal genes in a cohort of patients with genetic evidence for X-chromosome locus involvement may resolve up to 58% of Fragile X-negative cases.
X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes
Hu, H; Haas, S A; Chelly, J; Van Esch, H; Raynaud, M; de Brouwer, A P M; Weinert, S; Froyen, G; Frints, S G M; Laumonnier, F; Zemojtel, T; Love, M I; Richard, H; Emde, A-K; Bienek, M; Jensen, C; Hambrock, M; Fischer, U; Langnick, C; Feldkamp, M; Wissink-Lindhout, W; Lebrun, N; Castelnau, L; Rucci, J; Montjean, R; Dorseuil, O; Billuart, P; Stuhlmann, T; Shaw, M; Corbett, M A; Gardner, A; Willis-Owen, S; Tan, C; Friend, K L; Belet, S; van Roozendaal, K E P; Jimenez-Pocquet, M; Moizard, M-P; Ronce, N; Sun, R; O'Keeffe, S; Chenna, R; van Bömmel, A; Göke, J; Hackett, A; Field, M; Christie, L; Boyle, J; Haan, E; Nelson, J; Turner, G; Baynam, G; Gillessen-Kaesbach, G; Müller, U; Steinberger, D; Budny, B; Badura-Stronka, M; Latos-Bieleńska, A; Ousager, L B; Wieacker, P; Rodríguez Criado, G; Bondeson, M-L; Annerén, G; Dufke, A; Cohen, M; Van Maldergem, L; Vincent-Delorme, C; Echenne, B; Simon-Bouy, B; Kleefstra, T; Willemsen, M; Fryns, J-P; Devriendt, K; Ullmann, R; Vingron, M; Wrogemann, K; Wienker, T F; Tzschach, A; van Bokhoven, H; Gecz, J; Jentsch, T J; Chen, W; Ropers, H-H; Kalscheuer, V M
2016-01-01
X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or loci are yet to be identified. Here, we have investigated 405 unresolved families with XLID. We employed massively parallel sequencing of all X-chromosome exons in the index males. The majority of these males were previously tested negative for copy number variations and for mutations in a subset of known XLID genes by Sanger sequencing. In total, 745 X-chromosomal genes were screened. After stringent filtering, a total of 1297 non-recurrent exonic variants remained for prioritization. Co-segregation analysis of potential clinically relevant changes revealed that 80 families (20%) carried pathogenic variants in established XLID genes. In 19 families, we detected likely causative protein truncating and missense variants in 7 novel and validated XLID genes (CLCN4, CNKSR2, FRMPD4, KLHL15, LAS1L, RLIM and USP27X) and potentially deleterious variants in 2 novel candidate XLID genes (CDK16 and TAF1). We show that the CLCN4 and CNKSR2 variants impair protein functions as indicated by electrophysiological studies and altered differentiation of cultured primary neurons from Clcn4−/− mice or after mRNA knock-down. The newly identified and candidate XLID proteins belong to pathways and networks with established roles in cognitive function and intellectual disability in particular. We suggest that systematic sequencing of all X-chromosomal genes in a cohort of patients with genetic evidence for X-chromosome locus involvement may resolve up to 58% of Fragile X-negative cases. PMID:25644381
Advances in asthma and allergy genetics in 2007.
Vercelli, Donata
2008-08-01
This review discusses the main advances in the genetics of asthma and allergy published in the Journal in 2007. The association studies discussed herein addressed 3 main topics: the effect of the environment and gene-environment interactions on asthma/allergy susceptibility, the contribution of T(H)2 immunity gene variants to allergic inflammation, and the role of filaggrin mutations in atopic dermatitis and associated phenotypes. Other articles revealed novel, potentially important candidate genes or confirmed known ones. Collectively, the works published in 2007 reiterate that allergy and asthma are typical complex diseases; that is, they are disorders in which intricate interactions among environmental and genetic factors modify disease susceptibility by altering the fundamental structural and functional properties of target organs at critical developmental windows.
Chen, Shuowen; Khan, Muhammad J.; Loor, Juan J.
2013-01-01
Characterization and biological roles of the peroxisome proliferator-activated receptor (PPAR) isotypes are well known in monogastrics, but not in ruminants. However, a wealth of information has accumulated in little more than a decade on ruminant PPARs including isotype tissue distribution, response to synthetic and natural agonists, gene targets, and factors affecting their expression. Functional characterization demonstrated that, as in monogastrics, the PPAR isotypes control expression of genes involved in lipid metabolism, anti-inflammatory response, development, and growth. Contrary to mouse, however, the PPARγ gene network appears to controls milk fat synthesis in lactating ruminants. As in monogastrics, PPAR isotypes in ruminants are activated by long-chain fatty acids, therefore, making them ideal candidates for fine-tuning metabolism in this species via nutrients. In this regard, using information accumulated in ruminants and monogastrics, we propose a model of PPAR isotype-driven biological functions encompassing key tissues during the peripartal period in dairy cattle. PMID:23737762
Jiang, Li; Edwards, Stefan M; Thomsen, Bo; Workman, Christopher T; Guldbrandtsen, Bernt; Sørensen, Peter
2014-09-24
Prioritizing genetic variants is a challenge because disease susceptibility loci are often located in genes of unknown function or the relationship with the corresponding phenotype is unclear. A global data-mining exercise on the biomedical literature can establish the phenotypic profile of genes with respect to their connection to disease phenotypes. The importance of protein-protein interaction networks in the genetic heterogeneity of common diseases or complex traits is becoming increasingly recognized. Thus, the development of a network-based approach combined with phenotypic profiling would be useful for disease gene prioritization. We developed a random-set scoring model and implemented it to quantify phenotype relevance in a network-based disease gene-prioritization approach. We validated our approach based on different gene phenotypic profiles, which were generated from PubMed abstracts, OMIM, and GeneRIF records. We also investigated the validity of several vocabulary filters and different likelihood thresholds for predicted protein-protein interactions in terms of their effect on the network-based gene-prioritization approach, which relies on text-mining of the phenotype data. Our method demonstrated good precision and sensitivity compared with those of two alternative complex-based prioritization approaches. We then conducted a global ranking of all human genes according to their relevance to a range of human diseases. The resulting accurate ranking of known causal genes supported the reliability of our approach. Moreover, these data suggest many promising novel candidate genes for human disorders that have a complex mode of inheritance. We have implemented and validated a network-based approach to prioritize genes for human diseases based on their phenotypic profile. We have devised a powerful and transparent tool to identify and rank candidate genes. Our global gene prioritization provides a unique resource for the biological interpretation of data from genome-wide association studies, and will help in the understanding of how the associated genetic variants influence disease or quantitative phenotypes.
Evolutionary transgenomics: prospects and challenges.
Correa, Raul; Baum, David A
2015-01-01
Many advances in our understanding of the genetic basis of species differences have arisen from transformation experiments, which allow us to study the effect of genes from one species (the donor) when placed in the genetic background of another species (the recipient). Such interspecies transformation experiments are usually focused on candidate genes - genes that, based on work in model systems, are suspected to be responsible for certain phenotypic differences between the donor and recipient species. We suggest that the high efficiency of transformation in a few plant species, most notably Arabidopsis thaliana, combined with the small size of typical plant genes and their cis-regulatory regions allow implementation of a screening strategy that does not depend upon a priori candidate gene identification. This approach, transgenomics, entails moving many large genomic inserts of a donor species into the wild type background of a recipient species and then screening for dominant phenotypic effects. As a proof of concept, we recently conducted a transgenomic screen that analyzed more than 1100 random, large genomic inserts of the Alabama gladecress Leavenworthia alabamica for dominant phenotypic effects in the A. thaliana background. This screen identified one insert that shortens fruit and decreases A. thaliana fertility. In this paper we discuss the principles of transgenomic screens and suggest methods to help minimize the frequencies of false positive and false negative results. We argue that, because transgenomics avoids committing in advance to candidate genes it has the potential to help us identify truly novel genes or cryptic functions of known genes. Given the valuable knowledge that is likely to be gained, we believe the time is ripe for the plant evolutionary community to invest in transgenomic screens, at least in the mustard family Brassicaceae where many species are amenable to efficient transformation.
Daware, Anurag; Das, Sweta; Srivastava, Rishi; Badoni, Saurabh; Singh, Ashok K.; Agarwal, Pinky; Parida, Swarup K.; Tyagi, Akhilesh K.
2016-01-01
Development and use of genome-wide informative simple sequence repeat (SSR) markers and novel integrated genomic strategies are vital to drive genomics-assisted breeding applications and for efficient dissection of quantitative trait loci (QTLs) underlying complex traits in rice. The present study developed 6244 genome-wide informative SSR markers exhibiting in silico fragment length polymorphism based on repeat-unit variations among genomic sequences of 11 indica, japonica, aus, and wild rice accessions. These markers were mapped on diverse coding and non-coding sequence components of known cloned/candidate genes annotated from 12 chromosomes and revealed a much higher amplification (97%) and polymorphic potential (88%) along with wider genetic/functional diversity level (16–74% with a mean 53%) especially among accessions belonging to indica cultivar group, suggesting their utility in large-scale genomics-assisted breeding applications in rice. A high-density 3791 SSR markers-anchored genetic linkage map (IR 64 × Sonasal) spanning 2060 cM total map-length with an average inter-marker distance of 0.54 cM was generated. This reference genetic map identified six major genomic regions harboring robust QTLs (31% combined phenotypic variation explained with a 5.7–8.7 LOD) governing grain weight on six rice chromosomes. One strong grain weight major QTL region (OsqGW5.1) was narrowed-down by integrating traditional QTL mapping with high-resolution QTL region-specific integrated SSR and single nucleotide polymorphism markers-based QTL-seq analysis and differential expression profiling. This led us to delineate two natural allelic variants in two known cis-regulatory elements (RAV1AAT and CARGCW8GAT) of glycosyl hydrolase and serine carboxypeptidase genes exhibiting pronounced seed-specific differential regulation in low (Sonasal) and high (IR 64) grain weight mapping parental accessions. Our genome-wide SSR marker resource (polymorphic within/between diverse cultivar groups) and integrated genomic strategy can efficiently scan functionally relevant potential molecular tags (markers, candidate genes and alleles) regulating complex agronomic traits (grain weight) and expedite marker-assisted genetic enhancement in rice. PMID:27833617
Genomic Locus Modulating IOP in the BXD RI Mouse Strains
King, Rebecca; Li, Ying; Wang, Jiaxing; Struebing, Felix L.; Geisert, Eldon E.
2018-01-01
Intraocular pressure (IOP) is the primary risk factor for developing glaucoma, yet little is known about the contribution of genomic background to IOP regulation. The present study leverages an array of systems genetics tools to study genomic factors modulating normal IOP in the mouse. The BXD recombinant inbred (RI) strain set was used to identify genomic loci modulating IOP. We measured the IOP in a total of 506 eyes from 38 different strains. Strain averages were subjected to conventional quantitative trait analysis by means of composite interval mapping. Candidate genes were defined, and immunohistochemistry and quantitative PCR (qPCR) were used for validation. Of the 38 BXD strains examined the mean IOP ranged from a low of 13.2mmHg to a high of 17.1mmHg. The means for each strain were used to calculate a genome wide interval map. One significant quantitative trait locus (QTL) was found on Chr.8 (96 to 103 Mb). Within this 7 Mb region only 4 annotated genes were found: Gm15679, Cdh8, Cdh11 and Gm8730. Only two genes (Cdh8 and Cdh11) were candidates for modulating IOP based on the presence of non-synonymous SNPs. Further examination using SIFT (Sorting Intolerant From Tolerant) analysis revealed that the SNPs in Cdh8 (Cadherin 8) were predicted to not change protein function; while the SNPs in Cdh11 (Cadherin 11) would not be tolerated, affecting protein function. Furthermore, immunohistochemistry demonstrated that CDH11 is expressed in the trabecular meshwork of the mouse. We have examined the genomic regulation of IOP in the BXD RI strain set and found one significant QTL on Chr. 8. Within this QTL, there is one good candidate gene, Cdh11. PMID:29496776
Klangnurak, Wanlada; Fukuyo, Taketo; Rezanujjaman, M D; Seki, Masahide; Sugano, Sumio; Suzuki, Yutaka; Tokumoto, Toshinobu
2018-01-01
We previously reported the microarray-based selection of three ovulation-related genes in zebrafish. We used a different selection method in this study, RNA sequencing analysis. An additional eight up-regulated candidates were found as specifically up-regulated genes in ovulation-induced samples. Changes in gene expression were confirmed by qPCR analysis. Furthermore, up-regulation prior to ovulation during natural spawning was verified in samples from natural pairing. Gene knock-out zebrafish strains of one of the candidates, the starmaker gene (stm), were established by CRISPR genome editing techniques. Unexpectedly, homozygous mutants were fertile and could spawn eggs. However, a high percentage of unfertilized eggs and abnormal embryos were produced from these homozygous females. The results suggest that the stm gene is necessary for fertilization. In this study, we selected additional ovulation-inducing candidate genes, and a novel function of the stm gene was investigated.
ICan: an integrated co-alteration network to identify ovarian cancer-related genes.
Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan
2015-01-01
Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data.
ICan: An Integrated Co-Alteration Network to Identify Ovarian Cancer-Related Genes
Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan
2015-01-01
Background Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. Results We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). Conclusion In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data. PMID:25803614
Identification and characterization of nuclear genes involved in photosynthesis in Populus
2014-01-01
Background The gap between the real and potential photosynthetic rate under field conditions suggests that photosynthesis could potentially be improved. Nuclear genes provide possible targets for improving photosynthetic efficiency. Hence, genome-wide identification and characterization of the nuclear genes affecting photosynthetic traits in woody plants would provide key insights on genetic regulation of photosynthesis and identify candidate processes for improvement of photosynthesis. Results Using microarray and bulked segregant analysis strategies, we identified differentially expressed nuclear genes for photosynthesis traits in a segregating population of poplar. We identified 515 differentially expressed genes in this population (FC ≥ 2 or FC ≤ 0.5, P < 0.05), 163 up-regulated and 352 down-regulated. Real-time PCR expression analysis confirmed the microarray data. Singular Enrichment Analysis identified 48 significantly enriched GO terms for molecular functions (28), biological processes (18) and cell components (2). Furthermore, we selected six candidate genes for functional examination by a single-marker association approach, which demonstrated that 20 SNPs in five candidate genes significantly associated with photosynthetic traits, and the phenotypic variance explained by each SNP ranged from 2.3% to 12.6%. This revealed that regulation of photosynthesis by the nuclear genome mainly involves transport, metabolism and response to stimulus functions. Conclusions This study provides new genome-scale strategies for the discovery of potential candidate genes affecting photosynthesis in Populus, and for identification of the functions of genes involved in regulation of photosynthesis. This work also suggests that improving photosynthetic efficiency under field conditions will require the consideration of multiple factors, such as stress responses. PMID:24673936
Li, Yongsheng; Sahni, Nidhi; Yi, Song
2016-11-29
Comprehensive understanding of human cancer mechanisms requires the identification of a thorough list of cancer-associated genes, which could serve as biomarkers for diagnoses and therapies in various types of cancer. Although substantial progress has been made in functional studies to uncover genes involved in cancer, these efforts are often time-consuming and costly. Therefore, it remains challenging to comprehensively identify cancer candidate genes. Network-based methods have accelerated this process through the analysis of complex molecular interactions in the cell. However, the extent to which various interactome networks can contribute to prediction of candidate genes responsible for cancer is still enigmatic. In this study, we evaluated different human protein-protein interactome networks and compared their application to cancer gene prioritization. Our results indicate that network analyses can increase the power to identify novel cancer genes. In particular, such predictive power can be enhanced with the use of unbiased systematic protein interaction maps for cancer gene prioritization. Functional analysis reveals that the top ranked genes from network predictions co-occur often with cancer-related terms in literature, and further, these candidate genes are indeed frequently mutated across cancers. Finally, our study suggests that integrating interactome networks with other omics datasets could provide novel insights into cancer-associated genes and underlying molecular mechanisms.
Trubiroha, A; Gillotay, P; Giusti, N; Gacquer, D; Libert, F; Lefort, A; Haerlingen, B; De Deken, X; Opitz, R; Costagliola, S
2018-04-04
The foregut endoderm gives rise to several organs including liver, pancreas, lung and thyroid with important roles in human physiology. Understanding which genes and signalling pathways regulate their development is crucial for understanding developmental disorders as well as diseases in adulthood. We exploited unique advantages of the zebrafish model to develop a rapid and scalable CRISPR/Cas-based mutagenesis strategy aiming at the identification of genes involved in morphogenesis and function of the thyroid. Core elements of the mutagenesis assay comprise bi-allelic gene invalidation in somatic mutants, a non-invasive monitoring of thyroid development in live transgenic fish, complementary analyses of thyroid function in fixed specimens and quantitative analyses of mutagenesis efficiency by Illumina sequencing of individual fish. We successfully validated our mutagenesis-phenotyping strategy in experiments targeting genes with known functions in early thyroid morphogenesis (pax2a, nkx2.4b) and thyroid functional differentiation (duox, duoxa, tshr). We also demonstrate that duox and duoxa crispants phenocopy thyroid phenotypes previously observed in human patients with bi-allelic DUOX2 and DUOXA2 mutations. The proposed combination of efficient mutagenesis protocols, rapid non-invasive phenotyping and sensitive genotyping holds great potential to systematically characterize the function of larger candidate gene panels during thyroid development and is applicable to other organs and tissues.
A current view of Alzheimer's disease.
Hooli, Basavaraj V; Tanzi, Rudolph E
2009-07-08
Several genes that influence susceptibility to Alzheimer's disease (AD) have been known for over two decades. Recent advances have elucidated novel candidate genes and the pathogenetic mechanisms underlying neurodegeneration in AD. Here, we summarize what we have learned from studies of the known AD genes with regard to the causes of AD and emerging therapies. We also review key recent discoveries that have enhanced our understanding of the etiology and pathogenesis of this devastating disease, based on new investigations into the genes and molecular mechanisms underlying AD.
USDA-ARS?s Scientific Manuscript database
Large-scale screens of the maize genome identified 48 genes that show the putative signature of artificial selection during maize domestication or improvement. These selection-candidate genes may act as quantitative trait loci (QTL) that control the phenotypic differences between maize and its proge...
Metagenomic gene annotation by a homology-independent approach
DOE Office of Scientific and Technical Information (OSTI.GOV)
Froula, Jeff; Zhang, Tao; Salmeen, Annette
2011-06-02
Fully understanding the genetic potential of a microbial community requires functional annotation of all the genes it encodes. The recently developed deep metagenome sequencing approach has enabled rapid identification of millions of genes from a complex microbial community without cultivation. Current homology-based gene annotation fails to detect distantly-related or structural homologs. Furthermore, homology searches with millions of genes are very computational intensive. To overcome these limitations, we developed rhModeller, a homology-independent software pipeline to efficiently annotate genes from metagenomic sequencing projects. Using cellulases and carbonic anhydrases as two independent test cases, we demonstrated that rhModeller is much faster than HMMERmore » but with comparable accuracy, at 94.5percent and 99.9percent accuracy, respectively. More importantly, rhModeller has the ability to detect novel proteins that do not share significant homology to any known protein families. As {approx}50percent of the 2 million genes derived from the cow rumen metagenome failed to be annotated based on sequence homology, we tested whether rhModeller could be used to annotate these genes. Preliminary results suggest that rhModeller is robust in the presence of missense and frameshift mutations, two common errors in metagenomic genes. Applying the pipeline to the cow rumen genes identified 4,990 novel cellulases candidates and 8,196 novel carbonic anhydrase candidates.In summary, we expect rhModeller to dramatically increase the speed and quality of metagnomic gene annotation.« less
Nitric oxide signaling in the development and evolution of language and cognitive circuits.
Funk, Owen H; Kwan, Kenneth Y
2014-09-01
The neocortex underlies not only remarkable motor and sensory capabilities, but also some of our most distinctly human cognitive functions. The emergence of these higher functions during evolution was accompanied by structural changes in the neocortex, including the acquisition of areal specializations such as Broca's speech and language area. The study of these evolutionary mechanisms, which likely involve species-dependent gene expression and function, represents a substantial challenge. These species differences, however, may represent valuable opportunities to understand the molecular underpinnings of neocortical evolution. Here, we discuss nitric oxide signaling as a candidate mechanism in the assembly of neocortical circuits underlying language and higher cognitive functions. This hypothesis was based on the highly specific mid-fetal pattern of nitric oxide synthase 1 (NOS1, previously nNOS) expression in the pyramidal (projection) neurons of two human neocortical areas respectively involved in speech and language, and higher cognition; the frontal operculum (FOp) and the anterior cingulate cortex (ACC). This expression is transiently present during mid-gestation, suggesting that NOS1 may be involved in the development of these areas and the assembly of their neural circuits. As no other gene product is known to exhibit such exquisite spatiotemporal expression, NOS1 represents a remarkable candidate for these functions. Copyright © 2014 Elsevier Ireland Ltd and the Japan Neuroscience Society. All rights reserved.
Zhong, Chao; Sun, Suli; Li, Yinping; Duan, Canxing; Zhu, Zhendong
2018-03-01
A novel Phytophthora sojae resistance gene RpsHC18 was identified and finely mapped on soybean chromosome 3. Two NBS-LRR candidate genes were identified and two diagnostic markers of RpsHC18 were developed. Phytophthora root rot caused by Phytophthora sojae is a destructive disease of soybean. The most effective disease-control strategy is to deploy resistant cultivars carrying Phytophthora-resistant Rps genes. The soybean cultivar Huachun 18 has a broad and distinct resistance spectrum to 12 P. sojae isolates. Quantitative trait loci sequencing (QTL-seq), based on the whole-genome resequencing (WGRS) of two extreme resistant and susceptible phenotype bulks from an F 2:3 population, was performed, and one 767-kb genomic region with ΔSNP-index ≥ 0.9 on chromosome 3 was identified as the RpsHC18 candidate region in Huachun 18. The candidate region was reduced to a 146-kb region by fine mapping. Nonsynonymous SNP and haplotype analyses were carried out in the 146-kb region among ten soybean genotypes using WGRS. Four specific nonsynonymous SNPs were identified in two nucleotide-binding sites-leucine-rich repeat (NBS-LRR) genes, RpsHC18-NBL1 and RpsHC18-NBL2, which were considered to be the candidate genes. Finally, one specific SNP marker in each candidate gene was successfully developed using a tetra-primer ARMS-PCR assay, and the two markers were verified to be specific for RpsHC18 and to effectively distinguish other known Rps genes. In this study, we applied an integrated genomic-based strategy combining WGRS with traditional genetic mapping to identify RpsHC18 candidate genes and develop diagnostic markers. These results suggest that next-generation sequencing is a precise, rapid and cost-effective way to identify candidate genes and develop diagnostic markers, and it can accelerate Rps gene cloning and marker-assisted selection for breeding of P. sojae-resistant soybean cultivars.
Identification of Inherited Retinal Disease-Associated Genetic Variants in 11 Candidate Genes.
Astuti, Galuh D N; van den Born, L Ingeborgh; Khan, M Imran; Hamel, Christian P; Bocquet, Béatrice; Manes, Gaël; Quinodoz, Mathieu; Ali, Manir; Toomes, Carmel; McKibbin, Martin; El-Asrag, Mohammed E; Haer-Wigman, Lonneke; Inglehearn, Chris F; Black, Graeme C M; Hoyng, Carel B; Cremers, Frans P M; Roosing, Susanne
2018-01-10
Inherited retinal diseases (IRDs) display an enormous genetic heterogeneity. Whole exome sequencing (WES) recently identified genes that were mutated in a small proportion of IRD cases. Consequently, finding a second case or family carrying pathogenic variants in the same candidate gene often is challenging. In this study, we searched for novel candidate IRD gene-associated variants in isolated IRD families, assessed their causality, and searched for novel genotype-phenotype correlations. Whole exome sequencing was performed in 11 probands affected with IRDs. Homozygosity mapping data was available for five cases. Variants with minor allele frequencies ≤ 0.5% in public databases were selected as candidate disease-causing variants. These variants were ranked based on their: (a) presence in a gene that was previously implicated in IRD; (b) minor allele frequency in the Exome Aggregation Consortium database (ExAC); (c) in silico pathogenicity assessment using the combined annotation dependent depletion (CADD) score; and (d) interaction of the corresponding protein with known IRD-associated proteins. Twelve unique variants were found in 11 different genes in 11 IRD probands. Novel autosomal recessive and dominant inheritance patterns were found for variants in Small Nuclear Ribonucleoprotein U5 Subunit 200 ( SNRNP200 ) and Zinc Finger Protein 513 ( ZNF513 ), respectively. Using our pathogenicity assessment, a variant in DEAH-Box Helicase 32 ( DHX32 ) was the top ranked novel candidate gene to be associated with IRDs, followed by eight medium and lower ranked candidate genes. The identification of candidate disease-associated sequence variants in 11 single families underscores the notion that the previously identified IRD-associated genes collectively carry > 90% of the defects implicated in IRDs. To identify multiple patients or families with variants in the same gene and thereby provide extra proof for pathogenicity, worldwide data sharing is needed.
Hochfeld, Lara M; Anhalt, Thomas; Reinbold, Céline S; Herrera-Rivero, Marisol; Fricker, Nadine; Nöthen, Markus M; Heilmann-Heimbach, Stefanie
2017-02-22
Human hair follicle (HF) cycling is characterised by the tight orchestration and regulation of signalling cascades. Research shows that micro(mi)RNAs are potent regulators of these pathways. However, knowledge of the expression of miRNAs and their target genes and pathways in the human HF is limited. The objective of this study was to improve understanding of the role of miRNAs and their regulatory interactions in the human HF. Expression levels of ten candidate miRNAs with reported functions in hair biology were assessed in HFs from 25 healthy male donors. MiRNA expression levels were correlated with mRNA-expression levels from the same samples. Identified target genes were tested for enrichment in biological pathways and accumulation in protein-protein interaction (PPI) networks. Expression in the human HF was confirmed for seven of the ten candidate miRNAs, and numerous target genes for miR-24, miR-31, and miR-106a were identified. While the latter include several genes with known functions in hair biology (e.g., ITGB1, SOX9), the majority have not been previously implicated (e.g., PHF1). Target genes were enriched in pathways of interest to hair biology, such as integrin and GnRH signalling, and the respective gene products showed accumulation in PPIs. Further investigation of miRNA expression in the human HF, and the identification of novel miRNA target genes and pathways via the systematic integration of miRNA and mRNA expression data, may facilitate the delineation of tissue-specific regulatory interactions, and improve our understanding of both normal hair growth and the pathobiology of hair loss disorders.
Lata, Charu; Mishra, Awdhesh Kumar; Muthamilarasan, Mehanathan; Bonthala, Venkata Suresh; Khan, Yusuf; Prasad, Manoj
2014-01-01
The APETALA2/ethylene-responsive element binding factor (AP2/ERF) family is one of the largest transcription factor (TF) families in plants that includes four major sub-families, namely AP2, DREB (dehydration responsive element binding), ERF (ethylene responsive factors) and RAV (Related to ABI3/VP). AP2/ERFs are known to play significant roles in various plant processes including growth and development and biotic and abiotic stress responses. Considering this, a comprehensive genome-wide study was conducted in foxtail millet (Setaria italica L.). A total of 171 AP2/ERF genes were identified by systematic sequence analysis and were physically mapped onto nine chromosomes. Phylogenetic analysis grouped AP2/ERF genes into six classes (I to VI). Duplication analysis revealed that 12 (∼7%) SiAP2/ERF genes were tandem repeated and 22 (∼13%) were segmentally duplicated. Comparative physical mapping between foxtail millet AP2/ERF genes and its orthologs of sorghum (18 genes), maize (14 genes), rice (9 genes) and Brachypodium (6 genes) showed the evolutionary insights of AP2/ERF gene family and also the decrease in orthology with increase in phylogenetic distance. The evolutionary significance in terms of gene-duplication and divergence was analyzed by estimating synonymous and non-synonymous substitution rates. Expression profiling of candidate AP2/ERF genes against drought, salt and phytohormones revealed insights into their precise and/or overlapping expression patterns which could be responsible for their functional divergence in foxtail millet. The study showed that the genes SiAP2/ERF-069, SiAP2/ERF-103 and SiAP2/ERF-120 may be considered as potential candidate genes for further functional validation as well for utilization in crop improvement programs for stress resistance since these genes were up-regulated under drought and salinity stresses in ABA dependent manner. Altogether the present study provides new insights into evolution, divergence and systematic functional analysis of AP2/ERF gene family at genome level in foxtail millet which may be utilized for improving stress adaptation and tolerance in millets, cereals and bioenergy grasses. PMID:25409524
Lata, Charu; Mishra, Awdhesh Kumar; Muthamilarasan, Mehanathan; Bonthala, Venkata Suresh; Khan, Yusuf; Prasad, Manoj
2014-01-01
The APETALA2/ethylene-responsive element binding factor (AP2/ERF) family is one of the largest transcription factor (TF) families in plants that includes four major sub-families, namely AP2, DREB (dehydration responsive element binding), ERF (ethylene responsive factors) and RAV (Related to ABI3/VP). AP2/ERFs are known to play significant roles in various plant processes including growth and development and biotic and abiotic stress responses. Considering this, a comprehensive genome-wide study was conducted in foxtail millet (Setaria italica L.). A total of 171 AP2/ERF genes were identified by systematic sequence analysis and were physically mapped onto nine chromosomes. Phylogenetic analysis grouped AP2/ERF genes into six classes (I to VI). Duplication analysis revealed that 12 (∼7%) SiAP2/ERF genes were tandem repeated and 22 (∼13%) were segmentally duplicated. Comparative physical mapping between foxtail millet AP2/ERF genes and its orthologs of sorghum (18 genes), maize (14 genes), rice (9 genes) and Brachypodium (6 genes) showed the evolutionary insights of AP2/ERF gene family and also the decrease in orthology with increase in phylogenetic distance. The evolutionary significance in terms of gene-duplication and divergence was analyzed by estimating synonymous and non-synonymous substitution rates. Expression profiling of candidate AP2/ERF genes against drought, salt and phytohormones revealed insights into their precise and/or overlapping expression patterns which could be responsible for their functional divergence in foxtail millet. The study showed that the genes SiAP2/ERF-069, SiAP2/ERF-103 and SiAP2/ERF-120 may be considered as potential candidate genes for further functional validation as well for utilization in crop improvement programs for stress resistance since these genes were up-regulated under drought and salinity stresses in ABA dependent manner. Altogether the present study provides new insights into evolution, divergence and systematic functional analysis of AP2/ERF gene family at genome level in foxtail millet which may be utilized for improving stress adaptation and tolerance in millets, cereals and bioenergy grasses.
Language Impairments in ASD Resulting from a Failed Domestication of the Human Brain
Benítez-Burraco, Antonio; Lattanzi, Wanda; Murphy, Elliot
2016-01-01
Autism spectrum disorders (ASD) are pervasive neurodevelopmental disorders entailing social and cognitive deficits, including marked problems with language. Numerous genes have been associated with ASD, but it is unclear how language deficits arise from gene mutation or dysregulation. It is also unclear why ASD shows such high prevalence within human populations. Interestingly, the emergence of a modern faculty of language has been hypothesized to be linked to changes in the human brain/skull, but also to the process of self-domestication of the human species. It is our intention to show that people with ASD exhibit less marked domesticated traits at the morphological, physiological, and behavioral levels. We also discuss many ASD candidates represented among the genes known to be involved in the “domestication syndrome” (the constellation of traits exhibited by domesticated mammals, which seemingly results from the hypofunction of the neural crest) and among the set of genes involved in language function closely connected to them. Moreover, many of these genes show altered expression profiles in the brain of autists. In addition, some candidates for domestication and language-readiness show the same expression profile in people with ASD and chimps in different brain areas involved in language processing. Similarities regarding the brain oscillatory behavior of these areas can be expected too. We conclude that ASD may represent an abnormal ontogenetic itinerary for the human faculty of language resulting in part from changes in genes important for the “domestication syndrome” and, ultimately, from the normal functioning of the neural crest. PMID:27621700
Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki
2016-05-26
Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes.
Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A.; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki
2016-01-01
Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes. PMID:27225414
Genome-wide Association Study Identifies Candidate Genes for Male Fertility Traits in Humans
Kosova, Gülüm; Scott, Nicole M.; Niederberger, Craig; Prins, Gail S.; Ober, Carole
2012-01-01
Despite the fact that hundreds of genes are known to affect fertility in animal models, relatively little is known about genes that influence natural fertility in humans. To broadly survey genes contributing to variation in male fertility, we conducted a genome-wide association study (GWAS) of two fertility traits (family size and birth rate) in 269 married men who are members of a founder population of European descent that proscribes contraception and has large family sizes. Associations between ∼250,000 autosomal SNPs and the fertility traits were examined. A total of 41 SNPs with p ≤ 1 × 10−4 for either trait were taken forward to a validation study of 123 ethnically diverse men from Chicago who had previously undergone semen analyses. Nine (22%) of the SNPs associated with reduced fertility in the GWAS were also associated with one or more of the ten measures of reduced sperm quantity and/or function, yielding 27 associations with p values < 0.05 and seven with p values < 0.01 in the validation study. On the basis of 5,000 permutations of our data, the probabilities of observing this many or more small p values were 0.0014 and 5.6 × 10−4, respectively. Among the nine associated loci, outstanding candidates for male fertility genes include USP8, an essential deubiquitinating enzyme that has a role in acrosome assembly; UBD and EPSTI1, which have potential roles in innate immunity; and LRRC32, which encodes a latent transforming growth factor β (TGF-β) receptor on regulatory T cells. We suggest that mutations in these genes that are more severe may account for some of the unexplained infertility (or subfertility) in the general population. PMID:22633400
Automatic annotation of protein motif function with Gene Ontology terms.
Lu, Xinghua; Zhai, Chengxiang; Gopalakrishnan, Vanathi; Buchanan, Bruce G
2004-09-02
Conserved protein sequence motifs are short stretches of amino acid sequence patterns that potentially encode the function of proteins. Several sequence pattern searching algorithms and programs exist foridentifying candidate protein motifs at the whole genome level. However, a much needed and important task is to determine the functions of the newly identified protein motifs. The Gene Ontology (GO) project is an endeavor to annotate the function of genes or protein sequences with terms from a dynamic, controlled vocabulary and these annotations serve well as a knowledge base. This paper presents methods to mine the GO knowledge base and use the association between the GO terms assigned to a sequence and the motifs matched by the same sequence as evidence for predicting the functions of novel protein motifs automatically. The task of assigning GO terms to protein motifs is viewed as both a binary classification and information retrieval problem, where PROSITE motifs are used as samples for mode training and functional prediction. The mutual information of a motif and aGO term association is found to be a very useful feature. We take advantage of the known motifs to train a logistic regression classifier, which allows us to combine mutual information with other frequency-based features and obtain a probability of correct association. The trained logistic regression model has intuitively meaningful and logically plausible parameter values, and performs very well empirically according to our evaluation criteria. In this research, different methods for automatic annotation of protein motifs have been investigated. Empirical result demonstrated that the methods have a great potential for detecting and augmenting information about the functions of newly discovered candidate protein motifs.
Huang, Kristen M; Geunes-Boyer, Scarlett; Wu, Sufen; Dutra, Amalia; Favor, Jack; Stambolian, Dwight
2004-05-01
Xcat mice display X-linked congenital cataracts and are a mouse model for the human X-linked cataract disease Nance Horan syndrome (NHS). The genetic defect in Xcat mice and NHS patients is not known. We isolated and sequenced a BAC contig representing a portion of the Xcat critical region. We combined our sequencing data with the most recent mouse sequence assemblies from both Celera and public databases. The sequence of the 2.2-Mb Xcat critical region was then analyzed for potential Xcat candidate genes. The coding regions of the seven known genes within this area (Rai2, Rbbp7, Ctps2, Calb3, Grpr, Reps2, and Syap1) were sequenced in Xcat mice and no mutations were detected. The expression of Rai2 was quantitatively identical in wild-type and Xcat mutant eyes. These results indicate that the Xcat mutation is within a novel, undiscovered gene.
2009-01-01
Background Stripe rust, caused by Puccinia striiformis f. sp. tritici (Pst), is one of the most destructive diseases of wheat (Triticum aestivum L.) worldwide. In spite of its agricultural importance, the genomics and genetics of the pathogen are poorly characterized. Pst transcripts from urediniospores and germinated urediniospores have been examined previously, but little is known about genes expressed during host infection. Some genes involved in virulence in other rust fungi have been found to be specifically expressed in haustoria. Therefore, the objective of this study was to generate a cDNA library to characterize genes expressed in haustoria of Pst. Results A total of 5,126 EST sequences of high quality were generated from haustoria of Pst, from which 287 contigs and 847 singletons were derived. Approximately 10% and 26% of the 1,134 unique sequences were homologous to proteins with known functions and hypothetical proteins, respectively. The remaining 64% of the unique sequences had no significant similarities in GenBank. Fifteen genes were predicted to be proteins secreted from Pst haustoria. Analysis of ten genes, including six secreted protein genes, using quantitative RT-PCR revealed changes in transcript levels in different developmental and infection stages of the pathogen. Conclusions The haustorial cDNA library was useful in identifying genes of the stripe rust fungus expressed during the infection process. From the library, we identified 15 genes encoding putative secreted proteins and six genes induced during the infection process. These genes are candidates for further studies to determine their functions in wheat-Pst interactions. PMID:20028560
Huang, Huiyan; Zhu, Yong; Eliot, Melissa N; Knopik, Valerie S; McGeary, John E; Carskadon, Mary A; Hart, Anne C
2017-06-01
We aimed to test a combined approach to identify conserved genes regulating sleep and to explore the association between DNA methylation and sleep length. We identified candidate genes associated with shorter versus longer sleep duration in college students based on DNA methylation using Illumina Infinium HumanMethylation450 BeadChip arrays. Orthologous genes in Caenorhabditis elegans were identified, and we examined whether their loss of function affected C. elegans sleep. For genes whose perturbation affected C. elegans sleep, we subsequently undertook a small pilot study to re-examine DNA methylation in an independent set of human participants with shorter versus longer sleep durations. Eighty-seven out of 485,577 CpG sites had significant differential methylation in young adults with shorter versus longer sleep duration, corresponding to 52 candidate genes. We identified 34 C. elegans orthologs, including NPY/flp-18 and flp-21, which are known to affect sleep. Loss of five additional genes alters developmentally timed C. elegans sleep (B4GALT6/bre-4, DOCK180/ced-5, GNB2L1/rack-1, PTPRN2/ida-1, ZFYVE28/lst-2). For one of these genes, ZFYVE28 (also known as hLst2), the pilot replication study again found decreased DNA methylation associated with shorter sleep duration at the same two CpG sites in the first intron of ZFYVE28. Using an approach that combines human epigenetics and C. elegans sleep studies, we identified five genes that play previously unidentified roles in C. elegans sleep. We suggest sleep duration in humans may be associated with differential DNA methylation at specific sites and that the conserved genes identified here likely play roles in C. elegans sleep and in other species. © Sleep Research Society 2017. Published by Oxford University Press on behalf of the Sleep Research Society. All rights reserved. For permissions, please e-mail journals.permissions@oup.com.
Whole-Exome Sequencing Identifies Novel Variants for Tooth Agenesis.
Dinckan, N; Du, R; Petty, L E; Coban-Akdemir, Z; Jhangiani, S N; Paine, I; Baugh, E H; Erdem, A P; Kayserili, H; Doddapaneni, H; Hu, J; Muzny, D M; Boerwinkle, E; Gibbs, R A; Lupski, J R; Uyguner, Z O; Below, J E; Letra, A
2018-01-01
Tooth agenesis is a common craniofacial abnormality in humans and represents failure to develop 1 or more permanent teeth. Tooth agenesis is complex, and variations in about a dozen genes have been reported as contributing to the etiology. Here, we combined whole-exome sequencing, array-based genotyping, and linkage analysis to identify putative pathogenic variants in candidate disease genes for tooth agenesis in 10 multiplex Turkish families. Novel homozygous and heterozygous variants in LRP6, DKK1, LAMA3, and COL17A1 genes, as well as known variants in WNT10A, were identified as likely pathogenic in isolated tooth agenesis. Novel variants in KREMEN1 were identified as likely pathogenic in 2 families with suspected syndromic tooth agenesis. Variants in more than 1 gene were identified segregating with tooth agenesis in 2 families, suggesting oligogenic inheritance. Structural modeling of missense variants suggests deleterious effects to the encoded proteins. Functional analysis of an indel variant (c.3607+3_6del) in LRP6 suggested that the predicted resulting mRNA is subject to nonsense-mediated decay. Our results support a major role for WNT pathways genes in the etiology of tooth agenesis while revealing new candidate genes. Moreover, oligogenic cosegregation was suggestive for complex inheritance and potentially complex gene product interactions during development, contributing to improved understanding of the genetic etiology of familial tooth agenesis.
Stankiewicz, Adrian M; Goscik, Joanna; Dyr, Wanda; Juszczak, Grzegorz R; Ryglewicz, Danuta; Swiergiel, Artur H; Wieczorek, Marek; Stefanski, Roman
2015-12-01
Animal models provide opportunity to study neurobiological aspects of human alcoholism. Changes in gene expression have been implicated in mediating brain functions, including reward system and addiction. The current study aimed to identify genes that may underlie differential ethanol preference in Warsaw High Preferring (WHP) and Warsaw Low Preferring (WLP) rats. Microarray analysis comparing gene expression in nucleus accumbens (NAc), hippocampus (HP) and medial prefrontal cortex (mPFC) was performed in male WHP and WLP rats bred for differences in ethanol preference. Differential and stable between biological repeats expression of 345, 254 and 129 transcripts in NAc, HP and mPFC was detected. Identified genes and processes included known mediators of ethanol response (Mx2, Fam111a, Itpr1, Gabra4, Agtr1a, LTP/LTD, renin-angiotensin signaling pathway), toxicity (Sult1c2a, Ces1, inflammatory response), as well as genes involved in regulation of important addiction-related brain systems such as dopamine, tachykinin or acetylcholine (Gng7, Tac4, Slc5a7). The identified candidate genes may underlie differential ethanol preference in an animal model of alcoholism. Names of genes are written in italics, while names of proteins are written in standard font. Names of human genes/proteins are written in all capital letters. Names of rodent genes/proteins are written in capital letter followed by small letters. Copyright © 2015 Elsevier Inc. All rights reserved.
Tang, Xin; Liu, Huawei; Chen, Quanmei; Wang, Xin; Xiong, Ying; Zhao, Ping
2016-01-01
The solute carrier 6 (SLC6) gene family, initially known as the neurotransmitter transporters, plays vital roles in the regulation of neurotransmitter signaling, nutrient absorption and motor behavior. In this study, a total of 16 candidate genes were identified as SLC6 family gene homologs in the silkworm (Bombyx mori) genome. Spatio-temporal expression patterns of silkworm SLC6 gene transcripts indicated that these genes were highly and specifically expressed in midgut, brain and gonads; moreover, these genes were expressed primarily at the feeding stage or adult stage. Levels of expression for most midgut-specific and midgut-enriched gene transcripts were down-regulated after starvation but up-regulated after re-feeding. In addition, we observed that expression levels of these genes except for BmSLC6-15 and BmGT1 were markedly up-regulated by a juvenile hormone analog. Moreover, brain-enriched genes showed differential expression patterns during wandering and mating processes, suggesting that these genes may be involved in modulating wandering and mating behaviors. Our results improve our understanding of the expression patterns and potential physiological functions of the SLC6 gene family, and provide valuable information for the comprehensive functional analysis of the SLC6 gene family. PMID:27706106
Tang, Xin; Liu, Huawei; Chen, Quanmei; Wang, Xin; Xiong, Ying; Zhao, Ping
2016-10-03
The solute carrier 6 (SLC6) gene family, initially known as the neurotransmitter transporters, plays vital roles in the regulation of neurotransmitter signaling, nutrient absorption and motor behavior. In this study, a total of 16 candidate genes were identified as SLC6 family gene homologs in the silkworm (Bombyx mori) genome. Spatio-temporal expression patterns of silkworm SLC6 gene transcripts indicated that these genes were highly and specifically expressed in midgut, brain and gonads; moreover, these genes were expressed primarily at the feeding stage or adult stage. Levels of expression for most midgut-specific and midgut-enriched gene transcripts were down-regulated after starvation but up-regulated after re-feeding. In addition, we observed that expression levels of these genes except for BmSLC6-15 and BmGT1 were markedly up-regulated by a juvenile hormone analog. Moreover, brain-enriched genes showed differential expression patterns during wandering and mating processes, suggesting that these genes may be involved in modulating wandering and mating behaviors. Our results improve our understanding of the expression patterns and potential physiological functions of the SLC6 gene family, and provide valuable information for the comprehensive functional analysis of the SLC6 gene family.
The Genetic Basis for Variation in Sensitivity to Lead Toxicity in Drosophila melanogaster.
Zhou, Shanshan; Morozova, Tatiana V; Hussain, Yasmeen N; Luoma, Sarah E; McCoy, Lenovia; Yamamoto, Akihiko; Mackay, Trudy F C; Anholt, Robert R H
2016-07-01
Lead toxicity presents a worldwide health problem, especially due to its adverse effects on cognitive development in children. However, identifying genes that give rise to individual variation in susceptibility to lead toxicity is challenging in human populations. Our goal was to use Drosophila melanogaster to identify evolutionarily conserved candidate genes associated with individual variation in susceptibility to lead exposure. To identify candidate genes associated with variation in susceptibility to lead toxicity, we measured effects of lead exposure on development time, viability and adult activity in the Drosophila melanogaster Genetic Reference Panel (DGRP) and performed genome-wide association analyses to identify candidate genes. We used mutants to assess functional causality of candidate genes and constructed a genetic network associated with variation in sensitivity to lead exposure, on which we could superimpose human orthologs. We found substantial heritabilities for all three traits and identified candidate genes associated with variation in susceptibility to lead exposure for each phenotype. The genetic architectures that determine variation in sensitivity to lead exposure are highly polygenic. Gene ontology and network analyses showed enrichment of genes associated with early development and function of the nervous system. Drosophila melanogaster presents an advantageous model to study the genetic underpinnings of variation in susceptibility to lead toxicity. Evolutionary conservation of cellular pathways that respond to toxic exposure allows predictions regarding orthologous genes and pathways across phyla. Thus, studies in the D. melanogaster model system can identify candidate susceptibility genes to guide subsequent studies in human populations. Zhou S, Morozova TV, Hussain YN, Luoma SE, McCoy L, Yamamoto A, Mackay TF, Anholt RR. 2016. The genetic basis for variation in sensitivity to lead toxicity in Drosophila melanogaster. Environ Health Perspect 124:1062-1070; http://dx.doi.org/10.1289/ehp.1510513.
Kim, Eunjung; Kim, Eun Jung; Seo, Seung-Won; Hur, Cheol-Goo; McGregor, Robin A; Choi, Myung-Sook
2014-01-01
Worldwide obesity and related comorbidities are increasing, but identifying new therapeutic targets remains a challenge. A plethora of microarray studies in diet-induced obesity models has provided large datasets of obesity associated genes. In this review, we describe an approach to examine the underlying molecular network regulating obesity, and we discuss interactions between obesity candidate genes. We conducted network analysis on functional protein-protein interactions associated with 25 obesity candidate genes identified in a literature-driven approach based on published microarray studies of diet-induced obesity. The obesity candidate genes were closely associated with lipid metabolism and inflammation. Peroxisome proliferator activated receptor gamma (Pparg) appeared to be a core obesity gene, and obesity candidate genes were highly interconnected, suggesting a coordinately regulated molecular network in adipose tissue. In conclusion, the current network analysis approach may help elucidate the underlying molecular network regulating obesity and identify anti-obesity targets for therapeutic intervention.
X-linked intellectual disability update 2017.
Neri, Giovanni; Schwartz, Charles E; Lubs, Herbert A; Stevenson, Roger E
2018-04-25
The X-chromosome comprises only about 5% of the human genome but accounts for about 15% of the genes currently known to be associated with intellectual disability. The early progress in identifying the X-linked intellectual disability (XLID)-associated genes through linkage analysis and candidate gene sequencing has been accelerated with the use of high-throughput technologies. In the 10 years since the last update, the number of genes associated with XLID has increased by 96% from 72 to 141 and duplications of all 141 XLID genes have been described, primarily through the application of high-resolution microarrays and next generation sequencing. The progress in identifying genetic and genomic alterations associated with XLID has not been matched with insights that improve the clinician's ability to form differential diagnoses, that bring into view the possibility of curative therapies for patients, or that inform scientists of the impact of the genetic alterations on cell organization and function. © 2018 Wiley Periodicals, Inc.
Pappa, Irene; Szekely, Eszter; Mileva-Seitz, Viara R; Luijk, Maartje P C M; Bakermans-Kranenburg, Marian J; van IJzendoorn, Marinus H; Tiemeier, Henning
2015-01-01
Although the environmental influences on infant attachment disorganization and security are well-studied, little is known about their heritability. Candidate gene studies have shown small, often non-replicable effects. In this study, we gathered the largest sample (N = 657) of ethnically homogenous, 14-month-old children with both observed attachment and genome-wide data. First, we used a Genome-Wide Association Study (GWAS) approach to identify single nucleotide polymorphisms (SNPs) associated with attachment disorganization and security. Second, we annotated them into genes (Versatile Gene-based Association Study) and functional pathways. Our analyses provide evidence of novel genes (HDAC1, ZNF675, BSCD1) and pathways (synaptic transmission, cation transport) associated with attachment disorganization. Similar analyses identified a novel gene (BECN1) but no distinct pathways associated with attachment security. The results of this first extensive, exploratory study on the molecular-genetic basis of infant attachment await replication in large, independent samples.
Comparative genomics of defense systems in archaea and bacteria
Makarova, Kira S.; Wolf, Yuri I.; Koonin, Eugene V.
2013-01-01
Our knowledge of prokaryotic defense systems has vastly expanded as the result of comparative genomic analysis, followed by experimental validation. This expansion is both quantitative, including the discovery of diverse new examples of known types of defense systems, such as restriction-modification or toxin-antitoxin systems, and qualitative, including the discovery of fundamentally new defense mechanisms, such as the CRISPR-Cas immunity system. Large-scale statistical analysis reveals that the distribution of different defense systems in bacterial and archaeal taxa is non-uniform, with four groups of organisms distinguishable with respect to the overall abundance and the balance between specific types of defense systems. The genes encoding defense system components in bacterial and archaea typically cluster in defense islands. In addition to genes encoding known defense systems, these islands contain numerous uncharacterized genes, which are candidates for new types of defense systems. The tight association of the genes encoding immunity systems and dormancy- or cell death-inducing defense systems in prokaryotic genomes suggests that these two major types of defense are functionally coupled, providing for effective protection at the population level. PMID:23470997
The Pea Photoperiod Response Gene STERILE NODES Is an Ortholog of LUX ARRHYTHMO1[W][OPEN
Liew, Lim Chee; Hecht, Valérie; Sussmilch, Frances C.; Weller, James L.
2014-01-01
The STERILE NODES (SN) locus in pea (Pisum sativum) was one of the first photoperiod response genes to be described and provided early evidence for the genetic control of long-distance signaling in flowering-time regulation. Lines homozygous for recessive sn mutations are early flowering and photoperiod insensitive, with an increased ability to promote flowering across a graft union in short-day conditions. Here, we show that SN controls developmental regulation of genes in the FT family and rhythmic regulation of genes related to circadian clock function. Using a positional and functional candidate approach, we identify SN as the pea ortholog of LUX ARRHYTHMO, a GARP transcription factor from Arabidopsis (Arabidopsis thaliana) with an important role in circadian clock function. In addition to induced mutants, sequence analysis demonstrates the presence of at least three other independent, naturally occurring loss-of-function mutations among known sn cultivars. Examination of genetic and regulatory interactions between SN and two other circadian clock genes, HIGH RESPONSE TO PHOTOPERIOD (HR) and DIE NEUTRALIS (DNE), suggests a complex relationship in which HR regulates expression of SN and the role of DNE and HR in control of flowering is dependent on SN. These results extend previous work to show that pea orthologs of all three Arabidopsis evening complex genes regulate clock function and photoperiod-responsive flowering and suggest that the function of these genes may be widely conserved. PMID:24706549
Enciso-Rodríguez, Felix E.; González, Carolina; Rodríguez, Edwin A.; López, Camilo E.; Landsman, David; Barrero, Luz Stella; Mariño-Ramírez, Leonardo
2013-01-01
The Cape gooseberry ( Physalis peruviana L) is an Andean exotic fruit with high nutritional value and appealing medicinal properties. However, its cultivation faces important phytosanitary problems mainly due to pathogens like Fusarium oxysporum, Cercosporaphysalidis and Alternaria spp. Here we used the Cape gooseberry foliar transcriptome to search for proteins that encode conserved domains related to plant immunity including: NBS (Nucleotide Binding Site), CC (Coiled-Coil), TIR (Toll/Interleukin-1 Receptor). We identified 74 immunity related gene candidates in P . peruviana which have the typical resistance gene (R-gene) architecture, 17 Receptor like kinase (RLKs) candidates related to PAMP-Triggered Immunity (PTI), eight (TIR-NBS-LRR, or TNL) and nine (CC–NBS-LRR, or CNL) candidates related to Effector-Triggered Immunity (ETI) genes among others. These candidate genes were categorized by molecular function (98%), biological process (85%) and cellular component (79%) using gene ontology. Some of the most interesting predicted roles were those associated with binding and transferase activity. We designed 94 primers pairs from the 74 immunity-related genes (IRGs) to amplify the corresponding genomic regions on six genotypes that included resistant and susceptible materials. From these, we selected 17 single band amplicons and sequenced them in 14 F. oxysporum resistant and susceptible genotypes. Sequence polymorphisms were analyzed through preliminary candidate gene association, which allowed the detection of one SNP at the PpIRG-63 marker revealing a nonsynonymous mutation in the predicted LRR domain suggesting functional roles for resistance. PMID:23844210
Enciso-Rodríguez, Felix E; González, Carolina; Rodríguez, Edwin A; López, Camilo E; Landsman, David; Barrero, Luz Stella; Mariño-Ramírez, Leonardo
2013-01-01
The Cape gooseberry (Physalisperuviana L) is an Andean exotic fruit with high nutritional value and appealing medicinal properties. However, its cultivation faces important phytosanitary problems mainly due to pathogens like Fusarium oxysporum, Cercosporaphysalidis and Alternaria spp. Here we used the Cape gooseberry foliar transcriptome to search for proteins that encode conserved domains related to plant immunity including: NBS (Nucleotide Binding Site), CC (Coiled-Coil), TIR (Toll/Interleukin-1 Receptor). We identified 74 immunity related gene candidates in P. peruviana which have the typical resistance gene (R-gene) architecture, 17 Receptor like kinase (RLKs) candidates related to PAMP-Triggered Immunity (PTI), eight (TIR-NBS-LRR, or TNL) and nine (CC-NBS-LRR, or CNL) candidates related to Effector-Triggered Immunity (ETI) genes among others. These candidate genes were categorized by molecular function (98%), biological process (85%) and cellular component (79%) using gene ontology. Some of the most interesting predicted roles were those associated with binding and transferase activity. We designed 94 primers pairs from the 74 immunity-related genes (IRGs) to amplify the corresponding genomic regions on six genotypes that included resistant and susceptible materials. From these, we selected 17 single band amplicons and sequenced them in 14 F. oxysporum resistant and susceptible genotypes. Sequence polymorphisms were analyzed through preliminary candidate gene association, which allowed the detection of one SNP at the PpIRG-63 marker revealing a nonsynonymous mutation in the predicted LRR domain suggesting functional roles for resistance.
Expressed sequence tags from the flower pathogen Claviceps purpurea.
Oeser, Birgitt; Beaussart, François; Haarmann, Thomas; Lorenz, Nicole; Nathues, Eva; Rolke, Yvonne; Scheffer, Jan; Weiner, January; Tudzynski, Paul
2009-09-01
SUMMARY The ascomycete Claviceps purpurea (ergot) is a biotrophic flower pathogen of rye and other grasses. The deleterious toxic effects of infected rye seeds on humans and grazing animals have been known since the Middle Ages. To gain further insight into the molecular basis of this disease, we generated about 10 000 expressed sequence tags (ESTs)-about 25% originating from axenic fungal culture and about 75% from tissues collected 6-20 days after infection of rye spikes. The pattern of axenic vs. in planta gene expression was compared. About 200 putative plant genes were identified within the in planta library. A high percentage of these were predicted to function in plant defence against the ergot fungus and other pathogens, for example pathogenesis-related proteins. Potential fungal pathogenicity and virulence genes were found via comparison with the pathogen-host interaction database (PHI-base; http://www.phi-base.org) and with genes known to be highly expressed in the haustoria of the bean rust fungus. Comparative analysis of Claviceps and two other fungal flower pathogens (necrotrophic Fusarium graminearum and biotrophic Ustilago maydis) highlighted similarities and differences in their lifestyles, for example all three fungi have signalling components and cell wall-degrading enzymes in their arsenal. In summary, the analysis of axenic and in planta ESTs yielded a collection of candidate genes to be evaluated for functional roles in this plant-microbe interaction.
Trégouët, David-Alexandre; Morange, Pierre-Emmanuel
2018-02-01
Venous thromboembolism (VTE) has a strong genetic component. This review summarizes what is known at the seventeen genes that are now well established to harbour VTE-associated genetic variants. In addition, it discusses additional candidate genes that deserve further validation before being claimed as VTE associated genes. Finally, several research strategies are briefly described to identify other molecular determinants of the disease. © 2017 John Wiley & Sons Ltd.
Genome-wide survey and expression analysis of F-box genes in chickpea.
Gupta, Shefali; Garg, Vanika; Kant, Chandra; Bhatia, Sabhyata
2015-02-13
The F-box genes constitute one of the largest gene families in plants involved in degradation of cellular proteins. F-box proteins can recognize a wide array of substrates and regulate many important biological processes such as embryogenesis, floral development, plant growth and development, biotic and abiotic stress, hormonal responses and senescence, among others. However, little is known about the F-box genes in the important legume crop, chickpea. The available draft genome sequence of chickpea allowed us to conduct a genome-wide survey of the F-box gene family in chickpea. A total of 285 F-box genes were identified in chickpea which were classified based on their C-terminal domain structures into 10 subfamilies. Thirteen putative novel motifs were also identified in F-box proteins with no known functional domain at their C-termini. The F-box genes were physically mapped on the 8 chickpea chromosomes and duplication events were investigated which revealed that the F-box gene family expanded largely due to tandem duplications. Phylogenetic analysis classified the chickpea F-box genes into 9 clusters. Also, maximum syntenic relationship was observed with soybean followed by Medicago truncatula, Lotus japonicus and Arabidopsis. Digital expression analysis of F-box genes in various chickpea tissues as well as under abiotic stress conditions utilizing the available chickpea transcriptome data revealed differential expression patterns with several F-box genes specifically expressing in each tissue, few of which were validated by using quantitative real-time PCR. The genome-wide analysis of chickpea F-box genes provides new opportunities for characterization of candidate F-box genes and elucidation of their function in growth, development and stress responses for utilization in chickpea improvement.
Butler, Merlin G; McGuire, Austen; Manzardo, Ann M
2015-04-01
Obesity is a growing public health concern now reaching epidemic status worldwide for children and adults due to multiple problems impacting on energy intake and expenditure with influences on human reproduction and infertility. A positive family history and genetic factors are known to play a role in obesity by influencing eating behavior, weight and level of physical activity and also contributing to human reproduction and infertility. Recent advances in genetic technology have led to discoveries of new susceptibility genes for obesity and causation of infertility. The goal of our study was to provide an update of clinically relevant candidate and known genes for obesity and infertility using high resolution chromosome ideograms with gene symbols and tabular form. We used computer-based internet websites including PubMed to search for combinations of key words such as obesity, body mass index, infertility, reproduction, azoospermia, endometriosis, diminished ovarian reserve, estrogen along with genetics, gene mutations or variants to identify evidence for development of a master list of recognized obesity genes in humans and those involved with infertility and reproduction. Gene symbols for known and candidate genes for obesity were plotted on high resolution chromosome ideograms at the 850 band level. Both infertility and obesity genes were listed separately in alphabetical order in tabular form and those highlighted when involved with both conditions. By searching the medical literature and computer generated websites for key words, we found documented evidence for 370 genes playing a role in obesity and 153 genes for human reproduction or infertility. The obesity genes primarily affected common pathways in lipid metabolism, deposition or transport, eating behavior and food selection, physical activity or energy expenditure. Twenty-one of the obesity genes were also associated with human infertility and reproduction. Gene symbols were plotted on high resolution ideograms and their name, precise chromosome band location and description were summarized in tabular form. Meaningful correlations in the obesity phenotype and associated human infertility and reproduction are represented with the location of genes on chromosome ideograms along with description of the gene and position in tabular form. These high resolution chromosome ideograms and tables will be useful in genetic awareness and counseling, diagnosis and treatment to improve clinical outcomes.
A Network Approach to Rare Disease Modeling
NASA Astrophysics Data System (ADS)
Ghiassian, Susan; Rabello, Sabrina; Sharma, Amitabh; Wiest, Olaf; Barabasi, Albert-Laszlo
2011-03-01
Network approaches have been widely used to better understand different areas of natural and social sciences. Network Science had a particularly great impact on the study of biological systems. In this project, using biological networks, candidate drugs as a potential treatment of rare diseases were identified. Developing new drugs for more than 2000 rare diseases (as defined by ORPHANET) is too expensive and beyond expectation. Disease proteins do not function in isolation but in cooperation with other interacting proteins. Research on FDA approved drugs have shown that most of the drugs do not target the disease protein but a protein which is 2 or 3 steps away from the disease protein in the Protein-Protein Interaction (PPI) network. We identified the already known drug targets in the disease gene's PPI subnetwork (up to the 3rd neighborhood) and among them those in the same sub cellular compartment and higher coexpression coefficient with the disease gene are expected to be stronger candidates. Out of 2177 rare diseases, 1092 were found not to have any drug target. Using the above method, we have found the strongest candidates among the rest in order to further experimental validations.
Esibizione, Diana; Cui, Chang-Yi; Schlessinger, David
2009-01-01
EDA, the gene mutated in anhidrotic ectodermal dysplasia, encodes ectodysplasin, a TNF superfamily member that activates NF-kB mediated transcription. To identify EDA target genes, we have earlier used expression profiling to infer genes differentially expressed at various developmental time points in Tabby (Eda-deficient) compared to wild-type mouse skin. To increase the resolution to find genes whose expression may be restricted to epidermal cells, we have now extended studies to primary keratinocyte cultures established from E19 wild-type and Tabby skin. Using microarrays bearing 44,000 gene probes, we found 385 preliminary candidate genes whose expression was significantly affected by Eda loss. By comparing expression profiles to those from Eda-A1 transgenic skin, we restricted the list to 38 “candidate EDA targets”, 14 of which were already known to be expressed in hair follicles or epidermis. We confirmed expression changes for 3 selected genes, Tbx1, Bmp7, and Jag1, both in keratinocytes and in whole skin, by Q-PCR and Western blotting analyses. Thus, by the analysis of keratinocytes, novel candidate pathways downstream of EDA were detected. PMID:18848976
Cross-talk of the biotrophic pathogen Claviceps purpurea and its host Secale cereale.
Oeser, Birgitt; Kind, Sabine; Schurack, Selma; Schmutzer, Thomas; Tudzynski, Paul; Hinsch, Janine
2017-04-04
The economically important Ergot fungus Claviceps purpurea is an interesting biotrophic model system because of its strict organ specificity (grass ovaries) and the lack of any detectable plant defense reactions. Though several virulence factors were identified, the exact infection mechanisms are unknown, e.g. how the fungus masks its attack and if the host detects the infection at all. We present a first dual transcriptome analysis using an RNA-Seq approach. We studied both, fungal and plant gene expression in young ovaries infected by the wild-type and two virulence-attenuated mutants. We can show that the plant recognizes the fungus, since defense related genes are upregulated, especially several phytohormone genes. We present a survey of in planta expressed fungal genes, among them several confirmed virulence genes. Interestingly, the set of most highly expressed genes includes a high proportion of genes encoding putative effectors, small secreted proteins which might be involved in masking the fungal attack or interfering with host defense reactions. As known from several other phytopathogens, the C. purpurea genome contains more than 400 of such genes, many of them clustered and probably highly redundant. Since the lack of effective defense reactions in spite of recognition of the fungus could very well be achieved by effectors, we started a functional analysis of some of the most highly expressed candidates. However, the redundancy of the system made the identification of a drastic effect of a single gene most unlikely. We can show that at least one candidate accumulates in the plant apoplast. Deletion of some candidates led to a reduced virulence of C. purpurea on rye, indicating a role of the respective proteins during the infection process. We show for the first time that- despite the absence of effective plant defense reactions- the biotrophic pathogen C. purpurea is detected by its host. This points to a role of effectors in modulation of the effective plant response. Indeed, several putative effector genes are among the highest expressed genes in planta.
Marjonen, Heidi; Sierra, Alejandra; Nyman, Anna; Rogojin, Vladimir; Gröhn, Olli; Linden, Anni-Maija; Hautaniemi, Sampsa; Kaminen-Ahola, Nina
2015-01-01
The adverse effects of alcohol consumption during pregnancy are known, but the molecular events that lead to the phenotypic characteristics are unclear. To unravel the molecular mechanisms, we have used a mouse model of gestational ethanol exposure, which is based on maternal ad libitum ingestion of 10% (v/v) ethanol for the first 8 days of gestation (GD 0.5-8.5). Early neurulation takes place by the end of this period, which is equivalent to the developmental stage early in the fourth week post-fertilization in human. During this exposure period, dynamic epigenetic reprogramming takes place and the embryo is vulnerable to the effects of environmental factors. Thus, we hypothesize that early ethanol exposure disrupts the epigenetic reprogramming of the embryo, which leads to alterations in gene regulation and life-long changes in brain structure and function. Genome-wide analysis of gene expression in the mouse hippocampus revealed altered expression of 23 genes and three miRNAs in ethanol-exposed, adolescent offspring at postnatal day (P) 28. We confirmed this result by using two other tissues, where three candidate genes are known to express actively. Interestingly, we found a similar trend of upregulated gene expression in bone marrow and main olfactory epithelium. In addition, we observed altered DNA methylation in the CpG islands upstream of the candidate genes in the hippocampus. Our MRI study revealed asymmetry of brain structures in ethanol-exposed adult offspring (P60): we detected ethanol-induced enlargement of the left hippocampus and decreased volume of the left olfactory bulb. Our study indicates that ethanol exposure in early gestation can cause changes in DNA methylation, gene expression, and brain structure of offspring. Furthermore, the results support our hypothesis of early epigenetic origin of alcohol-induced disorders: changes in gene regulation may have already taken place in embryonic stem cells and therefore can be seen in different tissue types later in life. PMID:25970770
Quilter, C.R.; Karcanias, A.C.; Bagga, M.R.; Duncan, S.; Murray, A.; Conway, G.S.; Sargent, C.A.; Affara, N.A.
2013-01-01
BACKGROUND Premature ovarian failure (POF) is a heterogeneous disease defined as amenorrhoea for >6 months before age 40, with an FSH serum level >40 mIU/ml (menopausal levels). While there is a strong genetic association with POF, familial studies have also indicated that idiopathic POF may also be genetically linked. Conventional cytogenetic analyses have identified regions of the X chromosome that are strongly associated with ovarian function, as well as several POF candidate genes. Cryptic chromosome abnormalities that have been missed might be detected by array comparative genomic hybridization. METHODS In this study, samples from 42 idiopathic POF patients were subjected to a complete end-to-end X/Y chromosome tiling path array to achieve a detailed copy number variation (CNV) analysis of X chromosome involvement in POF. The arrays also contained a 1 Mb autosomal tiling path as a reference control. Quantitative PCR for selected genes contained within the CNVs was used to confirm the majority of the changes detected. The expression pattern of some of these genes in human tissue RNA was examined by reverse transcription (RT)–PCR. RESULTS A number of CNVs were identified on both Xp and Xq, with several being shared among the POF cases. Some CNVs fall within known polymorphic CNV regions, and others span previously identified POF candidate regions and genes. CONCLUSIONS The new data reported in this study reveal further discrete X chromosome intervals not previously associated with the disease and therefore implicate new clusters of candidate genes. Further studies will be required to elucidate their involvement in POF. PMID:20570974
Li, Yuanjun; Gou, Junbo; Chen, Fangfang; Li, Changfu; Zhang, Yansheng
2016-01-01
Xanthium strumarium L. is a traditional Chinese herb belonging to the Asteraceae family. The major bioactive components of this plant are sesquiterpene lactones (STLs), which include the xanthanolides. To date, the biogenesis of xanthanolides, especially their downstream pathway, remains largely unknown. In X. strumarium, xanthanolides primarily accumulate in its glandular trichomes. To identify putative gene candidates involved in the biosynthesis of xanthanolides, three X. strumarium transcriptomes, which were derived from the young leaves of two different cultivars and the purified glandular trichomes from one of the cultivars, were constructed in this study. In total, 157 million clean reads were generated and assembled into 91,861 unigenes, of which 59,858 unigenes were successfully annotated. All the genes coding for known enzymes in the upstream pathway to the biosynthesis of xanthanolides were present in the X. strumarium transcriptomes. From a comparative analysis of the X. strumarium transcriptomes, this study identified a number of gene candidates that are putatively involved in the downstream pathway to the synthesis of xanthanolides, such as four unigenes encoding CYP71 P450s, 50 unigenes for dehydrogenases, and 27 genes for acetyltransferases. The possible functions of these four CYP71 candidates are extensively discussed. In addition, 116 transcription factors that are highly expressed in X. strumarium glandular trichomes were also identified. Their possible regulatory roles in the biosynthesis of STLs are discussed. The global transcriptomic data for X. strumarium should provide a valuable resource for further research into the biosynthesis of xanthanolides. PMID:27625674
In Silico Gene Prioritization by Integrating Multiple Data Sources
Zhou, Yingyao; Shields, Robert; Chanda, Sumit K.; Elston, Robert C.; Li, Jing
2011-01-01
Identifying disease genes is crucial to the understanding of disease pathogenesis, and to the improvement of disease diagnosis and treatment. In recent years, many researchers have proposed approaches to prioritize candidate genes by considering the relationship of candidate genes and existing known disease genes, reflected in other data sources. In this paper, we propose an expandable framework for gene prioritization that can integrate multiple heterogeneous data sources by taking advantage of a unified graphic representation. Gene-gene relationships and gene-disease relationships are then defined based on the overall topology of each network using a diffusion kernel measure. These relationship measures are in turn normalized to derive an overall measure across all networks, which is utilized to rank all candidate genes. Based on the informativeness of available data sources with respect to each specific disease, we also propose an adaptive threshold score to select a small subset of candidate genes for further validation studies. We performed large scale cross-validation analysis on 110 disease families using three data sources. Results have shown that our approach consistently outperforms other two state of the art programs. A case study using Parkinson disease (PD) has identified four candidate genes (UBB, SEPT5, GPR37 and TH) that ranked higher than our adaptive threshold, all of which are involved in the PD pathway. In particular, a very recent study has observed a deletion of TH in a patient with PD, which supports the importance of the TH gene in PD pathogenesis. A web tool has been implemented to assist scientists in their genetic studies. PMID:21731658
Kohno, Takashi; Otsuka, Ayaka; Girard, Luc; Sato, Masanori; Iwakawa, Reika; Ogiwara, Hideaki; Sanchez-Cespedes, Montse; Minna, John D.; Yokota, Jun
2010-01-01
A total of 176 genes homozygously deleted in human lung cancer were identified by DNA array-based whole genome scanning of 52 lung cancer cell lines and subsequent genomic PCR in 74 cell lines, including the 52 cell lines scanned. One or more exons of these genes were homozygously deleted in one (1%) to 20 (27%) cell lines. These genes included known tumor suppressor genes, e.g., CDKN2A/p16, RB1, and SMAD4, and candidate tumor suppressor genes whose hemizygous or homozygous deletions were reported in several types of human cancers, such as FHIT, KEAP1, and LRP1B/LRP-DIP. CDKN2A/p16 and p14ARF located in 9p21 were most frequently deleted (20/74, 27%). The PTPRD gene was most frequently deleted (8/74, 11%) among genes mapping to regions other than 9p21. Somatic mutations, including a nonsense mutation, of the PTPRD gene were detected in 8/74 (11%) of cell lines and 4/95 (4%) of surgical specimens of lung cancer. Reduced PTPRD expression was observed in the majority (>80%) of cell lines and surgical specimens of lung cancer. Therefore, PTPRD is a candidate tumor suppressor gene in lung cancer. Microarray-based expression profiling of 19 lung cancer cell lines also indicated that some of the 176 genes, such as KANK and ADAMTS1, are preferentially inactivated by epigenetic alterations. Genetic/epigenetic as well as functional studies of these 176 genes will increase our understanding of molecular mechanisms behind lung carcinogenesis. PMID:20073072
Distal 10q monosomy: new evidence for a neurobehavioral condition?
Plaisancié, Julie; Bouneau, Laurence; Cances, Claude; Garnier, Christelle; Benesteau, Jacques; Leonard, Samantha; Bourrouillou, Georges; Calvas, Patrick; Vigouroux, Adeline; Julia, Sophie; Bieth, Eric
2014-01-01
Pure distal monosomy of the long arm of chromosome 10 is a rare cytogenetic abnormality. The location and size of the deletions described in this region are variable. Nevertheless, the patients share characteristic facial appearance, variable cognitive impairment and neurobehavioral manifestations. A Minimal Critical Region corresponding to a 600 kb Smallest Region of deletion Overlap (SRO) has been proposed. In this report, we describe four patients with a distal 10q26 deletion, who displayed attention-deficit/hyperactivity disorders (ADHD). One of them had a marked behavioral profile and relatively preserved cognitive functions. Interestingly, the SRO was not included in the deleted segment of this patient suggesting that this deletion could contain candidate genes involved in the control of neurobehavioral functions. One of these candidates was the CALY gene, known for its association with ADHD patients and whose expression level was shown to be correlated with neurobehavioral disturbances in varying animal models. This report emphasizes the importance of the behavioral problems as a cardinal feature of the 10q microdeletion syndrome. Haploinsufficiency of CALY could play a crucial role in the development of the behavioral troubles within these patients. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
Maver, Ales; Medica, Igor; Peterlin, Borut
2009-12-01
The search for gene candidates in multifactorial diseases such as sarcoidosis can be based on the integration of linkage association data, gene expression data, and protein profile data from genomic, transcriptomic and proteomic studies, respectively. In this study we performed a literature-based search for studies reporting such data, followed by integration of collected information. Different databases were examined--Medline, HugGE Navigator, ArrayExpress and Gene Expression Omnibus (GEO). Candidate genes were defined as genes which were reported in at least 2 different types of omics studies. Genes previously investigated in sarcoidosis were excluded from further analyses. We identified 177 genes associated with sarcoidosis as potential new candidate genes. Subsequently, 9 gene candidates identified to overlap in 2 different types of studies (genomic, transcriptomic and/or proteomic) were consistently reported in at least 3 studies: SERPINB1, FABP4, S100A8, HBEGF, IL7R, LRIG1, PTPN23, DPM2 and NUP214. These genes are involved in regulation of immune response, cellular proliferation, apoptosis, inhibition of protease activity, lipid metabolism. Exact biological functions of HBEGF, LRIG1, PTPN23, DPM2 and NUP214 remain to be completely elucidated. We propose 9 candidate genes: SERPINB1, FABP4, S100A8, HBEGF, IL7R, LRIG1, PTPN23, DPM2 and NUP214, as genes with high potential for association with sarcoidosis.
Novel Approaches to Breast Cancer Prevention and Inhibition of Metastases
2013-10-01
allow a functional characterization of human candidate breast cancer genes. The transgenic RNAi library is covering the whole Drosophila genome ...W81XWH-12-1-0093 / Penninger 15. SUBJECT TERMS Genome wide functional genetics, haploid stem cells, Drosophila cancer modeling...With the advent of modern genomics hundreds of candidate genes have been associated with breast cancer both in GWAS studies as well as by cancer genome
Rare variants in SOS2 and LZTR1 are associated with Noonan syndrome.
Yamamoto, Guilherme Lopes; Aguena, Meire; Gos, Monika; Hung, Christina; Pilch, Jacek; Fahiminiya, Somayyeh; Abramowicz, Anna; Cristian, Ingrid; Buscarilli, Michelle; Naslavsky, Michel Satya; Malaquias, Alexsandra C; Zatz, Mayana; Bodamer, Olaf; Majewski, Jacek; Jorge, Alexander A L; Pereira, Alexandre C; Kim, Chong Ae; Passos-Bueno, Maria Rita; Bertola, Débora Romeo
2015-06-01
Noonan syndrome is an autosomal dominant, multisystemic disorder caused by dysregulation of the RAS/mitogen activated protein kinase (MAPK) pathway. Heterozygous, pathogenic variants in 11 known genes account for approximately 80% of cases. The identification of novel genes associated with Noonan syndrome has become increasingly challenging, since they might be responsible for very small fractions of the cases. A cohort of 50 Brazilian probands negative for pathogenic variants in the known genes associated with Noonan syndrome was tested through whole-exome sequencing along with the relatives in the familial cases. Families from the USA and Poland with mutations in the newly identified genes were included subsequently. We identified rare, segregating or de novo missense variants in SOS2 and LZTR1 in 4% and 8%, respectively, of the 50 Brazilian probands. SOS2 and LZTR1 variants were also found to segregate in one American and one Polish family. Notably, SOS2 variants were identified in patients with marked ectodermal involvement, similar to patients with SOS1 mutations. We identified two novel genes, SOS2 and LZTR1, associated with Noonan syndrome, thereby expanding the molecular spectrum of RASopathies. Mutations in these genes are responsible for approximately 3% of all patients with Noonan syndrome. While SOS2 is a natural candidate, because of its homology with SOS1, the functional role of LZTR1 in the RAS/MAPK pathway is not known, and it could not have been identified without the large pedigrees. Additional functional studies are needed to elucidate the role of LZTR1 in RAS/MAPK signalling and in the pathogenesis of Noonan syndrome. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Butler, Merlin G; Rafi, Syed K; McGuire, Austen; Manzardo, Ann M
2016-01-01
To provide an update of currently recognized clinically relevant candidate and known genes for human reproduction and related infertility plotted on high resolution chromosome ideograms (850 band level) and represented alphabetically in tabular form. Descriptive authoritative computer-based website and peer-reviewed medical literature searches used pertinent keywords representing human reproduction and related infertility along with genetics and gene mutations. A master list of genes associated with human reproduction and related infertility was generated with a visual representation of gene locations on high resolution chromosome ideograms. GeneAnalytics pathway analysis was carried out on the resulting list of genes to assess underlying genetic architecture for infertility. Advances in genetic technology have led to the discovery of genes responsible for reproduction and related infertility. Genes identified (N=371) in our search primarily impact ovarian steroidogenesis through sex hormone biology, germ cell production, genito-urinary or gonadal development and function, and related peptide production, receptors and regulatory factors. The location of gene symbols plotted on high resolution chromosome ideograms forms a conceptualized image of the distribution of human reproduction genes. The updated master list can be used to promote better awareness of genetics of reproduction and related infertility and advance discoveries on genetic causes and disease mechanisms. Copyright © 2015 Elsevier B.V. All rights reserved.
de Miguel, Marina; Cabezas, José-Antonio; de María, Nuria; Sánchez-Gómez, David; Guevara, María-Ángeles; Vélez, María-Dolores; Sáez-Laguna, Enrique; Díaz, Luis-Manuel; Mancha, Jose-Antonio; Barbero, María-Carmen; Collada, Carmen; Díaz-Sala, Carmen; Aranda, Ismael; Cervera, María-Teresa
2014-06-12
Understanding molecular mechanisms that control photosynthesis and water use efficiency in response to drought is crucial for plant species from dry areas. This study aimed to identify QTL for these traits in a Mediterranean conifer and tested their stability under drought. High density linkage maps for Pinus pinaster were used in the detection of QTL for photosynthesis and water use efficiency at three water irrigation regimes. A total of 28 significant and 27 suggestive QTL were found. QTL detected for photochemical traits accounted for the higher percentage of phenotypic variance. Functional annotation of genes within the QTL suggested 58 candidate genes for the analyzed traits. Allele association analysis in selected candidate genes showed three SNPs located in a MYB transcription factor that were significantly associated with efficiency of energy capture by open PSII reaction centers and specific leaf area. The integration of QTL mapping of functional traits, genome annotation and allele association yielded several candidate genes involved with molecular control of photosynthesis and water use efficiency in response to drought in a conifer species. The results obtained highlight the importance of maintaining the integrity of the photochemical machinery in P. pinaster drought response.
Vischi Winck, Flavia; Arvidsson, Samuel; Riaño-Pachón, Diego Mauricio; Hempel, Sabrina; Koseska, Aneta; Nikoloski, Zoran; Urbina Gomez, David Alejandro; Rupprecht, Jens; Mueller-Roeber, Bernd
2013-01-01
The unicellular green alga Chlamydomonas reinhardtii is a long-established model organism for studies on photosynthesis and carbon metabolism-related physiology. Under conditions of air-level carbon dioxide concentration [CO2], a carbon concentrating mechanism (CCM) is induced to facilitate cellular carbon uptake. CCM increases the availability of carbon dioxide at the site of cellular carbon fixation. To improve our understanding of the transcriptional control of the CCM, we employed FAIRE-seq (formaldehyde-assisted Isolation of Regulatory Elements, followed by deep sequencing) to determine nucleosome-depleted chromatin regions of algal cells subjected to carbon deprivation. Our FAIRE data recapitulated the positions of known regulatory elements in the promoter of the periplasmic carbonic anhydrase (Cah1) gene, which is upregulated during CCM induction, and revealed new candidate regulatory elements at a genome-wide scale. In addition, time series expression patterns of 130 transcription factor (TF) and transcription regulator (TR) genes were obtained for cells cultured under photoautotrophic condition and subjected to a shift from high to low [CO2]. Groups of co-expressed genes were identified and a putative directed gene-regulatory network underlying the CCM was reconstructed from the gene expression data using the recently developed IOTA (inner composition alignment) method. Among the candidate regulatory genes, two members of the MYB-related TF family, Lcr1 (Low-CO 2 response regulator 1) and Lcr2 (Low-CO 2 response regulator 2), may play an important role in down-regulating the expression of a particular set of TF and TR genes in response to low [CO2]. The results obtained provide new insights into the transcriptional control of the CCM and revealed more than 60 new candidate regulatory genes. Deep sequencing of nucleosome-depleted genomic regions indicated the presence of new, previously unknown regulatory elements in the C. reinhardtii genome. Our work can serve as a basis for future functional studies of transcriptional regulator genes and genomic regulatory elements in Chlamydomonas. PMID:24224019
Positional cloning of disease genes on chromosome 16
DOE Office of Scientific and Technical Information (OSTI.GOV)
Doggett, N.; Bruening, M.; Callen, D.
1996-04-01
The project seeks to elucidate the molecular basis of an important genetic disease (Batten`s disease) by molecular cloning of the affected gene by utilizing an overlapping clone map of chromosome 16. Batten disease (also known as juvenile neuronal ceroid lipofuscinosis) is a recessively inherited neurodegenerative disorder of childhood characterized by progressive loss of vision, seizures, and psychomoter disturbances. The Batten disease gene was genetically mapped to the chromosome region 16p 12.1 in close linkage with the genetic markers D16S299 and D16S298. Exon amplification of a cosmid containing D16S298 yielded a candidate gene that was disrupted by a 1 kb genomicmore » deletion in all patients containing the most common haplotype for the disease. Two separate deletions and a point mutation altering a splice site in three unrelated families have confirmed the gene as the Batten disease gene. The disease gene encodes a novel 438 amino acid membrane binding protein of unknown function.« less
Probing the Xenopus laevis inner ear transcriptome for biological function
2012-01-01
Background The senses of hearing and balance depend upon mechanoreception, a process that originates in the inner ear and shares features across species. Amphibians have been widely used for physiological studies of mechanotransduction by sensory hair cells. In contrast, much less is known of the genetic basis of auditory and vestibular function in this class of animals. Among amphibians, the genus Xenopus is a well-characterized genetic and developmental model that offers unique opportunities for inner ear research because of the amphibian capacity for tissue and organ regeneration. For these reasons, we implemented a functional genomics approach as a means to undertake a large-scale analysis of the Xenopus laevis inner ear transcriptome through microarray analysis. Results Microarray analysis uncovered genes within the X. laevis inner ear transcriptome associated with inner ear function and impairment in other organisms, thereby supporting the inclusion of Xenopus in cross-species genetic studies of the inner ear. The use of gene categories (inner ear tissue; deafness; ion channels; ion transporters; transcription factors) facilitated the assignment of functional significance to probe set identifiers. We enhanced the biological relevance of our microarray data by using a variety of curation approaches to increase the annotation of the Affymetrix GeneChip® Xenopus laevis Genome array. In addition, annotation analysis revealed the prevalence of inner ear transcripts represented by probe set identifiers that lack functional characterization. Conclusions We identified an abundance of targets for genetic analysis of auditory and vestibular function. The orthologues to human genes with known inner ear function and the highly expressed transcripts that lack annotation are particularly interesting candidates for future analyses. We used informatics approaches to impart biologically relevant information to the Xenopus inner ear transcriptome, thereby addressing the impediment imposed by insufficient gene annotation. These findings heighten the relevance of Xenopus as a model organism for genetic investigations of inner ear organogenesis, morphogenesis, and regeneration. PMID:22676585
Toulis, Vasileios; Garanto, Alejandro; Marfany, Gemma
2016-01-01
Ubiquitination is a dynamic and reversible posttranslational modification. Much effort has been devoted to characterize the function of ubiquitin pathway genes in the cell context, but much less is known on their functional role in the development and maintenance of organs and tissues in the organism. In fact, several ubiquitin ligases and deubiquitinating enzymes (DUBs) are implicated in human pathological disorders, from cancer to neurodegeneration. The aim of our work is to explore the relevance of DUBs in retinal function in health and disease, particularly since some genes related to the ubiquitin or SUMO pathways cause retinal dystrophies, a group of rare diseases that affect 1:3000 individuals worldwide. We propose zebrafish as an extremely useful and informative genetic model to characterize the function of any particular gene in the retina, and thus complement the expression data from mouse. A preliminary characterization of gene expression in mouse retinas (RT-PCR and in situ hybridization) was performed to select particularly interesting genes, and we later replicated the experiments in zebrafish. As a proof of concept, we selected ups45 to be knocked down by morpholino injection in zebrafish embryos. Morphant phenotypic analysis showed moderate to severe eye morphological defects, with a defective formation of the retinal structures, therefore supporting the relevance of DUBs in the formation and differentiation of the vertebrate retina, and suggesting that genes encoding ubiquitin pathway enzymes are good candidates for causing hereditary retinal dystrophies.
Spontaneous preterm birth and single nucleotide gene polymorphisms: a recent update.
Sheikh, Ishfaq A; Ahmad, Ejaz; Jamal, Mohammad S; Rehan, Mohd; Assidi, Mourad; Tayubi, Iftikhar A; AlBasri, Samera F; Bajouh, Osama S; Turki, Rola F; Abuzenadah, Adel M; Damanhouri, Ghazi A; Beg, Mohd A; Al-Qahtani, Mohammed
2016-10-17
Preterm birth (PTB), birth at <37 weeks of gestation, is a significant global public health problem. World-wide, about 15 million babies are born preterm each year resulting in more than a million deaths of children. Preterm neonates are more prone to problems and need intensive care hospitalization. Health issues may persist through early adulthood and even be carried on to the next generation. Majority (70 %) of PTBs are spontaneous with about a half without any apparent cause and the other half associated with a number of risk factors. Genetic factors are one of the significant risks for PTB. The focus of this review is on single nucleotide gene polymorphisms (SNPs) that are reported to be associated with PTB. A comprehensive evaluation of studies on SNPs known to confer potential risk of PTB was done by performing a targeted PubMed search for the years 2007-2015 and systematically reviewing all relevant studies. Evaluation of 92 studies identified 119 candidate genes with SNPs that had potential association with PTB. The genes were associated with functions of a wide spectrum of tissue and cell types such as endocrine, tissue remodeling, vascular, metabolic, and immune and inflammatory systems. A number of potential functional candidate gene variants have been reported that predispose women for PTB. Understanding the complex genomic landscape of PTB needs high-throughput genome sequencing methods such as whole-exome sequencing and whole-genome sequencing approaches that will significantly enhance the understanding of PTB. Identification of high risk women, avoidance of possible risk factors, and provision of personalized health care are important to manage PTB.
Babben, Steve; Perovic, Dragan; Koch, Michael; Ordon, Frank
2015-01-01
Recent declines in costs accelerated sequencing of many species with large genomes, including hexaploid wheat (Triticum aestivum L.). Although the draft sequence of bread wheat is known, it is still one of the major challenges to developlocus specific primers suitable to be used in marker assisted selection procedures, due to the high homology of the three genomes. In this study we describe an efficient approach for the development of locus specific primers comprising four steps, i.e. (i) identification of genomic and coding sequences (CDS) of candidate genes, (ii) intron- and exon-structure reconstruction, (iii) identification of wheat A, B and D sub-genome sequences and primer development based on sequence differences between the three sub-genomes, and (iv); testing of primers for functionality, correct size and localisation. This approach was applied to single, low and high copy genes involved in frost tolerance in wheat. In summary for 27 of these genes for which sequences were derived from Triticum aestivum, Triticum monococcum and Hordeum vulgare, a set of 119 primer pairs was developed and after testing on Nulli-tetrasomic (NT) lines, a set of 65 primer pairs (54.6%), corresponding to 19 candidate genes, turned out to be specific. Out of these a set of 35 fragments was selected for validation via Sanger's amplicon re-sequencing. All fragments, with the exception of one, could be assigned to the original reference sequence. The approach presented here showed a much higher specificity in primer development in comparison to techniques used so far in bread wheat and can be applied to other polyploid species with a known draft sequence. PMID:26565976
Candidate genes for idiopathic epilepsy in four dog breeds.
Ekenstedt, Kari J; Patterson, Edward E; Minor, Katie M; Mickelson, James R
2011-04-25
Idiopathic epilepsy (IE) is a naturally occurring and significant seizure disorder affecting all dog breeds. Because dog breeds are genetically isolated populations, it is possible that IE is attributable to common founders and is genetically homogenous within breeds. In humans, a number of mutations, the majority of which are genes encoding ion channels, neurotransmitters, or their regulatory subunits, have been discovered to cause rare, specific types of IE. It was hypothesized that there are simple genetic bases for IE in some purebred dog breeds, specifically in Vizslas, English Springer Spaniels (ESS), Greater Swiss Mountain Dogs (GSMD), and Beagles, and that the gene(s) responsible may, in some cases, be the same as those already discovered in humans. Candidate genes known to be involved in human epilepsy, along with selected additional genes in the same gene families that are involved in murine epilepsy or are expressed in neural tissue, were examined in populations of affected and unaffected dogs. Microsatellite markers in close proximity to each candidate gene were genotyped and subjected to two-point linkage in Vizslas, and association analysis in ESS, GSMD and Beagles. Most of these candidate genes were not significantly associated with IE in these four dog breeds, while a few genes remained inconclusive. Other genes not included in this study may still be causing monogenic IE in these breeds or, like many cases of human IE, the disease in dogs may be likewise polygenic.
Functional dissection of drought-responsive gene expression patterns in Cynodon dactylon L.
Kim, Changsoo; Lemke, Cornelia; Paterson, Andrew H
2009-05-01
Water deficit is one of the main abiotic factors that affect plant productivity in subtropical regions. To identify genes induced during the water stress response in Bermudagrass (Cynodon dactylon), cDNA macroarrays were used. The macroarray analysis identified 189 drought-responsive candidate genes from C. dactylon, of which 120 were up-regulated and 69 were down-regulated. The candidate genes were classified into seven groups by cluster analysis of expression levels across two intensities and three durations of imposed stress. Annotation using BLASTX suggested that up-regulated genes may be involved in proline biosynthesis, signal transduction pathways, protein repair systems, and removal of toxins, while down-regulated genes were mostly related to basic plant metabolism such as photosynthesis and glycolysis. The functional classification of gene ontology (GO) was consistent with the BLASTX results, also suggesting some crosstalk between abiotic and biotic stress. Comparative analysis of cis-regulatory elements from the candidate genes implicated specific elements in drought response in Bermudagrass. Although only a subset of genes was studied, Bermudagrass shared many drought-responsive genes and cis-regulatory elements with other botanical models, supporting a strategy of cross-taxon application of drought-responsive genes, regulatory cues, and physiological-genetic information.
2011-01-01
Background Elucidating the genetic basis of human diseases is a central goal of genetics and molecular biology. While traditional linkage analysis and modern high-throughput techniques often provide long lists of tens or hundreds of disease gene candidates, the identification of disease genes among the candidates remains time-consuming and expensive. Efficient computational methods are therefore needed to prioritize genes within the list of candidates, by exploiting the wealth of information available about the genes in various databases. Results We propose ProDiGe, a novel algorithm for Prioritization of Disease Genes. ProDiGe implements a novel machine learning strategy based on learning from positive and unlabeled examples, which allows to integrate various sources of information about the genes, to share information about known disease genes across diseases, and to perform genome-wide searches for new disease genes. Experiments on real data show that ProDiGe outperforms state-of-the-art methods for the prioritization of genes in human diseases. Conclusions ProDiGe implements a new machine learning paradigm for gene prioritization, which could help the identification of new disease genes. It is freely available at http://cbio.ensmp.fr/prodige. PMID:21977986
Gene Expression Profiling of Gastric Cancer
Marimuthu, Arivusudar; Jacob, Harrys K.C.; Jakharia, Aniruddha; Subbannayya, Yashwanth; Keerthikumar, Shivakumar; Kashyap, Manoj Kumar; Goel, Renu; Balakrishnan, Lavanya; Dwivedi, Sutopa; Pathare, Swapnali; Dikshit, Jyoti Bajpai; Maharudraiah, Jagadeesha; Singh, Sujay; Sameer Kumar, Ghantasala S; Vijayakumar, M.; Veerendra Kumar, Kariyanakatte Veeraiah; Premalatha, Chennagiri Shrinivasamurthy; Tata, Pramila; Hariharan, Ramesh; Roa, Juan Carlos; Prasad, T.S.K; Chaerkady, Raghothama; Kumar, Rekha Vijay; Pandey, Akhilesh
2015-01-01
Gastric cancer is the second leading cause of cancer death worldwide, both in men and women. A genomewide gene expression analysis was carried out to identify differentially expressed genes in gastric adenocarcinoma tissues as compared to adjacent normal tissues. We used Agilent’s whole human genome oligonucleotide microarray platform representing ~41,000 genes to carry out gene expression analysis. Two-color microarray analysis was employed to directly compare the expression of genes between tumor and normal tissues. Through this approach, we identified several previously known candidate genes along with a number of novel candidate genes in gastric cancer. Testican-1 (SPOCK1) was one of the novel molecules that was 10-fold upregulated in tumors. Using tissue microarrays, we validated the expression of testican-1 by immunohistochemical staining. It was overexpressed in 56% (160/282) of the cases tested. Pathway analysis led to the identification of several networks in which SPOCK1 was among the topmost networks of interacting genes. By gene enrichment analysis, we identified several genes involved in cell adhesion and cell proliferation to be significantly upregulated while those corresponding to metabolic pathways were significantly downregulated. The differentially expressed genes identified in this study are candidate biomarkers for gastric adenoacarcinoma. PMID:27030788
Evaluating Reported Candidate Gene Associations with Polycystic Ovary Syndrome
Pau, Cindy; Saxena, Richa; Welt, Corrine Kolka
2013-01-01
Objective To replicate variants in candidate genes associated with PCOS in a population of European PCOS and control subjects. Design Case-control association analysis and meta-analysis. Setting Major academic hospital Patients Women of European ancestry with PCOS (n=525) and controls (n=472), aged 18 to 45 years. Intervention Variants previously associated with PCOS in candidate gene studies were genotyped (n=39). Metabolic, reproductive and anthropomorphic parameters were examined as a function of the candidate variants. All genetic association analyses were adjusted for age, BMI and ancestry and were reported after correction for multiple testing. Main Outcome Measure Association of candidate gene variants with PCOS. Results Three variants, rs3797179 (SRD5A1), rs12473543 (POMC), and rs1501299 (ADIPOQ), were nominally associated with PCOS. However, they did not remain significant after correction for multiple testing and none of the variants replicated in a sufficiently powered meta-analysis. Variants in the FBN3 gene (rs17202517 and rs73503752) were associated with smaller waist circumferences and variant rs727428 in the SHBG gene was associated with lower SHBG levels. Conclusion Previously identified variants in candidate genes do not appear to be associated with PCOS risk. PMID:23375202
Kang, Yuan; Dong, Xinran; Zhou, Qiongjie; Zhang, Ying; Cheng, Yan; Hu, Rong; Su, Cuihong; Jin, Hong; Liu, Xiaohui; Ma, Duan; Tian, Weidong; Li, Xiaotian
2012-03-01
This study aimed to identify candidate protein biomarkers from maternal serum for Down syndrome (DS) by integrated proteomic and bioinformatics analysis. A pregnancy DS group of 18 women and a control group with the same number were prepared, and the maternal serum proteins were analyzed by isobaric tags for relative and absolute quantitation and mass spectrometry, to identify DS differentially expressed maternal serum proteins (DS-DEMSPs). Comprehensive bioinformatics analysis was then employed to analyze DS-DEMSPs both in this paper and seven related publications. Down syndrome differentially expressed maternal serum proteins from different studies are significantly enriched with common Gene Ontology functions, Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways, transcription factor binding sites, and Pfam protein domains, However, the DS-DEMSPs are less functionally related to known DS-related genes. These evidences suggest that common molecular mechanisms induced by secondary effects may be present upon DS carrying. A simple scoring scheme revealed Alpha-2-macroglobulin, Apolipoprotein A1, Apolipoprotein E, Complement C1s subcomponent, Complement component 5, Complement component 8, alpha polypeptide, Complement component 8, beta polypeptide and Fibronectin as potential DS biomarkers. The integration of proteomics and bioinformatics studies provides a novel approach to develop new prenatal screening methods for noninvasive yet accurate diagnosis of DS. Copyright © 2012 John Wiley & Sons, Ltd.
Racial disparity in pathophysiologic pathways of preterm birth based on genetic variants
Menon, Ramkumar; Pearce, Brad; Velez, Digna R; Merialdi, Mario; Williams, Scott M; Fortunato, Stephen J; Thorsen, Poul
2009-01-01
Objective To study pathophysiologic pathways in spontaneous preterm birth and possibly the racial disparity associating with maternal and fetal genetic variations, using bioinformatics tools. Methods A large scale candidate gene association study was performed on 1442 SNPs in 130 genes in a case (preterm birth < 36 weeks) control study (term birth > 37 weeks). Both maternal and fetal DNA from Caucasians (172 cases and 198 controls) and 279 African-Americans (82 cases and 197 controls) were used. A single locus association (genotypic) analysis followed by hierarchical clustering was performed, where clustering was based on p values for significant associations within each race. Using Ingenuity Pathway Analysis (IPA) software, known pathophysiologic pathways in both races were determined. Results From all SNPs entered into the analysis, the IPA mapped genes to specific disease functions. Gene variants in Caucasians were implicated in disease functions shared with other known disorders; specifically, dermatopathy, inflammation, and hematological disorders. This may reflect abnormal cervical ripening and decidual hemorrhage. In African-Americans inflammatory pathways were the most prevalent. In Caucasians, maternal gene variants showed the most prominent role in disease functions, whereas in African Americans it was fetal variants. The IPA software was used to generate molecular interaction maps that differed between races and also between maternal and fetal genetic variants. Conclusion Differences at the genetic level revealed distinct disease functions and operational pathways in African Americans and Caucasians in spontaneous preterm birth. Differences in maternal and fetal contributions in pregnancy outcome are also different between African Americans and Caucasians. These results present a set of explicit testable hypotheses regarding genetic associations with preterm birth in African Americans and Caucasians PMID:19527514
Deciphering the Developmental Dynamics of the Mouse Liver Transcriptome
Gunewardena, Sumedha S.; Yoo, Byunggil; Peng, Lai; Lu, Hong; Zhong, Xiaobo; Klaassen, Curtis D.; Cui, Julia Yue
2015-01-01
During development, liver undergoes a rapid transition from a hematopoietic organ to a major organ for drug metabolism and nutrient homeostasis. However, little is known on a transcriptome level of the genes and RNA-splicing variants that are differentially regulated with age, and which up-stream regulators orchestrate age-specific biological functions in liver. We used RNA-Seq to interrogate the developmental dynamics of the liver transcriptome in mice at 12 ages from late embryonic stage (2-days before birth) to maturity (60-days after birth). Among 21,889 unique NCBI RefSeq-annotated genes, 9,641 were significantly expressed in at least one age, 7,289 were differently regulated with age, and 859 had multiple (> = 2) RNA splicing-variants. Factor analysis showed that the dynamics of hepatic genes fall into six distinct groups based on their temporal expression. The average expression of cytokines, ion channels, kinases, phosphatases, transcription regulators and translation regulators decreased with age, whereas the average expression of peptidases, enzymes and transmembrane receptors increased with age. The average expression of growth factors peak between Day-3 and Day-10, and decrease thereafter. We identified critical biological functions, upstream regulators, and putative transcription modules that seem to govern age-specific gene expression. We also observed differential ontogenic expression of known splicing variants of certain genes, and 1,455 novel splicing isoform candidates. In conclusion, the hepatic ontogeny of the transcriptome ontogeny has unveiled critical networks and up-stream regulators that orchestrate age-specific biological functions in liver, and suggest that age contributes to the complexity of the alternative splicing landscape of the hepatic transcriptome. PMID:26496202
Deciphering the Developmental Dynamics of the Mouse Liver Transcriptome.
Gunewardena, Sumedha S; Yoo, Byunggil; Peng, Lai; Lu, Hong; Zhong, Xiaobo; Klaassen, Curtis D; Cui, Julia Yue
2015-01-01
During development, liver undergoes a rapid transition from a hematopoietic organ to a major organ for drug metabolism and nutrient homeostasis. However, little is known on a transcriptome level of the genes and RNA-splicing variants that are differentially regulated with age, and which up-stream regulators orchestrate age-specific biological functions in liver. We used RNA-Seq to interrogate the developmental dynamics of the liver transcriptome in mice at 12 ages from late embryonic stage (2-days before birth) to maturity (60-days after birth). Among 21,889 unique NCBI RefSeq-annotated genes, 9,641 were significantly expressed in at least one age, 7,289 were differently regulated with age, and 859 had multiple (> = 2) RNA splicing-variants. Factor analysis showed that the dynamics of hepatic genes fall into six distinct groups based on their temporal expression. The average expression of cytokines, ion channels, kinases, phosphatases, transcription regulators and translation regulators decreased with age, whereas the average expression of peptidases, enzymes and transmembrane receptors increased with age. The average expression of growth factors peak between Day-3 and Day-10, and decrease thereafter. We identified critical biological functions, upstream regulators, and putative transcription modules that seem to govern age-specific gene expression. We also observed differential ontogenic expression of known splicing variants of certain genes, and 1,455 novel splicing isoform candidates. In conclusion, the hepatic ontogeny of the transcriptome ontogeny has unveiled critical networks and up-stream regulators that orchestrate age-specific biological functions in liver, and suggest that age contributes to the complexity of the alternative splicing landscape of the hepatic transcriptome.
Satapathy, Lopamudra; Singh, Dharmendra; Ranjan, Prashant; Kumar, Dhananjay; Kumar, Manish; Prabhu, Kumble Vinod; Mukhopadhyay, Kunal
2014-12-01
WRKY, a plant-specific transcription factor family, has important roles in pathogen defense, abiotic cues and phytohormone signaling, yet little is known about their roles and molecular mechanism of function in response to rust diseases in wheat. We identified 100 TaWRKY sequences using wheat Expressed Sequence Tag database of which 22 WRKY sequences were novel. Identified proteins were characterized based on their zinc finger motifs and phylogenetic analysis clustered them into six clades consisting of class IIc and class III WRKY proteins. Functional annotation revealed major functions in metabolic and cellular processes in control plants; whereas response to stimuli, signaling and defense in pathogen inoculated plants, their major molecular function being binding to DNA. Tag-based expression analysis of the identified genes revealed differential expression between mock and Puccinia triticina inoculated wheat near isogenic lines. Gene expression was also performed with six rust-related microarray experiments at Gene Expression Omnibus database. TaWRKY10, 15, 17 and 56 were common in both tag-based and microarray-based differential expression analysis and could be representing rust specific WRKY genes. The obtained results will bestow insight into the functional characterization of WRKY transcription factors responsive to leaf rust pathogenesis that can be used as candidate genes in molecular breeding programs to improve biotic stress tolerance in wheat.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, Xiaohan; Ye, Chuyu; Bisaria, Anjali
2011-01-01
Populus is an important bioenergy crop for bioethanol production. A greater understanding of cell wall biosynthesis processes is critical in reducing biomass recalcitrance, a major hindrance in efficient generation of ethanol from lignocellulosic biomass. Here, we report the identification of candidate cell wall biosynthesis genes through the development and application of a novel bioinformatics pipeline. As a first step, via text-mining of PubMed publications, we obtained 121 Arabidopsis genes that had the experimental evidences supporting their involvement in cell wall biosynthesis or remodeling. The 121 genes were then used as bait genes to query an Arabidopsis co-expression database and additionalmore » genes were identified as neighbors of the bait genes in the network, increasing the number of genes to 548. The 548 Arabidopsis genes were then used to re-query the Arabidopsis co-expression database and re-construct a network that captured additional network neighbors, expanding to a total of 694 genes. The 694 Arabidopsis genes were computationally divided into 22 clusters. Queries of the Populus genome using the Arabidopsis genes revealed 817 Populus orthologs. Functional analysis of gene ontology and tissue-specific gene expression indicated that these Arabidopsis and Populus genes are high likelihood candidates for functional genomics in relation to cell wall biosynthesis.« less
Thiel, Heike; Varrelmann, Mark
2009-08-01
Beet necrotic yellow vein virus (BNYVV) induces the most important disease threatening sugar beet. The growth of partially resistant hybrids carrying monogenic dominant resistance genes stabilize yield but are unable to entirely prevent virus infection and replication. P25 is responsible for symptom development and previous studies have shown that recently occurring resistance-breaking isolates possess increased P25 variability. To better understand the viral pathogenicity factor's interplay with plant proteins and to possibly unravel the molecular basis of sugar beet antivirus resistance, P25 was applied in a yeast two-hybrid screen of a resistant sugar beet cDNA library. This screen identified candidate proteins recognized as orthologues from other plant species which are known to be expressed following pathogen infection and involved in plant defense response. Most of the candidates potentially related to host-pathogen interactions were involved in the ubiquitylation process and plants response to stress, and were part of cell and metabolism components. The interaction of several candidate genes with P25 was confirmed in Nicotiana benthamiana leaf cells by transient agrobacterium-mediated expression applying bimolecular fluorescence complementation assay. The putative functions of several of the candidates identified support previous findings and present first targets for understanding the BNYVV pathogenicity and antivirus resistance mechanism.
Paulo, Paula; Maia, Sofia; Pinto, Carla; Pinto, Pedro; Monteiro, Augusta; Peixoto, Ana; Teixeira, Manuel R
2018-04-01
Considering that mutations in known prostate cancer (PrCa) predisposition genes, including those responsible for hereditary breast/ovarian cancer and Lynch syndromes, explain less than 5% of early-onset/familial PrCa, we have sequenced 94 genes associated with cancer predisposition using next generation sequencing (NGS) in a series of 121 PrCa patients. We found monoallelic truncating/functionally deleterious mutations in seven genes, including ATM and CHEK2, which have previously been associated with PrCa predisposition, and five new candidate PrCa associated genes involved in cancer predisposing recessive disorders, namely RAD51C, FANCD2, FANCI, CEP57 and RECQL4. Furthermore, using in silico pathogenicity prediction of missense variants among 18 genes associated with breast/ovarian cancer and/or Lynch syndrome, followed by KASP genotyping in 710 healthy controls, we identified "likely pathogenic" missense variants in ATM, BRIP1, CHEK2 and TP53. In conclusion, this study has identified putative PrCa predisposing germline mutations in 14.9% of early-onset/familial PrCa patients. Further data will be necessary to confirm the genetic heterogeneity of inherited PrCa predisposition hinted in this study.
Jiang, Bo; Zhang, Yong; She, Chang; Zhao, Jiaju; Zhou, Kailong; Zuo, Zhicheng; Zhou, Xiaozhong; Wang, Peiji; Dong, Qirong
2017-09-01
It is well known that moderate to high doses of ionizing radiation have a toxic effect on the organism. However, there are few experimental studies on the mechanisms of LDR ionizing radiation on nerve regeneration after peripheral nerve injury. We established the rats' peripheral nerve injury model via repaired Peripheral nerve injury nerve, vascular endothelial growth factor a and Growth associated protein-43 were detected from different treatment groups. We performed transcriptome sequencing focusing on investigating the differentially expressed genes and gene functions between the control group and 1Gy group. Sequencing was done by using high-throughput RNA-sequencing (RNA-seq) technologies. The results showed the 1Gy group to be the most effective promoting repair. RNA-sequencing identified 619 differently expressed genes between control and treated groups. A Gene Ontology analysis of the differentially expressed genes revealed enrichment in the functional pathways. Among them, candidate genes associated with nerve repair were identified. Pathways involved in cell-substrate adhesion, vascular smooth muscle contraction and cell adhesion molecule signaling may be involved in recovery from peripheral nerve injury. Copyright © 2017. Published by Elsevier B.V.
Exploring the Transcriptome of Ciliated Cells Using In Silico Dissection of Human Tissues
Ivliev, Alexander E.; 't Hoen, Peter A. C.; van Roon-Mom, Willeke M. C.; Peters, Dorien J. M.; Sergeeva, Marina G.
2012-01-01
Cilia are cell organelles that play important roles in cell motility, sensory and developmental functions and are involved in a range of human diseases, known as ciliopathies. Here, we search for novel human genes related to cilia using a strategy that exploits the previously reported tendency of cell type-specific genes to be coexpressed in the transcriptome of complex tissues. Gene coexpression networks were constructed using the noise-resistant WGCNA algorithm in 12 publicly available microarray datasets from human tissues rich in motile cilia: airways, fallopian tubes and brain. A cilia-related coexpression module was detected in 10 out of the 12 datasets. A consensus analysis of this module's gene composition recapitulated 297 known and predicted 74 novel cilia-related genes. 82% of the novel candidates were supported by tissue-specificity expression data from GEO and/or proteomic data from the Human Protein Atlas. The novel findings included a set of genes (DCDC2, DYX1C1, KIAA0319) related to a neurological disease dyslexia suggesting their potential involvement in ciliary functions. Furthermore, we searched for differences in gene composition of the ciliary module between the tissues. A multidrug-and-toxin extrusion transporter MATE2 (SLC47A2) was found as a brain-specific central gene in the ciliary module. We confirm the localization of MATE2 in cilia by immunofluorescence staining using MDCK cells as a model. While MATE2 has previously gained attention as a pharmacologically relevant transporter, its potential relation to cilia is suggested for the first time. Taken together, our large-scale analysis of gene coexpression networks identifies novel genes related to human cell cilia. PMID:22558177
Bassuk, Alexander G.; Muthuswamy, Lakshmi B.; Boland, Riley; Smith, Tiffany L.; Hulstrand, Alissa M.; Northrup, Hope; Hakeman, Matthew; Dierdorff, Jason M.; Yung, Christina K.; Long, Abby; Brouillette, Rachel B.; Au, Kit Sing; Gurnett, Christina; Houston, Douglas W.; Cornell, Robert A.; Manak, J. Robert
2013-01-01
Neural tube defects (NTDs) are common birth defects of complex etiology. Family and population-based studies have confirmed a genetic component to NTDs. However, despite more than three decades of research, the genes involved in human NTDs remain largely unknown. We tested the hypothesis that rare copy number variants (CNVs), especially de novo germline CNVs, are a significant risk factor for NTDs. We used array-based comparative genomic hybridization (aCGH) to identify rare CNVs in 128 Caucasian and 61 Hispanic patients with non-syndromic lumbar-sacral myelomeningocele. We also performed aCGH analysis on the parents of affected individuals with rare CNVs where parental DNA was available (42 sets). Among the eight de novo CNVs that we identified, three generated copy number changes of entire genes. One large heterozygous deletion removed 27 genes, including PAX3, a known spina bifida-associated gene. A second CNV altered genes (PGPD8, ZC3H6) for which little is known regarding function or expression. A third heterozygous deletion removed GPC5 and part of GPC6, genes encoding glypicans. Glypicans are proteoglycans that modulate the activity of morphogens such as Sonic Hedgehog (SHH) and bone morphogenetic proteins (BMPs), both of which have been implicated in NTDs. Additionally, glypicans function in the planar cell polarity (PCP) pathway, and several PCP genes have been associated with NTDs. Here, we show that GPC5 orthologs are expressed in the neural tube, and that inhibiting their expression in frog and fish embryos results in NTDs. These results implicate GPC5 as a gene required for normal neural tube development. PMID:23223018
Abruzzi, Katharine C; Zadina, Abigail; Luo, Weifei; Wiyanto, Evelyn; Rahman, Reazur; Guo, Fang; Shafer, Orie; Rosbash, Michael
2017-02-01
Locomotor activity rhythms are controlled by a network of ~150 circadian neurons within the adult Drosophila brain. They are subdivided based on their anatomical locations and properties. We profiled transcripts "around the clock" from three key groups of circadian neurons with different functions. We also profiled a non-circadian outgroup, dopaminergic (TH) neurons. They have cycling transcripts but fewer than clock neurons as well as low expression and poor cycling of clock gene transcripts. This suggests that TH neurons do not have a canonical circadian clock and that their gene expression cycling is driven by brain systemic cues. The three circadian groups are surprisingly diverse in their cycling transcripts and overall gene expression patterns, which include known and putative novel neuropeptides. Even the overall phase distributions of cycling transcripts are distinct, indicating that different regulatory principles govern transcript oscillations. This surprising cell-type diversity parallels the functional heterogeneity of the different neurons.
BioGPS and MyGene.info: organizing online, gene-centric information.
Wu, Chunlei; Macleod, Ian; Su, Andrew I
2013-01-01
Fast-evolving technologies have enabled researchers to easily generate data at genome scale, and using these technologies to compare biological states typically results in a list of candidate genes. Researchers are then faced with the daunting task of prioritizing these candidate genes for follow-up studies. There are hundreds, possibly even thousands, of web-based gene annotation resources available, but it quickly becomes impractical to manually access and review all of these sites for each gene in a candidate gene list. BioGPS (http://biogps.org) was created as a centralized gene portal for aggregating distributed gene annotation resources, emphasizing community extensibility and user customizability. BioGPS serves as a convenient tool for users to access known gene-centric resources, as well as a mechanism to discover new resources that were previously unknown to the user. This article describes updates to BioGPS made after its initial release in 2008. We summarize recent additions of features and data, as well as the robust user activity that underlies this community intelligence application. Finally, we describe MyGene.info (http://mygene.info) and related web services that provide programmatic access to BioGPS.
Lempereur, Laetitia; Larcombe, Stephen D; Durrani, Zeeshan; Karagenc, Tulin; Bilgic, Huseyin Bilgin; Bakirci, Serkan; Hacilarlioglu, Selin; Kinnaird, Jane; Thompson, Joanne; Weir, William; Shiels, Brian
2017-06-05
Vector-borne apicomplexan parasites are a major cause of mortality and morbidity to humans and livestock globally. The most important disease syndromes caused by these parasites are malaria, babesiosis and theileriosis. Strategies for control often target parasite stages in the mammalian host that cause disease, but this can result in reservoir infections that promote pathogen transmission and generate economic loss. Optimal control strategies should protect against clinical disease, block transmission and be applicable across related genera of parasites. We have used bioinformatics and transcriptomics to screen for transmission-blocking candidate antigens in the tick-borne apicomplexan parasite, Theileria annulata. A number of candidate antigen genes were identified which encoded amino acid domains that are conserved across vector-borne Apicomplexa (Babesia, Plasmodium and Theileria), including the Pfs48/45 6-cys domain and a novel cysteine-rich domain. Expression profiling confirmed that selected candidate genes are expressed by life cycle stages within infected ticks. Additionally, putative B cell epitopes were identified in the T. annulata gene sequences encoding the 6-cys and cysteine rich domains, in a gene encoding a putative papain-family cysteine peptidase, with similarity to the Plasmodium SERA family, and the gene encoding the T. annulata major merozoite/piroplasm surface antigen, Tams1. Candidate genes were identified that encode proteins with similarity to known transmission blocking candidates in related parasites, while one is a novel candidate conserved across vector-borne apicomplexans and has a potential role in the sexual phase of the life cycle. The results indicate that a 'One Health' approach could be utilised to develop a transmission-blocking strategy effective against vector-borne apicomplexan parasites of animals and humans.
Jayaswall, Kuldip; Mahajan, Pallavi; Singh, Gagandeep; Parmar, Rajni; Seth, Romit; Raina, Aparnashree; Swarnkar, Mohit Kumar; Singh, Anil Kumar; Shankar, Ravi; Sharma, Ram Kumar
2016-01-01
To unravel the molecular mechanism of defense against blister blight (BB) disease caused by an obligate biotrophic fungus, Exobasidium vexans, transcriptome of BB interaction with resistance and susceptible tea genotypes was analysed through RNA-seq using Illumina GAIIx at four different stages during ~20-day disease cycle. Approximately 69 million high quality reads were assembled de novo, yielding 37,790 unique transcripts with more than 55% being functionally annotated. Differentially expressed, 149 defense related transcripts/genes, namely defense related enzymes, resistance genes, multidrug resistant transporters, transcription factors, retrotransposons, metacaspases and chaperons were observed in RG, suggesting their role in defending against BB. Being present in the major hub, putative master regulators among these candidates were identified from predetermined protein-protein interaction network of Arabidopsis thaliana. Further, confirmation of abundant expression of well-known RPM1, RPS2 and RPP13 in quantitative Real Time PCR indicates salicylic acid and jasmonic acid, possibly induce synthesis of antimicrobial compounds, required to overcome the virulence of E. vexans. Compendiously, the current study provides a comprehensive gene expression and insights into the molecular mechanism of tea defense against BB to serve as a resource for unravelling the possible regulatory mechanism of immunity against various biotic stresses in tea and other crops. PMID:27465480
NASA Astrophysics Data System (ADS)
Jayaswall, Kuldip; Mahajan, Pallavi; Singh, Gagandeep; Parmar, Rajni; Seth, Romit; Raina, Aparnashree; Swarnkar, Mohit Kumar; Singh, Anil Kumar; Shankar, Ravi; Sharma, Ram Kumar
2016-07-01
To unravel the molecular mechanism of defense against blister blight (BB) disease caused by an obligate biotrophic fungus, Exobasidium vexans, transcriptome of BB interaction with resistance and susceptible tea genotypes was analysed through RNA-seq using Illumina GAIIx at four different stages during ~20-day disease cycle. Approximately 69 million high quality reads were assembled de novo, yielding 37,790 unique transcripts with more than 55% being functionally annotated. Differentially expressed, 149 defense related transcripts/genes, namely defense related enzymes, resistance genes, multidrug resistant transporters, transcription factors, retrotransposons, metacaspases and chaperons were observed in RG, suggesting their role in defending against BB. Being present in the major hub, putative master regulators among these candidates were identified from predetermined protein-protein interaction network of Arabidopsis thaliana. Further, confirmation of abundant expression of well-known RPM1, RPS2 and RPP13 in quantitative Real Time PCR indicates salicylic acid and jasmonic acid, possibly induce synthesis of antimicrobial compounds, required to overcome the virulence of E. vexans. Compendiously, the current study provides a comprehensive gene expression and insights into the molecular mechanism of tea defense against BB to serve as a resource for unravelling the possible regulatory mechanism of immunity against various biotic stresses in tea and other crops.
Large-scale gene-centric analysis identifies novel variants for coronary artery disease.
2011-09-01
Coronary artery disease (CAD) has a significant genetic contribution that is incompletely characterized. To complement genome-wide association (GWA) studies, we conducted a large and systematic candidate gene study of CAD susceptibility, including analysis of many uncommon and functional variants. We examined 49,094 genetic variants in ∼2,100 genes of cardiovascular relevance, using a customised gene array in 15,596 CAD cases and 34,992 controls (11,202 cases and 30,733 controls of European descent; 4,394 cases and 4,259 controls of South Asian origin). We attempted to replicate putative novel associations in an additional 17,121 CAD cases and 40,473 controls. Potential mechanisms through which the novel variants could affect CAD risk were explored through association tests with vascular risk factors and gene expression. We confirmed associations of several previously known CAD susceptibility loci (eg, 9p21.3:p<10(-33); LPA:p<10(-19); 1p13.3:p<10(-17)) as well as three recently discovered loci (COL4A1/COL4A2, ZC3HC1, CYP17A1:p<5×10(-7)). However, we found essentially null results for most previously suggested CAD candidate genes. In our replication study of 24 promising common variants, we identified novel associations of variants in or near LIPA, IL5, TRIB1, and ABCG5/ABCG8, with per-allele odds ratios for CAD risk with each of the novel variants ranging from 1.06-1.09. Associations with variants at LIPA, TRIB1, and ABCG5/ABCG8 were supported by gene expression data or effects on lipid levels. Apart from the previously reported variants in LPA, none of the other ∼4,500 low frequency and functional variants showed a strong effect. Associations in South Asians did not differ appreciably from those in Europeans, except for 9p21.3 (per-allele odds ratio: 1.14 versus 1.27 respectively; P for heterogeneity = 0.003). This large-scale gene-centric analysis has identified several novel genes for CAD that relate to diverse biochemical and cellular functions and clarified the literature with regard to many previously suggested genes.
Investigating Gene Function in Cereal Rust Fungi by Plant-Mediated Virus-Induced Gene Silencing.
Panwar, Vinay; Bakkeren, Guus
2017-01-01
Cereal rust fungi are destructive pathogens, threatening grain production worldwide. Targeted breeding for resistance utilizing host resistance genes has been effective. However, breakdown of resistance occurs frequently and continued efforts are needed to understand how these fungi overcome resistance and to expand the range of available resistance genes. Whole genome sequencing, transcriptomic and proteomic studies followed by genome-wide computational and comparative analyses have identified large repertoire of genes in rust fungi among which are candidates predicted to code for pathogenicity and virulence factors. Some of these genes represent defence triggering avirulence effectors. However, functions of most genes still needs to be assessed to understand the biology of these obligate biotrophic pathogens. Since genetic manipulations such as gene deletion and genetic transformation are not yet feasible in rust fungi, performing functional gene studies is challenging. Recently, Host-induced gene silencing (HIGS) has emerged as a useful tool to characterize gene function in rust fungi while infecting and growing in host plants. We utilized Barley stripe mosaic virus-mediated virus induced gene silencing (BSMV-VIGS) to induce HIGS of candidate rust fungal genes in the wheat host to determine their role in plant-fungal interactions. Here, we describe the methods for using BSMV-VIGS in wheat for functional genomics study in cereal rust fungi.
Mukherjee, Shubhabrata; Russell, Joshua C; Carr, Daniel T; Burgess, Jeremy D; Allen, Mariet; Serie, Daniel J; Boehme, Kevin L; Kauwe, John S K; Naj, Adam C; Fardo, David W; Dickson, Dennis W; Montine, Thomas J; Ertekin-Taner, Nilufer; Kaeberlein, Matt R; Crane, Paul K
2017-10-01
We sought to determine whether a systems biology approach may identify novel late-onset Alzheimer's disease (LOAD) loci. We performed gene-wide association analyses and integrated results with human protein-protein interaction data using network analyses. We performed functional validation on novel genes using a transgenic Caenorhabditis elegans Aβ proteotoxicity model and evaluated novel genes using brain expression data from people with LOAD and other neurodegenerative conditions. We identified 13 novel candidate LOAD genes outside chromosome 19. Of those, RNA interference knockdowns of the C. elegans orthologs of UBC, NDUFS3, EGR1, and ATP5H were associated with Aβ toxicity, and NDUFS3, SLC25A11, ATP5H, and APP were differentially expressed in the temporal cortex. Network analyses identified novel LOAD candidate genes. We demonstrated a functional role for four of these in a C. elegans model and found enrichment of differentially expressed genes in the temporal cortex. Copyright © 2017 the Alzheimer's Association. Published by Elsevier Inc. All rights reserved.
NDRC: A Disease-Causing Genes Prioritized Method Based on Network Diffusion and Rank Concordance.
Fang, Minghong; Hu, Xiaohua; Wang, Yan; Zhao, Junmin; Shen, Xianjun; He, Tingting
2015-07-01
Disease-causing genes prioritization is very important to understand disease mechanisms and biomedical applications, such as design of drugs. Previous studies have shown that promising candidate genes are mostly ranked according to their relatedness to known disease genes or closely related disease genes. Therefore, a dangling gene (isolated gene) with no edges in the network can not be effectively prioritized. These approaches tend to prioritize those genes that are highly connected in the PPI network while perform poorly when they are applied to loosely connected disease genes. To address these problems, we propose a new disease-causing genes prioritization method that based on network diffusion and rank concordance (NDRC). The method is evaluated by leave-one-out cross validation on 1931 diseases in which at least one gene is known to be involved, and it is able to rank the true causal gene first in 849 of all 2542 cases. The experimental results suggest that NDRC significantly outperforms other existing methods such as RWR, VAVIEN, DADA and PRINCE on identifying loosely connected disease genes and successfully put dangling genes as potential candidate disease genes. Furthermore, we apply NDRC method to study three representative diseases, Meckel syndrome 1, Protein C deficiency and Peroxisome biogenesis disorder 1A (Zellweger). Our study has also found that certain complex disease-causing genes can be divided into several modules that are closely associated with different disease phenotype.
Retrieval of Enterobacteriaceae drug targets using singular value decomposition.
Silvério-Machado, Rita; Couto, Bráulio R G M; Dos Santos, Marcos A
2015-04-15
The identification of potential drug target proteins in bacteria is important in pharmaceutical research for the development of new antibiotics to combat bacterial agents that cause diseases. A new model that combines the singular value decomposition (SVD) technique with biological filters composed of a set of protein properties associated with bacterial drug targets and similarity to protein-coding essential genes of Escherichia coli (strain K12) has been created to predict potential antibiotic drug targets in the Enterobacteriaceae family. This model identified 99 potential drug target proteins in the studied family, which exhibit eight different functions and are protein-coding essential genes or similar to protein-coding essential genes of E.coli (strain K12), indicating that the disruption of the activities of these proteins is critical for cells. Proteins from bacteria with described drug resistance were found among the retrieved candidates. These candidates have no similarity to the human proteome, therefore exhibiting the advantage of causing no adverse effects or at least no known adverse effects on humans. rita_silverio@hotmail.com. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Park, Dong-Soo; Lee, Sang-Kyu; Lee, Jong-Hee; Song, Min-Young; Song, Song-Yi; Kwak, Do-Yeon; Yeo, Un-Sang; Jeon, Nam-Soo; Park, Soo-Kwon; Yi, Gihwan; Song, You-Chun; Nam, Min-Hee; Ku, Yeon-Chung; Jeon, Jong-Seong
2007-08-01
The development of rice varieties (Oryza sativa L.) that are resistant to the brown planthopper (BPH; Nilaparvata lugens Stål) is an important objective in current breeding programs. In this study, we generated 132 BC(5)F(5) near-isogenic rice lines (NILs) by five backcrosses of Samgangbyeo, a BPH resistant indica variety carrying the Bph1 locus, with Nagdongbyeo, a BPH susceptible japonica variety. To identify genes that confer BPH resistance, we employed representational difference analysis (RDA) to detect transcripts that were exclusively expressed in one of our BPH resistant NIL, SNBC61, during insect feeding. The chromosomal mapping of the RDA clones that we subsequently isolated revealed that they are located in close proximity either to known quantitative trait loci or to an introgressed SSR marker from the BPH resistant donor parent Samgangbyeo. Genomic DNA gel-blot analysis further revealed that loci of all RDA clones in SNBC61 correspond to the alleles of Samgangbyeo. Most of the RDA clones were found to be exclusively expressed in SNBC61 and could be assigned to functional groups involved in plant defense. These RDA clones therefore represent candidate defense genes for BPH resistance.
Acland, Gregory M.
2014-01-01
Considerable clinical and molecular variations have been known in retinal blinding diseases in man and also in dogs. Different forms of retinal diseases occur in specific breed(s) caused by mutations segregating within each isolated breeding population. While molecular studies to find genes and mutations underlying retinal diseases in dogs have benefited largely from the phenotypic and genetic uniformity within a breed, within- and across-breed variations have often played a key role in elucidating the molecular basis. The increasing knowledge of phenotypic, allelic, and genetic heterogeneities in canine retinal degeneration has shown that the overall picture is rather more complicated than initially thought. Over the past 20 years, various approaches have been developed and tested to search for genes and mutations underlying genetic traits in dogs, depending on the availability of genetic tools and sample resources. Candidate gene, linkage analysis, and genome-wide association studies have so far identified 24 mutations in 18 genes underlying retinal diseases in at least 58 dog breeds. Many of these genes have been associated with retinal diseases in humans, thus providing opportunities to study the role in pathogenesis and in normal vision. Application in therapeutic interventions such as gene therapy has proven successful initially in a naturally occurring dog model followed by trials in human patients. Other genes whose human homologs have not been associated with retinal diseases are potential candidates to explain equivalent human diseases and contribute to the understanding of their function in vision. PMID:22065099
Miyadera, Keiko; Acland, Gregory M; Aguirre, Gustavo D
2012-02-01
Considerable clinical and molecular variations have been known in retinal blinding diseases in man and also in dogs. Different forms of retinal diseases occur in specific breed(s) caused by mutations segregating within each isolated breeding population. While molecular studies to find genes and mutations underlying retinal diseases in dogs have benefited largely from the phenotypic and genetic uniformity within a breed, within- and across-breed variations have often played a key role in elucidating the molecular basis. The increasing knowledge of phenotypic, allelic, and genetic heterogeneities in canine retinal degeneration has shown that the overall picture is rather more complicated than initially thought. Over the past 20 years, various approaches have been developed and tested to search for genes and mutations underlying genetic traits in dogs, depending on the availability of genetic tools and sample resources. Candidate gene, linkage analysis, and genome-wide association studies have so far identified 24 mutations in 18 genes underlying retinal diseases in at least 58 dog breeds. Many of these genes have been associated with retinal diseases in humans, thus providing opportunities to study the role in pathogenesis and in normal vision. Application in therapeutic interventions such as gene therapy has proven successful initially in a naturally occurring dog model followed by trials in human patients. Other genes whose human homologs have not been associated with retinal diseases are potential candidates to explain equivalent human diseases and contribute to the understanding of their function in vision.
Ultra Deep Sequencing of Listeria monocytogenes sRNA Transcriptome Revealed New Antisense RNAs
Behrens, Sebastian; Widder, Stefanie; Mannala, Gopala Krishna; Qing, Xiaoxing; Madhugiri, Ramakanth; Kefer, Nathalie; Mraheil, Mobarak Abu; Rattei, Thomas; Hain, Torsten
2014-01-01
Listeria monocytogenes, a gram-positive pathogen, and causative agent of listeriosis, has become a widely used model organism for intracellular infections. Recent studies have identified small non-coding RNAs (sRNAs) as important factors for regulating gene expression and pathogenicity of L. monocytogenes. Increased speed and reduced costs of high throughput sequencing (HTS) techniques have made RNA sequencing (RNA-Seq) the state-of-the-art method to study bacterial transcriptomes. We created a large transcriptome dataset of L. monocytogenes containing a total of 21 million reads, using the SOLiD sequencing technology. The dataset contained cDNA sequences generated from L. monocytogenes RNA collected under intracellular and extracellular condition and additionally was size fractioned into three different size ranges from <40 nt, 40–150 nt and >150 nt. We report here, the identification of nine new sRNAs candidates of L. monocytogenes and a reevaluation of known sRNAs of L. monocytogenes EGD-e. Automatic comparison to known sRNAs revealed a high recovery rate of 55%, which was increased to 90% by manual revision of the data. Moreover, thorough classification of known sRNAs shed further light on their possible biological functions. Interestingly among the newly identified sRNA candidates are antisense RNAs (asRNAs) associated to the housekeeping genes purA, fumC and pgi and potentially their regulation, emphasizing the significance of sRNAs for metabolic adaptation in L. monocytogenes. PMID:24498259
Current Understanding of Usher Syndrome Type II
Yang, Jun; Wang, Le; Song, Hongman; Sokolov, Maxim
2012-01-01
Usher syndrome is the most common deafness-blindness caused by genetic mutations. To date, three genes have been identified underlying the most prevalent form of Usher syndrome, the type II form (USH2). The proteins encoded by these genes are demonstrated to form a complex in vivo. This complex is localized mainly at the periciliary membrane complex in photoreceptors and the ankle-link of the stereocilia in hair cells. Many proteins have been found to interact with USH2 proteins in vitro, suggesting that they are potential additional components of this USH2 complex and that the genes encoding these proteins may be the candidate USH2 genes. However, further investigations are critical to establish their existence in the USH2 complex in vivo. Based on the predicted functional domains in USH2 proteins, their cellular localizations in photoreceptors and hair cells, the observed phenotypes in USH2 mutant mice, and the known knowledge about diseases similar to USH2, putative biological functions of the USH2 complex have been proposed. Finally, therapeutic approaches for this group of diseases are now being actively explored. PMID:22201796
Cheng, Yulin; Wu, Kuan; Yao, Juanni; Li, Shumin; Wang, Xiaojie; Huang, Lili; Kang, Zhensheng
2017-05-01
During the infection of host plants, pathogens can deliver virulence-associated 'effector' proteins to promote plant susceptibility. However, little is known about effector function in the obligate biotrophic pathogen Puccinia striiformis f. sp. tritici (Pst) that is an important fungal pathogen in wheat production worldwide. Here, they report their findings on an in planta highly induced candidate effector from Pst, PSTha5a23. The PSTha5a23 gene is unique to Pst and shows a low level of intra-species polymorphism. It has a functional N-terminal signal peptide and is translocated to the host cytoplasm after infection. Overexpression of PSTha5a23 in Nicotiana benthamiana was found to suppress the programmed cell death triggered by BAX, PAMP-INF1 and two resistance-related mitogen-activated protein kinases (MKK1 and NPK1). Overexpression of PSTha5a23 in wheat also suppressed pattern-triggered immunity (PTI)-associated callose deposition. In addition, silencing of PSTha5a23 did not change Pst virulence phenotypes; however, overexpression of PSTha5a23 significantly enhanced Pst virulence in wheat. These results indicate that the Pst candidate effector PSTha5a23 plays an important role in plant defense suppression and rust pathogenicity, and also highlight the utility of gene overexpression in plants as a tool for studying effectors from obligate biotrophic pathogens. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.
Voltage-gated proton channel in a dinoflagellate
Smith, Susan M. E.; Morgan, Deri; Musset, Boris; Cherny, Vladimir V.; Place, Allen R.; Hastings, J. Woodland; DeCoursey, Thomas E.
2011-01-01
Fogel and Hastings first hypothesized the existence of voltage-gated proton channels in 1972 in bioluminescent dinoflagellates, where they were thought to trigger the flash by activating luciferase. Proton channel genes were subsequently identified in human, mouse, and Ciona intestinalis, but their existence in dinoflagellates remained unconfirmed. We identified a candidate proton channel gene from a Karlodinium veneficum cDNA library based on homology with known proton channel genes. K. veneficum is a predatory, nonbioluminescent dinoflagellate that produces toxins responsible for fish kills worldwide. Patch clamp studies on the heterologously expressed gene confirm that it codes for a genuine voltage-gated proton channel, kHV1: it is proton-specific and activated by depolarization, its gH–V relationship shifts with changes in external or internal pH, and mutation of the selectivity filter (which we identify as Asp51) results in loss of proton-specific conduction. Indirect evidence suggests that kHV1 is monomeric, unlike other proton channels. Furthermore, kHV1 differs from all known proton channels in activating well negative to the Nernst potential for protons, EH. This unique voltage dependence makes the dinoflagellate proton channel ideally suited to mediate the proton influx postulated to trigger bioluminescence. In contrast to vertebrate proton channels, whose main function is acid extrusion, we propose that proton channels in dinoflagellates have fundamentally different functions of signaling and excitability. PMID:22006335
The Genetic Basis for Variation in Sensitivity to Lead Toxicity in Drosophila melanogaster
Zhou, Shanshan; Morozova, Tatiana V.; Hussain, Yasmeen N.; Luoma, Sarah E.; McCoy, Lenovia; Yamamoto, Akihiko; Mackay, Trudy F.C.; Anholt, Robert R.H.
2016-01-01
Background: Lead toxicity presents a worldwide health problem, especially due to its adverse effects on cognitive development in children. However, identifying genes that give rise to individual variation in susceptibility to lead toxicity is challenging in human populations. Objectives: Our goal was to use Drosophila melanogaster to identify evolutionarily conserved candidate genes associated with individual variation in susceptibility to lead exposure. Methods: To identify candidate genes associated with variation in susceptibility to lead toxicity, we measured effects of lead exposure on development time, viability and adult activity in the Drosophila melanogaster Genetic Reference Panel (DGRP) and performed genome-wide association analyses to identify candidate genes. We used mutants to assess functional causality of candidate genes and constructed a genetic network associated with variation in sensitivity to lead exposure, on which we could superimpose human orthologs. Results: We found substantial heritabilities for all three traits and identified candidate genes associated with variation in susceptibility to lead exposure for each phenotype. The genetic architectures that determine variation in sensitivity to lead exposure are highly polygenic. Gene ontology and network analyses showed enrichment of genes associated with early development and function of the nervous system. Conclusions: Drosophila melanogaster presents an advantageous model to study the genetic underpinnings of variation in susceptibility to lead toxicity. Evolutionary conservation of cellular pathways that respond to toxic exposure allows predictions regarding orthologous genes and pathways across phyla. Thus, studies in the D. melanogaster model system can identify candidate susceptibility genes to guide subsequent studies in human populations. Citation: Zhou S, Morozova TV, Hussain YN, Luoma SE, McCoy L, Yamamoto A, Mackay TF, Anholt RR. 2016. The genetic basis for variation in sensitivity to lead toxicity in Drosophila melanogaster. Environ Health Perspect 124:1062–1070; http://dx.doi.org/10.1289/ehp.1510513 PMID:26859824
Moon, Myungjin; Nakai, Kenta
2018-04-01
Currently, cancer biomarker discovery is one of the important research topics worldwide. In particular, detecting significant genes related to cancer is an important task for early diagnosis and treatment of cancer. Conventional studies mostly focus on genes that are differentially expressed in different states of cancer; however, noise in gene expression datasets and insufficient information in limited datasets impede precise analysis of novel candidate biomarkers. In this study, we propose an integrative analysis of gene expression and DNA methylation using normalization and unsupervised feature extractions to identify candidate biomarkers of cancer using renal cell carcinoma RNA-seq datasets. Gene expression and DNA methylation datasets are normalized by Box-Cox transformation and integrated into a one-dimensional dataset that retains the major characteristics of the original datasets by unsupervised feature extraction methods, and differentially expressed genes are selected from the integrated dataset. Use of the integrated dataset demonstrated improved performance as compared with conventional approaches that utilize gene expression or DNA methylation datasets alone. Validation based on the literature showed that a considerable number of top-ranked genes from the integrated dataset have known relationships with cancer, implying that novel candidate biomarkers can also be acquired from the proposed analysis method. Furthermore, we expect that the proposed method can be expanded for applications involving various types of multi-omics datasets.
Joy, Nisha; Soniya, Eppurathu Vasudevan
2012-06-01
Plant miRNAs (18-24nt) are generated by the RNase III-type Dicer endonuclease from the endogenous hairpin precursors ('pre-miRNAs') with significant regulatory functions. The transcribed regions display a higher frequency of microsatellites, when compared to other regions of the genomic DNA. Simple sequence repeats (SSRs) resulting from replication slippage occurring in transcripts affect the expression of genes. The available experimental evidence for the incidence of SSRs in the miRNA precursors is limited. Considering the potential significance of SSRs in the miRNA genes, we carried out a preliminary analysis to verify the presence of SSRs in the pri-miRNAs of black pepper (Piper nigrum L.). We isolated a (CT) dinucleotide SSR bearing transcript using SMART strategy. The transcript was predicted to be a 'pri-miRNA candidate' with Dicer sites based on miRNA prediction tools and MFOLD structural predictions. The presence of this 'miRNA candidate' was confirmed by real-time TaqMan assays. The upstream sequence of the 'miRNA candidate' by genome walking when subjected to PlantCARE showed the presence of certain promoter elements, and the deduced amino acid showed significant similarity with NAP1 gene, which affects the transcription of many genes. Moreover the hairpin-like precursor overlapped the neighbouring NAP1 gene. In silico analysis revealed distinct putative functions for the 'miRNA candidate', of which majority were related to growth. Hence, we assume that this 'miRNA candidate' may get activated during transcription of NAP gene, thereby regulating the expression of many genes involved in developmental processes.
De novo sequencing and analysis of the transcriptome of Panax ginseng in the leaf-expansion period.
Liu, Shichao; Wang, Siming; Liu, Meichen; Yang, Fei; Zhang, Hui; Liu, Shiyang; Wang, Qun; Zhao, Yu
2016-08-01
Panax ginseng, a traditional Chinese medicine, is used worldwide for its variety of health benefits and its treatment efficacy. However, it is difficult to cultivate due to its vulnerability to environmental stresses. The present study provided the first report, to the best of our knowledge, of transcriptome analysis of ginseng at the leaf‑expansion stage. Using the Illumina sequencing platform, >40,000,000 high‑quality paired‑end reads were obtained and assembled into 100,533 unique sequences. When the sequences were searched against the publicly available National Center for Biotechnology Information protein database using The Basic Local Alignment Search Tool, 61,599 sequences exhibited similarity to known proteins. Functional annotation and classification, including use of the Gene Ontology, Clusters of Orthologous Groups, and Kyoto Encyclopedia of Genes and Genomes databases, revealed that the activated genes in ginseng were predominantly ribonuclease‑like storage genes, environmental stress genes, pathogenesis-related genes and other antioxidant genes. A number of candidate genes in environmental stress‑associated pathways were also identified. These novel data provide useful information on the growth and development stages of ginseng, and serve as an important public information platform for further understanding of the molecular mechanisms and functional genomics of ginseng.
Li, Min; Li, Qi; Ganegoda, Gamage Upeksha; Wang, JianXin; Wu, FangXiang; Pan, Yi
2014-11-01
Identification of disease-causing genes among a large number of candidates is a fundamental challenge in human disease studies. However, it is still time-consuming and laborious to determine the real disease-causing genes by biological experiments. With the advances of the high-throughput techniques, a large number of protein-protein interactions have been produced. Therefore, to address this issue, several methods based on protein interaction network have been proposed. In this paper, we propose a shortest path-based algorithm, named SPranker, to prioritize disease-causing genes in protein interaction networks. Considering the fact that diseases with similar phenotypes are generally caused by functionally related genes, we further propose an improved algorithm SPGOranker by integrating the semantic similarity of GO annotations. SPGOranker not only considers the topological similarity between protein pairs in a protein interaction network but also takes their functional similarity into account. The proposed algorithms SPranker and SPGOranker were applied to 1598 known orphan disease-causing genes from 172 orphan diseases and compared with three state-of-the-art approaches, ICN, VS and RWR. The experimental results show that SPranker and SPGOranker outperform ICN, VS, and RWR for the prioritization of orphan disease-causing genes. Importantly, for the case study of severe combined immunodeficiency, SPranker and SPGOranker predict several novel causal genes.
Dennenmoser, Stefan; Vamosi, Steven M; Nolte, Arne W; Rogers, Sean M
2017-01-01
Understanding the genomic basis of adaptive divergence in the presence of gene flow remains a major challenge in evolutionary biology. In prickly sculpin (Cottus asper), an abundant euryhaline fish in northwestern North America, high genetic connectivity among brackish-water (estuarine) and freshwater (tributary) habitats of coastal rivers does not preclude the build-up of neutral genetic differentiation and emergence of different life history strategies. Because these two habitats present different osmotic niches, we predicted high genetic differentiation at known teleost candidate genes underlying salinity tolerance and osmoregulation. We applied whole-genome sequencing of pooled DNA samples (Pool-Seq) to explore adaptive divergence between two estuarine and two tributary habitats. Paired-end sequence reads were mapped against genomic contigs of European Cottus, and the gene content of candidate regions was explored based on comparisons with the threespine stickleback genome. Genes showing signals of repeated differentiation among brackish-water and freshwater habitats included functions such as ion transport and structural permeability in freshwater gills, which suggests that local adaptation to different osmotic niches might contribute to genomic divergence among habitats. Overall, the presence of both repeated and unique signatures of differentiation across many loci scattered throughout the genome is consistent with polygenic adaptation from standing genetic variation and locally variable selection pressures in the early stages of life history divergence. © 2016 John Wiley & Sons Ltd.
Cheng, Ting-Yuan David; Makar, Karen W; Neuhouser, Marian L; Miller, Joshua W; Song, Xiaoling; Brown, Elissa C; Beresford, Shirley A A; Zheng, Yingye; Poole, Elizabeth M; Galbraith, Rachel L; Duggan, David J; Habermann, Nina; Bailey, Lynn B; Maneval, David R; Caudill, Marie A; Toriola, Adetunji T; Green, Ralph; Ulrich, Cornelia M
2015-10-15
Investigations of folate-mediated one-carbon metabolism (FOCM) genes and gene-nutrient interactions with respect to colorectal cancer (CRC) risk are limited to candidate polymorphisms and dietary folate. This study comprehensively investigated associations between genetic variants in FOCM and CRC risk and whether the FOCM nutrient status modified these associations. Two hundred eighty-eight candidate and tagging single-nucleotide polymorphisms (SNPs) in 30 FOCM genes were genotyped for 821 incident CRC case-control matched pairs in the Women's Health Initiative Observational Study cohort. FOCM biomarkers (red blood cell [RBC] folate, plasma folate, pyridoxal-5'-phosphate [PLP], vitamin B12, and homocysteine) and self-reported alcohol consumption were measured at the baseline. Conditional logistic regression was implemented; effect modification was examined on the basis of known enzyme-nutrient relations. Statistically significant associations were observed between CRC risk and functionally defined candidate SNPs of methylenetetrahydrofolate dehydrogenase 1 (MTHFD1; K134R), 5-methyltetrahydrofolate-homocysteine methyltransferase reductase (MTRR; P450R), and PR domain containing 2 with ZNF domain (PRDM2; S450N) and a literature candidate SNP of thymidylate synthase (TYMS; g.676789A>T; nominal P < .05). In addition, suggestive associations were noted for tagging SNPs in cystathionine-β-synthase (CBS), dihydrofolate reductase (DHFR), DNA (cytosine-5-)-methyltransferase 3β (DNMT3B), methionine adenosyltransferase I α (MAT1A), MTHFD1, and MTRR (nominal P < .05; adjusted P, not significant). Significant interactions between nutrient biomarkers and candidate polymorphisms were observed for 1) plasma/RBC folate and folate hydrolase 1 (FOLH1), paraoxonase 1 (PON1), transcobalamin II (TCN2), DNMT1, and DNMT3B; 2) plasma PLP and TYMS TS3; 3) plasma B12 and betaine-homocysteine S-methyltransferase 2 (BHMT2); and 4) homocysteine and methylenetetrahydrofolate reductase (MTHFR) and alanyl-transfer RNA synthetase (AARS). Genetic variants in FOCM genes are associated with CRC risk among postmenopausal women. FOCM nutrients continue to emerge as effect modifiers of genetic influences on CRC risk. © 2015 American Cancer Society.
A glycogene mutation map for discovery of diseases of glycosylation
Hansen, Lars; Lind-Thomsen, Allan; Joshi, Hiren J; Pedersen, Nis Borbye; Have, Christian Theil; Kong, Yun; Wang, Shengjun; Sparso, Thomas; Grarup, Niels; Vester-Christensen, Malene Bech; Schjoldager, Katrine; Freeze, Hudson H; Hansen, Torben; Pedersen, Oluf; Henrissat, Bernard; Mandel, Ulla; Clausen, Henrik; Wandall, Hans H; Bennett, Eric P
2015-01-01
Glycosylation of proteins and lipids involves over 200 known glycosyltransferases (GTs), and deleterious defects in many of the genes encoding these enzymes cause disorders collectively classified as congenital disorders of glycosylation (CDGs). Most known CDGs are caused by defects in glycogenes that affect glycosylation globally. Many GTs are members of homologous isoenzyme families and deficiencies in individual isoenzymes may not affect glycosylation globally. In line with this, there appears to be an underrepresentation of disease-causing glycogenes among these larger isoenzyme homologous families. However, genome-wide association studies have identified such isoenzyme genes as candidates for different diseases, but validation is not straightforward without biomarkers. Large-scale whole-exome sequencing (WES) provides access to mutations in, for example, GT genes in populations, which can be used to predict and/or analyze functional deleterious mutations. Here, we constructed a draft of a functional mutational map of glycogenes, GlyMAP, from WES of a rather homogenous population of 2000 Danes. We cataloged all missense mutations and used prediction algorithms, manual inspection and in case of carbohydrate-active enzymes family GT27 experimental analysis of mutations to map deleterious mutations. GlyMAP (http://glymap.glycomics.ku.dk) provides a first global view of the genetic stability of the glycogenome and should serve as a tool for discovery of novel CDGs. PMID:25267602
Winter, Jean M; Curry, Natasha L; Gildea, Derek M; Williams, Kendra A; Lee, Minnkyong; Hu, Ying; Crawford, Nigel P S
2018-06-11
It is well known that development of prostate cancer (PC) can be attributed to somatic mutations of the genome, acquired within proto-oncogenes or tumor-suppressor genes. What is less well understood is how germline variation contributes to disease aggressiveness in PC patients. To map germline modifiers of aggressive neuroendocrine PC, we generated a genetically diverse F2 intercross population using the transgenic TRAMP mouse model and the wild-derived WSB/EiJ (WSB) strain. The relevance of germline modifiers of aggressive PC identified in these mice was extensively correlated in human PC datasets and functionally validated in cell lines. Aggressive PC traits were quantified in a population of 30 week old (TRAMP x WSB) F2 mice (n = 307). Correlation of germline genotype with aggressive disease phenotype revealed seven modifier loci that were significantly associated with aggressive disease. RNA-seq were analyzed using cis-eQTL and trait correlation analyses to identify candidate genes within each of these loci. Analysis of 92 (TRAMP x WSB) F2 prostates revealed 25 candidate genes that harbored both a significant cis-eQTL and mRNA expression correlations with an aggressive PC trait. We further delineated these candidate genes based on their clinical relevance, by interrogating human PC GWAS and PC tumor gene expression datasets. We identified four genes (CCDC115, DNAJC10, RNF149, and STYXL1), which encompassed all of the following characteristics: 1) one or more germline variants associated with aggressive PC traits; 2) differential mRNA levels associated with aggressive PC traits; and 3) differential mRNA expression between normal and tumor tissue. Functional validation studies of these four genes using the human LNCaP prostate adenocarcinoma cell line revealed ectopic overexpression of CCDC115 can significantly impede cell growth in vitro and tumor growth in vivo. Furthermore, CCDC115 human prostate tumor expression was associated with better survival outcomes. We have demonstrated how modifier locus mapping in mouse models of PC, coupled with in silico analyses of human PC datasets, can reveal novel germline modifier genes of aggressive PC. We have also characterized CCDC115 as being associated with less aggressive PC in humans, placing it as a potential prognostic marker of aggressive PC.
Truong, Anh Duc; Rengaraj, Deivendran; Hong, Yeojin; Hoang, Cong Thanh; Hong, Yeong Ho; Lillehoj, Hyun S
2017-05-01
The JAK-STAT signaling pathway plays a key role in cytokine and growth factor activation and is involved in several cellular functions and diseases. The main objective of this study was to investigate the expression of candidate JAK-STAT pathway genes and their regulators and interactors in the intestinal mucosal layer of two genetically disparate chicken lines [Marek's disease (MD)-resistant line 6.3 and MD-susceptible line 7.2] induced with necrotic enteritis (NE). Through RNA-sequencing, we investigated 116 JAK-STAT signaling pathway-related genes that were significant and differentially expressed between the intestinal mucosa of the two lines compared with respective uninfected controls. About 15 JAK-STAT pathway genes were further verified by qRT-PCR, and the results were in agreement with our sequencing data. All the identified 116 genes were annotated through Gene Ontology and mapped to the KEGG chicken JAK-STAT signaling pathway. To the best of our knowledge, this is the first study to represent the transcriptional analysis of a large number of candidate genes, regulators, and potential interactors in the JAK-STAT pathway of the two chicken lines induced with NE. Several key genes of the interactome, namely, STAT1/3/4, STAT5B, JAK1-3, TYK2, AKT1/3, SOCS1-5, PIAS1/2/4, PTPN6/11, and PIK3, were determined to be differentially expressed in the two lines. Moreover, we detected 68 known miRNAs variably targeting JAK-STAT pathway genes and differentially expressed in the two lines induced with NE. The RNA-sequencing and bioinformatics analyses in this study provided an abundance of data that will be useful for future studies on JAK-STAT pathways associated with the functions of two genetically disparate chicken lines induced with NE. Copyright © 2017 Elsevier B.V. All rights reserved.
Grouping and characterization of putative glycosyltransferase genes from Panax ginseng Meyer.
Khorolragchaa, Altanzul; Kim, Yu-Jin; Rahimi, Shadi; Sukweenadhi, Johan; Jang, Moon-Gi; Yang, Deok-Chun
2014-02-15
Glycosyltransferases are members of the multigene family of plants that can transfer single or multiple activated sugars to a range of plant molecules, resulting in the glycosylation of plant compounds. Although the activities of many glycosyltransferases and their products have been recognized for a long time, only in recent years were some glycosyltransferase genes identified and few have been functionally characterized in detail. Korean ginseng (Panax ginseng Meyer), belonging to Araliaceae, has been well known as a popular mysterious medicinal herb in East Asia for over 2,000 years. A total of 704 glycosyltransferase unique sequences have been found from a ginseng expressed sequence tag (EST) library, and these sequences encode enzymes responsible for the secondary metabolite biosynthesis. Finally, twelve UDP glycosyltransferases (UGTs) were selected as the candidates most likely to be involved in triterpenoid synthesis. In this study, we classified the candidate P. ginseng UGTs (PgUGTs) into proper families and groups, which resulted in eight UGT families and six UGT groups. We also investigated those gene candidates encoding for glycosyltransferases by analysis of gene expression in methyl jasmonate (MeJA)-treated ginseng adventitious roots and different tissues from four-year-old ginseng using quantitative reverse transcriptase-polymerase chain reaction (RT-PCR). For organ-specific expression, most of PgUGT transcription levels were higher in leaves and roots compared with flower buds and stems. The transcription of PgUGTs in adventitious roots treated with MeJA increased as compared with the control. PgUGT1 and PgUGT2, which belong to the UGT71 family genes expressed in MeJA-treated adventitious roots, were especially sensitive, showing 33.32 and 38.88-fold expression increases upon 24h post-treatments, respectively. © 2013 Elsevier B.V. All rights reserved.
Mapping eQTLs in the Norfolk Island Genetic Isolate Identifies Candidate Genes for CVD Risk Traits
Benton, Miles C.; Lea, Rod A.; Macartney-Coxson, Donia; Carless, Melanie A.; Göring, Harald H.; Bellis, Claire; Hanna, Michelle; Eccles, David; Chambers, Geoffrey K.; Curran, Joanne E.; Harper, Jacquie L.; Blangero, John; Griffiths, Lyn R.
2013-01-01
Cardiovascular disease (CVD) affects millions of people worldwide and is influenced by numerous factors, including lifestyle and genetics. Expression quantitative trait loci (eQTLs) influence gene expression and are good candidates for CVD risk. Founder-effect pedigrees can provide additional power to map genes associated with disease risk. Therefore, we identified eQTLs in the genetic isolate of Norfolk Island (NI) and tested for associations between these and CVD risk factors. We measured genome-wide transcript levels of blood lymphocytes in 330 individuals and used pedigree-based heritability analysis to identify heritable transcripts. eQTLs were identified by genome-wide association testing of these transcripts. Testing for association between CVD risk factors (i.e., blood lipids, blood pressure, and body fat indices) and eQTLs revealed 1,712 heritable transcripts (p < 0.05) with heritability values ranging from 0.18 to 0.84. From these, we identified 200 cis-acting and 70 trans-acting eQTLs (p < 1.84 × 10−7) An eQTL-centric analysis of CVD risk traits revealed multiple associations, including 12 previously associated with CVD-related traits. Trait versus eQTL regression modeling identified four CVD risk candidates (NAAA, PAPSS1, NME1, and PRDX1), all of which have known biological roles in disease. In addition, we implicated several genes previously associated with CVD risk traits, including MTHFR and FN3KRP. We have successfully identified a panel of eQTLs in the NI pedigree and used this to implicate several genes in CVD risk. Future studies are required for further assessing the functional importance of these eQTLs and whether the findings here also relate to outbred populations. PMID:24314549
Profiling deleterious non-synonymous SNPs of smoker's gene CYP1A1.
Ramesh, A Sai; Khan, Imran; Farhan, Md; Thiagarajan, Padma
2013-01-01
CYP1A1 gene belongs to the cytochrome P450 family and is known better as smokers' gene due to its hyperactivation as a consequence of long term smoking. The expression of CYP1A1 induces polycyclic aromatic hydrocarbon production in the lungs, which when over expressed, is known to cause smoking related diseases, such as cardiovascular pathologies, cancer, and diabetes. Single nucleotide polymorphisms (SNPs) are the simplest form of genetic variations that occur at a higher frequency, and are denoted as synonymous and non-synonymous SNPs on the basis of their effects on the amino acids. This study adopts a systematic in silico approach to predict the deleterious SNPs that are associated with disease conditions. It is inferred that four SNPs are highly deleterious, among which the SNP with rs17861094 is commonly predicted to be harmful by all tools. Hydrophobic (isoleucine) to hydrophilic (serine) amino acid variation was observed in the candidate gene. Hence, this investigation aims to characterize a candidate gene from 159 SNPs of CYP1A1.
Variation in umami perception and in candidate genes for the umami receptor in mice and humans1234
Shirosaki, Shinya; Ohkuri, Tadahiro; Sanematsu, Keisuke; Islam, AA Shahidul; Ogiwara, Yoko; Kawai, Misako; Yoshida, Ryusuke; Ninomiya, Yuzo
2009-01-01
The unique taste induced by monosodium glutamate is referred to as umami taste. The umami taste is also elicited by the purine nucleotides inosine 5′-monophosphate and guanosine 5′-monophosphate. There is evidence that a heterodimeric G protein–coupled receptor, which consists of the T1R1 (taste receptor type 1, member 1, Tas1r1) and the T1R3 (taste receptor type 1, member 3, Tas1r3) proteins, functions as an umami taste receptor for rodents and humans. Splice variants of metabotropic glutamate receptors, mGluR1 (glutamate receptor, metabotropic 1, Grm1) and mGluR4 (glutamate receptor, metabotropic 4, Grm4), also have been proposed as taste receptors for glutamate. The taste sensitivity to umami substances varies in inbred mouse strains and in individual humans. However, little is known about the relation of umami taste sensitivity to variations in candidate umami receptor genes in rodents or in humans. In this article, we summarize current knowledge of the diversity of umami perception in mice and humans. Furthermore, we combine previously published data and new information from the single nucleotide polymorphism databases regarding variation in the mouse and human candidate umami receptor genes: mouse Tas1r1 (TAS1R1 for human), mouse Tas1r3 (TAS1R3 for human), mouse Grm1 (GRM1 for human), and mouse Grm4 (GRM4 for human). Finally, we discuss prospective associations between variation of these genes and umami taste perception in both species. PMID:19625681
Computational Predictions Provide Insights into the Biology of TAL Effector Target Sites
Grau, Jan; Wolf, Annett; Reschke, Maik; Bonas, Ulla; Posch, Stefan; Boch, Jens
2013-01-01
Transcription activator-like (TAL) effectors are injected into host plant cells by Xanthomonas bacteria to function as transcriptional activators for the benefit of the pathogen. The DNA binding domain of TAL effectors is composed of conserved amino acid repeat structures containing repeat-variable diresidues (RVDs) that determine DNA binding specificity. In this paper, we present TALgetter, a new approach for predicting TAL effector target sites based on a statistical model. In contrast to previous approaches, the parameters of TALgetter are estimated from training data computationally. We demonstrate that TALgetter successfully predicts known TAL effector target sites and often yields a greater number of predictions that are consistent with up-regulation in gene expression microarrays than an existing approach, Target Finder of the TALE-NT suite. We study the binding specificities estimated by TALgetter and approve that different RVDs are differently important for transcriptional activation. In subsequent studies, the predictions of TALgetter indicate a previously unreported positional preference of TAL effector target sites relative to the transcription start site. In addition, several TAL effectors are predicted to bind to the TATA-box, which might constitute one general mode of transcriptional activation by TAL effectors. Scrutinizing the predicted target sites of TALgetter, we propose several novel TAL effector virulence targets in rice and sweet orange. TAL-mediated induction of the candidates is supported by gene expression microarrays. Validity of these targets is also supported by functional analogy to known TAL effector targets, by an over-representation of TAL effector targets with similar function, or by a biological function related to pathogen infection. Hence, these predicted TAL effector virulence targets are promising candidates for studying the virulence function of TAL effectors. TALgetter is implemented as part of the open-source Java library Jstacs, and is freely available as a web-application and a command line program. PMID:23526890
Fan, Sheng; Zhang, Dong; Xing, Libo; Qi, Siyan; Du, Lisha; Wu, Haiqin; Shao, Hongxia; Li, Youmei; Ma, Juanjuan; Han, Mingyu
2017-08-01
Although INDETERMINATE DOMAIN (IDD) genes encoding specific plant transcription factors have important roles in plant growth and development, little is known about apple IDD (MdIDD) genes and their potential functions in the flower induction. In this study, we identified 20 putative IDD genes in apple and named them according to their chromosomal locations. All identified MdIDD genes shared a conserved IDD domain. A phylogenetic analysis separated MdIDDs and other plant IDD genes into four groups. Bioinformatic analysis of chemical characteristics, gene structure, and prediction of protein-protein interactions demonstrated the functional and structural diversity of MdIDD genes. To further uncover their potential functions, we performed analysis of tandem, synteny, and gene duplications, which indicated several paired homologs of IDD genes between apple and Arabidopsis. Additionally, genome duplications also promoted the expansion and evolution of the MdIDD genes. Quantitative real-time PCR revealed that all the MdIDD genes showed distinct expression levels in five different tissues (stems, leaves, buds, flowers, and fruits). Furthermore, the expression levels of candidate MdIDD genes were also investigated in response to various circumstances, including GA treatment (decreased the flowering rate), sugar treatment (increased the flowering rate), alternate-bearing conditions, and two varieties with different-flowering intensities. Parts of them were affected by exogenous treatments and showed different expression patterns. Additionally, changes in response to alternate-bearing and different-flowering varieties of apple trees indicated that they were also responsive to flower induction. Taken together, our comprehensive analysis provided valuable information for further analysis of IDD genes aiming at flower induction.
Bae, Joon Seol; Kim, Jason Yongha; Park, Byung-Lae; Kim, Jeong-Hyun; Kim, Bomi; Park, Chul Soo; Kim, Bong-Jo; Lee, Cheol-Soon; Lee, Migyung; Choi, Woo Hyuk; Shin, Tae-Min; Hwang, Jaeuk; Shin, Hyoung Doo; Woo, Sung-Il
2014-10-01
Located on 6q15 and 1p36.11, cannabinoid receptor 1 (CNR1) and cannabinoid receptor 2 (CNR2) genes are considered to be a positional and functional candidate gene for the development of mental disorders such as schizophrenia because CNR1 is known as a regulator of dopamine signaling in the hippocampus and the cerebral cortex. However, few genetic studies have been carried out to investigate an association of CNR1 and CNR2 polymorphisms and the risk of schizophrenia. In this study, although the result indicates that CNR1 and CNR2 variations are unlikely to influence schizophrenia susceptibility in a Korean population, the findings would provide meaningful information for further genetic studies.
Jazayeri, Roshanak; Hu, Hao; Fattahi, Zohreh; Musante, Luciana; Abedini, Seyedeh Sedigheh; Hosseini, Masoumeh; Wienker, Thomas F; Ropers, Hans Hilger; Najmabadi, Hossein; Kahrizi, Kimia
2015-10-01
Intellectual disability (ID) is a neuro-developmental disorder which causes considerable socio-economic problems. Some ID individuals are also affected by ataxia, and the condition includes different mutations affecting several genes. We used whole exome sequencing (WES) in combination with homozygosity mapping (HM) to identify the genetic defects in five consanguineous families among our cohort study, with two affected children with ID and ataxia as major clinical symptoms. We identified three novel candidate genes, RIPPLY1, MRPL10, SNX14, and a new mutation in known gene SURF1. All are autosomal genes, except RIPPLY1, which is located on the X chromosome. Two are housekeeping genes, implicated in transcription and translation regulation and intracellular trafficking, and two encode mitochondrial proteins. The pathogenesis of these variants was evaluated by mutation classification, bioinformatic methods, review of medical and biological relevance, co-segregation studies in the particular family, and a normal population study. Linkage analysis and exome sequencing of a small number of affected family members is a powerful new technique which can be used to decrease the number of candidate genes in heterogenic disorders such as ID, and may even identify the responsible gene(s).
Systematic Characterization and Prediction of Human Hypertension Genes.
Li, Yan-Hui; Zhang, Gai-Gai; Wang, Nanping
2017-02-01
Hypertension is a major cardiovascular risk factor and accounts for a large part of cardiovascular mortality. In this work, we analyzed the properties of hypertension genes and found that when compared with genes not yet known to be involved in hypertension regulation, known hypertension genes display distinguishing features: (1) hypertension genes tend to be located at network center; (2) hypertension genes tend to interact with each other; and (3) hypertension genes tend to enrich in certain biological processes and show certain phenotypes. Based on these features, we developed a machine-learning algorithm to predict new hypertension genes. One hundred and seventy-seven candidates were predicted with a posterior probability >0.9. Evidence supporting 17 of the predictions has been found. © 2016 American Heart Association, Inc.
Reranking candidate gene models with cross-species comparison for improved gene prediction
Liu, Qian; Crammer, Koby; Pereira, Fernando CN; Roos, David S
2008-01-01
Background Most gene finders score candidate gene models with state-based methods, typically HMMs, by combining local properties (coding potential, splice donor and acceptor patterns, etc). Competing models with similar state-based scores may be distinguishable with additional information. In particular, functional and comparative genomics datasets may help to select among competing models of comparable probability by exploiting features likely to be associated with the correct gene models, such as conserved exon/intron structure or protein sequence features. Results We have investigated the utility of a simple post-processing step for selecting among a set of alternative gene models, using global scoring rules to rerank competing models for more accurate prediction. For each gene locus, we first generate the K best candidate gene models using the gene finder Evigan, and then rerank these models using comparisons with putative orthologous genes from closely-related species. Candidate gene models with lower scores in the original gene finder may be selected if they exhibit strong similarity to probable orthologs in coding sequence, splice site location, or signal peptide occurrence. Experiments on Drosophila melanogaster demonstrate that reranking based on cross-species comparison outperforms the best gene models identified by Evigan alone, and also outperforms the comparative gene finders GeneWise and Augustus+. Conclusion Reranking gene models with cross-species comparison improves gene prediction accuracy. This straightforward method can be readily adapted to incorporate additional lines of evidence, as it requires only a ranked source of candidate gene models. PMID:18854050
Exome Sequence Analysis of 14 Families With High Myopia.
Kloss, Bethany A; Tompson, Stuart W; Whisenhunt, Kristina N; Quow, Krystina L; Huang, Samuel J; Pavelec, Derek M; Rosenberg, Thomas; Young, Terri L
2017-04-01
To identify causal gene mutations in 14 families with autosomal dominant (AD) high myopia using exome sequencing. Select individuals from 14 large Caucasian families with high myopia were exome sequenced. Gene variants were filtered to identify potential pathogenic changes. Sanger sequencing was used to confirm variants in original DNA, and to test for disease cosegregation in additional family members. Candidate genes and chromosomal loci previously associated with myopic refractive error and its endophenotypes were comprehensively screened. In 14 high myopia families, we identified 73 rare and 31 novel gene variants as candidates for pathogenicity. In seven of these families, two of the novel and eight of the rare variants were within known myopia loci. A total of 104 heterozygous nonsynonymous rare variants in 104 genes were identified in 10 out of 14 probands. Each variant cosegregated with affection status. No rare variants were identified in genes known to cause myopia or in genes closest to published genome-wide association study association signals for refractive error or its endophenotypes. Whole exome sequencing was performed to determine gene variants implicated in the pathogenesis of AD high myopia. This study provides new genes for consideration in the pathogenesis of high myopia, and may aid in the development of genetic profiling of those at greatest risk for attendant ocular morbidities of this disorder.
ENU Mutagenesis in Mice Identifies Candidate Genes For Hypogonadism
Weiss, Jeffrey; Hurley, Lisa A.; Harris, Rebecca M.; Finlayson, Courtney; Tong, Minghan; Fisher, Lisa A.; Moran, Jennifer L.; Beier, David R.; Mason, Christopher; Jameson, J. Larry
2012-01-01
Genome-wide mutagenesis was performed in mice to identify candidate genes for male infertility, for which the predominant causes remain idiopathic. Mice were mutagenized using N-ethyl-N-nitrosourea (ENU), bred, and screened for phenotypes associated with the male urogenital system. Fifteen heritable lines were isolated and chromosomal loci were assigned using low density genome-wide SNP arrays. Ten of the fifteen lines were pursued further using higher resolution SNP analysis to narrow the candidate gene regions. Exon sequencing of candidate genes identified mutations in mice with cystic kidneys (Bicc1), cryptorchidism (Rxfp2), restricted germ cell deficiency (Plk4), and severe germ cell deficiency (Prdm9). In two other lines with severe hypogonadism candidate sequencing failed to identify mutations, suggesting defects in genes with previously undocumented roles in gonadal function. These genomic intervals were sequenced in their entirety and a candidate mutation was identified in SnrpE in one of the two lines. The line harboring the SnrpE variant retains substantial spermatogenesis despite small testis size, an unusual phenotype. In addition to the reproductive defects, heritable phenotypes were observed in mice with ataxia (Myo5a), tremors (Pmp22), growth retardation (unknown gene), and hydrocephalus (unknown gene). These results demonstrate that the ENU screen is an effective tool for identifying potential causes of male infertility. PMID:22258617
A novel screen for genes associated with pheromone-induced sterility
Camiletti, Alison L.; Percival-Smith, Anthony; Croft, Justin R.; Thompson, Graham J.
2016-01-01
For honey bee and other social insect colonies the ‘queen substance’ regulates colony reproduction rendering workers functionally sterile. The evolution of worker reproductive altruism is explained by inclusive fitness theory, but little is known of the genes involved or how they regulate the phenotypic expression of altruism. We previously showed that application of honeybee queen pheromone to virgin fruit flies suppresses fecundity. Here we exploit this finding to identify genes associated with the perception of an ovary-inhibiting social pheromone. Mutational and RNAi approaches in Drosophila reveal that the olfactory co-factor Orco together with receptors Or49b, Or56a and Or98a are potentially involved in the perception of queen pheromone and the suppression of fecundity. One of these, Or98a, is known to mediate female fly mating behaviour, and its predicted ligand is structurally similar to a methyl component of the queen pheromone. Our novel approach to finding genes associated with pheromone-induced sterility implies conserved reproductive regulation between social and pre-social orders, and further helps to identify candidate orthologues from the pheromone-responsive pathway that may regulate honeybee worker sterility. PMID:27786267
Xu, Li-Hua; Chang, Yu-Mei; Liu, Chun-Lei; Liang, Li-Qun; Liu, Jin-Liang; Chi, Bing-Jie
2011-03-01
In this study, 26 candidate genes were quantified and normalized in the brain cDNA of common carp (Cyprinus carpio) at 23°C and 6°C using double-standard curve method of real-time quantitative PCR. The results showed that five candidates up-regulated in the samples at 6°C (P<0.01) and quantified 2.11, 13.9, 2.52, 7.38, and 1.83 times more than in the samples at 23°C, respectively. Gene function searching indicated that the protein products of these five candidates were elongation of very long chain fatty acids protein, Acyl-CoA desaturase, Transcription initiation factor IIB, Myo-inositol- 1-phosphate synthase, and Blood-brain barrier HT7 antigen individually. Moreover, seven down-regulated candidates were also identified in the same samples at 6°C (P>0.05), and their expression levels were decreased by 21.8%, 25.9%, 16.6%, 23.7%, 15.8%, 16.3%, and 42.5%, respectively, in comparison with the samples at 23°C. These seven down-regulated candidates mainly participated in the inhibition of glycolysis, improvement of cell apoptosis, and intervention of synapse remodeling based on the results of function searching. The five cold-induced genes identified in this study will be used as important elements for fish with cold sensitive through transgenic technology in future.
Luo, Weiwei; Cao, Xiaojuan; Xu, Xiuwen; Huang, Songqian; Liu, Chuanshu; Tomljanovic, Tea
2016-01-01
Dojo loach, Misgurnus anguillicaudatus is a freshwater fish species of the loach family Cobitidae, using its posterior intestine as an accessory air-breathing organ. Little is known about the molecular regulatory mechanisms in the formation of intestinal air-breathing function of M. anguillicaudatus. Here high-throughput sequencing of mRNAs was performed from six developmental stages of posterior intestine of M. anguillicaudatus: 4-Dph (days post hatch) group, 8-Dph group, 12-Dph group, 20-Dph group, 40-Dph group and Oyd (one-year-old) group. These six libraries were assembled into 81300 unigenes. Totally 40757 unigenes were annotated. Subsequently, 35291 differentially expressed genes (DEGs) were scanned among different developmental stages and clustered into 20 gene expression profiles. Finally, 15 key pathways and 25 key genes were mined, providing potential targets for candidate gene selection involved in formation of intestinal air-breathing function in M. anguillicaudatus. This is the first report of developmental transcriptome of posterior intestine in M. anguillicaudatus, offering a substantial contribution to the sequence resources for this species and providing a deep insight into the formation mechanism of its intestinal air-breathing function. This report demonstrates that M. anguillicaudatus is a good model for studies to identify and characterize the molecular basis of accessory air-breathing organ development in fish. PMID:27545457
Improving information retrieval in functional analysis.
Rodriguez, Juan C; González, Germán A; Fresno, Cristóbal; Llera, Andrea S; Fernández, Elmer A
2016-12-01
Transcriptome analysis is essential to understand the mechanisms regulating key biological processes and functions. The first step usually consists of identifying candidate genes; to find out which pathways are affected by those genes, however, functional analysis (FA) is mandatory. The most frequently used strategies for this purpose are Gene Set and Singular Enrichment Analysis (GSEA and SEA) over Gene Ontology. Several statistical methods have been developed and compared in terms of computational efficiency and/or statistical appropriateness. However, whether their results are similar or complementary, the sensitivity to parameter settings, or possible bias in the analyzed terms has not been addressed so far. Here, two GSEA and four SEA methods and their parameter combinations were evaluated in six datasets by comparing two breast cancer subtypes with well-known differences in genetic background and patient outcomes. We show that GSEA and SEA lead to different results depending on the chosen statistic, model and/or parameters. Both approaches provide complementary results from a biological perspective. Hence, an Integrative Functional Analysis (IFA) tool is proposed to improve information retrieval in FA. It provides a common gene expression analytic framework that grants a comprehensive and coherent analysis. Only a minimal user parameter setting is required, since the best SEA/GSEA alternatives are integrated. IFA utility was demonstrated by evaluating four prostate cancer and the TCGA breast cancer microarray datasets, which showed its biological generalization capabilities. Copyright © 2016 Elsevier Ltd. All rights reserved.
Drug Target Prediction and Repositioning Using an Integrated Network-Based Approach
Emig, Dorothea; Ivliev, Alexander; Pustovalova, Olga; Lancashire, Lee; Bureeva, Svetlana; Nikolsky, Yuri; Bessarabova, Marina
2013-01-01
The discovery of novel drug targets is a significant challenge in drug development. Although the human genome comprises approximately 30,000 genes, proteins encoded by fewer than 400 are used as drug targets in the treatment of diseases. Therefore, novel drug targets are extremely valuable as the source for first in class drugs. On the other hand, many of the currently known drug targets are functionally pleiotropic and involved in multiple pathologies. Several of them are exploited for treating multiple diseases, which highlights the need for methods to reliably reposition drug targets to new indications. Network-based methods have been successfully applied to prioritize novel disease-associated genes. In recent years, several such algorithms have been developed, some focusing on local network properties only, and others taking the complete network topology into account. Common to all approaches is the understanding that novel disease-associated candidates are in close overall proximity to known disease genes. However, the relevance of these methods to the prediction of novel drug targets has not yet been assessed. Here, we present a network-based approach for the prediction of drug targets for a given disease. The method allows both repositioning drug targets known for other diseases to the given disease and the prediction of unexploited drug targets which are not used for treatment of any disease. Our approach takes as input a disease gene expression signature and a high-quality interaction network and outputs a prioritized list of drug targets. We demonstrate the high performance of our method and highlight the usefulness of the predictions in three case studies. We present novel drug targets for scleroderma and different types of cancer with their underlying biological processes. Furthermore, we demonstrate the ability of our method to identify non-suspected repositioning candidates using diabetes type 1 as an example. PMID:23593264
Multifaceted Genomic Risk for Brain Function in Schizophrenia
Chen, Jiayu; Calhoun, Vince D.; Pearlson, Godfrey D.; Ehrlich, Stefan; Turner, Jessica A.; Ho, Beng-Choon; Wassink, Thomas H.; Michael, Andrew M; Liu, Jingyu
2012-01-01
Recently, deriving candidate endophenotypes from brain imaging data has become a valuable approach to study genetic influences on schizophrenia (SZ), whose pathophysiology remains unclear. In this work we utilized a multivariate approach, parallel independent component analysis, to identify genomic risk components associated with brain function abnormalities in SZ. 5157 candidate single nucleotide polymorphisms (SNPs) were derived from genome-wide array based on their possible connections with SZ and further investigated for their associations with brain activations captured with functional magnetic resonance imaging (fMRI) during a sensorimotor task. Using data from 92 SZ patients and 116 healthy controls, we detected a significant correlation (r= 0.29; p= 2.41×10−5) between one fMRI component and one SNP component, both of which significantly differentiated patients from controls. The fMRI component mainly consisted of precentral and postcentral gyri, the major activated regions in the motor task. On average, higher activation in these regions was observed in participants with higher loadings of the linked SNP component, predominantly contributed to by 253 SNPs. 138 identified SNPs were from known coding regions of 100 unique genes. 31 identified SNPs did not differ between groups, but moderately correlated with some other group-discriminating SNPs, indicating interactions among alleles contributing towards elevated SZ susceptibility. The genes associated with the identified SNPs participated in four neurotransmitter pathways: GABA receptor signaling, dopamine receptor signaling, neuregulin signaling and glutamate receptor signaling. In summary, our work provides further evidence for the complexity of genomic risk to the functional brain abnormality in SZ and suggests a pathological role of interactions between SNPs, genes and multiple neurotransmitter pathways. PMID:22440650
Genetic Basis of Melanin Pigmentation in Butterfly Wings
Zhang, Linlin; Martin, Arnaud; Perry, Michael W.; van der Burg, Karin R. L.; Matsuoka, Yuji; Monteiro, Antónia; Reed, Robert D.
2017-01-01
Despite the variety, prominence, and adaptive significance of butterfly wing patterns, surprisingly little is known about the genetic basis of wing color diversity. Even though there is intense interest in wing pattern evolution and development, the technical challenge of genetically manipulating butterflies has slowed efforts to functionally characterize color pattern development genes. To identify candidate wing pigmentation genes, we used RNA sequencing to characterize transcription across multiple stages of butterfly wing development, and between different color pattern elements, in the painted lady butterfly Vanessa cardui. This allowed us to pinpoint genes specifically associated with red and black pigment patterns. To test the functions of a subset of genes associated with presumptive melanin pigmentation, we used clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 genome editing in four different butterfly genera. pale, Ddc, and yellow knockouts displayed reduction of melanin pigmentation, consistent with previous findings in other insects. Interestingly, however, yellow-d, ebony, and black knockouts revealed that these genes have localized effects on tuning the color of red, brown, and ochre pattern elements. These results point to previously undescribed mechanisms for modulating the color of specific wing pattern elements in butterflies, and provide an expanded portrait of the insect melanin pathway. PMID:28193726
Shim, Hongseok; Kim, Ji Hyun; Kim, Chan Yeong; Hwang, Sohyun; Kim, Hyojin; Yang, Sunmo; Lee, Ji Eun; Lee, Insuk
2016-11-16
Whole exome sequencing (WES) accelerates disease gene discovery using rare genetic variants, but further statistical and functional evidence is required to avoid false-discovery. To complement variant-driven disease gene discovery, here we present function-driven disease gene discovery in zebrafish (Danio rerio), a promising human disease model owing to its high anatomical and genomic similarity to humans. To facilitate zebrafish-based function-driven disease gene discovery, we developed a genome-scale co-functional network of zebrafish genes, DanioNet (www.inetbio.org/danionet), which was constructed by Bayesian integration of genomics big data. Rigorous statistical assessment confirmed the high prediction capacity of DanioNet for a wide variety of human diseases. To demonstrate the feasibility of the function-driven disease gene discovery using DanioNet, we predicted genes for ciliopathies and performed experimental validation for eight candidate genes. We also validated the existence of heterozygous rare variants in the candidate genes of individuals with ciliopathies yet not in controls derived from the UK10K consortium, suggesting that these variants are potentially involved in enhancing the risk of ciliopathies. These results showed that an integrated genomics big data for a model animal of diseases can expand our opportunity for harnessing WES data in disease gene discovery. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Wakefield, Laura; Gadoury, David M; Seem, Robert C; Milgroom, Michael G; Sun, Qi; Cadle-Davidson, Lance
2011-07-01
Asexual sporulation (conidiation) is coordinately regulated in the grape powdery mildew pathogen Erysiphe necator but nothing is known about its genetic regulation. We hypothesized that genes required for conidiation in other fungi would be upregulated at conidiophore initiation or full conidiation (relative to preconidiation vegetative growth and development of mature ascocarps), and that the obligate biotrophic lifestyle of E. necator would necessitate some novel gene regulation. cDNA amplified fragment length polymorphism analysis with 45 selective primer combinations produced ≈1,600 transcript-derived fragments (TDFs), of which 620 (39%) showed differential expression. TDF sequences were annotated using BLAST analysis of GenBank and of a reference transcriptome for E. necator developed by 454-FLX pyrosequencing of a normalized cDNA library. One-fourth of the differentially expressed, annotated sequences had similarity to fungal genes of unknown function. The remaining genes had annotated function in metabolism, signaling, transcription, transport, and protein fate. As expected, a portion of orthologs known in other fungi to be involved in developmental regulation was upregulated immediately prior to or during conidiation; particularly noteworthy were several genes associated with the light-dependent VeA regulatory system, G-protein signaling (Pth11 and a kelch repeat), and nuclear transport (importin-β and Ran). This work represents the first investigation into differential gene expression during morphogenesis in E. necator and identifies candidate genes and hypotheses for characterization in powdery mildews. Our results indicate that, although control of conidiation in powdery mildews may share some basic elements with established systems, there are significant points of divergence as well, perhaps due, in part, to the obligate biotrophic lifestyle of powdery mildews.
Functional profiles of orphan membrane transporters in the life cycle of the malaria parasite
Kenthirapalan, Sanketha; Waters, Andrew P.; Matuschewski, Kai; Kooij, Taco W. A.
2016-01-01
Assigning function to orphan membrane transport proteins and prioritizing candidates for detailed biochemical characterization remain fundamental challenges and are particularly important for medically relevant pathogens, such as malaria parasites. Here we present a comprehensive genetic analysis of 35 orphan transport proteins of Plasmodium berghei during its life cycle in mice and Anopheles mosquitoes. Six genes, including four candidate aminophospholipid transporters, are refractory to gene deletion, indicative of essential functions. We generate and phenotypically characterize 29 mutant strains with deletions of individual transporter genes. Whereas seven genes appear to be dispensable under the experimental conditions tested, deletion of any of the 22 other genes leads to specific defects in life cycle progression in vivo and/or host transition. Our study provides growing support for a potential link between heavy metal homeostasis and host switching and reveals potential targets for rational design of new intervention strategies against malaria. PMID:26796412
High-resolution phylogenetic microbial community profiling
DOE Office of Scientific and Technical Information (OSTI.GOV)
Singer, Esther; Coleman-Derr, Devin; Bowman, Brett
2014-03-17
The representation of bacterial and archaeal genome sequences is strongly biased towards cultivated organisms, which belong to merely four phylogenetic groups. Functional information and inter-phylum level relationships are still largely underexplored for candidate phyla, which are often referred to as microbial dark matter. Furthermore, a large portion of the 16S rRNA gene records in the GenBank database are labeled as environmental samples and unclassified, which is in part due to low read accuracy, potential chimeric sequences produced during PCR amplifications and the low resolution of short amplicons. In order to improve the phylogenetic classification of novel species and advance ourmore » knowledge of the ecosystem function of uncultivated microorganisms, high-throughput full length 16S rRNA gene sequencing methodologies with reduced biases are needed. We evaluated the performance of PacBio single-molecule real-time (SMRT) sequencing in high-resolution phylogenetic microbial community profiling. For this purpose, we compared PacBio and Illumina metagenomic shotgun and 16S rRNA gene sequencing of a mock community as well as of an environmental sample from Sakinaw Lake, British Columbia. Sakinaw Lake is known to contain a large age of microbial species from candidate phyla. Sequencing results show that community structure based on PacBio shotgun and 16S rRNA gene sequences is highly similar in both the mock and the environmental communities. Resolution power and community representation accuracy from SMRT sequencing data appeared to be independent of GC content of microbial genomes and was higher when compared to Illumina-based metagenome shotgun and 16S rRNA gene (iTag) sequences, e.g. full-length sequencing resolved all 23 OTUs in the mock community, while iTags did not resolve closely related species. SMRT sequencing hence offers various potential benefits when characterizing uncharted microbial communities.« less
Winchester, Catherine L; Ohzeki, Hiromitsu; Vouyiouklis, Demetrius A; Thompson, Rhiannon; Penninger, Josef M; Yamagami, Keiji; Norrie, John D; Hunter, Robert; Pratt, Judith A; Morris, Brian J
2012-11-15
Schizophrenia is a debilitating psychiatric disease with a strong genetic contribution, potentially linked to altered glutamatergic function in brain regions such as the prefrontal cortex (PFC). Here, we report converging evidence to support a functional candidate gene for schizophrenia. In post-mortem PFC from patients with schizophrenia, we detected decreased expression of MKK7/MAP2K7-a kinase activated by glutamatergic activity. While mice lacking one copy of the Map2k7 gene were overtly normal in a variety of behavioural tests, these mice showed a schizophrenia-like cognitive phenotype of impaired working memory. Additional support for MAP2K7 as a candidate gene came from a genetic association study. A substantial effect size (odds ratios: ~1.9) was observed for a common variant in a cohort of case and control samples collected in the Glasgow area and also in a replication cohort of samples of Northern European descent (most significant P-value: 3 × 10(-4)). While some caution is warranted until these association data are further replicated, these results are the first to implicate the candidate gene MAP2K7 in genetic risk for schizophrenia. Complete sequencing of all MAP2K7 exons did not reveal any non-synonymous mutations. However, the MAP2K7 haplotype appeared to have functional effects, in that it influenced the level of expression of MAP2K7 mRNA in human PFC. Taken together, the results imply that reduced function of the MAP2K7-c-Jun N-terminal kinase (JNK) signalling cascade may underlie some of the neurochemical changes and core symptoms in schizophrenia.
Identifying positive selection candidate loci for high-altitude adaptation in Andean populations
2009-01-01
High-altitude environments (>2,500 m) provide scientists with a natural laboratory to study the physiological and genetic effects of low ambient oxygen tension on human populations. One approach to understanding how life at high altitude has affected human metabolism is to survey genome-wide datasets for signatures of natural selection. In this work, we report on a study to identify selection-nominated candidate genes involved in adaptation to hypoxia in one highland group, Andeans from the South American Altiplano. We analysed dense microarray genotype data using four test statistics that detect departures from neutrality. Using a candidate gene, single nucleotide polymorphism-based approach, we identified genes exhibiting preliminary evidence of recent genetic adaptation in this population. These included genes that are part of the hypoxia-inducible transcription factor (HIF) pathway, a biochemical pathway involved in oxygen homeostasis, as well as three other genomic regions previously not known to be associated with high-altitude phenotypes. In addition to identifying selection-nominated candidate genes, we also tested whether the HIF pathway shows evidence of natural selection. Our results indicate that the genes of this biochemical pathway as a group show no evidence of having evolved in response to hypoxia in Andeans. Results from particular HIF-targeted genes, however, suggest that genes in this pathway could play a role in Andean adaptation to high altitude, even if the pathway as a whole does not show higher relative rates of evolution. These data suggest a genetic role in high-altitude adaptation and provide a basis for genotype/phenotype association studies that are necessary to confirm the role of putative natural selection candidate genes and gene regions in adaptation to altitude. PMID:20038496
Walsh, Kyle M; Anderson, Erik; Hansen, Helen M; Decker, Paul A; Kosel, Matt L; Kollmeyer, Thomas; Rice, Terri; Zheng, Shichun; Xiao, Yuanyuan; Chang, Jeffrey S; McCoy, Lucie S; Bracci, Paige M; Wiemels, Joe L; Pico, Alexander R; Smirnov, Ivan; Lachance, Daniel H; Sicotte, Hugues; Eckel-Passow, Jeanette E; Wiencke, John K; Jenkins, Robert B; Wrensch, Margaret R
2013-02-01
Genomewide association studies (GWAS) and candidate-gene studies have implicated single-nucleotide polymorphisms (SNPs) in at least 45 different genes as putative glioma risk factors. Attempts to validate these associations have yielded variable results and few genetic risk factors have been consistently replicated. We conducted a case-control study of Caucasian glioma cases and controls from the University of California San Francisco (810 cases, 512 controls) and the Mayo Clinic (852 cases, 789 controls) in an attempt to replicate previously reported genetic risk factors for glioma. Sixty SNPs selected from the literature (eight from GWAS and 52 from candidate-gene studies) were successfully genotyped on an Illumina custom genotyping panel. Eight SNPs in/near seven different genes (TERT, EGFR, CCDC26, CDKN2A, PHLDB1, RTEL1, TP53) were significantly associated with glioma risk in the combined dataset (P < 0.05), with all associations in the same direction as in previous reports. Several SNP associations showed considerable differences across histologic subtype. All eight successfully replicated associations were first identified by GWAS, although none of the putative risk SNPs from candidate-gene studies was associated in the full case-control sample (all P values > 0.05). Although several confirmed associations are located near genes long known to be involved in gliomagenesis (e.g., EGFR, CDKN2A, TP53), these associations were first discovered by the GWAS approach and are in noncoding regions. These results highlight that the deficiencies of the candidate-gene approach lay in selecting both appropriate genes and relevant SNPs within these genes. © 2012 WILEY PERIODICALS, INC.
Candidate genes for cooperation and aggression in the social wasp Polistes dominula.
Manfredini, Fabio; Brown, Mark J F; Toth, Amy L
2018-05-01
Cooperation and aggression are ubiquitous in social groups, and the genetic mechanisms underlying these behaviours are of great interest for understanding how social group formation is regulated and how it evolves. In this study, we used a candidate gene approach to investigate the patterns of expression of key genes for cooperation and aggression in the brain of a primitively eusocial wasp, Polistes dominula, during colony founding, when multiple foundresses can join the same nest and establish subtle hierarchies of dominance. We used a comparative approach to select candidate genes for cooperation and aggression looking at two previously published studies on global gene expression in wasps and ants. We tested the expression of these genes in P. dominula wasps that were either displaying aggressive behaviour (dominant and single foundresses) or cooperation (subordinate foundresses and workers) towards nestmates. One gene in particular, the egg yolk protein vitellogenin, known for its reproductive role in insects, displayed patterns of expression that strongly matched wasp social rank. We characterize the genomic context of vitellogenin by building a head co-expression gene network for P. dominula, and we discuss a potential role for vitellogenin as a mediator of social interactions in wasps.
Using Zebrafish to Test the Genetic Basis of Human Craniofacial Diseases.
Machado, R Grecco; Eames, B Frank
2017-10-01
Genome-wide association studies (GWASs) opened an innovative and productive avenue to investigate the molecular basis of human craniofacial disease. However, GWASs identify candidate genes only; they do not prove that any particular one is the functional villain underlying disease or just an unlucky genomic bystander. Genetic manipulation of animal models is the best approach to reveal which genetic loci identified from human GWASs are functionally related to specific diseases. The purpose of this review is to discuss the potential of zebrafish to resolve which candidate genetic loci are mechanistic drivers of craniofacial diseases. Many anatomic, embryonic, and genetic features of craniofacial development are conserved among zebrafish and mammals, making zebrafish a good model of craniofacial diseases. Also, the ability to manipulate gene function in zebrafish was greatly expanded over the past 20 y, enabling systems such as Gateway Tol2 and CRISPR-Cas9 to test gain- and loss-of-function alleles identified from human GWASs in coding and noncoding regions of DNA. With the optimization of genetic editing methods, large numbers of candidate genes can be efficiently interrogated. Finding the functional villains that underlie diseases will permit new treatments and prevention strategies and will increase understanding of how gene pathways operate during normal development.
Evidence that breast cancer risk at the 2q35 locus is mediated through IGFBP5 regulation.
Ghoussaini, Maya; Edwards, Stacey L; Michailidou, Kyriaki; Nord, Silje; Cowper-Sal Lari, Richard; Desai, Kinjal; Kar, Siddhartha; Hillman, Kristine M; Kaufmann, Susanne; Glubb, Dylan M; Beesley, Jonathan; Dennis, Joe; Bolla, Manjeet K; Wang, Qin; Dicks, Ed; Guo, Qi; Schmidt, Marjanka K; Shah, Mitul; Luben, Robert; Brown, Judith; Czene, Kamila; Darabi, Hatef; Eriksson, Mikael; Klevebring, Daniel; Bojesen, Stig E; Nordestgaard, Børge G; Nielsen, Sune F; Flyger, Henrik; Lambrechts, Diether; Thienpont, Bernard; Neven, Patrick; Wildiers, Hans; Broeks, Annegien; Van't Veer, Laura J; Th Rutgers, Emiel J; Couch, Fergus J; Olson, Janet E; Hallberg, Emily; Vachon, Celine; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Peto, Julian; Dos-Santos-Silva, Isabel; Gibson, Lorna; Nevanlinna, Heli; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Hall, Per; Li, Jingmei; Liu, Jianjun; Humphreys, Keith; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K; Noh, Dong-Young; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Yatabe, Yasushi; Guénel, Pascal; Truong, Thérèse; Menegaux, Florence; Sanchez, Marie; Burwinkel, Barbara; Marme, Frederik; Schneeweiss, Andreas; Sohn, Christof; Wu, Anna H; Tseng, Chiu-Chen; Van Den Berg, David; Stram, Daniel O; Benitez, Javier; Zamora, M Pilar; Perez, Jose Ignacio Arias; Menéndez, Primitiva; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Cai, Qiuyin; Cox, Angela; Cross, Simon S; Reed, Malcolm W R; Andrulis, Irene L; Knight, Julia A; Glendon, Gord; Tchatchou, Sandrine; Sawyer, Elinor J; Tomlinson, Ian; Kerin, Michael J; Miller, Nicola; Haiman, Christopher A; Henderson, Brian E; Schumacher, Fredrick; Le Marchand, Loic; Lindblom, Annika; Margolin, Sara; Teo, Soo Hwang; Yip, Cheng Har; Lee, Daphne S C; Wong, Tien Y; Hooning, Maartje J; Martens, John W M; Collée, J Margriet; van Deurzen, Carolien H M; Hopper, John L; Southey, Melissa C; Tsimiklis, Helen; Kapuscinski, Miroslav K; Shen, Chen-Yang; Wu, Pei-Ei; Yu, Jyh-Cherng; Chen, Shou-Tung; Alnæs, Grethe Grenaker; Borresen-Dale, Anne-Lise; Giles, Graham G; Milne, Roger L; McLean, Catriona; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Hartman, Mikael; Miao, Hui; Buhari, Shaik Ahmad Bin Syed; Teo, Yik Ying; Fasching, Peter A; Haeberle, Lothar; Ekici, Arif B; Beckmann, Matthias W; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Schoemaker, Minouk J; García-Closas, Montserrat; Figueroa, Jonine; Chanock, Stephen J; Lissowska, Jolanta; Simard, Jacques; Goldberg, Mark S; Labrèche, France; Dumont, Martine; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Brauch, Hiltrud; Brüning, Thomas; Koto, Yon-Dschun; Radice, Paolo; Peterlongo, Paolo; Bonanni, Bernardo; Volorio, Sara; Dörk, Thilo; Bogdanova, Natalia V; Helbig, Sonja; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Devilee, Peter; Tollenaar, Robert A E M; Seynaeve, Caroline; Van Asperen, Christi J; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Slager, Susan; Toland, Amanda E; Ambrosone, Christine B; Yannoukakos, Drakoulis; Sangrajrang, Suleeporn; Gaborieau, Valerie; Brennan, Paul; McKay, James; Hamann, Ute; Torres, Diana; Zheng, Wei; Long, Jirong; Anton-Culver, Hoda; Neuhausen, Susan L; Luccarini, Craig; Baynes, Caroline; Ahmed, Shahana; Maranian, Mel; Healey, Catherine S; González-Neira, Anna; Pita, Guillermo; Alonso, M Rosario; Alvarez, Nuria; Herrero, Daniel; Tessier, Daniel C; Vincent, Daniel; Bacot, Francois; de Santiago, Ines; Carroll, Jason; Caldas, Carlos; Brown, Melissa A; Lupien, Mathieu; Kristensen, Vessela N; Pharoah, Paul D P; Chenevix-Trench, Georgia; French, Juliet D; Easton, Douglas F; Dunning, Alison M
2014-09-23
GWAS have identified a breast cancer susceptibility locus on 2q35. Here we report the fine mapping of this locus using data from 101,943 subjects from 50 case-control studies. We genotype 276 SNPs using the 'iCOGS' genotyping array and impute genotypes for a further 1,284 using 1000 Genomes Project data. All but two, strongly correlated SNPs (rs4442975 G/T and rs6721996 G/A) are excluded as candidate causal variants at odds against >100:1. The best functional candidate, rs4442975, is associated with oestrogen receptor positive (ER+) disease with an odds ratio (OR) in Europeans of 0.85 (95% confidence interval=0.84-0.87; P=1.7 × 10(-43)) per t-allele. This SNP flanks a transcriptional enhancer that physically interacts with the promoter of IGFBP5 (encoding insulin-like growth factor-binding protein 5) and displays allele-specific gene expression, FOXA1 binding and chromatin looping. Evidence suggests that the g-allele confers increased breast cancer susceptibility through relative downregulation of IGFBP5, a gene with known roles in breast cell biology.
Lu, Qing; Niu, Xiaojun; Zhang, Mengchen; Wang, Caihong; Xu, Qun; Feng, Yue; Yang, Yaolong; Wang, Shan; Yuan, Xiaoping; Yu, Hanyong; Wang, Yiping; Chen, Xiaoping; Liang, Xuanqiang; Wei, Xinghua
2018-01-01
Seed dormancy is an important agronomic trait affecting grain yield and quality because of pre-harvest germination and is influenced by both environmental and genetic factors. However, our knowledge of the factors controlling seed dormancy remains limited. To better reveal the molecular mechanism underlying this trait, a genome-wide association study was conducted in an indica-only population consisting of 453 accessions genotyped using 5,291 SNPs. Nine known and new significant SNPs were identified on eight chromosomes. These lead SNPs explained 34.9% of the phenotypic variation, and four of them were designed as dCAPS markers in the hope of accelerating molecular breeding. Moreover, a total of 212 candidate genes was predicted and eight candidate genes showed plant tissue-specific expression in expression profile data from different public bioinformatics databases. In particular, LOC_Os03g10110, which had a maize homolog involved in embryo development, was identified as a candidate regulator for further biological function investigations. Additionally, a polymorphism information content ratio method was used to screen improvement footprints and 27 selective sweeps were identified, most of which harbored domestication-related genes. Further studies suggested that three significant SNPs were adjacent to the candidate selection signals, supporting the accuracy of our genome-wide association study (GWAS) results. These findings show that genome-wide screening for selective sweeps can be used to identify new improvement-related DNA regions, although the phenotypes are unknown. This study enhances our knowledge of the genetic variation in seed dormancy, and the new dormancy-associated SNPs will provide real benefits in molecular breeding. PMID:29354150
Genome-wide association study identifies candidate genes for male fertility traits in humans.
Kosova, Gülüm; Scott, Nicole M; Niederberger, Craig; Prins, Gail S; Ober, Carole
2012-06-08
Despite the fact that hundreds of genes are known to affect fertility in animal models, relatively little is known about genes that influence natural fertility in humans. To broadly survey genes contributing to variation in male fertility, we conducted a genome-wide association study (GWAS) of two fertility traits (family size and birth rate) in 269 married men who are members of a founder population of European descent that proscribes contraception and has large family sizes. Associations between ∼250,000 autosomal SNPs and the fertility traits were examined. A total of 41 SNPs with p ≤ 1 × 10(-4) for either trait were taken forward to a validation study of 123 ethnically diverse men from Chicago who had previously undergone semen analyses. Nine (22%) of the SNPs associated with reduced fertility in the GWAS were also associated with one or more of the ten measures of reduced sperm quantity and/or function, yielding 27 associations with p values < 0.05 and seven with p values < 0.01 in the validation study. On the basis of 5,000 permutations of our data, the probabilities of observing this many or more small p values were 0.0014 and 5.6 × 10(-4), respectively. Among the nine associated loci, outstanding candidates for male fertility genes include USP8, an essential deubiquitinating enzyme that has a role in acrosome assembly; UBD and EPSTI1, which have potential roles in innate immunity; and LRRC32, which encodes a latent transforming growth factor β (TGF-β) receptor on regulatory T cells. We suggest that mutations in these genes that are more severe may account for some of the unexplained infertility (or subfertility) in the general population. Copyright © 2012 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Zwaenepoel, Arthur; Diels, Tim; Amar, David; Van Parys, Thomas; Shamir, Ron; Van de Peer, Yves; Tzfadia, Oren
2018-01-01
Recent times have seen an enormous growth of "omics" data, of which high-throughput gene expression data are arguably the most important from a functional perspective. Despite huge improvements in computational techniques for the functional classification of gene sequences, common similarity-based methods often fall short of providing full and reliable functional information. Recently, the combination of comparative genomics with approaches in functional genomics has received considerable interest for gene function analysis, leveraging both gene expression based guilt-by-association methods and annotation efforts in closely related model organisms. Besides the identification of missing genes in pathways, these methods also typically enable the discovery of biological regulators (i.e., transcription factors or signaling genes). A previously built guilt-by-association method is MORPH, which was proven to be an efficient algorithm that performs particularly well in identifying and prioritizing missing genes in plant metabolic pathways. Here, we present MorphDB, a resource where MORPH-based candidate genes for large-scale functional annotations (Gene Ontology, MapMan bins) are integrated across multiple plant species. Besides a gene centric query utility, we present a comparative network approach that enables researchers to efficiently browse MORPH predictions across functional gene sets and species, facilitating efficient gene discovery and candidate gene prioritization. MorphDB is available at http://bioinformatics.psb.ugent.be/webtools/morphdb/morphDB/index/. We also provide a toolkit, named "MORPH bulk" (https://github.com/arzwa/morph-bulk), for running MORPH in bulk mode on novel data sets, enabling researchers to apply MORPH to their own species of interest.
From the ultrasonic to the infrared: molecular evolution and the sensory biology of bats
Jones, Gareth; Teeling, Emma C.; Rossiter, Stephen J.
2013-01-01
Great advances have been made recently in understanding the genetic basis of the sensory biology of bats. Research has focused on the molecular evolution of candidate sensory genes, genes with known functions [e.g., olfactory receptor (OR) genes] and genes identified from mutations associated with sensory deficits (e.g., blindness and deafness). For example, the FoxP2 gene, underpinning vocal behavior and sensorimotor coordination, has undergone diversification in bats, while several genes associated with audition show parallel amino acid substitutions in unrelated lineages of echolocating bats and, in some cases, in echolocating dolphins, representing a classic case of convergent molecular evolution. Vision genes encoding the photopigments rhodopsin and the long-wave sensitive opsin are functional in bats, while that encoding the short-wave sensitive opsin has lost functionality in rhinolophoid bats using high-duty cycle laryngeal echolocation, suggesting a sensory trade-off between investment in vision and echolocation. In terms of olfaction, bats appear to have a distinctive OR repertoire compared with other mammals, and a gene involved in signal transduction in the vomeronasal system has become non-functional in most bat species. Bitter taste receptors appear to have undergone a “birth-and death” evolution involving extensive gene duplication and loss, unlike genes coding for sweet and umami tastes that show conservation across most lineages but loss in vampire bats. Common vampire bats have also undergone adaptations for thermoperception, via alternative splicing resulting in the evolution of a novel heat-sensitive channel. The future for understanding the molecular basis of sensory biology is promising, with great potential for comparative genomic analyses, studies on gene regulation and expression, exploration of the role of alternative splicing in the generation of proteomic diversity, and linking genetic mechanisms to behavioral consequences. PMID:23755015
Groten, Karin; Pahari, Nabin T; Xu, Shuqing; Miloradovic van Doorn, Maja; Baldwin, Ian T
2015-01-01
Most land plants live in a symbiotic association with arbuscular mycorrhizal fungi (AMF) that belong to the phylum Glomeromycota. Although a number of plant genes involved in the plant-AMF interactions have been identified by analyzing mutants, the ability to rapidly manipulate gene expression to study the potential functions of new candidate genes remains unrealized. We analyzed changes in gene expression of wild tobacco roots (Nicotiana attenuata) after infection with mycorrhizal fungi (Rhizophagus irregularis) by serial analysis of gene expression (SuperSAGE) combined with next generation sequencing, and established a virus-induced gene-silencing protocol to study the function of candidate genes in the interaction. From 92,434 SuperSAGE Tag sequences, 32,808 (35%) matched with our in-house Nicotiana attenuata transcriptome database and 3,698 (4%) matched to Rhizophagus genes. In total, 11,194 Tags showed a significant change in expression (p<0.05, >2-fold change) after infection. When comparing the functions of highly up-regulated annotated Tags in this study with those of two previous large-scale gene expression studies, 18 gene functions were found to be up-regulated in all three studies mainly playing roles related to phytohormone metabolism, catabolism and defense. To validate the function of identified candidate genes, we used the technique of virus-induced gene silencing (VIGS) to silence the expression of three putative N. attenuata genes: germin-like protein, indole-3-acetic acid-amido synthetase GH3.9 and, as a proof-of-principle, calcium and calmodulin-dependent protein kinase (CCaMK). The silencing of the three plant genes in roots was successful, but only CCaMK silencing had a significant effect on the interaction with R. irregularis. Interestingly, when a highly activated inoculum was used for plant inoculation, the effect of CCaMK silencing on fungal colonization was masked, probably due to trans-complementation. This study demonstrates that large-scale gene expression studies across different species induce of a core set of genes of similar functions. However, additional factors seem to influence the overall pattern of gene expression, resulting in high variability among independent studies with different hosts. We conclude that VIGS is a powerful tool with which to investigate the function of genes involved in plant-AMF interactions but that inoculum strength can strongly influence the outcome of the interaction.
Kumar, Hirdesh; Frischknecht, Friedrich; Mair, Gunnar R; Gomes, James
2015-12-01
Genetically attenuated parasites (GAPs) that lack genes essential for the liver stage of the malaria parasite, and therefore cause developmental arrest, have been developed as live vaccines in rodent malaria models and recently been tested in humans. The genes targeted for deletion were often identified by trial and error. Here we present a systematic gene - protein and transcript - expression analyses of several Plasmodium species with the aim to identify candidate genes for the generation of novel GAPs. With a lack of liver stage expression data for human malaria parasites, we used data available for liver stage development of Plasmodium yoelii, a rodent malaria model, to identify proteins expressed in the liver stage but absent from blood stage parasites. An orthology-based search was then employed to identify orthologous proteins in the human malaria parasite Plasmodium falciparum resulting in a total of 310 genes expressed in the liver stage but lacking evidence of protein expression in blood stage parasites. Among these 310 possible GAP candidates, we further studied Plasmodium liver stage proteins by phyletic distribution and functional domain analyses and shortlisted twenty GAP-candidates; these are: fabB/F, fabI, arp, 3 genes encoding subunits of the PDH complex, dnaJ, urm1, rS5, ancp, mcp, arh, gk, lisp2, valS, palm, and four conserved Plasmodium proteins of unknown function. Parasites lacking one or several of these genes might yield new attenuated malaria parasites for experimental vaccination studies. Copyright © 2015 Elsevier B.V. All rights reserved.
Meta-analysis and genome-wide interpretation of genetic susceptibility to drug addiction
2011-01-01
Background Classical genetic studies provide strong evidence for heritable contributions to susceptibility to developing dependence on addictive substances. Candidate gene and genome-wide association studies (GWAS) have sought genes, chromosomal regions and allelic variants likely to contribute to susceptibility to drug addiction. Results Here, we performed a meta-analysis of addiction candidate gene association studies and GWAS to investigate possible functional mechanisms associated with addiction susceptibility. From meta-data retrieved from 212 publications on candidate gene association studies and 5 GWAS reports, we linked a total of 843 haplotypes to addiction susceptibility. We mapped the SNPs in these haplotypes to functional and regulatory elements in the genome and estimated the magnitude of the contributions of different molecular mechanisms to their effects on addiction susceptibility. In addition to SNPs in coding regions, these data suggest that haplotypes in gene regulatory regions may also contribute to addiction susceptibility. When we compared the lists of genes identified by association studies and those identified by molecular biological studies of drug-regulated genes, we observed significantly higher participation in the same gene interaction networks than expected by chance, despite little overlap between the two gene lists. Conclusions These results appear to offer new insights into the genetic factors underlying drug addiction. PMID:21999673
Genetics of human neural tube defects
Greene, Nicholas D.E.; Stanier, Philip; Copp, Andrew J.
2009-01-01
Neural tube defects (NTDs) are common, severe congenital malformations whose causation involves multiple genes and environmental factors. Although more than 200 genes are known to cause NTDs in mice, there has been rather limited progress in delineating the molecular basis underlying most human NTDs. Numerous genetic studies have been carried out to investigate candidate genes in cohorts of patients, with particular reference to those that participate in folate one-carbon metabolism. Although the homocysteine remethylation gene MTHFR has emerged as a risk factor in some human populations, few other consistent findings have resulted from this approach. Similarly, attention focused on the human homologues of mouse NTD genes has contributed only limited positive findings to date, although an emerging association between genes of the non-canonical Wnt (planar cell polarity) pathway and NTDs provides candidates for future studies. Priorities for the next phase of this research include: (i) larger studies that are sufficiently powered to detect significant associations with relatively minor risk factors; (ii) analysis of multiple candidate genes in groups of well-genotyped individuals to detect possible gene–gene interactions; (iii) use of high throughput genomic technology to evaluate the role of copy number variants and to detect ‘private’ and regulatory mutations, neither of which have been studied to date; (iv) detailed analysis of patient samples stratified by phenotype to enable, for example, hypothesis-driven testing of candidates genes in groups of NTDs with specific defects of folate metabolism, or in groups of fetuses with well-defined phenotypes such as craniorachischisis. PMID:19808787
2011-01-01
Abstract Background Bupleurum chinense DC. is a widely used traditional Chinese medicinal plant. Saikosaponins are the major bioactive constituents of B. chinense, but relatively little is known about saikosaponin biosynthesis. The 454 pyrosequencing technology provides a promising opportunity for finding novel genes that participate in plant metabolism. Consequently, this technology may help to identify the candidate genes involved in the saikosaponin biosynthetic pathway. Results One-quarter of the 454 pyrosequencing runs produced a total of 195, 088 high-quality reads, with an average read length of 356 bases (NCBI SRA accession SRA039388). A de novo assembly generated 24, 037 unique sequences (22, 748 contigs and 1, 289 singletons), 12, 649 (52.6%) of which were annotated against three public protein databases using a basic local alignment search tool (E-value ≤1e-10). All unique sequences were compared with NCBI expressed sequence tags (ESTs) (237) and encoding sequences (44) from the Bupleurum genus, and with a Sanger-sequenced EST dataset (3, 111). The 23, 173 (96.4%) unique sequences obtained in the present study represent novel Bupleurum genes. The ESTs of genes related to saikosaponin biosynthesis were found to encode known enzymes that catalyze the formation of the saikosaponin backbone; 246 cytochrome P450 (P450s) and 102 glycosyltransferases (GTs) unique sequences were also found in the 454 dataset. Full length cDNAs of 7 P450s and 7 uridine diphosphate GTs (UGTs) were verified by reverse transcriptase polymerase chain reaction or by cloning using 5' and/or 3' rapid amplification of cDNA ends. Two P450s and three UGTs were identified as the most likely candidates involved in saikosaponin biosynthesis. This finding was based on the coordinate up-regulation of their expression with β-AS in methyl jasmonate-treated adventitious roots and on their similar expression patterns with β-AS in various B. chinense tissues. Conclusions A collection of high-quality ESTs for B. chinense obtained by 454 pyrosequencing is provided here for the first time. These data should aid further research on the functional genomics of B. chinense and other Bupleurum species. The candidate genes for enzymes involved in saikosaponin biosynthesis, especially the P450s and UGTs, that were revealed provide a substantial foundation for follow-up research on the metabolism and regulation of the saikosaponins. PMID:22047182
Identification of genetic elements in metabolism by high-throughput mouse phenotyping.
Rozman, Jan; Rathkolb, Birgit; Oestereicher, Manuela A; Schütt, Christine; Ravindranath, Aakash Chavan; Leuchtenberger, Stefanie; Sharma, Sapna; Kistler, Martin; Willershäuser, Monja; Brommage, Robert; Meehan, Terrence F; Mason, Jeremy; Haselimashhadi, Hamed; Hough, Tertius; Mallon, Ann-Marie; Wells, Sara; Santos, Luis; Lelliott, Christopher J; White, Jacqueline K; Sorg, Tania; Champy, Marie-France; Bower, Lynette R; Reynolds, Corey L; Flenniken, Ann M; Murray, Stephen A; Nutter, Lauryl M J; Svenson, Karen L; West, David; Tocchini-Valentini, Glauco P; Beaudet, Arthur L; Bosch, Fatima; Braun, Robert B; Dobbie, Michael S; Gao, Xiang; Herault, Yann; Moshiri, Ala; Moore, Bret A; Kent Lloyd, K C; McKerlie, Colin; Masuya, Hiroshi; Tanaka, Nobuhiko; Flicek, Paul; Parkinson, Helen E; Sedlacek, Radislav; Seong, Je Kyung; Wang, Chi-Kuang Leo; Moore, Mark; Brown, Steve D; Tschöp, Matthias H; Wurst, Wolfgang; Klingenspor, Martin; Wolf, Eckhard; Beckers, Johannes; Machicao, Fausto; Peter, Andreas; Staiger, Harald; Häring, Hans-Ulrich; Grallert, Harald; Campillos, Monica; Maier, Holger; Fuchs, Helmut; Gailus-Durner, Valerie; Werner, Thomas; Hrabe de Angelis, Martin
2018-01-18
Metabolic diseases are a worldwide problem but the underlying genetic factors and their relevance to metabolic disease remain incompletely understood. Genome-wide research is needed to characterize so-far unannotated mammalian metabolic genes. Here, we generate and analyze metabolic phenotypic data of 2016 knockout mouse strains under the aegis of the International Mouse Phenotyping Consortium (IMPC) and find 974 gene knockouts with strong metabolic phenotypes. 429 of those had no previous link to metabolism and 51 genes remain functionally completely unannotated. We compared human orthologues of these uncharacterized genes in five GWAS consortia and indeed 23 candidate genes are associated with metabolic disease. We further identify common regulatory elements in promoters of candidate genes. As each regulatory element is composed of several transcription factor binding sites, our data reveal an extensive metabolic phenotype-associated network of co-regulated genes. Our systematic mouse phenotype analysis thus paves the way for full functional annotation of the genome.
Hwang, Sohyun; Rhee, Seung Y; Marcotte, Edward M; Lee, Insuk
2012-01-01
AraNet is a functional gene network for the reference plant Arabidopsis and has been constructed in order to identify new genes associated with plant traits. It is highly predictive for diverse biological pathways and can be used to prioritize genes for functional screens. Moreover, AraNet provides a web-based tool with which plant biologists can efficiently discover novel functions of Arabidopsis genes (http://www.functionalnet.org/aranet/). This protocol explains how to conduct network-based prediction of gene functions using AraNet and how to interpret the prediction results. Functional discovery in plant biology is facilitated by combining candidate prioritization by AraNet with focused experimental tests. PMID:21886106
Integrated computational biology analysis to evaluate target genes for chronic myelogenous leukemia.
Zheng, Yu; Wang, Yu-Ping; Cao, Hongbao; Chen, Qiusheng; Zhang, Xi
2018-06-05
Although hundreds of genes have been linked to chronic myelogenous leukemia (CML), many of the results lack reproducibility. In the present study, data across multiple modalities were integrated to evaluate 579 CML candidate genes, including literature‑based CML‑gene relation data, Gene Expression Omnibus RNA expression data and pathway‑based gene‑gene interaction data. The expression data included samples from 76 patients with CML and 73 healthy controls. For each target gene, four metrics were proposed and tested with case/control classification. The effectiveness of the four metrics presented was demonstrated by the high classification accuracy (94.63%; P<2x10‑4). Cross metric analysis suggested nine top candidate genes for CML: Epidermal growth factor receptor, tumor protein p53, catenin β 1, janus kinase 2, tumor necrosis factor, abelson murine leukemia viral oncogene homolog 1, vascular endothelial growth factor A, B‑cell lymphoma 2 and proto‑oncogene tyrosine‑protein kinase. In addition, 145 CML candidate pathways enriched with 485 out of 579 genes were identified (P<8.2x10‑11; q=0.005). In conclusion, weighted genetic networks generated using computational biology may be complementary to biological experiments for the evaluation of known or novel CML target genes.
Hook, Paul W; McClymont, Sarah A; Cannon, Gabrielle H; Law, William D; Morton, A Jennifer; Goff, Loyal A; McCallion, Andrew S
2018-03-01
Genetic variation modulating risk of sporadic Parkinson disease (PD) has been primarily explored through genome-wide association studies (GWASs). However, like many other common genetic diseases, the impacted genes remain largely unknown. Here, we used single-cell RNA-seq to characterize dopaminergic (DA) neuron populations in the mouse brain at embryonic and early postnatal time points. These data facilitated unbiased identification of DA neuron subpopulations through their unique transcriptional profiles, including a postnatal neuroblast population and substantia nigra (SN) DA neurons. We use these population-specific data to develop a scoring system to prioritize candidate genes in all 49 GWAS intervals implicated in PD risk, including genes with known PD associations and many with extensive supporting literature. As proof of principle, we confirm that the nigrostriatal pathway is compromised in Cplx1-null mice. Ultimately, this systematic approach establishes biologically pertinent candidates and testable hypotheses for sporadic PD, informing a new era of PD genetic research. Copyright © 2018 American Society of Human Genetics. All rights reserved.
An integrative, translational approach to understanding rare and orphan genetically based diseases
Hoehndorf, Robert; Schofield, Paul N.; Gkoutos, Georgios V.
2013-01-01
PhenomeNet is an approach for integrating phenotypes across species and identifying candidate genes for genetic diseases based on the similarity between a disease and animal model phenotypes. In contrast to ‘guilt-by-association’ approaches, PhenomeNet relies exclusively on the comparison of phenotypes to suggest candidate genes, and can, therefore, be applied to study the molecular basis of rare and orphan diseases for which the molecular basis is unknown. In addition to disease phenotypes from the Online Mendelian Inheritance in Man (OMIM) database, we have now integrated the clinical signs from Orphanet into PhenomeNet. We demonstrate that our approach can efficiently identify known candidate genes for genetic diseases in Orphanet and OMIM. Furthermore, we find evidence that mutations in the HIP1 gene might cause Bassoe syndrome, a rare disorder with unknown genetic aetiology. Our results demonstrate that integration and computational analysis of human disease and animal model phenotypes using PhenomeNet has the potential to reveal novel insights into the pathobiology underlying genetic diseases. PMID:23853703
Kon, M; Suzuki, E; Dung, V C; Hasegawa, Y; Mitsui, T; Muroya, K; Ueoka, K; Igarashi, N; Nagasaki, K; Oto, Y; Hamajima, T; Yoshino, K; Igarashi, M; Kato-Fukui, Y; Nakabayashi, K; Hayashi, K; Hata, K; Matsubara, Y; Moriya, K; Ogata, T; Nonomura, K; Fukami, M
2015-03-01
What percentage of cases with non-syndromic hypospadias can be ascribed to mutations in known causative/candidate/susceptibility genes or submicroscopic copy-number variations (CNVs) in the genome? Monogenic and digenic mutations in known causative genes and cryptic CNVs account for >10% of cases with non-syndromic hypospadias. While known susceptibility polymorphisms appear to play a minor role in the development of this condition, further studies are required to validate this observation. Fifteen causative, three candidate, and 14 susceptible genes, and a few submicroscopic CNVs have been implicated in non-syndromic hypospadias. Systematic mutation screening and genome-wide copy-number analysis of 62 patients. The study group consisted of 57 Japanese and five Vietnamese patients with non-syndromic hypospadias. Systematic mutation screening was performed for 25 known causative/candidate/susceptibility genes using a next-generation sequencer. Functional consequences of nucleotide alterations were assessed by in silico assays. The frequencies of polymorphisms in the patient group were compared with those in the male general population. CNVs were analyzed by array-based comparative genomic hybridization and characterized by fluorescence in situ hybridization. Seven of 62 patients with anterior or posterior hypospadias carried putative pathogenic mutations, such as hemizygous mutations in AR, a heterozygous mutation in BNC2, and homozygous mutations in SRD5A2 and HSD3B2. Two of the seven patients had mutations in multiple genes. We did not find any rare polymorphisms that were abundant specifically in the patient group. One patient carried mosaic dicentric Y chromosome. The patient group consisted solely of Japanese and Vietnamese individuals and clinical and hormonal information of the patients remained rather fragmentary. In addition, mutation analysis focused on protein-altering substitutions. Our data provide evidence that pathogenic mutations can underlie both mild and severe hypospadias and that HSD3B2 mutations cause non-syndromic hypospadias as a sole clinical manifestation. Most importantly, this is the first report documenting possible oligogenicity of non-syndromic hypospadias. This study was funded by the Grant-in-Aid from the Ministry of Education, Culture, Sports, Science and Technology; by the Grant-in-Aid from the Japan Society for the Promotion of Science; by the Grants from the Ministry of Health, Labour and Welfare, from the National Center for Child Health and Development and from the Takeda Foundation. The authors have no competing interests to disclose. Not applicable. © The Author 2015. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Chen, Minhui; Wang, Jiying; Wang, Yanping; Wu, Ying; Fu, Jinluan; Liu, Jian-Feng
2018-05-18
Currently, genome-wide scans for positive selection signatures in commercial breed have been investigated. However, few studies have focused on selection footprints of indigenous breeds. Laiwu pig is an invaluable Chinese indigenous pig breed with extremely high proportion of intramuscular fat (IMF), and an excellent model to detect footprint as the result of natural and artificial selection for fat deposition in muscle. In this study, based on GeneSeek Genomic profiler Porcine HD data, three complementary methods, F ST , iHS (integrated haplotype homozygosity score) and CLR (composite likelihood ratio), were implemented to detect selection signatures in the whole genome of Laiwu pigs. Totally, 175 candidate selected regions were obtained by at least two of the three methods, which covered 43.75 Mb genomic regions and corresponded to 1.79% of the genome sequence. Gene annotation of the selected regions revealed a list of functionally important genes for feed intake and fat deposition, reproduction, and immune response. Especially, in accordance to the phenotypic features of Laiwu pigs, among the candidate genes, we identified several genes, NPY1R, NPY5R, PIK3R1 and JAKMIP1, involved in the actions of two sets of neurons, which are central regulators in maintaining the balance between food intake and energy expenditure. Our results identified a number of regions showing signatures of selection, as well as a list of functionally candidate genes with potential effect on phenotypic traits, especially fat deposition in muscle. Our findings provide insights into the mechanisms of artificial selection of fat deposition and further facilitate follow-up functional studies.
NASA Technical Reports Server (NTRS)
Maes, Olivier C.; Xu, Suying; Hada, Megumi; Wu, Honglu; Wang, Eugenia
2007-01-01
Exposure to ionizing radiation causes DNA damage to cells, and provokes a plethora of cellular responses controlled by unique gene-directed signaling pathways. MicroRNAs (miRNAs) are small (22-nucleotide), non-coding RNAs which functionally silence gene expression by either degrading the messages or inhibiting translation. Here we investigate radiation-dependent changes in these negative regulators by comparing the expression patterns of all 462 known human miRNAs in fibroblasts, after exposure to low (0.1 Gy) or high (2 Gy) doses of X-rays at 30 min, 2, 6 and 24 hrs post-treatment. The expression patterns of microRNAs after low and high doses of radiation show a similar qualitative down-regulation trend at early (0.5 hr) and late (24 hr) time points, with a quantitatively steeper slope following the 2 Gy exposures. Interestingly, an interruption of this downward trend is observed after the 2 Gy exposure, i.e. a significant up-regulation of microRNAs at 2 hrs, then reverting to the downward trend by 6 hrs; this interruption at the intermediate time point was not observed with the 0.1 Gy exposure. At the early time point (0.5 hr), candidate gene targets of selected down-regulated microRNAs, common to both 0.1 and 2 Gy exposures, were those functioning in chromatin remodeling. Candidate target genes of unique up-regulated microRNAs seen at a 2 hr intermediate time point, after the 2 Gy exposure only, are those involved in cell death signaling. Finally, putative target genes of down-regulated microRNAs seen at the late (24 hr) time point after either doses of radiation are those involved in the up-regulation of DNA repair, cell signaling and homeostasis. Thus we hypothesize that after radiation exposure, microRNAs acting as hub negative regulators for unique signaling pathways needed to be down-regulated so as to de-repress their target genes for the proper cellular responses, including DNA repair and cell maintenance. The unique microRNAs up-regulated at 2 hr after 2 Gy suggest the cellular response to functionally suppress the apoptotic death signaling reflex after exposure to high dose radiation. Further analyses with transcriptome and global proteomic profiling will validate the reciprocal expression of signature microRNAs selected in our radiation-exposed cells, and their candidate target gene families, and test our hypothesis that unique radiation-specific microRNAs are keys in governing signaling responses for damage control of this environmental hazard.
Sysol, Justin R.; Abbasi, Taimur; Patel, Amit R.; Lang, Roberto M.; Gupta, Akash; Garcia, Joe G. N.; Gordeuk, Victor R.; Machado, Roberto F.
2016-01-01
Background Diastolic dysfunction is common in sickle cell disease (SCD), and is associated with an increased risk of mortality. However, the molecular pathogenesis underlying this development is poorly understood. The aim of this study was to identify a gene expression profile that is associated with diastolic function in SCD, potentially elucidating molecular mechanisms behind diastolic dysfunction development. Methods Diastolic function was measured via echocardiography in 65 patients with SCD from two independent study populations. Gene expression microarray data was compared with diastolic function in both study cohorts. Candidate genes that associated in both analyses were tested for validation in a murine SCD model. Lastly, genotyping array data from the replication cohort was used to derive cis-expression quantitative trait loci (cis-eQTLs) and genetic associations within the candidate gene regions. Results Transcriptome data from both patient cohorts implicated 7 genes associated with diastolic function, and mouse SCD myocardial expression validated 3 of these genes. Genetic associations and eQTLs were detected in 2 of the 3 genes, FUCA2 and IL18. Conclusions FUCA2 and IL18 are associated with diastolic function in SCD patients, and may be involved in the pathogenesis of the disease. Genetic polymorphisms within the FUCA2 and IL18 gene regions are also associated with diastolic function in SCD, likely by affecting expression levels of the genes. PMID:27636371
Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters
DOE Office of Scientific and Technical Information (OSTI.GOV)
Santini, Simona; Boore, Jeffrey L.; Meyer, Axel
2003-12-31
Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involvedmore » in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.« less
Simon, Matthew J; Murchison, Charles; Iliff, Jeffrey J
2018-02-01
Astrocytes play a critical role in regulating the interface between the cerebral vasculature and the central nervous system. Contributing to this is the astrocytic endfoot domain, a specialized structure that ensheathes the entirety of the vasculature and mediates signaling between endothelial cells, pericytes, and neurons. The astrocytic endfoot has been implicated as a critical element of the glymphatic pathway, and changes in protein expression profiles in this cellular domain are linked to Alzheimer's disease pathology. Despite this, basic physiological properties of this structure remain poorly understood including the developmental timing of its formation, and the protein components that localize there to mediate its functions. Here we use human transcriptome data from male and female subjects across several developmental stages and brain regions to characterize the gene expression profile of the dystrophin-associated complex (DAC), a known structural component of the astrocytic endfoot that supports perivascular localization of the astroglial water channel aquaporin-4. Transcriptomic profiling is also used to define genes exhibiting parallel expression profiles to DAC elements, generating a pool of candidate genes that encode gene products that may contribute to the physiological function of the perivascular astrocytic endfoot domain. We found that several genes encoding transporter proteins are transcriptionally associated with DAC genes. © 2017 Wiley Periodicals, Inc.
"Orphan" retrogenes in the human genome.
Ciomborowska, Joanna; Rosikiewicz, Wojciech; Szklarczyk, Damian; Makałowski, Wojciech; Makałowska, Izabela
2013-02-01
Gene duplicates generated via retroposition were long thought to be pseudogenized and consequently decayed. However, a significant number of these genes escaped their evolutionary destiny and evolved into functional genes. Despite multiple studies, the number of functional retrogenes in human and other genomes remains unclear. We performed a comparative analysis of human, chicken, and worm genomes to identify "orphan" retrogenes, that is, retrogenes that have replaced their progenitors. We located 25 such candidates in the human genome. All of these genes were previously known, and the majority has been intensively studied. Despite this, they have never been recognized as retrogenes. Analysis revealed that the phenomenon of replacing parental genes with their retrocopies has been taking place over the entire span of animal evolution. This process was often species specific and contributed to interspecies differences. Surprisingly, these retrogenes, which should evolve in a more relaxed mode, are subject to a very strong purifying selection, which is, on average, two and a half times stronger than other human genes. Also, for retrogenes, they do not show a typical overall tendency for a testis-specific expression. Notably, seven of them are associated with human diseases. Recognizing them as "orphan" retrocopies, which have different regulatory machinery than their parents, is important for any disease studies in model organisms, especially when discoveries made in one species are transferred to humans.
Csilléry, Katalin; Lalagüe, Hadrien; Vendramin, Giovanni G; González-Martínez, Santiago C; Fady, Bruno; Oddou-Muratorio, Sylvie
2014-10-01
Detecting signatures of selection in tree populations threatened by climate change is currently a major research priority. Here, we investigated the signature of local adaptation over a short spatial scale using 96 European beech (Fagus sylvatica L.) individuals originating from two pairs of populations on the northern and southern slopes of Mont Ventoux (south-eastern France). We performed both single and multilocus analysis of selection based on 53 climate-related candidate genes containing 546 SNPs. FST outlier methods at the SNP level revealed a weak signal of selection, with three marginally significant outliers in the northern populations. At the gene level, considering haplotypes as alleles, two additional marginally significant outliers were detected, one on each slope. To account for the uncertainty of haplotype inference, we averaged the Bayes factors over many possible phase reconstructions. Epistatic selection offers a realistic multilocus model of selection in natural populations. Here, we used a test suggested by Ohta based on the decomposition of the variance of linkage disequilibrium. Overall populations, 0.23% of the SNP pairs (haplotypes) showed evidence of epistatic selection, with nearly 80% of them being within genes. One of the between gene epistatic selection signals arose between an FST outlier and a nonsynonymous mutation in a drought response gene. Additionally, we identified haplotypes containing selectively advantageous allele combinations which were unique to high or low elevations and northern or southern populations. Several haplotypes contained nonsynonymous mutations situated in genes with known functional importance for adaptation to climatic factors. © 2014 John Wiley & Sons Ltd.
Gene Expression Profiling of Soft and Firm Atlantic Salmon Fillet
Larsson, Thomas; Mørkøre, Turid; Kolstad, Kari; Østbye, Tone-Kari; Afanasyev, Sergey; Krasnov, Aleksei
2012-01-01
Texture of salmon fillets is an important quality trait for consumer acceptance as well as for the suitability for processing. In the present work we measured fillet firmness in a population of farmed Atlantic salmon with known pedigree and investigated the relationship between this trait and gene expression. Transcriptomic analyses performed with a 21 K oligonucleotide microarray revealed strong correlations between firmness and a large number of genes. Highly similar expression profiles were observed in several functional groups. Positive regression was found between firmness and genes encoding proteasome components (41 genes) and mitochondrial proteins (129 genes), proteins involved in stress responses (12 genes), and lipid metabolism (30 genes). Coefficients of determination (R2) were in the range of 0.64–0.74. A weaker though highly significant negative regression was seen in sugar metabolism (26 genes, R2 = 0.66) and myofiber proteins (42 genes, R2 = 0.54). Among individual genes that showed a strong association with firmness, there were extracellular matrix proteins (negative correlation), immune genes, and intracellular proteases (positive correlation). Several genes can be regarded as candidate markers of flesh quality (coiled-coil transcriptional coactivator b, AMP deaminase 3, and oligopeptide transporter 15) though their functional roles are unclear. To conclude, fillet firmness of Atlantic salmon depends largely on metabolic properties of the skeletal muscle; where aerobic metabolism using lipids as fuel, and the rapid removal of damaged proteins, appear to play a major role. PMID:22745718
Gene expression profiling of soft and firm Atlantic salmon fillet.
Larsson, Thomas; Mørkøre, Turid; Kolstad, Kari; Østbye, Tone-Kari; Afanasyev, Sergey; Krasnov, Aleksei
2012-01-01
Texture of salmon fillets is an important quality trait for consumer acceptance as well as for the suitability for processing. In the present work we measured fillet firmness in a population of farmed Atlantic salmon with known pedigree and investigated the relationship between this trait and gene expression. Transcriptomic analyses performed with a 21 K oligonucleotide microarray revealed strong correlations between firmness and a large number of genes. Highly similar expression profiles were observed in several functional groups. Positive regression was found between firmness and genes encoding proteasome components (41 genes) and mitochondrial proteins (129 genes), proteins involved in stress responses (12 genes), and lipid metabolism (30 genes). Coefficients of determination (R(2)) were in the range of 0.64-0.74. A weaker though highly significant negative regression was seen in sugar metabolism (26 genes, R(2) = 0.66) and myofiber proteins (42 genes, R(2) = 0.54). Among individual genes that showed a strong association with firmness, there were extracellular matrix proteins (negative correlation), immune genes, and intracellular proteases (positive correlation). Several genes can be regarded as candidate markers of flesh quality (coiled-coil transcriptional coactivator b, AMP deaminase 3, and oligopeptide transporter 15) though their functional roles are unclear. To conclude, fillet firmness of Atlantic salmon depends largely on metabolic properties of the skeletal muscle; where aerobic metabolism using lipids as fuel, and the rapid removal of damaged proteins, appear to play a major role.
Prioritizing chronic obstructive pulmonary disease (COPD) candidate genes in COPD-related networks
Zhang, Yihua; Li, Wan; Feng, Yuyan; Guo, Shanshan; Zhao, Xilei; Wang, Yahui; He, Yuehan; He, Weiming; Chen, Lina
2017-01-01
Chronic obstructive pulmonary disease (COPD) is a multi-factor disease, which could be caused by many factors, including disturbances of metabolism and protein-protein interactions (PPIs). In this paper, a weighted COPD-related metabolic network and a weighted COPD-related PPI network were constructed base on COPD disease genes and functional information. Candidate genes in these weighted COPD-related networks were prioritized by making use of a gene prioritization method, respectively. Literature review and functional enrichment analysis of the top 100 genes in these two networks suggested the correlation of COPD and these genes. The performance of our gene prioritization method was superior to that of ToppGene and ToppNet for genes from the COPD-related metabolic network or the COPD-related PPI network after assessing using leave-one-out cross-validation, literature validation and functional enrichment analysis. The top-ranked genes prioritized from COPD-related metabolic and PPI networks could promote the better understanding about the molecular mechanism of this disease from different perspectives. The top 100 genes in COPD-related metabolic network or COPD-related PPI network might be potential markers for the diagnosis and treatment of COPD. PMID:29262568
Prioritizing chronic obstructive pulmonary disease (COPD) candidate genes in COPD-related networks.
Zhang, Yihua; Li, Wan; Feng, Yuyan; Guo, Shanshan; Zhao, Xilei; Wang, Yahui; He, Yuehan; He, Weiming; Chen, Lina
2017-11-28
Chronic obstructive pulmonary disease (COPD) is a multi-factor disease, which could be caused by many factors, including disturbances of metabolism and protein-protein interactions (PPIs). In this paper, a weighted COPD-related metabolic network and a weighted COPD-related PPI network were constructed base on COPD disease genes and functional information. Candidate genes in these weighted COPD-related networks were prioritized by making use of a gene prioritization method, respectively. Literature review and functional enrichment analysis of the top 100 genes in these two networks suggested the correlation of COPD and these genes. The performance of our gene prioritization method was superior to that of ToppGene and ToppNet for genes from the COPD-related metabolic network or the COPD-related PPI network after assessing using leave-one-out cross-validation, literature validation and functional enrichment analysis. The top-ranked genes prioritized from COPD-related metabolic and PPI networks could promote the better understanding about the molecular mechanism of this disease from different perspectives. The top 100 genes in COPD-related metabolic network or COPD-related PPI network might be potential markers for the diagnosis and treatment of COPD.
Hayes, C; Rump, A; Cadman, M R; Harrison, M; Evans, E P; Lyon, M F; Morriss-Kay, G M; Rosenthal, A; Brown, S D
2001-12-01
The mouse doublefoot (Dbf) mutant exhibits preaxial polydactyly in association with craniofacial defects. This mutation has previously been mapped to mouse chromosome 1. We have used a positional cloning strategy, coupled with a comparative sequencing approach using available human draft sequence, to identify putative candidates for the Dbf gene in the mouse and in homologous human region. We have constructed a high-resolution genetic map of the region, localizing the mutation to a 0.4-cM (+/-0.0061) interval on mouse chromosome 1. Furthermore, we have constructed contiguous BAC/PAC clone maps across the mouse and human Dbf region. Using existing markers and additional sequence tagged sites, which we have generated, we have anchored the physical map to the genetic map. Through the comparative sequencing of these clones we have identified 35 genes within this interval, indicating that the region is gene-rich. From this we have identified several genes that are known to be differentially expressed in the developing mid-gestation mouse embryo, some in the developing embryonic limb buds. These genes include those encoding known developmental signaling molecules such as WNT proteins and IHH, and we provide evidence that these genes are candidates for the Dbf mutation.
A review of selected genes with known effects on performance and health of cattle
USDA-ARS?s Scientific Manuscript database
There are genetic conditions that influence production in dairy and beef cattle. The objective of this review was to describe relevant genetic conditions that have been associated with productivity in cattle. Genes or genomic regions that have been identified as a candidate for the condition will be...
USDA-ARS?s Scientific Manuscript database
Cotton productivity is affected by water deficit, and little is known about the molecular basis of drought tolerance in cotton. In this study, microarray analysis was conducted to identify drought-responsive genes in the third topmost leaves of the field-grown drought-tolerant cotton (Gossypium hirs...
Makita, Yuko; Kobayashi, Norio; Mochizuki, Yoshiki; Yoshida, Yuko; Asano, Satomi; Heida, Naohiko; Deshpande, Mrinalini; Bhatia, Rinki; Matsushima, Akihiro; Ishii, Manabu; Kawaguchi, Shuji; Iida, Kei; Hanada, Kosuke; Kuromori, Takashi; Seki, Motoaki; Shinozaki, Kazuo; Toyoda, Tetsuro
2009-07-01
Molecular breeding of crops is an efficient way to upgrade plant functions useful to mankind. A key step is forward genetics or positional cloning to identify the genes that confer useful functions. In order to accelerate the whole research process, we have developed an integrated database system powered by an intelligent data-retrieval engine termed PosMed-plus (Positional Medline for plant upgrading science), allowing us to prioritize highly promising candidate genes in a given chromosomal interval(s) of Arabidopsis thaliana and rice, Oryza sativa. By inferentially integrating cross-species information resources including genomes, transcriptomes, proteomes, localizomes, phenomes and literature, the system compares a user's query, such as phenotypic or functional keywords, with the literature associated with the relevant genes located within the interval. By utilizing orthologous and paralogous correspondences, PosMed-plus efficiently integrates cross-species information to facilitate the ranking of rice candidate genes based on evidence from other model species such as Arabidopsis. PosMed-plus is a plant science version of the PosMed system widely used by mammalian researchers, and provides both a powerful integrative search function and a rich integrative display of the integrated databases. PosMed-plus is the first cross-species integrated database that inferentially prioritizes candidate genes for forward genetics approaches in plant science, and will be expanded for wider use in plant upgrading in many species.
Karaesmen, Ezgi; Rizvi, Abbas A.; Preus, Leah M.; McCarthy, Philip L.; Pasquini, Marcelo C.; Onel, Kenan; Zhu, Xiaochun; Spellman, Stephen; Haiman, Christopher A.; Stram, Daniel O.; Pooler, Loreall; Sheng, Xin; Zhu, Qianqian; Yan, Li; Liu, Qian; Hu, Qiang; Webb, Amy; Brock, Guy; Clay-Gilmour, Alyssa I.; Battaglia, Sebastiano; Tritchler, David; Liu, Song; Hahn, Theresa
2017-01-01
Multiple candidate gene-association studies of non-HLA single-nucleotide polymorphisms (SNPs) and outcomes after blood or marrow transplant (BMT) have been conducted. We identified 70 publications reporting 45 SNPs in 36 genes significantly associated with disease-related mortality, progression-free survival, transplant-related mortality, and/or overall survival after BMT. Replication and validation of these SNP associations were performed using DISCOVeRY-BMT (Determining the Influence of Susceptibility COnveying Variants Related to one-Year mortality after BMT), a well-powered genome-wide association study consisting of 2 cohorts, totaling 2888 BMT recipients with acute myeloid leukemia, acute lymphoblastic leukemia, or myelodysplastic syndrome, and their HLA-matched unrelated donors, reported to the Center for International Blood and Marrow Transplant Research. Gene-based tests were used to assess the aggregate effect of SNPs on outcome. None of the previously reported significant SNPs replicated at P < .05 in DISCOVeRY-BMT. Validation analyses showed association with one previously reported donor SNP at P < .05 and survival; more associations would be anticipated by chance alone. No gene-based tests were significant at P < .05. Functional annotation with publicly available data shows these candidate SNPs most likely do not have biochemical function; only 13% of candidate SNPs correlate with gene expression or are predicted to impact transcription factor binding. Of these, half do not impact the candidate gene of interest; the other half correlate with expression of multiple genes. These findings emphasize the peril of pursing candidate approaches and the importance of adequately powered tests of unbiased genome-wide associations with BMT clinical outcomes given the ultimate goal of improving patient outcomes. PMID:28811306
Chen, Junhui; Meng, Yuhuan; Zhou, Jinghui; Zhuo, Min; Ling, Fei; Zhang, Yu; Du, Hongli; Wang, Xiaoning
2013-01-01
Type 2 Diabetes Mellitus (T2DM) and obesity have become increasingly prevalent in recent years. Recent studies have focused on identifying causal variations or candidate genes for obesity and T2DM via analysis of expression quantitative trait loci (eQTL) within a single tissue. T2DM and obesity are affected by comprehensive sets of genes in multiple tissues. In the current study, gene expression levels in multiple human tissues from GEO datasets were analyzed, and 21 candidate genes displaying high percentages of differential expression were filtered out. Specifically, DENND1B, LYN, MRPL30, POC1B, PRKCB, RP4-655J12.3, HIBADH, and TMBIM4 were identified from the T2DM-control study, and BCAT1, BMP2K, CSRNP2, MYNN, NCKAP5L, SAP30BP, SLC35B4, SP1, BAP1, GRB14, HSP90AB1, ITGA5, and TOMM5 were identified from the obesity-control study. The majority of these genes are known to be involved in T2DM and obesity. Therefore, analysis of gene expression in various tissues using GEO datasets may be an effective and feasible method to determine novel or causal genes associated with T2DM and obesity.
Exome-wide DNA capture and next generation sequencing in domestic and wild species.
Cosart, Ted; Beja-Pereira, Albano; Chen, Shanyuan; Ng, Sarah B; Shendure, Jay; Luikart, Gordon
2011-07-05
Gene-targeted and genome-wide markers are crucial to advance evolutionary biology, agriculture, and biodiversity conservation by improving our understanding of genetic processes underlying adaptation and speciation. Unfortunately, for eukaryotic species with large genomes it remains costly to obtain genome sequences and to develop genome resources such as genome-wide SNPs. A method is needed to allow gene-targeted, next-generation sequencing that is flexible enough to include any gene or number of genes, unlike transcriptome sequencing. Such a method would allow sequencing of many individuals, avoiding ascertainment bias in subsequent population genetic analyses.We demonstrate the usefulness of a recent technology, exon capture, for genome-wide, gene-targeted marker discovery in species with no genome resources. We use coding gene sequences from the domestic cow genome sequence (Bos taurus) to capture (enrich for), and subsequently sequence, thousands of exons of B. taurus, B. indicus, and Bison bison (wild bison). Our capture array has probes for 16,131 exons in 2,570 genes, including 203 candidate genes with known function and of interest for their association with disease and other fitness traits. We successfully sequenced and mapped exon sequences from across the 29 autosomes and X chromosome in the B. taurus genome sequence. Exon capture and high-throughput sequencing identified thousands of putative SNPs spread evenly across all reference chromosomes, in all three individuals, including hundreds of SNPs in our targeted candidate genes. This study shows exon capture can be customized for SNP discovery in many individuals and for non-model species without genomic resources. Our captured exome subset was small enough for affordable next-generation sequencing, and successfully captured exons from a divergent wild species using the domestic cow genome as reference.
Ultsch, Alfred; Kringel, Dario; Kalso, Eija; Mogil, Jeffrey S; Lötsch, Jörn
2016-12-01
The increasing availability of "big data" enables novel research approaches to chronic pain while also requiring novel techniques for data mining and knowledge discovery. We used machine learning to combine the knowledge about n = 535 genes identified empirically as relevant to pain with the knowledge about the functions of thousands of genes. Starting from an accepted description of chronic pain as displaying systemic features described by the terms "learning" and "neuronal plasticity," a functional genomics analysis proposed that among the functions of the 535 "pain genes," the biological processes "learning or memory" (P = 8.6 × 10) and "nervous system development" (P = 2.4 × 10) are statistically significantly overrepresented as compared with the annotations to these processes expected by chance. After establishing that the hypothesized biological processes were among important functional genomics features of pain, a subset of n = 34 pain genes were found to be annotated with both Gene Ontology terms. Published empirical evidence supporting their involvement in chronic pain was identified for almost all these genes, including 1 gene identified in March 2016 as being involved in pain. By contrast, such evidence was virtually absent in a randomly selected set of 34 other human genes. Hence, the present computational functional genomics-based method can be used for candidate gene selection, providing an alternative to established methods.
[Linkage analysis of a family with familial hypertriglyceridemia].
Tang, Xin; Lin, Ying; Liu, Bing; Ma, Shi; Yang, Yang; Yang, Zheng-lin
2009-10-01
To perform linkage analysis and mutation screening in a Chinese family with familial hpertriglyceridemia (FHTG). Thirty-two family members including 12 hypertriglyceridemia patients participated in the study. Genotyping and haplotype analysis for 22 subjects were performed using short tandem repeat (STR) microsatellite polymorphism markers on 16 candidate genes and/or loci related to lipid metabolism. Two of the sixteen known candidate genes, APOA2 and USF1 were screened for mutation by direct DNA sequencing. No linkage was found between the candidate genes/loci of APOA5, LIPI, RP1, APOC2, ABC1, LMF1, APOA1-APOC3-APOA4, LPL, APOB, CETP, LCAT, LDLR, APOE and the phenotype in this family. The two-point Lod scores (theta =0) were all less than-1.0 for all the markers tested. Linkage analysis suggested linkage to chromosome 1q23.3-24.2 between the disease phenotype and STR marker D1S194 with a two-point maximum Lod score of 2.44 at theta =0. Fine mapping indicated that the disease gene was localized to a 5.87 cM interval between D1S104 and D1S196. No disease-causing mutation was detected in the APOA2 and USF1 genes. The above mentioned candidate genes were excluded as the disease causing genes for this family. The results implied that there might be a novel gene/locus for FHTG on chromosome 1q23.3-1q24.2.
Scuba: scalable kernel-based gene prioritization.
Zampieri, Guido; Tran, Dinh Van; Donini, Michele; Navarin, Nicolò; Aiolli, Fabio; Sperduti, Alessandro; Valle, Giorgio
2018-01-25
The uncovering of genes linked to human diseases is a pressing challenge in molecular biology and precision medicine. This task is often hindered by the large number of candidate genes and by the heterogeneity of the available information. Computational methods for the prioritization of candidate genes can help to cope with these problems. In particular, kernel-based methods are a powerful resource for the integration of heterogeneous biological knowledge, however, their practical implementation is often precluded by their limited scalability. We propose Scuba, a scalable kernel-based method for gene prioritization. It implements a novel multiple kernel learning approach, based on a semi-supervised perspective and on the optimization of the margin distribution. Scuba is optimized to cope with strongly unbalanced settings where known disease genes are few and large scale predictions are required. Importantly, it is able to efficiently deal both with a large amount of candidate genes and with an arbitrary number of data sources. As a direct consequence of scalability, Scuba integrates also a new efficient strategy to select optimal kernel parameters for each data source. We performed cross-validation experiments and simulated a realistic usage setting, showing that Scuba outperforms a wide range of state-of-the-art methods. Scuba achieves state-of-the-art performance and has enhanced scalability compared to existing kernel-based approaches for genomic data. This method can be useful to prioritize candidate genes, particularly when their number is large or when input data is highly heterogeneous. The code is freely available at https://github.com/gzampieri/Scuba .
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liang, Ying; Gao, Yajun; Jones, Alan M.
The three-member family of Arabidopsis extra-large G proteins (XLG1-3) defines the prototype of an atypical Ga subunit in the heterotrimeric G protein complex. Some recent evidence indicate that XLG subunits operate along with its Gbg dimer in root morphology, stress responsiveness, and cytokinin induced development, however downstream targets of activated XLG proteins in the stress pathways are rarely known. In order to assemble a set of candidate XLG-targeted proteins, a yeast two-hybrid complementation-based screen was performed using XLG protein baits to query interactions between XLG and partner protein found in glucose-treated seedlings, roots, and Arabidopsis cells in culture. Seventy twomore » interactors were identified and >60% of a test set displayed in vivo interaction with XLG proteins. Gene co-expression analysis shows that >70% of the interactors are positively correlated with the corresponding XLG partners. Gene Ontology enrichment for all the candidates indicates stress responses and posits a molecular mechanism involving a specific set of transcription factor partners to XLG. Genes encoding two of these transcription factors, SZF1 and 2, require XLG proteins for full NaCl-induced expression. Furthermore, the subcellular localization of the XLG proteins in the nucleus, endosome, and plasma membrane is dependent on the specific interacting partner.« less
Liang, Ying; Gao, Yajun; Jones, Alan M.
2017-06-13
The three-member family of Arabidopsis extra-large G proteins (XLG1-3) defines the prototype of an atypical Ga subunit in the heterotrimeric G protein complex. Some recent evidence indicate that XLG subunits operate along with its Gbg dimer in root morphology, stress responsiveness, and cytokinin induced development, however downstream targets of activated XLG proteins in the stress pathways are rarely known. In order to assemble a set of candidate XLG-targeted proteins, a yeast two-hybrid complementation-based screen was performed using XLG protein baits to query interactions between XLG and partner protein found in glucose-treated seedlings, roots, and Arabidopsis cells in culture. Seventy twomore » interactors were identified and >60% of a test set displayed in vivo interaction with XLG proteins. Gene co-expression analysis shows that >70% of the interactors are positively correlated with the corresponding XLG partners. Gene Ontology enrichment for all the candidates indicates stress responses and posits a molecular mechanism involving a specific set of transcription factor partners to XLG. Genes encoding two of these transcription factors, SZF1 and 2, require XLG proteins for full NaCl-induced expression. Furthermore, the subcellular localization of the XLG proteins in the nucleus, endosome, and plasma membrane is dependent on the specific interacting partner.« less
Transcriptomes That Confer to Plant Defense against Powdery Mildew Disease in Lagerstroemia indica
Shi, Weibing; Rinehart, Timothy
2015-01-01
Transcriptome analysis was conducted in two popular Lagerstroemia cultivars: “Natchez” (NAT), a white flower and powdery mildew resistant interspecific hybrid and “Carolina Beauty” (CAB), a red flower and powdery mildew susceptible L. indica cultivar. RNA-seq reads were generated from Erysiphe australiana infected leaves and de novo assembled. A total of 37,035 unigenes from 224,443 assembled contigs in both genotypes were identified. Approximately 85% of these unigenes have known function. Of them, 475 KEGG genes were found significantly different between the two genotypes. Five of the top ten differentially expressed genes (DEGs) involved in the biosynthesis of secondary metabolites (plant defense) and four in flavonoid biosynthesis pathway (antioxidant activities or flower coloration). Furthermore, 5 of the 12 assembled unigenes in benzoxazinoid biosynthesis and 7 of 11 in flavonoid biosynthesis showed higher transcript abundance in NAT. The relative abundance of transcripts for 16 candidate DEGs (9 from CAB and 7 from NAT) detected by qRT-PCR showed general agreement with the abundances of the assembled transcripts in NAT. This study provided the first transcriptome analyses in L. indica. The differential transcript abundance between two genotypes indicates that it is possible to identify candidate genes that are associated with the plant defenses or flower coloration. PMID:26247009
A model system to study the lignification process in Eucalyptus globulus.
Araújo, Pedro; Cesarino, Igor; Mayer, Juliana Lischka Sampaio; Ferrari, Ilse Fernanda; Kiyota, Eduardo; Sawaya, Alexandra Christine Helena Frankland; Paes Leme, Adriana Franco; Mazzafera, Paulo
2014-09-01
Recalcitrance of plant biomass is closely related to the presence of the phenolic heteropolymer lignin in secondary cell walls, which has a negative effect on forage digestibility, biomass-to-biofuels conversion and chemical pulping. The genus Eucalyptus is the main source of wood for pulp and paper industry. However, when compared to model plants such as Arabidopsis thaliana and poplar, relatively little is known about lignin biosynthesis in Eucalyptus and only a few genes were functionally characterized. An efficient, fast and inexpensive in vitro system was developed to study lignification in Eucalyptus globulus and to evaluate the potential role of candidate genes in this biological process. Seedlings were grown in four different conditions, in the presence or absence of light and with or without sucrose in the growth medium, and several aspects of lignin metabolism were evaluated. Our results showed that light and, to a lesser extent, sucrose induced lignin biosynthesis, which was followed by changes in S/G ratio, lignin oligomers accumulation and gene expression. In addition, higher total peroxidase activity and differential isoperoxidase profile were observed when seedlings were grown in the presence of light and sucrose. Peptide sequencing allowed the identification of differentially expressed peroxidases, which can be considered potential candidate class III peroxidases involved in lignin polymerization in E. globulus. © 2014 Scandinavian Plant Physiology Society.
Elucidating the genetic basis of antioxidant status in lettuce (Lactuca sativa)
Damerum, Annabelle; Selmes, Stacey L; Biggi, Gaia F; Clarkson, Graham JJ; Rothwell, Steve D; Truco, Maria José; Michelmore, Richard W; Hancock, Robert D; Shellcock, Connie; Chapman, Mark A; Taylor, Gail
2015-01-01
A diet rich in phytonutrients from fruit and vegetables has been acknowledged to afford protection against a range of human diseases, but many of the most popular vegetables are low in phytonutrients. Wild relatives of crops may contain allelic variation for genes determining the concentrations of these beneficial phytonutrients, and therefore understanding the genetic basis of this variation is important for breeding efforts to enhance nutritional quality. In this study, lettuce recombinant inbred lines, generated from a cross between wild and cultivated lettuce (Lactuca serriola and Lactuca sativa, respectively), were analysed for antioxidant (AO) potential and important phytonutrients including carotenoids, chlorophyll and phenolic compounds. When grown in two environments, 96 quantitative trait loci (QTL) were identified for these nutritional traits: 4 for AO potential, 2 for carotenoid content, 3 for total chlorophyll content and 87 for individual phenolic compounds (two per compound on average). Most often, the L. serriola alleles conferred an increase in total AOs and metabolites. Candidate genes underlying these QTL were identified by BLASTn searches; in several cases, these had functions suggesting involvement in phytonutrient biosynthetic pathways. Analysis of a QTL on linkage group 3, which accounted for >30% of the variation in AO potential, revealed several candidate genes encoding multiple MYB transcription factors which regulate flavonoid biosynthesis and flavanone 3-hydroxylase, an enzyme involved in the biosynthesis of the flavonoids quercetin and kaempferol, which are known to have powerful AO activity. Follow-up quantitative RT-PCR of these candidates revealed that 5 out of 10 genes investigated were significantly differentially expressed between the wild and cultivated parents, providing further evidence of their potential involvement in determining the contrasting phenotypes. These results offer exciting opportunities to improve the nutritional content and health benefits of lettuce through marker-assisted breeding. PMID:26640696
Elucidating the genetic basis of antioxidant status in lettuce (Lactuca sativa).
Damerum, Annabelle; Selmes, Stacey L; Biggi, Gaia F; Clarkson, Graham Jj; Rothwell, Steve D; Truco, Maria José; Michelmore, Richard W; Hancock, Robert D; Shellcock, Connie; Chapman, Mark A; Taylor, Gail
2015-01-01
A diet rich in phytonutrients from fruit and vegetables has been acknowledged to afford protection against a range of human diseases, but many of the most popular vegetables are low in phytonutrients. Wild relatives of crops may contain allelic variation for genes determining the concentrations of these beneficial phytonutrients, and therefore understanding the genetic basis of this variation is important for breeding efforts to enhance nutritional quality. In this study, lettuce recombinant inbred lines, generated from a cross between wild and cultivated lettuce (Lactuca serriola and Lactuca sativa, respectively), were analysed for antioxidant (AO) potential and important phytonutrients including carotenoids, chlorophyll and phenolic compounds. When grown in two environments, 96 quantitative trait loci (QTL) were identified for these nutritional traits: 4 for AO potential, 2 for carotenoid content, 3 for total chlorophyll content and 87 for individual phenolic compounds (two per compound on average). Most often, the L. serriola alleles conferred an increase in total AOs and metabolites. Candidate genes underlying these QTL were identified by BLASTn searches; in several cases, these had functions suggesting involvement in phytonutrient biosynthetic pathways. Analysis of a QTL on linkage group 3, which accounted for >30% of the variation in AO potential, revealed several candidate genes encoding multiple MYB transcription factors which regulate flavonoid biosynthesis and flavanone 3-hydroxylase, an enzyme involved in the biosynthesis of the flavonoids quercetin and kaempferol, which are known to have powerful AO activity. Follow-up quantitative RT-PCR of these candidates revealed that 5 out of 10 genes investigated were significantly differentially expressed between the wild and cultivated parents, providing further evidence of their potential involvement in determining the contrasting phenotypes. These results offer exciting opportunities to improve the nutritional content and health benefits of lettuce through marker-assisted breeding.
Fine mapping of the Darier's disease locus on chromosome 12q.
Richard, G; Wright, A R; Harris, S; Doyle, S Z; Korge, B; Mazzanti, C; Tanaka, T; Harth, W; McBride, O W; Compton, J G; Bale, S J; DiGiovanna, J J
1994-11-01
Darier's disease (DD) is an autosomal dominant genodermatosis characterized by epidermal acantholysis and dyskeratosis. We have performed genetic linkage studies in 10 families with DD (34 affected) by analyzing 14 polymorphic microsatellite markers. Our results confirm recent reports mapping the DD gene to chromosome 12q23-q24.1. Haplotype analysis of recombinant chromosomes in our families, along with previously reported data, narrow the location of the DD gene to a 5 cM interval flanked by the loci D12S354 and D12S84/D12S105. This localization allowed exclusion of two known genes, PLA2A and PAH, as candidate loci for DD. Three other gene loci (PPP1C, PMCH, PMCA1), mapping in 12q21-q24, remain potential candidates.
The genomic signature of dog domestication reveals adaptation to a starch-rich diet.
Axelsson, Erik; Ratnakumar, Abhirami; Arendt, Maja-Louise; Maqbool, Khurram; Webster, Matthew T; Perloski, Michele; Liberg, Olof; Arnemo, Jon M; Hedhammar, Ake; Lindblad-Toh, Kerstin
2013-03-21
The domestication of dogs was an important episode in the development of human civilization. The precise timing and location of this event is debated and little is known about the genetic changes that accompanied the transformation of ancient wolves into domestic dogs. Here we conduct whole-genome resequencing of dogs and wolves to identify 3.8 million genetic variants used to identify 36 genomic regions that probably represent targets for selection during dog domestication. Nineteen of these regions contain genes important in brain function, eight of which belong to nervous system development pathways and potentially underlie behavioural changes central to dog domestication. Ten genes with key roles in starch digestion and fat metabolism also show signals of selection. We identify candidate mutations in key genes and provide functional support for an increased starch digestion in dogs relative to wolves. Our results indicate that novel adaptations allowing the early ancestors of modern dogs to thrive on a diet rich in starch, relative to the carnivorous diet of wolves, constituted a crucial step in the early domestication of dogs.
Analysis of the gene coding for the BRCA2-interacting protein PALB2 in hereditary prostate cancer.
Tischkowitz, Marc; Sabbaghian, Nelly; Ray, Anna M; Lange, Ethan M; Foulkes, William D; Cooney, Kathleen A
2008-05-01
The genetic basis of susceptibility to prostate cancer (PRCA) remains elusive. Mutations in BRCA2 have been associated with increased prostate cancer risk and account for around 2% of young onset (<56 years) prostate cancer cases. PALB2 is a recently identified breast cancer susceptibility gene whose protein is closely associated with BRCA2 and is essential for BRCA2 anchorage to nuclear structures. This functional relationship made PALB2 a candidate PRCA susceptibility gene. We sequenced PALB2 in probands from 95 PRCA families, 77 of which had two or more cases of early onset PRCA (age at diagnosis <55 years), and the remaining 18 had one case of early onset PRCA and five or more total cases of PRCA. Two previously unreported variants, K18R and V925L were identified, neither of which is in a known PALB2 functional domain and both of which are unlikely to be pathogenic. No truncating mutations were identified. These results indicate that deleterious PALB2 mutations are unlikely to play a significant role in hereditary prostate cancer.
Identification of 15 candidate structured noncoding RNA motifs in fungi by comparative genomics.
Li, Sanshu; Breaker, Ronald R
2017-10-13
With the development of rapid and inexpensive DNA sequencing, the genome sequences of more than 100 fungal species have been made available. This dataset provides an excellent resource for comparative genomics analyses, which can be used to discover genetic elements, including noncoding RNAs (ncRNAs). Bioinformatics tools similar to those used to uncover novel ncRNAs in bacteria, likewise, should be useful for searching fungal genomic sequences, and the relative ease of genetic experiments with some model fungal species could facilitate experimental validation studies. We have adapted a bioinformatics pipeline for discovering bacterial ncRNAs to systematically analyze many fungal genomes. This comparative genomics pipeline integrates information on conserved RNA sequence and structural features with alternative splicing information to reveal fungal RNA motifs that are candidate regulatory domains, or that might have other possible functions. A total of 15 prominent classes of structured ncRNA candidates were identified, including variant HDV self-cleaving ribozyme representatives, atypical snoRNA candidates, and possible structured antisense RNA motifs. Candidate regulatory motifs were also found associated with genes for ribosomal proteins, S-adenosylmethionine decarboxylase (SDC), amidase, and HexA protein involved in Woronin body formation. We experimentally confirm that the variant HDV ribozymes undergo rapid self-cleavage, and we demonstrate that the SDC RNA motif reduces the expression of SAM decarboxylase by translational repression. Furthermore, we provide evidence that several other motifs discovered in this study are likely to be functional ncRNA elements. Systematic screening of fungal genomes using a computational discovery pipeline has revealed the existence of a variety of novel structured ncRNAs. Genome contexts and similarities to known ncRNA motifs provide strong evidence for the biological and biochemical functions of some newly found ncRNA motifs. Although initial examinations of several motifs provide evidence for their likely functions, other motifs will require more in-depth analysis to reveal their functions.
Zhang, Q; Baldwin, V J; Acland, G M; Parshall, C J; Haskel, J; Aguirre, G D; Ray, K
1999-01-01
Photoreceptor dysplasia (pd) is one of a group of at least six distinct autosomal and one X-linked retinal disorders identified in dogs which are collectively known as progressive retinal atrophy (PRA). It is an early onset retinal disease identified in miniature schnauzer dogs, and pedigree analysis and breeding studies have established autosomal recessive inheritance of the disease. Using a gene-based approach, a number of retina-expressed genes, including some members of the phototransduction pathway, have been causally implicated in retinal diseases of humans and other animals. Here we examined seven such potential candidate genes (opsin, RDS/peripherin, ROM1, rod cGMP-gated cation channel alpha-subunit, and three subunits of transducin) for their causal association with the pd locus by testing segregation of intragenic markers with the disease locus, or, in the absence of informative polymorphisms, sequencing of the coding regions of the genes. Based on these results, we have conclusively excluded four photoreceptor-specific genes as candidates for pd by linkage analysis. For three other photoreceptor-specific genes, we did not find any mutation in the coding sequences of the genes and have excluded them provisionally. Formal exclusion would require investigation of the levels of expression of the candidate genes in pd-affected dogs relative to age-matched controls. At present we are building suitable informative pedigrees for the disease locus with a sufficient number of meiosis to be useful for genomewide screening. This should identify markers linked to the disease locus and eventually permit progress toward the identification of the photoreceptor dysplasia gene and the disease-causing mutation.
Chen, Yongsheng; Liu, Hongjun; Ali, Farhad; Scott, M Paul; Ji, Qing; Frei, Ursula Karoline; Lübberstedt, Thomas
2012-10-01
Brown midrib mutants in maize are known to be associated with reduced lignin content and increased cell wall digestibility, which leads to better forage quality and higher efficiency of cellulosic biomass conversion into ethanol. Four well known brown midrib (bm) mutants, named bm1-4, were identified several decades ago. Additional recessive brown midrib mutants have been identified by allelism tests and designated as bm5 and bm6. In this study, we determined that bm6 increases cell wall digestibility and decreases plant height. bm6 was confirmed onto the short arm of chromosome 2 by a small mapping set with 181 plants from a F(2) segregating population, derived from crossing B73 and a bm6 mutant line. Subsequently, 960 brown midrib individuals were selected from the same but larger F(2) population for genetic and physical mapping. With newly developed markers in the target region, the bm6 gene was assigned to a 180 kb interval flanked by markers SSR_308337 and SSR_488638. In this region, ten gene models are predicted in the maize B73 sequence. Analysis of these ten genes as well as genes in the syntenic rice region revealed that four of them are promising candidate genes for bm6. Our study will facilitate isolation of the underlying gene of bm6 and advance our understanding of brown midrib gene functions.
NCI-60 Whole Exome Sequencing and Pharmacological CellMiner Analyses
Reinhold, William C.; Varma, Sudhir; Sousa, Fabricio; Sunshine, Margot; Abaan, Ogan D.; Davis, Sean R.; Reinhold, Spencer W.; Kohn, Kurt W.; Morris, Joel; Meltzer, Paul S.; Doroshow, James H.; Pommier, Yves
2014-01-01
Exome sequencing provides unprecedented insights into cancer biology and pharmacological response. Here we assess these two parameters for the NCI-60, which is among the richest genomic and pharmacological publicly available cancer cell line databases. Homozygous genetic variants that putatively affect protein function were identified in 1,199 genes (approximately 6% of all genes). Variants that are either enriched or depleted compared to non-cancerous genomes, and thus may be influential in cancer progression and differential drug response were identified for 2,546 genes. Potential gene knockouts are made available. Assessment of cell line response to 19,940 compounds, including 110 FDA-approved drugs, reveals ≈80-fold range in resistance versus sensitivity response across cell lines. 103,422 gene variants were significantly correlated with at least one compound (at p<0.0002). These include genes of known pharmacological importance such as IGF1R, BRAF, RAD52, MTOR, STAT2 and TSC2 as well as a large number of candidate genes such as NOM1, TLL2, and XDH. We introduce two new web-based CellMiner applications that enable exploration of variant-to-compound relationships for a broad range of researchers, especially those without bioinformatics support. The first tool, “Genetic variant versus drug visualization”, provides a visualization of significant correlations between drug activity-gene variant combinations. Examples are given for the known vemurafenib-BRAF, and novel ifosfamide-RAD52 pairings. The second, “Genetic variant summation” allows an assessment of cumulative genetic variations for up to 150 combined genes together; and is designed to identify the variant burden for molecular pathways or functional grouping of genes. An example of its use is provided for the EGFR-ERBB2 pathway gene variant data and the identification of correlated EGFR, ERBB2, MTOR, BRAF, MEK and ERK inhibitors. The new tools are implemented as an updated web-based CellMiner version, for which the present publication serves as a compendium. PMID:25032700
Curtis, Ross E; Kim, Seyoung; Woolford, John L; Xu, Wenjie; Xing, Eric P
2013-03-21
Association analysis using genome-wide expression quantitative trait locus (eQTL) data investigates the effect that genetic variation has on cellular pathways and leads to the discovery of candidate regulators. Traditional analysis of eQTL data via pairwise statistical significance tests or linear regression does not leverage the availability of the structural information of the transcriptome, such as presence of gene networks that reveal correlation and potentially regulatory relationships among the study genes. We employ a new eQTL mapping algorithm, GFlasso, which we have previously developed for sparse structured regression, to reanalyze a genome-wide yeast dataset. GFlasso fully takes into account the dependencies among expression traits to suppress false positives and to enhance the signal/noise ratio. Thus, GFlasso leverages the gene-interaction network to discover the pleiotropic effects of genetic loci that perturb the expression level of multiple (rather than individual) genes, which enables us to gain more power in detecting previously neglected signals that are marginally weak but pleiotropically significant. While eQTL hotspots in yeast have been reported previously as genomic regions controlling multiple genes, our analysis reveals additional novel eQTL hotspots and, more interestingly, uncovers groups of multiple contributing eQTL hotspots that affect the expression level of functional gene modules. To our knowledge, our study is the first to report this type of gene regulation stemming from multiple eQTL hotspots. Additionally, we report the results from in-depth bioinformatics analysis for three groups of these eQTL hotspots: ribosome biogenesis, telomere silencing, and retrotransposon biology. We suggest candidate regulators for the functional gene modules that map to each group of hotspots. Not only do we find that many of these candidate regulators contain mutations in the promoter and coding regions of the genes, in the case of the Ribi group, we provide experimental evidence suggesting that the identified candidates do regulate the target genes predicted by GFlasso. Thus, this structured association analysis of a yeast eQTL dataset via GFlasso, coupled with extensive bioinformatics analysis, discovers a novel regulation pattern between multiple eQTL hotspots and functional gene modules. Furthermore, this analysis demonstrates the potential of GFlasso as a powerful computational tool for eQTL studies that exploit the rich structural information among expression traits due to correlation, regulation, or other forms of biological dependencies.
Variation in Telangiectasia Predisposing Genes Is Associated With Overall Radiation Toxicity
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tanteles, George A.; Department of Cancer Studies and Molecular Medicine, University Hospitals of Leicester, Leicester Royal Infirmary, Leicester; Murray, Robert J.S.
2012-11-15
Purpose: In patients receiving radiotherapy for breast cancer where the heart is within the radiation field, cutaneous telangiectasiae could be a marker of potential radiation-induced heart disease. We hypothesized that single nucleotide polymorphisms (SNPs) in genes known to cause heritable telangiectasia-associated disorders could predispose to such late, normal tissue vascular damage. Methods and Materials: The relationship between cutaneous telangiectasia as a late normal tissue radiation injury phenotype in 633 breast cancer patients treated with radiotherapy was examined. Patients were clinically assessed for the presence of cutaneous telangiectasia and genotyped at nine SNPs in three candidate genes. Candidate SNPs were withinmore » the endoglin (ENG) and activin A receptor, type II-like 1 (ACVRL1) genes, mutations in which cause hereditary hemorrhagic telangiectasia and the ataxia-telangiectasia mutated (ATM) gene associated with ataxia-telangiectasia. Results: A total of 121 (19.1%) patients exhibited a degree of cutaneous telangiectasiae on clinical examination. Regression was used to examine the associations between the presence of telangiectasiae in patients who underwent breast-conserving surgery, controlling for the effects of boost and known brassiere size (n=388), and individual geno- or haplotypes. Inheritance of ACVRL1 SNPs marginally contributed to the risk of cutaneous telangiectasiae. Haplotypic analysis revealed a stronger association between inheritance of a ATM haplotype and the presence of cutaneous telangiectasiae, fibrosis and overall toxicity. No significant association was observed between telangiectasiae and the coinheritance of the candidate ENG SNPs. Conclusions: Genetic variation in the ATM gene influences reaction to radiotherapy through both vascular damage and increased fibrosis. The predisposing variation in the ATM gene will need to be better defined to optimize it as a predictive marker for assessing radiotherapy late effects.« less
Horizontal gene transfer in silkworm, Bombyx mori.
Zhu, Bo; Lou, Miao-Miao; Xie, Guan-Lin; Zhang, Guo-Qing; Zhou, Xue-Ping; Li, Bin; Jin, Gu-Lei
2011-05-19
The domesticated silkworm, Bombyx mori, is the model insect for the order Lepidoptera, has economically important values, and has gained some representative behavioral characteristics compared to its wild ancestor. The genome of B. mori has been fully sequenced while function analysis of BmChi-h and BmSuc1 genes revealed that horizontal gene transfer (HGT) maybe bestow a clear selective advantage to B. mori. However, the role of HGT in the evolutionary history of B. mori is largely unexplored. In this study, we compare the whole genome of B. mori with those of 382 prokaryotic and eukaryotic species to investigate the potential HGTs. Ten candidate HGT events were defined in B. mori by comprehensive sequence analysis using Maximum Likelihood and Bayesian method combining with EST checking. Phylogenetic analysis of the candidate HGT genes suggested that one HGT was plant-to- B. mori transfer while nine were bacteria-to- B. mori transfer. Furthermore, functional analysis based on expression, coexpression and related literature searching revealed that several HGT candidate genes have added important characters, such as resistance to pathogen, to B. mori. Results from this study clearly demonstrated that HGTs play an important role in the evolution of B. mori although the number of HGT events in B. mori is in general smaller than those of microbes and other insects. In particular, interdomain HGTs in B. mori may give rise to functional, persistent, and possibly evolutionarily significant new genes.
Fine-mapping of qGW4.05, a major QTL for kernel weight and size in maize.
Chen, Lin; Li, Yong-xiang; Li, Chunhui; Wu, Xun; Qin, Weiwei; Li, Xin; Jiao, Fuchao; Zhang, Xiaojing; Zhang, Dengfeng; Shi, Yunsu; Song, Yanchun; Li, Yu; Wang, Tianyu
2016-04-12
Kernel weight and size are important components of grain yield in cereals. Although some information is available concerning the map positions of quantitative trait loci (QTL) for kernel weight and size in maize, little is known about the molecular mechanisms of these QTLs. qGW4.05 is a major QTL that is associated with kernel weight and size in maize. We combined linkage analysis and association mapping to fine-map and identify candidate gene(s) at qGW4.05. QTL qGW4.05 was fine-mapped to a 279.6-kb interval in a segregating population derived from a cross of Huangzaosi with LV28. By combining the results of regional association mapping and linkage analysis, we identified GRMZM2G039934 as a candidate gene responsible for qGW4.05. Candidate gene-based association mapping was conducted using a panel of 184 inbred lines with variable kernel weights and kernel sizes. Six polymorphic sites in the gene GRMZM2G039934 were significantly associated with kernel weight and kernel size. The results of linkage analysis and association mapping revealed that GRMZM2G039934 is the most likely candidate gene for qGW4.05. These results will improve our understanding of the genetic architecture and molecular mechanisms underlying kernel development in maize.
The Influence of Genetics on Cystic Fibrosis Phenotypes
Knowles, Michael R.; Drumm, Mitchell
2012-01-01
Technological advances in genetics have made feasible and affordable large studies to identify genetic variants that cause or modify a trait. Genetic studies have been carried out to assess variants in candidate genes, as well as polymorphisms throughout the genome, for their associations with heritable clinical outcomes of cystic fibrosis (CF), such as lung disease, meconium ileus, and CF-related diabetes. The candidate gene approach has identified some predicted relationships, while genome-wide surveys have identified several genes that would not have been obvious disease-modifying candidates, such as a methionine sulfoxide transferase gene that influences intestinal obstruction, or a region on chromosome 11 proximate to genes encoding a transcription factor and an apoptosis controller that associates with lung function. These unforeseen associations thus provide novel insight into disease pathophysiology, as well as suggesting new therapeutic strategies for CF. PMID:23209180
Identification of the gene for Nance-Horan syndrome (NHS).
Brooks, S P; Ebenezer, N D; Poopalasundaram, S; Lehmann, O J; Moore, A T; Hardcastle, A J
2004-10-01
The disease intervals for Nance-Horan syndrome (NHS [MIM 302350]) and X linked congenital cataract (CXN) overlap on Xp22. To identify the gene or genes responsible for these diseases. Families with NHS were ascertained. The refined locus for CXN was used to focus the search for candidate genes, which were screened by polymerase chain reaction and direct sequencing of potential exons and intron-exon splice sites. Genomic structures and homologies were determined using bioinformatics. Expression studies were undertaken using specific exonic primers to amplify human fetal cDNA and mouse RNA. A novel gene NHS, with no known function, was identified as causative for NHS. Protein truncating mutations were detected in all three NHS pedigrees, but no mutation was identified in a CXN family, raising the possibility that NHS and CXN may not be allelic. The NHS gene forms a new gene family with a closely related novel gene NHS-Like1 (NHSL1). NHS and NHSL1 lie in paralogous duplicated chromosomal intervals on Xp22 and 6q24, and NHSL1 is more broadly expressed than NHS in human fetal tissues. This study reports the independent identification of the gene causative for Nance-Horan syndrome and extends the number of mutations identified.
Large-Scale Gene-Centric Analysis Identifies Novel Variants for Coronary Artery Disease
2011-01-01
Coronary artery disease (CAD) has a significant genetic contribution that is incompletely characterized. To complement genome-wide association (GWA) studies, we conducted a large and systematic candidate gene study of CAD susceptibility, including analysis of many uncommon and functional variants. We examined 49,094 genetic variants in ∼2,100 genes of cardiovascular relevance, using a customised gene array in 15,596 CAD cases and 34,992 controls (11,202 cases and 30,733 controls of European descent; 4,394 cases and 4,259 controls of South Asian origin). We attempted to replicate putative novel associations in an additional 17,121 CAD cases and 40,473 controls. Potential mechanisms through which the novel variants could affect CAD risk were explored through association tests with vascular risk factors and gene expression. We confirmed associations of several previously known CAD susceptibility loci (eg, 9p21.3:p<10−33; LPA:p<10−19; 1p13.3:p<10−17) as well as three recently discovered loci (COL4A1/COL4A2, ZC3HC1, CYP17A1:p<5×10−7). However, we found essentially null results for most previously suggested CAD candidate genes. In our replication study of 24 promising common variants, we identified novel associations of variants in or near LIPA, IL5, TRIB1, and ABCG5/ABCG8, with per-allele odds ratios for CAD risk with each of the novel variants ranging from 1.06–1.09. Associations with variants at LIPA, TRIB1, and ABCG5/ABCG8 were supported by gene expression data or effects on lipid levels. Apart from the previously reported variants in LPA, none of the other ∼4,500 low frequency and functional variants showed a strong effect. Associations in South Asians did not differ appreciably from those in Europeans, except for 9p21.3 (per-allele odds ratio: 1.14 versus 1.27 respectively; P for heterogeneity = 0.003). This large-scale gene-centric analysis has identified several novel genes for CAD that relate to diverse biochemical and cellular functions and clarified the literature with regard to many previously suggested genes. PMID:21966275
Satapathy, Lopamudra; Kumar, Dhananjay; Kumar, Manish; Mukhopadhyay, Kunal
2018-01-01
WRKY, a plant-specific transcription factor family, plays vital roles in pathogen defense, abiotic stress, and phytohormone signalling. Little is known about the roles and function of WRKY transcription factors in response to rust diseases in wheat. In the present study, three TaWRKY genes encoding complete protein sequences were cloned. They belonged to class II and III WRKY based on the number of WRKY domains and the pattern of zinc finger structures. Twenty-two DNA-protein binding docking complexes predicted stable interactions of WRKY domain with W-box. Quantitative real-time-PCR using wheat near-isogenic lines with or without Lr28 gene revealed differential up- or down-regulation in response to biotic and abiotic stress treatments which could be responsible for their functional divergence in wheat. TaWRKY62 was found to be induced upon treatment with JA, MJ, and SA and reduced after ABA treatments. Maximum induction of six out of seven genes occurred at 48 h post inoculation due to pathogen inoculation. Hence, TaWRKY (49, 50 , 52 , 55 , 57, and 62 ) can be considered as potential candidate genes for further functional validation as well as for crop improvement programs for stress resistance. The results of the present study will enhance knowledge towards understanding the molecular basis of mode of action of WRKY transcription factor genes in wheat and their role during leaf rust pathogenesis in particular.
Xu, Hai-Ming; Kong, Xiang-Dong; Chen, Fei; Huang, Ji-Xiang; Lou, Xiang-Yang; Zhao, Jian-Yi
2015-10-24
Brassica napus is an important oilseed crop. Dissection of the genetic architecture underlying oil-related biological processes will greatly facilitates the genetic improvement of rapeseed. The differential gene expression during pod development offers a snapshot on the genes responsible for oil accumulation in. To identify candidate genes in the linkage peaks reported previously, we used RNA sequencing (RNA-Seq) technology to analyze the pod transcriptomes of German cultivar Sollux and Chinese inbred line Gaoyou. The RNA samples were collected for RNA-Seq at 5-7, 15-17 and 25-27 days after flowering (DAF). Bioinformatics analysis was performed to investigate differentially expressed genes (DEGs). Gene annotation analysis was integrated with QTL mapping and Brassica napus pod transcriptome profiling to detect potential candidate genes in oilseed. Four hundred sixty five and two thousand, one hundred fourteen candidate DEGs were identified, respectively, between two varieties at the same stages and across different periods of each variety. Then, 33 DEGs between Sollux and Gaoyou were identified as the candidate genes affecting seed oil content by combining those DEGs with the quantitative trait locus (QTL) mapping results, of which, one was found to be homologous to Arabidopsis thaliana lipid-related genes. Intervarietal DEGs of lipid pathways in QTL regions represent important candidate genes for oil-related traits. Integrated analysis of transcriptome profiling, QTL mapping and comparative genomics with other relative species leads to efficient identification of most plausible functional genes underlying oil-content related characters, offering valuable resources for bettering breeding program of Brassica napus. This study provided a comprehensive overview on the pod transcriptomes of two varieties with different oil-contents at the three developmental stages.
Prediction of gene-phenotype associations in humans, mice, and plants using phenologs.
Woods, John O; Singh-Blom, Ulf Martin; Laurent, Jon M; McGary, Kriston L; Marcotte, Edward M
2013-06-21
Phenotypes and diseases may be related to seemingly dissimilar phenotypes in other species by means of the orthology of underlying genes. Such "orthologous phenotypes," or "phenologs," are examples of deep homology, and may be used to predict additional candidate disease genes. In this work, we develop an unsupervised algorithm for ranking phenolog-based candidate disease genes through the integration of predictions from the k nearest neighbor phenologs, comparing classifiers and weighting functions by cross-validation. We also improve upon the original method by extending the theory to paralogous phenotypes. Our algorithm makes use of additional phenotype data--from chicken, zebrafish, and E. coli, as well as new datasets for C. elegans--establishing that several types of annotations may be treated as phenotypes. We demonstrate the use of our algorithm to predict novel candidate genes for human atrial fibrillation (such as HRH2, ATP4A, ATP4B, and HOPX) and epilepsy (e.g., PAX6 and NKX2-1). We suggest gene candidates for pharmacologically-induced seizures in mouse, solely based on orthologous phenotypes from E. coli. We also explore the prediction of plant gene-phenotype associations, as for the Arabidopsis response to vernalization phenotype. We are able to rank gene predictions for a significant portion of the diseases in the Online Mendelian Inheritance in Man database. Additionally, our method suggests candidate genes for mammalian seizures based only on bacterial phenotypes and gene orthology. We demonstrate that phenotype information may come from diverse sources, including drug sensitivities, gene ontology biological processes, and in situ hybridization annotations. Finally, we offer testable candidates for a variety of human diseases, plant traits, and other classes of phenotypes across a wide array of species.
Zangen, David; Kaufman, Yotam; Zeligson, Sharon; Perlberg, Shira; Fridman, Hila; Kanaan, Moein; Abdulhadi-Atwan, Maha; Abu Libdeh, Abdulsalam; Gussow, Ayal; Kisslov, Irit; Carmel, Liran; Renbaum, Paul; Levy-Lahad, Ephrat
2011-10-07
XX female gonadal dysgenesis (XX-GD) is a rare, genetically heterogeneous disorder characterized by lack of spontaneous pubertal development, primary amenorrhea, uterine hypoplasia, and hypergonadotropic hypogonadism as a result of streak gonads. Most cases are unexplained but thought to be autosomal recessive. We elucidated the genetic basis of XX-GD in a highly consanguineous Palestinian family by using homozygosity mapping and candidate-gene and whole-exome sequencing. Affected females were homozygous for a 3 bp deletion (NM_016556.2, c.600_602del) in the PSMC3IP gene, leading to deletion of a glutamic acid residue (p.Glu201del) in the highly conserved C-terminal acidic domain. Proteasome 26S subunit, ATPase, 3-Interacting Protein (PSMC3IP)/Tat Binding Protein Interacting Protein (TBPIP) is a nuclear, tissue-specific protein with multiple functions. It is critical for meiotic recombination as indicated by the known role of its yeast ortholog, Hop2. Through the C terminus (not present in yeast), PSMC3IP also coactivates ligand-driven transcription mediated by estrogen, androgen, glucocorticoid, progesterone, and thyroid nuclear receptors. In cell lines, the p.Glu201del mutation abolished PSMC3IP activation of estrogen-driven transcription. Impaired estrogenic signaling can lead to ovarian dysgenesis both by affecting the size of the follicular pool created during fetal development and by failing to counteract follicular atresia during puberty. PSMC3IP joins previous genes known to be mutated in XX-GD, the FSH receptor, and BMP15, highlighting the importance of hormonal signaling in ovarian development and maintenance and suggesting a common pathway perturbed in isolated XX-GD. By analogy to other XX-GD genes, PSMC3IP is also a candidate gene for premature ovarian failure, and its role in folliculogenesis should be further investigated. Copyright © 2011 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Fewings, Eleanor; Larionov, Alexey; Redman, James; Goldgraben, Mae A; Scarth, James; Richardson, Susan; Brewer, Carole; Davidson, Rosemarie; Ellis, Ian; Evans, D Gareth; Halliday, Dorothy; Izatt, Louise; Marks, Peter; McConnell, Vivienne; Verbist, Louis; Mayes, Rebecca; Clark, Graeme R; Hadfield, James; Chin, Suet-Feung; Teixeira, Manuel R; Giger, Olivier T; Hardwick, Richard; di Pietro, Massimiliano; O'Donovan, Maria; Pharoah, Paul; Caldas, Carlos; Fitzgerald, Rebecca C; Tischkowitz, Marc
2018-04-26
Germline pathogenic variants in the E-cadherin gene (CDH1) are strongly associated with the development of hereditary diffuse gastric cancer. There is a paucity of data to guide risk assessment and management of families with hereditary diffuse gastric cancer that do not carry a CDH1 pathogenic variant, making it difficult to make informed decisions about surveillance and risk-reducing surgery. We aimed to identify new candidate genes associated with predisposition to hereditary diffuse gastric cancer in affected families without pathogenic CDH1 variants. We did whole-exome sequencing on DNA extracted from the blood of 39 individuals (28 individuals diagnosed with hereditary diffuse gastric cancer and 11 unaffected first-degree relatives) in 22 families without pathogenic CDH1 variants. Genes with loss-of-function variants were prioritised using gene-interaction analysis to identify clusters of genes that could be involved in predisposition to hereditary diffuse gastric cancer. Protein-affecting germline variants were identified in probands from six families with hereditary diffuse gastric cancer; variants were found in genes known to predispose to cancer and in lesser-studied DNA repair genes. A frameshift deletion in PALB2 was found in one member of a family with a history of gastric and breast cancer. Two different MSH2 variants were identified in two unrelated affected individuals, including one frameshift insertion and one previously described start-codon loss. One family had a unique combination of variants in the DNA repair genes ATR and NBN. Two variants in the DNA repair gene RECQL5 were identified in two unrelated families: one missense variant and a splice-acceptor variant. The results of this study suggest a role for the known cancer predisposition gene PALB2 in families with hereditary diffuse gastric cancer and no detected pathogenic CDH1 variants. We also identified new candidate genes associated with disease risk in these families. UK Medical Research Council (Sackler programme), European Research Council under the European Union's Seventh Framework Programme (2007-13), National Institute for Health Research Cambridge Biomedical Research Centre, Experimental Cancer Medicine Centres, and Cancer Research UK. Copyright © 2018 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY 4.0 license. Published by Elsevier Ltd.. All rights reserved.
Wang, Quan; Jia, Peilin; Cuenco, Karen T.; Feingold, Eleanor; Marazita, Mary L.; Wang, Lily; Zhao, Zhongming
2013-01-01
A number of genetic studies have suggested numerous susceptibility genes for dental caries over the past decade with few definite conclusions. The rapid accumulation of relevant information, along with the complex architecture of the disease, provides a challenging but also unique opportunity to review and integrate the heterogeneous data for follow-up validation and exploration. In this study, we collected and curated candidate genes from four major categories: association studies, linkage scans, gene expression analyses, and literature mining. Candidate genes were prioritized according to the magnitude of evidence related to dental caries. We then searched for dense modules enriched with the prioritized candidate genes through their protein-protein interactions (PPIs). We identified 23 modules comprising of 53 genes. Functional analyses of these 53 genes revealed three major clusters: cytokine network relevant genes, matrix metalloproteinases (MMPs) family, and transforming growth factor-beta (TGF-β) family, all of which have been previously implicated to play important roles in tooth development and carious lesions. Through our extensive data collection and an integrative application of gene prioritization and PPI network analyses, we built a dental caries-specific sub-network for the first time. Our study provided insights into the molecular mechanisms underlying dental caries. The framework we proposed in this work can be applied to other complex diseases. PMID:24146904
Liu, Lei; Ang, Keng Pee; Elliott, J A K; Kent, Matthew Peter; Lien, Sigbjørn; MacDonald, Danielle; Boulding, Elizabeth Grace
2017-03-01
Comparative genome scans can be used to identify chromosome regions, but not traits, that are putatively under selection. Identification of targeted traits may be more likely in recently domesticated populations under strong artificial selection for increased production. We used a North American Atlantic salmon 6K SNP dataset to locate genome regions of an aquaculture strain (Saint John River) that were highly diverged from that of its putative wild founder population (Tobique River). First, admixed individuals with partial European ancestry were detected using STRUCTURE and removed from the dataset. Outlier loci were then identified as those showing extreme differentiation between the aquaculture population and the founder population. All Arlequin methods identified an overlapping subset of 17 outlier loci, three of which were also identified by BayeScan. Many outlier loci were near candidate genes and some were near published quantitative trait loci (QTLs) for growth, appetite, maturity, or disease resistance. Parallel comparisons using a wild, nonfounder population (Stewiacke River) yielded only one overlapping outlier locus as well as a known maturity QTL. We conclude that genome scans comparing a recently domesticated strain with its wild founder population can facilitate identification of candidate genes for traits known to have been under strong artificial selection.
Mining biological databases for candidate disease genes
NASA Astrophysics Data System (ADS)
Braun, Terry A.; Scheetz, Todd; Webster, Gregg L.; Casavant, Thomas L.
2001-07-01
The publicly-funded effort to sequence the complete nucleotide sequence of the human genome, the Human Genome Project (HGP), has currently produced more than 93% of the 3 billion nucleotides of the human genome into a preliminary `draft' format. In addition, several valuable sources of information have been developed as direct and indirect results of the HGP. These include the sequencing of model organisms (rat, mouse, fly, and others), gene discovery projects (ESTs and full-length), and new technologies such as expression analysis and resources (micro-arrays or gene chips). These resources are invaluable for the researchers identifying the functional genes of the genome that transcribe and translate into the transcriptome and proteome, both of which potentially contain orders of magnitude more complexity than the genome itself. Preliminary analyses of this data identified approximately 30,000 - 40,000 human `genes.' However, the bulk of the effort still remains -- to identify the functional and structural elements contained within the transcriptome and proteome, and to associate function in the transcriptome and proteome to genes. A fortuitous consequence of the HGP is the existence of hundreds of databases containing biological information that may contain relevant data pertaining to the identification of disease-causing genes. The task of mining these databases for information on candidate genes is a commercial application of enormous potential. We are developing a system to acquire and mine data from specific databases to aid our efforts to identify disease genes. A high speed cluster of Linux of workstations is used to analyze sequence and perform distributed sequence alignments as part of our data mining and processing. This system has been used to mine GeneMap99 sequences within specific genomic intervals to identify potential candidate disease genes associated with Bardet-Biedle Syndrome (BBS).
Silvar, Cristina; Perovic, Dragan; Nussbaumer, Thomas; Spannagl, Manuel; Usadel, Björn; Casas, Ana; Igartua, Ernesto; Ordon, Frank
2013-01-01
Three quantitative trait loci (QTL) conferring broad spectrum resistance to powdery mildew, caused by the fungus Blumeria graminis f. sp. hordei, were previously identified on chromosomes 7HS, 7HL and 6HL in the Spanish barley landrace-derived lines SBCC097 and SBCC145. In the present work, a genome-wide putative linear gene index of barley (Genome Zipper) and the first draft of the physical, genetic and functional sequence of the barley genome were used to go one step further in the shortening and explicit demarcation on the barley genome of these regions conferring resistance to powdery mildew as well as in the identification of candidate genes. First, a comparative analysis of the target regions to the barley Genome Zippers of chromosomes 7H and 6H allowed the development of 25 new gene-based molecular markers, which slightly better delimit the QTL intervals. These new markers provided the framework for anchoring of genetic and physical maps, figuring out the outline of the barley genome at the target regions in SBCC097 and SBCC145. The outermost flanking markers of QTLs on 7HS, 7HL and 6HL defined a physical area of 4 Mb, 3.7 Mb and 3.2 Mb, respectively. In total, 21, 10 and 16 genes on 7HS, 7HL and 6HL, respectively, could be interpreted as potential candidates to explain the resistance to powdery mildew, as they encode proteins of related functions with respect to the known pathogen defense-related processes. The majority of these were annotated as belonging to the NBS-LRR class or protein kinase family. PMID:23826271
Ndeve, Arsenio Daniel; Huynh, Bao-Lam; Matthews, William Charles; Roberts, Philip Alan
2018-01-01
Cowpea is one of the most important food and forage legumes in drier regions of the tropics and subtropics. However, cowpea yield worldwide is markedly below the known potential due to abiotic and biotic stresses, including parasitism by root-knot nematodes (Meloidogyne spp., RKN). Two resistance genes with dominant effect, Rk and Rk2, have been reported to provide resistance against RKN in cowpea. Despite their description and use in breeding for resistance to RKN and particularly genetic mapping of the Rk locus, the exact genes conferring resistance to RKN remain unknown. In the present work, QTL mapping using recombinant inbred line (RIL) population 524B x IT84S-2049 segregating for a newly mapped locus and analysis of the transcriptome changes in two cowpea near-isogenic lines (NIL) were used to identify candidate genes for Rk and the newly mapped locus. A major QTL, designated QRk-vu9.1, associated with resistance to Meloidogyne javanica reproduction, was detected and mapped on linkage group LG9 at position 13.37 cM using egg production data. Transcriptome analysis on resistant and susceptible NILs 3 and 9 days after inoculation revealed up-regulation of 109 and 98 genes and down-regulation of 110 and 89 genes, respectively, out of 19,922 unique genes mapped to the common bean reference genome. Among the differentially expressed genes, four and nine genes were found within the QRk-vu9.1 and QRk-vu11.1 QTL intervals, respectively. Six of these genes belong to the TIR-NBS-LRR family of resistance genes and three were upregulated at one or more time-points. Quantitative RT-PCR validated gene expression to be positively correlated with RNA-seq expression pattern for eight genes. Future functional analysis of these cowpea genes will enhance our understanding of Rk-mediated resistance and identify the specific gene responsible for the resistance. PMID:29300744
Santos, Jansen Rodrigo Pereira; Ndeve, Arsenio Daniel; Huynh, Bao-Lam; Matthews, William Charles; Roberts, Philip Alan
2018-01-01
Cowpea is one of the most important food and forage legumes in drier regions of the tropics and subtropics. However, cowpea yield worldwide is markedly below the known potential due to abiotic and biotic stresses, including parasitism by root-knot nematodes (Meloidogyne spp., RKN). Two resistance genes with dominant effect, Rk and Rk2, have been reported to provide resistance against RKN in cowpea. Despite their description and use in breeding for resistance to RKN and particularly genetic mapping of the Rk locus, the exact genes conferring resistance to RKN remain unknown. In the present work, QTL mapping using recombinant inbred line (RIL) population 524B x IT84S-2049 segregating for a newly mapped locus and analysis of the transcriptome changes in two cowpea near-isogenic lines (NIL) were used to identify candidate genes for Rk and the newly mapped locus. A major QTL, designated QRk-vu9.1, associated with resistance to Meloidogyne javanica reproduction, was detected and mapped on linkage group LG9 at position 13.37 cM using egg production data. Transcriptome analysis on resistant and susceptible NILs 3 and 9 days after inoculation revealed up-regulation of 109 and 98 genes and down-regulation of 110 and 89 genes, respectively, out of 19,922 unique genes mapped to the common bean reference genome. Among the differentially expressed genes, four and nine genes were found within the QRk-vu9.1 and QRk-vu11.1 QTL intervals, respectively. Six of these genes belong to the TIR-NBS-LRR family of resistance genes and three were upregulated at one or more time-points. Quantitative RT-PCR validated gene expression to be positively correlated with RNA-seq expression pattern for eight genes. Future functional analysis of these cowpea genes will enhance our understanding of Rk-mediated resistance and identify the specific gene responsible for the resistance.
The contribution of mouse models to the understanding of constitutional thrombocytopenia.
Léon, Catherine; Dupuis, Arnaud; Gachet, Christian; Lanza, François
2016-08-01
Constitutional thrombocytopenias result from platelet production abnormalities of hereditary origin. Long misdiagnosed and poorly studied, knowledge about these rare diseases has increased considerably over the last twenty years due to improved technology for the identification of mutations, as well as an improvement in obtaining megakaryocyte culture from patient hematopoietic stem cells. Simultaneously, the manipulation of mouse genes (transgenesis, total or conditional inactivation, introduction of point mutations, random chemical mutagenesis) have helped to generate disease models that have contributed greatly to deciphering patient clinical and laboratory features. Most of the thrombocytopenias for which the mutated genes have been identified now have a murine model counterpart. This review focuses on the contribution that these mouse models have brought to the understanding of hereditary thrombocytopenias with respect to what was known in humans. Animal models have either i) provided novel information on the molecular and cellular pathways that were missing from the patient studies; ii) improved our understanding of the mechanisms of thrombocytopoiesis; iii) been instrumental in structure-function studies of the mutated gene products; and iv) been an invaluable tool as preclinical models to test new drugs or develop gene therapies. At present, the genetic determinants of thrombocytopenia remain unknown in almost half of all cases. Currently available high-speed sequencing techniques will identify new candidate genes, which will in turn allow the generation of murine models to confirm and further study the abnormal phenotype. In a complementary manner, programs of random mutagenesis in mice should also identify new candidate genes involved in thrombocytopenia. Copyright© Ferrata Storti Foundation.
Kim, Jaemin; Lee, Taeheon; Kim, Tae-Hun; Lee, Kyung-Tai; Kim, Heebal
2012-12-19
Traditional candidate gene approach has been widely used for the study of complex diseases including obesity. However, this approach is largely limited by its dependence on existing knowledge of presumed biology of the phenotype under investigation. Our combined strategy of comparative genomics and chromosomal heritability estimate analysis of obesity traits, subscapular skinfold thickness and back-fat thickness in Korean cohorts and pig (Sus scrofa), may overcome the limitations of candidate gene analysis and allow us to better understand genetic predisposition to human obesity. We found common genes including FTO, the fat mass and obesity associated gene, identified from significant SNPs by association studies of each trait. These common genes were related to blood pressure and arterial stiffness (P = 1.65E-05) and type 2 diabetes (P = 0.00578). Through the estimation of variance of genetic component (heritability) for each chromosome by SNPs, we observed a significant positive correlation (r = 0.479) between genetic contributions of human and pig to obesity traits. Furthermore, we noted that human chromosome 2 (syntenic to pig chromosomes 3 and 15) was most important in explaining the phenotypic variance for obesity. Obesity genetics still awaits further discovery. Navigating syntenic regions suggests obesity candidate genes on chromosome 2 that are previously known to be associated with obesity-related diseases: MRPL33, PARD3B, ERBB4, STK39, and ZNF385B.
Genetic architecture for human aggression: A study of gene-phenotype relationship in OMIM.
Zhang-James, Yanli; Faraone, Stephen V
2016-07-01
Genetic studies of human aggression have mainly focused on known candidate genes and pathways regulating serotonin and dopamine signaling and hormonal functions. These studies have taught us much about the genetics of human aggression, but no genetic locus has yet achieved genome-significance. We here present a review based on a paradoxical hypothesis that studies of rare, functional genetic variations can lead to a better understanding of the molecular mechanisms underlying complex multifactorial disorders such as aggression. We examined all aggression phenotypes catalogued in Online Mendelian Inheritance in Man (OMIM), an Online Catalog of Human Genes and Genetic Disorders. We identified 95 human disorders that have documented aggressive symptoms in at least one individual with a well-defined genetic variant. Altogether, we retrieved 86 causal genes. Although most of these genes had not been implicated in human aggression by previous studies, the most significantly enriched canonical pathways had been previously implicated in aggression (e.g., serotonin and dopamine signaling). Our findings provide strong evidence to support the causal role of these pathways in the pathogenesis of aggression. In addition, the novel genes and pathways we identified suggest additional mechanisms underlying the origins of human aggression. Genome-wide association studies with very large samples will be needed to determine if common variants in these genes are risk factors for aggression. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Šmajs, David; Zobaníková, Marie; Strouhal, Michal; Čejková, Darina; Dugan-Rocha, Shannon; Pospíšilová, Petra; Norris, Steven J.; Albert, Tom; Qin, Xiang; Hallsworth-Pepin, Kym; Buhay, Christian; Muzny, Donna M.; Chen, Lei; Gibbs, Richard A.; Weinstock, George M.
2011-01-01
Treponema paraluiscuniculi is the causative agent of rabbit venereal spirochetosis. It is not infectious to humans, although its genome structure is very closely related to other pathogenic Treponema species including Treponema pallidum subspecies pallidum, the etiological agent of syphilis. In this study, the genome sequence of Treponema paraluiscuniculi, strain Cuniculi A, was determined by a combination of several high-throughput sequencing strategies. Whereas the overall size (1,133,390 bp), arrangement, and gene content of the Cuniculi A genome closely resembled those of the T. pallidum genome, the T. paraluiscuniculi genome contained a markedly higher number of pseudogenes and gene fragments (51). In addition to pseudogenes, 33 divergent genes were also found in the T. paraluiscuniculi genome. A set of 32 (out of 84) affected genes encoded proteins of known or predicted function in the Nichols genome. These proteins included virulence factors, gene regulators and components of DNA repair and recombination. The majority (52 or 61.9%) of the Cuniculi A pseudogenes and divergent genes were of unknown function. Our results indicate that T. paraluiscuniculi has evolved from a T. pallidum-like ancestor and adapted to a specialized host-associated niche (rabbits) during loss of infectivity to humans. The genes that are inactivated or altered in T. paraluiscuniculi are candidates for virulence factors important in the infectivity and pathogenesis of T. pallidum subspecies. PMID:21655244
Zhang, Ningbo; Li, Ruimin; Shen, Wei; Jiao, Shuzhen; Zhang, Junxiang; Xu, Weirong
2018-04-27
The major latex protein/ripening-related protein (MLP/RRP) subfamily is known to be involved in a wide range of biological processes of plant development and various stress responses. However, the biological function of MLP/RRP proteins is still far from being clear and identification of them may provide important clues for understanding their roles. Here, we report a genome-wide evolutionary characterization and gene expression analysis of the MLP family in European Vitis species. A total of 14 members, was found in the grape genome, all of which are located on chromosome 1, where are predominantly arranged in tandem clusters. We have noticed, most surprisingly, promoter-sharing by several non-identical but highly similar gene members to a greater extent than expected by chance. Synteny analysis between the grape and Arabidopsis thaliana genomes suggested that 3 grape MLP genes arose before the divergence of the two species. Phylogenetic analysis provided further insights into the evolutionary relationship between the genes, as well as their putative functions, and tissue-specific expression analysis suggested distinct biological roles for different members. Our expression data suggested a couple of candidate genes involved in abiotic stresses and phytohormone responses. The present work provides new insight into the evolution and regulation of Vitis MLP genes, which represent targets for future studies and inclusion in tolerance-related molecular breeding programs.
Cangül, Hakan; Demir, Korcan; Babayiğit, H Ömür; Abacı, Ayhan; Böber, Ece
2015-09-01
Congenital hypothyroidism (CH) occurs with a prevalence of approximately 1:4000 live births. Defects of thyroid hormone synthesis account for 15-20% of these cases. Thyroid peroxidase (TPO) gene is the most common cause for dyshormonogenesis. So far, more than 60 mutations in the TPO gene have been described, resulting in a variable decrease in TPO bioactivity. We present an 8-day-old male with mild CH who was identified to have a G to A transition in the fifth codon of the TPO gene (c.13G>A; p.Ala5Thr). The unaffected family members were heterozygous carriers of the mutation, whereas 400 healthy individuals of the same ethnic background did not have the mutation. Mutation analysis of 11 known causative CH genes and 4 of our own strong candidate genes with next-generation sequencing revealed no mutations in the patient nor in any other family members. The results of in silico functional analyses indicated partial loss-of-function (LOF) in the resulting enzyme molecule due to mutation. The patient's clinical finding s were consistent with the effect of this partial LOF of the mutation. In conclusion, we strongly believe that A5T alteration in the TPO gene is actually pathogenic and suggest that it should be classified as a mutation.
Chen, Xiaowei Sylvia; Reader, Rose H; Hoischen, Alexander; Veltman, Joris A; Simpson, Nuala H; Francks, Clyde; Newbury, Dianne F; Fisher, Simon E
2017-04-25
A significant proportion of children have unexplained problems acquiring proficient linguistic skills despite adequate intelligence and opportunity. Developmental language disorders are highly heritable with substantial societal impact. Molecular studies have begun to identify candidate loci, but much of the underlying genetic architecture remains undetermined. We performed whole-exome sequencing of 43 unrelated probands affected by severe specific language impairment, followed by independent validations with Sanger sequencing, and analyses of segregation patterns in parents and siblings, to shed new light on aetiology. By first focusing on a pre-defined set of known candidates from the literature, we identified potentially pathogenic variants in genes already implicated in diverse language-related syndromes, including ERC1, GRIN2A, and SRPX2. Complementary analyses suggested novel putative candidates carrying validated variants which were predicted to have functional effects, such as OXR1, SCN9A and KMT2D. We also searched for potential "multiple-hit" cases; one proband carried a rare AUTS2 variant in combination with a rare inherited haplotype affecting STARD9, while another carried a novel nonsynonymous variant in SEMA6D together with a rare stop-gain in SYNPR. On broadening scope to all rare and novel variants throughout the exomes, we identified biological themes that were enriched for such variants, including microtubule transport and cytoskeletal regulation.
Chen, Xiaowei Sylvia; Reader, Rose H.; Hoischen, Alexander; Veltman, Joris A.; Simpson, Nuala H.; Francks, Clyde; Newbury, Dianne F.; Fisher, Simon E.
2017-01-01
A significant proportion of children have unexplained problems acquiring proficient linguistic skills despite adequate intelligence and opportunity. Developmental language disorders are highly heritable with substantial societal impact. Molecular studies have begun to identify candidate loci, but much of the underlying genetic architecture remains undetermined. We performed whole-exome sequencing of 43 unrelated probands affected by severe specific language impairment, followed by independent validations with Sanger sequencing, and analyses of segregation patterns in parents and siblings, to shed new light on aetiology. By first focusing on a pre-defined set of known candidates from the literature, we identified potentially pathogenic variants in genes already implicated in diverse language-related syndromes, including ERC1, GRIN2A, and SRPX2. Complementary analyses suggested novel putative candidates carrying validated variants which were predicted to have functional effects, such as OXR1, SCN9A and KMT2D. We also searched for potential “multiple-hit” cases; one proband carried a rare AUTS2 variant in combination with a rare inherited haplotype affecting STARD9, while another carried a novel nonsynonymous variant in SEMA6D together with a rare stop-gain in SYNPR. On broadening scope to all rare and novel variants throughout the exomes, we identified biological themes that were enriched for such variants, including microtubule transport and cytoskeletal regulation. PMID:28440294
Ponsuwanna, Patrath; Kümpornsin, Krittikorn; Chookajorn, Thanat
2014-01-01
Even though antigenic variation is employed among parasitic protozoa for host immune evasion, Tetrahymena thermophila, a free-living ciliate, can also change its surface protein antigens. These cysteine-rich glycosylphosphatidylinositol (GPI)-linked surface proteins are encoded by a family of polymorphic Ser genes. Despite the availability of T. thermophila genome, a comprehensive analysis of the Ser family is limited by its high degree of polymorphism. In order to overcome this problem, a new approach was adopted by searching for Ser candidates with common motif sequences, namely length-specific repetitive cysteine pattern and GPI anchor site. The candidate genes were phylogenetically compared with the previously identified Ser genes and classified into subtypes. Ser candidates were often found to be located as tandem arrays of the same subtypes on several chromosomal scaffolds. Certain Ser candidates located in the same chromosomal arrays were transcriptionally expressed at specific T. thermophila developmental stages. These Ser candidates selected by the motif analysis approach can form the foundation for a systematic identification of the entire Ser gene family, which will contribute to the understanding of their function and the basis of T. thermophila antigenic variation. PMID:25133747
Miraoui, Hichem; Dwyer, Andrew A.; Sykiotis, Gerasimos P.; Plummer, Lacey; Chung, Wilson; Feng, Bihua; Beenken, Andrew; Clarke, Jeff; Pers, Tune H.; Dworzynski, Piotr; Keefe, Kimberley; Niedziela, Marek; Raivio, Taneli; Crowley, William F.; Seminara, Stephanie B.; Quinton, Richard; Hughes, Virginia A.; Kumanov, Philip; Young, Jacques; Yialamas, Maria A.; Hall, Janet E.; Van Vliet, Guy; Chanoine, Jean-Pierre; Rubenstein, John; Mohammadi, Moosa; Tsai, Pei-San; Sidis, Yisrael; Lage, Kasper; Pitteloud, Nelly
2013-01-01
Congenital hypogonadotropic hypogonadism (CHH) and its anosmia-associated form (Kallmann syndrome [KS]) are genetically heterogeneous. Among the >15 genes implicated in these conditions, mutations in FGF8 and FGFR1 account for ∼12% of cases; notably, KAL1 and HS6ST1 are also involved in FGFR1 signaling and can be mutated in CHH. We therefore hypothesized that mutations in genes encoding a broader range of modulators of the FGFR1 pathway might contribute to the genetics of CHH as causal or modifier mutations. Thus, we aimed to (1) investigate whether CHH individuals harbor mutations in members of the so-called “FGF8 synexpression” group and (2) validate the ability of a bioinformatics algorithm on the basis of protein-protein interactome data (interactome-based affiliation scoring [IBAS]) to identify high-quality candidate genes. On the basis of sequence homology, expression, and structural and functional data, seven genes were selected and sequenced in 386 unrelated CHH individuals and 155 controls. Except for FGF18 and SPRY2, all other genes were found to be mutated in CHH individuals: FGF17 (n = 3 individuals), IL17RD (n = 8), DUSP6 (n = 5), SPRY4 (n = 14), and FLRT3 (n = 3). Independently, IBAS predicted FGF17 and IL17RD as the two top candidates in the entire proteome on the basis of a statistical test of their protein-protein interaction patterns to proteins known to be altered in CHH. Most of the FGF17 and IL17RD mutations altered protein function in vitro. IL17RD mutations were found only in KS individuals and were strongly linked to hearing loss (6/8 individuals). Mutations in genes encoding components of the FGF pathway are associated with complex modes of CHH inheritance and act primarily as contributors to an oligogenic genetic architecture underlying CHH. PMID:23643382
Genetic Basis of Melanin Pigmentation in Butterfly Wings.
Zhang, Linlin; Martin, Arnaud; Perry, Michael W; van der Burg, Karin R L; Matsuoka, Yuji; Monteiro, Antónia; Reed, Robert D
2017-04-01
Despite the variety, prominence, and adaptive significance of butterfly wing patterns, surprisingly little is known about the genetic basis of wing color diversity. Even though there is intense interest in wing pattern evolution and development, the technical challenge of genetically manipulating butterflies has slowed efforts to functionally characterize color pattern development genes. To identify candidate wing pigmentation genes, we used RNA sequencing to characterize transcription across multiple stages of butterfly wing development, and between different color pattern elements, in the painted lady butterfly Vanessa cardui This allowed us to pinpoint genes specifically associated with red and black pigment patterns. To test the functions of a subset of genes associated with presumptive melanin pigmentation, we used clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 genome editing in four different butterfly genera. pale , Ddc , and yellow knockouts displayed reduction of melanin pigmentation, consistent with previous findings in other insects. Interestingly, however, yellow-d , ebony , and black knockouts revealed that these genes have localized effects on tuning the color of red, brown, and ochre pattern elements. These results point to previously undescribed mechanisms for modulating the color of specific wing pattern elements in butterflies, and provide an expanded portrait of the insect melanin pathway. Copyright © 2017 by the Genetics Society of America.
Radhakrishna, Uppala; Albayrak, Samet; Alpay-Savasan, Zeynep; Zeb, Amna; Turkoglu, Onur; Sobolewski, Paul; Bahado-Singh, Ray O
2016-01-01
Congenital heart defect (CHD) is the most common cause of death from congenital anomaly. Among several candidate epigenetic mechanisms, DNA methylation may play an important role in the etiology of CHDs. We conducted a genome-wide DNA methylation analysis using an Illumina Infinium 450k human methylation assay in a cohort of 24 newborns who had aortic valve stenosis (AVS), with gestational-age matched controls. The study identified significantly-altered CpG methylation at 59 sites in 52 genes in AVS subjects as compared to controls (either hypermethylated or demethylated). Gene Ontology analysis identified biological processes and functions for these genes including positive regulation of receptor-mediated endocytosis. Consistent with prior clinical data, the molecular function categories as determined using DAVID identified low-density lipoprotein receptor binding, lipoprotein receptor binding and identical protein binding to be over-represented in the AVS group. A significant epigenetic change in the APOA5 and PCSK9 genes known to be involved in AVS was also observed. A large number CpG methylation sites individually demonstrated good to excellent diagnostic accuracy for the prediction of AVS status, thus raising possibility of molecular screening markers for this disorder. Using epigenetic analysis we were able to identify genes significantly involved in the pathogenesis of AVS.
Radhakrishna, Uppala; Albayrak, Samet; Alpay-Savasan, Zeynep; Zeb, Amna; Turkoglu, Onur; Sobolewski, Paul; Bahado-Singh, Ray O.
2016-01-01
Congenital heart defect (CHD) is the most common cause of death from congenital anomaly. Among several candidate epigenetic mechanisms, DNA methylation may play an important role in the etiology of CHDs. We conducted a genome-wide DNA methylation analysis using an Illumina Infinium 450k human methylation assay in a cohort of 24 newborns who had aortic valve stenosis (AVS), with gestational-age matched controls. The study identified significantly-altered CpG methylation at 59 sites in 52 genes in AVS subjects as compared to controls (either hypermethylated or demethylated). Gene Ontology analysis identified biological processes and functions for these genes including positive regulation of receptor-mediated endocytosis. Consistent with prior clinical data, the molecular function categories as determined using DAVID identified low-density lipoprotein receptor binding, lipoprotein receptor binding and identical protein binding to be over-represented in the AVS group. A significant epigenetic change in the APOA5 and PCSK9 genes known to be involved in AVS was also observed. A large number CpG methylation sites individually demonstrated good to excellent diagnostic accuracy for the prediction of AVS status, thus raising possibility of molecular screening markers for this disorder. Using epigenetic analysis we were able to identify genes significantly involved in the pathogenesis of AVS. PMID:27152866
Shen, Changbing; Gao, Jing; Sheng, Yujun; Dou, Jinfa; Zhou, Fusheng; Zheng, Xiaodong; Ko, Randy; Tang, Xianfa; Zhu, Caihong; Yin, Xianyong; Sun, Liangdan; Cui, Yong; Zhang, Xuejun
2016-01-01
Vitiligo is an autoimmune disease with a strong genetic component, characterized by areas of depigmented skin resulting from loss of epidermal melanocytes. Genetic factors are known to play key roles in vitiligo through discoveries in association studies and family studies. Previously, vitiligo susceptibility genes were mainly revealed through linkage analysis and candidate gene studies. Recently, our understanding of the genetic basis of vitiligo has been rapidly advancing through genome-wide association study (GWAS). More than 40 robust susceptible loci have been identified and confirmed to be associated with vitiligo by using GWAS. Most of these associated genes participate in important pathways involved in the pathogenesis of vitiligo. Many susceptible loci with unknown functions in the pathogenesis of vitiligo have also been identified, indicating that additional molecular mechanisms may contribute to the risk of developing vitiligo. In this review, we summarize the key loci that are of genome-wide significance, which have been shown to influence vitiligo risk. These genetic loci may help build the foundation for genetic diagnosis and personalize treatment for patients with vitiligo in the future. However, substantial additional studies, including gene-targeted and functional studies, are required to confirm the causality of the genetic variants and their biological relevance in the development of vitiligo. PMID:26870082
Saik, Olga V; Demenkov, Pavel S; Ivanisenko, Timofey V; Bragina, Elena Yu; Freidin, Maxim B; Goncharova, Irina A; Dosenko, Victor E; Zolotareva, Olga I; Hofestaedt, Ralf; Lavrik, Inna N; Rogaev, Evgeny I; Ivanisenko, Vladimir A
2018-02-13
Hypertension and bronchial asthma are a major issue for people's health. As of 2014, approximately one billion adults, or ~ 22% of the world population, have had hypertension. As of 2011, 235-330 million people globally have been affected by asthma and approximately 250,000-345,000 people have died each year from the disease. The development of the effective treatment therapies against these diseases is complicated by their comorbidity features. This is often a major problem in diagnosis and their treatment. Hence, in this study the bioinformatical methodology for the analysis of the comorbidity of these two diseases have been developed. As such, the search for candidate genes related to the comorbid conditions of asthma and hypertension can help in elucidating the molecular mechanisms underlying the comorbid condition of these two diseases, and can also be useful for genotyping and identifying new drug targets. Using ANDSystem, the reconstruction and analysis of gene networks associated with asthma and hypertension was carried out. The gene network of asthma included 755 genes/proteins and 62,603 interactions, while the gene network of hypertension - 713 genes/proteins and 45,479 interactions. Two hundred and five genes/proteins and 9638 interactions were shared between asthma and hypertension. An approach for ranking genes implicated in the comorbid condition of two diseases was proposed. The approach is based on nine criteria for ranking genes by their importance, including standard methods of gene prioritization (Endeavor, ToppGene) as well as original criteria that take into account the characteristics of an associative gene network and the presence of known polymorphisms in the analysed genes. According to the proposed approach, the genes IL10, TLR4, and CAT had the highest priority in the development of comorbidity of these two diseases. Additionally, it was revealed that the list of top genes is enriched with apoptotic genes and genes involved in biological processes related to the functioning of central nervous system. The application of methods of reconstruction and analysis of gene networks is a productive tool for studying the molecular mechanisms of comorbid conditions. The method put forth to rank genes by their importance to the comorbid condition of asthma and hypertension was employed that resulted in prediction of 10 genes, playing the key role in the development of the comorbid condition. The results can be utilised to plan experiments for identification of novel candidate genes along with searching for novel pharmacological targets.
Keilwagen, Jens; Lehnert, Heike; Berner, Thomas; Budahn, Holger; Nothnagel, Thomas; Ulrich, Detlef; Dunemann, Frank
2017-01-01
Terpenes are an important group of secondary metabolites in carrots influencing taste and flavor, and some of them might also play a role as bioactive substances with an impact on human physiology and health. Understanding the genetic and molecular basis of terpene synthases (TPS) involved in the biosynthesis of volatile terpenoids will provide insights for improving breeding strategies aimed at quality traits and for developing specific carrot chemotypes possibly useful for pharmaceutical applications. Hence, a combination of terpene metabolite profiling, genotyping-by-sequencing (GBS), and genome-wide association study (GWAS) was used in this work to get insights into the genetic control of terpene biosynthesis in carrots and to identify several TPS candidate genes that might be involved in the production of specific monoterpenes. In a panel of 85 carrot cultivars and accessions, metabolite profiling was used to identify 31 terpenoid volatile organic compounds (VOCs) in carrot leaves and roots, and a GBS approach was used to provide dense genome-wide marker coverage (>168,000 SNPs). Based on this data, a total of 30 quantitative trait loci (QTLs) was identified for 15 terpenoid volatiles. Most QTLs were detected for the monoterpene compounds ocimene, sabinene, β-pinene, borneol and bornyl acetate. We identified four genomic regions on three different carrot chromosomes by GWAS which are both associated with high significance (LOD ≥ 5.91) to distinct monoterpenes and to TPS candidate genes, which have been identified by homology-based gene prediction utilizing RNA-seq data. In total, 65 TPS candidate gene models in carrot were identified and assigned to known plant TPS subfamilies with the exception of TPS-d and TPS-h. TPS-b was identified as largest subfamily with 32 TPS candidate genes. PMID:29170675
Wang, Meng; Wu, Kai; Lu, Changhong; Kong, Xiangyin
2015-01-01
Prostate cancer is a type of cancer that occurs in the male prostate, a gland in the male reproductive system. Because prostate cancer cells may spread to other parts of the body and can influence human reproduction, understanding the mechanisms underlying this disease is critical for designing effective treatments. The identification of as many genes and chemicals related to prostate cancer as possible will enhance our understanding of this disease. In this study, we proposed a computational method to identify new candidate genes and chemicals based on currently known genes and chemicals related to prostate cancer by applying a shortest path approach in a hybrid network. The hybrid network was constructed according to information concerning chemical-chemical interactions, chemical-protein interactions, and protein-protein interactions. Many of the obtained genes and chemicals are associated with prostate cancer. PMID:26504486
Winata, Cecilia L; Kondrychyn, Igor; Kumar, Vibhor; Srinivasan, Kandhadayar G; Orlov, Yuriy; Ravishankar, Ashwini; Prabhakar, Shyam; Stanton, Lawrence W; Korzh, Vladimir; Mathavan, Sinnakaruppan
2013-10-01
Zic3 regulates early embryonic patterning in vertebrates. Loss of Zic3 function is known to disrupt gastrulation, left-right patterning, and neurogenesis. However, molecular events downstream of this transcription factor are poorly characterized. Here we use the zebrafish as a model to study the developmental role of Zic3 in vivo, by applying a combination of two powerful genomics approaches--ChIP-seq and microarray. Besides confirming direct regulation of previously implicated Zic3 targets of the Nodal and canonical Wnt pathways, analysis of gastrula stage embryos uncovered a number of novel candidate target genes, among which were members of the non-canonical Wnt pathway and the neural pre-pattern genes. A similar analysis in zic3-expressing cells obtained by FACS at segmentation stage revealed a dramatic shift in Zic3 binding site locations and identified an entirely distinct set of target genes associated with later developmental functions such as neural development. We demonstrate cis-regulation of several of these target genes by Zic3 using in vivo enhancer assay. Analysis of Zic3 binding sites revealed a distribution biased towards distal intergenic regions, indicative of a long distance regulatory mechanism; some of these binding sites are highly conserved during evolution and act as functional enhancers. This demonstrated that Zic3 regulation of developmental genes is achieved predominantly through long distance regulatory mechanism and revealed that developmental transitions could be accompanied by dramatic changes in regulatory landscape.
Sokhi, Upneet K.; Bacolod, Manny D.; Dasgupta, Santanu; Emdad, Luni; Das, Swadesh K.; Dumur, Catherine I.; Miles, Michael F.; Sarkar, Devanand; Fisher, Paul B.
2013-01-01
Human Polynucleotide Phosphorylase (hPNPaseold-35 or PNPT1) is an evolutionarily conserved 3′→5′ exoribonuclease implicated in the regulation of numerous physiological processes including maintenance of mitochondrial homeostasis, mtRNA import and aging-associated inflammation. From an RNase perspective, little is known about the RNA or miRNA species it targets for degradation or whose expression it regulates; except for c-myc and miR-221. To further elucidate the functional implications of hPNPaseold-35 in cellular physiology, we knocked-down and overexpressed hPNPaseold-35 in human melanoma cells and performed gene expression analyses to identify differentially expressed transcripts. Ingenuity Pathway Analysis indicated that knockdown of hPNPaseold-35 resulted in significant gene expression changes associated with mitochondrial dysfunction and cholesterol biosynthesis; whereas overexpression of hPNPaseold-35 caused global changes in cell-cycle related functions. Additionally, comparative gene expression analyses between our hPNPaseold-35 knockdown and overexpression datasets allowed us to identify 77 potential “direct” and 61 potential “indirect” targets of hPNPaseold-35 which formed correlated networks enriched for cell-cycle and wound healing functional association, respectively. These results provide a comprehensive database of genes responsive to hPNPaseold-35 expression levels; along with the identification new potential candidate genes offering fresh insight into cellular pathways regulated by PNPT1 and which may be used in the future for possible therapeutic intervention in mitochondrial- or inflammation-associated disease phenotypes. PMID:24143183
High degree of genetic differentiation in marine three-spined sticklebacks (Gasterosteus aculeatus).
Defaveri, Jacquelin; Shikano, Takahito; Shimada, Yukinori; Merilä, Juha
2013-09-01
Populations of widespread marine organisms are typically characterized by a low degree of genetic differentiation in neutral genetic markers, but much less is known about differentiation in genes whose functional roles are associated with specific selection regimes. To uncover possible adaptive population divergence and heterogeneous genomic differentiation in marine three-spined sticklebacks (Gasterosteus aculeatus), we used a candidate gene-based genome-scan approach to analyse variability in 138 microsatellite loci located within/close to (<6 kb) functionally important genes in samples collected from ten geographic locations. The degree of genetic differentiation in markers classified as neutral or under balancing selection-as determined with several outlier detection methods-was low (F(ST) = 0.033 or 0.011, respectively), whereas average FST for directionally selected markers was significantly higher (F(ST) = 0.097). Clustering analyses provided support for genomic and geographic heterogeneity in selection: six genetic clusters were identified based on allele frequency differences in the directionally selected loci, whereas four were identified with the neutral loci. Allelic variation in several loci exhibited significant associations with environmental variables, supporting the conjecture that temperature and salinity, but not optic conditions, are important drivers of adaptive divergence among populations. In general, these results suggest that in spite of the high degree of physical connectivity and gene flow as inferred from neutral marker genes, marine stickleback populations are strongly genetically structured in loci associated with functionally relevant genes. © 2013 John Wiley & Sons Ltd.
Cellular and synaptic network defects in autism
Peça, João; Feng, Guoping
2012-01-01
Many candidate genes are now thought to confer susceptibility to autism spectrum disorder (ASD). Here we review four interrelated complexes, each composed of multiple families of genes that functionally coalesce on common cellular pathways. We illustrate a common thread in the organization of glutamatergic synapses and suggest a link between genes involved in Tuberous Sclerosis Complex, Fragile X syndrome, Angelman syndrome and several synaptic ASD candidate genes. When viewed in this context, progress in deciphering the molecular architecture of cellular protein-protein interactions together with the unraveling of synaptic dysfunction in neural networks may prove pivotal to advancing our understanding of ASDs. PMID:22440525
Geffroy, V; Sévignac, M; De Oliveira, J C; Fouilloux, G; Skroch, P; Thoquet, P; Gepts, P; Langin, T; Dron, M
2000-03-01
Anthracnose, one of the most important diseases of common bean (Phaseolus vulgaris), is caused by the fungus Colletotrichum lindemuthianum. A "candidate gene" approach was used to map anthracnose resistance quantitative trait loci (QTL). Candidate genes included genes for both pathogen recognition (resistance genes and resistance gene analogs [RGAs]) and general plant defense (defense response genes). Two strains of C. lindemuthianum, identified in a world collection of 177 strains, displayed a reproducible and differential aggressiveness toward BAT93 and JaloEEP558, two parental lines of P. vulgaris representing the two major gene pools of this crop. A reliable test was developed to score partial resistance in aerial organs of the plant (stem, leaf, petiole) under controlled growth chamber conditions. BAT93 was more resistant than JaloEEP558 regardless of the organ or strain tested. With a recombinant inbred line (RIL) population derived from a cross between these two parental lines, 10 QTL were located on a genetic map harboring 143 markers, including known defense response genes, anthracnose-specific resistance genes, and RGAs. Eight of the QTL displayed isolate specificity. Two were co-localized with known defense genes (phenylalanine ammonia-lyase and hydroxyproline-rich glycoprotein) and three with anthracnose-specific resistance genes and/or RGAs. Interestingly, two QTL, with different allelic contribution, mapped on linkage group B4 in a 5.0 cM interval containing Andean and Mesoamerican specific resistance genes against C. lindemuthianum and 11 polymorphic fragments revealed with a RGA probe. The possible relationship between genes underlying specific and partial resistance is discussed.
Chakrabarti, B; Dudbridge, F; Kent, L; Wheelwright, S; Hill-Cawthorne, G; Allison, C; Banerjee-Basu, S; Baron-Cohen, S
2009-06-01
Genetic studies of autism spectrum conditions (ASC) have mostly focused on the "low functioning" severe clinical subgroup, treating it as a rare disorder. However, ASC is now thought to be relatively common ( approximately 1%), and representing one end of a quasi-normal distribution of autistic traits in the general population. Here we report a study of common genetic variation in candidate genes associated with autistic traits and Asperger syndrome (AS). We tested single nucleotide polymorphisms in 68 candidate genes in three functional groups (sex steroid synthesis/transport, neural connectivity, and social-emotional responsivity) in two experiments. These were (a) an association study of relevant behavioral traits (the Empathy Quotient (EQ), the Autism Spectrum Quotient (AQ)) in a population sample (n=349); and (b) a case-control association study on a sample of people with AS, a "high-functioning" subgroup of ASC (n=174). 27 genes showed a nominally significant association with autistic traits and/or ASC diagnosis. Of these, 19 genes showed nominally significant association with AQ/EQ. In the sex steroid group, this included ESR2 and CYP11B1. In the neural connectivity group, this included HOXA1, NTRK1, and NLGN4X. In the socio-responsivity behavior group, this included MAOB, AVPR1B, and WFS1. Fourteen genes showed nominally significant association with AS. In the sex steroid group, this included CYP17A1 and CYP19A1. In the socio-emotional behavior group, this included OXT. Six genes were nominally associated in both experiments, providing a partial replication. Eleven genes survived family wise error rate (FWER) correction using permutations across both experiments, which is greater than would be expected by chance. CYP11B1 and NTRK1 emerged as significantly associated genes in both experiments, after FWER correction (P<0.05). This is the first candidate-gene association study of AS and of autistic traits. The most promising candidate genes require independent replication and fine mapping.
A transposon-based genetic screen in mice identifies genes altered in colorectal cancer.
Starr, Timothy K; Allaei, Raha; Silverstein, Kevin A T; Staggs, Rodney A; Sarver, Aaron L; Bergemann, Tracy L; Gupta, Mihir; O'Sullivan, M Gerard; Matise, Ilze; Dupuy, Adam J; Collier, Lara S; Powers, Scott; Oberg, Ann L; Asmann, Yan W; Thibodeau, Stephen N; Tessarollo, Lino; Copeland, Neal G; Jenkins, Nancy A; Cormier, Robert T; Largaespada, David A
2009-03-27
Human colorectal cancers (CRCs) display a large number of genetic and epigenetic alterations, some of which are causally involved in tumorigenesis (drivers) and others that have little functional impact (passengers). To help distinguish between these two classes of alterations, we used a transposon-based genetic screen in mice to identify candidate genes for CRC. Mice harboring mutagenic Sleeping Beauty (SB) transposons were crossed with mice expressing SB transposase in gastrointestinal tract epithelium. Most of the offspring developed intestinal lesions, including intraepithelial neoplasia, adenomas, and adenocarcinomas. Analysis of over 16,000 transposon insertions identified 77 candidate CRC genes, 60 of which are mutated and/or dysregulated in human CRC and thus are most likely to drive tumorigenesis. These genes include APC, PTEN, and SMAD4. The screen also identified 17 candidate genes that had not previously been implicated in CRC, including POLI, PTPRK, and RSPO2.
Danso, Dominik; Schmeisser, Christel; Chow, Jennifer; Zimmermann, Wolfgang; Wei, Ren; Leggewie, Christian; Li, Xiangzhen; Hazen, Terry; Streit, Wolfgang R
2018-04-15
Polyethylene terephthalate (PET) is one of the most important synthetic polymers used today. Unfortunately, the polymers accumulate in nature and to date no highly active enzymes are known that can degrade it at high velocity. Enzymes involved in PET degradation are mainly α- and β-hydrolases, like cutinases and related enzymes (EC 3.1.1). Currently, only a small number of such enzymes are well characterized. In this work, a search algorithm was developed that identified 504 possible PET hydrolase candidate genes from various databases. A further global search that comprised more than 16 Gb of sequence information within 108 marine and 25 terrestrial metagenomes obtained from the Integrated Microbial Genome (IMG) database detected 349 putative PET hydrolases. Heterologous expression of four such candidate enzymes verified the function of these enzymes and confirmed the usefulness of the developed search algorithm. In this way, two novel and thermostable enzymes with high potential for downstream application were partially characterized. Clustering of 504 novel enzyme candidates based on amino acid similarities indicated that PET hydrolases mainly occur in the phyla of Actinobacteria , Proteobacteria , and Bacteroidetes Within the Proteobacteria , the Betaproteobacteria , Deltaproteobacteria , and Gammaproteobacteria were the main hosts. Remarkably enough, in the marine environment, bacteria affiliated with the phylum Bacteroidetes appear to be the main hosts of PET hydrolase genes, rather than Actinobacteria or Proteobacteria , as observed for the terrestrial metagenomes. Our data further imply that PET hydrolases are truly rare enzymes. The highest occurrence of 1.5 hits/Mb was observed in sequences from a sample site containing crude oil. IMPORTANCE Polyethylene terephthalate (PET) accumulates in our environment without significant microbial conversion. Although a few PET hydrolases are already known, it is still unknown how frequently they appear and with which main bacterial phyla they are affiliated. In this study, deep sequence mining of protein databases and metagenomes demonstrated that PET hydrolases indeed occur at very low frequencies in the environment. Furthermore, it was possible to link them to phyla that were previously not known to harbor such enzymes. This work contributes novel knowledge on the phylogenetic relationships, the recent evolution, and the global distribution of PET hydrolases. Finally, we describe the biochemical traits of four novel PET hydrolases. Copyright © 2018 Danso et al.
Danso, Dominik; Schmeisser, Christel; Chow, Jennifer; Wei, Ren; Leggewie, Christian; Li, Xiangzhen
2018-01-01
ABSTRACT Polyethylene terephthalate (PET) is one of the most important synthetic polymers used today. Unfortunately, the polymers accumulate in nature and to date no highly active enzymes are known that can degrade it at high velocity. Enzymes involved in PET degradation are mainly α- and β-hydrolases, like cutinases and related enzymes (EC 3.1.1). Currently, only a small number of such enzymes are well characterized. In this work, a search algorithm was developed that identified 504 possible PET hydrolase candidate genes from various databases. A further global search that comprised more than 16 Gb of sequence information within 108 marine and 25 terrestrial metagenomes obtained from the Integrated Microbial Genome (IMG) database detected 349 putative PET hydrolases. Heterologous expression of four such candidate enzymes verified the function of these enzymes and confirmed the usefulness of the developed search algorithm. In this way, two novel and thermostable enzymes with high potential for downstream application were partially characterized. Clustering of 504 novel enzyme candidates based on amino acid similarities indicated that PET hydrolases mainly occur in the phyla of Actinobacteria, Proteobacteria, and Bacteroidetes. Within the Proteobacteria, the Betaproteobacteria, Deltaproteobacteria, and Gammaproteobacteria were the main hosts. Remarkably enough, in the marine environment, bacteria affiliated with the phylum Bacteroidetes appear to be the main hosts of PET hydrolase genes, rather than Actinobacteria or Proteobacteria, as observed for the terrestrial metagenomes. Our data further imply that PET hydrolases are truly rare enzymes. The highest occurrence of 1.5 hits/Mb was observed in sequences from a sample site containing crude oil. IMPORTANCE Polyethylene terephthalate (PET) accumulates in our environment without significant microbial conversion. Although a few PET hydrolases are already known, it is still unknown how frequently they appear and with which main bacterial phyla they are affiliated. In this study, deep sequence mining of protein databases and metagenomes demonstrated that PET hydrolases indeed occur at very low frequencies in the environment. Furthermore, it was possible to link them to phyla that were previously not known to harbor such enzymes. This work contributes novel knowledge on the phylogenetic relationships, the recent evolution, and the global distribution of PET hydrolases. Finally, we describe the biochemical traits of four novel PET hydrolases. PMID:29427431
Lachowiec, Jennifer; Shen, Xia; Queitsch, Christine; Carlborg, Örjan
2015-01-01
Efforts to identify loci underlying complex traits generally assume that most genetic variance is additive. Here, we examined the genetics of Arabidopsis thaliana root length and found that the genomic narrow-sense heritability for this trait in the examined population was statistically zero. The low amount of additive genetic variance that could be captured by the genome-wide genotypes likely explains why no associations to root length could be found using standard additive-model-based genome-wide association (GWA) approaches. However, as the broad-sense heritability for root length was significantly larger, and primarily due to epistasis, we also performed an epistatic GWA analysis to map loci contributing to the epistatic genetic variance. Four interacting pairs of loci were revealed, involving seven chromosomal loci that passed a standard multiple-testing corrected significance threshold. The genotype-phenotype maps for these pairs revealed epistasis that cancelled out the additive genetic variance, explaining why these loci were not detected in the additive GWA analysis. Small population sizes, such as in our experiment, increase the risk of identifying false epistatic interactions due to testing for associations with very large numbers of multi-marker genotypes in few phenotyped individuals. Therefore, we estimated the false-positive risk using a new statistical approach that suggested half of the associated pairs to be true positive associations. Our experimental evaluation of candidate genes within the seven associated loci suggests that this estimate is conservative; we identified functional candidate genes that affected root development in four loci that were part of three of the pairs. The statistical epistatic analyses were thus indispensable for confirming known, and identifying new, candidate genes for root length in this population of wild-collected A. thaliana accessions. We also illustrate how epistatic cancellation of the additive genetic variance explains the insignificant narrow-sense and significant broad-sense heritability by using a combination of careful statistical epistatic analyses and functional genetic experiments.
Identifying candidate driver genes by integrative ovarian cancer genomics data
NASA Astrophysics Data System (ADS)
Lu, Xinguo; Lu, Jibo
2017-08-01
Integrative analysis of molecular mechanics underlying cancer can distinguish interactions that cannot be revealed based on one kind of data for the appropriate diagnosis and treatment of cancer patients. Tumor samples exhibit heterogeneity in omics data, such as somatic mutations, Copy Number Variations CNVs), gene expression profiles and so on. In this paper we combined gene co-expression modules and mutation modulators separately in tumor patients to obtain the candidate driver genes for resistant and sensitive tumor from the heterogeneous data. The final list of modulators identified are well known in biological processes associated with ovarian cancer, such as CCL17, CACTIN, CCL16, CCL22, APOB, KDF1, CCL11, HNF1B, LRG1, MED1 and so on, which can help to facilitate the discovery of biomarkers, molecular diagnostics, and drug discovery.
Lawrenson, Kate; Li, Qiyuan; Kar, Siddhartha; Seo, Ji-Heui; Tyrer, Jonathan; Spindler, Tassja J; Lee, Janet; Chen, Yibu; Karst, Alison; Drapkin, Ronny; Aben, Katja K H; Anton-Culver, Hoda; Antonenkova, Natalia; Baker, Helen; Bandera, Elisa V; Bean, Yukie; Beckmann, Matthias W; Berchuck, Andrew; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A; Brooks-Wilson, Angela; Bruinsma, Fiona; Butzow, Ralf; Campbell, Ian G; Carty, Karen; Chang-Claude, Jenny; Chenevix-Trench, Georgia; Chen, Anne; Chen, Zhihua; Cook, Linda S; Cramer, Daniel W; Cunningham, Julie M; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas T; Edwards, Robert P; Eilber, Ursula; Ekici, Arif B; Fasching, Peter A; Fridley, Brooke L; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G; Glasspool, Rosalind; Goode, Ellen L; Goodman, Marc T; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hasmad, Hanis Nazihah; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A T; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus; Hosono, Satoyo; Iversen, Edwin S; Jakubowska, Anna; James, Paul; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kruger Kjaer, Susanne; Kelemen, Linda E; Kellar, Melissa; Kelley, Joseph L; Kiemeney, Lambertus A; Krakstad, Camilla; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D; Lee, Alice W; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F A G; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R; Nevanlinna, Heli; McNeish, Ian; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B; Narod, Steven A; Nedergaard, Lotte; Ness, Roberta B; Azmi, Mat Adenan Noor; Odunsi, Kunle; Olson, Sara H; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Pearce, Celeste L; Pejovic, Tanja; Pelttari, Liisa M; Permuth-Wey, Jennifer; Phelan, Catherine M; Pike, Malcolm C; Poole, Elizabeth M; Ramus, Susan J; Risch, Harvey A; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H; Rudolph, Anja; Runnebaum, Ingo B; Rzepecka, Iwona K; Salvesen, Helga B; Schildkraut, Joellen M; Schwaab, Ira; Sellers, Thomas A; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C; Sucheston, Lara; Tangen, Ingvild L; Teo, Soo-Hwang; Terry, Kathryn L; Thompson, Pamela J; Timorek, Agnieszka; Tsai, Ya-Yu; Tworoger, Shelley S; van Altena, Anne M; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S; Wicklund, Kristine G; Wilkens, Lynne R; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna H; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Monteiro, Alvaro; Pharoah, Paul D; Gayther, Simon A; Freedman, Matthew L
2015-09-22
Genome-wide association studies have reported 11 regions conferring risk of high-grade serous epithelial ovarian cancer (HGSOC). Expression quantitative trait locus (eQTL) analyses can identify candidate susceptibility genes at risk loci. Here we evaluate cis-eQTL associations at 47 regions associated with HGSOC risk (P≤10(-5)). For three cis-eQTL associations (P<1.4 × 10(-3), FDR<0.05) at 1p36 (CDC42), 1p34 (CDCA8) and 2q31 (HOXD9), we evaluate the functional role of each candidate by perturbing expression of each gene in HGSOC precursor cells. Overexpression of HOXD9 increases anchorage-independent growth, shortens population-doubling time and reduces contact inhibition. Chromosome conformation capture identifies an interaction between rs2857532 and the HOXD9 promoter, suggesting this SNP is a leading causal variant. Transcriptomic profiling after HOXD9 overexpression reveals enrichment of HGSOC risk variants within HOXD9 target genes (P=6 × 10(-10) for risk variants (P<10(-4)) within 10 kb of a HOXD9 target gene in ovarian cells), suggesting a broader role for this network in genetic susceptibility to HGSOC.
Lawrenson, Kate; Li, Qiyuan; Kar, Siddhartha; Seo, Ji-Heui; Tyrer, Jonathan; Spindler, Tassja J.; Lee, Janet; Chen, Yibu; Karst, Alison; Drapkin, Ronny; Aben, Katja K. H.; Anton-Culver, Hoda; Antonenkova, Natalia; Bowtell, David; Webb, Penelope M.; deFazio, Anna; Baker, Helen; Bandera, Elisa V.; Bean, Yukie; Beckmann, Matthias W.; Berchuck, Andrew; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A.; Brooks-Wilson, Angela; Bruinsma, Fiona; Butzow, Ralf; Campbell, Ian G.; Carty, Karen; Chang-Claude, Jenny; Chenevix-Trench, Georgia; Chen, Anne; Chen, Zhihua; Cook, Linda S.; Cramer, Daniel W.; Cunningham, Julie M.; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A.; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas T.; Edwards, Robert P.; Eilber, Ursula; Ekici, Arif B.; Fasching, Peter A.; Fridley, Brooke L.; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind; Goode, Ellen L.; Goodman, Marc T.; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hasmad, Hanis Nazihah; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A. T.; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus; Hosono, Satoyo; Iversen, Edwin S.; Jakubowska, Anna; James, Paul; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y.; Kruger Kjaer, Susanne; Kelemen, Linda E.; Kellar, Melissa; Kelley, Joseph L.; Kiemeney, Lambertus A.; Krakstad, Camilla; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D.; Lee, Alice W.; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A.; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F. A. G.; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R.; Nevanlinna, Heli; McNeish, Ian; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B.; Narod, Steven A.; Nedergaard, Lotte; Ness, Roberta B.; Azmi, Mat Adenan Noor; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Pearce, Celeste L.; Pejovic, Tanja; Pelttari, Liisa M.; Permuth-Wey, Jennifer; Phelan, Catherine M.; Pike, Malcolm C.; Poole, Elizabeth M.; Ramus, Susan J.; Risch, Harvey A.; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H.; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Schildkraut, Joellen M.; Schwaab, Ira; Sellers, Thomas A.; Shu, Xiao-Ou; Shvetsov, Yurii B.; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Sucheston, Lara; Tangen, Ingvild L.; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J.; Timorek, Agnieszka; Tsai, Ya-Yu; Tworoger, Shelley S.; van Altena, Anne M.; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A.; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S.; Wicklund, Kristine G.; Wilkens, Lynne R.; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna H.; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Monteiro, Alvaro; Pharoah, Paul D.; Gayther, Simon A.; Freedman, Matthew L.
2015-01-01
Genome-wide association studies have reported 11 regions conferring risk of high-grade serous epithelial ovarian cancer (HGSOC). Expression quantitative trait locus (eQTL) analyses can identify candidate susceptibility genes at risk loci. Here we evaluate cis-eQTL associations at 47 regions associated with HGSOC risk (P≤10−5). For three cis-eQTL associations (P<1.4 × 10−3, FDR<0.05) at 1p36 (CDC42), 1p34 (CDCA8) and 2q31 (HOXD9), we evaluate the functional role of each candidate by perturbing expression of each gene in HGSOC precursor cells. Overexpression of HOXD9 increases anchorage-independent growth, shortens population-doubling time and reduces contact inhibition. Chromosome conformation capture identifies an interaction between rs2857532 and the HOXD9 promoter, suggesting this SNP is a leading causal variant. Transcriptomic profiling after HOXD9 overexpression reveals enrichment of HGSOC risk variants within HOXD9 target genes (P=6 × 10−10 for risk variants (P<10−4) within 10 kb of a HOXD9 target gene in ovarian cells), suggesting a broader role for this network in genetic susceptibility to HGSOC. PMID:26391404
Novel mutations in GALNT3 causing hyperphosphatemic familial tumoral calcinosis.
Yancovitch, Alan; Hershkovitz, Dov; Indelman, Margareta; Galloway, Peter; Whiteford, Margo; Sprecher, Eli; Kılıç, Esra
2011-09-01
Hyperphosphatemic familial tumoral calcinosis (HFTC) is known to be caused by mutations in at least three genes: FGF23, GALNT3 and KL. Two families with two affected members suffering from HFTC were scrutinized for mutations in these candidate genes. We identified in both families homozygous missense mutations affecting highly conserved amino acids in GALNT3. One of the mutations is a novel mutation, whereas the second mutation was reported before in a compound heterozygous state. Our data expand the spectrum of known mutations in GALNT3 and contribute to a better understanding of the phenotypic manifestations of mutations in this gene.
Host adaptation to viruses relies on few genes with different cross-resistance properties
Martins, Nelson E.; Faria, Vítor G.; Nolte, Viola; Schlötterer, Christian; Teixeira, Luis; Sucena, Élio; Magalhães, Sara
2014-01-01
Host adaptation to one parasite may affect its response to others. However, the genetics of these direct and correlated responses remains poorly studied. The overlap between these responses is instrumental for the understanding of host evolution in multiparasite environments. We determined the genetic and phenotypic changes underlying adaptation of Drosophila melanogaster to Drosophila C virus (DCV). Within 20 generations, flies selected with DCV showed increased survival after DCV infection, but also after cricket paralysis virus (CrPV) and flock house virus (FHV) infection. Whole-genome sequencing identified two regions of significant differentiation among treatments, from which candidate genes were functionally tested with RNAi. Three genes were validated—pastrel, a known DCV-response gene, and two other loci, Ubc-E2H and CG8492. Knockdown of Ubc-E2H and pastrel also led to increased sensitivity to CrPV, whereas knockdown of CG8492 increased susceptibility to FHV infection. Therefore, Drosophila adaptation to DCV relies on few major genes, each with different cross-resistance properties, conferring host resistance to several parasites. PMID:24711428
Oiestad, A J; Martin, J M; Cook, J; Varella, A C; Giroux, M J
2017-07-01
The wheat stem sawfly (WSS) is an economically important pest of wheat in the Northern Great Plains. The primary means of WSS control is resistance associated with the single quantitative trait locus (QTL) , which controls most stem solidness variation. The goal of this study was to identify stem solidness candidate genes via RNA-seq. This study made use of 28 single nucleotide polymorphism (SNP) makers derived from expressed sequence tags (ESTs) linked to contained within a 5.13 cM region. Allele specific expression of EST markers was examined in stem tissue for solid and hollow-stemmed pairs of two spring wheat near isogenic lines (NILs) differing for the QTL. Of the 28 ESTs, 13 were located within annotated genes and 10 had detectable stem expression. Annotated genes corresponding to four of the ESTs were differentially expressed between solid and hollow-stemmed NILs and represent possible stem solidness gene candidates. Further examination of the 5.13 cM region containing the 28 EST markers identified 260 annotated genes. Twenty of the 260 linked genes were up-regulated in hollow NIL stems, while only seven genes were up-regulated in solid NIL stems. An -methyltransferase within the region of interest was identified as a candidate based on differential expression between solid and hollow-stemmed NILs and putative function. Further study of these candidate genes may lead to the identification of the gene(s) controlling stem solidness and an increased ability to select for wheat stem solidness and manage WSS. Copyright © 2017 Crop Science Society of America.
Vadigepalli, Rajanikanth; Chakravarthula, Praveen; Zak, Daniel E; Schwaber, James S; Gonye, Gregory E
2003-01-01
We have developed a bioinformatics tool named PAINT that automates the promoter analysis of a given set of genes for the presence of transcription factor binding sites. Based on coincidence of regulatory sites, this tool produces an interaction matrix that represents a candidate transcriptional regulatory network. This tool currently consists of (1) a database of promoter sequences of known or predicted genes in the Ensembl annotated mouse genome database, (2) various modules that can retrieve and process the promoter sequences for binding sites of known transcription factors, and (3) modules for visualization and analysis of the resulting set of candidate network connections. This information provides a substantially pruned list of genes and transcription factors that can be examined in detail in further experimental studies on gene regulation. Also, the candidate network can be incorporated into network identification methods in the form of constraints on feasible structures in order to render the algorithms tractable for large-scale systems. The tool can also produce output in various formats suitable for use in external visualization and analysis software. In this manuscript, PAINT is demonstrated in two case studies involving analysis of differentially regulated genes chosen from two microarray data sets. The first set is from a neuroblastoma N1E-115 cell differentiation experiment, and the second set is from neuroblastoma N1E-115 cells at different time intervals following exposure to neuropeptide angiotensin II. PAINT is available for use as an agent in BioSPICE simulation and analysis framework (www.biospice.org), and can also be accessed via a WWW interface at www.dbi.tju.edu/dbi/tools/paint/.
Challis, Richard J.; Hepworth, Jo; Mouchel, Céline; Waites, Richard; Leyser, Ottoline
2013-01-01
Strigolactones (SLs) are carotenoid-derived phytohormones with diverse roles. They are secreted from roots as attractants for arbuscular mycorrhizal fungi and have a wide range of endogenous functions, such as regulation of root and shoot system architecture. To date, six genes associated with SL synthesis and signaling have been molecularly identified using the shoot-branching mutants more axillary growth (max) of Arabidopsis (Arabidopsis thaliana) and dwarf (d) of rice (Oryza sativa). Here, we present a phylogenetic analysis of the MAX/D genes to clarify the relationships of each gene with its wider family and to allow the correlation of events in the evolution of the genes with the evolution of SL function. Our analysis suggests that the notion of a distinct SL pathway is inappropriate. Instead, there may be a diversity of SL-like compounds, the response to which requires a D14/D14-like protein. This ancestral system could have been refined toward distinct ligand-specific pathways channeled through MAX2, the most downstream known component of SL signaling. MAX2 is tightly conserved among land plants and is more diverged from its nearest sister clade than any other SL-related gene, suggesting a pivotal role in the evolution of SL signaling. By contrast, the evidence suggests much greater flexibility upstream of MAX2. The MAX1 gene is a particularly strong candidate for contributing to diversification of inputs upstream of MAX2. Our functional analysis of the MAX1 family demonstrates the early origin of its catalytic function and both redundancy and functional diversification associated with its duplication in angiosperm lineages. PMID:23424248
Gsg1, Trnp1, and Tmem215 Mark Subpopulations of Bipolar Interneurons in the Mouse Retina
Park, Ko Uoon; Randazzo, Grace; Jones, Kenneth L.; Brzezinski, Joseph A.
2017-01-01
Purpose How retinal bipolar cell interneurons are specified and assigned to specialized subtypes is only partially understood. In part, this is due to a lack of early pan- and subtype-specific bipolar cell markers. To discover these factors, we identified genes that were upregulated in Blimp1 (Prdm1) mutant retinas, which exhibit precocious bipolar cell development. Methods Postnatal day (P)2 retinas from Blimp1 conditional knock-out (CKO) mice and controls were processed for RNA sequencing. Genes that increased at least 45% and were statistically different between conditions were considered candidate bipolar-specific factors. Candidates were further evaluated by RT-PCR, in situ hybridization, and immunohistochemistry. Knock-in Tmem215-LacZ mice were used to better trace retinal expression. Results A comparison between Blimp1 CKO and control RNA-seq datasets revealed approximately 40 significantly upregulated genes. We characterized the expression of three genes that have no known function in the retina, Gsg1 (germ cell associated gene), Trnp1 (TMF-regulated nuclear protein), and Tmem215 (a predicted transmembrane protein). Germ cell associated gene appeared restricted to a small subset of cone bipolars while Trnp1 was seen in all ON type bipolar cells. Using Tmem215-LacZ heterozygous knock-in mice, we observed that β-galactosidase expression started early in bipolar cell development. In adults, Tmem215 was expressed by a subset of ON and OFF cone bipolar cells. Conclusions We have identified Gsg1, Tmem215, and Trnp1 as novel bipolar subtype-specific genes. The spatial and temporal pattern of their expression is consistent with a role in controlling bipolar subtype fate choice, differentiation, or physiology. PMID:28199486
Functional genomics indicate that schizophrenia may be an adult vascular-ischemic disorder
Moises, H W; Wollschläger, D; Binder, H
2015-01-01
In search for the elusive schizophrenia pathway, candidate genes for the disorder from a discovery sample were localized within the energy-delivering and ischemia protection pathway. To test the adult vascular-ischemic (AVIH) and the competing neurodevelopmental hypothesis (NDH), functional genomic analyses of practically all available schizophrenia-associated genes from candidate gene, genome-wide association and postmortem expression studies were performed. Our results indicate a significant overrepresentation of genes involved in vascular function (P<0.001), vasoregulation (that is, perivascular (P<0.001) and shear stress (P<0.01), cerebral ischemia (P<0.001), neurodevelopment (P<0.001) and postischemic repair (P<0.001) among schizophrenia-associated genes from genetic association studies. These findings support both the NDH and the AVIH. The genes from postmortem studies showed an upregulation of vascular-ischemic genes (P=0.020) combined with downregulated synaptic (P=0.005) genes, and ND/repair (P=0.003) genes. Evidence for the AVIH and the NDH is critically discussed. We conclude that schizophrenia is probably a mild adult vascular-ischemic and postischemic repair disorder. Adult postischemic repair involves ND genes for adult neurogenesis, synaptic plasticity, glutamate and increased long-term potentiation of excitatory neurotransmission (i-LTP). Schizophrenia might be caused by the cerebral analog of microvascular angina. PMID:26261884
Functional genomics indicate that schizophrenia may be an adult vascular-ischemic disorder.
Moises, H W; Wollschläger, D; Binder, H
2015-08-11
In search for the elusive schizophrenia pathway, candidate genes for the disorder from a discovery sample were localized within the energy-delivering and ischemia protection pathway. To test the adult vascular-ischemic (AVIH) and the competing neurodevelopmental hypothesis (NDH), functional genomic analyses of practically all available schizophrenia-associated genes from candidate gene, genome-wide association and postmortem expression studies were performed. Our results indicate a significant overrepresentation of genes involved in vascular function (P < 0.001), vasoregulation (that is, perivascular (P < 0.001) and shear stress (P < 0.01), cerebral ischemia (P < 0.001), neurodevelopment (P < 0.001) and postischemic repair (P < 0.001) among schizophrenia-associated genes from genetic association studies. These findings support both the NDH and the AVIH. The genes from postmortem studies showed an upregulation of vascular-ischemic genes (P = 0.020) combined with downregulated synaptic (P = 0.005) genes, and ND/repair (P = 0.003) genes. Evidence for the AVIH and the NDH is critically discussed. We conclude that schizophrenia is probably a mild adult vascular-ischemic and postischemic repair disorder. Adult postischemic repair involves ND genes for adult neurogenesis, synaptic plasticity, glutamate and increased long-term potentiation of excitatory neurotransmission (i-LTP). Schizophrenia might be caused by the cerebral analog of microvascular angina.
Antennal transcriptome analysis of the piercing moth Oraesia emarginata (Lepidoptera: Noctuidae)
Feng, Bo; Guo, Qianshuang; Zheng, Kaidi; Qin, Yuanxia; Du, Yongjun
2017-01-01
The piercing fruit moth Oraesia emarginata is an economically significant pest; however, our understanding of its olfactory mechanisms in infestation is limited. The present study conducted antennal transcriptome analysis of olfactory genes using real-time quantitative reverse transcription PCR analysis (RT-qPCR). We identified a total of 104 candidate chemosensory genes from several gene families, including 35 olfactory receptors (ORs), 41 odorant-binding proteins, 20 chemosensory proteins, 6 ionotropic receptors, and 2 sensory neuron membrane proteins. Seven candidate pheromone receptors (PRs) and 3 candidate pheromone-binding proteins (PBPs) for sex pheromone recognition were found. OemaOR29 and OemaPBP1 had the highest fragments per kb per million fragments (FPKM) values in all ORs and OBPs, respectively. Eighteen olfactory genes were upregulated in females, including 5 candidate PRs, and 20 olfactory genes were upregulated in males, including 2 candidate PRs (OemaOR29 and 4) and 2 PBPs (OemaPBP1 and 3). These genes may have roles in mediating sex-specific behaviors. Most candidate olfactory genes of sex pheromone recognition (except OemaOR29 and OemaPBP3) in O. emarginata were not clustered with those of studied noctuid species (type I pheromone). In addition, OemaOR29 was belonged to cluster PRIII, which comprise proteins that recognize type II pheromones instead of type I pheromones. The structure and function of olfactory genes that encode sex pheromones in O. emarginata might thus differ from those of other studied noctuids. The findings of the present study may help explain the molecular mechanism underlying olfaction and the evolution of olfactory genes encoding sex pheromones in O. emarginata. PMID:28614384
Revealing Alzheimer's disease genes spectrum in the whole-genome by machine learning.
Huang, Xiaoyan; Liu, Hankui; Li, Xinming; Guan, Liping; Li, Jiankang; Tellier, Laurent Christian Asker M; Yang, Huanming; Wang, Jian; Zhang, Jianguo
2018-01-10
Alzheimer's disease (AD) is an important, progressive neurodegenerative disease, with a complex genetic architecture. A key goal of biomedical research is to seek out disease risk genes, and to elucidate the function of these risk genes in the development of disease. For this purpose, expanding the AD-associated gene set is necessary. In past research, the prediction methods for AD related genes has been limited in their exploration of the target genome regions. We here present a genome-wide method for AD candidate genes predictions. We present a machine learning approach (SVM), based upon integrating gene expression data with human brain-specific gene network data, to discover the full spectrum of AD genes across the whole genome. We classified AD candidate genes with an accuracy and the area under the receiver operating characteristic (ROC) curve of 84.56% and 94%. Our approach provides a supplement for the spectrum of AD-associated genes extracted from more than 20,000 genes in a genome wide scale. In this study, we have elucidated the whole-genome spectrum of AD, using a machine learning approach. Through this method, we expect for the candidate gene catalogue to provide a more comprehensive annotation of AD for researchers.
A role for genetic susceptibility in sporadic focal segmental glomerulosclerosis
Yu, Haiyang; Artomov, Mykyta; Brähler, Sebastian; Stander, M. Christine; Shamsan, Ghaidan; Sampson, Matthew G.; White, J. Michael; Kretzler, Matthias; Jain, Sanjay; Winkler, Cheryl A.; Mitra, Robi D.; Daly, Mark J.; Shaw, Andrey S.
2016-01-01
Focal segmental glomerulosclerosis (FSGS) is a syndrome that involves kidney podocyte dysfunction and causes chronic kidney disease. Multiple factors including chemical toxicity, inflammation, and infection underlie FSGS; however, highly penetrant disease genes have been identified in a small fraction of patients with a family history of FSGS. Variants of apolipoprotein L1 (APOL1) have been linked to FSGS in African Americans with HIV or hypertension, supporting the proposal that genetic factors enhance FSGS susceptibility. Here, we used sequencing to investigate whether genetics plays a role in the majority of FSGS cases that are identified as primary or sporadic FSGS and have no known cause. Given the limited number of biopsy-proven cases with ethnically matched controls, we devised an analytic strategy to identify and rank potential candidate genes and used an animal model for validation. Nine candidate FSGS susceptibility genes were identified in our patient cohort, and three were validated using a high-throughput mouse method that we developed. Specifically, we introduced a podocyte-specific, doxycycline-inducible transactivator into a murine embryonic stem cell line with an FSGS-susceptible genetic background that allows shRNA-mediated targeting of candidate genes in the adult kidney. Our analysis supports a broader role for genetic susceptibility of both sporadic and familial cases of FSGS and provides a tool to rapidly evaluate candidate FSGS-associated genes. PMID:26901816
A Computational Network Biology Approach to Uncover Novel Genes Related to Alzheimer's Disease.
Zanzoni, Andreas
2016-01-01
Recent advances in the fields of genetics and genomics have enabled the identification of numerous Alzheimer's disease (AD) candidate genes, although for many of them the role in AD pathophysiology has not been uncovered yet. Concomitantly, network biology studies have shown a strong link between protein network connectivity and disease. In this chapter I describe a computational approach that, by combining local and global network analysis strategies, allows the formulation of novel hypotheses on the molecular mechanisms involved in AD and prioritizes candidate genes for further functional studies.
Evidence that breast cancer risk at the 2q35 locus is mediated through IGFBP5 regulation
Ghoussaini, Maya; Edwards, Stacey L.; Michailidou, Kyriaki; Nord, Silje; Cowper-Sal·lari, Richard; Desai, Kinjal; Kar, Siddhartha; Hillman, Kristine M.; Kaufmann, Susanne; Glubb, Dylan M.; Beesley, Jonathan; Dennis, Joe; Bolla, Manjeet K.; Wang, Qin; Dicks, Ed; Guo, Qi; Schmidt, Marjanka K.; Shah, Mitul; Luben, Robert; Brown, Judith; Czene, Kamila; Darabi, Hatef; Eriksson, Mikael; Klevebring, Daniel; Bojesen, Stig E.; Nordestgaard, Børge G.; Nielsen, Sune F.; Flyger, Henrik; Lambrechts, Diether; Thienpont, Bernard; Neven, Patrick; Wildiers, Hans; Broeks, Annegien; Van’t Veer, Laura J.; Th Rutgers, Emiel J.; Couch, Fergus J.; Olson, Janet E.; Hallberg, Emily; Vachon, Celine; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Peto, Julian; dos-Santos-Silva, Isabel; Gibson, Lorna; Nevanlinna, Heli; Muranen, Taru A.; Aittomäki, Kristiina; Blomqvist, Carl; Hall, Per; Li, Jingmei; Liu, Jianjun; Humphreys, Keith; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K.; Noh, Dong-Young; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Yatabe, Yasushi; Guénel, Pascal; Truong, Thérèse; Menegaux, Florence; Sanchez, Marie; Burwinkel, Barbara; Marme, Frederik; Schneeweiss, Andreas; Sohn, Christof; Wu, Anna H.; Tseng, Chiu-chen; Van Den Berg, David; Stram, Daniel O.; Benitez, Javier; Zamora, M. Pilar; Perez, Jose Ignacio Arias; Menéndez, Primitiva; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Cai, Qiuyin; Cox, Angela; Cross, Simon S.; Reed, Malcolm W. R.; Andrulis, Irene L.; Knight, Julia A.; Glendon, Gord; Tchatchou, Sandrine; Sawyer, Elinor J.; Tomlinson, Ian; Kerin, Michael J.; Miller, Nicola; Haiman, Christopher A.; Henderson, Brian E.; Schumacher, Fredrick; Le Marchand, Loic; Lindblom, Annika; Margolin, Sara; TEO, Soo Hwang; YIP, Cheng Har; Lee, Daphne S. C.; Wong, Tien Y.; Hooning, Maartje J.; Martens, John W. M.; Collée, J. Margriet; van Deurzen, Carolien H. M.; Hopper, John L.; Southey, Melissa C.; Tsimiklis, Helen; Kapuscinski, Miroslav K.; Shen, Chen-Yang; Wu, Pei-Ei; Yu, Jyh-Cherng; Chen, Shou-Tung; Alnæs, Grethe Grenaker; Borresen-Dale, Anne-Lise; Giles, Graham G.; Milne, Roger L.; McLean, Catriona; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Hartman, Mikael; Miao, Hui; Buhari, Shaik Ahmad Bin Syed; Teo, Yik Ying; Fasching, Peter A.; Haeberle, Lothar; Ekici, Arif B.; Beckmann, Matthias W.; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Schoemaker, Minouk J.; García-Closas, Montserrat; Figueroa, Jonine; Chanock, Stephen J.; Lissowska, Jolanta; Simard, Jacques; Goldberg, Mark S.; Labrèche, France; Dumont, Martine; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Brauch, Hiltrud; Brüning, Thomas; Koto, Yon-Dschun; Radice, Paolo; Peterlongo, Paolo; Bonanni, Bernardo; Volorio, Sara; Dörk, Thilo; Bogdanova, Natalia V.; Helbig, Sonja; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M.; Devilee, Peter; Tollenaar, Robert A. E. M.; Seynaeve, Caroline; Van Asperen, Christi J.; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Slager, Susan; Toland, Amanda E.; Ambrosone, Christine B.; Yannoukakos, Drakoulis; Sangrajrang, Suleeporn; Gaborieau, Valerie; Brennan, Paul; McKay, James; Hamann, Ute; Torres, Diana; Zheng, Wei; Long, Jirong; Anton-Culver, Hoda; Neuhausen, Susan L.; Luccarini, Craig; Baynes, Caroline; Ahmed, Shahana; Maranian, Mel; Healey, Catherine S.; González-Neira, Anna; Pita, Guillermo; Alonso, M. Rosario; Álvarez, Nuria; Herrero, Daniel; Tessier, Daniel C.; Vincent, Daniel; Bacot, Francois; de Santiago, Ines; Carroll, Jason; Caldas, Carlos; Brown, Melissa A.; Lupien, Mathieu; Kristensen, Vessela N.; Pharoah, Paul D P; Chenevix-Trench, Georgia; French, Juliet D; Easton, Douglas F.; Dunning, Alison M.; Chenevix-Trench, Georgia; Webb, Penny; Bowtell, David; De Fazio, Anna
2014-01-01
GWAS have identified a breast cancer susceptibility locus on 2q35. Here we report the fine mapping of this locus using data from 101,943 subjects from 50 case-control studies. We genotype 276 SNPs using the ‘iCOGS’ genotyping array and impute genotypes for a further 1,284 using 1000 Genomes Project data. All but two, strongly correlated SNPs (rs4442975 G/T and rs6721996 G/A) are excluded as candidate causal variants at odds against >100:1. The best functional candidate, rs4442975, is associated with oestrogen receptor positive (ER+) disease with an odds ratio (OR) in Europeans of 0.85 (95% confidence interval=0.84−0.87; P=1.7 × 10−43) per t-allele. This SNP flanks a transcriptional enhancer that physically interacts with the promoter of IGFBP5 (encoding insulin-like growth factor-binding protein 5) and displays allele-specific gene expression, FOXA1 binding and chromatin looping. Evidence suggests that the g-allele confers increased breast cancer susceptibility through relative downregulation of IGFBP5, a gene with known roles in breast cell biology. PMID:25248036
Cellular dissection of psoriasis for transcriptome analyses and the post-GWAS era
2014-01-01
Background Genome-scale studies of psoriasis have been used to identify genes of potential relevance to disease mechanisms. For many identified genes, however, the cell type mediating disease activity is uncertain, which has limited our ability to design gene functional studies based on genomic findings. Methods We identified differentially expressed genes (DEGs) with altered expression in psoriasis lesions (n = 216 patients), as well as candidate genes near susceptibility loci from psoriasis GWAS studies. These gene sets were characterized based upon their expression across 10 cell types present in psoriasis lesions. Susceptibility-associated variation at intergenic (non-coding) loci was evaluated to identify sites of allele-specific transcription factor binding. Results Half of DEGs showed highest expression in skin cells, although the dominant cell type differed between psoriasis-increased DEGs (keratinocytes, 35%) and psoriasis-decreased DEGs (fibroblasts, 33%). In contrast, psoriasis GWAS candidates tended to have highest expression in immune cells (71%), with a significant fraction showing maximal expression in neutrophils (24%, P < 0.001). By identifying candidate cell types for genes near susceptibility loci, we could identify and prioritize SNPs at which susceptibility variants are predicted to influence transcription factor binding. This led to the identification of potentially causal (non-coding) SNPs for which susceptibility variants influence binding of AP-1, NF-κB, IRF1, STAT3 and STAT4. Conclusions These findings underscore the role of innate immunity in psoriasis and highlight neutrophils as a cell type linked with pathogenetic mechanisms. Assignment of candidate cell types to genes emerging from GWAS studies provides a first step towards functional analysis, and we have proposed an approach for generating hypotheses to explain GWAS hits at intergenic loci. PMID:24885462
Diopere, Eveline; Hellemans, Bart; Volckaert, Filip A M; Maes, Gregory E
2013-03-01
Genomic methodologies applied in evolutionary and fisheries research have been of great benefit to understand the marine ecosystem and the management of natural resources. Although single nucleotide polymorphisms (SNPs) are attractive for the study of local adaptation, spatial stock management and traceability, and investigating the effects of fisheries-induced selection, they have rarely been exploited in non-model organisms. This is partly due to difficulties in finding and validating SNPs in species with limited or no genomic resources. Complementary to random genome-scan approaches, a targeted candidate gene approach has the potential to unveil pre-selected functional diversity and provides more in depth information on the action of selection at specific genes. For example genes can be under selective pressure due to climate change and sustained periods of heavy fishing pressure. In this study, we applied a candidate gene approach in sole (Solea solea L.), an important member of the demersal ecosystem. As consumption flatfish it is heavy exploited and has experienced associated life-history changes over the last 60years. To discover novel genetic polymorphisms in or around genes linked to important life history traits in sole, we screened a total of 76 candidate genes related to growth and maturation using a targeted resequencing approach. We identified in total 86 putative SNPs in 22 genes and validated 29 SNPs using a multiplex single-base extension genotyping assay. We found 22 informative SNPs, of which two represent non-synonymous mutations, potentially of functional relevance. These novel markers should be rapidly and broadly applicable in analyses of natural sole populations, as a measure of the evolutionary signature of overfishing and for initiatives on marker assisted selection. Copyright © 2012 Elsevier B.V. All rights reserved.
Transcriptomic analysis provides insight into high-altitude acclimation in domestic goats.
Tang, Qianzi; Huang, Wenyao; Guan, Jiuqiang; Jin, Long; Che, Tiandong; Fu, Yuhua; Hu, Yaodong; Tian, Shilin; Wang, Dawei; Jiang, Zhi; Li, Xuewei; Li, Mingzhou
2015-08-10
Domestic goats are distributed in a wide range of habitats and have acclimated to their local environmental conditions. To investigate the gene expression changes of goats that are induced by high altitude stress, we performed RNA-seq on 27 samples from the three hypoxia-sensitive tissues (heart, lung, and skeletal muscle) in three indigenous populations from distinct altitudes (600 m, 2000 m, and 3000 m). We generated 129Gb of high-quality sequencing data (~4Gb per sample) and catalogued the expression profiles of 12,421 annotated hircine genes in each sample. The analysis showed global similarities and differences of high-altitude transcriptomes among populations and tissues as well as revealed that the heart underwent the most high-altitude induced expression changes. We identified numerous differentially expressed genes that exhibited distinct expression patterns, and nonsynonymous single nucleotide variant-containing genes that were highly differentiated between the high- and low-altitude populations. These genes have known or potential roles in hypoxia response and were enriched in functional gene categories potentially responsible for high-altitude stress. Therefore, they are appealing candidates for further investigation of the gene expression and associated regulatory mechanisms related to high-altitude acclimation. Copyright © 2015 Elsevier B.V. All rights reserved.
Ye, Jianqiu; Yang, Hai; Shi, Haitao; Wei, Yunxie; Tie, Weiwei; Ding, Zehong; Yan, Yan; Luo, Ying; Xia, Zhiqiang; Wang, Wenquan; Peng, Ming; Li, Kaimian; Zhang, He; Hu, Wei
2017-11-02
Mitogen-activated protein kinase kinase kinases (MAPKKKs), an important unit of MAPK cascade, play crucial roles in plant development and response to various stresses. However, little is known concerning the MAPKKK family in the important subtropical and tropical crop cassava. In this study, 62 MAPKKK genes were identified in the cassava genome, and were classified into 3 subfamilies based on phylogenetic analysis. Most of MAPKKKs in the same subfamily shared similar gene structures and conserved motifs. The comprehensive transcriptome analysis showed that MAPKKK genes participated in tissue development and response to drought stress. Comparative expression profiles revealed that many MAPKKK genes were activated in cultivated varieties SC124 and Arg7 and the function of MeMAPKKKs in drought resistance may be different between SC124/Arg7 and W14. Expression analyses of the 7 selected MeMAPKKK genes showed that most of them were significantly upregulated by osmotic, salt and ABA treatments, whereas slightly induced by H 2 O 2 and cold stresses. Taken together, this study identified candidate MeMAPKKK genes for genetic improvement of abiotic stress resistance and provided new insights into MAPKKK -mediated cassava resistance to drought stress.
Farlora, Rodolfo; Araya-Garay, José; Gallardo-Escárate, Cristian
2014-06-01
Understanding the molecular underpinnings involved in the reproduction of the salmon louse is critical for designing novel strategies of pest management for this ectoparasite. However, genomic information on sex-related genes is still limited. In the present work, sex-specific gene transcription was revealed in the salmon louse Caligus rogercresseyi using high-throughput Illumina sequencing. A total of 30,191,914 and 32,292,250 high quality reads were generated for females and males, and these were de novo assembled into 32,173 and 38,177 contigs, respectively. Gene ontology analysis showed a pattern of higher expression in the female as compared to the male transcriptome. Based on our sequence analysis and known sex-related proteins, several genes putatively involved in sex differentiation, including Dmrt3, FOXL2, VASA, and FEM1, and other potentially significant candidate genes in C. rogercresseyi, were identified for the first time. In addition, the occurrence of SNPs in several differentially expressed contigs annotating for sex-related genes was found. This transcriptome dataset provides a useful resource for future functional analyses, opening new opportunities for sea lice pest control. Copyright © 2014 Elsevier B.V. All rights reserved.
Sprangers, Mirjam A.G.; Thong, Melissa S.Y.; Bartels, Meike; Barsevick, Andrea; Ordoñana, Juan; Shi, Qiuling; Wang, Xin Shelley; Klepstad, Pål; Wierenga, Eddy A.; Singh, Jasvinder A.; Sloan, Jeff A.
2014-01-01
Background There is compelling evidence of a genetic foundation of patient-reported QOL. Given the rapid development of substantial scientific advances in this area of research, the current paper updates and extends reviews published in 2010. Objectives The objective is to provide an updated overview of the biological pathways, candidate genes and molecular markers involved in fatigue, pain, negative (depressed mood) and positive (well-being/happiness) emotional functioning, social functioning, and overall QOL. Methods We followed a purposeful search algorithm of existing literature to capture empirical papers investigating the relationship between biological pathways and molecular markers and the identified QOL domains. Results Multiple major pathways are involved in each QOL domain. The inflammatory pathway has the strongest evidence as a controlling mechanism underlying fatigue. Inflammation and neurotransmission are key processes involved in pain perception and the COMT gene is associated with multiple sorts of pain. The neurotransmitter and neuroplasticity theories have the strongest evidence for their relationship with depression. Oxytocin-related genes and genes involved in the serotonergic and dopaminergic pathways play a role in social functioning. Inflammatory pathways, via cytokines, also play an important role in overall QOL. Conclusions Whereas the current findings need future experiments and replication efforts, they will provide researchers supportive background information when embarking on studies relating candidate genes and/or molecular markers to QOL domains. The ultimate goal of this area of research is to enhance patients’ QOL. PMID:24604075
Horizontal gene transfer in silkworm, Bombyx mori
2011-01-01
Background The domesticated silkworm, Bombyx mori, is the model insect for the order Lepidoptera, has economically important values, and has gained some representative behavioral characteristics compared to its wild ancestor. The genome of B. mori has been fully sequenced while function analysis of BmChi-h and BmSuc1 genes revealed that horizontal gene transfer (HGT) maybe bestow a clear selective advantage to B. mori. However, the role of HGT in the evolutionary history of B. mori is largely unexplored. In this study, we compare the whole genome of B. mori with those of 382 prokaryotic and eukaryotic species to investigate the potential HGTs. Results Ten candidate HGT events were defined in B. mori by comprehensive sequence analysis using Maximum Likelihood and Bayesian method combining with EST checking. Phylogenetic analysis of the candidate HGT genes suggested that one HGT was plant-to- B. mori transfer while nine were bacteria-to- B. mori transfer. Furthermore, functional analysis based on expression, coexpression and related literature searching revealed that several HGT candidate genes have added important characters, such as resistance to pathogen, to B. mori. Conclusions Results from this study clearly demonstrated that HGTs play an important role in the evolution of B. mori although the number of HGT events in B. mori is in general smaller than those of microbes and other insects. In particular, interdomain HGTs in B. mori may give rise to functional, persistent, and possibly evolutionarily significant new genes. PMID:21595916
Sprangers, Mirjam A G; Thong, Melissa S Y; Bartels, Meike; Barsevick, Andrea; Ordoñana, Juan; Shi, Qiuling; Wang, Xin Shelley; Klepstad, Pål; Wierenga, Eddy A; Singh, Jasvinder A; Sloan, Jeff A
2014-09-01
There is compelling evidence of a genetic foundation of patient-reported quality of life (QOL). Given the rapid development of substantial scientific advances in this area of research, the current paper updates and extends reviews published in 2010. The objective was to provide an updated overview of the biological pathways, candidate genes, and molecular markers involved in fatigue, pain, negative (depressed mood) and positive (well-being/happiness) emotional functioning, social functioning, and overall QOL. We followed a purposeful search algorithm of existing literature to capture empirical papers investigating the relationship between biological pathways and molecular markers and the identified QOL domains. Multiple major pathways are involved in each QOL domain. The inflammatory pathway has the strongest evidence as a controlling mechanism underlying fatigue. Inflammation and neurotransmission are key processes involved in pain perception, and the catechol-O-methyltransferase (COMT) gene is associated with multiple sorts of pain. The neurotransmitter and neuroplasticity theories have the strongest evidence for their relationship with depression. Oxytocin-related genes and genes involved in the serotonergic and dopaminergic pathways play a role in social functioning. Inflammatory pathways, via cytokines, also play an important role in overall QOL. Whereas the current findings need future experiments and replication efforts, they will provide researchers supportive background information when embarking on studies relating candidate genes and/or molecular markers to QOL domains. The ultimate goal of this area of research is to enhance patients' QOL.
Exome Sequencing in Suspected Monogenic Dyslipidemias
Stitziel, Nathan O.; Peloso, Gina M.; Abifadel, Marianne; Cefalu, Angelo B.; Fouchier, Sigrid; Motazacker, M. Mahdi; Tada, Hayato; Larach, Daniel B.; Awan, Zuhier; Haller, Jorge F.; Pullinger, Clive R.; Varret, Mathilde; Rabès, Jean-Pierre; Noto, Davide; Tarugi, Patrizia; Kawashiri, Masa-aki; Nohara, Atsushi; Yamagishi, Masakazu; Risman, Marjorie; Deo, Rahul; Ruel, Isabelle; Shendure, Jay; Nickerson, Deborah A.; Wilson, James G.; Rich, Stephen S.; Gupta, Namrata; Farlow, Deborah N.; Neale, Benjamin M.; Daly, Mark J.; Kane, John P.; Freeman, Mason W.; Genest, Jacques; Rader, Daniel J.; Mabuchi, Hiroshi; Kastelein, John J.P.; Hovingh, G. Kees; Averna, Maurizio R.; Gabriel, Stacey; Boileau, Catherine; Kathiresan, Sekar
2015-01-01
Background Exome sequencing is a promising tool for gene mapping in Mendelian disorders. We utilized this technique in an attempt to identify novel genes underlying monogenic dyslipidemias. Methods and Results We performed exome sequencing on 213 selected family members from 41 kindreds with suspected Mendelian inheritance of extreme levels of low-density lipoprotein (LDL) cholesterol (after candidate gene sequencing excluded known genetic causes for high LDL cholesterol families) or high-density lipoprotein (HDL) cholesterol. We used standard analytic approaches to identify candidate variants and also assigned a polygenic score to each individual in order to account for their burden of common genetic variants known to influence lipid levels. In nine families, we identified likely pathogenic variants in known lipid genes (ABCA1, APOB, APOE, LDLR, LIPA, and PCSK9); however, we were unable to identify obvious genetic etiologies in the remaining 32 families despite follow-up analyses. We identified three factors that limited novel gene discovery: (1) imperfect sequencing coverage across the exome hid potentially causal variants; (2) large numbers of shared rare alleles within families obfuscated causal variant identification; and (3) individuals from 15% of families carried a significant burden of common lipid-related alleles, suggesting complex inheritance can masquerade as monogenic disease. Conclusions We identified the genetic basis of disease in nine of 41 families; however, none of these represented novel gene discoveries. Our results highlight the promise and limitations of exome sequencing as a discovery technique in suspected monogenic dyslipidemias. Considering the confounders identified may inform the design of future exome sequencing studies. PMID:25632026
Fault tolerance in protein interaction networks: stable bipartite subgraphs and redundant pathways.
Brady, Arthur; Maxwell, Kyle; Daniels, Noah; Cowen, Lenore J
2009-01-01
As increasing amounts of high-throughput data for the yeast interactome become available, more system-wide properties are uncovered. One interesting question concerns the fault tolerance of protein interaction networks: whether there exist alternative pathways that can perform some required function if a gene essential to the main mechanism is defective, absent or suppressed. A signature pattern for redundant pathways is the BPM (between-pathway model) motif, introduced by Kelley and Ideker. Past methods proposed to search the yeast interactome for BPM motifs have had several important limitations. First, they have been driven heuristically by local greedy searches, which can lead to the inclusion of extra genes that may not belong in the motif; second, they have been validated solely by functional coherence of the putative pathways using GO enrichment, making it difficult to evaluate putative BPMs in the absence of already known biological annotation. We introduce stable bipartite subgraphs, and show they form a clean and efficient way of generating meaningful BPMs which naturally discard extra genes included by local greedy methods. We show by GO enrichment measures that our BPM set outperforms previous work, covering more known complexes and functional pathways. Perhaps most importantly, since our BPMs are initially generated by examining the genetic-interaction network only, the location of edges in the protein-protein physical interaction network can then be used to statistically validate each candidate BPM, even with sparse GO annotation (or none at all). We uncover some interesting biological examples of previously unknown putative redundant pathways in such areas as vesicle-mediated transport and DNA repair.
Fault Tolerance in Protein Interaction Networks: Stable Bipartite Subgraphs and Redundant Pathways
Brady, Arthur; Maxwell, Kyle; Daniels, Noah; Cowen, Lenore J.
2009-01-01
As increasing amounts of high-throughput data for the yeast interactome become available, more system-wide properties are uncovered. One interesting question concerns the fault tolerance of protein interaction networks: whether there exist alternative pathways that can perform some required function if a gene essential to the main mechanism is defective, absent or suppressed. A signature pattern for redundant pathways is the BPM (between-pathway model) motif, introduced by Kelley and Ideker. Past methods proposed to search the yeast interactome for BPM motifs have had several important limitations. First, they have been driven heuristically by local greedy searches, which can lead to the inclusion of extra genes that may not belong in the motif; second, they have been validated solely by functional coherence of the putative pathways using GO enrichment, making it difficult to evaluate putative BPMs in the absence of already known biological annotation. We introduce stable bipartite subgraphs, and show they form a clean and efficient way of generating meaningful BPMs which naturally discard extra genes included by local greedy methods. We show by GO enrichment measures that our BPM set outperforms previous work, covering more known complexes and functional pathways. Perhaps most importantly, since our BPMs are initially generated by examining the genetic-interaction network only, the location of edges in the protein-protein physical interaction network can then be used to statistically validate each candidate BPM, even with sparse GO annotation (or none at all). We uncover some interesting biological examples of previously unknown putative redundant pathways in such areas as vesicle-mediated transport and DNA repair. PMID:19399174
2010-01-01
Background With its genome sequence and other experimental attributes, Populus trichocarpa has become the model species for genomic studies of wood development. Wood is derived from secondary growth of tree stems, and begins with the development of a ring of vascular cambium in the young developing stem. The terminal region of the developing shoot provides a steep developmental gradient from primary to secondary growth that facilitates identification of genes that play specialized functions during each of these phases of growth. Results Using a genomic microarray representing the majority of the transcriptome, we profiled gene expression in stem segments that spanned primary to secondary growth. We found 3,016 genes that were differentially expressed during stem development (Q-value ≤ 0.05; >2-fold expression variation), and 15% of these genes encode proteins with no significant identities to known genes. We identified all gene family members putatively involved in secondary growth for carbohydrate active enzymes, tubulins, actins, actin depolymerizing factors, fasciclin-like AGPs, and vascular development-associated transcription factors. Almost 70% of expressed transcription factors were upregulated during the transition to secondary growth. The primary shoot elongation region of the stem contained specific carbohydrate active enzyme and expansin family members that are likely to function in primary cell wall synthesis and modification. Genes involved in plant defense and protective functions were also dominant in the primary growth region. Conclusion Our results describe the global patterns of gene expression that occur during the transition from primary to secondary stem growth. We were able to identify three major patterns of gene expression and over-represented gene ontology categories during stem development. The new regulatory factors and cell wall biogenesis genes that we identified provide candidate genes for further functional characterization, as well as new tools for molecular breeding and biotechnology aimed at improvement of tree growth rate, crown form, and wood quality. PMID:20199690
A novel, extremely alkaliphilic and cold-active esterase from Antarctic desert soil.
Hu, Xiao Ping; Heath, Caroline; Taylor, Mark Paul; Tuffin, Marla; Cowan, Don
2012-01-01
A novel, cold-active and highly alkaliphilic esterase was isolated from an Antarctic desert soil metagenomic library by functional screening. The 1,044 bp gene sequence contained several conserved regions common to lipases/esterases, but lacked clear classification based on sequence analysis alone. Moderate (<40%) amino acid sequence similarity to known esterases was apparent (the closest neighbour being a hypothetical protein from Chitinophaga pinensis), despite phylogenetic distance to many of the lipolytic "families". The enzyme functionally demonstrated activity towards shorter chain p-nitrophenyl esters with the optimal activity recorded towards p-nitrophenyl propionate (C3). The enzyme possessed an apparent T(opt) at 20°C and a pH optimum at pH 11. Esterases possessing such extreme alkaliphily are rare and so this enzyme represents an intriguing novel locus in protein sequence space. A metagenomic approach has been shown, in this case, to yield an enzyme with quite different sequential/structural properties to known lipases. It serves as an excellent candidate for analysis of the molecular mechanisms responsible for both cold and alkaline activity and novel structure-function relationships of esterase activity.
Characterizations of 9p21 candidate genes in familial melanoma
DOE Office of Scientific and Technical Information (OSTI.GOV)
Walker, G.J.; Flores, J.F.; Glendening, J.M.
We have previously collected and characterized 16 melanoma families for the inheritance of a familial melanoma predisposition gene on 9p21. Clear evidence for genetic linkage has been detected in 8 of these families with the 9p21 markers D9S126 and 1FNA, while linkage of the remaining families to this region is less certain. A candidate for the 9p21 familial melanoma gene, the cyclin kinase inhibitor gene p16 (also known as the multiple tumor suppressor 1 (MTS1) gene), has been recently indentified. Notably, a nonsense mutation within the p16 gene has been detected in the lymphoblastoid cell line DNA from a dysplasticmore » nevus syndrome (DNS), or familial melanoma, patient. The p16 gene is also known to be frequently deleted or mutated in a variety of tumor cell lines (including melanoma) and resides within a region that has been defined as harboring the 9p21 melanoma predisposition locus. This region is delineated on the distal side by the marker D9S736 (which resides just distal to the p16 gene) and extends in a proximal direction to the marker D9S171. Overall, the entire distance between these two loci is estimated at 3-5Mb. Preliminary analysis of our two largest 9p21-linked melanoma kindreds (by direct sequencing of PCR products) has not yet revealed mutations within the coding region of the p16 gene. Others have reported that 8/11 unrelated 9p21-linked melanoma families do not appear to carry p16 mutations; thus the possibility exists that p16 is not a melanoma susceptibility gene per se, although it appears to play some role in melanoma tumor progression. Our melanoma kindred DNAs are currently being analyzed by SSCP using primers that amplify exons of other candidate genes from the 9p21 region implicated in familial melanoma. These novel genes reside within a distinct critical region of homozygous loss in melanoma which is located >2 Mb from the p16 gene on 9p21.« less
Tian, Yunhong; Tian, Yunming; Luo, Xiaojun; Zhou, Tao; Huang, Zuoping; Liu, Ying; Qiu, Yihan; Hou, Bing; Sun, Dan; Deng, Hongyu; Qian, Shen; Yao, Kaitai
2014-09-03
MicroRNAs (miRNAs) are a new class of endogenous regulators of a broad range of physiological processes, which act by regulating gene expression post-transcriptionally. The brassica vegetable, broccoli (Brassica oleracea var. italica), is very popular with a wide range of consumers, but environmental stresses such as salinity are a problem worldwide in restricting its growth and yield. Little is known about the role of miRNAs in the response of broccoli to salt stress. In this study, broccoli subjected to salt stress and broccoli grown under control conditions were analyzed by high-throughput sequencing. Differential miRNA expression was confirmed by real-time reverse transcription polymerase chain reaction (RT-PCR). The prediction of miRNA targets was undertaken using the Kyoto Encyclopedia of Genes and Genomes (KEGG) Orthology (KO) database and Gene Ontology (GO)-enrichment analyses. Two libraries of small (or short) RNAs (sRNAs) were constructed and sequenced by high-throughput Solexa sequencing. A total of 24,511,963 and 21,034,728 clean reads, representing 9,861,236 (40.23%) and 8,574,665 (40.76%) unique reads, were obtained for control and salt-stressed broccoli, respectively. Furthermore, 42 putative known and 39 putative candidate miRNAs that were differentially expressed between control and salt-stressed broccoli were revealed by their read counts and confirmed by the use of stem-loop real-time RT-PCR. Amongst these, the putative conserved miRNAs, miR393 and miR855, and two putative candidate miRNAs, miR3 and miR34, were the most strongly down-regulated when broccoli was salt-stressed, whereas the putative conserved miRNA, miR396a, and the putative candidate miRNA, miR37, were the most up-regulated. Finally, analysis of the predicted gene targets of miRNAs using the GO and KO databases indicated that a range of metabolic and other cellular functions known to be associated with salt stress were up-regulated in broccoli treated with salt. A comprehensive study of broccoli miRNA in relation to salt stress has been performed. We report significant data on the miRNA profile of broccoli that will underpin further studies on stress responses in broccoli and related species. The differential regulation of miRNAs between control and salt-stressed broccoli indicates that miRNAs play an integral role in the regulation of responses to salt stress.
Candidate genetic modifiers for breast and ovarian cancer risk in BRCA1 and BRCA2 mutation carriers
Peterlongo, Paolo; Chang-Claude, Jenny; Moysich, Kirsten B.; Rudolph, Anja; Schmutzler, Rita K.; Simard, Jacques; Soucy, Penny; Eeles, Rosalind A.; Easton, Douglas F.; Hamann, Ute; Wilkening, Stefan; Chen, Bowang; Rookus, Matti A.; Schmidt, Marjanka K; van der Baan, Frederieke H.; Spurdle, Amanda B.; Walker, Logan C.; Lose, Felicity; Maia, Ana-Teresa; Montagna, Marco; Matricardi, Laura; Lubinski, Jan; Jakubowska, Anna; Gómez Garcia, Encarna B.; Olopade, Olufunmilayo I.; Nussbaum, Robert L.; Nathanson, Katherine L.; Domchek, Susan M.; Rebbeck, Timothy R.; Arun, Banu K.; Karlan, Beth Y.; Orsulic, Sandra; Lester, Jenny; Chung, Wendy K.; Miron, Alex; Southey, Melissa C.; Goldgar, David E.; Buys, Saundra S.; Janavicius, Ramunas; Dorfling, Cecilia M.; van Rensburg, Elizabeth J.; Ding, Yuan Chun; Neuhausen, Susan L.; Hansen, Thomas V. O.; Gerdes, Anne-Marie; Ejlertsen, Bent; Jønson, Lars; Osorio, Ana; Martínez-Bouzas, Cristina; Benitez, Javier; Conway, Edye E.; Blazer, Kathleen R.; Weitzel, Jeffrey N.; Manoukian, Siranoush; Peissel, Bernard; Zaffaroni, Daniela; Scuvera, Giulietta; Barile, Monica; Ficarazzi, Filomena; Mariette, Frederique; Fortuzzi, Stefano; Viel, Alessandra; Giannini, Giuseppe; Papi, Laura; Martayan, Aline; Tibiletti, Maria Grazia; Radice, Paolo; Vratimos, Athanassios; Fostira, Florentia; Garber, Judy E.; Donaldson, Alan; Brewer, Carole; Foo, Claire; Evans, D. Gareth R.; Frost, Debra; Eccles, Diana; Brady, Angela; Cook, Jackie; Tischkowitz, Marc; Adlard, Julian; Barwell, Julian; Walker, Lisa; Izatt, Louise; Side, Lucy E.; Kennedy, M. John; Rogers, Mark T.; Porteous, Mary E.; Morrison, Patrick J.; Platte, Radka; Davidson, Rosemarie; Hodgson, Shirley V.; Ellis, Steve; Cole, Trevor; Godwin, Andrew K.; Claes, Kathleen; Van Maerken, Tom; Meindl, Alfons; Gehrig, Andrea; Sutter, Christian; Engel, Christoph; Niederacher, Dieter; Steinemann, Doris; Plendl, Hansjoerg; Kast, Karin; Rhiem, Kerstin; Ditsch, Nina; Arnold, Norbert; Varon-Mateeva, Raymonda; Wappenschmidt, Barbara; Wang-Gohrke, Shan; Bressac-de Paillerets, Brigitte; Buecher, Bruno; Delnatte, Capucine; Houdayer, Claude; Stoppa-Lyonnet, Dominique; Damiola, Francesca; Coupier, Isabelle; Barjhoux, Laure; Venat-Bouvet, Laurence; Golmard, Lisa; Boutry-Kryza, Nadia; Sinilnikova, Olga M.; Caron, Olivier; Pujol, Pascal; Mazoyer, Sylvie; Belotti, Muriel; Piedmonte, Marion; Friedlander, Michael L.; Rodriguez, Gustavo C.; Copeland, Larry J; de la Hoya, Miguel; Segura, Pedro Perez; Nevanlinna, Heli; Aittomäki, Kristiina; van Os, Theo A.M.; Meijers-Heijboer, Hanne E.J.; van der Hout, Annemarie H.; Vreeswijk, Maaike P.G.; Hoogerbrugge, Nicoline; Ausems, Margreet G.E.M.; van Doorn, Helena C.; Collée, J. Margriet; Olah, Edith; Diez, Orland; Blanco, Ignacio; Lazaro, Conxi; Brunet, Joan; Feliubadalo, Lidia; Cybulski, Cezary; Gronwald, Jacek; Durda, Katarzyna; Jaworska-Bieniek, Katarzyna; Sukiennicki, Grzegorz; Arason, Adalgeir; Chiquette, Jocelyne; Teixeira, Manuel R.; Olswold, Curtis; Couch, Fergus J.; Lindor, Noralane M.; Wang, Xianshu; Szabo, Csilla I.; Offit, Kenneth; Corines, Marina; Jacobs, Lauren; Robson, Mark E.; Zhang, Liying; Joseph, Vijai; Berger, Andreas; Singer, Christian F.; Rappaport, Christine; Kaulich, Daphne Geschwantler; Pfeiler, Georg; Tea, Muy-Kheng M.; Phelan, Catherine M.; Greene, Mark H.; Mai, Phuong L.; Rennert, Gad; Mulligan, Anna Marie; Glendon, Gord; Tchatchou, Sandrine; Andrulis, Irene L.; Toland, Amanda Ewart; Bojesen, Anders; Pedersen, Inge Sokilde; Thomassen, Mads; Jensen, Uffe Birk; Laitman, Yael; Rantala, Johanna; von Wachenfeldt, Anna; Ehrencrona, Hans; Askmalm, Marie Stenmark; Borg, Åke; Kuchenbaecker, Karoline B.; McGuffog, Lesley; Barrowdale, Daniel; Healey, Sue; Lee, Andrew; Pharoah, Paul D.P.; Chenevix-Trench, Georgia; Antoniou, Antonis C.; Friedman, Eitan
2014-01-01
Background BRCA1 and BRCA2 mutation carriers are at substantially increased risk for developing breast and ovarian cancer. The incomplete penetrance coupled with the variable age at diagnosis in carriers of the same mutation suggests the existence of genetic and non-genetic modifying factors. In this study we evaluated the putative role of variants in many candidate modifier genes. Methods Genotyping data from 15,252 BRCA1 and 8,211 BRCA2 mutation carriers, for known variants (n=3,248) located within or around 445 candidate genes, were available through the iCOGS custom-designed array. Breast and ovarian cancer association analysis was performed within a retrospective cohort approach. Results The observed p-values of association ranged between 0.005-1.000. None of the variants was significantly associated with breast or ovarian cancer risk in either BRCA1 or BRCA2 mutation carriers, after multiple testing adjustments. Conclusion There is little evidence that any of the evaluated candidate variants act as modifiers of breast and/or ovarian cancer risk in BRCA1 or BRCA2 mutation carriers. Impact Genome-wide association studies have been more successful at identifying genetic modifiers of BRCA1/2 penetrance than candidate gene studies. PMID:25336561
Candidate genetic modifiers for breast and ovarian cancer risk in BRCA1 and BRCA2 mutation carriers.
Peterlongo, Paolo; Chang-Claude, Jenny; Moysich, Kirsten B; Rudolph, Anja; Schmutzler, Rita K; Simard, Jacques; Soucy, Penny; Eeles, Rosalind A; Easton, Douglas F; Hamann, Ute; Wilkening, Stefan; Chen, Bowang; Rookus, Matti A; Schmidt, Marjanka K; van der Baan, Frederieke H; Spurdle, Amanda B; Walker, Logan C; Lose, Felicity; Maia, Ana-Teresa; Montagna, Marco; Matricardi, Laura; Lubinski, Jan; Jakubowska, Anna; Gómez Garcia, Encarna B; Olopade, Olufunmilayo I; Nussbaum, Robert L; Nathanson, Katherine L; Domchek, Susan M; Rebbeck, Timothy R; Arun, Banu K; Karlan, Beth Y; Orsulic, Sandra; Lester, Jenny; Chung, Wendy K; Miron, Alex; Southey, Melissa C; Goldgar, David E; Buys, Saundra S; Janavicius, Ramunas; Dorfling, Cecilia M; van Rensburg, Elizabeth J; Ding, Yuan Chun; Neuhausen, Susan L; Hansen, Thomas V O; Gerdes, Anne-Marie; Ejlertsen, Bent; Jønson, Lars; Osorio, Ana; Martínez-Bouzas, Cristina; Benitez, Javier; Conway, Edye E; Blazer, Kathleen R; Weitzel, Jeffrey N; Manoukian, Siranoush; Peissel, Bernard; Zaffaroni, Daniela; Scuvera, Giulietta; Barile, Monica; Ficarazzi, Filomena; Mariette, Frederique; Fortuzzi, Stefano; Viel, Alessandra; Giannini, Giuseppe; Papi, Laura; Martayan, Aline; Tibiletti, Maria Grazia; Radice, Paolo; Vratimos, Athanassios; Fostira, Florentia; Garber, Judy E; Donaldson, Alan; Brewer, Carole; Foo, Claire; Evans, D Gareth R; Frost, Debra; Eccles, Diana; Brady, Angela; Cook, Jackie; Tischkowitz, Marc; Adlard, Julian; Barwell, Julian; Walker, Lisa; Izatt, Louise; Side, Lucy E; Kennedy, M John; Rogers, Mark T; Porteous, Mary E; Morrison, Patrick J; Platte, Radka; Davidson, Rosemarie; Hodgson, Shirley V; Ellis, Steve; Cole, Trevor; Godwin, Andrew K; Claes, Kathleen; Van Maerken, Tom; Meindl, Alfons; Gehrig, Andrea; Sutter, Christian; Engel, Christoph; Niederacher, Dieter; Steinemann, Doris; Plendl, Hansjoerg; Kast, Karin; Rhiem, Kerstin; Ditsch, Nina; Arnold, Norbert; Varon-Mateeva, Raymonda; Wappenschmidt, Barbara; Wang-Gohrke, Shan; Bressac-de Paillerets, Brigitte; Buecher, Bruno; Delnatte, Capucine; Houdayer, Claude; Stoppa-Lyonnet, Dominique; Damiola, Francesca; Coupier, Isabelle; Barjhoux, Laure; Venat-Bouvet, Laurence; Golmard, Lisa; Boutry-Kryza, Nadia; Sinilnikova, Olga M; Caron, Olivier; Pujol, Pascal; Mazoyer, Sylvie; Belotti, Muriel; Piedmonte, Marion; Friedlander, Michael L; Rodriguez, Gustavo C; Copeland, Larry J; de la Hoya, Miguel; Segura, Pedro Perez; Nevanlinna, Heli; Aittomäki, Kristiina; van Os, Theo A M; Meijers-Heijboer, Hanne E J; van der Hout, Annemarie H; Vreeswijk, Maaike P G; Hoogerbrugge, Nicoline; Ausems, Margreet G E M; van Doorn, Helena C; Collée, J Margriet; Olah, Edith; Diez, Orland; Blanco, Ignacio; Lazaro, Conxi; Brunet, Joan; Feliubadalo, Lidia; Cybulski, Cezary; Gronwald, Jacek; Durda, Katarzyna; Jaworska-Bieniek, Katarzyna; Sukiennicki, Grzegorz; Arason, Adalgeir; Chiquette, Jocelyne; Teixeira, Manuel R; Olswold, Curtis; Couch, Fergus J; Lindor, Noralane M; Wang, Xianshu; Szabo, Csilla I; Offit, Kenneth; Corines, Marina; Jacobs, Lauren; Robson, Mark E; Zhang, Liying; Joseph, Vijai; Berger, Andreas; Singer, Christian F; Rappaport, Christine; Kaulich, Daphne Geschwantler; Pfeiler, Georg; Tea, Muy-Kheng M; Phelan, Catherine M; Greene, Mark H; Mai, Phuong L; Rennert, Gad; Mulligan, Anna Marie; Glendon, Gord; Tchatchou, Sandrine; Andrulis, Irene L; Toland, Amanda Ewart; Bojesen, Anders; Pedersen, Inge Sokilde; Thomassen, Mads; Jensen, Uffe Birk; Laitman, Yael; Rantala, Johanna; von Wachenfeldt, Anna; Ehrencrona, Hans; Askmalm, Marie Stenmark; Borg, Åke; Kuchenbaecker, Karoline B; McGuffog, Lesley; Barrowdale, Daniel; Healey, Sue; Lee, Andrew; Pharoah, Paul D P; Chenevix-Trench, Georgia; Antoniou, Antonis C; Friedman, Eitan
2015-01-01
BRCA1 and BRCA2 mutation carriers are at substantially increased risk for developing breast and ovarian cancer. The incomplete penetrance coupled with the variable age at diagnosis in carriers of the same mutation suggests the existence of genetic and nongenetic modifying factors. In this study, we evaluated the putative role of variants in many candidate modifier genes. Genotyping data from 15,252 BRCA1 and 8,211 BRCA2 mutation carriers, for known variants (n = 3,248) located within or around 445 candidate genes, were available through the iCOGS custom-designed array. Breast and ovarian cancer association analysis was performed within a retrospective cohort approach. The observed P values of association ranged between 0.005 and 1.000. None of the variants was significantly associated with breast or ovarian cancer risk in either BRCA1 or BRCA2 mutation carriers, after multiple testing adjustments. There is little evidence that any of the evaluated candidate variants act as modifiers of breast and/or ovarian cancer risk in BRCA1 or BRCA2 mutation carriers. Genome-wide association studies have been more successful at identifying genetic modifiers of BRCA1/2 penetrance than candidate gene studies. ©2014 American Association for Cancer Research.
Pathak, Bhakti R; Breed, Ananya A; Apte, Snehal; Acharya, Kshitish; Mahale, Smita D
2016-01-01
Cysteine-rich secretory protein 3 (CRISP-3) is upregulated in prostate cancer as compared to the normal prostate tissue. Higher expression of CRISP-3 has been linked to poor prognosis and hence it has been thought to act as a prognostic marker for prostate cancer. It is proposed to have a role in innate immunity but its role in prostate cancer is still unknown. In order to understand its function, its expression was stably knocked down in LNCaP cells. CRISP-3 knockdown did not affect cell viability but resulted in reduced invasiveness. Global gene expression changes upon CRISP-3 knockdown were identified by microarray analysis. Microarray data were quantitatively validated by evaluating the expression of seven candidate genes in three independent stable clones. Functional annotation of the differentially expressed genes identified cell adhesion, cell motility, and ion transport to be affected among other biological processes. Prostate-specific antigen (PSA, also known as Kallikrein 3) was the top most downregulated gene whose expression was also validated at protein level. Interestingly, expression of Annexin A1 (ANXA1), a known anti-inflammatory protein, was upregulated upon CRISP-3 knockdown. Re-introduction of CRISP-3 into the knockdown clone reversed the effect on invasiveness and also led to increased PSA expression. These results suggest that overexpression of CRISP-3 in prostate tumor may maintain higher PSA expression and lower ANXA1 expression. Our data also indicate that poor prognosis associated with higher CRISP-3 expression could be due to its role in cell invasion.
Iacob, Eli; Light, Alan R.; Donaldson, Gary W.; Okifuji, Akiko; Hughen, Ronald W.; White, Andrea T.; Light, Kathleen C.
2015-01-01
Objective To determine if independent candidate genes can be grouped into meaningful biological factors and if these factors are associated with the diagnosis of chronic fatigue syndrome (CFS) and fibromyalgia (FMS) while controlling for co-morbid depression, sex, and age. Methods We included leukocyte mRNA gene expression from a total of 261 individuals including healthy controls (n=61), patients with FMS only (n=15), CFS only (n=33), co-morbid CFS and FMS (n=79), and medication-resistant (n=42) or medication-responsive (n=31) depression. We used Exploratory Factor Analysis (EFA) on 34 candidate genes to determine factor scores and regression analysis to examine if these factors were associated with specific diagnoses. Results EFA resulted in four independent factors with minimal overlap of genes between factors explaining 51% of the variance. We labeled these factors by function as: 1) Purinergic and cellular modulators; 2) Neuronal growth and immune function; 3) Nociception and stress mediators; 4) Energy and mitochondrial function. Regression analysis predicting these biological factors using FMS, CFS, depression severity, age, and sex revealed that greater expression in Factors 1 and 3 was positively associated with CFS and negatively associated with depression severity (QIDS score), but not associated with FMS. Conclusion Expression of candidate genes can be grouped into meaningful clusters, and CFS and depression are associated with the same 2 clusters but in opposite directions when controlling for co-morbid FMS. Given high co-morbid disease and interrelationships between biomarkers, EFA may help determine patient subgroups in this population based on gene expression. PMID:26097208
Herszberg, B; McCue, M E; Larcher, T; Mata, X; Vaiman, A; Chaffaux, S; Chérel, Y; Valberg, S J; Mickelson, J R; Guérin, G
2009-02-01
Glycogen storage diseases or glycogenoses are inherited diseases caused by abnormalities of enzymes that regulate the synthesis or degradation of glycogen. Deleterious mutations in many genes of the glyco(geno)lytic or the glycogenesis pathways can potentially cause a glycogenosis, and currently mutations in fourteen different genes are known to cause animal or human glycogenoses, resulting in myopathies and/or hepatic disorders. The genetic bases of two forms of glycogenosis are currently known in horses. A fatal neonatal polysystemic type IV glycogenosis, inherited recessively in affected Quarter Horse foals, is due to a mutation in the glycogen branching enzyme gene (GBE1). A second type of glycogenosis, termed polysaccharide storage myopathy (PSSM), is observed in adult Quarter Horses and other breeds. A severe form of PSSM also occurs in draught horses. A mutation in the skeletal muscle glycogen synthase gene (GYS1) was recently reported to be highly associated with PSSM in Quarter Horses and Belgian draught horses. This GYS1 point mutation appears to cause a gain-of-function of the enzyme and to result in the accumulation of a glycogen-like, less-branched polysaccharide in skeletal muscle. It is inherited as a dominant trait. The aim of this work was to test for possible associations between genetic polymorphisms in four candidate genes of the glycogen pathway or the GYS1 mutation in Cob Normand draught horses diagnosed with PSSM by muscle biopsy.
2013-01-01
Background Identification of single nucleotide polymorphisms (SNPs) for specific genes involved in reproduction might improve reliability of genomic estimates for these low-heritability traits. Semen from 550 Holstein bulls of high (≥ 1.7; n = 288) or low (≤ −2; n = 262) daughter pregnancy rate (DPR) was genotyped for 434 candidate SNPs using the Sequenom MassARRAY® system. Three types of SNPs were evaluated: SNPs previously reported to be associated with reproductive traits or physically close to genetic markers for reproduction, SNPs in genes that are well known to be involved in reproductive processes, and SNPs in genes that are differentially expressed between physiological conditions in a variety of tissues associated in reproductive function. Eleven reproduction and production traits were analyzed. Results A total of 40 SNPs were associated (P < 0.05) with DPR. Among these were genes involved in the endocrine system, cell signaling, immune function and inhibition of apoptosis. A total of 10 genes were regulated by estradiol. In addition, 22 SNPs were associated with heifer conception rate, 33 with cow conception rate, 36 with productive life, 34 with net merit, 23 with milk yield, 19 with fat yield, 13 with fat percent, 19 with protein yield, 22 with protein percent, and 13 with somatic cell score. The allele substitution effect for SNPs associated with heifer conception rate, cow conception rate, productive life and net merit were in the same direction as for DPR. Allele substitution effects for several SNPs associated with production traits were in the opposite direction as DPR. Nonetheless, there were 29 SNPs associated with DPR that were not negatively associated with production traits. Conclusion SNPs in a total of 40 genes associated with DPR were identified as well as SNPs for other traits. It might be feasible to include these SNPs into genomic tests of reproduction and other traits. The genes associated with DPR are likely to be important for understanding the physiology of reproduction. Given the large number of SNPs associated with DPR that were not negatively associated with production traits, it should be possible to select for DPR without compromising production. PMID:23759029
Tollenaere, C; Jacquet, S; Ivanova, S; Loiseau, A; Duplantier, J-M; Streiff, R; Brouat, C
2013-01-01
Genome scans using amplified fragment length polymorphism (AFLP) markers became popular in nonmodel species within the last 10 years, but few studies have tried to characterize the anonymous outliers identified. This study follows on from an AFLP genome scan in the black rat (Rattus rattus), the reservoir of plague (Yersinia pestis infection) in Madagascar. We successfully sequenced 17 of the 22 markers previously shown to be potentially affected by plague-mediated selection and associated with a plague resistance phenotype. Searching these sequences in the genome of the closely related species Rattus norvegicus assigned them to 14 genomic regions, revealing a random distribution of outliers in the genome (no clustering). We compared these results with those of an in silico AFLP study of the R. norvegicus genome, which showed that outlier sequences could not have been inferred by this method in R. rattus (only four of the 15 sequences were predicted). However, in silico analysis allowed the prediction of AFLP markers distribution and the estimation of homoplasy rates, confirming its potential utility for designing AFLP studies in nonmodel species. The 14 genomic regions surrounding AFLP outliers (less than 300 kb from the marker) contained 75 genes encoding proteins of known function, including nine involved in immune function and pathogen defence. We identified the two interleukin 1 genes (Il1a and Il1b) that share homology with an antigen of Y. pestis, as the best candidates for genes subject to plague-mediated natural selection. At least six other genes known to be involved in proinflammatory pathways may also be affected by plague-mediated selection. © 2012 Blackwell Publishing Ltd.
Turyagyenda, Laban F.; Kizito, Elizabeth B.; Ferguson, Morag; Baguma, Yona; Agaba, Morris; Harvey, Jagger J. W.; Osiru, David S. O.
2013-01-01
Cassava is an important root crop to resource-poor farmers in marginal areas, where its production faces drought stress constraints. Given the difficulties associated with cassava breeding, a molecular understanding of drought tolerance in cassava will help in the identification of markers for use in marker-assisted selection and genes for transgenic improvement of drought tolerance. This study was carried out to identify candidate drought-tolerance genes and expression-based markers of drought stress in cassava. One drought-tolerant (improved variety) and one drought-susceptible (farmer-preferred) cassava landrace were grown in the glasshouse under well-watered and water-stressed conditions. Their morphological, physiological and molecular responses to drought were characterized. Morphological and physiological measurements indicate that the tolerance of the improved variety is based on drought avoidance, through reduction of water loss via partial stomatal closure. Ten genes that have previously been biologically validated as conferring or being associated with drought tolerance in other plant species were confirmed as being drought responsive in cassava. Four genes (MeALDH, MeZFP, MeMSD and MeRD28) were identified as candidate cassava drought-tolerance genes, as they were exclusively up-regulated in the drought-tolerant genotype to comparable levels known to confer drought tolerance in other species. Based on these genes, we hypothesize that the basis of the tolerance at the cellular level is probably through mitigation of the oxidative burst and osmotic adjustment. This study provides an initial characterization of the molecular response of cassava to drought stress resembling field conditions. The drought-responsive genes can now be used as expression-based markers of drought stress tolerance in cassava, and the candidate tolerance genes tested in the context of breeding (as possible quantitative trait loci) and engineering drought tolerance in transgenics. PMID:23519782
Generation of transgenic mouse model using PTTG as an oncogene.
Kakar, Sham S; Kakar, Cohin
2015-01-01
The close physiological similarity between the mouse and human has provided tools to understanding the biological function of particular genes in vivo by introduction or deletion of a gene of interest. Using a mouse as a model has provided a wealth of resources, knowledge, and technology, helping scientists to understand the biological functions, translocation, trafficking, and interaction of a candidate gene with other intracellular molecules, transcriptional regulation, posttranslational modification, and discovery of novel signaling pathways for a particular gene. Most importantly, the generation of the mouse model for a specific human disease has provided a powerful tool to understand the etiology of a disease and discovery of novel therapeutics. This chapter describes in detail the step-by-step generation of the transgenic mouse model, which can be helpful in guiding new investigators in developing successful models. For practical purposes, we will describe the generation of a mouse model using pituitary tumor transforming gene (PTTG) as the candidate gene of interest.
Schrank, Bertold; Götz, Rudolf; Gunnersen, Jennifer M.; Ure, Janice M.; Toyka, Klaus V.; Smith, Austin G.; Sendtner, Michael
1997-01-01
Proximal spinal muscular atrophy is an autosomal recessive human disease of spinal motor neurons leading to muscular weakness with onset predominantly in infancy and childhood. With an estimated heterozygote frequency of 1/40 it is the most common monogenic disorder lethal to infants; milder forms represent the second most common pediatric neuromuscular disorder. Two candidate genes—survival motor neuron (SMN) and neuronal apoptosis inhibitory protein have been identified on chromosome 5q13 by positional cloning. However, the functional impact of these genes and the mechanism leading to a degeneration of motor neurons remain to be defined. To analyze the role of the SMN gene product in vivo we generated SMN-deficient mice. In contrast to the human genome, which contains two copies, the mouse genome contains only one SMN gene. Mice with homozygous SMN disruption display massive cell death during early embryonic development, indicating that the SMN gene product is necessary for cellular survival and function. PMID:9275227
2014-01-01
Background Coconut (Cocos nucifera L.) is one of the world’s most versatile, economically important tropical crops. Little is known about the physiological and molecular basis of coconut pulp (endosperm) development and only a few coconut genes and gene product sequences are available in public databases. This study identified genes that were differentially expressed during development of coconut pulp and functionally annotated these identified genes using bioinformatics analysis. Results Pulp from three different coconut developmental stages was collected. Four suppression subtractive hybridization (SSH) libraries were constructed (forward and reverse libraries A and B between stages 1 and 2, and C and D between stages 2 and 3), and identified sequences were computationally annotated using Blast2GO software. A total of 1272 clones were obtained for analysis from four SSH libraries with 63% showing similarity to known proteins. Pairwise comparing of stage-specific gene ontology ids from libraries B-D, A-C, B-C and A-D showed that 32 genes were continuously upregulated and seven downregulated; 28 were transiently upregulated and 23 downregulated. KEGG (Kyoto Encyclopedia of Genes and Genomes) analysis showed that 1-acyl-sn-glycerol-3-phosphate acyltransferase (LPAAT), phospholipase D, acetyl-CoA carboxylase carboxyltransferase beta subunit, 3-hydroxyisobutyryl-CoA hydrolase-like and pyruvate dehydrogenase E1 β subunit were associated with fatty acid biosynthesis or metabolism. Triose phosphate isomerase, cellulose synthase and glucan 1,3-β-glucosidase were related to carbohydrate metabolism, and phosphoenolpyruvate carboxylase was related to both fatty acid and carbohydrate metabolism. Of 737 unigenes, 103 encoded enzymes were involved in fatty acid and carbohydrate biosynthesis and metabolism, and a number of transcription factors and other interesting genes with stage-specific expression were confirmed by real-time PCR, with validation of the SSH results as high as 66.6%. Based on determination of coconut endosperm fatty acids content by gas chromatography–mass spectrometry, a number of candidate genes in fatty acid anabolism were selected for further study. Conclusion Functional annotation of genes differentially expressed in coconut pulp development helped determine the molecular basis of coconut endosperm development. The SSH method identified genes related to fatty acids, carbohydrate and secondary metabolites. The results will be important for understanding gene functions and regulatory networks in coconut fruit. PMID:25084812
Liang, Yuanxue; Yuan, Yijun; Liu, Tao; Mao, Wei; Zheng, Yusheng; Li, Dongdong
2014-08-02
Coconut (Cocos nucifera L.) is one of the world's most versatile, economically important tropical crops. Little is known about the physiological and molecular basis of coconut pulp (endosperm) development and only a few coconut genes and gene product sequences are available in public databases. This study identified genes that were differentially expressed during development of coconut pulp and functionally annotated these identified genes using bioinformatics analysis. Pulp from three different coconut developmental stages was collected. Four suppression subtractive hybridization (SSH) libraries were constructed (forward and reverse libraries A and B between stages 1 and 2, and C and D between stages 2 and 3), and identified sequences were computationally annotated using Blast2GO software. A total of 1272 clones were obtained for analysis from four SSH libraries with 63% showing similarity to known proteins. Pairwise comparing of stage-specific gene ontology ids from libraries B-D, A-C, B-C and A-D showed that 32 genes were continuously upregulated and seven downregulated; 28 were transiently upregulated and 23 downregulated. KEGG (Kyoto Encyclopedia of Genes and Genomes) analysis showed that 1-acyl-sn-glycerol-3-phosphate acyltransferase (LPAAT), phospholipase D, acetyl-CoA carboxylase carboxyltransferase beta subunit, 3-hydroxyisobutyryl-CoA hydrolase-like and pyruvate dehydrogenase E1 β subunit were associated with fatty acid biosynthesis or metabolism. Triose phosphate isomerase, cellulose synthase and glucan 1,3-β-glucosidase were related to carbohydrate metabolism, and phosphoenolpyruvate carboxylase was related to both fatty acid and carbohydrate metabolism. Of 737 unigenes, 103 encoded enzymes were involved in fatty acid and carbohydrate biosynthesis and metabolism, and a number of transcription factors and other interesting genes with stage-specific expression were confirmed by real-time PCR, with validation of the SSH results as high as 66.6%. Based on determination of coconut endosperm fatty acids content by gas chromatography-mass spectrometry, a number of candidate genes in fatty acid anabolism were selected for further study. Functional annotation of genes differentially expressed in coconut pulp development helped determine the molecular basis of coconut endosperm development. The SSH method identified genes related to fatty acids, carbohydrate and secondary metabolites. The results will be important for understanding gene functions and regulatory networks in coconut fruit.
Fernandez-San Jose, Patricia; Liu, Yichuan; March, Michael; Pellegrino, Renata; Golhar, Ryan; Corton, Marta; Blanco-Kelly, Fiona; López-Molina, Maria Isabel; García-Sandoval, Blanca; Guo, Yiran; Tian, Lifeng; Liu, Xuanzhu; Guan, Liping; Zhang, Jianguo; Keating, Brendan; Xu, Xun
2015-01-01
This study aimed to identify the genetics underlying dominant forms of inherited retinal dystrophies using whole exome sequencing (WES) in six families extensively screened for known mutations or genes. Thirty-eight individuals were subjected to WES. Causative variants were searched among single nucleotide variants (SNVs) and insertion/deletion variants (indels) and whenever no potential candidate emerged, copy number variant (CNV) analysis was performed. Variants or regions harboring a candidate variant were prioritized and segregation of the variant with the disease was further assessed using Sanger sequencing in case of SNVs and indels, and quantitative PCR (qPCR) for CNVs. SNV and indel analysis led to the identification of a previously reported mutation in PRPH2. Two additional mutations linked to different forms of retinal dystrophies were identified in two families: a known frameshift deletion in RPGR, a gene responsible for X-linked retinitis pigmentosa and p.Ser163Arg in C1QTNF5 associated with Late-Onset Retinal Degeneration. A novel heterozygous deletion spanning the entire region of PRPF31 was also identified in the affected members of a fourth family, which was confirmed with qPCR. This study allowed the identification of the genetic cause of the retinal dystrophy and the establishment of a correct diagnosis in four families, including a large heterozygous deletion in PRPF31, typically considered one of the pitfalls of this method. Since all findings in this study are restricted to known genes, we propose that targeted sequencing using gene-panel is an optimal first approach for the genetic screening and that once known genetic causes are ruled out, WES might be used to uncover new genes involved in inherited retinal dystrophies. PMID:26197217
DOE Office of Scientific and Technical Information (OSTI.GOV)
Goodman, A.B.
1994-09-15
Among relatives of Ashkenazi schizophrenic probands the rate of amyotrophic lateral sclerosis was 3/1,000, compared to expected population rates of approximately 2/100,000. Relative risk of bleeding disorders, including hematologic cancers, was increased more than three-fold compared to controls. Co-occurrence of motor neuron disease and blood dyscrasias, accompanied by psychosis, has long been recognized. A virally-mediated autoimmune pathogenesis has been proposed. However, the familial co-occurrence of these three disease entities raises the possibility that the disease constellation be considered as a manifestation of a common underlying genetic defect. Such expansion of the spectrum of affectation might enhance the power of bothmore » candidate gene and linkage studies. Based on these findings, the loci suggested as candidate regions in schizophrenia include a potential hot spot on chromosome 21q21-q22, involving the superoxide dismutase and amyloid precursor protein genes. Alternatively, genes on other chromosomes involved in the expression, transcription, or regulation of these genes, or associated with the illnesses of high frequency in these pedigrees are suggested. Candidates include the choroid plexus transport protein, transthyretin at 18q11.2-q12.1; the t(14;18)(q22;21) characterizing B-cell lymphoma-2, the most common form of hematologic cancer; and the 14q24 locus of early onset Alzheimer`s disease, c-Fos, transforming growth factor beta 3, and heat shock protein A2. Expression of hematologic cancers and the suggested candidate genes are known to involve retinoid pathways, and retinoid disregulation has been proposed as a cause of schizophrenia. 67 refs., 2 figs., 1 tab.« less
Rossouw, Debra; Næs, Tormod; Bauer, Florian F
2008-01-01
Background 'Omics' tools provide novel opportunities for system-wide analysis of complex cellular functions. Secondary metabolism is an example of a complex network of biochemical pathways, which, although well mapped from a biochemical point of view, is not well understood with regards to its physiological roles and genetic and biochemical regulation. Many of the metabolites produced by this network such as higher alcohols and esters are significant aroma impact compounds in fermentation products, and different yeast strains are known to produce highly divergent aroma profiles. Here, we investigated whether we can predict the impact of specific genes of known or unknown function on this metabolic network by combining whole transcriptome and partial exo-metabolome analysis. Results For this purpose, the gene expression levels of five different industrial wine yeast strains that produce divergent aroma profiles were established at three different time points of alcoholic fermentation in synthetic wine must. A matrix of gene expression data was generated and integrated with the concentrations of volatile aroma compounds measured at the same time points. This relatively unbiased approach to the study of volatile aroma compounds enabled us to identify candidate genes for aroma profile modification. Five of these genes, namely YMR210W, BAT1, AAD10, AAD14 and ACS1 were selected for overexpression in commercial wine yeast, VIN13. Analysis of the data show a statistically significant correlation between the changes in the exo-metabome of the overexpressing strains and the changes that were predicted based on the unbiased alignment of transcriptomic and exo-metabolomic data. Conclusion The data suggest that a comparative transcriptomics and metabolomics approach can be used to identify the metabolic impacts of the expression of individual genes in complex systems, and the amenability of transcriptomic data to direct applications of biotechnological relevance. PMID:18990252
Wu, Shuanghua; Lei, Jianjun; Chen, Guoju; Chen, Hancai; Cao, Bihao; Chen, Changming
2017-01-01
Chinese kale, a vegetable of the cruciferous family, is a popular crop in southern China and Southeast Asia due to its high glucosinolate content and nutritional qualities. However, there is little research on the molecular genetics and genes involved in glucosinolate metabolism and its regulation in Chinese kale. In this study, we sequenced and characterized the transcriptomes and expression profiles of genes expressed in 11 tissues of Chinese kale. A total of 216 million 150-bp clean reads were generated using RNA-sequencing technology. From the sequences, 98,180 unigenes were assembled for the whole plant, and 49,582~98,423 unigenes were assembled for each tissue. Blast analysis indicated that a total of 80,688 (82.18%) unigenes exhibited similarity to known proteins. The functional annotation and classification tools used in this study suggested that genes principally expressed in Chinese kale, were mostly involved in fundamental processes, such as cellular and molecular functions, the signal transduction, and biosynthesis of secondary metabolites. The expression levels of all unigenes were analyzed in various tissues of Chinese kale. A large number of candidate genes involved in glucosinolate metabolism and its regulation were identified, and the expression patterns of these genes were analyzed. We found that most of the genes involved in glucosinolate biosynthesis were highly expressed in the root, petiole, and in senescent leaves. The expression patterns of ten glucosinolate biosynthetic genes from RNA-seq were validated by quantitative RT-PCR in different tissues. These results provided an initial and global overview of Chinese kale gene functions and expression activities in different tissues. PMID:28228764
Valentin-Kahan, Adrián; García-Tejedor, Gabriela B; Robello, Carlos; Trujillo-Cenóz, Omar; Russo, Raúl E; Alvarez-Valin, Fernando
2017-01-01
Slider turtles are the only known amniotes with self-repair mechanisms of the spinal cord that lead to substantial functional recovery. Their strategic phylogenetic position makes them a relevant model to investigate the peculiar genetic programs that allow anatomical reconnection in some vertebrate groups but are absent in others. Here, we analyze the gene expression profile of the response to spinal cord injury (SCI) in the turtle Trachemys scripta elegans . We found that this response comprises more than 1000 genes affecting diverse functions: reaction to ischemic insult, extracellular matrix re-organization, cell proliferation and death, immune response, and inflammation. Genes related to synapses and cholesterol biosynthesis are down-regulated. The analysis of the evolutionary distribution of these genes shows that almost all are present in most vertebrates. Additionally, we failed to find genes that were exclusive of regenerating taxa. The comparison of expression patterns among species shows that the response to SCI in the turtle is more similar to that of mice and non-regenerative Xenopus than to Xenopus during its regenerative stage. This observation, along with the lack of conserved "regeneration genes" and the current accepted phylogenetic placement of turtles (sister group of crocodilians and birds), indicates that the ability of spinal cord self-repair of turtles does not represent the retention of an ancestral vertebrate character. Instead, our results suggest that turtles developed this capability from a non-regenerative ancestor (i.e., a lineage specific innovation) that was achieved by re-organizing gene expression patterns on an essentially non-regenerative genetic background. Among the genes activated by SCI exclusively in turtles, those related to anoxia tolerance, extracellular matrix remodeling, and axonal regrowth are good candidates to underlie functional recovery.
Fitzgerald, Timothy L; Powell, Jonathan J; Stiller, Jiri; Weese, Terri L; Abe, Tomoko; Zhao, Guangyao; Jia, Jizeng; McIntyre, C Lynne; Li, Zhongyi; Manners, John M; Kazan, Kemal
2015-01-01
Reverse genetic techniques harnessing mutational approaches are powerful tools that can provide substantial insight into gene function in plants. However, as compared to diploid species, reverse genetic analyses in polyploid plants such as bread wheat can present substantial challenges associated with high levels of sequence and functional similarity amongst homoeologous loci. We previously developed a high-throughput method to identify deletions of genes within a physically mutagenized wheat population. Here we describe our efforts to combine multiple homoeologous deletions of three candidate disease susceptibility genes (TaWRKY11, TaPFT1 and TaPLDß1). We were able to produce lines featuring homozygous deletions at two of the three homoeoloci for all genes, but this was dependent on the individual mutants used in crossing. Intriguingly, despite extensive efforts, viable lines possessing homozygous deletions at all three homoeoloci could not be produced for any of the candidate genes. To investigate deletion size as a possible reason for this phenomenon, we developed an amplicon sequencing approach based on synteny to Brachypodium distachyon to assess the size of the deletions removing one candidate gene (TaPFT1) in our mutants. These analyses revealed that genomic deletions removing the locus are relatively large, resulting in the loss of multiple additional genes. The implications of this work for the use of heavy ion mutagenesis for reverse genetic analyses in wheat are discussed.
Fitzgerald, Timothy L.; Powell, Jonathan J.; Stiller, Jiri; Weese, Terri L.; Abe, Tomoko; Zhao, Guangyao; Jia, Jizeng; McIntyre, C. Lynne; Li, Zhongyi; Manners, John M.; Kazan, Kemal
2015-01-01
Reverse genetic techniques harnessing mutational approaches are powerful tools that can provide substantial insight into gene function in plants. However, as compared to diploid species, reverse genetic analyses in polyploid plants such as bread wheat can present substantial challenges associated with high levels of sequence and functional similarity amongst homoeologous loci. We previously developed a high-throughput method to identify deletions of genes within a physically mutagenized wheat population. Here we describe our efforts to combine multiple homoeologous deletions of three candidate disease susceptibility genes (TaWRKY11, TaPFT1 and TaPLDß1). We were able to produce lines featuring homozygous deletions at two of the three homoeoloci for all genes, but this was dependent on the individual mutants used in crossing. Intriguingly, despite extensive efforts, viable lines possessing homozygous deletions at all three homoeoloci could not be produced for any of the candidate genes. To investigate deletion size as a possible reason for this phenomenon, we developed an amplicon sequencing approach based on synteny to Brachypodium distachyon to assess the size of the deletions removing one candidate gene (TaPFT1) in our mutants. These analyses revealed that genomic deletions removing the locus are relatively large, resulting in the loss of multiple additional genes. The implications of this work for the use of heavy ion mutagenesis for reverse genetic analyses in wheat are discussed. PMID:25719507
Peñagaricano, Francisco; Zorrilla, Pilar; Naya, Hugo; Robello, Carlos; Urioste, Jorge I
2012-02-01
The white coat colour of sheep is an important economic trait. For unknown reasons, some animals are born with, and others develop with time, black skin spots that can also produce pigmented fibres. The presence of pigmented fibres in the white wool significantly decreases the fibre quality. The aim of this work was to study gene expression in black spots (with and without pigmented fibres) and white skin by microarray techniques, in order to identify the possible genes involved in the development of this trait. Five unrelated Corriedale sheep were used and, for each animal, the three possible comparisons (three different hybridisations) between the three samples of interest were performed. Differential gene expression patterns were analysed using different t-test approaches. Most of the major genes with well-known roles in skin pigmentation, e.g. ASIP, MC1R and C-KIT, showed no significant difference in the gene expression between white skin and black spots. On the other hand, many of the differentially expressed genes (raw P-value < 0.005) detected in this study, e.g. C-FOS, KLF4 and UFC1, fulfil biological functions that are plausible to be involved in the formation of black spots. The gene expression of C-FOS and KLF4, transcription factors involved in the cellular response to external factors such as ultraviolet light, was validated by quantitative polymerase chain reaction (PCR). This exploratory study provides a list of candidate genes that could be associated with the development of black skin spots that should be studied in more detail. Characterisation of these genes will enable us to discern the molecular mechanisms involved in the development of this feature and, hence, increase our understanding of melanocyte biology and skin pigmentation. In sheep, understanding this phenomenon is a first step towards developing molecular tools to assist in the selection against the presence of pigmented fibres in white wool.
2014-01-01
Background Discerning the traits evolving under neutral conditions from those traits evolving rapidly because of various selection pressures is a great challenge. We propose a new method, composite selection signals (CSS), which unifies the multiple pieces of selection evidence from the rank distribution of its diverse constituent tests. The extreme CSS scores capture highly differentiated loci and underlying common variants hauling excess haplotype homozygosity in the samples of a target population. Results The data on high-density genotypes were analyzed for evidence of an association with either polledness or double muscling in various cohorts of cattle and sheep. In cattle, extreme CSS scores were found in the candidate regions on autosome BTA-1 and BTA-2, flanking the POLL locus and MSTN gene, for polledness and double muscling, respectively. In sheep, the regions with extreme scores were localized on autosome OAR-2 harbouring the MSTN gene for double muscling and on OAR-10 harbouring the RXFP2 gene for polledness. In comparison to the constituent tests, there was a partial agreement between the signals at the four candidate loci; however, they consistently identified additional genomic regions harbouring no known genes. Persuasively, our list of all the additional significant CSS regions contains genes that have been successfully implicated to secondary phenotypic diversity among several subpopulations in our data. For example, the method identified a strong selection signature for stature in cattle capturing selective sweeps harbouring UQCC-GDF5 and PLAG1-CHCHD7 gene regions on BTA-13 and BTA-14, respectively. Both gene pairs have been previously associated with height in humans, while PLAG1-CHCHD7 has also been reported for stature in cattle. In the additional analysis, CSS identified significant regions harbouring multiple genes for various traits under selection in European cattle including polledness, adaptation, metabolism, growth rate, stature, immunity, reproduction traits and some other candidate genes for dairy and beef production. Conclusions CSS successfully localized the candidate regions in validation datasets as well as identified previously known and novel regions for various traits experiencing selection pressure. Together, the results demonstrate the utility of CSS by its improved power, reduced false positives and high-resolution of selection signals as compared to individual constituent tests. PMID:24636660
Jiang, Xin; Xue, Yang; Zhou, Hongzhi; Li, Shouhong; Zhang, Zongmin; Hou, Rui; Ding, Yuxiang; Hu, Kaijin
2015-10-01
Reference genes are commonly used as a reliable approach to normalize the results of quantitative polymerase chain reaction (qPCR), and to reduce errors in the relative quantification of gene expression. Suitable reference genes belonging to numerous functional classes have been identified for various types of species and tissue. However, little is currently known regarding the most suitable reference genes for bone, specifically for the sheep mandibular condyle. Sheep are important for the study of human bone diseases, particularly for temporomandibular diseases. The present study aimed to identify a set of reference genes suitable for the normalization of qPCR data from the mandibular condyle of sheep. A total of 12 reference genes belonging to various functional classes were selected, and the expression stability of the reference genes was determined in both the normal and fractured area of the sheep mandibular condyle. RefFinder, which integrates the following currently available computational algorithms: geNorm, NormFinder, BestKeeper, and the comparative ΔCt method, was used to compare and rank the candidate reference genes. The results obtained from the four methods demonstrated a similar trend: RPL19, ACTB, and PGK1 were the most stably expressed reference genes in the sheep mandibular condyle. As determined by RefFinder comprehensive analysis, the results of the present study suggested that RPL19 is the most suitable reference gene for studies associated with the sheep mandibular condyle. In addition, ACTB and PGK1 may be considered suitable alternatives.
Genetic dissection and validation of candidate genes for flag leaf size in rice (Oryza sativa L.).
Tang, Xinxin; Gong, Rong; Sun, Wenqiang; Zhang, Chaopu; Yu, Sibin
2018-04-01
Two major loci with functional candidate genes were identified and validated affecting flag leaf size, which offer desirable genes to improve leaf architecture and photosynthetic capacity in rice. Leaf size is a major determinant of plant architecture and yield potential in crops. However, the genetic and molecular mechanisms regulating leaf size remain largely elusive. In this study, quantitative trait loci (QTLs) for flag leaf length and flag leaf width in rice were detected with high-density single nucleotide polymorphism genotyping of a chromosomal segment substitution line (CSSL) population, in which each line carries one or a few chromosomal segments from the japonica cultivar Nipponbare in a common background of the indica variety Zhenshan 97. In total, 14 QTLs for flag leaf length and nine QTLs for flag leaf width were identified in the CSSL population. Among them, qFW4-2 for flag leaf width was mapped to a 37-kb interval, with the most likely candidate gene being the previously characterized NAL1. Another major QTL for both flag leaf width and length was delimited by substitution mapping to a small region of 13.5 kb that contains a single gene, Ghd7.1. Mutants of Ghd7.1 generated using CRISPR/CAS9 approach showed reduced leaf size. Allelic variation analyses also validated Ghd7.1 as a functional candidate gene for leaf size, photosynthetic capacity and other yield-related traits. These results provide useful genetic information for the improvement of leaf size and yield in rice breeding programs.
Tohidi, Reza; Idris, Ismail Bin; Panandam, Jothi Malar; Bejo, Mohd Hair
2012-12-01
Salmonella Enteritidis is a major cause of food poisoning worldwide, and poultry products are the main source of S. Enteritidis contamination for humans. Among the numerous strategies for disease control, improving genetic resistance to S. Enteritidis has been the most effective approach. We investigated the association between S. Enteritidis burden in the caecum, spleen, and liver of young indigenous chickens and seven candidate genes, selected on the basis of their critical roles in immunological functions. The genes included those encoding interleukin 2 (IL-2), interferon-γ (IFN-γ), transforming growth factor β2 (TGF-β2), immunoglobulin light chain (IgL), toll-like receptor 4 (TLR-4), myeloid differentiation protein 2 (MD-2), and inducible nitric oxide synthase (iNOS). Two Malaysian indigenous chicken breeds were used as sustainable genetic sources of alleles that are resistant to salmonellosis. The polymerase chain reaction restriction fragment-length polymorphism technique was used to genotype the candidate genes. Three different genotypes were observed in all of the candidate genes, except for MD-2. All of the candidate genes showed the Hardy-Weinberg equilibrium for the two populations. The IL-2-MnlI polymorphism was associated with S. Enteritidis burden in the caecum and spleen. The TGF-β2-RsaI, TLR-4-Sau 96I, and iNOS-AluI polymorphisms were associated with the caecum S. Enteritidis load. The other candidate genes were not associated with S. Enteritidis load in any organ. The results indicate that the IL-2, TGF-β2, TLR-4, and iNOS genes are potential candidates for use in selection programmes for increasing genetic resistance against S. Enteritidis in Malaysian indigenous chickens.
Rao, Yan; Dong, Sufang; Li, Zuhua; Yang, Guohua; Peng, Chunyan; Yan, Ming; Zheng, Fang
2017-01-01
To identify the potential candidate genes for a large Chinese family with autosomal dominant congenital cataract (ADCC) and nystagmus, and investigate the possible molecular mechanism underlying the role of the candidate genes in cataractogenesis. We combined the linkage analysis and direct sequencing for the candidate genes in the linkage regions to identify the causative mutation. The molecular and bio-functional properties of the proteins encoded by the candidate genes was further explored with biophysical and biochemical studies of the recombinant wild-type and mutant proteins. We identified a c. C749T (p.Q227X) transversion in exon 6 of CRYBB1 , a cataract-causative gene. This nonsense mutation changes a phylogenetically conserved glutamine to a stop codon and is predicted to truncate the C-terminus of the wild-type protein by 26 amino acids. Comparison of the biophysical and biochemical properties of the recombinant full-length and truncated βB1-crystallins revealed that the mutation led to the insolubility and the phase separation phenomenon of the truncated protein with a changed conformation. Meanwhile, the thermal stability of the truncated βB1-crystallin was significantly decreased, and the mutation diminished the chaperoning ability of αA-crystallin with the mutant under heating stress. Our findings highlight the importance of the C-terminus in βB1-crystallin in maintaining the crystalline function and stability, and provide a novel insight into the molecular mechanism underlying the pathogenesis of human autosomal dominant congenital cataract.
Emergence and Evolution of Hominidae-Specific Coding and Noncoding Genomic Sequences
Saber, Morteza Mahmoudi; Adeyemi Babarinde, Isaac; Hettiarachchi, Nilmini; Saitou, Naruya
2016-01-01
Family Hominidae, which includes humans and great apes, is recognized for unique complex social behavior and intellectual abilities. Despite the increasing genome data, however, the genomic origin of its phenotypic uniqueness has remained elusive. Clade-specific genes and highly conserved noncoding sequences (HCNSs) are among the high-potential evolutionary candidates involved in driving clade-specific characters and phenotypes. On this premise, we analyzed whole genome sequences along with gene orthology data retrieved from major DNA databases to find Hominidae-specific (HS) genes and HCNSs. We discovered that Down syndrome critical region 4 (DSCR4) is the only experimentally verified gene uniquely present in Hominidae. DSCR4 has no structural homology to any known protein and was inferred to have emerged in several steps through LTR/ERV1, LTR/ERVL retrotransposition, and transversion. Using the genomic distance as neutral evolution threshold, we identified 1,658 HS HCNSs. Polymorphism coverage and derived allele frequency analysis of HS HCNSs showed that these HCNSs are under purifying selection, indicating that they may harbor important functions. They are overrepresented in promoters/untranslated regions, in close proximity of genes involved in sensory perception of sound and developmental process, and also showed a significantly lower nucleosome occupancy probability. Interestingly, many ancestral sequences of the HS HCNSs showed very high evolutionary rates. This suggests that new functions emerged through some kind of positive selection, and then purifying selection started to operate to keep these functions. PMID:27289096
Zhu, Zhou; Ihle, Nathan T; Rejto, Paul A; Zarrinkar, Patrick P
2016-06-13
Genome-scale functional genomic screens across large cell line panels provide a rich resource for discovering tumor vulnerabilities that can lead to the next generation of targeted therapies. Their data analysis typically has focused on identifying genes whose knockdown enhances response in various pre-defined genetic contexts, which are limited by biological complexities as well as the incompleteness of our knowledge. We thus introduce a complementary data mining strategy to identify genes with exceptional sensitivity in subsets, or outlier groups, of cell lines, allowing an unbiased analysis without any a priori assumption about the underlying biology of dependency. Genes with outlier features are strongly and specifically enriched with those known to be associated with cancer and relevant biological processes, despite no a priori knowledge being used to drive the analysis. Identification of exceptional responders (outliers) may not lead only to new candidates for therapeutic intervention, but also tumor indications and response biomarkers for companion precision medicine strategies. Several tumor suppressors have an outlier sensitivity pattern, supporting and generalizing the notion that tumor suppressors can play context-dependent oncogenic roles. The novel application of outlier analysis described here demonstrates a systematic and data-driven analytical strategy to decipher large-scale functional genomic data for oncology target and precision medicine discoveries.
Functional polymorphisms associated with human muscle size and strength.
Thompson, Paul D; Moyna, Niall; Seip, Richard; Price, Thomas; Clarkson, Priscilla; Angelopoulos, Theodore; Gordon, Paul; Pescatello, Linda; Visich, Paul; Zoeller, Robert; Devaney, Joseph M; Gordish, Heather; Bilbie, Stephen; Hoffman, Eric P
2004-07-01
Skeletal muscle is critically important to human performance and health, but little is known of the genetic factors influencing muscle size, strength, and its response to exercise training. The Functional single nucleotide polymorphisms (SNP) Associated with Muscle Size and Strength, or FAMuSS, Study is a multicenter, NIH-funded program to examine the influence of gene polymorphisms on skeletal muscle size and strength before and after resistance exercise training. One thousand men and women, age 18 - 40 yr, will train their nondominant arm for 12 wk. Skeletal muscle size (magnetic resonance imaging) and isometric and dynamic strength will be measured before and after training. Individuals whose baseline values or response to training deviate > or = 1.5 SD will be defined as outliers and examined for genetic variants. Initially candidate genes previously associated with muscle performance will be examined, but the study will ultimately attempt to identify genes associated with muscle performance. FAMuSS should help identify genetic factors associated with muscle performance and the response to exercise training. Such insight should contribute to our ability to predict the individual response to exercise training but may also contribute to understanding better muscle physiology, to identifying individuals who are susceptible to muscle loss with environmental challenge, and to developing pharmacologic agents capable of preserving muscle size and function.
2009-01-01
Background Soybeans grown in the upper Midwestern United States often suffer from iron deficiency chlorosis, which results in yield loss at the end of the season. To better understand the effect of iron availability on soybean yield, we identified genes in two near isogenic lines with changes in expression patterns when plants were grown in iron sufficient and iron deficient conditions. Results Transcriptional profiles of soybean (Glycine max, L. Merr) near isogenic lines Clark (PI548553, iron efficient) and IsoClark (PI547430, iron inefficient) grown under Fe-sufficient and Fe-limited conditions were analyzed and compared using the Affymetrix® GeneChip® Soybean Genome Array. There were 835 candidate genes in the Clark (PI548553) genotype and 200 candidate genes in the IsoClark (PI547430) genotype putatively involved in soybean's iron stress response. Of these candidate genes, fifty-eight genes in the Clark genotype were identified with a genetic location within known iron efficiency QTL and 21 in the IsoClark genotype. The arrays also identified 170 single feature polymorphisms (SFPs) specific to either Clark or IsoClark. A sliding window analysis of the microarray data and the 7X genome assembly coupled with an iterative model of the data showed the candidate genes are clustered in the genome. An analysis of 5' untranslated regions in the promoter of candidate genes identified 11 conserved motifs in 248 differentially expressed genes, all from the Clark genotype, representing 129 clusters identified earlier, confirming the cluster analysis results. Conclusion These analyses have identified the first genes with expression patterns that are affected by iron stress and are located within QTL specific to iron deficiency stress. The genetic location and promoter motif analysis results support the hypothesis that the differentially expressed genes are co-regulated. The combined results of all analyses lead us to postulate iron inefficiency in soybean is a result of a mutation in a transcription factor(s), which controls the expression of genes required in inducing an iron stress response. PMID:19678937
Saenko, S V; Jerónimo, M A; Beldade, P
2012-06-01
Melanism, the overall darkening of the body, is a widespread form of animal adaptation to particular environments, and includes bookcase examples of evolution by natural selection, such as industrial melanism in the peppered moth. The major components of the melanin biosynthesis pathway have been characterized in model insects, but little is known about the genetic basis of life-stage specific melanism such as cases described in some lepidopteran species. Here, we investigate two melanic mutations of Bicyclus anynana butterflies, called Chocolate and melanine, that exclusively affect pigmentation of the larval and adult stages, respectively. Our analysis of Mendelian segregation patterns reveals that the larval and adult melanic phenotypes are due to alleles at different, independently segregating loci. Our linkage mapping analysis excludes the pigmentation candidate gene black as the melanine locus, and implicates a gene encoding a putative pyridoxal phosphate-dependant cysteine sulfinic acid decarboxylase as the Chocolate locus. We show variation in coding sequence and in expression levels for this candidate larval melanism locus. This is the first study that suggests a biological function for this gene in insects. Our findings open up exciting opportunities to study the role of this locus in the evolution of adaptive variation in pigmentation, and the uncoupling of regulation of pigment biosynthesis across developmental stages with different ecologies and pressures on body coloration.
Saenko, S V; Jerónimo, M A; Beldade, P
2012-01-01
Melanism, the overall darkening of the body, is a widespread form of animal adaptation to particular environments, and includes bookcase examples of evolution by natural selection, such as industrial melanism in the peppered moth. The major components of the melanin biosynthesis pathway have been characterized in model insects, but little is known about the genetic basis of life-stage specific melanism such as cases described in some lepidopteran species. Here, we investigate two melanic mutations of Bicyclus anynana butterflies, called Chocolate and melanine, that exclusively affect pigmentation of the larval and adult stages, respectively. Our analysis of Mendelian segregation patterns reveals that the larval and adult melanic phenotypes are due to alleles at different, independently segregating loci. Our linkage mapping analysis excludes the pigmentation candidate gene black as the melanine locus, and implicates a gene encoding a putative pyridoxal phosphate-dependant cysteine sulfinic acid decarboxylase as the Chocolate locus. We show variation in coding sequence and in expression levels for this candidate larval melanism locus. This is the first study that suggests a biological function for this gene in insects. Our findings open up exciting opportunities to study the role of this locus in the evolution of adaptive variation in pigmentation, and the uncoupling of regulation of pigment biosynthesis across developmental stages with different ecologies and pressures on body coloration. PMID:22234245
Genome-wide association study of rust traits in orchardgrass using SLAF-seq technology.
Zeng, Bing; Yan, Haidong; Liu, Xinchun; Zang, Wenjing; Zhang, Ailing; Zhou, Sifan; Huang, Linkai; Liu, Jinping
2017-01-01
While orchardgrass ( Dactylis glomerata L.) is a well-known perennial forage species, rust diseases cause serious reductions in the yield and quality of orchardgrass; however, genetic mechanisms of rust resistance are not well understood in orchardgrass. In this study, a genome-wide association study (GWAS) was performed using specific-locus amplified fragment sequencing (SLAF-seq) technology in orchardgrass. A total of 2,334,889 SLAF tags were generated to produce 2,309,777 SNPs. ADMIXTURE analysis revealed unstructured subpopulations for 33 accessions, indicating that this orchardgrass population could be used for association analysis. Linkage disequilibrium (LD) analysis revealed an average r 2 of 0.4 across all SNP pairs, indicating a high extent of LD in these samples. Through GWAS, a total of 4,604 SNPs were found to be significantly ( P < 0.01) associated with the rust trait. The bulk analysis discovered a number of 5,211 SNPs related to rust trait. Two candidate genes, including cytochrome P450, and prolamin were implicated in disease resistance through prediction of functional genes surrounding each high-quality SNP ( P < 0.01) associated with rust traits based on GWAS analysis and bulk analysis. The large number of SNPs associated with rust traits and these two candidate genes may provide the basis for further research on rust resistance mechanisms and marker-assisted selection (MAS) for rust-resistant lineages.
Escribano, Julio; Coca-Prados, Miguel
2002-08-28
The ciliary body is largely known for its major roles in the regulation of aqueous humor secretion, intraocular pressure, and accommodation of the lens. In this review article we applied bioinformatics to re-examine hundreds of expressed sequence tags (ESTs) previously isolated by subtractive hybridization from a human ciliary body library [1]. The DNA sequences of these clones have been recently added to the web site of NEIBank. DNA sequence comparisons of subtracted ESTs were performed against all entries in the last available release of the non-redundant database containing GenBank, EMBL, DDBJ and PDB sequences using the BlastN program accessed through NCBI's BLAST services on the internet (NCBI). Sequences were also compared and mapped using the Blast search program provided through the Internet by the Human Genome Project (UCSC). A total number of 284 independent ESTs were classified in 17 functional groups. Analysis of their relationships allowed to define the expression of five major groups of known genes: (i) protein synthesis, folding, secretion and degradation (20%); (ii) energy supply and biosynthesis (12%); (iii) contractility and cytoskeleton structure (6%); (iv) cellular signaling and cell cycle regulation (7%); and (v) nerve cell related tasks (2%), including neuropeptide processing and putative non-visual phototransduction and circadian rhythm control. The largest group contain unidentified sequences, a total of 105 sequences, accounting for 37% of ESTs. The unidentified sequences show similarity to genomic non-coding regions, or genes of unknown function. The most highly represented EST, correspond to myocilin, a gene involved in glaucoma. The data also confirms the secretory functions of the ciliary epithelium, and its high metabolism; the presence of a neuroendocrine peptidergic system presumably involved in the regulation of the intraocular pressure and/or aqueous humor secretion. Additional genes may be related to a non-visual phototransduction cascade and/or to circadian rhythms. Overall this initial group of subtracted ESTs can lead to uncover novel physiological functions of the ciliary body in normal and in disease, as well as novel candidate genes for ocular diseases.
Electing a candidate: a speculative history of the bacterial phylum OP10.
Dunfield, Peter F; Tamas, Ivica; Lee, Kevin C; Morgan, Xochitl C; McDonald, Ian R; Stott, Matthew B
2012-12-01
In 1998, a cultivation-independent survey of the microbial community in Obsidian Pool, Yellowstone National Park, detected 12 new phyla within the Domain Bacteria. These were dubbed 'candidate divisions' OP1 to OP12. Since that time the OP10 candidate division has been commonly detected in various environments, usually as part of the rare biosphere, but occasionally as a predominant community component. Based on 16S rRNA gene phylogeny, OP10 comprises at least 12 class-level subdivisions. However, despite this broad ecological and evolutionary diversity, all OP10 bacteria have eluded cultivation until recently. In 2011, two reference species of OP10 were taxonomically validated, removing the phylum from its 'candidate' status. Construction of a highly resolved phylogeny based on 29 universally conserved genes verifies its standing as a unique bacterial phylum. In the following paper we summarize what is known and what is suspected about the newest described bacterial phylum, the Armatimonadetes. © 2012 Society for Applied Microbiology and Blackwell Publishing Ltd.
Wei, Pi-Jing; Zhang, Di; Xia, Junfeng; Zheng, Chun-Hou
2016-12-23
Cancer is a complex disease which is characterized by the accumulation of genetic alterations during the patient's lifetime. With the development of the next-generation sequencing technology, multiple omics data, such as cancer genomic, epigenomic and transcriptomic data etc., can be measured from each individual. Correspondingly, one of the key challenges is to pinpoint functional driver mutations or pathways, which contributes to tumorigenesis, from millions of functional neutral passenger mutations. In this paper, in order to identify driver genes effectively, we applied a generalized additive model to mutation profiles to filter genes with long length and constructed a new gene-gene interaction network. Then we integrated the mutation data and expression data into the gene-gene interaction network. Lastly, greedy algorithm was used to prioritize candidate driver genes from the integrated data. We named the proposed method Length-Net-Driver (LNDriver). Experiments on three TCGA datasets, i.e., head and neck squamous cell carcinoma, kidney renal clear cell carcinoma and thyroid carcinoma, demonstrated that the proposed method was effective. Also, it can identify not only frequently mutated drivers, but also rare candidate driver genes.
Theodorou, Vassiliki; Kimm, Melanie A; Boer, Mandy; Wessels, Lodewyk; Theelen, Wendy; Jonkers, Jos; Hilkens, John
2007-06-01
We performed a high-throughput retroviral insertional mutagenesis screen in mouse mammary tumor virus (MMTV)-induced mammary tumors and identified 33 common insertion sites, of which 17 genes were previously not known to be associated with mammary cancer and 13 had not previously been linked to cancer in general. Although members of the Wnt and fibroblast growth factors (Fgf) families were frequently tagged, our exhaustive screening for MMTV insertion sites uncovered a new repertoire of candidate breast cancer oncogenes. We validated one of these genes, Rspo3, as an oncogene by overexpression in a p53-deficient mammary epithelial cell line. The human orthologs of the candidate oncogenes were frequently deregulated in human breast cancers and associated with several tumor parameters. Computational analysis of all MMTV-tagged genes uncovered specific gene families not previously associated with cancer and showed a significant overrepresentation of protein domains and signaling pathways mainly associated with development and growth factor signaling. Comparison of all tagged genes in MMTV and Moloney murine leukemia virus-induced malignancies showed that both viruses target mostly different genes that act predominantly in distinct pathways.
Bagheri, Hani; Badduke, Chansonette; Qiao, Ying; Colnaghi, Rita; Abramowicz, Iga; Alcantara, Diana; Dunham, Christopher; Wen, Jiadi; Wildin, Robert S.; Nowaczyk, Malgorzata J.M.; Eichmeyer, Jennifer; Lehman, Anna; Maranda, Bruno; Martell, Sally; Shan, Xianghong; Lewis, Suzanne M.E.; O’Driscoll, Mark; Gregory-Evans, Cheryl Y.
2016-01-01
The 2p15p16.1 microdeletion syndrome has a core phenotype consisting of intellectual disability, microcephaly, hypotonia, delayed growth, common craniofacial features, and digital anomalies. So far, more than 20 cases of 2p15p16.1 microdeletion syndrome have been reported in the literature; however, the size of the deletions and their breakpoints vary, making it difficult to identify the candidate genes. Recent reports pointed to 4 genes (XPO1, USP34, BCL11A, and REL) that were included, alone or in combination, in the smallest deletions causing the syndrome. Here, we describe 8 new patients with the 2p15p16.1 deletion and review all published cases to date. We demonstrate functional deficits for the above 4 candidate genes using patients’ lymphoblast cell lines (LCLs) and knockdown of their orthologs in zebrafish. All genes were dosage sensitive on the basis of reduced protein expression in LCLs. In addition, deletion of XPO1, a nuclear exporter, cosegregated with nuclear accumulation of one of its cargo molecules (rpS5) in patients’ LCLs. Other pathways associated with these genes (e.g., NF-κB and Wnt signaling as well as the DNA damage response) were not impaired in patients’ LCLs. Knockdown of xpo1a, rel, bcl11aa, and bcl11ab resulted in abnormal zebrafish embryonic development including microcephaly, dysmorphic body, hindered growth, and small fins as well as structural brain abnormalities. Our multifaceted analysis strongly implicates XPO1, REL, and BCL11A as candidate genes for 2p15p16.1 microdeletion syndrome. PMID:27699255
Chandran, Anil Kumar Nalini; Lee, Gang-Seob; Yoo, Yo-Han; Yoon, Ung-Han; Ahn, Byung-Ohg; Yun, Doh-Won; Kim, Jin-Hyun; Choi, Hong-Kyu; An, GynHeung; Kim, Tae-Ho; Jung, Ki-Hong
2016-12-01
Rice is one of the most important food crops for humans. To improve the agronomical traits of rice, the functions of more than 1,000 rice genes have been recently characterized and summarized. The completed, map-based sequence of the rice genome has significantly accelerated the functional characterization of rice genes, but progress remains limited in assigning functions to all predicted non-transposable element (non-TE) genes, estimated to number 37,000-41,000. The International Rice Functional Genomics Consortium (IRFGC) has generated a huge number of gene-indexed mutants by using mutagens such as T-DNA, Tos17 and Ds/dSpm. These mutants have been identified by 246,566 flanking sequence tags (FSTs) and cover 65 % (25,275 of 38,869) of the non-TE genes in rice, while the mutation ratio of TE genes is 25.7 %. In addition, almost 80 % of highly expressed non-TE genes have insertion mutations, indicating that highly expressed genes in rice chromosomes are more likely to have mutations by mutagens such as T-DNA, Ds, dSpm and Tos17. The functions of around 2.5 % of rice genes have been characterized, and studies have mainly focused on transcriptional and post-transcriptional regulation. Slow progress in characterizing the function of rice genes is mainly due to a lack of clues to guide functional studies or functional redundancy. These limitations can be partially solved by a well-categorized functional classification of FST genes. To create this classification, we used the diverse overviews installed in the MapMan toolkit. Gene Ontology (GO) assignment to FST genes supplemented the limitation of MapMan overviews. The functions of 863 of 1,022 known genes can be evaluated by current FST lines, indicating that FST genes are useful resources for functional genomic studies. We assigned 16,169 out of 29,624 FST genes to 34 MapMan classes, including major three categories such as DNA, RNA and protein. To demonstrate the MapMan application on FST genes, transcriptome analysis was done from a rice mutant of 1-deoxy-D-xylulose 5-phosphate reductoisomerase (DXR) gene with FST. Mapping of 756 down-regulated genes in dxr mutants and their annotation in terms of various MapMan overviews revealed candidate genes downstream of DXR-mediating light signaling pathway in diverse functional classes such as the methyl-D-erythritol 4-phosphatepathway (MEP) pathway overview, photosynthesis, secondary metabolism and regulatory overview. This report provides a useful guide for systematic phenomics and further applications to enhance the key agronomic traits of rice.
Rodriguez-Fernandez, I A; Dell'Angelica, E C
2009-04-01
The study of protein-protein interactions is a powerful approach to uncovering the molecular function of gene products associated with human disease. Protein-protein interaction data are accumulating at an unprecedented pace owing to interactomics projects, although it has been recognized that a significant fraction of these data likely represents false positives. During our studies of biogenesis of lysosome-related organelles complex-1 (BLOC-1), a protein complex involved in protein trafficking and containing the products of genes mutated in Hermansky-Pudlak syndrome, we faced the problem of having too many candidate binding partners to pursue experimentally. In this work, we have explored ways of efficiently gathering high-quality information about candidate binding partners and presenting the information in a visually friendly manner. We applied the approach to rank 70 candidate binding partners of human BLOC-1 and 102 candidates of its counterpart from Drosophila melanogaster. The top candidate for human BLOC-1 was the small GTPase encoded by the RAB11A gene, which is a paralogue of the Rab38 and Rab32 proteins in mammals and the lightoid gene product in flies. Interestingly, genetic analyses in D. melanogaster uncovered a synthetic sick/lethal interaction between Rab11 and lightoid. The data-mining approach described herein can be customized to study candidate binding partners for other proteins or possibly candidates derived from other types of 'omics' data.
Lalucque, Hervé; Malagnac, Fabienne; Green, Kimberly; Gautier, Valérie; Grognet, Pierre; Chan Ho Tong, Laetitia; Scott, Barry; Silar, Philippe
2017-01-15
Filamentous ascomycetes produce complex multicellular structures during sexual reproduction. Little is known about the genetic pathways enabling the construction of such structures. Here, with a combination of classical and reverse genetic methods, as well as genetic mosaic and graft analyses, we identify and provide evidence for key roles for two genes during the formation of perithecia, the sexual fruiting bodies, of the filamentous fungus Podospora anserina. Data indicate that the proteins coded by these two genes function cell-non-autonomously and that their activity depends upon conserved cysteines, making them good candidate for being involved in the transmission of a reactive oxygen species (ROS) signal generated by the PaNox1 NADPH oxidase inside the maturing fruiting body towards the PaMpk1 MAP kinase, which is located inside the underlying mycelium, in which nutrients are stored. These data provide important new insights to our understanding of how fungi build multicellular structures. Copyright © 2016 Elsevier Inc. All rights reserved.
Germline Mutations and Polymorphisms in the Origins of Cancers in Women
Hirshfield, Kim M.; Rebbeck, Timothy R.; Levine, Arnold J.
2010-01-01
Several female malignancies including breast, ovarian, and endometrial cancers can be characterized based on known somatic and germline mutations. Initiation and propagation of tumors reflect underlying genomic alterations such as mutations, polymorphisms, and copy number variations found in genes of multiple cellular pathways. The contributions of any single genetic variation or mutation in a population depend on its frequency and penetrance as well as tissue-specific functionality. Genome wide association studies, fluorescence in situ hybridization, comparative genomic hybridization, and candidate gene studies have enumerated genetic contributors to cancers in women. These include p53, BRCA1, BRCA2, STK11, PTEN, CHEK2, ATM, BRIP1, PALB2, FGFR2, TGFB1, MDM2, MDM4 as well as several other chromosomal loci. Based on the heterogeneity within a specific tumor type, a combination of genomic alterations defines the cancer subtype, biologic behavior, and in some cases, response to therapeutics. Consideration of tumor heterogeneity is therefore important in the critical analysis of gene associations in cancer. PMID:20111735
2013-01-01
Background Scant genomic information from non-avian reptile sex chromosomes is available, and for only a few lizards, several snakes and one turtle species, and it represents only a small fraction of the total sex chromosome sequences in these species. Results We report a 352 kb of contiguous sequence from the sex chromosome of a squamate reptile, Pogona vitticeps, with a ZZ/ZW sex microchromosome system. This contig contains five protein coding genes (oprd1, rcc1, znf91, znf131, znf180), and major families of repetitive sequences with a high number of copies of LTR and non-LTR retrotransposons, including the CR1 and Bov-B LINEs. The two genes, oprd1 and rcc1 are part of a homologous syntenic block, which is conserved among amniotes. While oprd1 and rcc1 have no known function in sex determination or differentiation in amniotes, this homologous syntenic block in mammals and chicken also contains R-spondin 1 (rspo1), the ovarian differentiating gene in mammals. In order to explore the probability that rspo1 is sex determining in dragon lizards, genomic BAC and cDNA clones were mapped using fluorescence in situ hybridisation. Their location on an autosomal microchromosome pair, not on the ZW sex microchromosomes, eliminates rspo1 as a candidate sex determining gene in P. vitticeps. Conclusion Our study has characterized the largest contiguous stretch of physically mapped sex chromosome sequence (352 kb) from a ZZ/ZW lizard species. Although this region represents only a small fraction of the sex chromosomes of P. vitticeps, it has revealed several features typically associated with sex chromosomes including the accumulation of large blocks of repetitive sequences. PMID:24344927
Longhi, Sara; Moretto, Marco; Viola, Roberto; Velasco, Riccardo; Costa, Fabrizio
2012-02-01
Fruit ripening is a complex physiological process in plants whereby cell wall programmed changes occur mainly to promote seed dispersal. Cell wall modification also directly regulates the textural properties, a fundamental aspect of fruit quality. In this study, two full-sib populations of apple, with 'Fuji' as the common maternal parent, crossed with 'Delearly' and 'Pink Lady', were used to understand the control of fruit texture by QTL mapping and in silico gene mining. Texture was dissected with a novel high resolution phenomics strategy, simultaneously profiling both mechanical and acoustic fruit texture components. In 'Fuji × Delearly' nine linkage groups were associated with QTLs accounting from 15.6% to 49% of the total variance, and a highly significant QTL cluster for both textural components was mapped on chromosome 10 and co-located with Md-PG1, a polygalacturonase gene that, in apple, is known to be involved in cell wall metabolism processes. In addition, other candidate genes related to Md-NOR and Md-RIN transcription factors, Md-Pel (pectate lyase), and Md-ACS1 were mapped within statistical intervals. In 'Fuji × Pink Lady', a smaller set of linkage groups associated with the QTLs identified for fruit texture (15.9-34.6% variance) was observed. The analysis of the phenotypic variance over a two-dimensional PCA plot highlighted a transgressive segregation for this progeny, revealing two QTL sets distinctively related to both mechanical and acoustic texture components. The mining of the apple genome allowed the discovery of the gene inventory underlying each QTL, and functional profile assessment unravelled specific gene expression patterns of these candidate genes.
Yıldırım, Kubilay; Uylaş, Senem
2016-12-01
Boron (B) is an essential nutrient for normal growth of plants. Despite its low abundance in soils, it could be highly toxic to plants in especially arid and semi-arid environments. Poplars are known to be tolerant species to B toxicity and accumulation. However, physiological and gene regulation responses of these trees to B toxicity have not been investigated yet. Here, B accumulation and tolerance level of black poplar clones were firstly tested in the current study. Rooted cutting of these clones were treated with elevated B toxicity to select the most B accumulator and tolerant genotype. Then we carried out a microarray based transcriptome experiment on the leaves and roots of this genotype to find out transcriptional networks, genes and molecular mechanisms behind B toxicity tolerance. The results of the study indicated that black poplar is quite suitable for phytoremediation of B pollution. It could resist 15 ppm soil B content and >1500 ppm B accumulation in leaves, which are highly toxic concentrations for almost all agricultural plants. Transcriptomics results of study revealed totally 1625 and 1419 altered probe sets under 15 ppm B toxicity in leaf and root tissues, respectively. The highest induction were recorded for the probes sets annotated to tyrosine aminotransferase, ATP binding cassette transporters, glutathione S transferases and metallochaperone proteins. Strong up regulation of these genes attributed to internal excretion of B into the cell vacuole and existence of B detoxification processes in black poplar. Many other candidate genes functional in signalling, gene regulation, antioxidation, B uptake and transport processes were also identified in this hyper B accumulator plant for the first time with the current study. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Identification of the gene for Nance-Horan syndrome (NHS)
Brooks, S; Ebenezer, N; Poopalasundaram, S; Lehmann, O; Moore, A; Hardcastle, A
2004-01-01
Background: The disease intervals for Nance-Horan syndrome (NHS [MIM 302350]) and X linked congenital cataract (CXN) overlap on Xp22. Objective: To identify the gene or genes responsible for these diseases. Methods: Families with NHS were ascertained. The refined locus for CXN was used to focus the search for candidate genes, which were screened by polymerase chain reaction and direct sequencing of potential exons and intron-exon splice sites. Genomic structures and homologies were determined using bioinformatics. Expression studies were undertaken using specific exonic primers to amplify human fetal cDNA and mouse RNA. Results: A novel gene NHS, with no known function, was identified as causative for NHS. Protein truncating mutations were detected in all three NHS pedigrees, but no mutation was identified in a CXN family, raising the possibility that NHS and CXN may not be allelic. The NHS gene forms a new gene family with a closely related novel gene NHS-Like1 (NHSL1). NHS and NHSL1 lie in paralogous duplicated chromosomal intervals on Xp22 and 6q24, and NHSL1 is more broadly expressed than NHS in human fetal tissues. Conclusions: This study reports the independent identification of the gene causative for Nance-Horan syndrome and extends the number of mutations identified. PMID:15466011
Valentini, Giorgio; Paccanaro, Alberto; Caniza, Horacio; Romero, Alfonso E; Re, Matteo
2014-06-01
In the context of "network medicine", gene prioritization methods represent one of the main tools to discover candidate disease genes by exploiting the large amount of data covering different types of functional relationships between genes. Several works proposed to integrate multiple sources of data to improve disease gene prioritization, but to our knowledge no systematic studies focused on the quantitative evaluation of the impact of network integration on gene prioritization. In this paper, we aim at providing an extensive analysis of gene-disease associations not limited to genetic disorders, and a systematic comparison of different network integration methods for gene prioritization. We collected nine different functional networks representing different functional relationships between genes, and we combined them through both unweighted and weighted network integration methods. We then prioritized genes with respect to each of the considered 708 medical subject headings (MeSH) diseases by applying classical guilt-by-association, random walk and random walk with restart algorithms, and the recently proposed kernelized score functions. The results obtained with classical random walk algorithms and the best single network achieved an average area under the curve (AUC) across the 708 MeSH diseases of about 0.82, while kernelized score functions and network integration boosted the average AUC to about 0.89. Weighted integration, by exploiting the different "informativeness" embedded in different functional networks, outperforms unweighted integration at 0.01 significance level, according to the Wilcoxon signed rank sum test. For each MeSH disease we provide the top-ranked unannotated candidate genes, available for further bio-medical investigation. Network integration is necessary to boost the performances of gene prioritization methods. Moreover the methods based on kernelized score functions can further enhance disease gene ranking results, by adopting both local and global learning strategies, able to exploit the overall topology of the network. Copyright © 2014 The Authors. Published by Elsevier B.V. All rights reserved.
Zhang, R L; Samuelson, D A; Zhang, Z G; Reddy, V N; Shastry, B S
1991-08-01
The congenital hereditary cataracts and microphthalmia in the miniature schnauzer dog are inherited by an autosomal recessive mode. To understand the genetic basis of these diseases, the authors purified and analyzed leukocyte deoxyribonucleic acid (DNA) from affected and normal animals using a candidate gene approach. Because the genes that encode the lens-specific proteins, specifically, alpha, beta, and gamma crystallins and the membrane protein (MP26), are known to maintain the structure and function of the lens, the authors used complimentary DNA (cDNA) fragments that corresponded to the above genes to search for the mutations at their loci in the affected animals. They found no evidence of the gene deletion and rearrangement in any of the five loci. In addition, the hybridizable sequences of the dog DNA to the specific probes for the human chromosome 4 and 18 loci, which are reported to be involved in the abnormality of the human eye, seem to be unaffected. These data support the notion that the hereditary cataracts and microphthalmia in the dog may be associated with genes other than those reported for several animal systems.
Baines, John F.; Roller, Julia; Saminadin-Peter, Sarah S.; Parsch, John; Jiggins, Francis M.
2009-01-01
Background Bacterial and fungal infections induce a potent immune response in Drosophila melanogaster, but it is unclear whether viral infections induce an antiviral immune response. Using microarrays, we examined the changes in gene expression in Drosophila that occur in response to infection with the sigma virus, a negative-stranded RNA virus (Rhabdoviridae) that occurs in wild populations of D. melanogaster. Principal Findings We detected many changes in gene expression in infected flies, but found no evidence for the activation of the Toll, IMD or Jak-STAT pathways, which control immune responses against bacteria and fungi. We identified a number of functional categories of genes, including serine proteases, ribosomal proteins and chorion proteins that were overrepresented among the differentially expressed genes. We also found that the sigma virus alters the expression of many more genes in males than in females. Conclusions These data suggest that either Drosophila do not mount an immune response against the sigma virus, or that the immune response is not controlled by known immune pathways. If the latter is true, the genes that we identified as differentially expressed after infection are promising candidates for controlling the host's response to the sigma virus. PMID:19718442
Carpenter, Jennifer; Hutter, Stephan; Baines, John F; Roller, Julia; Saminadin-Peter, Sarah S; Parsch, John; Jiggins, Francis M
2009-08-31
Bacterial and fungal infections induce a potent immune response in Drosophila melanogaster, but it is unclear whether viral infections induce an antiviral immune response. Using microarrays, we examined the changes in gene expression in Drosophila that occur in response to infection with the sigma virus, a negative-stranded RNA virus (Rhabdoviridae) that occurs in wild populations of D. melanogaster. We detected many changes in gene expression in infected flies, but found no evidence for the activation of the Toll, IMD or Jak-STAT pathways, which control immune responses against bacteria and fungi. We identified a number of functional categories of genes, including serine proteases, ribosomal proteins and chorion proteins that were overrepresented among the differentially expressed genes. We also found that the sigma virus alters the expression of many more genes in males than in females. These data suggest that either Drosophila do not mount an immune response against the sigma virus, or that the immune response is not controlled by known immune pathways. If the latter is true, the genes that we identified as differentially expressed after infection are promising candidates for controlling the host's response to the sigma virus.
Ectodermal dysplasias: a new clinical-genetic classification
Priolo, M.; Lagana, C.
2001-01-01
The ectodermal dysplasias (EDs) are a large and complex nosological group of diseases, first described by Thurnam in 1848. In the last 10 years more than 170 different pathological clinical conditions have been recognised and defined as EDs, all sharing in common anomalies of the hair, teeth, nails, and sweat glands. Many are associated with anomalies in other organs and systems and, in some conditions, with mental retardation. The anomalies affecting the epidermis and epidermal appendages are extremely variable and clinical overlap is present among the majority of EDs. Most EDs are defined by particular clinical signs (for example, eyelid adhesion in AEC syndrome, ectrodactyly in EEC). To date, few causative genes have been identified for these diseases. We recently reviewed genes known to be responsible for EDs in light of their molecular and biological function and proposed a new approach to EDs, integrating both molecular-genetic data and corresponding clinical findings. Based on our previous report, we now propose a clinical-genetic classification of EDs, expand it to other entities in which no causative genes have been identified based on the phenotype, and speculate on possible candidate genes suggested by associated "non-ectodermal" features. Keywords: ectodermal dysplasia; clinical-functional correlation; epithelial-mesenchymal interaction; ectodermal structural proteins PMID:11546825
Genomic anatomy of the Tyrp1 (brown) deletion complex
Smyth, Ian M.; Wilming, Laurens; Lee, Angela W.; Taylor, Martin S.; Gautier, Phillipe; Barlow, Karen; Wallis, Justine; Martin, Sancha; Glithero, Rebecca; Phillimore, Ben; Pelan, Sarah; Andrew, Rob; Holt, Karen; Taylor, Ruth; McLaren, Stuart; Burton, John; Bailey, Jonathon; Sims, Sarah; Squares, Jan; Plumb, Bob; Joy, Ann; Gibson, Richard; Gilbert, James; Hart, Elizabeth; Laird, Gavin; Loveland, Jane; Mudge, Jonathan; Steward, Charlie; Swarbreck, David; Harrow, Jennifer; North, Philip; Leaves, Nicholas; Greystrong, John; Coppola, Maria; Manjunath, Shilpa; Campbell, Mark; Smith, Mark; Strachan, Gregory; Tofts, Calli; Boal, Esther; Cobley, Victoria; Hunter, Giselle; Kimberley, Christopher; Thomas, Daniel; Cave-Berry, Lee; Weston, Paul; Botcherby, Marc R. M.; White, Sharon; Edgar, Ruth; Cross, Sally H.; Irvani, Marjan; Hummerich, Holger; Simpson, Eleanor H.; Johnson, Dabney; Hunsicker, Patricia R.; Little, Peter F. R.; Hubbard, Tim; Campbell, R. Duncan; Rogers, Jane; Jackson, Ian J.
2006-01-01
Chromosome deletions in the mouse have proven invaluable in the dissection of gene function. The brown deletion complex comprises >28 independent genome rearrangements, which have been used to identify several functional loci on chromosome 4 required for normal embryonic and postnatal development. We have constructed a 172-bacterial artificial chromosome contig that spans this 22-megabase (Mb) interval and have produced a contiguous, finished, and manually annotated sequence from these clones. The deletion complex is strikingly gene-poor, containing only 52 protein-coding genes (of which only 39 are supported by human homologues) and has several further notable genomic features, including several segments of >1 Mb, apparently devoid of a coding sequence. We have used sequence polymorphisms to finely map the deletion breakpoints and identify strong candidate genes for the known phenotypes that map to this region, including three lethal loci (l4Rn1, l4Rn2, and l4Rn3) and the fitness mutant brown-associated fitness (baf). We have also characterized misexpression of the basonuclin homologue, Bnc2, associated with the inversion-mediated coat color mutant white-based brown (Bw). This study provides a molecular insight into the basis of several characterized mouse mutants, which will allow further dissection of this region by targeted or chemical mutagenesis. PMID:16505357
Cavaiuolo, Marina; Cocetta, Giacomo; Spadafora, Natasha Damiana; Müller, Carsten T.; Rogers, Hilary J.
2017-01-01
Diplotaxis tenuifolia L. is of important economic value in the fresh-cut industry for its nutraceutical and sensorial properties. However, information on the molecular mechanisms conferring tolerance of harvested leaves to pre- and postharvest stresses during processing and shelf-life have never been investigated. Here, we provide the first transcriptomic resource of rocket by de novo RNA sequencing assembly, functional annotation and stress-induced expression analysis of 33874 transcripts. Transcriptomic changes in leaves subjected to commercially-relevant pre-harvest (salinity, heat and nitrogen starvation) and postharvest stresses (cold, dehydration, dark, wounding) known to affect quality and shelf-life were analysed 24h after stress treatment, a timing relevant to subsequent processing of salad leaves. Transcription factors and genes involved in plant growth regulator signaling, autophagy, senescence and glucosinolate metabolism were the most affected by the stresses. Hundreds of genes with unknown function but uniquely expressed under stress were identified, providing candidates to investigate stress responses in rocket. Dehydration and wounding had the greatest effect on the transcriptome and different stresses elicited changes in the expression of genes related to overlapping groups of hormones. These data will allow development of approaches targeted at improving stress tolerance, quality and shelf-life of rocket with direct applications in the fresh-cut industries. PMID:28558066
Cavaiuolo, Marina; Cocetta, Giacomo; Spadafora, Natasha Damiana; Müller, Carsten T; Rogers, Hilary J; Ferrante, Antonio
2017-01-01
Diplotaxis tenuifolia L. is of important economic value in the fresh-cut industry for its nutraceutical and sensorial properties. However, information on the molecular mechanisms conferring tolerance of harvested leaves to pre- and postharvest stresses during processing and shelf-life have never been investigated. Here, we provide the first transcriptomic resource of rocket by de novo RNA sequencing assembly, functional annotation and stress-induced expression analysis of 33874 transcripts. Transcriptomic changes in leaves subjected to commercially-relevant pre-harvest (salinity, heat and nitrogen starvation) and postharvest stresses (cold, dehydration, dark, wounding) known to affect quality and shelf-life were analysed 24h after stress treatment, a timing relevant to subsequent processing of salad leaves. Transcription factors and genes involved in plant growth regulator signaling, autophagy, senescence and glucosinolate metabolism were the most affected by the stresses. Hundreds of genes with unknown function but uniquely expressed under stress were identified, providing candidates to investigate stress responses in rocket. Dehydration and wounding had the greatest effect on the transcriptome and different stresses elicited changes in the expression of genes related to overlapping groups of hormones. These data will allow development of approaches targeted at improving stress tolerance, quality and shelf-life of rocket with direct applications in the fresh-cut industries.