Science.gov

Sample records for genome-wide renal gene

  1. Genome-Wide Association Study to Identify Genes Related to Renal Mercury Concentrations in Mice

    PubMed Central

    Alkaissi, Hammoudi; Ekstrand, Jimmy; Jawad, Aksa; Nielsen, Jesper Bo; Havarinasab, Said; Soderkvist, Peter; Hultman, Per

    2016-01-01

    Background: Following human mercury (Hg) exposure, the metal accumulates in considerable concentrations in kidney, liver, and brain. Although the toxicokinetics of Hg have been studied extensively, factors responsible for interindividual variation in humans are largely unknown. Differences in accumulation of renal Hg between inbred mouse strains suggest a genetic interstrain variation regulating retention or/and excretion of Hg. A.SW, DBA/2 and BALB/C mouse strains accumulate higher amounts of Hg than B10.S. Objectives: We aimed to find candidate genes associated with regulation of renal Hg concentrations. Methods: A.SW, B10.S and their F1 and F2 offspring were exposed for 6 weeks to 2.0 mg Hg/L drinking water. Genotyping with microsatellites was conducted on 84 F2 mice for genome-wide scanning with ion pair reverse-phase high-performance liquid chromatography (IP RP HPLC). Quantitative trait loci (QTL) were established. Denaturing HPLC was used to detect single nucleotide polymorphisms for haplotyping and fine mapping in 184 and 32 F2 mice, respectively. Candidate genes (Pprc1, Btrc and Nfkb2) verified by fine mapping and QTL were further investigated by real-time polymerase chain reaction. Genes enhanced by Pprc1 (Nrf1 and Nrf2) were included for gene expression analysis. Results: Renal Hg concentrations differed significantly between A.SW and B10.S mice and between males and females within each strain. QTL analysis showed a peak logarithm of odds ratio score 5.78 on chromosome 19 (p = 0.002). Haplotype and fine mapping associated the Hg accumulation with Pprc1, which encodes PGC-1-related coactivator (PRC), a coactivator for proteins involved in detoxification. Pprc1 and two genes coactivated by Pprc1 (Nrf1 and Nrf2) had significantly lower gene expression in the A.SW strain than in the B10.S strain. Conclusions: This study supports Pprc1 as a key regulator for renal Hg excretion. Citation: Alkaissi H, Ekstrand J, Jawad A, Nielsen JB, Havarinasab S, Soderkvist P

  2. Genome-wide analysis of differentially expressed genes and splicing isoforms in clear cell renal cell carcinoma.

    PubMed

    Valletti, Alessio; Gigante, Margherita; Palumbo, Orazio; Carella, Massimo; Divella, Chiara; Sbisà, Elisabetta; Tullo, Apollonia; Picardi, Ernesto; D'Erchia, Anna Maria; Battaglia, Michele; Gesualdo, Loreto; Pesole, Graziano; Ranieri, Elena

    2013-01-01

    Clear cell renal cell carcinoma (ccRCC) is the most common malignant renal epithelial tumor and also the most deadly. To identify molecular changes occurring in ccRCC, in the present study we performed a genome wide analysis of its entire complement of mRNAs. Gene and exon-level analyses were carried out by means of the Affymetrix Exon Array platform. To achieve a reliable detection of differentially expressed cassette exons we implemented a novel methodology that considered contiguous combinations of exon triplets and candidate differentially expressed cassette exons were identified when the expression level was significantly different only in the central exon of the triplet. More detailed analyses were performed for selected genes using quantitative RT-PCR and confocal laser scanning microscopy. Our analysis detected over 2,000 differentially expressed genes, and about 250 genes alternatively spliced and showed differential inclusion of specific cassette exons comparing tumor and non-tumoral tissues. We demonstrated the presence in ccRCC of an altered expression of the PTP4A3, LAMA4, KCNJ1 and TCF21 genes (at both transcript and protein level). Furthermore, we confirmed, at the mRNA level, the involvement of CAV2 and SFRP genes that have previously been identified. At exon level, among potential candidates we validated a differentially included cassette exon in DAB2 gene with a significant increase of DAB2 p96 splice variant as compared to the p67 isoform. Based on the results obtained, and their robustness according to both statistical analysis and literature surveys, we believe that a combination of gene/isoform expression signature may remarkably contribute, after suitable validation, to a more effective and reliable definition of molecular biomarkers for ccRCC early diagnosis, prognosis and prediction of therapeutic response. PMID:24194935

  3. Genome-Wide Analysis of Differentially Expressed Genes and Splicing Isoforms in Clear Cell Renal Cell Carcinoma

    PubMed Central

    Palumbo, Orazio; Carella, Massimo; Divella, Chiara; Sbisà, Elisabetta; Tullo, Apollonia; Picardi, Ernesto; D’Erchia, Anna Maria; Battaglia, Michele; Gesualdo, Loreto; Pesole, Graziano; Ranieri, Elena

    2013-01-01

    Clear cell renal cell carcinoma (ccRCC) is the most common malignant renal epithelial tumor and also the most deadly. To identify molecular changes occurring in ccRCC, in the present study we performed a genome wide analysis of its entire complement of mRNAs. Gene and exon-level analyses were carried out by means of the Affymetrix Exon Array platform. To achieve a reliable detection of differentially expressed cassette exons we implemented a novel methodology that considered contiguous combinations of exon triplets and candidate differentially expressed cassette exons were identified when the expression level was significantly different only in the central exon of the triplet. More detailed analyses were performed for selected genes using quantitative RT-PCR and confocal laser scanning microscopy. Our analysis detected over 2,000 differentially expressed genes, and about 250 genes alternatively spliced and showed differential inclusion of specific cassette exons comparing tumor and non-tumoral tissues. We demonstrated the presence in ccRCC of an altered expression of the PTP4A3, LAMA4, KCNJ1 and TCF21 genes (at both transcript and protein level). Furthermore, we confirmed, at the mRNA level, the involvement of CAV2 and SFRP genes that have previously been identified. At exon level, among potential candidates we validated a differentially included cassette exon in DAB2 gene with a significant increase of DAB2 p96 splice variant as compared to the p67 isoform. Based on the results obtained, and their robustness according to both statistical analysis and literature surveys, we believe that a combination of gene/isoform expression signature may remarkably contribute, after suitable validation, to a more effective and reliable definition of molecular biomarkers for ccRCC early diagnosis, prognosis and prediction of therapeutic response. PMID:24194935

  4. Genome-wide analysis of murine renal distal convoluted tubular cells for the target genes of mineralocorticoid receptor

    SciTech Connect

    Ueda, Kohei; Fujiki, Katsunori; Shirahige, Katsuhiko; Gomez-Sanchez, Celso E.; Fujita, Toshiro; Nangaku, Masaomi; Nagase, Miki

    2014-02-28

    Highlights: • We define a target gene of MR as that with MR-binding to the adjacent region of DNA. • We use ChIP-seq analysis in combination with microarray. • We, for the first time, explore the genome-wide binding profile of MR. • We reveal 5 genes as the direct target genes of MR in the renal epithelial cell-line. - Abstract: Background and objective: Mineralocorticoid receptor (MR) is a member of nuclear receptor family proteins and contributes to fluid homeostasis in the kidney. Although aldosterone-MR pathway induces several gene expressions in the kidney, it is often unclear whether the gene expressions are accompanied by direct regulations of MR through its binding to the regulatory region of each gene. The purpose of this study is to identify the direct target genes of MR in a murine distal convoluted tubular epithelial cell-line (mDCT). Methods: We analyzed the DNA samples of mDCT cells overexpressing 3xFLAG-hMR after treatment with 10{sup −7} M aldosterone for 1 h by chromatin immunoprecipitation with deep-sequence (ChIP-seq) and mRNA of the cell-line with treatment of 10{sup −7} M aldosterone for 3 h by microarray. Results: 3xFLAG-hMR overexpressed in mDCT cells accumulated in the nucleus in response to 10{sup −9} M aldosterone. Twenty-five genes were indicated as the candidate target genes of MR by ChIP-seq and microarray analyses. Five genes, Sgk1, Fkbp5, Rasl12, Tns1 and Tsc22d3 (Gilz), were validated as the direct target genes of MR by quantitative RT-qPCR and ChIP-qPCR. MR binding regions adjacent to Ctgf and Serpine1 were also validated. Conclusions: We, for the first time, captured the genome-wide distribution of MR in mDCT cells and, furthermore, identified five MR target genes in the cell-line. These results will contribute to further studies on the mechanisms of kidney diseases.

  5. Genome-wide screening of copy number alterations and LOH events in renal cell carcinomas and integration with gene expression profile

    PubMed Central

    Cifola, Ingrid; Spinelli, Roberta; Beltrame, Luca; Peano, Clelia; Fasoli, Ester; Ferrero, Stefano; Bosari, Silvano; Signorini, Stefano; Rocco, Francesco; Perego, Roberto; Proserpio, Vanessa; Raimondo, Francesca; Mocarelli, Paolo; Battaglia, Cristina

    2008-01-01

    Background Clear cell renal carcinoma (RCC) is the most common and invasive adult renal cancer. For the purpose of identifying RCC biomarkers, we investigated chromosomal regions and individual genes modulated in RCC pathology. We applied the dual strategy of assessing and integrating genomic and transcriptomic data, today considered the most effective approach for understanding genetic mechanisms of cancer and the most sensitive for identifying cancer-related genes. Results We performed the first integrated analysis of DNA and RNA profiles of RCC samples using Affymetrix technology. Using 100K SNP mapping arrays, we assembled a genome-wide map of DNA copy number alterations and LOH areas. We thus confirmed the typical genetic signature of RCC but also identified other amplified regions (e.g. on chr. 4, 11, 12), deleted regions (chr. 1, 9, 22) and LOH areas (chr. 1, 2, 9, 13). Simultaneously, using HG-U133 Plus 2.0 arrays, we identified differentially expressed genes (DEGs) in tumor vs. normal samples. Combining genomic and transcriptomic data, we identified 71 DEGs in aberrant chromosomal regions and observed, in amplified regions, a predominance of up-regulated genes (27 of 37 DEGs) and a trend to clustering. Functional annotation of these genes revealed some already implicated in RCC pathology and other cancers, as well as others that may be novel tumor biomarkers. Conclusion By combining genomic and transcriptomic profiles from a collection of RCC samples, we identified specific genomic regions with concordant alterations in DNA and RNA profiles and focused on regions with increased DNA copy number. Since the transcriptional modulation of up-regulated genes in amplified regions may be attributed to the genomic alterations characteristic of RCC, these genes may encode novel RCC biomarkers actively involved in tumor initiation and progression and useful in clinical applications. PMID:18194544

  6. Gene Fusion: A Genome Wide Survey

    NASA Technical Reports Server (NTRS)

    Liang, Ping; Riley, Monica

    2001-01-01

    As a well known fact, organisms form larger and complex multimodular (composite or chimeric) and mostly multi-functional proteins through gene fusion of two or more individual genes which have independent evolution histories and functions. We call each of these components a module. The existence of multimodular proteins may improves the efficiency in gene regulation and in cellular functions, and thus may give the host organism advantages in adaptation to environments. Analysis of all gene fusions in present-day organisms should allow us to examine the patterns of gene fusion in context with cellular functions, to trace back the evolution processes from the ancient smaller and uni-functional proteins to the present-day larger and complex multi-functional proteins, and to estimate the minimal number of ancestor proteins that existed in the last common ancestor for all life on earth. Although many multimodular proteins have been experimentally known, identification of gene fusion events systematically at genome scale had not been possible until recently when large number of completed genome sequences have been becoming available. In addition, technical difficulties for such analysis also exist due to the complexity of this biological and evolutionary process. We report from this study a new strategy to computationally identify multimodular proteins using completed genome sequences and the results surveyed from 22 organisms with the data from over 40 organisms to be presented during the meeting. Additional information is contained in the original extended abstract.

  7. Genome-wide copy number variation analysis of a Branchio-Oto-Renal syndrome cohort identifies a recombination hotspot and implicates new candidate genes

    PubMed Central

    Brophy, Patrick D.; Alasti, Fatemeh; Darbro, Benjamin W.; Clarke, Jason; Nishimura, Carla; Cobb, Bryan; Smith, Richard J.; Manak, J. Robert

    2013-01-01

    Branchio-oto-renal (BOR) syndrome is an autosomal dominant disorder characterized by branchial arch anomalies, hearing loss and renal dysmorphology. Although haploinsufficiency of EYA1 and SIX1 are known to cause BOR, copy number variation analysis has only been performed on a limited number of BOR patients. In this study, we used high-resolution array-based comparative genomic hybridization (aCGH) on 32 BOR probands negative for coding-sequence and splice-site mutations in known BOR-causing genes to identify potential disease-causing genomic rearrangements. Of the >1,000 rare and novel copy number variants (CNVs) we identified, four were heterozygous deletions of EYA1 and several downstream genes that had nearly identical breakpoints associated with retroviral sequence blocks, suggesting that non-allelic homologous recombination seeded by this recombination hotspot is important in the pathogenesis of BOR. A different heterozygous deletion removing the last exon of EYA1 was identified in an additional proband. Thus in total 5 probands (14%) had deletions of all or part of EYA1. Using a novel disease-gene prioritization strategy that includes network analysis of genes associated with other deletions suggests that SHARPIN (Sipl1), FGF3 and the HOXA gene cluster may contribute to the pathogenesis of BOR. PMID:23851940

  8. Genome-wide methylation analysis of tubulocystic and papillary renal cell carcinomas.

    PubMed

    Korabecna, M; Geryk, J; Hora, M; Steiner, P; Seda, O; Tesar, V

    2016-01-01

    Tubulocystic renal cell carcinoma (TRCC) represents a rare tumor with incidence lower than 1 % of all renal carcinomas. This study was undertaken to contribute to characterization of molecular signatures associated with TRCC and to compare them with the features of papillary renal cell carcinoma (PRCC) at the level of genome wide methylation analysis.We performed methylated DNA immunoprecipitation (MeDIP) coupled with microarray analysis (Roche NimbleGen). Using the CHARM package, we compared the levels of gene methylation between paired samples of tumors and control renal tissues of each examined individual. We found significant global demethylation in all tumor samples in comparison with adjacent kidney tissues of normal histological appearance but no significant differences in gene methylation between the both compared tumor entities. Therefore we focused on characterization of differentially methylated regions between both tumors and control tissues. We found 42 differentially methylated genes.Hypermethylated genes for protocadherins (PCDHG) and genes coding for products associated with functions of plasma membrane were evaluated as significantly overrepresented among hypermethylated genes detected in both types of renal cell carcinomas.In our pilot study, we provide the first evidence that identical features in the process of carcinogenesis leading to TRCC and/or to PRCC may be found at the gene methylation level.

  9. Genome-wide identification of long noncoding RNAs in rat models of cardiovascular and renal disease.

    PubMed

    Gopalakrishnan, Kathirvel; Kumarasamy, Sivarajan; Mell, Blair; Joe, Bina

    2015-01-01

    Long noncoding RNAs (lncRNAs) are an emerging class of genomic regulatory molecules reported in various species. In the rat, which is one of the major mammalian model organisms, discovery of lncRNAs on a genome-wide scale is lagging. Renal lncRNA sequencing and lncRNA transcriptome analysis were conducted in 3 rat strains that are widely used in cardiovascular and renal research: the Dahl salt-sensitive rat, the spontaneously hypertensive rat, and the Dahl salt-resistant rat. Through the RNA sequencing approach, 3273 transcripts were identified as rat lncRNAs. A majority of lncRNAs were without predicted target genes. Differential expression of 273 and 749 lncRNAs was detected between Dahl salt-sensitive versus Dahl salt-resistant and Dahl salt-sensitive versus spontaneously hypertensive rat comparisons, respectively. To couple the observed differential expression of lncRNAs with the status of mRNAs, an mRNA transcriptome analysis was conducted. Several cis mRNA genes were coregulated with lncRNAs. Of these, the protein expression status of 4 target genes, Asb3, Chac2, Pex11b, and Sp5, were differentially expressed between the relevant strain comparisons, thereby suggesting that the differentially expressed lncRNAs associated with these genes are candidate genetic determinants of blood pressure. This study serves as a first-generation catalog of rat lncRNAs and illustrates the prioritization of lncRNAs as candidates for complex polygenic traits. PMID:25385761

  10. Meta-analysis of genome-wide linkage scans for renal function traits

    PubMed Central

    Rao, Madhumathi; Mottl, Amy K.; Cole, Shelley A.; Umans, Jason G.; Freedman, Barry I.; Bowden, Donald W.; Langefeld, Carl D.; Fox, Caroline S.; Yang, Qiong; Cupples, Adrienne; Iyengar, Sudha K.; Hunt, Steven C.

    2012-01-01

    Background. Several genome scans have explored the linkage of chronic kidney disease phenotypes to chromosomic regions with disparate results. Genome scan meta-analysis (GSMA) is a quantitative method to synthesize linkage results from independent studies and assess their concordance. Methods. We searched PubMed to identify genome linkage analyses of renal function traits in humans, such as estimated glomerular filtration rate (GFR), albuminuria, serum creatinine concentration and creatinine clearance. We contacted authors for numerical data and extracted information from individual studies. We applied the GSMA nonparametric approach to combine results across 14 linkage studies for GFR, 11 linkage studies for albumin creatinine ratio, 11 linkage studies for serum creatinine and 4 linkage studies for creatinine clearance. Results. No chromosomal region reached genome-wide statistical significance in the main analysis which included all scans under each phenotype; however, regions on Chromosomes 7, 10 and 16 reached suggestive significance for linkage to two or more phenotypes. Subgroup analyses by disease status or ethnicity did not yield additional information. Conclusions. While heterogeneity across populations, methodologies and study designs likely explain this lack of agreement, it is possible that linkage scan methodologies lack the resolution for investigating complex traits. Combining family-based linkage studies with genome-wide association studies may be a powerful approach to detect private mutations contributing to complex renal phenotypes. PMID:21622988

  11. Genome-wide microarray analysis of gene expression profiling in major depression and antidepressant therapy.

    PubMed

    Lin, Eugene; Tsai, Shih-Jen

    2016-01-01

    Major depressive disorder (MDD) is a serious health concern worldwide. Currently there are no predictive tests for the effectiveness of any particular antidepressant in an individual patient. Thus, doctors must prescribe antidepressants based on educated guesses. With the recent advent of scientific research, genome-wide gene expression microarray studies are widely utilized to analyze hundreds of thousands of biomarkers by high-throughput technologies. In addition to the candidate-gene approach, the genome-wide approach has recently been employed to investigate the determinants of MDD as well as antidepressant response to therapy. In this review, we mainly focused on gene expression studies with genome-wide approaches using RNA derived from peripheral blood cells. Furthermore, we reviewed their limitations and future directions with respect to the genome-wide gene expression profiling in MDD pathogenesis as well as in antidepressant therapy.

  12. Genome-wide consequences of deleting any single gene

    PubMed Central

    Teng, Xinchen; Dayhoff-Brannigan, Margaret; Cheng, Wen-Chih; Gilbert, Catherine E.; Sing, Cierra N.; Diny, Nicola L.; Wheelan, Sarah J.; Dunham, Maitreya J.; Boeke, Jef D.; Pineda, Fernando J.; Hardwick, J. Marie

    2013-01-01

    Summary Loss or duplication of chromosome segments can lead to further genomic changes associated with cancer. However, it is not known if only a select subset of genes is responsible for driving further changes. To determine if perturbation of any given gene in a genome suffices to drive subsequent genetic changes, we analyzed the yeast knockout collection for secondary mutations of functional consequence. Unlike wild type, most gene knockout strains were found to have one additional mutant gene affecting nutrient responses and/or heat-stress-induced cell death. Moreover, independent knockouts of the same gene often evolved mutations in the same secondary gene. Genome sequencing identified acquired mutations in several human tumor suppressor homologs. Thus, mutation of any single gene may cause a genomic imbalance with consequences sufficient to drive adaptive genetic changes. This complicates genetic analyses, but is a logical consequence of losing a functional unit originally acquired under pressure during evolution. PMID:24211263

  13. Genome-wide analysis of gestational gene-environment interactions in the developing kidney

    PubMed Central

    Yan, Lei; Yao, Xiao; Bachvarov, Dimcho; Saifudeen, Zubaida

    2014-01-01

    The G protein-coupled bradykinin B2 receptor (Bdkrb2) plays an important role in regulation of blood pressure under conditions of excess salt intake. Our previous work has shown that Bdkrb2 also plays a developmental role since Bdkrb2−/− embryos, but not their wild-type or heterozygous littermates, are prone to renal dysgenesis in response to gestational high salt intake. Although impaired terminal differentiation and apoptosis are consistent findings in the Bdkrb2−/− mutant kidneys, the developmental pathways downstream of gene-environment interactions leading to the renal phenotype remain unknown. Here, we performed genome-wide transcriptional profiling on embryonic kidneys from salt-stressed Bdkrb2+/+ and Bdkrb2−/− embryos. The results reveal significant alterations in key pathways regulating Wnt signaling, apoptosis, embryonic development, and cell-matrix interactions. In silico analysis reveal that nearly 12% of differentially regulated genes harbor one or more Pax2 DNA-binding sites in their promoter region. Further analysis shows that metanephric kidneys of salt-stressed Bdkrb2−/− have a significant downregulation of Pax2 gene expression. This was corroborated in Bdkrb2−/−;Pax2GFP+/tg mice, demonstrating that Pax2 transcriptional activity is significantly repressed by gestational salt-Bdkrb2 interactions. We conclude that gestational gene (Bdkrb2) and environment (salt) interactions cooperate to impact gene expression programs in the developing kidney. Suppression of Pax2 likely contributes to the defects in epithelial survival, growth, and differentiation in salt-stressed BdkrB2−/− mice. PMID:25005792

  14. Genome-wide identification of Tribolium dorsoventral patterning genes.

    PubMed

    Stappert, Dominik; Frey, Nadine; von Levetzow, Cornelia; Roth, Siegfried

    2016-07-01

    The gene regulatory network controlling dorsoventral axis formation in insects has undergone drastic evolutionary changes. In Drosophila, a stable long-range gradient of Toll signalling specifies ventral cell fates and restricts BMP signalling to the dorsal half of the embryo. In Tribolium, however, Toll signalling is transient and only indirectly controls BMP signalling. In order to gain unbiased insights into the Tribolium network, we performed comparative transcriptome analyses of embryos with various dorsoventral pattering defects produced by parental RNAi for Toll and BMP signalling components. We also included embryos lacking the mesoderm (produced by Tc-twist RNAi) and characterized similarities and differences between Drosophila and Tribolium twist loss-of-function phenotypes. Using stringent conditions, we identified over 750 differentially expressed genes and analysed a subset with altered expression in more than one knockdown condition. We found new genes with localized expression and showed that conserved genes frequently possess earlier and stronger phenotypes than their Drosophila orthologues. For example, the leucine-rich repeat (LRR) protein Tartan, which has only a minor influence on nervous system development in Drosophila, is essential for early neurogenesis in Tribolium and the Tc-zinc-finger homeodomain protein 1 (Tc-zfh1), the orthologue of which plays a minor role in Drosophila muscle development, is essential for maintaining early Tc-twist expression, indicating an important function for mesoderm specification. PMID:27287803

  15. A genome-wide association search for type 2 diabetes genes in African Americans.

    PubMed

    Palmer, Nicholette D; McDonough, Caitrin W; Hicks, Pamela J; Roh, Bong H; Wing, Maria R; An, S Sandy; Hester, Jessica M; Cooke, Jessica N; Bostrom, Meredith A; Rudock, Megan E; Talbert, Matthew E; Lewis, Joshua P; Ferrara, Assiamira; Lu, Lingyi; Ziegler, Julie T; Sale, Michele M; Divers, Jasmin; Shriner, Daniel; Adeyemo, Adebowale; Rotimi, Charles N; Ng, Maggie C Y; Langefeld, Carl D; Freedman, Barry I; Bowden, Donald W; Voight, Benjamin F; Scott, Laura J; Steinthorsdottir, Valgerdur; Morris, Andrew P; Dina, Christian; Welch, Ryan P; Zeggini, Eleftheria; Huth, Cornelia; Aulchenko, Yurii S; Thorleifsson, Gudmar; McCulloch, Laura J; Ferreira, Teresa; Grallert, Harald; Amin, Najaf; Wu, Guanming; Willer, Cristen J; Raychaudhuri, Soumya; McCarroll, Steve A; Langenberg, Claudia; Hofmann, Oliver M; Dupuis, Josée; Qi, Lu; Segrè, Ayellet V; van Hoek, Mandy; Navarro, Pau; Ardlie, Kristin; Balkau, Beverley; Benediktsson, Rafn; Bennett, Amanda J; Blagieva, Roza; Boerwinkle, Eric; Bonnycastle, Lori L; Boström, Kristina Bengtsson; Bravenboer, Bert; Bumpstead, Suzannah; Burtt, Noël P; Charpentier, Guillaume; Chines, Peter S; Cornelis, Marilyn; Couper, David J; Crawford, Gabe; Doney, Alex S F; Elliott, Katherine S; Elliott, Amanda L; Erdos, Michael R; Fox, Caroline S; Franklin, Christopher S; Ganser, Martha; Gieger, Christian; Grarup, Niels; Green, Todd; Griffin, Simon; Groves, Christopher J; Guiducci, Candace; Hadjadj, Samy; Hassanali, Neelam; Herder, Christian; Isomaa, Bo; Jackson, Anne U; Johnson, Paul R V; Jørgensen, Torben; Kao, Wen H L; Klopp, Norman; Kong, Augustine; Kraft, Peter; Kuusisto, Johanna; Lauritzen, Torsten; Li, Man; Lieverse, Aloysius; Lindgren, Cecilia M; Lyssenko, Valeriya; Marre, Michel; Meitinger, Thomas; Midthjell, Kristian; Morken, Mario A; Narisu, Narisu; Nilsson, Peter; Owen, Katharine R; Payne, Felicity; Perry, John R B; Petersen, Ann-Kristin; Platou, Carl; Proença, Christine; Prokopenko, Inga; Rathmann, Wolfgang; Rayner, N William; Robertson, Neil R; Rocheleau, Ghislain; Roden, Michael; Sampson, Michael J; Saxena, Richa; Shields, Beverley M; Shrader, Peter; Sigurdsson, Gunnar; Sparsø, Thomas; Strassburger, Klaus; Stringham, Heather M; Sun, Qi; Swift, Amy J; Thorand, Barbara; Tichet, Jean; Tuomi, Tiinamaija; van Dam, Rob M; van Haeften, Timon W; van Herpt, Thijs; van Vliet-Ostaptchouk, Jana V; Walters, G Bragi; Weedon, Michael N; Wijmenga, Cisca; Witteman, Jacqueline; Bergman, Richard N; Cauchi, Stephane; Collins, Francis S; Gloyn, Anna L; Gyllensten, Ulf; Hansen, Torben; Hide, Winston A; Hitman, Graham A; Hofman, Albert; Hunter, David J; Hveem, Kristian; Laakso, Markku; Mohlke, Karen L; Morris, Andrew D; Palmer, Colin N A; Pramstaller, Peter P; Rudan, Igor; Sijbrands, Eric; Stein, Lincoln D; Tuomilehto, Jaakko; Uitterlinden, Andre; Walker, Mark; Wareham, Nicholas J; Watanabe, Richard M; Abecasis, Goncalo R; Boehm, Bernhard O; Campbell, Harry; Daly, Mark J; Hattersley, Andrew T; Hu, Frank B; Meigs, James B; Pankow, James S; Pedersen, Oluf; Wichmann, H-Erich; Barroso, Inês; Florez, Jose C; Frayling, Timothy M; Groop, Leif; Sladek, Rob; Thorsteinsdottir, Unnur; Wilson, James F; Illig, Thomas; Froguel, Philippe; van Duijn, Cornelia M; Stefansson, Kari; Altshuler, David; Boehnke, Michael; McCarthy, Mark I; Soranzo, Nicole; Wheeler, Eleanor; Glazer, Nicole L; Bouatia-Naji, Nabila; Mägi, Reedik; Randall, Joshua; Johnson, Toby; Elliott, Paul; Rybin, Denis; Henneman, Peter; Dehghan, Abbas; Hottenga, Jouke Jan; Song, Kijoung; Goel, Anuj; Egan, Josephine M; Lajunen, Taina; Doney, Alex; Kanoni, Stavroula; Cavalcanti-Proença, Christine; Kumari, Meena; Timpson, Nicholas J; Zabena, Carina; Ingelsson, Erik; An, Ping; O'Connell, Jeffrey; Luan, Jian'an; Elliott, Amanda; McCarroll, Steven A; Roccasecca, Rosa Maria; Pattou, François; Sethupathy, Praveen; Ariyurek, Yavuz; Barter, Philip; Beilby, John P; Ben-Shlomo, Yoav; Bergmann, Sven; Bochud, Murielle; Bonnefond, Amélie; Borch-Johnsen, Knut; Böttcher, Yvonne; Brunner, Eric; Bumpstead, Suzannah J; Chen, Yii-Der Ida; Chines, Peter; Clarke, Robert; Coin, Lachlan J M; Cooper, Matthew N; Crisponi, Laura; Day, Ian N M; de Geus, Eco J C; Delplanque, Jerome; Fedson, Annette C; Fischer-Rosinsky, Antje; Forouhi, Nita G; Frants, Rune; Franzosi, Maria Grazia; Galan, Pilar; Goodarzi, Mark O; Graessler, Jürgen; Grundy, Scott; Gwilliam, Rhian; Hallmans, Göran; Hammond, Naomi; Han, Xijing; Hartikainen, Anna-Liisa; Hayward, Caroline; Heath, Simon C; Hercberg, Serge; Hicks, Andrew A; Hillman, David R; Hingorani, Aroon D; Hui, Jennie; Hung, Joe; Jula, Antti; Kaakinen, Marika; Kaprio, Jaakko; Kesaniemi, Y Antero; Kivimaki, Mika; Knight, Beatrice; Koskinen, Seppo; Kovacs, Peter; Kyvik, Kirsten Ohm; Lathrop, G Mark; Lawlor, Debbie A; Le Bacquer, Olivier; Lecoeur, Cécile; Li, Yun; Mahley, Robert; Mangino, Massimo; Manning, Alisa K; Martínez-Larrad, María Teresa; McAteer, Jarred B; McPherson, Ruth; Meisinger, Christa; Melzer, David; Meyre, David; Mitchell, Braxton D; Mukherjee, Sutapa; Naitza, Silvia; Neville, Matthew J; Oostra, Ben A; Orrù, Marco; Pakyz, Ruth; Paolisso, Giuseppe; Pattaro, Cristian; Pearson, Daniel; Peden, John F; Pedersen, Nancy L; Perola, Markus; Pfeiffer, Andreas F H; Pichler, Irene; Polasek, Ozren; Posthuma, Danielle; Potter, Simon C; Pouta, Anneli; Province, Michael A; Psaty, Bruce M; Rayner, Nigel W; Rice, Kenneth; Ripatti, Samuli; Rivadeneira, Fernando; Rolandsson, Olov; Sandbaek, Annelli; Sandhu, Manjinder; Sanna, Serena; Sayer, Avan Aihie; Scheet, Paul; Seedorf, Udo; Sharp, Stephen J; Shields, Beverley; Sijbrands, Eric J G; Silveira, Angela; Simpson, Laila; Singleton, Andrew; Smith, Nicholas L; Sovio, Ulla; Swift, Amy; Syddall, Holly; Syvänen, Ann-Christine; Tanaka, Toshiko; Tönjes, Anke; Uitterlinden, André G; van Dijk, Ko Willems; Varma, Dhiraj; Visvikis-Siest, Sophie; Vitart, Veronique; Vogelzangs, Nicole; Waeber, Gérard; Wagner, Peter J; Walley, Andrew; Ward, Kim L; Watkins, Hugh; Wild, Sarah H; Willemsen, Gonneke; Witteman, Jaqueline C M; Yarnell, John W G; Zelenika, Diana; Zethelius, Björn; Zhai, Guangju; Zhao, Jing Hua; Zillikens, M Carola; Borecki, Ingrid B; Loos, Ruth J F; Meneton, Pierre; Magnusson, Patrik K E; Nathan, David M; Williams, Gordon H; Silander, Kaisa; Salomaa, Veikko; Smith, George Davey; Bornstein, Stefan R; Schwarz, Peter; Spranger, Joachim; Karpe, Fredrik; Shuldiner, Alan R; Cooper, Cyrus; Dedoussis, George V; Serrano-Ríos, Manuel; Lind, Lars; Palmer, Lyle J; Franks, Paul W; Ebrahim, Shah; Marmot, Michael; Kao, W H Linda; Pramstaller, Peter Paul; Wright, Alan F; Stumvoll, Michael; Hamsten, Anders; Buchanan, Thomas A; Valle, Timo T; Rotter, Jerome I; Siscovick, David S; Penninx, Brenda W J H; Boomsma, Dorret I; Deloukas, Panos; Spector, Timothy D; Ferrucci, Luigi; Cao, Antonio; Scuteri, Angelo; Schlessinger, David; Uda, Manuela; Ruokonen, Aimo; Jarvelin, Marjo-Riitta; Waterworth, Dawn M; Vollenweider, Peter; Peltonen, Leena; Mooser, Vincent; Sladek, Robert

    2012-01-01

    African Americans are disproportionately affected by type 2 diabetes (T2DM) yet few studies have examined T2DM using genome-wide association approaches in this ethnicity. The aim of this study was to identify genes associated with T2DM in the African American population. We performed a Genome Wide Association Study (GWAS) using the Affymetrix 6.0 array in 965 African-American cases with T2DM and end-stage renal disease (T2DM-ESRD) and 1029 population-based controls. The most significant SNPs (n = 550 independent loci) were genotyped in a replication cohort and 122 SNPs (n = 98 independent loci) were further tested through genotyping three additional validation cohorts followed by meta-analysis in all five cohorts totaling 3,132 cases and 3,317 controls. Twelve SNPs had evidence of association in the GWAS (P<0.0071), were directionally consistent in the Replication cohort and were associated with T2DM in subjects without nephropathy (P<0.05). Meta-analysis in all cases and controls revealed a single SNP reaching genome-wide significance (P<2.5×10(-8)). SNP rs7560163 (P = 7.0×10(-9), OR (95% CI) = 0.75 (0.67-0.84)) is located intergenically between RND3 and RBM43. Four additional loci (rs7542900, rs4659485, rs2722769 and rs7107217) were associated with T2DM (P<0.05) and reached more nominal levels of significance (P<2.5×10(-5)) in the overall analysis and may represent novel loci that contribute to T2DM. We have identified novel T2DM-susceptibility variants in the African-American population. Notably, T2DM risk was associated with the major allele and implies an interesting genetic architecture in this population. These results suggest that multiple loci underlie T2DM susceptibility in the African-American population and that these loci are distinct from those identified in other ethnic populations. PMID:22238593

  16. A Genome-Wide Association Search for Type 2 Diabetes Genes in African Americans

    PubMed Central

    Palmer, Nicholette D.; McDonough, Caitrin W.; Hicks, Pamela J.; Roh, Bong H.; Wing, Maria R.; An, S. Sandy; Hester, Jessica M.; Cooke, Jessica N.; Bostrom, Meredith A.; Rudock, Megan E.; Talbert, Matthew E.; Lewis, Joshua P.; Ferrara, Assiamira; Lu, Lingyi; Ziegler, Julie T.; Sale, Michele M.; Divers, Jasmin; Shriner, Daniel; Adeyemo, Adebowale; Rotimi, Charles N.; Ng, Maggie C. Y.; Langefeld, Carl D.; Freedman, Barry I.; Bowden, Donald W.

    2012-01-01

    African Americans are disproportionately affected by type 2 diabetes (T2DM) yet few studies have examined T2DM using genome-wide association approaches in this ethnicity. The aim of this study was to identify genes associated with T2DM in the African American population. We performed a Genome Wide Association Study (GWAS) using the Affymetrix 6.0 array in 965 African-American cases with T2DM and end-stage renal disease (T2DM-ESRD) and 1029 population-based controls. The most significant SNPs (n = 550 independent loci) were genotyped in a replication cohort and 122 SNPs (n = 98 independent loci) were further tested through genotyping three additional validation cohorts followed by meta-analysis in all five cohorts totaling 3,132 cases and 3,317 controls. Twelve SNPs had evidence of association in the GWAS (P<0.0071), were directionally consistent in the Replication cohort and were associated with T2DM in subjects without nephropathy (P<0.05). Meta-analysis in all cases and controls revealed a single SNP reaching genome-wide significance (P<2.5×10−8). SNP rs7560163 (P = 7.0×10−9, OR (95% CI) = 0.75 (0.67–0.84)) is located intergenically between RND3 and RBM43. Four additional loci (rs7542900, rs4659485, rs2722769 and rs7107217) were associated with T2DM (P<0.05) and reached more nominal levels of significance (P<2.5×10−5) in the overall analysis and may represent novel loci that contribute to T2DM. We have identified novel T2DM-susceptibility variants in the African-American population. Notably, T2DM risk was associated with the major allele and implies an interesting genetic architecture in this population. These results suggest that multiple loci underlie T2DM susceptibility in the African-American population and that these loci are distinct from those identified in other ethnic populations. PMID:22238593

  17. Genome-wide analysis of homeobox gene family in legumes: identification, gene duplication and expression profiling.

    PubMed

    Bhattacharjee, Annapurna; Ghangal, Rajesh; Garg, Rohini; Jain, Mukesh

    2015-01-01

    Homeobox genes encode transcription factors that are known to play a major role in different aspects of plant growth and development. In the present study, we identified homeobox genes belonging to 14 different classes in five legume species, including chickpea, soybean, Medicago, Lotus and pigeonpea. The characteristic differences within homeodomain sequences among various classes of homeobox gene family were quite evident. Genome-wide expression analysis using publicly available datasets (RNA-seq and microarray) indicated that homeobox genes are differentially expressed in various tissues/developmental stages and under stress conditions in different legumes. We validated the differential expression of selected chickpea homeobox genes via quantitative reverse transcription polymerase chain reaction. Genome duplication analysis in soybean indicated that segmental duplication has significantly contributed in the expansion of homeobox gene family. The Ka/Ks ratio of duplicated homeobox genes in soybean showed that several members of this family have undergone purifying selection. Moreover, expression profiling indicated that duplicated genes might have been retained due to sub-functionalization. The genome-wide identification and comprehensive gene expression profiling of homeobox gene family members in legumes will provide opportunities for functional analysis to unravel their exact role in plant growth and development.

  18. Genome-wide analysis and identification of genes related to expansin gene family in indica rice.

    PubMed

    Hemalatha, N; Rajesh, M K; Narayanan, N K

    2011-01-01

    In this study, we carried out genome-wide analyses to explore expansin gene family in the genome of indica rice. Reference nucleotides were chosen as query sequences for searches in the indica rice genome database. Clones having genomic sequences similar to expansin were taken and converted to amino acid sequences. Putative sequences were subjected to PROSITE and Pfam databases, and 21 signature-sequences-related expansin gene family was obtained. The presence of transmembrane domains was also predicted for all 21 expansin proteins. A phylogenetic tree was generated from the alignments of the proteins sequences to examine the phylogenetic relationship of indica rice expansin proteins.

  19. Gene-Environment Interactions in Genome-Wide Association Studies: Current Approaches and New Directions

    PubMed Central

    Winham, Stacey J; Biernacka, Joanna M.

    2013-01-01

    Background Complex psychiatric traits have long been thought to be the result of a combination of genetic and environmental factors, and gene-environment interactions are thought to play a crucial role in behavioral phenotypes and the susceptibility and progression of psychiatric disorders. Candidate gene studies to investigate hypothesized gene-environment interactions are now fairly common in human genetic research, and with the shift towards genome-wide association studies, genome-wide gene-environment interaction studies are beginning to emerge. Methods We summarize the basic ideas behind gene-environment interaction, and provide an overview of possible study designs and traditional analysis methods in the context of genome-wide analysis. We then discuss novel approaches beyond the traditional strategy of analyzing the interaction between the environmental factor and each polymorphism individually. Results Two-step filtering approaches that reduce the number of polymorphisms tested for interactions can substantially increase the power of genome-wide gene-environment studies. New analytical methods including data-mining approaches, and gene-level and pathway-level analyses, also have the capacity to improve our understanding of how complex genetic and environmental factors interact to influence psychological and psychiatric traits. Such methods, however, have not yet been utilized much in behavioral and mental health research. Conclusions Although methods to investigate gene-environment interactions are available, there is a need for further development and extension of these methods to identify gene-environment interactions in the context of genome-wide association studies. These novel approaches need to be applied in studies of psychology and psychiatry. PMID:23808649

  20. Genome-wide analysis of homeobox genes from Mesobuthus martensii reveals Hox gene duplication in scorpions.

    PubMed

    Di, Zhiyong; Yu, Yao; Wu, Yingliang; Hao, Pei; He, Yawen; Zhao, Huabin; Li, Yixue; Zhao, Guoping; Li, Xuan; Li, Wenxin; Cao, Zhijian

    2015-06-01

    Homeobox genes belong to a large gene group, which encodes the famous DNA-binding homeodomain that plays a key role in development and cellular differentiation during embryogenesis in animals. Here, one hundred forty-nine homeobox genes were identified from the Asian scorpion, Mesobuthus martensii (Chelicerata: Arachnida: Scorpiones: Buthidae) based on our newly assembled genome sequence with approximately 248 × coverage. The identified homeobox genes were categorized into eight classes including 82 families: 67 ANTP class genes, 33 PRD genes, 11 LIM genes, five POU genes, six SINE genes, 14 TALE genes, five CUT genes, two ZF genes and six unclassified genes. Transcriptome data confirmed that more than half of the genes were expressed in adults. The homeobox gene diversity of the eight classes is similar to the previously analyzed Mandibulata arthropods. Interestingly, it is hypothesized that the scorpion M. martensii may have two Hox clusters. The first complete genome-wide analysis of homeobox genes in Chelicerata not only reveals the repertoire of scorpion, arachnid and chelicerate homeobox genes, but also shows some insights into the evolution of arthropod homeobox genes.

  1. Reverse Engineering of Genome-wide Gene Regulatory Networks from Gene Expression Data.

    PubMed

    Liu, Zhi-Ping

    2015-02-01

    Transcriptional regulation plays vital roles in many fundamental biological processes. Reverse engineering of genome-wide regulatory networks from high-throughput transcriptomic data provides a promising way to characterize the global scenario of regulatory relationships between regulators and their targets. In this review, we summarize and categorize the main frameworks and methods currently available for inferring transcriptional regulatory networks from microarray gene expression profiling data. We overview each of strategies and introduce representative methods respectively. Their assumptions, advantages, shortcomings, and possible improvements and extensions are also clarified and commented.

  2. Genome-wide Association Study and Meta-Analysis Identify ISL1 as Genome-wide Significant Susceptibility Gene for Bladder Exstrophy

    PubMed Central

    Draaken, Markus; Knapp, Michael; Pennimpede, Tracie; Schmidt, Johanna M.; Ebert, Anne-Karolin; Rösch, Wolfgang; Stein, Raimund; Utsch, Boris; Hirsch, Karin; Boemers, Thomas M.; Mangold, Elisabeth; Heilmann, Stefanie; Ludwig, Kerstin U.; Jenetzky, Ekkehart; Zwink, Nadine; Moebus, Susanne; Herrmann, Bernhard G.; Mattheisen, Manuel; Nöthen, Markus M.

    2015-01-01

    The bladder exstrophy-epispadias complex (BEEC) represents the severe end of the uro-rectal malformation spectrum, and is thought to result from aberrant embryonic morphogenesis of the cloacal membrane and the urorectal septum. The most common form of BEEC is isolated classic bladder exstrophy (CBE). To identify susceptibility loci for CBE, we performed a genome-wide association study (GWAS) of 110 CBE patients and 1,177 controls of European origin. Here, an association was found with a region of approximately 220kb on chromosome 5q11.1. This region harbors the ISL1 (ISL LIM homeobox 1) gene. Multiple markers in this region showed evidence for association with CBE, including 84 markers with genome-wide significance. We then performed a meta-analysis using data from a previous GWAS by our group of 98 CBE patients and 526 controls of European origin. This meta-analysis also implicated the 5q11.1 locus in CBE risk. A total of 138 markers at this locus reached genome-wide significance in the meta-analysis, and the most significant marker (rs9291768) achieved a P value of 2.13 × 10−12. No other locus in the meta-analysis achieved genome-wide significance. We then performed murine expression analyses to follow up this finding. Here, Isl1 expression was detected in the genital region within the critical time frame for human CBE development. Genital regions with Isl1 expression included the peri-cloacal mesenchyme and the urorectal septum. The present study identified the first genome-wide significant locus for CBE at chromosomal region 5q11.1, and provides strong evidence for the hypothesis that ISL1 is the responsible candidate gene in this region. PMID:25763902

  3. Genome-wide association study and meta-analysis identify ISL1 as genome-wide significant susceptibility gene for bladder exstrophy.

    PubMed

    Draaken, Markus; Knapp, Michael; Pennimpede, Tracie; Schmidt, Johanna M; Ebert, Anne-Karolin; Rösch, Wolfgang; Stein, Raimund; Utsch, Boris; Hirsch, Karin; Boemers, Thomas M; Mangold, Elisabeth; Heilmann, Stefanie; Ludwig, Kerstin U; Jenetzky, Ekkehart; Zwink, Nadine; Moebus, Susanne; Herrmann, Bernhard G; Mattheisen, Manuel; Nöthen, Markus M; Ludwig, Michael; Reutter, Heiko

    2015-03-01

    The bladder exstrophy-epispadias complex (BEEC) represents the severe end of the uro-rectal malformation spectrum, and is thought to result from aberrant embryonic morphogenesis of the cloacal membrane and the urorectal septum. The most common form of BEEC is isolated classic bladder exstrophy (CBE). To identify susceptibility loci for CBE, we performed a genome-wide association study (GWAS) of 110 CBE patients and 1,177 controls of European origin. Here, an association was found with a region of approximately 220kb on chromosome 5q11.1. This region harbors the ISL1 (ISL LIM homeobox 1) gene. Multiple markers in this region showed evidence for association with CBE, including 84 markers with genome-wide significance. We then performed a meta-analysis using data from a previous GWAS by our group of 98 CBE patients and 526 controls of European origin. This meta-analysis also implicated the 5q11.1 locus in CBE risk. A total of 138 markers at this locus reached genome-wide significance in the meta-analysis, and the most significant marker (rs9291768) achieved a P value of 2.13 × 10-12. No other locus in the meta-analysis achieved genome-wide significance. We then performed murine expression analyses to follow up this finding. Here, Isl1 expression was detected in the genital region within the critical time frame for human CBE development. Genital regions with Isl1 expression included the peri-cloacal mesenchyme and the urorectal septum. The present study identified the first genome-wide significant locus for CBE at chromosomal region 5q11.1, and provides strong evidence for the hypothesis that ISL1 is the responsible candidate gene in this region.

  4. The First Pilot Genome-Wide Gene-Environment Study of Depression in the Japanese Population.

    PubMed

    Otowa, Takeshi; Kawamura, Yoshiya; Tsutsumi, Akizumi; Kawakami, Norito; Kan, Chiemi; Shimada, Takafumi; Umekage, Tadashi; Kasai, Kiyoto; Tokunaga, Katsushi; Sasaki, Tsukasa

    2016-01-01

    Stressful events have been identified as a risk factor for depression. Although gene-environment (G × E) interaction in a limited number of candidate genes has been explored, no genome-wide search has been reported. The aim of the present study is to identify genes that influence the association of stressful events with depression. Therefore, we performed a genome-wide G × E interaction analysis in the Japanese population. A genome-wide screen with 320 subjects was performed using the Affymetrix Genome-Wide Human Array 6.0. Stressful life events were assessed using the Social Readjustment Rating Scale (SRRS) and depression symptoms were assessed with self-rating questionnaires using the Center for Epidemiologic Studies Depression (CES-D) scale. The p values for interactions between single nucleotide polymorphisms (SNPs) and stressful events were calculated using the linear regression model adjusted for sex and age. After quality control of genotype data, a total of 534,848 SNPs on autosomal chromosomes were further analyzed. Although none surpassed the level of the genome-wide significance, a marginal significant association of interaction between SRRS and rs10510057 with depression were found (p = 4.5 × 10-8). The SNP is located on 10q26 near Regulators of G-protein signaling 10 (RGS10), which encodes a regulatory molecule involved in stress response. When we investigated a similar G × E interaction between depression (K6 scale) and work-related stress in an independent sample (n = 439), a significant G × E effect on depression was observed (p = 0.015). Our findings suggest that rs10510057, interacting with stressors, may be involved in depression risk. Incorporating G × E interaction into GWAS can contribute to find susceptibility locus that are potentially missed by conventional GWAS. PMID:27529621

  5. The First Pilot Genome-Wide Gene-Environment Study of Depression in the Japanese Population

    PubMed Central

    Otowa, Takeshi; Kawamura, Yoshiya; Tsutsumi, Akizumi; Kawakami, Norito; Kan, Chiemi; Shimada, Takafumi; Umekage, Tadashi; Kasai, Kiyoto; Tokunaga, Katsushi; Sasaki, Tsukasa

    2016-01-01

    Stressful events have been identified as a risk factor for depression. Although gene–environment (G × E) interaction in a limited number of candidate genes has been explored, no genome-wide search has been reported. The aim of the present study is to identify genes that influence the association of stressful events with depression. Therefore, we performed a genome-wide G × E interaction analysis in the Japanese population. A genome-wide screen with 320 subjects was performed using the Affymetrix Genome-Wide Human Array 6.0. Stressful life events were assessed using the Social Readjustment Rating Scale (SRRS) and depression symptoms were assessed with self-rating questionnaires using the Center for Epidemiologic Studies Depression (CES-D) scale. The p values for interactions between single nucleotide polymorphisms (SNPs) and stressful events were calculated using the linear regression model adjusted for sex and age. After quality control of genotype data, a total of 534,848 SNPs on autosomal chromosomes were further analyzed. Although none surpassed the level of the genome-wide significance, a marginal significant association of interaction between SRRS and rs10510057 with depression were found (p = 4.5 × 10−8). The SNP is located on 10q26 near Regulators of G-protein signaling 10 (RGS10), which encodes a regulatory molecule involved in stress response. When we investigated a similar G × E interaction between depression (K6 scale) and work-related stress in an independent sample (n = 439), a significant G × E effect on depression was observed (p = 0.015). Our findings suggest that rs10510057, interacting with stressors, may be involved in depression risk. Incorporating G × E interaction into GWAS can contribute to find susceptibility locus that are potentially missed by conventional GWAS. PMID:27529621

  6. INTEGRATIVE GENOME-WIDE ANALYSIS OF THE DETERMINANTS OF RNA SPLICING IN KIDNEY RENAL CLEAR CELL CARCINOMA

    PubMed Central

    LEHMANN, KJONG-VAN; KAHLES, ANDRÉ; KANDOTH, CYRIAC; LEE, WILLIAM; SCHULTZ, NIKOLAUS; STEGLE, OLIVER; RÄTSCH, GUNNAR

    2014-01-01

    We present a genome-wide analysis of splicing patterns of 282 kidney renal clear cell carcinoma patients in which we integrate data from whole-exome sequencing of tumor and normal samples, RNA-seq and copy number variation. We proposed a scoring mechanism to compare splicing patterns in tumor samples to normal samples in order to rank and detect tumor-specific isoforms that have a potential for new biomarkers. We identified a subset of genes that show introns only observable in tumor but not in normal samples, ENCODE and GEUVADIS samples. In order to improve our understanding of the underlying genetic mechanisms of splicing variation we performed a large-scale association analysis to find links between somatic or germline variants with alternative splicing events. We identified 915 cis- and trans-splicing quantitative trait loci (sQTL) associated with changes in splicing patterns. Some of these sQTL have previously been associated with being susceptibility loci for cancer and other diseases. Our analysis also allowed us to identify the function of several COSMIC variants showing significant association with changes in alternative splicing. This demonstrates the potential significance of variants affecting alternative splicing events and yields insights into the mechanisms related to an array of disease phenotypes. PMID:25592567

  7. Integrative genome-wide analysis of the determinants of RNA splicing in kidney renal clear cell carcinoma.

    PubMed

    Lehmann, Kjong-Van; Kahles, André; Kandoth, Cyriac; Lee, William; Schultz, Nikolaus; Stegle, Oliver; Rätsch, Gunnar

    2015-01-01

    We present a genome-wide analysis of splicing patterns of 282 kidney renal clear cell carcinoma patients in which we integrate data from whole-exome sequencing of tumor and normal samples, RNA-seq and copy number variation. We proposed a scoring mechanism to compare splicing patterns in tumor samples to normal samples in order to rank and detect tumor-specific isoforms that have a potential for new biomarkers. We identified a subset of genes that show introns only observable in tumor but not in normal samples, ENCODE and GEUVADIS samples. In order to improve our understanding of the underlying genetic mechanisms of splicing variation we performed a large-scale association analysis to find links between somatic or germline variants with alternative splicing events. We identified 915 cis- and trans-splicing quantitative trait loci (sQTL) associated with changes in splicing patterns. Some of these sQTL have previously been associated with being susceptibility loci for cancer and other diseases. Our analysis also allowed us to identify the function of several COSMIC variants showing significant association with changes in alternative splicing. This demonstrates the potential significance of variants affecting alternative splicing events and yields insights into the mechanisms related to an array of disease phenotypes.

  8. Common genes underlying asthma and COPD? Genome-wide analysis on the Dutch hypothesis

    PubMed Central

    Smolonska, Joanna; Koppelman, Gerard H.; Wijmenga, Cisca; Vonk, Judith M.; Zanen, Pieter; Bruinenberg, Marcel; Curjuric, Ivan; Imboden, Medea; Thun, Gian-Andri; Franke, Lude; Probst-Hensch, Nicole M.; Nürnberg, Peter; Riemersma, Roland A.; van Schayck, Onno; Loth, Daan W.; Bruselle, Guy G.; Stricker, Bruno H; Hofman, Albert; Uitterlinden, André G.; Lahousse, Lies; London, Stephanie J.; Loehr, Laura R.; Manichaikul, Ani; Barr, R. Graham; Donohue, Kathleen M.; Rich, Stephen S.; Pare, Peter; Bossé, Yohan; Hao, Ke; van den Berge, Maarten; Groen, Harry J.M.; Lammers, Jan-Willem J.; Mali, Willem; Boezen, H. Marike; Postma, Dirkje S.

    2014-01-01

    Asthma and chronic obstructive pulmonary disease (COPD) are thought to share a genetic background (“Dutch hypothesis”). We investigated whether asthma and COPD have common underlying genetic factors, performing genome-wide association studies for both asthma and COPD and combining the results in meta-analyses. Three loci showed potential involvement in both diseases: chr2p24.3, chr5q23.1 and chr13q14.2, containing DDX1, COMMD10 (both participating in the NFκβ pathway) and GNG5P5, respectively. SNP rs9534578 in GNG5P5 reached genome-wide significance after first stage replication (p=9.96·*10−9). The second stage replication in seven independent cohorts provided no significant replication. eQTL analysis in blood and lung on the top 20 associated SNPs identified two SNPs in COMMD10 influencing gene expression. Inflammatory processes differ in asthma and COPD and are mediated by NFκβ, which could be driven by the same underlying genes, COMMD10 and DDX1. None of the SNPs reached genome-wide significance. Our eQTL studies support a functional role of two COMMD10 SNPs, since they influence gene expression in both blood cells and lung tissue. Our findings either suggest that there is no common genetic component in asthma and COPD or, alternatively, different environmental factors, like lifestyle and occupation in different countries and continents may have obscured the genetic common contribution. PMID:24993907

  9. Genome-Wide Detection of Gene Extinction in Early Mammalian Evolution

    PubMed Central

    Kuraku, Shigehiro; Kuratani, Shigeru

    2011-01-01

    Detecting gene losses is a novel aspect of evolutionary genomics that has been made feasible by whole-genome sequencing. However, research to date has concentrated on elucidating evolutionary patterns of genomic components shared between species, rather than identifying disparities between genomes. In this study, we searched for gene losses in the lineage leading to eutherian mammals. First, as a pilot analysis, we selected five gene families (Wnt, Fgf, Tbx, TGFβ, and Frizzled) for molecular phylogenetic analyses, and identified mammalian lineage-specific losses of Wnt11b, Tbx6L/VegT/tbx16, Nodal-related, ADMP1, ADMP2, Sizzled, and Crescent. Second, automated genome-wide phylogenetic screening was implemented based on this pilot analysis. As a result, we detected 147 chicken genes without eutherian orthologs, which resulted from 141 gene loss events. Our inventory contained a group of regulatory genes governing early embryonic axis formation, such as Noggins, and multiple members of the opsin and prolactin-releasing hormone receptor (“PRLHR”) gene families. Our findings highlight the potential of genome-wide gene phylogeny (“phylome”) analysis in detecting possible rearrangement of gene networks and the importance of identifying losses of ancestral genomic components in analyzing the molecular basis underlying phenotypic evolution. PMID:22094861

  10. Genome-wide detection of gene extinction in early mammalian evolution.

    PubMed

    Kuraku, Shigehiro; Kuratani, Shigeru

    2011-01-01

    Detecting gene losses is a novel aspect of evolutionary genomics that has been made feasible by whole-genome sequencing. However, research to date has concentrated on elucidating evolutionary patterns of genomic components shared between species, rather than identifying disparities between genomes. In this study, we searched for gene losses in the lineage leading to eutherian mammals. First, as a pilot analysis, we selected five gene families (Wnt, Fgf, Tbx, TGFβ, and Frizzled) for molecular phylogenetic analyses, and identified mammalian lineage-specific losses of Wnt11b, Tbx6L/VegT/tbx16, Nodal-related, ADMP1, ADMP2, Sizzled, and Crescent. Second, automated genome-wide phylogenetic screening was implemented based on this pilot analysis. As a result, we detected 147 chicken genes without eutherian orthologs, which resulted from 141 gene loss events. Our inventory contained a group of regulatory genes governing early embryonic axis formation, such as Noggins, and multiple members of the opsin and prolactin-releasing hormone receptor ("PRLHR") gene families. Our findings highlight the potential of genome-wide gene phylogeny ("phylome") analysis in detecting possible rearrangement of gene networks and the importance of identifying losses of ancestral genomic components in analyzing the molecular basis underlying phenotypic evolution. PMID:22094861

  11. Assessing the Genome-Wide Effect of Promoter Region Tandem Repeat Natural Variation on Gene Expression

    PubMed Central

    Elmore, Martha H.; Gibbons, John G.; Rokas, Antonis

    2012-01-01

    Copy number polymorphisms of nucleotide tandem repeat (TR) regions, such as microsatellites and minisatellites, are mutationally reversible and highly abundant in eukaryotic genomes. Studies linking TR polymorphism to phenotypic variation have led some to suggest that TR variation modulates and majorly contributes to phenotypic variation; however, studies in which the authors assess the genome-wide impact of TR variation on phenotype are lacking. To address this question, we quantified relationships between polymorphism levels in 143 genome-wide promoter region TRs across 16 isolates of the filamentous fungus Aspergillus flavus and its ecotype Aspergillus oryzae with expression levels of their downstream genes. We found that only 4.3% of relationships tested were significant; these findings were consistent with models in which TRs act as “tuning,” “volume,” or “optimality” “knobs” of phenotype but not with “switch” models. Furthermore, the promoter regions of differentially expressed genes between A. oryzae and A. flavus did not show TR enrichment, suggesting that genome-wide differences in molecular phenotype between the two species are not significantly associated with TRs. Although in some cases TR polymorphisms do contribute to transcript abundance variation, these results argue that at least in this case, TRs might not be major modulators of variation in phenotype. PMID:23275886

  12. Genome-wide signatures of male-mediated migration shaping the Indian gene pool.

    PubMed

    ArunKumar, GaneshPrasad; Tatarinova, Tatiana V; Duty, Jeff; Rollo, Debra; Syama, Adhikarla; Arun, Varatharajan Santhakumari; Kavitha, Valampuri John; Triska, Petr; Greenspan, Bennett; Wells, R Spencer; Pitchappan, Ramasamy

    2015-09-01

    Multiple questions relating to contributions of cultural and demographical factors in the process of human geographical dispersal remain largely unanswered. India, a land of early human settlement and the resulting diversity is a good place to look for some of the answers. In this study, we explored the genetic structure of India using a diverse panel of 78 males genotyped using the GenoChip. Their genome-wide single-nucleotide polymorphism (SNP) diversity was examined in the context of various covariates that influence Indian gene pool. Admixture analysis of genome-wide SNP data showed high proportion of the Southwest Asian component in all of the Indian samples. Hierarchical clustering based on admixture proportions revealed seven distinct clusters correlating to geographical and linguistic affiliations. Convex hull overlay of Y-chromosomal haplogroups on the genome-wide SNP principal component analysis brought out distinct non-overlapping polygons of F*-M89, H*-M69, L1-M27, O2a-M95 and O3a3c1-M117, suggesting a male-mediated migration and expansion of the Indian gene pool. Lack of similar correlation with mitochondrial DNA clades indicated a shared genetic ancestry of females. We suggest that ancient male-mediated migratory events and settlement in various regional niches led to the present day scenario and peopling of India.

  13. Genome-wide signatures of male-mediated migration shaping the Indian gene pool.

    PubMed

    ArunKumar, GaneshPrasad; Tatarinova, Tatiana V; Duty, Jeff; Rollo, Debra; Syama, Adhikarla; Arun, Varatharajan Santhakumari; Kavitha, Valampuri John; Triska, Petr; Greenspan, Bennett; Wells, R Spencer; Pitchappan, Ramasamy

    2015-09-01

    Multiple questions relating to contributions of cultural and demographical factors in the process of human geographical dispersal remain largely unanswered. India, a land of early human settlement and the resulting diversity is a good place to look for some of the answers. In this study, we explored the genetic structure of India using a diverse panel of 78 males genotyped using the GenoChip. Their genome-wide single-nucleotide polymorphism (SNP) diversity was examined in the context of various covariates that influence Indian gene pool. Admixture analysis of genome-wide SNP data showed high proportion of the Southwest Asian component in all of the Indian samples. Hierarchical clustering based on admixture proportions revealed seven distinct clusters correlating to geographical and linguistic affiliations. Convex hull overlay of Y-chromosomal haplogroups on the genome-wide SNP principal component analysis brought out distinct non-overlapping polygons of F*-M89, H*-M69, L1-M27, O2a-M95 and O3a3c1-M117, suggesting a male-mediated migration and expansion of the Indian gene pool. Lack of similar correlation with mitochondrial DNA clades indicated a shared genetic ancestry of females. We suggest that ancient male-mediated migratory events and settlement in various regional niches led to the present day scenario and peopling of India. PMID:25994871

  14. Genome-wide evidence for speciation with gene flow in Heliconius butterflies.

    PubMed

    Martin, Simon H; Dasmahapatra, Kanchon K; Nadeau, Nicola J; Salazar, Camilo; Walters, James R; Simpson, Fraser; Blaxter, Mark; Manica, Andrea; Mallet, James; Jiggins, Chris D

    2013-11-01

    Most speciation events probably occur gradually, without complete and immediate reproductive isolation, but the full extent of gene flow between diverging species has rarely been characterized on a genome-wide scale. Documenting the extent and timing of admixture between diverging species can clarify the role of geographic isolation in speciation. Here we use new methodology to quantify admixture at different stages of divergence in Heliconius butterflies, based on whole-genome sequences of 31 individuals. Comparisons between sympatric and allopatric populations of H. melpomene, H. cydno, and H. timareta revealed a genome-wide trend of increased shared variation in sympatry, indicative of pervasive interspecific gene flow. Up to 40% of 100-kb genomic windows clustered by geography rather than by species, demonstrating that a very substantial fraction of the genome has been shared between sympatric species. Analyses of genetic variation shared over different time intervals suggested that admixture between these species has continued since early in speciation. Alleles shared between species during recent time intervals displayed higher levels of linkage disequilibrium than those shared over longer time intervals, suggesting that this admixture took place at multiple points during divergence and is probably ongoing. The signal of admixture was significantly reduced around loci controlling divergent wing patterns, as well as throughout the Z chromosome, consistent with strong selection for Müllerian mimicry and with known Z-linked hybrid incompatibility. Overall these results show that species divergence can occur in the face of persistent and genome-wide admixture over long periods of time.

  15. Genome-wide landscape of liver X receptor chromatin binding and gene regulation in human macrophages

    PubMed Central

    2012-01-01

    Background The liver X receptors (LXRs) are oxysterol sensing nuclear receptors with multiple effects on metabolism and immune cells. However, the complete genome-wide cistrome of LXR in cells of human origin has not yet been provided. Results We performed ChIP-seq in phorbol myristate acetate-differentiated THP-1 cells (macrophage-type) after stimulation with the potent synthetic LXR ligand T0901317 (T09). Microarray gene expression analysis was performed in the same cellular model. We identified 1357 genome-wide LXR locations (FDR < 1%), of which 526 were observed after T09 treatment. De novo analysis of LXR binding sequences identified a DR4-type element as the major motif. On mRNA level T09 up-regulated 1258 genes and repressed 455 genes. Our results show that LXR actions are focused on 112 genomic regions that contain up to 11 T09 target genes per region under the control of highly stringent LXR binding sites with individual constellations for each region. We could confirm that LXR controls lipid metabolism and transport and observed a strong association with apoptosis-related functions. Conclusions This first report on genome-wide binding of LXR in a human cell line provides new insights into the transcriptional network of LXR and its target genes with their link to physiological processes, such as apoptosis. The gene expression microarray and sequence data have been submitted collectively to the NCBI Gene Expression Omnibus http://www.ncbi.nlm.nih.gov/geo under accession number GSE28319. PMID:22292898

  16. Genome-Wide Expression Profiles Identify Potential Targets for Gene by Environment Interactions in Asthma Severity

    PubMed Central

    Sordillo, Joanne E; Kelly, Roxanne; Bunyavanich, Supinda; McGeachie, Michael; Qiu, Weiliang; Croteau-Chonka, Damien C.; Soto-Quiros, Manuel; Avila, Lydiana; Celedón, Juan C.; Brehm, John M.; Weiss, Scott T; Gold, Diane R; Litonjua, Augusto A

    2015-01-01

    Background Gene by environment interaction (G × E) studies utilizing GWAS data are often underpowered after adjustment for multiple comparisons. Differential gene expression, in response to the exposure of interest, may capture the most biologically relevant genes at the genome-wide level. Methods We used differential genome-wide expression profiles from the Home Allergens and Asthma Birth cohort in response to Der f 1 allergen (sensitized vs. non-sensitized) to inform a G × E study of dust mite exposure and asthma severity. Polymorphisms in differentially expressed genes were identified in GWAS data from CAMP, a clinical trial in childhood asthmatics. Home dust mite allergen (< or ≥ 10µg/g dust) was assessed at baseline, and (≥ 1) severe asthma exacerbation (emergency room (ER) visit or hospitalization for asthma in the first trial year) served as the disease severity outcome. The Genetics of Asthma in Costa Rica (GACRS) study, and a Puerto Rico/Connecticut asthma cohortwere used for replication. Results IL-9, IL-5 and PRG2 expression was up-regulated in Der f 1 stimulated PBMCs from dust mite sensitized individuals (adj. p value <0.04). IL-9 polymorphisms (rs11741137, rs2069885, rs1859430) showed evidence for interaction with dust mite in CAMP (p=0.02 to 0.03), with replication in GACRS (p=0.04). Subjects with the dominant genotype for these IL-9 polymorphisms were more likely to report a severe asthma exacerbation if exposed to elevated dust mite. Conclusions Genome-wide differential gene expression in response to dust mite allergen identified IL-9, a biologically plausible gene target that may interact with environmental dust mite to increase severe asthma exacerbations in children. PMID:25913104

  17. Selecting causal genes from genome-wide association studies via functionally-coherent subnetworks

    PubMed Central

    Taşan, Murat; Musso, Gabriel; Hao, Tong; Vidal, Marc; MacRae, Calum A.; Roth, Frederick P.

    2015-01-01

    While genome-wide association (GWA) studies have linked thousands of loci to human diseases, the causal genes and variants at these loci generally remain unknown. Although investigators typically focus on genes closest to the associated polymorphisms, the causal gene is often more distal. Relying on the literature to help prioritize additional candidate genes at associated loci can draw attention away from less-characterized causal genes. Here we describe a strategy that uses genome-scale ‘co-function’ networks to identify sets of mutually functionally related genes spanning multiple GWA loci. Using associations from ~100 GWA studies covering ten cancer types, this approach outperforms the common alternative strategy in ranking known cancer genes. The strategy’s power grows with more GWA loci, offering an increasing opportunity to elucidate causes of complex human disease. PMID:25532137

  18. Genome-Wide Significant Association between Alcohol Dependence and a Variant in the ADH Gene Cluster

    PubMed Central

    Frank, Josef; Cichon, Sven; Treutlein, Jens; Ridinger, Monika; Mattheisen, Manuel; Hoffmann, Per; Herms, Stefan; Wodarz, Norbert; Soyka, Michael; Zill, Peter; Maier, Wolfgang; Mössner, Rainald; Gaebel, Wolfgang; Dahmen, Norbert; Scherbaum, Norbert; Schmäl, Christine; Steffens, Michael; Lucae, Susanne; Ising, Marcus; Müller-Myhsok, Bertram; Nöthen, Markus M; Mann, Karl; Kiefer, Falk; Rietschel, Marcella

    2011-01-01

    Alcohol dependence (AD) is an important contributory factor to the global burden of disease. The etiology of AD involves both environmental and genetic factors, and the disorder has a heritability of around 50%. The aim of the present study was to identify susceptibility genes for AD by performing a genome-wide association study (GWAS). The sample comprised 1,333 male in-patients with severe DSM-IV AD and 2,168 controls. These included 487 patients and 1,358 controls from a previous GWAS study by our group. All individuals were of German descent. Single marker tests and a polygenic score based analysis to assess the combined contribution of multiple markers with small effects were performed. The SNP rs1789891, which is located between the ADH1B and ADH1C genes, achieved genome-wide significance (p=1.27E–8; OR=1.46). Other markers from this region were also associated with AD, and conditional analyses indicated that these made a partially independent contribution. The SNP rs1789891 is in complete linkage disequilibrium with the functional Arg272Gln variant (p=1.24E–7, OR=1.31) of the ADH1C gene, which has been reported to modify the rate of ethanol oxidation to acetaldehyde in vitro. A polygenic score based approach produced a significant result (p=9.66E–9). This is the first GWAS of AD to provide genome-wide significant support for the role of the ADH gene cluster and to suggest a polygenic component to the etiology of AD. The latter result suggests that many more AD susceptibility genes still await identification. PMID:22004471

  19. Genome-wide significant association between alcohol dependence and a variant in the ADH gene cluster.

    PubMed

    Frank, Josef; Cichon, Sven; Treutlein, Jens; Ridinger, Monika; Mattheisen, Manuel; Hoffmann, Per; Herms, Stefan; Wodarz, Norbert; Soyka, Michael; Zill, Peter; Maier, Wolfgang; Mössner, Rainald; Gaebel, Wolfgang; Dahmen, Norbert; Scherbaum, Norbert; Schmäl, Christine; Steffens, Michael; Lucae, Susanne; Ising, Marcus; Müller-Myhsok, Bertram; Nöthen, Markus M; Mann, Karl; Kiefer, Falk; Rietschel, Marcella

    2012-01-01

    Alcohol dependence (AD) is an important contributory factor to the global burden of disease. The etiology of AD involves both environmental and genetic factors, and the disorder has a heritability of around 50%. The aim of the present study was to identify susceptibility genes for AD by performing a genome-wide association study (GWAS). The sample comprised 1333 male in-patients with severe AD according to the Diagnostic and Statistical Manual of Mental Disorders, 4th edition, and 2168 controls. These included 487 patients and 1358 controls from a previous GWAS study by our group. All individuals were of German descent. Single-marker tests and a polygenic score-based analysis to assess the combined contribution of multiple markers with small effects were performed. The single nucleotide polymorphism (SNP) rs1789891, which is located between the ADH1B and ADH1C genes, achieved genome-wide significance [P = 1.27E-8, odds ratio (OR) = 1.46]. Other markers from this region were also associated with AD, and conditional analyses indicated that these made a partially independent contribution. The SNP rs1789891 is in complete linkage disequilibrium with the functional Arg272Gln variant (P = 1.24E-7, OR = 1.31) of the ADH1C gene, which has been reported to modify the rate of ethanol oxidation to acetaldehyde in vitro. A polygenic score-based approach produced a significant result (P = 9.66E-9). This is the first GWAS of AD to provide genome-wide significant support for the role of the ADH gene cluster and to suggest a polygenic component to the etiology of AD. The latter result may indicate that many more AD susceptibility genes still await identification.

  20. Chronic periodontitis genome-wide association studies: gene-centric and gene set enrichment analyses.

    PubMed

    Rhodin, K; Divaris, K; North, K E; Barros, S P; Moss, K; Beck, J D; Offenbacher, S

    2014-09-01

    Recent genome-wide association studies (GWAS) of chronic periodontitis (CP) offer rich data sources for the investigation of candidate genes, functional elements, and pathways. We used GWAS data of CP (n = 4,504) and periodontal pathogen colonization (n = 1,020) from a cohort of adult Americans of European descent participating in the Atherosclerosis Risk in Communities study and employed a MAGENTA approach (i.e., meta-analysis gene set enrichment of variant associations) to obtain gene-centric and gene set association results corrected for gene size, number of single-nucleotide polymorphisms, and local linkage disequilibrium characteristics based on the human genome build 18 (National Center for Biotechnology Information build 36). We used the Gene Ontology, Ingenuity, KEGG, Panther, Reactome, and Biocarta databases for gene set enrichment analyses. Six genes showed evidence of statistically significant association: 4 with severe CP (NIN, p = 1.6 × 10(-7); ABHD12B, p = 3.6 × 10(-7); WHAMM, p = 1.7 × 10(-6); AP3B2, p = 2.2 × 10(-6)) and 2 with high periodontal pathogen colonization (red complex-KCNK1, p = 3.4 × 10(-7); Porphyromonas gingivalis-DAB2IP, p = 1.0 × 10(-6)). Top-ranked genes for moderate CP were HGD (p = 1.4 × 10(-5)), ZNF675 (p = 1.5 × 10(-5)), TNFRSF10C (p = 2.0 × 10(-5)), and EMR1 (p = 2.0 × 10(-5)). Loci containing NIN, EMR1, KCNK1, and DAB2IP had showed suggestive evidence of association in the earlier single-nucleotide polymorphism-based analyses, whereas WHAMM and AP2B2 emerged as novel candidates. The top gene sets included severe CP ("endoplasmic reticulum membrane," "cytochrome P450," "microsome," and "oxidation reduction") and moderate CP ("regulation of gene expression," "zinc ion binding," "BMP signaling pathway," and "ruffle"). Gene-centric analyses offer a promising avenue for efficient interrogation of large-scale GWAS data. These results highlight genes in previously identified loci and new candidate genes and pathways

  1. Genome-wide selective sweeps and gene-specific sweeps in natural bacterial populations

    PubMed Central

    Bendall, Matthew L; Stevens, Sarah LR; Chan, Leong-Keat; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Froula, Jeff; Kang, Dongwan; Tringe, Susannah G; Bertilsson, Stefan; Moran, Mary A; Shade, Ashley; Newton, Ryan J; McMahon, Katherine D; Malmstrom, Rex R

    2016-01-01

    Multiple models describe the formation and evolution of distinct microbial phylogenetic groups. These evolutionary models make different predictions regarding how adaptive alleles spread through populations and how genetic diversity is maintained. Processes predicted by competing evolutionary models, for example, genome-wide selective sweeps vs gene-specific sweeps, could be captured in natural populations using time-series metagenomics if the approach were applied over a sufficiently long time frame. Direct observations of either process would help resolve how distinct microbial groups evolve. Here, from a 9-year metagenomic study of a freshwater lake (2005–2013), we explore changes in single-nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in 30 bacterial populations. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied by >1000-fold among populations. SNP allele frequencies also changed dramatically over time within some populations. Interestingly, nearly all SNP variants were slowly purged over several years from one population of green sulfur bacteria, while at the same time multiple genes either swept through or were lost from this population. These patterns were consistent with a genome-wide selective sweep in progress, a process predicted by the ‘ecotype model' of speciation but not previously observed in nature. In contrast, other populations contained large, SNP-free genomic regions that appear to have swept independently through the populations prior to the study without purging diversity elsewhere in the genome. Evidence for both genome-wide and gene-specific sweeps suggests that different models of bacterial speciation may apply to different populations coexisting in the same environment. PMID:26744812

  2. Genome-wide scan of healthy human connectome discovers SPON1 gene variant influencing dementia severity

    PubMed Central

    Jahanshad, Neda; Rajagopalan, Priya; Hua, Xue; Hibar, Derrek P.; Nir, Talia M.; Toga, Arthur W.; Jack, Clifford R.; Saykin, Andrew J.; Green, Robert C.; Weiner, Michael W.; Medland, Sarah E.; Montgomery, Grant W.; Hansell, Narelle K.; McMahon, Katie L.; de Zubicaray, Greig I.; Martin, Nicholas G.; Wright, Margaret J.; Thompson, Paul M.; Weiner, Michael; Aisen, Paul; Weiner, Michael; Aisen, Paul; Petersen, Ronald; Jack, Clifford R.; Jagust, William; Trojanowski, John Q.; Toga, Arthur W.; Beckett, Laurel; Green, Robert C.; Saykin, Andrew J.; Morris, John; Liu, Enchi; Green, Robert C.; Montine, Tom; Petersen, Ronald; Aisen, Paul; Gamst, Anthony; Thomas, Ronald G.; Donohue, Michael; Walter, Sarah; Gessert, Devon; Sather, Tamie; Beckett, Laurel; Harvey, Danielle; Gamst, Anthony; Donohue, Michael; Kornak, John; Jack, Clifford R.; Dale, Anders; Bernstein, Matthew; Felmlee, Joel; Fox, Nick; Thompson, Paul; Schuff, Norbert; Alexander, Gene; DeCarli, Charles; Jagust, William; Bandy, Dan; Koeppe, Robert A.; Foster, Norm; Reiman, Eric M.; Chen, Kewei; Mathis, Chet; Morris, John; Cairns, Nigel J.; Taylor-Reinwald, Lisa; Trojanowki, J.Q.; Shaw, Les; Lee, Virginia M.Y.; Korecka, Magdalena; Toga, Arthur W.; Crawford, Karen; Neu, Scott; Saykin, Andrew J.; Foroud, Tatiana M.; Potkin, Steven; Shen, Li; Khachaturian, Zaven; Frank, Richard; Snyder, Peter J.; Molchan, Susan; Kaye, Jeffrey; Quinn, Joseph; Lind, Betty; Dolen, Sara; Schneider, Lon S.; Pawluczyk, Sonia; Spann, Bryan M.; Brewer, James; Vanderswag, Helen; Heidebrink, Judith L.; Lord, Joanne L.; Petersen, Ronald; Johnson, Kris; Doody, Rachelle S.; Villanueva-Meyer, Javier; Chowdhury, Munir; Stern, Yaakov; Honig, Lawrence S.; Bell, Karen L.; Morris, John C.; Ances, Beau; Carroll, Maria; Leon, Sue; Mintun, Mark A.; Schneider, Stacy; Marson, Daniel; Griffith, Randall; Clark, David; Grossman, Hillel; Mitsis, Effie; Romirowsky, Aliza; deToledo-Morrell, Leyla; Shah, Raj C.; Duara, Ranjan; Varon, Daniel; Roberts, Peggy; Albert, Marilyn; Onyike, Chiadi; Kielb, Stephanie; Rusinek, Henry; de Leon, Mony J.; Glodzik, Lidia; De Santi, Susan; Doraiswamy, P. Murali; Petrella, Jeffrey R.; Coleman, R. Edward; Arnold, Steven E.; Karlawish, Jason H.; Wolk, David; Smith, Charles D.; Jicha, Greg; Hardy, Peter; Lopez, Oscar L.; Oakley, MaryAnn; Simpson, Donna M.; Porsteinsson, Anton P.; Goldstein, Bonnie S.; Martin, Kim; Makino, Kelly M.; Ismail, M. Saleem; Brand, Connie; Mulnard, Ruth A.; Thai, Gaby; Mc-Adams-Ortiz, Catherine; Womack, Kyle; Mathews, Dana; Quiceno, Mary; Diaz-Arrastia, Ramon; King, Richard; Weiner, Myron; Martin-Cook, Kristen; DeVous, Michael; Levey, Allan I.; Lah, James J.; Cellar, Janet S.; Burns, Jeffrey M.; Anderson, Heather S.; Swerdlow, Russell H.; Apostolova, Liana; Lu, Po H.; Bartzokis, George; Silverman, Daniel H.S.; Graff-Radford, Neill R.; Parfitt, Francine; Johnson, Heather; Farlow, Martin R.; Hake, Ann Marie; Matthews, Brandy R.; Herring, Scott; van Dyck, Christopher H.; Carson, Richard E.; MacAvoy, Martha G.; Chertkow, Howard; Bergman, Howard; Hosein, Chris; Black, Sandra; Stefanovic, Bojana; Caldwell, Curtis; Hsiung, Ging-Yuek Robin; Feldman, Howard; Mudge, Benita; Assaly, Michele; Kertesz, Andrew; Rogers, John; Trost, Dick; Bernick, Charles; Munic, Donna; Kerwin, Diana; Mesulam, Marek-Marsel; Lipowski, Kristina; Wu, Chuang-Kuo; Johnson, Nancy; Sadowsky, Carl; Martinez, Walter; Villena, Teresa; Turner, Raymond Scott; Johnson, Kathleen; Reynolds, Brigid; Sperling, Reisa A.; Johnson, Keith A.; Marshall, Gad; Frey, Meghan; Yesavage, Jerome; Taylor, Joy L.; Lane, Barton; Rosen, Allyson; Tinklenberg, Jared; Sabbagh, Marwan; Belden, Christine; Jacobson, Sandra; Kowall, Neil; Killiany, Ronald; Budson, Andrew E.; Norbash, Alexander; Johnson, Patricia Lynn; Obisesan, Thomas O.; Wolday, Saba; Bwayo, Salome K.; Lerner, Alan; Hudson, Leon; Ogrocki, Paula; Fletcher, Evan; Carmichael, Owen; Olichney, John; DeCarli, Charles; Kittur, Smita; Borrie, Michael; Lee, T.-Y.; Bartha, Rob; Johnson, Sterling; Asthana, Sanjay; Carlsson, Cynthia M.; Potkin, Steven G.; Preda, Adrian; Nguyen, Dana; Tariot, Pierre; Fleisher, Adam; Reeder, Stephanie; Bates, Vernice; Capote, Horacio; Rainka, Michelle; Scharre, Douglas W.; Kataki, Maria; Zimmerman, Earl A.; Celmins, Dzintra; Brown, Alice D.; Pearlson, Godfrey D.; Blank, Karen; Anderson, Karen; Saykin, Andrew J.; Santulli, Robert B.; Schwartz, Eben S.; Sink, Kaycee M.; Williamson, Jeff D.; Garg, Pradeep; Watkins, Franklin; Ott, Brian R.; Querfurth, Henry; Tremont, Geoffrey; Salloway, Stephen; Malloy, Paul; Correia, Stephen; Rosen, Howard J.; Miller, Bruce L.; Mintzer, Jacobo; Longmire, Crystal Flynn; Spicer, Kenneth; Finger, Elizabeth; Rachinsky, Irina; Rogers, John; Kertesz, Andrew; Drost, Dick

    2013-01-01

    Aberrant connectivity is implicated in many neurological and psychiatric disorders, including Alzheimer’s disease and schizophrenia. However, other than a few disease-associated candidate genes, we know little about the degree to which genetics play a role in the brain networks; we know even less about specific genes that influence brain connections. Twin and family-based studies can generate estimates of overall genetic influences on a trait, but genome-wide association scans (GWASs) can screen the genome for specific variants influencing the brain or risk for disease. To identify the heritability of various brain connections, we scanned healthy young adult twins with high-field, high-angular resolution diffusion MRI. We adapted GWASs to screen the brain’s connectivity pattern, allowing us to discover genetic variants that affect the human brain’s wiring. The association of connectivity with the SPON1 variant at rs2618516 on chromosome 11 (11p15.2) reached connectome-wide, genome-wide significance after stringent statistical corrections were enforced, and it was replicated in an independent subsample. rs2618516 was shown to affect brain structure in an elderly population with varying degrees of dementia. Older people who carried the connectivity variant had significantly milder clinical dementia scores and lower risk of Alzheimer’s disease. As a posthoc analysis, we conducted GWASs on several organizational and topological network measures derived from the matrices to discover variants in and around genes associated with autism (MACROD2), development (NEDD4), and mental retardation (UBE2A) significantly associated with connectivity. Connectome-wide, genome-wide screening offers substantial promise to discover genes affecting brain connectivity and risk for brain diseases. PMID:23471985

  3. Genome-wide scan of healthy human connectome discovers SPON1 gene variant influencing dementia severity.

    PubMed

    Jahanshad, Neda; Rajagopalan, Priya; Hua, Xue; Hibar, Derrek P; Nir, Talia M; Toga, Arthur W; Jack, Clifford R; Saykin, Andrew J; Green, Robert C; Weiner, Michael W; Medland, Sarah E; Montgomery, Grant W; Hansell, Narelle K; McMahon, Katie L; de Zubicaray, Greig I; Martin, Nicholas G; Wright, Margaret J; Thompson, Paul M

    2013-03-19

    Aberrant connectivity is implicated in many neurological and psychiatric disorders, including Alzheimer's disease and schizophrenia. However, other than a few disease-associated candidate genes, we know little about the degree to which genetics play a role in the brain networks; we know even less about specific genes that influence brain connections. Twin and family-based studies can generate estimates of overall genetic influences on a trait, but genome-wide association scans (GWASs) can screen the genome for specific variants influencing the brain or risk for disease. To identify the heritability of various brain connections, we scanned healthy young adult twins with high-field, high-angular resolution diffusion MRI. We adapted GWASs to screen the brain's connectivity pattern, allowing us to discover genetic variants that affect the human brain's wiring. The association of connectivity with the SPON1 variant at rs2618516 on chromosome 11 (11p15.2) reached connectome-wide, genome-wide significance after stringent statistical corrections were enforced, and it was replicated in an independent subsample. rs2618516 was shown to affect brain structure in an elderly population with varying degrees of dementia. Older people who carried the connectivity variant had significantly milder clinical dementia scores and lower risk of Alzheimer's disease. As a posthoc analysis, we conducted GWASs on several organizational and topological network measures derived from the matrices to discover variants in and around genes associated with autism (MACROD2), development (NEDD4), and mental retardation (UBE2A) significantly associated with connectivity. Connectome-wide, genome-wide screening offers substantial promise to discover genes affecting brain connectivity and risk for brain diseases.

  4. Rapid and efficient genome-wide characterization of Xanthomonas TAL effector genes

    PubMed Central

    Yu, Yan-Hua; Lu, Ye; He, Yong-Qiang; Huang, Sheng; Tang, Ji-Liang

    2015-01-01

    Xanthomonas TALE transcriptional activators act as virulence or avirulence factors by activating host disease susceptibility or resistance genes. Their specificity is determined by a tandem repeat domain. Some Xanthomonas pathogens contain 10–30 TALEs per strain. Although TALEs play critical roles in pathogenesis, their studies have so far been limited to a few examples, due to their highly repetitive gene structure and extreme similarity among different members, which constrict sequencing and assembling. To facilitate TALE studies, we developed an efficient and rapid pipeline for genome-wide cloning of tal genes as many as possible from a strain. Here, we report the pipeline and its use to identify all 18 tal genes from a newly isolated strain of the rice pathogen Xathomonas oryzae. Target prediction revealed a number of potential rice targets including several notable genes such as genes encoding SWEET, WRKY, Hen1, and BAK1 proteins, which provide candidates for further experimental functional analysis of the TALEs. PMID:26271455

  5. Rapid and efficient genome-wide characterization of Xanthomonas TAL effector genes.

    PubMed

    Yu, Yan-Hua; Lu, Ye; He, Yong-Qiang; Huang, Sheng; Tang, Ji-Liang

    2015-01-01

    Xanthomonas TALE transcriptional activators act as virulence or avirulence factors by activating host disease susceptibility or resistance genes. Their specificity is determined by a tandem repeat domain. Some Xanthomonas pathogens contain 10-30 TALEs per strain. Although TALEs play critical roles in pathogenesis, their studies have so far been limited to a few examples, due to their highly repetitive gene structure and extreme similarity among different members, which constrict sequencing and assembling. To facilitate TALE studies, we developed an efficient and rapid pipeline for genome-wide cloning of tal genes as many as possible from a strain. Here, we report the pipeline and its use to identify all 18 tal genes from a newly isolated strain of the rice pathogen Xathomonas oryzae. Target prediction revealed a number of potential rice targets including several notable genes such as genes encoding SWEET, WRKY, Hen1, and BAK1 proteins, which provide candidates for further experimental functional analysis of the TALEs. PMID:26271455

  6. European genome-wide association study identifies SLC14A1 as a new urinary bladder cancer susceptibility gene

    PubMed Central

    Rafnar, Thorunn; Vermeulen, Sita H.; Sulem, Patrick; Thorleifsson, Gudmar; Aben, Katja K.; Witjes, J. Alfred; Grotenhuis, Anne J.; Verhaegh, Gerald W.; Hulsbergen-van de Kaa, Christina A.; Besenbacher, Soren; Gudbjartsson, Daniel; Stacey, Simon N.; Gudmundsson, Julius; Johannsdottir, Hrefna; Bjarnason, Hjordis; Zanon, Carlo; Helgadottir, Hafdis; Jonasson, Jon Gunnlaugur; Tryggvadottir, Laufey; Jonsson, Eirikur; Geirsson, Gudmundur; Nikulasson, Sigfus; Petursdottir, Vigdis; Bishop, D. Timothy; Chung-Sak, Sei; Choudhury, Ananya; Elliott, Faye; Barrett, Jennifer H.; Knowles, Margaret A.; de Verdier, Petra J.; Ryk, Charlotta; Lindblom, Annika; Rudnai, Peter; Gurzau, Eugene; Koppova, Kvetoslava; Vineis, Paolo; Polidoro, Silvia; Guarrera, Simonetta; Sacerdote, Carlotta; Panadero, Angeles; Sanz-Velez, José I.; Sanchez, Manuel; Valdivia, Gabriel; Garcia-Prats, Maria D.; Hengstler, Jan G.; Selinski, Silvia; Gerullis, Holger; Ovsiannikov, Daniel; Khezri, Abdolaziz; Aminsharifi, Alireza; Malekzadeh, Mahyar; van den Berg, Leonard H.; Ophoff, Roel A.; Veldink, Jan H.; Zeegers, Maurice P.; Kellen, Eliane; Fostinelli, Jacopo; Andreoli, Daniele; Arici, Cecilia; Porru, Stefano; Buntinx, Frank; Ghaderi, Abbas; Golka, Klaus; Mayordomo, José I.; Matullo, Giuseppe; Kumar, Rajiv; Steineck, Gunnar; Kiltie, Anne E.; Kong, Augustine; Thorsteinsdottir, Unnur; Stefansson, Kari; Kiemeney, Lambertus A.

    2011-01-01

    Three genome-wide association studies in Europe and the USA have reported eight urinary bladder cancer (UBC) susceptibility loci. Using extended case and control series and 1000 Genomes imputations of 5 340 737 single-nucleotide polymorphisms (SNPs), we searched for additional loci in the European GWAS. The discovery sample set consisted of 1631 cases and 3822 controls from the Netherlands and 603 cases and 37 781 controls from Iceland. For follow-up, we used 3790 cases and 7507 controls from 13 sample sets of European and Iranian ancestry. Based on the discovery analysis, we followed up signals in the urea transporter (UT) gene SLC14A. The strongest signal at this locus was represented by a SNP in intron 3, rs17674580, that reached genome-wide significance in the overall analysis of the discovery and follow-up groups: odds ratio = 1.17, P = 7.6 × 10−11. SLC14A1 codes for UTs that define the Kidd blood group and are crucial for the maintenance of a constant urea concentration gradient in the renal medulla and, through this, the kidney's ability to concentrate urine. It is speculated that rs17674580, or other sequence variants in LD with it, indirectly modifies UBC risk by affecting urine production. If confirmed, this would support the ‘urogenous contact hypothesis’ that urine production and voiding frequency modify the risk of UBC. PMID:21750109

  7. European genome-wide association study identifies SLC14A1 as a new urinary bladder cancer susceptibility gene.

    PubMed

    Rafnar, Thorunn; Vermeulen, Sita H; Sulem, Patrick; Thorleifsson, Gudmar; Aben, Katja K; Witjes, J Alfred; Grotenhuis, Anne J; Verhaegh, Gerald W; Hulsbergen-van de Kaa, Christina A; Besenbacher, Soren; Gudbjartsson, Daniel; Stacey, Simon N; Gudmundsson, Julius; Johannsdottir, Hrefna; Bjarnason, Hjordis; Zanon, Carlo; Helgadottir, Hafdis; Jonasson, Jon Gunnlaugur; Tryggvadottir, Laufey; Jonsson, Eirikur; Geirsson, Gudmundur; Nikulasson, Sigfus; Petursdottir, Vigdis; Bishop, D Timothy; Chung-Sak, Sei; Choudhury, Ananya; Elliott, Faye; Barrett, Jennifer H; Knowles, Margaret A; de Verdier, Petra J; Ryk, Charlotta; Lindblom, Annika; Rudnai, Peter; Gurzau, Eugene; Koppova, Kvetoslava; Vineis, Paolo; Polidoro, Silvia; Guarrera, Simonetta; Sacerdote, Carlotta; Panadero, Angeles; Sanz-Velez, José I; Sanchez, Manuel; Valdivia, Gabriel; Garcia-Prats, Maria D; Hengstler, Jan G; Selinski, Silvia; Gerullis, Holger; Ovsiannikov, Daniel; Khezri, Abdolaziz; Aminsharifi, Alireza; Malekzadeh, Mahyar; van den Berg, Leonard H; Ophoff, Roel A; Veldink, Jan H; Zeegers, Maurice P; Kellen, Eliane; Fostinelli, Jacopo; Andreoli, Daniele; Arici, Cecilia; Porru, Stefano; Buntinx, Frank; Ghaderi, Abbas; Golka, Klaus; Mayordomo, José I; Matullo, Giuseppe; Kumar, Rajiv; Steineck, Gunnar; Kiltie, Anne E; Kong, Augustine; Thorsteinsdottir, Unnur; Stefansson, Kari; Kiemeney, Lambertus A

    2011-11-01

    Three genome-wide association studies in Europe and the USA have reported eight urinary bladder cancer (UBC) susceptibility loci. Using extended case and control series and 1000 Genomes imputations of 5 340 737 single-nucleotide polymorphisms (SNPs), we searched for additional loci in the European GWAS. The discovery sample set consisted of 1631 cases and 3822 controls from the Netherlands and 603 cases and 37 781 controls from Iceland. For follow-up, we used 3790 cases and 7507 controls from 13 sample sets of European and Iranian ancestry. Based on the discovery analysis, we followed up signals in the urea transporter (UT) gene SLC14A. The strongest signal at this locus was represented by a SNP in intron 3, rs17674580, that reached genome-wide significance in the overall analysis of the discovery and follow-up groups: odds ratio = 1.17, P = 7.6 × 10(-11). SLC14A1 codes for UTs that define the Kidd blood group and are crucial for the maintenance of a constant urea concentration gradient in the renal medulla and, through this, the kidney's ability to concentrate urine. It is speculated that rs17674580, or other sequence variants in LD with it, indirectly modifies UBC risk by affecting urine production. If confirmed, this would support the 'urogenous contact hypothesis' that urine production and voiding frequency modify the risk of UBC.

  8. Genome-wide association studies and gene expression profiles of rheumatoid arthritis

    PubMed Central

    Xiao, X.; Hao, J.; Wen, Y.; Wang, W.; Guo, X.

    2016-01-01

    Objectives The molecular mechanism of rheumatoid arthritis (RA) remains elusive. We conducted a protein-protein interaction network-based integrative analysis of genome-wide association studies (GWAS) and gene expression profiles of RA. Methods We first performed a dense search of RA-associated gene modules by integrating a large GWAS meta-analysis dataset (containing 5539 RA patients and 20 169 healthy controls), protein interaction network and gene expression profiles of RA synovium and peripheral blood mononuclear cells (PBMCs). Gene ontology (GO) enrichment analysis was conducted by DAVID. The protein association networks of gene modules were generated by STRING. Results For RA synovium, the top-ranked gene module is HLA-A, containing TAP2, HLA-A, HLA-C, TAPBP and LILRB1 genes. For RA PBMCs, the top-ranked gene module is GRB7, consisting of HLA-DRB5, HLA-DRA, GRB7, CD63 and KIT genes. Functional enrichment analysis identified three significant GO terms for RA synovium, including antigen processing and presentation of peptide antigen via major histocompatibility complex class I (false discovery rate (FDR) = 4.86 × 10 – 4), antigen processing and presentation of peptide antigen (FDR = 2.33 × 10 – 3) and eukaryotic translation initiation factor 4F complex (FDR = 2.52 × 10 – 2). Conclusion This study reported several RA-associated gene modules and their functional association networks. Cite this article: X. Xiao, J. Hao, Y. Wen, W. Wang, X. Guo, F. Zhang. Genome-wide association studies and gene expression profiles of rheumatoid arthritis: an analysis. Bone Joint Res 2016;5:314–319. DOI: 10.1302/2046-3758.57.2000502. PMID:27445359

  9. Comparison of genome-wide selection strategies to identify furfural tolerance genes in Escherichia coli.

    PubMed

    Glebes, Tirzah Y; Sandoval, Nicholas R; Gillis, Jacob H; Gill, Ryan T

    2015-01-01

    Engineering both feedstock and product tolerance is important for transitioning towards next-generation biofuels derived from renewable sources. Tolerance to chemical inhibitors typically results in complex phenotypes, for which multiple genetic changes must often be made to confer tolerance. Here, we performed a genome-wide search for furfural-tolerant alleles using the TRackable Multiplex Recombineering (TRMR) method (Warner et al. (2010), Nature Biotechnology), which uses chromosomally integrated mutations directed towards increased or decreased expression of virtually every gene in Escherichia coli. We employed various growth selection strategies to assess the role of selection design towards growth enrichments. We also compared genes with increased fitness from our TRMR selection to those from a previously reported genome-wide identification study of furfural tolerance genes using a plasmid-based genomic library approach (Glebes et al. (2014) PLOS ONE). In several cases, growth improvements were observed for the chromosomally integrated promoter/RBS mutations but not for the plasmid-based overexpression constructs. Through this assessment, four novel tolerance genes, ahpC, yhjH, rna, and dicA, were identified and confirmed for their effect on improving growth in the presence of furfural.

  10. Genome-Wide Epigenetic Regulation of Gene Transcription in Maize Seeds

    PubMed Central

    Chai, Zhenguang; Guo, Wenzhu; Chen, Rumei; Wang, Lei; Zhao, Jun; Lang, Zhihong; Fan, Yunliu; Zhao, Jiuran; Zhang, Chunyi

    2015-01-01

    Background Epigenetic regulation is well recognized for its importance in gene expression in organisms. DNA methylation, an important epigenetic mark, has received enormous attention in recent years as it’s a key player in many biological processes. It remains unclear how DNA methylation contributes to gene transcription regulation in maize seeds. Here, we take advantage of recent technologies to examine the genome-wide association of DNA methylation with transcription of four types of DNA sequences, including protein-coding genes, pseudogenes, transposable elements, and repeats in maize embryo and endosperm, respectively. Results The methylation in CG, CHG and CHH contexts plays different roles in the control of gene expression. Methylation around the transcription start sites and transcription stop regions of protein-coding genes is negatively correlated, but in gene bodies positively correlated, to gene expression level. The upstream regions of protein-coding genes are enriched with 24-nt siRNAs and contain high levels of CHH methylation, which is correlated to gene expression level. The analysis of sequence content within CG, CHG, or CHH contexts reveals that only CHH methylation is affected by its local sequences, which is different from Arabidopsis. Conclusions In summary, we conclude that methylation-regulated transcription varies with the types of DNA sequences, sequence contexts or parts of a specific gene in maize seeds and differs from that in other plant species. Our study helps people better understand from a genome-wide viewpoint that how transcriptional expression is controlled by DNA methylation, one of the important factors influencing transcription, and how the methylation is associated with small RNAs. PMID:26469520

  11. Pathway-based analysis using reduced gene subsets in genome-wide association studies

    PubMed Central

    2011-01-01

    Background Single Nucleotide Polymorphism (SNP) analysis only captures a small proportion of associated genetic variants in Genome-Wide Association Studies (GWAS) partly due to small marginal effects. Pathway level analysis incorporating prior biological information offers another way to analyze GWAS's of complex diseases, and promises to reveal the mechanisms leading to complex diseases. Biologically defined pathways are typically comprised of numerous genes. If only a subset of genes in the pathways is associated with disease then a joint analysis including all individual genes would result in a loss of power. To address this issue, we propose a pathway-based method that allows us to test for joint effects by using a pre-selected gene subset. In the proposed approach, each gene is considered as the basic unit, which reduces the number of genetic variants considered and hence reduces the degrees of freedom in the joint analysis. The proposed approach also can be used to investigate the joint effect of several genes in a candidate gene study. Results We applied this new method to a published GWAS of psoriasis and identified 6 biologically plausible pathways, after adjustment for multiple testing. The pathways identified in our analysis overlap with those reported in previous studies. Further, using simulations across a range of gene numbers and effect sizes, we demonstrate that the proposed approach enjoys higher power than several other approaches to detect associated pathways. Conclusions The proposed method could increase the power to discover susceptibility pathways and to identify associated genes using GWAS. In our analysis of genome-wide psoriasis data, we have identified a number of relevant pathways for psoriasis. PMID:21226955

  12. Genome-wide evidence for speciation with gene flow in Heliconius butterflies

    PubMed Central

    Martin, Simon H.; Dasmahapatra, Kanchon K.; Nadeau, Nicola J.; Salazar, Camilo; Walters, James R.; Simpson, Fraser; Blaxter, Mark; Manica, Andrea; Mallet, James; Jiggins, Chris D.

    2013-01-01

    Most speciation events probably occur gradually, without complete and immediate reproductive isolation, but the full extent of gene flow between diverging species has rarely been characterized on a genome-wide scale. Documenting the extent and timing of admixture between diverging species can clarify the role of geographic isolation in speciation. Here we use new methodology to quantify admixture at different stages of divergence in Heliconius butterflies, based on whole-genome sequences of 31 individuals. Comparisons between sympatric and allopatric populations of H. melpomene, H. cydno, and H. timareta revealed a genome-wide trend of increased shared variation in sympatry, indicative of pervasive interspecific gene flow. Up to 40% of 100-kb genomic windows clustered by geography rather than by species, demonstrating that a very substantial fraction of the genome has been shared between sympatric species. Analyses of genetic variation shared over different time intervals suggested that admixture between these species has continued since early in speciation. Alleles shared between species during recent time intervals displayed higher levels of linkage disequilibrium than those shared over longer time intervals, suggesting that this admixture took place at multiple points during divergence and is probably ongoing. The signal of admixture was significantly reduced around loci controlling divergent wing patterns, as well as throughout the Z chromosome, consistent with strong selection for Müllerian mimicry and with known Z-linked hybrid incompatibility. Overall these results show that species divergence can occur in the face of persistent and genome-wide admixture over long periods of time. PMID:24045163

  13. Genome-wide gene expression changes in an industrial clavulanic acid overproduction strain of Streptomyces clavuligerus.

    PubMed

    Medema, Marnix H; Alam, Mohammad T; Heijne, Wilbert H M; van den Berg, Marco A; Müller, Ulrike; Trefzer, Axel; Bovenberg, Roel A L; Breitling, Rainer; Takano, Eriko

    2011-03-01

    To increase production of the important pharmaceutical compound clavulanic acid, a β-lactamase inhibitor, both random mutagenesis approaches and rational engineering of Streptomyces clavuligerus strains have been extensively applied. Here, for the first time, we compared genome-wide gene expression of an industrial S. clavuligerus strain, obtained through iterative mutagenesis, with that of the wild-type strain. Intriguingly, we found that the majority of the changes contributed not to a complex rewiring of primary metabolism but consisted of a simple upregulation of various antibiotic biosynthesis gene clusters. A few additional transcriptional changes in primary metabolism at key points seem to divert metabolic fluxes to the biosynthetic precursors for clavulanic acid. In general, the observed changes largely coincide with genes that have been targeted by rational engineering in recent years, yet the presence of a number of previously unexplored genes clearly demonstrates that functional genomic analysis can provide new leads for strain improvement in biotechnology.

  14. Genome-wide identification and expression analysis of MAPK and MAPKK gene family in Malus domestica.

    PubMed

    Zhang, Shizhong; Xu, Ruirui; Luo, Xiaocui; Jiang, Zesheng; Shu, Huairui

    2013-12-01

    MAPK signal transduction modules play crucial roles in regulating many biological processes in plants, which are composed of three classes of hierarchically organized protein kinases, namely MAPKKKs, MAPKKs, and MAPKs. Although genome-wide analysis of this family has been carried out in some species, little is known about MAPK and MAPKK genes in apple (Malus domestica). In this study, a total of 26 putative apple MAPK genes (MdMPKs) and 9 putative apple MAPKK genes (MdMKKs) have been identified and located within the apple genome. Phylogenetic analysis revealed that MdMAPKs and MdMAPKKs could be divided into 4 subfamilies (groups A, B, C and D), respectively. The predicted MdMAPKs and MdMAPKKs were distributed across 13 out of 17 chromosomes with different densities. In addition, analysis of exon-intron junctions and of intron phase inside the predicted coding region of each candidate gene has revealed high levels of conservation within and between phylogenetic groups. According to the microarray and expressed sequence tag (EST) analysis, the different expression patterns indicate that they may play different roles during fruit development and rootstock-scion interaction process. Moreover, MAPK and MAPKK genes were performed expression profile analyses in different tissues (root, stem, leaf, flower and fruit), and all of the selected genes were expressed in at least one of the tissues tested, indicating that the MAPKs and MAPKKs are involved in various aspects of physiological and developmental processes of apple. To our knowledge, this is the first report of a genome-wide analysis of the apple MAPK and MAPKK gene family. This study provides valuable information for understanding the classification and putative functions of the MAPK signal in apple.

  15. Methods for Investigating Gene-Environment Interactions in Candidate Pathway and Genome-Wide Association Studies

    PubMed Central

    Thomas, Duncan

    2010-01-01

    Despite the considerable enthusiasm about the yield of novel and replicated discoveries of genetic associations from the new generation of genome-wide association studies (GWAS), the proportion of the heritability of most complex diseases that have been studied to date remains small. Some of this “dark matter” could be due to gene-environment (G×E) interactions or more complex pathways involving multiple genes and exposures. We review the basic epidemiologic study design and statistical analysis approaches to studying G×E interactions individually and then consider more comprehensive approaches to studying entire pathways or GWAS data. In addition to the usual issues in genetic association studies, particular care is needed in exposure assessment and very large sample sizes are required. Although hypothesis-driven pathway-based and “agnostic” GWAS approaches are generally viewed as opposite poles, we suggest that the two can be usefully married using hierarchical modeling strategies that exploit external pathway knowledge in mining genome-wide data. PMID:20070199

  16. Genome-wide detection of genes targeted by non-Ig somatic hypermutation in lymphoma.

    PubMed

    Jiang, Yanwen; Soong, T David; Wang, Ling; Melnick, Ari M; Elemento, Olivier

    2012-01-01

    The processes of somatic hypermutation (SHM) and class switch recombination introduced by activation-induced cytosine deaminase (AICDA) at the Immunoglobulin (Ig) loci are key steps for creating a pool of diversified antibodies in germinal center B cells (GCBs). Unfortunately, AICDA can also accidentally introduce mutations at bystander loci, particularly within the 5' regulatory regions of proto-oncogenes relevant to diffuse large B cell lymphomas (DLBCL). Since current methods for genomewide sequencing such as Exon Capture and RNAseq only target mutations in coding regions, to date non-Ig promoter SHMs have been studied only in a handful genes. We designed a novel approach integrating bioinformatics tools with next generation sequencing technology to identify regulatory loci targeted by SHM genome-wide. We observed increased numbers of SHM associated sequence variant hotspots in lymphoma cells as compared to primary normal germinal center B cells. Many of these SHM hotspots map to genes that have not been reported before as mutated, including BACH2, BTG2, CXCR4, CIITA, EBF1, PIM2, and TCL1A, etc., all of which have potential roles in B cell survival, differentiation, and malignant transformation. In addition, using BCL6 and BACH2 as examples, we demonstrated that SHM sites identified in these 5' regulatory regions greatly altered their transcription activities in a reporter assay. Our approach provides a first cost-efficient, genome-wide method to identify regulatory mutations and non-Ig SHM hotspots. PMID:22808135

  17. Genome-Wide Characterization and Expression Profiles of the Superoxide Dismutase Gene Family in Gossypium.

    PubMed

    Zhang, Jingbo; Li, Bo; Yang, Yang; Hu, Wenran; Chen, Fangyuan; Xie, Lixia; Fan, Ling

    2016-01-01

    Superoxide dismutase (SOD) as a group of significant and ubiquitous enzymes plays a critical function in plant growth and development. Previously this gene family has been investigated in Arabidopsis and rice; it has not yet been characterized in cotton. In our study, it was the first time for us to perform a genome-wide analysis of SOD gene family in cotton. Our results showed that 10 genes of SOD gene family were identified in Gossypium arboreum and Gossypium raimondii, including 6 Cu-Zn-SODs, 2 Fe-SODs, and 2 Mn-SODs. The chromosomal distribution analysis revealed that SOD genes are distributed across 7 chromosomes in Gossypium arboreum and 8 chromosomes in Gossypium raimondii. Segmental duplication is predominant duplication event and major contributor for expansion of SOD gene family. Gene structure and protein structure analysis showed that SOD genes have conserved exon/intron arrangement and motif composition. Microarray-based expression analysis revealed that SOD genes have important function in abiotic stress. Moreover, the tissue-specific expression profile reveals the functional divergence of SOD genes in different organs development of cotton. Taken together, this study has imparted new insights into the putative functions of SOD gene family in cotton. Findings of the present investigation could help in understanding the role of SOD gene family in various aspects of the life cycle of cotton. PMID:27660755

  18. Genome-wide identification and evolution of HECT genes in soybean.

    PubMed

    Meng, Xianwen; Wang, Chen; Rahman, Siddiq Ur; Wang, Yaxu; Wang, Ailan; Tao, Shiheng

    2015-04-16

    Proteins containing domains homologous to the E6-associated protein (E6-AP) carboxyl terminus (HECT) are an important class of E3 ubiquitin ligases involved in the ubiquitin proteasome pathway. HECT-type E3s play crucial roles in plant growth and development. However, current understanding of plant HECT genes and their evolution is very limited. In this study, we performed a genome-wide analysis of the HECT domain-containing genes in soybean. Using high-quality genome sequences, we identified 19 soybean HECT genes. The predicted HECT genes were distributed unevenly across 15 of 20 chromosomes. Nineteen of these genes were inferred to be segmentally duplicated gene pairs, suggesting that in soybean, segmental duplications have made a significant contribution to the expansion of the HECT gene family. Phylogenetic analysis showed that these HECT genes can be divided into seven groups, among which gene structure and domain architecture was relatively well-conserved. The Ka/Ks ratios show that after the duplication events, duplicated HECT genes underwent purifying selection. Moreover, expression analysis reveals that 15 of the HECT genes in soybean are differentially expressed in 14 tissues, and are often highly expressed in the flowers and roots. In summary, this work provides useful information on which further functional studies of soybean HECT genes can be based.

  19. Genome-Wide Characterization and Expression Profiles of the Superoxide Dismutase Gene Family in Gossypium

    PubMed Central

    Zhang, Jingbo; Li, Bo; Yang, Yang; Hu, Wenran; Chen, Fangyuan; Xie, Lixia

    2016-01-01

    Superoxide dismutase (SOD) as a group of significant and ubiquitous enzymes plays a critical function in plant growth and development. Previously this gene family has been investigated in Arabidopsis and rice; it has not yet been characterized in cotton. In our study, it was the first time for us to perform a genome-wide analysis of SOD gene family in cotton. Our results showed that 10 genes of SOD gene family were identified in Gossypium arboreum and Gossypium raimondii, including 6 Cu-Zn-SODs, 2 Fe-SODs, and 2 Mn-SODs. The chromosomal distribution analysis revealed that SOD genes are distributed across 7 chromosomes in Gossypium arboreum and 8 chromosomes in Gossypium raimondii. Segmental duplication is predominant duplication event and major contributor for expansion of SOD gene family. Gene structure and protein structure analysis showed that SOD genes have conserved exon/intron arrangement and motif composition. Microarray-based expression analysis revealed that SOD genes have important function in abiotic stress. Moreover, the tissue-specific expression profile reveals the functional divergence of SOD genes in different organs development of cotton. Taken together, this study has imparted new insights into the putative functions of SOD gene family in cotton. Findings of the present investigation could help in understanding the role of SOD gene family in various aspects of the life cycle of cotton.

  20. Genome-Wide Characterization and Expression Profiles of the Superoxide Dismutase Gene Family in Gossypium

    PubMed Central

    Zhang, Jingbo; Li, Bo; Yang, Yang; Hu, Wenran; Chen, Fangyuan; Xie, Lixia

    2016-01-01

    Superoxide dismutase (SOD) as a group of significant and ubiquitous enzymes plays a critical function in plant growth and development. Previously this gene family has been investigated in Arabidopsis and rice; it has not yet been characterized in cotton. In our study, it was the first time for us to perform a genome-wide analysis of SOD gene family in cotton. Our results showed that 10 genes of SOD gene family were identified in Gossypium arboreum and Gossypium raimondii, including 6 Cu-Zn-SODs, 2 Fe-SODs, and 2 Mn-SODs. The chromosomal distribution analysis revealed that SOD genes are distributed across 7 chromosomes in Gossypium arboreum and 8 chromosomes in Gossypium raimondii. Segmental duplication is predominant duplication event and major contributor for expansion of SOD gene family. Gene structure and protein structure analysis showed that SOD genes have conserved exon/intron arrangement and motif composition. Microarray-based expression analysis revealed that SOD genes have important function in abiotic stress. Moreover, the tissue-specific expression profile reveals the functional divergence of SOD genes in different organs development of cotton. Taken together, this study has imparted new insights into the putative functions of SOD gene family in cotton. Findings of the present investigation could help in understanding the role of SOD gene family in various aspects of the life cycle of cotton. PMID:27660755

  1. Cytokine Gene Polymorphisms and Human Autoimmune Disease in the Era of Genome-Wide Association Studies

    PubMed Central

    2012-01-01

    Cytokine (receptor) genes have traditionally attracted great interest as plausible genetic risk factors for autoimmune disease. Since 2007, the implementation of genome-wide association studies has facilitated the robust identification of allelic variants in more than 35 cytokine loci as susceptibility factors for a wide variety of over 15 autoimmune disorders. In this review, we catalog the gene loci of interleukin, chemokine, and tumor necrosis factor receptor superfamily and ligands that have emerged as autoimmune risk factors. We examine recent progress made in the clarification of the functional mechanisms by which polymorphisms in the genes coding for interleukin-2 receptor alpha (IL2RA), IL7R, and IL23R may alter risk for autoimmune disease, and discuss opposite autoimmune risk alleles found, among others, at the IL10 locus. PMID:22191464

  2. 141 Gene Discovery and Data Sharing in Genome Wide Association Analyses: lessons form AIDS genetic restriction genes

    PubMed Central

    O'Brien, Stephen J; Svitin, Anton; Malov, Sergey; Cherkazov, Nikolay; Dobrynin, Pavel; Geerts, Paul; Troyer, Jennifer; Hendrickson-Lambert, Sher; Sezgin, Efe; Hutcheson, Holli

    2014-01-01

    As genome wide association studies plus whole genome sequence analyses for complex human disease determinants are expanding, it seems useful to develop strategies to facilitate large data sharing, rapid replication and validation of provocative statistical associations that straddle the threshold for genome wide significance. At this conference, we shall announce GWATCH, (Genome Wide Association Tracks Chromosome Highway) a web based data release platform that can freely display and inspect unabridged genome tracked association data without compromising privacy or Informed Consent constrictions, allowing for rapid discovery and replication opportunities. We illustrate the utility with HIV-AIDS resistance genes screened in combined large multicenter cohort studies GWAS (MACS, HGDS, MHGS, ALLIVE, LSOCA HOMER) developed and studied over the last decades.

  3. Genome-wide analysis of chromatin packing in Arabidopsis thaliana at single-gene resolution

    PubMed Central

    Liu, Chang; Wang, Congmao; Wang, George; Becker, Claude; Zaidem, Maricris; Weigel, Detlef

    2016-01-01

    The three-dimensional packing of the genome plays an important role in regulating gene expression. We have used Hi-C, a genome-wide chromatin conformation capture (3C) method, to analyze Arabidopsis thaliana chromosomes dissected into subkilobase segments, which is required for gene-level resolution in this species with a gene-dense genome. We found that the repressive H3K27me3 histone mark is overrepresented in the promoter regions of genes that are in conformational linkage over long distances. In line with the globally dispersed distribution of RNA polymerase II in A. thaliana nuclear space, actively transcribed genes do not show a strong tendency to associate with each other. In general, there are often contacts between 5′ and 3′ ends of genes, forming local chromatin loops. Such self-loop structures of genes are more likely to occur in more highly expressed genes, although they can also be found in silent genes. Silent genes with local chromatin loops are highly enriched for the histone variant H3.3 at their 5′ and 3′ ends but depleted of repressive marks such as heterochromatic histone modifications and DNA methylation in flanking regions. Our results suggest that, different from animals, a major theme of genome folding in A. thaliana is the formation of structural units that correspond to gene bodies. PMID:27225844

  4. Genome-wide analysis suggests divergent evolution of lipid phosphotases/phosphotransferase genes in plants.

    PubMed

    Wang, Peng; Chen, Zhenxi; Kasimu, Rena; Chen, Yinhua; Zhang, Xiaoxiao; Gai, Jiangtao

    2016-08-01

    Genes of the LPPT (lipid phosphatase/phosphotransferase) family play important roles in lipid phosphorous transfer and triacylglycerol accumulation in plants. To provide overviews of the plant LPPT family and their overall relationships, here we carried out genome-wide identifications and analyses of plant LPPT family members. A total of 643 putative LPPT genes were identified from 48 sequenced plant genomes, among which 205 genes from 14 plants were chosen for further analyses. Plant LPPT genes belonged to three distinctive groups, namely the LPT (lipid phosphotransfease), LPP (lipid phosphatase), and pLPP (plastidic lipid phosphotransfease) groups. Genes of the LPT group could be further partitioned into three groups, two of which were only identified in terrestrial plants. Genes in the LPP and pLPP groups experienced duplications in early stages of plant evolution. Among 17 Zea mays LPPT genes, divergence of temporal-spatial expression patterns was revealed based on microarray data analysis. Peptide sequences of plant LPPT genes harbored different conserved motifs. A test of Branch Model versus One-ratio Model did not support significant selective pressures acting on different groups of LPPT genes, although quite different nonsynonymous evolutionary rates and selective pressures were observed. The complete picture of the plant LPPT family provided here should facilitate further investigations of plant LPPT genes and offer a better understanding of lipid biosynthesis in plants. PMID:27501416

  5. Genome-wide identification of genes essential for the survival of Streptococcus pneumoniae in human saliva.

    PubMed

    Verhagen, Lilly M; de Jonge, Marien I; Burghout, Peter; Schraa, Kiki; Spagnuolo, Lorenza; Mennens, Svenja; Eleveld, Marc J; van der Gaast-de Jongh, Christa E; Zomer, Aldert; Hermans, Peter W M; Bootsma, Hester J

    2014-01-01

    Since Streptococcus pneumoniae transmits through droplet spread, this respiratory tract pathogen may be able to survive in saliva. Here, we show that saliva supports survival of clinically relevant S. pneumoniae strains for more than 24 h in a capsule-independent manner. Moreover, saliva induced growth of S. pneumoniae in growth-permissive conditions, suggesting that S. pneumoniae is well adapted for uptake of nutrients from this bodily fluid. By using Tn-seq, a method for genome-wide negative selection screening, we identified 147 genes potentially required for growth and survival of S. pneumoniae in saliva, among which genes predicted to be involved in cell envelope biosynthesis, cell transport, amino acid metabolism, and stress response predominated. The Tn-seq findings were validated by testing a panel of directed gene deletion mutants for their ability to survive in saliva under two testing conditions: at room temperature without CO2, representing transmission, and at 37 °C with CO2, representing in-host carriage. These validation experiments confirmed that the plsX gene and the amiACDEF and aroDEBC operons, involved in respectively fatty acid metabolism, oligopeptide transport, and biosynthesis of aromatic amino acids play an important role in the growth and survival of S. pneumoniae in saliva at 37 °C. In conclusion, this study shows that S. pneumoniae is well-adapted for growth and survival in human saliva and provides a genome-wide list of genes potentially involved in adaptation. This notion supports earlier evidence that S. pneumoniae can use human saliva as a vector for transmission.

  6. Genome-wide chromatin and gene expression profiling during memory formation and maintenance in adult mice

    PubMed Central

    Centeno, Tonatiuh Pena; Shomroni, Orr; Hennion, Magali; Halder, Rashi; Vidal, Ramon; Rahman, Raza-Ur; Bonn, Stefan

    2016-01-01

    Recent evidence suggests that the formation and maintenance of memory requires epigenetic changes. In an effort to understand the spatio-temporal extent of learning and memory-related epigenetic changes we have charted genome-wide histone and DNA methylation profiles, in two different brain regions, two cell types, and three time-points, before and after learning. In this data descriptor we provide detailed information on data generation, give insights into the rationale of experiments, highlight necessary steps to assess data quality, offer guidelines for future use of the data and supply ready-to-use code to replicate the analysis results. The data provides a blueprint of the gene regulatory network underlying short- and long-term memory formation and maintenance. This ‘healthy’ gene regulatory network of learning can now be compared to changes in neurological or psychiatric diseases, providing mechanistic insights into brain disorders and highlighting potential therapeutic avenues. PMID:27727234

  7. Genome-wide profiling of DNA methylation and gene expression in Crassostrea gigas male gametes

    PubMed Central

    Olson, Claire E.; Roberts, Steven B.

    2014-01-01

    DNA methylation patterns and functions are variable across invertebrate taxa. In order to provide a better understanding of DNA methylation in the Pacific oyster (Crassostrea gigas), we characterized the genome-wide DNA methylation profile in male gamete cells using whole-genome bisulfite sequencing. RNA-Seq analysis was performed to examine the relationship between DNA methylation and transcript expression. Methylation status of over 7.6 million CpG dinucleotides was described with a majority of methylated regions occurring among intragenic regions. Overall, 15% of the CpG dinucleotides were determined to be methylated and the mitochondrial genome lacked DNA methylation. Integrative analysis of DNA methylation and RNA-Seq data revealed a positive association between methylation status, both in gene bodies and putative promoter regions, and expression. This study provides a comprehensive characterization of the distribution of DNA methylation in the oyster male gamete tissue and suggests that DNA methylation is involved in gene regulatory activity. PMID:24987376

  8. Integrated genome-wide analysis of genomic changes and gene regulation in human adrenocortical tissue samples

    PubMed Central

    Gara, Sudheer Kumar; Wang, Yonghong; Patel, Dhaval; Liu-Chittenden, Yi; Jain, Meenu; Boufraqech, Myriem; Zhang, Lisa; Meltzer, Paul S.; Kebebew, Electron

    2015-01-01

    To gain insight into the pathogenesis of adrenocortical carcinoma (ACC) and whether there is progression from normal-to-adenoma-to-carcinoma, we performed genome-wide gene expression, gene methylation, microRNA expression and comparative genomic hybridization (CGH) analysis in human adrenocortical tissue (normal, adrenocortical adenomas and ACC) samples. A pairwise comparison of normal, adrenocortical adenomas and ACC gene expression profiles with more than four-fold expression differences and an adjusted P-value < 0.05 revealed no major differences in normal versus adrenocortical adenoma whereas there are 808 and 1085, respectively, dysregulated genes between ACC versus adrenocortical adenoma and ACC versus normal. The majority of the dysregulated genes in ACC were downregulated. By integrating the CGH, gene methylation and expression profiles of potential miRNAs with the gene expression of dysregulated genes, we found that there are higher alterations in ACC versus normal compared to ACC versus adrenocortical adenoma. Importantly, we identified several novel molecular pathways that are associated with dysregulated genes and further experimentally validated that oncostatin m signaling induces caspase 3 dependent apoptosis and suppresses cell proliferation. Finally, we propose that there is higher number of genomic changes from normal-to-adenoma-to-carcinoma and identified oncostatin m signaling as a plausible druggable pathway for therapeutics. PMID:26446994

  9. Integrated genome-wide analysis of genomic changes and gene regulation in human adrenocortical tissue samples.

    PubMed

    Gara, Sudheer Kumar; Wang, Yonghong; Patel, Dhaval; Liu-Chittenden, Yi; Jain, Meenu; Boufraqech, Myriem; Zhang, Lisa; Meltzer, Paul S; Kebebew, Electron

    2015-10-30

    To gain insight into the pathogenesis of adrenocortical carcinoma (ACC) and whether there is progression from normal-to-adenoma-to-carcinoma, we performed genome-wide gene expression, gene methylation, microRNA expression and comparative genomic hybridization (CGH) analysis in human adrenocortical tissue (normal, adrenocortical adenomas and ACC) samples. A pairwise comparison of normal, adrenocortical adenomas and ACC gene expression profiles with more than four-fold expression differences and an adjusted P-value < 0.05 revealed no major differences in normal versus adrenocortical adenoma whereas there are 808 and 1085, respectively, dysregulated genes between ACC versus adrenocortical adenoma and ACC versus normal. The majority of the dysregulated genes in ACC were downregulated. By integrating the CGH, gene methylation and expression profiles of potential miRNAs with the gene expression of dysregulated genes, we found that there are higher alterations in ACC versus normal compared to ACC versus adrenocortical adenoma. Importantly, we identified several novel molecular pathways that are associated with dysregulated genes and further experimentally validated that oncostatin m signaling induces caspase 3 dependent apoptosis and suppresses cell proliferation. Finally, we propose that there is higher number of genomic changes from normal-to-adenoma-to-carcinoma and identified oncostatin m signaling as a plausible druggable pathway for therapeutics.

  10. A Genome-Wide Screen Indicates Correlation between Differentiation and Expression of Metabolism Related Genes

    PubMed Central

    Shende, Akhilesh; Singh, Anupama; Meena, Anil; Ghosal, Ritika; Ranganathan, Madhav; Bandyopadhyay, Amitabha

    2013-01-01

    Differentiated tissues may be considered as materials with distinct properties. The differentiation program of a given tissue ensures that it acquires material properties commensurate with its function. It may be hypothesized that some of these properties are acquired through production of tissue-specific metabolites synthesized by metabolic enzymes. To establish correlation between metabolism and organogenesis we have carried out a genome-wide expression study of metabolism related genes by RNA in-situ hybridization. 23% of the metabolism related genes studied are expressed in a tissue-restricted but not tissue-exclusive manner. We have conducted the screen on whole mount chicken (Gallus gallus) embryos from four distinct developmental stages to correlate dynamic changes in expression patterns of metabolic enzymes with spatio-temporally unique developmental events. Our data strongly suggests that unique combinations of metabolism related genes, and not specific metabolic pathways, are upregulated during differentiation. Further, expression of metabolism related genes in well established signaling centers that regulate different aspects of morphogenesis indicates developmental roles of some of the metabolism related genes. The database of tissue-restricted expression patterns of metabolism related genes, generated in this study, should serve as a resource for systematic identification of these genes with tissue-specific functions during development. Finally, comprehensive understanding of differentiation is not possible unless the downstream genes of a differentiation cascade are identified. We propose, metabolic enzymes constitute a significant portion of these downstream target genes. Thus our study should help elucidate different aspects of tissue differentiation. PMID:23717462

  11. Genome-Wide Identification and Characterization of WRKY Gene Family in Peanut

    PubMed Central

    Song, Hui; Wang, Pengfei; Lin, Jer-Young; Zhao, Chuanzhi; Bi, Yuping; Wang, Xingjun

    2016-01-01

    WRKY, an important transcription factor family, is widely distributed in the plant kingdom. Many reports focused on analysis of phylogenetic relationship and biological function of WRKY protein at the whole genome level in different plant species. However, little is known about WRKY proteins in the genome of Arachis species and their response to salicylic acid (SA) and jasmonic acid (JA) treatment. In this study, we identified 77 and 75 WRKY proteins from the two wild ancestral diploid genomes of cultivated tetraploid peanut, Arachis duranensis and Arachis ipaënsis, using bioinformatics approaches. Most peanut WRKY coding genes were located on A. duranensis chromosome A6 and A. ipaënsis chromosome B3, while the least number of WRKY genes was found in chromosome 9. The WRKY orthologous gene pairs in A. duranensis and A. ipaënsis chromosomes were highly syntenic. Our analysis indicated that segmental duplication events played a major role in AdWRKY and AiWRKY genes, and strong purifying selection was observed in gene duplication pairs. Furthermore, we translate the knowledge gained from the genome-wide analysis result of wild ancestral peanut to cultivated peanut to reveal that gene activities of specific cultivated peanut WRKY gene were changed due to SA and JA treatment. Peanut WRKY7, 8 and 13 genes were down-regulated, whereas WRKY1 and 12 genes were up-regulated with SA and JA treatment. These results could provide valuable information for peanut improvement. PMID:27200012

  12. Genome-wide identification and phylogenetic analysis of the SBP-box gene family in melons.

    PubMed

    Ma, Y; Guo, J W; Bade, R; Men, Z H; Hasi, A

    2014-10-27

    The SBP-box gene family is specific to plants and encodes a class of zinc finger-containing transcription factors with a broad range of functions. Although SBP-box genes have been identified in numerous plants, including green algae, moss, silver birch, snapdragon, Arabidopsis, rice, and maize, there is little information concerning SBP-box genes, or the corresponding miR156/157, function in melon. Using the highly conserved sequence of the Arabidopsis thaliana SBP-box domain protein as a probe of information sequence, the genome-wide protein database of melon was explored to obtain 13 SBP-box protein sequences, which were further divided into 4 groups, based on phylogenetic analysis. A further analysis centered on the melon SBP-box genetic family's phylogenetic evolution, sequence similarities, gene structure, and miR156 target sequence was also conducted. Analysis of all the expression patterns of melon SBP-box family genes showed that the SBP-box genes were detected in 7 kinds of tissue, and fruit had the highest expression level. CmSBP11 tends to present its specific expression in melon fruit and root. CmSBP09 expression was the highest in flower. Overall, the molecular evolution and expression pattern of the melon SBP-box gene family, revealed by these results, suggest its function differentiation that followed gene duplication.

  13. A genome-wide analysis of the expansin genes in Malus × Domestica.

    PubMed

    Zhang, Shizhong; Xu, Ruirui; Gao, Zheng; Chen, Changtian; Jiang, Zesheng; Shu, Huairui

    2014-04-01

    Expansins were first identified as cell wall-loosening proteins; they are involved in regulating cell expansion, fruits softening and many other physiological processes. However, our knowledge about the expansin family members and their evolutionary relationships in fruit trees, such as apple, is limited. In this study, we identified 41 members of the expansin gene family in the genome of apple (Malus × Domestica L. Borkh). Phylogenetic analysis revealed that expansin genes in apple could be divided into four subfamilies according to their gene structures and protein motifs. By phylogenetic analysis of the expansins in five plants (Arabidopsis, rice, poplar, grape and apple), the expansins were divided into 17 subgroups. Our gene duplication analysis revealed that whole-genome and chromosomal-segment duplications contributed to the expansion of Mdexpansins. The microarray and expressed sequence tag (EST) data showed that 34 Mdexpansin genes could be divided into five groups by the EST analysis; they may also play different roles during fruit development. An expression model for MdEXPA16 and MdEXPA20 showed their potential role in developing fruit. Overall, our study provides useful data and novel insights into the functions and regulatory mechanisms of the expansin genes in apple, as well as their evolution and divergence. As the first step towards genome-wide analysis of the expansin genes in apple, our results have established a solid foundation for future studies on the function of the expansin genes in fruit development.

  14. Genome-wide analysis reveals gene expression and metabolic network dynamics during embryo development in Arabidopsis.

    PubMed

    Xiang, Daoquan; Venglat, Prakash; Tibiche, Chabane; Yang, Hui; Risseeuw, Eddy; Cao, Yongguo; Babic, Vivijan; Cloutier, Mathieu; Keller, Wilf; Wang, Edwin; Selvaraj, Gopalan; Datla, Raju

    2011-05-01

    Embryogenesis is central to the life cycle of most plant species. Despite its importance, because of the difficulty associated with embryo isolation, global gene expression programs involved in plant embryogenesis, especially the early events following fertilization, are largely unknown. To address this gap, we have developed methods to isolate whole live Arabidopsis (Arabidopsis thaliana) embryos as young as zygote and performed genome-wide profiling of gene expression. These studies revealed insights into patterns of gene expression relating to: maternal and paternal contributions to zygote development, chromosomal level clustering of temporal expression in embryogenesis, and embryo-specific functions. Functional analysis of some of the modulated transcription factor encoding genes from our data sets confirmed that they are critical for embryogenesis. Furthermore, we constructed stage-specific metabolic networks mapped with differentially regulated genes by combining the microarray data with the available Kyoto Encyclopedia of Genes and Genomes metabolic data sets. Comparative analysis of these networks revealed the network-associated structural and topological features, pathway interactions, and gene expression with reference to the metabolic activities during embryogenesis. Together, these studies have generated comprehensive gene expression data sets for embryo development in Arabidopsis and may serve as an important foundational resource for other seed plants. PMID:21402797

  15. Integration of gene ontology pathways with North American Rheumatoid Arthritis Consortium genome-wide association data via linear modeling.

    PubMed

    Lebrec, Jérémie Jp; Huizinga, Tom Wj; Toes, René Em; Houwing-Duistermaat, Jeanine J; van Houwelingen, Hans C

    2009-01-01

    We describe an empirical Bayesian linear model for integration of functional gene annotation data with genome-wide association data. Using case-control study data from the North American Rheumatoid Arthritis Consortium and gene annotation data from the Gene Ontology, we illustrate how the method can be used to prioritize candidate genes for further investigation.

  16. Integration of gene ontology pathways with North American Rheumatoid Arthritis Consortium genome-wide association data via linear modeling

    PubMed Central

    2009-01-01

    We describe an empirical Bayesian linear model for integration of functional gene annotation data with genome-wide association data. Using case-control study data from the North American Rheumatoid Arthritis Consortium and gene annotation data from the Gene Ontology, we illustrate how the method can be used to prioritize candidate genes for further investigation. PMID:20018091

  17. Powerful Cocktail Methods for Detecting Genome-wide Gene-Environment Interaction

    PubMed Central

    Hsu, Li; Jiao, Shuo; Dai, James Y.; Hutter, Carolyn; Peters, Ulrike; Kooperberg, Charles

    2013-01-01

    Identifying gene and environment interaction (GxE) can provide insights into biological networks of complex diseases, identify novel genes that act synergistically with environmental factors, and inform risk prediction. However, despite the fact that hundreds of novel disease-associated loci have been identified from genome-wide association studies (GWAS), few GxEs have been discovered. One reason is that most studies are underpowered for detecting these interactions. Several new methods have been proposed to improve power for GxE analysis, but performance varies with scenario. In this article we present a module-based approach to integrating various methods that exploits each method’s most appealing aspects. There are three modules in our approach: 1) a screening module for prioritizing SNPs; 2) a multiple comparison module for testing GxE; and 3) a GxE testing module. We combine all three of these modules and develop two novel “cocktail” methods. We demonstrate that the proposed cocktail methods maintain the type I error, and that the power tracks well with the best existing methods, despite that the best methods may be different under various scenarios and interaction models. For GWAS, where the true interaction models are unknown, methods like our “cocktail” methods that are powerful under a wide range of situations are particularly appealing. Broadly speaking, the modular approach is conceptually straightforward and computationally simple. It builds on common test statistics and is easily implemented without additional computational efforts. It also allows for an easy incorporation of new methods as they are developed. Our work provides a comprehensive and powerful tool for devising effective strategies for genome-wide detection of gene-environment interactions. PMID:22714933

  18. Genome-wide modulation of gene transcription in ovarian carcinoma cells by a new mithramycin analogue.

    PubMed

    Vizcaíno, Carolina; Núñez, Luz-Elena; Morís, Francisco; Portugal, José

    2014-01-01

    Ovarian cancer has a poor prognosis due to intrinsic or acquired resistance to some cytotoxic drugs, raising the interest in new DNA-binding agents such as mithramycin analogues as potential chemotherapeutic agents in gynecological cancer. Using a genome-wide approach, we have analyzed gene expression in A2780 human ovarian carcinoma cells treated with the novel mithramycin analogue DIG-MSK (demycarosyl-3D-β-D-digitoxosyl-mithramycin SK) that binds to C+G-rich DNA sequences. Nanomolar concentrations of DIG-MSK abrogated the expression of genes involved in a variety of cell processes including transcription regulation and tumor development, which resulted in cell death. Some of those genes have been associated with cell proliferation and poor prognosis in ovarian cancer. Sp1 transcription factor regulated most of the genes that were down-regulated by the drug, as well as the up-regulation of other genes mainly involved in response to cell stress. The effect of DIG-MSK in the control of gene expression by other transcription factors was also explored. Some of them, such as CREB, E2F and EGR1, also recognize C/G-rich regions in gene promoters, which encompass potential DIG-MSK binding sites. DIG-MSK affected several biological processes and molecular functions related to transcription and its cellular regulation in A2780 cells, including transcription factor activity. This new compound might be a promising drug for the treatment of ovarian cancer.

  19. Genome-wide differential gene expression in immortalized DF-1 chicken embryo fibroblast cell line

    PubMed Central

    2011-01-01

    Background When compared to primary chicken embryo fibroblast (CEF) cells, the immortal DF-1 CEF line exhibits enhanced growth rates and susceptibility to oxidative stress. Although genes responsible for cell cycle regulation and antioxidant functions have been identified, the genome-wide transcription profile of immortal DF-1 CEF cells has not been previously reported. Global gene expression in primary CEF and DF-1 cells was performed using a 4X44K chicken oligo microarray. Results A total of 3876 differentially expressed genes were identified with a 2 fold level cutoff that included 1706 up-regulated and 2170 down-regulated genes in DF-1 cells. Network and functional analyses using Ingenuity Pathways Analysis (IPA, Ingenuity® Systems, http://www.ingenuity.com) revealed that 902 of 3876 differentially expressed genes were classified into a number of functional groups including cellular growth and proliferation, cell cycle, cellular movement, cancer, genetic disorders, and cell death. Also, the top 5 gene networks with intermolecular connections were identified. Bioinformatic analyses suggested that DF-1 cells were characterized by enhanced molecular mechanisms for cell cycle progression and proliferation, suppressing cell death pathways, altered cellular morphogenesis, and accelerated capacity for molecule transport. Key molecules for these functions include E2F1, BRCA1, SRC, CASP3, and the peroxidases. Conclusions The global gene expression profiles provide insight into the cellular mechanisms that regulate the unique characteristics observed in immortal DF-1 CEF cells. PMID:22111699

  20. Candidate Essential Genes in Burkholderia cenocepacia J2315 Identified by Genome-Wide TraDIS

    PubMed Central

    Wong, Yee-Chin; Abd El Ghany, Moataz; Naeem, Raeece; Lee, Kok-Wei; Tan, Yung-Chie; Pain, Arnab; Nathan, Sheila

    2016-01-01

    Burkholderia cenocepacia infection often leads to fatal cepacia syndrome in cystic fibrosis patients. However, antibiotic therapy rarely results in complete eradication of the pathogen due to its intrinsic resistance to many clinically available antibiotics. Recent attention has turned to the identification of essential genes as the proteins encoded by these genes may serve as potential targets for development of novel antimicrobials. In this study, we utilized TraDIS (Transposon Directed Insertion-site Sequencing) as a genome-wide screening tool to facilitate the identification of B. cenocepacia genes essential for its growth and viability. A transposon mutant pool consisting of approximately 500,000 mutants was successfully constructed, with more than 400,000 unique transposon insertion sites identified by computational analysis of TraDIS datasets. The saturated library allowed for the identification of 383 genes that were predicted to be essential in B. cenocepacia. We extended the application of TraDIS to identify conditionally essential genes required for in vitro growth and revealed an additional repertoire of 439 genes to be crucial for B. cenocepacia growth under nutrient-depleted conditions. The library of B. cenocepacia mutants can subsequently be subjected to various biologically related conditions to facilitate the discovery of genes involved in niche adaptation as well as pathogenicity and virulence. PMID:27597847

  1. Candidate Essential Genes in Burkholderia cenocepacia J2315 Identified by Genome-Wide TraDIS

    PubMed Central

    Wong, Yee-Chin; Abd El Ghany, Moataz; Naeem, Raeece; Lee, Kok-Wei; Tan, Yung-Chie; Pain, Arnab; Nathan, Sheila

    2016-01-01

    Burkholderia cenocepacia infection often leads to fatal cepacia syndrome in cystic fibrosis patients. However, antibiotic therapy rarely results in complete eradication of the pathogen due to its intrinsic resistance to many clinically available antibiotics. Recent attention has turned to the identification of essential genes as the proteins encoded by these genes may serve as potential targets for development of novel antimicrobials. In this study, we utilized TraDIS (Transposon Directed Insertion-site Sequencing) as a genome-wide screening tool to facilitate the identification of B. cenocepacia genes essential for its growth and viability. A transposon mutant pool consisting of approximately 500,000 mutants was successfully constructed, with more than 400,000 unique transposon insertion sites identified by computational analysis of TraDIS datasets. The saturated library allowed for the identification of 383 genes that were predicted to be essential in B. cenocepacia. We extended the application of TraDIS to identify conditionally essential genes required for in vitro growth and revealed an additional repertoire of 439 genes to be crucial for B. cenocepacia growth under nutrient-depleted conditions. The library of B. cenocepacia mutants can subsequently be subjected to various biologically related conditions to facilitate the discovery of genes involved in niche adaptation as well as pathogenicity and virulence.

  2. Candidate Essential Genes in Burkholderia cenocepacia J2315 Identified by Genome-Wide TraDIS.

    PubMed

    Wong, Yee-Chin; Abd El Ghany, Moataz; Naeem, Raeece; Lee, Kok-Wei; Tan, Yung-Chie; Pain, Arnab; Nathan, Sheila

    2016-01-01

    Burkholderia cenocepacia infection often leads to fatal cepacia syndrome in cystic fibrosis patients. However, antibiotic therapy rarely results in complete eradication of the pathogen due to its intrinsic resistance to many clinically available antibiotics. Recent attention has turned to the identification of essential genes as the proteins encoded by these genes may serve as potential targets for development of novel antimicrobials. In this study, we utilized TraDIS (Transposon Directed Insertion-site Sequencing) as a genome-wide screening tool to facilitate the identification of B. cenocepacia genes essential for its growth and viability. A transposon mutant pool consisting of approximately 500,000 mutants was successfully constructed, with more than 400,000 unique transposon insertion sites identified by computational analysis of TraDIS datasets. The saturated library allowed for the identification of 383 genes that were predicted to be essential in B. cenocepacia. We extended the application of TraDIS to identify conditionally essential genes required for in vitro growth and revealed an additional repertoire of 439 genes to be crucial for B. cenocepacia growth under nutrient-depleted conditions. The library of B. cenocepacia mutants can subsequently be subjected to various biologically related conditions to facilitate the discovery of genes involved in niche adaptation as well as pathogenicity and virulence. PMID:27597847

  3. A Genome-Wide Regulatory Framework Identifies Maize Pericarp Color1 Controlled Genes[C][W

    PubMed Central

    Morohashi, Kengo; Casas, María Isabel; Ferreyra, Lorena Falcone; Mejía-Guerra, María Katherine; Pourcel, Lucille; Yilmaz, Alper; Feller, Antje; Carvalho, Bruna; Emiliani, Julia; Rodriguez, Eduardo; Pellegrinet, Silvina; McMullen, Michael; Casati, Paula; Grotewold, Erich

    2012-01-01

    Pericarp Color1 (P1) encodes an R2R3-MYB transcription factor responsible for the accumulation of insecticidal flavones in maize (Zea mays) silks and red phlobaphene pigments in pericarps and other floral tissues, which makes P1 an important visual marker. Using genome-wide expression analyses (RNA sequencing) in pericarps and silks of plants with contrasting P1 alleles combined with chromatin immunoprecipitation coupled with high-throughput sequencing, we show here that the regulatory functions of P1 are much broader than the activation of genes corresponding to enzymes in a branch of flavonoid biosynthesis. P1 modulates the expression of several thousand genes, and ∼1500 of them were identified as putative direct targets of P1. Among them, we identified F2H1, corresponding to a P450 enzyme that converts naringenin into 2-hydroxynaringenin, a key branch point in the P1-controlled pathway and the first step in the formation of insecticidal C-glycosyl flavones. Unexpectedly, the binding of P1 to gene regulatory regions can result in both gene activation and repression. Our results indicate that P1 is the major regulator for a set of genes involved in flavonoid biosynthesis and a minor modulator of the expression of a much larger gene set that includes genes involved in primary metabolism and production of other specialized compounds. PMID:22822204

  4. Genome-wide analysis and expression profiling of the phospholipase D gene family in Gossypium arboreum.

    PubMed

    Tang, Kai; Dong, Chunjuan; Liu, Jinyuan

    2016-02-01

    The plant phospholipase D (PLD) plays versatile functions in multiple aspects of plant growth, development, and stress responses. However, until now, our knowledge concerning the PLD gene family members and their expression patterns in cotton has been limited. In this study, we performed for the first time the genome-wide analysis and expression profiling of PLD gene family in Gossypium arboretum, and finally, a total of 19 non-redundant PLD genes (GaPLDs) were identified. Based on the phylogenetic analysis, they were divided into six well-supported clades (α, β/γ, δ, ε, ζ and φ). Most of the GaPLD genes within the same clade showed the similar exon-intron organization and highly conserved motif structures. Additionally, the chromosomal distribution pattern revealed that GaPLD genes were unevenly distributed across 10 of the 13 cotton chromosomes. Segmental duplication is the major contributor to the expansion of GaPLD gene family and estimated to have occurred from 19.61 to 20.44 million years ago when a recent large-scale genome duplication occurred in cotton. Moreover, the expression profiling provides the functional divergence of GaPLD genes in cotton and provides some new light on the molecular mechanisms of GaPLDα1 and GaPLDδ2 in fiber development. PMID:26718354

  5. A genome-wide identification of genes undergoing recombination and positive selection in Neisseria.

    PubMed

    Yu, Dong; Jin, Yuan; Yin, Zhiqiu; Ren, Hongguang; Zhou, Wei; Liang, Long; Yue, Junjie

    2014-01-01

    Currently, there is particular interest in the molecular mechanisms of adaptive evolution in bacteria. Neisseria is a genus of gram negative bacteria, and there has recently been considerable focus on its two human pathogenic species N. meningitidis and N. gonorrhoeae. Until now, no genome-wide studies have attempted to scan for the genes related to adaptive evolution. For this reason, we selected 18 Neisseria genomes (14 N. meningitidis, 3 N. gonorrhoeae and 1 commensal N. lactamics) to conduct a comparative genome analysis to obtain a comprehensive understanding of the roles of natural selection and homologous recombination throughout the history of adaptive evolution. Among the 1012 core orthologous genes, we identified 635 genes with recombination signals and 10 genes that showed significant evidence of positive selection. Further functional analyses revealed that no functional bias was found in the recombined genes. Positively selected genes are prone to DNA processing and iron uptake, which are essential for the fundamental life cycle. Overall, the results indicate that both recombination and positive selection play crucial roles in the adaptive evolution of Neisseria genomes. The positively selected genes and the corresponding amino acid sites provide us with valuable targets for further research into the detailed mechanisms of adaptive evolution in Neisseria.

  6. Genome-wide identification of the expansin gene family in tobacco (Nicotiana tabacum).

    PubMed

    Ding, Anming; Marowa, Prince; Kong, Yingzhen

    2016-10-01

    Expansins are pH-dependent cell wall loosening proteins which form a large family in plants. They have been shown to be involved in various developmental processes and been implicated in enabling plants' ability to absorb nutrients from the soil as well as conferring biotic and abiotic stress resistances. It is therefore clear that they can be potential targets in genetic engineering for crop improvement. Tobacco (Nicotiana tabacum) is a major crop species as well as a model organism. Considering that only a few tobacco expansins have been studied, a genome-wide analysis of the tobacco expansin gene family is necessary. In this study, we identified 52 expansins in tobacco, which were classified into four subfamilies: 36 NtEXPAs, 6 NtEXPBs, 3 NtEXLAs and 7 NtEXLBs. Compared to other species, the NtEXLB subfamily size was relatively larger. Phylogenetic analysis showed that the 52 tobacco expansins were divided into 13 subgroups. Gene structure analysis revealed that genes within subfamilies/subgroups exhibited similar characteristics such as gene structure and protein motif arrangement. Whole-genome duplication and tandem duplication events may have played important roles in the expanding of tobacco expansins. Cis-Acting element analysis revealed that each expansin gene was regulated or several expansin genes were co-regulated by both internal and environmental factors. 35 of these genes were identified as being expressed according to a microarray analysis. In contrast to most NtEXPAs which had higher expression levels in young organs, NtEXLAs and NtEXLBs were preferentially expressed in mature or senescent tissues, suggesting that they might play different roles in different organs or at different developmental stages. As the first step towards genome-wide analysis of the tobacco expansin gene family, our work provides solid background information related to structure, evolution and expression as well as regulatory cis-acting elements of the tobacco expansins. This

  7. Genome-wide identification of the expansin gene family in tobacco (Nicotiana tabacum).

    PubMed

    Ding, Anming; Marowa, Prince; Kong, Yingzhen

    2016-10-01

    Expansins are pH-dependent cell wall loosening proteins which form a large family in plants. They have been shown to be involved in various developmental processes and been implicated in enabling plants' ability to absorb nutrients from the soil as well as conferring biotic and abiotic stress resistances. It is therefore clear that they can be potential targets in genetic engineering for crop improvement. Tobacco (Nicotiana tabacum) is a major crop species as well as a model organism. Considering that only a few tobacco expansins have been studied, a genome-wide analysis of the tobacco expansin gene family is necessary. In this study, we identified 52 expansins in tobacco, which were classified into four subfamilies: 36 NtEXPAs, 6 NtEXPBs, 3 NtEXLAs and 7 NtEXLBs. Compared to other species, the NtEXLB subfamily size was relatively larger. Phylogenetic analysis showed that the 52 tobacco expansins were divided into 13 subgroups. Gene structure analysis revealed that genes within subfamilies/subgroups exhibited similar characteristics such as gene structure and protein motif arrangement. Whole-genome duplication and tandem duplication events may have played important roles in the expanding of tobacco expansins. Cis-Acting element analysis revealed that each expansin gene was regulated or several expansin genes were co-regulated by both internal and environmental factors. 35 of these genes were identified as being expressed according to a microarray analysis. In contrast to most NtEXPAs which had higher expression levels in young organs, NtEXLAs and NtEXLBs were preferentially expressed in mature or senescent tissues, suggesting that they might play different roles in different organs or at different developmental stages. As the first step towards genome-wide analysis of the tobacco expansin gene family, our work provides solid background information related to structure, evolution and expression as well as regulatory cis-acting elements of the tobacco expansins. This

  8. Genome-Wide Identification, Characterization and Expression Analysis of the TCP Gene Family in Prunus mume

    PubMed Central

    Zhou, Yuzhen; Xu, Zongda; Zhao, Kai; Yang, Weiru; Cheng, Tangren; Wang, Jia; Zhang, Qixiang

    2016-01-01

    TCP proteins, belonging to a plant-specific transcription factors family, are known to have great functions in plant development, especially flower and leaf development. However, there is little information about this gene family in Prunus mume, which is widely cultivated in China as an ornamental and fruit tree. Here a genome-wide analysis of TCP genes was performed to explore their evolution in P. mume. Nineteen PmTCPs were identified and three of them contained putative miR319 target sites. Phylogenetic and comprehensive bioinformatics analyses of these genes revealed that different types of TCP genes had undergone different evolutionary processes and the genes in the same clade had similar chromosomal location, gene structure, and conserved domains. Expression analysis of these PmTCPs indicated that there were diverse expression patterns among different clades. Most TCP genes were predominantly expressed in flower, leaf, and stem, and showed high expression levels in the different stages of flower bud differentiation, especially in petal formation stage and gametophyte development. Genes in TCP-P subfamily had main roles in both flower development and gametophyte development. The CIN genes in double petal cultivars might have key roles in the formation of petal, while they were correlated with gametophyte development in the single petal cultivar. The CYC/TB1 type genes were highly detected in the formation of petal and pistil. The less-complex flower types of P. mume might result from the fact that there were only two CYC type genes present in P. mume and a lack of CYC2 genes to control the identity of flower types. These results lay the foundation for further study on the functions of TCP genes during flower development.

  9. Genome-Wide Identification, Characterization and Expression Analysis of the TCP Gene Family in Prunus mume

    PubMed Central

    Zhou, Yuzhen; Xu, Zongda; Zhao, Kai; Yang, Weiru; Cheng, Tangren; Wang, Jia; Zhang, Qixiang

    2016-01-01

    TCP proteins, belonging to a plant-specific transcription factors family, are known to have great functions in plant development, especially flower and leaf development. However, there is little information about this gene family in Prunus mume, which is widely cultivated in China as an ornamental and fruit tree. Here a genome-wide analysis of TCP genes was performed to explore their evolution in P. mume. Nineteen PmTCPs were identified and three of them contained putative miR319 target sites. Phylogenetic and comprehensive bioinformatics analyses of these genes revealed that different types of TCP genes had undergone different evolutionary processes and the genes in the same clade had similar chromosomal location, gene structure, and conserved domains. Expression analysis of these PmTCPs indicated that there were diverse expression patterns among different clades. Most TCP genes were predominantly expressed in flower, leaf, and stem, and showed high expression levels in the different stages of flower bud differentiation, especially in petal formation stage and gametophyte development. Genes in TCP-P subfamily had main roles in both flower development and gametophyte development. The CIN genes in double petal cultivars might have key roles in the formation of petal, while they were correlated with gametophyte development in the single petal cultivar. The CYC/TB1 type genes were highly detected in the formation of petal and pistil. The less-complex flower types of P. mume might result from the fact that there were only two CYC type genes present in P. mume and a lack of CYC2 genes to control the identity of flower types. These results lay the foundation for further study on the functions of TCP genes during flower development. PMID:27630648

  10. Genome-Wide Identification, Characterization and Expression Analysis of the TCP Gene Family in Prunus mume.

    PubMed

    Zhou, Yuzhen; Xu, Zongda; Zhao, Kai; Yang, Weiru; Cheng, Tangren; Wang, Jia; Zhang, Qixiang

    2016-01-01

    TCP proteins, belonging to a plant-specific transcription factors family, are known to have great functions in plant development, especially flower and leaf development. However, there is little information about this gene family in Prunus mume, which is widely cultivated in China as an ornamental and fruit tree. Here a genome-wide analysis of TCP genes was performed to explore their evolution in P. mume. Nineteen PmTCPs were identified and three of them contained putative miR319 target sites. Phylogenetic and comprehensive bioinformatics analyses of these genes revealed that different types of TCP genes had undergone different evolutionary processes and the genes in the same clade had similar chromosomal location, gene structure, and conserved domains. Expression analysis of these PmTCPs indicated that there were diverse expression patterns among different clades. Most TCP genes were predominantly expressed in flower, leaf, and stem, and showed high expression levels in the different stages of flower bud differentiation, especially in petal formation stage and gametophyte development. Genes in TCP-P subfamily had main roles in both flower development and gametophyte development. The CIN genes in double petal cultivars might have key roles in the formation of petal, while they were correlated with gametophyte development in the single petal cultivar. The CYC/TB1 type genes were highly detected in the formation of petal and pistil. The less-complex flower types of P. mume might result from the fact that there were only two CYC type genes present in P. mume and a lack of CYC2 genes to control the identity of flower types. These results lay the foundation for further study on the functions of TCP genes during flower development. PMID:27630648

  11. Genome-wide age-related changes in DNA methylation and gene expression in human PBMCs.

    PubMed

    Steegenga, Wilma T; Boekschoten, Mark V; Lute, Carolien; Hooiveld, Guido J; de Groot, Philip J; Morris, Tiffany J; Teschendorff, Andrew E; Butcher, Lee M; Beck, Stephan; Müller, Michael

    2014-06-01

    Aging is a progressive process that results in the accumulation of intra- and extracellular alterations that in turn contribute to a reduction in health. Age-related changes in DNA methylation have been reported before and may be responsible for aging-induced changes in gene expression, although a causal relationship has yet to be shown. Using genome-wide assays, we analyzed age-induced changes in DNA methylation and their effect on gene expression with and without transient induction with the synthetic transcription modulating agent WY14,643. To demonstrate feasibility of the approach, we isolated peripheral blood mononucleated cells (PBMCs) from five young and five old healthy male volunteers and cultured them with or without WY14,643. Infinium 450K BeadChip and Affymetrix Human Gene 1.1 ST expression array analysis revealed significant differential methylation of at least 5 % (ΔYO > 5 %) at 10,625 CpG sites between young and old subjects, but only a subset of the associated genes were also differentially expressed. Age-related differential methylation of previously reported epigenetic biomarkers of aging including ELOVL2, FHL2, PENK, and KLF14 was confirmed in our study, but these genes did not display an age-related change in gene expression in PBMCs. Bioinformatic analysis revealed that differentially methylated genes that lack an age-related expression change predominantly represent genes involved in carcinogenesis and developmental processes, and expression of most of these genes were silenced in PBMCs. No changes in DNA methylation were found in genes displaying transiently induced changes in gene expression. In conclusion, aging-induced differential methylation often targets developmental genes and occurs mostly without change in gene expression.

  12. Genome-wide age-related changes in DNA methylation and gene expression in human PBMCs.

    PubMed

    Steegenga, Wilma T; Boekschoten, Mark V; Lute, Carolien; Hooiveld, Guido J; de Groot, Philip J; Morris, Tiffany J; Teschendorff, Andrew E; Butcher, Lee M; Beck, Stephan; Müller, Michael

    2014-06-01

    Aging is a progressive process that results in the accumulation of intra- and extracellular alterations that in turn contribute to a reduction in health. Age-related changes in DNA methylation have been reported before and may be responsible for aging-induced changes in gene expression, although a causal relationship has yet to be shown. Using genome-wide assays, we analyzed age-induced changes in DNA methylation and their effect on gene expression with and without transient induction with the synthetic transcription modulating agent WY14,643. To demonstrate feasibility of the approach, we isolated peripheral blood mononucleated cells (PBMCs) from five young and five old healthy male volunteers and cultured them with or without WY14,643. Infinium 450K BeadChip and Affymetrix Human Gene 1.1 ST expression array analysis revealed significant differential methylation of at least 5 % (ΔYO > 5 %) at 10,625 CpG sites between young and old subjects, but only a subset of the associated genes were also differentially expressed. Age-related differential methylation of previously reported epigenetic biomarkers of aging including ELOVL2, FHL2, PENK, and KLF14 was confirmed in our study, but these genes did not display an age-related change in gene expression in PBMCs. Bioinformatic analysis revealed that differentially methylated genes that lack an age-related expression change predominantly represent genes involved in carcinogenesis and developmental processes, and expression of most of these genes were silenced in PBMCs. No changes in DNA methylation were found in genes displaying transiently induced changes in gene expression. In conclusion, aging-induced differential methylation often targets developmental genes and occurs mostly without change in gene expression. PMID:24789080

  13. Multiple type 2 diabetes susceptibility genes following genome-wide association scan in UK samples

    PubMed Central

    Zeggini, Eleftheria; Weedon, Michael N.; Lindgren, Cecilia M.; Frayling, Timothy M.; Elliott, Katherine S.; Lango, Hana; Timpson, Nicholas J.; Perry, John R.B.; Rayner, Nigel W.; Freathy, Rachel M.; Barrett, Jeffrey C.; Shields, Beverley; Morris, Andrew P.; Ellard, Sian; Groves, Christopher J.; Harries, Lorna W.; Marchini, Jonathan L.; Owen, Katharine R.; Knight, Beatrice; Cardon, Lon R.; Walker, Mark; Hitman, Graham A.; Morris, Andrew D.; Doney, Alex S.F.; McCarthy, Mark I.; Hattersley, Andrew T.

    2013-01-01

    The molecular mechanisms involved in the development of type 2 diabetes are poorly understood. Starting from genome-wide genotype data for 1,924 diabetic cases and 2,938 population controls generated by the Wellcome Trust Case Control Consortium, we set out to detect replicated diabetes association signals through analysis of 3,757 additional cases and 5,346 controls, and by integration of our findings with equivalent data from other international consortia. We detected diabetes susceptibility loci in and around the genes CDKAL1, CDKN2A/CDKN2B and IGF2BP2 and confirmed the recently described associations at HHEX/IDE and SLC30A8. Our findings provide insights into the genetic architecture of type 2 diabetes, emphasizing the contribution of multiple variants of modest effect. The regions identified underscore the importance of pathways influencing pancreatic beta cell development and function in the etiology of type 2 diabetes. PMID:17463249

  14. [A novel method of the genome-wide prediction for the target genes and its application].

    PubMed

    Zhang, Jing-Jing; Feng, Jing; Zhu, Ying-Guo; Li, Yang-Sheng

    2006-10-01

    Based on the protein databases of several model species, this study developed a new method of the Genome-wide prediction for the target genes, using Hidden Markov model by Perl programming. The advantages of this method are high throughput, high quality and easy prediction, especially in the case of multi-domains proteins families. By this method, we predicted the PPR and TPR proteins families in whole genome of several model species. There were 536 PPR proteins and 199 TPR proteins in Oryza sativa ssp. japonica, 519 PPR proteins and 177 TPR proteins in Oryza sativa L. ssp. indica, 735 PPR proteins and 292 TPR proteins in Arabidopsis thaliana, 6 PPR proteins and 32 TPR proteins in Cyanidioschyzon merolae. Synechococcus and Thermophilic archaebacterium did not have PPR proteins. By contrast, 10 TPR proteins were found in Synechococcus and 4 TPR proteins were found in Thermophilic archaebacterium. Moreover, of these results, some further bioinformatics analyses were conducted.

  15. An Integrated Genome-Wide Systems Genetics Screen for Breast Cancer Metastasis Susceptibility Genes.

    PubMed

    Bai, Ling; Yang, Howard H; Hu, Ying; Shukla, Anjali; Ha, Ngoc-Han; Doran, Anthony; Faraji, Farhoud; Goldberger, Natalie; Lee, Maxwell P; Keane, Thomas; Hunter, Kent W

    2016-04-01

    Metastasis remains the primary cause of patient morbidity and mortality in solid tumors and is due to the action of a large number of tumor-autonomous and non-autonomous factors. Here we report the results of a genome-wide integrated strategy to identify novel metastasis susceptibility candidate genes and molecular pathways in breast cancer metastasis. This analysis implicates a number of transcriptional regulators and suggests cell-mediated immunity is an important determinant. Moreover, the analysis identified novel or FDA-approved drugs as potentially useful for anti-metastatic therapy. Further explorations implementing this strategy may therefore provide a variety of information for clinical applications in the control and treatment of advanced neoplastic disease. PMID:27074153

  16. Neuroinformatics for genome-wide 3D gene expression mapping in the mouse brain.

    PubMed

    Ng, Lydia; Pathak, Sayan D; Kuan, Chihchau; Lau, Chris; Dong, Hongwei; Sodt, Andrew; Dang, Chinh; Avants, Brian; Yushkevich, Paul; Gee, James C; Haynor, David; Lein, Ed; Jones, Allan; Hawrylycz, Mike

    2007-01-01

    Large scale gene expression studies in the mammalian brain offer the promise of understanding the topology, networks and ultimately the function of its complex anatomy, opening previously unexplored avenues in neuroscience. High-throughput methods permit genome-wide searches to discover genes that are uniquely expressed in brain circuits and regions that control behavior. Previous gene expression mapping studies in model organisms have employed situ hybridization (ISH), a technique that uses labeled nucleic acid probes to bind to specific mRNA transcripts in tissue sections. A key requirement for this effort is the development of fast and robust algorithms for anatomically mapping and quantifying gene expression for ISH. We describe a neuroinformatics pipeline for automatically mapping expression profiles of ISH data and its use to produce the first genomic scale 3-D mapping of gene expression in a mammalian brain. The pipeline is fully automated and adaptable to other organisms and tissues. Our automated study of over 20,000 genes indicates that at least 78.8 percent are expressed at some level in the adult C56BL/6J mouse brain. In addition to providing a platform for genomic scale search, high-resolution images and visualization tools for expression analysis are available at the Allen Brain Atlas web site (http://www.brain-map.org).

  17. Genome-wide identification and characterization of aquaporin gene family in moso bamboo (Phyllostachys edulis).

    PubMed

    Sun, Huayu; Li, Lichao; Lou, Yongfeng; Zhao, Hansheng; Gao, Zhimin

    2016-05-01

    Aquaporins (AQPs) are known to play a major role in maintaining water and hydraulic conductivity balance in the plant system. Numerous studies have showed AQPs execute multi-function throughout plant growth and development, including water transport, nitrogen, carbon, and micronutrient acquisition etc. However, little information on AQPs is known in bamboo. In this study, we present the first genome-wide identification and characterization of AQP genes in moso bamboo (Phyllostachys edulis) using bioinformatics. In total, 26 AQP genes were identified by homologous analysis, which were divided into four groups (PIPs, TIPs, NIPs, and SIPs) based on the phylogenetic analysis. All the genes were located on 26 different scaffolds respectively on basis of the gene mapped to bamboo genome. Evolutionary analysis indicated that Ph. edulis was more close to Oryza sativa than Zea mays in the genetic relationship. Besides, qRT-PCR was used to analyze gene expression profiles, which revealed that AQP genes were expressed constitutively in all the detected tissues, and were all responsive to the environmental cues such as drought, water, and NaCl stresses. This data suggested that AQPs may play fundamental roles in maintaining normal growth and development of bamboo, which would contribute to better understanding for the complex regulation mechanism involved in the fast-growing process of bamboo. Furthermore, the result could provide valuable information for further research on bamboo functional genomics. PMID:26993482

  18. Detecting Gene-Environment Interactions for a Quantitative Trait in a Genome-Wide Association Study.

    PubMed

    Zhang, Pingye; Lewinger, Juan Pablo; Conti, David; Morrison, John L; Gauderman, W James

    2016-07-01

    A genome-wide association study (GWAS) typically is focused on detecting marginal genetic effects. However, many complex traits are likely to be the result of the interplay of genes and environmental factors. These SNPs may have a weak marginal effect and thus unlikely to be detected from a scan of marginal effects, but may be detectable in a gene-environment (G × E) interaction analysis. However, a genome-wide interaction scan (GWIS) using a standard test of G × E interaction is known to have low power, particularly when one corrects for testing multiple SNPs. Two 2-step methods for GWIS have been previously proposed, aimed at improving efficiency by prioritizing SNPs most likely to be involved in a G × E interaction using a screening step. For a quantitative trait, these include a method that screens on marginal effects [Kooperberg and Leblanc, 2008] and a method that screens on variance heterogeneity by genotype [Paré et al., 2010] In this paper, we show that the Paré et al. approach has an inflated false-positive rate in the presence of an environmental marginal effect, and we propose an alternative that remains valid. We also propose a novel 2-step approach that combines the two screening approaches, and provide simulations demonstrating that the new method can outperform other GWIS approaches. Application of this method to a G × Hispanic-ethnicity scan for childhood lung function reveals a SNP near the MARCO locus that was not identified by previous marginal-effect scans. PMID:27230133

  19. Genome-wide comparison of AP2/ERF superfamily genes between Gossypium arboreum and G. raimondii.

    PubMed

    Lei, Z P; He, D H; Xing, H Y; Tang, B S; Lu, B X

    2016-01-01

    The APETALA2/ethylene response factor (AP2/ERF) transcription factor superfamily is known to regulate diverse processes of plant development and stress responses. We conducted a genome-wide analysis of the AP2/ERF gene in Gossypium arboreum and G. raimondii. Using RPSBLAST and HMMsearch, a total of 271 and 269 AP2/ERF genes were identified in the G. arboreum and G. raimondii genomes, respectively. A phylogenetic analysis classified diploid Gossypium spp AP2/ERF genes into 4 families and 16 subfamilies. Orthologous genes predominated the terminal branch of the phylogenetic tree. Physical mapping showed at least 30% of AP2/ERF genes clustered together. A high level of intra- and inter-species collinearity involving AP2/ERF genes was observed, indicating common (before species divergence) or parallel (after species divergence) segmental duplications, along with tandem duplications, resulting in the species-specific expansion of AP2/ERF genes in diploid Gossypium species. Motif analyses of the AP2/ERF proteins revealed that motif arrangements were highly diverse among subfamilies, but shared by orthologous gene pairs. An examination of nucleotide divergence of AP2/ERF coding regions identified small and non-significant sequence differences among orthologs. Expression profiling of AP2/ERF orthologous gene pairs showed similar abundance levels of orthologous copies between G. arboreum and G. raimondii. Thus, cotton species possess abundant and diverse AP2/ERF genes, resulting from tandem and segmental duplications. Protein and nucleotide sequence and mRNA expression analyses revealed symmetrical evolution, indicating that most AP2/ ERF genes may not have undergone significant biochemical and morphological divergence between sister species. Our study provides detailed insights into the evolutionary characteristics and functional importance of AP2/ERF genes, and could aid in the genetic improvement of agriculturally significant crops in this genus. PMID:27525884

  20. Genome-Wide Survey and Developmental Expression Mapping of Zebrafish SET Domain-Containing Genes

    PubMed Central

    Zhou, Ting; Hu, Ming; Fu, Chun-Tang; Zhang, Yong; Jin, Yi; Chen, Yi; Chen, Sai-Juan; Huang, Qiu-Hua; Liu, Ting Xi; Chen, Zhu

    2008-01-01

    SET domain-containing proteins represent an evolutionarily conserved family of epigenetic regulators, which are responsible for most histone lysine methylation. Since some of these genes have been revealed to be essential for embryonic development, we propose that the zebrafish, a vertebrate model organism possessing many advantages for developmental studies, can be utilized to study the biological functions of these genes and the related epigenetic mechanisms during early development. To this end, we have performed a genome-wide survey of zebrafish SET domain genes. 58 genes total have been identified. Although gene duplication events give rise to several lineage-specific paralogs, clear reciprocal orthologous relationship reveals high conservation between zebrafish and human SET domain genes. These data were further subject to an evolutionary analysis ranging from yeast to human, leading to the identification of putative clusters of orthologous groups (COGs) of this gene family. By means of whole-mount mRNA in situ hybridization strategy, we have also carried out a developmental expression mapping of these genes. A group of maternal SET domain genes, which are implicated in the programming of histone modification states in early development, have been identified and predicted to be responsible for all known sites of SET domain-mediated histone methylation. Furthermore, some genes show specific expression patterns in certain tissues at certain stages, suggesting the involvement of epigenetic mechanisms in the development of these systems. These results provide a global view of zebrafish SET domain histone methyltransferases in evolutionary and developmental dimensions and pave the way for using zebrafish to systematically study the roles of these genes during development. PMID:18231586

  1. Genome-wide analysis of SAUR gene family in Solanaceae species.

    PubMed

    Wu, Jian; Liu, Songyu; He, Yanjun; Guan, Xiaoyan; Zhu, Xiangfei; Cheng, Lin; Wang, Jie; Lu, Gang

    2012-11-01

    The plant hormone auxin plays a vital role in regulating many aspects of plant growth and development. Small auxin up-regulated RNAs (SAURs) are primary auxin response genes hypothesized to be involved in auxin signaling pathway, but their functions remain unclear. Here, a genome-wide search for SAUR gene homologues in Solanaceae species identified 99 and 134 members of SAUR gene family from tomato and potato, respectively. Phylogenetic analysis indicated that the SAUR proteins from Arabidopsis, rice, sorghum, tomato and potato were divided into four major groups with 16 subgroups. Among them, 25 histidine-rich SAURs genes with metal-binding characteristics were found in Arabidopsis, sorghum and Solanaceae species, but not in rice. Using tomato as a model, a comprehensive overview of SAUR gene family is presented, including the gene structures, phylogeny and chromosome locations. Quantitative real-time PCR analysis indicated that 11 randomly selected SlSAUR genes in tomato could be expressed at least in one of the tomato organs/tissues tested. However, different SlSAUR genes displayed distinctive expression levels. SlSAUR16 and SlSAUR71 exhibited highly tissue-specific expression patterns. Almost all of the detected SlSAURs showed an accumulating pattern of mRNA along tomato flower and fruit development. Some of them displayed differential response to exogenous IAA treatment. The abiotic (cold, salt and drought) stresses significantly modified transcript levels of SlSAURs genes. Most of them were down-regulated in response to abiotic stresses (drought, heat and salinity), but SlSAUR58, as a histidine-rich SAUR gene, was up-regulated after salt treatment, indicating that it may play a specific role in the salt signaling transduction pathway. Our comparative analysis provides some basic genomic information for the SAUR genes in the Solanaceae species and will pave the way for deciphering their function during plant development.

  2. Genome-Wide Identification and Expression Analysis of WRKY Gene Family in Capsicum annuum L.

    PubMed Central

    Diao, Wei-Ping; Snyder, John C.; Wang, Shu-Bin; Liu, Jin-Bing; Pan, Bao-Gui; Guo, Guang-Jun; Wei, Ge

    2016-01-01

    The WRKY family of transcription factors is one of the most important families of plant transcriptional regulators with members regulating multiple biological processes, especially in regulating defense against biotic and abiotic stresses. However, little information is available about WRKYs in pepper (Capsicum annuum L.). The recent release of completely assembled genome sequences of pepper allowed us to perform a genome-wide investigation for pepper WRKY proteins. In the present study, a total of 71 WRKY genes were identified in the pepper genome. According to structural features of their encoded proteins, the pepper WRKY genes (CaWRKY) were classified into three main groups, with the second group further divided into five subgroups. Genome mapping analysis revealed that CaWRKY were enriched on four chromosomes, especially on chromosome 1, and 15.5% of the family members were tandemly duplicated genes. A phylogenetic tree was constructed depending on WRKY domain' sequences derived from pepper and Arabidopsis. The expression of 21 selected CaWRKY genes in response to seven different biotic and abiotic stresses (salt, heat shock, drought, Phytophtora capsici, SA, MeJA, and ABA) was evaluated by quantitative RT-PCR; Some CaWRKYs were highly expressed and up-regulated by stress treatment. Our results will provide a platform for functional identification and molecular breeding studies of WRKY genes in pepper. PMID:26941768

  3. Confluence of genes, environment, development, and behavior in a post Genome-Wide Association Study world.

    PubMed

    Vrieze, Scott I; Iacono, William G; McGue, Matt

    2012-11-01

    This article serves to outline a research paradigm to investigate main effects and interactions of genes, environment, and development on behavior and psychiatric illness. We provide a historical context for candidate gene studies and genome-wide association studies, including benefits, limitations, and expected payoffs. Using substance use and abuse as our driving example, we then turn to the importance of etiological psychological theory in guiding genetic, environmental, and developmental research, as well as the utility of refined phenotypic measures, such as endophenotypes, in the pursuit of etiological understanding and focused tests of genetic and environmental associations. Phenotypic measurement has received considerable attention in the history of psychology and is informed by psychometrics, whereas the environment remains relatively poorly measured and is often confounded with genetic effects (i.e., gene-environment correlation). Genetically informed designs, which are no longer limited to twin and adoption studies thanks to ever-cheaper genotyping, are required to understand environmental influences. Finally, we outline the vast amount of individual difference in structural genomic variation, most of which remains to be leveraged in genetic association tests. Although the genetic data can be massive and burdensome (tens of millions of variants per person), we argue that improved understanding of genomic structure and function will provide investigators with new tools to test specific a priori hypotheses derived from etiological psychological theory, much like current candidate gene research but with less confusion and more payoff than candidate gene research has to date. PMID:23062291

  4. Genome-Wide Identification, Characterization and Expression Profiling of ADF Family Genes in Solanum lycopersicum L.

    PubMed Central

    Khatun, Khadiza; Robin, Arif Hasan Khan; Park, Jong-In; Kim, Chang Kil; Lim, Ki-Byung; Kim, Min-Bae; Lee, Do-Jin; Nou, Ill Sup; Chung, Mi-Young

    2016-01-01

    The actin depolymerizing factor (ADF) proteins have growth, development, defense-related and growth regulatory functions in plants. The present study used genome-wide analysis to investigate ADF family genes in tomato. Eleven tomato ADF genes were identified and differential expression patterns were found in different organs. SlADF6 was preferentially expressed in roots, suggesting its function in root development. SlADF1, SlADF3 and SlADF10 were predominately expressed in the flowers compared to the other organs and specifically in the stamen compared to other flower parts, indicating their potential roles in pollen development. The comparatively higher expression of SlADF3 and SlADF11 at early fruit developmental stages might implicate them in determining final fruit size. SlADF5 and SlADF8 had relatively higher levels of expression five days after the breaker stage of fruit development, suggesting their possible role in fruit ripening. Notably, six genes were induced by cold and heat, seven by drought, five by NaCl, and four each by abscisic acid (ABA), jasmonic acid (JA) and wounding treatments. The differential expression patterns of the SlADF genes under different types of stresses suggested their function in stress tolerance in tomato plants. Our results will be helpful for the functional characterization of ADF genes during organ and fruit development of tomato under different stresses. PMID:27690110

  5. Genome-wide functional screen identifies a compendium of genes affecting sensitivity to tamoxifen

    PubMed Central

    Mendes-Pereira, Ana M.; Sims, David; Dexter, Tim; Fenwick, Kerry; Assiotis, Ioannis; Kozarewa, Iwanka; Mitsopoulos, Costas; Hakas, Jarle; Zvelebil, Marketa; Lord, Christopher J.; Ashworth, Alan

    2012-01-01

    Therapies that target estrogen signaling have made a very considerable contribution to reducing mortality from breast cancer. However, resistance to tamoxifen remains a major clinical problem. Here we have used a genome-wide functional profiling approach to identify multiple genes that confer resistance or sensitivity to tamoxifen. Combining whole-genome shRNA screening with massively parallel sequencing, we have profiled the impact of more than 56,670 RNA interference reagents targeting 16,487 genes on the cellular response to tamoxifen. This screen, along with subsequent validation experiments, identifies a compendium of genes whose silencing causes tamoxifen resistance (including BAP1, CLPP, GPRC5D, NAE1, NF1, NIPBL, NSD1, RAD21, RARG, SMC3, and UBA3) and also a set of genes whose silencing causes sensitivity to this endocrine agent (C10orf72, C15orf55/NUT, EDF1, ING5, KRAS, NOC3L, PPP1R15B, RRAS2, TMPRSS2, and TPM4). Multiple individual genes, including NF1, a regulator of RAS signaling, also correlate with clinical outcome after tamoxifen treatment. PMID:21482774

  6. Genome-wide methylation profiling identifies novel methylated genes in neuroblastoma tumors

    PubMed Central

    Olsson, Maja; Beck, Stephan; Kogner, Per; Martinsson, Tommy; Carén, Helena

    2016-01-01

    ABSTRACT Neuroblastoma is a very heterogeneous tumor of childhood. The clinical spectra range from very aggressive metastatic disease to spontaneous regression, even without therapy. Aberrant DNA methylation pattern is a common feature of most cancers. For neuroblastoma, it has been demonstrated both for single genes as well as genome-wide, where a so-called methylator phenotype has been described. Here, we present a study using Illumina 450K methylation arrays on 60 neuroblastoma tumors. We show that aggressive tumors, characterized by International Neuroblastoma Risk Group (INRG) as stage M, are hypermethylated compared to low-grade tumors. On the contrary, INRG stage L tumors display more non-CpG methylation. The genes with the highest number of hypermethylated CpG sites in INRG M tumors are TERT, PCDHGA4, DLX5, and DLX6-AS1. Gene ontology analysis showed a representation of neuronal tumor relevant gene functions among the differentially methylated genes. For validation, we used a set of independent tumors previously analyzed with the Illumina 27K methylation arrays, which confirmed the differentially methylated sites. Top candidate genes with aberrant methylation were analyzed for altered gene expression through the R2 platform (http://r2.amc.nl), and for correlations between methylation and gene expression in a public dataset. Altered expression in nonsurvivors was found for the genes B3GALT4 and KIAA1949, CLIC5, DLX6-AS, TERT, and PIRT, and strongest correlations were found for TRIM36, KIAA0513, and PIRT. Our data indicate that methylation profiling can be used for patient stratification and informs on epigenetically deregulated genes with the potential of increasing our knowledge about the underlying mechanisms of tumor development. PMID:26786290

  7. Semiparametric methods for genome-wide linkage analysis of human gene expression data.

    PubMed

    Diao, Guoqing; Lin, D Y

    2007-01-01

    With the availability of high-throughput microarray technologies, investigators can simultaneously measure the expression levels of many thousands of genes in a short period. Although there are rich statistical methods for analyzing microarray data in the literature, limited work has been done in mapping expression quantitative trait loci (eQTL) that influence the variation in levels of gene expression. Most existing eQTL mapping methods assume that the expression phenotypes follow a normal distribution and violation of the normality assumption may lead to inflated type I error and reduced power. QTL analysis of expression data involves the mapping of many expression phenotypes at thousands or hundreds of thousands of marker loci across the whole genome. An appropriate procedure to adjust for multiple testing is essential for guarding against an abundance of false positive results. In this study, we applied a semiparametric quantitative trait loci (SQTL) mapping method to human gene expression data. The SQTL mapping method is rank-based and therefore robust to non-normality and outliers. Furthermore, we apply an efficient Monte Carlo procedure to account for multiple testing and assess the genome-wide significance level. Particularly, we apply the SQTL mapping method and the Monte-Carlo approach to the gene expression data provided by Genetic Analysis Workshop 15.

  8. Semiparametric methods for genome-wide linkage analysis of human gene expression data

    PubMed Central

    Diao, Guoqing; Lin, DY

    2007-01-01

    With the availability of high-throughput microarray technologies, investigators can simultaneously measure the expression levels of many thousands of genes in a short period. Although there are rich statistical methods for analyzing microarray data in the literature, limited work has been done in mapping expression quantitative trait loci (eQTL) that influence the variation in levels of gene expression. Most existing eQTL mapping methods assume that the expression phenotypes follow a normal distribution and violation of the normality assumption may lead to inflated type I error and reduced power. QTL analysis of expression data involves the mapping of many expression phenotypes at thousands or hundreds of thousands of marker loci across the whole genome. An appropriate procedure to adjust for multiple testing is essential for guarding against an abundance of false positive results. In this study, we applied a semiparametric quantitative trait loci (SQTL) mapping method to human gene expression data. The SQTL mapping method is rank-based and therefore robust to non-normality and outliers. Furthermore, we apply an efficient Monte Carlo procedure to account for multiple testing and assess the genome-wide significance level. Particularly, we apply the SQTL mapping method and the Monte-Carlo approach to the gene expression data provided by Genetic Analysis Workshop 15. PMID:18466586

  9. A Genome-wide CRISPR Screen in Toxoplasma Identifies Essential Apicomplexan Genes.

    PubMed

    Sidik, Saima M; Huet, Diego; Ganesan, Suresh M; Huynh, My-Hang; Wang, Tim; Nasamu, Armiyaw S; Thiru, Prathapan; Saeij, Jeroen P J; Carruthers, Vern B; Niles, Jacquin C; Lourido, Sebastian

    2016-09-01

    Apicomplexan parasites are leading causes of human and livestock diseases such as malaria and toxoplasmosis, yet most of their genes remain uncharacterized. Here, we present the first genome-wide genetic screen of an apicomplexan. We adapted CRISPR/Cas9 to assess the contribution of each gene from the parasite Toxoplasma gondii during infection of human fibroblasts. Our analysis defines ∼200 previously uncharacterized, fitness-conferring genes unique to the phylum, from which 16 were investigated, revealing essential functions during infection of human cells. Secondary screens identify as an invasion factor the claudin-like apicomplexan microneme protein (CLAMP), which resembles mammalian tight-junction proteins and localizes to secretory organelles, making it critical to the initiation of infection. CLAMP is present throughout sequenced apicomplexan genomes and is essential during the asexual stages of the malaria parasite Plasmodium falciparum. These results provide broad-based functional information on T. gondii genes and will facilitate future approaches to expand the horizon of antiparasitic interventions. PMID:27594426

  10. Genome-wide characterization of phenylalanine ammonia-lyase gene family in watermelon (Citrullus lanatus).

    PubMed

    Dong, Chun-Juan; Shang, Qing-Mao

    2013-07-01

    Phenylalanine ammonia-lyase (PAL), the first enzyme in the phenylpropanoid pathway, plays a critical role in plant growth, development, and adaptation. PAL enzymes are encoded by a gene family in plants. Here, we report a genome-wide search for PAL genes in watermelon. A total of 12 PAL genes, designated ClPAL1-12, are identified . Nine are arranged in tandem in two duplication blocks located on chromosomes 4 and 7, and the other three ClPAL genes are distributed as single copies on chromosomes 2, 3, and 8. Both the cDNA and protein sequences of ClPALs share an overall high identity with each other. A phylogenetic analysis places 11 of the ClPALs into a separate cucurbit subclade, whereas ClPAL2, which belongs to neither monocots nor dicots, may serve as an ancestral PAL in plants. In the cucurbit subclade, seven ClPALs form homologous pairs with their counterparts from cucumber. Expression profiling reveals that 11 of the ClPAL genes are expressed and show preferential expression in the stems and male and female flowers. Six of the 12 ClPALs are moderately or strongly expressed in the fruits, particularly in the pulp, suggesting the potential roles of PAL in the development of fruit color and flavor. A promoter motif analysis of the ClPAL genes implies redundant but distinctive cis-regulatory structures for stress responsiveness. Finally, duplication events during the evolution and expansion of the ClPAL gene family are discussed, and the relationships between the ClPAL genes and their cucumber orthologs are estimated.

  11. Genome-wide association study of behavioral, physiological and gene expression traits in outbred CFW mice.

    PubMed

    Parker, Clarissa C; Gopalakrishnan, Shyam; Carbonetto, Peter; Gonzales, Natalia M; Leung, Emily; Park, Yeonhee J; Aryee, Emmanuel; Davis, Joe; Blizard, David A; Ackert-Bicknell, Cheryl L; Lionikas, Arimantas; Pritchard, Jonathan K; Palmer, Abraham A

    2016-08-01

    Although mice are the most widely used mammalian model organism, genetic studies have suffered from limited mapping resolution due to extensive linkage disequilibrium (LD) that is characteristic of crosses among inbred strains. Carworth Farms White (CFW) mice are a commercially available outbred mouse population that exhibit rapid LD decay in comparison to other available mouse populations. We performed a genome-wide association study (GWAS) of behavioral, physiological and gene expression phenotypes using 1,200 male CFW mice. We used genotyping by sequencing (GBS) to obtain genotypes at 92,734 SNPs. We also measured gene expression using RNA sequencing in three brain regions. Our study identified numerous behavioral, physiological and expression quantitative trait loci (QTLs). We integrated the behavioral QTL and eQTL results to implicate specific genes, including Azi2 in sensitivity to methamphetamine and Zmynd11 in anxiety-like behavior. The combination of CFW mice, GBS and RNA sequencing constitutes a powerful approach to GWAS in mice. PMID:27376237

  12. Genome-wide survey of the gene expression response to saprolegniasis in Atlantic salmon.

    PubMed

    Roberge, Christian; Páez, David J; Rossignol, Orlane; Guderley, Helga; Dodson, Julian; Bernatchez, Louis

    2007-02-01

    Pathogenic saprolegniaceae species are among the major disease-causing agents in farmed salmonids and in freshwater fish in general. Recent studies have used high-throughput cDNA-based methods to identify new potential actors of fish defence systems against various bacteria and viruses. However, the response of fish to fungal or fungus-like pathogens is still poorly documented. Here, we used a 16,006-gene salmonid cDNA microarray to identify genes which transcription levels are modified in juvenile Atlantic salmon (Salmo salar) affected with saprolegniasis compared to healthy fish from the same families. Our results confirmed the importance of non-specific immunity in the response of fish to saprolegniaceae infections and identified both similarities and differences in their genome-wide transcriptional response to oomycetes compared with their responses to bacterial or viral infections. Moreover, several clones with no known homologues were shown to be over-transcribed in infected fish. These may represent as yet unidentified immune-relevant genes in fish. PMID:16806477

  13. Genome-wide identification and comparison of legume MLO gene family.

    PubMed

    Rispail, Nicolas; Rubiales, Diego

    2016-01-01

    MLO proteins are highly conserved proteins with seven trans-membrane domains. Specific MLO genes have been linked to plant disease susceptibility. Others are involved in plant reproduction and in root thigmomorphogenesis. Functions of the remaining MLOs are still unknown. Here we performed a genome-wide survey of the MLO family in eight legume species from different clades of the Papillionoideae sub-family. A total of 118 MLO sequences were identified and characterized. Their deduced protein sequences shared the characteristics of MLO proteins. The total number of MLO genes per legume species varied from 13 to 20 depending on the species. Legume MLOs were evenly distributed over their genomes and tended to localize within syntenic blocks conserved across legume genomes. Phylogenetic analysis indicated that these sequences clustered in seven well-defined clades. Comparison of MLO protein sequences revealed 34 clade-specific motifs in the variable regions of the proteins. Comparative analyses of the MLO family between legume species also uncovered several evolutionary differences between the tropical legume species from the Phaseoloid clades and the other legume species. Altogether, this study provides interesting new features on the evolution of the MLO family. It also provides valuable clues to identify additional MLO genes from non-sequenced species. PMID:27596925

  14. Genome-wide quantification of homeolog expression ratio revealed nonstochastic gene regulation in synthetic allopolyploid Arabidopsis.

    PubMed

    Akama, Satoru; Shimizu-Inatsugi, Rie; Shimizu, Kentaro K; Sese, Jun

    2014-04-01

    Genome duplication with hybridization, or allopolyploidization, occurs commonly in plants, and is considered to be a strong force for generating new species. However, genome-wide quantification of homeolog expression ratios was technically hindered because of the high homology between homeologous gene pairs. To quantify the homeolog expression ratio using RNA-seq obtained from polyploids, a new method named HomeoRoq was developed, in which the genomic origin of sequencing reads was estimated using mismatches between the read and each parental genome. To verify this method, we first assembled the two diploid parental genomes of Arabidopsis halleri subsp. gemmifera and Arabidopsis lyrata subsp. petraea (Arabidopsis petraea subsp. umbrosa), then generated a synthetic allotetraploid, mimicking the natural allopolyploid Arabidopsis kamchatica. The quantified ratios corresponded well to those obtained by Pyrosequencing. We found that the ratios of homeologs before and after cold stress treatment were highly correlated (r = 0.870). This highlights the presence of nonstochastic polyploid gene regulation despite previous research identifying stochastic variation in expression. Moreover, our new statistical test incorporating overdispersion identified 226 homeologs (1.11% of 20 369 expressed homeologs) with significant ratio changes, many of which were related to stress responses. HomeoRoq would contribute to the study of the genes responsible for polyploid-specific environmental responses. PMID:24423873

  15. Genome-wide scans provide evidence for positive selection of genes implicated in Lassa fever

    PubMed Central

    Andersen, Kristian G.; Shylakhter, Ilya; Tabrizi, Shervin; Grossman, Sharon R.; Happi, Christian T.; Sabeti, Pardis C.

    2012-01-01

    Rapidly evolving viruses and other pathogens can have an immense impact on human evolution as natural selection acts to increase the prevalence of genetic variants providing resistance to disease. With the emergence of large datasets of human genetic variation, we can search for signatures of natural selection in the human genome driven by such disease-causing microorganisms. Based on this approach, we have previously hypothesized that Lassa virus (LASV) may have been a driver of natural selection in West African populations where Lassa haemorrhagic fever is endemic. In this study, we provide further evidence for this notion. By applying tests for selection to genome-wide data from the International Haplotype Map Consortium and the 1000 Genomes Consortium, we demonstrate evidence for positive selection in LARGE and interleukin 21 (IL21), two genes implicated in LASV infectivity and immunity. We further localized the signals of selection, using the recently developed composite of multiple signals method, to introns and putative regulatory regions of those genes. Our results suggest that natural selection may have targeted variants giving rise to alternative splicing or differential gene expression of LARGE and IL21. Overall, our study supports the hypothesis that selective pressures imposed by LASV may have led to the emergence of particular alleles conferring resistance to Lassa fever, and opens up new avenues of research pursuit. PMID:22312054

  16. Genome-wide identification and comparison of legume MLO gene family

    PubMed Central

    Rispail, Nicolas; Rubiales, Diego

    2016-01-01

    MLO proteins are highly conserved proteins with seven trans-membrane domains. Specific MLO genes have been linked to plant disease susceptibility. Others are involved in plant reproduction and in root thigmomorphogenesis. Functions of the remaining MLOs are still unknown. Here we performed a genome-wide survey of the MLO family in eight legume species from different clades of the Papillionoideae sub-family. A total of 118 MLO sequences were identified and characterized. Their deduced protein sequences shared the characteristics of MLO proteins. The total number of MLO genes per legume species varied from 13 to 20 depending on the species. Legume MLOs were evenly distributed over their genomes and tended to localize within syntenic blocks conserved across legume genomes. Phylogenetic analysis indicated that these sequences clustered in seven well-defined clades. Comparison of MLO protein sequences revealed 34 clade-specific motifs in the variable regions of the proteins. Comparative analyses of the MLO family between legume species also uncovered several evolutionary differences between the tropical legume species from the Phaseoloid clades and the other legume species. Altogether, this study provides interesting new features on the evolution of the MLO family. It also provides valuable clues to identify additional MLO genes from non-sequenced species. PMID:27596925

  17. Genome-Wide Association Study Identifies Candidate Genes for Starch Content Regulation in Maize Kernels.

    PubMed

    Liu, Na; Xue, Yadong; Guo, Zhanyong; Li, Weihua; Tang, Jihua

    2016-01-01

    Kernel starch content is an important trait in maize (Zea mays L.) as it accounts for 65-75% of the dry kernel weight and positively correlates with seed yield. A number of starch synthesis-related genes have been identified in maize in recent years. However, many loci underlying variation in starch content among maize inbred lines still remain to be identified. The current study is a genome-wide association study that used a set of 263 maize inbred lines. In this panel, the average kernel starch content was 66.99%, ranging from 60.60 to 71.58% over the three study years. These inbred lines were genotyped with the SNP50 BeadChip maize array, which is comprised of 56,110 evenly spaced, random SNPs. Population structure was controlled by a mixed linear model (MLM) as implemented in the software package TASSEL. After the statistical analyses, four SNPs were identified as significantly associated with starch content (P ≤ 0.0001), among which one each are located on chromosomes 1 and 5 and two are on chromosome 2. Furthermore, 77 candidate genes associated with starch synthesis were found within the 100-kb intervals containing these four QTLs, and four highly associated genes were within 20-kb intervals of the associated SNPs. Among the four genes, Glucose-1-phosphate adenylyltransferase (APS1; Gene ID GRMZM2G163437) is known as an important regulator of kernel starch content. The identified SNPs, QTLs, and candidate genes may not only be readily used for germplasm improvement by marker-assisted selection in breeding, but can also elucidate the genetic basis of starch content. Further studies on these identified candidate genes may help determine the molecular mechanisms regulating kernel starch content in maize and other important cereal crops. PMID:27512395

  18. Genome-Wide Association Study Identifies Candidate Genes for Starch Content Regulation in Maize Kernels

    PubMed Central

    Liu, Na; Xue, Yadong; Guo, Zhanyong; Li, Weihua; Tang, Jihua

    2016-01-01

    Kernel starch content is an important trait in maize (Zea mays L.) as it accounts for 65–75% of the dry kernel weight and positively correlates with seed yield. A number of starch synthesis-related genes have been identified in maize in recent years. However, many loci underlying variation in starch content among maize inbred lines still remain to be identified. The current study is a genome-wide association study that used a set of 263 maize inbred lines. In this panel, the average kernel starch content was 66.99%, ranging from 60.60 to 71.58% over the three study years. These inbred lines were genotyped with the SNP50 BeadChip maize array, which is comprised of 56,110 evenly spaced, random SNPs. Population structure was controlled by a mixed linear model (MLM) as implemented in the software package TASSEL. After the statistical analyses, four SNPs were identified as significantly associated with starch content (P ≤ 0.0001), among which one each are located on chromosomes 1 and 5 and two are on chromosome 2. Furthermore, 77 candidate genes associated with starch synthesis were found within the 100-kb intervals containing these four QTLs, and four highly associated genes were within 20-kb intervals of the associated SNPs. Among the four genes, Glucose-1-phosphate adenylyltransferase (APS1; Gene ID GRMZM2G163437) is known as an important regulator of kernel starch content. The identified SNPs, QTLs, and candidate genes may not only be readily used for germplasm improvement by marker-assisted selection in breeding, but can also elucidate the genetic basis of starch content. Further studies on these identified candidate genes may help determine the molecular mechanisms regulating kernel starch content in maize and other important cereal crops. PMID:27512395

  19. Genome-wide identification and characterization of WRKY gene family in Salix suchowensis.

    PubMed

    Bi, Changwei; Xu, Yiqing; Ye, Qiaolin; Yin, Tongming; Ye, Ning

    2016-01-01

    WRKY proteins are the zinc finger transcription factors that were first identified in plants. They can specifically interact with the W-box, which can be found in the promoter region of a large number of plant target genes, to regulate the expressions of downstream target genes. They also participate in diverse physiological and growing processes in plants. Prior to this study, a plenty of WRKY genes have been identified and characterized in herbaceous species, but there is no large-scale study of WRKY genes in willow. With the whole genome sequencing of Salix suchowensis, we have the opportunity to conduct the genome-wide research for willow WRKY gene family. In this study, we identified 85 WRKY genes in the willow genome and renamed them from SsWRKY1 to SsWRKY85 on the basis of their specific distributions on chromosomes. Due to their diverse structural features, the 85 willow WRKY genes could be further classified into three main groups (group I-III), with five subgroups (IIa-IIe) in group II. With the multiple sequence alignment and the manual search, we found three variations of the WRKYGQK heptapeptide: WRKYGRK, WKKYGQK and WRKYGKK, and four variations of the normal zinc finger motif, which might execute some new biological functions. In addition, the SsWRKY genes from the same subgroup share the similar exon-intron structures and conserved motif domains. Further studies of SsWRKY genes revealed that segmental duplication events (SDs) played a more prominent role in the expansion of SsWRKY genes. Distinct expression profiles of SsWRKY genes with RNA sequencing data revealed that diverse expression patterns among five tissues, including tender roots, young leaves, vegetative buds, non-lignified stems and barks. With the analyses of WRKY gene family in willow, it is not only beneficial to complete the functional and annotation information of WRKY genes family in woody plants, but also provide important references to investigate the expansion and evolution of

  20. Genome-Wide Analysis and Characterization of Aux/IAA Family Genes in Brassica rapa

    PubMed Central

    Rameneni, Jana Jeevan; Li, Xiaonan; Sivanandhan, Ganesan; Choi, Su Ryun; Pang, Wenxing; Im, Subin; Lim, Yong Pyo

    2016-01-01

    Auxins are the key players in plant growth development involving leaf formation, phototropism, root, fruit and embryo development. Auxin/Indole-3-Acetic Acid (Aux/IAA) are early auxin response genes noted as transcriptional repressors in plant auxin signaling. However, many studies focus on Aux/ARF gene families and much less is known about the Aux/IAA gene family in Brassica rapa (B. rapa). Here we performed a comprehensive genome-wide analysis and identified 55 Aux/IAA genes in B. rapa using four conserved motifs of Aux/IAA family (PF02309). Chromosomal mapping of the B. rapa Aux/IAA (BrIAA) genes facilitated understanding cluster rearrangement of the crucifer building blocks in the genome. Phylogenetic analysis of BrIAA with Arabidopsis thaliana, Oryza sativa and Zea mays identified 51 sister pairs including 15 same species (BrIAA—BrIAA) and 36 cross species (BrIAA—AtIAA) IAA genes. Among the 55 BrIAA genes, expression of 43 and 45 genes were verified using Genebank B. rapa ESTs and in home developed microarray data from mature leaves of Chiifu and RcBr lines. Despite their huge morphological difference, tissue specific expression analysis of BrIAA genes between the parental lines Chiifu and RcBr showed that the genes followed a similar pattern of expression during leaf development and a different pattern during bud, flower and siliqua development stages. The response of the BrIAA genes to abiotic and auxin stress at different time intervals revealed their involvement in stress response. Single Nucleotide Polymorphisms between IAA genes of reference genome Chiifu and RcBr were focused and identified. Our study examines the scope of conservation and divergence of Aux/IAA genes and their structures in B. rapa. Analyzing the expression and structural variation between two parental lines will significantly contribute to functional genomics of Brassica crops and we belive our study would provide a foundation in understanding the Aux/IAA genes in B. rapa. PMID

  1. Genome-wide identification and characterization of WRKY gene family in Salix suchowensis

    PubMed Central

    Ye, Qiaolin; Yin, Tongming

    2016-01-01

    WRKY proteins are the zinc finger transcription factors that were first identified in plants. They can specifically interact with the W-box, which can be found in the promoter region of a large number of plant target genes, to regulate the expressions of downstream target genes. They also participate in diverse physiological and growing processes in plants. Prior to this study, a plenty of WRKY genes have been identified and characterized in herbaceous species, but there is no large-scale study of WRKY genes in willow. With the whole genome sequencing of Salix suchowensis, we have the opportunity to conduct the genome-wide research for willow WRKY gene family. In this study, we identified 85 WRKY genes in the willow genome and renamed them from SsWRKY1 to SsWRKY85 on the basis of their specific distributions on chromosomes. Due to their diverse structural features, the 85 willow WRKY genes could be further classified into three main groups (group I–III), with five subgroups (IIa–IIe) in group II. With the multiple sequence alignment and the manual search, we found three variations of the WRKYGQK heptapeptide: WRKYGRK, WKKYGQK and WRKYGKK, and four variations of the normal zinc finger motif, which might execute some new biological functions. In addition, the SsWRKY genes from the same subgroup share the similar exon–intron structures and conserved motif domains. Further studies of SsWRKY genes revealed that segmental duplication events (SDs) played a more prominent role in the expansion of SsWRKY genes. Distinct expression profiles of SsWRKY genes with RNA sequencing data revealed that diverse expression patterns among five tissues, including tender roots, young leaves, vegetative buds, non-lignified stems and barks. With the analyses of WRKY gene family in willow, it is not only beneficial to complete the functional and annotation information of WRKY genes family in woody plants, but also provide important references to investigate the expansion and evolution

  2. Genome-Wide Analysis and Characterization of Aux/IAA Family Genes in Brassica rapa.

    PubMed

    Paul, Parameswari; Dhandapani, Vignesh; Rameneni, Jana Jeevan; Li, Xiaonan; Sivanandhan, Ganesan; Choi, Su Ryun; Pang, Wenxing; Im, Subin; Lim, Yong Pyo

    2016-01-01

    Auxins are the key players in plant growth development involving leaf formation, phototropism, root, fruit and embryo development. Auxin/Indole-3-Acetic Acid (Aux/IAA) are early auxin response genes noted as transcriptional repressors in plant auxin signaling. However, many studies focus on Aux/ARF gene families and much less is known about the Aux/IAA gene family in Brassica rapa (B. rapa). Here we performed a comprehensive genome-wide analysis and identified 55 Aux/IAA genes in B. rapa using four conserved motifs of Aux/IAA family (PF02309). Chromosomal mapping of the B. rapa Aux/IAA (BrIAA) genes facilitated understanding cluster rearrangement of the crucifer building blocks in the genome. Phylogenetic analysis of BrIAA with Arabidopsis thaliana, Oryza sativa and Zea mays identified 51 sister pairs including 15 same species (BrIAA-BrIAA) and 36 cross species (BrIAA-AtIAA) IAA genes. Among the 55 BrIAA genes, expression of 43 and 45 genes were verified using Genebank B. rapa ESTs and in home developed microarray data from mature leaves of Chiifu and RcBr lines. Despite their huge morphological difference, tissue specific expression analysis of BrIAA genes between the parental lines Chiifu and RcBr showed that the genes followed a similar pattern of expression during leaf development and a different pattern during bud, flower and siliqua development stages. The response of the BrIAA genes to abiotic and auxin stress at different time intervals revealed their involvement in stress response. Single Nucleotide Polymorphisms between IAA genes of reference genome Chiifu and RcBr were focused and identified. Our study examines the scope of conservation and divergence of Aux/IAA genes and their structures in B. rapa. Analyzing the expression and structural variation between two parental lines will significantly contribute to functional genomics of Brassica crops and we belive our study would provide a foundation in understanding the Aux/IAA genes in B. rapa.

  3. Genome-Wide Analysis and Characterization of Aux/IAA Family Genes in Brassica rapa.

    PubMed

    Paul, Parameswari; Dhandapani, Vignesh; Rameneni, Jana Jeevan; Li, Xiaonan; Sivanandhan, Ganesan; Choi, Su Ryun; Pang, Wenxing; Im, Subin; Lim, Yong Pyo

    2016-01-01

    Auxins are the key players in plant growth development involving leaf formation, phototropism, root, fruit and embryo development. Auxin/Indole-3-Acetic Acid (Aux/IAA) are early auxin response genes noted as transcriptional repressors in plant auxin signaling. However, many studies focus on Aux/ARF gene families and much less is known about the Aux/IAA gene family in Brassica rapa (B. rapa). Here we performed a comprehensive genome-wide analysis and identified 55 Aux/IAA genes in B. rapa using four conserved motifs of Aux/IAA family (PF02309). Chromosomal mapping of the B. rapa Aux/IAA (BrIAA) genes facilitated understanding cluster rearrangement of the crucifer building blocks in the genome. Phylogenetic analysis of BrIAA with Arabidopsis thaliana, Oryza sativa and Zea mays identified 51 sister pairs including 15 same species (BrIAA-BrIAA) and 36 cross species (BrIAA-AtIAA) IAA genes. Among the 55 BrIAA genes, expression of 43 and 45 genes were verified using Genebank B. rapa ESTs and in home developed microarray data from mature leaves of Chiifu and RcBr lines. Despite their huge morphological difference, tissue specific expression analysis of BrIAA genes between the parental lines Chiifu and RcBr showed that the genes followed a similar pattern of expression during leaf development and a different pattern during bud, flower and siliqua development stages. The response of the BrIAA genes to abiotic and auxin stress at different time intervals revealed their involvement in stress response. Single Nucleotide Polymorphisms between IAA genes of reference genome Chiifu and RcBr were focused and identified. Our study examines the scope of conservation and divergence of Aux/IAA genes and their structures in B. rapa. Analyzing the expression and structural variation between two parental lines will significantly contribute to functional genomics of Brassica crops and we belive our study would provide a foundation in understanding the Aux/IAA genes in B. rapa. PMID

  4. Genome-wide identification and characterization of WRKY gene family in Salix suchowensis

    PubMed Central

    Ye, Qiaolin; Yin, Tongming

    2016-01-01

    WRKY proteins are the zinc finger transcription factors that were first identified in plants. They can specifically interact with the W-box, which can be found in the promoter region of a large number of plant target genes, to regulate the expressions of downstream target genes. They also participate in diverse physiological and growing processes in plants. Prior to this study, a plenty of WRKY genes have been identified and characterized in herbaceous species, but there is no large-scale study of WRKY genes in willow. With the whole genome sequencing of Salix suchowensis, we have the opportunity to conduct the genome-wide research for willow WRKY gene family. In this study, we identified 85 WRKY genes in the willow genome and renamed them from SsWRKY1 to SsWRKY85 on the basis of their specific distributions on chromosomes. Due to their diverse structural features, the 85 willow WRKY genes could be further classified into three main groups (group I–III), with five subgroups (IIa–IIe) in group II. With the multiple sequence alignment and the manual search, we found three variations of the WRKYGQK heptapeptide: WRKYGRK, WKKYGQK and WRKYGKK, and four variations of the normal zinc finger motif, which might execute some new biological functions. In addition, the SsWRKY genes from the same subgroup share the similar exon–intron structures and conserved motif domains. Further studies of SsWRKY genes revealed that segmental duplication events (SDs) played a more prominent role in the expansion of SsWRKY genes. Distinct expression profiles of SsWRKY genes with RNA sequencing data revealed that diverse expression patterns among five tissues, including tender roots, young leaves, vegetative buds, non-lignified stems and barks. With the analyses of WRKY gene family in willow, it is not only beneficial to complete the functional and annotation information of WRKY genes family in woody plants, but also provide important references to investigate the expansion and evolution

  5. Genome-Wide Scans for Delineation of Candidate Genes Regulating Seed-Protein Content in Chickpea

    PubMed Central

    Upadhyaya, Hari D.; Bajaj, Deepak; Narnoliya, Laxmi; Das, Shouvik; Kumar, Vinod; Gowda, C. L. L.; Sharma, Shivali; Tyagi, Akhilesh K.; Parida, Swarup K.

    2016-01-01

    Identification of potential genes/alleles governing complex seed-protein content (SPC) is essential in marker-assisted breeding for quality trait improvement of chickpea. Henceforth, the present study utilized an integrated genomics-assisted breeding strategy encompassing trait association analysis, selective genotyping in traditional bi-parental mapping population and differential expression profiling for the first-time to understand the complex genetic architecture of quantitative SPC trait in chickpea. For GWAS (genome-wide association study), high-throughput genotyping information of 16376 genome-based SNPs (single nucleotide polymorphism) discovered from a structured population of 336 sequenced desi and kabuli accessions [with 150–200 kb LD (linkage disequilibrium) decay] was utilized. This led to identification of seven most effective genomic loci (genes) associated [10–20% with 41% combined PVE (phenotypic variation explained)] with SPC trait in chickpea. Regardless of the diverse desi and kabuli genetic backgrounds, a comparable level of association potential of the identified seven genomic loci with SPC trait was observed. Five SPC-associated genes were validated successfully in parental accessions and homozygous individuals of an intra-specific desi RIL (recombinant inbred line) mapping population (ICC 12299 × ICC 4958) by selective genotyping. The seed-specific expression, including differential up-regulation (>four fold) of six SPC-associated genes particularly in accessions, parents and homozygous individuals of the aforementioned mapping population with a high level of contrasting SPC (21–22%) was evident. Collectively, the integrated genomic approach delineated diverse naturally occurring novel functional SNP allelic variants in six potential candidate genes regulating SPC trait in chickpea. Of these, a non-synonymous SNP allele-carrying zinc finger transcription factor gene exhibiting strong association with SPC trait was found to be the most

  6. Impact of high predation risk on genome-wide hippocampal gene expression in snowshoe hares.

    PubMed

    Lavergne, Sophia G; McGowan, Patrick O; Krebs, Charles J; Boonstra, Rudy

    2014-11-01

    The population dynamics of snowshoe hares (Lepus americanus) are fundamental to the ecosystem dynamics of Canada's boreal forest. During the 8- to 11-year population cycle, hare densities can fluctuate up to 40-fold. Predators in this system (lynx, coyotes, great-horned owls) affect population numbers not only through direct mortality but also through sublethal effects. The chronic stress hypothesis posits that high predation risk during the decline severely stresses hares, leading to greater stress responses, heightened ability to mobilize cortisol and energy, and a poorer body condition. These effects may result in, or be mediated by, differential gene expression. We used an oligonucleotide microarray designed for a closely-related species, the European rabbit (Oryctolagus cuniculus), to characterize differences in genome-wide hippocampal RNA transcript abundance in wild hares from the Yukon during peak and decline phases of a single cycle. A total of 106 genes were differentially regulated between phases. Array results were validated with quantitative real-time PCR, and mammalian protein sequence similarity was used to infer gene function. In comparison to hares from the peak, decline phase hares showed increased expression of genes involved in metabolic processes and hormone response, and decreased expression of immune response and blood cell formation genes. We found evidence for predation risk effects on the expression of genes whose putative functions correspond with physiological impacts known to be induced by predation risk in snowshoe hares. This study shows, for the first time, a link between changes in demography and alterations in neural RNA transcript abundance in a natural population.

  7. Genome-wide identification and characterization of polygalacturonase genes in Cucumis sativus and Citrullus lanatus.

    PubMed

    Yu, Youjian; Liang, Ying; Lv, Meiling; Wu, Jian; Lu, Gang; Cao, Jiashu

    2014-01-01

    Polygalacturonase (PG, EC3.2.1.15), one of the hydrolytic enzymes associated with the modification of pectin network in plant cell wall, has an important role in various cell-separation processes that are essential for plant development. PGs are encoded by a large gene family in plants. However, information on this gene family in plant development remains limited. In the present study, 53 and 62 putative members of the PG gene family in cucumber and watermelon genomes, respectively, were identified by genome-wide search to explore the composition, structure, and evolution of the PG family in Cucurbitaceae crops. The results showed that tandem duplication could be an important factor that contributes to the expansion of the PG genes in the two crops. The phylogenetic and evolutionary analyses suggested that PGs could be classified into seven clades, and that the exon/intron structures and intron phases were conserved within but divergent between clades. At least 24 ancestral PGs were detected in the common ancestor of Arabidopsis and Cucumis sativus. Expression profile analysis by quantitative real-time polymerase chain reaction demonstrated that most CsPGs exhibit specific or high expression pattern in one of the organs/tissues. The 16 CsPGs associated with fruit development could be divided into three subsets based on their specific expression patterns and the cis-elements of fruit-specific, endosperm/seed-specific, and ethylene-responsive exhibited in their promoter regions. Our comparative analysis provided some basic information on the PG gene family, which would be valuable for further functional analysis of the PG genes during plant development.

  8. Genome-Wide Analysis of Antiviral Signature Genes in Porcine Macrophages at Different Activation Statuses

    PubMed Central

    Sang, Yongming; Brichalli, Wyatt; Rowland, Raymond R. R.; Blecha, Frank

    2014-01-01

    Macrophages (MФs) can be polarized to various activation statuses, including classical (M1), alternative (M2), and antiviral states. To study the antiviral activation status of porcine MФs during porcine reproductive and respiratory syndrome virus (PRRSV) infection, we used RNA Sequencing (RNA-Seq) for transcriptomic analysis of differentially expressed genes (DEGs). Sequencing assessment and quality evaluation showed that our RNA-Seq data met the criteria for genome-wide transcriptomic analysis. Comparisons of any two activation statuses revealed more than 20,000 DEGs that were normalized to filter out 153–5,303 significant DEGs [false discovery rate (FDR) ≤0.001, fold change ≥2] in each comparison. The highest 5,303 significant DEGs were found between lipopolysaccharide- (LPS) and interferon (IFN)γ-stimulated M1 cells, whereas only 153 significant DEGs were detected between interleukin (IL)-10-polarized M2 cells and control mock-activated cells. To identify signature genes for antiviral regulation pertaining to each activation status, we identified a set of DEGs that showed significant up-regulation in only one activation state. In addition, pathway analyses defined the top 20–50 significantly regulated pathways at each activation status, and we further analyzed DEGs pertinent to pathways mediated by AMP kinase (AMPK) and epigenetic mechanisms. For the first time in porcine macrophages, our transcriptomic analyses not only compared family-wide differential expression of most known immune genes at different activation statuses, but also revealed transcription evidence of multiple gene families. These findings show that using RNA-Seq transcriptomic analyses in virus-infected and status-synchronized macrophages effectively profiled signature genes and gene response pathways for antiviral regulation, which may provide a framework for optimizing antiviral immunity and immune homeostasis. PMID:24505295

  9. GDSL esterase/lipase genes in Brassica rapa L.: genome-wide identification and expression analysis.

    PubMed

    Dong, Xiangshu; Yi, Hankuil; Han, Ching-Tack; Nou, Ill-Sup; Hur, Yoonkang

    2016-04-01

    GDSL esterase/lipase proteins (GELPs), a very large subfamily of lipolytic enzymes, have been identified in microbes and many plants, but only a few have been characterized with respect to their roles in growth, development, and stress responses. In Brassica crops, as in many other species, genome-wide systematic analysis and functional studies of these genes are still lacking. As a first step to study their function in B. rapa ssp. pekinensis (Chinese cabbage), we comprehensively identified all GELP genes in the genome. We found a total of 121 Brassica rapa GDSL esterase/lipase protein genes (BrGELPs), forming three clades in the phylogenetic analysis (two major and one minor), with an asymmetrical chromosomal distribution. Most BrGELPs possess four strictly conserved residues (Ser-Gly-Asn-His) in four separate conserved regions, along with short conserved and clade-specific blocks, suggesting functional diversification of these proteins. Detailed expression profiling revealed that BrGELPs were expressed in various tissues, including floral organs, implying that BrGELPs play diverse roles in various tissues and during development. Ten percent of BrGELPs were specifically expressed in fertile buds, rather than male-sterile buds, implying their involvement in pollen development. Analyses of EXL6 (extracellular lipase 6) expression and its co-expressed genes in both B. rapa and Arabidopsis, as well as knockdown of this gene in Arabidopsis, revealed that this gene plays an important role in pollen development in both species. The data described in this study will facilitate future investigations of other BrGELP functions.

  10. A Genome-Wide Methylation Approach Identifies a New Hypermethylated Gene Panel in Ulcerative Colitis

    PubMed Central

    Kang, Keunsoo; Bae, Jin-Han; Han, Kyudong; Kim, Eun Soo; Kim, Tae-Oh; Yi, Joo Mi

    2016-01-01

    The cause of inflammatory bowel disease (IBD) is still unknown, but there is growing evidence that environmental factors such as epigenetic changes can contribute to the disease etiology. The aim of this study was to identify newly hypermethylated genes in ulcerative colitis (UC) using a genome-wide DNA methylation approach. Using an Infinium HumanMethylation450 BeadChip array, we screened the DNA methylation changes in three normal colon controls and eight UC patients. Using these methylation profiles, 48 probes associated with CpG promoter methylation showed differential hypermethylation between UC patients and normal controls. Technical validations for methylation analyses in a larger series of UC patients (n = 79) were performed by methylation-specific PCR (MSP) and bisulfite sequencing analysis. We finally found that three genes (FAM217B, KIAA1614 and RIBC2) that were significantly elevating the promoter methylation levels in UC compared to normal controls. Interestingly, we confirmed that three genes were transcriptionally silenced in UC patient samples by qRT-PCR, suggesting that their silencing is correlated with the promoter hypermethylation. Pathway analyses were performed using GO and KEGG databases with differentially hypermethylated genes in UC. Our results highlight that aberrant hypermethylation was identified in UC patients which can be a potential biomarker for detecting UC. Moreover, pathway-enriched hypermethylated genes are possibly implicating important cellular function in the pathogenesis of UC. Overall, this study describes a newly hypermethylated gene panel in UC patients and provides new clinical information that can be used for the diagnosis and therapeutic treatment of IBD. PMID:27517910

  11. Genome-Wide Scans for Delineation of Candidate Genes Regulating Seed-Protein Content in Chickpea.

    PubMed

    Upadhyaya, Hari D; Bajaj, Deepak; Narnoliya, Laxmi; Das, Shouvik; Kumar, Vinod; Gowda, C L L; Sharma, Shivali; Tyagi, Akhilesh K; Parida, Swarup K

    2016-01-01

    Identification of potential genes/alleles governing complex seed-protein content (SPC) is essential in marker-assisted breeding for quality trait improvement of chickpea. Henceforth, the present study utilized an integrated genomics-assisted breeding strategy encompassing trait association analysis, selective genotyping in traditional bi-parental mapping population and differential expression profiling for the first-time to understand the complex genetic architecture of quantitative SPC trait in chickpea. For GWAS (genome-wide association study), high-throughput genotyping information of 16376 genome-based SNPs (single nucleotide polymorphism) discovered from a structured population of 336 sequenced desi and kabuli accessions [with 150-200 kb LD (linkage disequilibrium) decay] was utilized. This led to identification of seven most effective genomic loci (genes) associated [10-20% with 41% combined PVE (phenotypic variation explained)] with SPC trait in chickpea. Regardless of the diverse desi and kabuli genetic backgrounds, a comparable level of association potential of the identified seven genomic loci with SPC trait was observed. Five SPC-associated genes were validated successfully in parental accessions and homozygous individuals of an intra-specific desi RIL (recombinant inbred line) mapping population (ICC 12299 × ICC 4958) by selective genotyping. The seed-specific expression, including differential up-regulation (>four fold) of six SPC-associated genes particularly in accessions, parents and homozygous individuals of the aforementioned mapping population with a high level of contrasting SPC (21-22%) was evident. Collectively, the integrated genomic approach delineated diverse naturally occurring novel functional SNP allelic variants in six potential candidate genes regulating SPC trait in chickpea. Of these, a non-synonymous SNP allele-carrying zinc finger transcription factor gene exhibiting strong association with SPC trait was found to be the most

  12. A Genome-Wide Methylation Approach Identifies a New Hypermethylated Gene Panel in Ulcerative Colitis.

    PubMed

    Kang, Keunsoo; Bae, Jin-Han; Han, Kyudong; Kim, Eun Soo; Kim, Tae-Oh; Yi, Joo Mi

    2016-01-01

    The cause of inflammatory bowel disease (IBD) is still unknown, but there is growing evidence that environmental factors such as epigenetic changes can contribute to the disease etiology. The aim of this study was to identify newly hypermethylated genes in ulcerative colitis (UC) using a genome-wide DNA methylation approach. Using an Infinium HumanMethylation450 BeadChip array, we screened the DNA methylation changes in three normal colon controls and eight UC patients. Using these methylation profiles, 48 probes associated with CpG promoter methylation showed differential hypermethylation between UC patients and normal controls. Technical validations for methylation analyses in a larger series of UC patients (n = 79) were performed by methylation-specific PCR (MSP) and bisulfite sequencing analysis. We finally found that three genes (FAM217B, KIAA1614 and RIBC2) that were significantly elevating the promoter methylation levels in UC compared to normal controls. Interestingly, we confirmed that three genes were transcriptionally silenced in UC patient samples by qRT-PCR, suggesting that their silencing is correlated with the promoter hypermethylation. Pathway analyses were performed using GO and KEGG databases with differentially hypermethylated genes in UC. Our results highlight that aberrant hypermethylation was identified in UC patients which can be a potential biomarker for detecting UC. Moreover, pathway-enriched hypermethylated genes are possibly implicating important cellular function in the pathogenesis of UC. Overall, this study describes a newly hypermethylated gene panel in UC patients and provides new clinical information that can be used for the diagnosis and therapeutic treatment of IBD. PMID:27517910

  13. GDSL esterase/lipase genes in Brassica rapa L.: genome-wide identification and expression analysis.

    PubMed

    Dong, Xiangshu; Yi, Hankuil; Han, Ching-Tack; Nou, Ill-Sup; Hur, Yoonkang

    2016-04-01

    GDSL esterase/lipase proteins (GELPs), a very large subfamily of lipolytic enzymes, have been identified in microbes and many plants, but only a few have been characterized with respect to their roles in growth, development, and stress responses. In Brassica crops, as in many other species, genome-wide systematic analysis and functional studies of these genes are still lacking. As a first step to study their function in B. rapa ssp. pekinensis (Chinese cabbage), we comprehensively identified all GELP genes in the genome. We found a total of 121 Brassica rapa GDSL esterase/lipase protein genes (BrGELPs), forming three clades in the phylogenetic analysis (two major and one minor), with an asymmetrical chromosomal distribution. Most BrGELPs possess four strictly conserved residues (Ser-Gly-Asn-His) in four separate conserved regions, along with short conserved and clade-specific blocks, suggesting functional diversification of these proteins. Detailed expression profiling revealed that BrGELPs were expressed in various tissues, including floral organs, implying that BrGELPs play diverse roles in various tissues and during development. Ten percent of BrGELPs were specifically expressed in fertile buds, rather than male-sterile buds, implying their involvement in pollen development. Analyses of EXL6 (extracellular lipase 6) expression and its co-expressed genes in both B. rapa and Arabidopsis, as well as knockdown of this gene in Arabidopsis, revealed that this gene plays an important role in pollen development in both species. The data described in this study will facilitate future investigations of other BrGELP functions. PMID:26423069

  14. Impact of high predation risk on genome-wide hippocampal gene expression in snowshoe hares.

    PubMed

    Lavergne, Sophia G; McGowan, Patrick O; Krebs, Charles J; Boonstra, Rudy

    2014-11-01

    The population dynamics of snowshoe hares (Lepus americanus) are fundamental to the ecosystem dynamics of Canada's boreal forest. During the 8- to 11-year population cycle, hare densities can fluctuate up to 40-fold. Predators in this system (lynx, coyotes, great-horned owls) affect population numbers not only through direct mortality but also through sublethal effects. The chronic stress hypothesis posits that high predation risk during the decline severely stresses hares, leading to greater stress responses, heightened ability to mobilize cortisol and energy, and a poorer body condition. These effects may result in, or be mediated by, differential gene expression. We used an oligonucleotide microarray designed for a closely-related species, the European rabbit (Oryctolagus cuniculus), to characterize differences in genome-wide hippocampal RNA transcript abundance in wild hares from the Yukon during peak and decline phases of a single cycle. A total of 106 genes were differentially regulated between phases. Array results were validated with quantitative real-time PCR, and mammalian protein sequence similarity was used to infer gene function. In comparison to hares from the peak, decline phase hares showed increased expression of genes involved in metabolic processes and hormone response, and decreased expression of immune response and blood cell formation genes. We found evidence for predation risk effects on the expression of genes whose putative functions correspond with physiological impacts known to be induced by predation risk in snowshoe hares. This study shows, for the first time, a link between changes in demography and alterations in neural RNA transcript abundance in a natural population. PMID:25234370

  15. Nutritional regulation of genome-wide association obesity genes in a tissue-dependent manner

    PubMed Central

    2012-01-01

    Background Genome-wide association studies (GWAS) have recently identified several new genetic variants associated with obesity. The majority of the variants are within introns or between genes, suggesting they affect gene expression, although it is not clear which of the nearby genes they affect. Understanding the regulation of these genes will be key to determining the role of these variants in the development of obesity and will provide support for a role of these genes in the development of obesity. Methods We examined the expression of 19 GWAS obesity genes in the brain and specifically the hypothalamus, adipose tissue and liver of mice by real-time quantitative PCR. To determine whether these genes are nutritionally regulated, as may be expected for genes affecting obesity, we compared tissues from fasting and non-fasting animals and tissues from mice consuming a high fat high sucrose diet in comparison to standard rodent chow. Results We found complex, tissue-dependent patterns of nutritional regulation of most of these genes. For example, Bat2 expression was increased ~10-fold in the brain of fed mice but was lower or unchanged in the hypothalamus and adipose tissue. Kctd15 expression was upregulated in the hypothalamus, brain and adipose tissue of fed mice and downregulated by high fat feeding in liver, adipose tissue and the hypothalamus but not the remainder of the brain. Sh2b1 expression in the brain and Faim2 expression in adipose tissue were specifically increased >20-fold in fed mice. Tmem18 expression in adipose tissue but not the brain was reduced 80% by high fat feeding. Few changes in the expression of these genes were observed in liver. Conclusions These data show nutritional regulation of nearly all these GWAS obesity genes, particularly in the brain and adipose tissue, and provide support for their role in the development of obesity. The complex patterns of nutritional and tissue-dependent regulation also highlight the difficulty that may be

  16. Transport genes and chemotaxis in Laribacter hongkongensis: a genome-wide analysis

    PubMed Central

    2011-01-01

    Background Laribacter hongkongensis is a Gram-negative, sea gull-shaped rod associated with community-acquired gastroenteritis. The bacterium has been found in diverse freshwater environments including fish, frogs and drinking water reservoirs. Using the complete genome sequence data of L. hongkongensis, we performed a comprehensive analysis of putative transport-related genes and genes related to chemotaxis, motility and quorum sensing, which may help the bacterium adapt to the changing environments and combat harmful substances. Results A genome-wide analysis using Transport Classification Database TCDB, similarity and keyword searches revealed the presence of a large diversity of transporters (n = 457) and genes related to chemotaxis (n = 52) and flagellar biosynthesis (n = 40) in the L. hongkongensis genome. The transporters included those from all seven major transporter categories, which may allow the uptake of essential nutrients or ions, and extrusion of metabolic end products and hazardous substances. L. hongkongensis is unique among closely related members of Neisseriaceae family in possessing higher number of proteins related to transport of ammonium, urea and dicarboxylate, which may reflect the importance of nitrogen and dicarboxylate metabolism in this assacharolytic bacterium. Structural modeling of two C4-dicarboxylate transporters showed that they possessed similar structures to the determined structures of other DctP-TRAP transporters, with one having an unusual disulfide bond. Diverse mechanisms for iron transport, including hemin transporters for iron acquisition from host proteins, were also identified. In addition to the chemotaxis and flagella-related genes, the L. hongkongensis genome also contained two copies of qseB/qseC homologues of the AI-3 quorum sensing system. Conclusions The large number of diverse transporters and genes involved in chemotaxis, motility and quorum sensing suggested that the bacterium may utilize a complex system to

  17. Genome-Wide Detection of Fitness Genes in Uropathogenic Escherichia coli during Systemic Infection

    PubMed Central

    Subashchandrabose, Sargurunathan; Smith, Sara N.; Spurbeck, Rachel R.; Kole, Monica M.; Mobley, Harry L. T.

    2013-01-01

    Uropathogenic Escherichia coli (UPEC) is a leading etiological agent of bacteremia in humans. Virulence mechanisms of UPEC in the context of urinary tract infections have been subjected to extensive research. However, understanding of the fitness mechanisms used by UPEC during bacteremia and systemic infection is limited. A forward genetic screen was utilized to detect transposon insertion mutants with fitness defects during colonization of mouse spleens. An inoculum comprised of 360,000 transposon mutants in the UPEC strain CFT073, cultured from the blood of a patient with pyelonephritis, was used to inoculate mice intravenously. Transposon insertion sites in the inoculum (input) and bacteria colonizing the spleen (output) were identified using high-throughput sequencing of transposon-chromosome junctions. Using frequencies of representation of each insertion mutant in the input and output samples, 242 candidate fitness genes were identified. Co-infection experiments with each of 11 defined mutants and the wild-type strain demonstrated that 82% (9 of 11) of the tested candidate fitness genes were required for optimal fitness in a mouse model of systemic infection. Genes involved in biosynthesis of poly-N-acetyl glucosamine (pgaABCD), major and minor pilin of a type IV pilus (c2394 and c2395), oligopeptide uptake periplasmic-binding protein (oppA), sensitive to antimicrobial peptides (sapABCDF), putative outer membrane receptor (yddB), zinc metallopeptidase (pqqL), a shikimate pathway gene (c1220) and autotransporter serine proteases (pic and vat) were further characterized. Here, we report the first genome-wide identification of genes that contribute to fitness in UPEC during systemic infection in a mammalian host. These fitness factors may represent targets for developing novel therapeutics against UPEC. PMID:24339777

  18. Genome-wide analysis of the omega-3 fatty acid desaturase gene family in Gossypium

    DOE PAGES

    Yurchenko, Olga P.; Park, Sunjung; Ilut, Daniel C.; Inmon, Jay J.; Millhollon, Jon C.; Liechty, Zach; Page, Justin T.; Jenks, Matthew A.; Chapman, Kent D.; Udall, Joshua A.; et al

    2014-11-18

    The majority of commercial cotton varieties planted worldwide are derived from Gossypium hirsutum, which is a naturally occurring allotetraploid produced by interspecific hybridization of A- and D-genome diploid progenitor species. While most cotton species are adapted to warm, semi-arid tropical and subtropical regions, and thus perform well in these geographical areas, cotton seedlings are sensitive to cold temperature, which can significantly reduce crop yields. One of the common biochemical responses of plants to cold temperatures is an increase in omega-3 fatty acids, which protects cellular function by maintaining membrane integrity. The purpose of our study was to identify and characterizemore » the omega-3 fatty acid desaturase (FAD) gene family in G. hirsutum, with an emphasis on identifying omega-3 FADs involved in cold temperature adaptation. Results: Eleven omega-3 FAD genes were identified in G. hirsutum, and characterization of the gene family in extant A and D diploid species (G. herbaceum and G. raimondii, respectively) allowed for unambiguous genome assignment of all homoeologs in tetraploid G. hirsutum. The omega-3 FAD family of cotton includes five distinct genes, two of which encode endoplasmic reticulum-type enzymes (FAD3-1 and FAD3-2) and three that encode chloroplast-type enzymes (FAD7/8-1, FAD7/8-2, and FAD7/8-3). The FAD3-2 gene was duplicated in the A genome progenitor species after the evolutionary split from the D progenitor, but before the interspecific hybridization event that gave rise to modern tetraploid cotton. RNA-seq analysis revealed conserved, gene-specific expression patterns in various organs and cell types and semi-quantitative RT-PCR further revealed that FAD7/8-1 was specifically induced during cold temperature treatment of G. hirsutum seedlings. Conclusions: The omega-3 FAD gene family in cotton was characterized at the genome-wide level in three species, showing relatively ancient establishment of the gene family prior

  19. Genome-Wide Profiling of PARP1 Reveals an Interplay with Gene Regulatory Regions and DNA Methylation

    PubMed Central

    Nalabothula, Narasimharao; Al-jumaily, Taha; Eteleeb, Abdallah M.; Flight, Robert M.; Xiaorong, Shao; Moseley, Hunter; Rouchka, Eric C.; Fondufe-Mittendorf, Yvonne N.

    2015-01-01

    Poly (ADP-ribose) polymerase-1 (PARP1) is a nuclear enzyme involved in DNA repair, chromatin remodeling and gene expression. PARP1 interactions with chromatin architectural multi-protein complexes (i.e. nucleosomes) alter chromatin structure resulting in changes in gene expression. Chromatin structure impacts gene regulatory processes including transcription, splicing, DNA repair, replication and recombination. It is important to delineate whether PARP1 randomly associates with nucleosomes or is present at specific nucleosome regions throughout the cell genome. We performed genome-wide association studies in breast cancer cell lines to address these questions. Our studies show that PARP1 associates with epigenetic regulatory elements genome-wide, such as active histone marks, CTCF and DNase hypersensitive sites. Additionally, the binding of PARP1 to chromatin genome-wide is mutually exclusive with DNA methylation pattern suggesting a functional interplay between PARP1 and DNA methylation. Indeed, inhibition of PARylation results in genome-wide changes in DNA methylation patterns. Our results suggest that PARP1 controls the fidelity of gene transcription and marks actively transcribed gene regions by selectively binding to transcriptionally active chromatin. These studies provide a platform for developing our understanding of PARP1’s role in gene regulation. PMID:26305327

  20. Genome-wide identification and expression analysis of the expansin gene family in tomato.

    PubMed

    Lu, Yongen; Liu, Lifeng; Wang, Xin; Han, Zhihui; Ouyang, Bo; Zhang, Junhong; Li, Hanxia

    2016-04-01

    Plant expansins are capable of inducing pH-dependent cell wall extension and stress relaxation. They may be useful as targets for crop improvement to enhance fruit development and stress resistance. Tomato is a major agricultural crop and a model plant for studying fruit development. Because only some tomato expansins have been studied, a genome-wide analysis of the tomato expansin family is necessary. In this study, we identified 25 SlEXPAs, eight SlEXPBs, one SlEXLA, four SlEXLBs, and five short homologs in the tomato genome. 25 of these genes were identified as being expressed. Bioinformatic analysis showed that although tomato expansins share similarities with those from other plants, they also exhibit specific features regarding genetic structure and amino acid sequences, which indicates a unique evolutionary process. Segmental and tandem duplication events have played important roles in expanding the tomato expansin family. Additionally, the 3-exon/2-intron structure may form the basic organization of expansin genes. We identified new expansin genes preferentially expressed in fruits (SlEXPA8, SlEXPB8, and SlEXLB1), roots (SlEXPA9, SlEXLB2, and SlEXLB4), and floral organs. Among the analyzed genes those that were inducible by hormone or stress treatments, including SlEXPA3, SlEXPA7, SlEXPB1-B2, SlEXPB8, SlEXLB1-LB2, and SlEXLB4. Our findings may further clarify the biological activities of tomato expansins, especially those related to fruit development and stress resistance, and contribute to the genetic modification of tomato plants to improve crop quality and yield.

  1. Sequencing mRNA from Cryo-Sliced Drosophila Embryos to Determine Genome-Wide Spatial Patterns of Gene Expression

    PubMed Central

    Combs, Peter A.; Eisen, Michael B.

    2013-01-01

    Complex spatial and temporal patterns of gene expression underlie embryo differentiation, yet methods do not yet exist for the efficient genome-wide determination of spatial expression patterns during development. In situ imaging of transcripts and proteins is the gold-standard, but it is difficult and time consuming to apply to an entire genome, even when highly automated. Sequencing, in contrast, is fast and genome-wide, but is generally applied to homogenized tissues, thereby discarding spatial information. To take advantage of the efficiency and comprehensiveness of sequencing while retaining spatial information, we cryosectioned individual blastoderm stage Drosophila melanogaster embryos along the anterior-posterior axis and developed methods to reliably sequence the mRNA isolated from each 25 µm slice. The spatial patterns of gene expression we infer closely match patterns previously determined by in situ hybridization and microscopy. We applied this method to generate a genome-wide timecourse of spatial gene expression from shortly after fertilization through gastrulation. We identified numerous genes with spatial patterns that have not yet been described in the several ongoing systematic in situ based projects. This simple experiment demonstrates the potential for combining careful anatomical dissection with high-throughput sequencing to obtain spatially resolved gene expression on a genome-wide scale. PMID:23951250

  2. Genome-wide gene expression analysis of mouse embryonic stem cells exposed to p-dichlorobenzene.

    PubMed

    Tani, Hidenori; Takeshita, Jun-Ichi; Aoki, Hiroshi; Abe, Ryosuke; Toyoda, Akinobu; Endo, Yasunori; Miyamoto, Sadaaki; Gamo, Masashi; Torimura, Masaki

    2016-09-01

    Because of the limitations of whole animal testing approaches for toxicological assessment, new cell-based assay systems have been widely studied. In this study, we focused on two biological products for toxicological assessment: mouse embryonic stem cells (mESCs) and long noncoding RNAs (lncRNAs). mESCs possess the abilities of self-renewal and differentiation into multiple cell types. LlncRNAs are an important class of pervasive non-protein-coding transcripts involved in the molecular mechanisms associated with responses to chemicals. We exposed mESCs to p-dichlorobenzene (p-DCB) for 1 or 28 days (daily dose), extracted total RNA, and performed deep sequencing analyses. The genome-wide gene expression analysis indicated that mechanisms modulating proteins occurred following acute and chronic exposures, and mechanisms modulating genomic DNA occurred following chronic exposure. Moreover, our results indicate that three novel lncRNAs (Snora41, Gm19947, and Scarna3a) in mESCs respond to p-DCB exposure. We propose that these lncRNAs have the potential to be surrogate indicators of p-DCB responses in mESCs. PMID:26975756

  3. Recent advances in globin research using genome-wide association studies and gene editing.

    PubMed

    Orkin, Stuart H

    2016-03-01

    A long-sought goal in the hemoglobin field has been an improved understanding of the mechanisms that regulate the switch from fetal (HbF) to adult (HbA) hemoglobin during development. With such knowledge, the hope is that strategies for directed reactivation of HbF in adults could be devised as an approach to therapy for the β-hemoglobinopathies thalassemia and sickle cell disease. Recent genome-wide association studies (GWAS) led to identification of three loci (BCL11A, HBS1L-MYB, and the β-globin cluster itself) in which natural genetic variation is correlated with different HbF levels in populations. Here, the central role of BCL11A in control of HbF is reviewed from the perspective of how findings may be translated to gene therapy in the not-too-distant future. This summary traces the evolution of recent studies from the initial recognition of BCL11A through GWAS to identification of critical sequences in an enhancer required for its erythroid-specific expression, thereby highlighting an Achilles heel for genome editing.

  4. Spotting and validation of a genome wide oligonucleotide chip with duplicate measurement of each gene

    SciTech Connect

    Thomassen, Mads . E-mail: mads.thomassen@ouh.fyns-amt.dk; Skov, Vibe; Eiriksdottir, Freyja; Tan, Qihua; Jochumsen, Kirsten; Fritzner, Niels; Brusgaard, Klaus; Dahlgaard, Jesper; Kruse, Torben A.

    2006-06-16

    The quality of DNA microarray based gene expression data relies on the reproducibility of several steps in a microarray experiment. We have developed a spotted genome wide microarray chip with oligonucleotides printed in duplicate in order to minimise undesirable biases, thereby optimising detection of true differential expression. The validation study design consisted of an assessment of the microarray chip performance using the MessageAmp and FairPlay labelling kits. Intraclass correlation coefficient (ICC) was used to demonstrate that MessageAmp was significantly more reproducible than FairPlay. Further examinations with MessageAmp revealed the applicability of the system. The linear range of the chips was three orders of magnitude, the precision was high, as 95% of measurements deviated less than 1.24-fold from the expected value, and the coefficient of variation for relative expression was 13.6%. Relative quantitation was more reproducible than absolute quantitation and substantial reduction of variance was attained with duplicate spotting. An analysis of variance (ANOVA) demonstrated no significant day-to-day variation.

  5. Identification of Genes Promoting Skin Youthfulness by Genome-Wide Association Study

    PubMed Central

    Chang, Anne L.S.; Atzmon, Gil; Bergman, Aviv; Brugmann, Samantha; Atwood, Scott X; Chang, Howard Y; Barzilai, Nir

    2014-01-01

    To identify genes that promote facial skin youthfulness (SY), a genome-wide association study on an Ashkenazi Jewish discovery group (n=428) was performed using Affymetrix 6.0 Single-Nucleotide Polymorphism (SNP) Array. After SNP quality controls, 901,470 SNPs remained for analysis. The eigenstrat method showed no stratification. Cases and controls were identified by global facial skin aging severity including intrinsic and extrinsic parameters. Linear regression adjusted for age and gender, with no significant differences in smoking history, body mass index, menopausal status, or personal or family history of centenarians. Six SNPs met the Bonferroni threshold with Pallele<10−8; two of these six had Pgenotype<10−8. Quantitative trait loci mapping confirmed linkage disequilibrium. The six SNPs were interrogated by MassARRAY in a replication group (n=436) with confirmation of rs6975107, an intronic region of KCND2 (potassium voltage-gated channel, Shal-related family member 2) (Pgenotype=0.023). A second replication group (n=371) confirmed rs318125, downstream of DIAPH2 (diaphanous homolog 2 (Drosophila)) (Pallele=0.010, Pgenotype=0.002) and rs7616661, downstream of EDEM1 (ER degradation enhancer, mannosidase α-like 1) (Pgenotype=0.042). DIAPH2 has been associated with premature ovarian insufficiency, an aging phenotype in humans. EDEM1 associates with lifespan in animal models, although not humans. KCND2 is expressed in human skin, but has not been associated with aging. These genes represent new candidate genes to study the molecular basis of healthy skin aging. PMID:24037343

  6. A Refined Study of FCRL Genes from a Genome-Wide Association Study for Graves’ Disease

    PubMed Central

    Song, Zhi-Yi; Yang, Shao-Ying; Xue, Li-Qiong; Pan, Chun-Ming; Gu, Zhao-Hui; Liu, Bing-Li; Wang, Hai-Ning; Liang, Liming; Liang, Jun; Zhang, Xiao-Mei; Yuan, Guo-Yue; Li, Chang-Gui; Chen, Ming-Dao; Chen, Jia-Lun; Gao, Guan-Qi; Song, Huai-Dong

    2013-01-01

    To pinpoint the exact location of the etiological variant/s present at 1q21.1 harboring FCRL1-5 and CD5L genes, we carried out a refined association study in the entire FCRL region in 1,536 patients with Graves’ disease (GD) and 1,516 sex-matched controls by imputation analysis, logistic regression, and cis-eQTL analysis. Among 516 SNPs with P<0.05 in the initial GWAS scan, the strongest signals associated with GD and correlated to FCRL3 expression were located at a cluster of SNPs including rs7528684 and rs3761959. And the allele-specific effects for rs3761959 and rs7528684 on FCRL3 expression level revealed that the risk alleles A of rs3761959 and C of rs7528684 were correlated with the elevated expression level of FCRL3 whether in PBMCs or its subsets, especially in CD19+ B cells and CD8+ T subsets. Next, the combined analysis with 5,300 GD cases and 4,916 control individuals confirmed FCRL3 was a susceptibility gene of GD in Chinese Han populations, and rs3761959 and rs7528684 met the genome-wide association significance level (Pcombined = 2.27×10−12 and 7.11×10−13, respectively). Moreover, the haplotypes with the risk allele A of rs3761959 and risk allele C of rs7528684 were associated with GD risk. Finally, our epigenetic analysis suggested the disease-associated C allele of rs7528684 increased affinity for NF-KB transcription factor. Above data indicated that FCRL3 gene and its proxy SNP rs7528684 may be involved in the pathogenesis of GD by excessive inhibiting B cell receptor signaling and the impairment of suppressing function of Tregs. PMID:23505439

  7. Identification of aluminium transport-related genes via genome-wide phenotypic screening of Saccharomyces cerevisiae.

    PubMed

    Tun, Nay M; O'Doherty, Patrick J; Chen, Zhong-Hua; Wu, Xi-Yang; Bailey, Trevor D; Kersaitis, Cindy; Wu, Ming J

    2014-08-01

    Genome-wide screening using gene deletion mutants has been widely carried out with numerous toxicants including oxidants and metal ions. The focus of such studies usually centres on identifying sensitive phenotypes against a given toxicant. Here, we screened the complete collection of yeast gene deletion mutants (5047) with increasing concentrations of aluminium sulphate (0.4, 0.8, 1.6 and 3.2 mM) in order to discover aluminium (Al(3+)) tolerant phenotypes. Fifteen genes were found to be associated with Al(3+) transport because their deletion mutants exhibited Al(3+) tolerance, including lem3Δ, hal5Δ and cka2Δ. Deletion of CKA2, a catalytic subunit of tetrameric protein kinase CK2, gives rise to the most pronounced resistance to Al(3+) by showing significantly higher growth compared to the wild type. Functional analysis revealed that both molecular regulation and endocytosis are involved in Al(3+) transport for yeast. Further investigations were extended to all the four subunits of CK2 (CKA1, CKA2, CKB1 and CKB2) and the other 14 identified mutants under a spectrum of metal ions, including Al(3+), Zn(2+), Mn(2+), Fe(2+), Fe(3+), Co(3+), Ga(3+), Cd(2+), In(3+), Ni(2+) and Cu(2+), as well as hydrogen peroxide and diamide, in order to unravel cross-tolerance amongst metal ions and the effect of the oxidants. Finally, the implication of the findings in Al(3+) transport for the other species like plants and humans is discussed.

  8. Identification of genes promoting skin youthfulness by genome-wide association study.

    PubMed

    Chang, Anne L S; Atzmon, Gil; Bergman, Aviv; Brugmann, Samantha; Atwood, Scott X; Chang, Howard Y; Barzilai, Nir

    2014-03-01

    To identify genes that promote facial skin youthfulness (SY), a genome-wide association study on an Ashkenazi Jewish discovery group (n=428) was performed using Affymetrix 6.0 Single-Nucleotide Polymorphism (SNP) Array. After SNP quality controls, 901,470 SNPs remained for analysis. The eigenstrat method showed no stratification. Cases and controls were identified by global facial skin aging severity including intrinsic and extrinsic parameters. Linear regression adjusted for age and gender, with no significant differences in smoking history, body mass index, menopausal status, or personal or family history of centenarians. Six SNPs met the Bonferroni threshold with Pallele<10(-8); two of these six had Pgenotype<10(-8). Quantitative trait loci mapping confirmed linkage disequilibrium. The six SNPs were interrogated by MassARRAY in a replication group (n=436) with confirmation of rs6975107, an intronic region of KCND2 (potassium voltage-gated channel, Shal-related family member 2) (Pgenotype=0.023). A second replication group (n=371) confirmed rs318125, downstream of DIAPH2 (diaphanous homolog 2 (Drosophila)) (Pallele=0.010, Pgenotype=0.002) and rs7616661, downstream of EDEM1 (ER degradation enhancer, mannosidase α-like 1) (Pgenotype=0.042). DIAPH2 has been associated with premature ovarian insufficiency, an aging phenotype in humans. EDEM1 associates with lifespan in animal models, although not humans. KCND2 is expressed in human skin, but has not been associated with aging. These genes represent new candidate genes to study the molecular basis of healthy skin aging.

  9. PICARA, an analytical pipeline providing probabilistic inference about a priori candidates genes underlying genome-wide association QTL in plants

    Technology Transfer Automated Retrieval System (TEKTRAN)

    PICARA is an analytical pipeline designed to systematically summarize observed SNP/trait associations identified by genome wide association studies (GWAS) and to identify candidate genes involved in the regulation of complex trait variation. The pipeline provides probabilistic inference about a prio...

  10. Alterations in hepatic gene expression and genome-wide DNA methylation in rat offspring exposed to maternal obesity in utero

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Adult offspring from obese (OB) rat dams gain greater body weight and fat mass than controls when fed HFD. At PND21, we examined energy expenditure (EE) (indirect calorimetry), hepatic gene expression (microarrays), and changes in genome-wide and global DNA methylation (enrichment-coupled DNA seque...

  11. Genome-wide linkage analysis and physical mapping of the rippling muscle disease gene

    SciTech Connect

    Stephan, D.A.; Buist, N.R.M.; Bhaskar, A.C.

    1994-09-01

    Rippling muscle disease (RMD) is an inherited disorder of skeletal muscle in which mechanical stimuli provoke electrically silent contractions. The patient`s symptoms are muscle cramps, pain, and stiffness, particularly during or following exercise. Clinical signs are balling of muscle following percussion, and a characteristic lateral rolling movement of muscle occurring after contraction followed by stretching. We report a new 44-member pedigree segregating RMD as an autosomal dominant trait. A genome-wide genetic linkage study in this family, using a novel approach of testing closely spaced highly polymorphic markers in affected individuals, localized the responsible gene to the distal end of the long arm of chromosome 1 with a maximum multi-point lod score of 3.56 ({theta}=0). In this family, RMD is localized to a 6 cM region near D1S235. Physical mapping of the linked region yielded several positive YAC clones, one of which spans the entire 6 cM distance. Several candidate genes not present in the YAC contig, but in the region of 1q4, have been excluded as causative by either linkage analysis of intragenic microsatellite repeats (alpha-actinin, angiotensinogen) or by SSCP of exons (skeletal muscle alpha-actinin). We studied two previously reported German families for linkage to the same locus and this same area did not co-segregate with the disease, a finding that shows that different genetic defects can cause a similar clinical phenotype (genetic heterogeneity). An understanding of the defect in contraction control within the muscle fibers in this disease may lead to a better understanding of muscle force transduction, intracellular calcium homeostasis, or both.

  12. Genome-wide analyses of Shavenbaby target genes reveals distinct features of enhancer organization

    PubMed Central

    2013-01-01

    Background Developmental programs are implemented by regulatory interactions between Transcription Factors (TFs) and their target genes, which remain poorly understood. While recent studies have focused on regulatory cascades of TFs that govern early development, little is known about how the ultimate effectors of cell differentiation are selected and controlled. We addressed this question during late Drosophila embryogenesis, when the finely tuned expression of the TF Ovo/Shavenbaby (Svb) triggers the morphological differentiation of epidermal trichomes. Results We defined a sizeable set of genes downstream of Svb and used in vivo assays to delineate 14 enhancers driving their specific expression in trichome cells. Coupling computational modeling to functional dissection, we investigated the regulatory logic of these enhancers. Extending the repertoire of epidermal effectors using genome-wide approaches showed that the regulatory models learned from this first sample are representative of the whole set of trichome enhancers. These enhancers harbor remarkable features with respect to their functional architectures, including a weak or non-existent clustering of Svb binding sites. The in vivo function of each site relies on its intimate context, notably the flanking nucleotides. Two additional cis-regulatory motifs, present in a broad diversity of composition and positioning among trichome enhancers, critically contribute to enhancer activity. Conclusions Our results show that Svb directly regulates a large set of terminal effectors of the remodeling of epidermal cells. Further, these data reveal that trichome formation is underpinned by unexpectedly diverse modes of regulation, providing fresh insights into the functional architecture of enhancers governing a terminal differentiation program. PMID:23972280

  13. Genome-wide identification of Saccharomyces cerevisiae genes required for tolerance to acetic acid

    PubMed Central

    2010-01-01

    Background Acetic acid is a byproduct of Saccharomyces cerevisiae alcoholic fermentation. Together with high concentrations of ethanol and other toxic metabolites, acetic acid may contribute to fermentation arrest and reduced ethanol productivity. This weak acid is also a present in lignocellulosic hydrolysates, a highly interesting non-feedstock substrate in industrial biotechnology. Therefore, the better understanding of the molecular mechanisms underlying S. cerevisiae tolerance to acetic acid is essential for the rational selection of optimal fermentation conditions and the engineering of more robust industrial strains to be used in processes in which yeast is explored as cell factory. Results The yeast genes conferring protection against acetic acid were identified in this study at a genome-wide scale, based on the screening of the EUROSCARF haploid mutant collection for susceptibility phenotypes to this weak acid (concentrations in the range 70-110 mM, at pH 4.5). Approximately 650 determinants of tolerance to acetic acid were identified. Clustering of these acetic acid-resistance genes based on their biological function indicated an enrichment of genes involved in transcription, internal pH homeostasis, carbohydrate metabolism, cell wall assembly, biogenesis of mitochondria, ribosome and vacuole, and in the sensing, signalling and uptake of various nutrients in particular iron, potassium, glucose and amino acids. A correlation between increased resistance to acetic acid and the level of potassium in the growth medium was found. The activation of the Snf1p signalling pathway, involved in yeast response to glucose starvation, is demonstrated to occur in response to acetic acid stress but no evidence was obtained supporting the acetic acid-induced inhibition of glucose uptake. Conclusions Approximately 490 of the 650 determinants of tolerance to acetic acid identified in this work are implicated, for the first time, in tolerance to this weak acid. These are

  14. Genome-wide profiling to analyze the effects of FXR activation on mouse renal proximal tubular cells

    PubMed Central

    Gui, Ting; Gai, Zhibo

    2015-01-01

    To assess the effect of farnesoid X receptor (FXR), a bile acid nuclear receptor, on renal proximal tubular cells, primary cultured mouse kidney proximal tubular cells were treated with GW4064 (a FXR agonist) or DMSO (as controls) overnight. Analysis of gene expression in the proximal tubular cells by whole genome microarrays indicated that FXR activation induced genes involved in fatty acid degradation and oxidation reduction. Among them, genes involved in glutathione metabolism were mostly induced. Here we describe in details the contents and quality controls for the gene expression and related results associated with the data uploaded to Gene Expression Omnibus (accession number GSE70296). PMID:26697325

  15. Genome-wide analysis and molecular dissection of the SPL gene family in Salvia miltiorrhiza.

    PubMed

    Zhang, Linsu; Wu, Bin; Zhao, Degang; Li, Caili; Shao, Fenjuan; Lu, Shanfa

    2014-01-01

    SQUAMOSA promoter binding protein-likes (SPLs) are plant-specific transcription factors playing vital regulatory roles in plant growth and development. There is no information about SPLs in Salvia miltiorrhiza (Danshen), a significant medicinal plant widely used in Traditional Chinese medicine (TCM) for >1,700 years and an emerging model plant for TCM studies. Through genome-wide identification and subsequent molecular cloning, we identified a total 15 SmSPLs with divergent sequence features, gene structures, and motifs. Comparative analysis showed sequence conservation between SmSPLs and their Arabidopsis counterparts. A phylogenetic tree clusters SmSPLs into six groups. Many of the motifs identified commonly exist in a group/subgroup, implying their functional redundancy. Eight SmSPLs were predicted and experimentally validated to be targets of miR156/157. SmSPLs were differentially expressed in various tissues of S. milltiorrhiza. The expression of miR156/157-targeted SmSPLs was increased with the maturation of S. miltiorrhiza, whereas the expression of miR156/157 was decreased, confirming the regulatory roles of miR156/157 in SmSPLs and suggesting the functions of SmSPLs in S. miltiorrhiza development. The expression of miR156/157 was negatively correlated with miR172 during the maturation of S. miltiorrhiza. The results indicate the significance and complexity of SmSPL-, miR156-, and miR172-mediated regulation of developmental timing in S. miltiorrhiza. PMID:24112769

  16. Genome-Wide Association Study of Intelligence: Additive Effects of Novel Brain Expressed Genes

    ERIC Educational Resources Information Center

    Loo, Sandra K.; Shtir, Corina; Doyle, Alysa E.; Mick, Eric; McGough, James J.; McCracken, James; Biederman, Joseph; Smalley, Susan L.; Cantor, Rita M.; Faraone, Stephen V.; Nelson, Stanley F.

    2012-01-01

    Objective: The purpose of the present study was to identify common genetic variants that are associated with human intelligence or general cognitive ability. Method: We performed a genome-wide association analysis with a dense set of 1 million single-nucleotide polymorphisms (SNPs) and quantitative intelligence scores within an ancestrally…

  17. Genome-wide SNP discovery and linkage analysis in barley based on genes responsive to abiotic stress.

    PubMed

    Rostoks, Nils; Mudie, Sharon; Cardle, Linda; Russell, Joanne; Ramsay, Luke; Booth, Allan; Svensson, Jan T; Wanamaker, Steve I; Walia, Harkamal; Rodriguez, Edmundo M; Hedley, Peter E; Liu, Hui; Morris, Jenny; Close, Timothy J; Marshall, David F; Waugh, Robbie

    2005-12-01

    More than 2,000 genome-wide barley single nucleotide polymorphisms (SNPs) were developed by resequencing unigene fragments from eight diverse accessions. The average genome-wide SNP frequency observed in 877 unigenes was 1 SNP per 200 bp. However, SNP frequency was highly variable with the least number of SNP and SNP haplotypes observed within European cultivated germplasm reflecting effects of breeding history on genetic diversity. More than 300 SNP loci were mapped genetically in three experimental mapping populations which allowed the construction of an integrated SNP map incorporating a large number of RFLP, AFLP and SSR markers (1,237 loci in total). The genes used for SNP discovery were selected based on their transcriptional response to a variety of abiotic stresses. A set of known barley abiotic stress QTL was positioned on the linkage map, while the available sequence and gene expression information facilitated the identification of genes potentially associated with these traits. Comparison of the sequenced SNP loci to the rice genome sequence identified several regions of highly conserved gene order providing a framework for marker saturation in barley genomic regions of interest. The integration of genome-wide SNP and expression data with available genetic and phenotypic information will facilitate the identification of gene function in barley and other non-model organisms. PMID:16244872

  18. A genome-wide function of THSC/TREX-2 at active genes prevents transcription–replication collisions

    PubMed Central

    Santos-Pereira, José M.; García-Rubio, María L.; González-Aguilera, Cristina; Luna, Rosa; Aguilera, Andrés

    2014-01-01

    The THSC/TREX-2 complex of Saccharomyces cerevisiae mediates the anchoring of transcribed genes to the nuclear pore, linking transcription elongation with mRNA export and genome stability, as shown for specific reporters. However, it is still unknown whether the function of TREX-2 is global and the reason for its relevant role in genome integrity. Here, by studying two TREX-2 representative subunits, Thp1 and Sac3, we show that TREX-2 has a genome-wide role in gene expression. Both proteins show similar distributions along the genome, with a gradient disposition at active genes that increases towards the 3′ end. Thp1 and Sac3 have a relevant impact on the expression of long, G+C-rich and highly transcribed genes. Interestingly, replication impairment detected by the genome-wide accumulation of the replicative Rrm3 helicase is increased preferentially at highly expressed genes in the thp1Δ and sac3Δ mutants analyzed. Therefore, our work provides evidence of a function of TREX-2 at the genome-wide level and suggests a role for TREX-2 in preventing transcription–replication conflicts, as a source of genome instability derived from a defective messenger ribonucleoprotein particle (mRNP) biogenesis. PMID:25294824

  19. Genome-Wide Association Identifies SLC2A9 and NLN Gene Regions as Associated with Entropion in Domestic Sheep

    PubMed Central

    Mousel, Michelle R.; Reynolds, James O.; White, Stephen N.

    2015-01-01

    Entropion is an inward rolling of the eyelid allowing contact between the eyelashes and cornea that may lead to blindness if not corrected. Although many mammalian species, including humans and dogs, are afflicted by congenital entropion, no specific genes or gene regions related to development of entropion have been reported in any mammalian species to date. Entropion in domestic sheep is known to have a genetic component therefore, we used domestic sheep as a model system to identify genomic regions containing genes associated with entropion. A genome-wide association was conducted with congenital entropion in 998 Columbia, Polypay, and Rambouillet sheep genotyped with 50,000 SNP markers. Prevalence of entropion was 6.01%, with all breeds represented. Logistic regression was performed in PLINK with additive allelic, recessive, dominant, and genotypic inheritance models. Two genome-wide significant (empirical P<0.05) SNP were identified, specifically markers in SLC2A9 (empirical P = 0.007; genotypic model) and near NLN (empirical P = 0.026; dominance model). Six additional genome-wide suggestive SNP (nominal P<1x10-5) were identified including markers in or near PIK3CB (P = 2.22x10-6; additive model), KCNB1 (P = 2.93x10-6; dominance model), ZC3H12C (P = 3.25x10-6; genotypic model), JPH1 (P = 4.68x20-6; genotypic model), and MYO3B (P = 5.74x10-6; recessive model). This is the first report of specific gene regions associated with congenital entropion in any mammalian species, to our knowledge. Further, none of these genes have previously been associated with any eyelid traits. These results represent the first genome-wide analysis of gene regions associated with entropion and provide target regions for the development of sheep genetic markers for marker-assisted selection. PMID:26098909

  20. Genome-Wide Association Identifies SLC2A9 and NLN Gene Regions as Associated with Entropion in Domestic Sheep.

    PubMed

    Mousel, Michelle R; Reynolds, James O; White, Stephen N

    2015-01-01

    Entropion is an inward rolling of the eyelid allowing contact between the eyelashes and cornea that may lead to blindness if not corrected. Although many mammalian species, including humans and dogs, are afflicted by congenital entropion, no specific genes or gene regions related to development of entropion have been reported in any mammalian species to date. Entropion in domestic sheep is known to have a genetic component therefore, we used domestic sheep as a model system to identify genomic regions containing genes associated with entropion. A genome-wide association was conducted with congenital entropion in 998 Columbia, Polypay, and Rambouillet sheep genotyped with 50,000 SNP markers. Prevalence of entropion was 6.01%, with all breeds represented. Logistic regression was performed in PLINK with additive allelic, recessive, dominant, and genotypic inheritance models. Two genome-wide significant (empirical P<0.05) SNP were identified, specifically markers in SLC2A9 (empirical P = 0.007; genotypic model) and near NLN (empirical P = 0.026; dominance model). Six additional genome-wide suggestive SNP (nominal P<1x10(-5)) were identified including markers in or near PIK3CB (P = 2.22x10(-6); additive model), KCNB1 (P = 2.93x10(-6); dominance model), ZC3H12C (P = 3.25x10(-6); genotypic model), JPH1 (P = 4.68x20(-6); genotypic model), and MYO3B (P = 5.74x10(-6); recessive model). This is the first report of specific gene regions associated with congenital entropion in any mammalian species, to our knowledge. Further, none of these genes have previously been associated with any eyelid traits. These results represent the first genome-wide analysis of gene regions associated with entropion and provide target regions for the development of sheep genetic markers for marker-assisted selection.

  1. A genome-wide admixture scan identifies MYH9 as a candidate locus associated with non-diabetic end stage renal disease in African Americans

    PubMed Central

    Linda Kao, WH; Klag, Michael J; Meoni, Lucy A; Reich, David; Berthier-Schaad, Yvette; Li, Man; Coresh, Josef; Patterson, Nick; Tandon, Arti; Powe, Neil R; Fink, Nancy E; Sadler, John H; Weir, Matthew R; Abboud, Hanna E; Adler, Sharon; Divers, Jasmin; Iyengar, Sudha K; Freedman, Barry I; Kimmel, Paul L; Knowler, William C; Kohn, Orly F; Kramp, Kristopher; Leehey, David J; Nicholas, Susanne; Pahl, Madeleine; Schelling, Jeffrey R; Sedor, John R; Thornley-Brown, Denyse; Winkler, Cheryl A; Smith, Michael W.; Parekh, Rulan S.

    2008-01-01

    End stage renal disease (ESRD) has a four times higher incidence in African Americans compared to European Americans. This led to the hypothesis that susceptibility alleles for ESRD have a higher frequency in West African than European gene pool. We performed a genome-wide admixture scan in 1,372 ESRD cases and 806 controls and demonstrated a highly significant association between excess African ancestry and non-diabetic ESRD (LOD 5.70) but not diabetic ESRD (LOD 0.47) on chromosome 22q12. Each copy of the European ancestral allele conferred a relative risk of 0.50 (95% credible interval 0.39 – 0.63) compared to African ancestry. Multiple common SNPs (allele frequency ranging from 0.2 to 0.6) in the gene that encodes non-muscle myosin heavy chain type II isoform A (MYH9) were associated with 2-4 times greater risk of non-diabetic ESRD and accounted for a large proportion of the excess risk of ESRD observed in African compared to European Americans. PMID:18794854

  2. Genome-Wide Meta-Analyses of Plasma Renin Activity and Concentration Reveal Association with the Kininogen 1 and Prekallikrein Genes

    PubMed Central

    Lieb, Wolfgang; Chen, Ming-Huei; Teumer, Alexander; de Boer, Rudolf A.; Lin, Honghuang; Fox, Ervin R.; Musani, Solomon K.; Wilson, James G.; Wang, Thomas J.; Völzke, Henry; Petersen, Ann-Kristin; Meisinger, Christine; Nauck, Matthias; Schlesinger, Sabrina; Li, Yong; Menard, Jöel; Hercberg, Serge; Wichmann, H.-Erich; Völker, Uwe; Rawal, Rajesh; Bidlingmaier, Martin; Hannemann, Anke; Dörr, Marcus; Rettig, Rainer; van Gilst, Wiek H.; van Veldhuisen, Dirk J.; Bakker, Stephan J.L.; Navis, Gerjan; Wallaschofski, Henri; Meneton, Pierre; van der Harst, Pim; Reincke, Martin; Vasan, Ramachandran S.; Consortium, CKDGen

    2015-01-01

    Background The renin-angiotensin-aldosterone-system (RAAS) is critical for regulation of blood pressure and fluid balance and influences cardiovascular remodeling. Dysregulation of the RAAS contributes to cardiovascular and renal morbidity. The genetic architecture of circulating RAAS components is incompletely understood. Methods and Results We meta-analyzed genome-wide association data for plasma renin activity (n=5,275), plasma renin concentrations (n=8,014) and circulating aldosterone (n=13,289) from up to four population-based cohorts of European and European-American ancestry, and assessed replication of the top results in an independent sample (n=6,487). Single nucleotide polymorphisms (SNPs) in two independent loci displayed associations with plasma renin activity atgenome-wide significance (p<5×10-8). A third locus was close to this threshold (rs4253311 in kallikrein B [KLKB1], p=5.5×10-8). Two of these loci replicated in an independent sample for both plasma renin and aldosterone concentrations (SNP rs5030062 in kininogen 1 [KNG1]: p=0.001 for plasma renin, p=0.024 for plasma aldosterone concentration; rs4253311 with p<0.001 for both plasma renin and aldosterone concentration). SNPs in the NEBL gene reached genome-wide significance for plasma renin concentration in the discovery sample (top SNP rs3915911, p= 8.81×10-9), but did not replicate (p=0.81). No locus reached genome-wide significance for aldosterone. SNPs rs5030062 and rs4253311 were not related to blood pressure or renal traits; in a companion study, variants in the kallikrein B locus were associated with B-type natriuretic peptide concentrations in African-Americans. Conclusions We identified two genetic loci (kininogen 1 and kallikrein B) influencing key components of the RAAS, consistent with the close interrelation between the kallikrein-kinin system and the RAAS. PMID:25477429

  3. High-resolution genome-wide scan of genes, gene-networks and cellular systems impacting the yeast ionome

    PubMed Central

    2012-01-01

    Background To balance the demand for uptake of essential elements with their potential toxicity living cells have complex regulatory mechanisms. Here, we describe a genome-wide screen to identify genes that impact the elemental composition (‘ionome’) of yeast Saccharomyces cerevisiae. Using inductively coupled plasma – mass spectrometry (ICP-MS) we quantify Ca, Cd, Co, Cu, Fe, K, Mg, Mn, Mo, Na, Ni, P, S and Zn in 11890 mutant strains, including 4940 haploid and 1127 diploid deletion strains, and 5798 over expression strains. Results We identified 1065 strains with an altered ionome, including 584 haploid and 35 diploid deletion strains, and 446 over expression strains. Disruption of protein metabolism or trafficking has the highest likelihood of causing large ionomic changes, with gene dosage also being important. Gene over expression produced more extreme ionomic changes, but over expression and loss of function phenotypes are generally not related. Ionomic clustering revealed the existence of only a small number of possible ionomic profiles suggesting fitness tradeoffs that constrain the ionome. Clustering also identified important roles for the mitochondria, vacuole and ESCRT pathway in regulation of the ionome. Network analysis identified hub genes such as PMR1 in Mn homeostasis, novel members of ionomic networks such as SMF3 in vacuolar retrieval of Mn, and cross-talk between the mitochondria and the vacuole. All yeast ionomic data can be searched and downloaded at http://www.ionomicshub.org. Conclusions Here, we demonstrate the power of high-throughput ICP-MS analysis to functionally dissect the ionome on a genome-wide scale. The information this reveals has the potential to benefit both human health and agriculture. PMID:23151179

  4. Genome-wide karyomapping accurately identifies the inheritance of single-gene defects in human preimplantation embryos in vitro

    PubMed Central

    Natesan, Senthilkumar A.; Bladon, Alex J.; Coskun, Serdar; Qubbaj, Wafa; Prates, Renata; Munne, Santiago; Coonen, Edith; Dreesen, Joseph C.F.M.; Stevens, Servi J.C.; Paulussen, Aimee D.C.; Stock-Myer, Sharyn E.; Wilton, Leeanda J.; Jaroudi, Souraya; Wells, Dagan; Brown, Anthony P.C.; Handyside, Alan H.

    2014-01-01

    Purpose: Our aim was to compare the accuracy of family- or disease-specific targeted haplotyping and direct mutation-detection strategies with the accuracy of genome-wide mapping of the parental origin of each chromosome, or karyomapping, by single-nucleotide polymorphism genotyping of the parents, a close relative of known disease status, and the embryo cell(s) used for preimplantation genetic diagnosis of single-gene defects in a single cell or small numbers of cells biopsied from human embryos following in vitro fertilization. Methods: Genomic DNA and whole-genome amplification products from embryo samples, which were previously diagnosed by targeted haplotyping, were genotyped for single-nucleotide polymorphisms genome-wide detection and retrospectively analyzed blind by karyomapping. Results: Single-nucleotide polymorphism genotyping and karyomapping were successful in 213/218 (97.7%) samples from 44 preimplantation genetic diagnosis cycles for 25 single-gene defects with various modes of inheritance distributed widely across the genome. Karyomapping was concordant with targeted haplotyping in 208 (97.7%) samples, and the five nonconcordant samples were all in consanguineous regions with limited or inconsistent haplotyping results. Conclusion: Genome-wide karyomapping is highly accurate and facilitates analysis of the inheritance of almost any single-gene defect, or any combination of loci, at the single-cell level, greatly expanding the range of conditions for which preimplantation genetic diagnosis can be offered clinically without the need for customized test development. PMID:24810687

  5. Genome-wide association study for acute otitis media in children identifies FNDC1 as disease contributing gene

    PubMed Central

    van Ingen, Gijs; Li, Jin; Goedegebure, André; Pandey, Rahul; Li, Yun Rose; March, Michael E.; Jaddoe, Vincent W. V.; Bakay, Marina; Mentch, Frank D.; Thomas, Kelly; Wei, Zhi; Chang, Xiao; Hain, Heather S.; Uitterlinden, André G.; Moll, Henriette A.; van Duijn, Cornelia M.; Rivadeneira, Fernando; Raat, Hein; Baatenburg de Jong, Robert J.; Sleiman, Patrick M.; van der Schroeff, Marc P.; Hakonarson, Hakon

    2016-01-01

    Acute otitis media (AOM) is among the most common pediatric diseases, and the most frequent reason for antibiotic treatment in children. Risk of AOM is dependent on environmental and host factors, as well as a significant genetic component. We identify genome-wide significance at a locus on 6q25.3 (rs2932989, Pmeta=2.15 × 10−09), and show that the associated variants are correlated with the methylation status of the FNDC1 gene (cg05678571, P=1.43 × 10−06), and further show it is an eQTL for FNDC1 (P=9.3 × 10−05). The mouse homologue, Fndc1, is expressed in middle ear tissue and its expression is upregulated upon lipopolysaccharide treatment. In this first GWAS of AOM and the largest OM genetic study to date, we identify the first genome-wide significant locus associated with AOM. PMID:27677580

  6. Genome-wide gene-gene interaction analysis for next-generation sequencing.

    PubMed

    Zhao, Jinying; Zhu, Yun; Xiong, Momiao

    2016-03-01

    The critical barrier in interaction analysis for next-generation sequencing (NGS) data is that the traditional pairwise interaction analysis that is suitable for common variants is difficult to apply to rare variants because of their prohibitive computational time, large number of tests and low power. The great challenges for successful detection of interactions with NGS data are (1) the demands in the paradigm of changes in interaction analysis; (2) severe multiple testing; and (3) heavy computations. To meet these challenges, we shift the paradigm of interaction analysis between two SNPs to interaction analysis between two genomic regions. In other words, we take a gene as a unit of analysis and use functional data analysis techniques as dimensional reduction tools to develop a novel statistic to collectively test interaction between all possible pairs of SNPs within two genome regions. By intensive simulations, we demonstrate that the functional logistic regression for interaction analysis has the correct type 1 error rates and higher power to detect interaction than the currently used methods. The proposed method was applied to a coronary artery disease dataset from the Wellcome Trust Case Control Consortium (WTCCC) study and the Framingham Heart Study (FHS) dataset, and the early-onset myocardial infarction (EOMI) exome sequence datasets with European origin from the NHLBI's Exome Sequencing Project. We discovered that 6 of 27 pairs of significantly interacted genes in the FHS were replicated in the independent WTCCC study and 24 pairs of significantly interacted genes after applying Bonferroni correction in the EOMI study.

  7. Genome-wide identification, isolation and expression analysis of auxin response factor (ARF) gene family in sweet orange (Citrus sinensis)

    PubMed Central

    Li, Si-Bei; OuYang, Wei-Zhi; Hou, Xiao-Jin; Xie, Liang-Liang; Hu, Chun-Gen; Zhang, Jin-Zhi

    2015-01-01

    Auxin response factors (ARFs) are an important family of proteins in auxin-mediated response, with key roles in various physiological and biochemical processes. To date, a genome-wide overview of the ARF gene family in citrus was not available. A systematic analysis of this gene family in citrus was begun by carrying out a genome-wide search for the homologs of ARFs. A total of 19 nonredundant ARF genes (CiARF) were found and validated from the sweet orange. A comprehensive overview of the CiARFs was undertaken, including the gene structures, phylogenetic analysis, chromosome locations, conserved motifs of proteins, and cis-elements in promoters of CiARF. Furthermore, expression profiling using real-time PCR revealed many CiARF genes, albeit with different patterns depending on types of tissues and/or developmental stages. Comprehensive expression analysis of these genes was also performed under two hormone treatments using real-time PCR. Indole-3-acetic acid (IAA) and N-1-napthylphthalamic acid (NPA) treatment experiments revealed differential up-regulation and down-regulation, respectively, of the 19 citrus ARF genes in the callus of sweet orange. Our comprehensive analysis of ARF genes further elucidates the roles of CiARF family members during citrus growth and development process. PMID:25870601

  8. Genome-wide identification, isolation and expression analysis of auxin response factor (ARF) gene family in sweet orange (Citrus sinensis).

    PubMed

    Li, Si-Bei; OuYang, Wei-Zhi; Hou, Xiao-Jin; Xie, Liang-Liang; Hu, Chun-Gen; Zhang, Jin-Zhi

    2015-01-01

    Auxin response factors (ARFs) are an important family of proteins in auxin-mediated response, with key roles in various physiological and biochemical processes. To date, a genome-wide overview of the ARF gene family in citrus was not available. A systematic analysis of this gene family in citrus was begun by carrying out a genome-wide search for the homologs of ARFs. A total of 19 nonredundant ARF genes (CiARF) were found and validated from the sweet orange. A comprehensive overview of the CiARFs was undertaken, including the gene structures, phylogenetic analysis, chromosome locations, conserved motifs of proteins, and cis-elements in promoters of CiARF. Furthermore, expression profiling using real-time PCR revealed many CiARF genes, albeit with different patterns depending on types of tissues and/or developmental stages. Comprehensive expression analysis of these genes was also performed under two hormone treatments using real-time PCR. Indole-3-acetic acid (IAA) and N-1-napthylphthalamic acid (NPA) treatment experiments revealed differential up-regulation and down-regulation, respectively, of the 19 citrus ARF genes in the callus of sweet orange. Our comprehensive analysis of ARF genes further elucidates the roles of CiARF family members during citrus growth and development process.

  9. Integration of genome-wide association studies with biological knowledge identifies six novel genes related to kidney function.

    PubMed

    Chasman, Daniel I; Fuchsberger, Christian; Pattaro, Cristian; Teumer, Alexander; Böger, Carsten A; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Taliun, Daniel; Li, Man; Gao, Xiaoyi; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C; O'Seaghdha, Conall M; Glazer, Nicole; Isaacs, Aaron; Liu, Ching-Ti; Smith, Albert V; O'Connell, Jeffrey R; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Johnson, Andrew D; Gierman, Hinco J; Feitosa, Mary F; Hwang, Shih-Jen; Atkinson, Elizabeth J; Lohman, Kurt; Cornelis, Marilyn C; Johansson, Asa; Tönjes, Anke; Dehghan, Abbas; Lambert, Jean-Charles; Holliday, Elizabeth G; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y; Murgia, Federico; Trompet, Stella; Imboden, Medea; Coassin, Stefan; Pistis, Giorgio; Harris, Tamara B; Launer, Lenore J; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D; Boerwinkle, Eric; Schmidt, Helena; Cavalieri, Margherita; Rao, Madhumathi; Hu, Frank; Demirkan, Ayse; Oostra, Ben A; de Andrade, Mariza; Turner, Stephen T; Ding, Jingzhong; Andrews, Jeanette S; Freedman, Barry I; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Meisinger, Christa; Gieger, Christian; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H; Wright, Alan F; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G; Rivadeneira, Fernando; Aulchenko, Yurii S; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Ketkar, Shamika; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K; Portas, Laura; Ford, Ian; Buckley, Brendan M; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Kim, Stuart K; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J Wouter; Probst-Hensch, Nicole M; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; Siscovick, David S; van Duijn, Cornelia M; Borecki, Ingrid B; Kardia, Sharon L R; Liu, Yongmei; Curhan, Gary C; Rudan, Igor; Gyllensten, Ulf; Wilson, James F; Franke, Andre; Pramstaller, Peter P; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M; Parsa, Afshin; Bochud, Murielle; Heid, Iris M; Kao, W H Linda; Fox, Caroline S; Köttgen, Anna

    2012-12-15

    In conducting genome-wide association studies (GWAS), analytical approaches leveraging biological information may further understanding of the pathophysiology of clinical traits. To discover novel associations with estimated glomerular filtration rate (eGFR), a measure of kidney function, we developed a strategy for integrating prior biological knowledge into the existing GWAS data for eGFR from the CKDGen Consortium. Our strategy focuses on single nucleotide polymorphism (SNPs) in genes that are connected by functional evidence, determined by literature mining and gene ontology (GO) hierarchies, to genes near previously validated eGFR associations. It then requires association thresholds consistent with multiple testing, and finally evaluates novel candidates by independent replication. Among the samples of European ancestry, we identified a genome-wide significant SNP in FBXL20 (P = 5.6 × 10(-9)) in meta-analysis of all available data, and additional SNPs at the INHBC, LRP2, PLEKHA1, SLC3A2 and SLC7A6 genes meeting multiple-testing corrected significance for replication and overall P-values of 4.5 × 10(-4)-2.2 × 10(-7). Neither the novel PLEKHA1 nor FBXL20 associations, both further supported by association with eGFR among African Americans and with transcript abundance, would have been implicated by eGFR candidate gene approaches. LRP2, encoding the megalin receptor, was identified through connection with the previously known eGFR gene DAB2 and extends understanding of the megalin system in kidney function. These findings highlight integration of existing genome-wide association data with independent biological knowledge to uncover novel candidate eGFR associations, including candidates lacking known connections to kidney-specific pathways. The strategy may also be applicable to other clinical phenotypes, although more testing will be needed to assess its potential for discovery in general.

  10. Integration of genome-wide association studies with biological knowledge identifies six novel genes related to kidney function

    PubMed Central

    Chasman, Daniel I.; Fuchsberger, Christian; Pattaro, Cristian; Teumer, Alexander; Böger, Carsten A.; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Taliun, Daniel; Li, Man; Gao, Xiaoyi; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C.; O'Seaghdha, Conall M.; Glazer, Nicole; Isaacs, Aaron; Liu, Ching-Ti; Smith, Albert V.; O'Connell, Jeffrey R.; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Johnson, Andrew D.; Gierman, Hinco J.; Feitosa, Mary F.; Hwang, Shih-Jen; Atkinson, Elizabeth J.; Lohman, Kurt; Cornelis, Marilyn C.; Johansson, Åsa; Tönjes, Anke; Dehghan, Abbas; Lambert, Jean-Charles; Holliday, Elizabeth G.; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y.; Murgia, Federico; Trompet, Stella; Imboden, Medea; Coassin, Stefan; Pistis, Giorgio; Harris, Tamara B.; Launer, Lenore J.; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D.; Boerwinkle, Eric; Schmidt, Helena; Cavalieri, Margherita; Rao, Madhumathi; Hu, Frank; Demirkan, Ayse; Oostra, Ben A.; de Andrade, Mariza; Turner, Stephen T.; Ding, Jingzhong; Andrews, Jeanette S.; Freedman, Barry I.; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Meisinger, Christa; Gieger, Christian; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E.; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H.; Wright, Alan F.; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K.; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G.; Rivadeneira, Fernando; Aulchenko, Yurii S.; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Ketkar, Shamika; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K.; Portas, Laura; Ford, Ian; Buckley, Brendan M.; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Kim, Stuart K.; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J. Wouter; Probst-Hensch, Nicole M.; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R.; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; Siscovick, David S.; van Duijn, Cornelia M.; Borecki, Ingrid B.; Kardia, Sharon L.R.; Liu, Yongmei; Curhan, Gary C.; Rudan, Igor; Gyllensten, Ulf; Wilson, James F.; Franke, Andre; Pramstaller, Peter P.; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M; Parsa, Afshin; Bochud, Murielle; Heid, Iris M.; Kao, W.H. Linda; Fox, Caroline S.; Köttgen, Anna

    2012-01-01

    In conducting genome-wide association studies (GWAS), analytical approaches leveraging biological information may further understanding of the pathophysiology of clinical traits. To discover novel associations with estimated glomerular filtration rate (eGFR), a measure of kidney function, we developed a strategy for integrating prior biological knowledge into the existing GWAS data for eGFR from the CKDGen Consortium. Our strategy focuses on single nucleotide polymorphism (SNPs) in genes that are connected by functional evidence, determined by literature mining and gene ontology (GO) hierarchies, to genes near previously validated eGFR associations. It then requires association thresholds consistent with multiple testing, and finally evaluates novel candidates by independent replication. Among the samples of European ancestry, we identified a genome-wide significant SNP in FBXL20 (P = 5.6 × 10−9) in meta-analysis of all available data, and additional SNPs at the INHBC, LRP2, PLEKHA1, SLC3A2 and SLC7A6 genes meeting multiple-testing corrected significance for replication and overall P-values of 4.5 × 10−4–2.2 × 10−7. Neither the novel PLEKHA1 nor FBXL20 associations, both further supported by association with eGFR among African Americans and with transcript abundance, would have been implicated by eGFR candidate gene approaches. LRP2, encoding the megalin receptor, was identified through connection with the previously known eGFR gene DAB2 and extends understanding of the megalin system in kidney function. These findings highlight integration of existing genome-wide association data with independent biological knowledge to uncover novel candidate eGFR associations, including candidates lacking known connections to kidney-specific pathways. The strategy may also be applicable to other clinical phenotypes, although more testing will be needed to assess its potential for discovery in general. PMID:22962313

  11. Genome-Wide Association of Bipolar Disorder Suggests an Enrichment of Replicable Associations in Regions near Genes

    PubMed Central

    Smith, Erin N.; Koller, Daniel L.; Panganiban, Corrie; Szelinger, Szabolcs; Zhang, Peng; Badner, Judith A.; Barrett, Thomas B.; Berrettini, Wade H.; Bloss, Cinnamon S.; Byerley, William; Coryell, William; Edenberg, Howard J.; Foroud, Tatiana; Gershon, Elliot S.; Greenwood, Tiffany A.; Guo, Yiran; Hipolito, Maria; Keating, Brendan J.; Lawson, William B.; Liu, Chunyu; Mahon, Pamela B.; McInnis, Melvin G.; McMahon, Francis J.; McKinney, Rebecca; Murray, Sarah S.; Nievergelt, Caroline M.; Nurnberger, John I.; Nwulia, Evaristus A.; Potash, James B.; Rice, John; Schulze, Thomas G.; Scheftner, William A.; Shilling, Paul D.; Zandi, Peter P.; Zöllner, Sebastian; Craig, David W.; Schork, Nicholas J.; Kelsoe, John R.

    2011-01-01

    Although a highly heritable and disabling disease, bipolar disorder's (BD) genetic variants have been challenging to identify. We present new genotype data for 1,190 cases and 401 controls and perform a genome-wide association study including additional samples for a total of 2,191 cases and 1,434 controls. We do not detect genome-wide significant associations for individual loci; however, across all SNPs, we show an association between the power to detect effects calculated from a previous genome-wide association study and evidence for replication (P = 1.5×10−7). To demonstrate that this result is not likely to be a false positive, we analyze replication rates in a large meta-analysis of height and show that, in a large enough study, associations replicate as a function of power, approaching a linear relationship. Within BD, SNPs near exons exhibit a greater probability of replication, supporting an enrichment of reproducible associations near functional regions of genes. These results indicate that there is likely common genetic variation associated with BD near exons (±10 kb) that could be identified in larger studies and, further, provide a framework for assessing the potential for replication when combining results from multiple studies. PMID:21738484

  12. Candidate genes for obesity-susceptibility show enriched association within a large genome-wide association study for BMI

    PubMed Central

    Vimaleswaran, Karani S.; Tachmazidou, Ioanna; Zhao, Jing Hua; Hirschhorn, Joel N.; Dudbridge, Frank; Loos, Ruth J.F.

    2012-01-01

    Before the advent of genome-wide association studies (GWASs), hundreds of candidate genes for obesity-susceptibility had been identified through a variety of approaches. We examined whether those obesity candidate genes are enriched for associations with body mass index (BMI) compared with non-candidate genes by using data from a large-scale GWAS. A thorough literature search identified 547 candidate genes for obesity-susceptibility based on evidence from animal studies, Mendelian syndromes, linkage studies, genetic association studies and expression studies. Genomic regions were defined to include the genes ±10 kb of flanking sequence around candidate and non-candidate genes. We used summary statistics publicly available from the discovery stage of the genome-wide meta-analysis for BMI performed by the genetic investigation of anthropometric traits consortium in 123 564 individuals. Hypergeometric, rank tail-strength and gene-set enrichment analysis tests were used to test for the enrichment of association in candidate compared with non-candidate genes. The hypergeometric test of enrichment was not significant at the 5% P-value quantile (P = 0.35), but was nominally significant at the 25% quantile (P = 0.015). The rank tail-strength and gene-set enrichment tests were nominally significant for the full set of genes and borderline significant for the subset without SNPs at P < 10−7. Taken together, the observed evidence for enrichment suggests that the candidate gene approach retains some value. However, the degree of enrichment is small despite the extensive number of candidate genes and the large sample size. Studies that focus on candidate genes have only slightly increased chances of detecting associations, and are likely to miss many true effects in non-candidate genes, at least for obesity-related traits. PMID:22791748

  13. Candidate genes for obesity-susceptibility show enriched association within a large genome-wide association study for BMI.

    PubMed

    Vimaleswaran, Karani S; Tachmazidou, Ioanna; Zhao, Jing Hua; Hirschhorn, Joel N; Dudbridge, Frank; Loos, Ruth J F

    2012-10-15

    Before the advent of genome-wide association studies (GWASs), hundreds of candidate genes for obesity-susceptibility had been identified through a variety of approaches. We examined whether those obesity candidate genes are enriched for associations with body mass index (BMI) compared with non-candidate genes by using data from a large-scale GWAS. A thorough literature search identified 547 candidate genes for obesity-susceptibility based on evidence from animal studies, Mendelian syndromes, linkage studies, genetic association studies and expression studies. Genomic regions were defined to include the genes ±10 kb of flanking sequence around candidate and non-candidate genes. We used summary statistics publicly available from the discovery stage of the genome-wide meta-analysis for BMI performed by the genetic investigation of anthropometric traits consortium in 123 564 individuals. Hypergeometric, rank tail-strength and gene-set enrichment analysis tests were used to test for the enrichment of association in candidate compared with non-candidate genes. The hypergeometric test of enrichment was not significant at the 5% P-value quantile (P = 0.35), but was nominally significant at the 25% quantile (P = 0.015). The rank tail-strength and gene-set enrichment tests were nominally significant for the full set of genes and borderline significant for the subset without SNPs at P < 10(-7). Taken together, the observed evidence for enrichment suggests that the candidate gene approach retains some value. However, the degree of enrichment is small despite the extensive number of candidate genes and the large sample size. Studies that focus on candidate genes have only slightly increased chances of detecting associations, and are likely to miss many true effects in non-candidate genes, at least for obesity-related traits.

  14. Genome-wide identification, classification, and expression analysis of sHSP genes in Chinese cabbage (Brassica rapa ssp pekinensis).

    PubMed

    Tao, P; Guo, W L; Li, B Y; Wang, W H; Yue, Z C; Lei, J L; Zhong, X M

    2015-10-05

    Small heat shock proteins (sHSPs) are essential for the plant's normal development and stress responses, especially the heat stress response. The information regarding sHSP genes in Chinese cabbage (Brassica rapa ssp pekinensis) is sparse, hence we performed a genome-wide analysis to identify sHSP genes in this species. We identified 26 non-redundant sHSP genes distributed on all chromosomes, except chromosome A7, with one additional sHSP gene identified from an expressed sequence tag library. Chinese cabbage was found to contain more sHSP genes than Arabidopsis. The 27 sHSP genes were classified into 11 subfamilies. We identified 22 groups of sHSP syntenic orthologous genes between Chinese cabbage and Arabidopsis. In addition, eight groups of paralogous genes were uncovered in Chinese cabbage. Protein structures of the 27 Chinese cabbage sHSPs were modeled using Phyre2, which revealed that all of them contain several conserved β strands across different subfamilies. In general, gene structure was conserved within each subfamily between Chinese cabbage and Arabidopsis, except for peroxisome sHSP. Analysis of promoter motifs showed that most sHSP genes contain heat shock elements or variants. We also found that biased gene loss has occurred during the evolution of the sHSP subfamily in Chinese cabbage. Expression analysis indicated that the greatest transcript abundance of most Chinese cabbage sHSP genes was found in siliques and early cotyledon embryos. Thus, genome-wide identification and characterization of sHSP genes is a first and important step in the investigation of sHSPs in Chinese cabbage.

  15. Genome-wide identification, classification, and expression analysis of sHSP genes in Chinese cabbage (Brassica rapa ssp pekinensis).

    PubMed

    Tao, P; Guo, W L; Li, B Y; Wang, W H; Yue, Z C; Lei, J L; Zhong, X M

    2015-01-01

    Small heat shock proteins (sHSPs) are essential for the plant's normal development and stress responses, especially the heat stress response. The information regarding sHSP genes in Chinese cabbage (Brassica rapa ssp pekinensis) is sparse, hence we performed a genome-wide analysis to identify sHSP genes in this species. We identified 26 non-redundant sHSP genes distributed on all chromosomes, except chromosome A7, with one additional sHSP gene identified from an expressed sequence tag library. Chinese cabbage was found to contain more sHSP genes than Arabidopsis. The 27 sHSP genes were classified into 11 subfamilies. We identified 22 groups of sHSP syntenic orthologous genes between Chinese cabbage and Arabidopsis. In addition, eight groups of paralogous genes were uncovered in Chinese cabbage. Protein structures of the 27 Chinese cabbage sHSPs were modeled using Phyre2, which revealed that all of them contain several conserved β strands across different subfamilies. In general, gene structure was conserved within each subfamily between Chinese cabbage and Arabidopsis, except for peroxisome sHSP. Analysis of promoter motifs showed that most sHSP genes contain heat shock elements or variants. We also found that biased gene loss has occurred during the evolution of the sHSP subfamily in Chinese cabbage. Expression analysis indicated that the greatest transcript abundance of most Chinese cabbage sHSP genes was found in siliques and early cotyledon embryos. Thus, genome-wide identification and characterization of sHSP genes is a first and important step in the investigation of sHSPs in Chinese cabbage. PMID:26505345

  16. A genome-wide association study of variations in maternal cardiometabolic genes and risk of placental abruption

    PubMed Central

    Moore, Amy; Enquobahrie, Daniel A; Sanchez, Sixto E; Ananth, Cande V; Pacora, Percy N; Williams, Michelle A

    2012-01-01

    Accumulating evidence suggests that placental abruption has a complex multifactorial pathogenesis that involves cardiovascular risk and metabolic dysfunction. However, comprehensive assessment of variations in genes involved in cardiometabolic traits associated with the risk of placental abruption is lacking. We conducted a case-control study investigating associations of variations in maternal cardiometabolic genes (characterized using 217,697 SNPs on the Illumina Cardio-Metabo Chip) with risk of placental abruption. A total of 253 abruption cases and 258 controls were selected from among participants enrolled in the Peruvian Abruptio Placentae Epidemiology Study in Lima, Peru. In the genome-wide association analyses, top hits did not surpass genome-wide significance. However, we observed suggestive associations of placental abruption with several SNPs, including SNPs in SMAD2 (P-value=1.88e-6), MIR17HG (P-value=7.8e-6], and DGKB (P-value=8.35e-6] loci. In candidate gene analyses, we observed associations of variations in a priori selected genes involved in coagulation, rennin-angiotensin, angiogenesis, inflammation, and B-vitamin metabolism with the risk of abruption. Our study suggests that variations in maternal cardiovascular and metabolic genes may be associated with risk of placental abruption. Future studies with large sample sizes are warranted. PMID:23205182

  17. Genome-Wide Analysis of Wilms’ Tumor 1-Controlled Gene Expression in Podocytes Reveals Key Regulatory Mechanisms

    PubMed Central

    Kann, Martin; Ettou, Sandrine; Jung, Youngsook L.; Lenz, Maximilian O.; Taglienti, Mary E.; Park, Peter J.; Schermer, Bernhard

    2015-01-01

    The transcription factor Wilms’ tumor suppressor 1 (WT1) is key to podocyte development and viability; however, WT1 transcriptional networks in podocytes remain elusive. We provide a comprehensive analysis of the genome-wide WT1 transcriptional network in podocytes in vivo using chromatin immunoprecipitation followed by sequencing (ChIPseq) and RNA sequencing techniques. Our data show a specific role for WT1 in regulating the podocyte-specific transcriptome through binding to both promoters and enhancers of target genes. Furthermore, we inferred a podocyte transcription factor network consisting of WT1, LMX1B, TCF21, Fox-class and TEAD family transcription factors, and MAFB that uses tissue-specific enhancers to control podocyte gene expression. In addition to previously described WT1-dependent target genes, ChIPseq identified novel WT1-dependent signaling systems. These targets included components of the Hippo signaling system, underscoring the power of genome-wide transcriptional-network analyses. Together, our data elucidate a comprehensive gene regulatory network in podocytes suggesting that WT1 gene regulatory function and podocyte cell-type specification can best be understood in the context of transcription factor-regulatory element network interplay. PMID:25636411

  18. Towards understanding the breast cancer epigenome: a comparison of genome-wide DNA methylation and gene expression data.

    PubMed

    Singhal, Sandeep K; Usmani, Nawaid; Michiels, Stefan; Metzger-Filho, Otto; Saini, Kamal S; Kovalchuk, Olga; Parliament, Matthew

    2016-01-19

    Until recently, an elevated disease risk has been ascribed to a genetic predisposition, however, exciting progress over the past years has discovered alternate elements of inheritance that involve epigenetic regulation. Epigenetic changes are heritably stable alterations that include DNA methylation, histone modifications and RNA-mediated silencing. Aberrant DNA methylation is a common molecular basis for a number of important human diseases, including breast cancer. Changes in DNA methylation profoundly affect global gene expression patterns. What is emerging is a more dynamic and complex association between DNA methylation and gene expression than previously believed. Although many tools have already been developed for analyzing genome-wide gene expression data, tools for analyzing genome-wide DNA methylation have not yet reached the same level of refinement. Here we provide an in-depth analysis of DNA methylation in parallel with gene expression data characteristics and describe the particularities of low-level and high-level analyses of DNA methylation data. Low-level analysis refers to pre-processing of methylation data (i.e. normalization, transformation and filtering), whereas high-level analysis is focused on illustrating the application of the widely used class comparison, class prediction and class discovery methods to DNA methylation data. Furthermore, we investigate the influence of DNA methylation on gene expression by measuring the correlation between the degree of CpG methylation and the level of expression and to explore the pattern of methylation as a function of the promoter region.

  19. Cyclic nucleotide gated channel gene family in tomato: genome-wide identification and functional analyses in disease resistance

    PubMed Central

    Saand, Mumtaz A.; Xu, You-Ping; Li, Wen; Wang, Ji-Peng; Cai, Xin-Zhong

    2015-01-01

    The cyclic nucleotide gated channel (CNGC) is suggested to be one of the important calcium conducting channels. Nevertheless, genome-wide identification and systemic functional analysis of CNGC gene family in crop plant species have not yet been conducted. In this study, we performed genome-wide identification of CNGC gene family in the economically important crop tomato (Solanum lycopersicum L.) and analyzed function of the group IVb SlCNGC genes in disease resistance. Eighteen CNGC genes were identified in tomato genome, and four CNGC loci that were misannotated at database were corrected by cloning and sequencing. Detailed bioinformatics analyses on gene structure, domain composition and phylogenetic relationship of the SlCNGC gene family were conducted and the group-specific feature was revealed. Comprehensive expression analyses demonstrated that SlCNGC genes were highly, widely but differently responsive to diverse stimuli. Pharmacological assays showed that the putative CNGC activators cGMP and cAMP enhanced resistance against Sclerotinia sclerotiorum. Silencing of group IVb SlCNGC genes significantly enhanced resistance to fungal pathogens Pythium aphanidermatum and S. sclerotiorum, strongly reduced resistance to viral pathogen Tobacco rattle virus, while attenuated PAMP- and DAMP-triggered immunity as shown by obvious decrease of the flg22- and AtPep1-elicited hydrogen peroxide accumulation in SlCNGC-silenced plants. Additionally, silencing of these SlCNGC genes significantly altered expression of a set of Ca2+ signaling genes including SlCaMs, SlCDPKs, and SlCAMTA3. Collectively, our results reveal that group IV SlCNGC genes regulate a wide range of resistance in tomato probably by affecting Ca2+ signaling. PMID:25999969

  20. Promising Loci and Genes for Yolk and Ovary Weight in Chickens Revealed by a Genome-Wide Association Study

    PubMed Central

    Yi, Guoqiang; Yuan, Jingwei; Duan, Zhongyi; Qu, Lujiang; Xu, Guiyun; Wang, Kehua; Yang, Ning

    2015-01-01

    Because it serves as the cytoplasm of the oocyte and provides a large amount of reserves, the egg yolk has biological significance for developing embryos. The ovary and its hierarchy of follicles are the main reproductive organs responsible for yolk deposition in chickens. However, the genetic architecture underlying the yolk and ovarian follicle weights remains elusive. Here, we measured the yolk weight (YW) at 11 age points from onset of egg laying to 72 weeks of age and measured the follicle weight (FW) and ovary weight (OW) at 73 weeks as part of a comprehensive genome-wide association study (GWAS) in 1,534 F2 hens derived from reciprocal crosses between White Leghorn (WL) and Dongxiang chickens (DX). For all ages, YWs exhibited moderate single nucleotide polymorphism (SNP)-based heritability estimates (0.25–0.38), while the estimates for FW (0.16) and OW (0.20) were relatively low. Independent univariate genome-wide screens for each trait identified 12, 3, and 31 novel significant associations with YW, FW, and OW, respectively. A list of candidate genes such as ZAR1, STARD13, ACER1b, ACSBG2, and DHRS12 were identified for having a plausible function in yolk and follicle development. These genes are important to the initiation of embryogenesis, lipid transport, lipoprotein synthesis, lipid droplet promotion, and steroid hormone metabolism, respectively. Our study provides for the first time a genome-wide association (GWA) analysis for follicle and ovary weight. Identification of the promising loci as well as potential candidate genes will greatly advance our understanding of the genetic basis underlying dynamic yolk weight and ovarian follicle development and has practical significance in breeding programs for the alteration of yolk weight at different age points. PMID:26332579

  1. GENOME WIDE IDENTIFICATION OF NEW GENES AND PATHWAYS IN PATIENTS WITH BOTH AUTOIMMUNE THYROIDITIS AND TYPE 1 DIABETES

    PubMed Central

    Tomer, Yaron; Dolan, Lawrence M.; Kahaly, George; Divers, Jasmin; D’Agostino, Ralph B.; Imperatore, Giuseppina; Dabelea, Dana; Marcovina, Santica; Black, Mary Helen; Pihoker, Catherine; Hasham, Alia; Salehi Hammerstad, Sara; Greenberg, David A.; Lotay, Vaneet; Zhang, Weijia; Monti, Maria Cristina; Matheis, Nina

    2015-01-01

    Autoimmune thyroid diseases (AITD) and Type 1 diabetes (T1D) frequently occur in the same individual pointing to a strong shared genetic susceptibility. Indeed, the cooccurrence of T1D and AITD in the same individual is classified as a variant of the autoimmune polyglandular syndrome type 3 (designated APS3v). Our aim was to identify new genes and mechanisms causing the co-occurrence of T1D+AITD (APS3v) in the same individual using a genome-wide approach. For our discovery set we analyzed 346 Caucasian APS3v patients and 727 gender and ethnicity matched healthy controls. Genotyping was performed using the Illumina Human660W-Quad.v1. The replication set included 185 APS3v patients and 340 controls. Association analyses were performed using the PLINK program, and pathway analyses were performed using the MAGENTA software. We identified multiple signals within the HLA region and conditioning studies suggested that a few of them contributed independently to the strong association of the HLA locus with APS3v. Outside the HLA region, variants in GPR103, a gene not suggested by previous studies of APS3v, T1D, or AITD, showed genome-wide significance (p<5×10−8). In addition, a locus on 1p13 containing the PTPN22 gene showed genome-wide significant associations. Pathway analysis demonstrated that cell cycle, B-cell development, CD40, and CTLA-4 signaling were the major pathways contributing to the pathogenesis of APS3v. These findings suggest that complex mechanisms involving T-cell and B-cell pathways are involved in the strong genetic association between AITD and T1D. PMID:25936594

  2. Genome wide identification of new genes and pathways in patients with both autoimmune thyroiditis and type 1 diabetes.

    PubMed

    Tomer, Yaron; Dolan, Lawrence M; Kahaly, George; Divers, Jasmin; D'Agostino, Ralph B; Imperatore, Giuseppina; Dabelea, Dana; Marcovina, Santica; Black, Mary Helen; Pihoker, Catherine; Hasham, Alia; Hammerstad, Sara Salehi; Greenberg, David A; Lotay, Vaneet; Zhang, Weijia; Monti, Maria Cristina; Matheis, Nina

    2015-06-01

    Autoimmune thyroid diseases (AITD) and Type 1 diabetes (T1D) frequently occur in the same individual pointing to a strong shared genetic susceptibility. Indeed, the co-occurrence of T1D and AITD in the same individual is classified as a variant of the autoimmune polyglandular syndrome type 3 (designated APS3v). Our aim was to identify new genes and mechanisms causing the co-occurrence of T1D + AITD (APS3v) in the same individual using a genome-wide approach. For our discovery set we analyzed 346 Caucasian APS3v patients and 727 gender and ethnicity matched healthy controls. Genotyping was performed using the Illumina Human660W-Quad.v1. The replication set included 185 APS3v patients and 340 controls. Association analyses were performed using the PLINK program, and pathway analyses were performed using the MAGENTA software. We identified multiple signals within the HLA region and conditioning studies suggested that a few of them contributed independently to the strong association of the HLA locus with APS3v. Outside the HLA region, variants in GPR103, a gene not suggested by previous studies of APS3v, T1D, or AITD, showed genome-wide significance (p < 5 × 10(-8)). In addition, a locus on 1p13 containing the PTPN22 gene showed genome-wide significant associations. Pathway analysis demonstrated that cell cycle, B-cell development, CD40, and CTLA-4 signaling were the major pathways contributing to the pathogenesis of APS3v. These findings suggest that complex mechanisms involving T-cell and B-cell pathways are involved in the strong genetic association between AITD and T1D.

  3. Genome wide identification of new genes and pathways in patients with both autoimmune thyroiditis and type 1 diabetes.

    PubMed

    Tomer, Yaron; Dolan, Lawrence M; Kahaly, George; Divers, Jasmin; D'Agostino, Ralph B; Imperatore, Giuseppina; Dabelea, Dana; Marcovina, Santica; Black, Mary Helen; Pihoker, Catherine; Hasham, Alia; Hammerstad, Sara Salehi; Greenberg, David A; Lotay, Vaneet; Zhang, Weijia; Monti, Maria Cristina; Matheis, Nina

    2015-06-01

    Autoimmune thyroid diseases (AITD) and Type 1 diabetes (T1D) frequently occur in the same individual pointing to a strong shared genetic susceptibility. Indeed, the co-occurrence of T1D and AITD in the same individual is classified as a variant of the autoimmune polyglandular syndrome type 3 (designated APS3v). Our aim was to identify new genes and mechanisms causing the co-occurrence of T1D + AITD (APS3v) in the same individual using a genome-wide approach. For our discovery set we analyzed 346 Caucasian APS3v patients and 727 gender and ethnicity matched healthy controls. Genotyping was performed using the Illumina Human660W-Quad.v1. The replication set included 185 APS3v patients and 340 controls. Association analyses were performed using the PLINK program, and pathway analyses were performed using the MAGENTA software. We identified multiple signals within the HLA region and conditioning studies suggested that a few of them contributed independently to the strong association of the HLA locus with APS3v. Outside the HLA region, variants in GPR103, a gene not suggested by previous studies of APS3v, T1D, or AITD, showed genome-wide significance (p < 5 × 10(-8)). In addition, a locus on 1p13 containing the PTPN22 gene showed genome-wide significant associations. Pathway analysis demonstrated that cell cycle, B-cell development, CD40, and CTLA-4 signaling were the major pathways contributing to the pathogenesis of APS3v. These findings suggest that complex mechanisms involving T-cell and B-cell pathways are involved in the strong genetic association between AITD and T1D. PMID:25936594

  4. Genome-Wide Analysis of the Musa WRKY Gene Family: Evolution and Differential Expression during Development and Stress.

    PubMed

    Goel, Ridhi; Pandey, Ashutosh; Trivedi, Prabodh K; Asif, Mehar H

    2016-01-01

    The WRKY gene family plays an important role in the development and stress responses in plants. As information is not available on the WRKY gene family in Musa species, genome-wide analysis has been carried out in this study using available genomic information from two species, Musa acuminata and Musa balbisiana. Analysis identified 147 and 132 members of the WRKY gene family in M. acuminata and M. balbisiana, respectively. Evolutionary analysis suggests that the WRKY gene family expanded much before the speciation in both the species. Most of the orthologs retained in two species were from the γ duplication event which occurred prior to α and β genome-wide duplication (GWD) events. Analysis also suggests that subtle changes in nucleotide sequences during the course of evolution have led to the development of new motifs which might be involved in neo-functionalization of different WRKY members in two species. Expression and cis-regulatory motif analysis suggest possible involvement of Group II and Group III WRKY members during various stresses and growth/development including fruit ripening process respectively. PMID:27014321

  5. The Csr system regulates genome-wide mRNA stability and transcription and thus gene expression in Escherichia coli.

    PubMed

    Esquerré, Thomas; Bouvier, Marie; Turlan, Catherine; Carpousis, Agamemnon J; Girbal, Laurence; Cocaign-Bousquet, Muriel

    2016-04-26

    Bacterial adaptation requires large-scale regulation of gene expression. We have performed a genome-wide analysis of the Csr system, which regulates many important cellular functions. The Csr system is involved in post-transcriptional regulation, but a role in transcriptional regulation has also been suggested. Two proteins, an RNA-binding protein CsrA and an atypical signaling protein CsrD, participate in the Csr system. Genome-wide transcript stabilities and levels were compared in wildtype E. coli (MG1655) and isogenic mutant strains deficient in CsrA or CsrD activity demonstrating for the first time that CsrA and CsrD are global negative and positive regulators of transcription, respectively. The role of CsrA in transcription regulation may be indirect due to the 4.6-fold increase in csrD mRNA concentration in the CsrA deficient strain. Transcriptional action of CsrA and CsrD on a few genes was validated by transcriptional fusions. In addition to an effect on transcription, CsrA stabilizes thousands of mRNAs. This is the first demonstration that CsrA is a global positive regulator of mRNA stability. For one hundred genes, we predict that direct control of mRNA stability by CsrA might contribute to metabolic adaptation by regulating expression of genes involved in carbon metabolism and transport independently of transcriptional regulation.

  6. The Csr system regulates genome-wide mRNA stability and transcription and thus gene expression in Escherichia coli

    PubMed Central

    Esquerré, Thomas; Bouvier, Marie; Turlan, Catherine; Carpousis, Agamemnon J.; Girbal, Laurence; Cocaign-Bousquet, Muriel

    2016-01-01

    Bacterial adaptation requires large-scale regulation of gene expression. We have performed a genome-wide analysis of the Csr system, which regulates many important cellular functions. The Csr system is involved in post-transcriptional regulation, but a role in transcriptional regulation has also been suggested. Two proteins, an RNA-binding protein CsrA and an atypical signaling protein CsrD, participate in the Csr system. Genome-wide transcript stabilities and levels were compared in wildtype E. coli (MG1655) and isogenic mutant strains deficient in CsrA or CsrD activity demonstrating for the first time that CsrA and CsrD are global negative and positive regulators of transcription, respectively. The role of CsrA in transcription regulation may be indirect due to the 4.6-fold increase in csrD mRNA concentration in the CsrA deficient strain. Transcriptional action of CsrA and CsrD on a few genes was validated by transcriptional fusions. In addition to an effect on transcription, CsrA stabilizes thousands of mRNAs. This is the first demonstration that CsrA is a global positive regulator of mRNA stability. For one hundred genes, we predict that direct control of mRNA stability by CsrA might contribute to metabolic adaptation by regulating expression of genes involved in carbon metabolism and transport independently of transcriptional regulation. PMID:27112822

  7. Genome-Wide Analysis of the Musa WRKY Gene Family: Evolution and Differential Expression during Development and Stress

    PubMed Central

    Goel, Ridhi; Pandey, Ashutosh; Trivedi, Prabodh K.; Asif, Mehar H.

    2016-01-01

    The WRKY gene family plays an important role in the development and stress responses in plants. As information is not available on the WRKY gene family in Musa species, genome-wide analysis has been carried out in this study using available genomic information from two species, Musa acuminata and Musa balbisiana. Analysis identified 147 and 132 members of the WRKY gene family in M. acuminata and M. balbisiana, respectively. Evolutionary analysis suggests that the WRKY gene family expanded much before the speciation in both the species. Most of the orthologs retained in two species were from the γ duplication event which occurred prior to α and β genome-wide duplication (GWD) events. Analysis also suggests that subtle changes in nucleotide sequences during the course of evolution have led to the development of new motifs which might be involved in neo-functionalization of different WRKY members in two species. Expression and cis-regulatory motif analysis suggest possible involvement of Group II and Group III WRKY members during various stresses and growth/development including fruit ripening process respectively. PMID:27014321

  8. Genome-Wide Distribution, Organisation and Functional Characterization of Disease Resistance and Defence Response Genes across Rice Species.

    PubMed

    Singh, Sangeeta; Chand, Suresh; Singh, N K; Sharma, Tilak Raj

    2015-01-01

    The resistance (R) genes and defense response (DR) genes have become very important resources for the development of disease resistant cultivars. In the present investigation, genome-wide identification, expression, phylogenetic and synteny analysis was done for R and DR-genes across three species of rice viz: Oryza sativa ssp indica cv 93-11, Oryza sativa ssp japonica and wild rice species, Oryza brachyantha. We used the in silico approach to identify and map 786 R -genes and 167 DR-genes, 672 R-genes and 142 DR-genes, 251 R-genes and 86 DR-genes in the japonica, indica and O. brachyanth a genomes, respectively. Our analysis showed that 60.5% and 55.6% of the R-genes are tandemly repeated within clusters and distributed over all the rice chromosomes in indica and japonica genomes, respectively. The phylogenetic analysis along with motif distribution shows high degree of conservation of R- and DR-genes in clusters. In silico expression analysis of R-genes and DR-genes showed more than 85% were expressed genes showing corresponding EST matches in the databases. This study gave special emphasis on mechanisms of gene evolution and duplication for R and DR genes across species. Analysis of paralogs across rice species indicated 17% and 4.38% R-genes, 29% and 11.63% DR-genes duplication in indica and Oryza brachyantha, as compared to 20% and 26% duplication of R-genes and DR-genes in japonica respectively. We found that during the course of duplication only 9.5% of R- and DR-genes changed their function and rest of the genes have maintained their identity. Syntenic relationship across three genomes inferred that more orthology is shared between indica and japonica genomes as compared to brachyantha genome. Genome wide identification of R-genes and DR-genes in the rice genome will help in allele mining and functional validation of these genes, and to understand molecular mechanism of disease resistance and their evolution in rice and related species.

  9. Genome-Wide Identification, Phylogenetic and Expression Analyses of the Ubiquitin-Conjugating Enzyme Gene Family in Maize

    PubMed Central

    Jue, Dengwei; Sang, Xuelian; Lu, Shengqiao; Dong, Chen; Zhao, Qiufang; Chen, Hongliang; Jia, Liqiang

    2015-01-01

    Background Ubiquitination is a post-translation modification where ubiquitin is attached to a substrate. Ubiquitin-conjugating enzymes (E2s) play a major role in the ubiquitin transfer pathway, as well as a variety of functions in plant biological processes. To date, no genome-wide characterization of this gene family has been conducted in maize (Zea mays). Methodology/Principal Findings In the present study, a total of 75 putative ZmUBC genes have been identified and located in the maize genome. Phylogenetic analysis revealed that ZmUBC proteins could be divided into 15 subfamilies, which include 13 ubiquitin-conjugating enzymes (ZmE2s) and two independent ubiquitin-conjugating enzyme variant (UEV) groups. The predicted ZmUBC genes were distributed across 10 chromosomes at different densities. In addition, analysis of exon-intron junctions and sequence motifs in each candidate gene has revealed high levels of conservation within and between phylogenetic groups. Tissue expression analysis indicated that most ZmUBC genes were expressed in at least one of the tissues, indicating that these are involved in various physiological and developmental processes in maize. Moreover, expression profile analyses of ZmUBC genes under different stress treatments (4°C, 20% PEG6000, and 200 mM NaCl) and various expression patterns indicated that these may play crucial roles in the response of plants to stress. Conclusions Genome-wide identification, chromosome organization, gene structure, evolutionary and expression analyses of ZmUBC genes have facilitated in the characterization of this gene family, as well as determined its potential involvement in growth, development, and stress responses. This study provides valuable information for better understanding the classification and putative functions of the UBC-encoding genes of maize. PMID:26606743

  10. Gene-environment interaction effects on lung function- a genome-wide association study within the Framingham heart study

    PubMed Central

    2013-01-01

    Background Previous studies in occupational exposure and lung function have focused only on the main effect of occupational exposure or genetics on lung function. Some disease-susceptible genes may be missed due to their low marginal effects, despite potential involvement in the disease process through interactions with the environment. Through comprehensive genome-wide gene-environment interaction studies, we can uncover these susceptibility genes. Our objective in this study was to explore gene by occupational exposure interaction effects on lung function using both the individual SNPs approach and the genetic network approach. Methods The study population comprised the Offspring Cohort and the Third Generation from the Framingham Heart Study. We used forced expiratory volume in one second (FEV1) and ratio of FEV1 to forced vital capacity (FVC) as outcomes. Occupational exposures were classified using a population-specific job exposure matrix. We performed genome-wide gene-environment interaction analysis, using the Affymetrix 550 K mapping array for genotyping. A linear regression-based generalized estimating equation was applied to account for within-family relatedness. Network analysis was conducted using results from single-nucleotide polymorphism (SNP)-level analyses and from gene expression study results. Results There were 4,785 participants in total. SNP-level analysis and network analysis identified SNP rs9931086 (Pinteraction =1.16 × 10-7) in gene SLC38A8, which may significantly modify the effects of occupational exposure on FEV1. Genes identified from the network analysis included CTLA-4, HDAC, and PPAR-alpha. Conclusions Our study implies that SNP rs9931086 in SLC38A8 and genes CTLA-4, HDAC, and PPAR-alpha, which are related to inflammatory processes, may modify the effect of occupational exposure on lung function. PMID:24289273

  11. Genome Wide Association Mapping in Arabidopsis thaliana Identifies Novel Genes Involved in Linking Allyl Glucosinolate to Altered Biomass and Defense

    PubMed Central

    Francisco, Marta; Joseph, Bindu; Caligagan, Hart; Li, Baohua; Corwin, Jason A.; Lin, Catherine; Kerwin, Rachel E.; Burow, Meike; Kliebenstein, Daniel J.

    2016-01-01

    A key limitation in modern biology is the ability to rapidly identify genes underlying newly identified complex phenotypes. Genome wide association studies (GWAS) have become an increasingly important approach for dissecting natural variation by associating phenotypes with genotypes at a genome wide level. Recent work is showing that the Arabidopsis thaliana defense metabolite, allyl glucosinolate (GSL), may provide direct feedback regulation, linking defense metabolism outputs to the growth, and defense responses of the plant. However, there is still a need to identify genes that underlie this process. To start developing a deeper understanding of the mechanism(s) that modulate the ability of exogenous allyl GSL to alter growth and defense, we measured changes in plant biomass and defense metabolites in a collection of natural 96 A. thaliana accessions fed with 50 μM of allyl GSL. Exogenous allyl GSL was introduced exclusively to the roots and the compound transported to the leaf leading to a wide range of heritable effects upon plant biomass and endogenous GSL accumulation. Using natural variation we conducted GWAS to identify a number of new genes which potentially control allyl responses in various plant processes. This is one of the first instances in which this approach has been successfully utilized to begin dissecting a novel phenotype to the underlying molecular/polygenic basis. PMID:27462337

  12. Genome Wide Association Mapping in Arabidopsis thaliana Identifies Novel Genes Involved in Linking Allyl Glucosinolate to Altered Biomass and Defense.

    PubMed

    Francisco, Marta; Joseph, Bindu; Caligagan, Hart; Li, Baohua; Corwin, Jason A; Lin, Catherine; Kerwin, Rachel E; Burow, Meike; Kliebenstein, Daniel J

    2016-01-01

    A key limitation in modern biology is the ability to rapidly identify genes underlying newly identified complex phenotypes. Genome wide association studies (GWAS) have become an increasingly important approach for dissecting natural variation by associating phenotypes with genotypes at a genome wide level. Recent work is showing that the Arabidopsis thaliana defense metabolite, allyl glucosinolate (GSL), may provide direct feedback regulation, linking defense metabolism outputs to the growth, and defense responses of the plant. However, there is still a need to identify genes that underlie this process. To start developing a deeper understanding of the mechanism(s) that modulate the ability of exogenous allyl GSL to alter growth and defense, we measured changes in plant biomass and defense metabolites in a collection of natural 96 A. thaliana accessions fed with 50 μM of allyl GSL. Exogenous allyl GSL was introduced exclusively to the roots and the compound transported to the leaf leading to a wide range of heritable effects upon plant biomass and endogenous GSL accumulation. Using natural variation we conducted GWAS to identify a number of new genes which potentially control allyl responses in various plant processes. This is one of the first instances in which this approach has been successfully utilized to begin dissecting a novel phenotype to the underlying molecular/polygenic basis.

  13. Rapid genome-wide evolution in Brassica rapa populations following drought revealed by sequencing of ancestral and descendant gene pools.

    PubMed

    Franks, Steven J; Kane, Nolan C; O'Hara, Niamh B; Tittes, Silas; Rest, Joshua S

    2016-08-01

    There is increasing evidence that evolution can occur rapidly in response to selection. Recent advances in sequencing suggest the possibility of documenting genetic changes as they occur in populations, thus uncovering the genetic basis of evolution, particularly if samples are available from both before and after selection. Here, we had a unique opportunity to directly assess genetic changes in natural populations following an evolutionary response to a fluctuation in climate. We analysed genome-wide differences between ancestors and descendants of natural populations of Brassica rapa plants from two locations that rapidly evolved changes in multiple phenotypic traits, including flowering time, following a multiyear late-season drought in California. These ancestor-descendant comparisons revealed evolutionary shifts in allele frequencies in many genes. Some genes showing evolutionary shifts have functions related to drought stress and flowering time, consistent with an adaptive response to selection. Loci differentiated between ancestors and descendants (FST outliers) were generally different from those showing signatures of selection based on site frequency spectrum analysis (Tajima's D), indicating that the loci that evolved in response to the recent drought and those under historical selection were generally distinct. Very few genes showed similar evolutionary responses between two geographically distinct populations, suggesting independent genetic trajectories of evolution yielding parallel phenotypic changes. The results show that selection can result in rapid genome-wide evolutionary shifts in allele frequencies in natural populations, and highlight the usefulness of combining resurrection experiments in natural populations with genomics for studying the genetic basis of adaptive evolution. PMID:27072809

  14. Genome Wide Association Mapping in Arabidopsis thaliana Identifies Novel Genes Involved in Linking Allyl Glucosinolate to Altered Biomass and Defense.

    PubMed

    Francisco, Marta; Joseph, Bindu; Caligagan, Hart; Li, Baohua; Corwin, Jason A; Lin, Catherine; Kerwin, Rachel E; Burow, Meike; Kliebenstein, Daniel J

    2016-01-01

    A key limitation in modern biology is the ability to rapidly identify genes underlying newly identified complex phenotypes. Genome wide association studies (GWAS) have become an increasingly important approach for dissecting natural variation by associating phenotypes with genotypes at a genome wide level. Recent work is showing that the Arabidopsis thaliana defense metabolite, allyl glucosinolate (GSL), may provide direct feedback regulation, linking defense metabolism outputs to the growth, and defense responses of the plant. However, there is still a need to identify genes that underlie this process. To start developing a deeper understanding of the mechanism(s) that modulate the ability of exogenous allyl GSL to alter growth and defense, we measured changes in plant biomass and defense metabolites in a collection of natural 96 A. thaliana accessions fed with 50 μM of allyl GSL. Exogenous allyl GSL was introduced exclusively to the roots and the compound transported to the leaf leading to a wide range of heritable effects upon plant biomass and endogenous GSL accumulation. Using natural variation we conducted GWAS to identify a number of new genes which potentially control allyl responses in various plant processes. This is one of the first instances in which this approach has been successfully utilized to begin dissecting a novel phenotype to the underlying molecular/polygenic basis. PMID:27462337

  15. Rapid genome-wide evolution in Brassica rapa populations following drought revealed by sequencing of ancestral and descendant gene pools.

    PubMed

    Franks, Steven J; Kane, Nolan C; O'Hara, Niamh B; Tittes, Silas; Rest, Joshua S

    2016-08-01

    There is increasing evidence that evolution can occur rapidly in response to selection. Recent advances in sequencing suggest the possibility of documenting genetic changes as they occur in populations, thus uncovering the genetic basis of evolution, particularly if samples are available from both before and after selection. Here, we had a unique opportunity to directly assess genetic changes in natural populations following an evolutionary response to a fluctuation in climate. We analysed genome-wide differences between ancestors and descendants of natural populations of Brassica rapa plants from two locations that rapidly evolved changes in multiple phenotypic traits, including flowering time, following a multiyear late-season drought in California. These ancestor-descendant comparisons revealed evolutionary shifts in allele frequencies in many genes. Some genes showing evolutionary shifts have functions related to drought stress and flowering time, consistent with an adaptive response to selection. Loci differentiated between ancestors and descendants (FST outliers) were generally different from those showing signatures of selection based on site frequency spectrum analysis (Tajima's D), indicating that the loci that evolved in response to the recent drought and those under historical selection were generally distinct. Very few genes showed similar evolutionary responses between two geographically distinct populations, suggesting independent genetic trajectories of evolution yielding parallel phenotypic changes. The results show that selection can result in rapid genome-wide evolutionary shifts in allele frequencies in natural populations, and highlight the usefulness of combining resurrection experiments in natural populations with genomics for studying the genetic basis of adaptive evolution.

  16. The histone modification pattern of active genes revealed through genome-wide chromatin analysis of a higher eukaryote

    PubMed Central

    Schübeler, Dirk; MacAlpine, David M.; Scalzo, David; Wirbelauer, Christiane; Kooperberg, Charles; van Leeuwen, Fred; Gottschling, Daniel E.; O'Neill, Laura P.; Turner, Bryan M.; Delrow, Jeffrey; Bell, Stephen P.; Groudine, Mark

    2004-01-01

    The covalent modification of nucleosomal histones has emerged as a major determinant of chromatin structure and gene activity. To understand the interplay between various histone modifications, including acetylation and methylation, we performed a genome-wide chromatin structure analysis in a higher eukaryote. We found a binary pattern of histone modifications among euchromatic genes, with active genes being hyperacetylated for H3 and H4 and hypermethylated at Lys 4 and Lys 79 of H3, and inactive genes being hypomethylated and deacetylated at the same residues. Furthermore, the degree of modification correlates with the level of transcription, and modifications are largely restricted to transcribed regions, suggesting that their regulation is tightly linked to polymerase activity. PMID:15175259

  17. Effect of MTHFR Gene Polymorphism Impact on Atherosclerosis via Genome-Wide Methylation

    PubMed Central

    Lin, Xuefeng; Zhang, Wei; Lu, Qun; Lei, Xinjun; Wang, Tingzhong; Han, Xuanmao; Ma, Aiqun

    2016-01-01

    Background Atherosclerosis seriously threats human health. Homocysteine is an independent risk factor closely related to DNA methylation. MTHFR C667T loci polymorphism is closely associated with homocysteine level. This study aimed to investigate the relationship among MTHFR C667T loci polymorphism, genome-wide methylation, and atherosclerosis. Material/Methods Blood sample was collected from 105 patients with coronary atherosclerosis and 105 healthy controls. Pyrosequencing methylation was used to detect LINE-1 methylation level. Polymerase chain reaction-restriction enzyme fragment length polymorphism (PCR-RFLP) was used to test MTHFR. Results LINE-1 methylation level in the patient group was significantly lower than in the controls (t=5.007, P<0.001). MTHFR C667T genotype distribution presented marked differences in the 2 groups. TT genotype carriers had significantly increased risk of atherosclerosis (OR=3.56, P=0.009). Three different genotypes of MTHFR C667T loci showed different LINE-1 methylation level between the 2 groups (P<0.01). LINE-1 methylation level in TT and CT genotype carriers was obviously lower than in CC genotype carriers (P<0.05). Conclusions MTHFR C667T loci polymorphism may affect atherosclerosis by regulating genome methylation level. PMID:26828698

  18. Joint Association of Genome-Wide Association Study–Identified Susceptibility Loci and Dietary Patterns in Risk of Renal Cell Carcinoma Among Non-Hispanic Whites

    PubMed Central

    Melkonian, Stephanie C.; Daniel, Carrie R.; Hildebrandt, Michelle A. T.; Tannir, Nizar M.; Ye, Yuanqing; Chow, Wong-Ho; Wood, Christopher G.; Wu, Xifeng

    2014-01-01

    Dietary factors may affect risk of renal cell carcinoma (RCC). In an ongoing case-control study of RCC initiated in Houston, Texas, in 2002, we identified 3 empirically derived dietary patterns: “fruits and vegetables,” “American/Western,” and “Tex-Mex.” Among 659 RCC cases and 699 controls, we evaluated associations of these dietary patterns with RCC risk and whether the associations varied by obesity status, smoking status, physical activity level, history of hypertension, and genetic variants previously identified via genome-wide association studies. Among persons in the highest categories of adherence versus the lowest, the “fruits and vegetables” dietary pattern was associated with an approximately 50% lower RCC risk (Ptrend < 0.001), while “American/Western” dietary pattern scores were positively associated with a 2-fold higher risk (Ptrend < 0.001). We observed synergistic interaction between the American/Western pattern and hypertension status: The odds ratio (highest tertile vs. lowest) among persons with hypertension was 2.23 (95% confidence interval: 1.43, 3.45), as compared with 1.76 (95% confidence interval: 1.16, 2.70) among persons without hypertension (additive Pinteraction = 0.01). A variant (rs718314) in the inositol 1,4,5-trisphosphate receptor, type 2 gene (ITPR2) was found to interact with the American/Western dietary pattern in relation to RCC risk (additive Pinteraction = 0.03). ITPR2 has been shown to affect nutrient metabolism and central obesity. Dietary patterns, genetic variants, and host characteristics may individually and jointly influence susceptibility to RCC. PMID:25053674

  19. Joint association of genome-wide association study-identified susceptibility loci and dietary patterns in risk of renal cell carcinoma among non-Hispanic whites.

    PubMed

    Melkonian, Stephanie C; Daniel, Carrie R; Hildebrandt, Michelle A T; Tannir, Nizar M; Ye, Yuanqing; Chow, Wong-Ho; Wood, Christopher G; Wu, Xifeng

    2014-09-01

    Dietary factors may affect risk of renal cell carcinoma (RCC). In an ongoing case-control study of RCC initiated in Houston, Texas, in 2002, we identified 3 empirically derived dietary patterns: "fruits and vegetables," "American/Western," and "Tex-Mex." Among 659 RCC cases and 699 controls, we evaluated associations of these dietary patterns with RCC risk and whether the associations varied by obesity status, smoking status, physical activity level, history of hypertension, and genetic variants previously identified via genome-wide association studies. Among persons in the highest categories of adherence versus the lowest, the "fruits and vegetables" dietary pattern was associated with an approximately 50% lower RCC risk (Ptrend < 0.001), while "American/Western" dietary pattern scores were positively associated with a 2-fold higher risk (Ptrend < 0.001). We observed synergistic interaction between the American/Western pattern and hypertension status: The odds ratio (highest tertile vs. lowest) among persons with hypertension was 2.23 (95% confidence interval: 1.43, 3.45), as compared with 1.76 (95% confidence interval: 1.16, 2.70) among persons without hypertension (additive Pinteraction = 0.01). A variant (rs718314) in the inositol 1,4,5-trisphosphate receptor, type 2 gene (ITPR2) was found to interact with the American/Western dietary pattern in relation to RCC risk (additive Pinteraction = 0.03). ITPR2 has been shown to affect nutrient metabolism and central obesity. Dietary patterns, genetic variants, and host characteristics may individually and jointly influence susceptibility to RCC. PMID:25053674

  20. Genome-Wide Identification and Expression Profile of Dof Transcription Factor Gene Family in Pepper (Capsicum annuum L.).

    PubMed

    Wu, Zhiming; Cheng, Jiaowen; Cui, Junjie; Xu, Xiaowan; Liang, Guansheng; Luo, Xirong; Chen, Xiaocui; Tang, Xiangqun; Hu, Kailin; Qin, Cheng

    2016-01-01

    Dof (DNA-binding One Zinc Finger) transcription factor family is unique to plants and has diverse roles associated with plant-specific phenomena, such as light, phytohormone and defense responses as well as seed development and germination. Although, genome-wide analysis of this family has been performed in many species, information regarding Dof genes in the pepper, Capsicum annuum L., is extremely limited. In this study, exhaustive searches of pepper genome revealed 33 potential CaDofs that were phylogenetically clustered into four subgroups. Twenty-nine of the 33 Dof genes could be mapped on 11 chromosomes, except for chromosome 7. The intron/exon organizations and conserved motif compositions of these genes were also analyzed. Additionally, phylogenetic analysis and classification of the Dof transcription factor family in eight plant species revealed that S. lycopersicum and C. annuum as well as O. sativa and S. bicolor Dof proteins may have evolved conservatively. Moreover, comprehensive expression analysis of CaDofs using a RNA-seq atlas and quantitative real-time polymerase chain reaction (qRT-PCR) revealed that these genes exhibit a variety of expression patterns. Most of the CaDofs were expressed in at least one of the tissues tested, whereas several genes were identified as being highly responsive to heat and salt stresses. Overall, this study describes the first genome-wide analysis of the pepper Dof family, whose genes exhibited different expression patterns in all primary fruit developmental stages and tissue types, as in response to abiotic stress. In particular, some Dof genes might be used as biomarkers for heat and salt stress. The results could expand our understanding of the roles of Dof genes in pepper.

  1. Genome-Wide Identification and Expression Profile of Dof Transcription Factor Gene Family in Pepper (Capsicum annuum L.)

    PubMed Central

    Wu, Zhiming; Cheng, Jiaowen; Cui, Junjie; Xu, Xiaowan; Liang, Guansheng; Luo, Xirong; Chen, Xiaocui; Tang, Xiangqun; Hu, Kailin; Qin, Cheng

    2016-01-01

    Dof (DNA-binding One Zinc Finger) transcription factor family is unique to plants and has diverse roles associated with plant-specific phenomena, such as light, phytohormone and defense responses as well as seed development and germination. Although, genome-wide analysis of this family has been performed in many species, information regarding Dof genes in the pepper, Capsicum annuum L., is extremely limited. In this study, exhaustive searches of pepper genome revealed 33 potential CaDofs that were phylogenetically clustered into four subgroups. Twenty-nine of the 33 Dof genes could be mapped on 11 chromosomes, except for chromosome 7. The intron/exon organizations and conserved motif compositions of these genes were also analyzed. Additionally, phylogenetic analysis and classification of the Dof transcription factor family in eight plant species revealed that S. lycopersicum and C. annuum as well as O. sativa and S. bicolor Dof proteins may have evolved conservatively. Moreover, comprehensive expression analysis of CaDofs using a RNA-seq atlas and quantitative real-time polymerase chain reaction (qRT-PCR) revealed that these genes exhibit a variety of expression patterns. Most of the CaDofs were expressed in at least one of the tissues tested, whereas several genes were identified as being highly responsive to heat and salt stresses. Overall, this study describes the first genome-wide analysis of the pepper Dof family, whose genes exhibited different expression patterns in all primary fruit developmental stages and tissue types, as in response to abiotic stress. In particular, some Dof genes might be used as biomarkers for heat and salt stress. The results could expand our understanding of the roles of Dof genes in pepper. PMID:27200047

  2. Genome-wide digital transcript analysis of putative fruitlet abscission related genes regulated by ethephon in litchi

    PubMed Central

    Li, Caiqin; Wang, Yan; Ying, Peiyuan; Ma, Wuqiang; Li, Jianguo

    2015-01-01

    The high level of physiological fruitlet abscission in litchi (Litchi chinensis Sonn.) causes severe yield loss. Cell separation occurs at the fruit abscission zone (FAZ) and can be triggered by ethylene. However, a deep knowledge of the molecular events occurring in the FAZ is still unknown. Here, genome-wide digital transcript abundance (DTA) analysis of putative fruit abscission related genes regulated by ethephon in litchi were studied. More than 81 million high quality reads from seven ethephon treated and untreated control libraries were obtained by high-throughput sequencing. Through DTA profile analysis in combination with Gene Ontology and KEGG pathway enrichment analyses, a total of 2730 statistically significant candidate genes were involved in the ethephon-promoted litchi fruitlet abscission. Of these, there were 1867 early-responsive genes whose expressions were up- or down-regulated from 0 to 1 d after treatment. The most affected genes included those related to ethylene biosynthesis and signaling, auxin transport and signaling, transcription factors (TFs), protein ubiquitination, ROS response, calcium signal transduction, and cell wall modification. These genes could be clustered into four groups and 13 subgroups according to their similar expression patterns. qRT-PCR displayed the expression pattern of 41 selected candidate genes, which proved the accuracy of our DTA data. Ethephon treatment significantly increased fruit abscission and ethylene production of fruitlet. The possible molecular events to control the ethephon-promoted litchi fruitlet abscission were prompted out. The increased ethylene evolution in fruitlet would suppress the synthesis and polar transport of auxin and trigger abscission signaling. To the best of our knowledge, it is the first time to monitor the gene expression profile occurring in the FAZ-enriched pedicel during litchi fruit abscission induced by ethephon on the genome-wide level. This study will contribute to a better

  3. Genome-Wide Identification and Expression Profile of Dof Transcription Factor Gene Family in Pepper (Capsicum annuum L.).

    PubMed

    Wu, Zhiming; Cheng, Jiaowen; Cui, Junjie; Xu, Xiaowan; Liang, Guansheng; Luo, Xirong; Chen, Xiaocui; Tang, Xiangqun; Hu, Kailin; Qin, Cheng

    2016-01-01

    Dof (DNA-binding One Zinc Finger) transcription factor family is unique to plants and has diverse roles associated with plant-specific phenomena, such as light, phytohormone and defense responses as well as seed development and germination. Although, genome-wide analysis of this family has been performed in many species, information regarding Dof genes in the pepper, Capsicum annuum L., is extremely limited. In this study, exhaustive searches of pepper genome revealed 33 potential CaDofs that were phylogenetically clustered into four subgroups. Twenty-nine of the 33 Dof genes could be mapped on 11 chromosomes, except for chromosome 7. The intron/exon organizations and conserved motif compositions of these genes were also analyzed. Additionally, phylogenetic analysis and classification of the Dof transcription factor family in eight plant species revealed that S. lycopersicum and C. annuum as well as O. sativa and S. bicolor Dof proteins may have evolved conservatively. Moreover, comprehensive expression analysis of CaDofs using a RNA-seq atlas and quantitative real-time polymerase chain reaction (qRT-PCR) revealed that these genes exhibit a variety of expression patterns. Most of the CaDofs were expressed in at least one of the tissues tested, whereas several genes were identified as being highly responsive to heat and salt stresses. Overall, this study describes the first genome-wide analysis of the pepper Dof family, whose genes exhibited different expression patterns in all primary fruit developmental stages and tissue types, as in response to abiotic stress. In particular, some Dof genes might be used as biomarkers for heat and salt stress. The results could expand our understanding of the roles of Dof genes in pepper. PMID:27200047

  4. Identification of Immune Related LRR-Containing Genes in Maize (Zea mays L.) by Genome-Wide Sequence Analysis

    PubMed Central

    Song, Wei; Wang, Baoqiang; Li, Xinghua; Wei, Jianfen; Chen, Ling; Zhang, Dongmin; Zhang, Wenying; Li, Ronggai

    2015-01-01

    A large number of immune receptors consist of nucleotide binding site-leucine rich repeat (NBS-LRR) proteins and leucine rich repeat-receptor-like kinases (LRR-RLK) that play a crucial role in plant disease resistance. Although many NBS-LRR genes have been previously identified in Zea mays, there are no reports on identifying NBS-LRR genes encoded in the N-terminal Toll/interleukin-1 receptor (TIR) motif and identifying genome-wide LRR-RLK genes. In the present study, 151 NBS-LRR genes and 226 LRR-RLK genes were identified after performing bioinformatics analysis of the entire maize genome. Of these identified genes, 64 NBS-LRR genes and four TIR-NBS-LRR genes were identified for the first time. The NBS-LRR genes are unevenly distributed on each chromosome with gene clusters located at the distal end of each chromosome, while LRR-RLK genes have a random chromosomal distribution with more paired genes. Additionally, six LRR-RLK/RLPs including FLS2, PSY1R, PSKR1, BIR1, SERK3, and Cf5 were characterized in Zea mays for the first time. Their predicted amino acid sequences have similar protein structures with their respective homologues in other plants, indicating that these maize LRR-RLK/RLPs have the same functions as their homologues act as immune receptors. The identified gene sequences would assist in the study of their functions in maize. PMID:26609518

  5. Functional Annotation of Rheumatoid Arthritis and Osteoarthritis Associated Genes by Integrative Genome-Wide Gene Expression Profiling Analysis

    PubMed Central

    Li, Zhan-Chun; Xiao, Jie; Peng, Jin-Liang; Chen, Jian-Wei; Ma, Tao; Cheng, Guang-Qi; Dong, Yu-Qi; Wang, Wei-li; Liu, Zu-De

    2014-01-01

    Background Rheumatoid arthritis (RA) and osteoarthritis (OA) are two major types of joint diseases that share multiple common symptoms. However, their pathological mechanism remains largely unknown. The aim of our study is to identify RA and OA related-genes and gain an insight into the underlying genetic basis of these diseases. Methods We collected 11 whole genome-wide expression profiling datasets from RA and OA cohorts and performed a meta-analysis to comprehensively investigate their expression signatures. This method can avoid some pitfalls of single dataset analyses. Results and Conclusion We found that several biological pathways (i.e., the immunity, inflammation and apoptosis related pathways) are commonly involved in the development of both RA and OA. Whereas several other pathways (i.e., vasopressin-related pathway, regulation of autophagy, endocytosis, calcium transport and endoplasmic reticulum stress related pathways) present significant difference between RA and OA. This study provides novel insights into the molecular mechanisms underlying this disease, thereby aiding the diagnosis and treatment of the disease. PMID:24551036

  6. Racial differences in genome-wide methylation profiling and gene expression in breast tissues from healthy women.

    PubMed

    Song, Min-Ae; Brasky, Theodore M; Marian, Catalin; Weng, Daniel Y; Taslim, Cenny; Dumitrescu, Ramona G; Llanos, Adana A; Freudenheim, Jo L; Shields, Peter G

    2015-01-01

    Breast cancer is more common in European Americans (EAs) than in African Americans (AAs) but mortality from breast cancer is higher among AAs. While there are racial differences in DNA methylation and gene expression in breast tumors, little is known whether such racial differences exist in breast tissues of healthy women. Genome-wide DNA methylation and gene expression profiling was performed in histologically normal breast tissues of healthy women. Linear regression models were used to identify differentially-methylated CpG sites (CpGs) between EAs (n = 61) and AAs (n = 22). Correlations for methylation and expression were assessed. Biological functions of the differentially-methylated genes were assigned using the Ingenuity Pathway Analysis. Among 485 differentially-methylated CpGs by race, 203 were hypermethylated in EAs, and 282 were hypermethylated in AAs. Promoter-related differentially-methylated CpGs were more frequently hypermethylated in EAs (52%) than AAs (27%) while gene body and intergenic CpGs were more frequently hypermethylated in AAs. The differentially-methylated CpGs were enriched for cancer-associated genes with roles in cell death and survival, cellular development, and cell-to-cell signaling. In a separate analysis for correlation in EAs and AAs, different patterns of correlation were found between EAs and AAs. The correlated genes showed different biological networks between EAs and AAs; networks were connected by Ubiquitin C. To our knowledge, this is the first comprehensive genome-wide study to identify differences in methylation and gene expression between EAs and AAs in breast tissues from healthy women. These findings may provide further insights regarding the contribution of epigenetic differences to racial disparities in breast cancer. PMID:26680018

  7. Racial differences in genome-wide methylation profiling and gene expression in breast tissues from healthy women.

    PubMed

    Song, Min-Ae; Brasky, Theodore M; Marian, Catalin; Weng, Daniel Y; Taslim, Cenny; Dumitrescu, Ramona G; Llanos, Adana A; Freudenheim, Jo L; Shields, Peter G

    2015-01-01

    Breast cancer is more common in European Americans (EAs) than in African Americans (AAs) but mortality from breast cancer is higher among AAs. While there are racial differences in DNA methylation and gene expression in breast tumors, little is known whether such racial differences exist in breast tissues of healthy women. Genome-wide DNA methylation and gene expression profiling was performed in histologically normal breast tissues of healthy women. Linear regression models were used to identify differentially-methylated CpG sites (CpGs) between EAs (n = 61) and AAs (n = 22). Correlations for methylation and expression were assessed. Biological functions of the differentially-methylated genes were assigned using the Ingenuity Pathway Analysis. Among 485 differentially-methylated CpGs by race, 203 were hypermethylated in EAs, and 282 were hypermethylated in AAs. Promoter-related differentially-methylated CpGs were more frequently hypermethylated in EAs (52%) than AAs (27%) while gene body and intergenic CpGs were more frequently hypermethylated in AAs. The differentially-methylated CpGs were enriched for cancer-associated genes with roles in cell death and survival, cellular development, and cell-to-cell signaling. In a separate analysis for correlation in EAs and AAs, different patterns of correlation were found between EAs and AAs. The correlated genes showed different biological networks between EAs and AAs; networks were connected by Ubiquitin C. To our knowledge, this is the first comprehensive genome-wide study to identify differences in methylation and gene expression between EAs and AAs in breast tissues from healthy women. These findings may provide further insights regarding the contribution of epigenetic differences to racial disparities in breast cancer.

  8. Accounting for Population Structure in Gene-by-Environment Interactions in Genome-Wide Association Studies Using Mixed Models.

    PubMed

    Sul, Jae Hoon; Bilow, Michael; Yang, Wen-Yun; Kostem, Emrah; Furlotte, Nick; He, Dan; Eskin, Eleazar

    2016-03-01

    Although genome-wide association studies (GWASs) have discovered numerous novel genetic variants associated with many complex traits and diseases, those genetic variants typically explain only a small fraction of phenotypic variance. Factors that account for phenotypic variance include environmental factors and gene-by-environment interactions (GEIs). Recently, several studies have conducted genome-wide gene-by-environment association analyses and demonstrated important roles of GEIs in complex traits. One of the main challenges in these association studies is to control effects of population structure that may cause spurious associations. Many studies have analyzed how population structure influences statistics of genetic variants and developed several statistical approaches to correct for population structure. However, the impact of population structure on GEI statistics in GWASs has not been extensively studied and nor have there been methods designed to correct for population structure on GEI statistics. In this paper, we show both analytically and empirically that population structure may cause spurious GEIs and use both simulation and two GWAS datasets to support our finding. We propose a statistical approach based on mixed models to account for population structure on GEI statistics. We find that our approach effectively controls population structure on statistics for GEIs as well as for genetic variants. PMID:26943367

  9. The Effects of Sequence Variation on Genome-wide NRF2 Binding—New Target Genes and Regulatory SNPs

    PubMed Central

    Kuosmanen, Suvi M.; Viitala, Sari; Laitinen, Tuomo; Peräkylä, Mikael; Pölönen, Petri; Kansanen, Emilia; Leinonen, Hanna; Raju, Suresh; Wienecke-Baldacchino, Anke; Närvänen, Ale; Poso, Antti; Heinäniemi, Merja; Heikkinen, Sami; Levonen, Anna-Liisa

    2016-01-01

    Transcription factor binding specificity is crucial for proper target gene regulation. Motif discovery algorithms identify the main features of the binding patterns, but the accuracy on the lower affinity sites is often poor. Nuclear factor E2-related factor 2 (NRF2) is a ubiquitous redox-activated transcription factor having a key protective role against endogenous and exogenous oxidant and electrophile stress. Herein, we decipher the effects of sequence variation on the DNA binding sequence of NRF2, in order to identify both genome-wide binding sites for NRF2 and disease-associated regulatory SNPs (rSNPs) with drastic effects on NRF2 binding. Interactions between NRF2 and DNA were studied using molecular modelling, and NRF2 chromatin immunoprecipitation-sequence datasets together with protein binding microarray measurements were utilized to study binding sequence variation in detail. The binding model thus generated was used to identify genome-wide binding sites for NRF2, and genomic binding sites with rSNPs that have strong effects on NRF2 binding and reside on active regulatory elements in human cells. As a proof of concept, miR-126–3p and -5p were identified as NRF2 target microRNAs, and a rSNP (rs113067944) residing on NRF2 target gene (Ferritin, light polypeptide, FTL) promoter was experimentally verified to decrease NRF2 binding and result in decreased transcriptional activity. PMID:26826707

  10. Bivariate genome-wide association study suggests that the DARC gene influences lean body mass and age at menarche.

    PubMed

    Hai, Rong; Zhang, Lei; Pei, Yufang; Zhao, Lanjuan; Ran, Shu; Han, Yingying; Zhu, Xuezhen; Shen, Hui; Tian, Qing; Deng, Hongwen

    2012-06-01

    Lean body mass (LBM) and age at menarche (AAM) are two important complex traits for human health. The aim of this study was to identify pleiotropic genes for both traits using a powerful bivariate genome-wide association study (GWAS). Two studies, a discovery study and a replication study, were performed. In the discovery study, 909622 single nucleotide polymorphisms (SNPs) were genotyped in 801 unrelated female Han Chinese subjects using the Affymetrix human genome-wide SNP array 6.0 platform. Then, a bivariate GWAS was performed to identify the SNPs that may be important for LBM and AAM. In the replication study, significant findings from the discovery study were validated in 1692 unrelated Caucasian female subjects. One SNP rs3027009 that was bivariately associated with left arm lean mass and AAM in the discovery samples (P=7.26×10(-6)) and in the replication samples (P=0.005) was identified. The SNP is located at the upstream of DARC (Duffy antigen receptor for chemokines) gene, suggesting that DARC may play an important role in regulating the metabolisms of both LBM and AAM.

  11. Accounting for Population Structure in Gene-by-Environment Interactions in Genome-Wide Association Studies Using Mixed Models.

    PubMed

    Sul, Jae Hoon; Bilow, Michael; Yang, Wen-Yun; Kostem, Emrah; Furlotte, Nick; He, Dan; Eskin, Eleazar

    2016-03-01

    Although genome-wide association studies (GWASs) have discovered numerous novel genetic variants associated with many complex traits and diseases, those genetic variants typically explain only a small fraction of phenotypic variance. Factors that account for phenotypic variance include environmental factors and gene-by-environment interactions (GEIs). Recently, several studies have conducted genome-wide gene-by-environment association analyses and demonstrated important roles of GEIs in complex traits. One of the main challenges in these association studies is to control effects of population structure that may cause spurious associations. Many studies have analyzed how population structure influences statistics of genetic variants and developed several statistical approaches to correct for population structure. However, the impact of population structure on GEI statistics in GWASs has not been extensively studied and nor have there been methods designed to correct for population structure on GEI statistics. In this paper, we show both analytically and empirically that population structure may cause spurious GEIs and use both simulation and two GWAS datasets to support our finding. We propose a statistical approach based on mixed models to account for population structure on GEI statistics. We find that our approach effectively controls population structure on statistics for GEIs as well as for genetic variants.

  12. Accounting for Population Structure in Gene-by-Environment Interactions in Genome-Wide Association Studies Using Mixed Models

    PubMed Central

    Yang, Wen-Yun; Kostem, Emrah; Furlotte, Nick; He, Dan; Eskin, Eleazar

    2016-01-01

    Although genome-wide association studies (GWASs) have discovered numerous novel genetic variants associated with many complex traits and diseases, those genetic variants typically explain only a small fraction of phenotypic variance. Factors that account for phenotypic variance include environmental factors and gene-by-environment interactions (GEIs). Recently, several studies have conducted genome-wide gene-by-environment association analyses and demonstrated important roles of GEIs in complex traits. One of the main challenges in these association studies is to control effects of population structure that may cause spurious associations. Many studies have analyzed how population structure influences statistics of genetic variants and developed several statistical approaches to correct for population structure. However, the impact of population structure on GEI statistics in GWASs has not been extensively studied and nor have there been methods designed to correct for population structure on GEI statistics. In this paper, we show both analytically and empirically that population structure may cause spurious GEIs and use both simulation and two GWAS datasets to support our finding. We propose a statistical approach based on mixed models to account for population structure on GEI statistics. We find that our approach effectively controls population structure on statistics for GEIs as well as for genetic variants. PMID:26943367

  13. Genome-wide analysis of the SBP-box gene family in Chinese cabbage (Brassica rapa subsp. pekinensis).

    PubMed

    Tan, Hua-Wei; Song, Xiao-Ming; Duan, Wei-Ke; Wang, Yan; Hou, Xi-Lin

    2015-11-01

    The SQUAMOSA PROMOTER BINDING PROTEIN (SBP)-box gene family contains highly conserved plant-specific transcription factors that play an important role in plant development, especially in flowering. Chinese cabbage (Brassica rapa subsp. pekinensis) is a leafy vegetable grown worldwide and is used as a model crop for research in genome duplication. The present study aimed to characterize the SBP-box transcription factor genes in Chinese cabbage. Twenty-nine SBP-box genes were identified in the Chinese cabbage genome and classified into six groups. We identified 23 orthologous and 5 co-orthologous SBP-box gene pairs between Chinese cabbage and Arabidopsis. An interaction network among these genes was constructed. Sixteen SBP-box genes were expressed more abundantly in flowers than in other tissues, suggesting their involvement in flowering. We show that the MiR156/157 family members may regulate the coding regions or 3'-UTR regions of Chinese cabbage SBP-box genes. As SBP-box genes were found to potentially participate in some plant development pathways, quantitative real-time PCR analysis was performed and showed that Chinese cabbage SBP-box genes were also sensitive to the exogenous hormones methyl jasmonic acid and salicylic acid. The SBP-box genes have undergone gene duplication and loss, evolving a more refined regulation for diverse stimulation in plant tissues. Our comprehensive genome-wide analysis provides insights into the SBP-box gene family of Chinese cabbage.

  14. Genome-wide analysis of the SBP-box gene family in Chinese cabbage (Brassica rapa subsp. pekinensis).

    PubMed

    Tan, Hua-Wei; Song, Xiao-Ming; Duan, Wei-Ke; Wang, Yan; Hou, Xi-Lin

    2015-11-01

    The SQUAMOSA PROMOTER BINDING PROTEIN (SBP)-box gene family contains highly conserved plant-specific transcription factors that play an important role in plant development, especially in flowering. Chinese cabbage (Brassica rapa subsp. pekinensis) is a leafy vegetable grown worldwide and is used as a model crop for research in genome duplication. The present study aimed to characterize the SBP-box transcription factor genes in Chinese cabbage. Twenty-nine SBP-box genes were identified in the Chinese cabbage genome and classified into six groups. We identified 23 orthologous and 5 co-orthologous SBP-box gene pairs between Chinese cabbage and Arabidopsis. An interaction network among these genes was constructed. Sixteen SBP-box genes were expressed more abundantly in flowers than in other tissues, suggesting their involvement in flowering. We show that the MiR156/157 family members may regulate the coding regions or 3'-UTR regions of Chinese cabbage SBP-box genes. As SBP-box genes were found to potentially participate in some plant development pathways, quantitative real-time PCR analysis was performed and showed that Chinese cabbage SBP-box genes were also sensitive to the exogenous hormones methyl jasmonic acid and salicylic acid. The SBP-box genes have undergone gene duplication and loss, evolving a more refined regulation for diverse stimulation in plant tissues. Our comprehensive genome-wide analysis provides insights into the SBP-box gene family of Chinese cabbage. PMID:26599708

  15. Comparison of genome-wide and gene-specific DNA methylation between ART and naturally conceived pregnancies.

    PubMed

    Melamed, Nir; Choufani, Sanaa; Wilkins-Haug, Louise E; Koren, Gideon; Weksberg, Rosanna

    2015-01-01

    Data linking assisted reproductive technologies (ART) with aberrant DNA methylation is limited and inconclusive. In addition, most studies to date have analyzed only a small number of CpG sites and focused on methylation changes in placentas, while data on cord blood are scarce. Our aim was to compare DNA methylation in cord blood samples from ART (N = 10) and control pregnancies (N = 8) using a genome-wide approach with the Illumina® Infinium Human Methylation27 array, which interrogates 27,578 CpG sites. A total of 733 (2.7%) of the CpG sites were significantly differentially methylated between the 2 groups (P < 0.05), with an overall relative hypomethylation in the ART group (P < 0.001). Differences in DNA methylation were more pronounced for CpG sites in certain types of genomic locations and were related to baseline methylation levels and distance from CpG islands and transcription start sites. ART was associated with significantly higher variation in DNA methylation, suggesting that differences in DNA methylation between cases and controls may result from stochastic (or random) genome-wide changes in DNA methylation in ART pregnancies. We identified 24 candidate genes with 2 or more CpG sites that were significantly different between the IVF and control groups. The current study provides support for the hypothesis that ART or associated subfertility may be associated with genome-wide changes in DNA methylation, and these changes appear to be, at least in part, due to epigenetic instability in ART pregnancies. Further studies are required in order to determine the extent to which such ART-related epigenetic instability may have phenotypic consequences.

  16. Family-based genome-wide copy number scan identifies five new genes of dyslexia involved in dendritic spinal plasticity.

    PubMed

    Veerappa, Avinash M; Saldanha, Marita; Padakannaya, Prakash; Ramachandra, Nallur B

    2013-08-01

    Genome-wide screening for copy number variations (CNVs) in ten Indian dyslexic families revealed the presence of five de novo CNVs in regions harboring GABARAP, NEGR1, ACCN1, DCDC5, and one in already known candidate gene CNTNAP2. These genes are located on regions of chromosomes 17p13.1, 1p31.1, 17q11.21, 11p14.1 and 7q35, respectively, and are implicated in learning, cognition and memory processes through dendritic spinal plasticity, though not formally associated with dyslexia. Molecular network analysis of these and other dyslexia-related module genes suggests them to be associated with synaptic transmission, axon guidance and cell adhesion. Thus, we suggest that dyslexia may also be caused by neuronal disconnection in addition to the earlier view that it is due to neuronal migrational disorder.

  17. A genome-wide survey of the secondary metabolite biosynthesis genes in the wheat pathogen Parastagonospora nodorum

    PubMed Central

    Chooi, Yit-Heng; Muria-Gonzalez, Mariano Jordi; Solomon, Peter S.

    2014-01-01

    The model pathogen Parastagonospora nodorum is a necrotroph and the causal agent of the wheat disease Septoria nodorum blotch (SNB). The sequenced P. nodorum genome has revealed that the fungus harbours a large number of secondary metabolite genes. Secondary metabolites are known to play important roles in the virulence of plant pathogens, but limited knowledge is available about the SM repertoire of this wheat pathogen. Here, we review the secondary metabolites that have been isolated from P. nodorum and related species of the same genus and provide an in-depth genome-wide overview of the secondary metabolite gene clusters encoded in the P. nodorum genome. The secondary metabolite gene survey reveals that P. nodorum is capable of producing a diverse range of small molecules and exciting prospects exist for discovery of novel virulence factors and bioactive molecules. PMID:25379341

  18. Biological interpretation of genome-wide association studies using predicted gene functions.

    PubMed

    Pers, Tune H; Karjalainen, Juha M; Chan, Yingleong; Westra, Harm-Jan; Wood, Andrew R; Yang, Jian; Lui, Julian C; Vedantam, Sailaja; Gustafsson, Stefan; Esko, Tonu; Frayling, Tim; Speliotes, Elizabeth K; Boehnke, Michael; Raychaudhuri, Soumya; Fehrmann, Rudolf S N; Hirschhorn, Joel N; Franke, Lude

    2015-01-19

    The main challenge for gaining biological insights from genetic associations is identifying which genes and pathways explain the associations. Here we present DEPICT, an integrative tool that employs predicted gene functions to systematically prioritize the most likely causal genes at associated loci, highlight enriched pathways and identify tissues/cell types where genes from associated loci are highly expressed. DEPICT is not limited to genes with established functions and prioritizes relevant gene sets for many phenotypes.

  19. Genome-wide analysis implicates microRNAs and their target genes in the development of bipolar disorder.

    PubMed

    Forstner, A J; Hofmann, A; Maaser, A; Sumer, S; Khudayberdiev, S; Mühleisen, T W; Leber, M; Schulze, T G; Strohmaier, J; Degenhardt, F; Treutlein, J; Mattheisen, M; Schumacher, J; Breuer, R; Meier, S; Herms, S; Hoffmann, P; Lacour, A; Witt, S H; Reif, A; Müller-Myhsok, B; Lucae, S; Maier, W; Schwarz, M; Vedder, H; Kammerer-Ciernioch, J; Pfennig, A; Bauer, M; Hautzinger, M; Moebus, S; Priebe, L; Sivalingam, S; Verhaert, A; Schulz, H; Czerski, P M; Hauser, J; Lissowska, J; Szeszenia-Dabrowska, N; Brennan, P; McKay, J D; Wright, A; Mitchell, P B; Fullerton, J M; Schofield, P R; Montgomery, G W; Medland, S E; Gordon, S D; Martin, N G; Krasnov, V; Chuchalin, A; Babadjanova, G; Pantelejeva, G; Abramova, L I; Tiganov, A S; Polonikov, A; Khusnutdinova, E; Alda, M; Cruceanu, C; Rouleau, G A; Turecki, G; Laprise, C; Rivas, F; Mayoral, F; Kogevinas, M; Grigoroiu-Serbanescu, M; Propping, P; Becker, T; Rietschel, M; Cichon, S; Schratt, G; Nöthen, M M

    2015-11-10

    Bipolar disorder (BD) is a severe and highly heritable neuropsychiatric disorder with a lifetime prevalence of 1%. Molecular genetic studies have identified the first BD susceptibility genes. However, the disease pathways remain largely unknown. Accumulating evidence suggests that microRNAs, a class of small noncoding RNAs, contribute to basic mechanisms underlying brain development and plasticity, suggesting their possible involvement in the pathogenesis of several psychiatric disorders, including BD. In the present study, gene-based analyses were performed for all known autosomal microRNAs using the largest genome-wide association data set of BD to date (9747 patients and 14 278 controls). Associated and brain-expressed microRNAs were then investigated in target gene and pathway analyses. Functional analyses of miR-499 and miR-708 were performed in rat hippocampal neurons. Ninety-eight of the six hundred nine investigated microRNAs showed nominally significant P-values, suggesting that BD-associated microRNAs might be enriched within known microRNA loci. After correction for multiple testing, nine microRNAs showed a significant association with BD. The most promising were miR-499, miR-708 and miR-1908. Target gene and pathway analyses revealed 18 significant canonical pathways, including brain development and neuron projection. For miR-499, four Bonferroni-corrected significant target genes were identified, including the genome-wide risk gene for psychiatric disorder CACNB2. First results of functional analyses in rat hippocampal neurons neither revealed nor excluded a major contribution of miR-499 or miR-708 to dendritic spine morphogenesis. The present results suggest that research is warranted to elucidate the precise involvement of microRNAs and their downstream pathways in BD.

  20. Genome-wide analysis implicates microRNAs and their target genes in the development of bipolar disorder

    PubMed Central

    Forstner, A J; Hofmann, A; Maaser, A; Sumer, S; Khudayberdiev, S; Mühleisen, T W; Leber, M; Schulze, T G; Strohmaier, J; Degenhardt, F; Treutlein, J; Mattheisen, M; Schumacher, J; Breuer, R; Meier, S; Herms, S; Hoffmann, P; Lacour, A; Witt, S H; Reif, A; Müller-Myhsok, B; Lucae, S; Maier, W; Schwarz, M; Vedder, H; Kammerer-Ciernioch, J; Pfennig, A; Bauer, M; Hautzinger, M; Moebus, S; Priebe, L; Sivalingam, S; Verhaert, A; Schulz, H; Czerski, P M; Hauser, J; Lissowska, J; Szeszenia-Dabrowska, N; Brennan, P; McKay, J D; Wright, A; Mitchell, P B; Fullerton, J M; Schofield, P R; Montgomery, G W; Medland, S E; Gordon, S D; Martin, N G; Krasnov, V; Chuchalin, A; Babadjanova, G; Pantelejeva, G; Abramova, L I; Tiganov, A S; Polonikov, A; Khusnutdinova, E; Alda, M; Cruceanu, C; Rouleau, G A; Turecki, G; Laprise, C; Rivas, F; Mayoral, F; Kogevinas, M; Grigoroiu-Serbanescu, M; Propping, P; Becker, T; Rietschel, M; Cichon, S; Schratt, G; Nöthen, M M

    2015-01-01

    Bipolar disorder (BD) is a severe and highly heritable neuropsychiatric disorder with a lifetime prevalence of 1%. Molecular genetic studies have identified the first BD susceptibility genes. However, the disease pathways remain largely unknown. Accumulating evidence suggests that microRNAs, a class of small noncoding RNAs, contribute to basic mechanisms underlying brain development and plasticity, suggesting their possible involvement in the pathogenesis of several psychiatric disorders, including BD. In the present study, gene-based analyses were performed for all known autosomal microRNAs using the largest genome-wide association data set of BD to date (9747 patients and 14 278 controls). Associated and brain-expressed microRNAs were then investigated in target gene and pathway analyses. Functional analyses of miR-499 and miR-708 were performed in rat hippocampal neurons. Ninety-eight of the six hundred nine investigated microRNAs showed nominally significant P-values, suggesting that BD-associated microRNAs might be enriched within known microRNA loci. After correction for multiple testing, nine microRNAs showed a significant association with BD. The most promising were miR-499, miR-708 and miR-1908. Target gene and pathway analyses revealed 18 significant canonical pathways, including brain development and neuron projection. For miR-499, four Bonferroni-corrected significant target genes were identified, including the genome-wide risk gene for psychiatric disorder CACNB2. First results of functional analyses in rat hippocampal neurons neither revealed nor excluded a major contribution of miR-499 or miR-708 to dendritic spine morphogenesis. The present results suggest that research is warranted to elucidate the precise involvement of microRNAs and their downstream pathways in BD. PMID:26556287

  1. Dorothy Hodgkin Lecture 2014. Understanding genes identified by genome-wide association studies for type 2 diabetes.

    PubMed

    Rutter, G A

    2014-12-01

    Whilst the heritable nature of Type 2 diabetes has been recognized for many years, only in the past two decades have linkage analyses in families and genome-wide association studies in large populations begun to reveal the genetic landscape of the disease in detail. Whilst the former have provided a powerful means of identifying the genes responsible for monogenic forms of the disease, the latter highlight relatively large genomic regions. These often harbour multiple genes, whose relative contribution to exaggerated disease risk is uncertain. In the present study, the approaches that have been used to dissect the role of just a few (TCF7L2, SLC30A8, ADCY5, MTNR1B and CDKAL1) of the ~ 500 genes identified at dozens of implicated loci are described. These are usually selected based on the strength of their effect on disease risk, and predictions as to their likely biological role. Direct determination of the effects of identified polymorphisms on gene expression in disease-relevant tissues, notably the pancreatic islet, are then performed to identify genes whose expression is affected by a particular polymorphism. Subsequent functional analyses then involve perturbing gene expression in vitro in β-cell lines or isolated islets and in vivo in animal models. Although the majority of polymorphisms affect insulin production rather than action, and mainly affect the β cell, effects via other tissues may also contribute, requiring careful consideration in the design and interpretation of experiments in model systems. These considerations illustrate the scale of the task needed to exploit genome-wide association study data for the development of new therapeutic strategies.

  2. Placental Genome and Maternal-Placental Genetic Interactions: A Genome-Wide and Candidate Gene Association Study of Placental Abruption

    PubMed Central

    Denis, Marie; Enquobahrie, Daniel A.; Tadesse, Mahlet G.; Gelaye, Bizu; Sanchez, Sixto E.; Salazar, Manuel; Ananth, Cande V.; Williams, Michelle A.

    2014-01-01

    While available evidence supports the role of genetics in the pathogenesis of placental abruption (PA), PA-related placental genome variations and maternal-placental genetic interactions have not been investigated. Maternal blood and placental samples collected from participants in the Peruvian Abruptio Placentae Epidemiology study were genotyped using Illumina’s Cardio-Metabochip platform. We examined 118,782 genome-wide SNPs and 333 SNPs in 32 candidate genes from mitochondrial biogenesis and oxidative phosphorylation pathways in placental DNA from 280 PA cases and 244 controls. We assessed maternal-placental interactions in the candidate gene SNPS and two imprinted regions (IGF2/H19 and C19MC). Univariate and penalized logistic regression models were fit to estimate odds ratios. We examined the combined effect of multiple SNPs on PA risk using weighted genetic risk scores (WGRS) with repeated ten-fold cross-validations. A multinomial model was used to investigate maternal-placental genetic interactions. In placental genome-wide and candidate gene analyses, no SNP was significant after false discovery rate correction. The top genome-wide association study (GWAS) hits were rs544201, rs1484464 (CTNNA2), rs4149570 (TNFRSF1A) and rs13055470 (ZNRF3) (p-values: 1.11e-05 to 3.54e-05). The top 200 SNPs of the GWAS overrepresented genes involved in cell cycle, growth and proliferation. The top candidate gene hits were rs16949118 (COX10) and rs7609948 (THRB) (p-values: 6.00e-03 and 8.19e-03). Participants in the highest quartile of WGRS based on cross-validations using SNPs selected from the GWAS and candidate gene analyses had a 8.40-fold (95% CI: 5.8–12.56) and a 4.46-fold (95% CI: 2.94–6.72) higher odds of PA compared to participants in the lowest quartile. We found maternal-placental genetic interactions on PA risk for two SNPs in PPARG (chr3∶12313450 and chr3∶12412978) and maternal imprinting effects for multiple SNPs in the C19MC and IGF2/H19 regions

  3. Genome-wide identification and characterization of aquaporin genes (AQPs) in Chinese cabbage (Brassica rapa ssp. pekinensis).

    PubMed

    Tao, Peng; Zhong, Xinmin; Li, Biyuan; Wang, Wuhong; Yue, Zhichen; Lei, Juanli; Guo, Weiling; Huang, Xiaoyun

    2014-12-01

    Aquaporins (AQPs) are members of a superfamily of integral membrane proteins and play a significant role in the transportation of small molecules across membranes. However, currently little is known about the AQP genes in Chinese cabbage (Brassica rapa ssp. pekinensis). In this study, a genome-wide analysis was carried out to identify the AQP genes in Chinese cabbage. In total, 53 non-redundant AQP genes were identified that were located on all of the 10 chromosomes. The number of AQP genes in Chinese cabbage was greater than in Arabidopsis. They were classified into four subfamilies, including PIP, TIP, NIP, and SIP. Thirty-three groups of AQP orthologous genes were identified between Chinese cabbage and Arabidopsis, but orthologs corresponding to AtNIP1;1 and AtPIP2;8 were not detected. Seventeen groups of paralogous genes were identified in Chinese cabbage. Three-dimensional models of the AQPs of Chinese cabbage were constructed using Phyre2, and ar/R selectivity filters were analyzed comparatively between Chinese cabbage and Arabidopsis. Generally, gene structure was conserved within each subfamily, especially in the SIP subfamily. Intron loss events have occurred during the evolution of the PIP, TIP, and NIP subfamilies. The expression of AQP genes in Chinese cabbage was analyzed in different organs. Most AQP genes were downregulated in response to salt stress. This work shows that the AQP genes of Chinese cabbage have undergone triplication and subsequent biased gene loss.

  4. Genome-wide identification and characterization of aquaporin genes (AQPs) in Chinese cabbage (Brassica rapa ssp. pekinensis).

    PubMed

    Tao, Peng; Zhong, Xinmin; Li, Biyuan; Wang, Wuhong; Yue, Zhichen; Lei, Juanli; Guo, Weiling; Huang, Xiaoyun

    2014-12-01

    Aquaporins (AQPs) are members of a superfamily of integral membrane proteins and play a significant role in the transportation of small molecules across membranes. However, currently little is known about the AQP genes in Chinese cabbage (Brassica rapa ssp. pekinensis). In this study, a genome-wide analysis was carried out to identify the AQP genes in Chinese cabbage. In total, 53 non-redundant AQP genes were identified that were located on all of the 10 chromosomes. The number of AQP genes in Chinese cabbage was greater than in Arabidopsis. They were classified into four subfamilies, including PIP, TIP, NIP, and SIP. Thirty-three groups of AQP orthologous genes were identified between Chinese cabbage and Arabidopsis, but orthologs corresponding to AtNIP1;1 and AtPIP2;8 were not detected. Seventeen groups of paralogous genes were identified in Chinese cabbage. Three-dimensional models of the AQPs of Chinese cabbage were constructed using Phyre2, and ar/R selectivity filters were analyzed comparatively between Chinese cabbage and Arabidopsis. Generally, gene structure was conserved within each subfamily, especially in the SIP subfamily. Intron loss events have occurred during the evolution of the PIP, TIP, and NIP subfamilies. The expression of AQP genes in Chinese cabbage was analyzed in different organs. Most AQP genes were downregulated in response to salt stress. This work shows that the AQP genes of Chinese cabbage have undergone triplication and subsequent biased gene loss. PMID:24972664

  5. Genome-wide identification of NBS resistance genes in Populus trichocarpa.

    PubMed

    Kohler, Annegret; Rinaldi, Cécile; Duplessis, Sébastien; Baucher, Marie; Geelen, Danny; Duchaussoy, Frédéric; Meyers, Blake C; Boerjan, Wout; Martin, Francis

    2008-04-01

    As the largest class of disease resistance R genes, the genes encoding nucleotide binding site and leucine-rich repeat proteins ("NBS-LRR genes") play a critical role in defending plants from a multitude of pathogens and pests. The diversity of NBS-LRR genes was examined in the Populus trichocarpa draft genome sequence. The NBS class of genes in this perennial tree is large and diverse, comprised of approximately 400 genes, at least twice the complement of Arabidopsis. The NBS family can be divided into multiple subfamilies with distinct domain organizations. It includes 119 Coiled-Coil-NBS-LRR genes, 64 TIR-NBS-LRR genes, 34 BED-finger-NBS-LRR, and both truncated and unusual NBS- and NBS-LRR-containing genes. The transcripts of only 34 NBS-LRR genes were detected in rust-infected and non-infected leaves using a whole-genome oligoarray. None showed an altered expression two days post inoculation. PMID:18247136

  6. Identification of a gene module associated with BMD through the integration of network analysis and genome-wide association data.

    PubMed

    Farber, Charles R

    2010-11-01

    Bone mineral density (BMD) is influenced by a complex network of gene interactions; therefore, elucidating the relationships between genes and how those genes, in turn, influence BMD is critical for developing a comprehensive understanding of osteoporosis. To investigate the role of transcriptional networks in the regulation of BMD, we performed a weighted gene coexpression network analysis (WGCNA) using microarray expression data on monocytes from young individuals with low or high BMD. WGCNA groups genes into modules based on patterns of gene coexpression. and our analysis identified 11 gene modules. We observed that the overall expression of one module (referred to as module 9) was significantly higher in the low-BMD group (p = .03). Module 9 was highly enriched for genes belonging to the immune system-related gene ontology (GO) category "response to virus" (p = 7.6 × 10(-11)). Using publically available genome-wide association study data, we independently validated the importance of module 9 by demonstrating that highly connected module 9 hubs were more likely, relative to less highly connected genes, to be genetically associated with BMD. This study highlights the advantages of systems-level analyses to uncover coexpression modules associated with bone mass and suggests that particular monocyte expression patterns may mediate differences in BMD.

  7. A genome-wide expression profile of salt-responsive genes in the apple rootstock Malus zumi.

    PubMed

    Li, Qingtian; Liu, Jia; Tan, Dunxian; Allan, Andrew C; Jiang, Yuzhuang; Xu, Xuefeng; Han, Zhenhai; Kong, Jin

    2013-10-18

    In some areas of cultivation, a lack of salt tolerance severely affects plant productivity. Apple, Malus x domestica Borkh., is sensitive to salt, and, as a perennial woody plant the mechanism of salt stress adaption will be different from that of annual herbal model plants, such as Arabidopsis. Malus zumi is a salt tolerant apple rootstock, which survives high salinity (up to 0.6% NaCl). To examine the mechanism underlying this tolerance, a genome-wide expression analysis was performed, using a cDNA library constructed from salt-treated seedlings of Malus zumi. A total of 15,000 cDNA clones were selected for microarray analysis. In total a group of 576 cDNAs, of which expression changed more than four-fold, were sequenced and 18 genes were selected to verify their expression pattern under salt stress by semi-quantitative RT-PCR. Our genome-wide expression analysis resulted in the isolation of 50 novel Malus genes and the elucidation of a new apple-specific mechanism of salt tolerance, including the stabilization of photosynthesis under stress, involvement of phenolic compounds, and sorbitol in ROS scavenging and osmoprotection. The promoter regions of 111 genes were analyzed by PlantCARE, suggesting an intensive cross-talking of abiotic stress in Malus zumi. An interaction network of salt responsive genes was constructed and molecular regulatory pathways of apple were deduced. Our research will contribute to gene function analysis and further the understanding of salt-tolerance mechanisms in fruit trees.

  8. A Genome-Wide Expression Profile of Salt-Responsive Genes in the Apple Rootstock Malus zumi

    PubMed Central

    Li, Qingtian; Liu, Jia; Tan, Dunxian; Allan, Andrew C.; Jiang, Yuzhuang; Xu, Xuefeng; Han, Zhenhai; Kong, Jin

    2013-01-01

    In some areas of cultivation, a lack of salt tolerance severely affects plant productivity. Apple, Malus x domestica Borkh., is sensitive to salt, and, as a perennial woody plant the mechanism of salt stress adaption will be different from that of annual herbal model plants, such as Arabidopsis. Malus zumi is a salt tolerant apple rootstock, which survives high salinity (up to 0.6% NaCl). To examine the mechanism underlying this tolerance, a genome-wide expression analysis was performed, using a cDNA library constructed from salt-treated seedlings of Malus zumi. A total of 15,000 cDNA clones were selected for microarray analysis. In total a group of 576 cDNAs, of which expression changed more than four-fold, were sequenced and 18 genes were selected to verify their expression pattern under salt stress by semi-quantitative RT-PCR. Our genome-wide expression analysis resulted in the isolation of 50 novel Malus genes and the elucidation of a new apple-specific mechanism of salt tolerance, including the stabilization of photosynthesis under stress, involvement of phenolic compounds, and sorbitol in ROS scavenging and osmoprotection. The promoter regions of 111 genes were analyzed by PlantCARE, suggesting an intensive cross-talking of abiotic stress in Malus zumi. An interaction network of salt responsive genes was constructed and molecular regulatory pathways of apple were deduced. Our research will contribute to gene function analysis and further the understanding of salt-tolerance mechanisms in fruit trees. PMID:24145753

  9. Genome-wide association study identifies TH1 pathway genes associated with lung function in asthmatic patients

    PubMed Central

    Li, Xingnan; Hawkins, Gregory A.; Ampleford, Elizabeth J.; Moore, Wendy C.; Li, Huashi; Hastie, Annette T.; Howard, Timothy D.; Boushey, Homer A.; Busse, William W.; Calhoun, William J.; Castro, Mario; Erzurum, Serpil C.; Israel, Elliot; Lemanske, Robert F.; Szefler, Stanley J.; Wasserman, Stephen I.; Wenzel, Sally E.; Peters, Stephen P.; Meyers, Deborah A.; Bleecker, Eugene R.

    2013-01-01

    Background Recent meta-analyses of genome-wide association studies in general populations of European descent have identified 28 loci for lung function. Objective We sought to identify novel lung function loci specifically for asthma and to confirm lung function loci identified in general populations. Methods Genome-wide association studies of lung function (percent predicted FEV1 [ppFEV1], percent predicted forced vital capacity, and FEV1/forced vital capacity ratio) were performed in 4 white populations of European descent (n = 1544), followed by meta-analyses. Results Seven of 28 previously identified lung function loci (HHIP, FAM13A, THSD4, GSTCD, NOTCH4-AGER, RARB, and ZNF323) identified in general populations were confirmed at single nucleotide polymorphism (SNP) levels (P < .05). Four of 32 loci (IL12A, IL12RB1, STAT4, and IRF2) associated with ppFEV1 (P < 10−4) belong to the TH1 or IL-12 cytokine family pathway. By using a linear additive model, these 4 TH1 pathway SNPs cumulatively explained 2.9% to 7.8% of the variance in ppFEV1 values in 4 populations (P = 3 × 10−11). Genetic scores of these 4 SNPs were associated with ppFEV1 values (P = 2 × 10−7) and the American Thoracic Society severe asthma classification (P = .005) in the Severe Asthma Research Program population. TH2 pathway genes (IL13, TSLP, IL33, and IL1RL1) conferring asthma susceptibility were not associated with lung function. Conclusion Genes involved in airway structure/remodeling are associated with lung function in both general populations and asthmatic subjects. TH1 pathway genes involved in anti-virus/bacterial infection and inflammation modify lung function in asthmatic subjects. Genes associated with lung function that might affect asthma severity are distinct from those genes associated with asthma susceptibility. PMID:23541324

  10. A K(ATP) channel gene effect on sleep duration: from genome-wide association studies to function in Drosophila.

    PubMed

    Allebrandt, K V; Amin, N; Müller-Myhsok, B; Esko, T; Teder-Laving, M; Azevedo, R V D M; Hayward, C; van Mill, J; Vogelzangs, N; Green, E W; Melville, S A; Lichtner, P; Wichmann, H-E; Oostra, B A; Janssens, A C J W; Campbell, H; Wilson, J F; Hicks, A A; Pramstaller, P P; Dogas, Z; Rudan, I; Merrow, M; Penninx, B; Kyriacou, C P; Metspalu, A; van Duijn, C M; Meitinger, T; Roenneberg, T

    2013-01-01

    Humans sleep approximately a third of their lifetime. The observation that individuals with either long or short sleep duration show associations with metabolic syndrome and psychiatric disorders suggests that the length of sleep is adaptive. Although sleep duration can be influenced by photoperiod (season) and phase of entrainment (chronotype), human familial sleep disorders indicate that there is a strong genetic modulation of sleep. Therefore, we conducted high-density genome-wide association studies for sleep duration in seven European populations (N=4251). We identified an intronic variant (rs11046205; P=3.99 × 10(-8)) in the ABCC9 gene that explains ≈5% of the variation in sleep duration. An influence of season and chronotype on sleep duration was solely observed in the replication sample (N=5949). Meta-analysis of the associations found in a subgroup of the replication sample, chosen for season of entry and chronotype, together with the discovery results showed genome-wide significance. RNA interference knockdown experiments of the conserved ABCC9 homologue in Drosophila neurons renders flies sleepless during the first 3 h of the night. ABCC9 encodes an ATP-sensitive potassium channel subunit (SUR2), serving as a sensor of intracellular energy metabolism. PMID:22105623

  11. A K(ATP) channel gene effect on sleep duration: from genome-wide association studies to function in Drosophila.

    PubMed

    Allebrandt, K V; Amin, N; Müller-Myhsok, B; Esko, T; Teder-Laving, M; Azevedo, R V D M; Hayward, C; van Mill, J; Vogelzangs, N; Green, E W; Melville, S A; Lichtner, P; Wichmann, H-E; Oostra, B A; Janssens, A C J W; Campbell, H; Wilson, J F; Hicks, A A; Pramstaller, P P; Dogas, Z; Rudan, I; Merrow, M; Penninx, B; Kyriacou, C P; Metspalu, A; van Duijn, C M; Meitinger, T; Roenneberg, T

    2013-01-01

    Humans sleep approximately a third of their lifetime. The observation that individuals with either long or short sleep duration show associations with metabolic syndrome and psychiatric disorders suggests that the length of sleep is adaptive. Although sleep duration can be influenced by photoperiod (season) and phase of entrainment (chronotype), human familial sleep disorders indicate that there is a strong genetic modulation of sleep. Therefore, we conducted high-density genome-wide association studies for sleep duration in seven European populations (N=4251). We identified an intronic variant (rs11046205; P=3.99 × 10(-8)) in the ABCC9 gene that explains ≈5% of the variation in sleep duration. An influence of season and chronotype on sleep duration was solely observed in the replication sample (N=5949). Meta-analysis of the associations found in a subgroup of the replication sample, chosen for season of entry and chronotype, together with the discovery results showed genome-wide significance. RNA interference knockdown experiments of the conserved ABCC9 homologue in Drosophila neurons renders flies sleepless during the first 3 h of the night. ABCC9 encodes an ATP-sensitive potassium channel subunit (SUR2), serving as a sensor of intracellular energy metabolism.

  12. Genome-wide selection of superior reference genes for expression studies in Ganoderma lucidum.

    PubMed

    Xu, Zhichao; Xu, Jiang; Ji, Aijia; Zhu, Yingjie; Zhang, Xin; Hu, Yuanlei; Song, Jingyuan; Chen, Shilin

    2015-12-15

    Quantitative real-time polymerase chain reaction (qRT-PCR) is widely used for the accurate analysis of gene expression. However, high homology among gene families might result in unsuitability of reference genes, which leads to the inaccuracy of qRT-PCR analysis. The release of the Ganoderma lucidum genome has triggered numerous studies to be done on the homology among gene families with the purpose of selecting reliable reference genes. Based on the G. lucdum genome and transcriptome database, 38 candidate reference genes including 28 novel genes were systematically selected and evaluated for qRT-PCR normalization. The result indicated that commonly used polyubiquitin (PUB), beta-actin (BAT), and glyceraldehyde 3-phosphate dehydrogenase (GAPDH) were unsuitable reference genes because of the high sequence similarity and low primer specificity. According to the evaluation of RefFinder, cyclophilin 5 (CYP5) was ranked as the most stable reference gene for 27 tested samples under all experimental conditions and eighteen mycelial samples. Based on sequence analysis and expression analysis, our study suggested that gene characteristic, primer specificity of high homologous genes, allele-specificity expression of candidate genes and under-evaluation of reference genes influenced the accuracy and sensitivity of qRT-PCR analysis. This investigation not only revealed potential factors influencing the unsuitability of reference genes but also selected the superior reference genes from more candidate genes and testing samples than those used in the previous study. Furthermore, our study established a model for reference gene analysis by using the genomic sequence.

  13. Genome-wide characterization of the Pectate Lyase-like (PLL) genes in Brassica rapa.

    PubMed

    Jiang, Jingjing; Yao, Lina; Miao, Ying; Cao, Jiashu

    2013-11-01

    Pectate lyases (PL) depolymerize demethylated pectin (pectate, EC 4.2.2.2) by catalyzing the eliminative cleavage of α-1,4-glycosidic linked galacturonan. Pectate Lyase-like (PLL) genes are one of the largest and most complex families in plants. However, studies on the phylogeny, gene structure, and expression of PLL genes are limited. To understand the potential functions of PLL genes in plants, we characterized their intron-exon structure, phylogenetic relationships, and protein structures, and measured their expression patterns in various tissues, specifically the reproductive tissues in Brassica rapa. Sequence alignments revealed two characteristic motifs in PLL genes. The chromosome location analysis indicated that 18 of the 46 PLL genes were located in the least fractionated sub-genome (LF) of B. rapa, while 16 were located in the medium fractionated sub-genome (MF1) and 12 in the more fractionated sub-genome (MF2). Quantitative RT-PCR analysis showed that BrPLL genes were expressed in various tissues, with most of them being expressed in flowers. Detailed qRT-PCR analysis identified 11 pollen specific PLL genes and several other genes with unique spatial expression patterns. In addition, some duplicated genes showed similar expression patterns. The phylogenetic analysis identified three PLL gene subfamilies in plants, among which subfamily II might have evolved from gene neofunctionalization or subfunctionalization. Therefore, this study opens the possibility for exploring the roles of PLL genes during plant development.

  14. Genome-wide gene expression perturbation induced by loss of C2 chromosome in allotetraploid Brassica napus L.

    PubMed

    Zhu, Bin; Shao, Yujiao; Pan, Qi; Ge, Xianhong; Li, Zaiyun

    2015-01-01

    Aneuploidy with loss of entire chromosomes from normal complement disrupts the balanced genome and is tolerable only by polyploidy plants. In this study, the monosomic and nullisomic plants losing one or two copies of C2 chromosome from allotetraploid Brassica napus L. (2n = 38, AACC) were produced and compared for their phenotype and transcriptome. The monosomics gave a plant phenotype very similar to the original donor, but the nullisomics had much smaller stature and also shorter growth period. By the comparative analyses on the global transcript profiles with the euploid donor, genome-wide alterations in gene expression were revealed in two aneuploids, and their majority of differentially expressed genes (DEGs) resulted from the trans-acting effects of the zero and one copy of C2 chromosome. The higher number of up-regulated genes than down-regulated genes on other chromosomes suggested that the genome responded to the C2 loss via enhancing the expression of certain genes. Particularly, more DEGs were detected in the monosomics than nullisomics, contrasting with their phenotypes. The gene expression of the other chromosomes was differently affected, and several dysregulated domains in which up- or downregulated genes obviously clustered were identifiable. But the mean gene expression (MGE) for homoeologous chromosome A2 reduced with the C2 loss. Some genes and their expressions on C2 were correlated with the phenotype deviations in the aneuploids. These results provided new insights into the transcriptomic perturbation of the allopolyploid genome elicited by the loss of individual chromosome. PMID:26442076

  15. Genome-wide identification and expression profiling of the SnRK2 gene family in Malus prunifolia.

    PubMed

    Shao, Yun; Qin, Yuan; Zou, Yangjun; Ma, Fengwang

    2014-11-15

    Sucrose non-fermenting-1-related protein kinase 2 (SnRK2) constitutes a small plant-specific serine/threonine kinase family with essential roles in the abscisic acid (ABA) signal pathway and in responses to osmotic stress. Although a genome-wide analysis of this family has been conducted in some species, little is known about SnRK2 genes in apple (Malus domestica). We identified 14 putative sequences encoding 12 deduced SnRK2 proteins within the apple genome. Gene chromosomal location and synteny analysis of the apple SnRK2 genes indicated that tandem and segmental duplications have likely contributed to the expansion and evolution of these genes. All 12 full-length coding sequences were confirmed by cloning from Malus prunifolia. The gene structure and motif compositions of the apple SnRK2 genes were analyzed. Phylogenetic analysis showed that MpSnRK2s could be classified into four groups. Profiling of these genes presented differential patterns of expression in various tissues. Under stress conditions, transcript levels for some family members were up-regulated in the leaves in response to drought, salinity, or ABA treatments. This suggested their possible roles in plant response to abiotic stress. Our findings provide essential information about SnRK2 genes in apple and will contribute to further functional dissection of this gene family.

  16. Genome-wide identification of lineage-specific genes in Arabidopsis, Oryza and Populus

    SciTech Connect

    Yang, Xiaohan; Jawdy, Sara; Tschaplinski, Timothy J; Tuskan, Gerald A

    2009-01-01

    Protein sequences were compared among Arabidopsis, Oryza and Populus to identify differential gene (DG) sets that are in one but not the other two genomes. The DG sets were screened against a plant transcript database, the NR protein database and six newly-sequenced genomes (Carica, Glycine, Medicago, Sorghum, Vitis and Zea) to identify a set of species-specific genes (SS). Gene expression, protein motif and intron number were examined. 192, 641 and 109 SS genes were identified in Arabidopsis, Oryza and Populus, respectively. Some SS genes were preferentially expressed in flowers, roots, xylem and cambium or up-regulated by stress. Six conserved motifs in Arabidopsis and Oryza SS proteins were found in other distant lineages. The SS gene sets were enriched with intronless genes. The results reflect functional and/or anatomical differences between monocots and eudicots or between herbaceous and woody plants. The Populus-specific genes are candidates for carbon sequestration and biofuel research.

  17. Genome-wide analysis of the GRAS gene family in Chinese cabbage (Brassica rapa ssp. pekinensis).

    PubMed

    Song, Xiao-Ming; Liu, Tong-Kun; Duan, Wei-Ke; Ma, Qing-Hua; Ren, Jun; Wang, Zhen; Li, Ying; Hou, Xi-Lin

    2014-01-01

    The GRAS gene family is one of the most important families of transcriptional regulators. In this study, 48 GRAS genes are identified from Chinese cabbage, and they are classified into eight groups according to the classification of Arabidopsis. The characterization, classification, gene structure and phylogenetic construction of GRAS proteins are performed. Distribution mapping shows that GRAS proteins are nonrandomly localized in 10 chromosomes. Fifty-five orthologous gene pairs are shared by Chinese cabbage and Arabidopsis, and interaction networks of these orthologous genes are constructed. The expansion of GRAS genes in Chinese cabbage results from genome triplication. Among the 17 species examined, 14 higher plants carry the GRAS genes, whereas two lower plants and one fungi species do not. Furthermore, the expression patterns of GRAS genes exhibit differences in three tissues based on RNA-seq data. Taken together, this comprehensive analysis will provide rich resources for studying GRAS protein functions in Chinese cabbage.

  18. Identification of potential driver genes in human liver carcinoma by genome-wide screening

    PubMed Central

    Woo, Hyun Goo; Park, Eun Sung; Lee, Ju-Seog; Lee, Yun-Han; Ishikawa, Tsuyoshi; Kim, Yoon Jun; Thorgeirsson, Snorri S.

    2009-01-01

    Genomic copy number aberrations and corresponding transcriptional deregulation in the cancer genome have been suggested to have regulatory roles in cancer development and progression. However, functional evaluation of individual genes from lengthy lists of candidate genes from genomic datasets presents a significant challenge. Here we report effective gene selection strategies to identify potential driver genes based on systematic integration of genome scale data of DNA copy numbers and gene expression profiles. Using regional pattern recognition approaches, we discovered the most probable copy number-dependent regions and 50 potential driver genes. At each step of gene selection process, functional relevance of the selected genes was evaluated by estimating the prognostic significance of the selected genes. Further validation using small interference RNA (siRNA)-mediated knockdown experiments demonstrated proof-of-principle evidence for the potential driver roles of the genes in HCC progression (i.e., NCSTN and SCRIB). In addition, systemic prediction of drug responses implicated the association of the 50 genes with specific signaling molecules (mTOR, AMPK, and EGFR). In conclusion, the application of an unbiased and integrative analysis of multidimensional genomic datasets can effectively screen for potential driver genes and provides novel mechanistic and clinical insights into pathobiology of HCC. PMID:19366792

  19. Genome-Wide Gene Expression in relation to Age in Large Laboratory Cohorts of Drosophila melanogaster

    PubMed Central

    Carlson, Kimberly A.; Gardner, Kylee; Pashaj, Anjeza; Carlson, Darby J.; Yu, Fang; Eudy, James D.; Zhang, Chi; Harshman, Lawrence G.

    2015-01-01

    Aging is a complex process characterized by a steady decline in an organism's ability to perform life-sustaining tasks. In the present study, two cages of approximately 12,000 mated Drosophila melanogaster females were used as a source of RNA from individuals sampled frequently as a function of age. A linear model for microarray data method was used for the microarray analysis to adjust for the box effect; it identified 1,581 candidate aging genes. Cluster analyses using a self-organizing map algorithm on the 1,581 significant genes identified gene expression patterns across different ages. Genes involved in immune system function and regulation, chorion assembly and function, and metabolism were all significantly differentially expressed as a function of age. The temporal pattern of data indicated that gene expression related to aging is affected relatively early in life span. In addition, the temporal variance in gene expression in immune function genes was compared to a random set of genes. There was an increase in the variance of gene expression within each cohort, which was not observed in the set of random genes. This observation is compatible with the hypothesis that D. melanogaster immune function genes lose control of gene expression as flies age. PMID:26090231

  20. A genome-wide survey reveals abundant rice blast R-genes in resistant cultivars

    PubMed Central

    Tan, Shengjun; Zhong, Yan; Wang, Ling; Gu, Longjiang; Chen, Jian-Qun; Pan, Qinghua; Bergelson, Joy; Tian, Dacheng

    2015-01-01

    Summary Plant resistance genes (R-genes) harbor tremendous allelic diversity, constituting a robust immune system effective against microbial pathogens. Nevertheless, few functional R-genes have been identified for even the best-studied pathosystems. Does this limited repertoire reflect specificity, with most R-genes having been defeated by former pests, or do plants harbor a rich diversity of functional R-genes whose composite behavior is yet to be characterized? Here, we survey 332 NBS-LRR genes cloned from 5 resistant rice cultivars for their ability to confer recognition of 12 rice blast isolates when transformed into susceptible cultivars. Our survey reveals that 48.5% of the 132 NBS-LRR loci tested contain functional rice blast R-genes, with most R-genes deriving from multi-copy clades containing especially diversified loci. Each R-gene recognized, on average, 2.42 of the 12 isolates screened. The abundant R-genes identified in resistant genomes provide extraordinary redundancy in the ability of host genotypes to recognize particular isolates. If the same is true for other pathogens, many extant NBS-LRR genes retain functionality. Our success at identifying rice blast R-genes also validates a highly efficient cloning and screening strategy. PMID:26248689

  1. Genome-Wide Analysis of Gene Expression during Early Arabidopsis Flower Development

    PubMed Central

    Wellmer, Frank; Alves-Ferreira, Márcio; Dubois, Annick; Riechmann, José Luis; Meyerowitz, Elliot M

    2006-01-01

    Detailed information about stage-specific changes in gene expression is crucial for the understanding of the gene regulatory networks underlying development. Here, we describe the global gene expression dynamics during early flower development, a key process in the life cycle of a plant, during which floral patterning and the specification of floral organs is established. We used a novel floral induction system in Arabidopsis, which allows the isolation of a large number of synchronized floral buds, in conjunction with whole-genome microarray analysis to identify genes with differential expression at distinct stages of flower development. We found that the onset of flower formation is characterized by a massive downregulation of genes in incipient floral primordia, which is followed by a predominance of gene activation during the differentiation of floral organs. Among the genes we identified as differentially expressed in the experiment, we detected a significant enrichment of closely related members of gene families. The expression profiles of these related genes were often highly correlated, indicating similar temporal expression patterns. Moreover, we found that the majority of these genes is specifically up-regulated during certain developmental stages. Because co-expressed members of gene families in Arabidopsis frequently act in a redundant manner, these results suggest a high degree of functional redundancy during early flower development, but also that its extent may vary in a stage-specific manner. PMID:16789830

  2. Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data

    PubMed Central

    2013-01-01

    Background Understanding how genes are expressed specifically in particular tissues is a fundamental question in developmental biology. Many tissue-specific genes are involved in the pathogenesis of complex human diseases. However, experimental identification of tissue-specific genes is time consuming and difficult. The accurate predictions of tissue-specific gene targets could provide useful information for biomarker development and drug target identification. Results In this study, we have developed a machine learning approach for predicting the human tissue-specific genes using microarray expression data. The lists of known tissue-specific genes for different tissues were collected from UniProt database, and the expression data retrieved from the previously compiled dataset according to the lists were used for input vector encoding. Random Forests (RFs) and Support Vector Machines (SVMs) were used to construct accurate classifiers. The RF classifiers were found to outperform SVM models for tissue-specific gene prediction. The results suggest that the candidate genes for brain or liver specific expression can provide valuable information for further experimental studies. Our approach was also applied for identifying tissue-selective gene targets for different types of tissues. Conclusions A machine learning approach has been developed for accurately identifying the candidate genes for tissue specific/selective expression. The approach provides an efficient way to select some interesting genes for developing new biomedical markers and improve our knowledge of tissue-specific expression. PMID:23369200

  3. Genome-wide characterization of the CBF/DREB1 gene family in Brassica rapa.

    PubMed

    Lee, Sang-Choon; Lim, Myung-Ho; Yu, Jae-Gyeong; Park, Beom-Seok; Yang, Tae-Jin

    2012-12-01

    The C-repeat/dehydration-responsive element binding transcription factors (CBF/DREBs) are important proteins in involved in responses to abiotic stress in plants. We identified ten BrDREB1 genes belonging to the CBF/DREB1 gene family in the Brassica rapa whole genome sequence, whereas six genes are found in the Arabidopsis thaliana genome. The deduced amino acid sequences of the B. rapa genes showed conserved motifs shared with other known plant CBF/DREB1s. Comparative analysis revealed that nine of the BrDREB1 genes were derived from the recent genome triplication in the tribe Brassiceae and the other one was translocated. The nine genes were located in seven of the 12 macrosyntenic blocks that are triplicated counterparts of four Arabidopsis macrosyntenic blocks harboring six CBF/DREB1 genes: one gene on each of three blocks and three tandemly arrayed genes on another block. We inspected the expression patterns of eight BrDREB1 genes by RT-PCR and microarray database searches. All eight genes were highly up-regulated during cold (4 °C) treatment, and some of them were also responsive to salt (250 mM NaCl), drought (air drying), and ABA (100 μM) treatment. Microarray data for plant developmental stages revealed that BrDREB1C2 was highly expressed during a period of cold treatment for vernalization, similar to abiotic stress-inducible genes homologous to Bn28a, Bn47, Bn115, and BoRS1, but almost opposite of BrFLC genes. Taken together, the number of BrDREB1 genes increased to 10 by genome triplication and reorganization, providing additional functions in B. rapa abiotic stress responses and development, as distinct from their Arabidopsis homologs.

  4. Genome-wide characterization of the CBF/DREB1 gene family in Brassica rapa.

    PubMed

    Lee, Sang-Choon; Lim, Myung-Ho; Yu, Jae-Gyeong; Park, Beom-Seok; Yang, Tae-Jin

    2012-12-01

    The C-repeat/dehydration-responsive element binding transcription factors (CBF/DREBs) are important proteins in involved in responses to abiotic stress in plants. We identified ten BrDREB1 genes belonging to the CBF/DREB1 gene family in the Brassica rapa whole genome sequence, whereas six genes are found in the Arabidopsis thaliana genome. The deduced amino acid sequences of the B. rapa genes showed conserved motifs shared with other known plant CBF/DREB1s. Comparative analysis revealed that nine of the BrDREB1 genes were derived from the recent genome triplication in the tribe Brassiceae and the other one was translocated. The nine genes were located in seven of the 12 macrosyntenic blocks that are triplicated counterparts of four Arabidopsis macrosyntenic blocks harboring six CBF/DREB1 genes: one gene on each of three blocks and three tandemly arrayed genes on another block. We inspected the expression patterns of eight BrDREB1 genes by RT-PCR and microarray database searches. All eight genes were highly up-regulated during cold (4 °C) treatment, and some of them were also responsive to salt (250 mM NaCl), drought (air drying), and ABA (100 μM) treatment. Microarray data for plant developmental stages revealed that BrDREB1C2 was highly expressed during a period of cold treatment for vernalization, similar to abiotic stress-inducible genes homologous to Bn28a, Bn47, Bn115, and BoRS1, but almost opposite of BrFLC genes. Taken together, the number of BrDREB1 genes increased to 10 by genome triplication and reorganization, providing additional functions in B. rapa abiotic stress responses and development, as distinct from their Arabidopsis homologs. PMID:23148914

  5. Genome-wide identification, classification, and expression analysis of CDPK and its closely related gene families in poplar (Populus trichocarpa).

    PubMed

    Zuo, Ran; Hu, Ruibo; Chai, Guohua; Xu, Meiling; Qi, Guang; Kong, Yingzhen; Zhou, Gongke

    2013-03-01

    Calcium-dependent protein kinases (CDPKs) are Ca(2+)-binding proteins known to play crucial roles in Ca(2+) signal transduction pathways which have been identified throughout plant kingdom and in certain types of protists. Genome-wide analysis of CDPKs have been carried out in Arabidopsis, rice and wheat, and quite a few of CDPKs were proved to play crucial roles in plant stress responsive signature pathways. In this study, a comprehensive analysis of Populus CDPK and its closely related gene families was performed, including phylogeny, chromosome locations, gene structures, and expression profiles. Thirty Populus CDPK genes and twenty closely related kinase genes were identified, which were phylogenetically clustered into eight distinct subfamilies and predominately distributed across fifteen linkage groups (LG). Genomic organization analyses indicated that purifying selection has played a pivotal role in the retention and maintenance of Populus CDPK gene family. Furthermore, microarray analysis showed that a number of Populus CDPK and its closely related genes differentially expressed across disparate tissues and under various stresses. The expression profiles of paralogous pairs were also investigated to reveal their evolution fates. In addition, quantitative real-time RT-PCR was performed on nine selected CDPK genes to confirm their responses to drought stress treatment. These observations may lay the foundation for future functional analysis of Populus CDPK and its closely related gene families to unravel their biological roles.

  6. Genome-wide identification of NBS-encoding resistance genes in Brassica rapa.

    PubMed

    Mun, Jeong-Hwan; Yu, Hee-Ju; Park, Soomin; Park, Beom-Seok

    2009-12-01

    Nucleotide-binding site (NBS)-encoding resistance genes are key plant disease-resistance genes and are abundant in plant genomes, comprising up to 2% of all genes. The availability of genome sequences from several plant models enables the identification and cloning of NBS-encoding genes from closely related species based on a comparative genomics approach. In this study, we used the genome sequence of Brassica rapa to identify NBS-encoding genes in the Brassica genome. We identified 92 non-redundant NBS-encoding genes [30 CC-NBS-LRR (CNL) and 62 TIR-NBS-LRR (TNL) genes] in approximately 100 Mbp of B. rapa euchromatic genome sequence. Despite the fact that B. rapa has a significantly larger genome than Arabidopsis thaliana due to a recent whole genome triplication event after speciation, B. rapa contains relatively small number of NBS-encoding genes compared to A. thaliana, presumably because of deletion of redundant genes related to genome diploidization. Phylogenetic and evolutionary analyses suggest that relatively higher relaxation of selective constraints on the TNL group after the old duplication event resulted in greater accumulation of TNLs than CNLs in both Arabidopsis and Brassica genomes. Recent tandem duplication and ectopic deletion are likely to have played a role in the generation of novel Brassica lineage-specific resistance genes.

  7. Genome-wide analysis of the WRKY gene family in physic nut (Jatropha curcas L.).

    PubMed

    Xiong, Wangdan; Xu, Xueqin; Zhang, Lin; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

    2013-07-25

    The WRKY proteins, which contain highly conserved WRKYGQK amino acid sequences and zinc-finger-like motifs, constitute a large family of transcription factors in plants. They participate in diverse physiological and developmental processes. WRKY genes have been identified and characterized in a number of plant species. We identified a total of 58 WRKY genes (JcWRKY) in the genome of the physic nut (Jatropha curcas L.). On the basis of their conserved WRKY domain sequences, all of the JcWRKY proteins could be assigned to one of the previously defined groups, I-III. Phylogenetic analysis of JcWRKY genes with Arabidopsis and rice WRKY genes, and separately with castor bean WRKY genes, revealed no evidence of recent gene duplication in JcWRKY gene family. Analysis of transcript abundance of JcWRKY gene products were tested in different tissues under normal growth condition. In addition, 47 WRKY genes responded to at least one abiotic stress (drought, salinity, phosphate starvation and nitrogen starvation) in individual tissues (leaf, root and/or shoot cortex). Our study provides a useful reference data set as the basis for cloning and functional analysis of physic nut WRKY genes. PMID:23644253

  8. Genome-wide identification, characterization, and expression analysis of lineage-specific genes within zebrafish

    PubMed Central

    2013-01-01

    Background The genomic basis of teleost phenotypic complexity remains obscure, despite increasing availability of genome and transcriptome sequence data. Fish-specific genome duplication cannot provide sufficient explanation for the morphological complexity of teleosts, considering the relatively large number of extinct basal ray-finned fishes. Results In this study, we performed comparative genomic analysis to discover the Conserved Teleost-Specific Genes (CTSGs) and orphan genes within zebrafish and found that these two sets of lineage-specific genes may have played important roles during zebrafish embryogenesis. Lineage-specific genes within zebrafish share many of the characteristics of their counterparts in other species: shorter length, fewer exon numbers, higher GC content, and fewer of them have transcript support. Chromosomal location analysis indicated that neither the CTSGs nor the orphan genes were distributed evenly in the chromosomes of zebrafish. The significant enrichment of immunity proteins in CTSGs annotated by gene ontology (GO) or predicted ab initio may imply that defense against pathogens may be an important reason for the diversification of teleosts. The evolutionary origin of the lineage-specific genes was determined and a very high percentage of lineage-specific genes were generated via gene duplications. The temporal and spatial expression profile of lineage-specific genes obtained by expressed sequence tags (EST) and RNA-seq data revealed two novel properties: in addition to being highly tissue-preferred expression, lineage-specific genes are also highly temporally restricted, namely they are expressed in narrower time windows than evolutionarily conserved genes and are specifically enriched in later-stage embryos and early larval stages. Conclusions Our study provides the first systematic identification of two different sets of lineage-specific genes within zebrafish and provides valuable information leading towards a better

  9. Genome-wide identification of lineage-specific genes within Caenorhabditis elegans.

    PubMed

    Zhou, Kun; Huang, Beibei; Zou, Ming; Lu, Dandan; He, Shunping; Wang, Guoxiu

    2015-10-01

    With the rapid growth of sequencing technology, a number of genomes and transcriptomes of various species have been sequenced, contributing to the study of lineage-specific genes (LSGs). We identified two sets of LSGs using BLAST: one included Caenorhabditis elegans species-specific genes (1423, SSGs), and the other consisted of Caenorhabditis genus-specific genes (4539, GSGs). The subsequent characterization and analysis of the SSGs and GSGs showed that they have significant differences in evolution and that most LSGs were generated by gene duplication and integration of transposable elements (TEs). We then performed temporal expression profiling and protein function prediction and observed that many SSGs and GSGs are expressed and that genes involved with sex determination, specific stress, immune response, and morphogenesis are over-represented, suggesting that these specific genes may be related to the Caenorhabditis nematodes' special ability to survive in severe and extreme environments.

  10. Comparative analysis of genome-wide Mlo gene family in Cajanus cajan and Phaseolus vulgaris.

    PubMed

    Deshmukh, Reena; Singh, V K; Singh, B D

    2016-04-01

    The Mlo gene was discovered in barley because the mutant 'mlo' allele conferred broad-spectrum, non-race-specific resistance to powdery mildew caused by Blumeria graminis f. sp. hordei. The Mlo genes also play important roles in growth and development of plants, and in responses to biotic and abiotic stresses. The Mlo gene family has been characterized in several crop species, but only a single legume species, soybean (Glycine max L.), has been investigated so far. The present report describes in silico identification of 18 CcMlo and 20 PvMlo genes in the important legume crops Cajanus cajan (L.) Millsp. and Phaseolus vulgaris L., respectively. In silico analysis of gene organization, protein properties and conserved domains revealed that the C. cajan and P. vulgaris Mlo gene paralogs are more divergent from each other than from their orthologous pairs. The comparative phylogenetic analysis classified CcMlo and PvMlo genes into three major clades. A comparative analysis of CcMlo and PvMlo proteins with the G. max Mlo proteins indicated close association of one CcMlo, one PvMlo with two GmMlo genes, indicating that there was no further expansion of the Mlo gene family after the separation of these species. Thus, most of the diploid species of eudicots might be expected to contain 15-20 Mlo genes. The genes CcMlo12 and 14, and PvMlo11 and 12 are predicted to participate in powdery mildew resistance. If this prediction were verified, these genes could be targeted by TILLING or CRISPR to isolate powdery mildew resistant mutants. PMID:26961357

  11. Genome-wide analysis of the MYB gene family in physic nut (Jatropha curcas L.).

    PubMed

    Zhou, Changpin; Chen, Yanbo; Wu, Zhenying; Lu, Wenjia; Han, Jinli; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

    2015-11-01

    The MYB proteins comprise one of the largest transcription factor families in plants, and play key roles in regulatory networks controlling development, metabolism, and stress responses. A total of 125 MYB genes (JcMYB) have been identified in the physic nut (Jatropha curcas L.) genome, including 120 2R-type MYB, 4 3R-MYB, and 1 4R-MYB genes. Based on exon-intron arrangement of MYBs from both lower (Physcomitrella patens) and higher (physic nut, Arabidopsis, and rice) plants, we can classify plant MYB genes into ten groups (MI-X), except for MIX genes which are nonexistent in higher plants. We also observed that MVIII genes may be one of the most ancient MYB types which consist of both R2R3- and 3R-MYB genes. Most MYB genes (76.8% in physic nut) belong to the MI group which can be divided into 34 subgroups. The JcMYB genes were nonrandomly distributed on its 11 linkage groups (LGs). The expansion of MYB genes across several subgroups was observed and resulted from genome triplication of ancient dicotyledons and from both ancient and recent tandem duplication events in the physic nut genome. The expression patterns of several MYB duplicates in the physic nut showed differences in four tissues (root, stem, leaf, and seed), and 34 MYB genes responded to at least one abiotic stressor (drought, salinity, phosphate starvation, and nitrogen starvation) in leaves and/or roots based on the data analysis of digital gene expression tags. Overexpression of the JcMYB001 gene in Arabidopsis increased its sensitivity to drought and salinity stresses. PMID:26142104

  12. Genome-wide analysis of the MYB gene family in physic nut (Jatropha curcas L.).

    PubMed

    Zhou, Changpin; Chen, Yanbo; Wu, Zhenying; Lu, Wenjia; Han, Jinli; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

    2015-11-01

    The MYB proteins comprise one of the largest transcription factor families in plants, and play key roles in regulatory networks controlling development, metabolism, and stress responses. A total of 125 MYB genes (JcMYB) have been identified in the physic nut (Jatropha curcas L.) genome, including 120 2R-type MYB, 4 3R-MYB, and 1 4R-MYB genes. Based on exon-intron arrangement of MYBs from both lower (Physcomitrella patens) and higher (physic nut, Arabidopsis, and rice) plants, we can classify plant MYB genes into ten groups (MI-X), except for MIX genes which are nonexistent in higher plants. We also observed that MVIII genes may be one of the most ancient MYB types which consist of both R2R3- and 3R-MYB genes. Most MYB genes (76.8% in physic nut) belong to the MI group which can be divided into 34 subgroups. The JcMYB genes were nonrandomly distributed on its 11 linkage groups (LGs). The expansion of MYB genes across several subgroups was observed and resulted from genome triplication of ancient dicotyledons and from both ancient and recent tandem duplication events in the physic nut genome. The expression patterns of several MYB duplicates in the physic nut showed differences in four tissues (root, stem, leaf, and seed), and 34 MYB genes responded to at least one abiotic stressor (drought, salinity, phosphate starvation, and nitrogen starvation) in leaves and/or roots based on the data analysis of digital gene expression tags. Overexpression of the JcMYB001 gene in Arabidopsis increased its sensitivity to drought and salinity stresses.

  13. Genome-Wide RNAi Screens in C. elegans to Identify Genes Influencing Lifespan and Innate Immunity.

    PubMed

    Sinha, Amit; Rae, Robbie

    2016-01-01

    RNA interference is a rapid, inexpensive, and highly effective tool used to inhibit gene function. In C. elegans, whole genome screens have been used to identify genes involved with numerous traits including aging and innate immunity. RNAi in C. elegans can be carried out via feeding, soaking, or injection. Here we outline protocols used to maintain, grow, and carry out RNAi via feeding in C. elegans and determine whether the inhibited genes are essential for lifespan or innate immunity. PMID:27581293

  14. Computational bacterial genome-wide analysis of phylogenetic profiles reveals potential virulence genes of Streptococcus agalactiae.

    PubMed

    Lin, Frank Po-Yen; Lan, Ruiting; Sintchenko, Vitali; Gilbert, Gwendolyn L; Kong, Fanrong; Coiera, Enrico

    2011-04-04

    The phylogenetic profile of a gene is a reflection of its evolutionary history and can be defined as the differential presence or absence of a gene in a set of reference genomes. It has been employed to facilitate the prediction of gene functions. However, the hypothesis that the application of this concept can also facilitate the discovery of bacterial virulence factors has not been fully examined. In this paper, we test this hypothesis and report a computational pipeline designed to identify previously unknown bacterial virulence genes using group B streptococcus (GBS) as an example. Phylogenetic profiles of all GBS genes across 467 bacterial reference genomes were determined by candidate-against-all BLAST searches,which were then used to identify candidate virulence genes by machine learning models. Evaluation experiments with known GBS virulence genes suggested good functional and model consistency in cross-validation analyses (areas under ROC curve, 0.80 and 0.98 respectively). Inspection of the top-10 genes in each of the 15 virulence functional groups revealed at least 15 (of 119) homologous genes implicated in virulence in other human pathogens but previously unrecognized as potential virulence genes in GBS. Among these highly-ranked genes, many encode hypothetical proteins with possible roles in GBS virulence. Thus, our approach has led to the identification of a set of genes potentially affecting the virulence potential of GBS, which are potential candidates for further in vitro and in vivo investigations. This computational pipeline can also be extended to in silico analysis of virulence determinants of other bacterial pathogens.

  15. Comparative analysis of genome-wide Mlo gene family in Cajanus cajan and Phaseolus vulgaris.

    PubMed

    Deshmukh, Reena; Singh, V K; Singh, B D

    2016-04-01

    The Mlo gene was discovered in barley because the mutant 'mlo' allele conferred broad-spectrum, non-race-specific resistance to powdery mildew caused by Blumeria graminis f. sp. hordei. The Mlo genes also play important roles in growth and development of plants, and in responses to biotic and abiotic stresses. The Mlo gene family has been characterized in several crop species, but only a single legume species, soybean (Glycine max L.), has been investigated so far. The present report describes in silico identification of 18 CcMlo and 20 PvMlo genes in the important legume crops Cajanus cajan (L.) Millsp. and Phaseolus vulgaris L., respectively. In silico analysis of gene organization, protein properties and conserved domains revealed that the C. cajan and P. vulgaris Mlo gene paralogs are more divergent from each other than from their orthologous pairs. The comparative phylogenetic analysis classified CcMlo and PvMlo genes into three major clades. A comparative analysis of CcMlo and PvMlo proteins with the G. max Mlo proteins indicated close association of one CcMlo, one PvMlo with two GmMlo genes, indicating that there was no further expansion of the Mlo gene family after the separation of these species. Thus, most of the diploid species of eudicots might be expected to contain 15-20 Mlo genes. The genes CcMlo12 and 14, and PvMlo11 and 12 are predicted to participate in powdery mildew resistance. If this prediction were verified, these genes could be targeted by TILLING or CRISPR to isolate powdery mildew resistant mutants.

  16. Genome-Wide Analysis of the NAC Gene Family in Physic Nut (Jatropha curcas L.).

    PubMed

    Wu, Zhenying; Xu, Xueqin; Xiong, Wangdan; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Wu, Guojiang; Jiang, Huawu

    2015-01-01

    The NAC proteins (NAM, ATAF1/2 and CUC2) are plant-specific transcriptional regulators that have a conserved NAM domain in the N-terminus. They are involved in various biological processes, including both biotic and abiotic stress responses. In the present study, a total of 100 NAC genes (JcNAC) were identified in physic nut (Jatropha curcas L.). Based on phylogenetic analysis and gene structures, 83 JcNAC genes were classified as members of, or proposed to be diverged from, 39 previously predicted orthologous groups (OGs) of NAC sequences. Physic nut has a single intron-containing NAC gene subfamily that has been lost in many plants. The JcNAC genes are non-randomly distributed across the 11 linkage groups of the physic nut genome, and appear to be preferentially retained duplicates that arose from both ancient and recent duplication events. Digital gene expression analysis indicates that some of the JcNAC genes have tissue-specific expression profiles (e.g. in leaves, roots, stem cortex or seeds), and 29 genes differentially respond to abiotic stresses (drought, salinity, phosphorus deficiency and nitrogen deficiency). Our results will be helpful for further functional analysis of the NAC genes in physic nut. PMID:26125188

  17. Genome-Wide Analysis of the NAC Gene Family in Physic Nut (Jatropha curcas L.)

    PubMed Central

    Wu, Zhenying; Xu, Xueqin; Xiong, Wangdan; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Wu, Guojiang; Jiang, Huawu

    2015-01-01

    The NAC proteins (NAM, ATAF1/2 and CUC2) are plant-specific transcriptional regulators that have a conserved NAM domain in the N-terminus. They are involved in various biological processes, including both biotic and abiotic stress responses. In the present study, a total of 100 NAC genes (JcNAC) were identified in physic nut (Jatropha curcas L.). Based on phylogenetic analysis and gene structures, 83 JcNAC genes were classified as members of, or proposed to be diverged from, 39 previously predicted orthologous groups (OGs) of NAC sequences. Physic nut has a single intron-containing NAC gene subfamily that has been lost in many plants. The JcNAC genes are non-randomly distributed across the 11 linkage groups of the physic nut genome, and appear to be preferentially retained duplicates that arose from both ancient and recent duplication events. Digital gene expression analysis indicates that some of the JcNAC genes have tissue-specific expression profiles (e.g. in leaves, roots, stem cortex or seeds), and 29 genes differentially respond to abiotic stresses (drought, salinity, phosphorus deficiency and nitrogen deficiency). Our results will be helpful for further functional analysis of the NAC genes in physic nut. PMID:26125188

  18. Genome-Wide Comparative Analysis of Flowering-Related Genes in Arabidopsis, Wheat, and Barley

    PubMed Central

    Peng, Fred Y.; Hu, Zhiqiu; Yang, Rong-Cai

    2015-01-01

    Early flowering is an important trait influencing grain yield and quality in wheat (Triticum aestivum L.) and barley (Hordeum vulgare L.) in short-season cropping regions. However, due to large and complex genomes of these species, direct identification of flowering genes and their molecular characterization remain challenging. Here, we used a bioinformatic approach to predict flowering-related genes in wheat and barley from 190 known Arabidopsis (Arabidopsis thaliana (L.) Heynh.) flowering genes. We identified 900 and 275 putative orthologs in wheat and barley, respectively. The annotated flowering-related genes were clustered into 144 orthologous groups with one-to-one, one-to-many, many-to-one, and many-to-many orthology relationships. Our approach was further validated by domain and phylogenetic analyses of flowering-related proteins and comparative analysis of publicly available microarray data sets for in silico expression profiling of flowering-related genes in 13 different developmental stages of wheat and barley. These further analyses showed that orthologous gene pairs in three critical flowering gene families (PEBP, MADS, and BBX) exhibited similar expression patterns among 13 developmental stages in wheat and barley, suggesting similar functions among the orthologous genes with sequence and expression similarities. The predicted candidate flowering genes can be confirmed and incorporated into molecular breeding for early flowering wheat and barley in short-season cropping regions. PMID:26435710

  19. Genome-wide identification, characterization, and expression analysis of the MLO gene family in Cucumis sativus.

    PubMed

    Zhou, S J; Jing, Z; Shi, J L

    2013-12-11

    Mildew resistance locus o (MLO) is a plant-specific seven-transmembrane (TM) gene family. Several studies have revealed that certain members of the MLO gene family mediate powdery mildew susceptibility in three plant species, namely, Arabidopsis, barley, and tomato. The sequenced cucumber genome provides an opportunity to conduct a comprehensive overview of the MLO gene family. Fourteen genes (designated CsMLO01 through CsMLO14) have been identified within the Cucumis sativus genome by using an in silico cloning method with the MLO amino acid sequences of Arabidopsis thaliana and rice as probes. Sequence alignment revealed that numerous features of the gene family, such as TMs, a calmodulin-binding domain, peptide domains I and II, and 30 important amino acid residues for MLO function, are well conserved. Phylogenetic analysis of the MLO genes from cucumber and other plant species reveals seven different clades (I through VII). Three of these clades comprised MLO genes from A. thaliana, rice, maize, and cucumber, suggesting that these genes may have evolved after the divergence of monocots and dicots. In silico mapping showed that these CsMLOs were located on chromosomes 1, 2, 3, 4, 5, and 6 without any obvious clustering, except CsMLO01. To our knowledge, this paper is the first comprehensive report on MLO genes in C. sativus. These findings will facilitate the functional characterization of the MLOs related to powdery mildew susceptibility and assist in the development of disease resistance in cucumber.

  20. Genome-Wide Identification, Evolution and Expression Analysis of mTERF Gene Family in Maize

    PubMed Central

    Zhao, Yanxin; Cai, Manjun; Zhang, Xiaobo; Li, Yurong; Zhang, Jianhua; Zhao, Hailiang; Kong, Fei; Zheng, Yonglian; Qiu, Fazhan

    2014-01-01

    Plant mitochondrial transcription termination factor (mTERF) genes comprise a large family with important roles in regulating organelle gene expression. In this study, a comprehensive database search yielded 31 potential mTERF genes in maize (Zea mays L.) and most of them were targeted to mitochondria or chloroplasts. Maize mTERF were divided into nine main groups based on phylogenetic analysis, and group IX represented the mitochondria and species-specific clade that diverged from other groups. Tandem and segmental duplication both contributed to the expansion of the mTERF gene family in the maize genome. Comprehensive expression analysis of these genes, using microarray data and RNA-seq data, revealed that these genes exhibit a variety of expression patterns. Environmental stimulus experiments revealed differential up or down-regulation expression of maize mTERF genes in seedlings exposed to light/dark, salts and plant hormones, respectively, suggesting various important roles of maize mTERF genes in light acclimation and stress-related responses. These results will be useful for elucidating the roles of mTERF genes in the growth, development and stress response of maize. PMID:24718683

  1. Genome-Wide Linkage, Exome Sequencing and Functional Analyses Identify ABCB6 as the Pathogenic Gene of Dyschromatosis Universalis Hereditaria

    PubMed Central

    Wang, Na; Wang, Chuan; Chen, Xuechao; Sheng, Donglai; Fu, Xi’an; See, Kelvin; Foo, Jia Nee; Low, Huiqi; Liany, Herty; Irwan, Ishak Darryl; Liu, Jian; Yang, Baoqi; Chen, Mingfei; Yu, Yongxiang; Yu, Gongqi; Niu, Guiye; You, Jiabao; Zhou, Yan; Ma, Shanshan; Wang, Ting; Yan, Xiaoxiao; Goh, Boon Kee; Common, John E. A.; Lane, Birgitte E.; Sun, Yonghu; Zhou, Guizhi; Lu, Xianmei; Wang, Zhenhua; Tian, Hongqing; Cao, Yuanhua; Chen, Shumin; Liu, Qiji; Liu, Jianjun; Zhang, Furen

    2014-01-01

    Background As a genetic disorder of abnormal pigmentation, the molecular basis of dyschromatosis universalis hereditaria (DUH) had remained unclear until recently when ABCB6 was reported as a causative gene of DUH. Methodology We performed genome-wide linkage scan using Illumina Human 660W-Quad BeadChip and exome sequencing analyses using Agilent SureSelect Human All Exon Kits in a multiplex Chinese DUH family to identify the pathogenic mutations and verified the candidate mutations using Sanger sequencing. Quantitative RT-PCR and Immunohistochemistry was performed to verify the expression of the pathogenic gene, Zebrafish was also used to confirm the functional role of ABCB6 in melanocytes and pigmentation. Results Genome-wide linkage (assuming autosomal dominant inheritance mode) and exome sequencing analyses identified ABCB6 as the disease candidate gene by discovering a coding mutation (c.1358C>T; p.Ala453Val) that co-segregates with the disease phenotype. Further mutation analysis of ABCB6 in four other DUH families and two sporadic cases by Sanger sequencing confirmed the mutation (c.1358C>T; p.Ala453Val) and discovered a second, co-segregating coding mutation (c.964A>C; p.Ser322Lys) in one of the four families. Both mutations were heterozygous in DUH patients and not present in the 1000 Genome Project and dbSNP database as well as 1,516 unrelated Chinese healthy controls. Expression analysis in human skin and mutagenesis interrogation in zebrafish confirmed the functional role of ABCB6 in melanocytes and pigmentation. Given the involvement of ABCB6 mutations in coloboma, we performed ophthalmological examination of the DUH carriers of ABCB6 mutations and found ocular abnormalities in them. Conclusion Our study has advanced our understanding of DUH pathogenesis and revealed the shared pathological mechanism between pigmentary DUH and ocular coloboma. PMID:24498303

  2. Genome-Wide Analysis of the Expansin Gene Superfamily Reveals Grapevine-Specific Structural and Functional Characteristics

    PubMed Central

    Tornielli, Giovanni Battista; Fasoli, Marianna; Venturini, Luca; Pezzotti, Mario; Zenoni, Sara

    2013-01-01

    Background Expansins are proteins that loosen plant cell walls in a pH-dependent manner, probably by increasing the relative movement among polymers thus causing irreversible expansion. The expansin superfamily (EXP) comprises four distinct families: expansin A (EXPA), expansin B (EXPB), expansin-like A (EXLA) and expansin-like B (EXLB). There is experimental evidence that EXPA and EXPB proteins are required for cell expansion and developmental processes involving cell wall modification, whereas the exact functions of EXLA and EXLB remain unclear. The complete grapevine (Vitis vinifera) genome sequence has allowed the characterization of many gene families, but an exhaustive genome-wide analysis of expansin gene expression has not been attempted thus far. Methodology/Principal Findings We identified 29 EXP superfamily genes in the grapevine genome, representing all four EXP families. Members of the same EXP family shared the same exon–intron structure, and phylogenetic analysis confirmed a closer relationship between EXP genes from woody species, i.e. grapevine and poplar (Populus trichocarpa), compared to those from Arabidopsis thaliana and rice (Oryza sativa). We also identified grapevine-specific duplication events involving the EXLB family. Global gene expression analysis confirmed a strong correlation among EXP genes expressed in mature and green/vegetative samples, respectively, as reported for other gene families in the recently-published grapevine gene expression atlas. We also observed the specific co-expression of EXLB genes in woody organs, and the involvement of certain grapevine EXP genes in berry development and post-harvest withering. Conclusion Our comprehensive analysis of the grapevine EXP superfamily confirmed and extended current knowledge about the structural and functional characteristics of this gene family, and also identified properties that are currently unique to grapevine expansin genes. Our data provide a model for the functional

  3. Identification of disease-associated pathways in pancreatic cancer by integrating genome-wide association study and gene expression data

    PubMed Central

    LONG, JIN; LIU, ZHE; WU, XINGDA; XU, YUANHONG; GE, CHUNLIN

    2016-01-01

    In order to additionally understand the pathogenesis of pancreatic cancer (PC), the present study conducted pathway analysis based on genome-wide association study (GWAS) and gene expression data to predict genes that are associated with PC. GWAS data (accession no., pha002874.1) were downloaded from National Center for Biotechnology Information (NCBI) database of Genotypes and Phenotypes, which included data concerning 1,896 patients with PC and 1,939 control individuals. Gene expression data [accession no., GSE23952; human pancreatic carcinoma Panc-1 transforming growth factor-β (TGF-β) treatment assay] were downloaded from NCBI Gene Expression Omnibus. Gene set enrichment analysis was used to identify significant pathways in the GWAS or gene expression profiles. Meta-analysis was performed based on pathway analysis of the two data sources. In total, 58 and 280 pathways were identified to be significant in the GWAS and gene expression data, respectively, with 7 pathways significant in both the data profiles. Hsa 04350 TGF-β signaling pathway had the smallest meta P-value. Other significant pathways in the two data sources were negative regulation of DNA-dependent transcription, the nucleolus, negative regulation of RNA metabolic process, the cellular defense response, exocytosis and galactosyltransferase activity. By constructing the gene-pathway network, 5 pathways were closely associated, apart from exocytosis and galactosyltransferase activity pathways. Among the 7 pathways, 11 key genes (2.9% out of a total of 380 genes) from the GWAS data and 43 genes (10.5% out of a total of 409 genes) from the gene expression data were differentially expressed. Only Abelson murine leukemia viral oncogene homolog 1 from the nucleolus pathway was significantly expressed in by both data sources. Overall, the results of the present analysis provide possible factors for the occurrence of PC, and the identification of the pathways and genes associated with PC provides

  4. Multi-species data integration and gene ranking enrich significant results in an alcoholism genome-wide association study

    PubMed Central

    2012-01-01

    Background A variety of species and experimental designs have been used to study genetic influences on alcohol dependence, ethanol response, and related traits. Integration of these heterogeneous data can be used to produce a ranked target gene list for additional investigation. Results In this study, we performed a unique multi-species evidence-based data integration using three microarray experiments in mice or humans that generated an initial alcohol dependence (AD) related genes list, human linkage and association results, and gene sets implicated in C. elegans and Drosophila. We then used permutation and false discovery rate (FDR) analyses on the genome-wide association studies (GWAS) dataset from the Collaborative Study on the Genetics of Alcoholism (COGA) to evaluate the ranking results and weighting matrices. We found one weighting score matrix could increase FDR based q-values for a list of 47 genes with a score greater than 2. Our follow up functional enrichment tests revealed these genes were primarily involved in brain responses to ethanol and neural adaptations occurring with alcoholism. Conclusions These results, along with our experimental validation of specific genes in mice, C. elegans and Drosophila, suggest that a cross-species evidence-based approach is useful to identify candidate genes contributing to alcoholism. PMID:23282140

  5. Genome-wide Computational Analysis Reveals Cardiomyocyte-specific Transcriptional Cis-regulatory Motifs That Enable Efficient Cardiac Gene Therapy

    PubMed Central

    Rincon, Melvin Y; Sarcar, Shilpita; Danso-Abeam, Dina; Keyaerts, Marleen; Matrai, Janka; Samara-Kuko, Ermira; Acosta-Sanchez, Abel; Athanasopoulos, Takis; Dickson, George; Lahoutte, Tony; De Bleser, Pieter; VandenDriessche, Thierry; Chuah, Marinee K

    2015-01-01

    Gene therapy is a promising emerging therapeutic modality for the treatment of cardiovascular diseases and hereditary diseases that afflict the heart. Hence, there is a need to develop robust cardiac-specific expression modules that allow for stable expression of the gene of interest in cardiomyocytes. We therefore explored a new approach based on a genome-wide bioinformatics strategy that revealed novel cardiac-specific cis-acting regulatory modules (CS-CRMs). These transcriptional modules contained evolutionary-conserved clusters of putative transcription factor binding sites that correspond to a “molecular signature” associated with robust gene expression in the heart. We then validated these CS-CRMs in vivo using an adeno-associated viral vector serotype 9 that drives a reporter gene from a quintessential cardiac-specific α-myosin heavy chain promoter. Most de novo designed CS-CRMs resulted in a >10-fold increase in cardiac gene expression. The most robust CRMs enhanced cardiac-specific transcription 70- to 100-fold. Expression was sustained and restricted to cardiomyocytes. We then combined the most potent CS-CRM4 with a synthetic heart and muscle-specific promoter (SPc5-12) and obtained a significant 20-fold increase in cardiac gene expression compared to the cytomegalovirus promoter. This study underscores the potential of rational vector design to improve the robustness of cardiac gene therapy. PMID:25195597

  6. Genome-wide identification and expression analysis of the IQD gene family in moso bamboo (Phyllostachys edulis)

    PubMed Central

    Wu, Min; Li, Yuan; Chen, Danmei; Liu, Huanlong; Zhu, Dongyue; Xiang, Yan

    2016-01-01

    Members of the plant-specific IQ67-domain (IQD) protein family are involved in various aspects of normal plant growth and developmental processes as well as basal defence response. Although hundreds of IQD proteins have been identified, only a small number of IQDs have been functionally characterized. Moreover, no systematic study has been performed on moso bamboo. In this study, we performed for the first time a genome-wide identification and expression analysis of the IQD gene family in moso bamboo. We identified 29 non-redundant PeIQD encoding genes. Analysis of the evolutionary patterns and divergence revealed that the IQD genes underwent a large-scale event around 12 million years ago and the division times of IQD family genes between moso bamboo and rice, and, between moso bamboo and Brachypodium, were found to be 20–35 MYA and 25–40 MYA, respectively. We surveyed the putative promoter regions of the PeIQD genes, which showed that largely stress-related cis-elements existed in these genes. The expression profiles of the IQD genes shed light on their functional divergence. Additionally, a yeast two-hybrid assay proved that PeIQD8 can interact with PeCaM2 and that IQ or I in the IQ motif is required for PeIQD8 to combine with CaM2. PMID:27094318

  7. Genome-wide computational analysis reveals cardiomyocyte-specific transcriptional Cis-regulatory motifs that enable efficient cardiac gene therapy.

    PubMed

    Rincon, Melvin Y; Sarcar, Shilpita; Danso-Abeam, Dina; Keyaerts, Marleen; Matrai, Janka; Samara-Kuko, Ermira; Acosta-Sanchez, Abel; Athanasopoulos, Takis; Dickson, George; Lahoutte, Tony; De Bleser, Pieter; VandenDriessche, Thierry; Chuah, Marinee K

    2015-01-01

    Gene therapy is a promising emerging therapeutic modality for the treatment of cardiovascular diseases and hereditary diseases that afflict the heart. Hence, there is a need to develop robust cardiac-specific expression modules that allow for stable expression of the gene of interest in cardiomyocytes. We therefore explored a new approach based on a genome-wide bioinformatics strategy that revealed novel cardiac-specific cis-acting regulatory modules (CS-CRMs). These transcriptional modules contained evolutionary-conserved clusters of putative transcription factor binding sites that correspond to a "molecular signature" associated with robust gene expression in the heart. We then validated these CS-CRMs in vivo using an adeno-associated viral vector serotype 9 that drives a reporter gene from a quintessential cardiac-specific α-myosin heavy chain promoter. Most de novo designed CS-CRMs resulted in a >10-fold increase in cardiac gene expression. The most robust CRMs enhanced cardiac-specific transcription 70- to 100-fold. Expression was sustained and restricted to cardiomyocytes. We then combined the most potent CS-CRM4 with a synthetic heart and muscle-specific promoter (SPc5-12) and obtained a significant 20-fold increase in cardiac gene expression compared to the cytomegalovirus promoter. This study underscores the potential of rational vector design to improve the robustness of cardiac gene therapy.

  8. Genome-wide misexpression of X-linked versus autosomal genes associated with hybrid male sterility.

    PubMed

    Lu, Xuemei; Shapiro, Joshua A; Ting, Chau-Ti; Li, Yan; Li, Chunyan; Xu, Jin; Huang, Huanwei; Cheng, Ya-Jen; Greenberg, Anthony J; Li, Shou-Hsien; Wu, Mao-Lien; Shen, Yang; Wu, Chung-I

    2010-08-01

    Postmating reproductive isolation is often manifested as hybrid male sterility, for which X-linked genes are overrepresented (the so-called large X effect). In contrast, X-linked genes are significantly under-represented among testis-expressing genes. This seeming contradiction may be germane to the X:autosome imbalance hypothesis on hybrid sterility, in which the X-linked effect is mediated mainly through the misexpression of autosomal genes. In this study, we compared gene expression in fertile and sterile males in the hybrids between two Drosophila species. These hybrid males differ only in a small region of the X chromosome containing the Ods-site homeobox (OdsH) (also known as Odysseus) locus of hybrid sterility. Of genes expressed in the testis, autosomal genes were, indeed, more likely to be misexpressed than X-linked genes under the sterilizing action of OdsH. Since this mechanism of X:autosome interaction is only associated with spermatogenesis, a connection between X:autosome imbalance and the high rate of hybrid male sterility seems plausible.

  9. Genome-wide analysis for identification of salt-responsive genes in common wheat.

    PubMed

    Kawaura, Kanako; Mochida, Keiichi; Ogihara, Yasunari

    2008-08-01

    To identify salt-responsive genes in wheat, global expression analysis of transcripts was carried out using oligo-DNA microarrays. Microarrays have been designed from approximately 32,000 unique wheat genes classified from a large number of expressed sequence tags (ESTs). Two-week-old seedlings of wheat were treated with 150 mM NaCl for 1, 6, and 24 h, and their roots and shoots were separately subjected to analyses. Consequently, 5,996 genes showed changes in expression of more than twofold and were classified into 12 groups according to correlations in expression patterns. These salt-responsive genes were assigned functions using the Gene Ontology (GO). Genes assigned to transcription factor, transcription-regulator activity, and DNA-binding functions were preferentially classified into early response groups. On the other hand, those assigned transferase and transporter activity were classified into late response groups. These data suggest that multiple signal transduction pathways in response to salinity exist in wheat. Transcription factors (TFs) which have been reported as participants in salt-tolerant pathway changed their expression levels in response to salt treatment. Among them, only a few TFs show high sequence homologies to genes in rice. These investigations suggest that salt-responsive genes identified by this study are candidates for salt-stress tolerance uniquely in wheat.

  10. Gene-Environment Interactions in Genome-Wide Association Studies: Current Approaches and New Directions

    ERIC Educational Resources Information Center

    Winham, Stacey J.; Biernacka, Joanna M.

    2013-01-01

    Background: Complex psychiatric traits have long been thought to be the result of a combination of genetic and environmental factors, and gene-environment interactions are thought to play a crucial role in behavioral phenotypes and the susceptibility and progression of psychiatric disorders. Candidate gene studies to investigate hypothesized…

  11. Insights into GATA-1 Mediated Gene Activation versus Repression via Genome-wide Chromatin Occupancy Analysis

    PubMed Central

    Yu, Ming; Riva, Laura; Xie, Huafeng; Schindler, Yocheved; Moran, Tyler B.; Cheng, Yong; Yu, Duonan; Hardison, Ross; Weiss, Mitchell J; Orkin, Stuart H.; Bernstein, Bradley E.; Fraenkel, Ernest; Cantor, Alan B.

    2009-01-01

    Summary The transcription factor GATA-1 is required for terminal erythroid maturation and functions as an activator or repressor depending on gene context. Yet its in vivo site selectivity and ability to distinguish between activated versus repressed genes remain incompletely understood. In this study, we performed GATA-1 ChIP-seq in erythroid cells and compared it to GATA-1 induced gene expression changes. Bound and differentially expressed genes contain a greater number of GATA binding motifs, a higher frequency of palindromic GATA sites, and closer occupancy to the transcriptional start site versus non-differentially expressed genes. Moreover, we show that the transcription factor Zbtb7a occupies GATA-1 bound regions of some direct GATA-1 target genes, that the presence of SCL/TAL1 helps distinguish transcriptional activation versus repression, and that Polycomb Repressive Complex 2 (PRC2) is involved in epigenetic silencing of a subset of GATA-1 repressed genes. These data provide insights into GATA-1 mediated gene regulation in vivo. PMID:19941827

  12. Genome-Wide Identification and Functional Classification of Tomato (Solanum lycopersicum) Aldehyde Dehydrogenase (ALDH) Gene Superfamily

    PubMed Central

    Lopez-Valverde, Francisco J.; Robles-Bolivar, Paula; Lima-Cabello, Elena; Gachomo, Emma W.; Kotchoni, Simeon O.

    2016-01-01

    Aldehyde dehydrogenases (ALDHs) is a protein superfamily that catalyzes the oxidation of aldehyde molecules into their corresponding non-toxic carboxylic acids, and responding to different environmental stresses, offering promising genetic approaches for improving plant adaptation. The aim of the current study is the functional analysis for systematic identification of S. lycopersicum ALDH gene superfamily. We performed genome-based ALDH genes identification and functional classification, phylogenetic relationship, structure and catalytic domains analysis, and microarray based gene expression. Twenty nine unique tomato ALDH sequences encoding 11 ALDH families were identified, including a unique member of the family 19 ALDH. Phylogenetic analysis revealed 13 groups, with a conserved relationship among ALDH families. Functional structure analysis of ALDH2 showed a catalytic mechanism involving Cys-Glu couple. However, the analysis of ALDH3 showed no functional gene duplication or potential neo-functionalities. Gene expression analysis reveals that particular ALDH genes might respond to wounding stress increasing the expression as ALDH2B7. Overall, this study reveals the complexity of S. lycopersicum ALDH gene superfamily and offers new insights into the structure-functional features and evolution of ALDH gene families in vascular plants. The functional characterization of ALDHs is valuable and promoting molecular breeding in tomato for the improvement of stress tolerance and signaling. PMID:27755582

  13. Genome-wide analysis of syntenic gene deletion in the grasses.

    PubMed

    Schnable, James C; Freeling, Michael; Lyons, Eric

    2012-01-01

    The grasses, Poaceae, are one of the largest and most successful angiosperm families. Like many radiations of flowering plants, the divergence of the major grass lineages was preceded by a whole-genome duplication (WGD), although these events are not rare for flowering plants. By combining identification of syntenic gene blocks with measures of gene pair divergence and different frequencies of ancient gene loss, we have separated the two subgenomes present in modern grasses. Reciprocal loss of duplicated genes or genomic regions has been hypothesized to reproductively isolate populations and, thus, speciation. However, in contrast to previous studies in yeast and teleost fishes, we found very little evidence of reciprocal loss of homeologous genes between the grasses, suggesting that post-WGD gene loss may not be the cause of the grass radiation. The sets of homeologous and orthologous genes and predicted locations of deleted genes identified in this study, as well as links to the CoGe comparative genomics web platform for analyzing pan-grass syntenic regions, are provided along with this paper as a resource for the grass genetics community.

  14. Genome-Wide Expression Analysis of Soybean MADS Genes Showing Potential Function in the Seed Development

    PubMed Central

    Hu, Rui-Bo; Zhang, Xiao-Mei; Chen, Jian-Xin; Fu, Yong-Fu

    2013-01-01

    The MADS family is an ancient and best-studied transcription factor and plays fundamental roles in almost every developmental process in plants. In the plant evolutionary history, the whole genome duplication (WGD) events are important not only to the plant species evolution, but to expansion of members of the gene families. Soybean as a model legume crop has experience three rounds of WGD events. Members of some MIKCC subfamilies, such as SOC, AGL6, SQUA, SVP, AGL17 and DEF/GLO, were expanded after soybean three rounds of WGD events. And some MIKCC subfamilies, MIKC* and type I MADS families had experienced faster birth-and-death evolution and their traces before the Glycine WGD event were not found. Transposed duplication played important roles in tandem arrangements among the members of different subfamilies. According to the expression profiles of type I and MIKC paralog pair genes, the fates of MIKC paralog gene pairs were subfunctionalization, and the fates of type I MADS paralog gene pairs were nonfunctionalization. 137 out of 163 MADS genes were close to 186 loci within 2 Mb genomic regions associated with seed-relative QTLs, among which 115 genes expressed during the seed development. Although MIKCC genes kept the important and conserved functions of the flower development, most MIKCC genes showed potentially essential roles in the seed development as well as the type I MADS. PMID:23638026

  15. Genome-wide experimental determination of barriers to horizontal gene transfer

    SciTech Connect

    Rubin, Edward; Sorek, Rotem; Zhu, Yiwen; Creevey, Christopher J.; Francino, M. Pilar; Bork, Peer; Rubin, Edward M.

    2007-09-24

    Horizontal gene transfer, in which genetic material is transferred from the genome of one organism to another, has been investigated in microbial species mainly through computational sequence analyses. To address the lack of experimental data, we studied the attempted movement of 246,045 genes from 79 prokaryotic genomes into E. coli and identified genes that consistently fail to transfer. We studied the mechanisms underlying transfer inhibition by placing coding regions from different species under the control of inducible promoters. Their toxicity to the host inhibited transfer regardless of the species of origin and our data suggest that increased gene dosage and associated increased expression is a predominant cause for transfer failure. While these experimental studies examined transfer solely into E. coli, a computational analysis of gene transfer rates across available bacterial and archaeal genomes indicates that the barriers observed in our study are general across the tree of life.

  16. The catalase gene family in cucumber: genome-wide identification and organization.

    PubMed

    Hu, Lifang; Yang, Yingui; Jiang, Lunwei; Liu, Shiqiang

    2016-01-01

    Catalase (CAT) is a common antioxidant enzyme in almost all living organisms. Currently, detailed reports on cucumber (Cucumis sativus L.) CAT (CsCAT) genes and tissue expression profiling are limited. In the present study, four candidate CsCAT genes were identified in cucumber. Phylogenetic analysis indicated that CsCAT1-CsCAT3 are closely related to Arabidopsis AtCAT1-AtCAT3, but no obvious counterpart was observed for CsCAT4. Intron/exon structure analysis revealed that only one of the 15 positions was completely conserved. Motif analysis showed that, unlike the CAT genes of other species, none of CsCAT genes contained all 10 motifs. Expression data showed that transcripts of all of the CsCAT genes, except CsCAT4, were detected in five tissues. Moreover, their transcription levels displayed differences under different stress treatments. PMID:27560990

  17. Genome-wide analysis of Aux/IAA and ARF gene families in Populus trichocarpa

    SciTech Connect

    Kalluri, Udaya C; DiFazio, Stephen P; Brunner, A.; Tuskan, Gerald A

    2007-01-01

    Auxin/Indole-3-Acetic Acid (Aux/IAA) and Auxin Response Factor (ARF) transcription factors are key regulators of auxin responses in plants. A total of 35 Aux/IAA and 39 ARF genes were identified in the Populus genome. Comparative phylogenetic analysis revealed that the subgroups PoptrARF2, 6, 9 and 16 and PoptrIAA3, 16, 27 and 29 have differentially expanded in Populus relative to Arabidopsis. Activator ARFs were found to be two fold-overrepresented in the Populus genome. PoptrIAA and PoptrARF gene families appear to have expanded due to high segmental and low tandem duplication events. Furthermore, expression studies showed that genes in the expanded PoptrIAA3 subgroup display differential expression. The gene-family analysis reported here will be useful in conducting future functional genomics studies to understand how the molecular roles of these large gene families translate into a diversity of biologically meaningful auxin effects.

  18. The catalase gene family in cucumber: genome-wide identification and organization

    PubMed Central

    Hu, Lifang; Yang, Yingui; Jiang, Lunwei; Liu, Shiqiang

    2016-01-01

    Abstract Catalase (CAT) is a common antioxidant enzyme in almost all living organisms. Currently, detailed reports on cucumber (Cucumis sativus L.) CAT (CsCAT) genes and tissue expression profiling are limited. In the present study, four candidate CsCAT genes were identified in cucumber. Phylogenetic analysis indicated that CsCAT1-CsCAT3 are closely related to Arabidopsis AtCAT1-AtCAT3, but no obvious counterpart was observed for CsCAT4. Intron/exon structure analysis revealed that only one of the 15 positions was completely conserved. Motif analysis showed that, unlike the CAT genes of other species, none of CsCAT genes contained all 10 motifs. Expression data showed that transcripts of all of the CsCAT genes, except CsCAT4, were detected in five tissues. Moreover, their transcription levels displayed differences under different stress treatments. PMID:27560990

  19. A genome-wide approach reveals novel imprinted genes expressed in the human placenta

    PubMed Central

    Barbaux, Sandrine; Gascoin-Lachambre, Géraldine; Buffat, Christophe; Monnier, Paul; Mondon, Françoise; Tonanny, Marie-Béatrice; Pinard, Amélie; Auer, Jana; Bessières, Bettina; Barlier, Anne; Jacques, Sébastien; Simeoni, Umberto; Dandolo, Luisa; Letourneur, Franck; Jammes, Hélène; Vaiman, Daniel

    2012-01-01

    Genomic imprinting characterizes genes with a monoallelic expression, which is dependent on the parental origin of each allele. Approximately 150 imprinted genes are known to date, in humans and mice but, though computational searches have tried to extract intrinsic characteristics of these genes to identify new ones, the existing list is probably far from being comprehensive. We used a high-throughput strategy by diverting the classical use of genotyping microarrays to compare the genotypes of mRNA/cDNA vs. genomic DNA to identify new genes presenting monoallelic expression, starting from human placental material. After filtering of data, we obtained a list of 1,082 putative candidate monoallelic SNPs located in more than one hundred candidate genes. Among these, we found known imprinted genes, such as IPW, GRB10, INPP5F and ZNF597, which contribute to validate the approach. We also explored some likely candidates of our list and identified seven new imprinted genes, including ZFAT, ZFAT-AS1, GLIS3, NTM, MAGI2, ZC3H12Cand LIN28B, four of which encode zinc finger transcription factors. They are, however, not imprinted in the mouse placenta, except for Magi2. We analyzed in more details the ZFAT gene, which is paternally expressed in the placenta (as ZFAT-AS1, a non-coding antisense RNA) but biallelic in other tissues. The ZFAT protein is expressed in endothelial cells, as well as in syncytiotrophoblasts. The expression of this gene is, moreover, downregulated in placentas from complicated pregnancies. With this work we increase by about 10% the number of known imprinted genes in humans. PMID:22894909

  20. Genome-wide identification and characterization of Fox genes in the silkworm, Bombyx mori.

    PubMed

    Song, JiangBo; Li, ZhiQuan; Tong, XiaoLing; Chen, Cong; Chen, Min; Meng, Gang; Chen, Peng; Li, ChunLin; Xin, YaQun; Gai, TingTing; Dai, FangYin; Lu, Cheng

    2015-09-01

    The forkhead box (Fox) transcription factor family has a characteristic of forkhead domain, a winged DNA-binding domain. The Fox genes have been classified into 23 subfamilies, designated FoxA to FoxS, of which the FoxR and FoxS subfamilies are specific to vertebrates. In this review, using whole-genome scanning, we identified 17 distinct Fox genes distributed on 13 chromosomes of the silkworm, Bombyx mori. A phylogenetic tree showed that the silkworm Fox genes could be classified into 13 subfamilies. The FoxK subfamily is specifically absent from the silkworm, although it is present in other lepidopteran insects, including Danaus plexippus and Heliconius melpomene. Microarray data revealed that the Fox genes have distinct expression patterns in the tissues on day 3 of the 5th instar larva. A Gene Ontology analysis suggested that the Fox genes have roles in cellular components, molecular functions, and biological processes, except in pore complex biogenesis. An analysis of the selective pressure on the proteins indicated that most of the amino acid sites in the Fox proteins are undergoing strong purifying selection. Here, we summarize the general characteristics of the Fox genes in the silkworm, which should support further functional studies of the silkworm Fox proteins.

  1. Genome wide in silico characterization of Dof gene families of pigeonpea (Cajanus cajan (L) Millsp.).

    PubMed

    Malviya, N; Gupta, S; Singh, V K; Yadav, M K; Bisht, N C; Sarangi, B K; Yadav, D

    2015-02-01

    The DNA binding with One Finger (Dof) protein is a plant specific transcription factor involved in the regulation of wide range of processes. The analysis of whole genome sequence of pigeonpea has identified 38 putative Dof genes (CcDof) distributed on 8 chromosomes. A total of 17 out of 38 CcDof genes were found to be intronless. A comprehensive in silico characterization of CcDof gene family including the gene structure, chromosome location, protein motif, phylogeny, gene duplication and functional divergence has been attempted. The phylogenetic analysis resulted in 3 major clusters with closely related members in phylogenetic tree revealed common motif distribution. The in silico cis-regulatory element analysis revealed functional diversity with predominance of light responsive and stress responsive elements indicating the possibility of these CcDof genes to be associated with photoperiodic control and biotic and abiotic stress. The duplication pattern showed that tandem duplication is predominant over segmental duplication events. The comparative phylogenetic analysis of these Dof proteins along with 78 soybean, 36 Arabidopsis and 30 rice Dof proteins revealed 7 major clusters. Several groups of orthologs and paralogs were identified based on phylogenetic tree constructed. Our study provides useful information for functional characterization of CcDof genes. PMID:25344821

  2. Genome-wide analysis of the GRAS gene family in physic nut (Jatropha curcas L.).

    PubMed

    Wu, Z Y; Wu, P Z; Chen, Y P; Li, M R; Wu, G J; Jiang, H W

    2015-01-01

    GRAS proteins play vital roles in plant growth and development. Physic nut (Jatropha curcas L.) was found to have a total of 48 GRAS family members (JcGRAS), 15 more than those found in Arabidopsis. The JcGRAS genes were divided into 12 subfamilies or 15 ancient monophyletic lineages based on the phylogenetic analysis of GRAS proteins from both flowering and lower plants. The functions of GRAS genes in 9 subfamilies have been reported previously for several plants, while the genes in the remaining 3 subfamilies were of unknown function; we named the latter families U1 to U3. No member of U3 subfamily is present in Arabidopsis and Poaceae species according to public genome sequence data. In comparison with the number of GRAS genes in Arabidopsis, more were detected in physic nut, resulting from the retention of many ancient GRAS subfamilies and the formation of tandem repeats during evolution. No evidence of recent duplication among JcGRAS genes was observed in physic nut. Based on digital gene expression data, 21 of the 48 genes exhibited differential expression in four tissues analyzed. Two members of subfamily U3 were expressed only in buds and flowers, implying that they may play specific roles. Our results provide valuable resources for future studies on the functions of GRAS proteins in physic nut. PMID:26782574

  3. The MAPKKK gene family in Gossypium raimondii: genome-wide identification, classification and expression analysis.

    PubMed

    Yin, Zujun; Wang, Junjuan; Wang, Delong; Fan, Weili; Wang, Shuai; Ye, Wuwei

    2013-01-01

    Mitogen-activated protein kinase (MAPK) cascades are conserved signal transduction pathways in all eukaryotic organisms. MAPKKKs (MAPK kinase kinases) operate at the top levels of these cascades. Recently, this family of genes has been systematically investigated in Arabidopsis, rice and maize, but has not yet been characterized in cotton. In this study, we identified 78 putative MAPKKK genes in the genome of the diploid cotton, Gossypium raimondii. They were classified into three subfamilies, of which 12 were ZIK, 22 were MEKK and 44 were Raf. The ZIK and MEKK genes displayed a scattered genomic distribution across 11 of the 13 chromosomes, whereas Raf genes were distributed across the entire genome. Their conserved patterns observed for introns and additional domains were consistent with the evolutionary relationships inferred from the phylogenetic analysis within subfamily. Transcriptome sequencing data were used to investigate their transcript profiles in mature leaves, 0 day and 3 days post-anthesis (DPA) ovules. Sixty MAPKKK genes were expressed, of which 41 were strongly expressed in mature leaves. Twelve MAPKKK genes were more highly expressed in 3-DPA ovules than in 0-DPA ovules. Our results provide a foundation for future evolutionary and functional characterizations of MAPKKK genes in cotton and probably other Gossypium plants.

  4. Genome-wide analysis of alternative promoters of human genes using a custom promoter tiling array

    PubMed Central

    Singer, Gregory AC; Wu, Jiejun; Yan, Pearlly; Plass, Christoph; Huang, Tim HM; Davuluri, Ramana V

    2008-01-01

    Background Independent lines of evidence suggested that a large fraction of human genes possess multiple promoters driving gene expression from distinct transcription start sites. Understanding which promoter is employed in which cellular context is required to unravel gene regulatory networks within the cell. Results We have developed a custom microarray platform that tiles roughly 35,000 alternative putative promoters from nearly 7,000 genes in the human genome. To demonstrate the utility of this array platform, we have analyzed the patterns of promoter usage in 17β-estradiol (E2)-treated and untreated MCF7 cells and show widespread usage of alternative promoters. Most intriguingly, we show that the downstream promoter in E2-sensitive multiple promoter genes tends to be very close to the 3'-terminus of the gene, suggesting exotic mechanisms of expression regulation in these genes. Conclusion The usage of alternative promoters greatly multiplies the transcriptional complexity available within the human genome. The fact that many of these promoters are incapable of driving the synthesis of a meaningful protein-encoding transcript further complicates the story. PMID:18655706

  5. Genome wide transcriptome profiling reveals differential gene expression in secondary metabolite pathway of Cymbopogon winterianus

    PubMed Central

    Devi, Kamalakshi; Mishra, Surajit K.; Sahu, Jagajjit; Panda, Debashis; Modi, Mahendra K.; Sen, Priyabrata

    2016-01-01

    Advances in transcriptome sequencing provide fast, cost-effective and reliable approach to generate large expression datasets especially suitable for non-model species to identify putative genes, key pathway and regulatory mechanism. Citronella (Cymbopogon winterianus) is an aromatic medicinal grass used for anti-tumoral, antibacterial, anti-fungal, antiviral, detoxifying and natural insect repellent properties. Despite of having number of utilities, the genes involved in terpenes biosynthetic pathway is not yet clearly elucidated. The present study is a pioneering attempt to generate an exhaustive molecular information of secondary metabolite pathway and to increase genomic resources in Citronella. Using high-throughput RNA-Seq technology, root and leaf transcriptome was analysed at an unprecedented depth (11.7 Gb). Targeted searches identified majority of the genes associated with metabolic pathway and other natural product pathway viz. antibiotics synthesis along with many novel genes. Terpenoid biosynthesis genes comparative expression results were validated for 15 unigenes by RT-PCR and qRT-PCR. Thus the coverage of these transcriptome is comprehensive enough to discover all known genes of major metabolic pathways. This transcriptome dataset can serve as important public information for gene expression, genomics and function genomics studies in Citronella and shall act as a benchmark for future improvement of the crop. PMID:26877149

  6. Genome-wide screening of Saccharomyces cerevisiae genes regulated by vanillin.

    PubMed

    Park, Eun-Hee; Kim, Myoung-Dong

    2015-01-01

    During pretreatment of lignocellulosic biomass, a variety of fermentation inhibitors, including acetic acid and vanillin, are released. Using DNA microarray analysis, this study explored genes of the budding yeast Saccharomyces cerevisiae that respond to vanillin-induced stress. The expression of 273 genes was upregulated and that of 205 genes was downregulated under vanillin stress. Significantly induced genes included MCH2, SNG1, GPH1, and TMA10, whereas NOP2, UTP18, FUR1, and SPR1 were down regulated. Sequence analysis of the 5'-flanking region of upregulated genes suggested that vanillin might regulate gene expression in a stress response element (STRE)-dependent manner, in addition to a pathway that involved the transcription factor Yap1p. Retardation in the cell growth of mutant strains indicated that MCH2, SNG1, and GPH1 are intimately involved in vanillin stress response. Deletion of the genes whose expression levels were decreased under vanillin stress did not result in a notable change in S. cerevisiae growth under vanillin stress. This study will provide the basis for a better understanding of the stress response of the yeast S. cerevisiae to fermentation inhibitors.

  7. Genome-wide profiles of methylation, microRNAs, and gene expression in chemoresistant breast cancer

    PubMed Central

    He, Dong-Xu; Gu, Feng; Gao, Fei; Hao, Jun-jun; Gong, Desheng; Gu, Xiao-Ting; Mao, Ai-Qin; Jin, Jian; Fu, Li; Ma, Xin

    2016-01-01

    Cancer chemoresistance is regulated by complex genetic and epigenetic networks. In this study, the features of gene expression, methylation, and microRNA (miRNA) expression were investigated with high-throughput sequencing in human breast cancer MCF-7 cells resistant to adriamycin (MCF-7/ADM) and paclitaxel (MCF-7/PTX). We found that: ① both of the chemoresistant cell lines had similar, massive changes in gene expression, methylation, and miRNA expression versus chemosensitive controls. ② Pairwise integration of the data highlighted sets of genes that were regulated by either methylation or miRNAs, and sets of miRNAs whose expression was controlled by DNA methylation in chemoresistant cells. ③ By combining the three sets of high-throughput data, we obtained a list of genes whose expression was regulated by both methylation and miRNAs in chemoresistant cells; ④ Expression of these genes was then validated in clinical breast cancer samples to generate a 17-gene signature that showed good predictive and prognostic power in triple-negative breast cancer patients receiving anthracycline-taxane-based neoadjuvant chemotherapy. In conclusion, our results have generated a new workflow for the integrated analysis of the effects of miRNAs and methylation on gene expression during the development of chemoresistance. PMID:27094684

  8. Genome wide transcriptome profiling reveals differential gene expression in secondary metabolite pathway of Cymbopogon winterianus.

    PubMed

    Devi, Kamalakshi; Mishra, Surajit K; Sahu, Jagajjit; Panda, Debashis; Modi, Mahendra K; Sen, Priyabrata

    2016-01-01

    Advances in transcriptome sequencing provide fast, cost-effective and reliable approach to generate large expression datasets especially suitable for non-model species to identify putative genes, key pathway and regulatory mechanism. Citronella (Cymbopogon winterianus) is an aromatic medicinal grass used for anti-tumoral, antibacterial, anti-fungal, antiviral, detoxifying and natural insect repellent properties. Despite of having number of utilities, the genes involved in terpenes biosynthetic pathway is not yet clearly elucidated. The present study is a pioneering attempt to generate an exhaustive molecular information of secondary metabolite pathway and to increase genomic resources in Citronella. Using high-throughput RNA-Seq technology, root and leaf transcriptome was analysed at an unprecedented depth (11.7 Gb). Targeted searches identified majority of the genes associated with metabolic pathway and other natural product pathway viz. antibiotics synthesis along with many novel genes. Terpenoid biosynthesis genes comparative expression results were validated for 15 unigenes by RT-PCR and qRT-PCR. Thus the coverage of these transcriptome is comprehensive enough to discover all known genes of major metabolic pathways. This transcriptome dataset can serve as important public information for gene expression, genomics and function genomics studies in Citronella and shall act as a benchmark for future improvement of the crop. PMID:26877149

  9. Genome-wide analysis of YY2 versus YY1 target genes

    PubMed Central

    Chen, Li; Shioda, Toshi; Coser, Kathryn R.; Lynch, Mary C.; Yang, Chuanwei; Schmidt, Emmett V.

    2010-01-01

    Yin Yang 1 (YY1) is a critical transcription factor controlling cell proliferation, development and DNA damage responses. Retrotranspositions have independently generated additional YY family members in multiple species. Although Drosophila YY1 [pleiohomeotic (Pho)] and its homolog [pleiohomeotic-like (Phol)] redundantly control homeotic gene expression, the regulatory contributions of YY1-homologs have not yet been examined in other species. Indeed, targets for the mammalian YY1 homolog YY2 are completely unknown. Using gene set enrichment analysis, we found that lentiviral constructs containing short hairpin loop inhibitory RNAs for human YY1 (shYY1) and its homolog YY2 (shYY2) caused significant changes in both shared and distinguishable gene sets in human cells. Ribosomal protein genes were the most significant gene set upregulated by both shYY1 and shYY2, although combined shYY1/2 knock downs were not additive. In contrast, shYY2 reversed the anti-proliferative effects of shYY1, and shYY2 particularly altered UV damage response, platelet-specific and mitochondrial function genes. We found that decreases in YY1 or YY2 caused inverse changes in UV sensitivity, and that their combined loss reversed their respective individual effects. Our studies show that human YY2 is not redundant to YY1, and YY2 is a significant regulator of genes previously identified as uniquely responding to YY1. PMID:20215434

  10. Genome-Wide Identification and Expression Analysis of Two-Component System Genes in Tomato.

    PubMed

    He, Yanjun; Liu, Xue; Ye, Lei; Pan, Changtian; Chen, Lifei; Zou, Tao; Lu, Gang

    2016-01-01

    The two-component system (TCS), which comprises histidine kinases (HKs), phosphotransfers (HPs), and response regulator proteins (RRs), plays pivotal roles in regulating plant growth, development, and responses to biotic and abiotic stresses. TCS genes have been comprehensively identified and investigated in various crops but poorly characterized in tomato. In this work, a total of 65 TCS genes consisting of 20 HK(L)s, six HPs, and 39 RRs were identified from tomato genome. The classification, gene structures, conserved domains, chromosome distribution, phylogenetic relationship, gene duplication events, and subcellular localization of the TCS gene family were predicted and analyzed in detail. The amino acid sequences of tomato TCS family members, except those of type-B RRs, are highly conserved. The gene duplication events of the TCS family mainly occurred in the RR family. Furthermore, the expansion of RRs was attributed to both segment and tandem duplication. The subcellular localizations of the selected green fluorescent protein (GFP) fusion proteins exhibited a diverse subcellular targeting, thereby confirming their predicted divergent functionality. The majority of TCS family members showed distinct organ- or development-specific expression patterns. In addition, most of TCS genes were induced by abiotic stresses and exogenous phytohormones. The full elucidation of TCS elements will be helpful for comprehensive analysis of the molecular biology and physiological role of the TCS superfamily. PMID:27472316

  11. Genome wide transcriptome profiling reveals differential gene expression in secondary metabolite pathway of Cymbopogon winterianus.

    PubMed

    Devi, Kamalakshi; Mishra, Surajit K; Sahu, Jagajjit; Panda, Debashis; Modi, Mahendra K; Sen, Priyabrata

    2016-01-01

    Advances in transcriptome sequencing provide fast, cost-effective and reliable approach to generate large expression datasets especially suitable for non-model species to identify putative genes, key pathway and regulatory mechanism. Citronella (Cymbopogon winterianus) is an aromatic medicinal grass used for anti-tumoral, antibacterial, anti-fungal, antiviral, detoxifying and natural insect repellent properties. Despite of having number of utilities, the genes involved in terpenes biosynthetic pathway is not yet clearly elucidated. The present study is a pioneering attempt to generate an exhaustive molecular information of secondary metabolite pathway and to increase genomic resources in Citronella. Using high-throughput RNA-Seq technology, root and leaf transcriptome was analysed at an unprecedented depth (11.7 Gb). Targeted searches identified majority of the genes associated with metabolic pathway and other natural product pathway viz. antibiotics synthesis along with many novel genes. Terpenoid biosynthesis genes comparative expression results were validated for 15 unigenes by RT-PCR and qRT-PCR. Thus the coverage of these transcriptome is comprehensive enough to discover all known genes of major metabolic pathways. This transcriptome dataset can serve as important public information for gene expression, genomics and function genomics studies in Citronella and shall act as a benchmark for future improvement of the crop.

  12. Genome-Wide Identification of the Invertase Gene Family in Populus

    PubMed Central

    Su, Xiaoxing; Rao, Pian; An, Xinmin

    2015-01-01

    Invertase plays a crucial role in carbohydrate partitioning and plant development as it catalyses the irreversible hydrolysis of sucrose into glucose and fructose. The invertase family in plants is composed of two sub-families: acid invertases, which are targeted to the cell wall and vacuole; and neutral/alkaline invertases, which function in the cytosol. In this study, 5 cell wall invertase genes (PtCWINV1-5), 3 vacuolar invertase genes (PtVINV1-3) and 16 neutral/alkaline invertase genes (PtNINV1-16) were identified in the Populus genome and found to be distributed on 14 chromosomes. A comprehensive analysis of poplar invertase genes was performed, including structures, chromosome location, phylogeny, evolutionary pattern and expression profiles. Phylogenetic analysis indicated that the two sub-families were both divided into two clades. Segmental duplication is contributed to neutral/alkaline sub-family expansion. Furthermore, the Populus invertase genes displayed differential expression in roots, stems, leaves, leaf buds and in response to salt/cold stress and pathogen infection. In addition, the analysis of enzyme activity and sugar content revealed that invertase genes play key roles in the sucrose metabolism of various tissues and organs in poplar. This work lays the foundation for future functional analysis of the invertase genes in Populus and other woody perennials. PMID:26393355

  13. Genome-Wide Identification and Expression Analysis of Two-Component System Genes in Tomato

    PubMed Central

    He, Yanjun; Liu, Xue; Ye, Lei; Pan, Changtian; Chen, Lifei; Zou, Tao; Lu, Gang

    2016-01-01

    The two-component system (TCS), which comprises histidine kinases (HKs), phosphotransfers (HPs), and response regulator proteins (RRs), plays pivotal roles in regulating plant growth, development, and responses to biotic and abiotic stresses. TCS genes have been comprehensively identified and investigated in various crops but poorly characterized in tomato. In this work, a total of 65 TCS genes consisting of 20 HK(L)s, six HPs, and 39 RRs were identified from tomato genome. The classification, gene structures, conserved domains, chromosome distribution, phylogenetic relationship, gene duplication events, and subcellular localization of the TCS gene family were predicted and analyzed in detail. The amino acid sequences of tomato TCS family members, except those of type-B RRs, are highly conserved. The gene duplication events of the TCS family mainly occurred in the RR family. Furthermore, the expansion of RRs was attributed to both segment and tandem duplication. The subcellular localizations of the selected green fluorescent protein (GFP) fusion proteins exhibited a diverse subcellular targeting, thereby confirming their predicted divergent functionality. The majority of TCS family members showed distinct organ- or development-specific expression patterns. In addition, most of TCS genes were induced by abiotic stresses and exogenous phytohormones. The full elucidation of TCS elements will be helpful for comprehensive analysis of the molecular biology and physiological role of the TCS superfamily. PMID:27472316

  14. Genome-wide distribution of Auts2 binding localizes with active neurodevelopmental genes

    PubMed Central

    Oksenberg, N; Haliburton, G D E; Eckalbar, W L; Oren, I; Nishizaki, S; Murphy, K; Pollard, K S; Birnbaum, R Y; Ahituv, N

    2014-01-01

    The autism susceptibility candidate 2 gene (AUTS2) has been associated with multiple neurological diseases including autism spectrum disorders (ASDs). Previous studies showed that AUTS2 has an important neurodevelopmental function and is a suspected master regulator of genes implicated in ASD-related pathways. However, the regulatory role and targets of Auts2 are not well known. Here, by using ChIP-seq (chromatin immunoprecipitation followed by deep sequencing) and RNA-seq on mouse embryonic day 16.5 forebrains, we elucidated the gene regulatory networks of Auts2. We find that the majority of promoters bound by Auts2 belong to genes highly expressed in the developing forebrain, suggesting that Auts2 is involved in transcriptional activation. Auts2 non-promoter-bound regions significantly overlap developing brain-associated enhancer marks and are located near genes involved in neurodevelopment. Auts2-marked sequences are enriched for binding site motifs of neurodevelopmental transcription factors, including Pitx3 and TCF3. In addition, we characterized two functional brain enhancers marked by Auts2 near NRXN1 and ATP2B2, both ASD-implicated genes. Our results implicate Auts2 as an active regulator of important neurodevelopmental genes and pathways and identify novel genomic regions that could be associated with ASD and other neurodevelopmental diseases. PMID:25180570

  15. Genome-Wide Identification of the Invertase Gene Family in Populus.

    PubMed

    Chen, Zhong; Gao, Kai; Su, Xiaoxing; Rao, Pian; An, Xinmin

    2015-01-01

    Invertase plays a crucial role in carbohydrate partitioning and plant development as it catalyses the irreversible hydrolysis of sucrose into glucose and fructose. The invertase family in plants is composed of two sub-families: acid invertases, which are targeted to the cell wall and vacuole; and neutral/alkaline invertases, which function in the cytosol. In this study, 5 cell wall invertase genes (PtCWINV1-5), 3 vacuolar invertase genes (PtVINV1-3) and 16 neutral/alkaline invertase genes (PtNINV1-16) were identified in the Populus genome and found to be distributed on 14 chromosomes. A comprehensive analysis of poplar invertase genes was performed, including structures, chromosome location, phylogeny, evolutionary pattern and expression profiles. Phylogenetic analysis indicated that the two sub-families were both divided into two clades. Segmental duplication is contributed to neutral/alkaline sub-family expansion. Furthermore, the Populus invertase genes displayed differential expression in roots, stems, leaves, leaf buds and in response to salt/cold stress and pathogen infection. In addition, the analysis of enzyme activity and sugar content revealed that invertase genes play key roles in the sucrose metabolism of various tissues and organs in poplar. This work lays the foundation for future functional analysis of the invertase genes in Populus and other woody perennials. PMID:26393355

  16. Genome-Wide Identification and Expression Analysis of the WRKY Gene Family in Cassava

    PubMed Central

    Wei, Yunxie; Shi, Haitao; Xia, Zhiqiang; Tie, Weiwei; Ding, Zehong; Yan, Yan; Wang, Wenquan; Hu, Wei; Li, Kaimian

    2016-01-01

    The WRKY family, a large family of transcription factors (TFs) found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta). In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing three exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava. PMID:26904033

  17. Genome-Wide Analysis Indicates Lineage-Specific Gene Loss during Papilionoideae Evolution.

    PubMed

    Gu, Yongzhe; Xing, Shilai; He, Chaoying

    2016-03-01

    Gene loss is the driving force for changes in genome and morphology; however, this particular evolutionary event has been poorly investigated in leguminous plants. Legumes (Fabaceae) have some lineage-specific and diagnostic characteristics that are distinct from other angiosperms. To understand the potential role of gene loss in the evolution of legumes, we compared six genome-sequenced legume species of Papilionoideae, the largest representative clade of Fabaceae, such as Glycine max, with 34 nonlegume plant species, such as Arabidopsis thaliana. The results showed that the putative orthologs of the 34 Arabidopsis genes belonging to 29 gene families were absent in these legume species but these were conserved in the sequenced nonlegume angiosperm lineages. Further evolutionary analyses indicated that the orthologs of these genes were almost completely lost in the Papillionoideae ancestors, thus designated as the legume lost genes (LLGs), and these underwent purifying selection in nonlegume plants. Most LLGs were functionally unknown. In Arabidopsis, two LLGs were well-known genes that played a role in plant immunity such as HARMLESS TO OZONE LAYER 1 and HOPZ-ACTIVATED RESISTANCE 1, and 16 additional LLGs were predicted to participate in plant-pathogen interactions in in silico expression and protein-protein interaction network analyses. Most of these LLGs' orthologs in various plants were also found to be associated with biotic stress response, indicating the conserved role of these genes in plant defense. The evolutionary implication of LLGs during the development of the ability of symbiotic nitrogen fixation involving plant and bacterial interactions, which is a well-known characteristic of most legumes, is also discussed. Our work sheds light on the evolutionary implication of gene loss events in Papilionoideae evolution, as well as provides new insights into crop design to improve nitrogen fixation capacity. PMID:26868598

  18. Genome-Wide Analysis Indicates Lineage-Specific Gene Loss during Papilionoideae Evolution

    PubMed Central

    Gu, Yongzhe; Xing, Shilai; He, Chaoying

    2016-01-01

    Gene loss is the driving force for changes in genome and morphology; however, this particular evolutionary event has been poorly investigated in leguminous plants. Legumes (Fabaceae) have some lineage-specific and diagnostic characteristics that are distinct from other angiosperms. To understand the potential role of gene loss in the evolution of legumes, we compared six genome-sequenced legume species of Papilionoideae, the largest representative clade of Fabaceae, such as Glycine max, with 34 nonlegume plant species, such as Arabidopsis thaliana. The results showed that the putative orthologs of the 34 Arabidopsis genes belonging to 29 gene families were absent in these legume species but these were conserved in the sequenced nonlegume angiosperm lineages. Further evolutionary analyses indicated that the orthologs of these genes were almost completely lost in the Papillionoideae ancestors, thus designated as the legume lost genes (LLGs), and these underwent purifying selection in nonlegume plants. Most LLGs were functionally unknown. In Arabidopsis, two LLGs were well-known genes that played a role in plant immunity such as HARMLESS TO OZONE LAYER 1 and HOPZ-ACTIVATED RESISTANCE 1, and 16 additional LLGs were predicted to participate in plant–pathogen interactions in in silico expression and protein–protein interaction network analyses. Most of these LLGs’ orthologs in various plants were also found to be associated with biotic stress response, indicating the conserved role of these genes in plant defense. The evolutionary implication of LLGs during the development of the ability of symbiotic nitrogen fixation involving plant and bacterial interactions, which is a well-known characteristic of most legumes, is also discussed. Our work sheds light on the evolutionary implication of gene loss events in Papilionoideae evolution, as well as provides new insights into crop design to improve nitrogen fixation capacity. PMID:26868598

  19. Genome-Wide Identification and Expression Analysis of the WRKY Gene Family in Cassava.

    PubMed

    Wei, Yunxie; Shi, Haitao; Xia, Zhiqiang; Tie, Weiwei; Ding, Zehong; Yan, Yan; Wang, Wenquan; Hu, Wei; Li, Kaimian

    2016-01-01

    The WRKY family, a large family of transcription factors (TFs) found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta). In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing three exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava. PMID:26904033

  20. Genome-wide analysis of the MADS-box gene family in Brassica rapa (Chinese cabbage).

    PubMed

    Duan, Weike; Song, Xiaoming; Liu, Tongkun; Huang, Zhinan; Ren, Jun; Hou, Xilin; Li, Ying

    2015-02-01

    The MADS-box gene family is an ancient and well-studied transcription factor family that functions in almost every developmental process in plants. There are a number of reports about the MADS-box family in different plant species, but systematic analysis of the MADS-box transcription factor family in Brassica rapa (Chinese cabbage) is still lacking. In this study, 160 MADS-box transcription factors were identified from the entire Chinese cabbage genome and compared with the MADS-box factors from 21 other representative plant species. A detailed list of MADS proteins from these 22 species was sorted. Phylogenetic analysis of the BrMADS genes, together with their Arabidopsis and rice counterparts, showed that the BrMADS genes were categorised into type I (Mα, Mβ, Mγ) and type II (MIKC(C), MIKC*) groups, and the MIKC(C) proteins were further divided into 13 subfamilies. The Chinese cabbage type II group has 95 members, which is twice as much as the Arabidopsis type II group, indicating that the Chinese cabbage type II genes have been retained more frequently than the type I genes. Finally, RNA-seq transcriptome data and quantitative real-time PCR analysis revealed that BrMADS genes are expressed in a tissue-specific manner similar to Arabidopsis. Interestingly, a number of BrMIKC genes showed responses to different abiotic stress treatments, suggesting a function for some of the genes in these processes as well. Taken together, the characterization of the B. rapa MADS-box family presented here, will certainly help in the selection of appropriate candidate genes and further facilitate functional studies in Chinese cabbage.

  1. Genome-Wide Identification and Expression Analysis of the WRKY Gene Family in Cassava.

    PubMed

    Wei, Yunxie; Shi, Haitao; Xia, Zhiqiang; Tie, Weiwei; Ding, Zehong; Yan, Yan; Wang, Wenquan; Hu, Wei; Li, Kaimian

    2016-01-01

    The WRKY family, a large family of transcription factors (TFs) found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta). In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing three exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava.

  2. Genome-Wide Analysis Indicates Lineage-Specific Gene Loss during Papilionoideae Evolution.

    PubMed

    Gu, Yongzhe; Xing, Shilai; He, Chaoying

    2016-03-01

    Gene loss is the driving force for changes in genome and morphology; however, this particular evolutionary event has been poorly investigated in leguminous plants. Legumes (Fabaceae) have some lineage-specific and diagnostic characteristics that are distinct from other angiosperms. To understand the potential role of gene loss in the evolution of legumes, we compared six genome-sequenced legume species of Papilionoideae, the largest representative clade of Fabaceae, such as Glycine max, with 34 nonlegume plant species, such as Arabidopsis thaliana. The results showed that the putative orthologs of the 34 Arabidopsis genes belonging to 29 gene families were absent in these legume species but these were conserved in the sequenced nonlegume angiosperm lineages. Further evolutionary analyses indicated that the orthologs of these genes were almost completely lost in the Papillionoideae ancestors, thus designated as the legume lost genes (LLGs), and these underwent purifying selection in nonlegume plants. Most LLGs were functionally unknown. In Arabidopsis, two LLGs were well-known genes that played a role in plant immunity such as HARMLESS TO OZONE LAYER 1 and HOPZ-ACTIVATED RESISTANCE 1, and 16 additional LLGs were predicted to participate in plant-pathogen interactions in in silico expression and protein-protein interaction network analyses. Most of these LLGs' orthologs in various plants were also found to be associated with biotic stress response, indicating the conserved role of these genes in plant defense. The evolutionary implication of LLGs during the development of the ability of symbiotic nitrogen fixation involving plant and bacterial interactions, which is a well-known characteristic of most legumes, is also discussed. Our work sheds light on the evolutionary implication of gene loss events in Papilionoideae evolution, as well as provides new insights into crop design to improve nitrogen fixation capacity.

  3. Genome-wide analysis of the MADS-box gene family in Brassica rapa (Chinese cabbage).

    PubMed

    Duan, Weike; Song, Xiaoming; Liu, Tongkun; Huang, Zhinan; Ren, Jun; Hou, Xilin; Li, Ying

    2015-02-01

    The MADS-box gene family is an ancient and well-studied transcription factor family that functions in almost every developmental process in plants. There are a number of reports about the MADS-box family in different plant species, but systematic analysis of the MADS-box transcription factor family in Brassica rapa (Chinese cabbage) is still lacking. In this study, 160 MADS-box transcription factors were identified from the entire Chinese cabbage genome and compared with the MADS-box factors from 21 other representative plant species. A detailed list of MADS proteins from these 22 species was sorted. Phylogenetic analysis of the BrMADS genes, together with their Arabidopsis and rice counterparts, showed that the BrMADS genes were categorised into type I (Mα, Mβ, Mγ) and type II (MIKC(C), MIKC*) groups, and the MIKC(C) proteins were further divided into 13 subfamilies. The Chinese cabbage type II group has 95 members, which is twice as much as the Arabidopsis type II group, indicating that the Chinese cabbage type II genes have been retained more frequently than the type I genes. Finally, RNA-seq transcriptome data and quantitative real-time PCR analysis revealed that BrMADS genes are expressed in a tissue-specific manner similar to Arabidopsis. Interestingly, a number of BrMIKC genes showed responses to different abiotic stress treatments, suggesting a function for some of the genes in these processes as well. Taken together, the characterization of the B. rapa MADS-box family presented here, will certainly help in the selection of appropriate candidate genes and further facilitate functional studies in Chinese cabbage. PMID:25216934

  4. A gene expression resource generated by genome-wide lacZ profiling in the mouse

    PubMed Central

    Tuck, Elizabeth; Estabel, Jeanne; Oellrich, Anika; Maguire, Anna Karin; Adissu, Hibret A.; Souter, Luke; Siragher, Emma; Lillistone, Charlotte; Green, Angela L.; Wardle-Jones, Hannah; Carragher, Damian M.; Karp, Natasha A.; Smedley, Damian; Adams, Niels C.; Bussell, James N.; Adams, David J.; Ramírez-Solis, Ramiro; Steel, Karen P.; Galli, Antonella; White, Jacqueline K.

    2015-01-01

    ABSTRACT Knowledge of the expression profile of a gene is a critical piece of information required to build an understanding of the normal and essential functions of that gene and any role it may play in the development or progression of disease. High-throughput, large-scale efforts are on-going internationally to characterise reporter-tagged knockout mouse lines. As part of that effort, we report an open access adult mouse expression resource, in which the expression profile of 424 genes has been assessed in up to 47 different organs, tissues and sub-structures using a lacZ reporter gene. Many specific and informative expression patterns were noted. Expression was most commonly observed in the testis and brain and was most restricted in white adipose tissue and mammary gland. Over half of the assessed genes presented with an absent or localised expression pattern (categorised as 0-10 positive structures). A link between complexity of expression profile and viability of homozygous null animals was observed; inactivation of genes expressed in ≥21 structures was more likely to result in reduced viability by postnatal day 14 compared with more restricted expression profiles. For validation purposes, this mouse expression resource was compared with Bgee, a federated composite of RNA-based expression data sets. Strong agreement was observed, indicating a high degree of specificity in our data. Furthermore, there were 1207 observations of expression of a particular gene in an anatomical structure where Bgee had no data, indicating a large amount of novelty in our data set. Examples of expression data corroborating and extending genotype-phenotype associations and supporting disease gene candidacy are presented to demonstrate the potential of this powerful resource. PMID:26398943

  5. Common gene expression strategies revealed by genome-wide analysis in yeast

    PubMed Central

    García-Martínez, José; González-Candelas, Fernando; Pérez-Ortín, José E

    2007-01-01

    Background Gene expression is a two-step synthesis process that ends with the necessary amount of each protein required to perform its function. Since the protein is the final product, the main focus of gene regulation should be centered on it. However, because mRNA is an intermediate step and the amounts of both mRNA and protein are controlled by their synthesis and degradation rates, the desired amount of protein can be achieved following different strategies. Results In this paper we present the first comprehensive analysis of the relationships among the six variables that characterize gene expression in a living organism: transcription and translation rates, mRNA and protein amounts, and mRNA and protein stabilities. We have used previously published data from exponentially growing Saccharomyces cerevisiae cells. We show that there is a general tendency to harmonize the levels of mRNA and protein by coordinating their synthesis rates and that functionally related genes tend to have similar values for the six variables. Conclusion We propose that yeast cells use common expression strategies for genes acting in the same physiological pathways. This trend is more evident for genes coding for large and stable protein complexes, such as ribosomes or the proteasome. Hence, each functional group can be defined by a 'six variable profile' that illustrates the common strategy followed by the genes included in it. Genes encoding subunits of protein complexes show a tendency to have relatively unstable mRNAs and a less balanced profile for mRNA than for protein, suggesting a stronger regulation at the transcriptional level. PMID:17945030

  6. Non-additive genome-wide association scan reveals a new gene associated with habitual coffee consumption

    PubMed Central

    Pirastu, Nicola; Kooyman, Maarten; Robino, Antonietta; van der Spek, Ashley; Navarini, Luciano; Amin, Najaf; Karssen, Lennart C.; Van Duijn, Cornelia M; Gasparini, Paolo

    2016-01-01

    Coffee is one of the most consumed beverages world-wide and one of the primary sources of caffeine intake. Given its important health and economic impact, the underlying genetics of its consumption has been widely studied. Despite these efforts, much has still to be uncovered. In particular, the use of non-additive genetic models may uncover new information about the genetic variants driving coffee consumption. We have conducted a genome-wide association study in two Italian populations using additive, recessive and dominant models for analysis. This has uncovered a significant association in the PDSS2 gene under the recessive model that has been replicated in an independent cohort from the Netherlands (ERF). The identified gene has been shown to negatively regulate the expression of the caffeine metabolism genes and can thus be linked to coffee consumption. Further bioinformatics analysis of eQTL and histone marks from Roadmap data has evidenced a possible role of the identified SNPs in regulating PDSS2 gene expression through enhancers present in its intron. Our results highlight a novel gene which regulates coffee consumption by regulating the expression of the genes linked to caffeine metabolism. Further studies will be needed to clarify the biological mechanism which links PDSS2 and coffee consumption. PMID:27561104

  7. Genome-wide screen identifies a novel p97/CDC-48-dependent pathway regulating ER-stress-induced gene transcription.

    PubMed

    Marza, Esther; Taouji, Saïd; Barroso, Kim; Raymond, Anne-Aurélie; Guignard, Léo; Bonneu, Marc; Pallares-Lupon, Néstor; Dupuy, Jean-William; Fernandez-Zapico, Martin E; Rosenbaum, Jean; Palladino, Francesca; Dupuy, Denis; Chevet, Eric

    2015-03-01

    The accumulation of misfolded proteins in the endoplasmic reticulum (ER) activates the Unfolded Protein Response (UPR(ER)) to restore ER homeostasis. The AAA(+) ATPase p97/CDC-48 plays key roles in ER stress by promoting both ER protein degradation and transcription of UPR(ER) genes. Although the mechanisms associated with protein degradation are now well established, the molecular events involved in the regulation of gene transcription by p97/CDC-48 remain unclear. Using a reporter-based genome-wide RNAi screen in combination with quantitative proteomic analysis in Caenorhabditis elegans, we have identified RUVB-2, a AAA(+) ATPase, as a novel repressor of a subset of UPR(ER) genes. We show that degradation of RUVB-2 by CDC-48 enhances expression of ER stress response genes through an XBP1-dependent mechanism. The functional interplay between CDC-48 and RUVB-2 in controlling transcription of select UPR(ER) genes appears conserved in human cells. Together, these results describe a novel role for p97/CDC-48, whereby its role in protein degradation is integrated with its role in regulating expression of ER stress response genes.

  8. Genome-wide transcriptional analysis of flagellar regeneration in Chlamydomonas reinhardtii identifies orthologs of ciliary disease genes

    NASA Technical Reports Server (NTRS)

    Stolc, Viktor; Samanta, Manoj Pratim; Tongprasit, Waraporn; Marshall, Wallace F.

    2005-01-01

    The important role that cilia and flagella play in human disease creates an urgent need to identify genes involved in ciliary assembly and function. The strong and specific induction of flagellar-coding genes during flagellar regeneration in Chlamydomonas reinhardtii suggests that transcriptional profiling of such cells would reveal new flagella-related genes. We have conducted a genome-wide analysis of RNA transcript levels during flagellar regeneration in Chlamydomonas by using maskless photolithography method-produced DNA oligonucleotide microarrays with unique probe sequences for all exons of the 19,803 predicted genes. This analysis represents previously uncharacterized whole-genome transcriptional activity profiling study in this important model organism. Analysis of strongly induced genes reveals a large set of known flagellar components and also identifies a number of important disease-related proteins as being involved with cilia and flagella, including the zebrafish polycystic kidney genes Qilin, Reptin, and Pontin, as well as the testis-expressed tubby-like protein TULP2.

  9. Non-additive genome-wide association scan reveals a new gene associated with habitual coffee consumption.

    PubMed

    Pirastu, Nicola; Kooyman, Maarten; Robino, Antonietta; van der Spek, Ashley; Navarini, Luciano; Amin, Najaf; Karssen, Lennart C; Van Duijn, Cornelia M; Gasparini, Paolo

    2016-01-01

    Coffee is one of the most consumed beverages world-wide and one of the primary sources of caffeine intake. Given its important health and economic impact, the underlying genetics of its consumption has been widely studied. Despite these efforts, much has still to be uncovered. In particular, the use of non-additive genetic models may uncover new information about the genetic variants driving coffee consumption. We have conducted a genome-wide association study in two Italian populations using additive, recessive and dominant models for analysis. This has uncovered a significant association in the PDSS2 gene under the recessive model that has been replicated in an independent cohort from the Netherlands (ERF). The identified gene has been shown to negatively regulate the expression of the caffeine metabolism genes and can thus be linked to coffee consumption. Further bioinformatics analysis of eQTL and histone marks from Roadmap data has evidenced a possible role of the identified SNPs in regulating PDSS2 gene expression through enhancers present in its intron. Our results highlight a novel gene which regulates coffee consumption by regulating the expression of the genes linked to caffeine metabolism. Further studies will be needed to clarify the biological mechanism which links PDSS2 and coffee consumption. PMID:27561104

  10. Genome-Wide Identification of R2R3-MYB Genes and Expression Analyses During Abiotic Stress in Gossypium raimondii

    PubMed Central

    He, Qiuling; Jones, Don C.; Li, Wei; Xie, Fuliang; Ma, Jun; Sun, Runrun; Wang, Qinglian; Zhu, Shuijin; Zhang, Baohong

    2016-01-01

    The R2R3-MYB is one of the largest families of transcription factors, which have been implicated in multiple biological processes. There is great diversity in the number of R2R3-MYB genes in different plants. However, there is no report on genome-wide characterization of this gene family in cotton. In the present study, a total of 205 putative R2R3-MYB genes were identified in cotton D genome (Gossypium raimondii), that are much larger than that found in other cash crops with fully sequenced genomes. These GrMYBs were classified into 13 groups with the R2R3-MYB genes from Arabidopsis and rice. The amino acid motifs and phylogenetic tree were predicted and analyzed. The sequences of GrMYBs were distributed across 13 chromosomes at various densities. The results showed that the expansion of the G. Raimondii R2R3-MYB family was mainly attributable to whole genome duplication and segmental duplication. Moreover, the expression pattern of 52 selected GrMYBs and 46 GaMYBs were tested in roots and leaves under different abiotic stress conditions. The results revealed that the MYB genes in cotton were differentially expressed under salt and drought stress treatment. Our results will be useful for determining the precise role of the MYB genes during stress responses with crop improvement. PMID:27009386

  11. Genome-wide screen identifies a novel p97/CDC-48-dependent pathway regulating ER-stress-induced gene transcription

    PubMed Central

    Marza, Esther; Taouji, Saïd; Barroso, Kim; Raymond, Anne-Aurélie; Guignard, Léo; Bonneu, Marc; Pallares-Lupon, Néstor; Dupuy, Jean-William; Fernandez-Zapico, Martin E; Rosenbaum, Jean; Palladino, Francesca; Dupuy, Denis; Chevet, Eric

    2015-01-01

    The accumulation of misfolded proteins in the endoplasmic reticulum (ER) activates the Unfolded Protein Response (UPRER) to restore ER homeostasis. The AAA+ ATPase p97/CDC-48 plays key roles in ER stress by promoting both ER protein degradation and transcription of UPRER genes. Although the mechanisms associated with protein degradation are now well established, the molecular events involved in the regulation of gene transcription by p97/CDC-48 remain unclear. Using a reporter-based genome-wide RNAi screen in combination with quantitative proteomic analysis in Caenorhabditis elegans, we have identified RUVB-2, a AAA+ ATPase, as a novel repressor of a subset of UPRER genes. We show that degradation of RUVB-2 by CDC-48 enhances expression of ER stress response genes through an XBP1-dependent mechanism. The functional interplay between CDC-48 and RUVB-2 in controlling transcription of select UPRER genes appears conserved in human cells. Together, these results describe a novel role for p97/CDC-48, whereby its role in protein degradation is integrated with its role in regulating expression of ER stress response genes. PMID:25652260

  12. Genome-wide screen identifies a novel p97/CDC-48-dependent pathway regulating ER-stress-induced gene transcription.

    PubMed

    Marza, Esther; Taouji, Saïd; Barroso, Kim; Raymond, Anne-Aurélie; Guignard, Léo; Bonneu, Marc; Pallares-Lupon, Néstor; Dupuy, Jean-William; Fernandez-Zapico, Martin E; Rosenbaum, Jean; Palladino, Francesca; Dupuy, Denis; Chevet, Eric

    2015-03-01

    The accumulation of misfolded proteins in the endoplasmic reticulum (ER) activates the Unfolded Protein Response (UPR(ER)) to restore ER homeostasis. The AAA(+) ATPase p97/CDC-48 plays key roles in ER stress by promoting both ER protein degradation and transcription of UPR(ER) genes. Although the mechanisms associated with protein degradation are now well established, the molecular events involved in the regulation of gene transcription by p97/CDC-48 remain unclear. Using a reporter-based genome-wide RNAi screen in combination with quantitative proteomic analysis in Caenorhabditis elegans, we have identified RUVB-2, a AAA(+) ATPase, as a novel repressor of a subset of UPR(ER) genes. We show that degradation of RUVB-2 by CDC-48 enhances expression of ER stress response genes through an XBP1-dependent mechanism. The functional interplay between CDC-48 and RUVB-2 in controlling transcription of select UPR(ER) genes appears conserved in human cells. Together, these results describe a novel role for p97/CDC-48, whereby its role in protein degradation is integrated with its role in regulating expression of ER stress response genes. PMID:25652260

  13. Comparison of Gene Expression and Genome-Wide DNA Methylation Profiling between Phenotypically Normal Cloned Pigs and Conventionally Bred Controls

    PubMed Central

    Li, Shengting; Li, Jian; Lin, Lin; Nielsen, Anders Lade; Sørensen, Charlotte Brandt; Vajta, Gábor; Wang, Jun; Zhang, Xiuqing; Du, Yutao; Yang, Huanming; Bolund, Lars

    2011-01-01

    Animal breeding via Somatic Cell Nuclear Transfer (SCNT) has enormous potential in agriculture and biomedicine. However, concerns about whether SCNT animals are as healthy or epigenetically normal as conventionally bred ones are raised as the efficiency of cloning by SCNT is much lower than natural breeding or In-vitro fertilization (IVF). Thus, we have conducted a genome-wide gene expression and DNA methylation profiling between phenotypically normal cloned pigs and control pigs in two tissues (muscle and liver), using Affymetrix Porcine expression array as well as modified methylation-specific digital karyotyping (MMSDK) and Solexa sequencing technology. Typical tissue-specific differences with respect to both gene expression and DNA methylation were observed in muscle and liver from cloned as well as control pigs. Gene expression profiles were highly similar between cloned pigs and controls, though a small set of genes showed altered expression. Cloned pigs presented a more different pattern of DNA methylation in unique sequences in both tissues. Especially a small set of genomic sites had different DNA methylation status with a trend towards slightly increased methylation levels in cloned pigs. Molecular network analysis of the genes that contained such differential methylation loci revealed a significant network related to tissue development. In conclusion, our study showed that phenotypically normal cloned pigs were highly similar with normal breeding pigs in their gene expression, but moderate alteration in DNA methylation aspects still exists, especially in certain unique genomic regions. PMID:22022462

  14. [Genome-wide identification and bioinformatic analysis of PPR gene family in tomato].

    PubMed

    Ding, Anming; Li, Ling; Qu, Xu; Sun, Tingting; Chen, Yaqiong; Zong, Peng; Li, Zunqiang; Gong, Daping; Sun, Yuhe

    2014-01-01

    Pentatricopeptide repeats (PPRs) genes constitute one of the largest gene families in plants, which play a broad and essential role in plant growth and development. In this study, the protein sequences annotated by the tomato (S. lycopersicum L.) genome project were screened with the Pfam PPR sequences. A total of 471 putative PPR-encoding genes were identified. Based on the motifs defined in A. thaliana L., protein structure and conserved sequences for each tomato motif were analyzed. We also analyzed phylogenetic relationship, subcellular localization, expression and GO analysis of the identified gene sequences. Our results demonstrate that tomato PPR gene family contains two subfamilies, P and PLS, each accounting for half of the family. PLS subfamily can be divided into four subclasses i.e., PLS, E, E+ and DYW. Each subclass of sequences forms a clade in the phylogenetic tree. The PPR motifs were found highly conserved among plants. The tomato PPR genes were distributed over 12 chromosomes and most of them lack introns. The majority of PPR proteins harbor mitochondrial or chloroplast localization sequences, whereas GO analysis showed that most PPR proteins participate in RNA-related biological processes.

  15. Genome-wide identification and gene expression profiling of ubiquitin ligases for endoplasmic reticulum protein degradation

    PubMed Central

    Kaneko, Masayuki; Iwase, Ikuko; Yamasaki, Yuki; Takai, Tomoko; Wu, Yan; Kanemoto, Soshi; Matsuhisa, Koji; Asada, Rie; Okuma, Yasunobu; Watanabe, Takeshi; Imaizumi, Kazunori; Nomura, Yausyuki

    2016-01-01

    Endoplasmic reticulum (ER)-associated degradation (ERAD) is a mechanism by which unfolded proteins that accumulate in the ER are transported to the cytosol for ubiquitin–proteasome-mediated degradation. Ubiquitin ligases (E3s) are a group of enzymes responsible for substrate selectivity and ubiquitin chain formation. The purpose of this study was to identify novel E3s involved in ERAD. Thirty-seven candidate genes were selected by searches for proteins with RING-finger motifs and transmembrane regions, which are the major features of ERAD E3s. We performed gene expression profiling for the identified E3s in human and mouse tissues. Several genes were specifically or selectively expressed in both tissues; the expression of four genes (RNFT1, RNF185, CGRRF1 and RNF19B) was significantly upregulated by ER stress. To determine the involvement of the ER stress-responsive genes in ERAD, we investigated their ER localisation, in vitro autoubiquitination activity and ER stress resistance. All were partially localised to the ER, whereas CGRRF1 did not possess E3 activity. RNFT1 and RNF185, but not CGRRF1 and RNF19B, exhibited significant resistance to ER stressor in an E3 activity-dependent manner. Thus, these genes are possible candidates for ERAD E3s. PMID:27485036

  16. Genome-wide analysis of the structural genes regulating defense phenylpropanoid metabolism in Populus

    SciTech Connect

    Tschaplinski, Timothy J; Tsai, Chung-Jui; Harding, Scott A; Lindroth, richard L; Yuan, Yinan

    2006-01-01

    Salicin-based phenolic glycosides, hydroxycinnamate derivatives and flavonoid-derived condensed tannins comprise up to one-third of Populus leaf dry mass. Genes regulating the abundance and chemical diversity of these substances have not been comprehensively analysed in tree species exhibiting this metabolically demanding level of phenolic metabolism. Here, shikimate-phenylpropanoid pathway genes thought to give rise to these phenolic products were annotated from the Populus genome, their expression assessed by semiquantitative or quantitative reverse transcription polymerase chain reaction (PCR), and metabolic evidence for function presented. Unlike Arabidopsis, Populus leaves accumulate an array of hydroxycinnamoyl-quinate esters, which is consistent with broadened function of the expanded hydroxycinnamoyl-CoA transferase gene family. Greater flavonoid pathway diversity is also represented, and flavonoid gene families are larger. Consistent with expanded pathway function, most of these genes were upregulated during wound-stimulated condensed tannin synthesis in leaves. The suite of Populus genes regulating phenylpropanoid product accumulation should have important application in managing phenolic carbon pools in relation to climate change and global carbon cycling.

  17. Genome-wide identification and characterization of reference genes with different transcript abundances for Streptomyces coelicolor

    PubMed Central

    Li, Shanshan; Wang, Weishan; Li, Xiao; Fan, Keqiang; Yang, Keqian

    2015-01-01

    The lack of reliable reference genes (RGs) in the genus Streptomyces hampers effort to obtain the precise data of transcript levels. To address this issue, we aimed to identify reliable RGs in the model organism Streptomyces coelicolor. A pool of potential RGs containing 1,471 genes was first identified by determining the intersection of genes with stable transcript levels from four time-series transcriptome microarray datasets of S. coelicolor M145 cultivated in different conditions. Then, following a strict rational selection scheme including homology analysis, disturbance analysis, function analysis and transcript abundance analysis, 13 candidates were selected from the 1,471 genes. Based on real-time quantitative reverse transcription PCR assays, SCO0710, SCO6185, SCO1544, SCO3183 and SCO4758 were identified as the top five genes with the most stable transcript levels among the 13 candidates. Further analyses showed these five genes also maintained stable transcript levels in different S. coelicolor strains, as well as in Streptomyces avermitilis MA-4680 and Streptomyces clavuligerus NRRL 3585, suggesting they could fulfill the requirements of accurate data normalization in streptomycetes. Moreover, the systematic strategy employed in this work could be used for reference in other microorganism to select reliable RGs. PMID:26527303

  18. The putative phytocyanin genes in Chinese cabbage (Brassica rapa L.): genome-wide identification, classification and expression analysis.

    PubMed

    Li, Jun; Gao, Guizhen; Zhang, Tianyao; Wu, Xiaoming

    2013-02-01

    Phytocyanins (PCs) are a plant-specific family of small copper-containing electron transfer proteins. PCs may bind with a single copper atom to function as electron transporters in various biological systems, such as copper trafficking and plant photosynthesis. Evidence indicates that PCs may also be involved in plant developmental processes and stress responses. Many PCs possess arabinogalactan protein-like regions and are therefore termed chimeric arabinogalactan proteins (CAGPs). Previously, 38 and 62 PC genes have been identified in Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa), respectively. The recent release of the Chinese cabbage genome (B. rapa ssp. Pekinensis line Chiifu-401-42) enabled us to perform a genome-wide identification and analysis. In this study we identified 84 putative PC genes in the B. rapa genome. All of the Brassica rapa phytocyanins (BrPCs) described here could be divided, based on motif constitution, into the following three main subclasses: 52 early nodulin-like proteins (ENODLs), 16 uclacyanin-like proteins (UCLs), and 11 stellacyanin-like proteins (SCLs). A structural analysis predicted that 71 BrPCs contained N-terminal secretion signals and 45 BrPCs may be glycosylphosphatidylinositol-anchored to the plasma membrane. Glycosylation prediction revealed that 48 BrPCs were CAGPs with putative arabinogalactan glycomodules, and 57 BrPCs had N-glycosylation sites. Additionally, gene duplication analysis demonstrated that almost all of the duplicated BrPC genes shared the same conserved collinear blocks and that segmental duplications play an important role in the diversification of this gene family. Surprisingly, all BrUCL genes were duplicated except for BrUCL16. Expression analyses indicated that BrENODL22/27 and BrSCL8/9 were highly expressed in reproductive organs; BrUCL6/16 was strongly expressed in roots and even more strongly expressed in stems. The genome-wide identification, classification and expression analysis of

  19. Genome-wide linkage and association analysis identifies major gene loci for guttural pouch tympany in Arabian and German warmblood horses.

    PubMed

    Metzger, Julia; Ohnesorge, Bernhard; Distl, Ottmar

    2012-01-01

    Equine guttural pouch tympany (GPT) is a hereditary condition affecting foals in their first months of life. Complex segregation analyses in Arabian and German warmblood horses showed the involvement of a major gene as very likely. Genome-wide linkage and association analyses including a high density marker set of single nucleotide polymorphisms (SNPs) were performed to map the genomic region harbouring the potential major gene for GPT. A total of 85 Arabian and 373 German warmblood horses were genotyped on the Illumina equine SNP50 beadchip. Non-parametric multipoint linkage analyses showed genome-wide significance on horse chromosomes (ECA) 3 for German warmblood at 16-26 Mb and 34-55 Mb and for Arabian on ECA15 at 64-65 Mb. Genome-wide association analyses confirmed the linked regions for both breeds. In Arabian, genome-wide association was detected at 64 Mb within the region with the highest linkage peak on ECA15. For German warmblood, signals for genome-wide association were close to the peak region of linkage at 52 Mb on ECA3. The odds ratio for the SNP with the highest genome-wide association was 0.12 for the Arabian. In conclusion, the refinement of the regions with the Illumina equine SNP50 beadchip is an important step to unravel the responsible mutations for GPT.

  20. Genome-wide linkage and association analysis identifies major gene loci for guttural pouch tympany in Arabian and German warmblood horses.

    PubMed

    Metzger, Julia; Ohnesorge, Bernhard; Distl, Ottmar

    2012-01-01

    Equine guttural pouch tympany (GPT) is a hereditary condition affecting foals in their first months of life. Complex segregation analyses in Arabian and German warmblood horses showed the involvement of a major gene as very likely. Genome-wide linkage and association analyses including a high density marker set of single nucleotide polymorphisms (SNPs) were performed to map the genomic region harbouring the potential major gene for GPT. A total of 85 Arabian and 373 German warmblood horses were genotyped on the Illumina equine SNP50 beadchip. Non-parametric multipoint linkage analyses showed genome-wide significance on horse chromosomes (ECA) 3 for German warmblood at 16-26 Mb and 34-55 Mb and for Arabian on ECA15 at 64-65 Mb. Genome-wide association analyses confirmed the linked regions for both breeds. In Arabian, genome-wide association was detected at 64 Mb within the region with the highest linkage peak on ECA15. For German warmblood, signals for genome-wide association were close to the peak region of linkage at 52 Mb on ECA3. The odds ratio for the SNP with the highest genome-wide association was 0.12 for the Arabian. In conclusion, the refinement of the regions with the Illumina equine SNP50 beadchip is an important step to unravel the responsible mutations for GPT. PMID:22848553

  1. Genome-Wide Linkage and Association Analysis Identifies Major Gene Loci for Guttural Pouch Tympany in Arabian and German Warmblood Horses

    PubMed Central

    Metzger, Julia; Ohnesorge, Bernhard; Distl, Ottmar

    2012-01-01

    Equine guttural pouch tympany (GPT) is a hereditary condition affecting foals in their first months of life. Complex segregation analyses in Arabian and German warmblood horses showed the involvement of a major gene as very likely. Genome-wide linkage and association analyses including a high density marker set of single nucleotide polymorphisms (SNPs) were performed to map the genomic region harbouring the potential major gene for GPT. A total of 85 Arabian and 373 German warmblood horses were genotyped on the Illumina equine SNP50 beadchip. Non-parametric multipoint linkage analyses showed genome-wide significance on horse chromosomes (ECA) 3 for German warmblood at 16–26 Mb and 34–55 Mb and for Arabian on ECA15 at 64–65 Mb. Genome-wide association analyses confirmed the linked regions for both breeds. In Arabian, genome-wide association was detected at 64 Mb within the region with the highest linkage peak on ECA15. For German warmblood, signals for genome-wide association were close to the peak region of linkage at 52 Mb on ECA3. The odds ratio for the SNP with the highest genome-wide association was 0.12 for the Arabian. In conclusion, the refinement of the regions with the Illumina equine SNP50 beadchip is an important step to unravel the responsible mutations for GPT. PMID:22848553

  2. Genome Wide Association Identifies PPFIA1 as a Candidate Gene for Acute Lung Injury Risk Following Major Trauma

    PubMed Central

    Feng, Rui; O'Keefe, Grant E.; Bradfield, Jonathan; Ware, Lorraine B.; Christiani, David C.; Calfee, Carolyn S.; Cohen, Mitchell J.; Matthay, Michael; Meyer, Nuala J.; Kim, Cecilia; Li, Mingyao; Akey, Joshua; Barnes, Kathleen C.; Sevransky, Jonathan; Lanken, Paul N.; May, Addison K.; Aplenc, Richard; Maloney, James P.; Hakonarson, Hakon

    2012-01-01

    Acute Lung Injury (ALI) is a syndrome with high associated mortality characterized by severe hypoxemia and pulmonary infiltrates in patients with critical illness. We conducted the first investigation to use the genome wide association (GWA) approach to identify putative risk variants for ALI. Genome wide genotyping was performed using the Illumina Human Quad 610 BeadChip. We performed a two-stage GWA study followed by a third stage of functional characterization. In the discovery phase (Phase 1), we compared 600 European American trauma-associated ALI cases with 2266 European American population-based controls. We carried forward the top 1% of single nucleotide polymorphisms (SNPs) at p<0.01 to a replication phase (Phase 2) comprised of a nested case-control design sample of 212 trauma-associated ALI cases and 283 at-risk trauma non-ALI controls from ongoing cohort studies. SNPs that replicated at the 0.05 level in Phase 2 were subject to functional validation (Phase 3) using expression quantitative trait loci (eQTL) analyses in stimulated B-lymphoblastoid cell lines (B-LCL) in family trios. 159 SNPs from the discovery phase replicated in Phase 2, including loci with prior evidence for a role in ALI pathogenesis. Functional evaluation of these replicated SNPs revealed rs471931 on 11q13.3 to exert a cis-regulatory effect on mRNA expression in the PPFIA1 gene (p = 0.0021). PPFIA1 encodes liprin alpha, a protein involved in cell adhesion, integrin expression, and cell-matrix interactions. This study supports the feasibility of future multi-center GWA investigations of ALI risk, and identifies PPFIA1 as a potential functional candidate ALI risk gene for future research. PMID:22295056

  3. Genome-Wide Analysis Reveals Novel Genes Essential for Heme Homeostasis in Caenorhabditis elegans

    PubMed Central

    Rao, Anita U.; Cerqueira, Gustavo C.; Mitreva, Makedonka; El-Sayed, Najib M.; Krause, Michael; Hamza, Iqbal

    2010-01-01

    Heme is a cofactor in proteins that function in almost all sub-cellular compartments and in many diverse biological processes. Heme is produced by a conserved biosynthetic pathway that is highly regulated to prevent the accumulation of heme—a cytotoxic, hydrophobic tetrapyrrole. Caenorhabditis elegans and related parasitic nematodes do not synthesize heme, but instead require environmental heme to grow and develop. Heme homeostasis in these auxotrophs is, therefore, regulated in accordance with available dietary heme. We have capitalized on this auxotrophy in C. elegans to study gene expression changes associated with precisely controlled dietary heme concentrations. RNA was isolated from cultures containing 4, 20, or 500 µM heme; derived cDNA probes were hybridized to Affymetrix C. elegans expression arrays. We identified 288 heme-responsive genes (hrgs) that were differentially expressed under these conditions. Of these genes, 42% had putative homologs in humans, while genomes of medically relevant heme auxotrophs revealed homologs for 12% in both Trypanosoma and Leishmania and 24% in parasitic nematodes. Depletion of each of the 288 hrgs by RNA–mediated interference (RNAi) in a transgenic heme-sensor worm strain identified six genes that regulated heme homeostasis. In addition, seven membrane-spanning transporters involved in heme uptake were identified by RNAi knockdown studies using a toxic heme analog. Comparison of genes that were positive in both of the RNAi screens resulted in the identification of three genes in common that were vital for organismal heme homeostasis in C. elegans. Collectively, our results provide a catalog of genes that are essential for metazoan heme homeostasis and demonstrate the power of C. elegans as a genetic animal model to dissect the regulatory circuits which mediate heme trafficking in both vertebrate hosts and their parasites, which depend on environmental heme for survival. PMID:20686661

  4. [Genome-wide identification and expression analysis of the WRKY gene family in peach].

    PubMed

    Yanbing, Gu; Zhirui, Ji; Fumei, Chi; Zhuang, Qiao; Chengnan, Xu; Junxiang, Zhang; Zongshan, Zhou; Qinglong, Dong

    2016-03-01

    The WRKY transcription factors are one of the largest families of transcriptional regulators and play diverse regulatory roles in biotic and abiotic stresses, plant growth and development processes. In this study, the WRKY DNA-binding domain (Pfam Database number: PF03106) downloaded from Pfam protein families database was exploited to identify WRKY genes from the peach (Prunus persica 'Lovell') genome using HMMER 3.0. The obtained amino acid sequences were analyzed with DNAMAN 5.0, WebLogo 3, MEGA 5.1, MapInspect and MEME bioinformatics softwares. Totally 61 peach WRKY genes were found in the peach genome. Our phylogenetic analysis revealed that peach WRKY genes were classified into three Groups: Ⅰ, Ⅱ and Ⅲ. The WRKY N-terminal and C-terminal domains of Group Ⅰ (group I-N and group I-C) were monophyletic. The Group Ⅱ was sub-divided into five distinct clades (groupⅡ-a, Ⅱ-b, Ⅱ-c, Ⅱ-d and Ⅱ-e). Our domain analysis indicated that the WRKY regions contained a highly conserved heptapeptide stretch WRKYGQK at its N-terminus followed by a zinc-finger motif. The chromosome mapping analysis showed that peach WRKY genes were distributed with different densities over 8 chromosomes. The intron-exon structure analysis revealed that structures of the WRKY gene were highly conserved in the peach. The conserved motif analysis showed that the conserved motifs 1, 2 and 3, which specify the WRKY domain, were observed in all peach WRKY proteins, motif 5 as the unknown domain was observed in group Ⅱ-d, two WRKY domains were assigned to GroupⅠ. SqRT-PCR and qRT-PCR results indicated that 16 PpWRKY genes were expressed in roots, stems, leaves, flowers and fruits at various expression levels. Our analysis thus identified the PpWRKY gene families, and future functional studies are needed to reveal its specific roles. PMID:27001479

  5. [Genome-wide identification and expression analysis of the WRKY gene family in peach].

    PubMed

    Yanbing, Gu; Zhirui, Ji; Fumei, Chi; Zhuang, Qiao; Chengnan, Xu; Junxiang, Zhang; Zongshan, Zhou; Qinglong, Dong

    2016-03-01

    The WRKY transcription factors are one of the largest families of transcriptional regulators and play diverse regulatory roles in biotic and abiotic stresses, plant growth and development processes. In this study, the WRKY DNA-binding domain (Pfam Database number: PF03106) downloaded from Pfam protein families database was exploited to identify WRKY genes from the peach (Prunus persica 'Lovell') genome using HMMER 3.0. The obtained amino acid sequences were analyzed with DNAMAN 5.0, WebLogo 3, MEGA 5.1, MapInspect and MEME bioinformatics softwares. Totally 61 peach WRKY genes were found in the peach genome. Our phylogenetic analysis revealed that peach WRKY genes were classified into three Groups: Ⅰ, Ⅱ and Ⅲ. The WRKY N-terminal and C-terminal domains of Group Ⅰ (group I-N and group I-C) were monophyletic. The Group Ⅱ was sub-divided into five distinct clades (groupⅡ-a, Ⅱ-b, Ⅱ-c, Ⅱ-d and Ⅱ-e). Our domain analysis indicated that the WRKY regions contained a highly conserved heptapeptide stretch WRKYGQK at its N-terminus followed by a zinc-finger motif. The chromosome mapping analysis showed that peach WRKY genes were distributed with different densities over 8 chromosomes. The intron-exon structure analysis revealed that structures of the WRKY gene were highly conserved in the peach. The conserved motif analysis showed that the conserved motifs 1, 2 and 3, which specify the WRKY domain, were observed in all peach WRKY proteins, motif 5 as the unknown domain was observed in group Ⅱ-d, two WRKY domains were assigned to GroupⅠ. SqRT-PCR and qRT-PCR results indicated that 16 PpWRKY genes were expressed in roots, stems, leaves, flowers and fruits at various expression levels. Our analysis thus identified the PpWRKY gene families, and future functional studies are needed to reveal its specific roles.

  6. Genome-wide identification, phylogeny, and expression of fibroblast growth genes in common carp.

    PubMed

    Jiang, Likun; Zhang, Songhao; Dong, Chuanju; Chen, Baohua; Feng, Jingyan; Peng, Wenzhu; Mahboob, Shahid; Al-Ghanim, Khalid A; Xu, Peng

    2016-03-10

    Fibroblast growth factors (FGFs) are a large family of polypeptide growth factors, which are found in organisms ranging from nematodes to humans. In vertebrates, a number of FGFs have been shown to play important roles in developing embryos and adult organisms. Among the vertebrate species, FGFs are highly conserved in both gene structure and amino-acid sequence. However, studies on teleost FGFs are mainly limited to model species, hence we investigated FGFs in the common carp genome. We identified 35 FGFs in the common carp genome. Phylogenetic analysis revealed that most of the FGFs are highly conserved, though recent gene duplication and gene losses do exist. By examining the copy number of FGFs in several vertebrate genomes, we found that eight FGFs in common carp have undergone gene duplications, including FGF6a, FGF6b, FGF7, FGF8b, FGF10a, FGF11b, FGF13a, and FGF18b. The expression patterns of all FGFs were examined in various tissues, including the blood, brain, gill, heart, intestine, muscle, skin, spleen and kidney, showing that most of the FGFs were ubiquitously expressed, indicating their critical role in common carp. To some extent, examination of gene families with detailed phylogenetic or orthology analysis verified the authenticity and accuracy of assembly and annotation of the recently published common carp whole genome sequences. Gene families are also considered as a unique source for evolutionary studies. Moreover, the whole set of common carp FGF gene family provides an important genomic resource for future biochemical, physiological, and phylogenetic studies on FGFs in teleosts.

  7. Genome-wide gene order distances support clustering the gram-positive bacteria

    PubMed Central

    House, Christopher H.; Pellegrini, Matteo; Fitz-Gibbon, Sorel T.

    2015-01-01

    Initially using 143 genomes, we developed a method for calculating the pair-wise distance between prokaryotic genomes using a Monte Carlo method to estimate the conservation of gene order. The method was based on repeatedly selecting five or six non-adjacent random orthologs from each of two genomes and determining if the chosen orthologs were in the same order. The raw distances were then corrected for gene order convergence using an adaptation of the Jukes-Cantor model, as well as using the common distance correction D′ = −ln(1-D). First, we compared the distances found via the order of six orthologs to distances found based on ortholog gene content and small subunit rRNA sequences. The Jukes-Cantor gene order distances are reasonably well correlated with the divergence of rRNA (R2 = 0.24), especially at rRNA Jukes-Cantor distances of less than 0.2 (R2 = 0.52). Gene content is only weakly correlated with rRNA divergence (R2 = 0.04) over all distances, however, it is especially strongly correlated at rRNA Jukes-Cantor distances of less than 0.1 (R2 = 0.67). This initial work suggests that gene order may be useful in conjunction with other methods to help understand the relatedness of genomes. Using the gene order distances in 143 genomes, the relations of prokaryotes were studied using neighbor joining and agreement subtrees. We then repeated our study of the relations of prokaryotes using gene order in 172 complete genomes better representing a wider-diversity of prokaryotes. Consistently, our trees show the Actinobacteria as a sister group to the bulk of the Firmicutes. In fact, the robustness of gene order support was found to be considerably greater for uniting these two phyla than for uniting any of the proteobacterial classes together. The results are supportive of the idea that Actinobacteria and Firmicutes are closely related, which in turn implies a single origin for the gram-positive cell. PMID:25653643

  8. Genome-wide identification of glucosinolate synthesis genes in Brassica rapa.

    PubMed

    Zang, Yun-Xiang; Kim, Hyun Uk; Kim, Jin A; Lim, Myung-Ho; Jin, Mina; Lee, Sang Choon; Kwon, Soo-Jin; Lee, Soo-In; Hong, Joon Ki; Park, Tae-Ho; Mun, Jeong-Hwan; Seol, Young-Joo; Hong, Seung-Beom; Park, Beom-Seok

    2009-07-01

    Glucosinolates play important roles in plant defense against herbivores and microbes, as well as in human nutrition. Some glucosinolate-derived isothiocyanate and nitrile compounds have been clinically proven for their anticarcinogenic activity. To better understand glucosinolate biosynthesis in Brassica rapa, we conducted a comparative genomics study with Arabidopsis thaliana and identified total 56 putative biosynthetic and regulator genes. This established a high colinearity in the glucosinolate biosynthesis pathway between Arabidopsis and B. rapa. Glucosinolate genes in B. rapa share 72-94% nucleotide sequence identity with the Arabidopsis orthologs and exist in different copy numbers. The exon/intron split pattern of B. rapa is almost identical to that of Arabidopsis, although inversion, insertion, deletion and intron size variations commonly occur. Four genes appear to be nonfunctional as a result of the presence of a frame shift mutation and retrotransposon insertion. At least 12 paralogs of desulfoglucosinolate sulfotransferase were found in B. rapa, whereas only three were found in Arabidopsis. The expression of those paralogs was not tissue-specific but varied greatly depending on B. rapa tissue types. Expression was also developmentally regulated in some paralogs but not in other paralogs. Most of the regulator genes are present as triple copies. Accordingly, glucosinolate synthesis and regulation in B. rapa appears to be more complex than that of Arabidopsis. With the isolation and further characterization of the endogenous genes, health-beneficial vegetables or desirable animal feed crops could be developed by metabolically engineering the glucosinolate pathway.

  9. Genome-wide assessment of differential effector gene use in embryogenesis.

    PubMed

    Barsi, Julius C; Tu, Qiang; Calestani, Cristina; Davidson, Eric H

    2015-11-15

    Six different populations of cells were isolated by fluorescence-activated cell sorting from disaggregated late blastula- and gastrula-stage sea urchin embryos according to the regulatory states expressed in these cells, as reported by recombineered bacterial artificial chromosomes producing fluorochromes. Transcriptomes recovered from these embryonic cell populations revealed striking, early differential expression of large cohorts of effector genes. The six cell populations were presumptive pigment cells, presumptive neurogenic cells, presumptive skeletogenic cells, cells from the stomodeal region of the oral ectoderm, ciliated band cells and cells from the endoderm/ectoderm boundary that will give rise both to hindgut and to border ectoderm. Transcriptome analysis revealed that each of these domains specifically expressed several hundred effector genes at significant levels. Annotation indicated the qualitative individuality of the functional nature of each cell population, even though they were isolated from embryos only 1-2 days old. In no case was more than a tiny fraction of the transcripts enriched in one population also enriched in any other of the six populations studied. As was particularly clear in the cases of the presumptive pigment, neurogenic and skeletogenic cells, all three of which represent precociously differentiating cell types of this embryo, most specifically expressed genes of given cell types are not significantly expressed at all in the other cell types. Thus, at the effector gene level, a dramatic, cell type-specific pattern of differential gene regulation is established well before any significant embryonic morphogenesis has occurred.

  10. Genome-wide identification and characterization of maize expansin genes expressed in endosperm.

    PubMed

    Zhang, Wei; Yan, Hanwei; Chen, Weijun; Liu, Jinyang; Jiang, Cuiping; Jiang, Haiyang; Zhu, Suwen; Cheng, Beijiu

    2014-12-01

    By promoting cell wall loosening, expansins contribute to cell enlargement during various developmental processes. Nevertheless, the role of expansins in the expansion and development of endosperm--a major seed component whose cell size is significantly associated with grain yield--is poorly understood. To explore associated biological processes and the evolution of expansins in maize, we performed a systematic analysis of the expansin gene family encompassing gene structure, phylogeny, chromosomal location, gene duplication, and gene ontology. A total of 88 maize expansin genes (ZmEXPs) were identified and categorized into three subfamilies according to their phylogenetic relationships. Expression patterns of ZmEXPs were also investigated in nine different tissues by semi-quantitative RT-PCR. The expression of eight ZmEXPs was detected in endosperm, with five showing endosperm-specific expression. Quantitative RT-PCR was used to analyze expression patterns of the eight ZmEXPs in endosperm (10 days after pollination) under abscisic acid (ABA) and gibberellic acid (GA3) treatments. All eight ZmEXPs were found to be significantly regulated by ABA and GA3 in endosperm, suggesting important roles for these hormones in the regulation of ZmEXPs during endosperm development. Our results provide essential information for ZmEXPs cloning and functional exploration, which will assist research on expansin-related mechanisms and contribute to future enhancement of maize grain yield.

  11. Genome-wide enrichment screening reveals multiple targets and resistance genes for triclosan in Escherichia coli.

    PubMed

    Yu, Byung Jo; Kim, Jung Ae; Ju, Hyun Mok; Choi, Soo-Kyung; Hwang, Seung Jin; Park, Sungyoo; Kim, Euijoong; Pan, Jae-Gu

    2012-10-01

    Triclosan is a widely used biocide effective against different microorganisms. At bactericidal concentrations, triclosan appears to affect multiple targets, while at bacteriostatic concentrations, triclosan targets FabI. The site-specific antibiotic-like mode-of-action and a widespread use of triclosan in household products claimed to possibly induce cross-resistance to other antibiotics. Thus, we set out to define more systematically the genes conferring resistance to triclosan; A genomic library of Escherichia coli strain W3110 was constructed and enriched in a selective medium containing a lethal concentration of triclosan. The genes enabling growth in the presence of triclosan were identified by using a DNA microarray and confirmed consequently by ASKA clones overexpressing the selected 62 candidate genes. Among these, forty-seven genes were further confirmed to enhance the resistance to triclosan; these genes, including the FabI target, were involved in inner or outer membrane synthesis, cell-surface material synthesis, transcriptional activation, sugar phosphotransferase (PTS) systems, various transporter systems, cell division, and ATPase and reductase/dehydrogenase reactions. In particular, overexpression of pgsA, rcsA, or gapC conferred to E. coli cells a similar level of triclosan resistance induced by fabI overexpression. These results indicate that triclosan may have multiple targets other than well-known FabI and that there are several undefined novel mechanisms for the resistance development to triclosan, thus probably inducing cross antibiotic resistance.

  12. Genome-wide gene expression profiling of acute metal exposures in male zebrafish.

    PubMed

    Baer, Christine E; Ippolito, Danielle L; Hussainzada, Naissan; Lewis, John A; Jackson, David A; Stallings, Jonathan D

    2014-12-01

    To capture global responses to metal poisoning and mechanistic insights into metal toxicity, gene expression changes were evaluated in whole adult male zebrafish following acute 24 h high dose exposure to three metals with known human health risks. Male adult zebrafish were exposed to nickel chloride, cobalt chloride or sodium dichromate at concentrations corresponding to their respective 96 h LC20, LC40 and LC60 (i.e. 96 h concentrations at which 20%, 40% and 60% lethality is expected, respectively). Histopathology was performed on a subset of metal-exposed zebrafish to phenotypically anchor transcriptional changes associated with each metal exposure. Here we describe in detail the contents and quality controls for the gene expression and other data associated with the study published by Hussainzada and colleagues in BMC Pharmacology and Toxicology (Hussainzada et al., 2014) with the data uploaded to Gene Expression Omnibus (accession number GSE50648).

  13. Genome-wide gene expression profiling of acute metal exposures in male zebrafish

    PubMed Central

    Baer, Christine E.; Ippolito, Danielle L.; Hussainzada, Naissan; Lewis, John A.; Jackson, David A.; Stallings, Jonathan D.

    2014-01-01

    To capture global responses to metal poisoning and mechanistic insights into metal toxicity, gene expression changes were evaluated in whole adult male zebrafish following acute 24 h high dose exposure to three metals with known human health risks. Male adult zebrafish were exposed to nickel chloride, cobalt chloride or sodium dichromate at concentrations corresponding to their respective 96 h LC20, LC40 and LC60 (i.e. 96 h concentrations at which 20%, 40% and 60% lethality is expected, respectively). Histopathology was performed on a subset of metal-exposed zebrafish to phenotypically anchor transcriptional changes associated with each metal exposure. Here we describe in detail the contents and quality controls for the gene expression and other data associated with the study published by Hussainzada and colleagues in BMC Pharmacology and Toxicology (Hussainzada et al., 2014) with the data uploaded to Gene Expression Omnibus (accession number GSE50648). PMID:26484131

  14. Genome-wide SNP association-based localization of a dwarfism gene in Friesian dwarf horses.

    PubMed

    Orr, N; Back, W; Gu, J; Leegwater, P; Govindarajan, P; Conroy, J; Ducro, B; Van Arendonk, J A M; MacHugh, D E; Ennis, S; Hill, E W; Brama, P A J

    2010-12-01

    The recent completion of the horse genome and commercial availability of an equine SNP genotyping array has facilitated the mapping of disease genes. We report putative localization of the gene responsible for dwarfism, a trait in Friesian horses that is thought to have a recessive mode of inheritance, to a 2-MB region of chromosome 14 using just 10 affected animals and 10 controls. We successfully genotyped 34,429 SNPs that were tested for association with dwarfism using chi-square tests. The most significant SNP in our study, BIEC2-239376 (P(2df)=4.54 × 10(-5), P(rec)=7.74 × 10(-6)), is located close to a gene implicated in human dwarfism. Fine-mapping and resequencing analyses did not aid in further localization of the causative variant, and replication of our findings in independent sample sets will be necessary to confirm these results.

  15. Genome-Wide Comparative Analysis of Chemosensory Gene Families in Five Tsetse Fly Species.

    PubMed

    Macharia, Rosaline; Mireji, Paul; Murungi, Edwin; Murilla, Grace; Christoffels, Alan; Aksoy, Serap; Masiga, Daniel

    2016-02-01

    For decades, odour-baited traps have been used for control of tsetse flies (Diptera; Glossinidae), vectors of African trypanosomes. However, differential responses to known attractants have been reported in different Glossina species, hindering establishment of a universal vector control tool. Availability of full genome sequences of five Glossina species offers an opportunity to compare their chemosensory repertoire and enhance our understanding of their biology in relation to chemosensation. Here, we identified and annotated the major chemosensory gene families in Glossina. We identified a total of 118, 115, 124, and 123 chemosensory genes in Glossina austeni, G. brevipalpis, G. f. fuscipes, G. pallidipes, respectively, relative to 127 reported in G. m. morsitans. Our results show that tsetse fly genomes have fewer chemosensory genes when compared to other dipterans such as Musca domestica (n>393), Drosophila melanogaster (n = 246) and Anopheles gambiae (n>247). We also found that Glossina chemosensory genes are dispersed across distantly located scaffolds in their respective genomes, in contrast to other insects like D. melanogaster whose genes occur in clusters. Further, Glossina appears to be devoid of sugar receptors and to have expanded CO2 associated receptors, potentially reflecting Glossina's obligate hematophagy and the need to detect hosts that may be out of sight. We also identified, in all species, homologs of Ir84a; a Drosophila-specific ionotropic receptor that promotes male courtship suggesting that this is a conserved trait in tsetse flies. Notably, our selection analysis revealed that a total of four gene loci (Gr21a, GluRIIA, Gr28b, and Obp83a) were under positive selection, which confers fitness advantage to species. These findings provide a platform for studies to further define the language of communication of tsetse with their environment, and influence development of novel approaches for control. PMID:26886411

  16. Genome-Wide Patterns of Gene Expression during Aging in the African Malaria Vector Anopheles gambiae

    PubMed Central

    Wang, Mei-Hui; Marinotti, Osvaldo; James, Anthony A.; Walker, Edward; Githure, John; Yan, Guiyun

    2010-01-01

    The primary means of reducing malaria transmission is through reduction in longevity in days of the adult female stage of the Anopheles vector. However, assessing chronological age is limited to crude physiologic methods which categorize the females binomially as either very young (nulliparous) or not very young (parous). Yet the epidemiologically relevant reduction in life span falls within the latter category. Age-grading methods that delineate chronological age, using accurate molecular surrogates based upon gene expression profiles, will allow quantification of the longevity-reducing effects of vector control tools aimed at the adult, female mosquito. In this study, microarray analyses of gene expression profiles in the African malaria vector Anopheles gambiae were conducted during natural senescence of females in laboratory conditions. Results showed that detoxification-related and stress-responsive genes were up-regulated as mosquitoes aged. A total of 276 transcripts had age-dependent expression, independently of blood feeding and egg laying events. Expression of 112 (40.6%) of these transcripts increased or decreased monotonically with increasing chronologic age. Seven candidate genes for practical age assessment were tested by quantitative gene amplification in the An. gambiae G3 strain in a laboratory experiment and the Mbita strain in field enclosures set up in western Kenya under conditions closely resembling natural ones. Results were similar between experiments, indicating that senescence is marked by changes in gene expression and that chronological age can be gauged accurately and repeatedly with this method. These results indicate that the method may be suitable for accurate gauging of the age in days of field-caught, female An. gambiae. PMID:20967211

  17. Genome-Wide Comparative Analysis of Chemosensory Gene Families in Five Tsetse Fly Species.

    PubMed

    Macharia, Rosaline; Mireji, Paul; Murungi, Edwin; Murilla, Grace; Christoffels, Alan; Aksoy, Serap; Masiga, Daniel

    2016-02-01

    For decades, odour-baited traps have been used for control of tsetse flies (Diptera; Glossinidae), vectors of African trypanosomes. However, differential responses to known attractants have been reported in different Glossina species, hindering establishment of a universal vector control tool. Availability of full genome sequences of five Glossina species offers an opportunity to compare their chemosensory repertoire and enhance our understanding of their biology in relation to chemosensation. Here, we identified and annotated the major chemosensory gene families in Glossina. We identified a total of 118, 115, 124, and 123 chemosensory genes in Glossina austeni, G. brevipalpis, G. f. fuscipes, G. pallidipes, respectively, relative to 127 reported in G. m. morsitans. Our results show that tsetse fly genomes have fewer chemosensory genes when compared to other dipterans such as Musca domestica (n>393), Drosophila melanogaster (n = 246) and Anopheles gambiae (n>247). We also found that Glossina chemosensory genes are dispersed across distantly located scaffolds in their respective genomes, in contrast to other insects like D. melanogaster whose genes occur in clusters. Further, Glossina appears to be devoid of sugar receptors and to have expanded CO2 associated receptors, potentially reflecting Glossina's obligate hematophagy and the need to detect hosts that may be out of sight. We also identified, in all species, homologs of Ir84a; a Drosophila-specific ionotropic receptor that promotes male courtship suggesting that this is a conserved trait in tsetse flies. Notably, our selection analysis revealed that a total of four gene loci (Gr21a, GluRIIA, Gr28b, and Obp83a) were under positive selection, which confers fitness advantage to species. These findings provide a platform for studies to further define the language of communication of tsetse with their environment, and influence development of novel approaches for control.

  18. Genome-Wide Comparative Analysis of Chemosensory Gene Families in Five Tsetse Fly Species

    PubMed Central

    Macharia, Rosaline; Mireji, Paul; Murungi, Edwin; Murilla, Grace; Christoffels, Alan; Aksoy, Serap; Masiga, Daniel

    2016-01-01

    For decades, odour-baited traps have been used for control of tsetse flies (Diptera; Glossinidae), vectors of African trypanosomes. However, differential responses to known attractants have been reported in different Glossina species, hindering establishment of a universal vector control tool. Availability of full genome sequences of five Glossina species offers an opportunity to compare their chemosensory repertoire and enhance our understanding of their biology in relation to chemosensation. Here, we identified and annotated the major chemosensory gene families in Glossina. We identified a total of 118, 115, 124, and 123 chemosensory genes in Glossina austeni, G. brevipalpis, G. f. fuscipes, G. pallidipes, respectively, relative to 127 reported in G. m. morsitans. Our results show that tsetse fly genomes have fewer chemosensory genes when compared to other dipterans such as Musca domestica (n>393), Drosophila melanogaster (n = 246) and Anopheles gambiae (n>247). We also found that Glossina chemosensory genes are dispersed across distantly located scaffolds in their respective genomes, in contrast to other insects like D. melanogaster whose genes occur in clusters. Further, Glossina appears to be devoid of sugar receptors and to have expanded CO2 associated receptors, potentially reflecting Glossina's obligate hematophagy and the need to detect hosts that may be out of sight. We also identified, in all species, homologs of Ir84a; a Drosophila-specific ionotropic receptor that promotes male courtship suggesting that this is a conserved trait in tsetse flies. Notably, our selection analysis revealed that a total of four gene loci (Gr21a, GluRIIA, Gr28b, and Obp83a) were under positive selection, which confers fitness advantage to species. These findings provide a platform for studies to further define the language of communication of tsetse with their environment, and influence development of novel approaches for control. PMID:26886411

  19. Genome-wide identification and characterization of novel genes involved in terpenoid biosynthesis in Salvia miltiorrhiza

    PubMed Central

    Ma, Yimian; Yuan, Lichai; Wu, Bin; Li, Xian’en; Chen, Shilin; Lu, Shanfa

    2012-01-01

    Terpenoids are the largest class of plant secondary metabolites and have attracted widespread interest. Salvia miltiorrhiza, belonging to the largest and most widely distributed genus in the mint family, is a model medicinal plant with great economic and medicinal value. Diterpenoid tanshinones are the major lipophilic bioactive components in S. miltiorrhiza. Systematic analysis of genes involved in terpenoid biosynthesis has not been reported to date. Searching the recently available working draft of the S. miltiorrhiza genome, 40 terpenoid biosynthesis-related genes were identified, of which 27 are novel. These genes are members of 19 families, which encode all of the enzymes involved in the biosynthesis of the universal isoprene precursor isopentenyl diphosphate and its isomer dimethylallyl diphosphate, and two enzymes associated with the biosynthesis of labdane-related diterpenoids. Through a systematic analysis, it was found that 20 of the 40 genes could be involved in tanshinone biosynthesis. Using a comprehensive approach, the intron/exon structures and expression patterns of all identified genes and their responses to methyl jasmonate treatment were analysed. The conserved domains and phylogenetic relationships among the deduced S. miltiorrhiza proteins and their homologues isolated from other plant species were revealed. It was discovered that some of the key enzymes, such as 1-deoxy-D-xylulose 5-phosphate synthase, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase, hydroxymethylglutaryl-CoA reductase, and geranylgeranyl diphosphate synthase, are encoded by multiple gene members with different expression patterns and subcellular localizations, and both homomeric and heteromeric geranyl diphosphate synthases exist in S. miltiorrhiza. The results suggest the complexity of terpenoid biosynthesis and the existence of metabolic channels for diverse terpenoids in S. miltiorrhiza and provide useful information for improving tanshinone production through genetic

  20. Genome-Wide Identification of Genes Required for Fitness of Group A Streptococcus in Human Blood

    PubMed Central

    Le Breton, Yoann; Mistry, Pragnesh; Valdes, Kayla M.; Quigley, Jeffrey; Kumar, Nikhil; Tettelin, Hervé

    2013-01-01

    The group A streptococcus (GAS) is a strict human pathogen responsible for a wide spectrum of diseases. Although GAS genome sequences are available, functional genomic analyses have been limited. We developed a mariner-based transposon, osKaR, designed to perform Transposon-Site Hybridization (TraSH) in GAS and successfully tested its use in several invasive serotypes. A complex osKaR mutant library in M1T1 GAS strain 5448 was subjected to negative selection in human blood to identify genes important for GAS fitness in this clinically relevant environment. Mutants underrepresented after growth in blood (output pool) compared to growth in rich media (input pool) were identified using DNA microarray hybridization of transposon-specific tags en masse. Using blood from three different donors, we identified 81 genes that met our criteria for reduced fitness in blood from at least two individuals. Genes known to play a role in survival of GAS in blood were found, including those encoding the virulence regulator Mga (mga), the peroxide response regulator PerR (perR), and the RofA-like regulator Ralp-3 (ralp3). We also identified genes previously reported for their contribution to sepsis in other pathogens, such as de novo nucleotide synthesis (purD, purA, pyrB, carA, carB, guaB), sugar metabolism (scrB, fruA), zinc uptake (adcC), and transcriptional regulation (cpsY). To validate our findings, independent mutants with mutations in 10 different genes identified in our screen were confirmed to be defective for survival in blood bactericidal assays. Overall, this work represents the first use of TraSH in GAS to identify potential virulence genes. PMID:23297387

  1. Genome-wide analysis of drought induced gene expression changes in flax (Linum usitatissimum)

    PubMed Central

    Dash, Prasanta K; Cao, Yongguo; Jailani, Abdul K; Gupta, Payal; Venglat, Prakash; Xiang, Daoquan; Rai, Rhitu; Sharma, Rinku; Thirunavukkarasu, Nepolean; Abdin, Malik Z; Yadava, Devendra K; Singh, Nagendra K; Singh, Jas; Selvaraj, Gopalan; Deyholos, Mike; Kumar, Polumetla Ananda; Datla, Raju

    2014-01-01

    A robust phenotypic plasticity to ward off adverse environmental conditions determines performance and productivity in crop plants. Flax (linseed), is an important cash crop produced for natural textile fiber (linen) or oilseed with many health promoting products. This crop is prone to drought stress and yield losses in many parts of the world. Despite recent advances in drought research in a number of important crops, related progress in flax is very limited. Since, response of this plant to drought stress has not been addressed at the molecular level; we conducted microarray analysis to capture transcriptome associated with induced drought in flax. This study identified 183 differentially expressed genes (DEGs) associated with diverse cellular, biophysical and metabolic programs in flax. The analysis also revealed especially the altered regulation of cellular and metabolic pathways governing photosynthesis. Additionally, comparative transcriptome analysis identified a plethora of genes that displayed differential regulation both spatially and temporally. These results revealed co-regulated expression of 26 genes in both shoot and root tissues with implications for drought stress response. Furthermore, the data also showed that more genes are upregulated in roots compared to shoots, suggesting that roots may play important and additional roles in response to drought in flax. With prolonged drought treatment, the number of DEGs increased in both tissue types. Differential expression of selected genes was confirmed by qRT-PCR, thus supporting the suggested functional association of these intrinsic genes in maintaining growth and homeostasis in response to imminent drought stress in flax. Together the present study has developed foundational and new transcriptome data sets for drought stress in flax. PMID:25072186

  2. Genome-wide analysis of drought induced gene expression changes in flax (Linum usitatissimum).

    PubMed

    Dash, Prasanta K; Cao, Yongguo; Jailani, Abdul K; Gupta, Payal; Venglat, Prakash; Xiang, Daoquan; Rai, Rhitu; Sharma, Rinku; Thirunavukkarasu, Nepolean; Abdin, Malik Z; Yadava, Devendra K; Singh, Nagendra K; Singh, Jas; Selvaraj, Gopalan; Deyholos, Mike; Kumar, Polumetla Ananda; Datla, Raju

    2014-01-01

    A robust phenotypic plasticity to ward off adverse environmental conditions determines performance and productivity in crop plants. Flax (linseed), is an important cash crop produced for natural textile fiber (linen) or oilseed with many health promoting products. This crop is prone to drought stress and yield losses in many parts of the world. Despite recent advances in drought research in a number of important crops, related progress in flax is very limited. Since, response of this plant to drought stress has not been addressed at the molecular level; we conducted microarray analysis to capture transcriptome associated with induced drought in flax. This study identified 183 differentially expressed genes (DEGs) associated with diverse cellular, biophysical and metabolic programs in flax. The analysis also revealed especially the altered regulation of cellular and metabolic pathways governing photosynthesis. Additionally, comparative transcriptome analysis identified a plethora of genes that displayed differential regulation both spatially and temporally. These results revealed co-regulated expression of 26 genes in both shoot and root tissues with implications for drought stress response. Furthermore, the data also showed that more genes are upregulated in roots compared to shoots, suggesting that roots may play important and additional roles in response to drought in flax. With prolonged drought treatment, the number of DEGs increased in both tissue types. Differential expression of selected genes was confirmed by qRT-PCR, thus supporting the suggested functional association of these intrinsic genes in maintaining growth and homeostasis in response to imminent drought stress in flax. Together the present study has developed foundational and new transcriptome data sets for drought stress in flax.

  3. Systematic, genome-wide identification of host genes affecting replication of a positive-strand RNA virus.

    PubMed

    Kushner, David B; Lindenbach, Brett D; Grdzelishvili, Valery Z; Noueiry, Amine O; Paul, Scott M; Ahlquist, Paul

    2003-12-23

    Positive-strand RNA viruses are the largest virus class and include many pathogens such as hepatitis C virus and the severe acute respiratory syndrome coronavirus (SARS). Brome mosaic virus (BMV) is a representative positive-strand RNA virus whose RNA replication, gene expression, and encapsidation have been reproduced in the yeast Saccharomyces cerevisiae. By using traditional yeast genetics, host genes have been identified that function in controlling BMV translation, selecting BMV RNAs as replication templates, activating the replication complex, maintaining a lipid composition required for membrane-associated RNA replication, and other steps. To more globally and systematically identify such host factors, we used engineered BMV derivatives to assay viral RNA replication in each strain of an ordered, genome-wide set of yeast single-gene deletion mutants. Each deletion strain was transformed to express BMV replicase proteins and a BMV RNA replication template with the capsid gene replaced by a luciferase reporter. Luciferase expression, which is dependent on viral RNA replication and RNA-dependent mRNA synthesis, was measured in intact yeast cells. Approximately 4500 yeast deletion strains ( approximately 80% of yeast genes) were screened in duplicate and selected strains analyzed further. This functional genomics approach revealed nearly 100 genes whose absence inhibited or stimulated BMV RNA replication and/or gene expression by 3- to >25-fold. Several of these genes were shown previously to function in BMV replication, validating the approach. Newly identified genes include some in RNA, protein, or membrane modification pathways and genes of unknown function. The results further illuminate virus and cell pathways. Further refinement of virus screening likely will reveal contributions from additional host genes.

  4. Genome-wide transcriptome analysis in the ovaries of two goats identifies differentially expressed genes related to fecundity.

    PubMed

    Miao, Xiangyang; Luo, Qingmiao; Qin, Xiaoyu

    2016-05-10

    The goats are widely kept as livestock throughout the world. Two excellent domestic breeds in China, the Laiwu Black and Jining Grey goats, have different fecundities and prolificacies. Although the goat genome sequences have been resolved recently, little is known about the gene regulations at the transcriptional level in goat. To understand the molecular and genetic mechanisms related to the fecundities and prolificacies, we performed genome-wide sequencing of the mRNAs from two breeds of goat using the next-generation RNA-Seq technology and used functional annotation to identify pathways of interest. Digital gene expression analysis showed 338 genes were up-regulated in the Jining Grey goats and 404 were up-regulated in the Laiwu Black goats. Quantitative real-time PCR verified the reliability of the RNA-Seq data. This study suggests that multiple genes responsible for various biological functions and signaling pathways are differentially expressed in the two different goat breeds, and these genes might be involved in the regulation of goat fecundity and prolificacy. Taken together, our study provides insight into the transcriptional regulation in the ovaries of 2 species of goats that might serve as a key resource for understanding goat fecundity, prolificacy and genetic diversity between species.

  5. Genome-wide significant localization for working and spatial memory: Identifying genes for psychosis using models of cognition.

    PubMed

    Knowles, Emma E M; Carless, Melanie A; de Almeida, Marcio A A; Curran, Joanne E; McKay, D Reese; Sprooten, Emma; Dyer, Thomas D; Göring, Harald H; Olvera, Rene; Fox, Peter; Almasy, Laura; Duggirala, Ravi; Kent, Jack W; Blangero, John; Glahn, David C

    2014-01-01

    It is well established that risk for developing psychosis is largely mediated by the influence of genes, but identifying precisely which genes underlie that risk has been problematic. Focusing on endophenotypes, rather than illness risk, is one solution to this problem. Impaired cognition is a well-established endophenotype of psychosis. Here we aimed to characterize the genetic architecture of cognition using phenotypically detailed models as opposed to relying on general IQ or individual neuropsychological measures. In so doing we hoped to identify genes that mediate cognitive ability, which might also contribute to psychosis risk. Hierarchical factor models of genetically clustered cognitive traits were subjected to linkage analysis followed by QTL region-specific association analyses in a sample of 1,269 Mexican American individuals from extended pedigrees. We identified four genome wide significant QTLs, two for working and two for spatial memory, and a number of plausible and interesting candidate genes. The creation of detailed models of cognition seemingly enhanced the power to detect genetic effects on cognition and provided a number of possible candidate genes for psychosis. PMID:24243780

  6. Genome-wide association links candidate genes to resistance to Plum Pox Virus in apricot (Prunus armeniaca).

    PubMed

    Mariette, Stéphanie; Wong Jun Tai, Fabienne; Roch, Guillaume; Barre, Aurélien; Chague, Aurélie; Decroocq, Stéphane; Groppi, Alexis; Laizet, Yec'han; Lambert, Patrick; Tricon, David; Nikolski, Macha; Audergon, Jean-Marc; Abbott, Albert G; Decroocq, Véronique

    2016-01-01

    In fruit tree species, many important traits have been characterized genetically by using single-family descent mapping in progenies segregating for the traits. However, most mapped loci have not been sufficiently resolved to the individual genes due to insufficient progeny sizes for high resolution mapping and the previous lack of whole-genome sequence resources of the study species. To address this problem for Plum Pox Virus (PPV) candidate resistance gene identification in Prunus species, we implemented a genome-wide association (GWA) approach in apricot. This study exploited the broad genetic diversity of the apricot (Prunus armeniaca) germplasm containing resistance to PPV, next-generation sequence-based genotyping, and the high-quality peach (Prunus persica) genome reference sequence for single nucleotide polymorphism (SNP) identification. The results of this GWA study validated previously reported PPV resistance quantitative trait loci (QTL) intervals, highlighted other potential resistance loci, and resolved each to a limited set of candidate genes for further study. This work substantiates the association genetics approach for resolution of QTL to candidate genes in apricot and suggests that this approach could simplify identification of other candidate genes for other marked trait intervals in this germplasm.

  7. Genome-wide RNAi Screen Identifies Cohesin Genes as Modifiers of Renewal and Differentiation in Human HSCs.

    PubMed

    Galeev, Roman; Baudet, Aurélie; Kumar, Praveen; Rundberg Nilsson, Alexandra; Nilsson, Björn; Soneji, Shamit; Törngren, Therese; Borg, Åke; Kvist, Anders; Larsson, Jonas

    2016-03-29

    To gain insights into the regulatory mechanisms of hematopoietic stem cells (HSCs), we employed a genome-wide RNAi screen in human cord-blood derived cells and identified candidate genes whose knockdown maintained the HSC phenotype during culture. A striking finding was the identification of members of the cohesin complex (STAG2, RAD21, STAG1, and SMC3) among the top 20 genes from the screen. Upon individual validation of these cohesin genes, we found that their knockdown led to an immediate expansion of cells with an HSC phenotype in vitro. A similar expansion was observed in vivo following transplantation to immunodeficient mice. Transcriptome analysis of cohesin-deficient CD34(+) cells showed an upregulation of HSC-specific genes, demonstrating an immediate shift toward a more stem-cell-like gene expression signature upon cohesin deficiency. Our findings implicate cohesin as a major regulator of HSCs and illustrate the power of global RNAi screens to identify modifiers of cell fate. PMID:26997282

  8. Genome-wide analysis of auxin response factor gene family members in medicinal model plant Salvia miltiorrhiza

    PubMed Central

    Xu, Zhichao; Ji, Aijia; Chen, Shilin

    2016-01-01

    ABSTRACT Auxin response factors (ARFs) can function as transcriptional activators or repressors to regulate the expression of auxin response genes by specifically binding to auxin response elements (AuxREs) during plant development. Based on a genome-wide strategy using the medicinal model plant Salvia miltiorrhiza, 25 S. miltiorrhiza ARF (SmARF) gene family members in four classes (class Ia, IIa, IIb and III) were comprehensively analyzed to identify characteristics including gene structures, conserved domains, phylogenetic relationships and expression patterns. In a hybrid analysis of the phylogenetic tree, microRNA targets, and expression patterns of SmARFs in different organs, root tissues, and methyl jasmonate or indole-3-acetic acid treatment conditions, we screened for candidate SmARFs involved in various developmental processes of S. miltiorrhiza. Based on this analysis, we predicted that SmARF25, SmARF7, SmARF16 and SmARF20 are involved in flower, leaf, stem and root development, respectively. With the further insight into the targets of miR160 and miR167, specific SmARF genes in S. miltiorrhiza might encode products that participate in biological processes as described for ARF genes in Arabidopsis. Our results provide a foundation for understanding the molecular basis and regulatory mechanisms of SmARFs in S. miltiorrhiza. PMID:27230647

  9. Genome-wide analysis in human colorectal cancer cells reveals ischemia-mediated expression of motility genes via DNA hypomethylation.

    PubMed

    Skowronski, Karolina; Skowronki, Karolina; Andrews, Joseph; Rodenhiser, David I; Coomber, Brenda L

    2014-01-01

    DNA hypomethylation is an important epigenetic modification found to occur in many different cancer types, leading to the upregulation of previously silenced genes and loss of genomic stability. We previously demonstrated that hypoxia and hypoglycaemia (ischemia), two common micro-environmental changes in solid tumours, decrease DNA methylation through the downregulation of DNMTs in human colorectal cancer cells. Here, we utilized a genome-wide cross-platform approach to identify genes hypomethylated and upregulated by ischemia. Following exposure to hypoxia or hypoglycaemia, methylated DNA from human colorectal cancer cells (HCT116) was immunoprecipitated and analysed with an Affymetrix promoter array. Additionally, RNA was isolated and analysed in parallel with an Affymetrix expression array. Ingenuity pathway analysis software revealed that a significant proportion of the genes hypomethylated and upregulated were involved in cellular movement, including PLAUR and CYR61. A Matrigel invasion assay revealed that indeed HCT116 cells grown in hypoxic or hypoglycaemic conditions have increased mobility capabilities. Confirmation of upregulated expression of cellular movement genes was performed with qPCR. The correlation between ischemia and metastasis is well established in cancer progression, but the molecular mechanisms responsible for this common observation have not been clearly identified. Our novel data suggests that hypoxia and hypoglycaemia may be driving changes in DNA methylation through downregulation of DNMTs. This is the first report to our knowledge that provides an explanation for the increased metastatic potential seen in ischemic cells; i.e. that ischemia could be driving DNA hypomethylation and increasing expression of cellular movement genes.

  10. Genome-wide analysis of auxin response factor gene family members in medicinal model plant Salvia miltiorrhiza.

    PubMed

    Xu, Zhichao; Ji, Aijia; Song, Jingyuan; Chen, Shilin

    2016-01-01

    Auxin response factors (ARFs) can function as transcriptional activators or repressors to regulate the expression of auxin response genes by specifically binding to auxin response elements (AuxREs) during plant development. Based on a genome-wide strategy using the medicinal model plant Salvia miltiorrhiza, 25 S. miltiorrhiza ARF (SmARF) gene family members in four classes (class Ia, IIa, IIb and III) were comprehensively analyzed to identify characteristics including gene structures, conserved domains, phylogenetic relationships and expression patterns. In a hybrid analysis of the phylogenetic tree, microRNA targets, and expression patterns of SmARFs in different organs, root tissues, and methyl jasmonate or indole-3-acetic acid treatment conditions, we screened for candidate SmARFs involved in various developmental processes of S. miltiorrhiza Based on this analysis, we predicted that SmARF25, SmARF7, SmARF16 and SmARF20 are involved in flower, leaf, stem and root development, respectively. With the further insight into the targets of miR160 and miR167, specific SmARF genes in S. miltiorrhiza might encode products that participate in biological processes as described for ARF genes in Arabidopsis Our results provide a foundation for understanding the molecular basis and regulatory mechanisms of SmARFs in S. miltiorrhiza. PMID:27230647

  11. Genome-Wide Identification of the Target Genes of AP2-O, a Plasmodium AP2-Family Transcription Factor

    PubMed Central

    Kaneko, Izumi; Iwanaga, Shiroh; Kato, Tomomi; Kobayashi, Issei; Yuda, Masao

    2015-01-01

    Stage-specific transcription is a fundamental biological process in the life cycle of the Plasmodium parasite. Proteins containing the AP2 DNA-binding domain are responsible for stage-specific transcriptional regulation and belong to the only known family of transcription factors in Plasmodium parasites. Comprehensive identification of their target genes will advance our understanding of the molecular basis of stage-specific transcriptional regulation and stage-specific parasite development. AP2-O is an AP2 family transcription factor that is expressed in the mosquito midgut-invading stage, called the ookinete, and is essential for normal morphogenesis of this stage. In this study, we identified the genome-wide target genes of AP2-O by chromatin immunoprecipitation-sequencing and elucidate how this AP2 family transcription factor contributes to the formation of this motile stage. The analysis revealed that AP2-O binds specifically to the upstream genomic regions of more than 500 genes, suggesting that approximately 10% of the parasite genome is directly regulated by AP2-O. These genes are involved in distinct biological processes such as morphogenesis, locomotion, midgut penetration, protection against mosquito immunity and preparation for subsequent oocyst development. This direct and global regulation by AP2-O provides a model for gene regulation in Plasmodium parasites and may explain how these parasites manage to control their complex life cycle using a small number of sequence-specific AP2 transcription factors. PMID:26018192

  12. Genome-Wide Functional Profiling Identifies Genes and Processes Important for Zinc-Limited Growth of Saccharomyces cerevisiae

    PubMed Central

    Loguinov, Alex V.; Zimmerman, Ginelle R.; Vulpe, Chris D.; Eide, David J.

    2012-01-01

    Zinc is an essential nutrient because it is a required cofactor for many enzymes and transcription factors. To discover genes and processes in yeast that are required for growth when zinc is limiting, we used genome-wide functional profiling. Mixed pools of ∼4,600 deletion mutants were inoculated into zinc-replete and zinc-limiting media. These cells were grown for several generations, and the prevalence of each mutant in the pool was then determined by microarray analysis. As a result, we identified more than 400 different genes required for optimal growth under zinc-limiting conditions. Among these were several targets of the Zap1 zinc-responsive transcription factor. Their importance is consistent with their up-regulation by Zap1 in low zinc. We also identified genes that implicate Zap1-independent processes as important. These include endoplasmic reticulum function, oxidative stress resistance, vesicular trafficking, peroxisome biogenesis, and chromatin modification. Our studies also indicated the critical role of macroautophagy in low zinc growth. Finally, as a result of our analysis, we discovered a previously unknown role for the ICE2 gene in maintaining ER zinc homeostasis. Thus, functional profiling has provided many new insights into genes and processes that are needed for cells to thrive under the stress of zinc deficiency. PMID:22685415

  13. Screening of Tissue-Specific Genes and Promoters in Tomato by Comparing Genome Wide Expression Profiles of Arabidopsis Orthologues

    PubMed Central

    Lim, Chan Ju; Lee, Ha Yeon; Kim, Woong Bom; Lee, Bok-Sim; Kim, Jungeun; Ahmad, Raza; Kim, Hyun A; Yi, So Young; Hur, Cheol-Goo; Kwon, Suk-Yoon

    2012-01-01

    Constitutive overexpression of transgenes occasionally interferes with normal growth and developmental processes in plants. Thus, the development of tissue-specific promoters that drive transgene expression has become agriculturally important. To identify tomato tissue-specific promoters, tissue-specific genes were screened using a series of in silico-based and experimental procedures, including genome-wide orthologue searches of tomato and Arabidopsis databases, isolation of tissue-specific candidates using an Arabidopsis microarray database, and validation of tissue specificity by reverse transcription-polymerase chain reaction (RT-PCR) analysis and promoter assay. Using these procedures, we found 311 tissue-specific candidate genes and validated 10 tissue-specific genes by RT-PCR. Among these identified genes, histochemical analysis of five isolated promoter::GUS transgenic tomato and Arabidopsis plants revealed that their promoters have different but distinct tissue-specific activities in anther, fruit, and root, respectively. Therefore, it appears these in silico-based screening approaches in addition to the identification of new tissue-specific genes and promoters will be helpful for the further development of tailored crop development. PMID:22699756

  14. Genome-wide identification and expression profiling of the late embryogenesis abundant genes in potato with emphasis on dehydrins.

    PubMed

    Charfeddine, Safa; Saïdi, Mohammed Najib; Charfeddine, Mariam; Gargouri-Bouzid, Radhia

    2015-07-01

    Late embryogenesis abundant (LEA) proteins were first described as accumulating late in plant seed development. They were also shown to be involved in plant responses to environmental stress and as well as in bacteria, yeast and invertebrates. They are known to play crucial roles in dehydration tolerance. This study describes a genome-wide analysis of LEA proteins and the corresponding genes in Solanum tuberosum. Twenty-nine LEA family members encoding genes in the Solanum genome were identified. Phylogenetic analyses allowed the classification of the potato LEA proteins into nine distinct groups. Some of them were identified as putative orthologs of Arabidopsis and rice LEA genes. In silico analyses confirmed the hydrophilicity of most of the StLEA proteins, whereas some of them can be folded. The in silico expression analyses showed that the identified genes displayed tissue-specific, stress and hormone-responsive expression profiles. Five StLEA classified as dehydrins were selected for expression analyses under salt and drought stresses. The data revealed that they were induced by both stresses. The analyses indicate that several factors such us developmental stages, hormones, and dehydration, can regulate the expression and activities of LEA protein. This report can be helpful for the further functional diversity studies and analyses of LEA proteins in potato. These genes can be overexpressed to improve potato abiotic stress response. PMID:25638043

  15. Genome-wide association links candidate genes to resistance to Plum Pox Virus in apricot (Prunus armeniaca).

    PubMed

    Mariette, Stéphanie; Wong Jun Tai, Fabienne; Roch, Guillaume; Barre, Aurélien; Chague, Aurélie; Decroocq, Stéphane; Groppi, Alexis; Laizet, Yec'han; Lambert, Patrick; Tricon, David; Nikolski, Macha; Audergon, Jean-Marc; Abbott, Albert G; Decroocq, Véronique

    2016-01-01

    In fruit tree species, many important traits have been characterized genetically by using single-family descent mapping in progenies segregating for the traits. However, most mapped loci have not been sufficiently resolved to the individual genes due to insufficient progeny sizes for high resolution mapping and the previous lack of whole-genome sequence resources of the study species. To address this problem for Plum Pox Virus (PPV) candidate resistance gene identification in Prunus species, we implemented a genome-wide association (GWA) approach in apricot. This study exploited the broad genetic diversity of the apricot (Prunus armeniaca) germplasm containing resistance to PPV, next-generation sequence-based genotyping, and the high-quality peach (Prunus persica) genome reference sequence for single nucleotide polymorphism (SNP) identification. The results of this GWA study validated previously reported PPV resistance quantitative trait loci (QTL) intervals, highlighted other potential resistance loci, and resolved each to a limited set of candidate genes for further study. This work substantiates the association genetics approach for resolution of QTL to candidate genes in apricot and suggests that this approach could simplify identification of other candidate genes for other marked trait intervals in this germplasm. PMID:26356603

  16. Genome-wide transcriptome analysis in the ovaries of two goats identifies differentially expressed genes related to fecundity.

    PubMed

    Miao, Xiangyang; Luo, Qingmiao; Qin, Xiaoyu

    2016-05-10

    The goats are widely kept as livestock throughout the world. Two excellent domestic breeds in China, the Laiwu Black and Jining Grey goats, have different fecundities and prolificacies. Although the goat genome sequences have been resolved recently, little is known about the gene regulations at the transcriptional level in goat. To understand the molecular and genetic mechanisms related to the fecundities and prolificacies, we performed genome-wide sequencing of the mRNAs from two breeds of goat using the next-generation RNA-Seq technology and used functional annotation to identify pathways of interest. Digital gene expression analysis showed 338 genes were up-regulated in the Jining Grey goats and 404 were up-regulated in the Laiwu Black goats. Quantitative real-time PCR verified the reliability of the RNA-Seq data. This study suggests that multiple genes responsible for various biological functions and signaling pathways are differentially expressed in the two different goat breeds, and these genes might be involved in the regulation of goat fecundity and prolificacy. Taken together, our study provides insight into the transcriptional regulation in the ovaries of 2 species of goats that might serve as a key resource for understanding goat fecundity, prolificacy and genetic diversity between species. PMID:26851539

  17. Genome Wide Host Gene Expression Analysis in Chicken Lungs Infected with Avian Influenza Viruses

    PubMed Central

    Gandhale, Pradeep N.; Kumar, Himanshu; Kulkarni, Diwakar D.

    2016-01-01

    The molecular pathogenesis of avian influenza infection varies greatly with individual bird species and virus strain. The molecular pathogenesis of the highly pathogenic avian influenza virus (HPAIV) or the low pathogenic avian influenza virus (LPAIV) infection in avian species remains poorly understood. Thus, global immune response of chickens infected with HPAI H5N1 (A/duck/India/02CA10/2011) and LPAI H9N2 (A/duck/India/249800/2010) viruses was studied using microarray to identify crucial host genetic components responsive to these infection. HPAI H5N1 virus induced excessive expression of type I IFNs (IFNA and IFNG), cytokines (IL1B, IL18, IL22, IL13, and IL12B), chemokines (CCL4, CCL19, CCL10, and CX3CL1) and IFN stimulated genes (OASL, MX1, RSAD2, IFITM5, IFIT5, GBP 1, and EIF2AK) in lung tissues. This dysregulation of host innate immune genes may be the critical determinant of the severity and the outcome of the influenza infection in chickens. In contrast, the expression levels of most of these genes was not induced in the lungs of LPAI H9N2 virus infected chickens. This study indicated the relationship between host immune genes and their roles in pathogenesis of HPAIV infection in chickens. PMID:27071061

  18. A Genome-Wide Screen for Dendritically Localized RNAs Identifies Genes Required for Dendrite Morphogenesis.

    PubMed

    Misra, Mala; Edmund, Hendia; Ennis, Darragh; Schlueter, Marissa A; Marot, Jessica E; Tambasco, Janet; Barlow, Ida; Sigurbjornsdottir, Sara; Mathew, Renjith; Vallés, Ana Maria; Wojciech, Waldemar; Roth, Siegfried; Davis, Ilan; Leptin, Maria; Gavis, Elizabeth R

    2016-01-01

    Localizing messenger RNAs at specific subcellular sites is a conserved mechanism for targeting the synthesis of cytoplasmic proteins to distinct subcellular domains, thereby generating the asymmetric protein distributions necessary for cellular and developmental polarity. However, the full range of transcripts that are asymmetrically distributed in specialized cell types, and the significance of their localization, especially in the nervous system, are not known. We used the EP-MS2 method, which combines EP transposon insertion with the MS2/MCP in vivo fluorescent labeling system, to screen for novel localized transcripts in polarized cells, focusing on the highly branched Drosophila class IV dendritic arborization neurons. Of a total of 541 lines screened, we identified 55 EP-MS2 insertions producing transcripts that were enriched in neuronal processes, particularly in dendrites. The 47 genes identified by these insertions encode molecularly diverse proteins, and are enriched for genes that function in neuronal development and physiology. RNAi-mediated knockdown confirmed roles for many of the candidate genes in dendrite morphogenesis. We propose that the transport of mRNAs encoded by these genes into the dendrites allows their expression to be regulated on a local scale during the dynamic developmental processes of dendrite outgrowth, branching, and/or remodeling. PMID:27260999

  19. Specific alleles at immune genes, rather than genome-wide heterozygosity, are related to immunity and survival in the critically endangered Attwater's prairie-chicken.

    PubMed

    Bateson, Zachary W; Hammerly, Susan C; Johnson, Jeff A; Morrow, Michael E; Whittingham, Linda A; Dunn, Peter O

    2016-10-01

    The negative effects of inbreeding on fitness are serious concerns for populations of endangered species. Reduced fitness has been associated with lower genome-wide heterozygosity and immune gene diversity in the wild; however, it is rare that both types of genetic measures are included in the same study. Thus, it is often unclear whether the variation in fitness is due to the general effects of inbreeding, immunity-related genes or both. Here, we tested whether genome-wide heterozygosity (20 990 SNPs) and diversity at nine immune genes were better predictors of two measures of fitness (immune response and survival) in the endangered Attwater's prairie-chicken (Tympanuchus cupido attwateri). We found that postrelease survival of captive-bred birds was related to alleles of the innate (Toll-like receptors, TLRs) and adaptive (major histocompatibility complex, MHC) immune systems, but not to genome-wide heterozygosity. Likewise, we found that the immune response at the time of release was related to TLR and MHC alleles, and not to genome-wide heterozygosity. Overall, this study demonstrates that immune genes may serve as important genetic markers when monitoring fitness in inbred populations and that in some populations specific functional genes may be better predictors of fitness than genome-wide heterozygosity.

  20. Comprehensive analysis of CCCH-type zinc finger gene family in citrus (Clementine mandarin) by genome-wide characterization.

    PubMed

    Liu, Shengrui; Khan, Muhammad Rehman Gul; Li, Yongping; Zhang, Jinzhi; Hu, Chungen

    2014-10-01

    The CCCH-type zinc finger proteins comprise a large gene family of regulatory proteins and are widely distributed in eukaryotic organisms. The CCCH proteins have been implicated in multiple biological processes and environmental responses in plants. Little information is available, however, about CCCH genes in plants, especially in woody plants such as citrus. The release of the whole-genome sequence of citrus allowed us to perform a genome-wide analysis of CCCH genes and to compare the identified proteins with their orthologs in model plants. In this study, 62 CCCH genes and a total of 132 CCCH motifs were identified, and a comprehensive analysis including the chromosomal locations, phylogenetic relationships, functional annotations, gene structures and conserved motifs was performed. Distribution mapping revealed that 54 of the 62 CCCH genes are unevenly dispersed on the nine citrus chromosomes. Based on phylogenetic analysis and gene structural features, we constructed 5 subfamilies of 62 CCCH members and integrative subfamilies from citrus, Arabidopsis, and rice, respectively. Importantly, large numbers of SNPs and InDels in 26 CCCH genes were identified from Poncirus trifoliata and Fortunella japonica using whole-genome deep re-sequencing. Furthermore, citrus CCCH genes showed distinct temporal and spatial expression patterns in different developmental processes and in response to various stress conditions. Our comprehensive analysis of CleC3Hs is a valuable resource that further elucidates the roles of CCCH family members in plant growth and development. In addition, variants and comparative genomics analyses deepen our understanding of the evolution of the CCCH gene family and will contribute to further genetics and genomics studies of citrus and other plant species.

  1. Genome-Wide Analysis of the AP2/ERF Superfamily Genes and their Responses to Abiotic Stress in Medicago truncatula

    PubMed Central

    Shu, Yongjun; Liu, Ying; Zhang, Jun; Song, Lili; Guo, Changhong

    2016-01-01

    The AP2/ERF superfamily is a large, plant-specific transcription factor family that is involved in many important processes, including plant growth, development, and stress responses. Using Medicago truncatula genome information, we identified and characterized 123 putative AP2/ERF genes, which were named as MtERF1–123. These genes were classified into four families based on phylogenetic analysis, which is consistent with the results of other plant species. MtERF genes are distributed throughout all chromosomes but are clustered on various chromosomes due to genomic tandem and segmental duplication. Using transcriptome, high-throughput sequencing data, and qRT-PCR analysis, we assessed the expression patterns of the MtERF genes in tissues during development and under abiotic stresses. In total, 87 MtERF genes were expressed in plant tissues, most of which were expressed in specific tissues during development or under specific abiotic stress treatments. These results support the notion that MtERF genes are involved in developmental regulation and environmental responses in M. truncatula. Furthermore, a cluster of DREB subfamily members on chromosome 6 was induced by both cold and freezing stress, representing a positive gene regulatory response under low temperature stress, which suggests that these genes might contribute to freezing tolerance to M. truncatula. In summary, our genome-wide characterization, evolutionary analysis, and expression pattern analysis of MtERF genes in M. truncatula provides valuable information for characterizing the molecular functions of these genes and utilizing them to improve stress tolerance in plants. PMID:26834762

  2. Genome Wide Screening of Candidate Genes for Improving Piglet Birth Weight Using High and Low Estimated Breeding Value Populations

    PubMed Central

    Zhang, Lifan; Zhou, Xiang; Michal, Jennifer J.; Ding, Bo; Li, Rui; Jiang, Zhihua

    2014-01-01

    Birth weight is an economically important trait in pig production because it directly impacts piglet growth and survival rate. In the present study, we performed a genome wide survey of candidate genes and pathways associated with individual birth weight (IBW) using the Illumina PorcineSNP60 BeadChip on 24 high (HEBV) and 24 low estimated breeding value (LEBV) animals. These animals were selected from a reference population of 522 individuals produced by three sires and six dam lines, which were crossbreds with multiple breeds. After quality-control, 43,257 SNPs (single nucleotide polymorphisms), including 42,243 autosomal SNPs and 1,014 SNPs on chromosome X, were used in the data analysis. A total of 27 differentially selected regions (DSRs), including 1 on Sus scrofa chromosome 1 (SSC1), 1 on SSC4, 2 on SSC5, 4 on SSC6, 2 on SSC7, 5 on SSC8, 3 on SSC9, 1 on SSC14, 3 on SSC18, and 5 on SSCX, were identified to show the genome wide separations between the HEBV and LEBV groups for IBW in piglets. A DSR with the most number of significant SNPs (including 7 top 0.1% and 31 top 5% SNPs) was located on SSC6, while another DSR with the largest genetic differences in FST was found on SSC18. These regions harbor known functionally important genes involved in growth and development, such as TNFRSF9 (tumor necrosis factor receptor superfamily member 9), CA6 (carbonic anhydrase VI) and MDFIC (MyoD family inhibitor domain containing). A DSR rich in imprinting genes appeared on SSC9, which included PEG10 (paternally expressed 10), SGCE (sarcoglycan, epsilon), PPP1R9A (protein phosphatase 1, regulatory subunit 9A) and ASB4 (ankyrin repeat and SOCS box containing 4). More importantly, our present study provided evidence to support six quantitative trait loci (QTL) regions for pig birth weight, six QTL regions for average birth weight (ABW) and three QTL regions for litter birth weight (LBW) reported previously by other groups. Furthermore, gene ontology analysis with 183 genes

  3. Genome-wide studies highlight indirect links between human replication origins and gene regulation.

    PubMed

    Cadoret, Jean-Charles; Meisch, Françoise; Hassan-Zadeh, Vahideh; Luyten, Isabelle; Guillet, Claire; Duret, Laurent; Quesneville, Hadi; Prioleau, Marie-Noëlle

    2008-10-14

    To get insights into the regulation of replication initiation, we systematically mapped replication origins along 1% of the human genome in HeLa cells. We identified 283 origins, 10 times more than previously known. Origin density is strongly correlated with genomic landscapes, with clusters of closely spaced origins in GC-rich regions and no origins in large GC-poor regions. Origin sequences are evolutionarily conserved, and half of them map within or near CpG islands. Most of the origins overlap transcriptional regulatory elements, providing further evidence of a connection with gene regulation. Moreover, we identify c-JUN and c-FOS as important regulators of origin selection. Half of the identified replication initiation sites do not have an open chromatin configuration, showing the absence of a direct link with gene regulation. Replication timing analyses coupled with our origin mapping suggest that a relatively strict origin-timing program regulates the replication of the human genome.

  4. Genome-wide gene expression profiling to predict resistance to anthracyclines in breast cancer patients

    PubMed Central

    Haibe-Kains, B.; Desmedt, C.; Di Leo, A.; Azambuja, E.; Larsimont, D.; Selleslags, J.; Delaloge, S.; Duhem, C.; Kains, J.P.; Carly, B.; Maerevoet, M.; Vindevoghel, A.; Rouas, G.; Lallemand, F.; Durbecq, V.; Cardoso, F.; Salgado, R.; Rovere, R.; Bontempi, G.; Michiels, S.; Buyse, M.; Nogaret, J.M.; Qi, Y.; Symmans, F.; Pusztai, L.; D'Hondt, V.; Piccart-Gebhart, M.; Sotiriou, C.

    2013-01-01

    Validated biomarkers predictive of response/resistance to anthracyclines in breast cancer are currently lacking. The neoadjuvant Trial of Principle (TOP) study, in which patients with estrogen receptor (ER)–negative tumors were treated with anthracycline (epirubicin) monotherapy, was specifically designed to evaluate the predictive value of topoisomerase II-alpha (TOP2A) and develop a gene expression signature to identify those patients who do not benefit from anthracyclines. Here we describe in details the contents and quality controls for the gene expression and clinical data associated with the study published by Desmedt and colleagues in the Journal of Clinical Oncology in 2011 (Desmedt et al., 2011). We also provide R code to easily access the data and perform the quality controls and basic analyses relevant to this dataset. PMID:26484051

  5. Genome Wide Binding Site Analysis Reveals Transcriptional Coactivation of Cytokinin-Responsive Genes by DELLA Proteins

    PubMed Central

    Marín-de la Rosa, Nora; Pfeiffer, Anne; Hill, Kristine; Locascio, Antonella; Bhalerao, Rishikesh P.; Miskolczi, Pal; Grønlund, Anne L.; Wanchoo-Kohli, Aakriti; Thomas, Stephen G.; Bennett, Malcolm J.; Lohmann, Jan U.; Blázquez, Miguel A.; Alabadí, David

    2015-01-01

    The ability of plants to provide a plastic response to environmental cues relies on the connectivity between signaling pathways. DELLA proteins act as hubs that relay environmental information to the multiple transcriptional circuits that control growth and development through physical interaction with transcription factors from different families. We have analyzed the presence of one DELLA protein at the Arabidopsis genome by chromatin immunoprecipitation coupled to large-scale sequencing and we find that it binds at the promoters of multiple genes. Enrichment analysis shows a strong preference for cis elements recognized by specific transcription factor families. In particular, we demonstrate that DELLA proteins are recruited by type-B ARABIDOPSIS RESPONSE REGULATORS (ARR) to the promoters of cytokinin-regulated genes, where they act as transcriptional co-activators. The biological relevance of this mechanism is underpinned by the necessity of simultaneous presence of DELLAs and ARRs to restrict root meristem growth and to promote photomorphogenesis. PMID:26134422

  6. Genome-wide gene expression profiling to predict resistance to anthracyclines in breast cancer patients.

    PubMed

    Haibe-Kains, B; Desmedt, C; Di Leo, A; Azambuja, E; Larsimont, D; Selleslags, J; Delaloge, S; Duhem, C; Kains, J P; Carly, B; Maerevoet, M; Vindevoghel, A; Rouas, G; Lallemand, F; Durbecq, V; Cardoso, F; Salgado, R; Rovere, R; Bontempi, G; Michiels, S; Buyse, M; Nogaret, J M; Qi, Y; Symmans, F; Pusztai, L; D'Hondt, V; Piccart-Gebhart, M; Sotiriou, C

    2013-12-01

    Validated biomarkers predictive of response/resistance to anthracyclines in breast cancer are currently lacking. The neoadjuvant Trial of Principle (TOP) study, in which patients with estrogen receptor (ER)-negative tumors were treated with anthracycline (epirubicin) monotherapy, was specifically designed to evaluate the predictive value of topoisomerase II-alpha (TOP2A) and develop a gene expression signature to identify those patients who do not benefit from anthracyclines. Here we describe in details the contents and quality controls for the gene expression and clinical data associated with the study published by Desmedt and colleagues in the Journal of Clinical Oncology in 2011 (Desmedt et al., 2011). We also provide R code to easily access the data and perform the quality controls and basic analyses relevant to this dataset. PMID:26484051

  7. Genomic-wide transcriptional profiling in primary myoblasts reveals Runx1-regulated genes in muscle regeneration

    PubMed Central

    Umansky, Kfir Baruch; Feldmesser, Ester; Groner, Yoram

    2015-01-01

    In response to muscle damage the muscle adult stem cells are activated and differentiate into myoblasts that regenerate the damaged tissue. We have recently showed that following myopathic damage the level of the Runx1 transcription factor (TF) is elevated and that during muscle regeneration this TF regulates the balance between myoblast proliferation and differentiation (Umansky et al.). We employed Runx1-dependent gene expression, Chromatin Immunoprecipitation sequencing (ChIP-seq), Assay for Transposase-Accessible Chromatin with high-throughput sequencing (ATAC-seq) and histone H3K4me1/H3K27ac modification analyses to identify a subset of Runx1-regulated genes that are co-occupied by the TFs MyoD and c-Jun and are involved in muscle regeneration (Umansky et al.). The data is available at the GEO database under the superseries accession number GSE56131. PMID:26697350

  8. A genome-wide analysis of antimicrobial effector genes and their transcription patterns in Manduca sexta.

    PubMed

    He, Yan; Cao, Xiaolong; Li, Kai; Hu, Yingxia; Chen, Yun-ru; Blissard, Gary; Kanost, Michael R; Jiang, Haobo

    2015-07-01

    Antimicrobial proteins/peptides (AMPs) are effectors of innate immune systems against pathogen infection in multicellular organisms. Over half of the AMPs reported so far come from insects, and these effectors act in concert to suppress or kill bacteria, fungi, viruses, and parasites. In this work, we have identified 86 AMP genes in the Manduca sexta genome, most of which seem likely to be functional. They encode 15 cecropins, 6 moricins, 6 defensins, 3 gallerimycins, 4 X-tox splicing variants, 14 diapausins, 15 whey acidic protein homologs, 11 attacins, 1 gloverin, 4 lebocins, 6 lysozyme-related proteins, and 4 transferrins. Some of these genes (e.g. attacins, cecropins) constitute large clusters, likely arising after rounds of gene duplication. We compared the amino acid sequences of M. sexta AMPs with their homologs in other insects to reveal conserved structural features and phylogenetic relationships. Expression data showed that many of them are synthesized in fat body and midgut during the larval-pupal molt. Certain genes contain one or more predicted κB binding sites and other regulatory elements in their promoter regions, which may account for the dramatic mRNA level increases in fat body and hemocytes after an immune challenge. Consistent with these strong mRNA increases, many AMPs become highly abundant in the larval plasma at 24 h after the challenge, as demonstrated in our previous peptidomic study. Taken together, these data suggest the existence of a large repertoire of AMPs in M. sexta, whose expression is up-regulated via immune signaling pathways to fight off invading pathogens in a coordinated manner. PMID:25662101

  9. C1-Pathways in Methyloversatilis universalis FAM5: Genome Wide Gene Expression and Mutagenesis Studies

    PubMed Central

    Good, Nathan M.; Lamb, Andrew; Beck, David A. C.; Martinez-Gomez, N. Cecilia; Kalyuzhnaya, Marina G.

    2015-01-01

    Methyloversatilis universalis FAM5 utilizes single carbon compounds such as methanol or methylamine as a sole source of carbon and energy. Expression profiling reveals distinct sets of genes altered during growth on methylamine vs methanol. As expected, all genes for the N-methylglutamate pathway were induced during growth on methylamine. Among other functions responding to the aminated source of C1-carbon, are a heme-containing amine dehydrogenase (Qhp), a distant homologue of formaldehyde activating enzyme (Fae3), molybdenum-containing formate dehydrogenase, ferredoxin reductase, a set of homologues to urea/ammonium transporters and amino-acid permeases. Mutants lacking one of the functional subunits of the amine dehydrogenase (ΔqhpA) or Δfae3 showed no growth defect on C1-compounds. M. universalis FAM5 strains with a lesion in the H4-folate pathway were not able to use any C1-compound, methanol or methylamine. Genes essential for C1-assimilation (the serine cycle and glyoxylate shunt) and H4MTP-pathway for formaldehyde oxidation showed similar levels of expression on both C1-carbon sources. M. universalis FAM5 possesses three homologs of the formaldehyde activating enzyme, a key enzyme of the H4MTP-pathway. Strains lacking the canonical Fae (fae1) lost the ability to grow on both C1-compounds. However, upon incubation on methylamine the fae1-mutant produced revertants (Δfae1R), which regained the ability to grow on methylamine. Double and triple mutants (Δfae1RΔfae3, or Δfae1RΔfae2 or Δfae1RΔfae2Δfae3) constructed in the revertant strain background showed growth similar to the Δfae1R phenotype. The metabolic pathways for utilization of methanol and methylamine in Methyloversatilis universalis FAM5 are reconstructed based on these gene expression and phenotypic data.

  10. C1-Pathways in Methyloversatilis universalis FAM5: Genome Wide Gene Expression and Mutagenesis Studies

    PubMed Central

    Good, Nathan M.; Lamb, Andrew; Beck, David A. C.; Martinez-Gomez, N. Cecilia; Kalyuzhnaya, Marina G.

    2015-01-01

    Methyloversatilis universalis FAM5 utilizes single carbon compounds such as methanol or methylamine as a sole source of carbon and energy. Expression profiling reveals distinct sets of genes altered during growth on methylamine vs methanol. As expected, all genes for the N-methylglutamate pathway were induced during growth on methylamine. Among other functions responding to the aminated source of C1-carbon, are a heme-containing amine dehydrogenase (Qhp), a distant homologue of formaldehyde activating enzyme (Fae3), molybdenum-containing formate dehydrogenase, ferredoxin reductase, a set of homologues to urea/ammonium transporters and amino-acid permeases. Mutants lacking one of the functional subunits of the amine dehydrogenase (ΔqhpA) or Δfae3 showed no growth defect on C1-compounds. M. universalis FAM5 strains with a lesion in the H4-folate pathway were not able to use any C1-compound, methanol or methylamine. Genes essential for C1-assimilation (the serine cycle and glyoxylate shunt) and H4MTP-pathway for formaldehyde oxidation showed similar levels of expression on both C1-carbon sources. M. universalis FAM5 possesses three homologs of the formaldehyde activating enzyme, a key enzyme of the H4MTP-pathway. Strains lacking the canonical Fae (fae1) lost the ability to grow on both C1-compounds. However, upon incubation on methylamine the fae1-mutant produced revertants (Δfae1R), which regained the ability to grow on methylamine. Double and triple mutants (Δfae1RΔfae3, or Δfae1RΔfae2 or Δfae1RΔfae2Δfae3) constructed in the revertant strain background showed growth similar to the Δfae1R phenotype. The metabolic pathways for utilization of methanol and methylamine in Methyloversatilis universalis FAM5 are reconstructed based on these gene expression and phenotypic data. PMID:27682085

  11. A genome-wide analysis of antimicrobial effector genes and their transcription patterns in Manduca sexta

    PubMed Central

    He, Yan; Cao, Xiaolong; Li, Kai; Hu, Yingxia; Chen, Yun-ru; Blissard, Gary; Kanost, Michael R.; Jiang, Haobo

    2015-01-01

    Antimicrobial proteins/peptides (AMPs) are effectors of innate immune systems against pathogen infection in multicellular organisms. Over half of the AMPs reported so far come from insects, and these effectors act in concert to suppress or kill bacteria, fungi, viruses, and parasites. In this work, we have identified 86 AMP genes in the Manduca sexta genome, most of which seem likely to be functional. They encode 15 cecropins, 6 moricins, 6 defensins, 3 gallerimycins, 4 X-tox splicing variants, 14 diapausins, 15 whey acidic protein homologs, 11 attacins, 1 gloverin, 4 lebocins, 6 lysozyme-related proteins, and 4 transferrins. Some of these genes (e.g. attacins, cecropins) constitute large clusters, likely arising after rounds of gene duplication. We compared the amino acid sequences of M. sexta AMPs with their homologs in other insects to reveal conserved structural features and phylogenetic relationships. Expression data showed that many of them are synthesized in fat body and midgut during the larval-pupal molt. Certain genes contain one or more predicted κB binding sites and other regulatory elements in their promoter regions, which may account for the dramatic mRNA level increases in fat body and hemocytes after an immune challenge. Consistent with these strong mRNA increases, many AMPs become highly abundant in the larval plasma at 24 h after the challenge, as demonstrated in our previous peptidomic study. Taken together, these data suggest the existence of a large repertoire of AMPs in M. sexta, whose expression is up-regulated via immune signaling pathways to fight off invading pathogens in a coordinated manner. PMID:25662101

  12. Genome-Wide Identification of Klebsiella pneumoniae Fitness Genes during Lung Infection

    PubMed Central

    Breen, Paul; Deornellas, Valerie; Mu, Qiao; Zhao, Lili; Wu, Weisheng; Cavalcoli, James D.; Mobley, Harry L. T.

    2015-01-01

    ABSTRACT Klebsiella pneumoniae is an urgent public health threat because of resistance to carbapenems, antibiotics of last resort against Gram-negative bacterial infections. Despite the fact that K. pneumoniae is a leading cause of pneumonia in hospitalized patients, the bacterial factors required to cause disease are poorly understood. Insertion site sequencing combines transposon mutagenesis with high-throughput sequencing to simultaneously screen thousands of insertion mutants for fitness defects during infection. Using the recently sequenced K. pneumoniae strain KPPR1 in a well-established mouse model of pneumonia, insertion site sequencing was performed on a pool of >25,000 transposon mutants. The relative fitness requirement of each gene was ranked based on the ratio of lung to inoculum read counts and concordance between insertions in the same gene. This analysis revealed over 300 mutants with at least a 2-fold fitness defect and 69 with defects ranging from 10- to >2,000-fold. Construction of 6 isogenic mutants for use in competitive infections with the wild type confirmed their requirement for lung fitness. Critical fitness genes included those for the synthesis of branched-chain and aromatic amino acids that are essential in mice and humans, the transcriptional elongation factor RfaH, and the copper efflux pump CopA. The majority of fitness genes were conserved among reference strains representative of diverse pathotypes. These results indicate that regulation of outer membrane components and synthesis of amino acids that are essential to its host are critical for K. pneumoniae fitness in the lung. PMID:26060277

  13. Genome-wide mapping of furfural tolerance genes in Escherichia coli.

    PubMed

    Glebes, Tirzah Y; Sandoval, Nicholas R; Reeder, Philippa J; Schilling, Katherine D; Zhang, Min; Gill, Ryan T

    2014-01-01

    Advances in genomics have improved the ability to map complex genotype-to-phenotype relationships, like those required for engineering chemical tolerance. Here, we have applied the multiSCale Analysis of Library Enrichments (SCALEs; Lynch et al. (2007) Nat. Method.) approach to map, in parallel, the effect of increased dosage for >10(5) different fragments of the Escherichia coli genome onto furfural tolerance (furfural is a key toxin of lignocellulosic hydrolysate). Only 268 of >4,000 E. coli genes (∼ 6%) were enriched after growth selections in the presence of furfural. Several of the enriched genes were cloned and tested individually for their effect on furfural tolerance. Overexpression of thyA, lpcA, or groESL individually increased growth in the presence of furfural. Overexpression of lpcA, but not groESL or thyA, resulted in increased furfural reduction rate, a previously identified mechanism underlying furfural tolerance. We additionally show that plasmid-based expression of functional LpcA or GroESL is required to confer furfural tolerance. This study identifies new furfural tolerant genes, which can be applied in future strain design efforts focused on the production of fuels and chemicals from lignocellulosic hydrolysate.

  14. Identification of candidate genes associated with porcine meat color traits by genome-wide transcriptome analysis

    PubMed Central

    Li, Bojiang; Dong, Chao; Li, Pinghua; Ren, Zhuqing; Wang, Han; Yu, Fengxiang; Ning, Caibo; Liu, Kaiqing; Wei, Wei; Huang, Ruihua; Chen, Jie; Wu, Wangjun; Liu, Honglin

    2016-01-01

    Meat color is considered to be the most important indicator of meat quality, however, the molecular mechanisms underlying traits related to meat color remain mostly unknown. In this study, to elucidate the molecular basis of meat color, we constructed six cDNA libraries from biceps femoris (Bf) and soleus (Sol), which exhibit obvious differences in meat color, and analyzed the whole-transcriptome differences between Bf (white muscle) and Sol (red muscle) using high-throughput sequencing technology. Using DEseq2 method, we identified 138 differentially expressed genes (DEGs) between Bf and Sol. Using DEGseq method, we identified 770, 810, and 476 DEGs in comparisons between Bf and Sol in three separate animals. Of these DEGs, 52 were overlapping DEGs. Using these data, we determined the enriched GO terms, metabolic pathways and candidate genes associated with meat color traits. Additionally, we mapped 114 non-redundant DEGs to the meat color QTLs via a comparative analysis with the porcine quantitative trait loci (QTL) database. Overall, our data serve as a valuable resource for identifying genes whose functions are critical for meat color traits and can accelerate studies of the molecular mechanisms of meat color formation. PMID:27748458

  15. Genome-Wide Mapping of Furfural Tolerance Genes in Escherichia coli

    PubMed Central

    Glebes, Tirzah Y.; Sandoval, Nicholas R.; Reeder, Philippa J.; Schilling, Katherine D.; Zhang, Min; Gill, Ryan T.

    2014-01-01

    Advances in genomics have improved the ability to map complex genotype-to-phenotype relationships, like those required for engineering chemical tolerance. Here, we have applied the multiSCale Analysis of Library Enrichments (SCALEs; Lynch et al. (2007) Nat. Method.) approach to map, in parallel, the effect of increased dosage for >105 different fragments of the Escherichia coli genome onto furfural tolerance (furfural is a key toxin of lignocellulosic hydrolysate). Only 268 of >4,000 E. coli genes (∼6%) were enriched after growth selections in the presence of furfural. Several of the enriched genes were cloned and tested individually for their effect on furfural tolerance. Overexpression of thyA, lpcA, or groESL individually increased growth in the presence of furfural. Overexpression of lpcA, but not groESL or thyA, resulted in increased furfural reduction rate, a previously identified mechanism underlying furfural tolerance. We additionally show that plasmid-based expression of functional LpcA or GroESL is required to confer furfural tolerance. This study identifies new furfural tolerant genes, which can be applied in future strain design efforts focused on the production of fuels and chemicals from lignocellulosic hydrolysate. PMID:24489935

  16. Genome-Wide Functional Profiling Reveals Genes Required for Tolerance to Benzene Metabolites in Yeast

    PubMed Central

    North, Matthew; Tandon, Vickram J.; Thomas, Reuben; Loguinov, Alex; Gerlovina, Inna; Hubbard, Alan E.; Zhang, Luoping; Smith, Martyn T.; Vulpe, Chris D.

    2011-01-01

    Benzene is a ubiquitous environmental contaminant and is widely used in industry. Exposure to benzene causes a number of serious health problems, including blood disorders and leukemia. Benzene undergoes complex metabolism in humans, making mechanistic determination of benzene toxicity difficult. We used a functional genomics approach to identify the genes that modulate the cellular toxicity of three of the phenolic metabolites of benzene, hydroquinone (HQ), catechol (CAT) and 1,2,4-benzenetriol (BT), in the model eukaryote Saccharomyces cerevisiae. Benzene metabolites generate oxidative and cytoskeletal stress, and tolerance requires correct regulation of iron homeostasis and the vacuolar ATPase. We have identified a conserved bZIP transcription factor, Yap3p, as important for a HQ-specific response pathway, as well as two genes that encode putative NAD(P)H:quinone oxidoreductases, PST2 and YCP4. Many of the yeast genes identified have human orthologs that may modulate human benzene toxicity in a similar manner and could play a role in benzene exposure-related disease. PMID:21912624

  17. Genome-Wide Characterization of Light-Regulated Genes in Neurospora crassa

    PubMed Central

    Wu, Cheng; Yang, Fei; Smith, Kristina M.; Peterson, Matthew; Dekhang, Rigzin; Zhang, Ying; Zucker, Jeremy; Bredeweg, Erin L.; Mallappa, Chandrashekara; Zhou, Xiaoying; Lyubetskaya, Anna; Townsend, Jeffrey P.; Galagan, James E.; Freitag, Michael; Dunlap, Jay C.; Bell-Pedersen, Deborah; Sachs, Matthew S.

    2014-01-01

    The filamentous fungus Neurospora crassa responds to light in complex ways. To thoroughly study the transcriptional response of this organism to light, RNA-seq was used to analyze capped and polyadenylated mRNA prepared from mycelium grown for 24 hr in the dark and then exposed to light for 0 (control) 15, 60, 120, and 240 min. More than three-quarters of all defined protein coding genes (79%) were expressed in these cells. The increased sensitivity of RNA-seq compared with previous microarray studies revealed that the RNA levels for 31% of expressed genes were affected two-fold or more by exposure to light. Additionally, a large class of mRNAs, enriched for transcripts specifying products involved in rRNA metabolism, showed decreased expression in response to light, indicating a heretofore undocumented effect of light on this pathway. Based on measured changes in mRNA levels, light generally increases cellular metabolism and at the same time causes significant oxidative stress to the organism. To deal with this stress, protective photopigments are made, antioxidants are produced, and genes involved in ribosome biogenesis are transiently repressed. PMID:25053707

  18. Genome-Wide Identification and Characterization of MicroRNAs and Target Genes in Lonicera japonica

    PubMed Central

    Wu, Gang; Fu, Chunhua; Long, Yan; Xiang, Jun; Gan, Jianping; Zhou, Yanhong; Yu, Longjiang; Li, Maoteng

    2016-01-01

    MiRNAs function in post-transcriptional regulation of gene expression and play very important roles in plant development. Lonicera japonica is one of the important medicinal plants in China. However, few studies on the discovery of conserved and novel miRNAs from L. japonica were reported. In this study, we employed deep sequencing technology to identify miRNAs in leaf and flower tissues of L. japonica. A total of 22.97 million clean reads from flower and leaf tissues were obtained, which generated 146 conserved miRNAs distributed in 20 families and 110 novel miRNAs. Accordingly, 72 differentially expressed miRNAs (P≤0.001) between leaves and flowers and their potential target genes were identified and validated. The qRT-PCR validation showed that majority of the differentially expressed miRNAs showed significant tissue-specific expression in L. japonica. Furthermore, the miRNA-mRNA and mRNA-mRNA regulatory networks were constructed using Cytoscape software. Taken together, this study identified a large number of miRNAs and target genes in L. japonica, which not only provides the first global miRNA expression profiles, but also sheds light on functional genomics research on L. japonica in the future. PMID:27711182

  19. Genome-wide discovery of cis-elements in promoter sequences using gene expression.

    PubMed

    Troukhan, Maxim; Tatarinova, Tatiana; Bouck, John; Flavell, Richard B; Alexandrov, Nickolai N

    2009-04-01

    The availability of complete or nearly complete genome sequences, a large number of 5' expressed sequence tags, and significant public expression data allow for a more accurate identification of cis-elements regulating gene expression. We have implemented a global approach that takes advantage of available expression data, genomic sequences, and transcript information to predict cis-elements associated with specific expression patterns. The key components of our approach are: (1) precise identification of transcription start sites, (2) specific locations of cis-elements relative to the transcription start site, and (3) assessment of statistical significance for all sequence motifs. By applying our method to promoters of Arabidopsis thaliana and Mus musculus, we have identified motifs that affect gene expression under specific environmental conditions or in certain tissues. We also found that the presence of the TATA box is associated with increased variability of gene expression. Strong correlation between our results and experimentally determined motifs shows that the method is capable of predicting new functionally important cis-elements in promoter sequences. PMID:19231992

  20. Genome-wide association study in East Asians suggests UHMK1 as a novel bone mineral density susceptibility gene.

    PubMed

    Choi, Hyung Jin; Park, Hyojung; Zhang, Lei; Kim, Jung Hee; Kim, Ye An; Yang, Jae-Yeon; Pei, Yu-Fang; Tian, Qing; Shen, Hui; Hwang, Joo-Yeon; Deng, Hong-Wen; Cho, Nam H; Shin, Soo

    2016-10-01

    To identify genetic variants that influence bone mineral density (BMD) in East Asians, we performed a quantitative trait analysis of lumbar spine, total hip and femoral neck BMD in a Korean population-based cohort (N=2729) and follow-up replication analysis in a Chinese Han population and two Caucasian populations (N=1547, 2250 and 987, respectively). From the meta-analysis of the stage 1 discovery analysis and stage 2 replication analysis, we identified four BMD loci that reached near-genome-wide significance level (P<5×10(-7)). One locus on 1q23 (UHMK1, rs16863247, P=4.1×10(-7) for femoral neck BMD and P=3.2×10(-6) for total hip BMD) was a novel BMD signal. Interestingly, rs16863247 was very rare in Caucasians (minor allele frequency<0.01), indicating that this association could be specific to East Asians. In gender specific analysis, rs1160574 on 1q32 (KCNH1) was associated with femoral neck BMD (P=2.1×10(-7)) in female subjects. rs9371538 in the known BMD region on 6q25 ESR1 was associated with lumbar spine BMD (P=5.6×10(-9)). rs7776725 in the known BMD region on 7q31 WTN16 was associated with total hip BMD (P=8.6×10(-9)). In osteoblasts, endogenous UHMK1 expression was increased during differentiation and UHMK1 knockdown decreased its differentiation, while UHMK1 overexpression increased its differentiation. In osteoclasts, endogenous UHMK1 expression was decreased during differentiation and UHMK1 knockdown increased its differentiation, while UHMK1 overexpression decreased its differentiation. In conclusion, our genome-wide association study identified the UHMK1 gene as a novel BMD locus specific to East Asians. Functional studies suggest a role of UHMK1 on regulation of osteoblasts and osteoclasts. PMID:27424934

  1. Genome-wide association study identifies SESTD1 as a novel risk gene for lithium-responsive bipolar disorder.

    PubMed

    Song, J; Bergen, S E; Di Florio, A; Karlsson, R; Charney, A; Ruderfer, D M; Stahl, E A; Chambert, K D; Moran, J L; Gordon-Smith, K; Forty, L; Green, E K; Jones, I; Jones, L; Scolnick, E M; Sklar, P; Smoller, J W; Lichtenstein, P; Hultman, C; Craddock, N; Landén, M; Smoller, Jordan W; Perlis, Roy H; Lee, Phil Hyoun; Castro, Victor M; Hoffnagle, Alison G; Sklar, Pamela; Stahl, Eli A; Purcell, Shaun M; Ruderfer, Douglas M; Charney, Alexander W; Roussos, Panos; Michele Pato, Carlos Pato; Medeiros, Helen; Sobel, Janet; Craddock, Nick; Jones, Ian; Forty, Liz; Florio, Arianna Di; Green, Elaine; Jones, Lisa; Gordon-Smith, Katherine; Landen, Mikael; Hultman, Christina; Jureus, Anders; Bergen, Sarah; McCarroll, Steven; Moran, Jennifer; Smoller, Jordan W; Chambert, Kimberly; Belliveau, Richard A

    2016-09-01

    Lithium is the mainstay prophylactic treatment for bipolar disorder (BD), but treatment response varies considerably across individuals. Patients who respond well to lithium treatment might represent a relatively homogeneous subtype of this genetically and phenotypically diverse disorder. Here, we performed genome-wide association studies (GWAS) to identify (i) specific genetic variations influencing lithium response and (ii) genetic variants associated with risk for lithium-responsive BD. Patients with BD and controls were recruited from Sweden and the United Kingdom. GWAS were performed on 2698 patients with subjectively defined (self-reported) lithium response and 1176 patients with objectively defined (clinically documented) lithium response. We next conducted GWAS comparing lithium responders with healthy controls (1639 subjective responders and 8899 controls; 323 objective responders and 6684 controls). Meta-analyses of Swedish and UK results revealed no significant associations with lithium response within the bipolar subjects. However, when comparing lithium-responsive patients with controls, two imputed markers attained genome-wide significant associations, among which one was validated in confirmatory genotyping (rs116323614, P=2.74 × 10(-8)). It is an intronic single-nucleotide polymorphism (SNP) on chromosome 2q31.2 in the gene SEC14 and spectrin domains 1 (SESTD1), which encodes a protein involved in regulation of phospholipids. Phospholipids have been strongly implicated as lithium treatment targets. Furthermore, we estimated the proportion of variance for lithium-responsive BD explained by common variants ('SNP heritability') as 0.25 and 0.29 using two definitions of lithium response. Our results revealed a genetic variant in SESTD1 associated with risk for lithium-responsive BD, suggesting that the understanding of BD etiology could be furthered by focusing on this subtype of BD. PMID:26503763

  2. Genome-wide association study identifies SESTD1 as a novel risk gene for lithium-responsive bipolar disorder

    PubMed Central

    Song, J; Bergen, S E; Di Florio, A; Karlsson, R; Charney, A; Ruderfer, D M; Stahl, E A; Chambert, K D; Moran, J L; Gordon-Smith, K; Forty, L; Green, E K; Jones, I; Jones, L; Scolnick, E M; Sklar, P; Smoller, J W; Lichtenstein, P; Hultman, C; Craddock, N; Landén, M; Smoller, Jordan W; Perlis, Roy H; Lee, Phil Hyoun; Castro, Victor M; Hoffnagle, Alison G; Sklar, Pamela; Stahl, Eli A; Purcell, Shaun M; Ruderfer, Douglas M; Charney, Alexander W; Roussos, Panos; Michele Pato, Carlos Pato; Medeiros, Helen; Sobel, Janet; Craddock, Nick; Jones, Ian; Forty, Liz; Florio, Arianna Di; Green, Elaine; Jones, Lisa; Gordon-Smith, Katherine; Landen, Mikael; Hultman, Christina; Jureus, Anders; Bergen, Sarah; McCarroll, Steven; Moran, Jennifer; Smoller, Jordan W; Chambert, Kimberly; Belliveau, Richard A

    2016-01-01

    Lithium is the mainstay prophylactic treatment for bipolar disorder (BD), but treatment response varies considerably across individuals. Patients who respond well to lithium treatment might represent a relatively homogeneous subtype of this genetically and phenotypically diverse disorder. Here, we performed genome-wide association studies (GWAS) to identify (i) specific genetic variations influencing lithium response and (ii) genetic variants associated with risk for lithium-responsive BD. Patients with BD and controls were recruited from Sweden and the United Kingdom. GWAS were performed on 2698 patients with subjectively defined (self-reported) lithium response and 1176 patients with objectively defined (clinically documented) lithium response. We next conducted GWAS comparing lithium responders with healthy controls (1639 subjective responders and 8899 controls; 323 objective responders and 6684 controls). Meta-analyses of Swedish and UK results revealed no significant associations with lithium response within the bipolar subjects. However, when comparing lithium-responsive patients with controls, two imputed markers attained genome-wide significant associations, among which one was validated in confirmatory genotyping (rs116323614, P=2.74 × 10−8). It is an intronic single-nucleotide polymorphism (SNP) on chromosome 2q31.2 in the gene SEC14 and spectrin domains 1 (SESTD1), which encodes a protein involved in regulation of phospholipids. Phospholipids have been strongly implicated as lithium treatment targets. Furthermore, we estimated the proportion of variance for lithium-responsive BD explained by common variants (‘SNP heritability') as 0.25 and 0.29 using two definitions of lithium response. Our results revealed a genetic variant in SESTD1 associated with risk for lithium-responsive BD, suggesting that the understanding of BD etiology could be furthered by focusing on this subtype of BD. PMID:26503763

  3. Genome-wide identification and analysis of the aldehyde dehydrogenase (ALDH) gene superfamily in apple (Malus × domestica Borkh.).

    PubMed

    Li, Xiaoqin; Guo, Rongrong; Li, Jun; Singer, Stacy D; Zhang, Yucheng; Yin, Xiangjing; Zheng, Yi; Fan, Chonghui; Wang, Xiping

    2013-10-01

    Aldehyde dehydrogenases (ALDHs) represent a protein superfamily encoding NAD(P)(+)-dependent enzymes that oxidize a wide range of endogenous and exogenous aliphatic and aromatic aldehydes. In plants, they are involved in many biological processes and play a role in the response to environmental stress. In this study, a total of 39 ALDH genes from ten families were identified in the apple (Malus × domestica Borkh.) genome. Synteny analysis of the apple ALDH (MdALDH) genes indicated that segmental and tandem duplications, as well as whole genome duplications, have likely contributed to the expansion and evolution of these gene families in apple. Moreover, synteny analysis between apple and Arabidopsis demonstrated that several MdALDH genes were found in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes appeared before the divergence of lineages that led to apple and Arabidopsis. In addition, phylogenetic analysis, as well as comparisons of exon-intron and protein structures, provided further insight into both their evolutionary relationships and their putative functions. Tissue-specific expression analysis of the MdALDH genes demonstrated diverse spatiotemporal expression patterns, while their expression profiles under abiotic stress and various hormone treatments indicated that many MdALDH genes were responsive to high salinity and drought, as well as different plant hormones. This genome-wide identification, as well as characterization of evolutionary relationships and expression profiles, of the apple MdALDH genes will not only be useful for the further analysis of ALDH genes and their roles in stress response, but may also aid in the future improvement of apple stress tolerance.

  4. Gene-based meta-analysis of genome-wide association studies implicates new loci involved in obesity.

    PubMed

    Hägg, Sara; Ganna, Andrea; Van Der Laan, Sander W; Esko, Tonu; Pers, Tune H; Locke, Adam E; Berndt, Sonja I; Justice, Anne E; Kahali, Bratati; Siemelink, Marten A; Pasterkamp, Gerard; Strachan, David P; Speliotes, Elizabeth K; North, Kari E; Loos, Ruth J F; Hirschhorn, Joel N; Pawitan, Yudi; Ingelsson, Erik

    2015-12-01

    To date, genome-wide association studies (GWASs) have identified >100 loci with single variants associated with body mass index (BMI). This approach may miss loci with high allelic heterogeneity; therefore, the aim of the present study was to use gene-based meta-analysis to identify regions with high allelic heterogeneity to discover additional obesity susceptibility loci. We included GWAS data from 123 865 individuals of European descent from 46 cohorts in Stage 1 and Metabochip data from additional 103 046 individuals from 43 cohorts in Stage 2, all within the Genetic Investigation of ANthropometric Traits (GIANT) consortium. Each cohort was tested for association between ∼2.4 million (Stage 1) or ∼200 000 (Stage 2) imputed or genotyped single variants and BMI, and summary statistics were subsequently meta-analyzed in 17 941 genes. We used the 'VErsatile Gene-based Association Study' (VEGAS) approach to assign variants to genes and to calculate gene-based P-values based on simulations. The VEGAS method was applied to each cohort separately before a gene-based meta-analysis was performed. In Stage 1, two known (FTO and TMEM18) and six novel (PEX2, MTFR2, SSFA2, IARS2, CEP295 and TXNDC12) loci were associated with BMI (P < 2.8 × 10(-6) for 17 941 gene tests). We confirmed all loci, and six of them were gene-wide significant in Stage 2 alone. We provide biological support for the loci by pathway, expression and methylation analyses. Our results indicate that gene-based meta-analysis of GWAS provides a useful strategy to find loci of interest that were not identified in standard single-marker analyses due to high allelic heterogeneity. PMID:26376864

  5. Identification of novel candidate genes involved in mineralization of dental enamel by genome-wide transcript profiling.

    PubMed

    Lacruz, Rodrigo S; Smith, Charles E; Bringas, Pablo; Chen, Yi-Bu; Smith, Susan M; Snead, Malcolm L; Kurtz, Ira; Hacia, Joseph G; Hubbard, Michael J; Paine, Michael L

    2012-05-01

    The gene repertoire regulating vertebrate biomineralization is poorly understood. Dental enamel, the most highly mineralized tissue in mammals, differs from other calcifying systems in that the formative cells (ameloblasts) lack remodeling activity and largely degrade and resorb the initial extracellular matrix. Enamel mineralization requires that ameloblasts undergo a profound functional switch from matrix-secreting to maturational (calcium transport, protein resorption) roles as mineralization progresses. During the maturation stage, extracellular pH decreases markedly, placing high demands on ameloblasts to regulate acidic environments present around the growing hydroxyapatite crystals. To identify the genetic events driving enamel mineralization, we conducted genome-wide transcript profiling of the developing enamel organ from rat incisors and highlight over 300 genes differentially expressed during maturation. Using multiple bioinformatics analyses, we identified groups of maturation-associated genes whose functions are linked to key mineralization processes including pH regulation, calcium handling, and matrix turnover. Subsequent qPCR and Western blot analyses revealed that a number of solute carrier (SLC) gene family members were up-regulated during maturation, including the novel protein Slc24a4 involved in calcium handling as well as other proteins of similar function (Stim1). By providing the first global overview of the cellular machinery required for enamel maturation, this study provide a strong foundation for improving basic understanding of biomineralization and its practical applications in healthcare.

  6. Genome-wide association study using extreme truncate selection identifies novel genes affecting bone mineral density and fracture risk.

    PubMed

    Duncan, Emma L; Danoy, Patrick; Kemp, John P; Leo, Paul J; McCloskey, Eugene; Nicholson, Geoffrey C; Eastell, Richard; Prince, Richard L; Eisman, John A; Jones, Graeme; Sambrook, Philip N; Reid, Ian R; Dennison, Elaine M; Wark, John; Richards, J Brent; Uitterlinden, Andre G; Spector, Tim D; Esapa, Chris; Cox, Roger D; Brown, Steve D M; Thakker, Rajesh V; Addison, Kathryn A; Bradbury, Linda A; Center, Jacqueline R; Cooper, Cyrus; Cremin, Catherine; Estrada, Karol; Felsenberg, Dieter; Glüer, Claus-C; Hadler, Johanna; Henry, Margaret J; Hofman, Albert; Kotowicz, Mark A; Makovey, Joanna; Nguyen, Sing C; Nguyen, Tuan V; Pasco, Julie A; Pryce, Karena; Reid, David M; Rivadeneira, Fernando; Roux, Christian; Stefansson, Kari; Styrkarsdottir, Unnur; Thorleifsson, Gudmar; Tichawangana, Rumbidzai; Evans, David M; Brown, Matthew A

    2011-04-01

    Osteoporotic fracture is a major cause of morbidity and mortality worldwide. Low bone mineral density (BMD) is a major predisposing factor to fracture and is known to be highly heritable. Site-, gender-, and age-specific genetic effects on BMD are thought to be significant, but have largely not been considered in the design of genome-wide association studies (GWAS) of BMD to date. We report here a GWAS using a novel study design focusing on women of a specific age (postmenopausal women, age 55-85 years), with either extreme high or low hip BMD (age- and gender-adjusted BMD z-scores of +1.5 to +4.0, n = 1055, or -4.0 to -1.5, n = 900), with replication in cohorts of women drawn from the general population (n = 20,898). The study replicates 21 of 26 known BMD-associated genes. Additionally, we report suggestive association of a further six new genetic associations in or around the genes CLCN7, GALNT3, IBSP, LTBP3, RSPO3, and SOX4, with replication in two independent datasets. A novel mouse model with a loss-of-function mutation in GALNT3 is also reported, which has high bone mass, supporting the involvement of this gene in BMD determination. In addition to identifying further genes associated with BMD, this study confirms the efficiency of extreme-truncate selection designs for quantitative trait association studies.

  7. Genome-Wide Association Study Using Extreme Truncate Selection Identifies Novel Genes Affecting Bone Mineral Density and Fracture Risk

    PubMed Central

    Duncan, Emma L.; Danoy, Patrick; Kemp, John P.; Leo, Paul J.; McCloskey, Eugene; Nicholson, Geoffrey C.; Eastell, Richard; Prince, Richard L.; Eisman, John A.; Jones, Graeme; Sambrook, Philip N.; Reid, Ian R.; Dennison, Elaine M.; Wark, John; Richards, J. Brent; Uitterlinden, Andre G.; Spector, Tim D.; Esapa, Chris; Cox, Roger D.; Brown, Steve D. M.; Thakker, Rajesh V.; Addison, Kathryn A.; Bradbury, Linda A.; Center, Jacqueline R.; Cooper, Cyrus; Cremin, Catherine; Estrada, Karol; Felsenberg, Dieter; Glüer, Claus-C.; Hadler, Johanna; Henry, Margaret J.; Hofman, Albert; Kotowicz, Mark A.; Makovey, Joanna; Nguyen, Sing C.; Nguyen, Tuan V.; Pasco, Julie A.; Pryce, Karena; Reid, David M.; Rivadeneira, Fernando; Roux, Christian; Stefansson, Kari; Styrkarsdottir, Unnur; Thorleifsson, Gudmar; Tichawangana, Rumbidzai; Evans, David M.; Brown, Matthew A.

    2011-01-01

    Osteoporotic fracture is a major cause of morbidity and mortality worldwide. Low bone mineral density (BMD) is a major predisposing factor to fracture and is known to be highly heritable. Site-, gender-, and age-specific genetic effects on BMD are thought to be significant, but have largely not been considered in the design of genome-wide association studies (GWAS) of BMD to date. We report here a GWAS using a novel study design focusing on women of a specific age (postmenopausal women, age 55–85 years), with either extreme high or low hip BMD (age- and gender-adjusted BMD z-scores of +1.5 to +4.0, n = 1055, or −4.0 to −1.5, n = 900), with replication in cohorts of women drawn from the general population (n = 20,898). The study replicates 21 of 26 known BMD–associated genes. Additionally, we report suggestive association of a further six new genetic associations in or around the genes CLCN7, GALNT3, IBSP, LTBP3, RSPO3, and SOX4, with replication in two independent datasets. A novel mouse model with a loss-of-function mutation in GALNT3 is also reported, which has high bone mass, supporting the involvement of this gene in BMD determination. In addition to identifying further genes associated with BMD, this study confirms the efficiency of extreme-truncate selection designs for quantitative trait association studies. PMID:21533022

  8. Genome-wide identification and analysis of MAPK and MAPKK gene families in Brachypodium distachyon.

    PubMed

    Chen, Lihong; Hu, Wei; Tan, Shenglong; Wang, Min; Ma, Zhanbing; Zhou, Shiyi; Deng, Xiaomin; Zhang, Yang; Huang, Chao; Yang, Guangxiao; He, Guangyuan

    2012-01-01

    MAPK cascades are universal signal transduction modules and play important roles in plant growth, development and in response to a variety of biotic and abiotic stresses. Although MAPKs and MAPKKs have been systematically investigated in several plant species including Arabidopsis, rice and poplar, no systematic analysis has been conducted in the emerging monocot model plant Brachypodium distachyon. In the present study, a total of 16 MAPK genes and 12 MAPKK genes were identified from B. distachyon. An analysis of the genomic evolution showed that both tandem and segment duplications contributed significantly to the expansion of MAPK and MAPKK families. Evolutionary relationships within subfamilies were supported by exon-intron organizations and the architectures of conserved protein motifs. Synteny analysis between B. distachyon and the other two plant species of rice and Arabidopsis showed that only one homolog of B. distachyon MAPKs was found in the corresponding syntenic blocks of Arabidopsis, while 13 homologs of B. distachyon MAPKs and MAPKKs were found in that of rice, which was consistent with the speciation process of the three species. In addition, several interactive protein pairs between the two families in B. distachyon were found through yeast two hybrid assay, whereas their orthologs of a pair in Arabidopsis and other plant species were not found to interact with each other. Finally, expression studies of closely related family members among B. distachyon, Arabidopsis and rice showed that even recently duplicated representatives may fulfill different functions and be involved in different signal pathways. Taken together, our data would provide a foundation for evolutionary and functional characterization of MAPK and MAPKK gene families in B. distachyon and other plant species to unravel their biological roles.

  9. Genome Wide Association Study Identifies 20 Novel Promising Genes Associated with Milk Fatty Acid Traits in Chinese Holstein

    PubMed Central

    Li, Cong; Sun, Dongxiao; Zhang, Shengli; Wang, Sheng; Wu, Xiaoping; Zhang, Qin; Liu, Lin; Li, Yanhua; Qiao, Lv

    2014-01-01

    Detecting genes associated with milk fat composition could provide valuable insights into the complex genetic networks of genes underling variation in fatty acids synthesis and point towards opportunities for changing milk fat composition via selective breeding. In this study, we conducted a genome-wide association study (GWAS) for 22 milk fatty acids in 784 Chinese Holstein cows with the PLINK software. Genotypes were obtained with the Illumina BovineSNP50 Bead chip and a total of 40,604 informative, high-quality single nucleotide polymorphisms (SNPs) were used. Totally, 83 genome-wide significant SNPs and 314 suggestive significant SNPs associated with 18 milk fatty acid traits were detected. Chromosome regions that affect milk fatty acid traits were mainly observed on BTA1, 2, 5, 6, 7, 9, 13, 14, 18, 19, 20, 21, 23, 26 and 27. Of these, 146 SNPs were associated with more than one milk fatty acid trait; most of studied fatty acid traits were significant associated with multiple SNPs, especially C18:0 (105 SNPs), C18 index (93 SNPs), and C14 index (84 SNPs); Several SNPs are close to or within the DGAT1, SCD1 and FASN genes which are well-known to affect milk composition traits of dairy cattle. Combined with the previously reported QTL regions and the biological functions of the genes, 20 novel promising candidates for C10:0, C12:0, C14:0, C14:1, C14 index, C18:0, C18:1n9c, C18 index, SFA, UFA and SFA/UFA were found, which composed of HTR1B, CPM, PRKG1, MINPP1, LIPJ, LIPK, EHHADH, MOGAT1, ECHS1, STAT1, SORBS1, NFKB2, AGPAT3, CHUK, OSBPL8, PRLR, IGF1R, ACSL3, GHR and OXCT1. Our findings provide a groundwork for unraveling the key genes and causal mutations affecting milk fatty acid traits in dairy cattle. PMID:24858810

  10. Genome wide association study identifies 20 novel promising genes associated with milk fatty acid traits in Chinese Holstein.

    PubMed

    Li, Cong; Sun, Dongxiao; Zhang, Shengli; Wang, Sheng; Wu, Xiaoping; Zhang, Qin; Liu, Lin; Li, Yanhua; Qiao, Lv

    2014-01-01

    Detecting genes associated with milk fat composition could provide valuable insights into the complex genetic networks of genes underling variation in fatty acids synthesis and point towards opportunities for changing milk fat composition via selective breeding. In this study, we conducted a genome-wide association study (GWAS) for 22 milk fatty acids in 784 Chinese Holstein cows with the PLINK software. Genotypes were obtained with the Illumina BovineSNP50 Bead chip and a total of 40,604 informative, high-quality single nucleotide polymorphisms (SNPs) were used. Totally, 83 genome-wide significant SNPs and 314 suggestive significant SNPs associated with 18 milk fatty acid traits were detected. Chromosome regions that affect milk fatty acid traits were mainly observed on BTA1, 2, 5, 6, 7, 9, 13, 14, 18, 19, 20, 21, 23, 26 and 27. Of these, 146 SNPs were associated with more than one milk fatty acid trait; most of studied fatty acid traits were significant associated with multiple SNPs, especially C18:0 (105 SNPs), C18 index (93 SNPs), and C14 index (84 SNPs); Several SNPs are close to or within the DGAT1, SCD1 and FASN genes which are well-known to affect milk composition traits of dairy cattle. Combined with the previously reported QTL regions and the biological functions of the genes, 20 novel promising candidates for C10:0, C12:0, C14:0, C14:1, C14 index, C18:0, C18:1n9c, C18 index, SFA, UFA and SFA/UFA were found, which composed of HTR1B, CPM, PRKG1, MINPP1, LIPJ, LIPK, EHHADH, MOGAT1, ECHS1, STAT1, SORBS1, NFKB2, AGPAT3, CHUK, OSBPL8, PRLR, IGF1R, ACSL3, GHR and OXCT1. Our findings provide a groundwork for unraveling the key genes and causal mutations affecting milk fatty acid traits in dairy cattle. PMID:24858810

  11. MADS-box gene family in rice: genome-wide identification, organization and expression profiling during reproductive development and stress

    PubMed Central

    Arora, Rita; Agarwal, Pinky; Ray, Swatismita; Singh, Ashok Kumar; Singh, Vijay Pal; Tyagi, Akhilesh K; Kapoor, Sanjay

    2007-01-01

    Background MADS-box transcription factors, besides being involved in floral organ specification, have also been implicated in several aspects of plant growth and development. In recent years, there have been reports on genomic localization, protein motif structure, phylogenetic relationships, gene structure and expression of the entire MADS-box family in the model plant system, Arabidopsis. Though there have been some studies in rice as well, an analysis of the complete MADS-box family along with a comprehensive expression profiling was still awaited after the completion of rice genome sequencing. Furthermore, owing to the role of MADS-box family in flower development, an analysis involving structure, expression and functional aspects of MADS-box genes in rice and Arabidopsis was required to understand the role of this gene family in reproductive development. Results A genome-wide molecular characterization and microarray-based expression profiling of the genes encoding MADS-box transcription factor family in rice is presented. Using a thorough annotation exercise, 75 MADS-box genes have been identified in rice and categorized into MIKCc, MIKC*, Mα, Mβ and Mγ groups based on phylogeny. Chromosomal localization of these genes reveals that 16 MADS-box genes, mostly MIKCc-type, are located within the duplicated segments of the rice genome, whereas most of the M-type genes, 20 in all, seem to have resulted from tandem duplications. Nine members belonging to the Mβ group, which was considered absent in monocots, have also been identified. The expression profiles of all the MADS-box genes have been analyzed under 11 temporal stages of panicle and seed development, three abiotic stress conditions, along with three stages of vegetative development. Transcripts for 31 genes accumulate preferentially in the reproductive phase, of which, 12 genes are specifically expressed in seeds, and six genes show expression specific to panicle development. Differential expression of

  12. Thiol peroxidases mediate specific genome-wide regulation of gene expression in response to hydrogen peroxide

    PubMed Central

    Fomenko, Dmitri E.; Koc, Ahmet; Agisheva, Natalia; Jacobsen, Michael; Kaya, Alaattin; Malinouski, Mikalai; Rutherford, Julian C.; Siu, Kam-Leung; Jin, Dong-Yan; Winge, Dennis R.; Gladyshev, Vadim N.

    2011-01-01

    Hydrogen peroxide is thought to regulate cellular processes by direct oxidation of numerous cellular proteins, whereas antioxidants, most notably thiol peroxidases, are thought to reduce peroxides and inhibit H2O2 response. However, thiol peroxidases have also been implicated in activation of transcription factors and signaling. It remains unclear if these enzymes stimulate or inhibit redox regulation and whether this regulation is widespread or limited to a few cellular components. Herein, we found that Saccharomyces cerevisiae cells lacking all eight thiol peroxidases were viable and withstood redox stresses. They transcriptionally responded to various redox treatments, but were unable to activate and repress gene expression in response to H2O2. Further studies involving redox transcription factors suggested that thiol peroxidases are major regulators of global gene expression in response to H2O2. The data suggest that thiol peroxidases sense and transfer oxidative signals to the signaling proteins and regulate transcription, whereas a direct interaction between H2O2 and other cellular proteins plays a secondary role. PMID:21282621

  13. Genome-wide gene expression analysis identifies K-ras as a regulator of alcohol intake.

    PubMed

    Repunte-Canonigo, Vez; van der Stap, Lena D; Chen, Jihuan; Sabino, Valentina; Wagner, Ulrich; Zorrilla, Eric P; Schumann, Gunter; Roberts, Amanda J; Sanna, Pietro Paolo

    2010-06-21

    Adaptations in the anterior cingulate cortex (ACC) have been implicated in alcohol and drug addiction. To identify genes that may contribute to excessive drinking, here we performed microarray analyses in laser microdissected rat ACC after a single or repeated administration of an intoxicating dose of alcohol (3 g/kg). Expression of the small G protein K-ras was differentially regulated following both single and repeated alcohol administration. We also observed that voluntary alcohol intake in K-ras heterozygous null mice (K-ras(+/-)) did not increase after withdrawal from repeated cycles of intermittent ethanol vapor exposure, unlike in their wild-type littermates. To identify K-ras regulated pathways, we then profiled gene expression in the ACC of K-ras(+/-), heterozygous null mice for the K-ras negative regulator Nf1 (Nf1(+/-)) and wild-type mice following repeated administration of an intoxicating dose of alcohol. Pathway analysis showed that alcohol differentially affected various pathways in a K-ras dependent manner - some of which previously shown to be regulated by alcohol - including the insulin/PI3K pathway, the NF-kappaB, the phosphodiesterases (PDEs) pathway, the Jak/Stat and the adipokine signaling pathways. Altogether, the data implicate K-ras-regulated pathways in the regulation of excessive alcohol drinking after a history of dependence.

  14. Genome Wide Association Study Identifies L3MBTL4 as a Novel Susceptibility Gene for Hypertension

    PubMed Central

    Liu, Xin; Hu, Cheng; Bao, Minghui; Li, Jing; Liu, Xiaoyan; Tan, Xuerui; Zhou, Yong; Chen, Yequn; Wu, Shouling; Chen, Shuohua; Zhang, Rong; Jiang, Feng; Jia, Weiping; Wang, Xingyu; Yang, Xinchun; Cai, Jun

    2016-01-01

    Hypertension is a major global health burden and a leading risk factor for cardiovascular diseases. Although its heritability has been documented previously, contributing loci identified to date account for only a small fraction of blood pressure (BP) variation, which strongly suggests the existence of undiscovered variants. To identify novel variants, we conducted a three staged genetic study in 21,990 hypertensive cases and normotensive controls. Four single nucleotide polymorphisms (SNPs) at three new genes (L3MBTL4 rs403814, Pmeta = 6.128 × 10−9; LOC729251, and TCEANC) and seven SNPs at five previously reported genes were identified as being significantly associated with hypertension. Through functional analysis, we found that L3MBTL4 is predominantly expressed in vascular smooth muscle cells and up-regulated in spontaneously hypertensive rats. Rats with ubiquitous over-expression of L3MBTL4 exhibited significantly elevated BP, increased thickness of the vascular media layer and cardiac hypertrophy. Mechanistically, L3MBTL4 over-expression could lead to down-regulation of latent transforming growth factor-β binding protein 1 (LTBP1), and phosphorylation activation of the mitogen-activated protein kinases (MAPK) signaling pathway, which is known to trigger the pathological progression of vascular remodeling and BP elevation. These findings pinpointed L3MBTL4 as a critical contributor to the development and progression of hypertension and uncovers a novel target for therapeutic intervention. PMID:27480026

  15. Genome-wide analysis of the genetic regulation of gene expression in human neutrophils.

    PubMed

    Andiappan, Anand Kumar; Melchiotti, Rossella; Poh, Tuang Yeow; Nah, Michelle; Puan, Kia Joo; Vigano, Elena; Haase, Doreen; Yusof, Nurhashikin; San Luis, Boris; Lum, Josephine; Kumar, Dilip; Foo, Shihui; Zhuang, Li; Vasudev, Anusha; Irwanto, Astrid; Lee, Bernett; Nardin, Alessandra; Liu, Hong; Zhang, Furen; Connolly, John; Liu, Jianjun; Mortellaro, Alessandra; Wang, De Yun; Poidinger, Michael; Larbi, Anis; Zolezzi, Francesca; Rotzschke, Olaf

    2015-01-01

    Neutrophils are an abundant immune cell type involved in both antimicrobial defence and autoimmunity. The regulation of their gene expression, however, is still largely unknown. Here we report an eQTL study on isolated neutrophils from 114 healthy individuals of Chinese ethnicity, identifying 21,210 eQTLs on 832 unique genes. Unsupervised clustering analysis of these eQTLs confirms their role in inflammatory responses and immunological diseases but also indicates strong involvement in dermatological pathologies. One of the strongest eQTL identified (rs2058660) is also the tagSNP of a linkage block reported to affect leprosy and Crohn's disease in opposite directions. In a functional study, we can link the C allele with low expression of the β-chain of IL18-receptor (IL18RAP). In neutrophils, this results in a reduced responsiveness to IL-18, detected both on the RNA and protein level. Thus, the polymorphic regulation of human neutrophils can impact beneficial as well as pathological inflammatory responses. PMID:26259071

  16. Genome-wide analysis of the genetic regulation of gene expression in human neutrophils

    PubMed Central

    Andiappan, Anand Kumar; Melchiotti, Rossella; Poh, Tuang Yeow; Nah, Michelle; Puan, Kia Joo; Vigano, Elena; Haase, Doreen; Yusof, Nurhashikin; San Luis, Boris; Lum, Josephine; Kumar, Dilip; Foo, Shihui; Zhuang, Li; Vasudev, Anusha; Irwanto, Astrid; Lee, Bernett; Nardin, Alessandra; Liu, Hong; Zhang, Furen; Connolly, John; Liu, Jianjun; Mortellaro, Alessandra; Wang, De Yun; Poidinger, Michael; Larbi, Anis; Zolezzi, Francesca; Rotzschke, Olaf

    2015-01-01

    Neutrophils are an abundant immune cell type involved in both antimicrobial defence and autoimmunity. The regulation of their gene expression, however, is still largely unknown. Here we report an eQTL study on isolated neutrophils from 114 healthy individuals of Chinese ethnicity, identifying 21,210 eQTLs on 832 unique genes. Unsupervised clustering analysis of these eQTLs confirms their role in inflammatory responses and immunological diseases but also indicates strong involvement in dermatological pathologies. One of the strongest eQTL identified (rs2058660) is also the tagSNP of a linkage block reported to affect leprosy and Crohn's disease in opposite directions. In a functional study, we can link the C allele with low expression of the β-chain of IL18-receptor (IL18RAP). In neutrophils, this results in a reduced responsiveness to IL-18, detected both on the RNA and protein level. Thus, the polymorphic regulation of human neutrophils can impact beneficial as well as pathological inflammatory responses. PMID:26259071

  17. Genome-wide identification and characterization of TCP genes involved in ovule development of Phalaenopsis equestris

    PubMed Central

    Lin, Yu-Fu; Chen, You-Yi; Hsiao, Yu-Yun; Shen, Ching-Yu; Hsu, Jui-Ling; Yeh, Chuan-Ming; Mitsuda, Nobutaka; Ohme-Takagi, Masaru; Liu, Zhong-Jian; Tsai, Wen-Chieh

    2016-01-01

    TEOSINTE-BRANCHED/CYCLOIDEA/PCF (TCP) proteins are plant-specific transcription factors known to have a role in multiple aspects of plant growth and development at the cellular, organ and tissue levels. However, there has been no related study of TCPs in orchids. Here we identified 23 TCP genes from the genome sequence of Phalaenopsis equestris. Phylogenetic analysis distinguished two homology classes of PeTCP transcription factor families: classes I and II. Class II was further divided into two subclasses, CIN and CYC/TB1. Spatial and temporal expression analysis showed that PePCF10 was predominantly expressed in ovules at early developmental stages and PeCIN8 had high expression at late developmental stages in ovules, with overlapping expression at day 16 after pollination. Subcellular localization and protein–protein interaction analyses revealed that PePCF10 and PeCIN8 could form homodimers and localize in the nucleus. However, PePCF10 and PeCIN8 could not form heterodimers. In transgenic Arabidopsis thaliana plants (overexpression and SRDX, a super repression motif derived from the EAR-motif of the repression domain of tobacco ETHYLENE-RESPONSIVE ELEMENT-BINDING FACTOR 3 and SUPERMAN, dominantly repressed), the two genes helped regulate cell proliferation. Together, these results suggest that PePCF10 and PeCIN8 play important roles in orchid ovule development by modulating cell division. PMID:27543606

  18. Genome-wide data substantiate Holocene gene flow from India to Australia.

    PubMed

    Pugach, Irina; Delfin, Frederick; Gunnarsdóttir, Ellen; Kayser, Manfred; Stoneking, Mark

    2013-01-29

    The Australian continent holds some of the earliest archaeological evidence for the expansion of modern humans out of Africa, with initial occupation at least 40,000 y ago. It is commonly assumed that Australia remained largely isolated following initial colonization, but the genetic history of Australians has not been explored in detail to address this issue. Here, we analyze large-scale genotyping data from aboriginal Australians, New Guineans, island Southeast Asians and Indians. We find an ancient association between Australia, New Guinea, and the Mamanwa (a Negrito group from the Philippines), with divergence times for these groups estimated at 36,000 y ago, and supporting the view that these populations represent the descendants of an early "southern route" migration out of Africa, whereas other populations in the region arrived later by a separate dispersal. We also detect a signal indicative of substantial gene flow between the Indian populations and Australia well before European contact, contrary to the prevailing view that there was no contact between Australia and the rest of the world. We estimate this gene flow to have occurred during the Holocene, 4,230 y ago. This is also approximately when changes in tool technology, food processing, and the dingo appear in the Australian archaeological record, suggesting that these may be related to the migration from India. PMID:23319617

  19. Genome-wide identification and characterization of TCP genes involved in ovule development of Phalaenopsis equestris.

    PubMed

    Lin, Yu-Fu; Chen, You-Yi; Hsiao, Yu-Yun; Shen, Ching-Yu; Hsu, Jui-Ling; Yeh, Chuan-Ming; Mitsuda, Nobutaka; Ohme-Takagi, Masaru; Liu, Zhong-Jian; Tsai, Wen-Chieh

    2016-09-01

    TEOSINTE-BRANCHED/CYCLOIDEA/PCF (TCP) proteins are plant-specific transcription factors known to have a role in multiple aspects of plant growth and development at the cellular, organ and tissue levels. However, there has been no related study of TCPs in orchids. Here we identified 23 TCP genes from the genome sequence of Phalaenopsis equestris Phylogenetic analysis distinguished two homology classes of PeTCP transcription factor families: classes I and II. Class II was further divided into two subclasses, CIN and CYC/TB1. Spatial and temporal expression analysis showed that PePCF10 was predominantly expressed in ovules at early developmental stages and PeCIN8 had high expression at late developmental stages in ovules, with overlapping expression at day 16 after pollination. Subcellular localization and protein-protein interaction analyses revealed that PePCF10 and PeCIN8 could form homodimers and localize in the nucleus. However, PePCF10 and PeCIN8 could not form heterodimers. In transgenic Arabidopsis thaliana plants (overexpression and SRDX, a super repression motif derived from the EAR-motif of the repression domain of tobacco ETHYLENE-RESPONSIVE ELEMENT-BINDING FACTOR 3 and SUPERMAN, dominantly repressed), the two genes helped regulate cell proliferation. Together, these results suggest that PePCF10 and PeCIN8 play important roles in orchid ovule development by modulating cell division. PMID:27543606

  20. Functional analysis of seven genes linked to body mass index and adiposity by genome-wide association studies: a review.

    PubMed

    Speakman, John R

    2013-01-01

    Genome-wide association studies (GWAS) have identified a total of about 40 single nucleotide polymorphisms (SNPs) that show significant linkage to body mass index, a widely utilised surrogate measure of adiposity. However, only 8 of these associations have been confirmed by follow-up GWAS using more sophisticated measures of adiposity (computed tomography). Among these 8, there is a SNP close to the gene FTO which has been the subject of considerable work to diagnose its function. The remaining 7 SNPs are adjacent to, or within, the genes NEGR1, TMEM18, ETV5, FLJ35779, LINGO2, SH2B1 and GIPR, most of which are less well studied than FTO, particularly in the context of obesity. This article reviews the available data on the functions of these genes, including information gleaned from studies in humans and animal models. At present, we have virtually no information on the putative mechanism associating the genes FLJ35779 and LINGO2 to obesity. All of these genes are expressed in the brain, and for 2 of them (SH2B1 and GIPR), a direct link to the appetite regulation system is known. SH2B1 is an enhancer of intracellular signalling in the JAK-STAT pathway, and GIPR is the receptor for an appetite-linked hormone (GIP) produced by the alimentary tract. NEGR1, ETV5 and SH2B1 all have suggested roles in neurite outgrowth, and hence SNPs adjacent to these genes may affect development of the energy balance circuitry. Although the genes have central patterns of gene expression, implying a central neuronal connection to energy balance, for at least 4 of them (NEGR1, TMEM18, SH2B1 and GIPR), there are also significant peripheral functions related to adipose tissue biology. These functions may contribute to their effects on the obese phenotype.

  1. Genome-Wide Identification of Pseudomonas aeruginosa Virulence-Related Genes Using a Caenorhabditis elegans Infection Model

    PubMed Central

    Feinbaum, Rhonda L.; Urbach, Jonathan M.; Liberati, Nicole T.; Djonovic, Slavica; Adonizio, Allison; Carvunis, Anne-Ruxandra; Ausubel, Frederick M.

    2012-01-01

    Pseudomonas aeruginosa strain PA14 is an opportunistic human pathogen capable of infecting a wide range of organisms including the nematode Caenorhabditis elegans. We used a non-redundant transposon mutant library consisting of 5,850 clones corresponding to 75% of the total and approximately 80% of the non-essential PA14 ORFs to carry out a genome-wide screen for attenuation of PA14 virulence in C. elegans. We defined a functionally diverse 180 mutant set (representing 170 unique genes) necessary for normal levels of virulence that included both known and novel virulence factors. Seven previously uncharacterized virulence genes (ABC transporters PchH and PchI, aminopeptidase PepP, ATPase/molecular chaperone ClpA, cold shock domain protein PA0456, putative enoyl-CoA hydratase/isomerase PA0745, and putative transcriptional regulator PA14_27700) were characterized with respect to pigment production and motility and all but one of these mutants exhibited pleiotropic defects in addition to their avirulent phenotype. We examined the collection of genes required for normal levels of PA14 virulence with respect to occurrence in P. aeruginosa strain-specific genomic regions, location on putative and known genomic islands, and phylogenetic distribution across prokaryotes. Genes predominantly contributing to virulence in C. elegans showed neither a bias for strain-specific regions of the P. aeruginosa genome nor for putatively horizontally transferred genomic islands. Instead, within the collection of virulence-related PA14 genes, there was an overrepresentation of genes with a broad phylogenetic distribution that also occur with high frequency in many prokaryotic clades, suggesting that in aggregate the genes required for PA14 virulence in C. elegans are biased towards evolutionarily conserved genes. PMID:22911607

  2. Genome-Wide Identification and Expression Profiling of Tomato Hsp20 Gene Family in Response to Biotic and Abiotic Stresses

    PubMed Central

    Yu, Jiahong; Cheng, Yuan; Feng, Kun; Ruan, Meiying; Ye, Qingjing; Wang, Rongqing; Li, Zhimiao; Zhou, Guozhi; Yao, Zhuping; Yang, Yuejian; Wan, Hongjian

    2016-01-01

    The Hsp20 genes are involved in the response of plants to environment stresses including heat shock and also play a vital role in plant growth and development. They represent the most abundant small heat shock proteins (sHsps) in plants, but little is known about this family in tomato (Solanum lycopersicum), an important vegetable crop in the world. Here, we characterized heat shock protein 20 (SlHsp20) gene family in tomato through integration of gene structure, chromosome location, phylogenetic relationship, and expression profile. Using bioinformatics-based methods, we identified at least 42 putative SlHsp20 genes in tomato. Sequence analysis revealed that most of SlHsp20 genes possessed no intron or a relatively short intron in length. Chromosome mapping indicated that inter-arm and intra-chromosome duplication events contributed remarkably to the expansion of SlHsp20 genes. Phylogentic tree of Hsp20 genes from tomato and other plant species revealed that SlHsp20 genes were grouped into 13 subfamilies, indicating that these genes may have a common ancestor that generated diverse subfamilies prior to the mono-dicot split. In addition, expression analysis using RNA-seq in various tissues and developmental stages of cultivated tomato and the wild relative Solanum pimpinellifolium revealed that most of these genes (83%) were expressed in at least one stage from at least one genotype. Out of 42 genes, 4 genes were expressed constitutively in almost all the tissues analyzed, implying that these genes might have specific housekeeping function in tomato cell under normal growth conditions. Two SlHsp20 genes displayed differential expression levels between cultivated tomato and S. pimpinellifolium in vegetative (leaf and root) and reproductive organs (floral bud and flower), suggesting inter-species diversification for functional specialization during the process of domestication. Based on genome-wide microarray analysis, we showed that the transcript levels of SlHsp20

  3. Genome-Wide Identification and Expression Profiling of Tomato Hsp20 Gene Family in Response to Biotic and Abiotic Stresses.

    PubMed

    Yu, Jiahong; Cheng, Yuan; Feng, Kun; Ruan, Meiying; Ye, Qingjing; Wang, Rongqing; Li, Zhimiao; Zhou, Guozhi; Yao, Zhuping; Yang, Yuejian; Wan, Hongjian

    2016-01-01

    The Hsp20 genes are involved in the response of plants to environment stresses including heat shock and also play a vital role in plant growth and development. They represent the most abundant small heat shock proteins (sHsps) in plants, but little is known about this family in tomato (Solanum lycopersicum), an important vegetable crop in the world. Here, we characterized heat shock protein 20 (SlHsp20) gene family in tomato through integration of gene structure, chromosome location, phylogenetic relationship, and expression profile. Using bioinformatics-based methods, we identified at least 42 putative SlHsp20 genes in tomato. Sequence analysis revealed that most of SlHsp20 genes possessed no intron or a relatively short intron in length. Chromosome mapping indicated that inter-arm and intra-chromosome duplication events contributed remarkably to the expansion of SlHsp20 genes. Phylogentic tree of Hsp20 genes from tomato and other plant species revealed that SlHsp20 genes were grouped into 13 subfamilies, indicating that these genes may have a common ancestor that generated diverse subfamilies prior to the mono-dicot split. In addition, expression analysis using RNA-seq in various tissues and developmental stages of cultivated tomato and the wild relative Solanum pimpinellifolium revealed that most of these genes (83%) were expressed in at least one stage from at least one genotype. Out of 42 genes, 4 genes were expressed constitutively in almost all the tissues analyzed, implying that these genes might have specific housekeeping function in tomato cell under normal growth conditions. Two SlHsp20 genes displayed differential expression levels between cultivated tomato and S. pimpinellifolium in vegetative (leaf and root) and reproductive organs (floral bud and flower), suggesting inter-species diversification for functional specialization during the process of domestication. Based on genome-wide microarray analysis, we showed that the transcript levels of SlHsp20

  4. Genome-Wide Identification and Expression Profiling of Tomato Hsp20 Gene Family in Response to Biotic and Abiotic Stresses.

    PubMed

    Yu, Jiahong; Cheng, Yuan; Feng, Kun; Ruan, Meiying; Ye, Qingjing; Wang, Rongqing; Li, Zhimiao; Zhou, Guozhi; Yao, Zhuping; Yang, Yuejian; Wan, Hongjian

    2016-01-01

    The Hsp20 genes are involved in the response of plants to environment stresses including heat shock and also play a vital role in plant growth and development. They represent the most abundant small heat shock proteins (sHsps) in plants, but little is known about this family in tomato (Solanum lycopersicum), an important vegetable crop in the world. Here, we characterized heat shock protein 20 (SlHsp20) gene family in tomato through integration of gene structure, chromosome location, phylogenetic relationship, and expression profile. Using bioinformatics-based methods, we identified at least 42 putative SlHsp20 genes in tomato. Sequence analysis revealed that most of SlHsp20 genes possessed no intron or a relatively short intron in length. Chromosome mapping indicated that inter-arm and intra-chromosome duplication events contributed remarkably to the expansion of SlHsp20 genes. Phylogentic tree of Hsp20 genes from tomato and other plant species revealed that SlHsp20 genes were grouped into 13 subfamilies, indicating that these genes may have a common ancestor that generated diverse subfamilies prior to the mono-dicot split. In addition, expression analysis using RNA-seq in various tissues and developmental stages of cultivated tomato and the wild relative Solanum pimpinellifolium revealed that most of these genes (83%) were expressed in at least one stage from at least one genotype. Out of 42 genes, 4 genes were expressed constitutively in almost all the tissues analyzed, implying that these genes might have specific housekeeping function in tomato cell under normal growth conditions. Two SlHsp20 genes displayed differential expression levels between cultivated tomato and S. pimpinellifolium in vegetative (leaf and root) and reproductive organs (floral bud and flower), suggesting inter-species diversification for functional specialization during the process of domestication. Based on genome-wide microarray analysis, we showed that the transcript levels of SlHsp20

  5. A genome-wide search for genes predisposing to manic-depression, assuming autosomal dominant inheritance

    SciTech Connect

    Coon, H.; Jensen, S.; Hoff, M.; Holik, J.; Plaetke, R.; Reimherr, F.; Wender, P.; Leppert, M.; Byerley, W. )

    1993-06-01

    Manic-depressive illness (MDI), also known as [open quotes]bipolar affective disorder[close quotes], is a common and devastating neuropsychiatric illness. Although pivotal biochemical alterations underlying the disease are unknown, results of family, twin, and adoption studies consistently implicate genetic transmission in the pathogenesis of MDI. In order to carry out linkage analysis, the authors ascertained eight moderately sized pedigrees containing multiple cases of the disease. For a four-allele marker mapping at 5 cM from the disease gene, the pedigree sample has >97% power to detect a dominant allele under genetic homogeneity and has >73% power under 20% heterogeneity. To date, the eight pedigrees have been genotyped with 328 polymorphic DNA loci throughout the genome. When autosomal dominant inheritance was assumed, 273 DNA markers gave lod scores <[minus]2.0 at [theta] = .05, and 4 DNA marker loci yielded lod scores >1 (chromosome 5 -- D5S39, D5S43, and D5S62; chromosome 11 -- D11S85). Of the markers giving lod scores >1, only D5S62 continued to show evidence for linkage when the affected-pedigree-member method was used. The D5S62 locus maps to distal 5q, a region containing neurotransmitter-receptor genes for dopamine, norepinephrine, glutamate, and gamma-aminobutyric acid. Although additional work in this region may be warranted, the linkage results should be interpreted as preliminary data, as 68 unaffected individuals are not past the age of risk. 72 refs., 2 tabs.

  6. Genome-wide identification and expression analysis of SWI1 genes in Boechera species.

    PubMed

    Sezer, Fatih; Yüzbaşioğlu, Gözde; Özbilen, Aslıhan; Taşkin, Kemal M

    2016-06-01

    As a mode of reproduction in plants, apomixis leads to the generation of clones via seeds. Apomictic plants form viable diploid female gametes without meiosis (apomeiosis) and produce embryos without fertilization (parthenogenesis). Apomeiosis, as a major component of apomixis, has recently been reported in some Arabidopsis thaliana mutants; dyad mutants of SWI1 showed developmental processes common to apomeiosis, such as producing functional diploid gametes. However, the orthologs of SWI1 genes in natural apomicts has not been previously reported. To identify the relationship between the SWI1 gene and the apomeiosis process, we isolated and sequenced SWI1 orthologs from Boechera species, including apomictic and sexual species. Boechera species are close relatives of A. thaliana and thus are advantageous model species for apomixis research. The SWI1 cDNAs were obtained by RT-PCR from apomictic and sexual Boechera young flower buds. We sequenced partial SWI1 transcripts that were 650bp for B. holboellii and 684bp for B. stricta. These SWI1-like sequences showed 86% similarity for B. holboellii and 92% for B. stricta to the A. thaliana SWI1 transcript. We also used available genome data and amplified genomic sequences for SWI1 orthologs in B. holboellii and B. stricta. The predicted proteins contain a phospholipase C domain and a nuclear localization signal. Sequence analysis did not show significant mutations related to apomixis, and phylogenetic analysis showed that SWI1-like sequences were common across plant families, regardless of the presence of a sexual or apomictic reproduction system. We also investigated the expression levels of SWI1 mRNA in the B. holboellii and B. stricta young unopened flower buds and found that relatively high levels of expression occurred in apomicts.

  7. Genome-wide identification and expression analysis of SWI1 genes in Boechera species.

    PubMed

    Sezer, Fatih; Yüzbaşioğlu, Gözde; Özbilen, Aslıhan; Taşkin, Kemal M

    2016-06-01

    As a mode of reproduction in plants, apomixis leads to the generation of clones via seeds. Apomictic plants form viable diploid female gametes without meiosis (apomeiosis) and produce embryos without fertilization (parthenogenesis). Apomeiosis, as a major component of apomixis, has recently been reported in some Arabidopsis thaliana mutants; dyad mutants of SWI1 showed developmental processes common to apomeiosis, such as producing functional diploid gametes. However, the orthologs of SWI1 genes in natural apomicts has not been previously reported. To identify the relationship between the SWI1 gene and the apomeiosis process, we isolated and sequenced SWI1 orthologs from Boechera species, including apomictic and sexual species. Boechera species are close relatives of A. thaliana and thus are advantageous model species for apomixis research. The SWI1 cDNAs were obtained by RT-PCR from apomictic and sexual Boechera young flower buds. We sequenced partial SWI1 transcripts that were 650bp for B. holboellii and 684bp for B. stricta. These SWI1-like sequences showed 86% similarity for B. holboellii and 92% for B. stricta to the A. thaliana SWI1 transcript. We also used available genome data and amplified genomic sequences for SWI1 orthologs in B. holboellii and B. stricta. The predicted proteins contain a phospholipase C domain and a nuclear localization signal. Sequence analysis did not show significant mutations related to apomixis, and phylogenetic analysis showed that SWI1-like sequences were common across plant families, regardless of the presence of a sexual or apomictic reproduction system. We also investigated the expression levels of SWI1 mRNA in the B. holboellii and B. stricta young unopened flower buds and found that relatively high levels of expression occurred in apomicts. PMID:27107180

  8. A genome-wide imaging-based screening to identify genes involved in synphilin-1 inclusion formation in Saccharomyces cerevisiae

    PubMed Central

    Zhao, Lei; Yang, Qian; Zheng, Ju; Zhu, Xuefeng; Hao, Xinxin; Song, Jia; Lebacq, Tom; Franssens, Vanessa; Winderickx, Joris; Nystrom, Thomas; Liu, Beidong

    2016-01-01

    Synphilin-1 is a major component of Parkinson’s disease (PD) inclusion bodies implicated in PD pathogenesis. However, the machinery controlling synphilin-1 inclusion formation remains unclear. Here, we investigated synphilin-1 inclusion formation using a systematic genome-wide, high-content imaging based screening approach (HCI) in the yeast Saccharomyces cerevisiae. By combining with a secondary screening for mutants showing significant changes on fluorescence signal intensity, we filtered out hits that significantly decreased the expression level of synphilin-1. We found 133 yeast genes that didn’t affect synphilin-1 expression but that were required for the formation of synphilin-1 inclusions. Functional enrichment and physical interaction network analysis revealed these genes to encode for functions involved in cytoskeleton organization, histone modification, sister chromatid segregation, glycolipid biosynthetic process, DNA repair and replication. All hits were confirmed by conventional microscopy. Complementation assays were performed with a selected group of mutants, results indicated that the observed phenotypic changes in synphilin-1 inclusion formation were directly caused by the loss of corresponding genes of the deletion mutants. Further growth assays of these mutants showed a significant synthetic sick effect upon synphilin-1 expression, which supports the hypothesis that matured inclusions represent an end stage of several events meant to protect cells against the synphilin-1 cytotoxicity. PMID:27440388

  9. Genes required for growth at high hydrostatic pressure in Escherichia coli K-12 identified by genome-wide screening.

    PubMed

    Black, S Lucas; Dawson, Angela; Ward, F Bruce; Allen, Rosalind J

    2013-01-01

    Despite the fact that much of the global microbial biosphere is believed to exist in high pressure environments, the effects of hydrostatic pressure on microbial physiology remain poorly understood. We use a genome-wide screening approach, combined with a novel high-throughput high-pressure cell culture method, to investigate the effects of hydrostatic pressure on microbial physiology in vivo. The Keio collection of single-gene deletion mutants in Escherichia coli K-12 was screened for growth at a range of pressures from 0.1 MPa to 60 MPa. This led to the identification of 6 genes, rodZ, holC, priA, dnaT, dedD and tatC, whose products were required for growth at 30 MPa and a further 3 genes, tolB, rffT and iscS, whose products were required for growth at 40 MPa. Our results support the view that the effects of pressure on cell physiology are pleiotropic, with DNA replication, cell division, the cytoskeleton and cell envelope physiology all being potential failure points for cell physiology during growth at elevated pressure.

  10. A bivariate genome-wide association study identifies ADAM12 as a novel susceptibility gene for Kashin-Beck disease

    PubMed Central

    Hao, Jingcan; Wang, Wenyu; Wen, Yan; Xiao, Xiao; He, Awen; Guo, Xiong; Yang, Tielin; Liu, Xiaogang; Shen, Hui; Chen, Xiangding; Tian, Qing; Deng, Hong-Wen; Zhang, Feng

    2016-01-01

    Kashin-Beck disease (KBD) is a chronic osteoarthropathy, which manifests as joint deformities and growth retardation. Only a few genetic studies of growth retardation associated with the KBD have been carried out by now. In this study, we conducted a two-stage bivariate genome-wide association study (BGWAS) of the KBD using joint deformities and body height as study phenotypes, totally involving 2,417 study subjects. Articular cartilage specimens from 8 subjects were collected for immunohistochemistry. In the BGWAS, ADAM12 gene achieved the most significant association (rs1278300 p-value = 9.25 × 10−9) with the KBD. Replication study observed significant association signal at rs1278300 (p-value = 0.007) and rs1710287 (p-value = 0.002) of ADAM12 after Bonferroni correction. Immunohistochemistry revealed significantly decreased expression level of ADAM12 protein in the KBD articular cartilage (average positive chondrocyte rate = 47.59 ± 7.79%) compared to healthy articular cartilage (average positive chondrocyte rate = 64.73 ± 5.05%). Our results suggest that ADAM12 gene is a novel susceptibility gene underlying both joint destruction and growth retardation of the KBD. PMID:27545300

  11. Genome-wide identification of long intergenic noncoding RNA genes and their potential association with domestication in pigs.

    PubMed

    Zhou, Zhong-Yin; Li, Ai-Min; Adeola, Adeniyi C; Liu, Yan-Hu; Irwin, David M; Xie, Hai-Bing; Zhang, Ya-Ping

    2014-06-02

    Thousands of long intergenic noncoding RNAs (lincRNAs) have been identified in the human and mouse genomes, some of which play important roles in fundamental biological processes. The pig is an important domesticated animal, however, pig lincRNAs remain poorly characterized and it is unknown if they were involved in the domestication of the pig. Here, we used available RNA-seq resources derived from 93 samples and expressed sequence tag data sets, and identified 6,621 lincRNA transcripts from 4,515 gene loci. Among the identified lincRNAs, some lincRNA genes exhibit synteny and sequence conservation, including linc-sscg2561, whose gene neighbor Dnmt3a is associated with