Science.gov

Sample records for genome-wide analysis reveals

  1. Genome-wide analysis reveals gene expression and metabolic network dynamics during embryo development in Arabidopsis.

    PubMed

    Xiang, Daoquan; Venglat, Prakash; Tibiche, Chabane; Yang, Hui; Risseeuw, Eddy; Cao, Yongguo; Babic, Vivijan; Cloutier, Mathieu; Keller, Wilf; Wang, Edwin; Selvaraj, Gopalan; Datla, Raju

    2011-05-01

    Embryogenesis is central to the life cycle of most plant species. Despite its importance, because of the difficulty associated with embryo isolation, global gene expression programs involved in plant embryogenesis, especially the early events following fertilization, are largely unknown. To address this gap, we have developed methods to isolate whole live Arabidopsis (Arabidopsis thaliana) embryos as young as zygote and performed genome-wide profiling of gene expression. These studies revealed insights into patterns of gene expression relating to: maternal and paternal contributions to zygote development, chromosomal level clustering of temporal expression in embryogenesis, and embryo-specific functions. Functional analysis of some of the modulated transcription factor encoding genes from our data sets confirmed that they are critical for embryogenesis. Furthermore, we constructed stage-specific metabolic networks mapped with differentially regulated genes by combining the microarray data with the available Kyoto Encyclopedia of Genes and Genomes metabolic data sets. Comparative analysis of these networks revealed the network-associated structural and topological features, pathway interactions, and gene expression with reference to the metabolic activities during embryogenesis. Together, these studies have generated comprehensive gene expression data sets for embryo development in Arabidopsis and may serve as an important foundational resource for other seed plants. PMID:21402797

  2. Genome-wide analysis reveals adaptation to high altitudes in Tibetan sheep.

    PubMed

    Wei, Caihong; Wang, Huihua; Liu, Gang; Zhao, Fuping; Kijas, James W; Ma, Youji; Lu, Jian; Zhang, Li; Cao, Jiaxue; Wu, Mingming; Wang, Guangkai; Liu, Ruizao; Liu, Zhen; Zhang, Shuzhen; Liu, Chousheng; Du, Lixin

    2016-01-01

    Tibetan sheep have lived on the Tibetan Plateau for thousands of years; however, the process and consequences of adaptation to this extreme environment have not been elucidated for important livestock such as sheep. Here, seven sheep breeds, representing both highland and lowland breeds from different areas of China, were genotyped for a genome-wide collection of single-nucleotide polymorphisms (SNPs). The FST and XP-EHH approaches were used to identify regions harbouring local positive selection between these highland and lowland breeds, and 236 genes were identified. We detected selection events spanning genes involved in angiogenesis, energy production and erythropoiesis. In particular, several candidate genes were associated with high-altitude hypoxia, including EPAS1, CRYAA, LONP1, NF1, DPP4, SOD1, PPARG and SOCS2. EPAS1 plays a crucial role in hypoxia adaption; therefore, we investigated the exon sequences of EPAS1 and identified 12 mutations. Analysis of the relationship between blood-related phenotypes and EPAS1 genotypes in additional highland sheep revealed that a homozygous mutation at a relatively conserved site in the EPAS1 3' untranslated region was associated with increased mean corpuscular haemoglobin concentration and mean corpuscular volume. Taken together, our results provide evidence of the genetic diversity of highland sheep and indicate potential high-altitude hypoxia adaptation mechanisms, including the role of EPAS1 in adaptation. PMID:27230812

  3. Genome-wide analysis reveals adaptation to high altitudes in Tibetan sheep

    PubMed Central

    Wei, Caihong; Wang, Huihua; Liu, Gang; Zhao, Fuping; Kijas, James W.; Ma, Youji; Lu, Jian; Zhang, Li; Cao, Jiaxue; Wu, Mingming; Wang, Guangkai; Liu, Ruizao; Liu, Zhen; Zhang, Shuzhen; Liu, Chousheng; Du, Lixin

    2016-01-01

    Tibetan sheep have lived on the Tibetan Plateau for thousands of years; however, the process and consequences of adaptation to this extreme environment have not been elucidated for important livestock such as sheep. Here, seven sheep breeds, representing both highland and lowland breeds from different areas of China, were genotyped for a genome-wide collection of single-nucleotide polymorphisms (SNPs). The FST and XP-EHH approaches were used to identify regions harbouring local positive selection between these highland and lowland breeds, and 236 genes were identified. We detected selection events spanning genes involved in angiogenesis, energy production and erythropoiesis. In particular, several candidate genes were associated with high-altitude hypoxia, including EPAS1, CRYAA, LONP1, NF1, DPP4, SOD1, PPARG and SOCS2. EPAS1 plays a crucial role in hypoxia adaption; therefore, we investigated the exon sequences of EPAS1 and identified 12 mutations. Analysis of the relationship between blood-related phenotypes and EPAS1 genotypes in additional highland sheep revealed that a homozygous mutation at a relatively conserved site in the EPAS1 3′ untranslated region was associated with increased mean corpuscular haemoglobin concentration and mean corpuscular volume. Taken together, our results provide evidence of the genetic diversity of highland sheep and indicate potential high-altitude hypoxia adaptation mechanisms, including the role of EPAS1 in adaptation. PMID:27230812

  4. Genetic architecture dissection by genome-wide association analysis reveals avian eggshell ultrastructure traits.

    PubMed

    Duan, Zhongyi; Sun, Congjiao; Shen, ManMan; Wang, Kehua; Yang, Ning; Zheng, Jiangxia; Xu, Guiyun

    2016-01-01

    The ultrastructure of an eggshell is considered the major determinant of eggshell quality, which has biological and economic significance for the avian and poultry industries. However, the interrelationships and genome-wide architecture of eggshell ultrastructure remain to be elucidated. Herein, we measured eggshell thickness (EST), effective layer thickness (ET), mammillary layer thickness (MT), and mammillary density (MD) and conducted genome-wide association studies in 927 F2 hens. The SNP-based heritabilities of eggshell ultrastructure traits were estimated to be 0.39, 0.36, 0.17 and 0.19 for EST, ET, MT and MD, respectively, and a total of 719, 784, 1 and 10 genome-wide significant SNPs were associated with EST, ET, MT and MD, respectively. ABCC9, ITPR2, KCNJ8 and WNK1, which are involved in ion transport, were suggested to be the key genes regulating EST and ET. ITM2C and KNDC1 likely affect MT and MD, respectively. Additionally, there were linear relationships between the chromosome lengths and the variance explained per chromosome for EST (R(2) = 0.57) and ET (R(2) = 0.67). In conclusion, the interrelationships and genetic architecture of eggshell ultrastructure traits revealed in this study are valuable for our understanding of the avian eggshell and contribute to research on a variety of other calcified shells. PMID:27456605

  5. Genetic architecture dissection by genome-wide association analysis reveals avian eggshell ultrastructure traits

    PubMed Central

    Duan, Zhongyi; Sun, Congjiao; Shen, ManMan; Wang, Kehua; Yang, Ning; Zheng, Jiangxia; Xu, Guiyun

    2016-01-01

    The ultrastructure of an eggshell is considered the major determinant of eggshell quality, which has biological and economic significance for the avian and poultry industries. However, the interrelationships and genome-wide architecture of eggshell ultrastructure remain to be elucidated. Herein, we measured eggshell thickness (EST), effective layer thickness (ET), mammillary layer thickness (MT), and mammillary density (MD) and conducted genome-wide association studies in 927 F2 hens. The SNP-based heritabilities of eggshell ultrastructure traits were estimated to be 0.39, 0.36, 0.17 and 0.19 for EST, ET, MT and MD, respectively, and a total of 719, 784, 1 and 10 genome-wide significant SNPs were associated with EST, ET, MT and MD, respectively. ABCC9, ITPR2, KCNJ8 and WNK1, which are involved in ion transport, were suggested to be the key genes regulating EST and ET. ITM2C and KNDC1 likely affect MT and MD, respectively. Additionally, there were linear relationships between the chromosome lengths and the variance explained per chromosome for EST (R2 = 0.57) and ET (R2 = 0.67). In conclusion, the interrelationships and genetic architecture of eggshell ultrastructure traits revealed in this study are valuable for our understanding of the avian eggshell and contribute to research on a variety of other calcified shells. PMID:27456605

  6. Differential network analysis reveals the genome-wide landscape of estrogen receptor modulation in hormonal cancers

    PubMed Central

    Hsiao, Tzu-Hung; Chiu, Yu-Chiao; Hsu, Pei-Yin; Lu, Tzu-Pin; Lai, Liang-Chuan; Tsai, Mong-Hsun; Huang, Tim H.-M.; Chuang, Eric Y.; Chen, Yidong

    2016-01-01

    Several mutual information (MI)-based algorithms have been developed to identify dynamic gene-gene and function-function interactions governed by key modulators (genes, proteins, etc.). Due to intensive computation, however, these methods rely heavily on prior knowledge and are limited in genome-wide analysis. We present the modulated gene/gene set interaction (MAGIC) analysis to systematically identify genome-wide modulation of interaction networks. Based on a novel statistical test employing conjugate Fisher transformations of correlation coefficients, MAGIC features fast computation and adaption to variations of clinical cohorts. In simulated datasets MAGIC achieved greatly improved computation efficiency and overall superior performance than the MI-based method. We applied MAGIC to construct the estrogen receptor (ER) modulated gene and gene set (representing biological function) interaction networks in breast cancer. Several novel interaction hubs and functional interactions were discovered. ER+ dependent interaction between TGFβ and NFκB was further shown to be associated with patient survival. The findings were verified in independent datasets. Using MAGIC, we also assessed the essential roles of ER modulation in another hormonal cancer, ovarian cancer. Overall, MAGIC is a systematic framework for comprehensively identifying and constructing the modulated interaction networks in a whole-genome landscape. MATLAB implementation of MAGIC is available for academic uses at https://github.com/chiuyc/MAGIC. PMID:26972162

  7. Differential network analysis reveals the genome-wide landscape of estrogen receptor modulation in hormonal cancers.

    PubMed

    Hsiao, Tzu-Hung; Chiu, Yu-Chiao; Hsu, Pei-Yin; Lu, Tzu-Pin; Lai, Liang-Chuan; Tsai, Mong-Hsun; Huang, Tim H-M; Chuang, Eric Y; Chen, Yidong

    2016-01-01

    Several mutual information (MI)-based algorithms have been developed to identify dynamic gene-gene and function-function interactions governed by key modulators (genes, proteins, etc.). Due to intensive computation, however, these methods rely heavily on prior knowledge and are limited in genome-wide analysis. We present the modulated gene/gene set interaction (MAGIC) analysis to systematically identify genome-wide modulation of interaction networks. Based on a novel statistical test employing conjugate Fisher transformations of correlation coefficients, MAGIC features fast computation and adaption to variations of clinical cohorts. In simulated datasets MAGIC achieved greatly improved computation efficiency and overall superior performance than the MI-based method. We applied MAGIC to construct the estrogen receptor (ER) modulated gene and gene set (representing biological function) interaction networks in breast cancer. Several novel interaction hubs and functional interactions were discovered. ER+ dependent interaction between TGFβ and NFκB was further shown to be associated with patient survival. The findings were verified in independent datasets. Using MAGIC, we also assessed the essential roles of ER modulation in another hormonal cancer, ovarian cancer. Overall, MAGIC is a systematic framework for comprehensively identifying and constructing the modulated interaction networks in a whole-genome landscape. MATLAB implementation of MAGIC is available for academic uses at https://github.com/chiuyc/MAGIC. PMID:26972162

  8. Genome-Wide Analysis Reveals Novel Regulators of Growth in Drosophila melanogaster.

    PubMed

    Vonesch, Sibylle Chantal; Lamparter, David; Mackay, Trudy F C; Bergmann, Sven; Hafen, Ernst

    2016-01-01

    Organismal size depends on the interplay between genetic and environmental factors. Genome-wide association (GWA) analyses in humans have implied many genes in the control of height but suffer from the inability to control the environment. Genetic analyses in Drosophila have identified conserved signaling pathways controlling size; however, how these pathways control phenotypic diversity is unclear. We performed GWA of size traits using the Drosophila Genetic Reference Panel of inbred, sequenced lines. We find that the top associated variants differ between traits and sexes; do not map to canonical growth pathway genes, but can be linked to these by epistasis analysis; and are enriched for genes and putative enhancers. Performing GWA on well-studied developmental traits under controlled conditions expands our understanding of developmental processes underlying phenotypic diversity. PMID:26751788

  9. Genome-Wide Analysis Reveals Novel Regulators of Growth in Drosophila melanogaster

    PubMed Central

    Vonesch, Sibylle Chantal; Lamparter, David; Mackay, Trudy F. C.; Bergmann, Sven; Hafen, Ernst

    2016-01-01

    Organismal size depends on the interplay between genetic and environmental factors. Genome-wide association (GWA) analyses in humans have implied many genes in the control of height but suffer from the inability to control the environment. Genetic analyses in Drosophila have identified conserved signaling pathways controlling size; however, how these pathways control phenotypic diversity is unclear. We performed GWA of size traits using the Drosophila Genetic Reference Panel of inbred, sequenced lines. We find that the top associated variants differ between traits and sexes; do not map to canonical growth pathway genes, but can be linked to these by epistasis analysis; and are enriched for genes and putative enhancers. Performing GWA on well-studied developmental traits under controlled conditions expands our understanding of developmental processes underlying phenotypic diversity. PMID:26751788

  10. Genome-wide enrichment analysis between endometriosis and obesity-related traits reveals novel susceptibility loci

    PubMed Central

    Rahmioglu, Nilufer; Macgregor, Stuart; Drong, Alexander W.; Hedman, Åsa K.; Harris, Holly R.; Randall, Joshua C.; Prokopenko, Inga; Nyholt, Dale R.; Morris, Andrew P.; Montgomery, Grant W.; Missmer, Stacey A.; Lindgren, Cecilia M.; Zondervan, Krina T.

    2015-01-01

    Endometriosis is a chronic inflammatory condition in women that results in pelvic pain and subfertility, and has been associated with decreased body mass index (BMI). Genetic variants contributing to the heritable component have started to emerge from genome-wide association studies (GWAS), although the majority remain unknown. Unexpectedly, we observed an intergenic locus on 7p15.2 that was genome-wide significantly associated with both endometriosis and fat distribution (waist-to-hip ratio adjusted for BMI; WHRadjBMI) in an independent meta-GWAS of European ancestry individuals. This led us to investigate the potential overlap in genetic variants underlying the aetiology of endometriosis, WHRadjBMI and BMI using GWAS data. Our analyses demonstrated significant enrichment of common variants between fat distribution and endometriosis (P = 3.7 × 10−3), which was stronger when we restricted the investigation to more severe (Stage B) cases (P = 4.5 × 10−4). However, no genetic enrichment was observed between endometriosis and BMI (P = 0.79). In addition to 7p15.2, we identify four more variants with statistically significant evidence of involvement in both endometriosis and WHRadjBMI (in/near KIFAP3, CAB39L, WNT4, GRB14); two of these, KIFAP3 and CAB39L, are novel associations for both traits. KIFAP3, WNT4 and 7p15.2 are associated with the WNT signalling pathway; formal pathway analysis confirmed a statistically significant (P = 6.41 × 10−4) overrepresentation of shared associations in developmental processes/WNT signalling between the two traits. Our results demonstrate an example of potential biological pleiotropy that was hitherto unknown, and represent an opportunity for functional follow-up of loci and further cross-phenotype comparisons to assess how fat distribution and endometriosis pathogenesis research fields can inform each other. PMID:25296917

  11. Genome-wide mutational spectra analysis reveals significant cancer-specific heterogeneity

    PubMed Central

    Tan, Hua; Bao, Jiguang; Zhou, Xiaobo

    2015-01-01

    Cancer is widely recognized as a genetic disease in which somatic mutations are sequentially accumulated to drive tumor progression. Although genomic landscape studies are informative for individual cancer types, a comprehensive comparative study of tumorigenic mutations across cancer types based on integrative data sources is still a pressing need. We systematically analyzed ~106 non-synonymous mutations extracted from COSMIC, involving ~8000 genome-wide screened samples across 23 major human cancers at both the amino acid and gene levels. Our analysis identified cancer-specific heterogeneity that traditional nucleotide variation analysis alone usually overlooked. Particularly, the amino acid arginine (R) turns out to be the most favorable target of amino acid alteration in most cancer types studied (P < 10−9, binomial test), reflecting its important role in cellular physiology. The tumor suppressor gene TP53 is mutated exclusively with the HYDIN, KRAS, and PTEN genes in large intestine, lung, and endometrial cancers respectively, indicating that TP53 takes part in different signaling pathways in different cancers. While some of our analyses corroborated previous observations, others indicated relevant candidates with high priority for further experimental validation. Our findings have many ramifications in understanding the etiology of cancer and the underlying molecular mechanisms in particular cancers. PMID:26212640

  12. Genome Wide Association Analysis Reveals New Production Trait Genes in a Male Duroc Population

    PubMed Central

    Wang, Kejun; Liu, Dewu; Hernandez-Sanchez, Jules; Chen, Jie; Liu, Chengkun; Wu, Zhenfang; Fang, Meiying; Li, Ning

    2015-01-01

    In this study, 796 male Duroc pigs were used to identify genomic regions controlling growth traits. Three production traits were studied: food conversion ratio, days to 100 KG, and average daily gain, using a panel of 39,436 single nucleotide polymorphisms. In total, we detected 11 genome-wide and 162 chromosome-wide single nucleotide polymorphism trait associations. The Gene ontology analysis identified 14 candidate genes close to significant single nucleotide polymorphisms, with growth-related functions: six for days to 100 KG (WT1, FBXO3, DOCK7, PPP3CA, AGPAT9, and NKX6-1), seven for food conversion ratio (MAP2, TBX15, IVL, ARL15, CPS1, VWC2L, and VAV3), and one for average daily gain (COL27A1). Gene ontology analysis indicated that most of the candidate genes are involved in muscle, fat, bone or nervous system development, nutrient absorption, and metabolism, which are all either directly or indirectly related to growth traits in pigs. Additionally, we found four haplotype blocks composed of suggestive single nucleotide polymorphisms located in the growth trait-related quantitative trait loci and further narrowed down the ranges, the largest of which decreased by ~60 Mb. Hence, our results could be used to improve pig production traits by increasing the frequency of favorable alleles via artificial selection. PMID:26418247

  13. Genome-wide analysis reveals a cell cycle–dependent mechanism controlling centromere propagation

    PubMed Central

    Erhardt, Sylvia; Mellone, Barbara G.; Betts, Craig M.; Zhang, Weiguo; Karpen, Gary H.; Straight, Aaron F.

    2008-01-01

    Centromeres are the structural and functional foundation for kinetochore formation, spindle attachment, and chromosome segregation. In this study, we isolated factors required for centromere propagation using genome-wide RNA interference screening for defects in centromere protein A (CENP-A; centromere identifier [CID]) localization in Drosophila melanogaster. We identified the proteins CAL1 and CENP-C as essential factors for CID assembly at the centromere. CID, CAL1, and CENP-C coimmunoprecipitate and are mutually dependent for centromere localization and function. We also identified the mitotic cyclin A (CYCA) and the anaphase-promoting complex (APC) inhibitor RCA1/Emi1 as regulators of centromere propagation. We show that CYCA is centromere localized and that CYCA and RCA1/Emi1 couple centromere assembly to the cell cycle through regulation of the fizzy-related/CDH1 subunit of the APC. Our findings identify essential components of the epigenetic machinery that ensures proper specification and propagation of the centromere and suggest a mechanism for coordinating centromere inheritance with cell division. PMID:19047461

  14. Genome-wide analysis reveals mechanisms modulating autophagy in normal brain aging and in Alzheimer's disease

    PubMed Central

    Lipinski, Marta M.; Zheng, Bin; Lu, Tao; Yan, Zhenyu; Py, Bénédicte F.; Ng, Aylwin; Xavier, Ramnik J.; Li, Cheng; Yankner, Bruce A.; Scherzer, Clemens R.; Yuan, Junying

    2010-01-01

    Dysregulation of autophagy, a cellular catabolic mechanism essential for degradation of misfolded proteins, has been implicated in multiple neurodegenerative diseases. However, the mechanisms that lead to the autophagy dysfunction are still not clear. Based on the results of a genome-wide screen, we show that reactive oxygen species (ROS) serve as common mediators upstream of the activation of the type III PI3 kinase, which is critical for the initiation of autophagy. Furthermore, ROS play an essential function in the induction of the type III PI3 kinase and autophagy in response to amyloid β peptide, the main pathogenic mediator of Alzheimer's disease (AD). However, lysosomal blockage also caused by Aβ is independent of ROS. In addition, we demonstrate that autophagy is transcriptionally down-regulated during normal aging in the human brain. Strikingly, in contrast to normal aging, we observe transcriptional up-regulation of autophagy in the brains of AD patients, suggesting that there might be a compensatory regulation of autophagy. Interestingly, we show that an AD drug and an AD drug candidate have inhibitory effects on autophagy, raising the possibility that decreasing input into the lysosomal system may help to reduce cellular stress in AD. Finally, we provide a list of candidate drug targets that can be used to safely modulate levels of autophagy without causing cell death. PMID:20660724

  15. Comprehensive genome-wide analysis reveals different classes of enigmatic old yellow enzyme in fungi

    PubMed Central

    Nizam, Shadab; Verma, Sandhya; Borah, Nilam Nayan; Gazara, Rajesh Kumar; Verma, Praveen Kumar

    2014-01-01

    In this study, we systematically identify Old Yellow Enzymes (OYEs) from a diverse range of economically important fungi representing different ecology and lifestyle. Using active site residues and sequence alignments, we present a classification for these proteins into three distinct classes including a novel class (Class III) and assign names to sequences. Our in-depth phylogenetic analysis suggests a complex history of lineage-specific expansion and contraction for the OYE gene family in fungi. Comparative analyses reveal remarkable diversity in the number and classes of OYE among fungi. Quantitative real-time PCR (qRT-PCR) of Ascochyta rabiei OYEs indicates differential expression of OYE genes during oxidative stress and plant infection. This study shows relationship of OYE with fungal ecology and lifestyle, and provides a foundation for future functional analysis and characterization of OYE gene family. PMID:24500274

  16. Genome-wide analysis reveals the ancient and recent admixture history of East African Shorthorn Zebu (EASZ)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Indigenous zebu cattle are widespread across East Africa owing to their tropically-adapted physiology. Previous studies using microsatellite loci revealed the complex history of these populations with the presence of taurine and zebu genetic backgrounds. Here, we estimate at the genome-wide level th...

  17. Genome-Wide Methylation Analysis of Prostate Tissues Reveals Global Methylation Patterns of Prostate Cancer

    PubMed Central

    Luo, Jian-Hua; Ding, Ying; Chen, Rui; Michalopoulos, George; Nelson, Joel; Tseng, George; Yu, Yan P.

    2014-01-01

    Altered genome methylation is a hallmark of human malignancies. In this study, high-throughput analyses of concordant gene methylation and expression events were performed for 91 human prostate specimens, including prostate tumor (T), matched normal adjacent to tumor (AT), and organ donor (OD). Methylated DNA in genomic DNA was immunoprecipitated with anti-methylcytidine antibodies and detected by Affymetrix human whole genome SNP 6.0 chips. Among the methylated CpG islands, 11,481 islands were found located in the promoter and exon 1 regions of 9295 genes. Genes (7641) were methylated frequently across OD, AT, and T samples, whereas 239 genes were differentially methylated in only T and 785 genes in both AT and T but not OD. Genes with promoter methylation and concordantly suppressed expression were identified. Pathway analysis suggested that many of the methylated genes in T and AT are involved in cell growth and mitogenesis. Classification analysis of the differentially methylated genes in T or OD produced a specificity of 89.4% and a sensitivity of 85.7%. The T and AT groups, however, were only slightly separated by the prediction analysis, indicating a strong field effect. A gene methylation prediction model was shown to predict prostate cancer relapse with sensitivity of 80.0% and specificity of 85.0%. These results suggest methylation patterns useful in predicting clinical outcomes of prostate cancer. PMID:23583283

  18. Genome-wide analysis of Musashi-2 targets reveals novel functions in governing epithelial cell migration

    PubMed Central

    Bennett, Christopher G.; Riemondy, Kent; Chapnick, Douglas A.; Bunker, Eric; Liu, Xuedong; Kuersten, Scott; Yi, Rui

    2016-01-01

    The Musashi-2 (Msi2) RNA-binding protein maintains stem cell self-renewal and promotes oncogenesis by enhancing cell proliferation in hematopoietic and gastrointestinal tissues. However, it is unclear how Msi2 recognizes and regulates mRNA targets in vivo and whether Msi2 primarily controls cell growth in all cell types. Here we identified Msi2 targets with HITS-CLIP and revealed that Msi2 primarily recognizes mRNA 3′UTRs at sites enriched in multiple copies of UAG motifs in epithelial progenitor cells. RNA-seq and ribosome profiling demonstrated that Msi2 promotes targeted mRNA decay without affecting translation efficiency. Unexpectedly, the most prominent Msi2 targets identified are key regulators that govern cell motility with a high enrichment in focal adhesion and extracellular matrix-receptor interaction, in addition to regulators of cell growth and survival. Loss of Msi2 stimulates epithelial cell migration, increases the number of focal adhesions and also compromises cell growth. These findings provide new insights into the molecular mechanisms of Msi2's recognition and repression of targets and uncover a key function of Msi2 in restricting epithelial cell migration. PMID:27034466

  19. Genome-Wide Analysis Reveals Novel Genes Essential for Heme Homeostasis in Caenorhabditis elegans

    PubMed Central

    Rao, Anita U.; Cerqueira, Gustavo C.; Mitreva, Makedonka; El-Sayed, Najib M.; Krause, Michael; Hamza, Iqbal

    2010-01-01

    Heme is a cofactor in proteins that function in almost all sub-cellular compartments and in many diverse biological processes. Heme is produced by a conserved biosynthetic pathway that is highly regulated to prevent the accumulation of heme—a cytotoxic, hydrophobic tetrapyrrole. Caenorhabditis elegans and related parasitic nematodes do not synthesize heme, but instead require environmental heme to grow and develop. Heme homeostasis in these auxotrophs is, therefore, regulated in accordance with available dietary heme. We have capitalized on this auxotrophy in C. elegans to study gene expression changes associated with precisely controlled dietary heme concentrations. RNA was isolated from cultures containing 4, 20, or 500 µM heme; derived cDNA probes were hybridized to Affymetrix C. elegans expression arrays. We identified 288 heme-responsive genes (hrgs) that were differentially expressed under these conditions. Of these genes, 42% had putative homologs in humans, while genomes of medically relevant heme auxotrophs revealed homologs for 12% in both Trypanosoma and Leishmania and 24% in parasitic nematodes. Depletion of each of the 288 hrgs by RNA–mediated interference (RNAi) in a transgenic heme-sensor worm strain identified six genes that regulated heme homeostasis. In addition, seven membrane-spanning transporters involved in heme uptake were identified by RNAi knockdown studies using a toxic heme analog. Comparison of genes that were positive in both of the RNAi screens resulted in the identification of three genes in common that were vital for organismal heme homeostasis in C. elegans. Collectively, our results provide a catalog of genes that are essential for metazoan heme homeostasis and demonstrate the power of C. elegans as a genetic animal model to dissect the regulatory circuits which mediate heme trafficking in both vertebrate hosts and their parasites, which depend on environmental heme for survival. PMID:20686661

  20. Genome Wide Binding Site Analysis Reveals Transcriptional Coactivation of Cytokinin-Responsive Genes by DELLA Proteins

    PubMed Central

    Marín-de la Rosa, Nora; Pfeiffer, Anne; Hill, Kristine; Locascio, Antonella; Bhalerao, Rishikesh P.; Miskolczi, Pal; Grønlund, Anne L.; Wanchoo-Kohli, Aakriti; Thomas, Stephen G.; Bennett, Malcolm J.; Lohmann, Jan U.; Blázquez, Miguel A.; Alabadí, David

    2015-01-01

    The ability of plants to provide a plastic response to environmental cues relies on the connectivity between signaling pathways. DELLA proteins act as hubs that relay environmental information to the multiple transcriptional circuits that control growth and development through physical interaction with transcription factors from different families. We have analyzed the presence of one DELLA protein at the Arabidopsis genome by chromatin immunoprecipitation coupled to large-scale sequencing and we find that it binds at the promoters of multiple genes. Enrichment analysis shows a strong preference for cis elements recognized by specific transcription factor families. In particular, we demonstrate that DELLA proteins are recruited by type-B ARABIDOPSIS RESPONSE REGULATORS (ARR) to the promoters of cytokinin-regulated genes, where they act as transcriptional co-activators. The biological relevance of this mechanism is underpinned by the necessity of simultaneous presence of DELLAs and ARRs to restrict root meristem growth and to promote photomorphogenesis. PMID:26134422

  1. Genome-wide analysis reveals regulatory role of G4 DNA in gene transcription

    PubMed Central

    Du, Zhuo; Zhao, Yiqiang; Li, Ning

    2008-01-01

    G-quadruplex or G4 DNA, a four-stranded DNA structure formed in G-rich sequences, has been hypothesized to be a structural motif involved in gene regulation. In this study, we examined the regulatory role of potential G4 DNA motifs (PG4Ms) located in the putative transcriptional regulatory region (TRR, –500 to +500) of genes across the human genome. We found that PG4Ms in the 500-bp region downstream of the annotated transcription start site (TSS; PG4MD500) are associated with gene expression. Generally, PG4MD500-positive genes are expressed at higher levels than PG4MD500-negative genes, and an increased number of PG4MD500 provides a cumulative effect. This observation was validated by controlling for attributes, including gene family, function, and promoter similarity. We also observed an asymmetric pattern of PG4MD500 distribution between strands, whereby the frequency of PG4MD500 in the coding strand is generally higher than that in the template strand. Further analysis showed that the presence of PG4MD500 and its strand asymmetry are associated with significant enrichment of RNAP II at the putative TRR. On the basis of these results, we propose a model of G4 DNA-mediated stimulation of transcription with the hypothesis that PG4MD500 contributes to gene transcription by maintaining the DNA in an open conformation, while the asymmetric distribution of PG4MD500 considerably reduces the probability of blocking the progression of the RNA polymerase complex on the template strand. Our findings provide a comprehensive view of the regulatory function of G4 DNA in gene transcription. PMID:18096746

  2. Genome-wide analysis reveals selection for important traits in domestic horse breeds.

    PubMed

    Petersen, Jessica L; Mickelson, James R; Rendahl, Aaron K; Valberg, Stephanie J; Andersson, Lisa S; Axelsson, Jeanette; Bailey, Ernie; Bannasch, Danika; Binns, Matthew M; Borges, Alexandre S; Brama, Pieter; da Câmara Machado, Artur; Capomaccio, Stefano; Cappelli, Katia; Cothran, E Gus; Distl, Ottmar; Fox-Clipsham, Laura; Graves, Kathryn T; Guérin, Gérard; Haase, Bianca; Hasegawa, Telhisa; Hemmann, Karin; Hill, Emmeline W; Leeb, Tosso; Lindgren, Gabriella; Lohi, Hannes; Lopes, Maria Susana; McGivney, Beatrice A; Mikko, Sofia; Orr, Nicholas; Penedo, M Cecilia T; Piercy, Richard J; Raekallio, Marja; Rieder, Stefan; Røed, Knut H; Swinburne, June; Tozaki, Teruaki; Vaudin, Mark; Wade, Claire M; McCue, Molly E

    2013-01-01

    Intense selective pressures applied over short evolutionary time have resulted in homogeneity within, but substantial variation among, horse breeds. Utilizing this population structure, 744 individuals from 33 breeds, and a 54,000 SNP genotyping array, breed-specific targets of selection were identified using an F(ST)-based statistic calculated in 500-kb windows across the genome. A 5.5-Mb region of ECA18, in which the myostatin (MSTN) gene was centered, contained the highest signature of selection in both the Paint and Quarter Horse. Gene sequencing and histological analysis of gluteal muscle biopsies showed a promoter variant and intronic SNP of MSTN were each significantly associated with higher Type 2B and lower Type 1 muscle fiber proportions in the Quarter Horse, demonstrating a functional consequence of selection at this locus. Signatures of selection on ECA23 in all gaited breeds in the sample led to the identification of a shared, 186-kb haplotype including two doublesex related mab transcription factor genes (DMRT2 and 3). The recent identification of a DMRT3 mutation within this haplotype, which appears necessary for the ability to perform alternative gaits, provides further evidence for selection at this locus. Finally, putative loci for the determination of size were identified in the draft breeds and the Miniature horse on ECA11, as well as when signatures of selection surrounding candidate genes at other loci were examined. This work provides further evidence of the importance of MSTN in racing breeds, provides strong evidence for selection upon gait and size, and illustrates the potential for population-based techniques to find genomic regions driving important phenotypes in the modern horse. PMID:23349635

  3. Genome-Wide Analysis Reveals Selection for Important Traits in Domestic Horse Breeds

    PubMed Central

    Petersen, Jessica L.; Mickelson, James R.; Rendahl, Aaron K.; Valberg, Stephanie J.; Andersson, Lisa S.; Axelsson, Jeanette; Bailey, Ernie; Bannasch, Danika; Binns, Matthew M.; Borges, Alexandre S.; Brama, Pieter; da Câmara Machado, Artur; Capomaccio, Stefano; Cappelli, Katia; Cothran, E. Gus; Distl, Ottmar; Fox-Clipsham, Laura; Graves, Kathryn T.; Guérin, Gérard; Haase, Bianca; Hasegawa, Telhisa; Hemmann, Karin; Hill, Emmeline W.; Leeb, Tosso; Lindgren, Gabriella; Lohi, Hannes; Lopes, Maria Susana; McGivney, Beatrice A.; Mikko, Sofia; Orr, Nicholas; Penedo, M. Cecilia T.; Piercy, Richard J.; Raekallio, Marja; Rieder, Stefan; Røed, Knut H.; Swinburne, June; Tozaki, Teruaki; Vaudin, Mark; Wade, Claire M.; McCue, Molly E.

    2013-01-01

    Intense selective pressures applied over short evolutionary time have resulted in homogeneity within, but substantial variation among, horse breeds. Utilizing this population structure, 744 individuals from 33 breeds, and a 54,000 SNP genotyping array, breed-specific targets of selection were identified using an FST-based statistic calculated in 500-kb windows across the genome. A 5.5-Mb region of ECA18, in which the myostatin (MSTN) gene was centered, contained the highest signature of selection in both the Paint and Quarter Horse. Gene sequencing and histological analysis of gluteal muscle biopsies showed a promoter variant and intronic SNP of MSTN were each significantly associated with higher Type 2B and lower Type 1 muscle fiber proportions in the Quarter Horse, demonstrating a functional consequence of selection at this locus. Signatures of selection on ECA23 in all gaited breeds in the sample led to the identification of a shared, 186-kb haplotype including two doublesex related mab transcription factor genes (DMRT2 and 3). The recent identification of a DMRT3 mutation within this haplotype, which appears necessary for the ability to perform alternative gaits, provides further evidence for selection at this locus. Finally, putative loci for the determination of size were identified in the draft breeds and the Miniature horse on ECA11, as well as when signatures of selection surrounding candidate genes at other loci were examined. This work provides further evidence of the importance of MSTN in racing breeds, provides strong evidence for selection upon gait and size, and illustrates the potential for population-based techniques to find genomic regions driving important phenotypes in the modern horse. PMID:23349635

  4. Genome-Wide Analysis of Group A Streptococci Reveals a Mutation That Modulates Global Phenotype and Disease Specificity

    PubMed Central

    2006-01-01

    Many human pathogens produce phenotypic variants as a means to circumvent the host immune system and enhance survival and, as a potential consequence, exhibit increased virulence. For example, it has been known for almost 90 y that clinical isolates of the human bacterial pathogen group A streptococci (GAS) have extensive phenotypic heterogeneity linked to variation in virulence. However, the complete underlying molecular mechanism(s) have not been defined. Expression microarray analysis of nine clinical isolates identified two fundamentally different transcriptomes, designated pharyngeal transcriptome profile (PTP) and invasive transcriptome profile (ITP). PTP and ITP GAS differed in approximately 10% of the transcriptome, including at least 23 proven or putative virulence factor genes. ITP organisms were recovered from skin lesions of mice infected subcutaneously with PTP GAS and were significantly more able to survive phagocytosis and killing by human polymorphonuclear leukocytes. Complete genome resequencing of a mouse-derived ITP GAS revealed that the organism differed from its precursor by only a 7-bp frameshift mutation in the gene (covS) encoding the sensor kinase component of a two-component signal transduction system implicated in virulence. Genetic complementation, and sequence analysis of covR/S in 42 GAS isolates confirmed the central role of covR/S in transcriptome, exoproteome, and virulence modulation. Genome-wide analysis provides a heretofore unattained understanding of phenotypic variation and disease specificity in microbial pathogens, resulting in new avenues for vaccine and therapeutics research. PMID:16446783

  5. A genome-wide association analysis of temozolomide response using lymphoblastoid cell lines reveals a clinically relevant association with MGMT

    PubMed Central

    Brown, Chad C.; Havener, Tammy M.; Medina, Marisa Wong; Auman, J. Todd; Mangravite, Lara M.; Krauss, Ronald M.; McLeod, Howard L.; Motsinger-Reif, Alison A.

    2013-01-01

    Recently, lymphoblastoid cell lines (LCLs) have emerged as an innovative model system for mapping gene variants that predict dose response to chemotherapy drugs. In the current study, this strategy was expanded to the in vitro genome-wide association approach, using 516 LCLs derived from a Caucasian cohort to assess cytotoxic response to temozolomide. Genome-wide association analysis using approximately 2.1 million quality controlled single-nucleotide polymorphisms (SNPs) identified a statistically significant association (p < 10−8) with SNPs in the O6-methylguanine–DNA methyltransferase (MGMT) gene. We also demonstrate that the primary SNP in this region is significantly associated with differential gene expression of MGMT (p< 10−26) in LCLs, and differential methylation in glioblastoma samples from The Cancer Genome Atlas. The previously documented clinical and functional relationships between MGMT and temozolomide response highlight the potential of well-powered GWAS of the LCL model system to identify meaningful genetic associations. PMID:23047291

  6. Genome-wide analysis of longevity in nutrient-deprived Saccharomyces cerevisiae reveals importance of recycling in maintaining cell viability.

    PubMed

    Davey, Hazel M; Cross, Emma J M; Davey, Christopher L; Gkargkas, Konstantinos; Delneri, Daniela; Hoyle, David C; Oliver, Stephen G; Kell, Douglas B; Griffith, Gareth W

    2012-05-01

    Although typically cosseted in the laboratory with constant temperatures and plentiful nutrients, microbes are frequently exposed to much more stressful conditions in their natural environments where survival and competitive fitness depend upon both growth rate when conditions are favourable and on persistence in a viable and recoverable state when they are not. In order to determine the role of genetic heterogeneity in environmental fitness we present a novel approach that combines the power of fluorescence-activated cell sorting with barcode microarray analysis and apply this to determining the importance of every gene in the Saccharomyces cerevisiae genome in a high-throughput, genome-wide fitness screen. We have grown > 6000 heterozygous mutants together and exposed them to a starvation stress before using fluorescence-activated cell sorting to identify and isolate those individual cells that have not survived the stress applied. Barcode array analysis of the sorted and total populations reveals the importance of cellular recycling mechanisms (autophagy, pexophagy and ribosome breakdown) in maintaining cell viability during starvation and provides compelling evidence for an important role for fatty acid degradation in maintaining viability. In addition, we have developed a semi-batch fermentor system that is a more realistic model of environmental fitness than either batch or chemostat culture. Barcode array analysis revealed that arginine biosynthesis was important for fitness in semi-batch culture and modelling of this regime showed that rapid emergence from lag phase led to greatly increased fitness. One hundred and twenty-five strains with deletions in unclassified proteins were identified as being over-represented in the sorted fraction, while 27 unclassified proteins caused a haploinsufficient phenotype in semi-batch culture. These methods thus provide a screen to identifying other genes and pathways that have a role in maintaining cell viability. PMID

  7. Genome-wide analysis reveals the ancient and recent admixture history of East African Shorthorn Zebu from Western Kenya.

    PubMed

    Mbole-Kariuki, M N; Sonstegard, T; Orth, A; Thumbi, S M; Bronsvoort, B M de C; Kiara, H; Toye, P; Conradie, I; Jennings, A; Coetzer, K; Woolhouse, M E J; Hanotte, O; Tapio, M

    2014-10-01

    The Kenyan East African zebu cattle are valuable and widely used genetic resources. Previous studies using microsatellite loci revealed the complex history of these populations with the presence of taurine and zebu genetic backgrounds. Here, we estimate at genome-wide level the genetic composition and population structure of the East African Shorthorn Zebu (EASZ) of western Kenya. A total of 548 EASZ from 20 sub-locations were genotyped using the Illumina BovineSNP50 v. 1 beadchip. STRUCTURE analysis reveals admixture with Asian zebu, African and European taurine cattle. The EASZ were separated into three categories: substantial (⩾12.5%), moderate (1.56%

  8. Genome-wide analysis reveals the ancient and recent admixture history of East African Shorthorn Zebu from Western Kenya

    PubMed Central

    Mbole-Kariuki, M N; Sonstegard, T; Orth, A; Thumbi, S M; Bronsvoort, B M de C; Kiara, H; Toye, P; Conradie, I; Jennings, A; Coetzer, K; Woolhouse, M E J; Hanotte, O; Tapio, M

    2014-01-01

    The Kenyan East African zebu cattle are valuable and widely used genetic resources. Previous studies using microsatellite loci revealed the complex history of these populations with the presence of taurine and zebu genetic backgrounds. Here, we estimate at genome-wide level the genetic composition and population structure of the East African Shorthorn Zebu (EASZ) of western Kenya. A total of 548 EASZ from 20 sub-locations were genotyped using the Illumina BovineSNP50 v. 1 beadchip. STRUCTURE analysis reveals admixture with Asian zebu, African and European taurine cattle. The EASZ were separated into three categories: substantial (⩾12.5%), moderate (1.56%

  9. Meta-analysis of genome-wide association studies reveals genetic overlap between Hodgkin lymphoma and multiple sclerosis

    PubMed Central

    Khankhanian, Pouya; Cozen, Wendy; Himmelstein, Daniel S; Madireddy, Lohith; Din, Lennox; van den Berg, Anke; Matsushita, Takuya; Glaser, Sally L; Moré, Jayaji M; Smedby, Karin E.; Baranzini, Sergio E; Mack, Thomas M; Lizée, Antoine; de Sanjosé, Silvia; Gourraud, Pierre-Antoine; Nieters, Alexandra; Hauser, Stephen L; Cocco, Pierluigi; Maynadié, Marc; Foretová, Lenka; Staines, Anthony; Delahaye-Sourdeix, Manon; Li, Dalin; Bhatia, Smita; Melbye, Mads; Onel, Kenan; Jarrett, Ruth; McKay, James D; Oksenberg, Jorge R; Hjalgrim, Henrik

    2016-01-01

    Background: Based on epidemiological commonalities, multiple sclerosis (MS) and Hodgkin lymphoma (HL), two clinically distinct conditions, have long been suspected to be aetiologically related. MS and HL occur in roughly the same age groups, both are associated with Epstein-Barr virus infection and ultraviolet (UV) light exposure, and they cluster mutually in families (though not in individuals). We speculated if in addition to sharing environmental risk factors, MS and HL were also genetically related. Using data from genome-wide association studies (GWAS) of 1816 HL patients, 9772 MS patients and 25 255 controls, we therefore investigated the genetic overlap between the two diseases. Methods: From among a common denominator of 404 K single nucleotide polymorphisms (SNPs) studied, we identified SNPs and human leukocyte antigen (HLA) alleles independently associated with both diseases. Next, we assessed the cumulative genome-wide effect of MS-associated SNPs on HL and of HL-associated SNPs on MS. To provide an interpretational frame of reference, we used data from published GWAS to create a genetic network of diseases within which we analysed proximity of HL and MS to autoimmune diseases and haematological and non-haematological malignancies. Results: SNP analyses revealed genome-wide overlap between HL and MS, most prominently in the HLA region. Polygenic HL risk scores explained 4.44% of HL risk (Nagelkerke R2), but also 2.36% of MS risk. Conversely, polygenic MS risk scores explained 8.08% of MS risk and 1.94% of HL risk. In the genetic disease network, HL was closer to autoimmune diseases than to solid cancers. Conclusions: HL displays considerable genetic overlap with MS and other autoimmune diseases. PMID:26971321

  10. Meta-analysis of heterogeneous Down Syndrome data reveals consistent genome-wide dosage effects related to neurological processes

    PubMed Central

    2011-01-01

    Background Down syndrome (DS; trisomy 21) is the most common genetic cause of mental retardation in the human population and key molecular networks dysregulated in DS are still unknown. Many different experimental techniques have been applied to analyse the effects of dosage imbalance at the molecular and phenotypical level, however, currently no integrative approach exists that attempts to extract the common information. Results We have performed a statistical meta-analysis from 45 heterogeneous publicly available DS data sets in order to identify consistent dosage effects from these studies. We identified 324 genes with significant genome-wide dosage effects, including well investigated genes like SOD1, APP, RUNX1 and DYRK1A as well as a large proportion of novel genes (N = 62). Furthermore, we characterized these genes using gene ontology, molecular interactions and promoter sequence analysis. In order to judge relevance of the 324 genes for more general cerebral pathologies we used independent publicly available microarry data from brain studies not related with DS and identified a subset of 79 genes with potential impact for neurocognitive processes. All results have been made available through a web server under http://ds-geneminer.molgen.mpg.de/. Conclusions Our study represents a comprehensive integrative analysis of heterogeneous data including genome-wide transcript levels in the domain of trisomy 21. The detected dosage effects build a resource for further studies of DS pathology and the development of new therapies. PMID:21569303

  11. Genome-wide meta-analysis reveals common splice site acceptor variant in CHRNA4 associated with nicotine dependence.

    PubMed

    Hancock, D B; Reginsson, G W; Gaddis, N C; Chen, X; Saccone, N L; Lutz, S M; Qaiser, B; Sherva, R; Steinberg, S; Zink, F; Stacey, S N; Glasheen, C; Chen, J; Gu, F; Frederiksen, B N; Loukola, A; Gudbjartsson, D F; Brüske, I; Landi, M T; Bickeböller, H; Madden, P; Farrer, L; Kaprio, J; Kranzler, H R; Gelernter, J; Baker, T B; Kraft, P; Amos, C I; Caporaso, N E; Hokanson, J E; Bierut, L J; Thorgeirsson, T E; Johnson, E O; Stefansson, K

    2015-01-01

    We conducted a 1000 Genomes-imputed genome-wide association study (GWAS) meta-analysis for nicotine dependence, defined by the Fagerström Test for Nicotine Dependence in 17 074 ever smokers from five European-ancestry samples. We followed up novel variants in 7469 ever smokers from five independent European-ancestry samples. We identified genome-wide significant association in the alpha-4 nicotinic receptor subunit (CHRNA4) gene on chromosome 20q13: lowest P=8.0 × 10(-9) across all the samples for rs2273500-C (frequency=0.15; odds ratio=1.12 and 95% confidence interval=1.08-1.17 for severe vs mild dependence). rs2273500-C, a splice site acceptor variant resulting in an alternate CHRNA4 transcript predicted to be targeted for nonsense-mediated decay, was associated with decreased CHRNA4 expression in physiologically normal human brains (lowest P=7.3 × 10(-4)). Importantly, rs2273500-C was associated with increased lung cancer risk (N=28 998, odds ratio=1.06 and 95% confidence interval=1.00-1.12), likely through its effect on smoking, as rs2273500-C was no longer associated with lung cancer after adjustment for smoking. Using criteria for smoking behavior that encompass more than the single 'cigarettes per day' item, we identified a common CHRNA4 variant with important regulatory properties that contributes to nicotine dependence and smoking-related consequences. PMID:26440539

  12. Genome-Wide Analysis of Wilms' Tumor 1-Controlled Gene Expression in Podocytes Reveals Key Regulatory Mechanisms.

    PubMed

    Kann, Martin; Ettou, Sandrine; Jung, Youngsook L; Lenz, Maximilian O; Taglienti, Mary E; Park, Peter J; Schermer, Bernhard; Benzing, Thomas; Kreidberg, Jordan A

    2015-09-01

    The transcription factor Wilms' tumor suppressor 1 (WT1) is key to podocyte development and viability; however, WT1 transcriptional networks in podocytes remain elusive. We provide a comprehensive analysis of the genome-wide WT1 transcriptional network in podocytes in vivo using chromatin immunoprecipitation followed by sequencing (ChIPseq) and RNA sequencing techniques. Our data show a specific role for WT1 in regulating the podocyte-specific transcriptome through binding to both promoters and enhancers of target genes. Furthermore, we inferred a podocyte transcription factor network consisting of WT1, LMX1B, TCF21, Fox-class and TEAD family transcription factors, and MAFB that uses tissue-specific enhancers to control podocyte gene expression. In addition to previously described WT1-dependent target genes, ChIPseq identified novel WT1-dependent signaling systems. These targets included components of the Hippo signaling system, underscoring the power of genome-wide transcriptional-network analyses. Together, our data elucidate a comprehensive gene regulatory network in podocytes suggesting that WT1 gene regulatory function and podocyte cell-type specification can best be understood in the context of transcription factor-regulatory element network interplay. PMID:25636411

  13. Genome-wide meta-analysis reveals common splice site acceptor variant in CHRNA4 associated with nicotine dependence

    PubMed Central

    Hancock, D B; Reginsson, G W; Gaddis, N C; Chen, X; Saccone, N L; Lutz, S M; Qaiser, B; Sherva, R; Steinberg, S; Zink, F; Stacey, S N; Glasheen, C; Chen, J; Gu, F; Frederiksen, B N; Loukola, A; Gudbjartsson, D F; Brüske, I; Landi, M T; Bickeböller, H; Madden, P; Farrer, L; Kaprio, J; Kranzler, H R; Gelernter, J; Baker, T B; Kraft, P; Amos, C I; Caporaso, N E; Hokanson, J E; Bierut, L J; Thorgeirsson, T E; Johnson, E O; Stefansson, K

    2015-01-01

    We conducted a 1000 Genomes–imputed genome-wide association study (GWAS) meta-analysis for nicotine dependence, defined by the Fagerström Test for Nicotine Dependence in 17 074 ever smokers from five European-ancestry samples. We followed up novel variants in 7469 ever smokers from five independent European-ancestry samples. We identified genome-wide significant association in the alpha-4 nicotinic receptor subunit (CHRNA4) gene on chromosome 20q13: lowest P=8.0 × 10−9 across all the samples for rs2273500-C (frequency=0.15; odds ratio=1.12 and 95% confidence interval=1.08–1.17 for severe vs mild dependence). rs2273500-C, a splice site acceptor variant resulting in an alternate CHRNA4 transcript predicted to be targeted for nonsense-mediated decay, was associated with decreased CHRNA4 expression in physiologically normal human brains (lowest P=7.3 × 10−4). Importantly, rs2273500-C was associated with increased lung cancer risk (N=28 998, odds ratio=1.06 and 95% confidence interval=1.00–1.12), likely through its effect on smoking, as rs2273500-C was no longer associated with lung cancer after adjustment for smoking. Using criteria for smoking behavior that encompass more than the single ‘cigarettes per day' item, we identified a common CHRNA4 variant with important regulatory properties that contributes to nicotine dependence and smoking-related consequences. PMID:26440539

  14. Genome-wide association analysis reveals a SOD1 mutation in canine degenerative myelopathy that resembles amyotrophic lateral sclerosis.

    PubMed

    Awano, Tomoyuki; Johnson, Gary S; Wade, Claire M; Katz, Martin L; Johnson, Gayle C; Taylor, Jeremy F; Perloski, Michele; Biagi, Tara; Baranowska, Izabella; Long, Sam; March, Philip A; Olby, Natasha J; Shelton, G Diane; Khan, Shahnawaz; O'Brien, Dennis P; Lindblad-Toh, Kerstin; Coates, Joan R

    2009-02-24

    Canine degenerative myelopathy (DM) is a fatal neurodegenerative disease prevalent in several dog breeds. Typically, the initial progressive upper motor neuron spastic and general proprioceptive ataxia in the pelvic limbs occurs at 8 years of age or older. If euthanasia is delayed, the clinical signs will ascend, causing flaccid tetraparesis and other lower motor neuron signs. DNA samples from 38 DM-affected Pembroke Welsh corgi cases and 17 related clinically normal controls were used for genome-wide association mapping, which produced the strongest associations with markers on CFA31 in a region containing the canine SOD1 gene. SOD1 was considered a regional candidate gene because mutations in human SOD1 can cause amyotrophic lateral sclerosis (ALS), an adult-onset fatal paralytic neurodegenerative disease with both upper and lower motor neuron involvement. The resequencing of SOD1 in normal and affected dogs revealed a G to A transition, resulting in an E40K missense mutation. Homozygosity for the A allele was associated with DM in 5 dog breeds: Pembroke Welsh corgi, Boxer, Rhodesian ridgeback, German Shepherd dog, and Chesapeake Bay retriever. Microscopic examination of spinal cords from affected dogs revealed myelin and axon loss affecting the lateral white matter and neuronal cytoplasmic inclusions that bind anti-superoxide dismutase 1 antibodies. These inclusions are similar to those seen in spinal cord sections from ALS patients with SOD1 mutations. Our findings identify canine DM to be the first recognized spontaneously occurring animal model for ALS. PMID:19188595

  15. Genome-wide analysis of the AP2/ERF family in Musa species reveals divergence and neofunctionalisation during evolution

    PubMed Central

    Lakhwani, Deepika; Pandey, Ashutosh; Dhar, Yogeshwar Vikram; Bag, Sumit Kumar; Trivedi, Prabodh Kumar; Asif, Mehar Hasan

    2016-01-01

    AP2/ERF domain containing transcription factor super family is one of the important regulators in the plant kingdom. The involvement of AP2/ERF family members has been elucidated in various processes associated with plant growth, development as well as in response to hormones, biotic and abiotic stresses. In this study, we carried out genome-wide analysis to identify members of AP2/ERF family in Musa acuminata (A genome) and Musa balbisiana (B genome) and changes leading to neofunctionalisation of genes. Analysis identified 265 and 318 AP2/ERF encoding genes in M. acuminata and M. balbisiana respectively which were further classified into ERF, DREB, AP2, RAV and Soloist groups. Comparative analysis indicated that AP2/ERF family has undergone duplication, loss and divergence during evolution and speciation of the Musa A and B genomes. We identified nine genes which are up-regulated during fruit ripening and might be components of the regulatory machinery operating during ethylene-dependent ripening in banana. Tissue-specific expression analysis of the genes suggests that different regulatory mechanisms might be involved in peel and pulp ripening process through recruiting specific ERFs in these tissues. Analysis also suggests that MaRAV-6 and MaERF026 have structurally diverged from their M. balbisiana counterparts and have attained new functions during ripening. PMID:26733055

  16. Genome-wide analysis of the AP2/ERF family in Musa species reveals divergence and neofunctionalisation during evolution.

    PubMed

    Lakhwani, Deepika; Pandey, Ashutosh; Dhar, Yogeshwar Vikram; Bag, Sumit Kumar; Trivedi, Prabodh Kumar; Asif, Mehar Hasan

    2016-01-01

    AP2/ERF domain containing transcription factor super family is one of the important regulators in the plant kingdom. The involvement of AP2/ERF family members has been elucidated in various processes associated with plant growth, development as well as in response to hormones, biotic and abiotic stresses. In this study, we carried out genome-wide analysis to identify members of AP2/ERF family in Musa acuminata (A genome) and Musa balbisiana (B genome) and changes leading to neofunctionalisation of genes. Analysis identified 265 and 318 AP2/ERF encoding genes in M. acuminata and M. balbisiana respectively which were further classified into ERF, DREB, AP2, RAV and Soloist groups. Comparative analysis indicated that AP2/ERF family has undergone duplication, loss and divergence during evolution and speciation of the Musa A and B genomes. We identified nine genes which are up-regulated during fruit ripening and might be components of the regulatory machinery operating during ethylene-dependent ripening in banana. Tissue-specific expression analysis of the genes suggests that different regulatory mechanisms might be involved in peel and pulp ripening process through recruiting specific ERFs in these tissues. Analysis also suggests that MaRAV-6 and MaERF026 have structurally diverged from their M. balbisiana counterparts and have attained new functions during ripening. PMID:26733055

  17. Host genetic determinants of microbiota-dependent nutrition revealed by genome-wide analysis of Drosophila melanogaster

    PubMed Central

    Dobson, Adam J.; Chaston, John M.; Newell, Peter D.; Donahue, Leanne; Hermann, Sara L.; Sannino, David R.; Westmiller, Stephanie; Wong, Adam C.-N.; Clark, Andrew G.; Lazzaro, Brian P.; Douglas, Angela E.

    2015-01-01

    Animals bear communities of gut microorganisms with substantial effects on animal nutrition, but the host genetic basis of these effects is unknown. Here, we use Drosophila to demonstrate substantial among-genotype variation in the effects of eliminating the gut microbiota on five host nutritional indices (weight, and protein, lipid, glucose and glycogen contents); this includes variation in both the magnitude and direction of microbiota-dependent effects. Genome-wide associations to identify the genetic basis of the microbiota-dependent variation reveal polymorphisms in largely non-overlapping sets of genes associated with variation in the nutritional traits, including strong representation of conserved genes functioning in signaling. Key genes identified by the GWA study are validated by loss-of-function mutations that altered microbiota-dependent nutritional effects. We conclude that the microbiota interacts with the animal at multiple points in the signaling and regulatory networks that determine animal nutrition. These interactions with the microbiota are likely conserved across animals, including humans. PMID:25692519

  18. Genome-Wide Analysis Reveals Novel Genes Influencing Temporal Lobe Structure with Relevance to Neurodegeneration in Alzheimer’s Disease

    PubMed Central

    Stein, Jason L.; Hua, Xue; Morra, Jonathan H.; Lee, Suh; Hibar, Derrek P.; Ho, April J.; Leow, Alex D.; Toga, Arthur W.; Sul, Jae Hoon; Kang, Hyun Min; Eskin, Eleazar; Saykin, Andrew J.; Shen, Li; Foroud, Tatiana; Pankratz, Nathan; Huentelman, Matthew J.; Craig, David W.; Gerber, Jill D.; Allen, April N.; Corneveaux, Jason J.; Stephan, Dietrich A.; Webster, Jennifer; DeChairo, Bryan M.; Potkin, Steven G.; Jack, Clifford R.; Weiner, Michael W.; Thompson, Paul M.

    2010-01-01

    In a genome-wide association study of structural brain degeneration, we mapped the 3D profile of temporal lobe volume differences in 742 brain MRI scans of Alzheimer’s disease patients, mildly impaired, and healthy elderly subjects. After searching 546,314 genomic markers, 2 single nucleotide polymorphisms (SNPs) were associated with bilateral temporal lobe volume (P < 5×10−7). One SNP, rs10845840, is located in the GRIN2B gene which encodes the N-Methyl-D-Aspartate (NMDA) glutamate receptor NR2B subunit. This protein - involved in learning and memory, and excitotoxic cell death - has age-dependent prevalence in the synapse and is already a therapeutic target in Alzheimer’s disease. Risk alleles for lower temporal lobe volume at this SNP were significantly over-represented in AD and MCI subjects versus controls (odds ratio = 1.273; P = 0.039) and were associated with the mini-mental state exam (MMSE; t = −2.114; P = 0.035) demonstrating a negative effect on global cognitive function. Voxelwise maps of genetic association of this SNP with regional brain volumes, revealed intense temporal lobe effects (FDR correction at q = 0.05; critical P = 0.0257). This study uses large-scale brain mapping for gene discovery with implications for Alzheimer’s disease. PMID:20197096

  19. Genome-Wide Scan Reveals Mutation Associated with Melanoma

    MedlinePlus

    ... 1999 Spotlight on Research 2012 July 2012 (historical) Genome-Wide Scan Reveals Mutation Associated with Melanoma A ... out to see if a technology called whole genome sequencing would help them find other genetic risk ...

  20. Genome-wide expression analysis reveals 100 adrenal gland-dependent circadian genes in the mouse liver.

    PubMed

    Oishi, Katsutaka; Amagai, Noriko; Shirai, Hidenori; Kadota, Koji; Ohkura, Naoki; Ishida, Norio

    2005-01-01

    Recent progress in genome-wide expression analysis has identified hundreds of circadian genes not only in the suprachiasmatic nucleus (the mammalian master clock) but also in peripheral tissues, such as heart, liver and kidney of mammals. Glucocorticoid is thought to be a circadian time cue for mammalian peripheral clocks. To identify the genes of which the circadian expression is regulated by endogenous glucocorticoids, we performed DNA microarray analysis using hepatic RNA from adrenalectomized (ADX) and sham-operated mice. We identified 169 genes that fluctuated between day and night in the livers of the sham-operated mice. Among these, 100 lost circadian rhythmicity in ADX mice. These included the genes for key enzymes of liver metabolic functions, such as glucokinase, HMG-CoA reductase and glucose-6-phosphatase. The circadian expression of Lpin1, FKBP51 and S-adenosyl methionine decarboxylase was also abolished in the ADX mice. On the other hand, although the circadian expression of clock or clock-related genes, such as mPer2, DBP, E4BP4, mDec1, Usp2 and Wee1 remained almost totally intact in the liver of ADX mice, it was extremely damped in homozygous Clock mutant mice. The present findings suggested that one type of hepatic circadian genes in mice is transcriptionally regulated by core components of the circadian clock, such as CLOCK and BMAL1, and that the other depends on the adrenal gland. PMID:16303750

  1. Genome-Wide Analysis in German Shepherd Dogs Reveals Association of a Locus on CFA 27 with Atopic Dermatitis

    PubMed Central

    Tengvall, Katarina; Kierczak, Marcin; Bergvall, Kerstin; Olsson, Mia; Frankowiack, Marcel; Farias, Fabiana H. G.; Pielberg, Gerli; Carlborg, Örjan; Leeb, Tosso; Andersson, Göran; Hammarström, Lennart; Hedhammar, Åke; Lindblad-Toh, Kerstin

    2013-01-01

    Humans and dogs are both affected by the allergic skin disease atopic dermatitis (AD), caused by an interaction between genetic and environmental factors. The German shepherd dog (GSD) is a high-risk breed for canine AD (CAD). In this study, we used a Swedish cohort of GSDs as a model for human AD. Serum IgA levels are known to be lower in GSDs compared to other breeds. We detected significantly lower IgA levels in the CAD cases compared to controls (p = 1.1×10−5) in our study population. We also detected a separation within the GSD cohort, where dogs could be grouped into two different subpopulations. Disease prevalence differed significantly between the subpopulations contributing to population stratification (λ = 1.3), which was successfully corrected for using a mixed model approach. A genome-wide association analysis of CAD was performed (ncases = 91, ncontrols = 88). IgA levels were included in the model, due to the high correlation between CAD and low IgA levels. In addition, we detected a correlation between IgA levels and the age at the time of sampling (corr = 0.42, p = 3.0×10−9), thus age was included in the model. A genome-wide significant association was detected on chromosome 27 (praw = 3.1×10−7, pgenome = 0.03). The total associated region was defined as a ∼1.5-Mb-long haplotype including eight genes. Through targeted re-sequencing and additional genotyping of a subset of identified SNPs, we defined 11 smaller haplotype blocks within the associated region. Two blocks showed the strongest association to CAD. The ∼209-kb region, defined by the two blocks, harbors only the PKP2 gene, encoding Plakophilin 2 expressed in the desmosomes and important for skin structure. Our results may yield further insight into the genetics behind both canine and human AD. PMID:23671420

  2. Genome-wide Association Analysis of Psoriatic Arthritis and Cutaneous Psoriasis Reveals Differences in Their Genetic Architecture.

    PubMed

    Stuart, Philip E; Nair, Rajan P; Tsoi, Lam C; Tejasvi, Trilokraj; Das, Sayantan; Kang, Hyun Min; Ellinghaus, Eva; Chandran, Vinod; Callis-Duffin, Kristina; Ike, Robert; Li, Yanming; Wen, Xiaoquan; Enerbäck, Charlotta; Gudjonsson, Johann E; Kõks, Sulev; Kingo, Külli; Esko, Tõnu; Mrowietz, Ulrich; Reis, Andre; Wichmann, H Erich; Gieger, Christian; Hoffmann, Per; Nöthen, Markus M; Winkelmann, Juliane; Kunz, Manfred; Moreta, Elvia G; Mease, Philip J; Ritchlin, Christopher T; Bowcock, Anne M; Krueger, Gerald G; Lim, Henry W; Weidinger, Stephan; Weichenthal, Michael; Voorhees, John J; Rahman, Proton; Gregersen, Peter K; Franke, Andre; Gladman, Dafna D; Abecasis, Gonçalo R; Elder, James T

    2015-12-01

    Psoriasis vulgaris (PsV) is a common inflammatory and hyperproliferative skin disease. Up to 30% of people with PsV eventually develop psoriatic arthritis (PsA), an inflammatory musculoskeletal condition. To discern differences in genetic risk factors for PsA and cutaneous-only psoriasis (PsC), we carried out a genome-wide association study (GWAS) of 1,430 PsA case subjects and 1,417 unaffected control subjects. Meta-analysis of this study with three other GWASs and two targeted genotyping studies, encompassing a total of 9,293 PsV case subjects, 3,061 PsA case subjects, 3,110 PsC case subjects, and 13,670 unaffected control subjects of European descent, detected 10 regions associated with PsA and 11 with PsC at genome-wide (GW) significance. Several of these association signals (IFNLR1, IFIH1, NFKBIA for PsA; TNFRSF9, LCE3C/B, TRAF3IP2, IL23A, NFKBIA for PsC) have not previously achieved GW significance. After replication, we also identified a PsV-associated SNP near CDKAL1 (rs4712528, odds ratio [OR] = 1.16, p = 8.4 × 10(-11)). Among identified psoriasis risk variants, three were more strongly associated with PsC than PsA (rs12189871 near HLA-C, p = 5.0 × 10(-19); rs4908742 near TNFRSF9, p = 0.00020; rs10888503 near LCE3A, p = 0.0014), and two were more strongly associated with PsA than PsC (rs12044149 near IL23R, p = 0.00018; rs9321623 near TNFAIP3, p = 0.00022). The PsA-specific variants were independent of previously identified psoriasis variants near IL23R and TNFAIP3. We also found multiple independent susceptibility variants in the IL12B, NOS2, and IFIH1 regions. These results provide insights into the pathogenetic similarities and differences between PsC and PsA. PMID:26626624

  3. Genome-wide Association Analysis of Psoriatic Arthritis and Cutaneous Psoriasis Reveals Differences in Their Genetic Architecture

    PubMed Central

    Stuart, Philip E.; Nair, Rajan P.; Tsoi, Lam C.; Tejasvi, Trilokraj; Das, Sayantan; Kang, Hyun Min; Ellinghaus, Eva; Chandran, Vinod; Callis-Duffin, Kristina; Ike, Robert; Li, Yanming; Wen, Xiaoquan; Enerbäck, Charlotta; Gudjonsson, Johann E.; Kõks, Sulev; Kingo, Külli; Esko, Tõnu; Mrowietz, Ulrich; Reis, Andre; Wichmann, H. Erich; Gieger, Christian; Hoffmann, Per; Nöthen, Markus M.; Winkelmann, Juliane; Kunz, Manfred; Moreta, Elvia G.; Mease, Philip J.; Ritchlin, Christopher T.; Bowcock, Anne M.; Krueger, Gerald G.; Lim, Henry W.; Weidinger, Stephan; Weichenthal, Michael; Voorhees, John J.; Rahman, Proton; Gregersen, Peter K.; Franke, Andre; Gladman, Dafna D.; Abecasis, Gonçalo R.; Elder, James T.

    2015-01-01

    Psoriasis vulgaris (PsV) is a common inflammatory and hyperproliferative skin disease. Up to 30% of people with PsV eventually develop psoriatic arthritis (PsA), an inflammatory musculoskeletal condition. To discern differences in genetic risk factors for PsA and cutaneous-only psoriasis (PsC), we carried out a genome-wide association study (GWAS) of 1,430 PsA case subjects and 1,417 unaffected control subjects. Meta-analysis of this study with three other GWASs and two targeted genotyping studies, encompassing a total of 9,293 PsV case subjects, 3,061 PsA case subjects, 3,110 PsC case subjects, and 13,670 unaffected control subjects of European descent, detected 10 regions associated with PsA and 11 with PsC at genome-wide (GW) significance. Several of these association signals (IFNLR1, IFIH1, NFKBIA for PsA; TNFRSF9, LCE3C/B, TRAF3IP2, IL23A, NFKBIA for PsC) have not previously achieved GW significance. After replication, we also identified a PsV-associated SNP near CDKAL1 (rs4712528, odds ratio [OR] = 1.16, p = 8.4 × 10−11). Among identified psoriasis risk variants, three were more strongly associated with PsC than PsA (rs12189871 near HLA-C, p = 5.0 × 10−19; rs4908742 near TNFRSF9, p = 0.00020; rs10888503 near LCE3A, p = 0.0014), and two were more strongly associated with PsA than PsC (rs12044149 near IL23R, p = 0.00018; rs9321623 near TNFAIP3, p = 0.00022). The PsA-specific variants were independent of previously identified psoriasis variants near IL23R and TNFAIP3. We also found multiple independent susceptibility variants in the IL12B, NOS2, and IFIH1 regions. These results provide insights into the pathogenetic similarities and differences between PsC and PsA. PMID:26626624

  4. Genome-Wide Analysis of Transposon and Retroviral Insertions Reveals Preferential Integrations in Regions of DNA Flexibility

    PubMed Central

    Vrljicak, Pavle; Tao, Shijie; Varshney, Gaurav K.; Quach, Helen Ngoc Bao; Joshi, Adita; LaFave, Matthew C.; Burgess, Shawn M.; Sampath, Karuna

    2016-01-01

    DNA transposons and retroviruses are important transgenic tools for genome engineering. An important consideration affecting the choice of transgenic vector is their insertion site preferences. Previous large-scale analyses of Ds transposon integration sites in plants were done on the basis of reporter gene expression or germ-line transmission, making it difficult to discern vertebrate integration preferences. Here, we compare over 1300 Ds transposon integration sites in zebrafish with Tol2 transposon and retroviral integration sites. Genome-wide analysis shows that Ds integration sites in the presence or absence of marker selection are remarkably similar and distributed throughout the genome. No strict motif was found, but a preference for structural features in the target DNA associated with DNA flexibility (Twist, Tilt, Rise, Roll, Shift, and Slide) was observed. Remarkably, this feature is also found in transposon and retroviral integrations in maize and mouse cells. Our findings show that structural features influence the integration of heterologous DNA in genomes, and have implications for targeted genome engineering. PMID:26818075

  5. Genome-Wide Analysis of Transposon and Retroviral Insertions Reveals Preferential Integrations in Regions of DNA Flexibility.

    PubMed

    Vrljicak, Pavle; Tao, Shijie; Varshney, Gaurav K; Quach, Helen Ngoc Bao; Joshi, Adita; LaFave, Matthew C; Burgess, Shawn M; Sampath, Karuna

    2016-01-01

    DNA transposons and retroviruses are important transgenic tools for genome engineering. An important consideration affecting the choice of transgenic vector is their insertion site preferences. Previous large-scale analyses of Ds transposon integration sites in plants were done on the basis of reporter gene expression or germ-line transmission, making it difficult to discern vertebrate integration preferences. Here, we compare over 1300 Ds transposon integration sites in zebrafish with Tol2 transposon and retroviral integration sites. Genome-wide analysis shows that Ds integration sites in the presence or absence of marker selection are remarkably similar and distributed throughout the genome. No strict motif was found, but a preference for structural features in the target DNA associated with DNA flexibility (Twist, Tilt, Rise, Roll, Shift, and Slide) was observed. Remarkably, this feature is also found in transposon and retroviral integrations in maize and mouse cells. Our findings show that structural features influence the integration of heterologous DNA in genomes, and have implications for targeted genome engineering. PMID:26818075

  6. Cross-Cancer Genome-Wide Analysis of Lung, Ovary, Breast, Prostate, and Colorectal Cancer Reveals Novel Pleiotropic Associations.

    PubMed

    Fehringer, Gordon; Kraft, Peter; Pharoah, Paul D; Eeles, Rosalind A; Chatterjee, Nilanjan; Schumacher, Fredrick R; Schildkraut, Joellen M; Lindström, Sara; Brennan, Paul; Bickeböller, Heike; Houlston, Richard S; Landi, Maria Teresa; Caporaso, Neil; Risch, Angela; Amin Al Olama, Ali; Berndt, Sonja I; Giovannucci, Edward L; Grönberg, Henrik; Kote-Jarai, Zsofia; Ma, Jing; Muir, Kenneth; Stampfer, Meir J; Stevens, Victoria L; Wiklund, Fredrik; Willett, Walter C; Goode, Ellen L; Permuth, Jennifer B; Risch, Harvey A; Reid, Brett M; Bezieau, Stephane; Brenner, Hermann; Chan, Andrew T; Chang-Claude, Jenny; Hudson, Thomas J; Kocarnik, Jonathan K; Newcomb, Polly A; Schoen, Robert E; Slattery, Martha L; White, Emily; Adank, Muriel A; Ahsan, Habibul; Aittomäki, Kristiina; Baglietto, Laura; Blomquist, Carl; Canzian, Federico; Czene, Kamila; Dos-Santos-Silva, Isabel; Eliassen, A Heather; Figueroa, Jonine D; Flesch-Janys, Dieter; Fletcher, Olivia; Garcia-Closas, Montserrat; Gaudet, Mia M; Johnson, Nichola; Hall, Per; Hazra, Aditi; Hein, Rebecca; Hofman, Albert; Hopper, John L; Irwanto, Astrid; Johansson, Mattias; Kaaks, Rudolf; Kibriya, Muhammad G; Lichtner, Peter; Liu, Jianjun; Lund, Eiliv; Makalic, Enes; Meindl, Alfons; Müller-Myhsok, Bertram; Muranen, Taru A; Nevanlinna, Heli; Peeters, Petra H; Peto, Julian; Prentice, Ross L; Rahman, Nazneen; Sanchez, Maria Jose; Schmidt, Daniel F; Schmutzler, Rita K; Southey, Melissa C; Tamimi, Rulla; Travis, Ruth C; Turnbull, Clare; Uitterlinden, Andre G; Wang, Zhaoming; Whittemore, Alice S; Yang, Xiaohong R; Zheng, Wei; Buchanan, Daniel D; Casey, Graham; Conti, David V; Edlund, Christopher K; Gallinger, Steven; Haile, Robert W; Jenkins, Mark; Le Marchand, Loïc; Li, Li; Lindor, Noralene M; Schmit, Stephanie L; Thibodeau, Stephen N; Woods, Michael O; Rafnar, Thorunn; Gudmundsson, Julius; Stacey, Simon N; Stefansson, Kari; Sulem, Patrick; Chen, Y Ann; Tyrer, Jonathan P; Christiani, David C; Wei, Yongyue; Shen, Hongbing; Hu, Zhibin; Shu, Xiao-Ou; Shiraishi, Kouya; Takahashi, Atsushi; Bossé, Yohan; Obeidat, Ma'en; Nickle, David; Timens, Wim; Freedman, Matthew L; Li, Qiyuan; Seminara, Daniela; Chanock, Stephen J; Gong, Jian; Peters, Ulrike; Gruber, Stephen B; Amos, Christopher I; Sellers, Thomas A; Easton, Douglas F; Hunter, David J; Haiman, Christopher A; Henderson, Brian E; Hung, Rayjean J

    2016-09-01

    Identifying genetic variants with pleiotropic associations can uncover common pathways influencing multiple cancers. We took a two-stage approach to conduct genome-wide association studies for lung, ovary, breast, prostate, and colorectal cancer from the GAME-ON/GECCO Network (61,851 cases, 61,820 controls) to identify pleiotropic loci. Findings were replicated in independent association studies (55,789 cases, 330,490 controls). We identified a novel pleiotropic association at 1q22 involving breast and lung squamous cell carcinoma, with eQTL analysis showing an association with ADAM15/THBS3 gene expression in lung. We also identified a known breast cancer locus CASP8/ALS2CR12 associated with prostate cancer, a known cancer locus at CDKN2B-AS1 with different variants associated with lung adenocarcinoma and prostate cancer, and confirmed the associations of a breast BRCA2 locus with lung and serous ovarian cancer. This is the largest study to date examining pleiotropy across multiple cancer-associated loci, identifying common mechanisms of cancer development and progression. Cancer Res; 76(17); 5103-14. ©2016 AACR. PMID:27197191

  7. Genome-wide analysis of glucocorticoid receptor binding regions in adipocytes reveal gene network involved in triglyceride homeostasis.

    PubMed

    Yu, Chi-Yi; Mayba, Oleg; Lee, Joyce V; Tran, Joanna; Harris, Charlie; Speed, Terence P; Wang, Jen-Chywan

    2010-01-01

    Glucocorticoids play important roles in the regulation of distinct aspects of adipocyte biology. Excess glucocorticoids in adipocytes are associated with metabolic disorders, including central obesity, insulin resistance and dyslipidemia. To understand the mechanisms underlying the glucocorticoid action in adipocytes, we used chromatin immunoprecipitation sequencing to isolate genome-wide glucocorticoid receptor (GR) binding regions (GBRs) in 3T3-L1 adipocytes. Furthermore, gene expression analyses were used to identify genes that were regulated by glucocorticoids. Overall, 274 glucocorticoid-regulated genes contain or locate nearby GBR. We found that many GBRs were located in or nearby genes involved in triglyceride (TG) synthesis (Scd-1, 2, 3, GPAT3, GPAT4, Agpat2, Lpin1), lipolysis (Lipe, Mgll), lipid transport (Cd36, Lrp-1, Vldlr, Slc27a2) and storage (S3-12). Gene expression analysis showed that except for Scd-3, the other 13 genes were induced in mouse inguinal fat upon 4-day glucocorticoid treatment. Reporter gene assays showed that except Agpat2, the other 12 glucocorticoid-regulated genes contain at least one GBR that can mediate hormone response. In agreement with the fact that glucocorticoids activated genes in both TG biosynthetic and lipolytic pathways, we confirmed that 4-day glucocorticoid treatment increased TG synthesis and lipolysis concomitantly in inguinal fat. Notably, we found that 9 of these 12 genes were induced in transgenic mice that have constant elevated plasma glucocorticoid levels. These results suggested that a similar mechanism was used to regulate TG homeostasis during chronic glucocorticoid treatment. In summary, our studies have identified molecular components in a glucocorticoid-controlled gene network involved in the regulation of TG homeostasis in adipocytes. Understanding the regulation of this gene network should provide important insight for future therapeutic developments for metabolic diseases. PMID:21187916

  8. Genome-wide analysis reveals artificial selection on coat colour and reproductive traits in Chinese domestic pigs.

    PubMed

    Wang, Chao; Wang, Hongyang; Zhang, Yu; Tang, Zhonglin; Li, Kui; Liu, Bang

    2015-03-01

    Pigs from Asia and Europe were independently domesticated from c. 9000 years ago. During this period, strong artificial selection has led to dramatic phenotypic changes in domestic pigs. However, the genetic basis underlying these morphological and behavioural adaptations is relatively unknown, particularly for indigenous Chinese pigs. Here, we performed a genome-wide analysis to screen 196 regions with selective sweep signals in Tongcheng pigs, which are a typical indigenous Chinese breed. Genes located in these regions have been found to be involved in lipid metabolism, melanocyte differentiation, neural development and other biological processes, which coincide with the evolutionary phenotypic changes in this breed. A synonymous substitution, c.669T>C, in ESR1, which colocalizes with a major quantitative trait locus for litter size, shows extreme differences in allele frequency between Tongcheng pigs and wild boars. Notably, the variant C allele in this locus exhibits high allele frequency in most Chinese populations, suggesting a consequence of positive selection. Five genes (PRM1, PRM2, TNP2, GPR149 and JMJD1C) related to reproductive traits were found to have high haplotype similarity in Chinese breeds. Two selected genes, MITF and EDNRB, are implied to shape the two-end black colour trait in Tongcheng pig. Subsequent SNP microarray studies of five Chinese white-spotted breeds displayed a concordant signature at both loci, suggesting that these two genes are responsible for colour variations in Chinese breeds. Utilizing massively parallel sequencing, we characterized the candidate sites that adapt to artificial and environmental selections during the Chinese pig domestication. This study provides fundamental proof for further research on the evolutionary adaptation of Chinese pigs. PMID:25132237

  9. Genome wide analysis reveals single nucleotide polymorphisms associated with fatness and putative novel copy number variants in three pig breeds

    PubMed Central

    2013-01-01

    Background Obesity, excess fat tissue in the body, can underlie a variety of medical complaints including heart disease, stroke and cancer. The pig is an excellent model organism for the study of various human disorders, including obesity, as well as being the foremost agricultural species. In order to identify genetic variants associated with fatness, we used a selective genomic approach sampling DNA from animals at the extreme ends of the fat and lean spectrum using estimated breeding values derived from a total population size of over 70,000 animals. DNA from 3 breeds (Sire Line Large White, Duroc and a white Pietrain composite line (Titan)) was used to interrogate the Illumina Porcine SNP60 Genotyping Beadchip in order to identify significant associations in terms of single nucleotide polymorphisms (SNPs) and copy number variants (CNVs). Results By sampling animals at each end of the fat/lean EBV (estimate breeding value) spectrum the whole population could be assessed using less than 300 animals, without losing statistical power. Indeed, several significant SNPs (at the 5% genome wide significance level) were discovered, 4 of these linked to genes with ontologies that had previously been correlated with fatness (NTS, FABP6, SST and NR3C2). Quantitative analysis of the data identified putative CNV regions containing genes whose ontology suggested fatness related functions (MCHR1, PPARα, SLC5A1 and SLC5A4). Conclusions Selective genotyping of EBVs at either end of the phenotypic spectrum proved to be a cost effective means of identifying SNPs and CNVs associated with fatness and with estimated major effects in a large population of animals. PMID:24225222

  10. Genome-wide single nucleotide polymorphism array analysis reveals recurrent genomic alterations associated with histopathologic features in intrahepatic cholangiocarcinoma

    PubMed Central

    Huang, Wan-Ting; Weng, Shao-Wen; Wei, Yu-Ching; You, Huey-Ling; Wang, Jui-Tzu; Eng, Hock-Liew

    2014-01-01

    Recent studies indicate that genomic alterations (GAs) are associated with many human malignancies. Genome-wide analysis of GAs involved in intrahepatic cholangiocarcinoma (ICC) and association with histopathologic features are limited. To help characterize this relatively rare neoplasm, we collected 32 frozen tissue samples of ICC to study GAs and molecular karyotypes by using single-nucleotide polymorphism array. Recurrent GAs occurring in at least 40% of the patients were further correlated with histopathologic features. Gain of 1q21.3-q23.1 and losses of 1p36.33-p35.3 and 3p26.3-p13 were significantly associated with larger tumor size more than 5 cm in diameter; and loss of 4q13.2-q35.2 with tumor multiplicity. Moreover, losses of 1p36.32-p35.3, 3p26.3-p22.2, 4q13.1-q21.23, 4q31.3-q34.3 and 4q34.3-35.2 were inclined to be associated with high histological grade. As to tumor vascular invasion, gain of 1q21.3-q23.1 and losses of 3p22.1-p12.3 and 4q13.2-q35.2 were significantly associated with tumor vascular invasion. Some regions were concurrently associated with multiple histopathologic characteristics, including loss of 4q13.2-q35.2 associated with larger tumor size, high histological grade and vascular invasion; losses of 1p36.33-p35.3 and 3p26.3-p22.2 with larger tumor size and high histological grade; and gain of 1q21.3-q23.1 with larger tumor size and vascular invasion. Our study indicates that complex chromosomal instability is characteristic of ICC. Detecting crucial GAs will enable risk stratification and development of personalized therapies. PMID:25400767

  11. Integrated Expression Profiling and Genome-Wide Analysis of ChREBP Targets Reveals the Dual Role for ChREBP in Glucose-Regulated Gene Expression

    PubMed Central

    Lee, Yong Seok; Kim, Ha-Jung; Han, Jung-Youn; Im, Seung-Soon; Chong, Hansook Kim; Kwon, Je-Keun; Cho, Yun-Ho; Kim, Woo Kyung; Osborne, Timothy F.; Horton, Jay D.; Jun, Hee-Sook; Ahn, Yong-Ho; Ahn, Sung-Min; Cha, Ji-Young

    2011-01-01

    The carbohydrate response element binding protein (ChREBP), a basic helix-loop-helix/leucine zipper transcription factor, plays a critical role in the control of lipogenesis in the liver. To identify the direct targets of ChREBP on a genome-wide scale and provide more insight into the mechanism by which ChREBP regulates glucose-responsive gene expression, we performed chromatin immunoprecipitation-sequencing and gene expression analysis. We identified 1153 ChREBP binding sites and 783 target genes using the chromatin from HepG2, a human hepatocellular carcinoma cell line. A motif search revealed a refined consensus sequence (CABGTG-nnCnG-nGnSTG) to better represent critical elements of a functional ChREBP binding sequence. Gene ontology analysis shows that ChREBP target genes are particularly associated with lipid, fatty acid and steroid metabolism. In addition, other functional gene clusters related to transport, development and cell motility are significantly enriched. Gene set enrichment analysis reveals that ChREBP target genes are highly correlated with genes regulated by high glucose, providing a functional relevance to the genome-wide binding study. Furthermore, we have demonstrated that ChREBP may function as a transcriptional repressor as well as an activator. PMID:21811631

  12. Genome-Wide DNA Methylation Analysis in Melanoma Reveals the Importance of CpG Methylation in MITF Regulation.

    PubMed

    Lauss, Martin; Haq, Rizwan; Cirenajwis, Helena; Phung, Bengt; Harbst, Katja; Staaf, Johan; Rosengren, Frida; Holm, Karolina; Aine, Mattias; Jirström, Karin; Borg, Åke; Busch, Christian; Geisler, Jürgen; Lønning, Per E; Ringnér, Markus; Howlin, Jillian; Fisher, David E; Jönsson, Göran

    2015-07-01

    The microphthalmia-associated transcription factor (MITF) is a key regulator of melanocyte development and a lineage-specific oncogene in melanoma; a highly lethal cancer known for its unpredictable clinical course. MITF is regulated by multiple intracellular signaling pathways, although the exact mechanisms that determine MITF expression and activity remain incompletely understood. In this study, we obtained genome-wide DNA methylation profiles from 50 stage IV melanomas, normal melanocytes, keratinocytes, and dermal fibroblasts and utilized The Cancer Genome Atlas data for experimental validation. By integrating DNA methylation and gene expression data, we found that hypermethylation of MITF and its co-regulated differentiation pathway genes corresponded to decreased gene expression levels. In cell lines with a hypermethylated MITF-pathway, overexpression of MITF did not alter the expression level or methylation status of the MITF pathway genes. In contrast, however, demethylation treatment of these cell lines induced MITF-pathway activity, confirming that gene regulation was controlled via methylation. The discovery that the activity of the master regulator of pigmentation, MITF, and its downstream targets may be regulated by hypermethylation has significant implications for understanding the development and evolvement of melanoma. PMID:25705847

  13. Genome-wide analysis of tandem repeats in Tribolium castaneum genome reveals abundant and highly dynamic tandem repeat families with satellite DNA features in euchromatic chromosomal arms

    PubMed Central

    Pavlek, Martina; Gelfand, Yevgeniy; Plohl, Miroslav; Meštrović, Nevenka

    2015-01-01

    Although satellite DNAs are well-explored components of heterochromatin and centromeres, little is known about emergence, dispersal and possible impact of comparably structured tandem repeats (TRs) on the genome-wide scale. Our bioinformatics analysis of assembled Tribolium castaneum genome disclosed significant contribution of TRs in euchromatic chromosomal arms and clear predominance of satellite DNA-typical 170 bp monomers in arrays of ≥5 repeats. By applying different experimental approaches, we revealed that the nine most prominent TR families Cast1–Cast9 extracted from the assembly comprise ∼4.3% of the entire genome and reside almost exclusively in euchromatic regions. Among them, seven families that build ∼3.9% of the genome are based on ∼170 and ∼340 bp long monomers. Results of phylogenetic analyses of 2500 monomers originating from these families show high-sequence dynamics, evident by extensive exchanges between arrays on non-homologous chromosomes. In addition, our analysis shows that concerted evolution acts more efficiently on longer than on shorter arrays. Efficient genome-wide distribution of nine TR families implies the role of transposition only in expansion of the most dispersed family, and involvement of other mechanisms is anticipated. Despite similarities in sequence features, FISH experiments indicate high-level compartmentalization of centromeric and euchromatic tandem repeats. PMID:26428853

  14. Genome wide transcriptome analysis reveals ABA mediated response in Arabidopsis during gold (AuCl(-) 4) treatment.

    PubMed

    Shukla, Devesh; Krishnamurthy, Sneha; Sahi, Shivendra V

    2014-01-01

    The unique physico-chemical properties of gold nanoparticles (AuNPs) find manifold applications in diagnostics, medicine and catalysis. Chemical synthesis produces reactive AuNPs and generates hazardous by-products. Alternatively, plants can be utilized to produce AuNPs in an eco-friendly manner. To better control the biosynthesis of AuNPs, we need to first understand the detailed molecular response induced by AuCl(-) 4 In this study, we carried out global transcriptome analysis in root tissue of Arabidopsis grown for 12- h in presence of gold solution (HAuCl4) using the novel unbiased Affymetrix exon array. Transcriptomics analysis revealed differential regulation of a total of 704 genes and 4900 exons. Of these, 492 and 212 genes were up- and downregulated, respectively. The validation of the expressed key genes, such as glutathione-S-transferases, auxin responsive genes, cytochrome P450 82C2, methyl transferases, transducin (G protein beta subunit), ERF transcription factor, ABC, and MATE transporters, was carried out through quantitative RT-PCR. These key genes demonstrated specific induction under AuCl4(-) treatment relative to other heavy metals, suggesting a unique plant-gold interaction. GO enrichment analysis reveals the upregulation of processes like oxidative stress, glutathione binding, metal binding, transport, and plant hormonal responses. Changes predicted in biochemical pathways indicated major modulation in glutathione mediated detoxification, flavones and derivatives, and plant hormone biosynthesis. Motif search analysis identified a highly significant enriched motif, ACGT, which is an abscisic acid responsive core element (ABRE), suggesting the possibility of ABA- mediated signaling. Identification of abscisic acid response element (ABRE) points to the operation of a predominant signaling mechanism in response to AuCl(-) 4 exposure. Overall, this study presents a useful picture of plant-gold interaction with an identification of candidate genes

  15. Genome wide transcriptome analysis reveals ABA mediated response in Arabidopsis during gold (AuCl−4) treatment

    PubMed Central

    Shukla, Devesh; Krishnamurthy, Sneha; Sahi, Shivendra V.

    2014-01-01

    The unique physico-chemical properties of gold nanoparticles (AuNPs) find manifold applications in diagnostics, medicine and catalysis. Chemical synthesis produces reactive AuNPs and generates hazardous by-products. Alternatively, plants can be utilized to produce AuNPs in an eco-friendly manner. To better control the biosynthesis of AuNPs, we need to first understand the detailed molecular response induced by AuCl−4 In this study, we carried out global transcriptome analysis in root tissue of Arabidopsis grown for 12- h in presence of gold solution (HAuCl4) using the novel unbiased Affymetrix exon array. Transcriptomics analysis revealed differential regulation of a total of 704 genes and 4900 exons. Of these, 492 and 212 genes were up- and downregulated, respectively. The validation of the expressed key genes, such as glutathione-S-transferases, auxin responsive genes, cytochrome P450 82C2, methyl transferases, transducin (G protein beta subunit), ERF transcription factor, ABC, and MATE transporters, was carried out through quantitative RT-PCR. These key genes demonstrated specific induction under AuCl4− treatment relative to other heavy metals, suggesting a unique plant-gold interaction. GO enrichment analysis reveals the upregulation of processes like oxidative stress, glutathione binding, metal binding, transport, and plant hormonal responses. Changes predicted in biochemical pathways indicated major modulation in glutathione mediated detoxification, flavones and derivatives, and plant hormone biosynthesis. Motif search analysis identified a highly significant enriched motif, ACGT, which is an abscisic acid responsive core element (ABRE), suggesting the possibility of ABA- mediated signaling. Identification of abscisic acid response element (ABRE) points to the operation of a predominant signaling mechanism in response to AuCl−4 exposure. Overall, this study presents a useful picture of plant-gold interaction with an identification of candidate genes

  16. Genome-wide analysis of SREBP1 activity around the clock reveals its combined dependency on nutrient and circadian signals.

    PubMed

    Gilardi, Federica; Migliavacca, Eugenia; Naldi, Aurélien; Baruchet, Michaël; Canella, Donatella; Le Martelot, Gwendal; Guex, Nicolas; Desvergne, Béatrice

    2014-03-01

    In mammals, the circadian clock allows them to anticipate and adapt physiology around the 24 hours. Conversely, metabolism and food consumption regulate the internal clock, pointing the existence of an intricate relationship between nutrient state and circadian homeostasis that is far from being understood. The Sterol Regulatory Element Binding Protein 1 (SREBP1) is a key regulator of lipid homeostasis. Hepatic SREBP1 function is influenced by the nutrient-response cycle, but also by the circadian machinery. To systematically understand how the interplay of circadian clock and nutrient-driven rhythm regulates SREBP1 activity, we evaluated the genome-wide binding of SREBP1 to its targets throughout the day in C57BL/6 mice. The recruitment of SREBP1 to the DNA showed a highly circadian behaviour, with a maximum during the fed status. However, the temporal expression of SREBP1 targets was not always synchronized with its binding pattern. In particular, different expression phases were observed for SREBP1 target genes depending on their function, suggesting the involvement of other transcription factors in their regulation. Binding sites for Hepatocyte Nuclear Factor 4 (HNF4) were specifically enriched in the close proximity of SREBP1 peaks of genes, whose expression was shifted by about 8 hours with respect to SREBP1 binding. Thus, the cross-talk between hepatic HNF4 and SREBP1 may underlie the expression timing of this subgroup of SREBP1 targets. Interestingly, the proper temporal expression profile of these genes was dramatically changed in Bmal1-/- mice upon time-restricted feeding, for which a rhythmic, but slightly delayed, binding of SREBP1 was maintained. Collectively, our results show that besides the nutrient-driven regulation of SREBP1 nuclear translocation, a second layer of modulation of SREBP1 transcriptional activity, strongly dependent from the circadian clock, exists. This system allows us to fine tune the expression timing of SREBP1 target genes, thus

  17. Roles of Distal and Genic Methylation in the Development of Prostate Tumorigenesis Revealed by Genome-wide DNA Methylation Analysis

    PubMed Central

    Wang, Yao; Jadhav, Rohit Ramakant; Liu, Joseph; Wilson, Desiree; Chen, Yidong; Thompson, Ian M.; Troyer, Dean A.; Hernandez, Javier; Shi, Huidong; Leach, Robin J.; Huang, Tim H.-M.; Jin, Victor X.

    2016-01-01

    Aberrant DNA methylation at promoters is often linked to tumorigenesis. But many aspects of DNA methylation remain unexplored, including the individual roles of distal and gene body methylation, as well as their collaborative roles with promoter methylation. Here we performed a MBD-seq analysis on prostate specimens classified into low, high, and very high risk group based on Gleason score and TNM stages. We identified gene sets with differential methylation regions (DMRs) in Distal, TSS, gene body and TES. To understand the collaborative roles, TSS was compared with the other three DMRs, resulted in 12 groups of genes with collaborative differential methylation patterns (CDMPs). We found several groups of genes that show opposite methylation patterns in Distal and Genic regions compared to TSS region, and in general they are differentially expressed genes (DEGs) in tumors in TCGA RNA-seq data. IPA (Ingenuity Pathway Analysis) reveals AR/TP53 signaling network to be a major signaling pathway, and survival analysis indicates genes subsets significantly associated with prostate cancer recurrence. Our results suggest that DNA methylation in Distal and Genic regions also plays critical roles in contributing to prostate tumorigenesis, and may act either positively or negatively with TSSs to alter gene regulation in tumors. PMID:26924343

  18. Genome-Wide Analysis of Arabidopsis Pentatricopeptide Repeat Proteins Reveals Their Essential Role in Organelle BiogenesisW⃞

    PubMed Central

    Lurin, Claire; Andrés, Charles; Aubourg, Sébastien; Bellaoui, Mohammed; Bitton, Frédérique; Bruyère, Clémence; Caboche, Michel; Debast, Cédrig; Gualberto, José; Hoffmann, Beate; Lecharny, Alain; Le Ret, Monique; Martin-Magniette, Marie-Laure; Mireau, Hakim; Peeters, Nemo; Renou, Jean-Pierre; Szurek, Boris; Taconnat, Ludivine; Small, Ian

    2004-01-01

    The complete sequence of the Arabidopsis thaliana genome revealed thousands of previously unsuspected genes, many of which cannot be ascribed even putative functions. One of the largest and most enigmatic gene families discovered in this way is characterized by tandem arrays of pentatricopeptide repeats (PPRs). We describe a detailed bioinformatic analysis of 441 members of the Arabidopsis PPR family plus genomic and genetic data on the expression (microarray data), localization (green fluorescent protein and red fluorescent protein fusions), and general function (insertion mutants and RNA binding assays) of many family members. The basic picture that arises from these studies is that PPR proteins play constitutive, often essential roles in mitochondria and chloroplasts, probably via binding to organellar transcripts. These results confirm, but massively extend, the very sparse observations previously obtained from detailed characterization of individual mutants in other organisms. PMID:15269332

  19. Genome-wide meta-analysis of maize heterosis reveals the potential role of additive gene expression at pericentromeric loci

    PubMed Central

    2014-01-01

    Background The identification of QTL involved in heterosis formation is one approach to unravel the not yet fully understood genetic basis of heterosis - the improved agronomic performance of hybrid F1 plants compared to their inbred parents. The identification of candidate genes underlying a QTL is important both for developing markers and determining the molecular genetic basis of a trait, but remains difficult owing to the large number of genes often contained within individual QTL. To address this problem in heterosis analysis, we applied a meta-analysis strategy for grain yield (GY) of Zea mays L. as example, incorporating QTL-, hybrid field-, and parental gene expression data. Results For the identification of genes underlying known heterotic QTL, we made use of tight associations between gene expression pattern and the trait of interest, identified by correlation analyses. Using this approach genes strongly associated with heterosis for GY were discovered to be clustered in pericentromeric regions of the complex maize genome. This suggests that expression differences of sequences in recombination-suppressed regions are important in the establishment of heterosis for GY in F1 hybrids and also in the conservation of heterosis for GY across genotypes. Importantly functional analysis of heterosis-associated genes from these genomic regions revealed over-representation of a number of functional classes, identifying key processes contributing to heterosis for GY. Based on the finding that the majority of the analyzed heterosis-associated genes were addtitively expressed, we propose a model referring to the influence of cis-regulatory variation on heterosis for GY by the compensation of fixed detrimental expression levels in parents. Conclusions The study highlights the utility of a meta-analysis approach that integrates phenotypic and multi-level molecular data to unravel complex traits in plants. It provides prospects for the identification of genes relevant for

  20. Genome wide analysis of acute myeloid leukemia reveal leukemia specific methylome and subtype specific hypomethylation of repeats.

    PubMed

    Saied, Marwa H; Marzec, Jacek; Khalid, Sabah; Smith, Paul; Down, Thomas A; Rakyan, Vardhman K; Molloy, Gael; Raghavan, Manoj; Debernardi, Silvana; Young, Bryan D

    2012-01-01

    Methylated DNA immunoprecipitation followed by high-throughput sequencing (MeDIP-seq) has the potential to identify changes in DNA methylation important in cancer development. In order to understand the role of epigenetic modulation in the development of acute myeloid leukemia (AML) we have applied MeDIP-seq to the DNA of 12 AML patients and 4 normal bone marrows. This analysis revealed leukemia-associated differentially methylated regions that included gene promoters, gene bodies, CpG islands and CpG island shores. Two genes (SPHKAP and DPP6) with significantly methylated promoters were of interest and further analysis of their expression showed them to be repressed in AML. We also demonstrated considerable cytogenetic subtype specificity in the methylomes affecting different genomic features. Significantly distinct patterns of hypomethylation of certain interspersed repeat elements were associated with cytogenetic subtypes. The methylation patterns of members of the SINE family tightly clustered all leukemic patients with an enrichment of Alu repeats with a high CpG density (P<0.0001). We were able to demonstrate significant inverse correlation between intragenic interspersed repeat sequence methylation and gene expression with SINEs showing the strongest inverse correlation (R(2) = 0.7). We conclude that the alterations in DNA methylation that accompany the development of AML affect not only the promoters, but also the non-promoter genomic features, with significant demethylation of certain interspersed repeat DNA elements being associated with AML cytogenetic subtypes. MeDIP-seq data were validated using bisulfite pyrosequencing and the Infinium array. PMID:22479372

  1. Genome-wide allelic methylation analysis reveals disease-specific susceptibility to multiple methylation defects in imprinting syndromes.

    PubMed

    Court, Franck; Martin-Trujillo, Alex; Romanelli, Valeria; Garin, Intza; Iglesias-Platas, Isabel; Salafsky, Ira; Guitart, Miriam; Perez de Nanclares, Guiomar; Lapunzina, Pablo; Monk, David

    2013-04-01

    Genomic imprinting is the parent-of-origin-specific allelic transcriptional silencing observed in mammals, which is governed by DNA methylation established in the gametes and maintained throughout the development. The frequency and extent of epimutations associated with the nine reported imprinting syndromes varies because it is evident that aberrant preimplantation maintenance of imprinted differentially methylated regions (DMRs) may affect multiple loci. Using a custom Illumina GoldenGate array targeting 27 imprinted DMRs, we profiled allelic methylation in 65 imprinting defect patients. We identify multilocus hypomethylation in numerous Beckwith-Wiedemann syndrome, transient neonatal diabetes mellitus (TNDM), and pseudohypoparathyroidism 1B patients, and an individual with Silver-Russell syndrome. Our data reveal a broad range of epimutations exist in certain imprinting syndromes, with the exception of Prader-Willi syndrome and Angelman syndrome patients that are associated with solitary SNRPN-DMR defects. A mutation analysis identified a 1 bp deletion in the ZFP57 gene in a TNDM patient with methylation defects at multiple maternal DMRs. In addition, we observe missense variants in ZFP57, NLRP2, and NLRP7 that are not consistent with maternal effect and aberrant establishment or methylation maintenance, and are likely benign. This work illustrates that further extensive molecular characterization of these rare patients is required to fully understand the mechanism underlying the etiology of imprint establishment and maintenance. PMID:23335487

  2. A Genome-Wide Analysis Reveals Stress and Hormone Responsive Patterns of TIFY Family Genes in Brassica rapa

    PubMed Central

    Saha, Gopal; Park, Jong-In; Kayum, Md. Abdul; Nou, Ill-Sup

    2016-01-01

    The TIFY family is a plant-specific group of proteins with a diversity of functions and includes four subfamilies, viz. ZML, TIFY, PPD, and JASMONATE ZIM-domain (JAZ) proteins. TIFY family members, particularly JAZ subfamily proteins, play roles in biological processes such as development and stress and hormone responses in Arabidopsis, rice, chickpea, and grape. However, there is no information about this family in any Brassica crop. This study identifies 36 TIFY genes in Brassica rapa, an economically important crop species in the Brassicaceae. An extensive in silico analysis of phylogenetic grouping, protein motif organization and intron-exon distribution confirmed that there are four subfamilies of BrTIFY proteins. Out of 36 BrTIFY genes, we identified 21 in the JAZ subfamily, seven in the TIFY subfamily, six in ZML and two in PPD. Extensive expression profiling of 21 BrTIFY JAZs in various tissues, especially in floral organs and at different flower growth stages revealed constitutive expression patterns, which suggest that BrTIFY JAZ genes are important during growth and development of B. rapa flowers. A protein interaction network analysis also pointed to association of these proteins with fertility and defense processes of B. rapa. Using a low temperature-treated whole-genome microarray data set, most of the JAZ genes were found to have variable transcript abundance between the contrasting inbred lines Chiifu and Kenshin of B. rapa. Subsequently, the expression of all 21 BrTIFY JAZs in response to cold stress was characterized in the same two lines via qPCR, demonstrating that nine genes were up-regulated. Importantly, the BrTIFY JAZs showed strong and differential expression upon JA treatment, pointing to their probable involvement in JA-mediated growth regulatory functions, especially during flower development and stress responses. Additionally, BrTIFY JAZs were induced in response to salt, drought, Fusarium, ABA, and SA treatments, and six genes (BrTIFY3

  3. A Genome-Wide Analysis Reveals Stress and Hormone Responsive Patterns of TIFY Family Genes in Brassica rapa.

    PubMed

    Saha, Gopal; Park, Jong-In; Kayum, Md Abdul; Nou, Ill-Sup

    2016-01-01

    The TIFY family is a plant-specific group of proteins with a diversity of functions and includes four subfamilies, viz. ZML, TIFY, PPD, and JASMONATE ZIM-domain (JAZ) proteins. TIFY family members, particularly JAZ subfamily proteins, play roles in biological processes such as development and stress and hormone responses in Arabidopsis, rice, chickpea, and grape. However, there is no information about this family in any Brassica crop. This study identifies 36 TIFY genes in Brassica rapa, an economically important crop species in the Brassicaceae. An extensive in silico analysis of phylogenetic grouping, protein motif organization and intron-exon distribution confirmed that there are four subfamilies of BrTIFY proteins. Out of 36 BrTIFY genes, we identified 21 in the JAZ subfamily, seven in the TIFY subfamily, six in ZML and two in PPD. Extensive expression profiling of 21 BrTIFY JAZs in various tissues, especially in floral organs and at different flower growth stages revealed constitutive expression patterns, which suggest that BrTIFY JAZ genes are important during growth and development of B. rapa flowers. A protein interaction network analysis also pointed to association of these proteins with fertility and defense processes of B. rapa. Using a low temperature-treated whole-genome microarray data set, most of the JAZ genes were found to have variable transcript abundance between the contrasting inbred lines Chiifu and Kenshin of B. rapa. Subsequently, the expression of all 21 BrTIFY JAZs in response to cold stress was characterized in the same two lines via qPCR, demonstrating that nine genes were up-regulated. Importantly, the BrTIFY JAZs showed strong and differential expression upon JA treatment, pointing to their probable involvement in JA-mediated growth regulatory functions, especially during flower development and stress responses. Additionally, BrTIFY JAZs were induced in response to salt, drought, Fusarium, ABA, and SA treatments, and six genes (BrTIFY3

  4. Genome-wide analysis correlates Ayurveda Prakriti

    PubMed Central

    Govindaraj, Periyasamy; Nizamuddin, Sheikh; Sharath, Anugula; Jyothi, Vuskamalla; Rotti, Harish; Raval, Ritu; Nayak, Jayakrishna; Bhat, Balakrishna K.; Prasanna, B. V.; Shintre, Pooja; Sule, Mayura; Joshi, Kalpana S.; Dedge, Amrish P.; Bharadwaj, Ramachandra; Gangadharan, G. G.; Nair, Sreekumaran; Gopinath, Puthiya M.; Patwardhan, Bhushan; Kondaiah, Paturu; Satyamoorthy, Kapaettu; Valiathan, Marthanda Varma Sankaran; Thangaraj, Kumarasamy

    2015-01-01

    The practice of Ayurveda, the traditional medicine of India, is based on the concept of three major constitutional types (Vata, Pitta and Kapha) defined as “Prakriti”. To the best of our knowledge, no study has convincingly correlated genomic variations with the classification of Prakriti. In the present study, we performed genome-wide SNP (single nucleotide polymorphism) analysis (Affymetrix, 6.0) of 262 well-classified male individuals (after screening 3416 subjects) belonging to three Prakritis. We found 52 SNPs (p ≤ 1 × 10−5) were significantly different between Prakritis, without any confounding effect of stratification, after 106 permutations. Principal component analysis (PCA) of these SNPs classified 262 individuals into their respective groups (Vata, Pitta and Kapha) irrespective of their ancestry, which represent its power in categorization. We further validated our finding with 297 Indian population samples with known ancestry. Subsequently, we found that PGM1 correlates with phenotype of Pitta as described in the ancient text of Caraka Samhita, suggesting that the phenotypic classification of India’s traditional medicine has a genetic basis; and its Prakriti-based practice in vogue for many centuries resonates with personalized medicine. PMID:26511157

  5. Design and bioinformatics analysis of genome-wide CLIP experiments

    PubMed Central

    Wang, Tao; Xiao, Guanghua; Chu, Yongjun; Zhang, Michael Q.; Corey, David R.; Xie, Yang

    2015-01-01

    The past decades have witnessed a surge of discoveries revealing RNA regulation as a central player in cellular processes. RNAs are regulated by RNA-binding proteins (RBPs) at all post-transcriptional stages, including splicing, transportation, stabilization and translation. Defects in the functions of these RBPs underlie a broad spectrum of human pathologies. Systematic identification of RBP functional targets is among the key biomedical research questions and provides a new direction for drug discovery. The advent of cross-linking immunoprecipitation coupled with high-throughput sequencing (genome-wide CLIP) technology has recently enabled the investigation of genome-wide RBP–RNA binding at single base-pair resolution. This technology has evolved through the development of three distinct versions: HITS-CLIP, PAR-CLIP and iCLIP. Meanwhile, numerous bioinformatics pipelines for handling the genome-wide CLIP data have also been developed. In this review, we discuss the genome-wide CLIP technology and focus on bioinformatics analysis. Specifically, we compare the strengths and weaknesses, as well as the scopes, of various bioinformatics tools. To assist readers in choosing optimal procedures for their analysis, we also review experimental design and procedures that affect bioinformatics analyses. PMID:25958398

  6. Novel Comparative Pattern Count Analysis Reveals a Chronic Ethanol-Induced Dynamic Shift in Immediate Early NF-κB Genome-wide Promoter Binding During Liver Regeneration

    PubMed Central

    Kuttippurathu, Lakshmi; Patra, Biswanath; Hoek, Jan B; Vadigepalli, Rajanikanth

    2016-01-01

    Liver regeneration after partial hepatectomy is a clinically important process that is impaired by adaptation to chronic alcohol intake. We focused on the initial time points following partial hepatectomy (PHx) to analyze genome-wide binding activity of NF-κB, a key immediate early regulator. We investigated the effect of chronic alcohol intake on immediate early NF-κB genome-wide localization, in the adapted state as well as in response to partial hepatectomy, using chromatin immunoprecipitation followed by promoter microarray analysis. We found many ethanol-specific NF-κB binding target promoters in the ethanol-adapted state, corresponding to regulation of biosynthetic processes, oxidation-reduction and apoptosis. Partial hepatectomy induced a diet-independent shift in NF-κB binding loci relative to the transcription start sites. We employed a novel pattern count analysis to exhaustively enumerate and compare the number of promoters corresponding to the temporal binding patterns in ethanol and pair-fed control groups. The highest pattern count corresponded to promoters with NF-κB binding exclusively in the ethanol group at 1h post PHx. This set was associated with regulation of cell death, response to oxidative stress, histone modification, mitochondrial function, and metabolic processes. Integration with the global gene expression profiles to identify putative transcriptional consequences of NF-κB binding patterns revealed that several of ethanol-specific 1h binding targets showed ethanol-specific differential expression through 6h post PHx. Motif analysis yielded co-incident binding loci for STAT3, AP-1, CREB, C/EBP-β, PPAR-γ and C/EBP-α, likely participating in co-regulatory modules with NF-κB in shaping the immediate early response to PHx. We conclude that adaptation to chronic ethanol intake disrupts the NF-κB promoter binding landscape with consequences for the immediate early gene regulatory response to the acute challenge of PHx. PMID:26847025

  7. Genome-wide analysis reveals distinct substrate specificities of Rrp6, Dis3, and core exosome subunits.

    PubMed

    Kiss, Daniel L; Andrulis, Erik D

    2010-04-01

    The RNA processing exosome complex was originally defined as an evolutionarily conserved multisubunit complex of ribonucleases responsible for the processing and/or turnover of stable RNAs. The exosome complex is also involved in the surveillance of mRNAs in both the nucleus and the cytoplasm, including nonsense-mediated decay (NMD) targets. The detailed mechanisms for how individual exosome subunits participate in each of these RNA metabolic pathways remains unclear. Here, we use RNAi to deplete exosome subunits, the exonucleases Rrp6 and Dis3, and an exosome cofactor in Drosophila melanogaster S2 tissue culture cells and assay the effects on global mRNA levels using gene expression microarrays. Consistent with the RNA degradative activities ascribed to the exosome, most mRNAs are increased. Notably, these stabilized mRNAs possess 3' untranslated regions that are longer than the representative transcriptomic average. Moreover, our results reveal substantial differences in the pools of affected mRNAs for each depleted subunit. For example, approximately 25% of the affected transcripts in Rrp6 depleted cells represent NMD substrates. While the affected mRNAs were dissimilar, they encode proteins that function in similar cellular pathways. We conclude that individual exosome subunits are largely functionally independent at the transcript level, but are interdependent on a transcriptomic level. PMID:20185544

  8. Genome-wide analysis reveals Sall4 to be a major regulator of pluripotency in murine-embryonic stem cells.

    PubMed

    Yang, Jianchang; Chai, Li; Fowles, Taylor C; Alipio, Zaida; Xu, Dan; Fink, Louis M; Ward, David C; Ma, Yupo

    2008-12-16

    Embryonic stem cells have potential utility in regenerative medicine because of their pluripotent characteristics. Sall4, a zinc-finger transcription factor, is expressed very early in embryonic development with Oct4 and Nanog, two well-characterized pluripotency regulators. Sall4 plays an important role in governing the fate of stem cells through transcriptional regulation of both Oct4 and Nanog. By using chromatin immunoprecipitation coupled to microarray hybridization (ChIP-on-chip), we have mapped global gene targets of Sall4 to further investigate regulatory processes in W4 mouse ES cells. A total of 3,223 genes were identified that were bound by the Sall4 protein on duplicate assays with high confidence, and many of these have major functions in developmental and regulatory pathways. Sall4 bound approximately twice as many annotated genes within promoter regions as Nanog and approximately four times as many as Oct4. Immunoprecipitation revealed a heteromeric protein complex(es) between Sall4, Oct4, and Nanog, consistent with binding site co-occupancies. Decreasing Sall4 expression in W4 ES cells decreases the expression levels of Oct4, Sox2, c-Myc, and Klf4, four proteins capable of reprogramming somatic cells to an induced pluripotent state. Further, Sall4 bound many genes that are regulated in part by chromatin-based epigenetic events mediated by polycomb-repressive complexes and bivalent domains. This suggests that Sall4 plays a diverse role in regulating stem cell pluripotency during early embryonic development through integration of transcriptional and epigenetic controls. PMID:19060217

  9. Gene Expression Quantitative Trait Locus Analysis of 16,000 Barley Genes Reveals a Complex Pattern of Genome-wide Transcriptional Regulation

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Transcript abundance data from cRNA hybridizations to Affymetrix microarrays can be used for simultaneous marker development and genome-wide eQTL (expression Quantitative Trait Loci) analysis of crops. We have shown that it is easily possible to use the information from Affymetrix expression arrays ...

  10. Network Analysis of Genome-Wide Selective Constraint Reveals a Gene Network Active in Early Fetal Brain Intolerant of Mutation

    PubMed Central

    Choi, Jinmyung; Samocha, Kaitlin E.; Daly, Mark J.

    2016-01-01

    Using robust, integrated analysis of multiple genomic datasets, we show that genes depleted for non-synonymous de novo mutations form a subnetwork of 72 members under strong selective constraint. We further show this subnetwork is preferentially expressed in the early development of the human hippocampus and is enriched for genes mutated in neurological Mendelian disorders. We thus conclude that carefully orchestrated developmental processes are under strong constraint in early brain development, and perturbations caused by mutation have adverse outcomes subject to strong purifying selection. Our findings demonstrate that selective forces can act on groups of genes involved in the same process, supporting the notion that purifying selection can act coordinately on multiple genes. Our approach provides a statistically robust, interpretable way to identify the tissues and developmental times where groups of disease genes are active. PMID:27305007

  11. Genome-wide analysis of Italian sheep diversity reveals a strong geographic pattern and cryptic relationships between breeds.

    PubMed

    Ciani, E; Crepaldi, P; Nicoloso, L; Lasagna, E; Sarti, F M; Moioli, B; Napolitano, F; Carta, A; Usai, G; D'Andrea, M; Marletta, D; Ciampolini, R; Riggio, V; Occidente, M; Matassino, D; Kompan, D; Modesto, P; Macciotta, N; Ajmone-Marsan, P; Pilla, F

    2014-04-01

    Italy counts several sheep breeds, arisen over centuries as a consequence of ancient and recent genetic and demographic events. To finely reconstruct genetic structure and relationships between Italian sheep, 496 subjects from 19 breeds were typed at 50K single nucleotide polymorphism loci. A subset of foreign breeds from the Sheep HapMap dataset was also included in the analyses. Genetic distances (as visualized either in a network or in a multidimensional scaling analysis of identical by state distances) closely reflected geographic proximity between breeds, with a clear north-south gradient, likely because of high levels of past gene flow and admixture all along the peninsula. Sardinian breeds diverged more from other breeds, a probable consequence of the combined effect of ancient sporadic introgression of feral mouflon and long-lasting genetic isolation from continental sheep populations. The study allowed the detection of previously undocumented episodes of recent introgression (Delle Langhe into the endangered Altamurana breed) as well as signatures of known, or claimed, historical introgression (Merino into Sopravissana and Gentile di Puglia; Bergamasca into Fabrianese, Appenninica and, to a lesser extent, Leccese). Arguments that would question, from a genomic point of view, the current breed classification of Bergamasca and Biellese into two separate breeds are presented. Finally, a role for traditional transhumance practices in shaping the genetic makeup of Alpine sheep breeds is proposed. The study represents the first exhaustive analysis of Italian sheep diversity in an European context, and it bridges the gap in the previous HapMap panel between Western Mediterranean and Swiss breeds. PMID:24303943

  12. Functional specificity of shuttling hnRNPs revealed by genome-wide analysis of their RNA binding profiles.

    PubMed

    Kim Guisbert, Karen; Duncan, Kent; Li, Hao; Guthrie, Christine

    2005-04-01

    Nab2, Npl3, and Nab4/Hrp1 are essential RNA binding proteins of the shuttling hnRNP class that are required for the efficient export of mRNA. To characterize the in vivo transcript specificity of these proteins, we identified their mRNA binding partners using a microarray-based assay. Each of the three proteins was coimmunoprecipitated with many different mRNA transcripts. Interestingly, each protein exhibits preferential associations with a distinct set of mRNAs. Notably, some of these appear to denote specific functional classes. For example, the ribosomal protein mRNAs and other highly expressed transcripts significantly favor association with Npl3 over Nab2, and Nab4/Hrp1 is strongly enriched with transcripts required for amino acid metabolism. Significantly, nab4 mutants showed a striking, desensitized growth phenotype when exposed to amino acid stress conditions suggesting a biological consequence to the associations we observed. Supporting the hypothesis that these proteins display transcript specificity, we identified a unique 7-nucleotide sequence overrepresented in the transcripts highly associated with Nab2 and Nab4/Hrp1 using the REDUCE algorithm. Validating our approach, our bioinformatics analysis correctly identified the known binding site for Nab4/Hrp1. These specialized associations of the hnRNP proteins of Saccharomyces cerevisiae suggest the opportunity to regulate the processing of particular transcripts between transcription and translation. PMID:15703440

  13. Genome-wide Analysis of Host-Plasmodium yoelii Interactions Reveals Regulators of the Type I Interferon Response.

    PubMed

    Wu, Jian; Cai, Baowei; Sun, Wenxiang; Huang, Ruili; Liu, Xueqiao; Lin, Meng; Pattaradilokrat, Sittiporn; Martin, Scott; Qi, Yanwei; Nair, Sethu C; Bolland, Silvia; Cohen, Jeffrey I; Austin, Christopher P; Long, Carole A; Myers, Timothy G; Wang, Rong-Fu; Su, Xin-Zhuan

    2015-07-28

    Invading pathogens trigger specific host responses, an understanding of which might identify genes that function in pathogen recognition and elimination. In this study, we performed trans-species expression quantitative trait locus (ts-eQTL) analysis using genotypes of the Plasmodium yoelii malaria parasite and phenotypes of mouse gene expression. We significantly linked 1,054 host genes to parasite genetic loci (LOD score ≥ 3.0). Using LOD score patterns, which produced results that differed from direct expression-level clustering, we grouped host genes that function in related pathways, allowing functional prediction of unknown genes. As a proof of principle, 14 of 15 randomly selected genes predicted to function in type I interferon (IFN-I) responses were experimentally validated using overexpression, small hairpin RNA knockdown, viral infection, and/or infection of knockout mice. This study demonstrates an effective strategy for studying gene function, establishes a functional gene database, and identifies regulators in IFN-I pathways. PMID:26190101

  14. Genome-wide analysis reveals conserved transcriptional responses downstream of resting potential change in Xenopus embryos, axolotl regeneration, and human mesenchymal cell differentiation.

    PubMed

    Pai, Vaibhav P; Martyniuk, Christopher J; Echeverri, Karen; Sundelacruz, Sarah; Kaplan, David L; Levin, Michael

    2016-02-01

    Endogenous bioelectric signaling via changes in cellular resting potential (V mem) is a key regulator of patterning during regeneration and embryogenesis in numerous model systems. Depolarization of V mem has been functionally implicated in dedifferentiation, tumorigenesis, anatomical re-specification, and appendage regeneration. However, no unbiased analyses have been performed to understand genome-wide transcriptional responses to V mem change in vivo. Moreover, it is unknown which genes or gene networks represent conserved targets of bioelectrical signaling across different patterning contexts and species. Here, we use microarray analysis to comparatively analyze transcriptional responses to V mem depolarization. We compare the response of the transcriptome during embryogenesis (Xenopus development), regeneration (axolotl regeneration), and stem cell differentiation (human mesenchymal stem cells in culture) to identify common networks across model species that are associated with depolarization. Both subnetwork enrichment and PANTHER analyses identified a number of key genetic modules as targets of V mem change, and also revealed important (well-conserved) commonalities in bioelectric signal transduction, despite highly diverse experimental contexts and species. Depolarization regulates specific transcriptional networks across all three germ layers (ectoderm, mesoderm, and endoderm) such as cell differentiation and apoptosis, and this information will be used for developing mechanistic models of bioelectric regulation of patterning. Moreover, our analysis reveals that V mem change regulates transcripts related to important disease pathways such as cancer and neurodegeneration, which may represent novel targets for emerging electroceutical therapies. PMID:27499876

  15. Genome-wide transcriptomic analysis reveals correlation between higher WRKY61 expression and reduced symptom severity in Turnip crinkle virus infected Arabidopsis thaliana

    PubMed Central

    Gao, Ruimin; Liu, Peng; Yong, Yuhan; Wong, Sek-Man

    2016-01-01

    Turnip crinkle virus (TCV) is a carmovirus that infects many Arabidopsis ecotypes. Most studies mainly focused on discovery of resistance genes against TCV infection, and there is no Next Generation Sequencing based comparative genome wide transcriptome analysis reported. In this study, RNA-seq based transcriptome analysis revealed that 238 (155 up-regulated and 83 down-regulated) significant differentially expressed genes with at least 15-fold change were determined. Fifteen genes (including upregulated, unchanged and downregulated) were selected for RNA-seq data validation using quantitative real-time PCR, which showed consistencies between these two sets of data. GO enrichment analysis showed that numerous terms such as stress, immunity, defence and chemical stimulus were affected in TCV-infected plants. One putative plant defence related gene named WRKY61 was selected for further investigation. It showed that WRKY61 overexpression plants displayed reduced symptoms and less virus accumulation, as compared to wild type (WT) and WRKY61 deficient lines, suggesting that higher WRKY61 expression level reduced TCV viral accumulation. In conclusion, our transcriptome analysis showed that global gene expression was detected in TCV-infected Arabidopsis thaliana. WRKY61 gene was shown to be negatively correlated with TCV infection and viral symptoms, which may be connected to plant immunity pathways. PMID:27086702

  16. Genome-wide transcriptomic analysis reveals correlation between higher WRKY61 expression and reduced symptom severity in Turnip crinkle virus infected Arabidopsis thaliana.

    PubMed

    Gao, Ruimin; Liu, Peng; Yong, Yuhan; Wong, Sek-Man

    2016-01-01

    Turnip crinkle virus (TCV) is a carmovirus that infects many Arabidopsis ecotypes. Most studies mainly focused on discovery of resistance genes against TCV infection, and there is no Next Generation Sequencing based comparative genome wide transcriptome analysis reported. In this study, RNA-seq based transcriptome analysis revealed that 238 (155 up-regulated and 83 down-regulated) significant differentially expressed genes with at least 15-fold change were determined. Fifteen genes (including upregulated, unchanged and downregulated) were selected for RNA-seq data validation using quantitative real-time PCR, which showed consistencies between these two sets of data. GO enrichment analysis showed that numerous terms such as stress, immunity, defence and chemical stimulus were affected in TCV-infected plants. One putative plant defence related gene named WRKY61 was selected for further investigation. It showed that WRKY61 overexpression plants displayed reduced symptoms and less virus accumulation, as compared to wild type (WT) and WRKY61 deficient lines, suggesting that higher WRKY61 expression level reduced TCV viral accumulation. In conclusion, our transcriptome analysis showed that global gene expression was detected in TCV-infected Arabidopsis thaliana. WRKY61 gene was shown to be negatively correlated with TCV infection and viral symptoms, which may be connected to plant immunity pathways. PMID:27086702

  17. Rank-based genome-wide analysis reveals the association of Ryanodine receptor-2 gene variants with childhood asthma among human populations

    PubMed Central

    2013-01-01

    Background The standard approach to determine unique or shared genetic factors across populations is to identify risk alleles in one population and investigate replication in others. However, since populations differ in DNA sequence information, allele frequencies, effect sizes, and linkage disequilibrium patterns, SNP association using a uniform stringent threshold on p values may not be reproducible across populations. Here, we developed rank-based methods to investigate shared or population-specific loci and pathways for childhood asthma across individuals of diverse ancestry. We performed genome-wide association studies on 859,790 SNPs genotyped in 527 affected offspring trios of European, African, and Hispanic ancestry using publically available asthma database in the Genotypes and Phenotypes database. Results Rank-based analyses showed that there are shared genetic factors for asthma across populations, more at the gene and pathway levels than at the SNP level. Although the top 1,000 SNPs were not shared, 11 genes (RYR2, PDE4D, CSMD1, CDH13, ROBO2, RBFOX1, PTPRD, NPAS3, PDE1C, SEMA5A, and CTNNA2) mapped by these SNPs were shared across populations. Ryanodine receptor 2 (RYR2, a statin response-related gene) showed the strongest association in European (p value = 2.55 × 10−7) and was replicated in African (2.57 × 10−4) and Hispanic (1.18 × 10−3) Americans. Imputation analyses based on the 1000 Genomes Project uncovered additional RYR2 variants associated with asthma. Network and functional ontology analyses revealed that RYR2 is an integral part of dermatological or allergic disorder biological networks, specifically in the functional classes involving inflammatory, eosinophilic, and respiratory diseases. Conclusion Our rank-based genome-wide analysis revealed for the first time an association of RYR2 variants with asthma and replicated previously discovered PDE4D asthma gene across human populations. The replication of top

  18. Genome-Wide Association Analysis for Blood Lipid Traits Measured in Three Pig Populations Reveals a Substantial Level of Genetic Heterogeneity

    PubMed Central

    Yang, Hui; Huang, Xiaochang; Zeng, Zhijun; Zhang, Wanchang; Liu, Chenlong; Fang, Shaoming; Huang, Lusheng; Chen, Congying

    2015-01-01

    Serum lipids are associated with myocardial infarction and cardiovascular disease in humans. Here we dissected the genetic architecture of blood lipid traits by applying genome-wide association studies (GWAS) in 1,256 pigs from Laiwu, Erhualian and Duroc × (Landrace × Yorkshire) populations, and a meta-analysis of GWAS in more than 2,400 pigs from five diverse populations. A total of 22 genomic loci surpassing the suggestive significance level were detected on 11 pig chromosomes (SSC) for six blood lipid traits. Meta-analysis of GWAS identified 5 novel loci associated with blood lipid traits. Comparison of GWAS loci across the tested populations revealed a substantial level of genetic heterogeneity for porcine blood lipid levels. We further evaluated the causality of nine polymorphisms nearby or within the APOB gene on SSC3 for serum LDL-C and TC levels. Of the 9 polymorphisms, an indel showed the most significant association with LDL-C and TC in Laiwu pigs. But the significant association was not identified in the White Duroc × Erhualian F2 resource population, in which the QTL for LDL-C and TC was also detected on SSC3. This indicates that population-specific signals may exist for the SSC3 QTL. Further investigations are warranted to validate this assumption. PMID:26121138

  19. Genome-wide analysis of YB-1-RNA interactions reveals a novel role of YB-1 in miRNA processing in glioblastoma multiforme

    PubMed Central

    Wu, Shuai-Lai; Fu, Xing; Huang, Jinyan; Jia, Ting-Ting; Zong, Feng-Yang; Mu, Shi-Rong; Zhu, Hong; Yan, Yong; Qiu, Shuwei; Wu, Qun; Yan, Wei; Peng, Ying; Chen, Juxiang; Hui, Jingyi

    2015-01-01

    Altered miRNA expression is believed to play a crucial role in a variety of human cancers; however, the mechanisms leading to the dysregulation of miRNA expression remain elusive. In this study, we report that the human Y box-binding protein (YB-1), a major mRNA packaging protein, is a novel modulator of miRNA processing in glioblastoma multiforme (GBM). Using individual nucleotide-resolution crosslinking immunoprecipitation coupled to deep sequencing (iCLIP-seq), we performed the first genome-wide analysis of the in vivo YB-1-RNA interactions and found that YB-1 preferentially recognizes a UYAUC consensus motif and binds to the majority of coding gene transcripts including pre-mRNAs and mature mRNAs. Remarkably, our data show that YB-1 also binds extensively to the terminal loop region of pri-/pre-miR-29b-2 and regulates the biogenesis of miR-29b-2 by blocking the recruitment of microprocessor and Dicer to its precursors. Furthermore, we show that down-regulation of miR-29b by YB-1, which is up-regulated in GBM, is important for cell proliferation. Together, our findings reveal a novel function of YB-1 in regulating non-coding RNA expression, which has important implications in tumorigenesis. PMID:26240386

  20. Genome-wide analysis of histone methylation reveals chromatin state-based complex regulation of differential gene transcription and function of CD8 memory T cells

    PubMed Central

    Araki, Yasuto; Wang, Zhibin; Zang, Chongzhi; Wood, William H.; Schones, Dustin; Cui, Kairong; Roh, Tae-Young; Lhotsky, Brad; Wersto, Robert P.; Peng, Weiqun; Becker, Kevin G.; Zhao, Keji; Weng, Nan-ping

    2009-01-01

    Summary Memory lymphocytes are characterized by their ability to exhibit a rapid response to the recall antigen, in which differential transcription plays a significant role, yet the underlying mechanism is not understood. We report here a genome-wide analysis of histone methylation on two histone H3 lysine residues (H3K4me3 and H3K27me3) and gene expression profiles in naïve and memory CD8 T cells. We found that a general correlation exists between the levels of gene expression and the levels of H3K4me3 (positive correlation) and H3K27me3 (negative correlation) across the gene body. These correlations display four distinct modes: repressive, active, poised, and bivalent, reflecting different functions of these genes. Furthermore, a permissive chromatin state of each gene is established by a combination of different histone modifications. Our findings reveal a complex regulation by histone methylation in differential gene expression and suggest that histone methylation may be responsible for memory CD8 T cell function. PMID:19523850

  1. Genome-wide analysis of hydrogen peroxide-regulated gene expression in Arabidopsis reveals a high light-induced transcriptional cluster involved in anthocyanin biosynthesis.

    PubMed

    Vanderauwera, Sandy; Zimmermann, Philip; Rombauts, Stéphane; Vandenabeele, Steven; Langebartels, Christian; Gruissem, Wilhelm; Inzé, Dirk; Van Breusegem, Frank

    2005-10-01

    In plants, reactive oxygen species and, more particularly, hydrogen peroxide (H(2)O(2)) play a dual role as toxic by-products of normal cell metabolism and as regulatory molecules in stress perception and signal transduction. Peroxisomal catalases are an important sink for photorespiratory H(2)O(2). Using ATH1 Affymetrix microarrays, expression profiles were compared between control and catalase-deficient Arabidopsis (Arabidopsis thaliana) plants. Reduced catalase levels already provoked differences in nuclear gene expression under ambient growth conditions, and these effects were amplified by high light exposure in a sun simulator for 3 and 8 h. This genome-wide expression analysis allowed us to reveal the expression characteristics of complete pathways and functional categories during H(2)O(2) stress. In total, 349 transcripts were significantly up-regulated by high light in catalase-deficient plants and 88 were down-regulated. From this data set, H(2)O(2) was inferred to play a key role in the transcriptional up-regulation of small heat shock proteins during high light stress. In addition, several transcription factors and candidate regulatory genes involved in H(2)O(2) transcriptional gene networks were identified. Comparisons with other publicly available transcriptome data sets of abiotically stressed Arabidopsis revealed an important intersection with H(2)O(2)-deregulated genes, positioning elevated H(2)O(2) levels as an important signal within abiotic stress-induced gene expression. Finally, analysis of transcriptional changes in a combination of a genetic (catalase deficiency) and an environmental (high light) perturbation identified a transcriptional cluster that was strongly and rapidly induced by high light in control plants, but impaired in catalase-deficient plants. This cluster comprises the complete known anthocyanin regulatory and biosynthetic pathway, together with genes encoding unknown proteins. PMID:16183842

  2. Genome-Wide Analysis Reveals a Major Role in Cell Fate Maintenance and an Unexpected Role in Endoreduplication for the Drosophila FoxA Gene Fork Head

    PubMed Central

    Maruyama, Rika; Grevengoed, Elizabeth; Stempniewicz, Peter; Andrew, Deborah J.

    2011-01-01

    Transcription factors drive organogenesis, from the initiation of cell fate decisions to the maintenance and implementation of these decisions. The Drosophila embryonic salivary gland provides an excellent platform for unraveling the underlying transcriptional networks of organ development because Drosophila is relatively unencumbered by significant genetic redundancy. The highly conserved FoxA family transcription factors are essential for various aspects of organogenesis in all animals that have been studied. Here, we explore the role of the single Drosophila FoxA protein Fork head (Fkh) in salivary gland organogenesis using two genome-wide strategies. A large-scale in situ hybridization analysis reveals a major role for Fkh in maintaining the salivary gland fate decision and controlling salivary gland physiological activity, in addition to its previously known roles in morphogenesis and survival. The majority of salivary gland genes (59%) are affected by fkh loss, mainly at later stages of salivary gland development. We show that global expression of Fkh cannot drive ectopic salivary gland formation. Thus, unlike the worm FoxA protein PHA-4, Fkh does not function to specify cell fate. In addition, Fkh only indirectly regulates many salivary gland genes, which is also distinct from the role of PHA-4 in organogenesis. Our microarray analyses reveal unexpected roles for Fkh in blocking terminal differentiation and in endoreduplication in the salivary gland and in other Fkh-expressing embryonic tissues. Overall, this study demonstrates an important role for Fkh in determining how an organ preserves its identity throughout development and provides an alternative paradigm for how FoxA proteins function in organogenesis. PMID:21698206

  3. A genome-wide analysis of open chromatin in human epididymis epithelial cells reveals candidate regulatory elements for genes coordinating epididymal function.

    PubMed

    Bischof, Jared M; Gillen, Austin E; Song, Lingyun; Gosalia, Nehal; London, Darin; Furey, Terrence S; Crawford, Gregory E; Harris, Ann

    2013-10-01

    The epithelium lining the epididymis has a pivotal role in ensuring a luminal environment that can support normal sperm maturation. Many of the individual genes that encode proteins involved in establishing the epididymal luminal fluid are well characterized. They include ion channels, ion exchangers, transporters, and solute carriers. However, the molecular mechanisms that coordinate expression of these genes and modulate their activities in response to biological stimuli are less well understood. To identify cis-regulatory elements for genes expressed in human epididymis epithelial cells, we generated genome-wide maps of open chromatin by DNase-seq. This analysis identified 33,542 epididymis-selective DNase I hypersensitive sites (DHS), which were not evident in five cell types of different lineages. Identification of genes with epididymis-selective DHS at their promoters revealed gene pathways that are active in immature epididymis epithelial cells. These include processes correlating with epithelial function and also others with specific roles in the epididymis, including retinol metabolism and ascorbate and aldarate metabolism. Peaks of epididymis-selective chromatin were seen in the androgen receptor gene and the cystic fibrosis transmembrane conductance regulator (CFTR) gene, which has a critical role in regulating ion transport across the epididymis epithelium. In silico prediction of transcription factor binding sites that were overrepresented in epididymis-selective DHS identified epithelial transcription factors, including ELF5 and ELF3, the androgen receptor, Pax2, and Sox9, as components of epididymis transcriptional networks. Active genes, which are targets of each transcription factor, reveal important biological processes in the epididymis epithelium. PMID:24006278

  4. Genome-wide analysis reveals NRP1 as a direct HIF1α-E2F7 target in the regulation of motorneuron guidance in vivo.

    PubMed

    de Bruin, Alain; A Cornelissen, Peter W; Kirchmaier, Bettina C; Mokry, Michal; Iich, Elhadi; Nirmala, Ella; Liang, Kuo-Hsuan; D Végh, Anna M; Scholman, Koen T; Groot Koerkamp, Marian J; Holstege, Frank C; Cuppen, Edwin; Schulte-Merker, Stefan; Bakker, Walbert J

    2016-05-01

    In this study, we explored the existence of a transcriptional network co-regulated by E2F7 and HIF1α, as we show that expression of E2F7, like HIF1α, is induced in hypoxia, and because of the previously reported ability of E2F7 to interact with HIF1α. Our genome-wide analysis uncovers a transcriptional network that is directly controlled by HIF1α and E2F7, and demonstrates both stimulatory and repressive functions of the HIF1α -E2F7 complex. Among this network we reveal Neuropilin 1 (NRP1) as a HIF1α-E2F7 repressed gene. By performing in vitro and in vivo reporter assays we demonstrate that the HIF1α-E2F7 mediated NRP1 repression depends on a 41 base pairs 'E2F-binding site hub', providing a molecular mechanism for a previously unanticipated role for HIF1α in transcriptional repression. To explore the biological significance of this regulation we performed in situ hybridizations and observed enhanced nrp1a expression in spinal motorneurons (MN) of zebrafish embryos, upon morpholino-inhibition of e2f7/8 or hif1α Consistent with the chemo-repellent role of nrp1a, morpholino-inhibition of e2f7/8 or hif1α caused MN truncations, which was rescued in TALEN-induced nrp1a(hu10012) mutants, and phenocopied in e2f7/8 mutant zebrafish. Therefore, we conclude that repression of NRP1 by the HIF1α-E2F7 complex regulates MN axon guidance in vivo. PMID:26681691

  5. Genome-wide analysis reveals NRP1 as a direct HIF1α-E2F7 target in the regulation of motorneuron guidance in vivo

    PubMed Central

    de Bruin, Alain; A. Cornelissen, Peter W.; Kirchmaier, Bettina C.; Mokry, Michal; Iich, Elhadi; Nirmala, Ella; Liang, Kuo-Hsuan; D. Végh, Anna M.; Scholman, Koen T.; Groot Koerkamp, Marian J.; Holstege, Frank C.; Cuppen, Edwin; Schulte-Merker, Stefan; Bakker, Walbert J.

    2016-01-01

    In this study, we explored the existence of a transcriptional network co-regulated by E2F7 and HIF1α, as we show that expression of E2F7, like HIF1α, is induced in hypoxia, and because of the previously reported ability of E2F7 to interact with HIF1α. Our genome-wide analysis uncovers a transcriptional network that is directly controlled by HIF1α and E2F7, and demonstrates both stimulatory and repressive functions of the HIF1α -E2F7 complex. Among this network we reveal Neuropilin 1 (NRP1) as a HIF1α-E2F7 repressed gene. By performing in vitro and in vivo reporter assays we demonstrate that the HIF1α-E2F7 mediated NRP1 repression depends on a 41 base pairs ‘E2F-binding site hub’, providing a molecular mechanism for a previously unanticipated role for HIF1α in transcriptional repression. To explore the biological significance of this regulation we performed in situ hybridizations and observed enhanced nrp1a expression in spinal motorneurons (MN) of zebrafish embryos, upon morpholino-inhibition of e2f7/8 or hif1α. Consistent with the chemo-repellent role of nrp1a, morpholino-inhibition of e2f7/8 or hif1α caused MN truncations, which was rescued in TALEN-induced nrp1ahu10012 mutants, and phenocopied in e2f7/8 mutant zebrafish. Therefore, we conclude that repression of NRP1 by the HIF1α-E2F7 complex regulates MN axon guidance in vivo. PMID:26681691

  6. Genome-wide association interaction analysis for Alzheimer's disease

    PubMed Central

    Gusareva, Elena S.; Carrasquillo, Minerva M.; Bellenguez, Céline; Cuyvers, Elise; Colon, Samuel; Graff-Radford, Neill R.; Petersen, Ronald C.; Dickson, Dennis W.; Mahachie Johna, Jestinah M.; Bessonov, Kyrylo; Van Broeckhoven, Christine; Williams, Julie; Amouyel, Philippe; Sleegers, Kristel; Ertekin-Taner, Nilüfer; Lambert, Jean-Charles; Van Steen, Kristel

    2015-01-01

    We propose a minimal protocol for exhaustive genome-wide association interaction analysis that involves screening for epistasis over large-scale genomic data combining strengths of different methods and statistical tools. The different steps of this protocol are illustrated on a real-life data application for Alzheimer's disease (AD) (2259 patients and 6017 controls from France). Particularly, in the exhaustive genome-wide epistasis screening we identified AD-associated interacting SNPs-pair from chromosome 6q11.1 (rs6455128, the KHDRBS2 gene) and 13q12.11 (rs7989332, the CRYL1 gene) (p = 0.006, corrected for multiple testing). A replication analysis in the independent AD cohort from Germany (555 patients and 824 controls) confirmed the discovered epistasis signal (p = 0.036). This signal was also supported by a meta-analysis approach in 5 independent AD cohorts that was applied in the context of epistasis for the first time. Transcriptome analysis revealed negative correlation between expression levels of KHDRBS2 and CRYL1 in both the temporal cortex (β = −0.19, p = 0.0006) and cerebellum (β = −0.23, p < 0.0001) brain regions. This is the first time a replicable epistasis associated with AD was identified using a hypothesis free screening approach. PMID:24958192

  7. Genome-wide Association Analysis of Blood-Pressure Traits in African-Ancestry Individuals Reveals Common Associated Genes in African and Non-African Populations

    PubMed Central

    Franceschini, Nora; Fox, Ervin; Zhang, Zhaogong; Edwards, Todd L.; Nalls, Michael A.; Sung, Yun Ju; Tayo, Bamidele O.; Sun, Yan V.; Gottesman, Omri; Adeyemo, Adebawole; Johnson, Andrew D.; Young, J. Hunter; Rice, Ken; Duan, Qing; Chen, Fang; Li, Yun; Tang, Hua; Fornage, Myriam; Keene, Keith L.; Andrews, Jeanette S.; Smith, Jennifer A.; Faul, Jessica D.; Guangfa, Zhang; Guo, Wei; Liu, Yu; Murray, Sarah S.; Musani, Solomon K.; Srinivasan, Sathanur; Velez Edwards, Digna R.; Wang, Heming; Becker, Lewis C.; Bovet, Pascal; Bochud, Murielle; Broeckel, Ulrich; Burnier, Michel; Carty, Cara; Chasman, Daniel I.; Ehret, Georg; Chen, Wei-Min; Chen, Guanjie; Chen, Wei; Ding, Jingzhong; Dreisbach, Albert W.; Evans, Michele K.; Guo, Xiuqing; Garcia, Melissa E.; Jensen, Rich; Keller, Margaux F.; Lettre, Guillaume; Lotay, Vaneet; Martin, Lisa W.; Moore, Jason H.; Morrison, Alanna C.; Mosley, Thomas H.; Ogunniyi, Adesola; Palmas, Walter; Papanicolaou, George; Penman, Alan; Polak, Joseph F.; Ridker, Paul M.; Salako, Babatunde; Singleton, Andrew B.; Shriner, Daniel; Taylor, Kent D.; Vasan, Ramachandran; Wiggins, Kerri; Williams, Scott M.; Yanek, Lisa R.; Zhao, Wei; Zonderman, Alan B.; Becker, Diane M.; Berenson, Gerald; Boerwinkle, Eric; Bottinger, Erwin; Cushman, Mary; Eaton, Charles; Nyberg, Fredrik; Heiss, Gerardo; Hirschhron, Joel N.; Howard, Virginia J.; Karczewsk, Konrad J.; Lanktree, Matthew B.; Liu, Kiang; Liu, Yongmei; Loos, Ruth; Margolis, Karen; Snyder, Michael; Go, Min Jin; Kim, Young Jin; Lee, Jong-Young; Jeon, Jae-Pil; Kim, Sung Soo; Han, Bok-Ghee; Cho, Yoon Shin; Sim, Xueling; Tay, Wan Ting; Ong, Rick Twee Hee; Seielstad, Mark; Liu, Jian Jun; Aung, Tin; Wong, Tien Yin; Teo, Yik Ying; Tai, E. Shyong; Chen, Chien-Hsiun; Chang, Li-ching; Chen, Yuan-Tsong; Wu, Jer-Yuarn; Kelly, Tanika N.; Gu, Dongfeng; Hixson, James E.; Sung, Yun Ju; He, Jiang; Tabara, Yasuharu; Kokubo, Yoshihiro; Miki, Tetsuro; Iwai, Naoharu; Kato, Norihiro; Takeuchi, Fumihiko; Katsuya, Tomohiro; Nabika, Toru; Sugiyama, Takao; Zhang, Yi; Huang, Wei; Zhang, Xuegong; Zhou, Xueya; Jin, Li; Zhu, Dingliang; Psaty, Bruce M.; Schork, Nicholas J.; Weir, David R.; Rotimi, Charles N.; Sale, Michele M.; Harris, Tamara; Kardia, Sharon L.R.; Hunt, Steven C.; Arnett, Donna; Redline, Susan; Cooper, Richard S.; Risch, Neil J.; Rao, D.C.; Rotter, Jerome I.; Chakravarti, Aravinda; Reiner, Alex P.; Levy, Daniel; Keating, Brendan J.; Zhu, Xiaofeng

    2013-01-01

    High blood pressure (BP) is more prevalent and contributes to more severe manifestations of cardiovascular disease (CVD) in African Americans than in any other United States ethnic group. Several small African-ancestry (AA) BP genome-wide association studies (GWASs) have been published, but their findings have failed to replicate to date. We report on a large AA BP GWAS meta-analysis that includes 29,378 individuals from 19 discovery cohorts and subsequent replication in additional samples of AA (n = 10,386), European ancestry (EA) (n = 69,395), and East Asian ancestry (n = 19,601). Five loci (EVX1-HOXA, ULK4, RSPO3, PLEKHG1, and SOX6) reached genome-wide significance (p < 1.0 × 10−8) for either systolic or diastolic BP in a transethnic meta-analysis after correction for multiple testing. Three of these BP loci (EVX1-HOXA, RSPO3, and PLEKHG1) lack previous associations with BP. We also identified one independent signal in a known BP locus (SOX6) and provide evidence for fine mapping in four additional validated BP loci. We also demonstrate that validated EA BP GWAS loci, considered jointly, show significant effects in AA samples. Consequently, these findings suggest that BP loci might have universal effects across studied populations, demonstrating that multiethnic samples are an essential component in identifying, fine mapping, and understanding their trait variability. PMID:23972371

  8. Genome-wide association and pathway analysis of feed efficiency in pigs reveal candidate genes and pathways for residual feed intake

    PubMed Central

    Do, Duy N.; Strathe, Anders B.; Ostersen, Tage; Pant, Sameer D.; Kadarmideen, Haja N.

    2014-01-01

    Residual feed intake (RFI) is a complex trait that is economically important for livestock production; however, the genetic and biological mechanisms regulating RFI are largely unknown in pigs. Therefore, the study aimed to identify single nucleotide polymorphisms (SNPs), candidate genes and biological pathways involved in regulating RFI using Genome-wide association (GWA) and pathway analyses. A total of 596 Yorkshire boars with phenotypes for two different measures of RFI (RFI1 and 2) and 60k genotypic data was used. GWA analysis was performed using a univariate mixed model and 12 and 7 SNPs were found to be significantly associated with RFI1 and RFI2, respectively. Several genes such as xin actin-binding repeat-containing protein 2 (XIRP2),tetratricopeptide repeat domain 29 (TTC29),suppressor of glucose, autophagy associated 1 (SOGA1),MAS1,G-protein-coupled receptor (GPCR) kinase 5 (GRK5),prospero-homeobox protein 1 (PROX1),GPCR 155 (GPR155), and FYVE domain containing the 26 (ZFYVE26) were identified as putative candidates for RFI based on their genomic location in the vicinity of these SNPs. Genes located within 50 kbp of SNPs significantly associated with RFI and RFI2 (q-value ≤ 0.2) were subsequently used for pathway analyses. These analyses were performed by assigning genes to biological pathways and then testing the association of individual pathways with RFI using a Fisher’s exact test. Metabolic pathway was significantly associated with both RFIs. Other biological pathways regulating phagosome, tight junctions, olfactory transduction, and insulin secretion were significantly associated with both RFI traits when relaxed threshold for cut-off p-value was used (p ≤ 0.05). These results implied porcine RFI is regulated by multiple biological mechanisms, although the metabolic processes might be the most important. Olfactory transduction pathway controlling the perception of feed via smell, insulin pathway controlling food intake might be important

  9. Genome-wide analysis reveals the differential regulations of mRNAs and miRNAs in Dorset and Small Tail Han sheep muscles.

    PubMed

    Miao, Xiangyang; Luo, Qingmiao; Qin, Xiaoyu

    2015-05-15

    Sheep are highly diverse species raised for meat and other agricultural products. The aim of the present study was to investigate the genetic regulators that could control muscle growth and development in different sheep breeds. The study showed that the differentially expressed genes are involved in various cellular activities, such as metabolic cascades, catalytic function and signaling pathway. Many signaling molecules are also found to be differentially expressed, suggesting important roles of signaling pathways contributing to genetic diversity and sheep development. Analysis of miRNAs suggested important roles of miRNAs in controlling muscle differences. This study provided a genome-wide resolution of mRNA and miRNA regulations in muscles from Dorset and Han sheep. PMID:25732516

  10. Genome-wide SNP analysis reveals population structure and demographic history of the ryukyu islanders in the southern part of the Japanese archipelago.

    PubMed

    Sato, Takehiro; Nakagome, Shigeki; Watanabe, Chiaki; Yamaguchi, Kyoko; Kawaguchi, Akira; Koganebuchi, Kae; Haneji, Kuniaki; Yamaguchi, Tetsutaro; Hanihara, Tsunehiko; Yamamoto, Ken; Ishida, Hajime; Mano, Shuhei; Kimura, Ryosuke; Oota, Hiroki

    2014-11-01

    The Ryukyu Islands are located to the southwest of the Japanese archipelago. Archaeological evidence has revealed the existence of prehistoric cultural differentiation between the northern Ryukyu islands of Amami and Okinawa, and the southern Ryukyu islands of Miyako and Yaeyama. To examine a genetic subdivision in the Ryukyu Islands, we conducted genome-wide single nucleotide polymorphism typing of inhabitants from the Okinawa Islands, the Miyako Islands, and the Yaeyama Islands. Principal component and cluster analyses revealed genetic differentiation among the island groups, especially between Okinawa and Miyako. No genetic affinity was observed between aboriginal Taiwanese and any of the Ryukyu populations. The genetic differentiation observed between the inhabitants of the Okinawa Islands and the Miyako Islands is likely to have arisen due to genetic drift rather than admixture with people from neighboring regions. Based on the observed genetic differences, the divergence time between the inhabitants of Okinawa and Miyako islands was dated to the Holocene. These findings suggest that the Pleistocene inhabitants, whose bones have been found on the southern Ryukyu Islands, did not make a major genetic contribution, if any, to the present-day inhabitants of the southern Ryukyu Islands. PMID:25086001

  11. Genome-wide parent-of-origin DNA methylation analysis reveals the intricacies of human imprinting and suggests a germline methylation-independent mechanism of establishment

    PubMed Central

    Court, Franck; Tayama, Chiharu; Romanelli, Valeria; Martin-Trujillo, Alex; Iglesias-Platas, Isabel; Okamura, Kohji; Sugahara, Naoko; Simón, Carlos; Moore, Harry; Harness, Julie V.; Keirstead, Hans; Sanchez-Mut, Jose Vicente; Kaneki, Eisuke; Lapunzina, Pablo; Soejima, Hidenobu; Wake, Norio; Esteller, Manel; Ogata, Tsutomu; Hata, Kenichiro; Nakabayashi, Kazuhiko; Monk, David

    2014-01-01

    Differential methylation between the two alleles of a gene has been observed in imprinted regions, where the methylation of one allele occurs on a parent-of-origin basis, the inactive X-chromosome in females, and at those loci whose methylation is driven by genetic variants. We have extensively characterized imprinted methylation in a substantial range of normal human tissues, reciprocal genome-wide uniparental disomies, and hydatidiform moles, using a combination of whole-genome bisulfite sequencing and high-density methylation microarrays. This approach allowed us to define methylation profiles at known imprinted domains at base-pair resolution, as well as to identify 21 novel loci harboring parent-of-origin methylation, 15 of which are restricted to the placenta. We observe that the extent of imprinted differentially methylated regions (DMRs) is extremely similar between tissues, with the exception of the placenta. This extra-embryonic tissue often adopts a different methylation profile compared to somatic tissues. Further, we profiled all imprinted DMRs in sperm and embryonic stem cells derived from parthenogenetically activated oocytes, individual blastomeres, and blastocysts, in order to identify primary DMRs and reveal the extent of reprogramming during preimplantation development. Intriguingly, we find that in contrast to ubiquitous imprints, the majority of placenta-specific imprinted DMRs are unmethylated in sperm and all human embryonic stem cells. Therefore, placental-specific imprinting provides evidence for an inheritable epigenetic state that is independent of DNA methylation and the existence of a novel imprinting mechanism at these loci. PMID:24402520

  12. A genome-wide systems analysis reveals strong link between colorectal cancer and trimethylamine N-oxide (TMAO), a gut microbial metabolite of dietary meat and fat

    PubMed Central

    2015-01-01

    Background Dietary intakes of red meat and fat are established risk factors for both colorectal cancer (CRC) and cardiovascular disease (CVDs). Recent studies have shown a mechanistic link between TMAO, an intestinal microbial metabolite of red meat and fat, and risk of CVDs. Data linking TMAO directly to CRC is, however, lacking. Here, we present an unbiased data-driven network-based systems approach to uncover a potential genetic relationship between TMAO and CRC. Materials and methods We constructed two different epigenetic interaction networks (EINs) using chemical-gene, disease-gene and protein-protein interaction data from multiple large-scale data resources. We developed a network-based ranking algorithm to ascertain TMAO-related diseases from EINs. We systematically analyzed disease categories among TMAO-related diseases at different ranking cutoffs. We then determined which genetic pathways were associated with both TMAO and CRC. Results We show that CVDs and their major risk factors were ranked highly among TMAO-related diseases, confirming the newly discovered mechanistic link between CVDs and TMAO, and thus validating our algorithms. CRC was ranked highly among TMAO-related disease retrieved from both EINs (top 0.02%, #1 out of 4,372 diseases retrieved based on Mendelian genetics and top 10.9% among 882 diseases based on genome-wide association genetics), providing strong supporting evidence for our hypothesis that TMAO is genetically related to CRC. We have also identified putative genetic pathways that may link TMAO to CRC, which warrants further investigation. Through systematic disease enrichment analysis, we also demonstrated that TMAO is related to metabolic syndromes and cancers in general. Conclusions Our genome-wide analysis demonstrates that systems approaches to studying the epigenetic interactions among diet, microbiome metabolisms, and disease genetics hold promise for understanding disease pathogenesis. Our results show that TMAO is

  13. Genome-Wide Analysis of the Fasciclin-Like Arabinogalactan Protein Gene Family Reveals Differential Expression Patterns, Localization, and Salt Stress Response in Populus

    PubMed Central

    Zang, Lina; Zheng, Tangchun; Chu, Yanguang; Ding, Changjun; Zhang, Weixi; Huang, Qinjun; Su, Xiaohua

    2015-01-01

    Fasciclin-like arabinogalactan proteins (FLAs) are a subclass of arabinogalactan proteins (AGPs) involved in plant growth, development and response to abiotic stress. Although many studies have been performed to identify molecular functions of individual family members, little information is available on genome-wide identification and characterization of FLAs in the genus Populus. Based on genome-wide analysis, we have identified 35 Populus FLAs which were distributed on 16 chromosomes and phylogenetically clustered into four major groups. Gene structure and motif composition were relatively conserved in each group. All the members contained N-terminal signal peptide, 23 of which included predicted glycosylphosphatidylinositol (GPI) modification sites and were anchored to plasma membranes. Subcellular localization analysis showed that PtrFLA2/20/26 were localized in cell membrane and cytoplasm of protoplasts from Populus stem-differentiating xylem. The Ka/Ks ratios showed that purifying selection has played a leading role in the long-term evolutionary period which greatly maintained the function of this family. The expression profiles showed that 32 PtrFLAs were differentially expressed in four tissues at four seasons based on publicly available microarray data. 18 FLAs were further verified with qRT-PCR in different tissues, which indicated that PtrFLA1/2/3/7/11/12/20/21/22/24/26/30 were significantly expressed in male and female flowers, suggesting close correlations with the reproductive development. In addition, PtrFLA1/9/10/11/17/21/23/24/26/28 were highly expressed in the stems and differentiating xylem, which may be involved in stem development. To determine salt response of FLAs, qRT-PCR was performed to analyze the expression of 18 genes under salinity stress across two time points. Results demonstrated that all the 18 FLAs were expressed in root tissues; especially, PtrFLA2/12/20/21/24/30 were significantly induced at different time points. In summary

  14. Genome-wide meta-analysis identifies 56 bone mineral density loci and reveals 14 loci associated with risk of fracture

    PubMed Central

    Estrada, Karol; Styrkarsdottir, Unnur; Evangelou, Evangelos; Hsu, Yi-Hsiang; Duncan, Emma L; Ntzani, Evangelia E; Oei, Ling; Albagha, Omar M E; Amin, Najaf; Kemp, John P; Koller, Daniel L; Li, Guo; Liu, Ching-Ti; Minster, Ryan L; Moayyeri, Alireza; Vandenput, Liesbeth; Willner, Dana; Xiao, Su-Mei; Yerges-Armstrong, Laura M; Zheng, Hou-Feng; Alonso, Nerea; Eriksson, Joel; Kammerer, Candace M; Kaptoge, Stephen K; Leo, Paul J; Thorleifsson, Gudmar; Wilson, Scott G; Wilson, James F; Aalto, Ville; Alen, Markku; Aragaki, Aaron K; Aspelund, Thor; Center, Jacqueline R; Dailiana, Zoe; Duggan, David J; Garcia, Melissa; Garcia-Giralt, Natàlia; Giroux, Sylvie; Hallmans, Göran; Hocking, Lynne J; Husted, Lise Bjerre; Jameson, Karen A; Khusainova, Rita; Kim, Ghi Su; Kooperberg, Charles; Koromila, Theodora; Kruk, Marcin; Laaksonen, Marika; Lacroix, Andrea Z; Lee, Seung Hun; Leung, Ping C; Lewis, Joshua R; Masi, Laura; Mencej-Bedrac, Simona; Nguyen, Tuan V; Nogues, Xavier; Patel, Millan S; Prezelj, Janez; Rose, Lynda M; Scollen, Serena; Siggeirsdottir, Kristin; Smith, Albert V; Svensson, Olle; Trompet, Stella; Trummer, Olivia; van Schoor, Natasja M; Woo, Jean; Zhu, Kun; Balcells, Susana; Brandi, Maria Luisa; Buckley, Brendan M; Cheng, Sulin; Christiansen, Claus; Cooper, Cyrus; Dedoussis, George; Ford, Ian; Frost, Morten; Goltzman, David; González-Macías, Jesús; Kähönen, Mika; Karlsson, Magnus; Khusnutdinova, Elza; Koh, Jung-Min; Kollia, Panagoula; Langdahl, Bente Lomholt; Leslie, William D; Lips, Paul; Ljunggren, Östen; Lorenc, Roman S; Marc, Janja; Mellström, Dan; Obermayer-Pietsch, Barbara; Olmos, José M; Pettersson-Kymmer, Ulrika; Reid, David M; Riancho, José A; Ridker, Paul M; Rousseau, François; Slagboom, P Eline; Tang, Nelson LS; Urreizti, Roser; Van Hul, Wim; Viikari, Jorma; Zarrabeitia, María T; Aulchenko, Yurii S; Castano-Betancourt, Martha; Grundberg, Elin; Herrera, Lizbeth; Ingvarsson, Thorvaldur; Johannsdottir, Hrefna; Kwan, Tony; Li, Rui; Luben, Robert; Medina-Gómez, Carolina; Palsson, Stefan Th; Reppe, Sjur; Rotter, Jerome I; Sigurdsson, Gunnar; van Meurs, Joyce B J; Verlaan, Dominique; Williams, Frances MK; Wood, Andrew R; Zhou, Yanhua; Gautvik, Kaare M; Pastinen, Tomi; Raychaudhuri, Soumya; Cauley, Jane A; Chasman, Daniel I; Clark, Graeme R; Cummings, Steven R; Danoy, Patrick; Dennison, Elaine M; Eastell, Richard; Eisman, John A; Gudnason, Vilmundur; Hofman, Albert; Jackson, Rebecca D; Jones, Graeme; Jukema, J Wouter; Khaw, Kay-Tee; Lehtimäki, Terho; Liu, Yongmei; Lorentzon, Mattias; McCloskey, Eugene; Mitchell, Braxton D; Nandakumar, Kannabiran; Nicholson, Geoffrey C; Oostra, Ben A; Peacock, Munro; Pols, Huibert A P; Prince, Richard L; Raitakari, Olli; Reid, Ian R; Robbins, John; Sambrook, Philip N; Sham, Pak Chung; Shuldiner, Alan R; Tylavsky, Frances A; van Duijn, Cornelia M; Wareham, Nick J; Cupples, L Adrienne; Econs, Michael J; Evans, David M; Harris, Tamara B; Kung, Annie Wai Chee; Psaty, Bruce M; Reeve, Jonathan; Spector, Timothy D; Streeten, Elizabeth A; Zillikens, M Carola; Thorsteinsdottir, Unnur; Ohlsson, Claes; Karasik, David; Richards, J Brent; Brown, Matthew A; Stefansson, Kari; Uitterlinden, André G; Ralston, Stuart H; Ioannidis, John P A; Kiel, Douglas P; Rivadeneira, Fernando

    2012-01-01

    Bone mineral density (BMD) is the most important predictor of fracture risk. We performed the largest meta-analysis to date on lumbar spine and femoral neck BMD, including 17 genome-wide association studies and 32,961 individuals of European and East Asian ancestry. We tested the top-associated BMD markers for replication in 50,933 independent subjects and for risk of low-trauma fracture in 31,016 cases and 102,444 controls. We identified 56 loci (32 novel)associated with BMD atgenome-wide significant level (P<5×10−8). Several of these factors cluster within the RANK-RANKL-OPG, mesenchymal-stem-cell differentiation, endochondral ossification and the Wnt signalling pathways. However, we also discovered loci containing genes not known to play a role in bone biology. Fourteen BMD loci were also associated with fracture risk (P<5×10−4, Bonferroni corrected), of which six reached P<5×10−8 including: 18p11.21 (C18orf19), 7q21.3 (SLC25A13), 11q13.2 (LRP5), 4q22.1 (MEPE), 2p16.2 (SPTBN1) and 10q21.1 (DKK1). These findings shed light on the genetic architecture and pathophysiological mechanisms underlying BMD variation and fracture susceptibility. PMID:22504420

  15. Massively expedited genome-wide heritability analysis (MEGHA)

    PubMed Central

    Ge, Tian; Nichols, Thomas E.; Lee, Phil H.; Holmes, Avram J.; Roffman, Joshua L.; Buckner, Randy L.; Sabuncu, Mert R.; Smoller, Jordan W.

    2015-01-01

    The discovery and prioritization of heritable phenotypes is a computational challenge in a variety of settings, including neuroimaging genetics and analyses of the vast phenotypic repositories in electronic health record systems and population-based biobanks. Classical estimates of heritability require twin or pedigree data, which can be costly and difficult to acquire. Genome-wide complex trait analysis is an alternative tool to compute heritability estimates from unrelated individuals, using genome-wide data that are increasingly ubiquitous, but is computationally demanding and becomes difficult to apply in evaluating very large numbers of phenotypes. Here we present a fast and accurate statistical method for high-dimensional heritability analysis using genome-wide SNP data from unrelated individuals, termed massively expedited genome-wide heritability analysis (MEGHA) and accompanying nonparametric sampling techniques that enable flexible inferences for arbitrary statistics of interest. MEGHA produces estimates and significance measures of heritability with several orders of magnitude less computational time than existing methods, making heritability-based prioritization of millions of phenotypes based on data from unrelated individuals tractable for the first time to our knowledge. As a demonstration of application, we conducted heritability analyses on global and local morphometric measurements derived from brain structural MRI scans, using genome-wide SNP data from 1,320 unrelated young healthy adults of non-Hispanic European ancestry. We also computed surface maps of heritability for cortical thickness measures and empirically localized cortical regions where thickness measures were significantly heritable. Our analyses demonstrate the unique capability of MEGHA for large-scale heritability-based screening and high-dimensional heritability profile construction. PMID:25675487

  16. Massively expedited genome-wide heritability analysis (MEGHA).

    PubMed

    Ge, Tian; Nichols, Thomas E; Lee, Phil H; Holmes, Avram J; Roffman, Joshua L; Buckner, Randy L; Sabuncu, Mert R; Smoller, Jordan W

    2015-02-24

    The discovery and prioritization of heritable phenotypes is a computational challenge in a variety of settings, including neuroimaging genetics and analyses of the vast phenotypic repositories in electronic health record systems and population-based biobanks. Classical estimates of heritability require twin or pedigree data, which can be costly and difficult to acquire. Genome-wide complex trait analysis is an alternative tool to compute heritability estimates from unrelated individuals, using genome-wide data that are increasingly ubiquitous, but is computationally demanding and becomes difficult to apply in evaluating very large numbers of phenotypes. Here we present a fast and accurate statistical method for high-dimensional heritability analysis using genome-wide SNP data from unrelated individuals, termed massively expedited genome-wide heritability analysis (MEGHA) and accompanying nonparametric sampling techniques that enable flexible inferences for arbitrary statistics of interest. MEGHA produces estimates and significance measures of heritability with several orders of magnitude less computational time than existing methods, making heritability-based prioritization of millions of phenotypes based on data from unrelated individuals tractable for the first time to our knowledge. As a demonstration of application, we conducted heritability analyses on global and local morphometric measurements derived from brain structural MRI scans, using genome-wide SNP data from 1,320 unrelated young healthy adults of non-Hispanic European ancestry. We also computed surface maps of heritability for cortical thickness measures and empirically localized cortical regions where thickness measures were significantly heritable. Our analyses demonstrate the unique capability of MEGHA for large-scale heritability-based screening and high-dimensional heritability profile construction. PMID:25675487

  17. Genome-wide CNV analysis in 221 unrelated patients and targeted high-throughput sequencing reveal novel causative candidate genes for colorectal adenomatous polyposis.

    PubMed

    Horpaopan, Sukanya; Spier, Isabel; Zink, Alexander M; Altmüller, Janine; Holzapfel, Stefanie; Laner, Andreas; Vogt, Stefanie; Uhlhaas, Siegfried; Heilmann, Stefanie; Stienen, Dietlinde; Pasternack, Sandra M; Keppler, Kathleen; Adam, Ronja; Kayser, Katrin; Moebus, Susanne; Draaken, Markus; Degenhardt, Franziska; Engels, Hartmut; Hofmann, Andrea; Nöthen, Markus M; Steinke, Verena; Perez-Bouza, Alberto; Herms, Stefan; Holinski-Feder, Elke; Fröhlich, Holger; Thiele, Holger; Hoffmann, Per; Aretz, Stefan

    2015-03-15

    To uncover novel causative genes in patients with unexplained adenomatous polyposis, a model disease for colorectal cancer, we performed a genome-wide analysis of germline copy number variants (CNV) in a large, well characterized APC and MUTYH mutation negative patient cohort followed by a targeted next generation sequencing (NGS) approach. Genomic DNA from 221 unrelated German patients was genotyped on high-resolution SNP arrays. Putative CNVs were filtered according to stringent criteria, compared with those of 531 population-based German controls, and validated by qPCR. Candidate genes were prioritized using in silico, expression, and segregation analyses, data mining and enrichment analyses of genes and pathways. In 27% of the 221 unrelated patients, a total of 77 protein coding genes displayed rare, nonrecurrent, germline CNVs. The set included 26 candidates with molecular and cellular functions related to tumorigenesis. Targeted high-throughput sequencing found truncating point mutations in 12% (10/77) of the prioritized genes. No clear evidence was found for autosomal recessive subtypes. Six patients had potentially causative mutations in more than one of the 26 genes. Combined with data from recent studies of early-onset colorectal and breast cancer, recurrent potential loss-of-function alterations were detected in CNTN6, FOCAD (KIAA1797), HSPH1, KIF26B, MCM3AP, YBEY and in three genes from the ARHGAP family. In the canonical Wnt pathway oncogene CTNNB1 (β-catenin), two potential gain-of-function mutations were found. In conclusion, the present study identified a group of rarely affected genes which are likely to predispose to colorectal adenoma formation and confirmed previously published candidates for tumor predisposition as etiologically relevant. PMID:25219767

  18. Genome-wide DNA methylation analysis in hepatocellular carcinoma.

    PubMed

    Yamada, Nobuhisa; Yasui, Kohichiroh; Dohi, Osamu; Gen, Yasuyuki; Tomie, Akira; Kitaichi, Tomoko; Iwai, Naoto; Mitsuyoshi, Hironori; Sumida, Yoshio; Moriguchi, Michihisa; Yamaguchi, Kanji; Nishikawa, Taichiro; Umemura, Atsushi; Naito, Yuji; Tanaka, Shinji; Arii, Shigeki; Itoh, Yoshito

    2016-04-01

    Epigenetic changes as well as genetic changes are mechanisms of tumorigenesis. We aimed to identify novel genes that are silenced by DNA hypermethylation in hepatocellular carcinoma (HCC). We screened for genes with promoter DNA hypermethylation using a genome-wide methylation microarray analysis in primary HCC (the discovery set). The microarray analysis revealed that there were 2,670 CpG sites that significantly differed in regards to the methylation level between the tumor and non-tumor liver tissues; 875 were significantly hypermethylated and 1,795 were significantly hypomethylated in the HCC tumors compared to the non‑tumor tissues. Further analyses using methylation-specific PCR, combined with expression analysis, in the validation set of primary HCC showed that, in addition to three known tumor-suppressor genes (APC, CDKN2A, and GSTP1), eight genes (AKR1B1, GRASP, MAP9, NXPE3, RSPH9, SPINT2, STEAP4, and ZNF154) were significantly hypermethylated and downregulated in the HCC tumors compared to the non-tumor liver tissues. Our results suggest that epigenetic silencing of these genes may be associated with HCC. PMID:26883180

  19. Genome-wide pathway analysis of genome-wide association studies on systemic lupus erythematosus and rheumatoid arthritis.

    PubMed

    Lee, Young Ho; Bae, Sang-Cheol; Choi, Sung Jae; Ji, Jong Dae; Song, Gwan Gyu

    2012-12-01

    The aim of this study was to explore candidate single nucleotide polymorphisms (SNPs) and candidate mechanisms of systemic lupus erythematosus (SLE) and rheumatoid arthritis (RA). Two SLE genome-wide association studies (GWASs) datasets were included in this study. Meta-analysis was conducted using 737,984 SNPs in 1,527 SLE cases and 3,421 controls of European ancestry, and 4,429 SNPs that met a threshold of p < 0.01 in a Korean RA GWAS dataset was used. ICSNPathway (identify candidate causal SNPs and pathways) analysis was applied to the meta-analysis results of the SLE GWAS datasets, and a RA GWAS dataset. The most significant result of SLE GWAS meta-analysis concerned rs2051549 in the human leukocyte antigen (HLA) region (p = 3.36E-22). In the non-HLA region, meta-analysis identified 6 SNPs associated with SLE with genome-wide significance (STAT4, TNPO3, BLK, FAM167A, and IRF5). ICSNPathway identified five candidate causal SNPs and 13 candidate causal pathways. This pathway-based analysis provides three hypotheses of the biological mechanism involved. First, rs8084 and rs7192 → HLA-DRA → bystander B cell activation. Second, rs1800629 → TNF → cytokine network. Third, rs1150752 and rs185819 → TNXB → collagen metabolic process. ICSNPathway analysis identified three candidate causal non-HLA SNPs and four candidate causal pathways involving the PADI4, MTR, PADI2, and TPH2 genes of RA. We identified five candidate SNPs and thirteen pathways, involving bystander B cell activation, cytokine network, and collagen metabolic processing, which may contribute to SLE susceptibility, and we revealed candidate causal non-HLA SNPs, genes, and pathways of RA. PMID:23053960

  20. Genome-wide analysis of BMI in adolescents and young adults reveals additional insight into the effects of genetic loci over the life course

    PubMed Central

    Graff, Mariaelisa; Ngwa, Julius S.; Workalemahu, Tsegaselassie; Homuth, Georg; Schipf, Sabine; Teumer, Alexander; Völzke, Henry; Wallaschofski, Henri; Abecasis, Goncalo R.; Edward, Lakatta; Francesco, Cucca; Sanna, Serena; Scheet, Paul; Schlessinger, David; Sidore, Carlo; Xiao, Xiangjun; Wang, Zhaoming; Chanock, Stephen J.; Jacobs, Kevin B.; Hayes, Richard B.; Hu, Frank; Van Dam, Rob M.; Crout, Richard J.; Marazita, Mary L.; Shaffer, John R; Atwood, Larry D.; Fox, Caroline S.; Heard-Costa, Nancy L.; White, Charles; Choh, Audrey C.; Czerwinski, Stefan A.; Demerath, Ellen W.; Dyer, Thomas D.; Towne, Bradford; Amin, Najaf; Oostra, Ben A.; Van Duijn, Cornelia M.; Zillikens, M. Carola; Esko, Tõnu; Nelis, Mari; Nikopensius, Tit; Metspalu, Andres; Strachan, David P.; Monda, Keri; Qi, Lu; North, Kari E.; Cupples, L. Adrienne; Gordon-Larsen, Penny; Berndt, Sonja I.

    2013-01-01

    Genetic loci for body mass index (BMI) in adolescence and young adulthood, a period of high risk for weight gain, are understudied, yet may yield important insight into the etiology of obesity and early intervention. To identify novel genetic loci and examine the influence of known loci on BMI during this critical time period in late adolescence and early adulthood, we performed a two-stage meta-analysis using 14 genome-wide association studies in populations of European ancestry with data on BMI between ages 16 and 25 in up to 29 880 individuals. We identified seven independent loci (P < 5.0 × 10−8) near FTO (P = 3.72 × 10−23), TMEM18 (P = 3.24 × 10−17), MC4R (P = 4.41 × 10−17), TNNI3K (P = 4.32 × 10−11), SEC16B (P = 6.24 × 10−9), GNPDA2 (P = 1.11 × 10−8) and POMC (P = 4.94 × 10−8) as well as a potential secondary signal at the POMC locus (rs2118404, P = 2.4 × 10−5 after conditioning on the established single-nucleotide polymorphism at this locus) in adolescents and young adults. To evaluate the impact of the established genetic loci on BMI at these young ages, we examined differences between the effect sizes of 32 published BMI loci in European adult populations (aged 18–90) and those observed in our adolescent and young adult meta-analysis. Four loci (near PRKD1, TNNI3K, SEC16B and CADM2) had larger effects and one locus (near SH2B1) had a smaller effect on BMI during adolescence and young adulthood compared with older adults (P < 0.05). These results suggest that genetic loci for BMI can vary in their effects across the life course, underlying the importance of evaluating BMI at different ages. PMID:23669352

  1. A GENOME-WIDE LINKAGE AND ASSOCIATION SCAN REVEALS NOVEL LOCI FOR AUTISM

    PubMed Central

    Weiss, Lauren A.; Arking, Dan E.

    2009-01-01

    Summary Although autism is a highly heritable neurodevelopmental disorder, attempts to identify specific susceptibility genes have thus far met with limited success 1. Genome-wide association studies (GWAS) using half a million or more markers, particularly those with very large sample sizes achieved through meta-analysis, have shown great success in mapping genes for other complex genetic traits (http://www.genome.gov/26525384). Consequently, we initiated a linkage and association mapping study using half a million genome-wide SNPs in a common set of 1,031 multiplex autism families (1,553 affected offspring). We identified regions of suggestive and significant linkage on chromosomes 6q27 and 20p13, respectively. Initial analysis did not yield genome-wide significant associations; however, genotyping of top hits in additional families revealed a SNP on chromosome 5p15 (between SEMA5A and TAS2R1) that was significantly associated with autism (P = 2 × 10−7). We also demonstrated that expression of SEMA5A is reduced in brains from autistic patients, further implicating SEMA5A as an autism susceptibility gene. The linkage regions reported here provide targets for rare variation screening while the discovery of a single novel association demonstrates the action of common variants. PMID:19812673

  2. Genome-Wide Analysis of Human Metapneumovirus Evolution

    PubMed Central

    Kim, Jin Il; Park, Sehee; Lee, Ilseob; Park, Kwang Sook; Kwak, Eun Jung; Moon, Kwang Mee; Lee, Chang Kyu; Bae, Joon-Yong; Park, Man-Seong; Song, Ki-Joon

    2016-01-01

    Human metapneumovirus (HMPV) has been described as an important etiologic agent of upper and lower respiratory tract infections, especially in young children and the elderly. Most of school-aged children might be introduced to HMPVs, and exacerbation with other viral or bacterial super-infection is common. However, our understanding of the molecular evolution of HMPVs remains limited. To address the comprehensive evolutionary dynamics of HMPVs, we report a genome-wide analysis of the eight genes (N, P, M, F, M2, SH, G, and L) using 103 complete genome sequences. Phylogenetic reconstruction revealed that the eight genes from one HMPV strain grouped into the same genetic group among the five distinct lineages (A1, A2a, A2b, B1, and B2). A few exceptions of phylogenetic incongruence might suggest past recombination events, and we detected possible recombination breakpoints in the F, SH, and G coding regions. The five genetic lineages of HMPVs shared quite remote common ancestors ranging more than 220 to 470 years of age with the most recent origins for the A2b sublineage. Purifying selection was common, but most protein genes except the F and M2-2 coding regions also appeared to experience episodic diversifying selection. Taken together, these suggest that the five lineages of HMPVs maintain their individual evolutionary dynamics and that recombination and selection forces might work on shaping the genetic diversity of HMPVs. PMID:27046055

  3. Genome-Wide Detection and Analysis of Multifunctional Genes

    PubMed Central

    Pritykin, Yuri; Ghersi, Dario; Singh, Mona

    2015-01-01

    Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655

  4. Genome-Wide Detection and Analysis of Multifunctional Genes.

    PubMed

    Pritykin, Yuri; Ghersi, Dario; Singh, Mona

    2015-10-01

    Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms--H. sapiens, D. melanogaster, and S. cerevisiae--and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655

  5. The combination of a genome-wide association study of lymphocyte count and analysis of gene expression data reveals novel asthma candidate genes

    PubMed Central

    Cusanovich, Darren A.; Billstrand, Christine; Zhou, Xiang; Chavarria, Claudia; De Leon, Sherryl; Michelini, Katelyn; Pai, Athma A.; Ober, Carole; Gilad, Yoav

    2012-01-01

    Recent genome-wide association studies (GWAS) have identified a number of novel genetic associations with complex human diseases. In spite of these successes, results from GWAS generally explain only a small proportion of disease heritability, an observation termed the ‘missing heritability problem’. Several sources for the missing heritability have been proposed, including the contribution of many common variants with small individual effect sizes, which cannot be reliably found using the standard GWAS approach. The goal of our study was to explore a complimentary approach, which combines GWAS results with functional data in order to identify novel genetic associations with small effect sizes. To do so, we conducted a GWAS for lymphocyte count, a physiologic quantitative trait associated with asthma, in 462 Hutterites. In parallel, we performed a genome-wide gene expression study in lymphoblastoid cell lines from 96 Hutterites. We found significant support for genetic associations using the GWAS data when we considered variants near the 193 genes whose expression levels across individuals were most correlated with lymphocyte counts. Interestingly, these variants are also enriched with signatures of an association with asthma susceptibility, an observation we were able to replicate. The associated loci include genes previously implicated in asthma susceptibility as well as novel candidate genes enriched for functions related to T cell receptor signaling and adenosine triphosphate synthesis. Our results, therefore, establish a new set of asthma susceptibility candidate genes. More generally, our observations support the notion that many loci of small effects influence variation in lymphocyte count and asthma susceptibility. PMID:22286170

  6. Genome-Wide Analysis Reveals Diverged Patterns of Codon Bias, Gene Expression, and Rates of Sequence Evolution in Picea Gene Families

    PubMed Central

    De La Torre, Amanda R.; Lin, Yao-Cheng; Van de Peer, Yves; Ingvarsson, Pär K.

    2015-01-01

    The recent sequencing of several gymnosperm genomes has greatly facilitated studying the evolution of their genes and gene families. In this study, we examine the evidence for expression-mediated selection in the first two fully sequenced representatives of the gymnosperm plant clade (Picea abies and Picea glauca). We use genome-wide estimates of gene expression (>50,000 expressed genes) to study the relationship between gene expression, codon bias, rates of sequence divergence, protein length, and gene duplication. We found that gene expression is correlated with rates of sequence divergence and codon bias, suggesting that natural selection is acting on Picea protein-coding genes for translational efficiency. Gene expression, rates of sequence divergence, and codon bias are correlated with the size of gene families, with large multicopy gene families having, on average, a lower expression level and breadth, lower codon bias, and higher rates of sequence divergence than single-copy gene families. Tissue-specific patterns of gene expression were more common in large gene families with large gene expression divergence than in single-copy families. Recent family expansions combined with large gene expression variation in paralogs and increased rates of sequence evolution suggest that some Picea gene families are rapidly evolving to cope with biotic and abiotic stress. Our study highlights the importance of gene expression and natural selection in shaping the evolution of protein-coding genes in Picea species, and sets the ground for further studies investigating the evolution of individual gene families in gymnosperms. PMID:25747252

  7. Genome-wide localization analysis of a complete set of Tafs reveals a specific effect of the taf1 mutation on Taf2 occupancy and provides indirect evidence for different TFIID conformations at different promoters.

    PubMed

    Ohtsuki, Kazushige; Kasahara, Koji; Shirahige, Katsuhiko; Kokubo, Tetsuro

    2010-04-01

    In Saccharomyces cerevisiae, TFIID and SAGA principally mediate transcription of constitutive housekeeping genes and stress-inducible genes, respectively, by delivering TBP to the core promoter. Both are multi-protein complexes composed of 15 and 20 subunits, respectively, five of which are common and which may constitute a core sub-module in each complex. Although genome-wide gene expression studies have been conducted extensively in several TFIID and/or SAGA mutants, there are only a limited number of studies investigating genome-wide localization of the components of these two complexes. Specifically, there are no previous reports on localization of a complete set of Tafs and the effects of taf mutations on localization. Here, we examine the localization profiles of a complete set of Tafs, Gcn5, Bur6/Ncb2, Sua7, Tfa2, Tfg1, Tfb3 and Rpb1, on chromosomes III, IV and V by chromatin immunoprecipitation (ChIP)-chip analysis in wild-type and taf1-T657K mutant strains. In addition, we conducted conventional and sequential ChIP analysis of several ribosomal protein genes (RPGs) and non-RPGs. Intriguingly, the results revealed a novel relationship between TFIIB and NC2, simultaneous co-localization of SAGA and TFIID on RPG promoters, specific effects of taf1 mutation on Taf2 occupancy, and an indirect evidence for the existence of different TFIID conformations. PMID:20026583

  8. Genome-Wide Ultrabithorax Binding Analysis Reveals Highly Targeted Genomic Loci at Developmental Regulators and a Potential Connection to Polycomb-Mediated Regulation

    PubMed Central

    Meireles-Filho, Antonio C. A.; Pagani, Michaela; Stark, Alexander

    2016-01-01

    Hox homeodomain transcription factors are key regulators of animal development. They specify the identity of segments along the anterior-posterior body axis in metazoans by controlling the expression of diverse downstream targets, including transcription factors and signaling pathway components. The Drosophila melanogaster Hox factor Ultrabithorax (Ubx) directs the development of thoracic and abdominal segments and appendages, and loss of Ubx function can lead for example to the transformation of third thoracic segment appendages (e.g. halters) into second thoracic segment appendages (e.g. wings), resulting in a characteristic four-wing phenotype. Here we present a Drosophila melanogaster strain with a V5-epitope tagged Ubx allele, which we employed to obtain a high quality genome-wide map of Ubx binding sites using ChIP-seq. We confirm the sensitivity of the V5 ChIP-seq by recovering 7/8 of well-studied Ubx-dependent cis-regulatory regions. Moreover, we show that Ubx binding is predictive of enhancer activity as suggested by comparison with a genome-scale resource of in vivo tested enhancer candidates. We observed densely clustered Ubx binding sites at 12 extended genomic loci that included ANTP-C, BX-C, Polycomb complex genes, and other regulators and the clustered binding sites were frequently active enhancers. Furthermore, Ubx binding was detected at known Polycomb response elements (PREs) and was associated with significant enrichments of Pc and Pho ChIP signals in contrast to binding sites of other developmental TFs. Together, our results show that Ubx targets developmental regulators via strongly clustered binding sites and allow us to hypothesize that regulation by Ubx might involve Polycomb group proteins to maintain specific regulatory states in cooperative or mutually exclusive fashion, an attractive model that combines two groups of proteins with prominent gene regulatory roles during animal development. PMID:27575958

  9. Genome-Wide Ultrabithorax Binding Analysis Reveals Highly Targeted Genomic Loci at Developmental Regulators and a Potential Connection to Polycomb-Mediated Regulation.

    PubMed

    Shlyueva, Daria; Meireles-Filho, Antonio C A; Pagani, Michaela; Stark, Alexander

    2016-01-01

    Hox homeodomain transcription factors are key regulators of animal development. They specify the identity of segments along the anterior-posterior body axis in metazoans by controlling the expression of diverse downstream targets, including transcription factors and signaling pathway components. The Drosophila melanogaster Hox factor Ultrabithorax (Ubx) directs the development of thoracic and abdominal segments and appendages, and loss of Ubx function can lead for example to the transformation of third thoracic segment appendages (e.g. halters) into second thoracic segment appendages (e.g. wings), resulting in a characteristic four-wing phenotype. Here we present a Drosophila melanogaster strain with a V5-epitope tagged Ubx allele, which we employed to obtain a high quality genome-wide map of Ubx binding sites using ChIP-seq. We confirm the sensitivity of the V5 ChIP-seq by recovering 7/8 of well-studied Ubx-dependent cis-regulatory regions. Moreover, we show that Ubx binding is predictive of enhancer activity as suggested by comparison with a genome-scale resource of in vivo tested enhancer candidates. We observed densely clustered Ubx binding sites at 12 extended genomic loci that included ANTP-C, BX-C, Polycomb complex genes, and other regulators and the clustered binding sites were frequently active enhancers. Furthermore, Ubx binding was detected at known Polycomb response elements (PREs) and was associated with significant enrichments of Pc and Pho ChIP signals in contrast to binding sites of other developmental TFs. Together, our results show that Ubx targets developmental regulators via strongly clustered binding sites and allow us to hypothesize that regulation by Ubx might involve Polycomb group proteins to maintain specific regulatory states in cooperative or mutually exclusive fashion, an attractive model that combines two groups of proteins with prominent gene regulatory roles during animal development. PMID:27575958

  10. Genome-wide analysis of DNA methylation in hepatoblastoma tissues

    PubMed Central

    Cui, Ximao; Liu, Baihui; Zheng, Shan; Dong, Kuiran; Dong, Rui

    2016-01-01

    DNA methylation has a crucial role in cancer biology. In the present study, a genome-wide analysis of DNA methylation in hepatoblastoma (HB) tissues was performed to verify differential methylation levels between HB and normal tissues. As alpha-fetoprotein (AFP) has a critical role in HB, AFP methylation levels were also detected using pyrosequencing. Normal and HB liver tissue samples (frozen tissue) were obtained from patients with HB. Genome-wide analysis of DNA methylation in these tissues was performed using an Infinium HumanMethylation450 BeadChip, and the results were confirmed with reverse transcription-quantitative polymerase chain reaction. The Infinium HumanMethylation450 BeadChip demonstrated distinctively less methylation in HB tissues than in non-tumor tissues. In addition, methylation enrichment was observed in positions near the transcription start site of AFP, which exhibited lower methylation levels in HB tissues than in non-tumor liver tissues. Lastly, a significant negative correlation was observed between AFP messenger RNA expression and DNA methylation percentage, using linear Pearson's R correlation coefficients. The present results demonstrate differential methylation levels between HB and normal tissues, and imply that aberrant methylation of AFP in HB could reflect HB development. Expansion of these findings could provide useful insight into HB biology. PMID:27446465

  11. Genome-wide association analysis identifies three psoriasis susceptibility loci

    PubMed Central

    Stuart, Philip E.; Nair, Rajan P.; Ellinghaus, Eva; Ding, Jun; Tejasvi, Trilokraj; Gudjonsson, Johann E.; Li, Yun; Weidinger, Stephan; Eberlein, Bernadette; Gieger, Christian; Wichmann, H. Erich; Kunz, Manfred; Ike, Robert; Krueger, Gerald G.; Bowcock, Anne M.; Mroweitz, Ulrich; Lim, Henry W.; Voorhees, John J.; Abecasis, Goncalo R.; Weichenthal, Michael; Franke, Andre; Rahman, Proton; Gladman, Dafna D.; Elder, James T.

    2010-01-01

    To identify novel psoriasis susceptibility loci, we carried out a meta-analysis of two recent genome-wide association studies 1,2, yielding a discovery sample of 1,831 cases and 2,546 controls. 102 of the most promising loci in the discovery analysis were followed up in a three-stage replication study using 4,064 cases and 4,685 controls from Michigan, Toronto, Newfoundland, and Germany. Association at a genome-wide level of significance for the combined discovery and replication samples was found for three genomic regions. One contains NOS2 (rs4795067, p = 4 × 10−11), another contains FBXL19 (rs10782001, p = 9 × 10−10), and a third contains PSMA6 and NFKBIA (rs12586317, p = 2 × 10−8). All three loci were also strongly associated with the subphenotypes of psoriatic arthritis and purely cutaneous psoriasis. Finally, we confirmed a recently identified3 association signal near RNF114. PMID:20953189

  12. Advances in genome-wide DNA methylation analysis

    PubMed Central

    Gupta, Romi; Nagarajan, Arvindhan; Wajapeyee, Narendra

    2013-01-01

    The covalent DNA modification of cytosine at position 5 (5-methylcytosine; 5mC) has emerged as an important epigenetic mark most commonly present in the context of CpG dinucleotides in mammalian cells. In pluripotent stem cells and plants, it is also found in non-CpG and CpNpG contexts, respectively. 5mC has important implications in a diverse set of biological processes, including transcriptional regulation. Aberrant DNA methylation has been shown to be associated with a wide variety of human ailments and thus is the focus of active investigation. Methods used for detecting DNA methylation have revolutionized our understanding of this epigenetic mark and provided new insights into its role in diverse biological functions. Here we describe recent technological advances in genome-wide DNA methylation analysis and discuss their relative utility and drawbacks, providing specific examples from studies that have used these technologies for genome-wide DNA methylation analysis to address important biological questions. Finally, we discuss a newly identified covalent DNA modification, 5-hydroxymethylcytosine (5hmC), and speculate on its possible biological function, as well as describe a new methodology that can distinguish 5hmC from 5mC. PMID:20964631

  13. Signatures of positive selection in East African Shorthorn Zebu: a genome-wide SNP analysis

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The small East African Shorthorn Zebu is the main indigenous cattle across East Africa. A recent genome wide SNPs analysis has revealed their ancient stable African taurine x Asian zebu admixture. Here, we assess the presence of candidate signature of positive selection in their genome, with the aim...

  14. Genome-wide analysis of small nucleolar RNAs of Leishmania major reveals a rich repertoire of RNAs involved in modification and processing of rRNA.

    PubMed

    Eliaz, Dror; Doniger, Tirza; Tkacz, Itai Dov; Biswas, Viplov Kumar; Gupta, Sachin Kumar; Kolev, Nikolay G; Unger, Ron; Ullu, Elisabetta; Tschudi, Christian; Michaeli, Shulamit

    2015-01-01

    Trypanosomatids are protozoan parasites and the causative agent of infamous infectious diseases. These organisms regulate their gene expression mainly at the post-transcriptional level and possess characteristic RNA processing mechanisms. In this study, we analyzed the complete repertoire of Leishmania major small nucleolar (snoRNA) RNAs by performing RNA-seq analysis on RNAs that were affinity-purified using the C/D snoRNA core protein, SNU13, and the H/ACA core protein, NHP2. This study revealed a large collection of C/D and H/ACA snoRNAs, organized in gene clusters generally containing both snoRNA types. Abundant snoRNAs were identified and predicted to guide trypanosome-specific rRNA cleavages. The repertoire of snoRNAs was compared to that of the closely related Trypanosoma brucei, and 80% of both C/D and H/ACA molecules were found to have functional homologues. The comparative analyses elucidated how snoRNAs evolved to generate molecules with analogous functions in both species. Interestingly, H/ACA RNAs have great flexibility in their ability to guide modifications, and several of the RNA species can guide more than one modification, compensating for the presence of single hairpin H/ACA snoRNA in these organisms. Placing the predicted modifications on the rRNA secondary structure revealed hypermodification regions mostly in domains which are modified in other eukaryotes, in addition to trypanosome-specific modifications. PMID:25970223

  15. Genome-wide Medicago truncatula small RNA analysis revealed novel microRNAs and isoforms differentially regulated in roots and nodules.

    PubMed

    Lelandais-Brière, Christine; Naya, Loreto; Sallet, Erika; Calenge, Fanny; Frugier, Florian; Hartmann, Caroline; Gouzy, Jérome; Crespi, Martin

    2009-09-01

    Posttranscriptional regulation of a variety of mRNAs by small 21- to 24-nucleotide RNAs, notably the microRNAs (miRNAs), is emerging as a novel developmental mechanism. In legumes like the model Medicago truncatula, roots are able to develop a de novo meristem through the symbiotic interaction with nitrogen-fixing rhizobia. We used deep sequencing of small RNAs from root apexes and nodules of M. truncatula to identify 100 novel candidate miRNAs encoded by 265 hairpin precursors. New atypical precursor classes producing only specific 21- and 24-nucleotide small RNAs were found. Statistical analysis on sequencing reads abundance revealed specific miRNA isoforms in a same family showing contrasting expression patterns between nodules and root apexes. The differentially expressed conserved and nonconserved miRNAs may target a large variety of mRNAs. In root nodules, which show diverse cell types ranging from a persistent meristem to a fully differentiated central region, we discovered miRNAs spatially enriched in nodule meristematic tissues, vascular bundles, and bacterial infection zones using in situ hybridization. Spatial regulation of miRNAs may determine specialization of regulatory RNA networks in plant differentiation processes, such as root nodule formation. PMID:19767456

  16. Systems-Level Analysis of Genome-Wide Association Data

    PubMed Central

    Farber, Charles R.

    2013-01-01

    Genome-wide association studies (GWAS) have emerged as the method of choice for identifying common variants affecting complex disease. In a GWAS, particular attention is placed, for obvious reasons, on single-nucleotide polymorphisms (SNPs) that exceed stringent genome-wide significance thresholds. However, it is expected that many SNPs with only nominal evidence of association (e.g., P < 0.05) truly influence disease. Efforts to extract additional biological information from entire GWAS datasets have primarily focused on pathway-enrichment analyses. However, these methods suffer from a number of limitations and typically fail to lead to testable hypotheses. To evaluate alternative approaches, we performed a systems-level analysis of GWAS data using weighted gene coexpression network analysis. A weighted gene coexpression network was generated for 1918 genes harboring SNPs that displayed nominal evidence of association (P ≤ 0.05) from a GWAS of bone mineral density (BMD) using microarray data on circulating monocytes isolated from individuals with extremely low or high BMD. Thirteen distinct gene modules were identified, each comprising coexpressed and highly interconnected GWAS genes. Through the characterization of module content and topology, we illustrate how network analysis can be used to discover disease-associated subnetworks and characterize novel interactions for genes with a known role in the regulation of BMD. In addition, we provide evidence that network metrics can be used as a prioritizing tool when selecting genes and SNPs for replication studies. Our results highlight the advantages of using systems-level strategies to add value to and inform GWAS. PMID:23316444

  17. Development and application of a novel genome-wide SNP array reveals domestication history in soybean

    PubMed Central

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-01-01

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean. PMID:26856884

  18. Development and application of a novel genome-wide SNP array reveals domestication history in soybean.

    PubMed

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-01-01

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean. PMID:26856884

  19. Genome-Wide Transcriptome Analysis During Anthesis Reveals New Insights into the Molecular Basis of Heat Stress Responses in Tolerant and Sensitive Rice Varieties.

    PubMed

    González-Schain, Nahuel; Dreni, Ludovico; Lawas, Lovely M F; Galbiati, Massimo; Colombo, Lucia; Heuer, Sigrid; Jagadish, Krishna S V; Kater, Martin M

    2016-01-01

    Rice is one of the main food crops in the world. In the near future, yield is expected to be under pressure due to unfavorable climatic conditions, such as increasing temperatures. Therefore, improving rice germplasm in order to guarantee rice production under harsh environmental conditions is of top priority. Although many physiological studies have contributed to understanding heat responses during anthesis, the most heat-sensitive stage, molecular data are still largely lacking. In this study, an RNA-sequencing approach of heat- and control-treated reproductive tissues during anthesis was carried out using N22, one of the most heat-tolerant rice cultivars known to date. This analysis revealed that expression of genes encoding a number of transcription factor families, together with signal transduction and metabolic pathway genes, is repressed. On the other hand, expression of genes encoding heat shock factors and heat shock proteins was highly activated. Many of these genes are predominantly expressed at late stages of anther development. Further physiological experiments using heat-tolerant N22 and two sensitive cultivars suggest that reduced yield in heat-sensitive plants may be associated with poor pollen development or production in anthers prior to anthesis. In parallel, induction levels of a set of heat-responsive genes in these tissues correlated well with heat tolerance. Altogether, these findings suggest that proper expression of protective chaperones in anthers is needed before anthesis to overcome stress damage and to ensure fertilization. Genes putatively controlling this process were identified and are valuable candidates to consider for molecular breeding of highly productive heat-tolerant cultivars. PMID:26561535

  20. Comparative genome-wide analysis reveals that Burkholderia contaminans MS14 possesses multiple antimicrobial biosynthesis genes but not major genetic loci required for pathogenesis.

    PubMed

    Deng, Peng; Wang, Xiaoqiang; Baird, Sonya M; Showmaker, Kurt C; Smith, Leif; Peterson, Daniel G; Lu, Shien

    2016-06-01

    Burkholderia contaminans MS14 shows significant antimicrobial activities against plant and animal pathogenic fungi and bacteria. The antifungal agent occidiofungin produced by MS14 has great potential for development of biopesticides and pharmaceutical drugs. However, the use of Burkholderia species as biocontrol agent in agriculture is restricted due to the difficulties in distinguishing between plant growth-promoting bacteria and the pathogenic bacteria. The complete MS14 genome was sequenced and analyzed to find what beneficial and virulence-related genes it harbors. The phylogenetic relatedness of B. contaminans MS14 and other 17 Burkholderia species was also analyzed. To research MS14's potential virulence, the gene regions related to the antibiotic production, antibiotic resistance, and virulence were compared between MS14 and other Burkholderia genomes. The genome of B. contaminans MS14 was sequenced and annotated. The genomic analyses reveal the presence of multiple gene sets for antimicrobial biosynthesis, which contribute to its antimicrobial activities. BLAST results indicate that the MS14 genome harbors a large number of unique regions. MS14 is closely related to another plant growth-promoting Burkholderia strain B. lata 383 according to the average nucleotide identity data. Moreover, according to the phylogenetic analysis, plant growth-promoting species isolated from soils and mammalian pathogenic species are clustered together, respectively. MS14 has multiple antimicrobial activity-related genes identified from the genome, but it lacks key virulence-related gene loci found in the pathogenic strains. Additionally, plant growth-promoting Burkholderia species have one or more antimicrobial biosynthesis genes in their genomes as compared with nonplant growth-promoting soil-isolated Burkholderia species. On the other hand, pathogenic species harbor multiple virulence-associated gene loci that are not present in nonpathogenic Burkholderia species. The MS14

  1. Genome-wide search for eliminylating domains reveals novel function for BLES03-like proteins.

    PubMed

    Khater, Shradha; Mohanty, Debasisa

    2014-08-01

    Bacterial phosphothreonine lyases catalyze a novel posttranslational modification involving formation of dehydrobutyrine/dehyroalanine by β elimination of the phosphate group of phosphothreonine or phosphoserine residues in their substrate proteins. Though there is experimental evidence for presence of dehydro amino acids in human proteins, no eukaryotic homologs of these lyases have been identified as of today. A comprehensive genome-wide search for identifying phosphothreonine lyase homologs in eukaryotes was carried out. Our fold-based search revealed structural and catalytic site similarity between bacterial phosphothreonine lyases and BLES03 (basophilic leukemia-expressed protein 03), a human protein with unknown function. Ligand induced conformational changes similar to bacterial phosphothreonine lyases, and movement of crucial arginines in the loop region to the catalytic pocket upon binding of phosphothreonine-containing peptides was seen during docking and molecular dynamics studies. Genome-wide search for BLES03 homologs using sensitive profile-based methods revealed their presence not only in eukaryotic classes such as chordata and fungi but also in bacterial and archaebacterial classes. The synteny of these archaebacterial BLES03-like proteins was remarkably similar to that of type IV lantibiotic synthetases which harbor LanL-like phosphothreonine lyase domains. Hence, context-based analysis reinforced our earlier sequence/structure-based prediction of phosphothreonine lyase catalytic function for BLES03. Our in silico analysis has revealed that BLES03-like proteins with previously unknown function are novel eukaryotic phosphothreonine lyases involved in biosynthesis of dehydro amino acids, whereas their bacterial and archaebacterial counterparts might be involved in biosynthesis of natural products similar to lantibiotics. PMID:25062915

  2. Genome-Wide Small RNA Analysis of Soybean Reveals Auxin-Responsive microRNAs that are Differentially Expressed in Response to Salt Stress in Root Apex

    PubMed Central

    Sun, Zhengxi; Wang, Youning; Mou, Fupeng; Tian, Yinping; Chen, Liang; Zhang, Senlei; Jiang, Qiong; Li, Xia

    2016-01-01

    Root growth and the architecture of the root system in Arabidopsis are largely determined by root meristematic activity. Legume roots show strong developmental plasticity in response to both abiotic and biotic stimuli, including symbiotic rhizobia. However, a global analysis of gene regulation in the root meristem of soybean plants is lacking. In this study, we performed a global analysis of the small RNA transcriptome of root tips from soybean seedlings grown under normal and salt stress conditions. In total, 71 miRNA candidates, including known and novel variants of 59 miRNA families, were identified. We found 66 salt-responsive miRNAs in the soybean root meristem; among them, 22 are novel miRNAs. Interestingly, we found auxin-responsive cis-elements in the promoters of many salt-responsive miRNAs, implying that these miRNAs may be regulated by auxin and auxin signaling plays a key role in regulating the plasticity of the miRNAome and root development in soybean. A functional analysis of miR399, a salt-responsive miRNA in the root meristem, indicates the crucial role of this miRNA in modulating soybean root developmental plasticity. Our data provide novel insight into the miRNAome-mediated regulatory mechanism in soybean root growth under salt stress. PMID:26834773

  3. Genome-wide analysis on Chlamydomonas reinhardtii reveals the impact of hydrogen peroxide on protein stress responses and overlap with other stress transcriptomes.

    PubMed

    Blaby, Ian K; Blaby-Haas, Crysten E; Pérez-Pérez, María Esther; Schmollinger, Stefan; Fitz-Gibbon, Sorel; Lemaire, Stéphane D; Merchant, Sabeeha S

    2015-12-01

    Reactive oxygen species (ROS) are produced by and have the potential to be damaging to all aerobic organisms. In photosynthetic organisms, they are an unavoidable byproduct of electron transfer in both the chloroplast and mitochondrion. Here, we employ the reference unicellular green alga Chlamydomonas reinhardtii to identify the effect of H2O2 on gene expression by monitoring the changes in the transcriptome in a time-course experiment. Comparison of transcriptomes from cells sampled immediately prior to the addition of H2O2 and 0.5 and 1 h subsequently revealed 1278 differentially abundant transcripts. Of those transcripts that increase in abundance, many encode proteins involved in ROS detoxification, protein degradation and stress responses, whereas among those that decrease are transcripts encoding proteins involved in photosynthesis and central carbon metabolism. In addition to these transcriptomic adjustments, we observe that addition of H2O2 is followed by an accumulation and oxidation of the total intracellular glutathione pool, and a decrease in photosynthetic O2 output. Additionally, we analyze our transcriptomes in the context of changes in transcript abundance in response to singlet O2 (O2*), and relate our H2O2 -induced transcripts to a diurnal transcriptome, where we demonstrate enrichments of H2O2 -induced transcripts early in the light phase, late in the light phase and 2 h prior to light. On this basis several genes that are highlighted in this work may be involved in previously undiscovered stress remediation pathways or acclimation responses. PMID:26473430

  4. Genome-Wide Transcriptome Analysis Reveals that Cadmium Stress Signaling Controls the Expression of Genes in Drought Stress Signal Pathways in Rice

    PubMed Central

    Oono, Youko; Yazawa, Takayuki; Kawahara, Yoshihiro; Kanamori, Hiroyuki; Kobayashi, Fuminori; Sasaki, Harumi; Mori, Satomi; Wu, Jianzhong; Handa, Hirokazu; Itoh, Takeshi; Matsumoto, Takashi

    2014-01-01

    Plant growth is severely affected by toxic concentrations of the non-essential heavy metal cadmium (Cd). Comprehensive transcriptome analysis by RNA-Seq following cadmium exposure is required to further understand plant responses to Cd and facilitate future systems-based analyses of the underlying regulatory networks. In this study, rice plants were hydroponically treated with 50 µM Cd for 24 hours and ∼60,000 expressed transcripts, including transcripts that could not be characterized by microarray-based approaches, were evaluated. Upregulation of various ROS-scavenging enzymes, chelators and metal transporters demonstrated the appropriate expression profiles to Cd exposure. Gene Ontology enrichment analysis of the responsive transcripts indicated the upregulation of many drought stress-related genes under Cd exposure. Further investigation into the expression of drought stress marker genes such as DREB suggested that expression of genes in several drought stress signal pathways was activated under Cd exposure. Furthermore, qRT-PCR analyses of randomly selected Cd-responsive metal transporter transcripts under various metal ion stresses suggested that the expression of Cd-responsive transcripts might be easily affected by other ions. Our transcriptome analysis demonstrated a new transcriptional network linking Cd and drought stresses in rice. Considering our data and that Cd is a non-essential metal, the network underlying Cd stress responses and tolerance, which plants have developed to adapt to other stresses, could help to acclimate to Cd exposure. Our examination of this transcriptional network provides useful information for further studies of the molecular mechanisms of plant adaptation to Cd exposure and the improvement of tolerance in crop species. PMID:24816929

  5. Genome-wide array-CGH analysis reveals YRF1 gene copy number variation that modulates genetic stability in distillery yeasts

    PubMed Central

    Adamczyk, Jagoda; Kwiatkowska, Aleksandra; Rawska, Ewa; Skoneczna, Adrianna

    2015-01-01

    Industrial yeasts, economically important microorganisms, are widely used in diverse biotechnological processes including brewing, winemaking and distilling. In contrast to a well-established genome of brewer's and wine yeast strains, the comprehensive evaluation of genomic features of distillery strains is lacking. In the present study, twenty two distillery yeast strains were subjected to electrophoretic karyotyping and array-based comparative genomic hybridization (array-CGH). The strains analyzed were assigned to the Saccharomyces sensu stricto complex and grouped into four species categories: S. bayanus, S. paradoxus, S. cerevisiae and S. kudriavzevii. The genomic diversity was mainly revealed within subtelomeric regions and the losses and/or gains of fragments of chromosomes I, III, VI and IX were the most frequently observed. Statistically significant differences in the gene copy number were documented in six functional gene categories: 1) telomere maintenance via recombination, DNA helicase activity or DNA binding, 2) maltose metabolism process, glucose transmembrane transporter activity; 3) asparagine catabolism, cellular response to nitrogen starvation, localized in cell wall-bounded periplasmic space, 4) siderophore transport, 5) response to copper ion, cadmium ion binding and 6) L-iditol 2- dehydrogenase activity. The losses of YRF1 genes (Y' element ATP-dependent helicase) were accompanied by decreased level of Y' sequences and an increase in DNA double and single strand breaks, and oxidative DNA damage in the S. paradoxus group compared to the S. bayanus group. We postulate that naturally occurring diversity in the YRF1 gene copy number may promote genetic stability in the S. bayanus group of distillery yeast strains. PMID:26384347

  6. Genome-wide array-CGH analysis reveals YRF1 gene copy number variation that modulates genetic stability in distillery yeasts.

    PubMed

    Deregowska, Anna; Skoneczny, Marek; Adamczyk, Jagoda; Kwiatkowska, Aleksandra; Rawska, Ewa; Skoneczna, Adrianna; Lewinska, Anna; Wnuk, Maciej

    2015-10-13

    Industrial yeasts, economically important microorganisms, are widely used in diverse biotechnological processes including brewing, winemaking and distilling. In contrast to a well-established genome of brewer's and wine yeast strains, the comprehensive evaluation of genomic features of distillery strains is lacking. In the present study, twenty two distillery yeast strains were subjected to electrophoretic karyotyping and array-based comparative genomic hybridization (array-CGH). The strains analyzed were assigned to the Saccharomyces sensu stricto complex and grouped into four species categories: S. bayanus, S. paradoxus, S. cerevisiae and S. kudriavzevii. The genomic diversity was mainly revealed within subtelomeric regions and the losses and/or gains of fragments of chromosomes I, III, VI and IX were the most frequently observed. Statistically significant differences in the gene copy number were documented in six functional gene categories: 1) telomere maintenance via recombination, DNA helicase activity or DNA binding, 2) maltose metabolism process, glucose transmembrane transporter activity; 3) asparagine catabolism, cellular response to nitrogen starvation, localized in cell wall-bounded periplasmic space, 4) siderophore transport, 5) response to copper ion, cadmium ion binding and 6) L-iditol 2- dehydrogenase activity. The losses of YRF1 genes (Y' element ATP-dependent helicase) were accompanied by decreased level of Y' sequences and an increase in DNA double and single strand breaks, and oxidative DNA damage in the S. paradoxus group compared to the S. bayanus group. We postulate that naturally occurring diversity in the YRF1 gene copy number may promote genetic stability in the S. bayanus group of distillery yeast strains. PMID:26384347

  7. Genome-wide comparative analysis of the Brassica rapa gene space reveals genome shrinkage and differential loss of duplicated genes after whole genome triplication

    PubMed Central

    2009-01-01

    Background Brassica rapa is one of the most economically important vegetable crops worldwide. Owing to its agronomic importance and phylogenetic position, B. rapa provides a crucial reference to understand polyploidy-related crop genome evolution. The high degree of sequence identity and remarkably conserved genome structure between Arabidopsis and Brassica genomes enables comparative tiling sequencing using Arabidopsis sequences as references to select the counterpart regions in B. rapa, which is a strong challenge of structural and comparative crop genomics. Results We assembled 65.8 megabase-pairs of non-redundant euchromatic sequence of B. rapa and compared this sequence to the Arabidopsis genome to investigate chromosomal relationships, macrosynteny blocks, and microsynteny within blocks. The triplicated B. rapa genome contains only approximately twice the number of genes as in Arabidopsis because of genome shrinkage. Genome comparisons suggest that B. rapa has a distinct organization of ancestral genome blocks as a result of recent whole genome triplication followed by a unique diploidization process. A lack of the most recent whole genome duplication (3R) event in the B. rapa genome, atypical of other Brassica genomes, may account for the emergence of B. rapa from the Brassica progenitor around 8 million years ago. Conclusions This work demonstrates the potential of using comparative tiling sequencing for genome analysis of crop species. Based on a comparative analysis of the B. rapa sequences and the Arabidopsis genome, it appears that polyploidy and chromosomal diploidization are ongoing processes that collectively stabilize the B. rapa genome and facilitate its evolution. PMID:19821981

  8. Genome-wide analysis of AP2/ERF transcription factors in carrot (Daucus carota L.) reveals evolution and expression profiles under abiotic stress.

    PubMed

    Li, Meng-Yao; Xu, Zhi-Sheng; Huang, Ying; Tian, Chang; Wang, Feng; Xiong, Ai-Sheng

    2015-12-01

    AP2/ERF is a large transcription factor family that regulates plant physiological processes, such as plant development and stress response. Carrot (Daucus carota L.) is an important economical crop with a genome size of 480 Mb; the draft genome sequencing of this crop has been completed by our group. However, little is known about the AP2/ERF factors in carrot. In this study, a total of 267 putative AP2/ERF factors were identified from the whole-genome sequence of carrot. These AP2/ERF proteins were phylogenetically clustered into five subfamilies based on their similarity to the amino acid sequences from Arabidopsis. The distribution and comparative genome analysis of the AP2/ERF factors among plants showed the AP2/ERF factors had expansion during the evolutionary process, and the AP2 domain was highly conserved during evolution. The number of AP2/ERF factors in land plants expanded during their evolution. A total of 60 orthologous and 145 coorthologous AP2/ERF gene pairs between carrot and Arabidopsis were identified, and the interaction network of orthologous genes was constructed. The expression patterns of eight AP2/ERF family genes from each subfamily (DREB, ERF, AP2, and RAV) were related to abiotic stresses. Yeast one-hybrid and β-galactosidase activity assays confirmed the DRE and GCC box-binding activities of DREB subfamily genes. This study is the first to identify and characterize the AP2/ERF transcription factors in carrot using whole-genome analysis, and the findings may serve as references for future functional research on the transcription factors in carrot. PMID:25971861

  9. metaseq: a Python package for integrative genome-wide analysis reveals relationships between chromatin insulators and associated nuclear mRNA

    PubMed Central

    Dale, Ryan K.; Matzat, Leah H.; Lei, Elissa P.

    2014-01-01

    Here we introduce metaseq, a software library written in Python, which enables loading multiple genomic data formats into standard Python data structures and allows flexible, customized manipulation and visualization of data from high-throughput sequencing studies. We demonstrate its practical use by analyzing multiple datasets related to chromatin insulators, which are DNA–protein complexes proposed to organize the genome into distinct transcriptional domains. Recent studies in Drosophila and mammals have implicated RNA in the regulation of chromatin insulator activities. Moreover, the Drosophila RNA-binding protein Shep has been shown to antagonize gypsy insulator activity in a tissue-specific manner, but the precise role of RNA in this process remains unclear. Better understanding of chromatin insulator regulation requires integration of multiple datasets, including those from chromatin-binding, RNA-binding, and gene expression experiments. We use metaseq to integrate RIP- and ChIP-seq data for Shep and the core gypsy insulator protein Su(Hw) in two different cell types, along with publicly available ChIP-chip and RNA-seq data. Based on the metaseq-enabled analysis presented here, we propose a model where Shep associates with chromatin cotranscriptionally, then is recruited to insulator complexes in trans where it plays a negative role in insulator activity. PMID:25063299

  10. Integrative effect of drought and low temperature on litchi (Litchi chinensis Sonn.) floral initiation revealed by dynamic genome-wide transcriptome analysis.

    PubMed

    Shen, Jiyuan; Xiao, Qiusheng; Qiu, Haiji; Chen, Chengjie; Chen, Houbin

    2016-01-01

    Floral induction in litchi is influenced by multiple environment cues including temperature and soil water condition. In the present study, we determined that a combined treatment consisting of 14-day drought imposed prior to exposure to 35-day low temperature (T3) significantly promoted litchi flowering relative to the low temperature alone (T2), suggesting an integrative effect of drought and low temperature on litchi floral initiation. Analysis of transcriptomic changes in leaves from different treatments showed that 2,198 and 4,407 unigenes were differentially expressed in response to drought and low temperature, respectively. 1,227 of these unigenes were expressed in response to both treatments, implying an interaction of drought and low temperature on expression of genes involved in litchi floral initiation. Additionally, 932 unigenes were consistently differentially expressed during floral induction between T2 and T3 plants, which potentially accounts for the difference of flowering time. Thirty-eight transcription factors out of these 932 unigenes were identified as hub genes with central roles in regulation of litchi floral induction. The expression of litchi homologs of well-known flowering genes was also investigated, and one Flowering Locus T (FT) homolog may play a crucial role in litchi flowering in responses to drought and low temperature. PMID:27557749

  11. Genome-wide gene expression profiling analysis of Leishmania major and Leishmania infantum developmental stages reveals substantial differences between the two species

    PubMed Central

    Rochette, Annie; Raymond, Frédéric; Ubeda, Jean-Michel; Smith, Martin; Messier, Nadine; Boisvert, Sébastien; Rigault, Philippe; Corbeil, Jacques; Ouellette, Marc; Papadopoulou, Barbara

    2008-01-01

    Background Leishmania parasites cause a diverse spectrum of diseases in humans ranging from spontaneously healing skin lesions (e.g., L. major) to life-threatening visceral diseases (e.g., L. infantum). The high conservation in gene content and genome organization between Leishmania major and Leishmania infantum contrasts their distinct pathophysiologies, suggesting that highly regulated hierarchical and temporal changes in gene expression may be involved. Results We used a multispecies DNA oligonucleotide microarray to compare whole-genome expression patterns of promastigote (sandfly vector) and amastigote (mammalian macrophages) developmental stages between L. major and L. infantum. Seven per cent of the total L. infantum genome and 9.3% of the L. major genome were differentially expressed at the RNA level throughout development. The main variations were found in genes involved in metabolism, cellular organization and biogenesis, transport and genes encoding unknown function. Remarkably, this comparative global interspecies analysis demonstrated that only 10–12% of the differentially expressed genes were common to L. major and L. infantum. Differentially expressed genes are randomly distributed across chromosomes further supporting a posttranscriptional control, which is likely to involve a variety of 3'UTR elements. Conclusion This study highlighted substantial differences in gene expression patterns between L. major and L. infantum. These important species-specific differences in stage-regulated gene expression may contribute to the disease tropism that distinguishes L. major from L. infantum. PMID:18510761

  12. Genome-wide analysis of gene expression during adipogenesis in human adipose-derived stromal cells reveals novel patterns of gene expression during adipocyte differentiation.

    PubMed

    Ambele, Melvin Anyasi; Dessels, Carla; Durandt, Chrisna; Pepper, Michael Sean

    2016-05-01

    We have undertaken an in-depth transcriptome analysis of adipogenesis in human adipose-derived stromal cells (ASCs) induced to differentiate into adipocytes in vitro. Gene expression was assessed on days 1, 7, 14 and 21 post-induction and genes differentially expressed numbered 128, 218, 253 and 240 respectively. Up-regulated genes were associated with blood vessel development, leukocyte migration, as well as tumor growth, invasion and metastasis. They also shared common pathways with certain obesity-related pathophysiological conditions. Down-regulated genes were enriched for immune response processes. KLF15, LMO3, FOXO1 and ZBTB16 transcription factors were up-regulated throughout the differentiation process. CEBPA, PPARG, ZNF117, MLXIPL, MMP3 and RORB were up-regulated only on days 14 and 21, which coincide with the maturation of adipocytes and could possibly serve as candidates for controlling fat accumulation and the size of mature adipocytes. In summary, we have identified genes that were up-regulated only on days 1 and 7 or days 14 and 21 that could serve as potential early and late-stage differentiation markers. PMID:27108396

  13. Genome-Wide Analysis of Stowaway-Like MITEs in Wheat Reveals High Sequence Conservation, Gene Association, and Genomic Diversification1[C][W

    PubMed Central

    Yaakov, Beery; Ben-David, Smadar; Kashkush, Khalil

    2013-01-01

    The diversity and evolution of wheat (Triticum-Aegilops group) genomes is determined, in part, by the activity of transposable elements that constitute a large fraction of the genome (up to 90%). In this study, we retrieved sequences from publicly available wheat databases, including a 454-pyrosequencing database, and analyzed 18,217 insertions of 18 Stowaway-like miniature inverted-repeat transposable element (MITE) families previously characterized in wheat that together account for approximately 1.3 Mb of sequence. All 18 families showed high conservation in length, sequence, and target site preference. Furthermore, approximately 55% of the elements were inserted in transcribed regions, into or near known wheat genes. Notably, we observed significant correlation between the mean length of the MITEs and their copy number. In addition, the genomic composition of nine MITE families was studied by real-time quantitative polymerase chain reaction analysis in 40 accessions of Triticum spp. and Aegilops spp., including diploids, tetraploids, and hexaploids. The quantitative polymerase chain reaction data showed massive and significant intraspecific and interspecific variation as well as genome-specific proliferation and nonadditive quantities in the polyploids. We also observed significant differences in the methylation status of the insertion sites among MITE families. Our data thus suggest a possible role for MITEs in generating genome diversification and in the establishment of nascent polyploid species in wheat. PMID:23104862

  14. Genome-Wide Analysis of NBS-LRR Genes in Sorghum Genome Revealed Several Events Contributing to NBS-LRR Gene Evolution in Grass Species

    PubMed Central

    Yang, Xiping; Wang, Jianping

    2016-01-01

    The nucleotide-binding site (NBS)–leucine-rich repeat (LRR) gene family is crucially important for offering resistance to pathogens. To explore evolutionary conservation and variability of NBS-LRR genes across grass species, we identified 88, 107, 24, and 44 full-length NBS-LRR genes in sorghum, rice, maize, and Brachypodium, respectively. A comprehensive analysis was performed on classification, genome organization, evolution, expression, and regulation of these NBS-LRR genes using sorghum as a representative of grass species. In general, the full-length NBS-LRR genes are highly clustered and duplicated in sorghum genome mainly due to local duplications. NBS-LRR genes have basal expression levels and are highly potentially targeted by miRNA. The number of NBS-LRR genes in the four grass species is positively correlated with the gene clustering rate. The results provided a valuable genomic resource and insights for functional and evolutionary studies of NBS-LRR genes in grass species. PMID:26792976

  15. Integrative effect of drought and low temperature on litchi (Litchi chinensis Sonn.) floral initiation revealed by dynamic genome-wide transcriptome analysis

    PubMed Central

    Shen, Jiyuan; Xiao, Qiusheng; Qiu, Haiji; Chen, Chengjie; Chen, Houbin

    2016-01-01

    Floral induction in litchi is influenced by multiple environment cues including temperature and soil water condition. In the present study, we determined that a combined treatment consisting of 14-day drought imposed prior to exposure to 35-day low temperature (T3) significantly promoted litchi flowering relative to the low temperature alone (T2), suggesting an integrative effect of drought and low temperature on litchi floral initiation. Analysis of transcriptomic changes in leaves from different treatments showed that 2,198 and 4,407 unigenes were differentially expressed in response to drought and low temperature, respectively. 1,227 of these unigenes were expressed in response to both treatments, implying an interaction of drought and low temperature on expression of genes involved in litchi floral initiation. Additionally, 932 unigenes were consistently differentially expressed during floral induction between T2 and T3 plants, which potentially accounts for the difference of flowering time. Thirty-eight transcription factors out of these 932 unigenes were identified as hub genes with central roles in regulation of litchi floral induction. The expression of litchi homologs of well-known flowering genes was also investigated, and one Flowering Locus T (FT) homolog may play a crucial role in litchi flowering in responses to drought and low temperature. PMID:27557749

  16. Aspergillus niger genome-wide analysis reveals a large number of novel alpha-glucan acting enzymes with unexpected expression profiles.

    PubMed

    Yuan, Xiao-Lian; van der Kaaij, Rachel M; van den Hondel, Cees A M J J; Punt, Peter J; van der Maarel, Marc J E C; Dijkhuizen, Lubbert; Ram, Arthur F J

    2008-06-01

    The filamentous ascomycete Aspergillus niger is well known for its ability to produce a large variety of enzymes for the degradation of plant polysaccharide material. A major carbon and energy source for this soil fungus is starch, which can be degraded by the concerted action of alpha-amylase, glucoamylase and alpha-glucosidase enzymes, members of the glycoside hydrolase (GH) families 13, 15 and 31, respectively. In this study we have combined analysis of the genome sequence of A. niger CBS 513.88 with microarray experiments to identify novel enzymes from these families and to predict their physiological functions. We have identified 17 previously unknown family GH13, 15 and 31 enzymes in the A. niger genome, all of which have orthologues in other aspergilli. Only two of the newly identified enzymes, a putative alpha-glucosidase (AgdB) and an alpha-amylase (AmyC), were predicted to play a role in starch degradation. The expression of the majority of the genes identified was not induced by maltose as carbon source, and not dependent on the presence of AmyR, the transcriptional regulator for starch degrading enzymes. The possible physiological functions of the other predicted family GH13, GH15 and GH31 enzymes, including intracellular enzymes and cell wall associated proteins, in alternative alpha-glucan modifying processes are discussed. PMID:18320228

  17. Genome-wide identification of citrus ATP-citrate lyase genes and their transcript analysis in fruits reveals their possible role in citrate utilization.

    PubMed

    Hu, Xiao-Mei; Shi, Cai-Yun; Liu, Xiao; Jin, Long-Fei; Liu, Yong-Zhong; Peng, Shu-Ang

    2015-02-01

    ATP-citrate lyase (ACL, EC4.1.3.8) catalyzes citrate to oxaloacetate and acetyl-CoA in the cell cytosol, and has important roles in normal plant growth and in the biosynthesis of some secondary metabolites. We identified three ACL genes, CitACLα1, CitACLα2, and CitACLβ1, in the citrus genome database. Both CitACLα1 and CitACLα2 encode putative ACL α subunits with 82.5 % amino acid identity, whereas CitACLβ1 encodes a putative ACL β subunit. Gene structure analysis showed that CitACLα1 and CitACLα2 had 12 exons and 11 introns, and CitACLβ1 had 16 exons and 15 introns. CitACLα1 and CitACLβ1 were predominantly expressed in flower, and CitACLα2 was predominantly expressed in stem and fibrous roots. As fruits ripen, the transcript levels of CitACLα1, CitACLβ1, and/or CitACLα2 in cultivars 'Niuher' and 'Owari' increased, accompanied by significant decreases in citrate content, while their transcript levels decreased significantly in 'Egan No. 1' and 'Iyokan', although citrate content also decreased. In 'HB pummelo', in which acid content increased as fruit ripened, and in acid-free pummelo, transcript levels of CitACLα2, CitACLβ1, and/or CitACLα1 increased. Moreover, mild drought stress and ABA treatment significantly increased citrate contents in fruits. Transcript levels of the three genes were significantly reduced by mild drought stress, and the transcript level of only CitACLβ1 was significantly reduced by ABA treatment. Taken together, these data indicate that the effects of ACL on citrate use during fruit ripening depends on the cultivar, and the reduction in ACL gene expression may be attributed to citrate increases under mild drought stress or ABA treatment. PMID:25120169

  18. Homozygous loss of ADAM3A revealed by genome-wide analysis of pediatric high-grade glioma and diffuse intrinsic pontine gliomas

    PubMed Central

    Barrow, Jennifer; Adamowicz-Brice, Martyna; Cartmill, Maria; MacArthur, Donald; Lowe, James; Robson, Keith; Brundler, Marie-Anne; Walker, David A.; Coyle, Beth; Grundy, Richard

    2011-01-01

    Overall, pediatric high-grade glioma (pHGG) has a poor prognosis, in part due to the lack of understanding of the underlying biology. High-resolution 244 K oligo array comparative genomic hybridization (CGH) was used to analyze DNA from 38 formalin-fixed paraffin-embedded predominantly pretreatment pHGG samples, including 13 diffuse intrinsic pontine gliomas (DIPGs). The patterns of gains and losses were distinct from those seen in HGG arising in adults. In particular, we found 1q gain in up to 27% of our cohort compared with 9% reported in adults. A total of 13% had a balanced genetic profile with no large-scale copy number alterations. Homozygous loss at 8p12 was seen in 6 of 38 (16%) cases of pHGG. This novel deletion, which includes the ADAM3A gene, was confirmed by quantitative real-time PCR (qPCR). Loss of CDKN2A/CDKN2B in 4 of 38 (10%) samples by oligo array CGH was confirmed by fluorescent in situ hybridization on tissue microarrays and was restricted to supratentorial tumors. Only ∼50% of supratentorial tumors were positive for CDKN2B expression by immunohistochemistry (IHC), while ∼75% of infratentorial tumors were positive for CDKN2B expression (P = 0.03). Amplification of the 4q11–13 region was detected in 8% of cases and included PDGFRA and KIT, and subsequent qPCR analysis was consistent with the amplification of PDGFRA. MYCN amplification was seen in 5% of samples being significantly associated with anaplastic astrocytomas (P= 0.03). Overall, DIPG shared similar spectrum of changes to supratentorial HGG with some notable differences, including high-frequency loss of 17p and 14q and lack of CDKN2A/CDKN2B deletion. Informative genetic data providing insight into the underlying biology and potential therapeutic possibilities can be generated from archival tissue and typically small biopsies from DIPG. Our findings highlight the importance of obtaining pretreatment samples. PMID:21138945

  19. Comparative analysis of methods for genome-wide nucleosome cartography.

    PubMed

    Quintales, Luis; Vázquez, Enrique; Antequera, Francisco

    2015-07-01

    Nucleosomes contribute to compacting the genome into the nucleus and regulate the physical access of regulatory proteins to DNA either directly or through the epigenetic modifications of the histone tails. Precise mapping of nucleosome positioning across the genome is, therefore, essential to understanding the genome regulation. In recent years, several experimental protocols have been developed for this purpose that include the enzymatic digestion, chemical cleavage or immunoprecipitation of chromatin followed by next-generation sequencing of the resulting DNA fragments. Here, we compare the performance and resolution of these methods from the initial biochemical steps through the alignment of the millions of short-sequence reads to a reference genome to the final computational analysis to generate genome-wide maps of nucleosome occupancy. Because of the lack of a unified protocol to process data sets obtained through the different approaches, we have developed a new computational tool (NUCwave), which facilitates their analysis, comparison and assessment and will enable researchers to choose the most suitable method for any particular purpose. NUCwave is freely available at http://nucleosome.usal.es/nucwave along with a step-by-step protocol for its use. PMID:25296770

  20. Genome-wide transcriptome analysis of human epidermal melanocytes

    PubMed Central

    Haltaufderhyde, Kirk D.; Oancea, Elena

    2015-01-01

    Because human epidermal melanocytes (HEMs) provide critical protection against skin cancer, sunburn, and photoaging, a genome-wide perspective of gene expression in these cells is vital to understanding human skin physiology. In this study we performed high throughput sequencing of HEMs to obtain a complete data set of transcript sizes, abundances, and splicing. As expected, we found that melanocyte specific genes that function in pigmentation were among the highest expressed genes. We analyzed receptor, ion channel and transcription factor gene families to get a better understanding of the cell signalling pathways used by melanocytes. We also performed a comparative transcriptomic analysis of lightly versus darkly pigmented HEMs and found 16 genes differentially expressed in the two pigmentation phenotypes; of those, only one putative melanosomal transporter (SLC45A2) has known function in pigmentation. In addition, we found 166 genes with splice isoforms expressed exclusively in one pigmentation phenotype, 17 of which are genes involved in signal transduction. Our melanocyte transcriptome study provides a comprehensive view and may help identify novel pigmentation genes and potential pharmacological targets. PMID:25451175

  1. Genome-wide view of genetic diversity reveals paths of selection and cultivar differentiation in peach domestication.

    PubMed

    Akagi, Takashi; Hanada, Toshio; Yaegaki, Hideaki; Gradziel, Thomas M; Tao, Ryutaro

    2016-06-01

    Domestication and cultivar differentiation are requisite processes for establishing cultivated crops. These processes inherently involve substantial changes in population structure, including those from artificial selection of key genes. In this study, accessions of peach (Prunus persica) and its wild relatives were analysed genome-wide to identify changes in genetic structures and gene selections associated with their differentiation. Analysis of genome-wide informative single-nucleotide polymorphism loci revealed distinct changes in genetic structures and delineations among domesticated peach and its wild relatives and among peach landraces and modern fruit (F) and modern ornamental (O-A) cultivars. Indications of distinct changes in linkage disequilibrium extension/decay and of strong population bottlenecks or inbreeding were identified. Site frequency spectrum- and extended haplotype homozygosity-based evaluation of genome-wide genetic diversities supported selective sweeps distinguishing the domesticated peach from its wild relatives and each F/O-A cluster from the landrace clusters. The regions with strong selective sweeps harboured promising candidates for genes subjected to selection. Further sequence-based evaluation further defined the candidates and revealed their characteristics. All results suggest opportunities for identifying critical genes associated with each differentiation by analysing genome-wide genetic diversity in currently established populations. This approach obviates the special development of genetic populations, which is particularly difficult for long-lived tree crops. PMID:27085183

  2. Genome-wide view of genetic diversity reveals paths of selection and cultivar differentiation in peach domestication

    PubMed Central

    Akagi, Takashi; Hanada, Toshio; Yaegaki, Hideaki; Gradziel, Thomas M.; Tao, Ryutaro

    2016-01-01

    Domestication and cultivar differentiation are requisite processes for establishing cultivated crops. These processes inherently involve substantial changes in population structure, including those from artificial selection of key genes. In this study, accessions of peach (Prunus persica) and its wild relatives were analysed genome-wide to identify changes in genetic structures and gene selections associated with their differentiation. Analysis of genome-wide informative single-nucleotide polymorphism loci revealed distinct changes in genetic structures and delineations among domesticated peach and its wild relatives and among peach landraces and modern fruit (F) and modern ornamental (O-A) cultivars. Indications of distinct changes in linkage disequilibrium extension/decay and of strong population bottlenecks or inbreeding were identified. Site frequency spectrum- and extended haplotype homozygosity-based evaluation of genome-wide genetic diversities supported selective sweeps distinguishing the domesticated peach from its wild relatives and each F/O-A cluster from the landrace clusters. The regions with strong selective sweeps harboured promising candidates for genes subjected to selection. Further sequence-based evaluation further defined the candidates and revealed their characteristics. All results suggest opportunities for identifying critical genes associated with each differentiation by analysing genome-wide genetic diversity in currently established populations. This approach obviates the special development of genetic populations, which is particularly difficult for long-lived tree crops. PMID:27085183

  3. Improved Statistics for Genome-Wide Interaction Analysis

    PubMed Central

    Ueki, Masao; Cordell, Heather J.

    2012-01-01

    Recently, Wu and colleagues [1] proposed two novel statistics for genome-wide interaction analysis using case/control or case-only data. In computer simulations, their proposed case/control statistic outperformed competing approaches, including the fast-epistasis option in PLINK and logistic regression analysis under the correct model; however, reasons for its superior performance were not fully explored. Here we investigate the theoretical properties and performance of Wu et al.'s proposed statistics and explain why, in some circumstances, they outperform competing approaches. Unfortunately, we find minor errors in the formulae for their statistics, resulting in tests that have higher than nominal type 1 error. We also find minor errors in PLINK's fast-epistasis and case-only statistics, although theory and simulations suggest that these errors have only negligible effect on type 1 error. We propose adjusted versions of all four statistics that, both theoretically and in computer simulations, maintain correct type 1 error rates under the null hypothesis. We also investigate statistics based on correlation coefficients that maintain similar control of type 1 error. Although designed to test specifically for interaction, we show that some of these previously-proposed statistics can, in fact, be sensitive to main effects at one or both loci, particularly in the presence of linkage disequilibrium. We propose two new “joint effects” statistics that, provided the disease is rare, are sensitive only to genuine interaction effects. In computer simulations we find, in most situations considered, that highest power is achieved by analysis under the correct genetic model. Such an analysis is unachievable in practice, as we do not know this model. However, generally high power over a wide range of scenarios is exhibited by our joint effects and adjusted Wu statistics. We recommend use of these alternative or adjusted statistics and urge caution when using Wu et al

  4. Genome-Wide Mapping of the Binding Sites and Structural Analysis of Kaposi's Sarcoma-Associated Herpesvirus Viral Interferon Regulatory Factor 2 Reveal that It Is a DNA-Binding Transcription Factor

    PubMed Central

    Hu, Haidai; Dong, Jiazhen; Liang, Deguang; Gao, Zengqiang; Bai, Lei; Sun, Rui; Hu, Hao; Zhang, Heng

    2015-01-01

    ABSTRACT The oncogenic herpesvirus Kaposi's sarcoma-associated herpesvirus (KSHV) is known to encode four viral interferon regulatory factors (vIRF1 to -4) to subvert the host antiviral immune response, but their detailed DNA-binding profiles as transcription factors in the host remain uncharacterized. Here, we first performed genome-wide vIRF2-binding site mapping in the human genome using chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq). vIRF2 was capable of binding to the promoter regions of 100 putative target genes. Importantly, we confirmed that vIRF2 can specifically interact with the promoters of the genes encoding PIK3C3, HMGCR, and HMGCL, which are associated with autophagosome formation or tumor progression and metastasis, and regulate their transcription in vivo. The crystal structure of the vIRF2 DNA-binding domain (DBD) (referred to here as vIRF2DBD) showed variable loop conformations and positive-charge distributions different from those of vIRF1 and cellular IRFs that are associated with DNA-binding specificities. Structure-based mutagenesis revealed that Arg82 and Arg85 are required for the in vitro DNA-binding activity of vIRF2DBD and can abolish the transcription regulation function of vIRF2 on the promoter reporter activity of PIK3C3, HMGCR, and HMGCL. Collectively, our study provided unique insights into the DNA-binding potency of vIRF2 and suggested that vIRF2 could act as a transcription factor of its target genes in the host antiviral immune response. IMPORTANCE The oncogenic herpesvirus KSHV is the etiological agent of Kaposi's sarcoma, primary effusion lymphoma, and multicentric Castleman's disease. KSHV has developed a unique mechanism to subvert the host antiviral immune responses by encoding four homologues of cellular interferon regulatory factors (vIRF1 to -4). However, none of their DNA-binding profiles in the human genome have been characterized until now, and the structural basis for their diverse

  5. Genome-wide Analysis Reveals New Roles for the Activation Domains of the Saccharomyces cerevisiae Heat Shock Transcription Factor (Hsf1) during the Transient Heat Shock Response*S

    PubMed Central

    Eastmond, Dawn L.; Nelson, Hillary C. M.

    2008-01-01

    In response to elevated temperatures, cells from many organisms rapidly transcribe a number of mRNAs. In Saccharomyces cerevisiae, this protective response involves two regulatory systems: the heat shock transcription factor (Hsf1) and the Msn2 and Msn4 (Msn2/4) transcription factors. Both systems modulate the induction of specific heat shock genes. However, the contribution of Hsf1, independent of Msn2/4, is only beginning to emerge. To address this question, we constructed an msn2/4 double mutant and used microarrays to elucidate the genome-wide expression program of Hsf1. The data showed that 7.6% of the genome was heat-induced. The up-regulated genes belong to a wide range of functional categories, with a significant increase in the chaperone and metabolism genes. We then focused on the contribution of the activation domains of Hsf1 to the expression profile and extended our analysis to include msn2/4Δ strains deleted for the N-terminal or C-terminal activation domain of Hsf1. Cluster analysis of the heat-induced genes revealed activation domain-specific patterns of expression, with each cluster also showing distinct preferences for functional categories. Computational analysis of the promoters of the induced genes affected by the loss of an activation domain showed a distinct preference for positioning and topology of the Hsf1 binding site. This study provides insight into the important role that both activation domains play for the Hsf1 regulatory system to rapidly and effectively transcribe its regulon in response to heat shock. PMID:16926161

  6. Genome-Wide Sequencing Reveals Two Major Sub-Lineages in the Genetically Monomorphic Pathogen Xanthomonas Campestris Pathovar Musacearum

    PubMed Central

    Wasukira, Arthur; Tayebwa, Johnbosco; Thwaites, Richard; Paszkiewicz, Konrad; Aritua, Valente; Kubiriba, Jerome; Smith, Julian; Grant, Murray; Studholme, David J.

    2012-01-01

    The bacterium Xanthomonas campestris pathovar musacearum (Xcm) is the causal agent of banana Xanthomonas wilt (BXW). This disease has devastated economies based on banana and plantain crops (Musa species) in East Africa. Here we use genome-wide sequencing to discover a set of single-nucleotide polymorphisms (SNPs) among East African isolates of Xcm. These SNPs have potential as molecular markers for phylogeographic studies of the epidemiology and spread of the pathogen. Our analysis reveals two major sub-lineages of the pathogen, suggesting that the current outbreaks of BXW on Musa species in the region may have more than one introductory event, perhaps from Ethiopia. Also, based on comparisons of genome-wide sequence data from multiple isolates of Xcm and multiple strains of X. vasicola pathovar vasculorum, we identify genes specific to Xcm that could be used to specifically detect Xcm by PCR-based methods. PMID:24704974

  7. Genome-wide sequencing reveals two major sub-lineages in the genetically monomorphic pathogen xanthomonas campestris pathovar musacearum.

    PubMed

    Wasukira, Arthur; Tayebwa, Johnbosco; Thwaites, Richard; Paszkiewicz, Konrad; Aritua, Valente; Kubiriba, Jerome; Smith, Julian; Grant, Murray; Studholme, David J

    2012-01-01

    The bacterium Xanthomonas campestris pathovar musacearum (Xcm) is the causal agent of banana Xanthomonas wilt (BXW). This disease has devastated economies based on banana and plantain crops (Musa species) in East Africa. Here we use genome-wide sequencing to discover a set of single-nucleotide polymorphisms (SNPs) among East African isolates of Xcm. These SNPs have potential as molecular markers for phylogeographic studies of the epidemiology and spread of the pathogen. Our analysis reveals two major sub-lineages of the pathogen, suggesting that the current outbreaks of BXW on Musa species in the region may have more than one introductory event, perhaps from Ethiopia. Also, based on comparisons of genome-wide sequence data from multiple isolates of Xcm and multiple strains of X. vasicola pathovar vasculorum, we identify genes specific to Xcm that could be used to specifically detect Xcm by PCR-based methods. PMID:24704974

  8. Phenome-wide analysis of genome-wide polygenic scores.

    PubMed

    Krapohl, E; Euesden, J; Zabaneh, D; Pingault, J-B; Rimfeld, K; von Stumm, S; Dale, P S; Breen, G; O'Reilly, P F; Plomin, R

    2016-09-01

    Genome-wide polygenic scores (GPS), which aggregate the effects of thousands of DNA variants from genome-wide association studies (GWAS), have the potential to make genetic predictions for individuals. We conducted a systematic investigation of associations between GPS and many behavioral traits, the behavioral phenome. For 3152 unrelated 16-year-old individuals representative of the United Kingdom, we created 13 GPS from the largest GWAS for psychiatric disorders (for example, schizophrenia, depression and dementia) and cognitive traits (for example, intelligence, educational attainment and intracranial volume). The behavioral phenome included 50 traits from the domains of psychopathology, personality, cognitive abilities and educational achievement. We examined phenome-wide profiles of associations for the entire distribution of each GPS and for the extremes of the GPS distributions. The cognitive GPS yielded stronger predictive power than the psychiatric GPS in our UK-representative sample of adolescents. For example, education GPS explained variation in adolescents' behavior problems (~0.6%) and in educational achievement (~2%) but psychiatric GPS were associated with neither. Despite the modest effect sizes of current GPS, quantile analyses illustrate the ability to stratify individuals by GPS and opportunities for research. For example, the highest and lowest septiles for the education GPS yielded a 0.5 s.d. difference in mean math grade and a 0.25 s.d. difference in mean behavior problems. We discuss the usefulness and limitations of GPS based on adult GWAS to predict genetic propensities earlier in development. PMID:26303664

  9. Meta-Analysis of Genome-Wide Association Studies of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Neale, Benjamin M.; Medland, Sarah E.; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V.; Nguyen, Thuy Trang; Schafer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J. L.; Langely, Kate; O'Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P.; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D.; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan

    2010-01-01

    Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association studies (GWAS) have not yielded significant results, we conducted a meta-analysis of…

  10. Systematic Pathway Enrichment Analysis of a Genome-Wide Association Study on Breast Cancer Survival Reveals an Influence of Genes Involved in Cell Adhesion and Calcium Signaling on the Patients’ Clinical Outcome

    PubMed Central

    Woltmann, Andrea; Chen, Bowang; Lascorz, Jesús; Johansson, Robert; Eyfjörd, Jorunn E.; Hamann, Ute; Manjer, Jonas; Enquist-Olsson, Kerstin; Henriksson, Roger; Herms, Stefan; Hoffmann, Per; Hemminki, Kari; Lenner, Per; Försti, Asta

    2014-01-01

    Genome-wide association studies (GWASs) may help to understand the effects of genetic polymorphisms on breast cancer (BC) progression and survival. However, they give only a focused view, which cannot capture the tremendous complexity of this disease. Therefore, we investigated data from a previously conducted GWAS on BC survival for enriched pathways by different enrichment analysis tools using the two main annotation databases Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG). The goal was to identify the functional categories (GO terms and KEGG pathways) that are consistently overrepresented in a statistically significant way in the list of genes generated from the single nucleotide polymorphism (SNP) data. The SNPs with allelic p-value cut-offs 0.005 and 0.01 were annotated to the genes by excluding or including a 20 kb up-and down-stream sequence of the genes and analyzed by six different tools. We identified eleven consistently enriched categories, the most significant ones relating to cell adhesion and calcium ion binding. Moreover, we investigated the similarity between our GWAS and the enrichment analyses of twelve published gene expression signatures for breast cancer prognosis. Five of them were commonly used and commercially available, five were based on different aspects of metastasis formation and two were developed from meta-analyses of published prognostic signatures. This comparison revealed similarities between our GWAS data and the general and the specific brain metastasis gene signatures as well as the Oncotype DX signature. As metastasis formation is a strong indicator of a patient’s prognosis, this result reflects the survival aspect of the conducted GWAS and supports cell adhesion and calcium signaling as important pathways in cancer progression. PMID:24886783

  11. Genome-wide association reveals the locus responsible for four-horned ruminant.

    PubMed

    Kijas, James W; Hadfield, Tracy; Naval Sanchez, Marina; Cockett, Noelle

    2016-04-01

    Phenotypic variability in horn characteristics, such as their size, number and shape, offers the opportunity to elucidate the molecular basis of horn development. The objective of this study was to map the genetic determinant controlling the production of four horns in two breeds, Jacob sheep and Navajo-Churro, and examine whether an eyelid abnormality occurring in the same populations is related. Genome-wide association mapping was performed using 125 animals from the two breeds that contain two- and four-horned individuals. A case-control design analysis of 570 712 SNPs genotyped with the ovine HD SNP Beadchip revealed a strong association signal on sheep chromosome 2. The 10 most strongly associated SNPs were all located in a region spanning Mb positions 131.9-132.6, indicating the genetic architecture underpinning the production of four horns is likely to involve a single gene. The closest genes to the most strongly associated marker (OAR2_132568092) were MTX2 and the HOXD cluster, located approximately 93 Kb and 251 Kb upstream respectively. The occurrence of an eyelid malformation across both breeds was restricted to polled animals and those carrying more than two horns. This suggests the eyelid abnormality may be associated with departures from the normal developmental production of two-horned animals and that the two conditions are developmentally linked. This study demonstrated the presence of separate loci responsible for the polled and four-horned phenotypes in sheep and advanced our understanding of the complexity that underpins horn morphology in ruminants. PMID:26767438

  12. Genetic Structure of the Han Chinese Population Revealed by Genome-wide SNP Variation

    PubMed Central

    Chen, Jieming; Zheng, Houfeng; Bei, Jin-Xin; Sun, Liangdan; Jia, Wei-hua; Li, Tao; Zhang, Furen; Seielstad, Mark; Zeng, Yi-Xin; Zhang, Xuejun; Liu, Jianjun

    2009-01-01

    Population stratification is a potential problem for genome-wide association studies (GWAS), confounding results and causing spurious associations. Hence, understanding how allele frequencies vary across geographic regions or among subpopulations is an important prelude to analyzing GWAS data. Using over 350,000 genome-wide autosomal SNPs in over 6000 Han Chinese samples from ten provinces of China, our study revealed a one-dimensional “north-south” population structure and a close correlation between geography and the genetic structure of the Han Chinese. The north-south population structure is consistent with the historical migration pattern of the Han Chinese population. Metropolitan cities in China were, however, more diffused “outliers,” probably because of the impact of modern migration of peoples. At a very local scale within the Guangdong province, we observed evidence of population structure among dialect groups, probably on account of endogamy within these dialects. Via simulation, we show that empirical levels of population structure observed across modern China can cause spurious associations in GWAS if not properly handled. In the Han Chinese, geographic matching is a good proxy for genetic matching, particularly in validation and candidate-gene studies in which population stratification cannot be directly accessed and accounted for because of the lack of genome-wide data, with the exception of the metropolitan cities, where geographical location is no longer a good indicator of ancestral origin. Our findings are important for designing GWAS in the Chinese population, an activity that is expected to intensify greatly in the near future. PMID:19944401

  13. Principal Component Analysis Characterizes Shared Pathogenetics from Genome-Wide Association Studies

    PubMed Central

    Chang, Diana; Keinan, Alon

    2014-01-01

    Genome-wide association studies (GWASs) have recently revealed many genetic associations that are shared between different diseases. We propose a method, disPCA, for genome-wide characterization of shared and distinct risk factors between and within disease classes. It flips the conventional GWAS paradigm by analyzing the diseases themselves, across GWAS datasets, to explore their “shared pathogenetics”. The method applies principal component analysis (PCA) to gene-level significance scores across all genes and across GWASs, thereby revealing shared pathogenetics between diseases in an unsupervised fashion. Importantly, it adjusts for potential sources of heterogeneity present between GWAS which can confound investigation of shared disease etiology. We applied disPCA to 31 GWASs, including autoimmune diseases, cancers, psychiatric disorders, and neurological disorders. The leading principal components separate these disease classes, as well as inflammatory bowel diseases from other autoimmune diseases. Generally, distinct diseases from the same class tend to be less separated, which is in line with their increased shared etiology. Enrichment analysis of genes contributing to leading principal components revealed pathways that are implicated in the immune system, while also pointing to pathways that have yet to be explored before in this context. Our results point to the potential of disPCA in going beyond epidemiological findings of the co-occurrence of distinct diseases, to highlighting novel genes and pathways that unsupervised learning suggest to be key players in the variability across diseases. PMID:25211452

  14. Genome-Wide Screen Reveals Valosin-Containing Protein Requirement for Coronavirus Exit from Endosomes

    PubMed Central

    Wong, Hui Hui; Kumar, Pankaj; Tay, Felicia Pei Ling; Moreau, Dimitri

    2015-01-01

    ABSTRACT Coronaviruses are RNA viruses with a large zoonotic reservoir and propensity for host switching, representing a real threat for public health, as evidenced by severe acute respiratory syndrome (SARS) and the emerging Middle East respiratory syndrome (MERS). Cellular factors required for their replication are poorly understood. Using genome-wide small interfering RNA (siRNA) screening, we identified 83 novel genes supporting infectious bronchitis virus (IBV) replication in human cells. Thirty of these hits can be placed in a network of interactions with viral proteins and are involved in RNA splicing, membrane trafficking, and ubiquitin conjugation. In addition, our screen reveals an unexpected role for valosin-containing protein (VCP/p97) in early steps of infection. Loss of VCP inhibits a previously uncharacterized degradation of the nucleocapsid N protein. This inhibition derives from virus accumulation in early endosomes, suggesting a role for VCP in the maturation of virus-loaded endosomes. The several host factors identified in this study may provide avenues for targeted therapeutics. IMPORTANCE Coronaviruses are RNA viruses representing a real threat for public health, as evidenced by SARS and the emerging MERS. However, cellular factors required for their replication are poorly understood. Using genome-wide siRNA screening, we identified novel genes supporting infectious bronchitis virus (IBV) replication in human cells. The several host factors identified in this study may provide directions for future research on targeted therapeutics. PMID:26311884

  15. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    SciTech Connect

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMahon, Katherine D.; Malmstrom, Rex R.

    2014-06-18

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ‘ecotype model’ of diversification, but not previously observed in natural populations.

  16. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    SciTech Connect

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMcahon, Katherine D.; Mamlstrom, Rex R.

    2014-05-12

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ecotype model? of diversification, but not previously observed in natural populations.

  17. Assessing statistical significance in multivariable genome wide association analysis

    PubMed Central

    Buzdugan, Laura; Kalisch, Markus; Navarro, Arcadi; Schunk, Daniel; Fehr, Ernst; Bühlmann, Peter

    2016-01-01

    Motivation: Although Genome Wide Association Studies (GWAS) genotype a very large number of single nucleotide polymorphisms (SNPs), the data are often analyzed one SNP at a time. The low predictive power of single SNPs, coupled with the high significance threshold needed to correct for multiple testing, greatly decreases the power of GWAS. Results: We propose a procedure in which all the SNPs are analyzed in a multiple generalized linear model, and we show its use for extremely high-dimensional datasets. Our method yields P-values for assessing significance of single SNPs or groups of SNPs while controlling for all other SNPs and the family wise error rate (FWER). Thus, our method tests whether or not a SNP carries any additional information about the phenotype beyond that available by all the other SNPs. This rules out spurious correlations between phenotypes and SNPs that can arise from marginal methods because the ‘spuriously correlated’ SNP merely happens to be correlated with the ‘truly causal’ SNP. In addition, the method offers a data driven approach to identifying and refining groups of SNPs that jointly contain informative signals about the phenotype. We demonstrate the value of our method by applying it to the seven diseases analyzed by the Wellcome Trust Case Control Consortium (WTCCC). We show, in particular, that our method is also capable of finding significant SNPs that were not identified in the original WTCCC study, but were replicated in other independent studies. Availability and implementation: Reproducibility of our research is supported by the open-source Bioconductor package hierGWAS. Contact: peter.buehlmann@stat.math.ethz.ch Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153677

  18. Genome-wide Comparative Analysis of Annexin Superfamily in Plants

    PubMed Central

    Jami, Sravan Kumar; Clark, Greg B.; Ayele, Belay T.; Ashe, Paula; Kirti, Pulugurtha Bharadwaja

    2012-01-01

    Most annexins are calcium-dependent, phospholipid-binding proteins with suggested functions in response to environmental stresses and signaling during plant growth and development. They have previously been identified and characterized in Arabidopsis and rice, and constitute a multigene family in plants. In this study, we performed a comparative analysis of annexin gene families in the sequenced genomes of Viridiplantae ranging from unicellular green algae to multicellular plants, and identified 149 genes. Phylogenetic studies of these deduced annexins classified them into nine different arbitrary groups. The occurrence and distribution of bona fide type II calcium binding sites within the four annexin domains were found to be different in each of these groups. Analysis of chromosomal distribution of annexin genes in rice, Arabidopsis and poplar revealed their localization on various chromosomes with some members also found on duplicated chromosomal segments leading to gene family expansion. Analysis of gene structure suggests sequential or differential loss of introns during the evolution of land plant annexin genes. Intron positions and phases are well conserved in annexin genes from representative genomes ranging from Physcomitrella to higher plants. The occurrence of alternative motifs such as K/R/HGD was found to be overlapping or at the mutated regions of the type II calcium binding sites indicating potential functional divergence in certain plant annexins. This study provides a basis for further functional analysis and characterization of annexin multigene families in the plant lineage. PMID:23133603

  19. A Genome Wide Survey of SNP Variation Reveals the Genetic Structure of Sheep Breeds

    PubMed Central

    Kijas, James W.; Townley, David; Dalrymple, Brian P.; Heaton, Michael P.; Maddox, Jillian F.; McGrath, Annette; Wilson, Peter; Ingersoll, Roxann G.; McCulloch, Russell; McWilliam, Sean; Tang, Dave; McEwan, John; Cockett, Noelle; Oddy, V. Hutton; Nicholas, Frank W.; Raadsma, Herman

    2009-01-01

    The genetic structure of sheep reflects their domestication and subsequent formation into discrete breeds. Understanding genetic structure is essential for achieving genetic improvement through genome-wide association studies, genomic selection and the dissection of quantitative traits. After identifying the first genome-wide set of SNP for sheep, we report on levels of genetic variability both within and between a diverse sample of ovine populations. Then, using cluster analysis and the partitioning of genetic variation, we demonstrate sheep are characterised by weak phylogeographic structure, overlapping genetic similarity and generally low differentiation which is consistent with their short evolutionary history. The degree of population substructure was, however, sufficient to cluster individuals based on geographic origin and known breed history. Specifically, African and Asian populations clustered separately from breeds of European origin sampled from Australia, New Zealand, Europe and North America. Furthermore, we demonstrate the presence of stratification within some, but not all, ovine breeds. The results emphasize that careful documentation of genetic structure will be an essential prerequisite when mapping the genetic basis of complex traits. Furthermore, the identification of a subset of SNP able to assign individuals into broad groupings demonstrates even a small panel of markers may be suitable for applications such as traceability. PMID:19270757

  20. Dating the age of admixture via wavelet transform analysis of genome-wide data.

    PubMed

    Pugach, Irina; Matveyev, Rostislav; Wollstein, Andreas; Kayser, Manfred; Stoneking, Mark

    2011-01-01

    We describe a PCA-based genome scan approach to analyze genome-wide admixture structure, and introduce wavelet transform analysis as a method for estimating the time of admixture. We test the wavelet transform method with simulations and apply it to genome-wide SNP data from eight admixed human populations. The wavelet transform method offers better resolution than existing methods for dating admixture, and can be applied to either SNP or sequence data from humans or other species. PMID:21352535

  1. Dating the age of admixture via wavelet transform analysis of genome-wide data

    PubMed Central

    2011-01-01

    We describe a PCA-based genome scan approach to analyze genome-wide admixture structure, and introduce wavelet transform analysis as a method for estimating the time of admixture. We test the wavelet transform method with simulations and apply it to genome-wide SNP data from eight admixed human populations. The wavelet transform method offers better resolution than existing methods for dating admixture, and can be applied to either SNP or sequence data from humans or other species. PMID:21352535

  2. A genome-wide analysis of putative functional and exonic variation associated with extremely high intelligence

    PubMed Central

    Kadeva, Neli; Miller, Mike B.; Iacono, William G.; McGue, Matt; Stergiakouli, Evie; Davey Smith, George; Putallaz, Martha; Lubinski, David; Meaburn, Emma L.; Plomin, Robert; Simpson, Michael A.

    2015-01-01

    Although individual differences in intelligence (general cognitive ability) are highly heritable, molecular genetic analyses to date have had limited success in identifying specific loci responsible for its heritability. The present study is the first to investigate exome variation in individuals of extremely high intelligence. Under the quantitative genetic model, sampling from the high extreme of the distribution should provide increased power to detect associations. We therefore performed a case-control association analysis with 1 409 individuals drawn from the top 0.0003 (IQ > 170) of the population distribution of intelligence and 3 253 unselected population-based controls. Our analysis focused on putative functional exonic variants assayed on the Illumina Human Exome BeadChip. We did not observe any individual protein-altering variants that are reproducibly associated with extremely high intelligence and within the entire distribution of intelligence. Moreover, no significant associations were found for multiple rare alleles within individual genes. However, analyses using genome-wide similarity between unrelated individuals (Genome-wide Complex Trait Analysis) indicate that the genotyped functional protein-altering variation yields a heritability estimate of 17.4% (SE 1.7%) based on a liability model. In addition, investigation of nominally significant associations revealed fewer rare alleles associated with extremely high intelligence than would be expected under the null hypothesis. This observation is consistent with the hypothesis that rare functional alleles are more frequently detrimental than beneficial to intelligence. PMID:26239293

  3. A genome-wide analysis of putative functional and exonic variation associated with extremely high intelligence.

    PubMed

    Spain, S L; Pedroso, I; Kadeva, N; Miller, M B; Iacono, W G; McGue, M; Stergiakouli, E; Smith, G D; Putallaz, M; Lubinski, D; Meaburn, E L; Plomin, R; Simpson, M A

    2016-08-01

    Although individual differences in intelligence (general cognitive ability) are highly heritable, molecular genetic analyses to date have had limited success in identifying specific loci responsible for its heritability. This study is the first to investigate exome variation in individuals of extremely high intelligence. Under the quantitative genetic model, sampling from the high extreme of the distribution should provide increased power to detect associations. We therefore performed a case-control association analysis with 1409 individuals drawn from the top 0.0003 (IQ >170) of the population distribution of intelligence and 3253 unselected population-based controls. Our analysis focused on putative functional exonic variants assayed on the Illumina HumanExome BeadChip. We did not observe any individual protein-altering variants that are reproducibly associated with extremely high intelligence and within the entire distribution of intelligence. Moreover, no significant associations were found for multiple rare alleles within individual genes. However, analyses using genome-wide similarity between unrelated individuals (genome-wide complex trait analysis) indicate that the genotyped functional protein-altering variation yields a heritability estimate of 17.4% (s.e. 1.7%) based on a liability model. In addition, investigation of nominally significant associations revealed fewer rare alleles associated with extremely high intelligence than would be expected under the null hypothesis. This observation is consistent with the hypothesis that rare functional alleles are more frequently detrimental than beneficial to intelligence. PMID:26239293

  4. Genome-Wide Association and Functional Follow-Up Reveals New Loci for Kidney Function

    PubMed Central

    Fuchsberger, Christian; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Taliun, Daniel; Li, Man; Gao, Xiaoyi; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C.; O'Seaghdha, Conall M.; Glazer, Nicole; Isaacs, Aaron; Liu, Ching-Ti; Smith, Albert V.; O'Connell, Jeffrey R.; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Johnson, Andrew D.; Gierman, Hinco J.; Feitosa, Mary; Hwang, Shih-Jen; Atkinson, Elizabeth J.; Lohman, Kurt; Cornelis, Marilyn C.; Johansson, Åsa; Tönjes, Anke; Dehghan, Abbas; Chouraki, Vincent; Holliday, Elizabeth G.; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y.; Murgia, Federico; Trompet, Stella; Imboden, Medea; Kollerits, Barbara; Pistis, Giorgio; Harris, Tamara B.; Launer, Lenore J.; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D.; Boerwinkle, Eric; Schmidt, Helena; Cavalieri, Margherita; Rao, Madhumathi; Hu, Frank B.; Demirkan, Ayse; Oostra, Ben A.; de Andrade, Mariza; Turner, Stephen T.; Ding, Jingzhong; Andrews, Jeanette S.; Freedman, Barry I.; Koenig, Wolfgang; Illig, Thomas; Döring, Angela; Wichmann, H.-Erich; Kolcic, Ivana; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E.; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H.; Wright, Alan F.; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Endlich, Karlhans; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K.; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G.; Rivadeneira, Fernando; Aulchenko, Yurii S.; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Ketkar, Shamika; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Giulianini, Franco; Krämer, Bernhard K.; Portas, Laura; Ford, Ian; Buckley, Brendan M.; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Metzger, Marie; Mitchell, Paul; Ciullo, Marina; Kim, Stuart K.; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J. Wouter; Probst-Hensch, Nicole M.; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R.; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; Siscovick, David S.; van Duijn, Cornelia M.; Borecki, Ingrid; Kardia, Sharon L. R.; Liu, Yongmei; Curhan, Gary C.; Rudan, Igor; Gyllensten, Ulf; Wilson, James F.; Franke, Andre; Pramstaller, Peter P.; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline C. M.; Hayward, Caroline; Ridker, Paul; Parsa, Afshin; Bochud, Murielle; Heid, Iris M.; Goessling, Wolfram; Chasman, Daniel I.; Kao, W. H. Linda; Fox, Caroline S.

    2012-01-01

    Chronic kidney disease (CKD) is an important public health problem with a genetic component. We performed genome-wide association studies in up to 130,600 European ancestry participants overall, and stratified for key CKD risk factors. We uncovered 6 new loci in association with estimated glomerular filtration rate (eGFR), the primary clinical measure of CKD, in or near MPPED2, DDX1, SLC47A1, CDK12, CASP9, and INO80. Morpholino knockdown of mpped2 and casp9 in zebrafish embryos revealed podocyte and tubular abnormalities with altered dextran clearance, suggesting a role for these genes in renal function. By providing new insights into genes that regulate renal function, these results could further our understanding of the pathogenesis of CKD. PMID:22479191

  5. Genome-wide association study of toxic metals and trace elements reveals novel associations

    PubMed Central

    Ng, Esther; Lind, P. Monica; Lindgren, Cecilia; Ingelsson, Erik; Mahajan, Anubha; Morris, Andrew; Lind, Lars

    2015-01-01

    The accumulation of toxic metals in the human body is influenced by exposure and mechanisms involved in metabolism, some of which may be under genetic control. This is the first genome-wide association study to investigate variants associated with whole blood levels of a range of toxic metals. Eleven toxic metals and trace elements (aluminium, cadmium, cobalt, copper, chromium, mercury, manganese, molybdenum, nickel, lead and zinc) were assayed in a cohort of 949 individuals using mass spectrometry. DNA samples were genotyped on the Infinium Omni Express bead microarray and imputed up to reference panels from the 1000 Genomes Project. Analyses revealed two regions associated with manganese level at genome-wide significance, mapping to 4q24 and 1q41. The lead single nucleotide polymorphism (SNP) in the 4q24 locus was rs13107325 (P-value = 5.1 × 10−11, β = −0.77), located in an exon of SLC39A8, which encodes a protein involved in manganese and zinc transport. The lead SNP in the 1q41 locus is rs1776029 (P-value = 2.2 × 10−14, β = −0.46). The SNP lies within the intronic region of SLC30A10, another transporter protein. Among other metals, the loci 6q14.1 and 3q26.32 were associated with cadmium and mercury levels (P = 1.4 × 10−10, β = −1.2 and P = 1.8 × 10−9, β = −1.8, respectively). Whole blood measurements of toxic metals are associated with genetic variants in metal transporter genes and others. This is relevant in inferring metabolic pathways of metals and identifying subsets of individuals who may be more susceptible to metal toxicity. PMID:26025379

  6. Genome-Wide Transcriptional Profiling Reveals Connective Tissue Mast Cell Accumulation in Bronchopulmonary Dysplasia

    PubMed Central

    Bhattacharya, Soumyaroop; Go, Diana; Krenitsky, Daria L.; Huyck, Heidi L.; Solleti, Siva Kumar; Lunger, Valerie A.; Metlay, Leon; Srisuma, Sorachai; Wert, Susan E.; Pryhuber, Gloria S.

    2012-01-01

    Rationale: Bronchopulmonary dysplasia (BPD) is a major complication of premature birth. Risk factors for BPD are complex and include prenatal infection and O2 toxicity. BPD pathology is equally complex and characterized by inflammation and dysmorphic airspaces and vasculature. Due to the limited availability of clinical samples, an understanding of the molecular pathogenesis of this disease and its causal mechanisms and associated biomarkers is limited. Objectives: Apply genome-wide expression profiling to define pathways affected in BPD lungs. Methods: Lung tissue was obtained at autopsy from 11 BPD cases and 17 age-matched control subjects without BPD. RNA isolated from these tissue samples was interrogated using microarrays. Standard gene selection and pathway analysis methods were applied to the data set. Abnormal expression patterns were validated by quantitative reverse transcriptase–polymerase chain reaction and immunohistochemistry. Measurements and Main Results: We identified 159 genes differentially expressed in BPD tissues. Pathway analysis indicated previously appreciated (e.g., DNA damage regulation of cell cycle) as well as novel (e.g., B-cell development) biological functions were affected. Three of the five most highly induced genes were mast cell (MC)-specific markers. We confirmed an increased accumulation of connective tissue MCTC (chymase expressing) mast cells in BPD tissues. Increased expression of MCTC markers was also demonstrated in an animal model of BPD-like pathology. Conclusions: We present a unique genome-wide expression data set from human BPD lung tissue. Our data provide information on gene expression patterns associated with BPD and facilitated the discovery that MCTC accumulation is a prominent feature of this disease. These observations have significant clinical and mechanistic implications. PMID:22723293

  7. Identification of Promising Mutants Associated with Egg Production Traits Revealed by Genome-Wide Association Study

    PubMed Central

    Dou, Taocun; Yi, Guoqiang; Qu, LuJiang; Qu, Liang; Wang, Kehua; Yang, Ning

    2015-01-01

    Egg number (EN), egg laying rate (LR) and age at first egg (AFE) are important production traits related to egg production in poultry industry. To better understand the knowledge of genetic architecture of dynamic EN during the whole laying cycle and provide the precise positions of associated variants for EN, LR and AFE, laying records from 21 to 72 weeks of age were collected individually for 1,534 F2 hens produced by reciprocal crosses between White Leghorn and Dongxiang Blue-shelled chicken, and their genotypes were assayed by chicken 600 K Affymetrix high density genotyping arrays. Subsequently, pedigree and SNP-based genetic parameters were estimated and a genome-wide association study (GWAS) was conducted on EN, LR and AFE. The heritability estimates were similar between pedigree and SNP-based estimates varying from 0.17 to 0.36. In the GWA analysis, we identified nine genome-wide significant loci associated with EN of the laying periods from 21 to 26 weeks, 27 to 36 weeks and 37 to 72 weeks. Analysis of GTF2A1 and CLSPN suggested that they influenced the function of ovary and uterus, and may be considered as relevant candidates. The identified SNP rs314448799 for accumulative EN from 21 to 40 weeks on chromosome 5 created phenotypic differences of 6.86 eggs between two homozygous genotypes, which could be potentially applied to the molecular breeding for EN selection. Moreover, our finding showed that LR was a moderate polygenic trait. The suggestive significant region on chromosome 16 for AFE suggested the relationship between sex maturity and immune in the current population. The present study comprehensively evaluates the role of genetic variants in the development of egg laying. The findings will be helpful to investigation of causative genes function and future marker-assisted selection and genomic selection in chickens. PMID:26496084

  8. Genome-wide analysis of mRNA polysomal profiles with spotted DNA microarrays.

    PubMed

    Melamed, Daniel; Arava, Yoav

    2007-01-01

    The sedimentation of an mRNA in sucrose gradients is highly affected by its ribosomal association. Sedimentation analysis has therefore become routine for studying changes in ribosomal association of mRNAs of interest. DNA microarray technology has been combined with sedimentation analysis to characterize changes in ribosomal association for thousands of mRNAs in parallel. Such analyses revealed mRNAs that are translationally regulated and have provided new insights into the translation process. In this chapter, we describe possible experimental designs for analyzing genome-wide changes in ribosomal association, and discuss some of their advantages and disadvantages. We then provide a detailed protocol for analysis of polysomal fractions using spotted DNA microarrays. PMID:17923236

  9. Genome-wide association analyses reveal complex genetic architecture underlying natural variation for flowering time in canola.

    PubMed

    Raman, H; Raman, R; Coombes, N; Song, J; Prangnell, R; Bandaranayake, C; Tahira, R; Sundaramoorthi, V; Killian, A; Meng, J; Dennis, E S; Balasubramanian, S

    2016-06-01

    Optimum flowering time is the key to maximize canola production in order to meet global demand of vegetable oil, biodiesel and canola-meal. We reveal extensive variation in flowering time across diverse genotypes of canola under field, glasshouse and controlled environmental conditions. We conduct a genome-wide association study and identify 69 single nucleotide polymorphism (SNP) markers associated with flowering time, which are repeatedly detected across experiments. Several associated SNPs occur in clusters across the canola genome; seven of them were detected within 20 Kb regions of a priori candidate genes; FLOWERING LOCUS T, FRUITFUL, FLOWERING LOCUS C, CONSTANS, FRIGIDA, PHYTOCHROME B and an additional five SNPs were localized within 14 Kb of a previously identified quantitative trait loci for flowering time. Expression analyses showed that among FLC paralogs, BnFLC.A2 accounts for ~23% of natural variation in diverse accessions. Genome-wide association analysis for FLC expression levels mapped not only BnFLC.C2 but also other loci that contribute to variation in FLC expression. In addition to revealing the complex genetic architecture of flowering time variation, we demonstrate that the identified SNPs can be modelled to predict flowering time in diverse canola germplasm accurately and hence are suitable for genomic selection of adaptative traits in canola improvement programmes. PMID:26428711

  10. Genome-wide analysis of the MYB transcription factor superfamily in soybean

    PubMed Central

    2012-01-01

    Background The MYB superfamily constitutes one of the most abundant groups of transcription factors described in plants. Nevertheless, their functions appear to be highly diverse and remain rather unclear. To date, no genome-wide characterization of this gene family has been conducted in a legume species. Here we report the first genome-wide analysis of the whole MYB superfamily in a legume species, soybean (Glycine max), including the gene structures, phylogeny, chromosome locations, conserved motifs, and expression patterns, as well as a comparative genomic analysis with Arabidopsis. Results A total of 244 R2R3-MYB genes were identified and further classified into 48 subfamilies based on a phylogenetic comparative analysis with their putative orthologs, showed both gene loss and duplication events. The phylogenetic analysis showed that most characterized MYB genes with similar functions are clustered in the same subfamily, together with the identification of orthologs by synteny analysis, functional conservation among subgroups of MYB genes was strongly indicated. The phylogenetic relationships of each subgroup of MYB genes were well supported by the highly conserved intron/exon structures and motifs outside the MYB domain. Synonymous nucleotide substitution (dN/dS) analysis showed that the soybean MYB DNA-binding domain is under strong negative selection. The chromosome distribution pattern strongly indicated that genome-wide segmental and tandem duplication contribute to the expansion of soybean MYB genes. In addition, we found that ~ 4% of soybean R2R3-MYB genes had undergone alternative splicing events, producing a variety of transcripts from a single gene, which illustrated the extremely high complexity of transcriptome regulation. Comparative expression profile analysis of R2R3-MYB genes in soybean and Arabidopsis revealed that MYB genes play conserved and various roles in plants, which is indicative of a divergence in function. Conclusions In this

  11. Genomic-Wide Analysis with Microarrays in Human Oncology

    PubMed Central

    Inaoka, Kenichi; Inokawa, Yoshikuni; Nomoto, Shuji

    2015-01-01

    DNA microarray technologies have advanced rapidly and had a profound impact on examining gene expression on a genomic scale in research. This review discusses the history and development of microarray and DNA chip devices, and specific microarrays are described along with their methods and applications. In particular, microarrays have detected many novel cancer-related genes by comparing cancer tissues and non-cancerous tissues in oncological research. Recently, new methods have been in development, such as the double-combination array and triple-combination array, which allow more effective analysis of gene expression and epigenetic changes. Analysis of gene expression alterations in precancerous regions compared with normal regions and array analysis in drug-resistance cancer tissues are also successfully performed. Compared with next-generation sequencing, a similar method of genome analysis, several important differences distinguish these techniques and their applications. Development of novel microarray technologies is expected to contribute to further cancer research.

  12. Genome-wide Association Study and Meta-Analysis Identify ISL1 as Genome-wide Significant Susceptibility Gene for Bladder Exstrophy

    PubMed Central

    Draaken, Markus; Knapp, Michael; Pennimpede, Tracie; Schmidt, Johanna M.; Ebert, Anne-Karolin; Rösch, Wolfgang; Stein, Raimund; Utsch, Boris; Hirsch, Karin; Boemers, Thomas M.; Mangold, Elisabeth; Heilmann, Stefanie; Ludwig, Kerstin U.; Jenetzky, Ekkehart; Zwink, Nadine; Moebus, Susanne; Herrmann, Bernhard G.; Mattheisen, Manuel; Nöthen, Markus M.

    2015-01-01

    The bladder exstrophy-epispadias complex (BEEC) represents the severe end of the uro-rectal malformation spectrum, and is thought to result from aberrant embryonic morphogenesis of the cloacal membrane and the urorectal septum. The most common form of BEEC is isolated classic bladder exstrophy (CBE). To identify susceptibility loci for CBE, we performed a genome-wide association study (GWAS) of 110 CBE patients and 1,177 controls of European origin. Here, an association was found with a region of approximately 220kb on chromosome 5q11.1. This region harbors the ISL1 (ISL LIM homeobox 1) gene. Multiple markers in this region showed evidence for association with CBE, including 84 markers with genome-wide significance. We then performed a meta-analysis using data from a previous GWAS by our group of 98 CBE patients and 526 controls of European origin. This meta-analysis also implicated the 5q11.1 locus in CBE risk. A total of 138 markers at this locus reached genome-wide significance in the meta-analysis, and the most significant marker (rs9291768) achieved a P value of 2.13 × 10−12. No other locus in the meta-analysis achieved genome-wide significance. We then performed murine expression analyses to follow up this finding. Here, Isl1 expression was detected in the genital region within the critical time frame for human CBE development. Genital regions with Isl1 expression included the peri-cloacal mesenchyme and the urorectal septum. The present study identified the first genome-wide significant locus for CBE at chromosomal region 5q11.1, and provides strong evidence for the hypothesis that ISL1 is the responsible candidate gene in this region. PMID:25763902

  13. A Genome Wide Survey of SNP Variation Reveals the Genetic Structure of Sheep Breeds

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The genetic structure of sheep reflects their domestication and subsequent formation into discrete breeds. Understanding genetic structure is essential for achieving genetic improvement through genome-wide association studies, genomic selection and the dissection of quantitative traits. After identi...

  14. Genome-Wide Transcriptome Analysis of Cadmium Stress in Rice

    PubMed Central

    Oono, Youko; Yazawa, Takayuki; Kanamori, Hiroyuki; Sasaki, Harumi; Mori, Satomi; Handa, Hirokazu; Matsumoto, Takashi

    2016-01-01

    Rice growth is severely affected by toxic concentrations of the nonessential heavy metal cadmium (Cd). To elucidate the molecular basis of the response to Cd stress, we performed mRNA sequencing of rice following our previous study on exposure to high concentrations of Cd (Oono et al., 2014). In this study, rice plants were hydroponically treated with low concentrations of Cd and approximately 211 million sequence reads were mapped onto the IRGSP-1.0 reference rice genome sequence. Many genes, including some identified under high Cd concentration exposure in our previous study, were found to be responsive to low Cd exposure, with an average of about 11,000 transcripts from each condition. However, genes expressed constitutively across the developmental course responded only slightly to low Cd concentrations, in contrast to their clear response to high Cd concentration, which causes fatal damage to rice seedlings according to phenotypic changes. The expression of metal ion transporter genes tended to correlate with Cd concentration, suggesting the potential of the RNA-Seq strategy to reveal novel Cd-responsive transporters by analyzing gene expression under different Cd concentrations. This study could help to develop novel strategies for improving tolerance to Cd exposure in rice and other cereal crops. PMID:27034955

  15. Rapid genome-wide evolution in Brassica rapa populations following drought revealed by sequencing of ancestral and descendant gene pools.

    PubMed

    Franks, Steven J; Kane, Nolan C; O'Hara, Niamh B; Tittes, Silas; Rest, Joshua S

    2016-08-01

    There is increasing evidence that evolution can occur rapidly in response to selection. Recent advances in sequencing suggest the possibility of documenting genetic changes as they occur in populations, thus uncovering the genetic basis of evolution, particularly if samples are available from both before and after selection. Here, we had a unique opportunity to directly assess genetic changes in natural populations following an evolutionary response to a fluctuation in climate. We analysed genome-wide differences between ancestors and descendants of natural populations of Brassica rapa plants from two locations that rapidly evolved changes in multiple phenotypic traits, including flowering time, following a multiyear late-season drought in California. These ancestor-descendant comparisons revealed evolutionary shifts in allele frequencies in many genes. Some genes showing evolutionary shifts have functions related to drought stress and flowering time, consistent with an adaptive response to selection. Loci differentiated between ancestors and descendants (FST outliers) were generally different from those showing signatures of selection based on site frequency spectrum analysis (Tajima's D), indicating that the loci that evolved in response to the recent drought and those under historical selection were generally distinct. Very few genes showed similar evolutionary responses between two geographically distinct populations, suggesting independent genetic trajectories of evolution yielding parallel phenotypic changes. The results show that selection can result in rapid genome-wide evolutionary shifts in allele frequencies in natural populations, and highlight the usefulness of combining resurrection experiments in natural populations with genomics for studying the genetic basis of adaptive evolution. PMID:27072809

  16. Decoding genome-wide GadEWX-transcriptional regulatory networks reveals multifaceted cellular responses to acid stress in Escherichia coli.

    PubMed

    Seo, Sang Woo; Kim, Donghyuk; O'Brien, Edward J; Szubin, Richard; Palsson, Bernhard O

    2015-01-01

    The regulators GadE, GadW and GadX (which we refer to as GadEWX) play a critical role in the transcriptional regulation of the glutamate-dependent acid resistance (GDAR) system in Escherichia coli K-12 MG1655. However, the genome-wide regulatory role of GadEWX is still unknown. Here we comprehensively reconstruct the genome-wide GadEWX transcriptional regulatory network and RpoS involvement in E. coli K-12 MG1655 under acidic stress. Integrative data analysis reveals that GadEWX regulons consist of 45 genes in 31 transcription units and 28 of these genes were associated with RpoS-binding sites. We demonstrate that GadEWX directly and coherently regulate several proton-generating/consuming enzymes with pairs of negative-feedback loops for pH homeostasis. In addition, GadEWX regulate genes with assorted functions, including molecular chaperones, acid resistance, stress response and other regulatory activities. These results show how GadEWX simultaneously coordinate many cellular processes to produce the overall response of E. coli to acid stress. PMID:26258987

  17. Decoding genome-wide GadEWX-transcriptional regulatory networks reveals multifaceted cellular responses to acid stress in Escherichia coli

    PubMed Central

    Seo, Sang Woo; Kim, Donghyuk; O'Brien, Edward J.; Szubin, Richard; Palsson, Bernhard O.

    2015-01-01

    The regulators GadE, GadW and GadX (which we refer to as GadEWX) play a critical role in the transcriptional regulation of the glutamate-dependent acid resistance (GDAR) system in Escherichia coli K-12 MG1655. However, the genome-wide regulatory role of GadEWX is still unknown. Here we comprehensively reconstruct the genome-wide GadEWX transcriptional regulatory network and RpoS involvement in E. coli K-12 MG1655 under acidic stress. Integrative data analysis reveals that GadEWX regulons consist of 45 genes in 31 transcription units and 28 of these genes were associated with RpoS-binding sites. We demonstrate that GadEWX directly and coherently regulate several proton-generating/consuming enzymes with pairs of negative-feedback loops for pH homeostasis. In addition, GadEWX regulate genes with assorted functions, including molecular chaperones, acid resistance, stress response and other regulatory activities. These results show how GadEWX simultaneously coordinate many cellular processes to produce the overall response of E. coli to acid stress. PMID:26258987

  18. Genome-wide analysis of promoter architecture in Drosophila melanogaster

    SciTech Connect

    Hoskins, Roger A.; Landolin, Jane M.; Brown, James B.; Sandler, Jeremy E.; Takahashi, Hazuki; Lassmann, Timo; Yu, Charles; Booth, Benjamin W.; Zhang, Dayu; Wan, Kenneth H.; Yang, Li; Boley, Nathan; Andrews, Justen; Kaufman, Thomas C.; Graveley, Brenton R.; Bickel, Peter J.; Carninci, Piero; Carlson, Joseph W.; Celniker, Susan E.

    2010-10-20

    Core promoters are critical regions for gene regulation in higher eukaryotes. However, the boundaries of promoter regions, the relative rates of initiation at the transcription start sites (TSSs) distributed within them, and the functional significance of promoter architecture remain poorly understood. We produced a high-resolution map of promoters active in the Drosophila melanogaster embryo by integrating data from three independent and complementary methods: 21 million cap analysis of gene expression (CAGE) tags, 1.2 million RNA ligase mediated rapid amplification of cDNA ends (RLMRACE) reads, and 50,000 cap-trapped expressed sequence tags (ESTs). We defined 12,454 promoters of 8037 genes. Our analysis indicates that, due to non-promoter-associated RNA background signal, previous studies have likely overestimated the number of promoter-associated CAGE clusters by fivefold. We show that TSS distributions form a complex continuum of shapes, and that promoters active in the embryo and adult have highly similar shapes in 95% of cases. This suggests that these distributions are generally determined by static elements such as local DNA sequence and are not modulated by dynamic signals such as histone modifications. Transcription factor binding motifs are differentially enriched as a function of promoter shape, and peaked promoter shape is correlated with both temporal and spatial regulation of gene expression. Our results contribute to the emerging view that core promoters are functionally diverse and control patterning of gene expression in Drosophila and mammals.

  19. Genome wide analysis of blood pressure variability and ischemic stroke

    PubMed Central

    Khan, Muhammad S; Nalls, Michael A; Bevan, Steve; Cheng, Yu-Ching; Chen, Wei-Min; Malik, Rainer; McCarthy, Nina S; Holliday, Elizabeth G; Speed, Douglas; Hasan, Nazeeha; Pucek, Mateusz; Rinne, Paul E.; Sever, Peter; Stanton, Alice; Shields, Denis C; Maguire, Jane M; McEvoy, Mark; Scott, Rodney J; Ferrucci, Luigi; Macleod, Mary J; Attia, John; Markus, Hugh S; Sale, Michele M; Worrall, Bradford B; Mitchell, Braxton D; Dichgans, Martin; Sudlow, Cathy; Meschia, James F; Rothwell, Peter M

    2013-01-01

    Background and Purpose Visit-to-visit variability in BP is associated with ischemic stroke. We sought to determine whether such variability has a genetic aetiology and whether genetic variants associated with BP variability are also associated with ischemic stroke. Methods A GWAS for loci influencing BP variability was undertaken in 3,802 individuals from the Anglo-Scandinavian Cardiac Outcome Trial (ASCOT) study where long-term visit-to-visit and within visit BP measures were available. Since BP variability is strongly associated with ischemic stroke, we genotyped the sentinel SNP in an independent ischemic stroke population comprising of 8,624 cases and 12,722 controls and in 3,900 additional (Scandinavian) participants from the ASCOT study in order to replicate our findings. Results The ASCOT discovery GWAS identified a cluster of 17 correlated SNPs within the NLGN1 gene (3q26.31) associated with BP variability. The strongest association was with rs976683 (p=1.4×10−8). Conditional analysis on rs976683 provided no evidence of additional independent associations at the locus. Analysis of rs976683 in ischemic stroke patients found no association for overall stroke (OR 1.02; 95% CI 0.97-1.07; p=0.52) or its sub-types: CE (OR 1.07; 95% CI 0.97-1.16; p=0.17), LVD (OR 0.98; 95% 0.89-1.07; p=0.60) and SVD (OR 1.07; 95% CI 0.97-1.17; p=0.19). No evidence for association was found between rs976683 and BP variability in the additional (Scandinavian) ASCOT participants (p=0.18). Conclusions We identified a cluster of SNPs at the NLGN1 locus showing significant association with BP variability. Follow up analyses did not support an association with risk of ischemic stroke and its subtypes. PMID:23929743

  20. Genome-wide analysis of TCP family in tobacco.

    PubMed

    Chen, L; Chen, Y Q; Ding, A M; Chen, H; Xia, F; Wang, W F; Sun, Y H

    2016-01-01

    The TCP family is a transcription factor family, members of which are extensively involved in plant growth and development as well as in signal transduction in the response against many physiological and biochemical stimuli. In the present study, 61 TCP genes were identified in tobacco (Nicotiana tabacum) genome. Bioinformatic methods were employed for predicting and analyzing the gene structure, gene expression, phylogenetic analysis, and conserved domains of TCP proteins in tobacco. The 61 NtTCP genes were divided into three diverse groups, based on the division of TCP genes in tomato and Arabidopsis, and the results of the conserved domain and sequence analyses further confirmed the classification of the NtTCP genes. The expression pattern of NtTCP also demonstrated that majority of these genes play important roles in all the tissues, while some special genes exercise their functions only in specific tissues. In brief, the comprehensive and thorough study of the TCP family in other plants provides sufficient resources for studying the structure and functions of TCPs in tobacco. PMID:27323069

  1. Genome-wide methylome analyses reveal novel epigenetic regulation patterns in schizophrenia and bipolar disorder.

    PubMed

    Li, Yongsheng; Camarillo, Cynthia; Xu, Juan; Arana, Tania Bedard; Xiao, Yun; Zhao, Zheng; Chen, Hong; Ramirez, Mercedes; Zavala, Juan; Escamilla, Michael A; Armas, Regina; Mendoza, Ricardo; Ontiveros, Alfonso; Nicolini, Humberto; Magaña, Alvaro Antonio Jerez; Rubin, Lewis P; Li, Xia; Xu, Chun

    2015-01-01

    Schizophrenia (SZ) and bipolar disorder (BP) are complex genetic disorders. Their appearance is also likely informed by as yet only partially described epigenetic contributions. Using a sequencing-based method for genome-wide analysis, we quantitatively compared the blood DNA methylation landscapes in SZ and BP subjects to control, both in an understudied population, Hispanics along the US-Mexico border. Remarkably, we identified thousands of differentially methylated regions for SZ and BP preferentially located in promoters 3'-UTRs and 5'-UTRs of genes. Distinct patterns of aberrant methylation of promoter sequences were located surrounding transcription start sites. In these instances, aberrant methylation occurred in CpG islands (CGIs) as well as in flanking regions as well as in CGI sparse promoters. Pathway analysis of genes displaying these distinct aberrant promoter methylation patterns showed enhancement of epigenetic changes in numerous genes previously related to psychiatric disorders and neurodevelopment. Integration of gene expression data further suggests that in SZ aberrant promoter methylation is significantly associated with altered gene transcription. In particular, we found significant associations between (1) promoter CGIs hypermethylation with gene repression and (2) CGI 3'-shore hypomethylation with increased gene expression. Finally, we constructed a specific methylation analysis platform that facilitates viewing and comparing aberrant genome methylation in human neuropsychiatric disorders. PMID:25734057

  2. Dynamics of oscillatory phenotypes in S. cerevisiae reveal a network of genome-wide transcriptional oscillators

    PubMed Central

    Chin, Shwe L.; Marcus, Ian M.; Klevecz, Robert R.; Li, Caroline M.

    2012-01-01

    Genetic and environmental factors are well-studied influences on phenotype; however, time is a variable that is rarely considered when studying changes in cellular phenotype. Time-resolved microarray data revealed genome-wide transcriptional oscillation in a yeast continuous culture system with ~2 and ~4 h periods. We mapped the global patterns of transcriptional oscillations into a 3D map to represent different cellular phenotypes of redox cycles. This map shows the dynamic nature of gene expression in that transcripts are ordered and coupled to each other through time and concentration space. Although cells differed in oscillation periods, transcripts involved in certain processes were conserved in a deterministic way. When oscillation period lengthened, the peak to trough ratio of transcripts increased and the fraction of cells in the unbudded (G0/G1) phase of the cell division cycle increased. Decreasing the glucose level in the culture media was one way to increase the redox cycle, possibly from changes in metabolic flux. The period may be responding to lower glucose levels by increasing the fraction of cells in G1 and reducing S-phase gating so that cells can spend more time in catabolic processes. Our results support that gene transcripts are coordinated with metabolic functions and the cell division cycle. PMID:22289124

  3. Genome-wide RNAi screen for nuclear actin reveals a network of cofilin regulators

    PubMed Central

    Dopie, Joseph; Rajakylä, Eeva K.; Joensuu, Merja S.; Huet, Guillaume; Ferrantelli, Evelina; Xie, Tiao; Jäälinoja, Harri; Jokitalo, Eija; Vartiainen, Maria K.

    2015-01-01

    ABSTRACT Nuclear actin plays an important role in many processes that regulate gene expression. Cytoplasmic actin dynamics are tightly controlled by numerous actin-binding proteins, but regulation of nuclear actin has remained unclear. Here, we performed a genome-wide RNA interference (RNAi) screen in Drosophila cells to identify proteins that influence either nuclear polymerization or import of actin. We validate 19 factors as specific hits, and show that Chinmo (known as Bach2 in mammals), SNF4Aγ (Prkag1 in mammals) and Rab18 play a role in nuclear localization of actin in both fly and mammalian cells. We identify several new regulators of cofilin activity, and characterize modulators of both cofilin kinases and phosphatase. For example, Chinmo/Bach2, which regulates nuclear actin levels also in vivo, maintains active cofilin by repressing the expression of the kinase Cdi (Tesk in mammals). Finally, we show that Nup98 and lamin are candidates for regulating nuclear actin polymerization. Our screen therefore reveals new aspects of actin regulation and links nuclear actin to many cellular processes. PMID:26021350

  4. Genome-wide RNAi screen for nuclear actin reveals a network of cofilin regulators.

    PubMed

    Dopie, Joseph; Rajakylä, Eeva K; Joensuu, Merja S; Huet, Guillaume; Ferrantelli, Evelina; Xie, Tiao; Jäälinoja, Harri; Jokitalo, Eija; Vartiainen, Maria K

    2015-07-01

    Nuclear actin plays an important role in many processes that regulate gene expression. Cytoplasmic actin dynamics are tightly controlled by numerous actin-binding proteins, but regulation of nuclear actin has remained unclear. Here, we performed a genome-wide RNA interference (RNAi) screen in Drosophila cells to identify proteins that influence either nuclear polymerization or import of actin. We validate 19 factors as specific hits, and show that Chinmo (known as Bach2 in mammals), SNF4Aγ (Prkag1 in mammals) and Rab18 play a role in nuclear localization of actin in both fly and mammalian cells. We identify several new regulators of cofilin activity, and characterize modulators of both cofilin kinases and phosphatase. For example, Chinmo/Bach2, which regulates nuclear actin levels also in vivo, maintains active cofilin by repressing the expression of the kinase Cdi (Tesk in mammals). Finally, we show that Nup98 and lamin are candidates for regulating nuclear actin polymerization. Our screen therefore reveals new aspects of actin regulation and links nuclear actin to many cellular processes. PMID:26021350

  5. Genome-wide Association Study of Dermatomyositis Reveals Genetic Overlap with other Autoimmune Disorders

    PubMed Central

    Miller, Frederick W.; Cooper, Robert G.; Vencovsky, Jiri; Rider, Lisa G.; Danko, Katalin; Wedderburn, Lucy R.; Lundberg, Ingrid E.; Pachman, Lauren M.; Reed, Ann M.; Ytterberg, Steven R.; Padyukov, Leonid; Selva-O’Callaghan, Albert; Radstake, Timothy; Isenberg, David A.; Chinoy, Hector; Ollier, William E. R.; O’Hanlon, Terrance P.; Peng, Bo; Lee, Annette; Lamb, Janine A.; Chen, Wei; Amos, Christopher I.; Gregersen, Peter K.

    2014-01-01

    Objective To identify new genetic associations with juvenile and adult dermatomyositis (DM). Methods We performed a genome-wide association study (GWAS) of adult and juvenile DM patients of European ancestry (n = 1178) and controls (n = 4724). To assess genetic overlap with other autoimmune disorders, we examined whether 141 single nucleotide polymorphisms (SNPs) outside the major histocompatibility complex (MHC) locus, and previously associated with autoimmune diseases, predispose to DM. Results Compared to controls, patients with DM had a strong signal in the MHC region consisting of GWAS-level significance (P < 5x10−8) at 80 genotyped SNPs. An analysis of 141 non-MHC SNPs previously associated with autoimmune diseases showed that three SNPs linked with three genes were associated with DM, with a false discovery rate (FDR) < 0.05. These genes were phospholipase C like 1 (PLCL1, rs6738825, FDR=0.00089), B lymphoid tyrosine kinase (BLK, rs2736340, FDR=0.00031), and chemokine (C-C motif) ligand 21 (CCL21, rs951005, FDR=0.0076). None of these genes was previously reported to be associated with DM. Conclusion Our findings confirm the MHC as the major genetic region associated with DM and indicate that DM shares non-MHC genetic features with other autoimmune diseases, suggesting the presence of additional novel risk loci. This first identification of autoimmune disease genetic predispositions shared with DM may lead to enhanced understanding of pathogenesis and novel diagnostic and therapeutic approaches. PMID:23983088

  6. Modeling genome-wide replication kinetics reveals a mechanism for regulation of replication timing

    PubMed Central

    Yang, Scott Cheng-Hsin; Rhind, Nicholas; Bechhoefer, John

    2010-01-01

    Microarrays are powerful tools to probe genome-wide replication kinetics. The rich data sets that result contain more information than has been extracted by current methods of analysis. In this paper, we present an analytical model that incorporates probabilistic initiation of origins and passive replication. Using the model, we performed least-squares fits to a set of recently published time course microarray data on Saccharomyces cerevisiae. We extracted the distribution of firing times for each origin and found that the later an origin fires on average, the greater the variation in firing times. To explain this trend, we propose a model where earlier-firing origins have more initiator complexes loaded and a more accessible chromatin environment. The model demonstrates how initiation can be stochastic and yet occur at defined times during S phase, without an explicit timing program. Furthermore, we hypothesize that the initiators in this model correspond to loaded minichromosome maintenance complexes. This model is the first to suggest a detailed, testable, biochemically plausible mechanism for the regulation of replication timing in eukaryotes. PMID:20739926

  7. Genome-Wide Analyses Reveal a Role for Peptide Hormones in Planarian Germline Development

    PubMed Central

    Collins, James J.; Hou, Xiaowen; Romanova, Elena V.; Lambrus, Bramwell G.; Miller, Claire M.; Saberi, Amir; Sweedler, Jonathan V.; Newmark, Phillip A.

    2010-01-01

    Bioactive peptides (i.e., neuropeptides or peptide hormones) represent the largest class of cell-cell signaling molecules in metazoans and are potent regulators of neural and physiological function. In vertebrates, peptide hormones play an integral role in endocrine signaling between the brain and the gonads that controls reproductive development, yet few of these molecules have been shown to influence reproductive development in invertebrates. Here, we define a role for peptide hormones in controlling reproductive physiology of the model flatworm, the planarian Schmidtea mediterranea. Based on our observation that defective neuropeptide processing results in defects in reproductive system development, we employed peptidomic and functional genomic approaches to characterize the planarian peptide hormone complement, identifying 51 prohormone genes and validating 142 peptides biochemically. Comprehensive in situ hybridization analyses of prohormone gene expression revealed the unanticipated complexity of the flatworm nervous system and identified a prohormone specifically expressed in the nervous system of sexually reproducing planarians. We show that this member of the neuropeptide Y superfamily is required for the maintenance of mature reproductive organs and differentiated germ cells in the testes. Additionally, comparative analyses of our biochemically validated prohormones with the genomes of the parasitic flatworms Schistosoma mansoni and Schistosoma japonicum identified new schistosome prohormones and validated half of all predicted peptide-encoding genes in these parasites. These studies describe the peptide hormone complement of a flatworm on a genome-wide scale and reveal a previously uncharacterized role for peptide hormones in flatworm reproduction. Furthermore, they suggest new opportunities for using planarians as free-living models for understanding the reproductive biology of flatworm parasites. PMID:20967238

  8. Genome-wide Identification and Structural, Functional and Evolutionary Analysis of WRKY Components of Mulberry.

    PubMed

    Baranwal, Vinay Kumar; Negi, Nisha; Khurana, Paramjit

    2016-01-01

    Mulberry is known to be sensitive to several biotic and abiotic stresses, which in turn have a direct impact on the yield of silk, because it is the sole food source for the silk worm. WRKYs are a family of transcription factors, which play an important role in combating various biotic and abiotic stresses. In this study, we identified 54 genes with conserved WRKY motifs in the Morus notabilis genome. Motif searches coupled with a phylogenetic analysis revealed seven sub-groups as well as the absence of members of Group Ib in mulberry. Analyses of the 2K upstream region in addition to a gene ontology terms enrichment analysis revealed putative functions of mulberry WRKYs under biotic and abiotic stresses. An RNA-seq-based analysis showed that several of the identified WRKYs have shown preferential expression in the leaf, bark, root, male flower, and winter bud of M. notabilis. Finally, expression analysis by qPCR under different stress and hormone treatments revealed genotype-specific responses. Taken together, our results briefs about the genome-wide identification of WRKYs as well as their differential response to stresses and hormones. Importantly, these data can also be utilized to identify potential molecular targets for conferring tolerance to various stresses in mulberry. PMID:27477686

  9. Genome-wide Identification and Structural, Functional and Evolutionary Analysis of WRKY Components of Mulberry

    PubMed Central

    Baranwal, Vinay Kumar; Negi, Nisha; Khurana, Paramjit

    2016-01-01

    Mulberry is known to be sensitive to several biotic and abiotic stresses, which in turn have a direct impact on the yield of silk, because it is the sole food source for the silk worm. WRKYs are a family of transcription factors, which play an important role in combating various biotic and abiotic stresses. In this study, we identified 54 genes with conserved WRKY motifs in the Morus notabilis genome. Motif searches coupled with a phylogenetic analysis revealed seven sub-groups as well as the absence of members of Group Ib in mulberry. Analyses of the 2K upstream region in addition to a gene ontology terms enrichment analysis revealed putative functions of mulberry WRKYs under biotic and abiotic stresses. An RNA-seq-based analysis showed that several of the identified WRKYs have shown preferential expression in the leaf, bark, root, male flower, and winter bud of M. notabilis. Finally, expression analysis by qPCR under different stress and hormone treatments revealed genotype-specific responses. Taken together, our results briefs about the genome-wide identification of WRKYs as well as their differential response to stresses and hormones. Importantly, these data can also be utilized to identify potential molecular targets for conferring tolerance to various stresses in mulberry. PMID:27477686

  10. Sympatric speciation revealed by genome-wide divergence in the blind mole rat Spalax.

    PubMed

    Li, Kexin; Hong, Wei; Jiao, Hengwu; Wang, Guo-Dong; Rodriguez, Karl A; Buffenstein, Rochelle; Zhao, Yang; Nevo, Eviatar; Zhao, Huabin

    2015-09-22

    Sympatric speciation (SS), i.e., speciation within a freely breeding population or in contiguous populations, was first proposed by Darwin [Darwin C (1859) On the Origins of Species by Means of Natural Selection] and is still controversial despite theoretical support [Gavrilets S (2004) Fitness Landscapes and the Origin of Species (MPB-41)] and mounting empirical evidence. Speciation of subterranean mammals generally, including the genus Spalax, was considered hitherto allopatric, whereby new species arise primarily through geographic isolation. Here we show in Spalax a case of genome-wide divergence analysis in mammals, demonstrating that SS in continuous populations, with gene flow, encompasses multiple widespread genomic adaptive complexes, associated with the sharply divergent ecologies. The two abutting soil populations of S. galili in northern Israel habituate the ancestral Senonian chalk population and abutting derivative Plio-Pleistocene basalt population. Population divergence originated ∼0.2-0.4 Mya based on both nuclear and mitochondrial genome analyses. Population structure analysis displayed two distinctly divergent clusters of chalk and basalt populations. Natural selection has acted on 300+ genes across the genome, diverging Spalax chalk and basalt soil populations. Gene ontology enrichment analysis highlights strong but differential soil population adaptive complexes: in basalt, sensory perception, musculature, metabolism, and energetics, and in chalk, nutrition and neurogenetics are outstanding. Population differentiation of chemoreceptor genes suggests intersoil population's mate and habitat choice substantiating SS. Importantly, distinctions in protein degradation may also contribute to SS. Natural selection and natural genetic engineering [Shapiro JA (2011) Evolution: A View From the 21st Century] overrule gene flow, evolving divergent ecological adaptive complexes. Sharp ecological divergences abound in nature; therefore, SS appears to be an

  11. Genome-wide gene expression profiling reveals unsuspected molecular alterations in pemphigus foliaceus

    PubMed Central

    Malheiros, Danielle; Panepucci, Rodrigo A; Roselino, Ana M; Araújo, Amélia G; Zago, Marco A; Petzl-Erler, Maria Luiza

    2014-01-01

    Pemphigus foliaceus (PF) is a complex autoimmune disease characterized by bullous skin lesions and the presence of antibodies against desmoglein 1. In this study we sought to contribute to a better understanding of the molecular processes in endemic PF, as the identification of factors that participate in the pathogenesis is a prerequisite for understanding its biological basis and may lead to novel therapeutic interventions. CD4+ T lymphocytes are central to the development of the disease. Therefore, we compared genome-wide gene expression profiles of peripheral CD4+ T cells of various PF patient subgroups with each other and with that of healthy individuals. The patient sample was subdivided into three groups: untreated patients with the generalized form of the disease, patients submitted to immunosuppressive treatment, and patients with the localized form of the disease. Comparisons between different subgroups resulted in 135, 54 and 64 genes differentially expressed. These genes are mainly related to lymphocyte adhesion and migration, apoptosis, cellular proliferation, cytotoxicity and antigen presentation. Several of these genes were differentially expressed when comparing lesional and uninvolved skin from the same patient. The chromosomal regions 19q13 and 12p13 concentrate differentially expressed genes and are candidate regions for PF susceptibility genes and disease markers. Our results reveal genes involved in disease severity, potential therapeutic targets and previously unsuspected processes involved in the pathogenesis. Besides, this study adds original information that will contribute to the understanding of PF's pathogenesis and of the still poorly defined in vivo functions of most of these genes. PMID:24813052

  12. Methods for meta-analysis of genome-wide association studies

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. For increasing N, results from different GWA can be combined in a meta-analysis (MA-...

  13. On the analysis of a repeated measure design in genome-wide association analysis.

    PubMed

    Lee, Young; Park, Suyeon; Moon, Sanghoon; Lee, Juyoung; Elston, Robert C; Lee, Woojoo; Won, Sungho

    2014-12-01

    Longitudinal data enables detecting the effect of aging/time, and as a repeated measures design is statistically more efficient compared to cross-sectional data if the correlations between repeated measurements are not large. In particular, when genotyping cost is more expensive than phenotyping cost, the collection of longitudinal data can be an efficient strategy for genetic association analysis. However, in spite of these advantages, genome-wide association studies (GWAS) with longitudinal data have rarely been analyzed taking this into account. In this report, we calculate the required sample size to achieve 80% power at the genome-wide significance level for both longitudinal and cross-sectional data, and compare their statistical efficiency. Furthermore, we analyzed the GWAS of eight phenotypes with three observations on each individual in the Korean Association Resource (KARE). A linear mixed model allowing for the correlations between observations for each individual was applied to analyze the longitudinal data, and linear regression was used to analyze the first observation on each individual as cross-sectional data. We found 12 novel genome-wide significant disease susceptibility loci that were then confirmed in the Health Examination cohort, as well as some significant interactions between age/sex and SNPs. PMID:25464127

  14. CNV-based genome wide association study reveals additional variants contributing to meat quality in swine

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pork quality is important both to the meat processing industry and consumers’ purchasing attitudes. Copy number variation (CNV) is a burgeoning kind of variant that may influence meat quality. Herein, a genome-wide association study (GWAS) was performed between CNVs and meat quality traits in swine....

  15. Genome-wide scan revealed genetic loci for energy metabolism in Hispanic children and adolescents

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome-wide scans were conducted in a search for genetic locations linked to energy expenditure and substrate oxidation in children. Pedigreed data of 1030 Hispanic children and adolescents were from the Viva La Familia Study, which was designed to investigate genetic and environmental risk factors ...

  16. Sparse principal component analysis for identifying ancestry-informative markers in genome-wide association studies.

    PubMed

    Lee, Seokho; Epstein, Michael P; Duncan, Richard; Lin, Xihong

    2012-05-01

    Genome-wide association studies (GWAS) routinely apply principal component analysis (PCA) to infer population structure within a sample to correct for confounding due to ancestry. GWAS implementation of PCA uses tens of thousands of single-nucleotide polymorphisms (SNPs) to infer structure, despite the fact that only a small fraction of such SNPs provides useful information on ancestry. The identification of this reduced set of ancestry-informative markers (AIMs) from a GWAS has practical value; for example, researchers can genotype the AIM set to correct for potential confounding due to ancestry in follow-up studies that utilize custom SNP or sequencing technology. We propose a novel technique to identify AIMs from genome-wide SNP data using sparse PCA. The procedure uses penalized regression methods to identify those SNPs in a genome-wide panel that significantly contribute to the principal components while encouraging SNPs that provide negligible loadings to vanish from the analysis. We found that sparse PCA leads to negligible loss of ancestry information compared to traditional PCA analysis of genome-wide SNP data. We further demonstrate the value of sparse PCA for AIM selection using real data from the International HapMap Project and a genomewide study of inflammatory bowel disease. We have implemented our approach in open-source R software for public use. PMID:22508067

  17. Application of genome-wide expression analysis to human health and disease

    PubMed Central

    Cobb, J. Perren; Mindrinos, Michael N.; Miller-Graziano, Carol; Calvano, Steve E.; Baker, Henry V.; Xiao, Wenzhong; Laudanski, Krzysztof; Brownstein, Bernard H.; Elson, Constance M.; Hayden, Douglas L.; Herndon, David N.; Lowry, Stephen F.; Maier, Ronald V.; Schoenfeld, David A.; Moldawer, Lyle L.; Davis, Ronald W.; Tompkins, Ronald G.

    2005-01-01

    The application of genome-wide expression analysis to a large-scale, multicentered program in critically ill patients poses a number of theoretical and technical challenges. We describe here an analytical and organizational approach to a systematic evaluation of the variance associated with genome-wide expression analysis specifically tailored to study human disease. We analyzed sources of variance in genome-wide expression analyses performed with commercial oligonucleotide arrays. In addition, variance in gene expression in human blood leukocytes caused by repeated sampling in the same subject, among different healthy subjects, among different leukocyte subpopulations, and the effect of traumatic injury, were also explored. We report that analytical variance caused by sample processing was acceptably small. Blood leukocyte gene expression in the same individual over a 24-h period was remarkably constant. In contrast, genome-wide expression varied significantly among different subjects and leukocyte subpopulations. Expectedly, traumatic injury induced dramatic changes in apparent gene expression that were greater in magnitude than the analytical noise and interindividual variance. We demonstrate that the development of a nation-wide program for gene expression analysis with careful attention to analytical details can reduce the variance in the clinical setting to a level where patterns of gene expression are informative among different healthy human subjects, and can be studied with confidence in human disease. PMID:15781863

  18. Genome-Wide Analysis Reveals Selective Modulation of microRNAs and mRNAs by Histone Deacetylase Inhibitor in B Cells Induced to Undergo Class-Switch DNA Recombination and Plasma Cell Differentiation

    PubMed Central

    Shen, Tian; Sanchez, Helia N.; Zan, Hong; Casali, Paolo

    2015-01-01

    As we have suggested, epigenetic factors, such as microRNAs (miRNAs), can interact with genetic programs to regulate B cell functions, thereby informing antibody and autoantibody responses. We have shown that histone deacetylase (HDAC) inhibitors (HDI) inhibit the differentiation events critical to the maturation of the antibody response: class-switch DNA recombination (CSR), somatic hypermutation (SHM), and plasma cell differentiation, by modulating intrinsic B cell mechanisms. HDI repress the expression of AID and Blimp-1, which are critical for CSR/SHM and plasma cell differentiation, respectively, in mouse and human B cells by upregulating selected miRNAs that silenced AICDA/Aicda and PRDM1/Prdm1 mRNAs, as demonstrated by multiple qRT-PCRs (J Immunol 193:5933–5950, 2014). To further define the selectivity of HDI-mediated modulation of miRNA and gene expression, we performed genome-wide miRNA-Seq and mRNA-Seq analysis in B cells stimulated by LPS plus IL-4 and treated with HDI or nil. Consistent with what we have shown using qRT-PCR, these HDI-treated B cells displayed reduced expression of Aicda and Prdm1, and increased expression of miR-155, miR-181b, and miR-361, which target Aicda, and miR-23b, miR-30a, and miR-125b, which target Prdm1. In B cells induced to undergo CSR and plasma cell differentiation, about 23% of over 22,000 mRNAs analyzed were expressed at a significantly high copy number (more than 20 copies/cell). Only 18 (0.36%) of these highly expressed mRNAs, including Aicda, Prdm1, and Xbp1, were downregulated by HDI by 50% or more. Further, only 16 (0.30%) of the highly expressed mRNAs were upregulated (more than twofold) by HDI. The selectivity of HDI-mediated modulation of gene expression was emphasized by unchanged expression of the genes that are involved in regulation, targeting, or DNA repair processes of CSR, as well as unchanged expression of the genes encoding epigenetic regulators and factors that are important for cell signaling or

  19. Meta-analysis Reveals Genome-Wide Significance at 15q13 for Nonsyndromic Clefting of Both the Lip and the Palate, and Functional Analyses Implicate GREM1 As a Plausible Causative Gene.

    PubMed

    Ludwig, Kerstin U; Ahmed, Syeda Tasnim; Böhmer, Anne C; Sangani, Nasim Bahram; Varghese, Sheryil; Klamt, Johanna; Schuenke, Hannah; Gültepe, Pinar; Hofmann, Andrea; Rubini, Michele; Aldhorae, Khalid Ahmed; Steegers-Theunissen, Regine P; Rojas-Martinez, Augusto; Reiter, Rudolf; Borck, Guntram; Knapp, Michael; Nakatomi, Mitsushiro; Graf, Daniel; Mangold, Elisabeth; Peters, Heiko

    2016-03-01

    Nonsyndromic orofacial clefts are common birth defects with multifactorial etiology. The most common type is cleft lip, which occurs with or without cleft palate (nsCLP and nsCLO, respectively). Although genetic components play an important role in nsCLP, the genetic factors that predispose to palate involvement are largely unknown. In this study, we carried out a meta-analysis on genetic and clinical data from three large cohorts and identified strong association between a region on chromosome 15q13 and nsCLP (P = 8.13 × 10(-14) for rs1258763; relative risk (RR): 1.46, 95% confidence interval (CI): 1.32-1.61)) but not nsCLO (P = 0.27; RR: 1.09 (0.94-1.27)). The 5 kb region of strongest association maps downstream of Gremlin-1 (GREM1), which encodes a secreted antagonist of the BMP4 pathway. We show during mouse embryogenesis, Grem1 is expressed in the developing lip and soft palate but not in the hard palate. This is consistent with genotype-phenotype correlations between rs1258763 and a specific nsCLP subphenotype, since a more than two-fold increase in risk was observed in patients displaying clefts of both the lip and soft palate but who had an intact hard palate (RR: 3.76, CI: 1.47-9.61, Pdiff<0.05). While we did not find lip or palate defects in Grem1-deficient mice, wild type embryonic palatal shelves developed divergent shapes when cultured in the presence of ectopic Grem1 protein (P = 0.0014). The present study identified a non-coding region at 15q13 as the second, genome-wide significant locus specific for nsCLP, after 13q31. Moreover, our data suggest that the closely located GREM1 gene contributes to a rare clinical nsCLP entity. This entity specifically involves abnormalities of the lip and soft palate, which develop at different time-points and in separate anatomical regions. PMID:26968009

  20. Genome-Wide Analysis Reveals Selective Modulation of microRNAs and mRNAs by Histone Deacetylase Inhibitor in B Cells Induced to Undergo Class-Switch DNA Recombination and Plasma Cell Differentiation.

    PubMed

    Shen, Tian; Sanchez, Helia N; Zan, Hong; Casali, Paolo

    2015-01-01

    As we have suggested, epigenetic factors, such as microRNAs (miRNAs), can interact with genetic programs to regulate B cell functions, thereby informing antibody and autoantibody responses. We have shown that histone deacetylase (HDAC) inhibitors (HDI) inhibit the differentiation events critical to the maturation of the antibody response: class-switch DNA recombination (CSR), somatic hypermutation (SHM), and plasma cell differentiation, by modulating intrinsic B cell mechanisms. HDI repress the expression of AID and Blimp-1, which are critical for CSR/SHM and plasma cell differentiation, respectively, in mouse and human B cells by upregulating selected miRNAs that silenced AICDA/Aicda and PRDM1/Prdm1 mRNAs, as demonstrated by multiple qRT-PCRs (J Immunol 193:5933-5950, 2014). To further define the selectivity of HDI-mediated modulation of miRNA and gene expression, we performed genome-wide miRNA-Seq and mRNA-Seq analysis in B cells stimulated by LPS plus IL-4 and treated with HDI or nil. Consistent with what we have shown using qRT-PCR, these HDI-treated B cells displayed reduced expression of Aicda and Prdm1, and increased expression of miR-155, miR-181b, and miR-361, which target Aicda, and miR-23b, miR-30a, and miR-125b, which target Prdm1. In B cells induced to undergo CSR and plasma cell differentiation, about 23% of over 22,000 mRNAs analyzed were expressed at a significantly high copy number (more than 20 copies/cell). Only 18 (0.36%) of these highly expressed mRNAs, including Aicda, Prdm1, and Xbp1, were downregulated by HDI by 50% or more. Further, only 16 (0.30%) of the highly expressed mRNAs were upregulated (more than twofold) by HDI. The selectivity of HDI-mediated modulation of gene expression was emphasized by unchanged expression of the genes that are involved in regulation, targeting, or DNA repair processes of CSR, as well as unchanged expression of the genes encoding epigenetic regulators and factors that are important for cell signaling or

  1. Meta-analysis Reveals Genome-Wide Significance at 15q13 for Nonsyndromic Clefting of Both the Lip and the Palate, and Functional Analyses Implicate GREM1 As a Plausible Causative Gene

    PubMed Central

    Ludwig, Kerstin U.; Ahmed, Syeda Tasnim; Böhmer, Anne C.; Sangani, Nasim Bahram; Varghese, Sheryil; Klamt, Johanna; Schuenke, Hannah; Gültepe, Pinar; Hofmann, Andrea; Rubini, Michele; Aldhorae, Khalid Ahmed; Steegers-Theunissen, Regine P.; Rojas-Martinez, Augusto; Reiter, Rudolf; Borck, Guntram; Knapp, Michael; Nakatomi, Mitsushiro; Graf, Daniel; Mangold, Elisabeth; Peters, Heiko

    2016-01-01

    Nonsyndromic orofacial clefts are common birth defects with multifactorial etiology. The most common type is cleft lip, which occurs with or without cleft palate (nsCLP and nsCLO, respectively). Although genetic components play an important role in nsCLP, the genetic factors that predispose to palate involvement are largely unknown. In this study, we carried out a meta-analysis on genetic and clinical data from three large cohorts and identified strong association between a region on chromosome 15q13 and nsCLP (P = 8.13×10−14 for rs1258763; relative risk (RR): 1.46, 95% confidence interval (CI): 1.32–1.61)) but not nsCLO (P = 0.27; RR: 1.09 (0.94–1.27)). The 5 kb region of strongest association maps downstream of Gremlin-1 (GREM1), which encodes a secreted antagonist of the BMP4 pathway. We show during mouse embryogenesis, Grem1 is expressed in the developing lip and soft palate but not in the hard palate. This is consistent with genotype-phenotype correlations between rs1258763 and a specific nsCLP subphenotype, since a more than two-fold increase in risk was observed in patients displaying clefts of both the lip and soft palate but who had an intact hard palate (RR: 3.76, CI: 1.47–9.61, Pdiff<0.05). While we did not find lip or palate defects in Grem1-deficient mice, wild type embryonic palatal shelves developed divergent shapes when cultured in the presence of ectopic Grem1 protein (P = 0.0014). The present study identified a non-coding region at 15q13 as the second, genome-wide significant locus specific for nsCLP, after 13q31. Moreover, our data suggest that the closely located GREM1 gene contributes to a rare clinical nsCLP entity. This entity specifically involves abnormalities of the lip and soft palate, which develop at different time-points and in separate anatomical regions. PMID:26968009

  2. Genome-wide meta-analysis of cerebral white matter hyperintensities in patients with stroke

    PubMed Central

    Zhang, Cathy R.; Adib-Samii, Poneh; Devan, William J.; Parsons, Owen E.; Lanfranconi, Silvia; Gregory, Sarah; Cloonan, Lisa; Falcone, Guido J.; Radmanesh, Farid; Fitzpatrick, Kaitlin; Kanakis, Allison; Barrick, Thomas R.; Moynihan, Barry; Lewis, Cathryn M.; Boncoraglio, Giorgio B.; Lemmens, Robin; Thijs, Vincent; Sudlow, Cathie; Wardlaw, Joanna; Rothwell, Peter M.; Meschia, James F.; Worrall, Bradford B.; Levi, Christopher; Bevan, Steve; Furie, Karen L.; Dichgans, Martin; Rosand, Jonathan; Markus, Hugh S.; Rost, Natalia

    2016-01-01

    Objective: For 3,670 stroke patients from the United Kingdom, United States, Australia, Belgium, and Italy, we performed a genome-wide meta-analysis of white matter hyperintensity volumes (WMHV) on data imputed to the 1000 Genomes reference dataset to provide insights into disease mechanisms. Methods: We first sought to identify genetic associations with white matter hyperintensities in a stroke population, and then examined whether genetic loci previously linked to WMHV in community populations are also associated in stroke patients. Having established that genetic associations are shared between the 2 populations, we performed a meta-analysis testing which associations with WMHV in stroke-free populations are associated overall when combined with stroke populations. Results: There were no associations at genome-wide significance with WMHV in stroke patients. All previously reported genome-wide significant associations with WMHV in community populations shared direction of effect in stroke patients. In a meta-analysis of the genome-wide significant and suggestive loci (p < 5 × 10−6) from community populations (15 single nucleotide polymorphisms in total) and from stroke patients, 6 independent loci were associated with WMHV in both populations. Four of these are novel associations at the genome-wide level (rs72934505 [NBEAL1], p = 2.2 × 10−8; rs941898 [EVL], p = 4.0 × 10−8; rs962888 [C1QL1], p = 1.1 × 10−8; rs9515201 [COL4A2], p = 6.9 × 10−9). Conclusions: Genetic associations with WMHV are shared in otherwise healthy individuals and patients with stroke, indicating common genetic susceptibility in cerebral small vessel disease. PMID:26674333

  3. Integrative Network-based Analysis of Magnetic Resonance Spectroscopy and Genome Wide Expression in Glioblastoma multiforme.

    PubMed

    Heiland, Dieter Henrik; Mader, Irina; Schlosser, Pascal; Pfeifer, Dietmar; Carro, Maria Stella; Lange, Thomas; Schwarzwald, Ralf; Vasilikos, Ioannis; Urbach, Horst; Weyerbrock, Astrid

    2016-01-01

    The goal of this study was to identify correlations between metabolites from proton MR spectroscopy and genetic pathway activity in glioblastoma multiforme (GBM). Twenty patients with primary GBM were analysed by short echo-time chemical shift imaging and genome-wide expression analyses. Weighed Gene Co-Expression Analysis was used for an integrative analysis of imaging and genetic data. N-acetylaspartate, normalised to the contralateral healthy side (nNAA), was significantly correlated to oligodendrocytic and neural development. For normalised creatine (nCr), a group with low nCr was linked to the mesenchymal subtype, while high nCr could be assigned to the proneural subtype. Moreover, clustering of normalised glutamine and glutamate (nGlx) revealed two groups, one with high nGlx being attributed to the neural subtype, and one with low nGlx associated with the classical subtype. Hence, the metabolites nNAA, nCr, and nGlx correlate with a specific gene expression pattern reflecting the previously described subtypes of GBM. Moreover high nNAA was associated with better clinical prognosis, whereas patients with lower nNAA revealed a shorter progression-free survival (PFS). PMID:27350391

  4. Integrative Network-based Analysis of Magnetic Resonance Spectroscopy and Genome Wide Expression in Glioblastoma multiforme

    PubMed Central

    Heiland, Dieter Henrik; Mader, Irina; Schlosser, Pascal; Pfeifer, Dietmar; Carro, Maria Stella; Lange, Thomas; Schwarzwald, Ralf; Vasilikos, Ioannis; Urbach, Horst; Weyerbrock, Astrid

    2016-01-01

    The goal of this study was to identify correlations between metabolites from proton MR spectroscopy and genetic pathway activity in glioblastoma multiforme (GBM). Twenty patients with primary GBM were analysed by short echo-time chemical shift imaging and genome-wide expression analyses. Weighed Gene Co-Expression Analysis was used for an integrative analysis of imaging and genetic data. N-acetylaspartate, normalised to the contralateral healthy side (nNAA), was significantly correlated to oligodendrocytic and neural development. For normalised creatine (nCr), a group with low nCr was linked to the mesenchymal subtype, while high nCr could be assigned to the proneural subtype. Moreover, clustering of normalised glutamine and glutamate (nGlx) revealed two groups, one with high nGlx being attributed to the neural subtype, and one with low nGlx associated with the classical subtype. Hence, the metabolites nNAA, nCr, and nGlx correlate with a specific gene expression pattern reflecting the previously described subtypes of GBM. Moreover high nNAA was associated with better clinical prognosis, whereas patients with lower nNAA revealed a shorter progression-free survival (PFS). PMID:27350391

  5. Genome-wide analysis and expression profiling of the Solanum tuberosum aquaporins.

    PubMed

    Venkatesh, Jelli; Yu, Jae-Woong; Park, Se Won

    2013-12-01

    Aquaporins belongs to the major intrinsic proteins involved in the transcellular membrane transport of water and other small solutes. A comprehensive genome-wide search for the homologues of Solanum tuberosum major intrinsic protein (MIP) revealed 41 full-length potato aquaporin genes. All potato aquaporins are grouped into five subfamilies; plasma membrane intrinsic proteins (PIPs), tonoplast intrinsic proteins (TIPs), NOD26-like intrinsic proteins (NIPs), small basic intrinsic proteins (SIPs) and x-intrinsic proteins (XIPs). Functional predictions based on the aromatic/arginine (ar/R) selectivity filters and Froger's positions showed a remarkable difference in substrate transport specificity among subfamilies. The expression pattern of potato aquaporins, examined by qPCR analysis, showed distinct expression profiles in various organs and tuber developmental stages. Furthermore, qPCR analysis of potato plantlets, subjected to various abiotic stresses revealed the marked effect of stresses on expression levels of aquaporins. Taken together, the expression profiles of aquaporins imply that aquaporins play important roles in plant growth and development, in addition to maintaining water homeostasis in response to environmental stresses. PMID:24215931

  6. Genome-Wide Analysis of Branched-Chain Amino Acid Levels in Arabidopsis Seeds[W

    PubMed Central

    Angelovici, Ruthie; Lipka, Alexander E.; Deason, Nicholas; Gonzalez-Jorge, Sabrina; Lin, Haining; Cepela, Jason; Buell, Robin; Gore, Michael A.; DellaPenna, Dean

    2013-01-01

    Branched-chain amino acids (BCAAs) are three of the nine essential amino acids in human and animal diets and are important for numerous processes in development and growth. However, seed BCAA levels in major crops are insufficient to meet dietary requirements, making genetic improvement for increased and balanced seed BCAAs an important nutritional target. Addressing this issue requires a better understanding of the genetics underlying seed BCAA content and composition. Here, a genome-wide association study and haplotype analysis for seed BCAA traits in Arabidopsis thaliana revealed a strong association with a chromosomal interval containing two BRANCHED-CHAIN AMINO ACID TRANSFERASES, BCAT1 and BCAT2. Linkage analysis, reverse genetic approaches, and molecular complementation analysis demonstrated that allelic variation at BCAT2 is responsible for the natural variation of seed BCAAs in this interval. Complementation analysis of a bcat2 null mutant with two significantly different alleles from accessions Bayreuth-0 and Shahdara is consistent with BCAT2 contributing to natural variation in BCAA levels, glutamate recycling, and free amino acid homeostasis in seeds in an allele-dependent manner. The seed-specific phenotype of bcat2 null alleles, its strong transcription induction during late seed development, and its subcellular localization to the mitochondria are consistent with a unique, catabolic role for BCAT2 in BCAA metabolism in seeds. PMID:24368787

  7. Meta-analysis of genome-wide association studies in five cohorts reveals common variants in RBFOX1, a regulator of tissue-specific splicing, associated with refractive error

    PubMed Central

    Stambolian, Dwight; Wojciechowski, Robert; Oexle, Konrad; Pirastu, Mario; Li, Xiaohui; Raffel, Leslie J.; Cotch, Mary Frances; Chew, Emily Y.; Klein, Barbara; Klein, Ronald; Wong, Tien Y.; Simpson, Claire L.; Klaver, Caroline C.W.; van Duijn, Cornelia M.; Verhoeven, Virginie J.M.; Baird, Paul N.; Vitart, Veronique; Paterson, Andrew D.; Mitchell, Paul; Saw, Seang Mei; Fossarello, Maurizio; Kazmierkiewicz, Krista; Murgia, Federico; Portas, Laura; Schache, Maria; Richardson, Andrea; Xie, Jing; Wang, Jie Jin; Rochtchina, Elena; Viswanathan, Ananth C.; Hayward, Caroline; Wright, Alan F.; Polašek, Ozren; Campbell, Harry; Rudan, Igor; Oostra, Ben A.; Uitterlinden, André G.; Hofman, Albert; Rivadeneira, Fernando; Amin, Najaf; Karssen, Lennart C.; Vingerling, Johannes R.; Hosseini, S.M.; Döring, Angela; Bettecken, Thomas; Vatavuk, Zoran; Gieger, Christian; Wichmann, H.-Erich; Wilson, James F.; Fleck, Brian; Foster, Paul J.; Topouzis, Fotis; McGuffin, Peter; Sim, Xueling; Inouye, Michael; Holliday, Elizabeth G.; Attia, John; Scott, Rodney J.; Rotter, Jerome I.; Meitinger, Thomas; Bailey-Wilson, Joan E.

    2013-01-01

    Visual refractive errors (REs) are complex genetic traits with a largely unknown etiology. To date, genome-wide association studies (GWASs) of moderate size have identified several novel risk markers for RE, measured here as mean spherical equivalent (MSE). We performed a GWAS using a total of 7280 samples from five cohorts: the Age-Related Eye Disease Study (AREDS); the KORA study (‘Cooperative Health Research in the Region of Augsburg’); the Framingham Eye Study (FES); the Ogliastra Genetic Park-Talana (OGP-Talana) Study and the Multiethnic Study of Atherosclerosis (MESA). Genotyping was performed on Illumina and Affymetrix platforms with additional markers imputed to the HapMap II reference panel. We identified a new genome-wide significant locus on chromosome 16 (rs10500355, P = 3.9 × 10−9) in a combined discovery and replication set (26 953 samples). This single nucleotide polymorphism (SNP) is located within the RBFOX1 gene which is a neuron-specific splicing factor regulating a wide range of alternative splicing events implicated in neuronal development and maturation, including transcription factors, other splicing factors and synaptic proteins. PMID:23474815

  8. AUTOGSCAN: powerful tools for automated genome-wide linkage and linkage disequilibrium analysis.

    PubMed

    Hiekkalinna, Tero; Terwilliger, Joseph D; Sammalisto, Sampo; Peltonen, Leena; Perola, Markus

    2005-02-01

    Genome-wide linkage analysis using multiple traits and statistical software packages is a tedious process which requires a significant amount of manual file manipulation. Different linkage analysis programs require different input file formats, making the task of analyzing data with multiple methods even more time-consuming. We have developed a software tool, AUTOGSCAN, that automates file formatting, the running of statistical analyses, and the summarizing of resulting statistics for whole genome scans with a push of a button, using several independent, and often idiosyncratic, statistical software packages such as MERLIN, SOLAR and GENEHUNTER. We also describe a program, ANALYZE, designed to run qualitative linkage analysis with several different statistical strategies and programs to efficiently screen for linkage and linkage disequilibrium for a given discrete trait. The ANALYZE program can also be used by AUTOGSCAN in a genome-wide sense. PMID:15836805

  9. Genome-Wide Analysis of the Lysine Biosynthesis Pathway Network during Maize Seed Development

    PubMed Central

    Liu, Yuwei; Xie, Shaojun; Yu, Jingjuan

    2016-01-01

    Lysine is one of the most limiting essential amino acids for humans and livestock. The nutritional value of maize (Zea mays L.) is reduced by its poor lysine content. To better understand the lysine biosynthesis pathway in maize seed, we conducted a genome-wide analysis of the genes involved in lysine biosynthesis. We identified lysine biosynthesis pathway genes (LBPGs) and investigated whether a diaminopimelate pathway variant exists in maize. We analyzed two genes encoding the key enzyme dihydrodipicolinate synthase, and determined that they contribute differently to lysine synthesis during maize seed development. A coexpression network of LBPGs was constructed using RNA-sequencing data from 21 developmental stages of B73 maize seed. We found a large set of genes encoding ribosomal proteins, elongation factors and zein proteins that were coexpressed with LBPGs. The coexpressed genes were enriched in cellular metabolism terms and protein related terms. A phylogenetic analysis of the LBPGs from different plant species revealed different relationships. Additionally, six transcription factor (TF) families containing 13 TFs were identified as the Hub TFs of the LBPGs modules. Several expression quantitative trait loci of LBPGs were also identified. Our results should help to elucidate the lysine biosynthesis pathway network in maize seed. PMID:26829553

  10. Genome-wide analysis of FOXO3 mediated transcription regulation through RNA polymerase II profiling.

    PubMed

    Eijkelenboom, Astrid; Mokry, Michal; de Wit, Elzo; Smits, Lydia M; Polderman, Paulien E; van Triest, Miranda H; van Boxtel, Ruben; Schulze, Almut; de Laat, Wouter; Cuppen, Edwin; Burgering, Boudewijn M T

    2013-01-01

    Forkhead box O (FOXO) transcription factors are key players in diverse cellular processes affecting tumorigenesis, stem cell maintenance and lifespan. To gain insight into the mechanisms of FOXO-regulated target gene expression, we studied genome-wide effects of FOXO3 activation. Profiling RNA polymerase II changes shows that FOXO3 regulates gene expression through transcription initiation. Correlative analysis of FOXO3 and RNA polymerase II ChIP-seq profiles demonstrates FOXO3 to act as a transcriptional activator. Furthermore, this analysis reveals a significant part of FOXO3 gene regulation proceeds through enhancer regions. FOXO3 binds to pre-existing enhancers and further activates these enhancers as shown by changes in histone acetylation and RNA polymerase II recruitment. In addition, FOXO3-mediated enhancer activation correlates with regulation of adjacent genes and pre-existence of chromatin loops between FOXO3 bound enhancers and target genes. Combined, our data elucidate how FOXOs regulate gene transcription and provide insight into mechanisms by which FOXOs can induce different gene expression programs depending on chromatin architecture. PMID:23340844

  11. Genome-wide analysis and expression profiling of the phospholipase D gene family in Gossypium arboreum.

    PubMed

    Tang, Kai; Dong, Chunjuan; Liu, Jinyuan

    2016-02-01

    The plant phospholipase D (PLD) plays versatile functions in multiple aspects of plant growth, development, and stress responses. However, until now, our knowledge concerning the PLD gene family members and their expression patterns in cotton has been limited. In this study, we performed for the first time the genome-wide analysis and expression profiling of PLD gene family in Gossypium arboretum, and finally, a total of 19 non-redundant PLD genes (GaPLDs) were identified. Based on the phylogenetic analysis, they were divided into six well-supported clades (α, β/γ, δ, ε, ζ and φ). Most of the GaPLD genes within the same clade showed the similar exon-intron organization and highly conserved motif structures. Additionally, the chromosomal distribution pattern revealed that GaPLD genes were unevenly distributed across 10 of the 13 cotton chromosomes. Segmental duplication is the major contributor to the expansion of GaPLD gene family and estimated to have occurred from 19.61 to 20.44 million years ago when a recent large-scale genome duplication occurred in cotton. Moreover, the expression profiling provides the functional divergence of GaPLD genes in cotton and provides some new light on the molecular mechanisms of GaPLDα1 and GaPLDδ2 in fiber development. PMID:26718354

  12. Genome-wide association study reveals novel variants for growth and egg traits in Dongxiang blue-shelled and White Leghorn chickens.

    PubMed

    Liao, R; Zhang, X; Chen, Q; Wang, Z; Wang, Q; Yang, C; Pan, Y

    2016-10-01

    This study was designed to investigate the genetic basis of growth and egg traits in Dongxiang blue-shelled chickens and White Leghorn chickens. In this study, we employed a reduced representation sequencing approach called genotyping by genome reducing and sequencing to detect genome-wide SNPs in 252 Dongxiang blue-shelled chickens and 252 White Leghorn chickens. The Dongxiang blue-shelled chicken breed has many specific traits and is characterized by blue-shelled eggs, black plumage, black skin, black bone and black organs. The White Leghorn chicken is an egg-type breed with high productivity. As multibreed genome-wide association studies (GWASs) can improve precision due to less linkage disequilibrium across breeds, a multibreed GWAS was performed with 156 575 SNPs to identify the associated variants underlying growth and egg traits within the two chicken breeds. The analysis revealed 32 SNPs exhibiting a significant genome-wide association with growth and egg traits. Some of the significant SNPs are located in genes that are known to impact growth and egg traits, but nearly half of the significant SNPs are located in genes with unclear functions in chickens. To our knowledge, this is the first multibreed genome-wide report for the genetics of growth and egg traits in the Dongxiang blue-shelled and White Leghorn chickens. PMID:27166871

  13. Genome-wide meta-analysis identifies five new susceptibility loci for cutaneous malignant melanoma.

    PubMed

    Law, Matthew H; Bishop, D Timothy; Lee, Jeffrey E; Brossard, Myriam; Martin, Nicholas G; Moses, Eric K; Song, Fengju; Barrett, Jennifer H; Kumar, Rajiv; Easton, Douglas F; Pharoah, Paul D P; Swerdlow, Anthony J; Kypreou, Katerina P; Taylor, John C; Harland, Mark; Randerson-Moor, Juliette; Akslen, Lars A; Andresen, Per A; Avril, Marie-Françoise; Azizi, Esther; Scarrà, Giovanna Bianchi; Brown, Kevin M; Dȩbniak, Tadeusz; Duffy, David L; Elder, David E; Fang, Shenying; Friedman, Eitan; Galan, Pilar; Ghiorzo, Paola; Gillanders, Elizabeth M; Goldstein, Alisa M; Gruis, Nelleke A; Hansson, Johan; Helsing, Per; Hočevar, Marko; Höiom, Veronica; Ingvar, Christian; Kanetsky, Peter A; Chen, Wei V; Landi, Maria Teresa; Lang, Julie; Lathrop, G Mark; Lubiński, Jan; Mackie, Rona M; Mann, Graham J; Molven, Anders; Montgomery, Grant W; Novaković, Srdjan; Olsson, Håkan; Puig, Susana; Puig-Butille, Joan Anton; Qureshi, Abrar A; Radford-Smith, Graham L; van der Stoep, Nienke; van Doorn, Remco; Whiteman, David C; Craig, Jamie E; Schadendorf, Dirk; Simms, Lisa A; Burdon, Kathryn P; Nyholt, Dale R; Pooley, Karen A; Orr, Nick; Stratigos, Alexander J; Cust, Anne E; Ward, Sarah V; Hayward, Nicholas K; Han, Jiali; Schulze, Hans-Joachim; Dunning, Alison M; Bishop, Julia A Newton; Demenais, Florence; Amos, Christopher I; MacGregor, Stuart; Iles, Mark M

    2015-09-01

    Thirteen common susceptibility loci have been reproducibly associated with cutaneous malignant melanoma (CMM). We report the results of an international 2-stage meta-analysis of CMM genome-wide association studies (GWAS). This meta-analysis combines 11 GWAS (5 previously unpublished) and a further three stage 2 data sets, totaling 15,990 CMM cases and 26,409 controls. Five loci not previously associated with CMM risk reached genome-wide significance (P < 5 × 10(-8)), as did 2 previously reported but unreplicated loci and all 13 established loci. Newly associated SNPs fall within putative melanocyte regulatory elements, and bioinformatic and expression quantitative trait locus (eQTL) data highlight candidate genes in the associated regions, including one involved in telomere biology. PMID:26237428

  14. Novel R tools for analysis of genome-wide population genetic data with emphasis on clonality

    PubMed Central

    Kamvar, Zhian N.; Brooks, Jonah C.; Grünwald, Niklaus J.

    2015-01-01

    To gain a detailed understanding of how plant microbes evolve and adapt to hosts, pesticides, and other factors, knowledge of the population dynamics and evolutionary history of populations is crucial. Plant pathogen populations are often clonal or partially clonal which requires different analytical tools. With the advent of high throughput sequencing technologies, obtaining genome-wide population genetic data has become easier than ever before. We previously contributed the R package poppr specifically addressing issues with analysis of clonal populations. In this paper we provide several significant extensions to poppr with a focus on large, genome-wide SNP data. Specifically, we provide several new functionalities including the new function mlg.filter to define clone boundaries allowing for inspection and definition of what is a clonal lineage, minimum spanning networks with reticulation, a sliding-window analysis of the index of association, modular bootstrapping of any genetic distance, and analyses across any level of hierarchies. PMID:26113860

  15. Genome-wide meta-analysis identifies five new susceptibility loci for cutaneous malignant melanoma

    PubMed Central

    Law, Matthew H.; Bishop, D. Timothy; Martin, Nicholas G.; Moses, Eric K.; Song, Fengju; Barrett, Jennifer H.; Kumar, Rajiv; Easton, Douglas F.; Pharoah, Paul D. P.; Swerdlow, Anthony J.; Kypreou, Katerina P.; Taylor, John C.; Harland, Mark; Randerson-Moor, Juliette; Akslen, Lars A.; Andresen, Per A.; Avril, Marie-Françoise; Azizi, Esther; Scarrà, Giovanna Bianchi; Brown, Kevin M.; Dębniak, Tadeusz; Duffy, David L.; Elder, David E.; Fang, Shenying; Friedman, Eitan; Galan, Pilar; Ghiorzo, Paola; Gillanders, Elizabeth M.; Goldstein, Alisa M.; Gruis, Nelleke A.; Hansson, Johan; Helsing, Per; Hočevar, Marko; Höiom, Veronica; Ingvar, Christian; Kanetsky, Peter A.; Chen, Wei V.; Landi, Maria Teresa; Lang, Julie; Lathrop, G. Mark; Lubiński, Jan; Mackie, Rona M.; Mann, Graham J.; Molven, Anders; Montgomery, Grant W.; Novaković, Srdjan; Olsson, Håkan; Puig, Susana; Puig-Butille, Joan Anton; Qureshi, Abrar A.; Radford-Smith, Graham L.; van der Stoep, Nienke; van Doorn, Remco; Whiteman, David C.; Craig, Jamie E.; Schadendorf, Dirk; Simms, Lisa A.; Burdon, Kathryn P.; Nyholt, Dale R.; Pooley, Karen A.; Orr, Nick; Stratigos, Alexander J.; Cust, Anne E.; Ward, Sarah V.; Hayward, Nicholas K.; Han, Jiali; Schulze, Hans-Joachim; Dunning, Alison M.; Bishop, Julia A. Newton; MacGregor, Stuart; Iles, Mark M.

    2015-01-01

    Thirteen common susceptibility loci have been reproducibly associated with cutaneous malignant melanoma (CMM). We report the results of an international 2-stage meta-analysis of CMM genome-wide association studies (GWAS). This meta-analysis combines 11 GWAS (5 previously unpublished) and a further three stage 2 data sets, totaling 15,990 CMM cases and 26,409 controls. Five loci not previously associated with CMM risk reached genome-wide significance (P < 5×10–8), as did two previously-reported but un-replicated loci and all thirteen established loci. Novel SNPs fall within putative melanocyte regulatory elements, and bioinformatic and expression quantitative trait locus (eQTL) data highlight candidate genes including one involved in telomere biology. PMID:26237428

  16. Novel R tools for analysis of genome-wide population genetic data with emphasis on clonality.

    PubMed

    Kamvar, Zhian N; Brooks, Jonah C; Grünwald, Niklaus J

    2015-01-01

    To gain a detailed understanding of how plant microbes evolve and adapt to hosts, pesticides, and other factors, knowledge of the population dynamics and evolutionary history of populations is crucial. Plant pathogen populations are often clonal or partially clonal which requires different analytical tools. With the advent of high throughput sequencing technologies, obtaining genome-wide population genetic data has become easier than ever before. We previously contributed the R package poppr specifically addressing issues with analysis of clonal populations. In this paper we provide several significant extensions to poppr with a focus on large, genome-wide SNP data. Specifically, we provide several new functionalities including the new function mlg.filter to define clone boundaries allowing for inspection and definition of what is a clonal lineage, minimum spanning networks with reticulation, a sliding-window analysis of the index of association, modular bootstrapping of any genetic distance, and analyses across any level of hierarchies. PMID:26113860

  17. Genome-Wide Association Study Reveals Multiple Loci Influencing Normal Human Facial Morphology

    PubMed Central

    Raffensperger, Zachary D.; Heike, Carrie L.; Cunningham, Michael L.; Hecht, Jacqueline T.; Kau, Chung How; Moreno, Lina M.; Wehby, George L.; Murray, Jeffrey C.; Laurie, Cecelia A.; Laurie, Cathy C.; Santorico, Stephanie; Klein, Ophir; Feingold, Eleanor; Hallgrimsson, Benedikt; Spritz, Richard A.; Marazita, Mary L.; Weinberg, Seth M.

    2016-01-01

    Numerous lines of evidence point to a genetic basis for facial morphology in humans, yet little is known about how specific genetic variants relate to the phenotypic expression of many common facial features. We conducted genome-wide association meta-analyses of 20 quantitative facial measurements derived from the 3D surface images of 3118 healthy individuals of European ancestry belonging to two US cohorts. Analyses were performed on just under one million genotyped SNPs (Illumina OmniExpress+Exome v1.2 array) imputed to the 1000 Genomes reference panel (Phase 3). We observed genome-wide significant associations (p < 5 x 10−8) for cranial base width at 14q21.1 and 20q12, intercanthal width at 1p13.3 and Xq13.2, nasal width at 20p11.22, nasal ala length at 14q11.2, and upper facial depth at 11q22.1. Several genes in the associated regions are known to play roles in craniofacial development or in syndromes affecting the face: MAFB, PAX9, MIPOL1, ALX3, HDAC8, and PAX1. We also tested genotype-phenotype associations reported in two previous genome-wide studies and found evidence of replication for nasal ala length and SNPs in CACNA2D3 and PRDM16. These results provide further evidence that common variants in regions harboring genes of known craniofacial function contribute to normal variation in human facial features. Improved understanding of the genes associated with facial morphology in healthy individuals can provide insights into the pathways and mechanisms controlling normal and abnormal facial morphogenesis. PMID:27560520

  18. Genome-Wide Association Study Reveals Multiple Loci Influencing Normal Human Facial Morphology.

    PubMed

    Shaffer, John R; Orlova, Ekaterina; Lee, Myoung Keun; Leslie, Elizabeth J; Raffensperger, Zachary D; Heike, Carrie L; Cunningham, Michael L; Hecht, Jacqueline T; Kau, Chung How; Nidey, Nichole L; Moreno, Lina M; Wehby, George L; Murray, Jeffrey C; Laurie, Cecelia A; Laurie, Cathy C; Cole, Joanne; Ferrara, Tracey; Santorico, Stephanie; Klein, Ophir; Mio, Washington; Feingold, Eleanor; Hallgrimsson, Benedikt; Spritz, Richard A; Marazita, Mary L; Weinberg, Seth M

    2016-08-01

    Numerous lines of evidence point to a genetic basis for facial morphology in humans, yet little is known about how specific genetic variants relate to the phenotypic expression of many common facial features. We conducted genome-wide association meta-analyses of 20 quantitative facial measurements derived from the 3D surface images of 3118 healthy individuals of European ancestry belonging to two US cohorts. Analyses were performed on just under one million genotyped SNPs (Illumina OmniExpress+Exome v1.2 array) imputed to the 1000 Genomes reference panel (Phase 3). We observed genome-wide significant associations (p < 5 x 10-8) for cranial base width at 14q21.1 and 20q12, intercanthal width at 1p13.3 and Xq13.2, nasal width at 20p11.22, nasal ala length at 14q11.2, and upper facial depth at 11q22.1. Several genes in the associated regions are known to play roles in craniofacial development or in syndromes affecting the face: MAFB, PAX9, MIPOL1, ALX3, HDAC8, and PAX1. We also tested genotype-phenotype associations reported in two previous genome-wide studies and found evidence of replication for nasal ala length and SNPs in CACNA2D3 and PRDM16. These results provide further evidence that common variants in regions harboring genes of known craniofacial function contribute to normal variation in human facial features. Improved understanding of the genes associated with facial morphology in healthy individuals can provide insights into the pathways and mechanisms controlling normal and abnormal facial morphogenesis. PMID:27560520

  19. Genome-Wide Identification and Expression Analysis of WRKY Gene Family in Capsicum annuum L.

    PubMed Central

    Diao, Wei-Ping; Snyder, John C.; Wang, Shu-Bin; Liu, Jin-Bing; Pan, Bao-Gui; Guo, Guang-Jun; Wei, Ge

    2016-01-01

    The WRKY family of transcription factors is one of the most important families of plant transcriptional regulators with members regulating multiple biological processes, especially in regulating defense against biotic and abiotic stresses. However, little information is available about WRKYs in pepper (Capsicum annuum L.). The recent release of completely assembled genome sequences of pepper allowed us to perform a genome-wide investigation for pepper WRKY proteins. In the present study, a total of 71 WRKY genes were identified in the pepper genome. According to structural features of their encoded proteins, the pepper WRKY genes (CaWRKY) were classified into three main groups, with the second group further divided into five subgroups. Genome mapping analysis revealed that CaWRKY were enriched on four chromosomes, especially on chromosome 1, and 15.5% of the family members were tandemly duplicated genes. A phylogenetic tree was constructed depending on WRKY domain' sequences derived from pepper and Arabidopsis. The expression of 21 selected CaWRKY genes in response to seven different biotic and abiotic stresses (salt, heat shock, drought, Phytophtora capsici, SA, MeJA, and ABA) was evaluated by quantitative RT-PCR; Some CaWRKYs were highly expressed and up-regulated by stress treatment. Our results will provide a platform for functional identification and molecular breeding studies of WRKY genes in pepper. PMID:26941768

  20. Integrated genome-wide analysis of genomic changes and gene regulation in human adrenocortical tissue samples

    PubMed Central

    Gara, Sudheer Kumar; Wang, Yonghong; Patel, Dhaval; Liu-Chittenden, Yi; Jain, Meenu; Boufraqech, Myriem; Zhang, Lisa; Meltzer, Paul S.; Kebebew, Electron

    2015-01-01

    To gain insight into the pathogenesis of adrenocortical carcinoma (ACC) and whether there is progression from normal-to-adenoma-to-carcinoma, we performed genome-wide gene expression, gene methylation, microRNA expression and comparative genomic hybridization (CGH) analysis in human adrenocortical tissue (normal, adrenocortical adenomas and ACC) samples. A pairwise comparison of normal, adrenocortical adenomas and ACC gene expression profiles with more than four-fold expression differences and an adjusted P-value < 0.05 revealed no major differences in normal versus adrenocortical adenoma whereas there are 808 and 1085, respectively, dysregulated genes between ACC versus adrenocortical adenoma and ACC versus normal. The majority of the dysregulated genes in ACC were downregulated. By integrating the CGH, gene methylation and expression profiles of potential miRNAs with the gene expression of dysregulated genes, we found that there are higher alterations in ACC versus normal compared to ACC versus adrenocortical adenoma. Importantly, we identified several novel molecular pathways that are associated with dysregulated genes and further experimentally validated that oncostatin m signaling induces caspase 3 dependent apoptosis and suppresses cell proliferation. Finally, we propose that there is higher number of genomic changes from normal-to-adenoma-to-carcinoma and identified oncostatin m signaling as a plausible druggable pathway for therapeutics. PMID:26446994

  1. Genome-Wide Expression Analysis in Down Syndrome: Insight into Immunodeficiency

    PubMed Central

    Li, Chong; Jin, Lei; Bai, Yun; Chen, Qimin; Fu, Lijun; Yang, Minjun; Xiao, Huasheng; Zhao, Guoping; Wang, Shengyue

    2012-01-01

    Down syndrome (DS) is caused by triplication of Human chromosome 21 (Hsa21) and associated with an array of deleterious phenotypes, including mental retardation, heart defects and immunodeficiency. Genome-wide expression patterns of uncultured peripheral blood cells are useful to understanding of DS-associated immune dysfunction. We used a Human Exon microarray to characterize gene expression in uncultured peripheral blood cells derived from DS individuals and age-matched controls from two age groups: neonate (N) and child (C). A total of 174 transcript clusters (gene-level) with eight located on Hsa21 in N group and 383 transcript clusters including 56 on Hsa21 in C group were significantly dysregulated in DS individuals. Microarray data were validated by quantitative polymerase chain reaction. Functional analysis revealed that the dysregulated genes in DS were significantly enriched in two and six KEGG pathways in N and C group, respectively. These pathways included leukocyte trans-endothelial migration, B cell receptor signaling pathway and primary immunodeficiency, etc., which causally implicated dysfunctional immunity in DS. Our results provided a comprehensive picture of gene expression patterns in DS at the two developmental stages and pointed towards candidate genes and molecular pathways potentially associated with the immune dysfunction in DS. PMID:23155455

  2. Genome-wide analysis suggests divergent evolution of lipid phosphotases/phosphotransferase genes in plants.

    PubMed

    Wang, Peng; Chen, Zhenxi; Kasimu, Rena; Chen, Yinhua; Zhang, Xiaoxiao; Gai, Jiangtao

    2016-08-01

    Genes of the LPPT (lipid phosphatase/phosphotransferase) family play important roles in lipid phosphorous transfer and triacylglycerol accumulation in plants. To provide overviews of the plant LPPT family and their overall relationships, here we carried out genome-wide identifications and analyses of plant LPPT family members. A total of 643 putative LPPT genes were identified from 48 sequenced plant genomes, among which 205 genes from 14 plants were chosen for further analyses. Plant LPPT genes belonged to three distinctive groups, namely the LPT (lipid phosphotransfease), LPP (lipid phosphatase), and pLPP (plastidic lipid phosphotransfease) groups. Genes of the LPT group could be further partitioned into three groups, two of which were only identified in terrestrial plants. Genes in the LPP and pLPP groups experienced duplications in early stages of plant evolution. Among 17 Zea mays LPPT genes, divergence of temporal-spatial expression patterns was revealed based on microarray data analysis. Peptide sequences of plant LPPT genes harbored different conserved motifs. A test of Branch Model versus One-ratio Model did not support significant selective pressures acting on different groups of LPPT genes, although quite different nonsynonymous evolutionary rates and selective pressures were observed. The complete picture of the plant LPPT family provided here should facilitate further investigations of plant LPPT genes and offer a better understanding of lipid biosynthesis in plants. PMID:27501416

  3. Genome-wide analysis of high risk human papillomavirus E2 proteins in human primary keratinocytes.

    PubMed

    Sunthamala, Nuchsupha; Pang, Chai Ling; Thierry, Francoise; Teissier, Sebastien; Pientong, Chamsai; Ekalaksananan, Tipaya

    2014-12-01

    The E2 protein is expressed in the early stage of human papillomavirus (HPV) infection that is associated with cervical lesions. This protein plays important roles in regulation of viral replication and transcription. To characterize the role of E2 protein in modulation of cellular gene expression in HPV infected cells, genome-wide expression profiling of human primary keratinocytes (HPK) harboring HPV16 E2 and HPV18 E2 was investigated using microarray. The Principle Components Analysis (PCA) revealed that the expression data of HPV16 E2 and HPV18 E2-transduced HPKs were rather closely clustered. The Venn diagram of modulated genes showed an overlap of 10 common genes in HPV16 E2 expressing HPK and HPV18 E2 expressing HPK. These genes were expressed with significant difference by comparison with control cells. In addition, the distinct sets of modulated genes were detected 14 and 34 genes in HPV16 E2 and HPV18 E2 expressing HPKs, respectively. PMID:26484085

  4. Genome-wide association analysis of age at onset and psychotic symptoms in bipolar disorder

    PubMed Central

    Mahon, Pamela Belmonte; Pirooznia, Mehdi; Goes, Fernando S.; Seifuddin, Fayaz; Steele, Jo; Lee, Phil Hyoun; Huang, Jie; Hamshere, Marian; DePaulo, J. Raymond; Kelsoe, John R.; Rietschel, Marcella; Nöthen, Markus; Cichon, Sven; Gurling, Hugh; Purcell, Shaun; Smoller, Jordan W.; Craddock, Nick; Schulze, ThomasG.; McMahon, Francis J.; Potash, James B.; Zandi, Peter P.

    2011-01-01

    Genome-wide association studies (GWAS) have identified several susceptibility loci for bipolar disorder (BP), most notably ANK3. However, most of the inherited risk for BP remains unexplained. One reason for the limited success may be the genetic heterogeneity of BP. Clinical sub-phenotypes of BP may identify more etiologically homogeneous subsets of patients, which can be studied with increased power to detect genetic variation. Here, we report on a mega-analysis of two widely studied sub-phenotypes of BP, age at onset and psychotic symptoms, which are familial and clinically significant. We combined data from three GWAS: NIMH Bipolar Disorder Genetic Association Information Network (GAIN-BP), NIMH Bipolar Disorder Genome Study(BiGS), and a German sample. The combined sample consisted of 2836 BP cases with information on sub-phenotypes and 2744 controls. Imputation was performed, resulting in 2.3 million SNPs available for analysis. No SNP reached genome-wide significance for either sub-phenotype. In addition, no SNP reached genome-wide significance in a meta-analysis with an independent replication sample. We had 80% power to detect associations with a common SNP at an OR of 1.6 for psychotic symptoms and a mean difference of 1.8 years in age at onset. Age at onset and psychotic symptoms in BP may be influenced by many genes of smaller effect sizes or other variants not measured well by SNP arrays, such as rare alleles. PMID:21305692

  5. Meta-analysis of genome-wide association studies of attention deficit/hyperactivity disorder

    PubMed Central

    Neale, Benjamin M; Medland, Sarah E.; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V.; Nguyen, Thuy Trang; Schäfer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J.L.; Langely, Kate; O’Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P.; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D.; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan

    2010-01-01

    Objective Although twin and family studies have shown Attention Deficit/Hyperactivity Disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association scans (GWAS) have not yielded significant results, we conducted a meta-analysis of existing studies to boost statistical power. Method We used data from four projects: a) the Children’s Hospital of Philadelphia (CHOP), b) phase I of the International Multicenter ADHD Genetics project (IMAGE), c) phase II of IMAGE (IMAGE II), and d) the Pfizer funded study from the University of California, Los Angeles, Washington University and the Massachusetts General Hospital (PUWMa). The final sample size consisted of 2,064 trios, 896 cases and 2,455 controls. For each study, we imputed HapMap SNPs, computed association test statistics and transformed them to Z-scores, and then combined weighted Z-scores in a meta-analysis. Results No genome-wide significant associations were found, although an analysis of candidate genes suggests they may be involved in the disorder. Conclusions Given that ADHD is a highly heritable disorder, our negative results suggest that the effects of common ADHD risk variants must, individually, be very small or that other types of variants, e.g. rare ones, account for much of the disorder’s heritability. PMID:20732625

  6. Genome-wide polysomal analysis of a yeast strain with mutated ribosomal protein S9

    PubMed Central

    Pnueli, Lilach; Arava, Yoav

    2007-01-01

    Background The yeast ribosomal protein S9 (S9) is located at the entrance tunnel of the mRNA into the ribosome. It is known to play a role in accurate decoding and its bacterial homolog (S4) has recently been shown to be involved in opening RNA duplexes. Here we examined the effects of changing the C terminus of S9, which is rich in acidic amino acids and extends out of the ribosome surface. Results We performed a genome-wide analysis to reveal effects at the transcription and translation levels of all yeast genes. While negligible relative changes were observed in steady-state mRNA levels, a significant number of mRNAs appeared to have altered ribosomal density. Notably, 40% of the genes having reliable signals changed their ribosomal association by more than one ribosome. Yet, no general correlations with physical or functional features of the mRNA were observed. Ribosome Density Mapping (RDM) along four of the mRNAs with increased association revealed an increase in ribosomal density towards the end of the coding region for at least two of them. Read-through analysis did not reveal any increase in read-through of a premature stop codon by the mutant strain. Conclusion The ribosomal protein rpS9 appears to be involved in the translation of many mRNAs, since altering its C terminus led to a significant change in ribosomal association of many mRNAs. We did not find strong correlations between these changes and several physical features of the mRNA, yet future studies with advanced tools may allow such correlations to be determined. Importantly, our results indicate an accumulation of ribosomes towards the end of the coding regions of some mRNAs. This suggests an involvement of S9 in ribosomal dissociation during translation termination. PMID:17711575

  7. Genome-wide association analysis identifies six new loci associated with forced vital capacity

    PubMed Central

    Loth, Daan W.; Artigas, María Soler; Gharib, Sina A.; Wain, Louise V.; Franceschini, Nora; Koch, Beate; Pottinger, Tess; Smith, Albert Vernon; Duan, Qing; Oldmeadow, Chris; Lee, Mi Kyeong; Strachan, David P.; James, Alan L.; Huffman, Jennifer E.; Vitart, Veronique; Ramasamy, Adaikalavan; Wareham, Nicholas J.; Kaprio, Jaakko; Wang, Xin-Qun; Trochet, Holly; Kähönen, Mika; Flexeder, Claudia; Albrecht, Eva; Lopez, Lorna M.; de Jong, Kim; Thyagarajan, Bharat; Alves, Alexessander Couto; Enroth, Stefan; Omenaas, Ernst; Joshi, Peter K.; Fall, Tove; Viňuela, Ana; Launer, Lenore J.; Loehr, Laura R.; Fornage, Myriam; Li, Guo; Wilk, Jemma B.; Tang, Wenbo; Manichaikul, Ani; Lahousse, Lies; Harris, Tamara B.; North, Kari E.; Rudnicka, Alicja R.; Hui, Jennie; Gu, Xiangjun; Lumley, Thomas; Wright, Alan F.; Hastie, Nicholas D.; Campbell, Susan; Kumar, Rajesh; Pin, Isabelle; Scott, Robert A.; Pietiläinen, Kirsi H.; Surakka, Ida; Liu, Yongmei; Holliday, Elizabeth G.; Schulz, Holger; Heinrich, Joachim; Davies, Gail; Vonk, Judith M.; Wojczynski, Mary; Pouta, Anneli; Johansson, Åsa; Wild, Sarah H.; Ingelsson, Erik; Rivadeneira, Fernando; Völzke, Henry; Hysi, Pirro G.; Eiriksdottir, Gudny; Morrison, Alanna C.; Rotter, Jerome I.; Gao, Wei; Postma, Dirkje S.; White, Wendy B.; Rich, Stephen S.; Hofman, Albert; Aspelund, Thor; Couper, David; Smith, Lewis J.; Psaty, Bruce M.; Lohman, Kurt; Burchard, Esteban G.; Uitterlinden, André G.; Garcia, Melissa; Joubert, Bonnie R.; McArdle, Wendy L.; Musk, A. Bill; Hansel, Nadia; Heckbert, Susan R.; Zgaga, Lina; van Meurs, Joyce B.J.; Navarro, Pau; Rudan, Igor; Oh, Yeon-Mok; Redline, Susan; Jarvis, Deborah; Zhao, Jing Hua; Rantanen, Taina; O’Connor, George T.; Ripatti, Samuli; Scott, Rodney J.; Karrasch, Stefan; Grallert, Harald; Gaddis, Nathan C.; Starr, John M.; Wijmenga, Cisca; Minster, Ryan L.; Lederer, David J.; Pekkanen, Juha; Gyllensten, Ulf; Campbell, Harry; Morris, Andrew P.; Gläser, Sven; Hammond, Christopher J.; Burkart, Kristin M.; Beilby, John; Kritchevsky, Stephen B.; Gudnason, Vilmundur; Hancock, Dana B.; Williams, O. Dale; Polasek, Ozren; Zemunik, Tatijana; Kolcic, Ivana; Petrini, Marcy F.; Wjst, Matthias; Kim, Woo Jin; Porteous, David J.; Scotland, Generation; Smith, Blair H.; Viljanen, Anne; Heliövaara, Markku; Attia, John R.; Sayers, Ian; Hampel, Regina; Gieger, Christian; Deary, Ian J.; Boezen, H. Marike; Newman, Anne; Jarvelin, Marjo-Riitta; Wilson, James F.; Lind, Lars; Stricker, Bruno H.; Teumer, Alexander; Spector, Timothy D.; Melén, Erik; Peters, Marjolein J.; Lange, Leslie A.; Barr, R. Graham; Bracke, Ken R.; Verhamme, Fien M.; Sung, Joohon; Hiemstra, Pieter S.; Cassano, Patricia A.; Sood, Akshay; Hayward, Caroline; Dupuis, Josée; Hall, Ian P.; Brusselle, Guy G.; Tobin, Martin D.; London, Stephanie J.

    2014-01-01

    Forced vital capacity (FVC), a spirometric measure of pulmonary function, reflects lung volume and is used to diagnose and monitor lung diseases. We performed genome-wide association study meta-analysis of FVC in 52,253 individuals from 26 studies and followed up the top associations in 32,917 additional individuals of European ancestry. We found six new regions associated at genome-wide significance (P < 5 × 10−8) with FVC in or near EFEMP1, BMP6, MIR-129-2/HSD17B12, PRDM11, WWOX, and KCNJ2. Two (GSTCD and PTCH1) loci previously associated with spirometric measures were related to FVC. Newly implicated regions were followed-up in samples of African American, Korean, Chinese, and Hispanic individuals. We detected transcripts for all six newly implicated genes in human lung tissue. The new loci may inform mechanisms involved in lung development and pathogenesis of restrictive lung disease. PMID:24929828

  8. Genome-wide analysis of microRNA and mRNA expression signatures in cancer

    PubMed Central

    Li, Ming-hui; Fu, Sheng-bo; Xiao, Hua-sheng

    2015-01-01

    Cancer is an extremely diverse and complex disease that results from various genetic and epigenetic changes such as DNA copy-number variations, mutations, and aberrant mRNA and/or protein expression caused by abnormal transcriptional regulation. The expression profiles of certain microRNAs (miRNAs) and messenger RNAs (mRNAs) are closely related to cancer progression stages. In the past few decades, DNA microarray and next-generation sequencing techniques have been widely applied to identify miRNA and mRNA signatures for cancers on a genome-wide scale and have provided meaningful insights into cancer diagnosis, prognosis and personalized medicine. In this review, we summarize the progress in genome-wide analysis of miRNAs and mRNAs as cancer biomarkers, highlighting their diagnostic and prognostic roles. PMID:26299954

  9. Genome-wide association analysis identifies six new loci associated with forced vital capacity.

    PubMed

    Loth, Daan W; Soler Artigas, María; Gharib, Sina A; Wain, Louise V; Franceschini, Nora; Koch, Beate; Pottinger, Tess D; Smith, Albert Vernon; Duan, Qing; Oldmeadow, Chris; Lee, Mi Kyeong; Strachan, David P; James, Alan L; Huffman, Jennifer E; Vitart, Veronique; Ramasamy, Adaikalavan; Wareham, Nicholas J; Kaprio, Jaakko; Wang, Xin-Qun; Trochet, Holly; Kähönen, Mika; Flexeder, Claudia; Albrecht, Eva; Lopez, Lorna M; de Jong, Kim; Thyagarajan, Bharat; Alves, Alexessander Couto; Enroth, Stefan; Omenaas, Ernst; Joshi, Peter K; Fall, Tove; Viñuela, Ana; Launer, Lenore J; Loehr, Laura R; Fornage, Myriam; Li, Guo; Wilk, Jemma B; Tang, Wenbo; Manichaikul, Ani; Lahousse, Lies; Harris, Tamara B; North, Kari E; Rudnicka, Alicja R; Hui, Jennie; Gu, Xiangjun; Lumley, Thomas; Wright, Alan F; Hastie, Nicholas D; Campbell, Susan; Kumar, Rajesh; Pin, Isabelle; Scott, Robert A; Pietiläinen, Kirsi H; Surakka, Ida; Liu, Yongmei; Holliday, Elizabeth G; Schulz, Holger; Heinrich, Joachim; Davies, Gail; Vonk, Judith M; Wojczynski, Mary; Pouta, Anneli; Johansson, Asa; Wild, Sarah H; Ingelsson, Erik; Rivadeneira, Fernando; Völzke, Henry; Hysi, Pirro G; Eiriksdottir, Gudny; Morrison, Alanna C; Rotter, Jerome I; Gao, Wei; Postma, Dirkje S; White, Wendy B; Rich, Stephen S; Hofman, Albert; Aspelund, Thor; Couper, David; Smith, Lewis J; Psaty, Bruce M; Lohman, Kurt; Burchard, Esteban G; Uitterlinden, André G; Garcia, Melissa; Joubert, Bonnie R; McArdle, Wendy L; Musk, A Bill; Hansel, Nadia; Heckbert, Susan R; Zgaga, Lina; van Meurs, Joyce B J; Navarro, Pau; Rudan, Igor; Oh, Yeon-Mok; Redline, Susan; Jarvis, Deborah L; Zhao, Jing Hua; Rantanen, Taina; O'Connor, George T; Ripatti, Samuli; Scott, Rodney J; Karrasch, Stefan; Grallert, Harald; Gaddis, Nathan C; Starr, John M; Wijmenga, Cisca; Minster, Ryan L; Lederer, David J; Pekkanen, Juha; Gyllensten, Ulf; Campbell, Harry; Morris, Andrew P; Gläser, Sven; Hammond, Christopher J; Burkart, Kristin M; Beilby, John; Kritchevsky, Stephen B; Gudnason, Vilmundur; Hancock, Dana B; Williams, O Dale; Polasek, Ozren; Zemunik, Tatijana; Kolcic, Ivana; Petrini, Marcy F; Wjst, Matthias; Kim, Woo Jin; Porteous, David J; Scotland, Generation; Smith, Blair H; Viljanen, Anne; Heliövaara, Markku; Attia, John R; Sayers, Ian; Hampel, Regina; Gieger, Christian; Deary, Ian J; Boezen, H Marike; Newman, Anne; Jarvelin, Marjo-Riitta; Wilson, James F; Lind, Lars; Stricker, Bruno H; Teumer, Alexander; Spector, Timothy D; Melén, Erik; Peters, Marjolein J; Lange, Leslie A; Barr, R Graham; Bracke, Ken R; Verhamme, Fien M; Sung, Joohon; Hiemstra, Pieter S; Cassano, Patricia A; Sood, Akshay; Hayward, Caroline; Dupuis, Josée; Hall, Ian P; Brusselle, Guy G; Tobin, Martin D; London, Stephanie J

    2014-07-01

    Forced vital capacity (FVC), a spirometric measure of pulmonary function, reflects lung volume and is used to diagnose and monitor lung diseases. We performed genome-wide association study meta-analysis of FVC in 52,253 individuals from 26 studies and followed up the top associations in 32,917 additional individuals of European ancestry. We found six new regions associated at genome-wide significance (P < 5 × 10(-8)) with FVC in or near EFEMP1, BMP6, MIR129-2-HSD17B12, PRDM11, WWOX and KCNJ2. Two loci previously associated with spirometric measures (GSTCD and PTCH1) were related to FVC. Newly implicated regions were followed up in samples from African-American, Korean, Chinese and Hispanic individuals. We detected transcripts for all six newly implicated genes in human lung tissue. The new loci may inform mechanisms involved in lung development and the pathogenesis of restrictive lung disease. PMID:24929828

  10. A guide to genome-wide association analysis and post-analytic interrogation.

    PubMed

    Reed, Eric; Nunez, Sara; Kulp, David; Qian, Jing; Reilly, Muredach P; Foulkes, Andrea S

    2015-12-10

    This tutorial is a learning resource that outlines the basic process and provides specific software tools for implementing a complete genome-wide association analysis. Approaches to post-analytic visualization and interrogation of potentially novel findings are also presented. Applications are illustrated using the free and open-source R statistical computing and graphics software environment, Bioconductor software for bioinformatics and the UCSC Genome Browser. Complete genome-wide association data on 1401 individuals across 861,473 typed single nucleotide polymorphisms from the PennCATH study of coronary artery disease are used for illustration. All data and code, as well as additional instructional resources, are publicly available through the Open Resources in Statistical Genomics project: http://www.stat-gen.org. PMID:26343929

  11. Genome-wide association study reveals two new risk loci for bipolar disorder.

    PubMed

    Mühleisen, Thomas W; Leber, Markus; Schulze, Thomas G; Strohmaier, Jana; Degenhardt, Franziska; Treutlein, Jens; Mattheisen, Manuel; Forstner, Andreas J; Schumacher, Johannes; Breuer, René; Meier, Sandra; Herms, Stefan; Hoffmann, Per; Lacour, André; Witt, Stephanie H; Reif, Andreas; Müller-Myhsok, Bertram; Lucae, Susanne; Maier, Wolfgang; Schwarz, Markus; Vedder, Helmut; Kammerer-Ciernioch, Jutta; Pfennig, Andrea; Bauer, Michael; Hautzinger, Martin; Moebus, Susanne; Priebe, Lutz; Czerski, Piotr M; Hauser, Joanna; Lissowska, Jolanta; Szeszenia-Dabrowska, Neonila; Brennan, Paul; McKay, James D; Wright, Adam; Mitchell, Philip B; Fullerton, Janice M; Schofield, Peter R; Montgomery, Grant W; Medland, Sarah E; Gordon, Scott D; Martin, Nicholas G; Krasnow, Valery; Chuchalin, Alexander; Babadjanova, Gulja; Pantelejeva, Galina; Abramova, Lilia I; Tiganov, Alexander S; Polonikov, Alexey; Khusnutdinova, Elza; Alda, Martin; Grof, Paul; Rouleau, Guy A; Turecki, Gustavo; Laprise, Catherine; Rivas, Fabio; Mayoral, Fermin; Kogevinas, Manolis; Grigoroiu-Serbanescu, Maria; Propping, Peter; Becker, Tim; Rietschel, Marcella; Nöthen, Markus M; Cichon, Sven

    2014-01-01

    Bipolar disorder (BD) is a common and highly heritable mental illness and genome-wide association studies (GWAS) have robustly identified the first common genetic variants involved in disease aetiology. The data also provide strong evidence for the presence of multiple additional risk loci, each contributing a relatively small effect to BD susceptibility. Large samples are necessary to detect these risk loci. Here we present results from the largest BD GWAS to date by investigating 2.3 million single-nucleotide polymorphisms (SNPs) in a sample of 24,025 patients and controls. We detect 56 genome-wide significant SNPs in five chromosomal regions including previously reported risk loci ANK3, ODZ4 and TRANK1, as well as the risk locus ADCY2 (5p15.31) and a region between MIR2113 and POU3F2 (6q16.1). ADCY2 is a key enzyme in cAMP signalling and our finding provides new insights into the biological mechanisms involved in the development of BD. PMID:24618891

  12. Off-Target Effects of Psychoactive Drugs Revealed by Genome-Wide Assays in Yeast

    PubMed Central

    Ericson, Elke; Gebbia, Marinella; Heisler, Lawrence E.; Wildenhain, Jan; Tyers, Mike; Giaever, Guri; Nislow, Corey

    2008-01-01

    To better understand off-target effects of widely prescribed psychoactive drugs, we performed a comprehensive series of chemogenomic screens using the budding yeast Saccharomyces cerevisiae as a model system. Because the known human targets of these drugs do not exist in yeast, we could employ the yeast gene deletion collections and parallel fitness profiling to explore potential off-target effects in a genome-wide manner. Among 214 tested, documented psychoactive drugs, we identified 81 compounds that inhibited wild-type yeast growth and were thus selected for genome-wide fitness profiling. Many of these drugs had a propensity to affect multiple cellular functions. The sensitivity profiles of half of the analyzed drugs were enriched for core cellular processes such as secretion, protein folding, RNA processing, and chromatin structure. Interestingly, fluoxetine (Prozac) interfered with establishment of cell polarity, cyproheptadine (Periactin) targeted essential genes with chromatin-remodeling roles, while paroxetine (Paxil) interfered with essential RNA metabolism genes, suggesting potential secondary drug targets. We also found that the more recently developed atypical antipsychotic clozapine (Clozaril) had no fewer off-target effects in yeast than the typical antipsychotics haloperidol (Haldol) and pimozide (Orap). Our results suggest that model organism pharmacogenetic studies provide a rational foundation for understanding the off-target effects of clinically important psychoactive agents and suggest a rational means both for devising compound derivatives with fewer side effects and for tailoring drug treatment to individual patient genotypes. PMID:18688276

  13. Genome-Wide Comparative Analysis Reveals Similar Types of NBS Genes in Hybrid Citrus sinensis Genome and Original Citrus clementine Genome and Provides New Insights into Non-TIR NBS Genes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In this study, we identified and compared nucleotide-binding site (NBS) domain-containing genes from three Citrus genomes (C. clementina, C. sinensis from USA and C. sinensis from China). Phylogenetic analysis of all Citrus NBS genes across these three genomes revealed that there are three approxima...

  14. Genome-Wide Meta-Analysis of Longitudinal Alcohol Consumption Across Youth and Early Adulthood.

    PubMed

    Adkins, Daniel E; Clark, Shaunna L; Copeland, William E; Kennedy, Martin; Conway, Kevin; Angold, Adrian; Maes, Hermine; Liu, Youfang; Kumar, Gaurav; Erkanli, Alaattin; Patkar, Ashwin A; Silberg, Judy; Brown, Tyson H; Fergusson, David M; Horwood, L John; Eaves, Lindon; van den Oord, Edwin J C G; Sullivan, Patrick F; Costello, E J

    2015-08-01

    The public health burden of alcohol is unevenly distributed across the life course, with levels of use, abuse, and dependence increasing across adolescence and peaking in early adulthood. Here, we leverage this temporal patterning to search for common genetic variants predicting developmental trajectories of alcohol consumption. Comparable psychiatric evaluations measuring alcohol consumption were collected in three longitudinal community samples (N=2,126, obs=12,166). Consumption-repeated measurements spanning adolescence and early adulthood were analyzed using linear mixed models, estimating individual consumption trajectories, which were then tested for association with Illumina 660W-Quad genotype data (866,099 SNPs after imputation and QC). Association results were combined across samples using standard meta-analysis methods. Four meta-analysis associations satisfied our pre-determined genome-wide significance criterion (FDR<0.1) and six others met our 'suggestive' criterion (FDR<0.2). Genome-wide significant associations were highly biological plausible, including associations within GABA transporter 1, SLC6A1 (solute carrier family 6, member 1), and exonic hits in LOC100129340 (mitofusin-1-like). Pathway analyses elaborated single marker results, indicating significant enriched associations to intuitive biological mechanisms, including neurotransmission, xenobiotic pharmacodynamics, and nuclear hormone receptors (NHR). These findings underscore the value of combining longitudinal behavioral data and genome-wide genotype information in order to study developmental patterns and improve statistical power in genomic studies. PMID:26081443

  15. Genome-wide meta-analysis of longitudinal alcohol consumption across youth and early adulthood

    PubMed Central

    Adkins, Daniel E.; Clark, Shaunna L.; Copeland, William E.; Kennedy, Martin; Conway, Kevin; Angold, Adrian; Maes, Hermine; Liu, Youfang; Kumar, Gaurav; Erkanli, Alaattin; Patkar, Ashwin A.; Silberg, Judy; Brown, Tyson H.; Fergusson, David M.; Horwood, L. John; Eaves, Lindon; van den Oord, Edwin J.C.G.; Sullivan, Patrick F.; Costello, E. J.

    2016-01-01

    The public health burden of alcohol is unevenly distributed across the life course, with levels of use, abuse and dependence increasing across adolescence and peaking in early adulthood. Here we leverage this temporal patterning to search for common genetic variants predicting developmental trajectories of alcohol consumption. Comparable psychiatric evaluations measuring alcohol consumption were collected in three, longitudinal community samples (N=2,126, obs=12,166). Consumption repeated measurements spanning adolescence and early adulthood were analyzed using linear mixed models, estimating individual consumption trajectories, which were then tested for association with Illumina 660W-Quad genotype data (866,099 SNPs after imputation and QC). Association results were combined across samples using standard meta-analysis methods. Four meta-analysis associations satisfied our pre-determined genome-wide significance criterion (FDR<0.1) and 6 others met our “suggestive” criterion (FDR<0.2). Genome-wide significant associations were highly biological plausible, including associations within GABA transporter 1, SLC6A1 (solute carrier family 6, member 1), and exonic hits in LOC100129340 (mitofusin-1-like). Pathway analyses elaborated single marker results, indicating significant enriched associations to intuitive biological mechanisms including neurotransmission, xenobiotic pharmacodynamics and nuclear hormone receptors. These findings underscore the value of combining longitudinal behavioral data and genome-wide genotype information in order to study developmental patterns and improve statistical power in genomic studies. PMID:26081443

  16. Five endometrial cancer risk loci identified through genome-wide association analysis.

    PubMed

    Cheng, Timothy H T; Thompson, Deborah J; O'Mara, Tracy A; Painter, Jodie N; Glubb, Dylan M; Flach, Susanne; Lewis, Annabelle; French, Juliet D; Freeman-Mills, Luke; Church, David; Gorman, Maggie; Martin, Lynn; Hodgson, Shirley; Webb, Penelope M; Attia, John; Holliday, Elizabeth G; McEvoy, Mark; Scott, Rodney J; Henders, Anjali K; Martin, Nicholas G; Montgomery, Grant W; Nyholt, Dale R; Ahmed, Shahana; Healey, Catherine S; Shah, Mitul; Dennis, Joe; Fasching, Peter A; Beckmann, Matthias W; Hein, Alexander; Ekici, Arif B; Hall, Per; Czene, Kamila; Darabi, Hatef; Li, Jingmei; Dörk, Thilo; Dürst, Matthias; Hillemanns, Peter; Runnebaum, Ingo; Amant, Frederic; Schrauwen, Stefanie; Zhao, Hui; Lambrechts, Diether; Depreeuw, Jeroen; Dowdy, Sean C; Goode, Ellen L; Fridley, Brooke L; Winham, Stacey J; Njølstad, Tormund S; Salvesen, Helga B; Trovik, Jone; Werner, Henrica M J; Ashton, Katie; Otton, Geoffrey; Proietto, Tony; Liu, Tao; Mints, Miriam; Tham, Emma; Li, Mulin Jun; Yip, Shun H; Wang, Junwen; Bolla, Manjeet K; Michailidou, Kyriaki; Wang, Qin; Tyrer, Jonathan P; Dunlop, Malcolm; Houlston, Richard; Palles, Claire; Hopper, John L; Peto, Julian; Swerdlow, Anthony J; Burwinkel, Barbara; Brenner, Hermann; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Chang-Claude, Jenny; Couch, Fergus J; Giles, Graham G; Kristensen, Vessela N; Cox, Angela; Cunningham, Julie M; Pharoah, Paul D P; Dunning, Alison M; Edwards, Stacey L; Easton, Douglas F; Tomlinson, Ian; Spurdle, Amanda B

    2016-06-01

    We conducted a meta-analysis of three endometrial cancer genome-wide association studies (GWAS) and two follow-up phases totaling 7,737 endometrial cancer cases and 37,144 controls of European ancestry. Genome-wide imputation and meta-analysis identified five new risk loci of genome-wide significance at likely regulatory regions on chromosomes 13q22.1 (rs11841589, near KLF5), 6q22.31 (rs13328298, in LOC643623 and near HEY2 and NCOA7), 8q24.21 (rs4733613, telomeric to MYC), 15q15.1 (rs937213, in EIF2AK4, near BMF) and 14q32.33 (rs2498796, in AKT1, near SIVA1). We also found a second independent 8q24.21 signal (rs17232730). Functional studies of the 13q22.1 locus showed that rs9600103 (pairwise r(2) = 0.98 with rs11841589) is located in a region of active chromatin that interacts with the KLF5 promoter region. The rs9600103[T] allele that is protective in endometrial cancer suppressed gene expression in vitro, suggesting that regulation of the expression of KLF5, a gene linked to uterine development, is implicated in tumorigenesis. These findings provide enhanced insight into the genetic and biological basis of endometrial cancer. PMID:27135401

  17. Integrated analysis of genome-wide genetic and epigenetic association data for identification of disease mechanisms.

    PubMed

    Ke, Xiayi; Cortina-Borja, Mario; Silva, Bruno Cesar; Lowe, Robert; Rakyan, Vardhman; Balding, David

    2013-11-01

    Many human diseases are multifactorial, involving multiple genetic and environmental factors impacting on one or more biological pathways. Much of the environmental effect is believed to be mediated through epigenetic changes. Although many genome-wide genetic and epigenetic association studies have been conducted for different diseases and traits, it is still far from clear to what extent the genomic loci and biological pathways identified in the genetic and epigenetic studies are shared. There is also a lack of statistical tools to assess these important aspects of disease mechanisms. In the present study, we describe a protocol for the integrated analysis of genome-wide genetic and epigenetic data based on permutation of a sum statistic for the combined effects in a locus or pathway. The method was then applied to published type 1 diabetes (T1D) genome-wide- and epigenome-wide-association studies data to identify genomic loci and biological pathways that are associated with T1D genetically and epigenetically. Through combined analysis, novel loci and pathways were also identified, which could add to our understanding of disease mechanisms of T1D as well as complex diseases in general. PMID:24071862

  18. Genetic determinants of common epilepsies: a meta-analysis of genome-wide association studies

    PubMed Central

    2014-01-01

    Summary Background The epilepsies are a clinically heterogeneous group of neurological disorders. Despite strong evidence for heritability, genome-wide association studies have had little success in identification of risk loci associated with epilepsy, probably because of relatively small sample sizes and insufficient power. We aimed to identify risk loci through meta-analyses of genome-wide association studies for all epilepsy and the two largest clinical subtypes (genetic generalised epilepsy and focal epilepsy). Methods We combined genome-wide association data from 12 cohorts of individuals with epilepsy and controls from population-based datasets. Controls were ethnically matched with cases. We phenotyped individuals with epilepsy into categories of genetic generalised epilepsy, focal epilepsy, or unclassified epilepsy. After standardised filtering for quality control and imputation to account for different genotyping platforms across sites, investigators at each site conducted a linear mixed-model association analysis for each dataset. Combining summary statistics, we conducted fixed-effects meta-analyses of all epilepsy, focal epilepsy, and genetic generalised epilepsy. We set the genome-wide significance threshold at p<1·66 × 10−8. Findings We included 8696 cases and 26 157 controls in our analysis. Meta-analysis of the all-epilepsy cohort identified loci at 2q24.3 (p=8·71 × 10−10), implicating SCN1A, and at 4p15.1 (p=5·44 × 10−9), harbouring PCDH7, which encodes a protocadherin molecule not previously implicated in epilepsy. For the cohort of genetic generalised epilepsy, we noted a single signal at 2p16.1 (p=9·99 × 10−9), implicating VRK2 or FANCL. No single nucleotide polymorphism achieved genome-wide significance for focal epilepsy. Interpretation This meta-analysis describes a new locus not previously implicated in epilepsy and provides further evidence about the genetic architecture of these disorders, with the

  19. Genome-wide analysis of epistasis in body mass index using multiple human populations

    PubMed Central

    Wei, Wen-Hua; Hemani, Gib; Gyenesei, Attila; Vitart, Veronique; Navarro, Pau; Hayward, Caroline; Cabrera, Claudia P; Huffman, Jennifer E; Knott, Sara A; Hicks, Andrew A; Rudan, Igor; Pramstaller, Peter P; Wild, Sarah H; Wilson, James F; Campbell, Harry; Hastie, Nicholas D; Wright, Alan F; Haley, Chris S

    2012-01-01

    We surveyed gene–gene interactions (epistasis) in human body mass index (BMI) in four European populations (n<1200) via exhaustive pair-wise genome scans where interactions were computed as F ratios by testing a linear regression model fitting two single-nucleotide polymorphisms (SNPs) with interactions against the one without. Before the association tests, BMI was corrected for sex and age, normalised and adjusted for relatedness. Neither single SNPs nor SNP interactions were genome-wide significant in either cohort based on the consensus threshold (P=5.0E−08) and a Bonferroni corrected threshold (P=1.1E−12), respectively. Next we compared sub genome-wide significant SNP interactions (P<5.0E−08) across cohorts to identify common epistatic signals, where SNPs were annotated to genes to test for gene ontology (GO) enrichment. Among the epistatic genes contributing to the commonly enriched GO terms, 19 were shared across study cohorts of which 15 are previously published genome-wide association loci, including CDH13 (cadherin 13) associated with height and SORCS2 (sortilin-related VPS10 domain containing receptor 2) associated with circulating insulin-like growth factor 1 and binding protein 3. Interactions between the 19 shared epistatic genes and those involving BMI candidate loci (P<5.0E−08) were tested across cohorts and found eight replicated at the SNP level (P<0.05) in at least one cohort, which were further tested and showed limited replication in a separate European population (n>5000). We conclude that genome-wide analysis of epistasis in multiple populations is an effective approach to provide new insights into the genetic regulation of BMI but requires additional efforts to confirm the findings. PMID:22333899

  20. Genome-wide Association Analysis Identifies 14 New Risk Loci for Schizophrenia

    PubMed Central

    Ripke, Stephan; O'Dushlaine, Colm; Chambert, Kimberly; Moran, Jennifer L; Kähler, Anna K; Akterin, Susanne; Bergen, Sarah; Collins, Ann L; Crowley, James J; Fromer, Menachem; Kim, Yunjung; Lee, Sang Hong; Magnusson, Patrik KE; Sanchez, Nick; Stahl, Eli A; Williams, Stephanie; Wray, Naomi R; Xia, Kai; Bettella, Francesco; Børglum, Anders D; Bulik-Sullivan, Brendan K; Cormican, Paul; Craddock, Nick; de Leeuw, Christiaan; Durmishi, Naser; Gill, Michael; Golimbet, Vera; Hamshere, Marian L; Holmans, Peter; Hougaard, David M; Kendler, Kenneth S; Lin, Kuang; Morris, Derek W; Mors, Ole; Mortensen, Preben B; Neale, Benjamin M; O'Neill, Francis A; Owen, Michael J; Milovancevic, MilicaPejovic; Posthuma, Danielle; Powell, John; Richards, Alexander L; Riley, Brien P; Ruderfer, Douglas; Rujescu, Dan; Sigurdsson, Engilbert; Silagadze, Teimuraz; Smit, August B; Stefansson, Hreinn; Steinberg, Stacy; Suvisaari, Jaana; Tosato, Sarah; Verhage, Matthijs; Walters, James T; Bramon, Elvira; Corvin, Aiden P; O'Donovan, Michael C; Stefansson, Kari; Scolnick, Edward; Purcell, Shaun; McCarroll, Steve; Sklar, Pamela; Hultman, Christina M; Sullivan, Patrick F

    2013-01-01

    Schizophrenia is a heritable disorder with substantial public health impact. We conducted a multi-stage genome-wide association study (GWAS) for schizophrenia beginning with a Swedish national sample (5,001 cases, 6,243 controls) followed by meta-analysis with prior schizophrenia GWAS (8,832 cases, 12,067 controls) and finally by replication of SNPs in 168 genomic regions in independent samples (7,413 cases, 19,762 controls, and 581 trios). In total, 22 regions met genome-wide significance (14 novel and one previously implicated in bipolar disorder). The results strongly implicate calcium signaling in the etiology of schizophrenia, and include genome-wide significant results for CACNA1C and CACNB2 whose protein products interact. We estimate that ∼8,300 independent and predominantly common SNPs contribute to risk for schizophrenia and that these collectively account for most of its heritability. Common genetic variation plays an important role in the etiology of schizophrenia, and larger studies will allow more detailed understanding of this devastating disorder. PMID:23974872

  1. Genome-wide association analysis to predict optimal antipsychotic dosage in schizophrenia: a pilot study.

    PubMed

    Koga, Arthur T; Strauss, John; Zai, Clement; Remington, Gary; De Luca, Vincenzo

    2016-03-01

    In recent years, several studies have investigated genetic polymorphisms of antipsychotic drug-metabolizing enzymes and receptors. However, most studies focused on drug response and very few have investigated the genetic influence on antipsychotic dosage. The aim of the present study is to test the association between antipsychotic dosages at genome-wide level. The current dosage of antipsychotic medications was collected from 79 schizophrenia patients. The dosage was standardized using three different methods: chlorpromazine equivalent (CPZe), defined daily dose (DDD), and percentage of maximum dose (PM %). The patients were then genotyped using the Illumina HumanOmni2.5-8 BeadChip Kit. All markers were screened for significance using linear regression, and the p values were visualized using a Manhattan plot. The genome-wide analysis showed that the top Single-Nucleotide Polymorphisms (SNPs) associated with dosage variation were rs981975 on chromosome 14 for CPZe, rs4470690 on chromosome 4 for PM %, and rs79323383 on chromosome 8 for DDD. However, no genome-wide significantly associated SNPs were identified. In this pilot sample, we found promising trends for pharmacodynamic targets associated with antipsychotic dosage. Therefore, studies combining large prescription databases may identify genetic predictors to adjust the dose of antipsychotic medication. PMID:26821981

  2. Using genome-wide complex trait analysis to quantify 'missing heritability' in Parkinson's disease.

    PubMed

    Keller, Margaux F; Saad, Mohamad; Bras, Jose; Bettella, Francesco; Nicolaou, Nayia; Simón-Sánchez, Javier; Mittag, Florian; Büchel, Finja; Sharma, Manu; Gibbs, J Raphael; Schulte, Claudia; Moskvina, Valentina; Durr, Alexandra; Holmans, Peter; Kilarski, Laura L; Guerreiro, Rita; Hernandez, Dena G; Brice, Alexis; Ylikotila, Pauli; Stefánsson, Hreinn; Majamaa, Kari; Morris, Huw R; Williams, Nigel; Gasser, Thomas; Heutink, Peter; Wood, Nicholas W; Hardy, John; Martinez, Maria; Singleton, Andrew B; Nalls, Michael A

    2012-11-15

    Genome-wide association studies (GWASs) have been successful at identifying single-nucleotide polymorphisms (SNPs) highly associated with common traits; however, a great deal of the heritable variation associated with common traits remains unaccounted for within the genome. Genome-wide complex trait analysis (GCTA) is a statistical method that applies a linear mixed model to estimate phenotypic variance of complex traits explained by genome-wide SNPs, including those not associated with the trait in a GWAS. We applied GCTA to 8 cohorts containing 7096 case and 19 455 control individuals of European ancestry in order to examine the missing heritability present in Parkinson's disease (PD). We meta-analyzed our initial results to produce robust heritability estimates for PD types across cohorts. Our results identify 27% (95% CI 17-38, P = 8.08E - 08) phenotypic variance associated with all types of PD, 15% (95% CI -0.2 to 33, P = 0.09) phenotypic variance associated with early-onset PD and 31% (95% CI 17-44, P = 1.34E - 05) phenotypic variance associated with late-onset PD. This is a substantial increase from the genetic variance identified by top GWAS hits alone (between 3 and 5%) and indicates there are substantially more risk loci to be identified. Our results suggest that although GWASs are a useful tool in identifying the most common variants associated with complex disease, a great deal of common variants of small effect remain to be discovered. PMID:22892372

  3. GDSL esterase/lipase genes in Brassica rapa L.: genome-wide identification and expression analysis.

    PubMed

    Dong, Xiangshu; Yi, Hankuil; Han, Ching-Tack; Nou, Ill-Sup; Hur, Yoonkang

    2016-04-01

    GDSL esterase/lipase proteins (GELPs), a very large subfamily of lipolytic enzymes, have been identified in microbes and many plants, but only a few have been characterized with respect to their roles in growth, development, and stress responses. In Brassica crops, as in many other species, genome-wide systematic analysis and functional studies of these genes are still lacking. As a first step to study their function in B. rapa ssp. pekinensis (Chinese cabbage), we comprehensively identified all GELP genes in the genome. We found a total of 121 Brassica rapa GDSL esterase/lipase protein genes (BrGELPs), forming three clades in the phylogenetic analysis (two major and one minor), with an asymmetrical chromosomal distribution. Most BrGELPs possess four strictly conserved residues (Ser-Gly-Asn-His) in four separate conserved regions, along with short conserved and clade-specific blocks, suggesting functional diversification of these proteins. Detailed expression profiling revealed that BrGELPs were expressed in various tissues, including floral organs, implying that BrGELPs play diverse roles in various tissues and during development. Ten percent of BrGELPs were specifically expressed in fertile buds, rather than male-sterile buds, implying their involvement in pollen development. Analyses of EXL6 (extracellular lipase 6) expression and its co-expressed genes in both B. rapa and Arabidopsis, as well as knockdown of this gene in Arabidopsis, revealed that this gene plays an important role in pollen development in both species. The data described in this study will facilitate future investigations of other BrGELP functions. PMID:26423069

  4. Genome-wide analysis of antiviral signature genes in porcine macrophages at different activation statuses.

    PubMed

    Sang, Yongming; Brichalli, Wyatt; Rowland, Raymond R R; Blecha, Frank

    2014-01-01

    Macrophages (MФs) can be polarized to various activation statuses, including classical (M1), alternative (M2), and antiviral states. To study the antiviral activation status of porcine MФs during porcine reproductive and respiratory syndrome virus (PRRSV) infection, we used RNA Sequencing (RNA-Seq) for transcriptomic analysis of differentially expressed genes (DEGs). Sequencing assessment and quality evaluation showed that our RNA-Seq data met the criteria for genome-wide transcriptomic analysis. Comparisons of any two activation statuses revealed more than 20,000 DEGs that were normalized to filter out 153-5,303 significant DEGs [false discovery rate (FDR) ≤0.001, fold change ≥2] in each comparison. The highest 5,303 significant DEGs were found between lipopolysaccharide- (LPS) and interferon (IFN)γ-stimulated M1 cells, whereas only 153 significant DEGs were detected between interleukin (IL)-10-polarized M2 cells and control mock-activated cells. To identify signature genes for antiviral regulation pertaining to each activation status, we identified a set of DEGs that showed significant up-regulation in only one activation state. In addition, pathway analyses defined the top 20-50 significantly regulated pathways at each activation status, and we further analyzed DEGs pertinent to pathways mediated by AMP kinase (AMPK) and epigenetic mechanisms. For the first time in porcine macrophages, our transcriptomic analyses not only compared family-wide differential expression of most known immune genes at different activation statuses, but also revealed transcription evidence of multiple gene families. These findings show that using RNA-Seq transcriptomic analyses in virus-infected and status-synchronized macrophages effectively profiled signature genes and gene response pathways for antiviral regulation, which may provide a framework for optimizing antiviral immunity and immune homeostasis. PMID:24505295

  5. Genome-Wide Analysis of Antiviral Signature Genes in Porcine Macrophages at Different Activation Statuses

    PubMed Central

    Sang, Yongming; Brichalli, Wyatt; Rowland, Raymond R. R.; Blecha, Frank

    2014-01-01

    Macrophages (MФs) can be polarized to various activation statuses, including classical (M1), alternative (M2), and antiviral states. To study the antiviral activation status of porcine MФs during porcine reproductive and respiratory syndrome virus (PRRSV) infection, we used RNA Sequencing (RNA-Seq) for transcriptomic analysis of differentially expressed genes (DEGs). Sequencing assessment and quality evaluation showed that our RNA-Seq data met the criteria for genome-wide transcriptomic analysis. Comparisons of any two activation statuses revealed more than 20,000 DEGs that were normalized to filter out 153–5,303 significant DEGs [false discovery rate (FDR) ≤0.001, fold change ≥2] in each comparison. The highest 5,303 significant DEGs were found between lipopolysaccharide- (LPS) and interferon (IFN)γ-stimulated M1 cells, whereas only 153 significant DEGs were detected between interleukin (IL)-10-polarized M2 cells and control mock-activated cells. To identify signature genes for antiviral regulation pertaining to each activation status, we identified a set of DEGs that showed significant up-regulation in only one activation state. In addition, pathway analyses defined the top 20–50 significantly regulated pathways at each activation status, and we further analyzed DEGs pertinent to pathways mediated by AMP kinase (AMPK) and epigenetic mechanisms. For the first time in porcine macrophages, our transcriptomic analyses not only compared family-wide differential expression of most known immune genes at different activation statuses, but also revealed transcription evidence of multiple gene families. These findings show that using RNA-Seq transcriptomic analyses in virus-infected and status-synchronized macrophages effectively profiled signature genes and gene response pathways for antiviral regulation, which may provide a framework for optimizing antiviral immunity and immune homeostasis. PMID:24505295

  6. Genome-wide analysis of gestational gene-environment interactions in the developing kidney

    PubMed Central

    Yan, Lei; Yao, Xiao; Bachvarov, Dimcho; Saifudeen, Zubaida

    2014-01-01

    The G protein-coupled bradykinin B2 receptor (Bdkrb2) plays an important role in regulation of blood pressure under conditions of excess salt intake. Our previous work has shown that Bdkrb2 also plays a developmental role since Bdkrb2−/− embryos, but not their wild-type or heterozygous littermates, are prone to renal dysgenesis in response to gestational high salt intake. Although impaired terminal differentiation and apoptosis are consistent findings in the Bdkrb2−/− mutant kidneys, the developmental pathways downstream of gene-environment interactions leading to the renal phenotype remain unknown. Here, we performed genome-wide transcriptional profiling on embryonic kidneys from salt-stressed Bdkrb2+/+ and Bdkrb2−/− embryos. The results reveal significant alterations in key pathways regulating Wnt signaling, apoptosis, embryonic development, and cell-matrix interactions. In silico analysis reveal that nearly 12% of differentially regulated genes harbor one or more Pax2 DNA-binding sites in their promoter region. Further analysis shows that metanephric kidneys of salt-stressed Bdkrb2−/− have a significant downregulation of Pax2 gene expression. This was corroborated in Bdkrb2−/−;Pax2GFP+/tg mice, demonstrating that Pax2 transcriptional activity is significantly repressed by gestational salt-Bdkrb2 interactions. We conclude that gestational gene (Bdkrb2) and environment (salt) interactions cooperate to impact gene expression programs in the developing kidney. Suppression of Pax2 likely contributes to the defects in epithelial survival, growth, and differentiation in salt-stressed BdkrB2−/− mice. PMID:25005792

  7. Genome-wide identification and analysis of the MADS-box gene family in sesame.

    PubMed

    Wei, Xin; Wang, Linhai; Yu, Jingyin; Zhang, Yanxin; Li, Donghua; Zhang, Xiurong

    2015-09-10

    MADS-box genes encode transcription factors that play crucial roles in plant growth and development. Sesame (Sesamum indicum L.) is an oil crop that contributes to the daily oil and protein requirements of almost half of the world's population; therefore, a genome-wide analysis of the MADS-box gene family is needed. Fifty-seven MADS-box genes were identified from 14 linkage groups of the sesame genome. Analysis of phylogenetic relationships with Arabidopsis thaliana, Utricularia gibba and Solanum lycopersicum MADS-box genes was performed. Sesame MADS-box genes were clustered into four groups: 28 MIKC(c)-type, 5 MIKC(⁎)-type, 14 Mα-type and 10 Mγ-type. Gene structure analysis revealed from 1 to 22 exons of sesame MADS-box genes. The number of exons in type II MADS-box genes greatly exceeded the number in type I genes. Motif distribution analysis of sesame MADS-box genes also indicated that type II MADS-box genes contained more motifs than type I genes. These results suggested that type II sesame MADS-box genes had more complex structures. By analyzing expression profiles of MADS-box genes in seven sesame transcriptomes, we determined that MIKC(C)-type MADS-box genes played significant roles in sesame flower and seed development. Although most MADS-box genes in the same clade showed similar expression features, some gene functions were diversified from the orthologous Arabidopsis genes. This research will contribute to uncovering the role of MADS-box genes in sesame development. PMID:25967387

  8. Genome-wide Mapping Reveals Conservation of Promoter DNA Methylation Following Chicken Domestication

    PubMed Central

    Li, Qinghe; Wang, Yuanyuan; Hu, Xiaoxiang; Zhao, Yaofeng; Li, Ning

    2015-01-01

    It is well-known that environment influences DNA methylation, however, the extent of heritable DNA methylation variation following animal domestication remains largely unknown. Using meDIP-chip we mapped the promoter methylomes for 23,316 genes in muscle tissues of ancestral and domestic chickens. We systematically examined the variation of promoter DNA methylation in terms of different breeds, differentially expressed genes, SNPs and genes undergo genetic selection sweeps. While considerable changes in DNA sequence and gene expression programs were prevalent, we found that the inter-strain DNA methylation patterns were highly conserved in promoter region between the wild and domestic chicken breeds. Our data suggests a global preservation of DNA methylation between the wild and domestic chicken breeds in either a genome-wide or locus-specific scale in chick muscle tissues. PMID:25735894

  9. Genome-Wide Binding of MBD2 Reveals Strong Preference for Highly Methylated Loci

    PubMed Central

    Menafra, Roberta; Brinkman, Arie B.; Matarese, Filomena; Franci, Gianluigi; Bartels, Stefanie J. J.; Nguyen, Luan; Shimbo, Takashi; Wade, Paul A.; Hubner, Nina C.; Stunnenberg, Hendrik G.

    2014-01-01

    MBD2 is a subunit of the NuRD complex that is postulated to mediate gene repression via recruitment of the complex to methylated DNA. In this study we adopted an MBD2 tagging-approach to study its genome wide binding characteristics. We show that in vivo MBD2 is mainly recruited to CpG island promoters that are highly methylated. Interestingly, MBD2 binds around 1 kb downstream of the transcription start site of a subset of ∼400 CpG island promoters that are characterized by the presence of active histone marks, RNA polymerase II (Pol2) and low to medium gene expression levels and H3K36me3 deposition. These tagged-MBD2 binding sites in MCF-7 show increased methylation in a cohort of primary breast cancers but not in normal breast samples, suggesting a putative role for MBD2 in breast cancer. PMID:24927503

  10. A genome-wide screen for genes affecting eisosomes reveals Nce102 function in sphingolipid signaling

    PubMed Central

    Fröhlich, Florian; Moreira, Karen; Aguilar, Pablo S.; Hubner, Nina C.; Mann, Matthias; Walter, Peter

    2009-01-01

    The protein and lipid composition of eukaryotic plasma membranes is highly dynamic and regulated according to need. The sphingolipid-responsive Pkh kinases are candidates for mediating parts of this regulation, as they affect a diverse set of plasma membrane functions, such as cortical actin patch organization, efficient endocytosis, and eisosome assembly. Eisosomes are large protein complexes underlying the plasma membrane and help to sort a group of membrane proteins into distinct domains. In this study, we identify Nce102 in a genome-wide screen for genes involved in eisosome organization and Pkh kinase signaling. Nce102 accumulates in membrane domains at eisosomes where Pkh kinases also localize. The relative abundance of Nce102 in these domains compared with the rest of the plasma membrane is dynamically regulated by sphingolipids. Furthermore, Nce102 inhibits Pkh kinase signaling and is required for plasma membrane organization. Therefore, Nce102 might act as a sensor of sphingolipids that regulates plasma membrane function. PMID:19564405

  11. Genome-Wide and Paternal Diversity Reveal a Recent Origin of Human Populations in North Africa

    PubMed Central

    Martínez-Cruz, Begoña; Zalloua, Pierre; Benammar Elgaaied, Amel; Comas, David

    2013-01-01

    The geostrategic location of North Africa as a crossroad between three continents and as a stepping-stone outside Africa has evoked anthropological and genetic interest in this region. Numerous studies have described the genetic landscape of the human population in North Africa employing paternal, maternal, and biparental molecular markers. However, information from these markers which have different inheritance patterns has been mostly assessed independently, resulting in an incomplete description of the region. In this study, we analyze uniparental and genome-wide markers examining similarities or contrasts in the results and consequently provide a comprehensive description of the evolutionary history of North Africa populations. Our results show that both males and females in North Africa underwent a similar admixture history with slight differences in the proportions of admixture components. Consequently, genome-wide diversity show similar patterns with admixture tests suggesting North Africans are a mixture of ancestral populations related to current Africans and Eurasians with more affinity towards the out-of-Africa populations than to sub-Saharan Africans. We estimate from the paternal lineages that most North Africans emerged ∼15,000 years ago during the last glacial warming and that population splits started after the desiccation of the Sahara. Although most North Africans share a common admixture history, the Tunisian Berbers show long periods of genetic isolation and appear to have diverged from surrounding populations without subsequent mixture. On the other hand, continuous gene flow from the Middle East made Egyptians genetically closer to Eurasians than to other North Africans. We show that genetic diversity of today's North Africans mostly captures patterns from migrations post Last Glacial Maximum and therefore may be insufficient to inform on the initial population of the region during the Middle Paleolithic period. PMID:24312208

  12. A Genome-Wide Association Study Reveals Genes Associated with Fusarium Ear Rot Resistance in a Maize Core Diversity Panel

    PubMed Central

    Zila, Charles T.; Samayoa, L. Fernando; Santiago, Rogelio; Butrón, Ana; Holland, James B.

    2013-01-01

    Fusarium ear rot is a common disease of maize that affects food and feed quality globally. Resistance to the disease is highly quantitative, and maize breeders have difficulty incorporating polygenic resistance alleles from unadapted donor sources into elite breeding populations without having a negative impact on agronomic performance. Identification of specific allele variants contributing to improved resistance may be useful to breeders by allowing selection of resistance alleles in coupling phase linkage with favorable agronomic characteristics. We report the results of a genome-wide association study to detect allele variants associated with increased resistance to Fusarium ear rot in a maize core diversity panel of 267 inbred lines evaluated in two sets of environments. We performed association tests with 47,445 single-nucleotide polymorphisms (SNPs) while controlling for background genomic relationships with a mixed model and identified three marker loci significantly associated with disease resistance in at least one subset of environments. Each associated SNP locus had relatively small additive effects on disease resistance (±1.1% on a 0–100% scale), but nevertheless were associated with 3 to 12% of the genotypic variation within or across environment subsets. Two of three identified SNPs colocalized with genes that have been implicated with programmed cell death. An analysis of associated allele frequencies within the major maize subpopulations revealed enrichment for resistance alleles in the tropical/subtropical and popcorn subpopulations compared with other temperate breeding pools. PMID:24048647

  13. A genome-wide association study reveals genes associated with fusarium ear rot resistance in a maize core diversity panel.

    PubMed

    Zila, Charles T; Samayoa, L Fernando; Santiago, Rogelio; Butrón, Ana; Holland, James B

    2013-11-01

    Fusarium ear rot is a common disease of maize that affects food and feed quality globally. Resistance to the disease is highly quantitative, and maize breeders have difficulty incorporating polygenic resistance alleles from unadapted donor sources into elite breeding populations without having a negative impact on agronomic performance. Identification of specific allele variants contributing to improved resistance may be useful to breeders by allowing selection of resistance alleles in coupling phase linkage with favorable agronomic characteristics. We report the results of a genome-wide association study to detect allele variants associated with increased resistance to Fusarium ear rot in a maize core diversity panel of 267 inbred lines evaluated in two sets of environments. We performed association tests with 47,445 single-nucleotide polymorphisms (SNPs) while controlling for background genomic relationships with a mixed model and identified three marker loci significantly associated with disease resistance in at least one subset of environments. Each associated SNP locus had relatively small additive effects on disease resistance (±1.1% on a 0-100% scale), but nevertheless were associated with 3 to 12% of the genotypic variation within or across environment subsets. Two of three identified SNPs colocalized with genes that have been implicated with programmed cell death. An analysis of associated allele frequencies within the major maize subpopulations revealed enrichment for resistance alleles in the tropical/subtropical and popcorn subpopulations compared with other temperate breeding pools. PMID:24048647

  14. Refining genome-wide linkage intervals using a meta-analysis of genome-wide association studies identifies loci influencing personality dimensions.

    PubMed

    Amin, Najaf; Hottenga, Jouke-Jan; Hansell, Narelle K; Janssens, A Cecile J W; de Moor, Marleen H M; Madden, Pamela A F; Zorkoltseva, Irina V; Penninx, Brenda W; Terracciano, Antonio; Uda, Manuela; Tanaka, Toshiko; Esko, Tonu; Realo, Anu; Ferrucci, Luigi; Luciano, Michelle; Davies, Gail; Metspalu, Andres; Abecasis, Goncalo R; Deary, Ian J; Raikkonen, Katri; Bierut, Laura J; Costa, Paul T; Saviouk, Viatcheslav; Zhu, Gu; Kirichenko, Anatoly V; Isaacs, Aaron; Aulchenko, Yurii S; Willemsen, Gonneke; Heath, Andrew C; Pergadia, Michele L; Medland, Sarah E; Axenovich, Tatiana I; de Geus, Eco; Montgomery, Grant W; Wright, Margaret J; Oostra, Ben A; Martin, Nicholas G; Boomsma, Dorret I; van Duijn, Cornelia M

    2013-08-01

    Personality traits are complex phenotypes related to psychosomatic health. Individually, various gene finding methods have not achieved much success in finding genetic variants associated with personality traits. We performed a meta-analysis of four genome-wide linkage scans (N=6149 subjects) of five basic personality traits assessed with the NEO Five-Factor Inventory. We compared the significant regions from the meta-analysis of linkage scans with the results of a meta-analysis of genome-wide association studies (GWAS) (N∼17 000). We found significant evidence of linkage of neuroticism to chromosome 3p14 (rs1490265, LOD=4.67) and to chromosome 19q13 (rs628604, LOD=3.55); of extraversion to 14q32 (ATGG002, LOD=3.3); and of agreeableness to 3p25 (rs709160, LOD=3.67) and to two adjacent regions on chromosome 15, including 15q13 (rs970408, LOD=4.07) and 15q14 (rs1055356, LOD=3.52) in the individual scans. In the meta-analysis, we found strong evidence of linkage of extraversion to 4q34, 9q34, 10q24 and 11q22, openness to 2p25, 3q26, 9p21, 11q24, 15q26 and 19q13 and agreeableness to 4q34 and 19p13. Significant evidence of association in the GWAS was detected between openness and rs677035 at 11q24 (P-value=2.6 × 10(-06), KCNJ1). The findings of our linkage meta-analysis and those of the GWAS suggest that 11q24 is a susceptible locus for openness, with KCNJ1 as the possible candidate gene. PMID:23211697

  15. Genome-Wide Identification and Analysis of the MYB Transcription Factor Superfamily in Solanum lycopersicum.

    PubMed

    Li, Zhenjun; Peng, Rihe; Tian, Yongsheng; Han, Hongjuan; Xu, Jing; Yao, Quanhong

    2016-08-01

    MYB proteins constitute one of the largest transcription factor families in the plant kingdom, members of which perform a variety of functions in plant biological processes. However, there are only very limited reports on the characterization of MYB transcription factors in tomato (Solanum lycopersicum). In our study, a total of 127 MYB genes have been identified in the tomato genome. A complete overview of these MYB genes is presented, including the phylogeny, gene structures, protein motifs, chromosome locations and expression patterns. The 127 SlMYB proteins could be classified into 18 subgroups based on domain similarity and phylogenetic topology. Phylogenetic analysis of SlMYBs along with MYBs from Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa) indicated 14 subfamilies. Conserved motifs outside the MYB domain may reflect their functional conservation. The identified tomato MYB genes were distributed on 12 chromosomes at various densities but mainly in chromosomes 6 and 10 (12.6% and 11.8%, respectively). Genome-wide segmental and tandem duplications were also found, which may contribute to the expansion of SlMYB genes. RNA-sequencing and microarray data revealed tissue-specific and stress-responsive expression patterns of SlMYB genes. The expression profiles of SlMYB genes in response to salicylic acid (SA) and jasmonic acid methyl ester (MeJA) were also investigated by real-time PCR. Moreover, ethylene-responsive element-binding factor-associated amphiphilic repression (EAR) motifs were found in 24 SlMYB proteins. Collectively, our comprehensive analysis of SlMYB genes will facilitate future functional studies of the tomato MYB gene family and probably other Solanaceae plants. PMID:27279646

  16. Genome-wide SNP analysis of the Systemic Capillary Leak Syndrome (Clarkson disease)

    PubMed Central

    Xie, Zhihui; Nagarajan, Vijayaraj; Sturdevant, Daniel E; Iwaki, Shoko; Chan, Eunice; Wisch, Laura; Young, Michael; Nelson, Celeste M; Porcella, Stephen F; Druey, Kirk M

    2013-01-01

    The Systemic Capillary Leak Syndrome (SCLS) is an extremely rare, orphan disease that resembles, and is frequently erroneously diagnosed as, systemic anaphylaxis. The disorder is characterized by repeated, transient, and seemingly unprovoked episodes of hypotensive shock and peripheral edema due to transient endothelial hyperpermeability. SCLS is often accompanied by a monoclonal gammopathy of unknown significance (MGUS). Using Affymetrix Single Nucleotide Polymorphism (SNP) microarrays, we performed the first genome-wide SNP analysis of SCLS in a cohort of 12 disease subjects and 18 controls. Exome capture sequencing was performed on genomic DNA from nine of these patients as validation for the SNP-chip discoveries and de novo data generation. We identified candidate susceptibility loci for SCLS, which included a region flanking CAV3 (3p25.3) as well as SNP clusters in PON1 (7q21.3), PSORS1C1 (6p21.3), and CHCHD3 (7q33). Among the most highly ranked discoveries were gene-associated SNPs in the uncharacterized LOC100130480 gene (rs6417039, rs2004296). Top case-associated SNPs were observed in BTRC (rs12355803, 3rs4436485), ARHGEF18 (rs11668246), CDH13 (rs4782779), and EDG2 (rs12552348), which encode proteins with known or suspected roles in B cell function and/or vascular integrity. 61 SNPs that were significantly associated with SCLS by microarray analysis were also detected and validated by exome deep sequencing. Functional annotation of highly ranked SNPs revealed enrichment of cell projections, cell junctions and adhesion, and molecules containing pleckstrin homology, Ras/Rho regulatory, and immunoglobulin Ig-like C2/fibronectin type III domains, all of which involve mechanistic functions that correlate with the SCLS phenotype. These results highlight SNPs with potential relevance to SCLS. PMID:24808988

  17. Genome-Wide Analysis of Polycistronic MicroRNAs in Cultivated and Wild Rice.

    PubMed

    Baldrich, Patricia; Hsing, Yue-Ie Caroline; San Segundo, Blanca

    2016-01-01

    MicroRNAs (miRNAs) are small noncoding RNAs that direct posttranscriptional gene silencing in eukaryotes. They are frequently clustered in the genomes of animals and can be independently transcribed or simultaneously transcribed into single polycistronic transcripts. Only a few miRNA clusters have been described in plants, and most of them are generated from independent transcriptional units. Here, we used a combination of bioinformatic tools and experimental analyses to discover new polycistronic miRNAs in rice. A genome-wide analysis of clustering patterns of MIRNA loci in the rice genome was carried out using a criterion of 3 kb as the maximal distance between two miRNAs. This analysis revealed 28 loci with the ability to form the typical hairpin structure of miRNA precursors in which 2 or more mature miRNAs mapped along the same structure. RT-PCR provided evidence for the polycistronic nature of seven miRNA precursors containing homologous or nonhomologous miRNA species. Polycistronic miRNAs and candidate polycistronic miRNAs are located across different rice chromosomes, except chromosome 12, and resided in both duplicated and nonduplicated chromosomal regions. Finally, most polycistronic and candidate polycistronic miRNAs showed a pattern of conservation in the genome of rice species with an AA genome. The diversity in the organization of MIR genes that are transcribed as polycistrons suggests a versatile mechanism for the control of gene expression in different biological processes and supports additional levels of complexity in miRNA functioning in plants. PMID:27190137

  18. Transport genes and chemotaxis in Laribacter hongkongensis: a genome-wide analysis

    PubMed Central

    2011-01-01

    Background Laribacter hongkongensis is a Gram-negative, sea gull-shaped rod associated with community-acquired gastroenteritis. The bacterium has been found in diverse freshwater environments including fish, frogs and drinking water reservoirs. Using the complete genome sequence data of L. hongkongensis, we performed a comprehensive analysis of putative transport-related genes and genes related to chemotaxis, motility and quorum sensing, which may help the bacterium adapt to the changing environments and combat harmful substances. Results A genome-wide analysis using Transport Classification Database TCDB, similarity and keyword searches revealed the presence of a large diversity of transporters (n = 457) and genes related to chemotaxis (n = 52) and flagellar biosynthesis (n = 40) in the L. hongkongensis genome. The transporters included those from all seven major transporter categories, which may allow the uptake of essential nutrients or ions, and extrusion of metabolic end products and hazardous substances. L. hongkongensis is unique among closely related members of Neisseriaceae family in possessing higher number of proteins related to transport of ammonium, urea and dicarboxylate, which may reflect the importance of nitrogen and dicarboxylate metabolism in this assacharolytic bacterium. Structural modeling of two C4-dicarboxylate transporters showed that they possessed similar structures to the determined structures of other DctP-TRAP transporters, with one having an unusual disulfide bond. Diverse mechanisms for iron transport, including hemin transporters for iron acquisition from host proteins, were also identified. In addition to the chemotaxis and flagella-related genes, the L. hongkongensis genome also contained two copies of qseB/qseC homologues of the AI-3 quorum sensing system. Conclusions The large number of diverse transporters and genes involved in chemotaxis, motility and quorum sensing suggested that the bacterium may utilize a complex system to

  19. Genome-Wide Analysis of miRNA targets in Brachypodium and Biomass Energy Crops

    SciTech Connect

    Green, Pamela J.

    2015-08-11

    MicroRNAs (miRNAs) contribute to the control of numerous biological processes through the regulation of specific target mRNAs. Although the identities of these targets are essential to elucidate miRNA function, the targets are much more difficult to identify than the small RNAs themselves. Before this work, we pioneered the genome-wide identification of the targets of Arabidopsis miRNAs using an approach called PARE (German et al., Nature Biotech. 2008; Nature Protocols, 2009). Under this project, we applied PARE to Brachypodium distachyon (Brachypodium), a model plant in the Poaceae family, which includes the major food grain and bioenergy crops. Through in-depth global analysis and examination of specific examples, this research greatly expanded our knowledge of miRNAs and target RNAs of Brachypodium. New regulation in response to environmental stress or tissue type was found, and many new miRNAs were discovered. More than 260 targets of new and known miRNAs with PARE sequences at the precise sites of miRNA-guided cleavage were identified and characterized. Combining PARE data with the small RNA data also identified the miRNAs responsible for initiating approximately 500 phased loci, including one of the novel miRNAs. PARE analysis also revealed that differentially expressed miRNAs in the same family guide specific target RNA cleavage in a correspondingly tissue-preferential manner. The project included generation of small RNA and PARE resources for bioenergy crops, to facilitate ongoing discovery of conserved miRNA-target RNA regulation. By associating specific miRNA-target RNA pairs with known physiological functions, the research provides insights about gene regulation in different tissues and in response to environmental stress. This, and release of new PARE and small RNA data sets should contribute basic knowledge to enhance breeding and may suggest new strategies for improvement of biomass energy crops.

  20. Genome-Wide Analysis of Polycistronic MicroRNAs in Cultivated and Wild Rice

    PubMed Central

    Baldrich, Patricia; Hsing, Yue-Ie Caroline; San Segundo, Blanca

    2016-01-01

    MicroRNAs (miRNAs) are small noncoding RNAs that direct posttranscriptional gene silencing in eukaryotes. They are frequently clustered in the genomes of animals and can be independently transcribed or simultaneously transcribed into single polycistronic transcripts. Only a few miRNA clusters have been described in plants, and most of them are generated from independent transcriptional units. Here, we used a combination of bioinformatic tools and experimental analyses to discover new polycistronic miRNAs in rice. A genome-wide analysis of clustering patterns of MIRNA loci in the rice genome was carried out using a criterion of 3 kb as the maximal distance between two miRNAs. This analysis revealed 28 loci with the ability to form the typical hairpin structure of miRNA precursors in which 2 or more mature miRNAs mapped along the same structure. RT-PCR provided evidence for the polycistronic nature of seven miRNA precursors containing homologous or nonhomologous miRNA species. Polycistronic miRNAs and candidate polycistronic miRNAs are located across different rice chromosomes, except chromosome 12, and resided in both duplicated and nonduplicated chromosomal regions. Finally, most polycistronic and candidate polycistronic miRNAs showed a pattern of conservation in the genome of rice species with an AA genome. The diversity in the organization of MIR genes that are transcribed as polycistrons suggests a versatile mechanism for the control of gene expression in different biological processes and supports additional levels of complexity in miRNA functioning in plants. PMID:27190137

  1. Genome-Wide Identification, Characterization and Expression Analysis of the TCP Gene Family in Prunus mume

    PubMed Central

    Zhou, Yuzhen; Xu, Zongda; Zhao, Kai; Yang, Weiru; Cheng, Tangren; Wang, Jia; Zhang, Qixiang

    2016-01-01

    TCP proteins, belonging to a plant-specific transcription factors family, are known to have great functions in plant development, especially flower and leaf development. However, there is little information about this gene family in Prunus mume, which is widely cultivated in China as an ornamental and fruit tree. Here a genome-wide analysis of TCP genes was performed to explore their evolution in P. mume. Nineteen PmTCPs were identified and three of them contained putative miR319 target sites. Phylogenetic and comprehensive bioinformatics analyses of these genes revealed that different types of TCP genes had undergone different evolutionary processes and the genes in the same clade had similar chromosomal location, gene structure, and conserved domains. Expression analysis of these PmTCPs indicated that there were diverse expression patterns among different clades. Most TCP genes were predominantly expressed in flower, leaf, and stem, and showed high expression levels in the different stages of flower bud differentiation, especially in petal formation stage and gametophyte development. Genes in TCP-P subfamily had main roles in both flower development and gametophyte development. The CIN genes in double petal cultivars might have key roles in the formation of petal, while they were correlated with gametophyte development in the single petal cultivar. The CYC/TB1 type genes were highly detected in the formation of petal and pistil. The less-complex flower types of P. mume might result from the fact that there were only two CYC type genes present in P. mume and a lack of CYC2 genes to control the identity of flower types. These results lay the foundation for further study on the functions of TCP genes during flower development.

  2. Genome-wide identification and comparative expression analysis reveal a rapid expansion and functional divergence of duplicated genes in the WRKY gene family of cabbage, Brassica oleracea var. capitata.

    PubMed

    Yao, Qiu-Yang; Xia, En-Hua; Liu, Fei-Hu; Gao, Li-Zhi

    2015-02-15

    WRKY transcription factors (TFs), one of the ten largest TF families in higher plants, play important roles in regulating plant development and resistance. To date, little is known about the WRKY TF family in Brassica oleracea. Recently, the completed genome sequence of cabbage (B. oleracea var. capitata) allows us to systematically analyze WRKY genes in this species. A total of 148 WRKY genes were characterized and classified into seven subgroups that belong to three major groups. Phylogenetic and synteny analyses revealed that the repertoire of cabbage WRKY genes was derived from a common ancestor shared with Arabidopsis thaliana. The B. oleracea WRKY genes were found to be preferentially retained after the whole-genome triplication (WGT) event in its recent ancestor, suggesting that the WGT event had largely contributed to a rapid expansion of the WRKY gene family in B. oleracea. The analysis of RNA-Seq data from various tissues (i.e., roots, stems, leaves, buds, flowers and siliques) revealed that most of the identified WRKY genes were positively expressed in cabbage, and a large portion of them exhibited patterns of differential and tissue-specific expression, demonstrating that these gene members might play essential roles in plant developmental processes. Comparative analysis of the expression level among duplicated genes showed that gene expression divergence was evidently presented among cabbage WRKY paralogs, indicating functional divergence of these duplicated WRKY genes. PMID:25481634

  3. Ion Torrent sequencing for conducting genome-wide scans for mutation mapping analysis.

    PubMed

    Damerla, Rama Rao; Chatterjee, Bishwanath; Li, You; Francis, Richard J B; Fatakia, Sarosh N; Lo, Cecilia W

    2014-04-01

    Mutation mapping in mice can be readily accomplished by genome wide segregation analysis of polymorphic DNA markers. In this study, we showed the efficacy of Ion Torrent next generation sequencing for conducting genome-wide scans to map and identify a mutation causing congenital heart disease in a mouse mutant, Bishu, recovered from a mouse mutagenesis screen. The Bishu mutant line generated in a C57BL/6J (B6) background was intercrossed with another inbred strain, C57BL/10J (B10), and the resulting B6/B10 hybrid offspring were intercrossed to generate mutants used for the mapping analysis. For each mutant sample, a panel of 123 B6/B10 polymorphic SNPs distributed throughout the mouse genome was PCR amplified, bar coded, and then pooled to generate a single library used for Ion Torrent sequencing. Sequencing carried out using the 314 chip yielded >600,000 usable reads. These were aligned and mapped using a custom bioinformatics pipeline. Each SNP was sequenced to a depth >500×, allowing accurate automated calling of the B6/B10 genotypes. This analysis mapped the mutation in Bishu to an interval on the proximal region of mouse chromosome 4. This was confirmed by parallel capillary sequencing of the 123 polymorphic SNPs. Further analysis of genes in the map interval identified a splicing mutation in Dnaic1(c.204+1G>A), an intermediate chain dynein, as the disease causing mutation in Bishu. Overall, our experience shows Ion Torrent amplicon sequencing is high throughput and cost effective for conducting genome-wide mapping analysis and is easily scalable for other high volume genotyping analyses. PMID:24306492

  4. FVGWAS: Fast voxelwise genome wide association analysis of large-scale imaging genetic data.

    PubMed

    Huang, Meiyan; Nichols, Thomas; Huang, Chao; Yu, Yang; Lu, Zhaohua; Knickmeyer, Rebecca C; Feng, Qianjin; Zhu, Hongtu

    2015-09-01

    More and more large-scale imaging genetic studies are being widely conducted to collect a rich set of imaging, genetic, and clinical data to detect putative genes for complexly inherited neuropsychiatric and neurodegenerative disorders. Several major big-data challenges arise from testing genome-wide (NC>12 million known variants) associations with signals at millions of locations (NV~10(6)) in the brain from thousands of subjects (n~10(3)). The aim of this paper is to develop a Fast Voxelwise Genome Wide Association analysiS (FVGWAS) framework to efficiently carry out whole-genome analyses of whole-brain data. FVGWAS consists of three components including a heteroscedastic linear model, a global sure independence screening (GSIS) procedure, and a detection procedure based on wild bootstrap methods. Specifically, for standard linear association, the computational complexity is O (nNVNC) for voxelwise genome wide association analysis (VGWAS) method compared with O ((NC+NV)n(2)) for FVGWAS. Simulation studies show that FVGWAS is an efficient method of searching sparse signals in an extremely large search space, while controlling for the family-wise error rate. Finally, we have successfully applied FVGWAS to a large-scale imaging genetic data analysis of ADNI data with 708 subjects, 193,275voxels in RAVENS maps, and 501,584 SNPs, and the total processing time was 203,645s for a single CPU. Our FVGWAS may be a valuable statistical toolbox for large-scale imaging genetic analysis as the field is rapidly advancing with ultra-high-resolution imaging and whole-genome sequencing. PMID:26025292

  5. Comparative analysis of genome-wide divergence, domestication footprints and genome-wide association study of root traits for Gossypium hirsutum and Gossypium barbadense

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Use of 10,129 singleton SNPs of known genomic location in tetraploid cotton provided unique opportunities to characterize genome-wide diversity among 440 Gossypium hirsutum and 219 G. barbadense cultivars and landrace accessions of widespread origin. Using genome-wide distributed SNPs, we examined ...

  6. Genome-wide comparison of African-ancestry populations from CARe and other cohorts reveals signals of natural selection.

    PubMed

    Bhatia, Gaurav; Patterson, Nick; Pasaniuc, Bogdan; Zaitlen, Noah; Genovese, Giulio; Pollack, Samuela; Mallick, Swapan; Myers, Simon; Tandon, Arti; Spencer, Chris; Palmer, Cameron D; Adeyemo, Adebowale A; Akylbekova, Ermeg L; Cupples, L Adrienne; Divers, Jasmin; Fornage, Myriam; Kao, W H Linda; Lange, Leslie; Li, Mingyao; Musani, Solomon; Mychaleckyj, Josyf C; Ogunniyi, Adesola; Papanicolaou, George; Rotimi, Charles N; Rotter, Jerome I; Ruczinski, Ingo; Salako, Babatunde; Siscovick, David S; Tayo, Bamidele O; Yang, Qiong; McCarroll, Steve; Sabeti, Pardis; Lettre, Guillaume; De Jager, Phil; Hirschhorn, Joel; Zhu, Xiaofeng; Cooper, Richard; Reich, David; Wilson, James G; Price, Alkes L

    2011-09-01

    The study of recent natural selection in human populations has important applications to human history and medicine. Positive natural selection drives the increase in beneficial alleles and plays a role in explaining diversity across human populations. By discovering traits subject to positive selection, we can better understand the population level response to environmental pressures including infectious disease. Our study examines unusual population differentiation between three large data sets to detect natural selection. The populations examined, African Americans, Nigerians, and Gambians, are genetically close to one another (F(ST) < 0.01 for all pairs), allowing us to detect selection even with moderate changes in allele frequency. We also develop a tree-based method to pinpoint the population in which selection occurred, incorporating information across populations. Our genome-wide significant results corroborate loci previously reported to be under selection in Africans including HBB and CD36. At the HLA locus on chromosome 6, results suggest the existence of multiple, independent targets of population-specific selective pressure. In addition, we report a genome-wide significant (p = 1.36 × 10(-11)) signal of selection in the prostate stem cell antigen (PSCA) gene. The most significantly differentiated marker in our analysis, rs2920283, is highly differentiated in both Africa and East Asia and has prior genome-wide significant associations to bladder and gastric cancers. PMID:21907010

  7. The Phytocyanin Gene Family in Rice (Oryza sativa L.): Genome-Wide Identification, Classification and Transcriptional Analysis

    PubMed Central

    Ma, Haoli; Zhao, Heming; Liu, Zhi; Zhao, Jie

    2011-01-01

    Background Phytocyanins (PCs) are plant-specific blue copper proteins involved in electron transport, and a large number of known PCs are considered to be chimeric arabinogalactan proteins (AGPs). To date there has not been a genome-wide overview of the OsPC gene family. Therefore, as the first step and a useful strategy to elucidate the functions of OsPCs, there is an urgent need for a thorough genome-wide analysis of this gene family. Methodology/Principal Findings In this study, a total of 62 OsPC genes were identified through a comprehensive bioinformatics analysis of the rice (Oryza sativa L.) genome. Based on phylogeny and motif constitution, the family of OsPCs was classified into three subclasses: uclacyanin-like proteins (OsUCLs), stellacyanin-like proteins (OsSCLs) and early nodulin-like proteins (OsENODLs). Structure and glycosylation prediction indicated that 46 OsPCs were glycosylphosphatigylinositol-anchored proteins and 38 OsPCs were chimeric AGPs. Gene duplication analysis revealed that chromosomal segment and tandem duplications contributed almost equally to the expansion of this gene family, and duplication events were mostly happened in the OsUCL subfamily. The expression profiles of OsPC genes were analyzed at different stages of vegetative and reproductive development and under abiotic stresses. It revealed that a large number of OsPC genes were abundantly expressed in the various stages of development. Moreover, 17 genes were regulated under the treatments of abiotic stresses. Conclusions/Significance The genome-wide identification and expression analysis of OsPC genes should facilitate research in this gene family and give new insights toward elucidating their functions in higher plants. PMID:21984902

  8. Genome-Wide Transcriptional Analysis Reveals the Protection against Hypoxia-Induced Oxidative Injury in the Intestine of Tibetans via the Inhibition of GRB2/EGFR/PTPN11 Pathways

    PubMed Central

    Gesang, Luobu; Dan, Zeng; Gusang, Lamu

    2016-01-01

    The molecular mechanisms for hypoxic environment causing the injury of intestinal mucosal barrier (IMB) are widely unknown. To address the issue, Han Chinese from 100 m altitude and Tibetans from high altitude (more than 3650 m) were recruited. Histological and transcriptome analyses were performed. The results showed intestinal villi were reduced and appeared irregular, and glandular epithelium was destroyed in the IMB of Tibetans when compared with Han Chinese. Transcriptome analysis revealed 2573 genes with altered expression. The levels of 1137 genes increased and 1436 genes decreased in Tibetans when compared with Han Chinese. Gene ontology (GO) analysis indicated most immunological responses were reduced in the IMB of Tibetans when compared with Han Chinese. Gene microarray showed that there were 25-, 22-, and 18-fold downregulation for growth factor receptor-bound protein 2 (GRB2), epidermal growth factor receptor (EGFR), and tyrosine-protein phosphatase nonreceptor type 11 (PTPN11) in the IMB of Tibetans when compared with Han Chinese. The downregulation of EGFR, GRB2, and PTPN11 will reduce the production of reactive oxygen species and protect against oxidative stress-induced injury for intestine. Thus, the transcriptome analysis showed the protecting functions of IMB patients against hypoxia-induced oxidative injury in the intestine of Tibetans via affecting GRB2/EGFR/PTPN11 pathways. PMID:27594973

  9. Genome-Wide Transcriptional Analysis Reveals the Protection against Hypoxia-Induced Oxidative Injury in the Intestine of Tibetans via the Inhibition of GRB2/EGFR/PTPN11 Pathways.

    PubMed

    Li, Kang; Gesang, Luobu; Dan, Zeng; Gusang, Lamu

    2016-01-01

    The molecular mechanisms for hypoxic environment causing the injury of intestinal mucosal barrier (IMB) are widely unknown. To address the issue, Han Chinese from 100 m altitude and Tibetans from high altitude (more than 3650 m) were recruited. Histological and transcriptome analyses were performed. The results showed intestinal villi were reduced and appeared irregular, and glandular epithelium was destroyed in the IMB of Tibetans when compared with Han Chinese. Transcriptome analysis revealed 2573 genes with altered expression. The levels of 1137 genes increased and 1436 genes decreased in Tibetans when compared with Han Chinese. Gene ontology (GO) analysis indicated most immunological responses were reduced in the IMB of Tibetans when compared with Han Chinese. Gene microarray showed that there were 25-, 22-, and 18-fold downregulation for growth factor receptor-bound protein 2 (GRB2), epidermal growth factor receptor (EGFR), and tyrosine-protein phosphatase nonreceptor type 11 (PTPN11) in the IMB of Tibetans when compared with Han Chinese. The downregulation of EGFR, GRB2, and PTPN11 will reduce the production of reactive oxygen species and protect against oxidative stress-induced injury for intestine. Thus, the transcriptome analysis showed the protecting functions of IMB patients against hypoxia-induced oxidative injury in the intestine of Tibetans via affecting GRB2/EGFR/PTPN11 pathways. PMID:27594973

  10. Genome-Wide Screen Reveals Replication Pathway for Quasi-Palindrome Fragility Dependent on Homologous Recombination

    PubMed Central

    Zhang, Yu; Saini, Natalie; Sheng, Ziwei; Lobachev, Kirill S.

    2013-01-01

    Inverted repeats capable of forming hairpin and cruciform structures present a threat to chromosomal integrity. They induce double strand breaks, which lead to gross chromosomal rearrangements, the hallmarks of cancers and hereditary diseases. Secondary structure formation at this motif has been proposed to be the driving force for the instability, albeit the mechanisms leading to the fragility are not well-understood. We carried out a genome-wide screen to uncover the genetic players that govern fragility of homologous and homeologous Alu quasi-palindromes in the yeast Saccharomyces cerevisiae. We found that depletion or lack of components of the DNA replication machinery, proteins involved in Fe-S cluster biogenesis, the replication-pausing checkpoint pathway, the telomere maintenance complex or the Sgs1-Top3-Rmi1 dissolvasome augment fragility at Alu-IRs. Rad51, a component of the homologous recombination pathway, was found to be required for replication arrest and breakage at the repeats specifically in replication-deficient strains. These data demonstrate that Rad51 is required for the formation of breakage-prone secondary structures in situations when replication is compromised while another mechanism operates in DSB formation in replication-proficient strains. PMID:24339793

  11. Genome-wide screen reveals replication pathway for quasi-palindrome fragility dependent on homologous recombination.

    PubMed

    Zhang, Yu; Saini, Natalie; Sheng, Ziwei; Lobachev, Kirill S

    2013-01-01

    Inverted repeats capable of forming hairpin and cruciform structures present a threat to chromosomal integrity. They induce double strand breaks, which lead to gross chromosomal rearrangements, the hallmarks of cancers and hereditary diseases. Secondary structure formation at this motif has been proposed to be the driving force for the instability, albeit the mechanisms leading to the fragility are not well-understood. We carried out a genome-wide screen to uncover the genetic players that govern fragility of homologous and homeologous Alu quasi-palindromes in the yeast Saccharomyces cerevisiae. We found that depletion or lack of components of the DNA replication machinery, proteins involved in Fe-S cluster biogenesis, the replication-pausing checkpoint pathway, the telomere maintenance complex or the Sgs1-Top3-Rmi1 dissolvasome augment fragility at Alu-IRs. Rad51, a component of the homologous recombination pathway, was found to be required for replication arrest and breakage at the repeats specifically in replication-deficient strains. These data demonstrate that Rad51 is required for the formation of breakage-prone secondary structures in situations when replication is compromised while another mechanism operates in DSB formation in replication-proficient strains. PMID:24339793

  12. Genome-wide association study reveals multiple loci associated with primary tooth development during infancy.

    PubMed

    Pillas, Demetris; Hoggart, Clive J; Evans, David M; O'Reilly, Paul F; Sipilä, Kirsi; Lähdesmäki, Raija; Millwood, Iona Y; Kaakinen, Marika; Netuveli, Gopalakrishnan; Blane, David; Charoen, Pimphen; Sovio, Ulla; Pouta, Anneli; Freimer, Nelson; Hartikainen, Anna-Liisa; Laitinen, Jaana; Vaara, Sarianna; Glaser, Beate; Crawford, Peter; Timpson, Nicholas J; Ring, Susan M; Deng, Guohong; Zhang, Weihua; McCarthy, Mark I; Deloukas, Panos; Peltonen, Leena; Elliott, Paul; Coin, Lachlan J M; Smith, George Davey; Jarvelin, Marjo-Riitta

    2010-02-01

    Tooth development is a highly heritable process which relates to other growth and developmental processes, and which interacts with the development of the entire craniofacial complex. Abnormalities of tooth development are common, with tooth agenesis being the most common developmental anomaly in humans. We performed a genome-wide association study of time to first tooth eruption and number of teeth at one year in 4,564 individuals from the 1966 Northern Finland Birth Cohort (NFBC1966) and 1,518 individuals from the Avon Longitudinal Study of Parents and Children (ALSPAC). We identified 5 loci at P<5x10(-8), and 5 with suggestive association (P<5x10(-6)). The loci included several genes with links to tooth and other organ development (KCNJ2, EDA, HOXB2, RAD51L1, IGF2BP1, HMGA2, MSRB3). Genes at four of the identified loci are implicated in the development of cancer. A variant within the HOXB gene cluster associated with occlusion defects requiring orthodontic treatment by age 31 years. PMID:20195514

  13. Genome-Wide Association Study Reveals the Genetic Basis of Stalk Cell Wall Components in Maize

    PubMed Central

    Hu, Xiaojiao; Liu, Zhifang; Wu, Yujin; Huang, Changling

    2016-01-01

    Lignin, cellulose and hemicellulose are the three main components of the plant cell wall and can impact stalk quality by affecting cell wall structure and strength. In this study, we evaluated the lignin (LIG), cellulose (CEL) and hemicellulose (HC) contents in maize using an association mapping panel that included 368 inbred lines in seven environments. A genome-wide association study using approximately 0.56 million SNPs with a minor allele frequency of 0.05 identified 22, 18 and 24 loci significantly associated with LIG, CEL and HC at P < 1.0×10−4, respectively. The allelic variation of each significant association contributed 4 to 7% of the phenotypic variation. Candidate genes identified by GWAS mainly encode enzymes involved in cell wall metabolism, transcription factors, protein kinase and protein related to other biological processes. Among the association signals, six candidate genes had pleiotropic effects on lignin and cellulose content. These results provide valuable information for better understanding the genetic basis of stalk cell wall components in maize. PMID:27479588

  14. Genome-wide transcriptomic profiling of Anopheles gambiae hemocytes reveals pathogen-specific signatures upon bacterial challenge and Plasmodium berghei infection

    PubMed Central

    Baton, Luke A; Robertson, Anne; Warr, Emma; Strand, Michael R; Dimopoulos, George

    2009-01-01

    Background The mosquito Anopheles gambiae is a major vector of human malaria. Increasing evidence indicates that blood cells (hemocytes) comprise an essential arm of the mosquito innate immune response against both bacteria and malaria parasites. To further characterize the role of hemocytes in mosquito immunity, we undertook the first genome-wide transcriptomic analyses of adult female An. gambiae hemocytes following infection by two species of bacteria and a malaria parasite. Results We identified 4047 genes expressed in hemocytes, using An. gambiae genome-wide microarrays. While 279 transcripts were significantly enriched in hemocytes relative to whole adult female mosquitoes, 959 transcripts exhibited immune challenge-related regulation. The global transcriptomic responses of hemocytes to challenge with different species of bacteria and/or different stages of malaria parasite infection revealed discrete, minimally overlapping, pathogen-specific signatures of infection-responsive gene expression; 105 of these represented putative immunity-related genes including anti-Plasmodium factors. Of particular interest was the specific co-regulation of various members of the Imd and JNK immune signaling pathways during malaria parasite invasion of the mosquito midgut epithelium. Conclusion Our genome-wide transcriptomic analysis of adult mosquito hemocytes reveals pathogen-specific signatures of gene regulation and identifies several novel candidate genes for future functional studies. PMID:19500340

  15. Genome-wide DNA methylation analysis in obsessive-compulsive disorder patients

    PubMed Central

    Yue, Weihua; Cheng, Weiqiu; Liu, Zhaorui; Tang, Yi; Lu, Tianlan; Zhang, Dai; Tang, Muni; Huang, Yueqin

    2016-01-01

    Literatures have suggested that not only genetic but also environmental factors, interactively accounted for susceptibility of obsessive-compulsive disorder (OCD). DNA methylation may regulate expression of genes as the heritable epigenetic modification. The examination for genome-wide DNA methylation was performed on blood samples from 65 patients with OCD, as well as 96 healthy control subjects. The DNA methylation was examined at over 485,000 CpG sites using the Illumina Infinium Human Methylation450 BeadChip. As a result, 8,417 probes corresponding to 2,190 unique genes were found to be differentially methylated between OCD and healthy control subjects. Of those genes, 4,013 loci were located in CpG islands and 2,478 were in promoter regions. These included BCYRN1, BCOR, FGF13, HLA-DRB1, ARX, etc., which have previously been reported to be associated with OCD. Pathway analyses indicated that regulation of actin cytoskeleton, cell adhesion molecules (CAMs), actin binding, transcription regulator activity, and other pathways might be further associated with risk of OCD. Unsupervised clustering analysis of the top 3,000 most variable probes revealed two distinct groups with significantly more people with OCD in cluster one compared with controls (67.74% of cases v.s. 27.13% of controls, Chi-square = 26.011, df = 1, P = 3.41E-07). These results strongly suggested that differential DNA methylation might play an important role in etiology of OCD. PMID:27527274

  16. Genome-wide DNA methylation analysis in obsessive-compulsive disorder patients.

    PubMed

    Yue, Weihua; Cheng, Weiqiu; Liu, Zhaorui; Tang, Yi; Lu, Tianlan; Zhang, Dai; Tang, Muni; Huang, Yueqin

    2016-01-01

    Literatures have suggested that not only genetic but also environmental factors, interactively accounted for susceptibility of obsessive-compulsive disorder (OCD). DNA methylation may regulate expression of genes as the heritable epigenetic modification. The examination for genome-wide DNA methylation was performed on blood samples from 65 patients with OCD, as well as 96 healthy control subjects. The DNA methylation was examined at over 485,000 CpG sites using the Illumina Infinium Human Methylation450 BeadChip. As a result, 8,417 probes corresponding to 2,190 unique genes were found to be differentially methylated between OCD and healthy control subjects. Of those genes, 4,013 loci were located in CpG islands and 2,478 were in promoter regions. These included BCYRN1, BCOR, FGF13, HLA-DRB1, ARX, etc., which have previously been reported to be associated with OCD. Pathway analyses indicated that regulation of actin cytoskeleton, cell adhesion molecules (CAMs), actin binding, transcription regulator activity, and other pathways might be further associated with risk of OCD. Unsupervised clustering analysis of the top 3,000 most variable probes revealed two distinct groups with significantly more people with OCD in cluster one compared with controls (67.74% of cases v.s. 27.13% of controls, Chi-square = 26.011, df = 1, P = 3.41E-07). These results strongly suggested that differential DNA methylation might play an important role in etiology of OCD. PMID:27527274

  17. Genome wide analysis of transcript levels after perturbation of the EGFR pathway in the Drosophila ovary.

    PubMed

    Jordan, Katherine C; Hatfield, Steven D; Tworoger, Michael; Ward, Ellen J; Fischer, Karin A; Bowers, Stuart; Ruohola-Baker, Hannele

    2005-03-01

    Defects in the epidermal growth factor receptor (EGFR) pathway can lead to aggressive tumor formation. Activation of this pathway during normal development produces multiple outcomes at the cellular level, leading to cellular differentiation and cell cycle activation. To elucidate the downstream events induced by this pathway, we used genome-wide cDNA microarray technology to identify potential EGFR targets in Drosophila oogenesis. We focused on genes for which the transcriptional responses due to EGFR pathway activation and inactivation were in opposite directions, as this is expected for genes that are directly regulated by the pathway in this tissue type. We perturbed the EGFR pathway in epithelial follicle cells using seven different genetic backgrounds. To activate the pathway, we overexpressed an activated form of the EGFR (UAS-caEGFR), and an activated form of the signal transducer Raf (UAS-caRaf); we also over- or ectopically expressed the downstream homeobox transcription factor Mirror (UAS-mirr) and the ligand-activating serine protease Rhomboid (UAS-rho). To reduce pathway activity we used loss-of-function mutations in the ligand (gurken) and receptor (torpedo). From microarrays containing 6,255 genes, we found 454 genes that responded in an opposite manner in gain-of-function and loss-of-function conditions among which are many Wingless signaling pathway components. Further analysis of two such components, sugarless and pangolin, revealed a function for these genes in late follicle cell patterning. Of interest, components of other signaling pathways were also enriched in the EGFR target group, suggesting that one reason for the pleiotropic effects seen with EGFR activity in cancer progression and development may be its ability to regulate many other signaling pathways. PMID:15704171

  18. Genome-Wide Association Analysis of Oxidative Stress Resistance in Drosophila melanogaster

    PubMed Central

    Weber, Allison L.; Khan, George F.; Magwire, Michael M.; Tabor, Crystal L.; Mackay, Trudy F. C.; Anholt, Robert R. H.

    2012-01-01

    Background Aerobic organisms are susceptible to damage by reactive oxygen species. Oxidative stress resistance is a quantitative trait with population variation attributable to the interplay between genetic and environmental factors. Drosophila melanogaster provides an ideal system to study the genetics of variation for resistance to oxidative stress. Methods and Findings We used 167 wild-derived inbred lines of the Drosophila Genetic Reference Panel for a genome-wide association study of acute oxidative stress resistance to two oxidizing agents, paraquat and menadione sodium bisulfite. We found significant genetic variation for both stressors. Single nucleotide polymorphisms (SNPs) associated with variation in oxidative stress resistance were often sex-specific and agent-dependent, with a small subset common for both sexes or treatments. Associated SNPs had moderately large effects, with an inverse relationship between effect size and allele frequency. Linear models with up to 12 SNPs explained 67–79% and 56–66% of the phenotypic variance for resistance to paraquat and menadione sodium bisulfite, respectively. Many genes implicated were novel with no known role in oxidative stress resistance. Bioinformatics analyses revealed a cellular network comprising DNA metabolism and neuronal development, consistent with targets of oxidative stress-inducing agents. We confirmed associations of seven candidate genes associated with natural variation in oxidative stress resistance through mutational analysis. Conclusions We identified novel candidate genes associated with variation in resistance to oxidative stress that have context-dependent effects. These results form the basis for future translational studies to identify oxidative stress susceptibility/resistance genes that are evolutionary conserved and might play a role in human disease. PMID:22496853

  19. A mega-analysis of genome-wide association studies for major depressive disorder.

    PubMed

    Ripke, Stephan; Wray, Naomi R; Lewis, Cathryn M; Hamilton, Steven P; Weissman, Myrna M; Breen, Gerome; Byrne, Enda M; Blackwood, Douglas H R; Boomsma, Dorret I; Cichon, Sven; Heath, Andrew C; Holsboer, Florian; Lucae, Susanne; Madden, Pamela A F; Martin, Nicholas G; McGuffin, Peter; Muglia, Pierandrea; Noethen, Markus M; Penninx, Brenda P; Pergadia, Michele L; Potash, James B; Rietschel, Marcella; Lin, Danyu; Müller-Myhsok, Bertram; Shi, Jianxin; Steinberg, Stacy; Grabe, Hans J; Lichtenstein, Paul; Magnusson, Patrik; Perlis, Roy H; Preisig, Martin; Smoller, Jordan W; Stefansson, Kari; Uher, Rudolf; Kutalik, Zoltan; Tansey, Katherine E; Teumer, Alexander; Viktorin, Alexander; Barnes, Michael R; Bettecken, Thomas; Binder, Elisabeth B; Breuer, René; Castro, Victor M; Churchill, Susanne E; Coryell, William H; Craddock, Nick; Craig, Ian W; Czamara, Darina; De Geus, Eco J; Degenhardt, Franziska; Farmer, Anne E; Fava, Maurizio; Frank, Josef; Gainer, Vivian S; Gallagher, Patience J; Gordon, Scott D; Goryachev, Sergey; Gross, Magdalena; Guipponi, Michel; Henders, Anjali K; Herms, Stefan; Hickie, Ian B; Hoefels, Susanne; Hoogendijk, Witte; Hottenga, Jouke Jan; Iosifescu, Dan V; Ising, Marcus; Jones, Ian; Jones, Lisa; Jung-Ying, Tzeng; Knowles, James A; Kohane, Isaac S; Kohli, Martin A; Korszun, Ania; Landen, Mikael; Lawson, William B; Lewis, Glyn; Macintyre, Donald; Maier, Wolfgang; Mattheisen, Manuel; McGrath, Patrick J; McIntosh, Andrew; McLean, Alan; Middeldorp, Christel M; Middleton, Lefkos; Montgomery, Grant M; Murphy, Shawn N; Nauck, Matthias; Nolen, Willem A; Nyholt, Dale R; O'Donovan, Michael; Oskarsson, Högni; Pedersen, Nancy; Scheftner, William A; Schulz, Andrea; Schulze, Thomas G; Shyn, Stanley I; Sigurdsson, Engilbert; Slager, Susan L; Smit, Johannes H; Stefansson, Hreinn; Steffens, Michael; Thorgeirsson, Thorgeir; Tozzi, Federica; Treutlein, Jens; Uhr, Manfred; van den Oord, Edwin J C G; Van Grootheest, Gerard; Völzke, Henry; Weilburg, Jeffrey B; Willemsen, Gonneke; Zitman, Frans G; Neale, Benjamin; Daly, Mark; Levinson, Douglas F; Sullivan, Patrick F

    2013-04-01

    Prior genome-wide association studies (GWAS) of major depressive disorder (MDD) have met with limited success. We sought to increase statistical power to detect disease loci by conducting a GWAS mega-analysis for MDD. In the MDD discovery phase, we analyzed more than 1.2 million autosomal and X chromosome single-nucleotide polymorphisms (SNPs) in 18 759 independent and unrelated subjects of recent European ancestry (9240 MDD cases and 9519 controls). In the MDD replication phase, we evaluated 554 SNPs in independent samples (6783 MDD cases and 50 695 controls). We also conducted a cross-disorder meta-analysis using 819 autosomal SNPs with P<0.0001 for either MDD or the Psychiatric GWAS Consortium bipolar disorder (BIP) mega-analysis (9238 MDD cases/8039 controls and 6998 BIP cases/7775 controls). No SNPs achieved genome-wide significance in the MDD discovery phase, the MDD replication phase or in pre-planned secondary analyses (by sex, recurrent MDD, recurrent early-onset MDD, age of onset, pre-pubertal onset MDD or typical-like MDD from a latent class analyses of the MDD criteria). In the MDD-bipolar cross-disorder analysis, 15 SNPs exceeded genome-wide significance (P<5 × 10(-8)), and all were in a 248 kb interval of high LD on 3p21.1 (chr3:52 425 083-53 822 102, minimum P=5.9 × 10(-9) at rs2535629). Although this is the largest genome-wide analysis of MDD yet conducted, its high prevalence means that the sample is still underpowered to detect genetic effects typical for complex traits. Therefore, we were unable to identify robust and replicable findings. We discuss what this means for genetic research for MDD. The 3p21.1 MDD-BIP finding should be interpreted with caution as the most significant SNP did not replicate in MDD samples, and genotyping in independent samples will be needed to resolve its status. PMID:22472876

  20. Integrated genome-wide association, coexpression network, and expression single nucleotide polymorphism analysis identifies novel pathway in allergic rhinitis

    PubMed Central

    2014-01-01

    Background Allergic rhinitis is a common disease whose genetic basis is incompletely explained. We report an integrated genomic analysis of allergic rhinitis. Methods We performed genome wide association studies (GWAS) of allergic rhinitis in 5633 ethnically diverse North American subjects. Next, we profiled gene expression in disease-relevant tissue (peripheral blood CD4+ lymphocytes) collected from subjects who had been genotyped. We then integrated the GWAS and gene expression data using expression single nucleotide (eSNP), coexpression network, and pathway approaches to identify the biologic relevance of our GWAS. Results GWAS revealed ethnicity-specific findings, with 4 genome-wide significant loci among Latinos and 1 genome-wide significant locus in the GWAS meta-analysis across ethnic groups. To identify biologic context for these results, we constructed a coexpression network to define modules of genes with similar patterns of CD4+ gene expression (coexpression modules) that could serve as constructs of broader gene expression. 6 of the 22 GWAS loci with P-value ≤ 1x10−6 tagged one particular coexpression module (4.0-fold enrichment, P-value 0.0029), and this module also had the greatest enrichment (3.4-fold enrichment, P-value 2.6 × 10−24) for allergic rhinitis-associated eSNPs (genetic variants associated with both gene expression and allergic rhinitis). The integrated GWAS, coexpression network, and eSNP results therefore supported this coexpression module as an allergic rhinitis module. Pathway analysis revealed that the module was enriched for mitochondrial pathways (8.6-fold enrichment, P-value 4.5 × 10−72). Conclusions Our results highlight mitochondrial pathways as a target for further investigation of allergic rhinitis mechanism and treatment. Our integrated approach can be applied to provide biologic context for GWAS of other diseases. PMID:25085501

  1. Genome-wide association analysis in primary sclerosing cholangitis identifies two non-HLA susceptibility loci

    PubMed Central

    Melum, Espen; Franke, Andre; Schramm, Christoph; Weismüller, Tobias J; Gotthardt, Daniel Nils; Offner, Felix A; Juran, Brian D; Laerdahl, Jon K; Labi, Verena; Björnsson, Einar; Weersma, Rinse K; Henckaerts, Liesbet; Teufel, Andreas; Rust, Christian; Ellinghaus, Eva; Balschun, Tobias; Boberg, Kirsten Muri; Ellinghaus, David; Bergquist, Annika; Sauer, Peter; Ryu, Euijung; Hov, Johannes Roksund; Wedemeyer, Jochen; Lindkvist, Björn; Wittig, Michael; Porte, Robert J; Holm, Kristian; Gieger, Christian; Wichmann, H-Erich; Stokkers, Pieter; Ponsioen, Cyriel Y; Runz, Heiko; Stiehl, Adolf; Wijmenga, Cisca; Sterneck, Martina; Vermeire, Severine; Beuers, Ulrich; Villunger, Andreas; Schrumpf, Erik; Lazaridis, Konstantinos N; Manns, Michael P; Schreiber, Stefan; Karlsen, Tom H

    2015-01-01

    Primary sclerosing cholangitis (PSC) is a chronic bile duct disease affecting 2.4–7.5% of individuals with inflammatory bowel disease. We performed a genome-wide association analysis of 2,466,182 SNPs in 715 individuals with PSC and 2,962 controls, followed by replication in 1,025 PSC cases and 2,174 controls. We detected non-HLA associations at rs3197999 in MST1 and rs6720394 near BCL2L11 (combined P = 1.1 × 10−16 and P = 4.1 × 10−8, respectively). PMID:21151127

  2. Genome-wide association analysis in primary sclerosing cholangitis identifies two non-HLA susceptibility loci.

    PubMed

    Melum, Espen; Franke, Andre; Schramm, Christoph; Weismüller, Tobias J; Gotthardt, Daniel Nils; Offner, Felix A; Juran, Brian D; Laerdahl, Jon K; Labi, Verena; Björnsson, Einar; Weersma, Rinse K; Henckaerts, Liesbet; Teufel, Andreas; Rust, Christian; Ellinghaus, Eva; Balschun, Tobias; Boberg, Kirsten Muri; Ellinghaus, David; Bergquist, Annika; Sauer, Peter; Ryu, Euijung; Hov, Johannes Roksund; Wedemeyer, Jochen; Lindkvist, Björn; Wittig, Michael; Porte, Robert J; Holm, Kristian; Gieger, Christian; Wichmann, H-Erich; Stokkers, Pieter; Ponsioen, Cyriel Y; Runz, Heiko; Stiehl, Adolf; Wijmenga, Cisca; Sterneck, Martina; Vermeire, Severine; Beuers, Ulrich; Villunger, Andreas; Schrumpf, Erik; Lazaridis, Konstantinos N; Manns, Michael P; Schreiber, Stefan; Karlsen, Tom H

    2011-01-01

    Primary sclerosing cholangitis (PSC) is a chronic bile duct disease affecting 2.4-7.5% of individuals with inflammatory bowel disease. We performed a genome-wide association analysis of 2,466,182 SNPs in 715 individuals with PSC and 2,962 controls, followed by replication in 1,025 PSC cases and 2,174 controls. We detected non-HLA associations at rs3197999 in MST1 and rs6720394 near BCL2L11 (combined P = 1.1 × 10⁻¹⁶ and P = 4.1 × 10⁻⁸, respectively). PMID:21151127

  3. Novel insights into the relationships between dendritic cell subsets in human and mouse revealed by genome-wide expression profiling

    PubMed Central

    Robbins, Scott H; Walzer, Thierry; Dembélé, Doulaye; Thibault, Christelle; Defays, Axel; Bessou, Gilles; Xu, Huichun; Vivier, Eric; Sellars, MacLean; Pierre, Philippe; Sharp, Franck R; Chan, Susan; Kastner, Philippe; Dalod, Marc

    2008-01-01

    Background Dendritic cells (DCs) are a complex group of cells that play a critical role in vertebrate immunity. Lymph-node resident DCs (LN-DCs) are subdivided into conventional DC (cDC) subsets (CD11b and CD8α in mouse; BDCA1 and BDCA3 in human) and plasmacytoid DCs (pDCs). It is currently unclear if these various DC populations belong to a unique hematopoietic lineage and if the subsets identified in the mouse and human systems are evolutionary homologs. To gain novel insights into these questions, we sought conserved genetic signatures for LN-DCs and in vitro derived granulocyte-macrophage colony stimulating factor (GM-CSF) DCs through the analysis of a compendium of genome-wide expression profiles of mouse or human leukocytes. Results We show through clustering analysis that all LN-DC subsets form a distinct branch within the leukocyte family tree, and reveal a transcriptomal signature evolutionarily conserved in all LN-DC subsets. Moreover, we identify a large gene expression program shared between mouse and human pDCs, and smaller conserved profiles shared between mouse and human LN-cDC subsets. Importantly, most of these genes have not been previously associated with DC function and many have unknown functions. Finally, we use compendium analysis to re-evaluate the classification of interferon-producing killer DCs, lin-CD16+HLA-DR+ cells and in vitro derived GM-CSF DCs, and show that these cells are more closely linked to natural killer and myeloid cells, respectively. Conclusion Our study provides a unique database resource for future investigation of the evolutionarily conserved molecular pathways governing the ontogeny and functions of leukocyte subsets, especially DCs. PMID:18218067

  4. Genome-wide expression analysis upon constitutive activation of the HacA bZIP transcription factor in Aspergillus niger reveals a coordinated cellular response to counteract ER stress

    PubMed Central

    2012-01-01

    Background HacA/Xbp1 is a conserved bZIP transcription factor in eukaryotic cells which regulates gene expression in response to various forms of secretion stress and as part of secretory cell differentiation. In the present study, we replaced the endogenous hacA gene of an Aspergillus niger strain with a gene encoding a constitutively active form of the HacA transcription factor (HacACA). The impact of constitutive HacA activity during exponential growth was explored in bioreactor controlled cultures using transcriptomic analysis to identify affected genes and processes. Results Transcription profiles for the wild-type strain (HacAWT) and the HacACA strain were obtained using Affymetrix GeneChip analysis of three replicate batch cultures of each strain. In addition to the well known HacA targets such as the ER resident foldases and chaperones, GO enrichment analysis revealed up-regulation of genes involved in protein glycosylation, phospholipid biosynthesis, intracellular protein transport, exocytosis and protein complex assembly in the HacACA mutant. Biological processes over-represented in the down-regulated genes include those belonging to central metabolic pathways, translation and transcription. A remarkable transcriptional response in the HacACA strain was the down-regulation of the AmyR transcription factor and its target genes. Conclusions The results indicate that the constitutive activation of the HacA leads to a coordinated regulation of the folding and secretion capacity of the cell, but with consequences on growth and fungal physiology to reduce secretion stress. PMID:22846479

  5. Genome-Wide Comparative Analysis Reveals Similar Types of NBS Genes in Hybrid Citrus sinensis Genome and Original Citrus clementine Genome and Provides New Insights into Non-TIR NBS Genes

    PubMed Central

    Wang, Yunsheng; Zhou, Lijuan; Li, Dazhi; Dai, Liangying; Lawton-Rauh, Amy; Srimani, Pradip K.; Duan, Yongping; Luo, Feng

    2015-01-01

    In this study, we identified and compared nucleotide-binding site (NBS) domain-containing genes from three Citrus genomes (C. clementina, C. sinensis from USA and C. sinensis from China). Phylogenetic analysis of all Citrus NBS genes across these three genomes revealed that there are three approximately evenly numbered groups: one group contains the Toll-Interleukin receptor (TIR) domain and two different Non-TIR groups in which most of proteins contain the Coiled Coil (CC) domain. Motif analysis confirmed that the two groups of CC-containing NBS genes are from different evolutionary origins. We partitioned NBS genes into clades using NBS domain sequence distances and found most clades include NBS genes from all three Citrus genomes. This suggests that three Citrus genomes have similar numbers and types of NBS genes. We also mapped the re-sequenced reads of three pomelo and three mandarin genomes onto the C. sinensis genome. We found that most NBS genes of the hybrid C. sinensis genome have corresponding homologous genes in both pomelo and mandarin genomes. The homologous NBS genes in pomelo and mandarin suggest that the parental species of C. sinensis may contain similar types of NBS genes. This explains why the hybrid C. sinensis and original C. clementina have similar types of NBS genes in this study. Furthermore, we found that sequence variation amongst Citrus NBS genes were shaped by multiple independent and shared accelerated mutation accumulation events among different groups of NBS genes and in different Citrus genomes. Our comparative analyses yield valuable insight into the structure, organization and evolution of NBS genes in Citrus genomes. Furthermore, our comprehensive analysis showed that the non-TIR NBS genes can be divided into two groups that come from different evolutionary origins. This provides new insights into non-TIR genes, which have not received much attention. PMID:25811466

  6. Genome-wide Meta-analysis on the Sense of Smell Among US Older Adults.

    PubMed

    Dong, Jing; Yang, Jingyun; Tranah, Greg; Franceschini, Nora; Parimi, Neeta; Alkorta-Aranburu, Gorka; Xu, Zongli; Alonso, Alvaro; Cummings, Steven R; Fornage, Myriam; Huang, Xuemei; Kritchevsky, Stephen; Liu, Yongmei; London, Stephanie; Niu, Liang; Wilson, Robert S; De Jager, Philip L; Yu, Lei; Singleton, Andrew B; Harris, Tamara; Mosley, Thomas H; Pinto, Jayant M; Bennett, David A; Chen, Honglei

    2015-11-01

    Olfactory dysfunction is common among older adults and affects their safety, nutrition, quality of life, and mortality. More importantly, the decreased sense of smell is an early symptom of neurodegenerative diseases such as Parkinson disease (PD) and Alzheimer disease. However, the genetic determinants for the sense of smell have been poorly investigated. We here performed the first genome-wide meta-analysis on the sense of smell among 6252 US older adults of European descent from the Atherosclerosis Risk in Communities (ARIC) study, the Health, Aging, and Body Composition (Health ABC) study, and the Religious Orders Study and the Rush Memory and Aging Project (ROS/MAP). Genome-wide association study analysis was performed first by individual cohorts and then meta-analyzed using fixed-effect models with inverse variance weights. Although no SNPs reached genome-wide statistical significance, we identified 13 loci with suggestive evidence for an association with the sense of smell (Pmeta < 1 × 10). Of these, 2 SNPs at chromosome 17q21.31 (rs199443 in NSF, P = 3.02 × 10; and rs2732614 in KIAA1267-LRRC37A, P = 6.65 × 10) exhibited cis effects on the expression of microtubule-associated protein tau (MAPT, 17q21.31) in 447 frontal-cortex samples obtained postmortem and profiled by RNA-seq (P < 1 × 10). Gene-based and pathway-enrichment analyses further implicated MAPT in regulating the sense of smell in older adults. Similar results were obtained after excluding participants who reported a physician-diagnosed PD or use of PD medications. In conclusion, we provide preliminary evidence that the MAPT locus may play a role in regulating the sense of smell in older adults and therefore offer a potential genetic link between poor sense of smell and major neurodegenerative diseases. PMID:26632684

  7. A genome-wide meta-analysis of association studies of Cloninger's Temperament Scales

    PubMed Central

    Service, S K; Verweij, K J H; Lahti, J; Congdon, E; Ekelund, J; Hintsanen, M; Räikkönen, K; Lehtimäki, T; Kähönen, M; Widen, E; Taanila, A; Veijola, J; Heath, A C; Madden, P A F; Montgomery, G W; Sabatti, C; Järvelin, M-R; Palotie, A; Raitakari, O; Viikari, J; Martin, N G; Eriksson, J G; Keltikangas-Järvinen, L; Wray, N R; Freimer, N B

    2012-01-01

    Temperament has a strongly heritable component, yet multiple independent genome-wide studies have failed to identify significant genetic associations. We have assembled the largest sample to date of persons with genome-wide genotype data, who have been assessed with Cloninger's Temperament and Character Inventory. Sum scores for novelty seeking, harm avoidance, reward dependence and persistence have been measured in over 11 000 persons collected in four different cohorts. Our study had >80% power to identify genome-wide significant loci (P<1.25 × 10−8, with correction for testing four scales) accounting for ⩾0.4% of the phenotypic variance in temperament scales. Using meta-analysis techniques, gene-based tests and pathway analysis we have tested over 1.2 million single-nucleotide polymorphisms (SNPs) for association to each of the four temperament dimensions. We did not discover any SNPs, genes, or pathways to be significantly related to the four temperament dimensions, after correcting for multiple testing. Less than 1% of the variability in any temperament dimension appears to be accounted for by a risk score derived from the SNPs showing strongest association to the temperament dimensions. Elucidation of genetic loci significantly influencing temperament and personality will require potentially very large samples, and/or a more refined phenotype. Item response theory methodology may be a way to incorporate data from cohorts assessed with multiple personality instruments, and might be a method by which a large sample of a more refined phenotype could be acquired. PMID:22832960

  8. Genome-Wide Collation of the Plasmodium falciparum WDR Protein Superfamily Reveals Malarial Parasite-Specific Features

    PubMed Central

    Chahar, Priyanka; Kaushik, Manjeri; Gill, Sarvajeet Singh; Gakhar, Surendra Kumar; Gopalan, Natrajan; Datt, Manish; Sharma, Amit; Gill, Ritu

    2015-01-01

    Despite a significant drop in malaria deaths during the past decade, malaria continues to be one of the biggest health problems around the globe. WD40 repeats (WDRs) containing proteins comprise one of the largest and functionally diverse protein superfamily in eukaryotes, acting as scaffolds for assembling large protein complexes. In the present study, we report an extensive in silico analysis of the WDR gene family in human malaria parasite Plasmodium falciparum. Our genome-wide identification has revealed 80 putative WDR genes in P. falciparum (PfWDRs). Five distinct domain compositions were discovered in Plasmodium as compared to the human host. Notably, 31 PfWDRs were annotated/re-annotated on the basis of their orthologs in other species. Interestingly, most PfWDRs were larger as compared to their human homologs highlighting the presence of parasite-specific insertions. Fifteen PfWDRs appeared specific to the Plasmodium with no assigned orthologs. Expression profiling of PfWDRs revealed a mixture of linear and nonlinear relationships between transcriptome and proteome, and only nine PfWDRs were found to be stage-specific. Homology modeling identified conservation of major binding sites in PfCAF-1 and PfRACK. Protein-protein interaction network analyses suggested that PfWDRs are highly connected proteins with ~1928 potential interactions, supporting their role as hubs in cellular networks. The present study highlights the roles and relevance of the WDR family in P. falciparum, and identifies unique features that lay a foundation for further experimental dissection of PfWDRs. PMID:26043001

  9. Genome-wide identification, isolation and expression analysis of auxin response factor (ARF) gene family in sweet orange (Citrus sinensis)

    PubMed Central

    Li, Si-Bei; OuYang, Wei-Zhi; Hou, Xiao-Jin; Xie, Liang-Liang; Hu, Chun-Gen; Zhang, Jin-Zhi

    2015-01-01

    Auxin response factors (ARFs) are an important family of proteins in auxin-mediated response, with key roles in various physiological and biochemical processes. To date, a genome-wide overview of the ARF gene family in citrus was not available. A systematic analysis of this gene family in citrus was begun by carrying out a genome-wide search for the homologs of ARFs. A total of 19 nonredundant ARF genes (CiARF) were found and validated from the sweet orange. A comprehensive overview of the CiARFs was undertaken, including the gene structures, phylogenetic analysis, chromosome locations, conserved motifs of proteins, and cis-elements in promoters of CiARF. Furthermore, expression profiling using real-time PCR revealed many CiARF genes, albeit with different patterns depending on types of tissues and/or developmental stages. Comprehensive expression analysis of these genes was also performed under two hormone treatments using real-time PCR. Indole-3-acetic acid (IAA) and N-1-napthylphthalamic acid (NPA) treatment experiments revealed differential up-regulation and down-regulation, respectively, of the 19 citrus ARF genes in the callus of sweet orange. Our comprehensive analysis of ARF genes further elucidates the roles of CiARF family members during citrus growth and development process. PMID:25870601

  10. Integrative pathway analysis of genome-wide association studies and gene expression data in prostate cancer

    PubMed Central

    2012-01-01

    Background Pathway analysis of large-scale omics data assists us with the examination of the cumulative effects of multiple functionally related genes, which are difficult to detect using the traditional single gene/marker analysis. So far, most of the genomic studies have been conducted in a single domain, e.g., by genome-wide association studies (GWAS) or microarray gene expression investigation. A combined analysis of disease susceptibility genes across multiple platforms at the pathway level is an urgent need because it can reveal more reliable and more biologically important information. Results We performed an integrative pathway analysis of a GWAS dataset and a microarray gene expression dataset in prostate cancer. We obtained a comprehensive pathway annotation set from knowledge-based public resources, including KEGG pathways and the prostate cancer candidate gene set, and gene sets specifically defined based on cross-platform information. By leveraging on this pathway collection, we first searched for significant pathways in the GWAS dataset using four methods, which represent two broad groups of pathway analysis approaches. The significant pathways identified by each method varied greatly, but the results were more consistent within each method group than between groups. Next, we conducted a gene set enrichment analysis of the microarray gene expression data and found 13 pathways with cross-platform evidence, including "Fc gamma R-mediated phagocytosis" (PGWAS = 0.003, Pexpr < 0.001, and Pcombined = 6.18 × 10-8), "regulation of actin cytoskeleton" (PGWAS = 0.003, Pexpr = 0.009, and Pcombined = 3.34 × 10-4), and "Jak-STAT signaling pathway" (PGWAS = 0.001, Pexpr = 0.084, and Pcombined = 8.79 × 10-4). Conclusions Our results provide evidence at both the genetic variation and expression levels that several key pathways might have been involved in the pathological development of prostate cancer. Our framework that employs gene expression data to facilitate

  11. Genome-Wide Diversity in the Levant Reveals Recent Structuring by Culture

    PubMed Central

    Haber, Marc; Gauguier, Dominique; Youhanna, Sonia; Patterson, Nick; Moorjani, Priya; Botigué, Laura R.; Platt, Daniel E.; Matisoo-Smith, Elizabeth; Soria-Hernanz, David F.; Wells, R. Spencer; Bertranpetit, Jaume; Tyler-Smith, Chris

    2013-01-01

    The Levant is a region in the Near East with an impressive record of continuous human existence and major cultural developments since the Paleolithic period. Genetic and archeological studies present solid evidence placing the Middle East and the Arabian Peninsula as the first stepping-stone outside Africa. There is, however, little understanding of demographic changes in the Middle East, particularly the Levant, after the first Out-of-Africa expansion and how the Levantine peoples relate genetically to each other and to their neighbors. In this study we analyze more than 500,000 genome-wide SNPs in 1,341 new samples from the Levant and compare them to samples from 48 populations worldwide. Our results show recent genetic stratifications in the Levant are driven by the religious affiliations of the populations within the region. Cultural changes within the last two millennia appear to have facilitated/maintained admixture between culturally similar populations from the Levant, Arabian Peninsula, and Africa. The same cultural changes seem to have resulted in genetic isolation of other groups by limiting admixture with culturally different neighboring populations. Consequently, Levant populations today fall into two main groups: one sharing more genetic characteristics with modern-day Europeans and Central Asians, and the other with closer genetic affinities to other Middle Easterners and Africans. Finally, we identify a putative Levantine ancestral component that diverged from other Middle Easterners ∼23,700–15,500 years ago during the last glacial period, and diverged from Europeans ∼15,900–9,100 years ago between the last glacial warming and the start of the Neolithic. PMID:23468648

  12. Genome-wide modeling of transcription kinetics reveals patterns of RNA production delays

    PubMed Central

    Honkela, Antti; Peltonen, Jaakko; Topa, Hande; Charapitsa, Iryna; Matarese, Filomena; Grote, Korbinian; Stunnenberg, Hendrik G.; Reid, George; Lawrence, Neil D.; Rattray, Magnus

    2015-01-01

    Genes with similar transcriptional activation kinetics can display very different temporal mRNA profiles because of differences in transcription time, degradation rate, and RNA-processing kinetics. Recent studies have shown that a splicing-associated RNA production delay can be significant. To investigate this issue more generally, it is useful to develop methods applicable to genome-wide datasets. We introduce a joint model of transcriptional activation and mRNA accumulation that can be used for inference of transcription rate, RNA production delay, and degradation rate given data from high-throughput sequencing time course experiments. We combine a mechanistic differential equation model with a nonparametric statistical modeling approach allowing us to capture a broad range of activation kinetics, and we use Bayesian parameter estimation to quantify the uncertainty in estimates of the kinetic parameters. We apply the model to data from estrogen receptor α activation in the MCF-7 breast cancer cell line. We use RNA polymerase II ChIP-Seq time course data to characterize transcriptional activation and mRNA-Seq time course data to quantify mature transcripts. We find that 11% of genes with a good signal in the data display a delay of more than 20 min between completing transcription and mature mRNA production. The genes displaying these long delays are significantly more likely to be short. We also find a statistical association between high delay and late intron retention in pre-mRNA data, indicating significant splicing-associated production delays in many genes. PMID:26438844

  13. Genome-wide mutagenesis reveals that ORF7 is a novel VZV skin-tropic factor.

    PubMed

    Zhang, Zhen; Selariu, Anca; Warden, Charles; Huang, Grace; Huang, Ying; Zaccheus, Oluleke; Cheng, Tong; Xia, Ningshao; Zhu, Hua

    2010-01-01

    The Varicella Zoster Virus (VZV) is a ubiquitous human alpha-herpesvirus that is the causative agent of chicken pox and shingles. Although an attenuated VZV vaccine (v-Oka) has been widely used in children in the United States, chicken pox outbreaks are still seen, and the shingles vaccine only reduces the risk of shingles by 50%. Therefore, VZV still remains an important public health concern. Knowledge of VZV replication and pathogenesis remains limited due to its highly cell-associated nature in cultured cells, the difficulty of generating recombinant viruses, and VZV's almost exclusive tropism for human cells and tissues. In order to circumvent these hurdles, we cloned the entire VZV (p-Oka) genome into a bacterial artificial chromosome that included a dual-reporter system (GFP and luciferase reporter genes). We used PCR-based mutagenesis and the homologous recombination system in the E. coli to individually delete each of the genome's 70 unique ORFs. The collection of viral mutants obtained was systematically examined both in MeWo cells and in cultured human fetal skin organ samples. We use our genome-wide deletion library to provide novel functional annotations to 51% of the VZV proteome. We found 44 out of 70 VZV ORFs to be essential for viral replication. Among the 26 non-essential ORF deletion mutants, eight have discernable growth defects in MeWo. Interestingly, four ORFs were found to be required for viral replication in skin organ cultures, but not in MeWo cells, suggesting their potential roles as skin tropism factors. One of the genes (ORF7) has never been described as a skin tropic factor. The global profiling of the VZV genome gives further insights into the replication and pathogenesis of this virus, which can lead to improved prevention and therapy of chicken pox and shingles. PMID:20617166

  14. Genome-wide modeling of transcription kinetics reveals patterns of RNA production delays.

    PubMed

    Honkela, Antti; Peltonen, Jaakko; Topa, Hande; Charapitsa, Iryna; Matarese, Filomena; Grote, Korbinian; Stunnenberg, Hendrik G; Reid, George; Lawrence, Neil D; Rattray, Magnus

    2015-10-20

    Genes with similar transcriptional activation kinetics can display very different temporal mRNA profiles because of differences in transcription time, degradation rate, and RNA-processing kinetics. Recent studies have shown that a splicing-associated RNA production delay can be significant. To investigate this issue more generally, it is useful to develop methods applicable to genome-wide datasets. We introduce a joint model of transcriptional activation and mRNA accumulation that can be used for inference of transcription rate, RNA production delay, and degradation rate given data from high-throughput sequencing time course experiments. We combine a mechanistic differential equation model with a nonparametric statistical modeling approach allowing us to capture a broad range of activation kinetics, and we use Bayesian parameter estimation to quantify the uncertainty in estimates of the kinetic parameters. We apply the model to data from estrogen receptor α activation in the MCF-7 breast cancer cell line. We use RNA polymerase II ChIP-Seq time course data to characterize transcriptional activation and mRNA-Seq time course data to quantify mature transcripts. We find that 11% of genes with a good signal in the data display a delay of more than 20 min between completing transcription and mature mRNA production. The genes displaying these long delays are significantly more likely to be short. We also find a statistical association between high delay and late intron retention in pre-mRNA data, indicating significant splicing-associated production delays in many genes. PMID:26438844

  15. Genome-Wide Analysis of PHOSPHOLIPID:DIACYLGLYCEROL ACYLTRANSFERASE (PDAT) Genes in Plants Reveals the Eudicot-Wide PDAT Gene Expansion and Altered Selective Pressures Acting on the Core Eudicot PDAT Paralogs1[OPEN

    PubMed Central

    Pan, Xue; Peng, Fred Y.; Weselake, Randall J.

    2015-01-01

    PHOSPHOLIPID:DIACYLGLYCEROL ACYLTRANSFERASE (PDAT) is an enzyme that catalyzes the transfer of a fatty acyl moiety from the sn-2 position of a phospholipid to the sn-3-position of sn-1,2-diacylglyerol, thus forming triacylglycerol and a lysophospholipid. Although the importance of PDAT in triacylglycerol biosynthesis has been illustrated in some previous studies, the evolutionary relationship of plant PDATs has not been studied in detail. In this study, we investigated the evolutionary relationship of the PDAT gene family across the green plants using a comparative phylogenetic framework. We found that the PDAT candidate genes are present in all examined green plants, including algae, lowland plants (a moss and a lycophyte), monocots, and eudicots. Phylogenetic analysis revealed the evolutionary division of the PDAT gene family into seven major clades. The separation is supported by the conservation and variation in the gene structure, protein properties, motif patterns, and/or selection constraints. We further demonstrated that there is a eudicot-wide PDAT gene expansion, which appears to have been mainly caused by the eudicot-shared ancient gene duplication and subsequent species-specific segmental duplications. In addition, selection pressure analyses showed that different selection constraints have acted on three core eudicot clades, which might enable paleoduplicated PDAT paralogs to either become nonfunctionalized or develop divergent expression patterns during evolution. Overall, our study provides important insights into the evolution of the plant PDAT gene family and explores the evolutionary mechanism underlying the functional diversification among the core eudicot PDAT paralogs. PMID:25585619

  16. Genome-Wide Analysis of DNA Methylation and Cigarette Smoking in a Chinese Population

    PubMed Central

    Zhu, Xiaoyan; Li, Jun; Deng, Siyun; Yu, Kuai; Liu, Xuezhen; Deng, Qifei; Sun, Huizhen; Zhang, Xiaomin; He, Meian; Guo, Huan; Chen, Weihong; Yuan, Jing; Zhang, Bing; Kuang, Dan; He, Xiaosheng; Bai, Yansen; Han, Xu; Liu, Bing; Li, Xiaoliang; Yang, Liangle; Jiang, Haijing; Zhang, Yizhi; Hu, Jie; Cheng, Longxian; Luo, Xiaoting; Mei, Wenhua; Zhou, Zhiming; Sun, Shunchang; Zhang, Liyun; Liu, Chuanyao; Guo, Yanjun; Zhang, Zhihong; Hu, Frank B.; Liang, Liming; Wu, Tangchun

    2016-01-01

    Background: Smoking is a risk factor for many human diseases. DNA methylation has been related to smoking, but genome-wide methylation data for smoking in Chinese populations is limited. Objectives: We aimed to investigate epigenome-wide methylation in relation to smoking in a Chinese population. Methods: We measured the methylation levels at > 485,000 CpG sites (CpGs) in DNA from leukocytes using a methylation array and conducted a genome-wide meta-analysis of DNA methylation and smoking in a total of 596 Chinese participants. We further evaluated the associations of smoking-related CpGs with internal polycyclic aromatic hydrocarbon (PAH) biomarkers and their correlations with the expression of corresponding genes. Results: We identified 318 CpGs whose methylation levels were associated with smoking at a genome-wide significance level (false discovery rate < 0.05), among which 161 CpGs annotated to 123 genes were not associated with smoking in recent studies of Europeans and African Americans. Of these smoking-related CpGs, methylation levels at 80 CpGs showed significant correlations with the expression of corresponding genes (including RUNX3, IL6R, PTAFR, ANKRD11, CEP135 and CDH23), and methylation at 15 CpGs was significantly associated with urinary 2-hydroxynaphthalene, the most representative internal monohydroxy-PAH biomarker for smoking. Conclusion: We identified DNA methylation markers associated with smoking in a Chinese population, including some markers that were also correlated with gene expression. Exposure to naphthalene, a byproduct of tobacco smoke, may contribute to smoking-related methylation. Citation: Zhu X, Li J, Deng S, Yu K, Liu X, Deng Q, Sun H, Zhang X, He M, Guo H, Chen W, Yuan J, Zhang B, Kuang D, He X, Bai Y, Han X, Liu B, Li X, Yang L, Jiang H, Zhang Y, Hu J, Cheng L, Luo X, Mei W, Zhou Z, Sun S, Zhang L, Liu C, Guo Y, Zhang Z, Hu FB, Liang L, Wu T. 2016. Genome-wide analysis of DNA methylation and cigarette smoking in Chinese. Environ

  17. Genome-wide association analysis of red blood cell traits in African Americans: the COGENT Network

    PubMed Central

    Chen, Zhao; Tang, Hua; Qayyum, Rehan; Schick, Ursula M.; Nalls, Michael A.; Handsaker, Robert; Li, Jin; Lu, Yingchang; Yanek, Lisa R.; Keating, Brendan; Meng, Yan; van Rooij, Frank J.A.; Okada, Yukinori; Kubo, Michiaki; Rasmussen-Torvik, Laura; Keller, Margaux F.; Lange, Leslie; Evans, Michele; Bottinger, Erwin P.; Linderman, Michael D.; Ruderfer, Douglas M.; Hakonarson, Hakon; Papanicolaou, George; Zonderman, Alan B.; Gottesman, Omri; Thomson, Cynthia; Ziv, Elad; Singleton, Andrew B.; Loos, Ruth J.F.; Sleiman, Patrick M.A.; Ganesh, Santhi; McCarroll, Steven; Becker, Diane M.; Wilson, James G.; Lettre, Guillaume; Reiner, Alexander P.

    2013-01-01

    Laboratory red blood cell (RBC) measurements are clinically important, heritable and differ among ethnic groups. To identify genetic variants that contribute to RBC phenotypes in African Americans (AAs), we conducted a genome-wide association study in up to ∼16 500 AAs. The alpha-globin locus on chromosome 16pter [lead SNP rs13335629 in ITFG3 gene; P < 1E−13 for hemoglobin (Hgb), RBC count, mean corpuscular volume (MCV), MCH and MCHC] and the G6PD locus on Xq28 [lead SNP rs1050828; P < 1E − 13 for Hgb, hematocrit (Hct), MCV, RBC count and red cell distribution width (RDW)] were each associated with multiple RBC traits. At the alpha-globin region, both the common African 3.7 kb deletion and common single nucleotide polymorphisms (SNPs) appear to contribute independently to RBC phenotypes among AAs. In the 2p21 region, we identified a novel variant of PRKCE distinctly associated with Hct in AAs. In a genome-wide admixture mapping scan, local European ancestry at the 6p22 region containing HFE and LRRC16A was associated with higher Hgb. LRRC16A has been previously associated with the platelet count and mean platelet volume in AAs, but not with Hgb. Finally, we extended to AAs the findings of association of erythrocyte traits with several loci previously reported in Europeans and/or Asians, including CD164 and HBS1L-MYB. In summary, this large-scale genome-wide analysis in AAs has extended the importance of several RBC-associated genetic loci to AAs and identified allelic heterogeneity and pleiotropy at several previously known genetic loci associated with blood cell traits in AAs. PMID:23446634

  18. Using genome-wide complex trait analysis to quantify ‘missing heritability’ in Parkinson's disease

    PubMed Central

    Keller, Margaux F.; Saad, Mohamad; Bras, Jose; Bettella, Francesco; Nicolaou, Nayia; Simón-Sánchez, Javier; Mittag, Florian; Büchel, Finja; Sharma, Manu; Gibbs, J. Raphael; Schulte, Claudia; Moskvina, Valentina; Durr, Alexandra; Holmans, Peter; Kilarski, Laura L.; Guerreiro, Rita; Hernandez, Dena G.; Brice, Alexis; Ylikotila, Pauli; Stefánsson, Hreinn; Majamaa, Kari; Morris, Huw R.; Williams, Nigel; Gasser, Thomas; Heutink, Peter; Wood, Nicholas W.; Hardy, John; Martinez, Maria; Singleton, Andrew B.; Nalls, Michael A.

    2012-01-01

    Genome-wide association studies (GWASs) have been successful at identifying single-nucleotide polymorphisms (SNPs) highly associated with common traits; however, a great deal of the heritable variation associated with common traits remains unaccounted for within the genome. Genome-wide complex trait analysis (GCTA) is a statistical method that applies a linear mixed model to estimate phenotypic variance of complex traits explained by genome-wide SNPs, including those not associated with the trait in a GWAS. We applied GCTA to 8 cohorts containing 7096 case and 19 455 control individuals of European ancestry in order to examine the missing heritability present in Parkinson's disease (PD). We meta-analyzed our initial results to produce robust heritability estimates for PD types across cohorts. Our results identify 27% (95% CI 17–38, P = 8.08E − 08) phenotypic variance associated with all types of PD, 15% (95% CI −0.2 to 33, P = 0.09) phenotypic variance associated with early-onset PD and 31% (95% CI 17–44, P = 1.34E − 05) phenotypic variance associated with late-onset PD. This is a substantial increase from the genetic variance identified by top GWAS hits alone (between 3 and 5%) and indicates there are substantially more risk loci to be identified. Our results suggest that although GWASs are a useful tool in identifying the most common variants associated with complex disease, a great deal of common variants of small effect remain to be discovered. PMID:22892372

  19. Genome-Wide Analysis of miRNA-mRNA Interactions in Marrow Stromal Cells

    PubMed Central

    Balakrishnan, Ilango; Yang, Xiaodong; Brown, Joseph; Ramakrishnan, Aravind; Torok–Storb, Beverly; Kabos, Peter; Hesselberth, Jay R.; Pillai, Manoj M.

    2014-01-01

    Regulation of hematopoietic stem cell proliferation, lineage commitment, and differentiation in adult vertebrates requires extrinsic signals provided by cells in the marrow microenvironment (ME) located within the bone marrow. Both secreted and cell-surface bound factors critical to this regulation have been identified, yet control of their expression by cells within the ME has not been addressed. Herein we hypothesize that microRNAs (miRNAs) contribute to their controlled expression. MiRNAs are small noncoding RNAs that bind to target mRNAs and downregulate gene expression by either initiating mRNA degradation or preventing peptide translation. Testing the role of miRNAs in downregulating gene expression has been difficult since conventional techniques used to define miRNA-mRNA interactions are indirect and have high false-positive and negative rates. In this report, a genome-wide biochemical technique (high-throughput sequencing of RNA isolated by cross-linking immunoprecipitation or HITS-CLIP) was used to generate unbiased genome-wide maps of miRNA-mRNA interactions in two critical cellular components of the marrow ME: marrow stromal cells and bone marrow endothelial cells. Analysis of these datasets identified miRNAs as direct regulators of JAG1, WNT5A, MMP2, and VEGFA; four factors that are important to ME function. Our results show the feasibility and utility of unbiased genome-wide biochemical techniques in dissecting the role of miRNAs in regulation of complex tissues such as the marrow ME. PMID:24038734

  20. Genome Wide Analysis of Fertility and Production Traits in Italian Holstein Cattle

    PubMed Central

    Stella, Alessandra; Biffani, Stefano; Negrini, Riccardo; Lazzari, Barbara; Ajmone-Marsan, Paolo; Williams, John L .

    2013-01-01

    A genome wide scan was performed on a total of 2093 Italian Holstein proven bulls genotyped with 50K single nucleotide polymorphisms (SNPs), with the objective of identifying loci associated with fertility related traits and to test their effects on milk production traits. The analysis was carried out using estimated breeding values for the aggregate fertility index and for each trait contributing to the index: angularity, calving interval, non-return rate at 56 days, days to first service, and 305 day first parity lactation. In addition, two production traits not included in the aggregate fertility index were analysed: fat yield and protein yield. Analyses were carried out using all SNPs treated separately, further the most significant marker on BTA14 associated to milk quality located in the DGAT1 region was treated as fixed effect. Genome wide association analysis identified 61 significant SNPs and 75 significant marker-trait associations. Eight additional SNP associations were detected when SNP located near DGAT1 was included as a fixed effect. As there were no obvious common SNPs between the traits analyzed independently in this study, a network analysis was carried out to identify unforeseen relationships that may link production and fertility traits. PMID:24265800

  1. Quantifying the heritability of glioma using genome-wide complex trait analysis

    PubMed Central

    Kinnersley, Ben; Mitchell, Jonathan S.; Gousias, Konstantinos; Schramm, Johannes; Idbaih, Ahmed; Labussière, Marianne; Marie, Yannick; Rahimian, Amithys; Wichmann, H.-Erich; Schreiber, Stefan; Hoang-Xuan, Khe; Delattre, Jean-Yves; Nöthen, Markus M.; Mokhtari, Karima; Lathrop, Mark; Bondy, Melissa; Simon, Matthias; Sanson, Marc; Houlston, Richard S.

    2015-01-01

    Genome-wide association studies (GWAS) have successfully identified a number of common single-nucleotide polymorphisms (SNPs) influencing glioma risk. While these SNPs only explain a small proportion of the genetic risk it is unclear how much is left to be detected by other, yet to be identified, common SNPs. Therefore, we applied Genome-Wide Complex Trait Analysis (GCTA) to three GWAS datasets totalling 3,373 cases and 4,571 controls and performed a meta-analysis to estimate the heritability of glioma. Our results identify heritability estimates of 25% (95% CI: 20–31%, P = 1.15 × 10−17) for all forms of glioma - 26% (95% CI: 17–35%, P = 1.05 × 10−8) for glioblastoma multiforme (GBM) and 25% (95% CI: 17–32%, P = 1.26 × 10−10) for non-GBM tumors. This is a substantial increase from the genetic variance identified by the currently identified GWAS risk loci (~6% of common heritability), indicating that most of the heritable risk attributable to common genetic variants remains to be identified. PMID:26625949

  2. Genome-wide Comparative Analysis of Atopic Dermatitis and Psoriasis Gives Insight into Opposing Genetic Mechanisms

    PubMed Central

    Baurecht, Hansjörg; Hotze, Melanie; Brand, Stephan; Büning, Carsten; Cormican, Paul; Corvin, Aiden; Ellinghaus, David; Ellinghaus, Eva; Esparza-Gordillo, Jorge; Fölster-Holst, Regina; Franke, Andre; Gieger, Christian; Hubner, Norbert; Illig, Thomas; Irvine, Alan D.; Kabesch, Michael; Lee, Young A.E.; Lieb, Wolfgang; Marenholz, Ingo; McLean, W.H. Irwin; Morris, Derek W.; Mrowietz, Ulrich; Nair, Rajan; Nöthen, Markus M.; Novak, Natalija; O’Regan, Grainne M.; Schreiber, Stefan; Smith, Catherine; Strauch, Konstantin; Stuart, Philip E.; Trembath, Richard; Tsoi, Lam C.; Weichenthal, Michael; Barker, Jonathan; Elder, James T.; Weidinger, Stephan; Cordell, Heather J.; Brown, Sara J.

    2015-01-01

    Atopic dermatitis and psoriasis are the two most common immune-mediated inflammatory disorders affecting the skin. Genome-wide studies demonstrate a high degree of genetic overlap, but these diseases have mutually exclusive clinical phenotypes and opposing immune mechanisms. Despite their prevalence, atopic dermatitis and psoriasis very rarely co-occur within one individual. By utilizing genome-wide association study and ImmunoChip data from >19,000 individuals and methodologies developed from meta-analysis, we have identified opposing risk alleles at shared loci as well as independent disease-specific loci within the epidermal differentiation complex (chromosome 1q21.3), the Th2 locus control region (chromosome 5q31.1), and the major histocompatibility complex (chromosome 6p21–22). We further identified previously unreported pleiotropic alleles with opposing effects on atopic dermatitis and psoriasis risk in PRKRA and ANXA6/TNIP1. In contrast, there was no evidence for shared loci with effects operating in the same direction on both diseases. Our results show that atopic dermatitis and psoriasis have distinct genetic mechanisms with opposing effects in shared pathways influencing epidermal differentiation and immune response. The statistical analysis methods developed in the conduct of this study have produced additional insight from previously published data sets. The approach is likely to be applicable to the investigation of the genetic basis of other complex traits with overlapping and distinct clinical features. PMID:25574825

  3. Meta-analysis of genome-wide linkage scans for renal function traits

    PubMed Central

    Rao, Madhumathi; Mottl, Amy K.; Cole, Shelley A.; Umans, Jason G.; Freedman, Barry I.; Bowden, Donald W.; Langefeld, Carl D.; Fox, Caroline S.; Yang, Qiong; Cupples, Adrienne; Iyengar, Sudha K.; Hunt, Steven C.

    2012-01-01

    Background. Several genome scans have explored the linkage of chronic kidney disease phenotypes to chromosomic regions with disparate results. Genome scan meta-analysis (GSMA) is a quantitative method to synthesize linkage results from independent studies and assess their concordance. Methods. We searched PubMed to identify genome linkage analyses of renal function traits in humans, such as estimated glomerular filtration rate (GFR), albuminuria, serum creatinine concentration and creatinine clearance. We contacted authors for numerical data and extracted information from individual studies. We applied the GSMA nonparametric approach to combine results across 14 linkage studies for GFR, 11 linkage studies for albumin creatinine ratio, 11 linkage studies for serum creatinine and 4 linkage studies for creatinine clearance. Results. No chromosomal region reached genome-wide statistical significance in the main analysis which included all scans under each phenotype; however, regions on Chromosomes 7, 10 and 16 reached suggestive significance for linkage to two or more phenotypes. Subgroup analyses by disease status or ethnicity did not yield additional information. Conclusions. While heterogeneity across populations, methodologies and study designs likely explain this lack of agreement, it is possible that linkage scan methodologies lack the resolution for investigating complex traits. Combining family-based linkage studies with genome-wide association studies may be a powerful approach to detect private mutations contributing to complex renal phenotypes. PMID:21622988

  4. Genome-wide transcription analysis of histidine-related cataract in Atlantic salmon (Salmo salar L)

    PubMed Central

    Waagbø, Rune; Breck, Olav; Stavrum, Anne-Kristin; Petersen, Kjell; Olsvik, Pål A.

    2009-01-01

    progression in cataract formation. Conclusions Dietary histidine regimes affected cataract formation and lens gene expression in adult Atlantic salmon. Regulated transcripts selected from the results of this genome-wide transcription analysis might be used as possible biological markers for cataract development in Atlantic salmon. PMID:19597568

  5. Transcriptome Sequencing and Genome-wide Association Analyses Reveal Lysosomal Function and Actin Cytoskeleton Remodeling in Schizophrenia and Bipolar Disorder

    PubMed Central

    Kim, Sanghyeon; Reimers, Mark; Bacanu, Silviu-Alin; Yu, Hui; Liu, Chunyu; Sun, Jingchun; Wang, Quan; Jia, Peilin; Xu, Fengping; Zhang, Yong; Kendler, Kenneth S.; Peng, Zhiyu; Chen, Xiangning

    2014-01-01

    Schizophrenia (SCZ) and bipolar disorder (BPD) are severe mental disorders with high heritability. Clinicians have long noticed the similarities of clinic symptoms between these disorders. In recent years, accumulating evidence indicates some shared genetic liabilities. However, what is shared remains elusive. In this study, we conducted whole transcriptome analysis of postmortem brain tissues (cingulate cortex) from SCZ, BPD and control subjects, and identified differentially expressed genes in these disorders. We found 105 and 153 genes differentially expressed in SCZ and BPD, respectively. By comparing the t-test scores, we found that many of the genes differentially expressed in SCZ and BPD are concordant in their expression level (q ≤ 0.01, 53 genes; q ≤ 0.05, 213 genes; q ≤ 0.1, 885 genes). Using genome-wide association data from the Psychiatric Genomics Consortium, we found that these differentially and concordantly expressed genes were enriched in association signals for both SCZ (p < 10−7 ) and BPD (p = 0.029). To our knowledge, this is the first time that a substantially large number of genes shows concordant expression and association for both SCZ and BPD. Pathway analyses of these genes indicated that they are involved in the lysosome, Fc gamma receptor mediated phagocytosis, regulation of actin skeleton pathways, along with several cancer pathways. Functional analyses of these genes revealed an interconnected pathway network centered on lysosomal function and the regulation of actin cytoskeleton. These pathways and their interacting network were principally confirmed by an independent transcriptome sequencing dataset of hippocampus. Dysregulation of lysosomal function and cytoskeleton remodeling has direct impacts on endocytosis, phagocytosis, exocytosis, vesicle trafficking, neuronal maturation and migration, neurite outgrowth, and synaptic density and plasticity, and different aspects of these processes have been implicated in SCZ and BPD

  6. Genome-wide transcriptional analysis of grapevine berry ripening reveals a set of genes similarly modulated during three seasons and the occurrence of an oxidative burst at vèraison

    PubMed Central

    Pilati, Stefania; Perazzolli, Michele; Malossini, Andrea; Cestaro, Alessandro; Demattè, Lorenzo; Fontana, Paolo; Dal Ri, Antonio; Viola, Roberto; Velasco, Riccardo; Moser, Claudio

    2007-01-01

    Background Grapevine (Vitis species) is among the most important fruit crops in terms of cultivated area and economic impact. Despite this relevance, little is known about the transcriptional changes and the regulatory circuits underlying the biochemical and physical changes occurring during berry development. Results Fruit ripening in the non-climacteric crop species Vitis vinifera L. has been investigated at the transcriptional level by the use of the Affymetrix Vitis GeneChip® which contains approximately 14,500 unigenes. Gene expression data obtained from berries sampled before and after véraison in three growing years, were analyzed to identify genes specifically involved in fruit ripening and to investigate seasonal influences on the process. From these analyses a core set of 1477 genes was found which was similarly modulated in all seasons. We were able to separate ripening specific isoforms within gene families and to identify ripening related genes which appeared strongly regulated also by the seasonal weather conditions. Transcripts annotation by Gene Ontology vocabulary revealed five overrepresented functional categories of which cell wall organization and biogenesis, carbohydrate and secondary metabolisms and stress response were specifically induced during the ripening phase, while photosynthesis was strongly repressed. About 19% of the core gene set was characterized by genes involved in regulatory processes, such as transcription factors and transcripts related to hormonal metabolism and signal transduction. Auxin, ethylene and light emerged as the main stimuli influencing berry development. In addition, an oxidative burst, previously not detected in grapevine, characterized by rapid accumulation of H2O2 starting from véraison and by the modulation of many ROS scavenging enzymes, was observed. Conclusion The time-course gene expression analysis of grapevine berry development has identified the occurrence of two well distinct phases along the

  7. Genome-wide Generation and Systematic Phenotyping of Knockout Mice Reveals New Roles for Many Genes

    PubMed Central

    White, Jacqueline K.; Gerdin, Anna-Karin; Karp, Natasha A.; Ryder, Ed; Buljan, Marija; Bussell, James N.; Salisbury, Jennifer; Clare, Simon; Ingham, Neil J.; Podrini, Christine; Houghton, Richard; Estabel, Jeanne; Bottomley, Joanna R.; Melvin, David G.; Sunter, David; Adams, Niels C.; Baker, Lauren; Barnes, Caroline; Beveridge, Ryan; Cambridge, Emma; Carragher, Damian; Chana, Prabhjoat; Clarke, Kay; Hooks, Yvette; Igosheva, Natalia; Ismail, Ozama; Jackson, Hannah; Kane, Leanne; Lacey, Rosalind; Lafont, David Tino; Lucas, Mark; Maguire, Simon; McGill, Katherine; McIntyre, Rebecca E.; Messager, Sophie; Mottram, Lynda; Mulderrig, Lee; Pearson, Selina; Protheroe, Hayley J.; Roberson, Laura-Anne; Salsbury, Grace; Sanderson, Mark; Sanger, Daniel; Shannon, Carl; Thompson, Paul C.; Tuck, Elizabeth; Vancollie, Valerie E.; Brackenbury, Lisa; Bushell, Wendy; Cook, Ross; Dalvi, Priya; Gleeson, Diane; Habib, Bishoy; Hardy, Matt; Liakath-Ali, Kifayathullah; Miklejewska, Evelina; Price, Stacey; Sethi, Debarati; Trenchard, Elizabeth; von Schiller, Dominique; Vyas, Sapna; West, Anthony P.; Woodward, John; Wynn, Elizabeth; Evans, Arthur; Gannon, David; Griffiths, Mark; Holroyd, Simon; Iyer, Vivek; Kipp, Christian; Lewis, Morag; Li, Wei; Oakley, Darren; Richardson, David; Smedley, Damian; Agu, Chukwuma; Bryant, Jackie; Delaney, Liz; Gueorguieva, Nadia I.; Tharagonnet, Helen; Townsend, Anne J.; Biggs, Daniel; Brown, Ellen; Collinson, Adam; Dumeau, Charles-Etienne; Grau, Evelyn; Harrison, Sarah; Harrison, James; Ingle, Catherine E.; Kundi, Helen; Madich, Alla; Mayhew, Danielle; Metcalf, Tom; Newman, Stuart; Pass, Johanna; Pearson, Laila; Reynolds, Helen; Sinclair, Caroline; Wardle-Jones, Hannah; Woods, Michael; Alexander, Liam; Brown, Terry; Flack, Francesca; Frost, Carole; Griggs, Nicola; Hrnciarova, Silvia; Kirton, Andrea; McDermott, Jordan; Rogerson, Claire; White, Gemma; Zielezinski, Pawel; DiTommaso, Tia; Edwards, Andrew; Heath, Emma; Mahajan, Mary Ann; Yalcin, Binnaz; Tannahill, David; Logan, Darren W.; MacArthur, Daniel G.; Flint, Jonathan; Mahajan, Vinit B.; Tsang, Stephen H.; Smyth, Ian; Watt, Fiona M.; Skarnes, William C.; Dougan, Gordon; Adams, David J.; Ramirez-Solis, Ramiro; Bradley, Allan; Steel, Karen P.

    2013-01-01

    Summary Mutations in whole organisms are powerful ways of interrogating gene function in a realistic context. We describe a program, the Sanger Institute Mouse Genetics Project, that provides a step toward the aim of knocking out all genes and screening each line for a broad range of traits. We found that hitherto unpublished genes were as likely to reveal phenotypes as known genes, suggesting that novel genes represent a rich resource for investigating the molecular basis of disease. We found many unexpected phenotypes detected only because we screened for them, emphasizing the value of screening all mutants for a wide range of traits. Haploinsufficiency and pleiotropy were both surprisingly common. Forty-two percent of genes were essential for viability, and these were less likely to have a paralog and more likely to contribute to a protein complex than other genes. Phenotypic data and more than 900 mutants are openly available for further analysis. PaperClip PMID:23870131

  8. Genome-Wide Analysis and Characterization of Aux/IAA Family Genes in Brassica rapa

    PubMed Central

    Rameneni, Jana Jeevan; Li, Xiaonan; Sivanandhan, Ganesan; Choi, Su Ryun; Pang, Wenxing; Im, Subin; Lim, Yong Pyo

    2016-01-01

    Auxins are the key players in plant growth development involving leaf formation, phototropism, root, fruit and embryo development. Auxin/Indole-3-Acetic Acid (Aux/IAA) are early auxin response genes noted as transcriptional repressors in plant auxin signaling. However, many studies focus on Aux/ARF gene families and much less is known about the Aux/IAA gene family in Brassica rapa (B. rapa). Here we performed a comprehensive genome-wide analysis and identified 55 Aux/IAA genes in B. rapa using four conserved motifs of Aux/IAA family (PF02309). Chromosomal mapping of the B. rapa Aux/IAA (BrIAA) genes facilitated understanding cluster rearrangement of the crucifer building blocks in the genome. Phylogenetic analysis of BrIAA with Arabidopsis thaliana, Oryza sativa and Zea mays identified 51 sister pairs including 15 same species (BrIAA—BrIAA) and 36 cross species (BrIAA—AtIAA) IAA genes. Among the 55 BrIAA genes, expression of 43 and 45 genes were verified using Genebank B. rapa ESTs and in home developed microarray data from mature leaves of Chiifu and RcBr lines. Despite their huge morphological difference, tissue specific expression analysis of BrIAA genes between the parental lines Chiifu and RcBr showed that the genes followed a similar pattern of expression during leaf development and a different pattern during bud, flower and siliqua development stages. The response of the BrIAA genes to abiotic and auxin stress at different time intervals revealed their involvement in stress response. Single Nucleotide Polymorphisms between IAA genes of reference genome Chiifu and RcBr were focused and identified. Our study examines the scope of conservation and divergence of Aux/IAA genes and their structures in B. rapa. Analyzing the expression and structural variation between two parental lines will significantly contribute to functional genomics of Brassica crops and we belive our study would provide a foundation in understanding the Aux/IAA genes in B. rapa. PMID

  9. Genome-Wide Analysis and Characterization of Aux/IAA Family Genes in Brassica rapa.

    PubMed

    Paul, Parameswari; Dhandapani, Vignesh; Rameneni, Jana Jeevan; Li, Xiaonan; Sivanandhan, Ganesan; Choi, Su Ryun; Pang, Wenxing; Im, Subin; Lim, Yong Pyo

    2016-01-01

    Auxins are the key players in plant growth development involving leaf formation, phototropism, root, fruit and embryo development. Auxin/Indole-3-Acetic Acid (Aux/IAA) are early auxin response genes noted as transcriptional repressors in plant auxin signaling. However, many studies focus on Aux/ARF gene families and much less is known about the Aux/IAA gene family in Brassica rapa (B. rapa). Here we performed a comprehensive genome-wide analysis and identified 55 Aux/IAA genes in B. rapa using four conserved motifs of Aux/IAA family (PF02309). Chromosomal mapping of the B. rapa Aux/IAA (BrIAA) genes facilitated understanding cluster rearrangement of the crucifer building blocks in the genome. Phylogenetic analysis of BrIAA with Arabidopsis thaliana, Oryza sativa and Zea mays identified 51 sister pairs including 15 same species (BrIAA-BrIAA) and 36 cross species (BrIAA-AtIAA) IAA genes. Among the 55 BrIAA genes, expression of 43 and 45 genes were verified using Genebank B. rapa ESTs and in home developed microarray data from mature leaves of Chiifu and RcBr lines. Despite their huge morphological difference, tissue specific expression analysis of BrIAA genes between the parental lines Chiifu and RcBr showed that the genes followed a similar pattern of expression during leaf development and a different pattern during bud, flower and siliqua development stages. The response of the BrIAA genes to abiotic and auxin stress at different time intervals revealed their involvement in stress response. Single Nucleotide Polymorphisms between IAA genes of reference genome Chiifu and RcBr were focused and identified. Our study examines the scope of conservation and divergence of Aux/IAA genes and their structures in B. rapa. Analyzing the expression and structural variation between two parental lines will significantly contribute to functional genomics of Brassica crops and we belive our study would provide a foundation in understanding the Aux/IAA genes in B. rapa. PMID

  10. Genome-wide analysis of signal peptide functionality in Lactobacillus plantarum WCFS1

    PubMed Central

    Mathiesen, Geir; Sveen, Anita; Brurberg, May Bente; Fredriksen, Lasse; Axelsson, Lars; Eijsink, Vincent GH

    2009-01-01

    Background Lactobacillus plantarum is a normal, potentially probiotic, inhabitant of the human gastrointestinal (GI) tract. The bacterium has great potential as food-grade cell factory and for in situ delivery of biomolecules. Since protein secretion is important both for probiotic activity and in biotechnological applications, we have carried out a genome-wide experimental study of signal peptide (SP) functionality. Results We have constructed a library of 76 Sec-type signal peptides from L. plantarum WCFS1 that were predicted to be cleaved by signal peptidase I. SP functionality was studied using staphylococcal nuclease (NucA) as a reporter protein. 82% of the SPs gave significant extracellular NucA activity. Levels of secreted NucA varied by a dramatic 1800-fold and this variation was shown not to be the result of different mRNA levels. For the best-performing SPs all produced NucA was detected in the culture supernatant, but the secretion efficiency decreased for the less well performing SPs. Sequence analyses of the SPs and their cognate proteins revealed four properties that correlated positively with SP performance for NucA: high hydrophobicity, the presence of a transmembrane helix predicted by TMHMM, the absence of an anchoring motif in the cognate protein, and the length of the H+C domain. Analysis of a subset of SPs with a lactobacillal amylase (AmyA) showed large variation in production levels and secretion efficiencies. Importantly, there was no correlation between SP performance with NucA and the performance with AmyA. Conclusion This is the first comprehensive experimental study showing that predicted SPs in the L. plantarum genome actually are capable of driving protein secretion. The results reveal considerable variation between the SPs that is at least in part dependent on the protein that is secreted. Several SPs stand out as promising candidates for efficient secretion of heterologous proteins in L. plantarum. The results for NucA provide some

  11. Genome-wide analysis of the omega-3 fatty acid desaturase gene family in Gossypium

    DOE PAGESBeta

    Yurchenko, Olga P.; Park, Sunjung; Ilut, Daniel C.; Inmon, Jay J.; Millhollon, Jon C.; Liechty, Zach; Page, Justin T.; Jenks, Matthew A.; Chapman, Kent D.; Udall, Joshua A.; et al

    2014-11-18

    The majority of commercial cotton varieties planted worldwide are derived from Gossypium hirsutum, which is a naturally occurring allotetraploid produced by interspecific hybridization of A- and D-genome diploid progenitor species. While most cotton species are adapted to warm, semi-arid tropical and subtropical regions, and thus perform well in these geographical areas, cotton seedlings are sensitive to cold temperature, which can significantly reduce crop yields. One of the common biochemical responses of plants to cold temperatures is an increase in omega-3 fatty acids, which protects cellular function by maintaining membrane integrity. The purpose of our study was to identify and characterizemore » the omega-3 fatty acid desaturase (FAD) gene family in G. hirsutum, with an emphasis on identifying omega-3 FADs involved in cold temperature adaptation. Results: Eleven omega-3 FAD genes were identified in G. hirsutum, and characterization of the gene family in extant A and D diploid species (G. herbaceum and G. raimondii, respectively) allowed for unambiguous genome assignment of all homoeologs in tetraploid G. hirsutum. The omega-3 FAD family of cotton includes five distinct genes, two of which encode endoplasmic reticulum-type enzymes (FAD3-1 and FAD3-2) and three that encode chloroplast-type enzymes (FAD7/8-1, FAD7/8-2, and FAD7/8-3). The FAD3-2 gene was duplicated in the A genome progenitor species after the evolutionary split from the D progenitor, but before the interspecific hybridization event that gave rise to modern tetraploid cotton. RNA-seq analysis revealed conserved, gene-specific expression patterns in various organs and cell types and semi-quantitative RT-PCR further revealed that FAD7/8-1 was specifically induced during cold temperature treatment of G. hirsutum seedlings. Conclusions: The omega-3 FAD gene family in cotton was characterized at the genome-wide level in three species, showing relatively ancient establishment of the gene family prior

  12. Genome-wide linkage analysis in a Dutch multigenerational family with attention deficit hyperactivity disorder

    PubMed Central

    Vegt, Rinus; Bertoli-Avella, Aida M; Tulen, Joke H M; de Graaf, Bianca; Verkerk, Annemieke J M H; Vervoort, Jeroen; Twigt, Carla M; Maat-Kievit, Anneke; van Tuijl, Ruud; van der Lijn, Marieke; Hengeveld, Michiel W; Oostra, Ben A

    2010-01-01

    Attention deficit hyperactivity disorder (ADHD) is a common neuropsychiatric disorder. Genetics has an important role in the aetiology of this disease. In this study, we describe the clinical findings in a Dutch family with eight patients suffering from ADHD, in whom five had at least one other psychiatric disorder. We performed a genome-wide (parametric and nonparametric) affected-only linkage analysis. Two genomic regions on chromosomes 7 and 14 showed an excess of allele sharing among the definitely affected members of the family with suggestive LOD scores (2.1 and 2.08). Nonparametric linkage analyses (NPL) yielded a maxNPL of 2.92 (P=0.001) for marker D7S502 and a maxNPL score of 2.56 (P=0.003) for marker D14S275. We confirmed that all patients share the same haplotype in each region of 7p15.1–q31.33 and 14q11.2–q22.3. Interestingly, both loci have been reported before in Dutch (affected sib pairs) and German (extended families) ADHD linkage studies. Hopefully, the genome-wide association studies in ADHD will help to highlight specific polymorphisms and genes within the broad areas detected by our, as well as other, linkage studies. PMID:19707245

  13. Technical advances: genome-wide cDNA-AFLP analysis of the Arabidopsis transcriptome.

    PubMed

    Volkmuth, Wayne; Turk, Stefan; Shapiro, Amy; Fang, Yiwen; Kiegle, Ed; van Haaren, Mark; Donson, Jonathan

    2003-01-01

    cDNA-AFLP, a technology historically used to identify small numbers of differentially expressed genes, was adapted as a genome-wide transcript profiling method. mRNA levels were assayed in a diverse range of tissues from Arabidopsis thaliana plants grown under a variety of environmental conditions. The resulting cDNA-AFLP fragments were sequenced. By linking cDNA-AFLP fragments to their corresponding mRNAs via these sequences, a database was generated that contained quantitative expression information for up to two-thirds of gene loci in A. thaliana, ecotype Ws. Using this resource, the expression levels of genes, including those with high nucleotide sequence similarity, could be determined in a high-throughput manner merely by comparing cDNA-AFLP profiles with the database. The lengths of cDNA-AFLP fragments inferred from their electrophoretic mobilities correlated well with actual fragment lengths determined by sequencing. In addition, the concentrations of AFLP fragments from single cDNAs were highly correlated, illustrating the validity of cDNA-AFLP as a quantitative, genome-wide, transcript profiling method. cDNA-AFLP profiles were also qualitatively consistent with mRNA profiles obtained from parallel microarray analysis, and with data from previous studies. PMID:14506844

  14. An efficient hierarchical generalized linear mixed model for pathway analysis of genome-wide association studies

    PubMed Central

    Wang, Lily; Jia, Peilin; Wolfinger, Russell D.; Chen, Xi; Grayson, Britney L.; Aune, Thomas M.; Zhao, Zhongming

    2011-01-01

    Motivation: In genome-wide association studies (GWAS) of complex diseases, genetic variants having real but weak associations often fail to be detected at the stringent genome-wide significance level. Pathway analysis, which tests disease association with combined association signals from a group of variants in the same pathway, has become increasingly popular. However, because of the complexities in genetic data and the large sample sizes in typical GWAS, pathway analysis remains to be challenging. We propose a new statistical model for pathway analysis of GWAS. This model includes a fixed effects component that models mean disease association for a group of genes, and a random effects component that models how each gene's association with disease varies about the gene group mean, thus belongs to the class of mixed effects models. Results: The proposed model is computationally efficient and uses only summary statistics. In addition, it corrects for the presence of overlapping genes and linkage disequilibrium (LD). Via simulated and real GWAS data, we showed our model improved power over currently available pathway analysis methods while preserving type I error rate. Furthermore, using the WTCCC Type 1 Diabetes (T1D) dataset, we demonstrated mixed model analysis identified meaningful biological processes that agreed well with previous reports on T1D. Therefore, the proposed methodology provides an efficient statistical modeling framework for systems analysis of GWAS. Availability: The software code for mixed models analysis is freely available at http://biostat.mc.vanderbilt.edu/LilyWang. Contact: lily.wang@vanderbilt.edu; zhongming.zhao@vanderbilt.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:21266443

  15. Promising Loci and Genes for Yolk and Ovary Weight in Chickens Revealed by a Genome-Wide Association Study

    PubMed Central

    Yi, Guoqiang; Yuan, Jingwei; Duan, Zhongyi; Qu, Lujiang; Xu, Guiyun; Wang, Kehua; Yang, Ning

    2015-01-01

    Because it serves as the cytoplasm of the oocyte and provides a large amount of reserves, the egg yolk has biological significance for developing embryos. The ovary and its hierarchy of follicles are the main reproductive organs responsible for yolk deposition in chickens. However, the genetic architecture underlying the yolk and ovarian follicle weights remains elusive. Here, we measured the yolk weight (YW) at 11 age points from onset of egg laying to 72 weeks of age and measured the follicle weight (FW) and ovary weight (OW) at 73 weeks as part of a comprehensive genome-wide association study (GWAS) in 1,534 F2 hens derived from reciprocal crosses between White Leghorn (WL) and Dongxiang chickens (DX). For all ages, YWs exhibited moderate single nucleotide polymorphism (SNP)-based heritability estimates (0.25–0.38), while the estimates for FW (0.16) and OW (0.20) were relatively low. Independent univariate genome-wide screens for each trait identified 12, 3, and 31 novel significant associations with YW, FW, and OW, respectively. A list of candidate genes such as ZAR1, STARD13, ACER1b, ACSBG2, and DHRS12 were identified for having a plausible function in yolk and follicle development. These genes are important to the initiation of embryogenesis, lipid transport, lipoprotein synthesis, lipid droplet promotion, and steroid hormone metabolism, respectively. Our study provides for the first time a genome-wide association (GWA) analysis for follicle and ovary weight. Identification of the promising loci as well as potential candidate genes will greatly advance our understanding of the genetic basis underlying dynamic yolk weight and ovarian follicle development and has practical significance in breeding programs for the alteration of yolk weight at different age points. PMID:26332579

  16. Meta-analysis of New Genome-wide Association Studies of Colorectal Cancer Risk

    PubMed Central

    Peters, Ulrike; Hutter, Carolyn M.; Hsu, Li; Schumacher, Fredrick R.; Conti, David V.; Carlson, Christopher S.; Edlund, Christopher K.; Haile, Robert W.; Gallinger, Steven; Zanke, Brent W.; Lemire, Mathieu; Rangrej, Jagadish; Vijayaraghavan, Raakhee; Chan, Andrew T.; Hazra, Aditi; Hunter, David J.; Ma, Jing; Fuchs, Charles S.; Giovannucci, Edward L.; Kraft, Peter; Liu, Yan; Chen, Lin; Jiao, Shuo; Makar, Karen W.; Taverna, Darin; Gruber, Stephen B.; Rennert, Gad; Moreno, Victor; Ulrich, Cornelia M.; Woods, Michael O.; Green, Roger C.; Parfrey, Patrick S.; Prentice, Ross L.; Kooperberg, Charles; Jackson, Rebecca D.; LaCroix, Andrea Z.; Caan, Bette J.; Hayes, Richard B.; Berndt, Sonja I.; Chanock, Stephen J.; Schoen, Robert E.; Chang-Claude, Jenny; Hoffmeister, Michael; Brenner, Hermann; Frank, Bernd; Bézieau, Stéphane; Küry, Sébastien; Slattery, Martha L.; Hopper, John L.; Jenkins, Mark A.; Le Marchand, Loic; Lindor, Noralane M.; Newcomb, Polly A.; Seminara, Daniela; Hudson, Thomas J.; Duggan, David J.; Potter, John D.; Casey, Graham

    2011-01-01

    Colorectal cancer is the second leading cause of cancer death in developed countries. Genome-wide association studies (GWAS) have successfully identified novel susceptibility loci for colorectal cancer. To follow-up on these findings, and try to identify novel colorectal cancer susceptibility loci, we present results for genome-wide association studies (GWAS) of colorectal cancer (2,906 cases, 3,416 controls) that have not previously published main associations. Specifically, we calculated odds ratios (ORs) and 95% confidence intervals (CIs) using log-additive models for each study. In order to improve our power to detect novel colorectal cancer susceptibility loci, we performed a meta-analysis combining the results across studies. We selected the most statistically significant single nucleotide polymorphisms (SNPs) for replication using 10 independent studies (8,161 cases and 9,101 controls). We again used a meta-analysis to summarize results for the replication studies alone, and for a combined analysis of GWAS and replication studies. We measured 10 SNPs previously identified in colorectal cancer susceptibility loci and found eight to be associated with colorectal cancer (p-value range: 0.02 to 1.8 × 10−8). When we excluded studies that have previously published on these SNPs, five SNPs remained significant at p<0.05 in the combined analysis. No novel susceptibility loci were significant in the replication study after adjustment for multiple testing, and none reached genome-wide significance from a combined analysis of GWAS and replication. We observed marginally significant evidence for a second independent SNP in the BMP2 region at chromosomal location 20p12 (rs4813802; replication p-value 0.03; combined p-value 7.3 × 10−5). In a region on 5p33.15, which includes the coding regions of the TERT-CLPTM1L genes and has been identified in GWAS to be associated with susceptibility to at least seven other cancers, we observed a marginally significant

  17. Stepwise Evolution of Coral Biomineralization Revealed with Genome-Wide Proteomics and Transcriptomics.

    PubMed

    Takeuchi, Takeshi; Yamada, Lixy; Shinzato, Chuya; Sawada, Hitoshi; Satoh, Noriyuki

    2016-01-01

    Despite the importance of stony corals in many research fields related to global issues, such as marine ecology, climate change, paleoclimatogy, and metazoan evolution, very little is known about the evolutionary origin of coral skeleton formation. In order to investigate the evolution of coral biomineralization, we have identified skeletal organic matrix proteins (SOMPs) in the skeletal proteome of the scleractinian coral, Acropora digitifera, for which large genomic and transcriptomic datasets are available. Scrupulous gene annotation was conducted based on comparisons of functional domain structures among metazoans. We found that SOMPs include not only coral-specific proteins, but also protein families that are widely conserved among cnidarians and other metazoans. We also identified several conserved transmembrane proteins in the skeletal proteome. Gene expression analysis revealed that expression of these conserved genes continues throughout development. Therefore, these genes are involved not only skeleton formation, but also in basic cellular functions, such as cell-cell interaction and signaling. On the other hand, genes encoding coral-specific proteins, including extracellular matrix domain-containing proteins, galaxins, and acidic proteins, were prominently expressed in post-settlement stages, indicating their role in skeleton formation. Taken together, the process of coral skeleton formation is hypothesized as: 1) formation of initial extracellular matrix between epithelial cells and substrate, employing pre-existing transmembrane proteins; 2) additional extracellular matrix formation using novel proteins that have emerged by domain shuffling and rapid molecular evolution and; 3) calcification controlled by coral-specific SOMPs. PMID:27253604

  18. Stepwise Evolution of Coral Biomineralization Revealed with Genome-Wide Proteomics and Transcriptomics

    PubMed Central

    Sawada, Hitoshi; Satoh, Noriyuki

    2016-01-01

    Despite the importance of stony corals in many research fields related to global issues, such as marine ecology, climate change, paleoclimatogy, and metazoan evolution, very little is known about the evolutionary origin of coral skeleton formation. In order to investigate the evolution of coral biomineralization, we have identified skeletal organic matrix proteins (SOMPs) in the skeletal proteome of the scleractinian coral, Acropora digitifera, for which large genomic and transcriptomic datasets are available. Scrupulous gene annotation was conducted based on comparisons of functional domain structures among metazoans. We found that SOMPs include not only coral-specific proteins, but also protein families that are widely conserved among cnidarians and other metazoans. We also identified several conserved transmembrane proteins in the skeletal proteome. Gene expression analysis revealed that expression of these conserved genes continues throughout development. Therefore, these genes are involved not only skeleton formation, but also in basic cellular functions, such as cell-cell interaction and signaling. On the other hand, genes encoding coral-specific proteins, including extracellular matrix domain-containing proteins, galaxins, and acidic proteins, were prominently expressed in post-settlement stages, indicating their role in skeleton formation. Taken together, the process of coral skeleton formation is hypothesized as: 1) formation of initial extracellular matrix between epithelial cells and substrate, employing pre-existing transmembrane proteins; 2) additional extracellular matrix formation using novel proteins that have emerged by domain shuffling and rapid molecular evolution and; 3) calcification controlled by coral-specific SOMPs. PMID:27253604

  19. HLA has strongest association with IgA nephropathy in genome-wide analysis.

    PubMed

    Feehally, John; Farrall, Martin; Boland, Anne; Gale, Daniel P; Gut, Ivo; Heath, Simon; Kumar, Ashish; Peden, John F; Maxwell, Patrick H; Morris, David L; Padmanabhan, Sandosh; Vyse, Timothy J; Zawadzka, Anna; Rees, Andrew J; Lathrop, Mark; Ratcliffe, Peter J

    2010-10-01

    Demographic and family studies support the existence of a genetic contribution to the pathogenesis of IgA nephropathy, but results from genetic association studies of candidate genes are inconsistent. To systematically survey common genetic variation in this disease, we performed a genome-wide analysis in a cohort of patients with IgA nephropathy selected from the UK Glomerulonephritis DNA Bank. We used two groups of controls: parents of affected individuals and previously genotyped, unaffected, ancestry-matched individuals from the 1958 British Birth Cohort and the UK Blood Service. We genotyped 914 affected or family controls for 318,127 single nucleotide polymorphisms (SNPs). Filtering for low genotype call rates and inferred non-European ancestry left 533 genotyped individuals (187 affected children) for the family-based association analysis and 244 cases and 4980 controls for the case-control analysis. A total of 286,200 SNPs with call rates >95% were available for analysis. Genome-wide analysis showed a strong signal of association on chromosome 6p in the region of the MHC (P = 1 × 10(-9)). The two most strongly associated SNPs showed consistent association in both family-based and case-control analyses. HLA imputation analysis showed that the strongest association signal arose from a combination of DQ loci with some support for an independent HLA-B signal. These results suggest that the HLA region contains the strongest common susceptibility alleles that predispose to IgA nephropathy in the European population. PMID:20595679

  20. Heritability and genome-wide linkage analysis of migraine in the genetic isolate of Norfolk Island.

    PubMed

    Cox, Hannah C; Lea, Rod A; Bellis, Claire; Nyholt, Dale R; Dyer, Thomas D; Haupt, Larisa M; Charlesworth, Jac; Matovinovic, Elizabeth; Blangero, John; Griffiths, Lyn R

    2012-02-15

    Migraine is a common neurovascular disorder with a complex envirogenomic aetiology. In an effort to identify migraine susceptibility genes, we conducted a study of the isolated population of Norfolk Island, Australia. A large portion of the permanent inhabitants of Norfolk Island are descended from 18th Century English sailors involved in the infamous mutiny on the Bounty and their Polynesian consorts. In total, 600 subjects were recruited including a large pedigree of 377 individuals with lineage to the founders. All individuals were phenotyped for migraine using International Classification of Headache Disorders-II criterion. All subjects were genotyped for a genome-wide panel of microsatellite markers. Genotype and phenotype data for the pedigree were analysed using heritability and linkage methods implemented in the programme SOLAR. Follow-up association analysis was performed using the CLUMP programme. A total of 154 migraine cases (25%) were identified indicating the Norfolk Island population is high-risk for migraine. Heritability estimation of the 377-member pedigree indicated a significant genetic component for migraine (h(2)=0.53, P=0.016). Linkage analysis showed peaks on chromosome 13q33.1 (P=0.003) and chromosome 9q22.32 (P=0.008). Association analysis of the key microsatellites in the remaining 223 unrelated Norfolk Island individuals showed evidence of association, which strengthen support for the linkage findings (P≤0.05). In conclusion, a genome-wide linkage analysis and follow-up association analysis of migraine in the genetic isolate of Norfolk Island provided evidence for migraine susceptibility loci on chromosomes 9q22.22 and 13q33.1. PMID:22197687

  1. Insights into the genetic history of Green-legged Partridgelike fowl: mtDNA and genome-wide SNP analysis

    PubMed Central

    Siwek, M; Wragg, D; Sławińska, A; Malek, M; Hanotte, O; Mwacharo, JM

    2013-01-01

    The Green-legged Partridgelike (GP) fowl, an old native Polish breed, is characterised by reseda green-coloured shanks rather than yellow, white, slate or black commonly observed across most domestic breeds of chicken. Here, we investigate the origin, genetic relationships and structure of the GP fowl using mtDNA D-loop sequencing and genome-wide SNP analysis. Genome-wide association analysis between breeds enables us to verify the genetic control of the reseda green shank phenotype, a defining trait for the breed. Two mtDNA D-loop haplogroups and three autosomal genetic backgrounds are revealed. Significant associations of SNPs on chromosomes GGA24 and GGAZ indicate that the reseda green leg phenotype is associated with recessive alleles linked to the W and Id loci. Our results provide new insights into the genetic history of European chicken, indicating an admixd origin of East European traditional breeds of chicken on the continent, as supported by the presence of the reseda green phenotype and the knowledge that the GP fowl as a breed was developed before the advent of commercial stocks. PMID:23611337

  2. Exploring genome wide bisulfite sequencing for DNA methylation analysis in livestock: a technical assessment.

    PubMed

    Doherty, Rachael; Couldrey, Christine

    2014-01-01

    Recent advances made in "omics" technologies are contributing to a revolution in livestock selection and breeding practices. Epigenetic mechanisms, including DNA methylation are important determinants for the control of gene expression in mammals. DNA methylation research will help our understanding of how environmental factors contribute to phenotypic variation of complex production and health traits. High-throughput sequencing is a vital tool for the comprehensive analysis of DNA methylation, and bisulfite-based strategies coupled with DNA sequencing allows for quantitative, site-specific methylation analysis at the genome level or genome wide. Reduced representation bisulfite sequencing (RRBS) and more recently whole genome bisulfite sequencing (WGBS) have proven to be effective techniques for studying DNA methylation in both humans and mice. Here we report the development of RRBS and WGBS for use in sheep, the first application of this technology in livestock species. Important technical issues associated with these methodologies including fragment size selection and sequence depth are examined and discussed. PMID:24860595

  3. Genome-Wide Expression Profiling Reveals S100B as Biomarker for Invasive Aspergillosis.

    PubMed

    Dix, Andreas; Czakai, Kristin; Springer, Jan; Fliesser, Mirjam; Bonin, Michael; Guthke, Reinhard; Schmitt, Anna L; Einsele, Hermann; Linde, Jörg; Löffler, Jürgen

    2016-01-01

    Invasive aspergillosis (IA) is a devastating opportunistic infection and its treatment constitutes a considerable burden for the health care system. Immunocompromised patients are at an increased risk for IA, which is mainly caused by the species Aspergillus fumigatus. An early and reliable diagnosis is required to initiate the appropriate antifungal therapy. However, diagnostic sensitivity and accuracy still needs to be improved, which can be achieved at least partly by the definition of new biomarkers. Besides the direct detection of the pathogen by the current diagnostic methods, the analysis of the host response is a promising strategy toward this aim. Following this approach, we sought to identify new biomarkers for IA. For this purpose, we analyzed gene expression profiles of hematological patients and compared profiles of patients suffering from IA with non-IA patients. Based on microarray data, we applied a comprehensive feature selection using a random forest classifier. We identified the transcript coding for the S100 calcium-binding protein B (S100B) as a potential new biomarker for the diagnosis of IA. Considering the expression of this gene, we were able to classify samples from patients with IA with 82.3% sensitivity and 74.6% specificity. Moreover, we validated the expression of S100B in a real-time reverse transcription polymerase chain reaction (RT-PCR) assay and we also found a down-regulation of S100B in A. fumigatus stimulated DCs. An influence on the IL1B and CXCL1 downstream levels was demonstrated by this S100B knockdown. In conclusion, this study covers an effective feature selection revealing a key regulator of the human immune response during IA. S100B may represent an additional diagnostic marker that in combination with the established techniques may improve the accuracy of IA diagnosis. PMID:27047454

  4. Genome-wide comparison of cowpox viruses reveals a new clade related to Variola virus.

    PubMed

    Dabrowski, Piotr Wojtek; Radonić, Aleksandar; Kurth, Andreas; Nitsche, Andreas

    2013-01-01

    Zoonotic infections caused by several orthopoxviruses (OPV) like monkeypox virus or vaccinia virus have a significant impact on human health. In Europe, the number of diagnosed infections with cowpox viruses (CPXV) is increasing in animals as well as in humans. CPXV used to be enzootic in cattle; however, such infections were not being diagnosed over the last decades. Instead, individual cases of cowpox are being found in cats or exotic zoo animals that transmit the infection to humans. Both animals and humans reveal local exanthema on arms and legs or on the face. Although cowpox is generally regarded as a self-limiting disease, immunosuppressed patients can develop a lethal systemic disease resembling smallpox. To date, only limited information on the complex and, compared to other OPV, sparsely conserved CPXV genomes is available. Since CPXV displays the widest host range of all OPV known, it seems important to comprehend the genetic repertoire of CPXV which in turn may help elucidate specific mechanisms of CPXV pathogenesis and origin. Therefore, 22 genomes of independent CPXV strains from clinical cases, involving ten humans, four rats, two cats, two jaguarundis, one beaver, one elephant, one marah and one mongoose, were sequenced by using massive parallel pyrosequencing. The extensive phylogenetic analysis showed that the CPXV strains sequenced clearly cluster into several distinct clades, some of which are closely related to Vaccinia viruses while others represent different clades in a CPXV cluster. Particularly one CPXV clade is more closely related to Camelpox virus, Taterapox virus and Variola virus than to any other known OPV. These results support and extend recent data from other groups who postulate that CPXV does not form a monophyletic clade and should be divided into multiple lineages. PMID:24312452

  5. Genome-Wide Expression Profiling Reveals S100B as Biomarker for Invasive Aspergillosis

    PubMed Central

    Dix, Andreas; Czakai, Kristin; Springer, Jan; Fliesser, Mirjam; Bonin, Michael; Guthke, Reinhard; Schmitt, Anna L.; Einsele, Hermann; Linde, Jörg; Löffler, Jürgen

    2016-01-01

    Invasive aspergillosis (IA) is a devastating opportunistic infection and its treatment constitutes a considerable burden for the health care system. Immunocompromised patients are at an increased risk for IA, which is mainly caused by the species Aspergillus fumigatus. An early and reliable diagnosis is required to initiate the appropriate antifungal therapy. However, diagnostic sensitivity and accuracy still needs to be improved, which can be achieved at least partly by the definition of new biomarkers. Besides the direct detection of the pathogen by the current diagnostic methods, the analysis of the host response is a promising strategy toward this aim. Following this approach, we sought to identify new biomarkers for IA. For this purpose, we analyzed gene expression profiles of hematological patients and compared profiles of patients suffering from IA with non-IA patients. Based on microarray data, we applied a comprehensive feature selection using a random forest classifier. We identified the transcript coding for the S100 calcium-binding protein B (S100B) as a potential new biomarker for the diagnosis of IA. Considering the expression of this gene, we were able to classify samples from patients with IA with 82.3% sensitivity and 74.6% specificity. Moreover, we validated the expression of S100B in a real-time reverse transcription polymerase chain reaction (RT-PCR) assay and we also found a down-regulation of S100B in A. fumigatus stimulated DCs. An influence on the IL1B and CXCL1 downstream levels was demonstrated by this S100B knockdown. In conclusion, this study covers an effective feature selection revealing a key regulator of the human immune response during IA. S100B may represent an additional diagnostic marker that in combination with the established techniques may improve the accuracy of IA diagnosis. PMID:27047454

  6. Genome-Wide Linkage Analysis Identifies Loci for Physical Appearance Traits in Chickens

    PubMed Central

    Sun, Yanfa; Liu, Ranran; Zhao, Guiping; Zheng, Maiqing; Sun, Yan; Yu, Xiaoqiong; Li, Peng; Wen, Jie

    2015-01-01

    Physical appearance traits, such as feather-crested head, comb size and type, beard, wattles size, and feathered feet, are used to distinguish between breeds of chicken and also may be associated with economic traits. In this study, a genome-wide linkage analysis was used to identify candidate regions and genes for physical appearance traits and to potentially provide further knowledge of the molecular mechanisms that underlie these traits. The linkage analysis was conducted with an F2 population derived from Beijing-You chickens and a commercial broiler line. Single-nucleotide polymorphisms were analyzed using the Illumina 60K Chicken SNP Beadchip. The data were used to map quantitative trait loci and genes for six physical appearance traits. A 10-cM/0.51-Mb region (0.0−10.0 cM/0.00−0.51 Mb) with 1% genome-wide significant level on LGE22C19W28_E50C23 linkage group (LGE22) for crest trait was identified, which is likely very closely linked to the HOXC8. A QTL with 5% chromosome-wide significant level for comb weight, which partly overlaps with a region identified in a previous study, was identified at 74 cM/25.55 Mb on chicken (Gallus gallus; GG) chromosome 3 (i.e., GGA3). For beard and wattles traits, an identical region 11 cM/2.23 Mb (0.0−11.0 cM/0.00−2.23 Mb) including WNT3 and GH genes on GGA27 was identified. Two QTL with 1% genome-wide significant level for feathered feet trait, one 9-cM/2.80-Mb (48.0-57.0/13.40-16.20 Mb) region on GGA13, and another 12-cM/1.45-Mb (41.0−53.0 cM/11.37−12.82 Mb) region on GGA15 were identified. These candidate regions and genes provide important genetic information for the physical appearance traits in chicken. PMID:26248982

  7. Genome-wide meta-analysis of Psoriatic Arthritis Identifies Susceptibility Locus at REL

    PubMed Central

    Ellinghaus, Eva; Stuart, Philip E.; Ellinghaus, David; Nair, Rajan P.; Debrus, Sophie; Raelson, John V.; Belouchi, Majid; Tejasvi, Trilokraj; Li, Yanming; Tsoi, Lam C.; Onken, Anna T.; Esko, Tonu; Metspalu, Andres; Rahman, Proton; Gladman, Dafna D.; Bowcock, Anne M.; Helms, Cynthia; Krueger, Gerald G.; Koks, Sulev; Kingo, Külli; Gieger, Christian; Wichmann, H. Erich; Mrowietz, Ulrich; Weidinger, Stephan; Schreiber, Stefan; Abecasis, Gonçalo R.; Elder, James T.; Weichenthal, Michael; Franke, Andre

    2011-01-01

    Psoriatic arthritis (PsA) is a chronic inflammatory musculoskeletal disease affecting up to 30% of psoriasis vulgaris (PsV) cases and approximately 0.25% to 1% of the general population. To identify common susceptibility loci, we performed a meta-analysis of three imputed genome-wide association studies (GWAS) on psoriasis, stratified for PsA. A total of 1,160,703 SNPs were analyzed in the discovery set consisting of 535 PsA cases and 3,432 controls from Germany, the United States and Canada. We followed up two SNPs in 1,931 PsA cases and 6,785 controls comprising six independent replication panels from Germany, Estonia, the United States and Canada. In the combined analysis, a genome-wide significant association was detected at 2p16 near the REL locus encoding c-Rel (rs13017599, P=1.18×10−8, OR=1.27, 95% CI=1.18–1.35). The rs13017599 polymorphism is known to associate with rheumatoid arthritis (RA), and another SNP near REL (rs702873) was recently implicated in PsV susceptibility. However, conditional analysis indicated that rs13017599, rather than rs702873, accounts for the PsA association at REL. We hypothesize that c-Rel, as a member of the Rel/NF-κB family, is associated with PsA in the context of disease pathways that involve other identified PsA and PsV susceptibility genes including TNIP1, TNFAIP3 and NFκBIA. PMID:22170493

  8. MPE-seq, a new method for the genome-wide analysis of chromatin structure.

    PubMed

    Ishii, Haruhiko; Kadonaga, James T; Ren, Bing

    2015-07-01

    The analysis of chromatin structure is essential for the understanding of transcriptional regulation in eukaryotes. Here we describe methidiumpropyl-EDTA sequencing (MPE-seq), a method for the genome-wide characterization of chromatin that involves the digestion of nuclei withMPE-Fe(II) followed by massively parallel sequencing. Like micrococcal nuclease (MNase), MPE-Fe(II) preferentially cleaves the linker DNA between nucleosomes. However, there are differences in the cleavage of nuclear chromatin by MPE-Fe(II) relative to MNase. Most notably, immediately upstream of the transcription start site of active promoters, we frequently observed nucleosome-sized (141-190 bp) and subnucleosome-sized (such as 101-140 bp) peaks of digested chromatin fragments with MPE-seq but not with MNase-seq. These peaks also correlate with the presence of core histones and could thus be due, at least in part, to noncanonical chromatin structures such as labile nucleosome-like particles that have been observed in other contexts. The subnucleosome-sized MPE-seq peaks exhibit a particularly distinct association with active promoters. In addition, unlike MNase, MPE-Fe(II) cleaves nuclear DNA with little sequence bias. In this regard, we found that DNA sequences at RNA splice sites are hypersensitive to digestion by MNase but not by MPE-Fe(II). This phenomenon may have affected the analysis of nucleosome occupancy over exons. These findings collectively indicate that MPE-seq provides a unique and straightforward means for the genome-wide analysis of chromatin structure with minimal DNA sequence bias. In particular, the combined use of MPE-seq and MNase-seq enables the identification of noncanonical chromatin structures that are likely to be important for the regulation of gene expression. PMID:26080409

  9. Forty-three loci associated with plasma lipoprotein size, concentration, and cholesterol content in genome-wide analysis.

    PubMed

    Chasman, Daniel I; Paré, Guillaume; Mora, Samia; Hopewell, Jemma C; Peloso, Gina; Clarke, Robert; Cupples, L Adrienne; Hamsten, Anders; Kathiresan, Sekar; Mälarstig, Anders; Ordovas, José M; Ripatti, Samuli; Parker, Alex N; Miletich, Joseph P; Ridker, Paul M

    2009-11-01

    While conventional LDL-C, HDL-C, and triglyceride measurements reflect aggregate properties of plasma lipoprotein fractions, NMR-based measurements more accurately reflect lipoprotein particle concentrations according to class (LDL, HDL, and VLDL) and particle size (small, medium, and large). The concentrations of these lipoprotein sub-fractions may be related to risk of cardiovascular disease and related metabolic disorders. We performed a genome-wide association study of 17 lipoprotein measures determined by NMR together with LDL-C, HDL-C, triglycerides, ApoA1, and ApoB in 17,296 women from the Women's Genome Health Study (WGHS). Among 36 loci with genome-wide significance (P<5x10(-8)) in primary and secondary analysis, ten (PCCB/STAG1 (3q22.3), GMPR/MYLIP (6p22.3), BTNL2 (6p21.32), KLF14 (7q32.2), 8p23.1, JMJD1C (10q21.3), SBF2 (11p15.4), 12q23.2, CCDC92/DNAH10/ZNF664 (12q24.31.B), and WIPI1 (17q24.2)) have not been reported in prior genome-wide association studies for plasma lipid concentration. Associations with mean lipoprotein particle size but not cholesterol content were found for LDL at four loci (7q11.23, LPL (8p21.3), 12q24.31.B, and LIPG (18q21.1)) and for HDL at one locus (GCKR (2p23.3)). In addition, genetic determinants of total IDL and total VLDL concentration were found at many loci, most strongly at LIPC (15q22.1) and APOC-APOE complex (19q13.32), respectively. Associations at seven more loci previously known for effects on conventional plasma lipid measures reveal additional genetic influences on lipoprotein profiles and bring the total number of loci to 43. Thus, genome-wide associations identified novel loci involved with lipoprotein metabolism-including loci that affect the NMR-based measures of concentration or size of LDL, HDL, and VLDL particles-all characteristics of lipoprotein profiles that may impact disease risk but are not available by conventional assay. PMID:19936222

  10. A high definition look at the NF-Y regulome reveals genome-wide associations with selected transcription factors.

    PubMed

    Dolfini, Diletta; Zambelli, Federico; Pedrazzoli, Maurizio; Mantovani, Roberto; Pavesi, Giulio

    2016-06-01

    NF-Y is a trimeric transcription factor (TF), binding the CCAAT box element, for which several results suggest a pioneering role in activation of transcription. In this work, we integrated 380 ENCODE ChIP-Seq experiments for 154 TFs and cofactors with sequence analysis, protein-protein interactions and RNA profiling data, in order to identify genome-wide regulatory modules resulting from the co-association of NF-Y with other TFs. We identified three main degrees of co-association with NF-Y for sequence-specific TFs. In the most relevant one, we found TFs having a significant overlap with NF-Y in their DNA binding loci, some with a precise spacing of binding sites with respect to the CCAAT box, others (FOS, Sp1/2, RFX5, IRF3, PBX3) mostly lacking their canonical binding site and bound to arrays of well spaced CCAAT boxes. As expected, NF-Y binding also correlates with RNA Pol II General TFs and with subunits of complexes involved in the control of H3K4 methylations. Co-association patterns are confirmed by protein-protein interactions, and correspond to specific functional categorizations and expression level changes of target genes following NF-Y inactivation. These data define genome-wide rules for the organization of NF-Y-centered regulatory modules, supporting a model of distinct categorization and synergy with well defined sets of TFs. PMID:26896797

  11. A high definition look at the NF-Y regulome reveals genome-wide associations with selected transcription factors

    PubMed Central

    Dolfini, Diletta; Zambelli, Federico; Pedrazzoli, Maurizio; Mantovani, Roberto; Pavesi, Giulio

    2016-01-01

    NF-Y is a trimeric transcription factor (TF), binding the CCAAT box element, for which several results suggest a pioneering role in activation of transcription. In this work, we integrated 380 ENCODE ChIP-Seq experiments for 154 TFs and cofactors with sequence analysis, protein–protein interactions and RNA profiling data, in order to identify genome-wide regulatory modules resulting from the co-association of NF-Y with other TFs. We identified three main degrees of co-association with NF-Y for sequence-specific TFs. In the most relevant one, we found TFs having a significant overlap with NF-Y in their DNA binding loci, some with a precise spacing of binding sites with respect to the CCAAT box, others (FOS, Sp1/2, RFX5, IRF3, PBX3) mostly lacking their canonical binding site and bound to arrays of well spaced CCAAT boxes. As expected, NF-Y binding also correlates with RNA Pol II General TFs and with subunits of complexes involved in the control of H3K4 methylations. Co-association patterns are confirmed by protein–protein interactions, and correspond to specific functional categorizations and expression level changes of target genes following NF-Y inactivation. These data define genome-wide rules for the organization of NF-Y-centered regulatory modules, supporting a model of distinct categorization and synergy with well defined sets of TFs. PMID:26896797

  12. Genome-wide expression analysis of genetic networks in Neurospora crassa

    PubMed Central

    Logan, David A; Koch, Allison L; Dong, Wubei; Griffith, James; Nilsen, Roger; Case, Mary E; Schüttler, Heinz-Bernd; Arnold, Jonathan

    2007-01-01

    The products of five structural genes and two regulatory genes of the qa gene cluster of Neurospora crassa control the metabolism of quinic acid (QA) as a carbon source. A detailed genetic network model of this metabolic process has been reported. This investigation is designed to expand the current model of the QA reaction network. The ensemble method of network identification was used to model RNA profiling data on the qa gene cluster. Through microarray and cluster analysis, genome-wide identification of RNA transcripts associated with quinic acid metabolism in N. crassa is described and suggests a connection to other metabolic circuits. More than 100 genes whose products include carbon metabolism, protein degradation and modification, amino acid metabolism and ribosome synthesis appear to be connected to quinic acid metabolism. The core of the qa gene cluster network is validated with respect to RNA profiling data obtained from microarrays. PMID:17597928

  13. Pharmacogenetic meta-analysis of genome-wide association studies of LDL cholesterol response to statins.

    PubMed

    Postmus, Iris; Trompet, Stella; Deshmukh, Harshal A; Barnes, Michael R; Li, Xiaohui; Warren, Helen R; Chasman, Daniel I; Zhou, Kaixin; Arsenault, Benoit J; Donnelly, Louise A; Wiggins, Kerri L; Avery, Christy L; Griffin, Paula; Feng, QiPing; Taylor, Kent D; Li, Guo; Evans, Daniel S; Smith, Albert V; de Keyser, Catherine E; Johnson, Andrew D; de Craen, Anton J M; Stott, David J; Buckley, Brendan M; Ford, Ian; Westendorp, Rudi G J; Slagboom, P Eline; Sattar, Naveed; Munroe, Patricia B; Sever, Peter; Poulter, Neil; Stanton, Alice; Shields, Denis C; O'Brien, Eoin; Shaw-Hawkins, Sue; Chen, Y-D Ida; Nickerson, Deborah A; Smith, Joshua D; Dubé, Marie Pierre; Boekholdt, S Matthijs; Hovingh, G Kees; Kastelein, John J P; McKeigue, Paul M; Betteridge, John; Neil, Andrew; Durrington, Paul N; Doney, Alex; Carr, Fiona; Morris, Andrew; McCarthy, Mark I; Groop, Leif; Ahlqvist, Emma; Bis, Joshua C; Rice, Kenneth; Smith, Nicholas L; Lumley, Thomas; Whitsel, Eric A; Stürmer, Til; Boerwinkle, Eric; Ngwa, Julius S; O'Donnell, Christopher J; Vasan, Ramachandran S; Wei, Wei-Qi; Wilke, Russell A; Liu, Ching-Ti; Sun, Fangui; Guo, Xiuqing; Heckbert, Susan R; Post, Wendy; Sotoodehnia, Nona; Arnold, Alice M; Stafford, Jeanette M; Ding, Jingzhong; Herrington, David M; Kritchevsky, Stephen B; Eiriksdottir, Gudny; Launer, Leonore J; Harris, Tamara B; Chu, Audrey Y; Giulianini, Franco; MacFadyen, Jean G; Barratt, Bryan J; Nyberg, Fredrik; Stricker, Bruno H; Uitterlinden, André G; Hofman, Albert; Rivadeneira, Fernando; Emilsson, Valur; Franco, Oscar H; Ridker, Paul M; Gudnason, Vilmundur; Liu, Yongmei; Denny, Joshua C; Ballantyne, Christie M; Rotter, Jerome I; Adrienne Cupples, L; Psaty, Bruce M; Palmer, Colin N A; Tardif, Jean-Claude; Colhoun, Helen M; Hitman, Graham; Krauss, Ronald M; Wouter Jukema, J; Caulfield, Mark J

    2014-01-01

    Statins effectively lower LDL cholesterol levels in large studies and the observed interindividual response variability may be partially explained by genetic variation. Here we perform a pharmacogenetic meta-analysis of genome-wide association studies (GWAS) in studies addressing the LDL cholesterol response to statins, including up to 18,596 statin-treated subjects. We validate the most promising signals in a further 22,318 statin recipients and identify two loci, SORT1/CELSR2/PSRC1 and SLCO1B1, not previously identified in GWAS. Moreover, we confirm the previously described associations with APOE and LPA. Our findings advance the understanding of the pharmacogenetic architecture of statin response. PMID:25350695

  14. Pharmacogenetic meta-analysis of genome-wide association studies of LDL cholesterol response to statins

    PubMed Central

    Postmus, Iris; Trompet, Stella; Deshmukh, Harshal A.; Barnes, Michael R.; Li, Xiaohui; Warren, Helen R.; Chasman, Daniel I.; Zhou, Kaixin; Arsenault, Benoit J.; Donnelly, Louise A.; Wiggins, Kerri L.; Avery, Christy L.; Griffin, Paula; Feng, QiPing; Taylor, Kent D.; Li, Guo; Evans, Daniel S.; Smith, Albert V.; de Keyser, Catherine E.; Johnson, Andrew D.; de Craen, Anton J. M.; Stott, David J.; Buckley, Brendan M.; Ford, Ian; Westendorp, Rudi G. J.; Eline Slagboom, P.; Sattar, Naveed; Munroe, Patricia B.; Sever, Peter; Poulter, Neil; Stanton, Alice; Shields, Denis C.; O’Brien, Eoin; Shaw-Hawkins, Sue; Ida Chen, Y.-D.; Nickerson, Deborah A.; Smith, Joshua D.; Pierre Dubé, Marie; Matthijs Boekholdt, S.; Kees Hovingh, G.; Kastelein, John J. P.; McKeigue, Paul M.; Betteridge, John; Neil, Andrew; Durrington, Paul N.; Doney, Alex; Carr, Fiona; Morris, Andrew; McCarthy, Mark I.; Groop, Leif; Ahlqvist, Emma; Bis, Joshua C.; Rice, Kenneth; Smith, Nicholas L.; Lumley, Thomas; Whitsel, Eric A.; Stürmer, Til; Boerwinkle, Eric; Ngwa, Julius S.; O’Donnell, Christopher J.; Vasan, Ramachandran S.; Wei, Wei-Qi; Wilke, Russell A.; Liu, Ching-Ti; Sun, Fangui; Guo, Xiuqing; Heckbert, Susan R; Post, Wendy; Sotoodehnia, Nona; Arnold, Alice M.; Stafford, Jeanette M.; Ding, Jingzhong; Herrington, David M.; Kritchevsky, Stephen B.; Eiriksdottir, Gudny; Launer, Leonore J.; Harris, Tamara B.; Chu, Audrey Y.; Giulianini, Franco; MacFadyen, Jean G.; Barratt, Bryan J.; Nyberg, Fredrik; Stricker, Bruno H.; Uitterlinden, André G.; Hofman, Albert; Rivadeneira, Fernando; Emilsson, Valur; Franco, Oscar H.; Ridker, Paul M.; Gudnason, Vilmundur; Liu, Yongmei; Denny, Joshua C.; Ballantyne, Christie M.; Rotter, Jerome I.; Adrienne Cupples, L.; Psaty, Bruce M.; Palmer, Colin N. A.; Tardif, Jean-Claude; Colhoun, Helen M.; Hitman, Graham; Krauss, Ronald M.; Wouter Jukema, J; Caulfield, Mark J.; Donnelly, Peter; Barroso, Ines; Blackwell, Jenefer M.; Bramon, Elvira; Brown, Matthew A.; Casas, Juan P.; Corvin, Aiden; Deloukas, Panos; Duncanson, Audrey; Jankowski, Janusz; Markus, Hugh S.; Mathew, Christopher G.; Palmer, Colin N. A.; Plomin, Robert; Rautanen, Anna; Sawcer, Stephen J.; Trembath, Richard C.; Viswanathan, Ananth C.; Wood, Nicholas W.; Spencer, Chris C. A.; Band, Gavin; Bellenguez, Céline; Freeman, Colin; Hellenthal, Garrett; Giannoulatou, Eleni; Pirinen, Matti; Pearson, Richard; Strange, Amy; Su, Zhan; Vukcevic, Damjan; Donnelly, Peter; Langford, Cordelia; Hunt, Sarah E.; Edkins, Sarah; Gwilliam, Rhian; Blackburn, Hannah; Bumpstead, Suzannah J.; Dronov, Serge; Gillman, Matthew; Gray, Emma; Hammond, Naomi; Jayakumar, Alagurevathi; McCann, Owen T.; Liddle, Jennifer; Potter, Simon C.; Ravindrarajah, Radhi; Ricketts, Michelle; Waller, Matthew; Weston, Paul; Widaa, Sara; Whittaker, Pamela; Barroso, Ines; Deloukas, Panos; Mathew, Christopher G.; Blackwell, Jenefer M.; Brown, Matthew A.; Corvin, Aiden; McCarthy, Mark I.; Spencer, Chris C. A.

    2014-01-01

    Statins effectively lower LDL cholesterol levels in large studies and the observed interindividual response variability may be partially explained by genetic variation. Here we perform a pharmacogenetic meta-analysis of genome-wide association studies (GWAS) in studies addressing the LDL cholesterol response to statins, including up to 18,596 statin-treated subjects. We validate the most promising signals in a further 22,318 statin recipients and identify two loci, SORT1/CELSR2/PSRC1 and SLCO1B1, not previously identified in GWAS. Moreover, we confirm the previously described associations with APOE and LPA. Our findings advance the understanding of the pharmacogenetic architecture of statin response. PMID:25350695

  15. A genome-wide resource for the analysis of protein localisation in Drosophila.

    PubMed

    Sarov, Mihail; Barz, Christiane; Jambor, Helena; Hein, Marco Y; Schmied, Christopher; Suchold, Dana; Stender, Bettina; Janosch, Stephan; K J, Vinay Vikas; Krishnan, R T; Krishnamoorthy, Aishwarya; Ferreira, Irene R S; Ejsmont, Radoslaw K; Finkl, Katja; Hasse, Susanne; Kämpfer, Philipp; Plewka, Nicole; Vinis, Elisabeth; Schloissnig, Siegfried; Knust, Elisabeth; Hartenstein, Volker; Mann, Matthias; Ramaswami, Mani; VijayRaghavan, K; Tomancak, Pavel; Schnorrer, Frank

    2016-01-01

    The Drosophila genome contains >13000 protein-coding genes, the majority of which remain poorly investigated. Important reasons include the lack of antibodies or reporter constructs to visualise these proteins. Here, we present a genome-wide fosmid library of 10000 GFP-tagged clones, comprising tagged genes and most of their regulatory information. For 880 tagged proteins, we created transgenic lines, and for a total of 207 lines, we assessed protein expression and localisation in ovaries, embryos, pupae or adults by stainings and live imaging approaches. Importantly, we visualised many proteins at endogenous expression levels and found a large fraction of them localising to subcellular compartments. By applying genetic complementation tests, we estimate that about two-thirds of the tagged proteins are functional. Moreover, these tagged proteins enable interaction proteomics from developing pupae and adult flies. Taken together, this resource will boost systematic analysis of protein expression and localisation in various cellular and developmental contexts. PMID:26896675

  16. Identification of Genetic Susceptibility Loci for Colorectal Tumors in a Genome-wide Meta-analysis

    PubMed Central

    Peters, Ulrike; Jiao, Shuo; Schumacher, Fredrick R.; Hutter, Carolyn M.; Aragaki, Aaron K.; Baron, John A.; Berndt, Sonja I.; Bézieau, Stéphane; Brenner, Hermann; Butterbach, Katja; Caan, Bette J.; Campbell, Peter T.; Carlson, Christopher S.; Casey, Graham; Chan, Andrew T.; Chang-Claude, Jenny; Chanock, Stephen J.; Chen, Lin S.; Coetzee, Gerhard A.; Coetzee, Simon G.; Conti, David V.; Curtis, Keith R.; Duggan, David; Edwards, Todd; Fuchs, Charles S.; Gallinger, Steven; Giovannucci, Edward L.; Gogarten, Stephanie M.; Gruber, Stephen B.; Haile, Robert W.; Harrison, Tabitha A.; Hayes, Richard B.; Henderson, Brian E.; Hoffmeister, Michael; Hopper, John L.; Hudson, Thomas J.; Hunter, David J.; Jackson, Rebecca D.; Jee, Sun Ha; Jenkins, Mark A.; Jia, Wei-Hua; Kolonel, Laurence N.; Kooperberg, Charles; Küry, Sébastien; Lacroix, Andrea Z.; Laurie, Cathy C.; Laurie, Cecelia A.; Le Marchand, Loic; Lemire, Mathieu; Levine, David; Lindor, Noralane M.; Liu, Yan; Ma, Jing; Makar, Karen W.; Matsuo, Keitaro; Newcomb, Polly A.; Potter, John D.; Prentice, Ross L.; Qu, Conghui; Rohan, Thomas; Rosse, Stephanie A.; Schoen, Robert E.; Seminara, Daniela; Shrubsole, Martha; Shu, Xiao-Ou; Slattery, Martha L.; Taverna, Darin; Thibodeau, Stephen N.; Ulrich, Cornelia M.; White, Emily; Xiang, Yongbing; Zanke, Brent W.; Zeng, Yi-Xin; Zhang, Ben; Zheng, Wei; Hsu, Li

    2013-01-01

    BACKGROUND & AIMS Heritable factors contribute to the development of colorectal cancer. Identifying the genetic loci associated with colorectal tumor formation could elucidate the mechanisms of pathogenesis. METHODS We conducted a genome-wide association study that included 14 studies, 12,696 cases of colorectal tumors (11,870 cancer, 826 adenoma), and 15,113 controls of European descent. The 10 most statistically significant, previously unreported findings were followed up in 6 studies; these included 3056 colorectal tumor cases (2098 cancer, 958 adenoma) and 6658 controls of European and Asian descent. RESULTS Based on the combined analysis, we identified a locus that reached the conventional genome-wide significance level at less than 5.0 × 10−8: an intergenic region on chromosome 2q32.3, close to nucleic acid binding protein 1 (most significant single nucleotide polymorphism: rs11903757; odds ratio [OR], 1.15 per risk allele; P = 3.7 × 10−8). We also found evidence for 3 additional loci with P values less than 5.0 × 10−7: a locus within the laminin gamma 1 gene on chromosome 1q25.3 (rs10911251; OR, 1.10 per risk allele; P = 9.5 × 10−8), a locus within the cyclin D2 gene on chromosome 12p13.32 (rs3217810 per risk allele; OR, 0.84; P = 5.9 × 10−8), and a locus in the T-box 3 gene on chromosome 12q24.21 (rs59336; OR, 0.91 per risk allele; P = 3.7 × 10−7). CONCLUSIONS In a large genome-wide association study, we associated polymorphisms close to nucleic acid binding protein 1 (which encodes a DNA-binding protein involved in DNA repair) with colorectal tumor risk. We also provided evidence for an association between colorectal tumor risk and polymorphisms in laminin gamma 1 (this is the second gene in the laminin family to be associated with colorectal cancers), cyclin D2 (which encodes for cyclin D2), and T-box 3 (which encodes a T-box transcription factor and is a target of Wnt signaling to β-catenin). The roles of these genes and their products

  17. Genome-wide survey and expression analysis of the bHLH-PAS genes in the amphioxus Branchiostoma floridae reveal both conserved and diverged expression patterns between cephalochordates and vertebrates

    PubMed Central

    2014-01-01

    Background The bHLH-PAS transcription factors are found in both protostomes and deuterostomes. They are involved in many developmental and physiological processes, including regional differentiation of the central nervous system, tube-formation, hypoxia signaling, aromatic hydrocarbon sensing, and circadian rhythm regulation. To understand the evolution of these genes in chordates, we analyzed the bHLH-PAS genes of the basal chordate amphioxus (Branchiostoma floridae). Results From the amphioxus draft genome database, we identified ten bHLH-PAS genes, nine of which could be assigned to known orthologous families. The tenth bHLH-PAS gene could not be assigned confidently to any known bHLH family; however, phylogenetic analysis clustered this gene with arthropod Met family genes and two spiralian bHLH-PAS-containing sequences, suggesting that they may share the same ancestry. We examined temporal and spatial expression patterns of these bHLH-PAS genes in developing amphioxus embryos. We found that BfArnt, BfNcoa, BfSim, and BfHifα were expressed in the central nervous system in patterns similar to those of their vertebrate homologs, suggesting that their functions may be conserved. By contrast, the amphioxus BfAhr and BfNpas4 had expression patterns distinct from those in vertebrates. These results imply that there were changes in gene regulation after the divergence of cephalochordates and vertebrates. Conclusions We have identified ten bHLH-PAS genes from the amphioxus genome and determined the embryonic expression profiles for these genes. In addition to the nine currently recognized bHLH-PAS families, our survey suggests that the BfbHLHPAS-orphan gene along with arthropod Met genes and the newly identified spiralian bHLH-PAS-containing sequences represent an ancient group of genes that were lost in the vertebrate lineage. In a comparison with the expression patterns of the vertebrate bHLH-PAS paralogs, which are the result of whole-genome duplication, we found

  18. Genome-Wide Analysis of Seed Acid Detergent Lignin (ADL) and Hull Content in Rapeseed (Brassica napus L.).

    PubMed

    Wang, Jia; Jian, Hongju; Wei, Lijuan; Qu, Cunmin; Xu, Xinfu; Lu, Kun; Qian, Wei; Li, Jiana; Li, Maoteng; Liu, Liezhao

    2015-01-01

    A stable yellow-seeded variety is the breeding goal for obtaining the ideal rapeseed (Brassica napus L.) plant, and the amount of acid detergent lignin (ADL) in the seeds and the hull content (HC) are often used as yellow-seeded rapeseed screening indices. In this study, a genome-wide association analysis of 520 accessions was performed using the Q + K model with a total of 31,839 single-nucleotide polymorphism (SNP) sites. As a result, three significant associations on the B. napus chromosomes A05, A09, and C05 were detected for seed ADL content. The peak SNPs were within 9.27, 14.22, and 20.86 kb of the key genes BnaA.PAL4, BnaA.CAD2/BnaA.CAD3, and BnaC.CCR1, respectively. Further analyses were performed on the major locus of A05, which was also detected in the seed HC examination. A comparison of our genome-wide association study (GWAS) results and previous linkage mappings revealed a common chromosomal region on A09, which indicates that GWAS can be used as a powerful complementary strategy for dissecting complex traits in B. napus. Genomic selection (GS) utilizing the significant SNP markers based on the GWAS results exhibited increased predictive ability, indicating that the predictive ability of a given model can be substantially improved by using GWAS and GS. PMID:26673885

  19. Genome-Wide Analysis of Seed Acid Detergent Lignin (ADL) and Hull Content in Rapeseed (Brassica napus L.)

    PubMed Central

    Wei, Lijuan; Qu, Cunmin; Xu, Xinfu; Lu, Kun; Qian, Wei; Li, Jiana; Li, Maoteng; Liu, Liezhao

    2015-01-01

    A stable yellow-seeded variety is the breeding goal for obtaining the ideal rapeseed (Brassica napus L.) plant, and the amount of acid detergent lignin (ADL) in the seeds and the hull content (HC) are often used as yellow-seeded rapeseed screening indices. In this study, a genome-wide association analysis of 520 accessions was performed using the Q + K model with a total of 31,839 single-nucleotide polymorphism (SNP) sites. As a result, three significant associations on the B. napus chromosomes A05, A09, and C05 were detected for seed ADL content. The peak SNPs were within 9.27, 14.22, and 20.86 kb of the key genes BnaA.PAL4, BnaA.CAD2/BnaA.CAD3, and BnaC.CCR1, respectively. Further analyses were performed on the major locus of A05, which was also detected in the seed HC examination. A comparison of our genome-wide association study (GWAS) results and previous linkage mappings revealed a common chromosomal region on A09, which indicates that GWAS can be used as a powerful complementary strategy for dissecting complex traits in B. napus. Genomic selection (GS) utilizing the significant SNP markers based on the GWAS results exhibited increased predictive ability, indicating that the predictive ability of a given model can be substantially improved by using GWAS and GS. PMID:26673885

  20. Meta-analysis of genome-wide association studies for personality

    PubMed Central

    de Moor, Marleen H.M.; Costa, Paul T.; Terracciano, Antonio; Krueger, Robert F.; de Geus, Eco J.C.; Toshiko, Tanaka; Penninx, Brenda W.J.H.; Esko, Tõnu; Madden, Pamela A F; Derringer, Jaime; Amin, Najaf; Willemsen, Gonneke; Hottenga, Jouke-Jan; Distel, Marijn A.; Uda, Manuela; Sanna, Serena; Spinhoven, Philip; Hartman, Catharina A.; Sullivan, Patrick; Realo, Anu; Allik, Jüri; Heath, Andrew C; Pergadia, Michele L; Agrawal, Arpana; Lin, Peng; Grucza, Richard; Nutile, Teresa; Ciullo, Marina; Rujescu, Dan; Giegling, Ina; Konte, Bettina; Widen, Elisabeth; Cousminer, Diana L; Eriksson, Johan G.; Palotie, Aarno; Luciano, Michelle; Tenesa, Albert; Davies, Gail; Lopez, Lorna M.; Hansell, Narelle K.; Medland, Sarah E.; Ferrucci, Luigi; Schlessinger, David; Montgomery, Grant W.; Wright, Margaret J.; Aulchenko, Yurii S.; Janssens, A.Cecile J.W.; Oostra, Ben A.; Metspalu, Andres; Abecasis, Gonçalo R.; Deary, Ian J.; Räikkönen, Katri; Bierut, Laura J.; Martin, Nicholas G.; van Duijn, Cornelia M.; Boomsma, Dorret I.

    2013-01-01

    Personality can be thought of as a set of characteristics that influence people’s thoughts, feelings, and behaviour across a variety of settings. Variation in personality is predictive of many outcomes in life, including mental health. Here we report on a meta-analysis of genome-wide association (GWA) data for personality in ten discovery samples (17 375 adults) and five in-silico replication samples (3 294 adults). All participants were of European ancestry. Personality scores for Neuroticism, Extraversion, Openness to Experience, Agreeableness, and Conscientiousness were based on the NEO Five-Factor Inventory. Genotype data were available of ~2.4M Single Nucleotide Polymorphisms (SNPs; directly typed and imputed using HAPMAP data). In the discovery samples, classical association analyses were performed under an additive model followed by meta-analysis using the weighted inverse variance method. Results showed genome-wide significance for Openness to Experience near the RASA1 gene on 5q14.3 (rs1477268 and rs2032794, P = 2.8 × 10−8 and 3.1 × 10−8) and for Conscientiousness in the brain-expressed KATNAL2 gene on 18q21.1 (rs2576037, P = 4.9 × 10−8). We further conducted a gene-based test that confirmed the association of KATNAL2 to Conscientiousness. In-silico replication did not, however, show significant associations of the top SNPs with Openness and Conscientiousness, although the direction of effect of the KATNAL2 SNP on Conscientiousness was consistent in all replication samples. Larger scale GWA studies and alternative approaches are required for confirmation of KATNAL2 as a novel gene affecting Conscientiousness. PMID:21173776

  1. Genome-wide association analysis identifies three new susceptibility loci for childhood body mass index.

    PubMed

    Felix, Janine F; Bradfield, Jonathan P; Monnereau, Claire; van der Valk, Ralf J P; Stergiakouli, Evie; Chesi, Alessandra; Gaillard, Romy; Feenstra, Bjarke; Thiering, Elisabeth; Kreiner-Møller, Eskil; Mahajan, Anubha; Pitkänen, Niina; Joro, Raimo; Cavadino, Alana; Huikari, Ville; Franks, Steve; Groen-Blokhuis, Maria M; Cousminer, Diana L; Marsh, Julie A; Lehtimäki, Terho; Curtin, John A; Vioque, Jesus; Ahluwalia, Tarunveer S; Myhre, Ronny; Price, Thomas S; Vilor-Tejedor, Natalia; Yengo, Loïc; Grarup, Niels; Ntalla, Ioanna; Ang, Wei; Atalay, Mustafa; Bisgaard, Hans; Blakemore, Alexandra I; Bonnefond, Amelie; Carstensen, Lisbeth; Eriksson, Johan; Flexeder, Claudia; Franke, Lude; Geller, Frank; Geserick, Mandy; Hartikainen, Anna-Liisa; Haworth, Claire M A; Hirschhorn, Joel N; Hofman, Albert; Holm, Jens-Christian; Horikoshi, Momoko; Hottenga, Jouke Jan; Huang, Jinyan; Kadarmideen, Haja N; Kähönen, Mika; Kiess, Wieland; Lakka, Hanna-Maaria; Lakka, Timo A; Lewin, Alexandra M; Liang, Liming; Lyytikäinen, Leo-Pekka; Ma, Baoshan; Magnus, Per; McCormack, Shana E; McMahon, George; Mentch, Frank D; Middeldorp, Christel M; Murray, Clare S; Pahkala, Katja; Pers, Tune H; Pfäffle, Roland; Postma, Dirkje S; Power, Christine; Simpson, Angela; Sengpiel, Verena; Tiesler, Carla M T; Torrent, Maties; Uitterlinden, André G; van Meurs, Joyce B; Vinding, Rebecca; Waage, Johannes; Wardle, Jane; Zeggini, Eleftheria; Zemel, Babette S; Dedoussis, George V; Pedersen, Oluf; Froguel, Philippe; Sunyer, Jordi; Plomin, Robert; Jacobsson, Bo; Hansen, Torben; Gonzalez, Juan R; Custovic, Adnan; Raitakari, Olli T; Pennell, Craig E; Widén, Elisabeth; Boomsma, Dorret I; Koppelman, Gerard H; Sebert, Sylvain; Järvelin, Marjo-Riitta; Hyppönen, Elina; McCarthy, Mark I; Lindi, Virpi; Harri, Niinikoski; Körner, Antje; Bønnelykke, Klaus; Heinrich, Joachim; Melbye, Mads; Rivadeneira, Fernando; Hakonarson, Hakon; Ring, Susan M; Smith, George Davey; Sørensen, Thorkild I A; Timpson, Nicholas J; Grant, Struan F A; Jaddoe, Vincent W V

    2016-01-15

    A large number of genetic loci are associated with adult body mass index. However, the genetics of childhood body mass index are largely unknown. We performed a meta-analysis of genome-wide association studies of childhood body mass index, using sex- and age-adjusted standard deviation scores. We included 35 668 children from 20 studies in the discovery phase and 11 873 children from 13 studies in the replication phase. In total, 15 loci reached genome-wide significance (P-value < 5 × 10(-8)) in the joint discovery and replication analysis, of which 12 are previously identified loci in or close to ADCY3, GNPDA2, TMEM18, SEC16B, FAIM2, FTO, TFAP2B, TNNI3K, MC4R, GPR61, LMX1B and OLFM4 associated with adult body mass index or childhood obesity. We identified three novel loci: rs13253111 near ELP3, rs8092503 near RAB27B and rs13387838 near ADAM23. Per additional risk allele, body mass index increased 0.04 Standard Deviation Score (SDS) [Standard Error (SE) 0.007], 0.05 SDS (SE 0.008) and 0.14 SDS (SE 0.025), for rs13253111, rs8092503 and rs13387838, respectively. A genetic risk score combining all 15 SNPs showed that each additional average risk allele was associated with a 0.073 SDS (SE 0.011, P-value = 3.12 × 10(-10)) increase in childhood body mass index in a population of 1955 children. This risk score explained 2% of the variance in childhood body mass index. This study highlights the shared genetic background between childhood and adult body mass index and adds three novel loci. These loci likely represent age-related differences in strength of the associations with body mass index. PMID:26604143

  2. Genome-wide meta-analysis of common variant differences between men and women

    PubMed Central

    Boraska, Vesna; Jerončić, Ana; Colonna, Vincenza; Southam, Lorraine; Nyholt, Dale R.; William Rayner, Nigel; Perry, John R.B.; Toniolo, Daniela; Albrecht, Eva; Ang, Wei; Bandinelli, Stefania; Barbalic, Maja; Barroso, Inês; Beckmann, Jacques S.; Biffar, Reiner; Boomsma, Dorret; Campbell, Harry; Corre, Tanguy; Erdmann, Jeanette; Esko, Tõnu; Fischer, Krista; Franceschini, Nora; Frayling, Timothy M.; Girotto, Giorgia; Gonzalez, Juan R.; Harris, Tamara B.; Heath, Andrew C.; Heid, Iris M.; Hoffmann, Wolfgang; Hofman, Albert; Horikoshi, Momoko; Hua Zhao, Jing; Jackson, Anne U.; Hottenga, Jouke-Jan; Jula, Antti; Kähönen, Mika; Khaw, Kay-Tee; Kiemeney, Lambertus A.; Klopp, Norman; Kutalik, Zoltán; Lagou, Vasiliki; Launer, Lenore J.; Lehtimäki, Terho; Lemire, Mathieu; Lokki, Marja-Liisa; Loley, Christina; Luan, Jian'an; Mangino, Massimo; Mateo Leach, Irene; Medland, Sarah E.; Mihailov, Evelin; Montgomery, Grant W.; Navis, Gerjan; Newnham, John; Nieminen, Markku S.; Palotie, Aarno; Panoutsopoulou, Kalliope; Peters, Annette; Pirastu, Nicola; Polašek, Ozren; Rehnström, Karola; Ripatti, Samuli; Ritchie, Graham R.S.; Rivadeneira, Fernando; Robino, Antonietta; Samani, Nilesh J.; Shin, So-Youn; Sinisalo, Juha; Smit, Johannes H.; Soranzo, Nicole; Stolk, Lisette; Swinkels, Dorine W.; Tanaka, Toshiko; Teumer, Alexander; Tönjes, Anke; Traglia, Michela; Tuomilehto, Jaakko; Valsesia, Armand; van Gilst, Wiek H.; van Meurs, Joyce B.J.; Smith, Albert Vernon; Viikari, Jorma; Vink, Jacqueline M.; Waeber, Gerard; Warrington, Nicole M.; Widen, Elisabeth; Willemsen, Gonneke; Wright, Alan F.; Zanke, Brent W.; Zgaga, Lina; Boehnke, Michael; d'Adamo, Adamo Pio; de Geus, Eco; Demerath, Ellen W.; den Heijer, Martin; Eriksson, Johan G.; Ferrucci, Luigi; Gieger, Christian; Gudnason, Vilmundur; Hayward, Caroline; Hengstenberg, Christian; Hudson, Thomas J.; Järvelin, Marjo-Riitta; Kogevinas, Manolis; Loos, Ruth J.F.; Martin, Nicholas G.; Metspalu, Andres; Pennell, Craig E.; Penninx, Brenda W.; Perola, Markus; Raitakari, Olli; Salomaa, Veikko; Schreiber, Stefan; Schunkert, Heribert; Spector, Tim D.; Stumvoll, Michael; Uitterlinden, André G.; Ulivi, Sheila; van der Harst, Pim; Vollenweider, Peter; Völzke, Henry; Wareham, Nicholas J.; Wichmann, H.-Erich; Wilson, James F.; Rudan, Igor; Xue, Yali; Zeggini, Eleftheria

    2012-01-01

    The male-to-female sex ratio at birth is constant across world populations with an average of 1.06 (106 male to 100 female live births) for populations of European descent. The sex ratio is considered to be affected by numerous biological and environmental factors and to have a heritable component. The aim of this study was to investigate the presence of common allele modest effects at autosomal and chromosome X variants that could explain the observed sex ratio at birth. We conducted a large-scale genome-wide association scan (GWAS) meta-analysis across 51 studies, comprising overall 114 863 individuals (61 094 women and 53 769 men) of European ancestry and 2 623 828 common (minor allele frequency >0.05) single-nucleotide polymorphisms (SNPs). Allele frequencies were compared between men and women for directly-typed and imputed variants within each study. Forward-time simulations for unlinked, neutral, autosomal, common loci were performed under the demographic model for European populations with a fixed sex ratio and a random mating scheme to assess the probability of detecting significant allele frequency differences. We do not detect any genome-wide significant (P < 5 × 10−8) common SNP differences between men and women in this well-powered meta-analysis. The simulated data provided results entirely consistent with these findings. This large-scale investigation across ∼115 000 individuals shows no detectable contribution from common genetic variants to the observed skew in the sex ratio. The absence of sex-specific differences is useful in guiding genetic association study design, for example when using mixed controls for sex-biased traits. PMID:22843499

  3. Human genome-wide expression analysis reorients the study of inflammatory mediators and biomechanics in osteoarthritis.

    PubMed

    Sandy, J D; Chan, D D; Trevino, R L; Wimmer, M A; Plaas, A

    2015-11-01

    A major objective of this article is to examine the research implications of recently available genome-wide expression profiles of cartilage from human osteoarthritis (OA) joints. We propose that, when viewed in the light of extensive earlier work, this novel data provides a unique opportunity to reorient the design of experimental systems toward clinical relevance. Specifically, in the area of cartilage explant biology, this will require a fresh evaluation of existing paradigms, so as to optimize the choices of tissue source, cytokine/growth factor/nutrient addition, and biomechanical environment for discovery. Within this context, we firstly discuss the literature on the nature and role of potential catabolic mediators in OA pathology, including data from human OA cartilage, animal models of OA, and ex vivo studies. Secondly, due to the number and breadth of studies on IL-1β in this area, a major focus of the article is a critical analysis of the design and interpretation of cartilage studies where IL-1β has been used as a model cytokine. Thirdly, the article provides a data-driven perspective (including genome-wide analysis of clinical samples, studies on mutant mice, and clinical trials), which concludes that IL-1β should be replaced by soluble mediators such as IL-17 or TGF-β1, which are much more likely to mimic the disease in OA model systems. We also discuss the evidence that changes in early OA can be attributed to the activity of such soluble mediators, whereas late-stage disease results more from a chronic biomechanical effect on the matrix and cells of the remaining cartilage and on other local mediator-secreting cells. Lastly, an updated protocol for in vitro studies with cartilage explants and chondrocytes (including the use of specific gene expression arrays) is provided to motivate more disease-relevant studies on the interplay of cytokines, growth factors, and biomechanics on cellular behavior. PMID:26521740

  4. Mouse Genome-Wide Association Mapping Needs Linkage Analysis to Avoid False-Positive Loci

    PubMed Central

    Manenti, Giacomo; Galvan, Antonella; Pettinicchio, Angela; Trincucci, Gaia; Spada, Elena; Zolin, Anna; Milani, Silvano; Gonzalez-Neira, Anna; Dragani, Tommaso A.

    2009-01-01

    We carried out genome-wide association (GWA) studies in inbred mouse strains characterized for their lung tumor susceptibility phenotypes (spontaneous or urethane-induced) with panels of 12,959 (13K) or 138,793 (140K) single-nucleotide polymorphisms (SNPs). Above the statistical thresholds, we detected only SNP rs3681853 on Chromosome 5, two SNPs in the pulmonary adenoma susceptibility 1 (Pas1) locus, and SNP rs4174648 on Chromosome 16 for spontaneous tumor incidence, urethane-induced tumor incidence, and urethane-induced tumor multiplicity, respectively, with the 13K SNP panel, but only the Pas1 locus with the 140K SNP panel. Haplotype analysis carried out in the latter panel detected four additional loci. Loci reported in previous GWA studies failed to replicate. Genome-wide genetic linkage analysis in urethane-treated (BALB/c×C3H/He)F2, (BALB/c×SWR/J)F2, and (A/J×C3H/He)F2 mice showed that Pas1, but none of the other loci detected previously or herein by GWA, had a significant effect. The Lasc1 gene, identified by GWA as a functional element (Nat. Genet., 38:888–95, 2006), showed no genetic effects in the two independent intercross mouse populations containing both alleles, nor was it expressed in mouse normal lung or lung tumors. Our results indicate that GWA studies in mouse inbred strains can suffer a high rate of false-positive results and that such an approach should be used in conjunction with classical linkage mapping in genetic crosses. PMID:19132132

  5. Genome-wide recombination and chromosome segregation in human oocytes and embryos reveal selection for maternal recombination rates”

    PubMed Central

    Natesan, Senthilkumar A.; Joshi, Hrishikesh A.; Cimadomo, Danilo; Griffin, Darren K.; Sage, Karen; Summers, Michael C.; Thornhill, Alan R.; Housworth, Elizabeth; Herbert, Alex D.; Rienzi, Laura; Ubaldi, Filippo M.; Handyside, Alan H.; Hoffmann, Eva R.

    2015-01-01

    Crossover recombination reshuffles genes and prevents errors in segregation that lead to extra or missing chromosomes (aneuploidy) in human eggs, a major cause of pregnancy failure and congenital disorders. Here, we generate genome-wide maps of crossovers and chromosome segregation patterns by recovering all three products of single female meioses. Genotyping > 4 million informative single-nucleotide polymorphisms (SNPs) from 23 complete meioses allowed us to map 2,032 maternal and 1,342 paternal crossovers and to infer the segregation patterns of 529 chromosome pairs. We uncover a novel reverse chromosome segregation pattern in which both homologs separate their sister chromatids at meiosis I; detect selection for higher recombination rates in the female germline by the elimination of aneuploid embryos; and report chromosomal drive against non-recombinant chromatids at meiosis II. Collectively, our findings reveal that recombination not only affects homolog segregation at meiosis I but also the fate of sister chromatids at meiosis II. PMID:25985139

  6. What Cure Models Can Teach us About Genome-Wide Survival Analysis.

    PubMed

    Stringer, Sven; Denys, Damiaan; Kahn, René S; Derks, Eske M

    2016-03-01

    The aim of logistic regression is to estimate genetic effects on disease risk, while survival analysis aims to determine effects on age of onset. In practice, genetic variants may affect both types of outcomes. A cure survival model analyzes logistic and survival effects simultaneously. The aim of this simulation study is to assess the performance of logistic regression and traditional survival analysis under a cure model and to investigate the benefits of cure survival analysis. We simulated data under a cure model and varied the percentage of subjects at risk for disease (cure fraction), the logistic and survival effect sizes, and the contribution of genetic background risk factors. We then computed the error rates and estimation bias of logistic, Cox proportional hazards (PH), and cure PH analysis, respectively. The power of logistic and Cox PH analysis is sensitive to the cure fraction and background heritability. Our results show that traditional Cox PH analysis may erroneously detect age of onset effects if no such effects are present in the data. In the presence of genetic background risk even the cure model results in biased estimates of both the odds ratio and the hazard ratio. Cure survival analysis takes cure fractions into account and can be used to simultaneously estimate the effect of genetic variants on disease risk and age of onset. Since genome-wide cure survival analysis is not computationally feasible, we recommend this analysis for genetic variants that are significant in a traditional survival analysis. PMID:26552795

  7. Genome-wide association analysis of canine atopic dermatitis and identification of disease related SNPs.

    PubMed

    Wood, Shona Hiedi; Ke, Xiayi; Nuttall, Tim; McEwan, Neil; Ollier, William E; Carter, Stuart D

    2009-12-01

    In humans, genome-wide association studies (GWAS) have been shown to be an effective and thorough approach for identifying polymorphisms associated with disease phenotypes. Here, we describe the first study to perform a genome-wide association study in canine atopic dermatitis (cAD) using the Illumina Canine SNP20 array, containing 22,362 single-nucleotide polymorphisms (SNPs). The aim of the study was to identify SNPs associated with cAD using affected and unaffected Golden Retrievers. Further validation studies were performed for potentially associated SNPs using Sequenom genotyping of larger numbers of cases and controls across eight breeds (Boxer, German Shepherd Dog, Labrador, Golden Retriever, Shiba Inu, Shih Tzu, Pit Bull, and West Highland White Terriers). Using meta-analysis, two SNPs were associated with cAD in all breeds tested. RS22114085 was identified as a susceptibility locus (p=0.00014, odds ratio=2) and RS23472497 as a protective locus (p=0.0015, odds ratio=0.6). Both of these SNPs were located in intergenic regions, and their effects have been demonstrated to be independent of each other, highlighting that further fine mapping and resequencing is required of these areas. Further, 12 SNPs were validated by Sequenom genotyping as associated with cAD, but these were not associated with all breeds. This study suggests that GWAS will be a useful approach for identifying genetic risk factors for cAD. Given the clinical heterogeneity within this condition and the likelihood that the relative genetic effect sizes are small, greater sample sizes and further studies will be required. PMID:19838693

  8. A "candidate-interactome" aggregate analysis of genome-wide association data in multiple sclerosis.

    PubMed

    Mechelli, Rosella; Umeton, Renato; Policano, Claudia; Annibali, Viviana; Coarelli, Giulia; Ricigliano, Vito A G; Vittori, Danila; Fornasiero, Arianna; Buscarinu, Maria Chiara; Romano, Silvia; Salvetti, Marco; Ristori, Giovanni

    2013-01-01

    Though difficult, the study of gene-environment interactions in multifactorial diseases is crucial for interpreting the relevance of non-heritable factors and prevents from overlooking genetic associations with small but measurable effects. We propose a "candidate interactome" (i.e. a group of genes whose products are known to physically interact with environmental factors that may be relevant for disease pathogenesis) analysis of genome-wide association data in multiple sclerosis. We looked for statistical enrichment of associations among interactomes that, at the current state of knowledge, may be representative of gene-environment interactions of potential, uncertain or unlikely relevance for multiple sclerosis pathogenesis: Epstein-Barr virus, human immunodeficiency virus, hepatitis B virus, hepatitis C virus, cytomegalovirus, HHV8-Kaposi sarcoma, H1N1-influenza, JC virus, human innate immunity interactome for type I interferon, autoimmune regulator, vitamin D receptor, aryl hydrocarbon receptor and a panel of proteins targeted by 70 innate immune-modulating viral open reading frames from 30 viral species. Interactomes were either obtained from the literature or were manually curated. The P values of all single nucleotide polymorphism mapping to a given interactome were obtained from the last genome-wide association study of the International Multiple Sclerosis Genetics Consortium & the Wellcome Trust Case Control Consortium, 2. The interaction between genotype and Epstein Barr virus emerges as relevant for multiple sclerosis etiology. However, in line with recent data on the coexistence of common and unique strategies used by viruses to perturb the human molecular system, also other viruses have a similar potential, though probably less relevant in epidemiological terms. PMID:23696811

  9. Genome-wide promoter methylation analysis in neuroblastoma identifies prognostic methylation biomarkers

    PubMed Central

    2012-01-01

    Background Accurate outcome prediction in neuroblastoma, which is necessary to enable the optimal choice of risk-related therapy, remains a challenge. To improve neuroblastoma patient stratification, this study aimed to identify prognostic tumor DNA methylation biomarkers. Results To identify genes silenced by promoter methylation, we first applied two independent genome-wide methylation screening methodologies to eight neuroblastoma cell lines. Specifically, we used re-expression profiling upon 5-aza-2'-deoxycytidine (DAC) treatment and massively parallel sequencing after capturing with a methyl-CpG-binding domain (MBD-seq). Putative methylation markers were selected from DAC-upregulated genes through a literature search and an upfront methylation-specific PCR on 20 primary neuroblastoma tumors, as well as through MBD- seq in combination with publicly available neuroblastoma tumor gene expression data. This yielded 43 candidate biomarkers that were subsequently tested by high-throughput methylation-specific PCR on an independent cohort of 89 primary neuroblastoma tumors that had been selected for risk classification and survival. Based on this analysis, methylation of KRT19, FAS, PRPH, CNR1, QPCT, HIST1H3C, ACSS3 and GRB10 was found to be associated with at least one of the classical risk factors, namely age, stage or MYCN status. Importantly, HIST1H3C and GNAS methylation was associated with overall and/or event-free survival. Conclusions This study combines two genome-wide methylation discovery methodologies and is the most extensive validation study in neuroblastoma performed thus far. We identified several novel prognostic DNA methylation markers and provide a basis for the development of a DNA methylation-based prognostic classifier in neuroblastoma. PMID:23034519

  10. A “Candidate-Interactome” Aggregate Analysis of Genome-Wide Association Data in Multiple Sclerosis

    PubMed Central

    Policano, Claudia; Annibali, Viviana; Coarelli, Giulia; Ricigliano, Vito A. G.; Vittori, Danila; Fornasiero, Arianna; Buscarinu, Maria Chiara; Romano, Silvia; Salvetti, Marco; Ristori, Giovanni

    2013-01-01

    Though difficult, the study of gene-environment interactions in multifactorial diseases is crucial for interpreting the relevance of non-heritable factors and prevents from overlooking genetic associations with small but measurable effects. We propose a “candidate interactome” (i.e. a group of genes whose products are known to physically interact with environmental factors that may be relevant for disease pathogenesis) analysis of genome-wide association data in multiple sclerosis. We looked for statistical enrichment of associations among interactomes that, at the current state of knowledge, may be representative of gene-environment interactions of potential, uncertain or unlikely relevance for multiple sclerosis pathogenesis: Epstein-Barr virus, human immunodeficiency virus, hepatitis B virus, hepatitis C virus, cytomegalovirus, HHV8-Kaposi sarcoma, H1N1-influenza, JC virus, human innate immunity interactome for type I interferon, autoimmune regulator, vitamin D receptor, aryl hydrocarbon receptor and a panel of proteins targeted by 70 innate immune-modulating viral open reading frames from 30 viral species. Interactomes were either obtained from the literature or were manually curated. The P values of all single nucleotide polymorphism mapping to a given interactome were obtained from the last genome-wide association study of the International Multiple Sclerosis Genetics Consortium & the Wellcome Trust Case Control Consortium, 2. The interaction between genotype and Epstein Barr virus emerges as relevant for multiple sclerosis etiology. However, in line with recent data on the coexistence of common and unique strategies used by viruses to perturb the human molecular system, also other viruses have a similar potential, though probably less relevant in epidemiological terms. PMID:23696811

  11. Analysis of genome-wide structure, diversity and fine mapping of Mendelian traits in traditional and village chickens

    PubMed Central

    Wragg, D; Mwacharo, J M; Alcalde, J A; Hocking, P M; Hanotte, O

    2012-01-01

    Extensive phenotypic variation is a common feature among village chickens found throughout much of the developing world, and in traditional chicken breeds that have been artificially selected for traits such as plumage variety. We present here an assessment of traditional and village chicken populations, for fine mapping of Mendelian traits using genome-wide single-nucleotide polymorphism (SNP) genotyping while providing information on their genetic structure and diversity. Bayesian clustering analysis reveals two main genetic backgrounds in traditional breeds, Kenyan, Ethiopian and Chilean village chickens. Analysis of linkage disequilibrium (LD) reveals useful LD (r2⩾0.3) in both traditional and village chickens at pairwise marker distances of ∼10 Kb; while haplotype block analysis indicates a median block size of 11–12 Kb. Association mapping yielded refined mapping intervals for duplex comb (Gga 2:38.55–38.89 Mb) and rose comb (Gga 7:18.41–22.09 Mb) phenotypes in traditional breeds. Combined mapping information from traditional breeds and Chilean village chicken allows the oocyan phenotype to be fine mapped to two small regions (Gga 1:67.25–67.28 Mb, Gga 1:67.28–67.32 Mb) totalling ∼75 Kb. Mapping the unmapped earlobe pigmentation phenotype supports previous findings that the trait is sex-linked and polygenic. A critical assessment of the number of SNPs required to map simple traits indicate that between 90 and 110K SNPs are required for full genome-wide analysis of haplotype block structure/ancestry, and for association mapping in both traditional and village chickens. Our results demonstrate the importance and uniqueness of phenotypic diversity and genetic structure of traditional chicken breeds for fine-scale mapping of Mendelian traits in the species, with village chicken populations providing further opportunities to enhance mapping resolutions. PMID:22395157

  12. Analysis of genome-wide structure, diversity and fine mapping of Mendelian traits in traditional and village chickens.

    PubMed

    Wragg, D; Mwacharo, J M; Alcalde, J A; Hocking, P M; Hanotte, O

    2012-07-01

    Extensive phenotypic variation is a common feature among village chickens found throughout much of the developing world, and in traditional chicken breeds that have been artificially selected for traits such as plumage variety. We present here an assessment of traditional and village chicken populations, for fine mapping of Mendelian traits using genome-wide single-nucleotide polymorphism (SNP) genotyping while providing information on their genetic structure and diversity. Bayesian clustering analysis reveals two main genetic backgrounds in traditional breeds, Kenyan, Ethiopian and Chilean village chickens. Analysis of linkage disequilibrium (LD) reveals useful LD (r(2) ≥ 0.3) in both traditional and village chickens at pairwise marker distances of ~10 Kb; while haplotype block analysis indicates a median block size of 11-12 Kb. Association mapping yielded refined mapping intervals for duplex comb (Gga 2:38.55-38.89 Mb) and rose comb (Gga 7:18.41-22.09 Mb) phenotypes in traditional breeds. Combined mapping information from traditional breeds and Chilean village chicken allows the oocyan phenotype to be fine mapped to two small regions (Gga 1:67.25-67.28 Mb, Gga 1:67.28-67.32 Mb) totalling ~75 Kb. Mapping the unmapped earlobe pigmentation phenotype supports previous findings that the trait is sex-linked and polygenic. A critical assessment of the number of SNPs required to map simple traits indicate that between 90 and 110K SNPs are required for full genome-wide analysis of haplotype block structure/ancestry, and for association mapping in both traditional and village chickens. Our results demonstrate the importance and uniqueness of phenotypic diversity and genetic structure of traditional chicken breeds for fine-scale mapping of Mendelian traits in the species, with village chicken populations providing further opportunities to enhance mapping resolutions. PMID:22395157

  13. Pathway Analysis Based on a Genome-Wide Association Study of Polycystic Ovary Syndrome

    PubMed Central

    Shim, Unjin; Kim, Han-Na; Lee, Hyejin; Oh, Jee-Young

    2015-01-01

    Background Polycystic ovary syndrome (PCOS) is one of the most common endocrine disorders in women of reproductive age, and it is affected by both environmental and genetic factors. Although the genetic component of PCOS is evident, studies aiming to identify susceptibility genes have shown controversial results. This study conducted a pathway-based analysis using a dataset obtained through a genome-wide association study (GWAS) to elucidate the biological pathways that contribute to PCOS susceptibility and the associated genes. Methods We used GWAS data on 636,797 autosomal single nucleotide polymorphisms (SNPs) from 1,221 individuals (432 PCOS patients and 789 controls) for analysis. A pathway analysis was conducted using meta-analysis gene-set enrichment of variant associations (MAGENTA). Top-ranking pathways or gene sets associated with PCOS were identified, and significant genes within the pathways were analyzed. Results The pathway analysis of the GWAS dataset identified significant pathways related to oocyte meiosis and the regulation of insulin secretion by acetylcholine and free fatty acids (all nominal gene-set enrichment analysis (GSEA) P-values < 0.05). In addition, INS, GNAQ, STXBP1, PLCB3, PLCB2, SMC3 and PLCZ1 were significant genes observed within the biological pathways (all gene P-values < 0.05). Conclusions By applying MAGENTA pathway analysis to PCOS GWAS data, we identified significant pathways and candidate genes involved in PCOS. Our findings may provide new leads for understanding the mechanisms underlying the development of PCOS. PMID:26308735

  14. Gene set analysis of genome-wide association studies: methodological issues and perspectives

    PubMed Central

    Wang, Lily; Jia, Peilin; Wolfinger, Russell D; Chen, Xi; Zhao, Zhongming

    2013-01-01

    Recent studies have demonstrated that gene set analysis, which tests disease association with genetic variants in a group of functionally related genes, is a promising approach for analyzing and interpreting genome-wide association studies (GWAS) data. These approaches aim to increase power by combining association signals from multiple genes in the same gene set. In addition, gene set analysis can also shed more light on the biological processes underlying complex diseases. However, current approaches for gene set analysis are still in an early stage of development in that analysis results are often prone to sources of bias, including gene set size and gene length, linkage disequilibrium patterns and the presence of overlapping genes. In this paper, we provide an in-depth review of the gene set analysis procedures, along with parameter choices and the particular methodology challenges at each stage. In addition to providing a survey of recently developed tools, we also classify the analysis methods into larger categories and discuss their strengths and limitations. In the last section, we outline several important areas for improving the analytical strategies in gene set analysis. PMID:21565265

  15. A genome-wide map of hyper-edited RNA reveals numerous new sites.

    PubMed

    Porath, Hagit T; Carmi, Shai; Levanon, Erez Y

    2014-01-01

    Adenosine-to-inosine editing is one of the most frequent post-transcriptional modifications, manifested as A-to-G mismatches when comparing RNA sequences with their source DNA. Recently, a number of RNA-seq data sets have been screened for the presence of A-to-G editing, and hundreds of thousands of editing sites identified. Here we show that existing screens missed the majority of sites by ignoring reads with excessive ('hyper') editing that do not easily align to the genome. We show that careful alignment and examination of the unmapped reads in RNA-seq studies reveal numerous new sites, usually many more than originally discovered, and in precisely those regions that are most heavily edited. Specifically, we discover 327,096 new editing sites in the heavily studied Illumina Human BodyMap data and more than double the number of detected sites in several published screens. We also identify thousands of new sites in mouse, rat, opossum and fly. Our results establish that hyper-editing events account for the majority of editing sites. PMID:25158696

  16. A genome-wide survey reveals abundant rice blast R genes in resistant cultivars.

    PubMed

    Zhang, Xiaohui; Yang, Sihai; Wang, Jiao; Jia, Yanxiao; Huang, Ju; Tan, Shengjun; Zhong, Yan; Wang, Ling; Gu, Longjiang; Chen, Jian-Qun; Pan, Qinghua; Bergelson, Joy; Tian, Dacheng

    2015-10-01

    Plant resistance genes (R genes) harbor tremendous allelic diversity, constituting a robust immune system effective against microbial pathogens. Nevertheless, few functional R genes have been identified for even the best-studied pathosystems. Does this limited repertoire reflect specificity, with most R genes having been defeated by former pests, or do plants harbor a rich diversity of functional R genes, the composite behavior of which is yet to be characterized? Here, we survey 332 NBS-LRR genes cloned from five resistant Oryza sativa (rice) cultivars for their ability to confer recognition of 12 rice blast isolates when transformed into susceptible cultivars. Our survey reveals that 48.5% of the 132 NBS-LRR loci tested contain functional rice blast R genes, with most R genes deriving from multi-copy clades containing especially diversified loci. Each R gene recognized, on average, 2.42 of the 12 isolates screened. The abundant R genes identified in resistant genomes provide extraordinary redundancy in the ability of host genotypes to recognize particular isolates. If the same is true for other pathogens, many extant NBS-LRR genes retain functionality. Our success at identifying rice blast R genes also validates a highly efficient cloning and screening strategy. PMID:26248689

  17. Genome-wide DNA methylation map of human neutrophils reveals widespread inter-individual epigenetic variation

    PubMed Central

    Chatterjee, Aniruddha; Stockwell, Peter A.; Rodger, Euan J.; Duncan, Elizabeth J.; Parry, Matthew F.; Weeks, Robert J.; Morison, Ian M.

    2015-01-01

    The extent of variation in DNA methylation patterns in healthy individuals is not yet well documented. Identification of inter-individual epigenetic variation is important for understanding phenotypic variation and disease susceptibility. Using neutrophils from a cohort of healthy individuals, we generated base-resolution DNA methylation maps to document inter-individual epigenetic variation. We identified 12851 autosomal inter-individual variably methylated fragments (iVMFs). Gene promoters were the least variable, whereas gene body and upstream regions showed higher variation in DNA methylation. The iVMFs were relatively enriched in repetitive elements compared to non-iVMFs, and were associated with genome regulation and chromatin function elements. Further, variably methylated genes were disproportionately associated with regulation of transcription, responsive function and signal transduction pathways. Transcriptome analysis indicates that iVMF methylation at differentially expressed exons has a positive correlation and local effect on the inclusion of that exon in the mRNA transcript. PMID:26612583

  18. Genome-wide Analysis of AP-3–dependent Protein Transport in Yeast

    PubMed Central

    Anand, Vikram C.; Daboussi, Lydia; Lorenz, Todd C.

    2009-01-01

    The evolutionarily conserved adaptor protein-3 (AP-3) complex mediates cargo-selective transport to lysosomes and lysosome-related organelles. To identify proteins that function in AP-3–mediated transport, we performed a genome-wide screen in Saccharomyces cerevisiae for defects in the vacuolar maturation of alkaline phosphatase (ALP), a cargo of the AP-3 pathway. Forty-nine gene deletion strains were identified that accumulated precursor ALP, many with established defects in vacuolar protein transport. Maturation of a vacuolar membrane protein delivered via a separate, clathrin-dependent pathway, was affected in all strains except those with deletions of YCK3, encoding a vacuolar type I casein kinase; SVP26, encoding an endoplasmic reticulum (ER) export receptor for ALP; and AP-3 subunit genes. Subcellular fractionation and fluorescence microscopy revealed ALP transport defects in yck3Δ cells. Characterization of svp26Δ cells revealed a role for Svp26p in ER export of only a subset of type II membrane proteins. Finally, ALP maturation kinetics in vac8Δ and vac17Δ cells suggests that vacuole inheritance is important for rapid generation of proteolytically active vacuolar compartments in daughter cells. We propose that the cargo-selective nature of the AP-3 pathway in yeast is achieved by AP-3 and Yck3p functioning in concert with machinery shared by other vacuolar transport pathways. PMID:19116312

  19. Genome-wide association study for refractive astigmatism reveals genetic co-determination with spherical equivalent refractive error: the CREAM consortium.

    PubMed

    Li, Qing; Wojciechowski, Robert; Simpson, Claire L; Hysi, Pirro G; Verhoeven, Virginie J M; Ikram, Mohammad Kamran; Höhn, René; Vitart, Veronique; Hewitt, Alex W; Oexle, Konrad; Mäkelä, Kari-Matti; MacGregor, Stuart; Pirastu, Mario; Fan, Qiao; Cheng, Ching-Yu; St Pourcain, Beaté; McMahon, George; Kemp, John P; Northstone, Kate; Rahi, Jugnoo S; Cumberland, Phillippa M; Martin, Nicholas G; Sanfilippo, Paul G; Lu, Yi; Wang, Ya Xing; Hayward, Caroline; Polašek, Ozren; Campbell, Harry; Bencic, Goran; Wright, Alan F; Wedenoja, Juho; Zeller, Tanja; Schillert, Arne; Mirshahi, Alireza; Lackner, Karl; Yip, Shea Ping; Yap, Maurice K H; Ried, Janina S; Gieger, Christian; Murgia, Federico; Wilson, James F; Fleck, Brian; Yazar, Seyhan; Vingerling, Johannes R; Hofman, Albert; Uitterlinden, André; Rivadeneira, Fernando; Amin, Najaf; Karssen, Lennart; Oostra, Ben A; Zhou, Xin; Teo, Yik-Ying; Tai, E Shyong; Vithana, Eranga; Barathi, Veluchamy; Zheng, Yingfeng; Siantar, Rosalynn Grace; Neelam, Kumari; Shin, Youchan; Lam, Janice; Yonova-Doing, Ekaterina; Venturini, Cristina; Hosseini, S Mohsen; Wong, Hoi-Suen; Lehtimäki, Terho; Kähönen, Mika; Raitakari, Olli; Timpson, Nicholas J; Evans, David M; Khor, Chiea-Chuen; Aung, Tin; Young, Terri L; Mitchell, Paul; Klein, Barbara; van Duijn, Cornelia M; Meitinger, Thomas; Jonas, Jost B; Baird, Paul N; Mackey, David A; Wong, Tien Yin; Saw, Seang-Mei; Pärssinen, Olavi; Stambolian, Dwight; Hammond, Christopher J; Klaver, Caroline C W; Williams, Cathy; Paterson, Andrew D; Bailey-Wilson, Joan E; Guggenheim, Jeremy A

    2015-02-01

    To identify genetic variants associated with refractive astigmatism in the general population, meta-analyses of genome-wide association studies were performed for: White Europeans aged at least 25 years (20 cohorts, N = 31,968); Asian subjects aged at least 25 years (7 cohorts, N = 9,295); White Europeans aged <25 years (4 cohorts, N = 5,640); and all independent individuals from the above three samples combined with a sample of Chinese subjects aged <25 years (N = 45,931). Participants were classified as cases with refractive astigmatism if the average cylinder power in their two eyes was at least 1.00 diopter and as controls otherwise. Genome-wide association analysis was carried out for each cohort separately using logistic regression. Meta-analysis was conducted using a fixed effects model. In the older European group the most strongly associated marker was downstream of the neurexin-1 (NRXN1) gene (rs1401327, P = 3.92E-8). No other region reached genome-wide significance, and association signals were lower for the younger European group and Asian group. In the meta-analysis of all cohorts, no marker reached genome-wide significance: The most strongly associated regions were, NRXN1 (rs1401327, P = 2.93E-07), TOX (rs7823467, P = 3.47E-07) and LINC00340 (rs12212674, P = 1.49E-06). For 34 markers identified in prior GWAS for spherical equivalent refractive error, the beta coefficients for genotype versus spherical equivalent, and genotype versus refractive astigmatism, were highly correlated (r = -0.59, P = 2.10E-04). This work revealed no consistent or strong genetic signals for refractive astigmatism; however, the TOX gene region previously identified in GWAS for spherical equivalent refractive error was the second most strongly associated region. Analysis of additional markers provided evidence supporting widespread genetic co-susceptibility for spherical and astigmatic refractive errors. PMID:25367360

  20. A Genome-Wide Association Study Reveals Dominance Effects on Number of Teats in Pigs

    PubMed Central

    Lopes, Marcos S.; Bastiaansen, John W. M.; Harlizius, Barbara; Knol, Egbert F.; Bovenhuis, Henk

    2014-01-01

    Dominance has been suggested as one of the genetic mechanisms explaining heterosis. However, using traditional quantitative genetic methods it is difficult to obtain accurate estimates of dominance effects. With the availability of dense SNP (Single Nucleotide Polymorphism) panels, we now have new opportunities for the detection and use of dominance at individual loci. Thus, the aim of this study was to detect additive and dominance effects on number of teats (NT), specifically to investigate the importance of dominance in a Landrace-based population of pigs. In total, 1,550 animals, genotyped for 32,911 SNPs, were used in single SNP analysis. SNPs with a significant genetic effect were tested for their mode of gene action being additive, dominant or a combination. In total, 21 SNPs were associated with NT, located in three regions with additive (SSC6, 7 and 12) and one region with dominant effects (SSC4). Estimates of additive effects ranged from 0.24 to 0.29 teats. The dominance effect of the QTL located on SSC4 was negative (−0.26 teats). The additive variance of the four QTLs together explained 7.37% of the total phenotypic variance. The dominance variance of the four QTLs together explained 1.82% of the total phenotypic variance, which corresponds to one-fourth of the variance explained by additive effects. The results suggest that dominance effects play a relevant role in the genetic architecture of NT. The QTL region on SSC7 contains the most promising candidate gene: VRTN. This gene has been suggested to be related to the number of vertebrae, a trait correlated with NT. PMID:25158056

  1. Genome-wide RNAi screen reveals a role for the ESCRT complex in rotavirus cell entry.

    PubMed

    Silva-Ayala, Daniela; López, Tomás; Gutiérrez, Michelle; Perrimon, Norbert; López, Susana; Arias, Carlos F

    2013-06-18

    Rotavirus (RV) is the major cause of childhood gastroenteritis worldwide. This study presents a functional genome-scale analysis of cellular proteins and pathways relevant for RV infection using RNAi. Among the 522 proteins selected in the screen for their ability to affect viral infectivity, an enriched group that participates in endocytic processes was identified. Within these proteins, subunits of the vacuolar ATPase, small GTPases, actinin 4, and, of special interest, components of the endosomal sorting complex required for transport (ESCRT) machinery were found. Here we provide evidence for a role of the ESCRT complex in the entry of simian and human RV strains in both monkey and human epithelial cells. In addition, the ESCRT-associated ATPase VPS4A and phospholipid lysobisphosphatidic acid, both crucial for the formation of intralumenal vesicles in multivesicular bodies, were also found to be required for cell entry. Interestingly, it seems that regardless of the molecules that rhesus RV and human RV strains use for cell-surface attachment and the distinct endocytic pathway used, all these viruses converge in early endosomes and use multivesicular bodies for cell entry. Furthermore, the small GTPases RHOA and CDC42, which regulate different types of clathrin-independent endocytosis, as well as early endosomal antigen 1 (EEA1), were found to be involved in this process. This work reports the direct involvement of the ESCRT machinery in the life cycle of a nonenveloped virus and highlights the complex mechanism that these viruses use to enter cells. It also illustrates the efficiency of high-throughput RNAi screenings as genetic tools for comprehensively studying the interaction between viruses and their host cells. PMID:23733942

  2. A comprehensive genome-wide analysis of melanoma Breslow thickness identifies interaction between CDC42 and SCIN genetic variants.

    PubMed

    Vaysse, Amaury; Fang, Shenying; Brossard, Myriam; Wei, Qingyi; Chen, Wei V; Mohamdi, Hamida; Vincent-Fetita, Lynda; Margaritte-Jeannin, Patricia; Lavielle, Nolwenn; Maubec, Eve; Lathrop, Mark; Avril, Marie-Françoise; Amos, Christopher I; Lee, Jeffrey E; Demenais, Florence

    2016-11-01

    Breslow thickness (BT) is a major prognostic factor of cutaneous melanoma (CM), the most fatal skin cancer. The genetic component of BT has only been explored by candidate gene studies with inconsistent results. Our objective was to uncover the genetic factors underlying BT using an hypothesis-free genome-wide approach. Our analysis strategy integrated a genome-wide association study (GWAS) of single nucleotide polymorphisms (SNPs) for BT followed by pathway analysis of GWAS outcomes using the gene-set enrichment analysis (GSEA) method and epistasis analysis within BT-associated pathways. This strategy was applied to two large CM datasets with Hapmap3-imputed SNP data: the French MELARISK study for discovery (966 cases) and the MD Anderson Cancer Center study (1,546 cases) for replication. While no marginal effect of individual SNPs was revealed through GWAS, three pathways, defined by gene ontology (GO) categories were significantly enriched in genes associated with BT (false discovery rate ≤5% in both studies): hormone activity, cytokine activity and myeloid cell differentiation. Epistasis analysis, within each significant GO, identified a statistically significant interaction between CDC42 and SCIN SNPs (pmeta-int =2.2 × 10(-6) , which met the overall multiple-testing corrected threshold of 2.5 × 10(-6) ). These two SNPs (and proxies) are strongly associated with CDC42 and SCIN gene expression levels and map to regulatory elements in skin cells. This interaction has important biological relevance since CDC42 and SCIN proteins have opposite effects in actin cytoskeleton organization and dynamics, a key mechanism underlying melanoma cell migration and invasion. PMID:27347659

  3. Genome-wide association for grain morphology in synthetic hexaploid wheats using digital imaging analysis

    PubMed Central

    2014-01-01

    Background Grain size and shape greatly influence grain weight which ultimately enhances grain yield in wheat. Digital imaging (DI) based phenomic characterization can capture the three dimensional variation in grain size and shape than has hitherto been possible. In this study, we report the results from using digital imaging of grain size and shape to understand the relationship among different components of this trait, their contribution to enhance grain weight, and to identify genomic regions (QTLs) controlling grain morphology using genome wide association mapping with high density diversity array technology (DArT) and allele-specific markers. Results Significant positive correlations were observed between grain weight and grain size measurements such as grain length (r = 0.43), width, thickness (r = 0.64) and factor from density (FFD) (r = 0.69). A total of 231 synthetic hexaploid wheats (SHWs) were grouped into five different sub-clusters by Bayesian structure analysis using unlinked DArT markers. Linkage disequilibrium (LD) decay was observed among DArT loci > 10 cM distance and approximately 28% marker pairs were in significant LD. In total, 197 loci over 60 chromosomal regions and 79 loci over 31 chromosomal regions were associated with grain morphology by genome wide analysis using general linear model (GLM) and mixed linear model (MLM) approaches, respectively. They were mainly distributed on homoeologous group 2, 3, 6 and 7 chromosomes. Twenty eight marker-trait associations (MTAs) on the D genome chromosomes 2D, 3D and 6D may carry novel alleles with potential to enhance grain weight due to the use of untapped wild accessions of Aegilops tauschii. Statistical simulations showed that favorable alleles for thousand kernel weight (TKW), grain length, width and thickness have additive genetic effects. Allelic variations for known genes controlling grain size and weight, viz. TaCwi-2A, TaSus-2B, TaCKX6-3D and TaGw2-6A, were also associated

  4. Genome-Wide Comparative Analyses Reveal the Dynamic Evolution of Nucleotide-Binding Leucine-Rich Repeat Gene Family among Solanaceae Plants

    PubMed Central

    Seo, Eunyoung; Kim, Seungill; Yeom, Seon-In; Choi, Doil

    2016-01-01

    Plants have evolved an elaborate innate immune system against invading pathogens. Within this system, intracellular nucleotide-binding leucine-rich repeat (NLR) immune receptors are known play critical roles in effector-triggered immunity (ETI) plant defense. We performed genome-wide identification and classification of NLR-coding sequences from the genomes of pepper, tomato, and potato using fixed criteria. We then compared genomic duplication and evolution features. We identified intact 267, 443, and 755 NLR-encoding genes in tomato, potato, and pepper genomes, respectively. Phylogenetic analysis and classification of Solanaceae NLRs revealed that the majority of NLR super family members fell into 14 subgroups, including a TIR-NLR (TNL) subgroup and 13 non-TNL subgroups. Specific subgroups have expanded in each genome, with the expansion in pepper showing subgroup-specific physical clusters. Comparative analysis of duplications showed distinct duplication patterns within pepper and among Solanaceae plants suggesting subgroup- or species-specific gene duplication events after speciation, resulting in divergent evolution. Taken together, genome-wide analysis of NLR family members provide insights into their evolutionary history in Solanaceae. These findings also provide important foundational knowledge for understanding NLR evolution and will empower broader characterization of disease resistance genes to be used for crop breeding. PMID:27559340

  5. Genome-Wide Comparative Analyses Reveal the Dynamic Evolution of Nucleotide-Binding Leucine-Rich Repeat Gene Family among Solanaceae Plants.

    PubMed

    Seo, Eunyoung; Kim, Seungill; Yeom, Seon-In; Choi, Doil

    2016-01-01

    Plants have evolved an elaborate innate immune system against invading pathogens. Within this system, intracellular nucleotide-binding leucine-rich repeat (NLR) immune receptors are known play critical roles in effector-triggered immunity (ETI) plant defense. We performed genome-wide identification and classification of NLR-coding sequences from the genomes of pepper, tomato, and potato using fixed criteria. We then compared genomic duplication and evolution features. We identified intact 267, 443, and 755 NLR-encoding genes in tomato, potato, and pepper genomes, respectively. Phylogenetic analysis and classification of Solanaceae NLRs revealed that the majority of NLR super family members fell into 14 subgroups, including a TIR-NLR (TNL) subgroup and 13 non-TNL subgroups. Specific subgroups have expanded in each genome, with the expansion in pepper showing subgroup-specific physical clusters. Comparative analysis of duplications showed distinct duplication patterns within pepper and among Solanaceae plants suggesting subgroup- or species-specific gene duplication events after speciation, resulting in divergent evolution. Taken together, genome-wide analysis of NLR family members provide insights into their evolutionary history in Solanaceae. These findings also provide important foundational knowledge for understanding NLR evolution and will empower broader characterization of disease resistance genes to be used for crop breeding. PMID:27559340

  6. Pathway analysis of genome-wide association datasets of personality traits.

    PubMed

    Kim, H-N; Kim, B-H; Cho, J; Ryu, S; Shin, H; Sung, J; Shin, C; Cho, N H; Sung, Y A; Choi, B-O; Kim, H-L

    2015-04-01

    Although several genome-wide association (GWA) studies of human personality have been recently published, genetic variants that are highly associated with certain personality traits remain unknown, due to difficulty reproducing results. To further investigate these genetic variants, we assessed biological pathways using GWA datasets. Pathway analysis using GWA data was performed on 1089 Korean women whose personality traits were measured with the Revised NEO Personality Inventory for the 5-factor model of personality. A total of 1042 pathways containing 8297 genes were included in our study. Of these, 14 pathways were highly enriched with association signals that were validated in 1490 independent samples. These pathways include association of: Neuroticism with axon guidance [L1 cell adhesion molecule (L1CAM) interactions]; Extraversion with neuronal system and voltage-gated potassium channels; Agreeableness with L1CAM interaction, neurotransmitter receptor binding and downstream transmission in postsynaptic cells; and Conscientiousness with the interferon-gamma and platelet-derived growth factor receptor beta polypeptide pathways. Several genes that contribute to top-ranked pathways in this study were previously identified in GWA studies or by pathway analysis in schizophrenia or other neuropsychiatric disorders. Here we report the first pathway analysis of all five personality traits. Importantly, our analysis identified novel pathways that contribute to understanding the etiology of personality traits. PMID:25809424

  7. Genome-Wide Analysis of the Cyclin Gene Family in Tomato

    PubMed Central

    Zhang, Tingyan; Wang, Xin; Lu, Yongen; Cai, Xiaofeng; Ye, Zhibiao; Zhang, Junhong

    2014-01-01

    Cyclins play important roles in cell division and cell expansion. They also interact with cyclin-dependent kinases to control cell cycle progression in plants. Our genome-wide analysis identified 52 expressed cyclin genes in tomato. Phylogenetic analysis of the deduced amino sequences of tomato and Arabidopsis cyclin genes divided them into 10 types, A-, B-, C-, D-, H-, L-, T-, U-, SDS- and J18. Pfam analysis indicated that most tomato cyclins contain a cyclin-N domain. C-, H- and J18 types only contain a cyclin-C domain, and U-type cyclins contain another potential cyclin domain. All of the cyclin genes are distributed throughout the tomato genome except for chromosome 8, and 30 of them were found to be segmentally duplicated; they are found on the duplicate segments of chromosome 1, 2, 3, 4, 5, 6, 10, 11 and 12, suggesting that tomato cyclin genes experienced a mass of segmental duplication. Quantitative real-time polymerase chain reaction analysis indicates that the expression patterns of tomato cyclin genes were significantly different in vegetative and reproductive stages. Transcription of most cyclin genes can be enhanced or repressed by exogenous application of gibberellin, which implies that gibberellin maybe a direct regulator of cyclin genes. The study presented here may be useful as a guide for further functional research on tomato cyclins. PMID:24366066

  8. PATHWAY-BASED ANALYSIS FOR GENOME-WIDE ASSOCIATION STUDIES USING SUPERVISED PRINCIPAL COMPONENTS

    PubMed Central

    Chen, Xi; Wang, Lily; Hu, Bo; Guo, Mingsheng; Barnard, John; Zhu, Xiaofeng

    2012-01-01

    Many complex diseases are influenced by genetic variations in multiple genes, each with only a small marginal effect on disease susceptibility. Pathway analysis, which identifies biological pathways associated with disease outcome, has become increasingly popular for genome-wide association studies (GWAS). In addition to combining weak signals from a number of SNPs in the same pathway, results from pathway analysis also shed light on the biological processes underlying disease. We propose a new pathway-based analysis method for GWAS, the supervised principal component analysis (SPCA) model. In the proposed SPCA model, a selected subset of SNPs most associated with disease outcome is used to estimate the latent variable for a pathway. The estimated latent variable for each pathway is an optimal linear combination of a selected subset of SNPs; therefore, the proposed SPCA model provides the ability to borrow strength across the SNPs in a pathway. In addition to identifying pathways associated with disease outcome, SPCA also carries out additional within-category selection to identify the most important SNPs within each gene set. The proposed model operates in a well-established statistical framework and can handle design information such as covariate adjustment and matching information in GWAS. We compare the proposed method with currently available methods using data with realistic linkage disequilibrium structures and we illustrate the SPCA method using the Wellcome Trust Case-Control Consortium Crohn Disease (CD) dataset. PMID:20842628

  9. Meta-analysis of genome-wide association studies for circulating phylloquinone concentrations12345

    PubMed Central

    Dashti, Hassan S; Shea, M Kyla; Smith, Caren E; Tanaka, Toshiko; Hruby, Adela; Richardson, Kris; Wang, Thomas J; Nalls, Mike A; Guo, Xiuqing; Liu, Yongmei; Yao, Jie; Li, Dalin; Johnson, W Craig; Benjamin, Emelia J; Kritchevsky, Stephen B; Siscovick, David S; Ordovás, José M

    2014-01-01

    Background: Poor vitamin K status is linked to greater risk of several chronic diseases. Age, sex, and diet are determinants of circulating vitamin K; however, there is still large unexplained interindividual variability in vitamin K status. Although a strong genetic component has been hypothesized, this has yet to be examined by a genome-wide association (GWA) study. Objective: The objective was to identify common genetic variants associated with concentrations of circulating phylloquinone, the primary circulating form of vitamin K. Design: We conducted a 2-stage GWA meta-analysis of circulating phylloquinone in 2 populations of European descent from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium Nutrition Working Group. Circulating phylloquinone was measured by using reversed-phase high-performance liquid chromatography. Results from adjusted cohort-specific discovery GWA analyses were meta-analyzed with inverse variance weights (n = 2138). Associations with circulating phylloquinone at P < 1 × 10−6 were then evaluated in a second-stage analysis consisting of one independent cohort (n = 265). Results: No significant association was observed for circulating phylloquinone at the genome-wide significance level of 5 × 10−8. However, from the discovery GWA, there were 11 single-nucleotide polymorphism (SNP) associations with circulating phylloquinone at P < 1 × 10−6, including a functional variant previously associated with warfarin dose and altered phylloquinone metabolism. These SNPs are on 5 independent loci on 11q23.3, 8q24.3, 5q22.3, 2p12, and 19p13.12, and they fall within or near the candidate genes APOA1/C3/A4/A5 cluster (involved in lipoprotein metabolism), COL22A1, CDO1, CTNAA2, and CYP4F2 (a phylloquinone oxidase), respectively. Second-stage analysis in an independent cohort further suggests the association of the 5q22.3 locus with circulating phylloquinone (P < 0.05). Conclusions: Multiple candidate genes related to

  10. Pathway-based analysis of primary biliary cirrhosis genome-wide association studies

    PubMed Central

    Kar, SP; Seldin, MF; Chen, W; Lu, E; Hirschfield, GM; Invernizzi, P; Heathcote, J; Cusi, D; Gershwin, ME; Siminovitch, KA; Amos, CI

    2013-01-01

    Genome-wide association studies (GWAS) have successfully identified several loci associated with primary biliary cirrhosis (PBC) risk. Pathway analysis complements conventional GWAS analysis. We applied the recently developed linear combination test for pathways to datasets drawn from independent PBC GWAS in Italian and Canadian subjects. Of the Kyoto Encyclopedia of Genes and Genomes and BioCarta pathways tested, 25 pathways in the Italian dataset (449 cases, 940 controls) and 26 pathways in the Canadian dataset (530 cases, 398 controls) were associated with PBC susceptibility (P < 0.05). After correcting for multiple comparisons, only the eight most significant pathways in the Italian dataset had FDR < 0.25 with tumor necrosis factor/stress-related signaling emerging as the top pathway (P = 7.38 × 10−4, FDR = 0.18). Two pathways, phosphatidylinositol signaling and hedgehog signaling, were replicated in both datasets (P < 0.05), and subjected to two additional complementary pathway tests. Both pathway signals remained significant in the Italian dataset on modified gene set enrichment analysis (P < 0.05). In both GWAS, variants nominally associated with PBC were significantly overrepresented in the phosphatidylinositol pathway (Fisher exact P < 0.05). These results point to established and novel pathway-level associations with inherited predisposition to PBC that on further independent replication and functional validation, may provide fresh insights into PBC etiology. PMID:23392275

  11. A Genome-Wide Longitudinal Transcriptome Analysis of the Aging Model Podospora anserine

    PubMed Central

    Philipp, Oliver; Hamann, Andrea; Servos, Jörg; Werner, Alexandra; Koch, Ina; Osiewacz, Heinz D.

    2013-01-01

    Aging of biological systems is controlled by various processes which have a potential impact on gene expression. Here we report a genome-wide transcriptome analysis of the fungal aging model Podospora anserina. Total RNA of three individuals of defined age were pooled and analyzed by SuperSAGE (serial analysis of gene expression). A bioinformatics analysis identified different molecular pathways to be affected during aging. While the abundance of transcripts linked to ribosomes and to the proteasome quality control system were found to decrease during aging, those associated with autophagy increase, suggesting that autophagy may act as a compensatory quality control pathway. Transcript profiles associated with the energy metabolism including mitochondrial functions were identified to fluctuate during aging. Comparison of wild-type transcripts, which are continuously down-regulated during aging, with those down-regulated in the long-lived, copper-uptake mutant grisea, validated the relevance of age-related changes in cellular copper metabolism. Overall, we (i) present a unique age-related data set of a longitudinal study of the experimental aging model P. anserina which represents a reference resource for future investigations in a variety of organisms, (ii) suggest autophagy to be a key quality control pathway that becomes active once other pathways fail, and (iii) present testable predictions for subsequent experimental investigations. PMID:24376646

  12. Analysis of gene-specific and genome-wide sperm DNA methylation.

    PubMed

    Hammoud, Saher Sue; Cairns, Bradley R; Carrell, Douglas T

    2013-01-01

    Epigenetic modifications on the DNA sequence (DNA methylation) or on chromatin-associated proteins (i.e., histones) comprise the "cellular epigenome"; together these modifications play an important role in the regulation of gene expression. Unlike the genome, the epigenome is highly variable between cells and is dynamic and plastic in response to cellular stress and environmental cues. The role of the epigenome, specifically, the methylome has been increasingly highlighted and has been implicated in many cellular and developmental processes such as embryonic reprogramming, cellular differentiation, imprinting, X chromosome inactivation, genomic stability, and complex diseases such as cancer. Over the past decade several methods have been developed and applied to characterize DNA methylation at gene-specific loci (using either traditional bisulfite sequencing or pyrosequencing) or its genome-wide distribution (microarray analysis following methylated DNA immunoprecipitation (MeDIP-chip), analysis by sequencing (MeDIP-seq), reduced representation bisulfite sequencing (RRBS), or shotgun bisulfite sequencing). This chapter reviews traditional bisulfite sequencing and shotgun bisulfite sequencing approaches, with a greater emphasis on shotgun bisulfite sequencing methods and data analysis. PMID:22992936

  13. Genome-wide pathway analysis in attention-deficit/hyperactivity disorder.

    PubMed

    Lee, Young Ho; Song, Gwan Gyu

    2014-08-01

    This study aimed to (1) to identify candidate single-nucleotide polymorphisms (SNPs) and mechanisms of attention-deficit/hyperactivity disorder (ADHD) and (2) to generate SNP-to-gene-to-pathway hypotheses. An ADHD genome-wide association study (GWAS) dataset that included 428,074 SNPs in 924 trios (2,758 individuals) of European descent was used in this study. The Identify candidate Causal SNPs and Pathways (ICSNPathway) analysis was applied to the GWAS dataset. ICSNPathway analysis identified 11 candidate SNPs, 6 genes, and 6 pathways, which provided 6 hypothetical biological mechanisms. The strongest hypothetical biological mechanism was that rs2532502 alters the role of CD27 in the context of the pathways of positive regulation of nucleocytoplasmic transport [nominal p < 0.001; false discovery rate (FDR) = 0.028]. The second strongest mechanism was the rs1820204, rs1052571, rs1052576 → CASP9 → mitochondrial pathway (nominal p < 0.001; FDR = 0.032). The third mechanism was the rs1801516 → ATM → CD25 pathway (nominal p < 0.001; FDR = 0.034). By applying the ICSNPathway analysis to the ADHD GWAS data, 11 candidate SNPs, 6 genes that included CD27, CASP9, ATM, CD12orf65, OXER1, and ACRY, and 6 pathways were identified that may contribute to ADHD susceptibility. PMID:24531918

  14. Genome-Wide DNA Methylation Analysis and Epigenetic Variations Associated with Congenital Aortic Valve Stenosis (AVS).

    PubMed

    Radhakrishna, Uppala; Albayrak, Samet; Alpay-Savasan, Zeynep; Zeb, Amna; Turkoglu, Onur; Sobolewski, Paul; Bahado-Singh, Ray O

    2016-01-01

    Congenital heart defect (CHD) is the most common cause of death from congenital anomaly. Among several candidate epigenetic mechanisms, DNA methylation may play an important role in the etiology of CHDs. We conducted a genome-wide DNA methylation analysis using an Illumina Infinium 450k human methylation assay in a cohort of 24 newborns who had aortic valve stenosis (AVS), with gestational-age matched controls. The study identified significantly-altered CpG methylation at 59 sites in 52 genes in AVS subjects as compared to controls (either hypermethylated or demethylated). Gene Ontology analysis identified biological processes and functions for these genes including positive regulation of receptor-mediated endocytosis. Consistent with prior clinical data, the molecular function categories as determined using DAVID identified low-density lipoprotein receptor binding, lipoprotein receptor binding and identical protein binding to be over-represented in the AVS group. A significant epigenetic change in the APOA5 and PCSK9 genes known to be involved in AVS was also observed. A large number CpG methylation sites individually demonstrated good to excellent diagnostic accuracy for the prediction of AVS status, thus raising possibility of molecular screening markers for this disorder. Using epigenetic analysis we were able to identify genes significantly involved in the pathogenesis of AVS. PMID:27152866

  15. Genome-Wide DNA Methylation Analysis and Epigenetic Variations Associated with Congenital Aortic Valve Stenosis (AVS)

    PubMed Central

    Radhakrishna, Uppala; Albayrak, Samet; Alpay-Savasan, Zeynep; Zeb, Amna; Turkoglu, Onur; Sobolewski, Paul; Bahado-Singh, Ray O.

    2016-01-01

    Congenital heart defect (CHD) is the most common cause of death from congenital anomaly. Among several candidate epigenetic mechanisms, DNA methylation may play an important role in the etiology of CHDs. We conducted a genome-wide DNA methylation analysis using an Illumina Infinium 450k human methylation assay in a cohort of 24 newborns who had aortic valve stenosis (AVS), with gestational-age matched controls. The study identified significantly-altered CpG methylation at 59 sites in 52 genes in AVS subjects as compared to controls (either hypermethylated or demethylated). Gene Ontology analysis identified biological processes and functions for these genes including positive regulation of receptor-mediated endocytosis. Consistent with prior clinical data, the molecular function categories as determined using DAVID identified low-density lipoprotein receptor binding, lipoprotein receptor binding and identical protein binding to be over-represented in the AVS group. A significant epigenetic change in the APOA5 and PCSK9 genes known to be involved in AVS was also observed. A large number CpG methylation sites individually demonstrated good to excellent diagnostic accuracy for the prediction of AVS status, thus raising possibility of molecular screening markers for this disorder. Using epigenetic analysis we were able to identify genes significantly involved in the pathogenesis of AVS. PMID:27152866

  16. Genome-Wide Identification and Expression Analysis of Calcium-dependent Protein Kinase in Tomato

    PubMed Central

    Hu, Zhangjian; Lv, Xiangzhang; Xia, Xiaojian; Zhou, Jie; Shi, Kai; Yu, Jingquan; Zhou, Yanhong

    2016-01-01

    Calcium-dependent protein kinases (CDPKs) play critical roles in regulating growth, development and stress response in plants. Information about CDPKs in tomato, however, remains obscure although it is one of the most important model crops in the world. In this study, we performed a bioinformatics analysis of the entire tomato genome and identified 29 CDPK genes. These CDPK genes are found to be located in 12 chromosomes, and could be divided into four groups. Analysis of the gene structure and splicing site reflected high structure conservation within different CDPK gene groups both in the exon-intron pattern and mRNA splicing. Transcripts of most CDPK genes varied with plant organs and developmental stages and their transcripts could be differentially induced by abscisic acid (ABA), brassinosteroids (BRs), methyl jasmonate (MeJA), and salicylic acid (SA), as well as after exposure to heat, cold, and drought, respectively. To our knowledge, this is the first report about the genome-wide analysis of the CDPK gene family in tomato, and the findings obtained offer a clue to the elaborated regulatory role of CDPKs in plant growth, development and stress response in tomato. PMID:27092168

  17. Genome-wide analysis of over 106 000 individuals identifies 9 neuroticism-associated loci

    PubMed Central

    Smith, D J; Escott-Price, V; Davies, G; Bailey, M E S; Colodro-Conde, L; Ward, J; Vedernikov, A; Marioni, R; Cullen, B; Lyall, D; Hagenaars, S P; Liewald, D C M; Luciano, M; Gale, C R; Ritchie, S J; Hayward, C; Nicholl, B; Bulik-Sullivan, B; Adams, M; Couvy-Duchesne, B; Graham, N; Mackay, D; Evans, J; Smith, B H; Porteous, D J; Medland, S E; Martin, N G; Holmans, P; McIntosh, A M; Pell, J P; Deary, I J; O'Donovan, M C

    2016-01-01

    Neuroticism is a personality trait of fundamental importance for psychological well-being and public health. It is strongly associated with major depressive disorder (MDD) and several other psychiatric conditions. Although neuroticism is heritable, attempts to identify the alleles involved in previous studies have been limited by relatively small sample sizes. Here we report a combined meta-analysis of genome-wide association study (GWAS) of neuroticism that includes 91 370 participants from the UK Biobank cohort, 6659 participants from the Generation Scotland: Scottish Family Health Study (GS:SFHS) and 8687 participants from a QIMR (Queensland Institute of Medical Research) Berghofer Medical Research Institute (QIMR) cohort. All participants were assessed using the same neuroticism instrument, the Eysenck Personality Questionnaire-Revised (EPQ-R-S) Short Form's Neuroticism scale. We found a single-nucleotide polymorphism-based heritability estimate for neuroticism of ∼15% (s.e.=0.7%). Meta-analysis identified nine novel loci associated with neuroticism. The strongest evidence for association was at a locus on chromosome 8 (P=1.5 × 10−15) spanning 4 Mb and containing at least 36 genes. Other associated loci included interesting candidate genes on chromosome 1 (GRIK3 (glutamate receptor ionotropic kainate 3)), chromosome 4 (KLHL2 (Kelch-like protein 2)), chromosome 17 (CRHR1 (corticotropin-releasing hormone receptor 1) and MAPT (microtubule-associated protein Tau)) and on chromosome 18 (CELF4 (CUGBP elav-like family member 4)). We found no evidence for genetic differences in the common allelic architecture of neuroticism by sex. By comparing our findings with those of the Psychiatric Genetics Consortia, we identified a strong genetic correlation between neuroticism and MDD and a less strong but significant genetic correlation with schizophrenia, although not with bipolar disorder. Polygenic risk scores derived from the primary UK Biobank sample captured

  18. Genome-wide analysis of over 106 000 individuals identifies 9 neuroticism-associated loci.

    PubMed

    Smith, D J; Escott-Price, V; Davies, G; Bailey, M E S; Colodro-Conde, L; Ward, J; Vedernikov, A; Marioni, R; Cullen, B; Lyall, D; Hagenaars, S P; Liewald, D C M; Luciano, M; Gale, C R; Ritchie, S J; Hayward, C; Nicholl, B; Bulik-Sullivan, B; Adams, M; Couvy-Duchesne, B; Graham, N; Mackay, D; Evans, J; Smith, B H; Porteous, D J; Medland, S E; Martin, N G; Holmans, P; McIntosh, A M; Pell, J P; Deary, I J; O'Donovan, M C

    2016-06-01

    Neuroticism is a personality trait of fundamental importance for psychological well-being and public health. It is strongly associated with major depressive disorder (MDD) and several other psychiatric conditions. Although neuroticism is heritable, attempts to identify the alleles involved in previous studies have been limited by relatively small sample sizes. Here we report a combined meta-analysis of genome-wide association study (GWAS) of neuroticism that includes 91 370 participants from the UK Biobank cohort, 6659 participants from the Generation Scotland: Scottish Family Health Study (GS:SFHS) and 8687 participants from a QIMR (Queensland Institute of Medical Research) Berghofer Medical Research Institute (QIMR) cohort. All participants were assessed using the same neuroticism instrument, the Eysenck Personality Questionnaire-Revised (EPQ-R-S) Short Form's Neuroticism scale. We found a single-nucleotide polymorphism-based heritability estimate for neuroticism of ∼15% (s.e.=0.7%). Meta-analysis identified nine novel loci associated with neuroticism. The strongest evidence for association was at a locus on chromosome 8 (P=1.5 × 10(-15)) spanning 4 Mb and containing at least 36 genes. Other associated loci included interesting candidate genes on chromosome 1 (GRIK3 (glutamate receptor ionotropic kainate 3)), chromosome 4 (KLHL2 (Kelch-like protein 2)), chromosome 17 (CRHR1 (corticotropin-releasing hormone receptor 1) and MAPT (microtubule-associated protein Tau)) and on chromosome 18 (CELF4 (CUGBP elav-like family member 4)). We found no evidence for genetic differences in the common allelic architecture of neuroticism by sex. By comparing our findings with those of the Psychiatric Genetics Consortia, we identified a strong genetic correlation between neuroticism and MDD and a less strong but significant genetic correlation with schizophrenia, although not with bipolar disorder. Polygenic risk scores derived from the primary UK Biobank sample captured

  19. Genome-wide identification, classification, and expression analysis of sHSP genes in Chinese cabbage (Brassica rapa ssp pekinensis).

    PubMed

    Tao, P; Guo, W L; Li, B Y; Wang, W H; Yue, Z C; Lei, J L; Zhong, X M

    2015-01-01

    Small heat shock proteins (sHSPs) are essential for the plant's normal development and stress responses, especially the heat stress response. The information regarding sHSP genes in Chinese cabbage (Brassica rapa ssp pekinensis) is sparse, hence we performed a genome-wide analysis to identify sHSP genes in this species. We identified 26 non-redundant sHSP genes distributed on all chromosomes, except chromosome A7, with one additional sHSP gene identified from an expressed sequence tag library. Chinese cabbage was found to contain more sHSP genes than Arabidopsis. The 27 sHSP genes were classified into 11 subfamilies. We identified 22 groups of sHSP syntenic orthologous genes between Chinese cabbage and Arabidopsis. In addition, eight groups of paralogous genes were uncovered in Chinese cabbage. Protein structures of the 27 Chinese cabbage sHSPs were modeled using Phyre2, which revealed that all of them contain several conserved β strands across different subfamilies. In general, gene structure was conserved within each subfamily between Chinese cabbage and Arabidopsis, except for peroxisome sHSP. Analysis of promoter motifs showed that most sHSP genes contain heat shock elements or variants. We also found that biased gene loss has occurred during the evolution of the sHSP subfamily in Chinese cabbage. Expression analysis indicated that the greatest transcript abundance of most Chinese cabbage sHSP genes was found in siliques and early cotyledon embryos. Thus, genome-wide identification and characterization of sHSP genes is a first and important step in the investigation of sHSPs in Chinese cabbage. PMID:26505345

  20. Meta-analysis of genome-wide association studies discovers multiple loci for chronic lymphocytic leukemia

    PubMed Central

    Berndt, Sonja I.; Camp, Nicola J.; Skibola, Christine F.; Vijai, Joseph; Wang, Zhaoming; Gu, Jian; Nieters, Alexandra; Kelly, Rachel S.; Smedby, Karin E.; Monnereau, Alain; Cozen, Wendy; Cox, Angela; Wang, Sophia S.; Lan, Qing; Teras, Lauren R.; Machado, Moara; Yeager, Meredith; Brooks-Wilson, Angela R.; Hartge, Patricia; Purdue, Mark P.; Birmann, Brenda M.; Vajdic, Claire M.; Cocco, Pierluigi; Zhang, Yawei; Giles, Graham G.; Zeleniuch-Jacquotte, Anne; Lawrence, Charles; Montalvan, Rebecca; Burdett, Laurie; Hutchinson, Amy; Ye, Yuanqing; Call, Timothy G.; Shanafelt, Tait D.; Novak, Anne J.; Kay, Neil E.; Liebow, Mark; Cunningham, Julie M.; Allmer, Cristine; Hjalgrim, Henrik; Adami, Hans-Olov; Melbye, Mads; Glimelius, Bengt; Chang, Ellen T.; Glenn, Martha; Curtin, Karen; Cannon-Albright, Lisa A.; Diver, W Ryan; Link, Brian K.; Weiner, George J.; Conde, Lucia; Bracci, Paige M.; Riby, Jacques; Arnett, Donna K.; Zhi, Degui; Leach, Justin M.; Holly, Elizabeth A.; Jackson, Rebecca D.; Tinker, Lesley F.; Benavente, Yolanda; Sala, Núria; Casabonne, Delphine; Becker, Nikolaus; Boffetta, Paolo; Brennan, Paul; Foretova, Lenka; Maynadie, Marc; McKay, James; Staines, Anthony; Chaffee, Kari G.; Achenbach, Sara J.; Vachon, Celine M.; Goldin, Lynn R.; Strom, Sara S.; Leis, Jose F.; Weinberg, J. Brice; Caporaso, Neil E.; Norman, Aaron D.; De Roos, Anneclaire J.; Morton, Lindsay M.; Severson, Richard K.; Riboli, Elio; Vineis, Paolo; Kaaks, Rudolph; Masala, Giovanna; Weiderpass, Elisabete; Chirlaque, María- Dolores; Vermeulen, Roel C. H.; Travis, Ruth C.; Southey, Melissa C.; Milne, Roger L.; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie; Clavel, Jacqueline; Zheng, Tongzhang; Holford, Theodore R.; Villano, Danylo J.; Maria, Ann; Spinelli, John J.; Gascoyne, Randy D.; Connors, Joseph M.; Bertrand, Kimberly A.; Giovannucci, Edward; Kraft, Peter; Kricker, Anne; Turner, Jenny; Ennas, Maria Grazia; Ferri, Giovanni M.; Miligi, Lucia; Liang, Liming; Ma, Baoshan; Huang, Jinyan; Crouch, Simon; Park, Ju-Hyun; Chatterjee, Nilanjan; North, Kari E.; Snowden, John A.; Wright, Josh; Fraumeni, Joseph F.; Offit, Kenneth; Wu, Xifeng; de Sanjose, Silvia; Cerhan, James R.; Chanock, Stephen J.; Rothman, Nathaniel; Slager, Susan L.

    2016-01-01

    Chronic lymphocytic leukemia (CLL) is a common lymphoid malignancy with strong heritability. To further understand the genetic susceptibility for CLL and identify common loci associated with risk, we conducted a meta-analysis of four genome-wide association studies (GWAS) composed of 3,100 cases and 7,667 controls with follow-up replication in 1,958 cases and 5,530 controls. Here we report three new loci at 3p24.1 (rs9880772, EOMES, P=2.55 × 10−11), 6p25.2 (rs73718779, SERPINB6, P=1.97 × 10−8) and 3q28 (rs9815073, LPP, P=3.62 × 10−8), as well as a new independent SNP at the known 2q13 locus (rs9308731, BCL2L11, P=1.00 × 10−11) in the combined analysis. We find suggestive evidence (P<5 × 10−7) for two additional new loci at 4q24 (rs10028805, BANK1, P=7.19 × 10−8) and 3p22.2 (rs1274963, CSRNP1, P=2.12 × 10−7). Pathway analyses of new and known CLL loci consistently show a strong role for apoptosis, providing further evidence for the importance of this biological pathway in CLL susceptibility. PMID:26956414

  1. Methods for Genome-Wide Analysis of Gene Expression Changes in Polyploids

    PubMed Central

    Wang, Jianlin; Lee, Jinsuk J.; Tian, Lu; Lee, Hyeon-Se; Chen, Meng; Rao, Sheetal; Wei, Edward N.; Doerge, R. W.; Comai, Luca; Jeffrey Chen, Z.

    2007-01-01

    Polyploidy is an evolutionary innovation, providing extra sets of genetic material for phenotypic variation and adaptation. It is predicted that changes of gene expression by genetic and epigenetic mechanisms are responsible for novel variation in nascent and established polyploids (Liu and Wendel, 2002; Osborn et al., 2003; Pikaard, 2001). Studying gene expression changes in allopolyploids is more complicated than in autopolyploids, because allopolyploids contain more than two sets of genomes originating from divergent, but related, species. Here we describe two methods that are applicable to the genome-wide analysis of gene expression differences resulting from genome duplication in autopolyploids or interactions between homoeologous genomes in allopolyploids. First, we describe an amplified fragment length polymorphism (AFLP)–complementary DNA (cDNA) display method that allows the discrimination of homoeologous loci based on restriction polymorphisms between the progenitors. Second, we describe microarray analyses that can be used to compare gene expression differences between the allopolyploids and respective progenitors using appropriate experimental design and statistical analysis. We demonstrate the utility of these two complementary methods and discuss the pros and cons of using the methods to analyze gene expression changes in autopolyploids and allopolyploids. Furthermore, we describe these methods in general terms to be of wider applicability for comparative gene expression in a variety of evolutionary, genetic, biological, and physiological contexts. PMID:15865985

  2. Multiple SNP Set Analysis for Genome-Wide Association Studies Through Bayesian Latent Variable Selection.

    PubMed

    Lu, Zhao-Hua; Zhu, Hongtu; Knickmeyer, Rebecca C; Sullivan, Patrick F; Williams, Stephanie N; Zou, Fei

    2015-12-01

    The power of genome-wide association studies (GWAS) for mapping complex traits with single-SNP analysis (where SNP is single-nucleotide polymorphism) may be undermined by modest SNP effect sizes, unobserved causal SNPs, correlation among adjacent SNPs, and SNP-SNP interactions. Alternative approaches for testing the association between a single SNP set and individual phenotypes have been shown to be promising for improving the power of GWAS. We propose a Bayesian latent variable selection (BLVS) method to simultaneously model the joint association mapping between a large number of SNP sets and complex traits. Compared with single SNP set analysis, such joint association mapping not only accounts for the correlation among SNP sets but also is capable of detecting causal SNP sets that are marginally uncorrelated with traits. The spike-and-slab prior assigned to the effects of SNP sets can greatly reduce the dimension of effective SNP sets, while speeding up computation. An efficient Markov chain Monte Carlo algorithm is developed. Simulations demonstrate that BLVS outperforms several competing variable selection methods in some important scenarios. PMID:26515609

  3. Meta-analysis of genome-wide association studies discovers multiple loci for chronic lymphocytic leukemia.

    PubMed

    Berndt, Sonja I; Camp, Nicola J; Skibola, Christine F; Vijai, Joseph; Wang, Zhaoming; Gu, Jian; Nieters, Alexandra; Kelly, Rachel S; Smedby, Karin E; Monnereau, Alain; Cozen, Wendy; Cox, Angela; Wang, Sophia S; Lan, Qing; Teras, Lauren R; Machado, Moara; Yeager, Meredith; Brooks-Wilson, Angela R; Hartge, Patricia; Purdue, Mark P; Birmann, Brenda M; Vajdic, Claire M; Cocco, Pierluigi; Zhang, Yawei; Giles, Graham G; Zeleniuch-Jacquotte, Anne; Lawrence, Charles; Montalvan, Rebecca; Burdett, Laurie; Hutchinson, Amy; Ye, Yuanqing; Call, Timothy G; Shanafelt, Tait D; Novak, Anne J; Kay, Neil E; Liebow, Mark; Cunningham, Julie M; Allmer, Cristine; Hjalgrim, Henrik; Adami, Hans-Olov; Melbye, Mads; Glimelius, Bengt; Chang, Ellen T; Glenn, Martha; Curtin, Karen; Cannon-Albright, Lisa A; Diver, W Ryan; Link, Brian K; Weiner, George J; Conde, Lucia; Bracci, Paige M; Riby, Jacques; Arnett, Donna K; Zhi, Degui; Leach, Justin M; Holly, Elizabeth A; Jackson, Rebecca D; Tinker, Lesley F; Benavente, Yolanda; Sala, Núria; Casabonne, Delphine; Becker, Nikolaus; Boffetta, Paolo; Brennan, Paul; Foretova, Lenka; Maynadie, Marc; McKay, James; Staines, Anthony; Chaffee, Kari G; Achenbach, Sara J; Vachon, Celine M; Goldin, Lynn R; Strom, Sara S; Leis, Jose F; Weinberg, J Brice; Caporaso, Neil E; Norman, Aaron D; De Roos, Anneclaire J; Morton, Lindsay M; Severson, Richard K; Riboli, Elio; Vineis, Paolo; Kaaks, Rudolph; Masala, Giovanna; Weiderpass, Elisabete; Chirlaque, María-Dolores; Vermeulen, Roel C H; Travis, Ruth C; Southey, Melissa C; Milne, Roger L; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie; Clavel, Jacqueline; Zheng, Tongzhang; Holford, Theodore R; Villano, Danylo J; Maria, Ann; Spinelli, John J; Gascoyne, Randy D; Connors, Joseph M; Bertrand, Kimberly A; Giovannucci, Edward; Kraft, Peter; Kricker, Anne; Turner, Jenny; Ennas, Maria Grazia; Ferri, Giovanni M; Miligi, Lucia; Liang, Liming; Ma, Baoshan; Huang, Jinyan; Crouch, Simon; Park, Ju-Hyun; Chatterjee, Nilanjan; North, Kari E; Snowden, John A; Wright, Josh; Fraumeni, Joseph F; Offit, Kenneth; Wu, Xifeng; de Sanjose, Silvia; Cerhan, James R; Chanock, Stephen J; Rothman, Nathaniel; Slager, Susan L

    2016-01-01

    Chronic lymphocytic leukemia (CLL) is a common lymphoid malignancy with strong heritability. To further understand the genetic susceptibility for CLL and identify common loci associated with risk, we conducted a meta-analysis of four genome-wide association studies (GWAS) composed of 3,100 cases and 7,667 controls with follow-up replication in 1,958 cases and 5,530 controls. Here we report three new loci at 3p24.1 (rs9880772, EOMES, P=2.55 × 10(-11)), 6p25.2 (rs73718779, SERPINB6, P=1.97 × 10(-8)) and 3q28 (rs9815073, LPP, P=3.62 × 10(-8)), as well as a new independent SNP at the known 2q13 locus (rs9308731, BCL2L11, P=1.00 × 10(-11)) in the combined analysis. We find suggestive evidence (P<5 × 10(-7)) for two additional new loci at 4q24 (rs10028805, BANK1, P=7.19 × 10(-8)) and 3p22.2 (rs1274963, CSRNP1, P=2.12 × 10(-7)). Pathway analyses of new and known CLL loci consistently show a strong role for apoptosis, providing further evidence for the importance of this biological pathway in CLL susceptibility. PMID:26956414

  4. Genome-wide meta-analysis identifies multiple novel associations and ethnic heterogeneity of psoriasis susceptibility.

    PubMed

    Yin, Xianyong; Low, Hui Qi; Wang, Ling; Li, Yonghong; Ellinghaus, Eva; Han, Jiali; Estivill, Xavier; Sun, Liangdan; Zuo, Xianbo; Shen, Changbing; Zhu, Caihong; Zhang, Anping; Sanchez, Fabio; Padyukov, Leonid; Catanese, Joseph J; Krueger, Gerald G; Duffin, Kristina Callis; Mucha, Sören; Weichenthal, Michael; Weidinger, Stephan; Lieb, Wolfgang; Foo, Jia Nee; Li, Yi; Sim, Karseng; Liany, Herty; Irwan, Ishak; Teo, Yikying; Theng, Colin T S; Gupta, Rashmi; Bowcock, Anne; De Jager, Philip L; Qureshi, Abrar A; de Bakker, Paul I W; Seielstad, Mark; Liao, Wilson; Ståhle, Mona; Franke, Andre; Zhang, Xuejun; Liu, Jianjun

    2015-01-01

    Psoriasis is a common inflammatory skin disease with complex genetics and different degrees of prevalence across ethnic populations. Here we present the largest trans-ethnic genome-wide meta-analysis (GWMA) of psoriasis in 15,369 cases and 19,517 controls of Caucasian and Chinese ancestries. We identify four novel associations at LOC144817, COG6, RUNX1 and TP63, as well as three novel secondary associations within IFIH1 and IL12B. Fine-mapping analysis of MHC region demonstrates an important role for all three HLA class I genes and a complex and heterogeneous pattern of HLA associations between Caucasian and Chinese populations. Further, trans-ethnic comparison suggests population-specific effect or allelic heterogeneity for 11 loci. These population-specific effects contribute significantly to the ethnic diversity of psoriasis prevalence. This study not only provides novel biological insights into the involvement of immune and keratinocyte development mechanism, but also demonstrates a complex and heterogeneous genetic architecture of psoriasis susceptibility across ethnic populations. PMID:25903422

  5. A Tool Set for the Genome-Wide Analysis of Neurospora crassa by RT-PCR.

    PubMed

    Hurley, Jennifer H; Dasgupta, Arko; Andrews, Peter; Crowell, Alexander M; Ringelberg, Carol; Loros, Jennifer J; Dunlap, Jay C

    2015-10-01

    Neurospora crassa is an important model organism for filamentous fungi as well as for circadian biology and photobiology. Although the community-accumulated tool set for the molecular analysis of Neurospora is extensive, two components are missing: (1) dependable reference genes whose level of expression are relatively constant across light/dark cycles and as a function of time of day and (2) a catalog of primers specifically designed for real-time PCR (RT-PCR). To address the first of these we have identified genes that are optimal for use as reference genes in RT-PCR across a wide range of expression levels; the mRNA/transcripts from these genes have potential for use as reference noncycling transcripts outside of Neurospora. In addition, we have generated a genome-wide set of RT-PCR primers, thereby streamlining the analysis of gene expression. In validation studies these primers successfully identified target mRNAs arising from 70% (34 of 49) of all tested genes and from all (28) of the moderately to highly expressed tested genes. PMID:26248984

  6. A genome-wide association meta-analysis identifies new childhood obesity loci

    PubMed Central

    Bradfield, Jonathan P.; Taal, H. Rob; Timpson, Nicholas J.; Scherag, André; Lecoeur, Cecile; Warrington, Nicole M.; Hypponen, Elina; Holst, Claus; Valcarcel, Beatriz; Thiering, Elisabeth; Salem, Rany M.; Schumacher, Fredrick R.; Cousminer, Diana L.; Sleiman, Patrick M.A.; Zhao, Jianhua; Berkowitz, Robert I.; Vimaleswaran, Karani S.; Jarick, Ivonne; Pennell, Craig E.; Evans, David M.; St. Pourcain, Beate; Berry, Diane J.; Mook-Kanamori, Dennis O; Hofman, Albert; Rivadeinera, Fernando; Uitterlinden, André G.; van Duijn, Cornelia M.; van der Valk, Ralf J.P.; de Jongste, Johan C.; Postma, Dirkje S.; Boomsma, Dorret I.; Gauderman, William J.; Hassanein, Mohamed T.; Lindgren, Cecilia M.; Mägi, Reedik; Boreham, Colin A.G.; Neville, Charlotte E.; Moreno, Luis A.; Elliott, Paul; Pouta, Anneli; Hartikainen, Anna-Liisa; Li, Mingyao; Raitakari, Olli; Lehtimäki, Terho; Eriksson, Johan G.; Palotie, Aarno; Dallongeville, Jean; Das, Shikta; Deloukas, Panos; McMahon, George; Ring, Susan M.; Kemp, John P.; Buxton, Jessica L.; Blakemore, Alexandra I.F.; Bustamante, Mariona; Guxens, Mònica; Hirschhorn, Joel N.; Gillman, Matthew W.; Kreiner-Møller, Eskil; Bisgaard, Hans; Gilliland, Frank D.; Heinrich, Joachim; Wheeler, Eleanor; Barroso, Inês; O'Rahilly, Stephen; Meirhaeghe, Aline; Sørensen, Thorkild I.A.; Power, Chris; Palmer, Lyle J.; Hinney, Anke; Widen, Elisabeth; Farooqi, I. Sadaf; McCarthy, Mark I.; Froguel, Philippe; Meyre, David; Hebebrand, Johannes; Jarvelin, Marjo-Riitta; Jaddoe, Vincent W.V.; Smith, George Davey; Hakonarson, Hakon; Grant, Struan F.A.

    2012-01-01

    Multiple genetic variants have been associated with adult obesity and a few with severe obesity in childhood; however, less progress has been made to establish genetic influences on common early-onset obesity. We performed a North American-Australian-European collaborative meta-analysis of fourteen studies consisting of 5,530 cases (≥95th percentile of body mass index (BMI)) and 8,318 controls (<50th percentile of BMI) of European ancestry. Taking forward the eight novel signals yielding association with P < 5×10−6 in to nine independent datasets (n = 2,818 cases and 4,083 controls) we observed two loci that yielded a genome wide significant combined P-value, namely near OLFM4 on 13q14 (rs9568856; P=1.82×10−9; OR=1.22) and within HOXB5 on 17q21 (rs9299; P=3.54×10−9; OR=1.14). Both loci continued to show association when including two extreme childhood obesity cohorts (n = 2,214 cases and 2,674 controls). Finally, these two loci yielded directionally consistent associations in the GIANT meta-analysis of adult BMI1. PMID:22484627

  7. Genome-wide association meta-analysis identifies new endometriosis risk loci

    PubMed Central

    Nyholt, Dale R.; Low, Siew-Kee; Anderson, Carl A.; Painter, Jodie N.; Uno, Satoko; Morris, Andrew P.; MacGregor, Stuart; Gordon, Scott D.; Henders, Anjali K.; Martin, Nicholas G.; Attia, John; Holliday, Elizabeth G.; McEvoy, Mark; Scott, Rodney J.; Kennedy, Stephen H.; Treloar, Susan A.; Missmer, Stacey A.; Adachi, Sosuke; Tanaka, Kenichi; Nakamura, Yusuke; Zondervan, Krina T.; Zembutsu, Hitoshi; Montgomery, Grant W.

    2012-01-01

    We conducted a genome-wide association (GWA) meta-analysis of 4,604 endometriosis cases and 9,393 controls of Japanese1 and European2 ancestry. We show that rs12700667 on chromosome 7p15.2, previously found in Europeans, replicates in Japanese (P = 3.6 × 10−3), and confirm association of rs7521902 on 1p36.12 near WNT4. In addition, we establish association of rs13394619 in GREB1 on 2p25.1 and identify a novel locus on 12q22 near VEZT (rs10859871). Excluding European cases with minimal or unknown severity, we identified additional novel loci on 2p14 (rs4141819), 6p22.3 (rs7739264) and 9p21.3 (rs1537377). All seven SNP effects were replicated in an independent cohort and produced P < 5 × 10−8 in a combined analysis. Finally, we found a significant overlap in polygenic risk for endometriosis between the European and Japanese GWA cohorts (P = 8.8 × 10−11), indicating that many weakly associated SNPs represent true endometriosis risk loci and risk prediction and future targeted disease therapy may be transferred across these populations. PMID:23104006

  8. Genome-wide meta-analysis identifies multiple novel associations and ethnic heterogeneity of psoriasis susceptibility

    PubMed Central

    Yin, Xianyong; Low, Hui Qi; Wang, Ling; Li, Yonghong; Ellinghaus, Eva; Han, Jiali; Estivill, Xavier; Sun, Liangdan; Zuo, Xianbo; Shen, Changbing; Zhu, Caihong; Zhang, Anping; Sanchez, Fabio; Padyukov, Leonid; Catanese, Joseph J.; Krueger, Gerald G.; Duffin, Kristina Callis; Mucha, Sören; Weichenthal, Michael; Weidinger, Stephan; Lieb, Wolfgang; Foo, Jia Nee; Li, Yi; Sim, Karseng; Liany, Herty; Irwan, Ishak; Teo, Yikying; Theng, Colin T. S.; Gupta, Rashmi; Bowcock, Anne; De Jager, Philip L.; Qureshi, Abrar A.; de Bakker, Paul I. W.; Seielstad, Mark; Liao, Wilson; Ståhle, Mona; Franke, Andre; Zhang, Xuejun; Liu, Jianjun

    2015-01-01

    Psoriasis is a common inflammatory skin disease with complex genetics and different degrees of prevalence across ethnic populations. Here we present the largest trans-ethnic genome-wide meta-analysis (GWMA) of psoriasis in 15,369 cases and 19,517 controls of Caucasian and Chinese ancestries. We identify four novel associations at LOC144817, COG6, RUNX1 and TP63, as well as three novel secondary associations within IFIH1 and IL12B. Fine-mapping analysis of MHC region demonstrates an important role for all three HLA class I genes and a complex and heterogeneous pattern of HLA associations between Caucasian and Chinese populations. Further, trans-ethnic comparison suggests population-specific effect or allelic heterogeneity for 11 loci. These population-specific effects contribute significantly to the ethnic diversity of psoriasis prevalence. This study not only provides novel biological insights into the involvement of immune and keratinocyte development mechanism, but also demonstrates a complex and heterogeneous genetic architecture of psoriasis susceptibility across ethnic populations. PMID:25903422

  9. Meta-analysis of genome-wide association from genomic prediction models.

    PubMed

    Bernal Rubio, Y L; Gualdrón Duarte, J L; Bates, R O; Ernst, C W; Nonneman, D; Rohrer, G A; King, A; Shackelford, S D; Wheeler, T L; Cantet, R J C; Steibel, J P

    2016-02-01

    Genome-wide association (GWA) studies based on GBLUP models are a common practice in animal breeding. However, effect sizes of GWA tests are small, requiring larger sample sizes to enhance power of detection of rare variants. Because of difficulties in increasing sample size in animal populations, one alternative is to implement a meta-analysis (MA), combining information and results from independent GWA studies. Although this methodology has been used widely in human genetics, implementation in animal breeding has been limited. Thus, we present methods to implement a MA of GWA, describing the proper approach to compute weights derived from multiple genomic evaluations based on animal-centric GBLUP models. Application to real datasets shows that MA increases power of detection of associations in comparison with population-level GWA, allowing for population structure and heterogeneity of variance components across populations to be accounted for. Another advantage of MA is that it does not require access to genotype data that is required for a joint analysis. Scripts related to the implementation of this approach, which consider the strength of association as well as the sign, are distributed and thus account for heterogeneity in association phase between QTL and SNPs. Thus, MA of GWA is an attractive alternative to summarizing results from multiple genomic studies, avoiding restrictions with genotype data sharing, definition of fixed effects and different scales of measurement of evaluated traits. PMID:26607299

  10. Genome-Wide Association Study and Pathway-Level Analysis of Tocochromanol Levels in Maize Grain

    PubMed Central

    Lipka, Alexander E.; Gore, Michael A.; Magallanes-Lundback, Maria; Mesberg, Alex; Lin, Haining; Tiede, Tyler; Chen, Charles; Buell, C. Robin; Buckler, Edward S.; Rocheford, Torbert; DellaPenna, Dean

    2013-01-01

    Tocopherols and tocotrienols, collectively known as tocochromanols, are the major lipid-soluble antioxidants in maize (Zea mays L.) grain. Given that individual tocochromanols differ in their degree of vitamin E activity, variation for tocochromanol composition and content in grain from among diverse maize inbred lines has important nutritional and health implications for enhancing the vitamin E and antioxidant contents of maize-derived foods through plant breeding. Toward this end, we conducted a genome-wide association study of six tocochromanol compounds and 14 of their sums, ratios, and proportions with a 281 maize inbred association panel that was genotyped for 591,822 SNP markers. In addition to providing further insight into the association between ZmVTE4 (γ-tocopherol methyltransferase) haplotypes and α-tocopherol content, we also detected a novel association between ZmVTE1 (tocopherol cyclase) and tocotrienol composition. In a pathway-level analysis, we assessed the genetic contribution of 60 a priori candidate genes encoding the core tocochromanol pathway (VTE genes) and reactions for pathways supplying the isoprenoid tail and aromatic head group of tocochromanols. This analysis identified two additional genes, ZmHGGT1 (homogentisate geranylgeranyltransferase) and one prephenate dehydratase parolog (of four in the genome) that also modestly contribute to tocotrienol variation in the panel. Collectively, our results provide the most favorable ZmVTE4 haplotype and suggest three new gene targets for increasing vitamin E and antioxidant levels through marker-assisted selection. PMID:23733887

  11. Multiple SNP-sets Analysis for Genome-wide Association Studies through Bayesian Latent Variable Selection

    PubMed Central

    Lu, Zhaohua; Zhu, Hongtu; Knickmeyer, Rebecca C; Sullivan, Patrick F.; Stephanie, Williams N.; Zou, Fei

    2015-01-01

    The power of genome-wide association studies (GWAS) for mapping complex traits with single SNP analysis may be undermined by modest SNP effect sizes, unobserved causal SNPs, correlation among adjacent SNPs, and SNP-SNP interactions. Alternative approaches for testing the association between a single SNP-set and individual phenotypes have been shown to be promising for improving the power of GWAS. We propose a Bayesian latent variable selection (BLVS) method to simultaneously model the joint association mapping between a large number of SNP-sets and complex traits. Compared to single SNP-set analysis, such joint association mapping not only accounts for the correlation among SNP-sets, but also is capable of detecting causal SNP-sets that are marginally uncorrelated with traits. The spike-slab prior assigned to the effects of SNP-sets can greatly reduce the dimension of effective SNP-sets, while speeding up computation. An efficient MCMC algorithm is developed. Simulations demonstrate that BLVS outperforms several competing variable selection methods in some important scenarios. PMID:26515609

  12. Genome-wide analysis uncovers novel recurrent alterations in primary central nervous system lymphomas

    PubMed Central

    Braggio, Esteban; Van Wier, Scott; Ojha, Juhi; McPhail, Ellen; Asmann, Yan W.; Egan, Jan; da Silva, Jackline Ayres; Schiff, David; Lopes, M Beatriz; Decker, Paul A; Valdez, Riccardo; Tibes, Raoul; Eckloff, Bruce; Witzig, Thomas E.; Stewart, A Keith; Fonseca, Rafael; O’Neill, Brian Patrick

    2015-01-01

    Purpose Primary central nervous system lymphoma (PCNSL) is an aggressive non-Hodgkin lymphoma confined to the CNS. Whether there is a PCNSL-specific genomic signature and, if so, how it differs from systemic diffuse large B-cell lymphoma (DLBCL) is uncertain. Experimental design We performed a comprehensive genomic study of tumor samples from 19 immunocompetent PCNSL patients. Testing comprised array-comparative genomic hybridization and whole exome sequencing. Results Biallelic inactivation of TOX and PRKCD were recurrently found in PCNSL but not in systemic DLBCL, suggesting a specific role in PCNSL pathogenesis. Additionally, we found a high prevalence of MYD88 mutations (79%) and CDKN2A biallelic loss (60%). Several genes recurrently affected in PCNSL were common with systemic DLBCL, including loss of TNFAIP3, PRDM1, GNA13, TMEM30A, TBL1XR1, B2M, CD58, activating mutations of CD79B, CARD11 and translocations IgH-BCL6. Overall, BCR/TLR/NF-κB pathways were altered in >90% of PNCSL, highlighting its value for targeted therapeutic approaches. Furthermore, integrated analysis showed enrichment of pathways associated with immune response, proliferation, apoptosis, and lymphocyte differentiation. Conclusions In summary, genome-wide analysis uncovered novel recurrent alterations, including TOX and PRKCD, helping to differentiate PCNSL from systemic DLBCL and related lymphomas. PMID:25991819

  13. Genome-wide association meta-analysis of neuropathologic features of Alzheimer's disease and related dementias.

    PubMed

    Beecham, Gary W; Hamilton, Kara; Naj, Adam C; Martin, Eden R; Huentelman, Matt; Myers, Amanda J; Corneveaux, Jason J; Hardy, John; Vonsattel, Jean-Paul; Younkin, Steven G; Bennett, David A; De Jager, Philip L; Larson, Eric B; Crane, Paul K; Kamboh, M Ilyas; Kofler, Julia K; Mash, Deborah C; Duque, Linda; Gilbert, John R; Gwirtsman, Harry; Buxbaum, Joseph D; Kramer, Patricia; Dickson, Dennis W; Farrer, Lindsay A; Frosch, Matthew P; Ghetti, Bernardino; Haines, Jonathan L; Hyman, Bradley T; Kukull, Walter A; Mayeux, Richard P; Pericak-Vance, Margaret A; Schneider, Julie A; Trojanowski, John Q; Reiman, Eric M; Schellenberg, Gerard D; Montine, Thomas J

    2014-09-01

    Alzheimer's disease (AD) and related dementias are a major public health challenge and present a therapeutic imperative for which we need additional insight into molecular pathogenesis. We performed a genome-wide association study and analysis of known genetic risk loci for AD dementia using neuropathologic data from 4,914 brain autopsies. Neuropathologic data were used to define clinico-pathologic AD dementia or controls, assess core neuropathologic features of AD (neuritic plaques, NPs; neurofibrillary tangles, NFTs), and evaluate commonly co-morbid neuropathologic changes: cerebral amyloid angiopathy (CAA), Lewy body disease (LBD), hippocampal sclerosis of the elderly (HS), and vascular brain injury (VBI). Genome-wide significance was observed for clinico-pathologic AD dementia, NPs, NFTs, CAA, and LBD with a number of variants in and around the apolipoprotein E gene (APOE). GalNAc transferase 7 (GALNT7), ATP-Binding Cassette, Sub-Family G (WHITE), Member 1 (ABCG1), and an intergenic region on chromosome 9 were associated with NP score; and Potassium Large Conductance Calcium-Activated Channel, Subfamily M, Beta Member 2 (KCNMB2) was strongly associated with HS. Twelve of the 21 non-APOE genetic risk loci for clinically-defined AD dementia were confirmed in our clinico-pathologic sample: CR1, BIN1, CLU, MS4A6A, PICALM, ABCA7, CD33, PTK2B, SORL1, MEF2C, ZCWPW1, and CASS4 with 9 of these 12 loci showing larger odds ratio in the clinico-pathologic sample. Correlation of effect sizes for risk of AD dementia with effect size for NFTs or NPs showed positive correlation, while those for risk of VBI showed a moderate negative correlation. The other co-morbid neuropathologic features showed only nominal association with the known AD loci. Our results discovered new genetic associations with specific neuropathologic features and aligned known genetic risk for AD dementia with specific neuropathologic changes in the largest brain autopsy study of AD and related dementias

  14. A cellular genome-wide association study reveals human variation in microtubule stability and a role in inflammatory cell death

    PubMed Central

    Salinas, Raul E.; Ogohara, Cassandra; Thomas, Monica I.; Shukla, Kajal P.; Miller, Samuel I.; Ko, Dennis C.

    2014-01-01

    Pyroptosis is proinflammatory cell death that occurs in response to certain microbes. Activation of the protease caspase-1 by molecular platforms called inflammasomes is required for pyroptosis. We performed a cellular genome-wide association study (GWAS) using Salmonella typhimurium infection of human lymphoblastoid cell lines as a means of dissecting the genetic architecture of susceptibility to pyroptosis and identifying unknown regulatory mechanisms. Cellular GWAS revealed that a common human genetic difference that regulates pyroptosis also alters microtubule stability. An intergenic single-nucleotide polymorphism on chromosome 18 is associated with decreased pyroptosis and increased expression of TUBB6 (tubulin, β 6 class V). TUBB6 is unique among tubulin isoforms in that its overexpression can completely disrupt the microtubule network. Cells from individuals with higher levels of TUBB6 expression have lower microtubule stability and less pyroptosis. Reducing TUBB6 expression or stabilizing microtubules pharmacologically with paclitaxel (Taxol) increases pyroptosis without affecting the other major readout of caspase-1 activation, interleukin-1β secretion. The results reveal a new role for microtubules and possibly specific tubulin isoforms in the execution of pyroptosis. Furthermore, the finding that there is common diversity in TUBB6 expression and microtubule stability could have broad consequences for other microtubule-dependent phenotypes, diseases, and pharmacological responses. PMID:24173717

  15. A genome-wide linkage analysis of dementia in the Amish

    PubMed Central

    Hahs, Daniel W.; McCauley, Jacob L.; Crunk, Amy E.; McFarland, Lynne L.; Gaskell, Perry C.; Jiang, Lan; Slifer, Susan H.; Vance, Jeffery M.; Scott, William K.; Welsh-Bohmer, Kathleen A.; Johnson, Stephanie R.; Jackson, Charles E.; Pericak-Vance, Margaret A.; Haines, Jonathan L.

    2008-01-01

    Susceptibility genes for Alzheimer's disease are proving to be highly challenging to detect and verify. Population heterogeneity may be a significant confounding factor contributing to this difficulty. To increase the power for disease susceptibility gene detection we conducted a genome-wide genetic linkage screen using individuals from the relatively isolated, genetically homogeneous, Amish population. Our genome linkage analysis used a 407 microsatellite marker map (average density 7 cM) to search for autosomal genes linked to dementia in five Amish families from four Midwestern U.S. counties. Our highest two-point lod score (3.01) was observed at marker D4S1548 on chromosome 4q31. Five other regions (10q22, 3q28, 11p13, 4q28, 19p13) also demonstrated suggestive linkage with markers having two-point lod scores >2.0. While two of these regions are novel (4q31 and 11p13), the other regions lie close to regions identified in previous genome scans in other populations. Our results identify regions of the genome that may harbor genes involved in a subset of dementia patients, in particular the North American Amish community. PMID:16389594

  16. Genome-wide association analysis of susceptibility and clinical phenotype in multiple sclerosis.

    PubMed

    Baranzini, Sergio E; Wang, Joanne; Gibson, Rachel A; Galwey, Nicholas; Naegelin, Yvonne; Barkhof, Frederik; Radue, Ernst-Wilhelm; Lindberg, Raija L P; Uitdehaag, Bernard M G; Johnson, Michael R; Angelakopoulou, Aspasia; Hall, Leslie; Richardson, Jill C; Prinjha, Rab K; Gass, Achim; Geurts, Jeroen J G; Kragt, Jolijn; Sombekke, Madeleine; Vrenken, Hugo; Qualley, Pamela; Lincoln, Robin R; Gomez, Refujia; Caillier, Stacy J; George, Michaela F; Mousavi, Hourieh; Guerrero, Rosa; Okuda, Darin T; Cree, Bruce A C; Green, Ari J; Waubant, Emmanuelle; Goodin, Douglas S; Pelletier, Daniel; Matthews, Paul M; Hauser, Stephen L; Kappos, Ludwig; Polman, Chris H; Oksenberg, Jorge R

    2009-02-15

    Multiple sclerosis (MS), a chronic disorder of the central nervous system and common cause of neurological disability in young adults, is characterized by moderate but complex risk heritability. Here we report the results of a genome-wide association study performed in a 1000 prospective case series of well-characterized individuals with MS and group-matched controls using the Sentrix HumanHap550 BeadChip platform from Illumina. After stringent quality control data filtering, we compared allele frequencies for 551 642 SNPs in 978 cases and 883 controls and assessed genotypic influences on susceptibility, age of onset, disease severity, as well as brain lesion load and normalized brain volume from magnetic resonance imaging exams. A multi-analytical strategy identified 242 susceptibility SNPs exceeding established thresholds of significance, including 65 within the MHC locus in chromosome 6p21.3. Independent replication confirms a role for GPC5, a heparan sulfate proteoglycan, in disease risk. Gene ontology-based analysis shows a functional dichotomy between genes involved in the susceptibility pathway and those affecting the clinical phenotype. PMID:19010793

  17. Genome-wide gene expression analysis of mouse embryonic stem cells exposed to p-dichlorobenzene.

    PubMed

    Tani, Hidenori; Takeshita, Jun-Ichi; Aoki, Hiroshi; Abe, Ryosuke; Toyoda, Akinobu; Endo, Yasunori; Miyamoto, Sadaaki; Gamo, Masashi; Torimura, Masaki

    2016-09-01

    Because of the limitations of whole animal testing approaches for toxicological assessment, new cell-based assay systems have been widely studied. In this study, we focused on two biological products for toxicological assessment: mouse embryonic stem cells (mESCs) and long noncoding RNAs (lncRNAs). mESCs possess the abilities of self-renewal and differentiation into multiple cell types. LlncRNAs are an important class of pervasive non-protein-coding transcripts involved in the molecular mechanisms associated with responses to chemicals. We exposed mESCs to p-dichlorobenzene (p-DCB) for 1 or 28 days (daily dose), extracted total RNA, and performed deep sequencing analyses. The genome-wide gene expression analysis indicated that mechanisms modulating proteins occurred following acute and chronic exposures, and mechanisms modulating genomic DNA occurred following chronic exposure. Moreover, our results indicate that three novel lncRNAs (Snora41, Gm19947, and Scarna3a) in mESCs respond to p-DCB exposure. We propose that these lncRNAs have the potential to be surrogate indicators of p-DCB responses in mESCs. PMID:26975756

  18. A genome-wide resource for the analysis of protein localisation in Drosophila

    PubMed Central

    Sarov, Mihail; Barz, Christiane; Jambor, Helena; Hein, Marco Y; Schmied, Christopher; Suchold, Dana; Stender, Bettina; Janosch, Stephan; KJ, Vinay Vikas; Krishnan, RT; Krishnamoorthy, Aishwarya; Ferreira, Irene RS; Ejsmont, Radoslaw K; Finkl, Katja; Hasse, Susanne; Kämpfer, Philipp; Plewka, Nicole; Vinis, Elisabeth; Schloissnig, Siegfried; Knust, Elisabeth; Hartenstein, Volker; Mann, Matthias; Ramaswami, Mani; VijayRaghavan, K; Tomancak, Pavel; Schnorrer, Frank

    2016-01-01

    The Drosophila genome contains >13000 protein-coding genes, the majority of which remain poorly investigated. Important reasons include the lack of antibodies or reporter constructs to visualise these proteins. Here, we present a genome-wide fosmid library of 10000 GFP-tagged clones, comprising tagged genes and most of their regulatory information. For 880 tagged proteins, we created transgenic lines, and for a total of 207 lines, we assessed protein expression and localisation in ovaries, embryos, pupae or adults by stainings and live imaging approaches. Importantly, we visualised many proteins at endogenous expression levels and found a large fraction of them localising to subcellular compartments. By applying genetic complementation tests, we estimate that about two-thirds of the tagged proteins are functional. Moreover, these tagged proteins enable interaction proteomics from developing pupae and adult flies. Taken together, this resource will boost systematic analysis of protein expression and localisation in various cellular and developmental contexts. DOI: http://dx.doi.org/10.7554/eLife.12068.001 PMID:26896675

  19. Genome-wide DNA methylation analysis in cohesin mutant human cell lines

    PubMed Central

    Liu, Jinglan; Zhang, Zhe; Bando, Masashige; Itoh, Takehiko; Deardorff, Matthew A.; Li, Jennifer R.; Clark, Dinah; Kaur, Maninder; Tatsuro, Kondo; Kline, Antonie D.; Chang, Celia; Vega, Hugo; Jackson, Laird G.; Spinner, Nancy B.; Shirahige, Katsuhiko; Krantz, Ian D.

    2010-01-01

    The cohesin complex has recently been shown to be a key regulator of eukaryotic gene expression, although the mechanisms by which it exerts its effects are poorly understood. We have undertaken a genome-wide analysis of DNA methylation in cohesin-deficient cell lines from probands with Cornelia de Lange syndrome (CdLS). Heterozygous mutations in NIPBL, SMC1A and SMC3 genes account for ∼65% of individuals with CdLS. SMC1A and SMC3 are subunits of the cohesin complex that controls sister chromatid cohesion, whereas NIPBL facilitates cohesin loading and unloading. We have examined the methylation status of 27 578 CpG dinucleotides in 72 CdLS and control samples. We have documented the DNA methylation pattern in human lymphoblastoid cell lines (LCLs) as well as identified specific differential DNA methylation in CdLS. Subgroups of CdLS probands and controls can be classified using selected CpG loci. The X chromosome was also found to have a unique DNA methylation pattern in CdLS. Cohesin preferentially binds to hypo-methylated DNA in control LCLs, whereas the differential DNA methylation alters cohesin binding in CdLS. Our results suggest that in addition to DNA methylation multiple mechanisms may be involved in transcriptional regulation in human cells and in the resultant gene misexpression in CdLS. PMID:20448023

  20. A molecular scheme for Yersinia enterocolitica patho-serotyping derived from genome-wide analysis.

    PubMed

    Garzetti, Debora; Susen, Rosa; Fruth, Angelika; Tietze, Erhard; Heesemann, Jürgen; Rakin, Alexander

    2014-05-01

    Yersinia enterocolitica is a food-borne, gastro-intestinal pathogen with world-wide distribution. Only 11 serotypes have been isolated from patients, with O:3, O:9, O:8 and O:5,27 being the serotypes most commonly associated with human yersiniosis. Serotype is an important characteristic of Y. enterocolitica strains, allowing differentiation for epidemiology, diagnosis and phylogeny studies. Conventional serotyping, performed by slide agglutination, is a tedious and laborious procedure whose interpretation tends to be subjective, leading to poor reproducibility. Here we present a PCR-based typing scheme for molecular identification and patho-serotyping of Y. enterocolitica. Genome-wide comparison of Y. enterocolitica sequences allowed analysis of the O-antigen gene clusters of different serotypes, uncovering their formerly unknown genomic locations, and selection of targets for serotype-specific amplification. Two multiplex PCRs and one additional PCR were designed and tested on various reference strains and isolates from different origins. Our genotypic assay proved to be highly specific for identification of Y. enterocolitica species, discrimination between virulent and non-virulent strains, distinguishing the main human-related serotypes, and typing of conventionally untypeable strains. This genotyping scheme could be applied in microbiology laboratories as an alternative or complementary method to the traditional phenotypic assays, providing data for epidemiological studies. PMID:24246413

  1. High-Performance Mixed Models Based Genome-Wide Association Analysis with omicABEL software

    PubMed Central

    Fabregat-Traver, Diego; Sharapov, Sodbo Zh.; Hayward, Caroline; Rudan, Igor; Campbell, Harry; Aulchenko, Yurii; Bientinesi, Paolo

    2014-01-01

    To raise the power of genome-wide association studies (GWAS) and avoid false-positive results in structured populations, one can rely on mixed model based tests. When large samples are used, and when multiple traits are to be studied in the ’omics’ context, this approach becomes computationally challenging. Here we consider the problem of mixed-model based GWAS for arbitrary number of traits, and demonstrate that for the analysis of single-trait and multiple-trait scenarios different computational algorithms are optimal. We implement these optimal algorithms in a high-performance computing framework that uses state-of-the-art linear algebra kernels, incorporates optimizations, and avoids redundant computations, increasing throughput while reducing memory usage and energy consumption. We show that, compared to existing libraries, our algorithms and software achieve considerable speed-ups. The OmicABEL software described in this manuscript is available under the GNU GPL v. 3 license as part of the GenABEL project for statistical genomics at http: //www.genabel.org/packages/OmicABEL. PMID:25717363

  2. High-Performance Mixed Models Based Genome-Wide Association Analysis with omicABEL software.

    PubMed

    Fabregat-Traver, Diego; Sharapov, Sodbo Zh; Hayward, Caroline; Rudan, Igor; Campbell, Harry; Aulchenko, Yurii; Bientinesi, Paolo

    2014-01-01

    To raise the power of genome-wide association studies (GWAS) and avoid false-positive results in structured populations, one can rely on mixed model based tests. When large samples are used, and when multiple traits are to be studied in the 'omics' context, this approach becomes computationally challenging. Here we consider the problem of mixed-model based GWAS for arbitrary number of traits, and demonstrate that for the analysis of single-trait and multiple-trait scenarios different computational algorithms are optimal. We implement these optimal algorithms in a high-performance computing framework that uses state-of-the-art linear algebra kernels, incorporates optimizations, and avoids redundant computations, increasing throughput while reducing memory usage and energy consumption. We show that, compared to existing libraries, our algorithms and software achieve considerable speed-ups. The OmicABEL software described in this manuscript is available under the GNU GPL v. 3 license as part of the GenABEL project for statistical genomics at http: //www.genabel.org/packages/OmicABEL. PMID:25717363

  3. Genome-Wide Association Analysis of Adaptation Using Environmentally Predicted Traits.

    PubMed

    van Heerwaarden, Joost; van Zanten, Martijn; Kruijer, Willem

    2015-10-01

    Current methods for studying the genetic basis of adaptation evaluate genetic associations with ecologically relevant traits or single environmental variables, under the implicit assumption that natural selection imposes correlations between phenotypes, environments and genotypes. In practice, observed trait and environmental data are manifestations of unknown selective forces and are only indirectly associated with adaptive genetic variation. In theory, improved estimation of these forces could enable more powerful detection of loci under selection. Here we present an approach in which we approximate adaptive variation by modeling phenotypes as a function of the environment and using the predicted trait in multivariate and univariate genome-wide association analysis (GWAS). Based on computer simulations and published flowering time data from the model plant Arabidopsis thaliana, we find that environmentally predicted traits lead to higher recovery of functional loci in multivariate GWAS and are more strongly correlated to allele frequencies at adaptive loci than individual environmental variables. Our results provide an example of the use of environmental data to obtain independent and meaningful information on adaptive genetic variation. PMID:26496492

  4. Genome-Wide Analysis of Polymorphisms Associated with Cytokine Responses in Smallpox Vaccine Recipients

    PubMed Central

    Kennedy, Richard B.; Ovsyannikova, Inna G.; Pankratz, V. Shane; Haralambieva, Iana H.; Vierkant, Robert A.; Poland, Gregory A.

    2014-01-01

    The role that genetics plays in response to infection or disease is becoming increasingly clear as we learn more about immunogenetics and host-pathogen interactions. Here we report a genome-wide analysis of the effects of host genetic variation on cytokine responses to vaccinia virus stimulation in smallpox vaccine recipients. Our data show that vaccinia stimulation of immune individuals results in secretion of inflammatory and Th1 cytokines. We identified multiple SNPs significantly associated with variations in cytokine secretion. These SNPs are found in genes with known immune function, as well as in genes encoding for proteins involved in signal transduction, cytoskeleton, membrane channels and ion transport, as well as others with no previously identified connection to immune responses. The large number of significant SNP associations implies that cytokine secretion in response to vaccinia virus is a complex process controlled by multiple genes and gene families. Follow-up studies to replicate these findings and then pursue mechanistic studies will provide a greater understanding of how genetic variation influences vaccine responses. PMID:22610502

  5. The Genome-Wide Analysis of Carcinoembryonic Antigen Signaling by Colorectal Cancer Cells Using RNA Sequencing.

    PubMed

    Bajenova, Olga; Gorbunova, Anna; Evsyukov, Igor; Rayko, Michael; Gapon, Svetlana; Bozhokina, Ekaterina; Shishkin, Alexander; O'Brien, Stephen J

    2016-01-01

    Сarcinoembryonic antigen (CEA, CEACAM5, CD66) is a promoter of metastasis in epithelial cancers that is widely used as a prognostic clinical marker of metastasis. The aim of this study is to identify the network of genes that are associated with CEA-induced colorectal cancer liver metastasis. We compared the genome-wide transcriptomic profiles of CEA positive (MIP101 clone 8) and CEA negative (MIP 101) colorectal cancer cell lines with different metastatic potential in vivo. The CEA-producing cells displayed quantitative changes in the level of expression for 100 genes (over-expressed or down-regulated). They were confirmed by quantitative RT-PCR. The KEGG pathway analysis identified 4 significantly enriched pathways: cytokine-cytokine receptor interaction, MAPK signaling pathway, TGF-beta signaling pathway and pyrimidine metabolism. Our results suggest that CEA production by colorectal cancer cells triggers colorectal cancer progression by inducing the epithelial- mesenchymal transition, increasing tumor cell invasiveness into the surrounding tissues and suppressing stress and apoptotic signaling. The novel gene expression distinctions establish the relationships between the existing cancer markers and implicate new potential biomarkers for colorectal cancer hepatic metastasis. PMID:27583792

  6. Genome-wide characterization and comparative analysis of the MLO gene family in cotton.

    PubMed

    Wang, Xiaoyan; Ma, Qifeng; Dou, Lingling; Liu, Zhen; Peng, Renhai; Yu, Shuxun

    2016-06-01

    In plants, MLO (Mildew Locus O) gene encodes a plant-specific seven transmembrane (TM) domain protein involved in several cellular processes, including susceptibility to powdery mildew (PM). In this study, a genome-wide characterization of the MLO gene family in G. raimondii L., G. arboreum L. and G. hirsutum L. was performed. In total, 22, 17 and 38 homologous sequences were identified for each species, respectively. Gene organization, including chromosomal location, gene clustering and gene duplication, was investigated. Homologues related to PM susceptibility in upland cotton were inferred by phylogenetic relationships with functionally characterized MLO proteins. To conduct a comparative analysis between MLO candidate genes from G. raimondii L., G. arboreum L. and G. hirsutum L., orthologous relationships and conserved synteny blocks were constructed. The transcriptional variation of 38 GhMLO genes in response to exogenous application of salt, mannitol (Man), abscisic acid (ABA), ethylene (ETH), jasmonic acid (JA) and salicylic acid (SA) was monitored. Further studies should be conducted to elucidate the functions of MLO genes in PM susceptibility and phytohormone signalling pathways. PMID:26986931

  7. A genome-wide SNP-based phylogenetic analysis distinguishes different biovars of Brucella suis.

    PubMed

    Sankarasubramanian, Jagadesan; Vishnu, Udayakumar S; Gunasekaran, Paramasamy; Rajendhran, Jeyaprakash

    2016-07-01

    Brucellosis is an important zoonotic disease caused by Brucella spp. Brucella suis is the etiological agent of porcine brucellosis. B. suis is the most genetically diverged species within the genus Brucella. We present the first large-scale B. suis phylogenetic analysis based on an alignment-free k-mer approach of gathering polymorphic sites from whole genome sequences. Genome-wide core-SNP based phylogenetic tree clearly differentiated and discriminated the B. suis biovars and the vaccine strain into different clades. A total of 16,756 SNPs were identified from the genome sequences of 54 B. suis strains. Also, biovar-specific SNPs were identified. The vaccine strain B. suis S2-30 is extensively used in China, which was discriminated from all biovars with the accumulation of the highest number of SNPs. We have also identified the SNPs between B. suis vaccine strain S2-30 and its closest homolog, B. suis biovar 513UK. The highest number of mutations (22) was observed in the phosphomannomutase (pmm) gene essential for the synthesis of O-antigen. Also, mutations were identified in several virulent genes including genes coding for type IV secretion system and the effector proteins, which could be responsible for the attenuated virulence of B. suis S2-30. PMID:27085292

  8. GENOME-WIDE LINKAGE ANALYSIS OF PULSE PRESSURE IN AMERICAN INDIANS: THE STRONG HEART STUDY

    PubMed Central

    Franceschini, Nora; MacCluer, Jean W.; Rose, Kathreen M.; Rutherford, Sue; Cole, Shelley A.; Laston, Sandy; Göring, Harald H.H.; Diego, Vincent P.; Roman, Mary J.; Lee, Elisa T.; Best, Lyle G.; Howard, Barbara V.; Fabsitz, Richard R.; North, Kari E.

    2010-01-01

    Background Pulse pressure, a measure of central arterial stiffness and a predictor of cardiovascular mortality, has known genetic components. Methods To localize the genetic effects of pulse pressure, we conducted a genome-wide linkage analysis of 1,892 American Indian participants of the Strong Heart Family Study. Blood pressure was measured three times and the average of the last two measures was used for analyses. Pulse pressure, the difference between systolic and diastolic blood pressures, was log-transformed and adjusted for the effects of age and sex within each study center. Variance component linkage analyses were performed using marker allele frequencies derived from all individuals and multipoint identity-by-descent matrices calculated in Loki. Results We identified a quantitative trait locus influencing pulse pressure on chromosome 7 at 37 cM (marker D7S493, LOD=3.3) and suggestive evidence of linkage on chromosome 19 at 92 cM (marker D19S888, LOD=1.8). Conclusions The signal on 7p15.3 overlaps positive findings for pulse pressure among Utah population samples, suggesting that this region may harbor gene variants for blood pressure related traits. PMID:18188160

  9. Mammalian NET-seq analysis defines nascent RNA profiles and associated RNA processing genome-wide.

    PubMed

    Nojima, Takayuki; Gomes, Tomás; Carmo-Fonseca, Maria; Proudfoot, Nicholas J

    2016-03-01

    The transcription cycle of RNA polymerase II (Pol II) correlates with changes to the phosphorylation state of its large subunit C-terminal domain (CTD). We recently developed Native Elongation Transcript sequencing using mammalian cells (mNET-seq), which generates single-nucleotide-resolution genome-wide profiles of nascent RNA and co-transcriptional RNA processing that are associated with different CTD phosphorylation states. Here we provide a detailed protocol for mNET-seq. First, Pol II elongation complexes are isolated with specific phospho-CTD antibodies from chromatin solubilized by micrococcal nuclease digestion. Next, RNA derived from within the Pol II complex is size fractionated and Illumina sequenced. Using mNET-seq, we have previously shown that Pol II pauses at both ends of protein-coding genes but with different CTD phosphorylation patterns, and we have also detected phosphorylation at serine 5 (Ser5-P) CTD-specific splicing intermediates and Pol II accumulation over co-transcriptionally spliced exons. With moderate biochemical and bioinformatic skills, mNET-seq can be completed in ∼6 d, not including sequencing and data analysis. PMID:26844429

  10. Genome-Wide Association Analysis of Adaptation Using Environmentally Predicted Traits

    PubMed Central

    van Zanten, Martijn

    2015-01-01

    Current methods for studying the genetic basis of adaptation evaluate genetic associations with ecologically relevant traits or single environmental variables, under the implicit assumption that natural selection imposes correlations between phenotypes, environments and genotypes. In practice, observed trait and environmental data are manifestations of unknown selective forces and are only indirectly associated with adaptive genetic variation. In theory, improved estimation of these forces could enable more powerful detection of loci under selection. Here we present an approach in which we approximate adaptive variation by modeling phenotypes as a function of the environment and using the predicted trait in multivariate and univariate genome-wide association analysis (GWAS). Based on computer simulations and published flowering time data from the model plant Arabidopsis thaliana, we find that environmentally predicted traits lead to higher recovery of functional loci in multivariate GWAS and are more strongly correlated to allele frequencies at adaptive loci than individual environmental variables. Our results provide an example of the use of environmental data to obtain independent and meaningful information on adaptive genetic variation. PMID:26496492

  11. Meta-analysis and genome-wide interpretation of genetic susceptibility to drug addiction

    PubMed Central

    2011-01-01

    Background Classical genetic studies provide strong evidence for heritable contributions to susceptibility to developing dependence on addictive substances. Candidate gene and genome-wide association studies (GWAS) have sought genes, chromosomal regions and allelic variants likely to contribute to susceptibility to drug addiction. Results Here, we performed a meta-analysis of addiction candidate gene association studies and GWAS to investigate possible functional mechanisms associated with addiction susceptibility. From meta-data retrieved from 212 publications on candidate gene association studies and 5 GWAS reports, we linked a total of 843 haplotypes to addiction susceptibility. We mapped the SNPs in these haplotypes to functional and regulatory elements in the genome and estimated the magnitude of the contributions of different molecular mechanisms to their effects on addiction susceptibility. In addition to SNPs in coding regions, these data suggest that haplotypes in gene regulatory regions may also contribute to addiction susceptibility. When we compared the lists of genes identified by association studies and those identified by molecular biological studies of drug-regulated genes, we observed significantly higher participation in the same gene interaction networks than expected by chance, despite little overlap between the two gene lists. Conclusions These results appear to offer new insights into the genetic factors underlying drug addiction. PMID:21999673

  12. Genome-wide identification, evolutionary and expression analysis of the aspartic protease gene superfamily in grape

    PubMed Central

    2013-01-01

    Background Aspartic proteases (APs) are a large family of proteolytic enzymes found in almost all organisms. In plants, they are involved in many biological processes, such as senescence, stress responses, programmed cell death, and reproduction. Prior to the present study, no grape AP gene(s) had been reported, and their research on woody species was very limited. Results In this study, a total of 50 AP genes (VvAP) were identified in the grape genome, among which 30 contained the complete ASP domain. Synteny analysis within grape indicated that segmental and tandem duplication events contributed to the expansion of the grape AP family. Additional analysis between grape and Arabidopsis demonstrated that several grape AP genes were found in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of grape and Arabidopsis. Phylogenetic relationships of the 30 VvAPs with the complete ASP domain and their Arabidopsis orthologs, as well as their gene and protein features were analyzed and their cellular localization was predicted. Moreover, expression profiles of VvAP genes in six different tissues were determined, and their transcript abundance under various stresses and hormone treatments were measured. Twenty-seven VvAP genes were expressed in at least one of the six tissues examined; nineteen VvAPs responded to at least one abiotic stress, 12 VvAPs responded to powdery mildew infection, and most of the VvAPs responded to SA and ABA treatments. Furthermore, integrated synteny and phylogenetic analysis identified orthologous AP genes between grape and Arabidopsis, providing a unique starting point for investigating the function of grape AP genes. Conclusions The genome-wide identification, evolutionary and expression analyses of grape AP genes provide a framework for future analysis of AP genes in defining their roles during stress response. Integrated synteny and phylogenetic analyses provide novel insight into the

  13. Genome-Wide Transcriptome and Expression Profile Analysis of Phalaenopsis during Explant Browning

    PubMed Central

    Xu, Chuanjun; Zeng, Biyu; Huang, Junmei; Huang, Wen; Liu, Yumei

    2015-01-01

    Background Explant browning presents a major problem for in vitro culture, and can lead to the death of the explant and failure of regeneration. Considerable work has examined the physiological mechanisms underlying Phalaenopsis leaf explant browning, but the molecular mechanisms of browning remain elusive. In this study, we used whole genome RNA sequencing to examine Phalaenopsis leaf explant browning at genome-wide level. Methodology/Principal Findings We first used Illumina high-throughput technology to sequence the transcriptome of Phalaenopsis and then performed de novo transcriptome assembly. We assembled 79,434,350 clean reads into 31,708 isogenes and generated 26,565 annotated unigenes. We assigned Gene Ontology (GO) terms, Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations, and potential Pfam domains to each transcript. Using the transcriptome data as a reference, we next analyzed the differential gene expression of explants cultured for 0, 3, and 6 d, respectively. We then identified differentially expressed genes (DEGs) before and after Phalaenopsis explant browning. We also performed GO, KEGG functional enrichment and Pfam analysis of all DEGs. Finally, we selected 11 genes for quantitative real-time PCR (qPCR) analysis to confirm the expression profile analysis. Conclusions/Significance Here, we report the first comprehensive analysis of transcriptome and expression profiles during Phalaenopsis explant browning. Our results suggest that Phalaenopsis explant browning may be due in part to gene expression changes that affect the secondary metabolism, such as: phenylpropanoid pathway and flavonoid biosynthesis. Genes involved in photosynthesis and ATPase activity have been found to be changed at transcription level; these changes may perturb energy metabolism and thus lead to the decay of plant cells and tissues. This study provides comprehensive gene expression data for Phalaenopsis browning. Our data constitute an important resource for further