Ben-Yakar, Adela; Bourgeois, Frederic
Summary The use of ultrafast laser pulses in surgery has allowed for unprecedented precision with minimal collateral damage to surrounding tissues. For these reasons, ultrafast laser nanosurgery, as an injury model, has gained tremendous momentum in experimental biology ranging from in-vitro manipulations of subcellular structures to in-vivo studies in whole living organisms. For example, femtosecond laser nanosurgery on such model organism as the nematode Caenorhabditis elegans (C. elegans) has opened new opportunities for in-vivo nerve regeneration studies. Meanwhile, the development of novel microfluidic devices has brought the control in experimental environment to the level required for precise nanosurgery in various animal models. Merging microfluidics and laser nanosurgery has recently improved the specificities and increased the speed of laser surgeries enabling fast genome-wide screenings that can more readily decode the genetic map of various biological processes. PMID:19278850
The capacity to identify immunogens for vaccine development by genome-wide screening has been markedly enhanced by the availability of complete microbial genome sequences coupled to rapid proteomic and bioinformatic analysis. Critical to this genome-wide screening is in vivo testing in the context o...
Bassett, Andrew R.; Kong, Lesheng; Liu, Ji-Long
The simplicity of the CRISPR/Cas9 system of genome engineering has opened up the possibility of performing genome-wide targeted mutagenesis in cell lines, enabling screening for cellular phenotypes resulting from genetic aberrations. Drosophila cells have proven to be highly effective in identifying genes involved in cellular processes through similar screens using partial knockdown by RNAi. This is in part due to the lower degree of redundancy between genes in this organism, whilst still maintaining highly conserved gene networks and orthologs of many human disease-causing genes. The ability of CRISPR to generate genetic loss of function mutations not only increases the magnitude of any effect over currently employed RNAi techniques, but allows analysis over longer periods of time which can be critical for certain phenotypes. In this study, we have designed and built a genome-wide CRISPR library covering 13,501 genes, among which 8989 genes are targeted by three or more independent single guide RNAs (sgRNAs). Moreover, we describe strategies to monitor the population of guide RNAs by high throughput sequencing (HTS). We hope that this library will provide an invaluable resource for the community to screen loss of function mutations for cellular phenotypes, and as a source of guide RNA designs for future studies. PMID:26165496
Bassett, Andrew R; Kong, Lesheng; Liu, Ji-Long
The simplicity of the CRISPR/Cas9 system of genome engineering has opened up the possibility of performing genome-wide targeted mutagenesis in cell lines, enabling screening for cellular phenotypes resulting from genetic aberrations. Drosophila cells have proven to be highly effective in identifying genes involved in cellular processes through similar screens using partial knockdown by RNAi. This is in part due to the lower degree of redundancy between genes in this organism, whilst still maintaining highly conserved gene networks and orthologs of many human disease-causing genes. The ability of CRISPR to generate genetic loss of function mutations not only increases the magnitude of any effect over currently employed RNAi techniques, but allows analysis over longer periods of time which can be critical for certain phenotypes. In this study, we have designed and built a genome-wide CRISPR library covering 13,501 genes, among which 8989 genes are targeted by three or more independent single guide RNAs (sgRNAs). Moreover, we describe strategies to monitor the population of guide RNAs by high throughput sequencing (HTS). We hope that this library will provide an invaluable resource for the community to screen loss of function mutations for cellular phenotypes, and as a source of guide RNA designs for future studies.
Hammond, A S; Klein, M R; Corrah, T; Fox, A; Jaye, A; McAdam, K P; Brookes, R H
Mounting evidence suggests human leucocyte antigen (HLA) class I-restricted CD8+ T cells play a role in protective immunity against tuberculosis yet relatively few epitopes specific for the causative organism, Mycobacterium tuberculosis, are reported. Here a total genome-wide screen of M. tuberculosis was used to identify putative HLA-B*3501 T cell epitopes. Of 479 predicted epitopes, 13 with the highest score were synthesized and used to restimulate lymphocytes from naturally exposed HLA-B*3501 healthy individuals in cultured and ex vivo enzyme-linked immunospot (ELISPOT) assays for interferon (IFN)-γ. All 13 peptides elicited a response that varied considerably between individuals. For three peptides CD8+ T cell lines were expanded and four of the 13 were recognized permissively through the HLA-B7 supertype family. Although further testing is required we show the genome-wide screen to be feasible for the identification of unknown mycobacterial antigens involved in immunity against natural infection. While the mechanisms of protective immunity against M. tuberculosis infection remain unclear, conventional class I-restricted CD8+ T cell responses appear to be widespread throughout the genome. PMID:15762882
Panavas, Tadas; Serviene, Elena; Brasher, Jeremy; Nagy, Peter D.
Viruses are devastating pathogens of humans, animals, and plants. To further our understanding of how viruses use the resources of infected cells, we systematically tested the yeast single-gene-knockout library for the effect of each host gene on the replication of tomato bushy stunt virus (TBSV), a positive-strand RNA virus of plants. The genome-wide screen identified 96 host genes whose absence either reduced or increased the accumulation of the TBSV replicon. The identified genes are involved in the metabolism of nucleic acids, lipids, proteins, and other compounds and in protein targeting/transport. Comparison with published genome-wide screens reveals that the replication of TBSV and brome mosaic virus (BMV), which belongs to a different supergroup among plus-strand RNA viruses, is affected by vastly different yeast genes. Moreover, a set of yeast genes involved in vacuolar targeting of proteins and vesicle-mediated transport both affected replication of the TBSV replicon and enhanced the cytotoxicity of the Parkinson's disease-related α-synuclein when this protein was expressed in yeast. In addition, a set of host genes involved in ubiquitin-dependent protein catabolism affected both TBSV replication and the cytotoxicity of a mutant huntingtin protein, a candidate agent in Huntington's disease. This finding suggests that virus infection and disease-causing proteins might use or alter similar host pathways and may suggest connections between chronic diseases and prior virus infection. PMID:15883361
Karlas, Alexander; Berre, Stefano; Couderc, Thérèse; Varjak, Margus; Braun, Peter; Meyer, Michael; Gangneux, Nicolas; Karo-Astover, Liis; Weege, Friderike; Raftery, Martin; Schönrich, Günther; Klemm, Uwe; Wurzlbauer, Anne; Bracher, Franz; Merits, Andres; Meyer, Thomas F.; Lecuit, Marc
Chikungunya virus (CHIKV) is a globally spreading alphavirus against which there is no commercially available vaccine or therapy. Here we use a genome-wide siRNA screen to identify 156 proviral and 41 antiviral host factors affecting CHIKV replication. We analyse the cellular pathways in which human proviral genes are involved and identify druggable targets. Twenty-one small-molecule inhibitors, some of which are FDA approved, targeting six proviral factors or pathways, have high antiviral activity in vitro, with low toxicity. Three identified inhibitors have prophylactic antiviral effects in mouse models of chikungunya infection. Two of them, the calmodulin inhibitor pimozide and the fatty acid synthesis inhibitor TOFA, have a therapeutic effect in vivo when combined. These results demonstrate the value of loss-of-function screening and pathway analysis for the rational identification of small molecules with therapeutic potential and pave the way for the development of new, host-directed, antiviral agents. PMID:27177310
Gao, Xiaoyi; Martin, Eden R.; Liu, Yutao; Mayhew, Gregory; Vance, Jeffery M.; Scott, William K.
Parkinson disease (PD) is a complex, multifactorial neurodegenerative disease with substantial evidence for genetic risk factors. We conducted a genome-wide linkage screen of 5824 single-nucleotide polymorphisms in 278 families of European, non-Hispanic descent to localize regions that harbor susceptibility loci for PD. By using parametric and nonparametric linkage analyses and allowing for genetic heterogeneity among families, we found two loci for PD. Significant evidence for linkage was detected on chromosome 18q11 (maximum lod score [MLOD] = 4.1) and suggestive evidence for linkage was obtained on chromosome 3q25 (MLOD = 2.5). These results were strongest in families not previously screened for linkage, and simulation studies suggest that these findings are likely due to locus heterogeneity rather than random statistical error. The finding of two loci (one highly statistically significant) suggests that additional PD susceptibility genes might be identified through targeted candidate gene studies in these regions. PMID:19327735
Díaz-Martínez, Laura A; Karamysheva, Zemfira N; Warrington, Ross; Li, Bing; Wei, Shuguang; Xie, Xian-Jin; Roth, Michael G; Yu, Hongtao
The antimitotic anti-cancer drugs, including taxol, perturb spindle dynamics, and induce prolonged, spindle checkpoint-dependent mitotic arrest in cancer cells. These cells then either undergo apoptosis triggered by the intrinsic mitochondrial pathway or exit mitosis without proper cell division in an adaptation pathway. Using a genome-wide small interfering RNA (siRNA) screen in taxol-treated HeLa cells, we systematically identify components of the mitotic apoptosis and adaptation pathways. We show that the Mad2 inhibitor p31comet actively promotes mitotic adaptation through cyclin B1 degradation and has a minor separate function in suppressing apoptosis. Conversely, the pro-apoptotic Bcl2 family member, Noxa, is a critical initiator of mitotic cell death. Unexpectedly, the upstream components of the mitochondrial apoptosis pathway and the mitochondrial fission protein Drp1 contribute to mitotic adaption. Our results reveal crosstalk between the apoptosis and adaptation pathways during mitotic arrest. PMID:25024437
de Vos, Bob D.; van Setten, Jessica; de Jong, Pim A.; Mali, Willem P.; Oudkerk, Matthijs; Viergever, Max A.; Išgum, Ivana
Arterial calcification has been related to cardiovascular disease (CVD) and osteoporosis. However, little is known about the role of genetics and exact pathways leading to arterial calcification and its relation to bone density changes indicating osteoporosis. In this study, we conducted a genome-wide association study of arterial calcification burden, followed by a look-up of known single nucleotide polymorphisms (SNPs) for coronary artery disease (CAD) and myocardial infarction (MI), and bone mineral density (BMD) to test for a shared genetic basis between the traits. The study included a subcohort of the Dutch-Belgian lung cancer screening trial comprised of 2,561 participants. Participants underwent baseline CT screening in one of two hospitals participating in the trial. Low-dose chest CT images were acquired without contrast enhancement and without ECG-synchronization. In these images coronary and aortic calcifications were identified automatically. Subsequently, the detected calcifications were quantified using coronary artery calcium Agatston and volume scores. Genotype data was available for these participants. A genome-wide association study was conducted on 10,220,814 SNPs using a linear regression model. To reduce multiple testing burden, known CAD/MI and BMD SNPs were specifically tested (45 SNPs from the CARDIoGRAMplusC4D consortium and 60 SNPS from the GEFOS consortium). No novel significant SNPs were found. Significant enrichment for CAD/MI SNPs was observed in testing Agatston and coronary artery calcium volume scores. Moreover, a significant enrichment of BMD SNPs was shown in aortic calcium volume scores. This may indicate genetic relation of BMD SNPs and arterial calcification burden.
Chen, Xiaochu; Xu, Lan
The transforming growth factor-β (TGF-β) family of cytokines figures prominently in regulation of embryonic development and adult tissue homeostasis from Drosophila to mammals. Genetic defects affecting TGF-β signaling underlie developmental disorders and diseases such as cancer in human. Therefore, delineating the molecular mechanism by which TGF-β regulates cell biology is critical for understanding normal biology and disease mechanisms. Forward genetic screens in model organisms and biochemical approaches in mammalian tissue culture were instrumental in initial characterization of the TGF-β signal transduction pathway. With complete sequence information of the genomes and the advent of RNA interference (RNAi) technology, genome-wide RNAi screening emerged as a powerful functional genomics approach to systematically delineate molecular components of signal transduction pathways. Here, we describe a protocol for image-based whole-genome RNAi screening aimed at identifying molecules required for TGF-β signaling into the nucleus. Using this protocol we examined >90 % of annotated Drosophila open reading frames (ORF) individually and successfully uncovered several novel factors serving critical roles in the TGF-β pathway. Thus cell-based high-throughput functional genomics can uncover new mechanistic insights on signaling pathways beyond what the classical genetics had revealed.
de Macedo Dossin, Fernando; Choi, Seo Yeon; Kim, Nam Youl; Kim, Hi Chul; Jung, Sung Yong; Schenkman, Sergio; Almeida, Igor C.; Emans, Neil; Freitas-Junior, Lucio H.
The protozoan parasite Trypanosoma cruzi is the etiologic agent of Chagas disease, a neglected tropical infection that affects millions of people in the Americas. Current chemotherapy relies on only two drugs that have limited efficacy and considerable side effects. Therefore, the development of new and more effective drugs is of paramount importance. Although some host cellular factors that play a role in T. cruzi infection have been uncovered, the molecular requirements for intracellular parasite growth and persistence are still not well understood. To further study these host-parasite interactions and identify human host factors required for T. cruzi infection, we performed a genome-wide RNAi screen using cellular microarrays of a printed siRNA library that spanned the whole human genome. The screening was reproduced 6 times and a customized algorithm was used to select as hits those genes whose silencing visually impaired parasite infection. The 162 strongest hits were subjected to a secondary screening and subsequently validated in two different cell lines. Among the fourteen hits confirmed, we recognized some cellular membrane proteins that might function as cell receptors for parasite entry and others that may be related to calcium release triggered by parasites during cell invasion. In addition, two of the hits are related to the TGF-beta signaling pathway, whose inhibition is already known to diminish levels of T. cruzi infection. This study represents a significant step toward unveiling the key molecular requirements for host cell invasion and revealing new potential targets for antiparasitic therapy. PMID:21625474
Roos, Thomas R.; Roos, Andrew K.; Kleimeyer, John P.; Ahmed, Marwa A.; Goodlin, Gabrielle T.; Fredericson, Michael; Ioannidis, John P. A.; Avins, Andrew L.; Dragoo, Jason L.
Achilles tendinopathy or rupture and anterior cruciate ligament (ACL) rupture are substantial injuries affecting athletes, associated with delayed recovery or inability to return to competition. To identify genetic markers that might be used to predict risk for these injuries, we performed genome-wide association screens for these injuries using data from the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort consisting of 102,979 individuals. We did not find any single nucleotide polymorphisms (SNPs) associated with either of these injuries with a p-value that was genome-wide significant (p<5x10-8). We found, however, four and three polymorphisms with p-values that were borderline significant (p<10−6) for Achilles tendon injury and ACL rupture, respectively. We then tested SNPs previously reported to be associated with either Achilles tendon injury or ACL rupture. None showed an association in our cohort with a false discovery rate of less than 5%. We obtained, however, moderate to weak evidence for replication in one case; specifically, rs4919510 in MIR608 had a p-value of 5.1x10-3 for association with Achilles tendon injury, corresponding to a 7% chance of false replication. Finally, we tested 2855 SNPs in 90 candidate genes for musculoskeletal injury, but did not find any that showed a significant association below a false discovery rate of 5%. We provide data containing summary statistics for the entire genome, which will be useful for future genetic studies on these injuries. PMID:28358823
Bae, Jung-Hoon; Hyun Sung, Bong; Kim, Hyun-Jin; Park, Soon-Ho; Lim, Kwang-Mook; Kim, Mi-Jin; Lee, Cho-Ryong; Sohn, Jung-Hoon
To produce rarely secreted recombinant proteins in the yeast Saccharomyces cerevisiae, we developed a novel genome-wide optimal translational fusion partner (TFP) screening system that involves recruitment of an optimal secretion signal and fusion partner. A TFP library was constructed from a genomic and truncated cDNA library by using the invertase-based signal sequence trap technique. The efficiency of the system was demonstrated using two rarely secreted proteins, human interleukin (hIL)-2 and hIL-32. Optimal TFPs for secretion of hIL-2 and hIL-32 were easily selected, yielding secretion of these proteins up to hundreds of mg/L. Moreover, numerous uncovered yeast secretion signals and fusion partners were identified, leading to efficient secretion of various recombinant proteins. Selected TFPs were found to be useful for the hypersecretion of other recombinant proteins at yields of up to several g/L. This screening technique could provide new methods for the production of various types of difficult-to-express proteins. PMID:26195161
Wendler, Franz; Gillingham, Alison K; Sinka, Rita; Rosa-Ferreira, Cláudia; Gordon, David E; Franch-Marro, Xavier; Peden, Andrew A; Vincent, Jean-Paul; Munro, Sean
Genetic screens in the yeast Saccharomyces cerevisiae have identified many proteins involved in the secretory pathway, most of which have orthologues in higher eukaryotes. To investigate whether there are additional proteins that are required for secretion in metazoans but are absent from yeast, we used genome-wide RNA interference (RNAi) to look for genes required for secretion of recombinant luciferase from Drosophila S2 cells. This identified two novel components of the secretory pathway that are conserved from humans to plants. Gryzun is distantly related to, but distinct from, the Trs130 subunit of the TRAPP complex but is absent from S. cerevisiae. RNAi of human Gryzun (C4orf41) blocks Golgi exit. Kish is a small membrane protein with a previously uncharacterised orthologue in yeast. The screen also identified Drosophila orthologues of almost 60% of the yeast genes essential for secretion. Given this coverage, the small number of novel components suggests that contrary to previous indications the number of essential core components of the secretory pathway is not much greater in metazoans than in yeasts. PMID:19942856
Park, Ryan J; Wang, Tim; Koundakjian, Dylan; Hultquist, Judd F; Lamothe-Molina, Pedro; Monel, Blandine; Schumann, Kathrin; Yu, Haiyan; Krupzcak, Kevin M; Garcia-Beltran, Wilfredo; Piechocka-Trocha, Alicja; Krogan, Nevan J; Marson, Alexander; Sabatini, David M; Lander, Eric S; Hacohen, Nir; Walker, Bruce D
Host proteins are essential for HIV entry and replication and can be important nonviral therapeutic targets. Large-scale RNA interference (RNAi)-based screens have identified nearly a thousand candidate host factors, but there is little agreement among studies and few factors have been validated. Here we demonstrate that a genome-wide CRISPR-based screen identifies host factors in a physiologically relevant cell system. We identify five factors, including the HIV co-receptors CD4 and CCR5, that are required for HIV infection yet are dispensable for cellular proliferation and viability. Tyrosylprotein sulfotransferase 2 (TPST2) and solute carrier family 35 member B2 (SLC35B2) function in a common pathway to sulfate CCR5 on extracellular tyrosine residues, facilitating CCR5 recognition by the HIV envelope. Activated leukocyte cell adhesion molecule (ALCAM) mediates cell aggregation, which is required for cell-to-cell HIV transmission. We validated these pathways in primary human CD4(+) T cells through Cas9-mediated knockout and antibody blockade. Our findings indicate that HIV infection and replication rely on a limited set of host-dispensable genes and suggest that these pathways can be studied for therapeutic intervention.
Sidik, Saima M; Huet, Diego; Ganesan, Suresh M; Huynh, My-Hang; Wang, Tim; Nasamu, Armiyaw S; Thiru, Prathapan; Saeij, Jeroen P J; Carruthers, Vern B; Niles, Jacquin C; Lourido, Sebastian
Apicomplexan parasites are leading causes of human and livestock diseases such as malaria and toxoplasmosis, yet most of their genes remain uncharacterized. Here, we present the first genome-wide genetic screen of an apicomplexan. We adapted CRISPR/Cas9 to assess the contribution of each gene from the parasite Toxoplasma gondii during infection of human fibroblasts. Our analysis defines ∼200 previously uncharacterized, fitness-conferring genes unique to the phylum, from which 16 were investigated, revealing essential functions during infection of human cells. Secondary screens identify as an invasion factor the claudin-like apicomplexan microneme protein (CLAMP), which resembles mammalian tight-junction proteins and localizes to secretory organelles, making it critical to the initiation of infection. CLAMP is present throughout sequenced apicomplexan genomes and is essential during the asexual stages of the malaria parasite Plasmodium falciparum. These results provide broad-based functional information on T. gondii genes and will facilitate future approaches to expand the horizon of antiparasitic interventions.
Roosing, Susanne; Hofree, Matan; Kim, Sehyun; Scott, Eric; Copeland, Brett; Romani, Marta; Silhavy, Jennifer L; Rosti, Rasim O; Schroth, Jana; Mazza, Tommaso; Miccinilli, Elide; Zaki, Maha S; Swoboda, Kathryn J; Milisa-Drautz, Joanne; Dobyns, William B; Mikati, Mohamed A; İncecik, Faruk; Azam, Matloob; Borgatti, Renato; Romaniello, Romina; Boustany, Rose-Mary; Clericuzio, Carol L; D'Arrigo, Stefano; Strømme, Petter; Boltshauser, Eugen; Stanzial, Franco; Mirabelli-Badenier, Marisol; Moroni, Isabella; Bertini, Enrico; Emma, Francesco; Steinlin, Maja; Hildebrandt, Friedhelm; Johnson, Colin A; Freilinger, Michael; Vaux, Keith K; Gabriel, Stacey B; Aza-Blanc, Pedro; Heynen-Genel, Susanne; Ideker, Trey; Dynlacht, Brian D; Lee, Ji Eun; Valente, Enza Maria; Kim, Joon; Gleeson, Joseph G
Defective primary ciliogenesis or cilium stability forms the basis of human ciliopathies, including Joubert syndrome (JS), with defective cerebellar vermis development. We performed a high-content genome-wide small interfering RNA (siRNA) screen to identify genes regulating ciliogenesis as candidates for JS. We analyzed results with a supervised-learning approach, using SYSCILIA gold standard, Cildb3.0, a centriole siRNA screen and the GTex project, identifying 591 likely candidates. Intersection of this data with whole exome results from 145 individuals with unexplained JS identified six families with predominantly compound heterozygous mutations in KIAA0586. A c.428del base deletion in 0.1% of the general population was found in trans with a second mutation in an additional set of 9 of 163 unexplained JS patients. KIAA0586 is an orthologue of chick Talpid3, required for ciliogenesis and Sonic hedgehog signaling. Our results uncover a relatively high frequency cause for JS and contribute a list of candidates for future gene discoveries in ciliopathies.
Roosing, Susanne; Hofree, Matan; Kim, Sehyun; Scott, Eric; Copeland, Brett; Romani, Marta; Silhavy, Jennifer L; Rosti, Rasim O; Schroth, Jana; Mazza, Tommaso; Miccinilli, Elide; Zaki, Maha S; Swoboda, Kathryn J; Milisa-Drautz, Joanne; Dobyns, William B; Mikati, Mohamed A; İncecik, Faruk; Azam, Matloob; Borgatti, Renato; Romaniello, Romina; Boustany, Rose-Mary; Clericuzio, Carol L; D'Arrigo, Stefano; Strømme, Petter; Boltshauser, Eugen; Stanzial, Franco; Mirabelli-Badenier, Marisol; Moroni, Isabella; Bertini, Enrico; Emma, Francesco; Steinlin, Maja; Hildebrandt, Friedhelm; Johnson, Colin A; Freilinger, Michael; Vaux, Keith K; Gabriel, Stacey B; Aza-Blanc, Pedro; Heynen-Genel, Susanne; Ideker, Trey; Dynlacht, Brian D; Lee, Ji Eun; Valente, Enza Maria; Kim, Joon; Gleeson, Joseph G
Defective primary ciliogenesis or cilium stability forms the basis of human ciliopathies, including Joubert syndrome (JS), with defective cerebellar vermis development. We performed a high-content genome-wide small interfering RNA (siRNA) screen to identify genes regulating ciliogenesis as candidates for JS. We analyzed results with a supervised-learning approach, using SYSCILIA gold standard, Cildb3.0, a centriole siRNA screen and the GTex project, identifying 591 likely candidates. Intersection of this data with whole exome results from 145 individuals with unexplained JS identified six families with predominantly compound heterozygous mutations in KIAA0586. A c.428del base deletion in 0.1% of the general population was found in trans with a second mutation in an additional set of 9 of 163 unexplained JS patients. KIAA0586 is an orthologue of chick Talpid3, required for ciliogenesis and Sonic hedgehog signaling. Our results uncover a relatively high frequency cause for JS and contribute a list of candidates for future gene discoveries in ciliopathies. DOI: http://dx.doi.org/10.7554/eLife.06602.001 PMID:26026149
Suratanee, Apichat; Schaefer, Martin H.; Betts, Matthew J.; Soons, Zita; Mannsperger, Heiko; Harder, Nathalie; Oswald, Marcus; Gipp, Markus; Ramminger, Ellen; Marcus, Guillermo; Männer, Reinhard; Rohr, Karl; Wanker, Erich; Russell, Robert B.; Andrade-Navarro, Miguel A.; Eils, Roland; König, Rainer
Characterizing the activating and inhibiting effect of protein-protein interactions (PPI) is fundamental to gain insight into the complex signaling system of a human cell. A plethora of methods has been suggested to infer PPI from data on a large scale, but none of them is able to characterize the effect of this interaction. Here, we present a novel computational development that employs mitotic phenotypes of a genome-wide RNAi knockdown screen and enables identifying the activating and inhibiting effects of PPIs. Exemplarily, we applied our technique to a knockdown screen of HeLa cells cultivated at standard conditions. Using a machine learning approach, we obtained high accuracy (82% AUC of the receiver operating characteristics) by cross-validation using 6,870 known activating and inhibiting PPIs as gold standard. We predicted de novo unknown activating and inhibiting effects for 1,954 PPIs in HeLa cells covering the ten major signaling pathways of the Kyoto Encyclopedia of Genes and Genomes, and made these predictions publicly available in a database. We finally demonstrate that the predicted effects can be used to cluster knockdown genes of similar biological processes in coherent subgroups. The characterization of the activating or inhibiting effect of individual PPIs opens up new perspectives for the interpretation of large datasets of PPIs and thus considerably increases the value of PPIs as an integrated resource for studying the detailed function of signaling pathways of the cellular system of interest. PMID:25255318
Bateman, Jack R; Larschan, Erica; D'Souza, Ryan; Marshall, Lauren S; Dempsey, Kyle E; Johnson, Justine E; Mellone, Barbara G; Kuroda, Mitzi I
In Drosophila and other Dipterans, homologous chromosomes are in close contact in virtually all nuclei, a phenomenon known as somatic homolog pairing. Although homolog pairing has been recognized for over a century, relatively little is known about its regulation. We performed a genome-wide RNAi-based screen that monitored the X-specific localization of the male-specific lethal (MSL) complex, and we identified 59 candidate genes whose knockdown via RNAi causes a change in the pattern of MSL staining that is consistent with a disruption of X-chromosomal homolog pairing. Using DNA fluorescent in situ hybridization (FISH), we confirmed that knockdown of 17 of these genes has a dramatic effect on pairing of the 359 bp repeat at the base of the X. Furthermore, dsRNAs targeting Pr-set7, which encodes an H4K20 methyltransferase, cause a modest disruption in somatic homolog pairing. Consistent with our results in cultured cells, a classical mutation in one of the strongest candidate genes, pebble (pbl), causes a decrease in somatic homolog pairing in developing embryos. Interestingly, many of the genes identified by our screen have known roles in diverse cell-cycle events, suggesting an important link between somatic homolog pairing and the choreography of chromosomes during the cell cycle.
Liakath-Ali, Kifayathullah; Vancollie, Valerie E.; Heath, Emma; Smedley, Damian P.; Estabel, Jeanne; Sunter, David; DiTommaso, Tia; White, Jacqueline K.; Ramirez-Solis, Ramiro; Smyth, Ian; Steel, Karen P.; Watt, Fiona M.
Permanent stop-and-shop large-scale mouse mutant resources provide an excellent platform to decipher tissue phenogenomics. Here we analyse skin from 538 knockout mouse mutants generated by the Sanger Institute Mouse Genetics Project. We optimize immunolabelling of tail epidermal wholemounts to allow systematic annotation of hair follicle, sebaceous gland and interfollicular epidermal abnormalities using ontology terms from the Mammalian Phenotype Ontology. Of the 50 mutants with an epidermal phenotype, 9 map to human genetic conditions with skin abnormalities. Some mutant genes are expressed in the skin, whereas others are not, indicating systemic effects. One phenotype is affected by diet and several are incompletely penetrant. In-depth analysis of three mutants, Krt76, Myo5a (a model of human Griscelli syndrome) and Mysm1, provides validation of the screen. Our study is the first large-scale genome-wide tissue phenotype screen from the International Knockout Mouse Consortium and provides an open access resource for the scientific community. PMID:24721909
Wu, Jing; Devlin, Bernie; Ringquist, Steven; Trucco, Massimo; Roeder, Kathryn
Epistasis could be an important source of risk for disease. How interacting loci might be discovered is an open question for genome-wide association studies (GWAS). Most researchers limit their statistical analyses to testing individual pairwise interactions (i.e., marginal tests for association). A more effective means of identifying important predictors is to fit models that include many predictors simultaneously (i.e., higher-dimensional models). We explore a procedure called screen and clean (SC) for identifying liability loci, including interactions, by using the lasso procedure, which is a model selection tool for high-dimensional regression. We approach the problem by using a varying dictionary consisting of terms to include in the model. In the first step the lasso dictionary includes only main effects. The most promising single-nucleotide polymorphisms (SNPs) are identified using a screening procedure. Next the lasso dictionary is adjusted to include these main effects and the corresponding interaction terms. Again, promising terms are identified using lasso screening. Then significant terms are identified through the cleaning process. Implementation of SC for GWAS requires algorithms to explore the complex model space induced by the many SNPs genotyped and their interactions. We propose and explore a set of algorithms and find that SC successfully controls Type I error while yielding good power to identify risk loci and their interactions. When the method is applied to data obtained from the Wellcome Trust Case Control Consortium study of Type 1 Diabetes it uncovers evidence supporting interaction within the HLA class II region as well as within Chromosome 12q24.
Wu, Jing; Devlin, Bernie; Ringquist, Steven; Trucco, Massimo; Roeder, Kathryn
Epistasis could be an important source of risk for disease. How interacting loci might be discovered is an open question for genome-wide association studies (GWAS). Most researchers limit their statistical analyses to testing individual pairwise interactions (i.e., marginal tests for association). A more effective means of identifying important predictors is to fit models that include many predictors simultaneously (i.e., higher dimensional models). We explore a procedure called screen and clean (SC) for identifying liability loci, including interactions, by using the lasso procedure, which is a model selection tool for high dimensional regression. We approach the problem by using a varying dictionary consisting of terms to include in the model. In the first step the lasso dictionary includes only main effects. The most promising SNPs are identified using a screening procedure. Next the lasso dictionary is adjusted to include these main effects and the corresponding interaction terms. Again, promising terms are identified using lasso screening. Then significant terms are identified through the cleaning process. Implementation of SC for GWAS requires algorithms to explore the complex model space induced by the many SNPs genotyped and their interactions. We propose and explore a set of algorithms and find that SC successfully controls Type I error while yielding good power to identify risk loci and their interactions. When the method is applied to data obtained from the Wellcome Trust Case Control Consortium study of Type 1 Diabetes it uncovers evidence supporting interaction within the HLA class II region as well as within Chromosome 12q24. PMID:20088021
Nollen, Ellen A. A.; Garcia, Susana M.; van Haaften, Gijs; Kim, Soojin; Chavez, Alejandro; Morimoto, Richard I.; Plasterk, Ronald H. A.
Protein misfolding and the formation of aggregates are increasingly recognized components of the pathology of human genetic disease and hallmarks of many neurodegenerative disorders. As exemplified by polyglutamine diseases, the propensity for protein misfolding is associated with the length of polyglutamine expansions and age-dependent changes in protein-folding homeostasis, suggesting a critical role for a protein homeostatic buffer. To identify the complement of protein factors that protects cells against the formation of protein aggregates, we tested transgenic Caenorhabditis elegans strains expressing polyglutamine expansion yellow fluorescent protein fusion proteins at the threshold length associated with the age-dependent appearance of protein aggregation. We used genome-wide RNA interference to identify genes that, when suppressed, resulted in the premature appearance of protein aggregates. Our screen identified 186 genes corresponding to five principal classes of polyglutamine regulators: genes involved in RNA metabolism, protein synthesis, protein folding, and protein degradation; and those involved in protein trafficking. We propose that each of these classes represents a molecular machine collectively comprising the protein homeostatic buffer that responds to the expression of damaged proteins to prevent their misfolding and aggregation. protein misfolding | neurodegenerative diseases
Zeng, Xiankun; Han, Lili; Singh, Shree Ram; Liu, Hanhan; Neumüller, Ralph A; Yan, Dong; Hu, Yanhui; Liu, Ying; Liu, Wei; Lin, Xinhua; Hou, Steven X
The intestinal epithelium is the most rapidly self-renewing tissue in adult animals and maintained by intestinal stem cells (ISCs) in both Drosophila and mammals. To comprehensively identify genes and pathways that regulate ISC fates, we performed a genome-wide transgenic RNAi screen in adult Drosophila intestine and identified 405 genes that regulate ISC maintenance and lineage-specific differentiation. By integrating these genes into publicly available interaction databases, we further developed functional networks that regulate ISC self-renewal, ISC proliferation, ISC maintenance of diploid status, ISC survival, ISC-to-enterocyte (EC) lineage differentiation, and ISC-to-enteroendocrine (EE) lineage differentiation. By comparing regulators among ISCs, female germline stem cells, and neural stem cells, we found that factors related to basic stem cell cellular processes are commonly required in all stem cells, and stem-cell-specific, niche-related signals are required only in the unique stem cell type. Our findings provide valuable insights into stem cell maintenance and lineage-specific differentiation.
Wu, Kan Xing; Phuektes, Patchara; Kumar, Pankaj; Goh, Germaine Yen Lin; Moreau, Dimitri; Chow, Vincent Tak Kwong; Bard, Frederic; Chu, Justin Jang Hann
Enterovirus 71 (EV71) is a neurotropic enterovirus without antivirals or vaccine, and its host-pathogen interactions remain poorly understood. Here we use a human genome-wide RNAi screen to identify 256 host factors involved in EV71 replication in human rhabdomyosarcoma cells. Enrichment analyses reveal overrepresentation in processes like mitotic cell cycle and transcriptional regulation. We have carried out orthogonal experiments to characterize the roles of selected factors involved in cell cycle regulation and endoplasmatic reticulum-associated degradation. We demonstrate nuclear egress of CDK6 in EV71 infected cells, and identify CDK6 and AURKB as resistance factors. NGLY1, which co-localizes with EV71 replication complexes at the endoplasmatic reticulum, supports EV71 replication. We confirm importance of these factors for EV71 replication in a human neuronal cell line and for coxsackievirus A16 infection. A small molecule inhibitor of NGLY1 reduces EV71 replication. This study provides a comprehensive map of EV71 host factors and reveals potential antiviral targets. PMID:27748395
Shende, Akhilesh; Singh, Anupama; Meena, Anil; Ghosal, Ritika; Ranganathan, Madhav; Bandyopadhyay, Amitabha
Differentiated tissues may be considered as materials with distinct properties. The differentiation program of a given tissue ensures that it acquires material properties commensurate with its function. It may be hypothesized that some of these properties are acquired through production of tissue-specific metabolites synthesized by metabolic enzymes. To establish correlation between metabolism and organogenesis we have carried out a genome-wide expression study of metabolism related genes by RNA in-situ hybridization. 23% of the metabolism related genes studied are expressed in a tissue-restricted but not tissue-exclusive manner. We have conducted the screen on whole mount chicken (Gallus gallus) embryos from four distinct developmental stages to correlate dynamic changes in expression patterns of metabolic enzymes with spatio-temporally unique developmental events. Our data strongly suggests that unique combinations of metabolism related genes, and not specific metabolic pathways, are upregulated during differentiation. Further, expression of metabolism related genes in well established signaling centers that regulate different aspects of morphogenesis indicates developmental roles of some of the metabolism related genes. The database of tissue-restricted expression patterns of metabolism related genes, generated in this study, should serve as a resource for systematic identification of these genes with tissue-specific functions during development. Finally, comprehensive understanding of differentiation is not possible unless the downstream genes of a differentiation cascade are identified. We propose, metabolic enzymes constitute a significant portion of these downstream target genes. Thus our study should help elucidate different aspects of tissue differentiation. PMID:23717462
Roy, Priti; Kumar, Brijesh; Shende, Akhilesh; Singh, Anupama; Meena, Anil; Ghosal, Ritika; Ranganathan, Madhav; Bandyopadhyay, Amitabha
Differentiated tissues may be considered as materials with distinct properties. The differentiation program of a given tissue ensures that it acquires material properties commensurate with its function. It may be hypothesized that some of these properties are acquired through production of tissue-specific metabolites synthesized by metabolic enzymes. To establish correlation between metabolism and organogenesis we have carried out a genome-wide expression study of metabolism related genes by RNA in-situ hybridization. 23% of the metabolism related genes studied are expressed in a tissue-restricted but not tissue-exclusive manner. We have conducted the screen on whole mount chicken (Gallus gallus) embryos from four distinct developmental stages to correlate dynamic changes in expression patterns of metabolic enzymes with spatio-temporally unique developmental events. Our data strongly suggests that unique combinations of metabolism related genes, and not specific metabolic pathways, are upregulated during differentiation. Further, expression of metabolism related genes in well established signaling centers that regulate different aspects of morphogenesis indicates developmental roles of some of the metabolism related genes. The database of tissue-restricted expression patterns of metabolism related genes, generated in this study, should serve as a resource for systematic identification of these genes with tissue-specific functions during development. Finally, comprehensive understanding of differentiation is not possible unless the downstream genes of a differentiation cascade are identified. We propose, metabolic enzymes constitute a significant portion of these downstream target genes. Thus our study should help elucidate different aspects of tissue differentiation.
Peña, Pedro V; Glasker, Steven; Srienc, Friedrich
The production of biofuels from cellulosic biomass is a promising technology for developing a renewable source of energy. Efforts to produce ethanol from cellulosic biomass using microbes, such as the yeast Saccharomyces cerevisiae, face major challenges, including the need for detoxification. Here, we apply a strategy to discover genetic alterations that lead to improved robustness of S. cerevisiae in the presence of acetate, which is present at toxic concentrations in hemicellulose hydrolysates. Acetate in its protonated form (acetic acid) enters the cell through passive diffusion and dissociates into a proton and acetate, acidifying the cytosol and inhibiting cell function, an effect that is exacerbated in the presence of sodium. Through flow cytometry analysis, implemented as part of a novel cell culture technique, the Cytostat, we characterized the deleterious effects of sodium acetate on growth and on cell size homeostasis. Further, using the Cytostat to screen a genome-wide, gene overexpression library, we identified that overexpressing the ENA2 gene, a P-type sodium pump ATPase, provides a significant growth improvement in the presence of sodium acetate. Together, our data support the proposed mechanism for the synergistic growth inhibition exerted by acetate and sodium, as well as the mechanism that develops resistance.
Galamb, Orsolya; Wichmann, Barna; Sipos, Ferenc; Péterfia, Bálint; Csabai, István; Kovalszky, Ilona; Semsey, Szabolcs; Tulassay, Zsolt; Molnár, Béla
Tumorigenesis is accompanied by changes in the DNA methylation pattern. Our aim was to test a novel approach for identification of transcripts at whole transcript level which are regulated by DNA methylation. Our approach is based on comparison of data obtained from transcriptome profiling of primary human samples and in vitro cell culture models. Epithelial cells were collected by LCM from normal, adenoma, and tumorous colonic samples. Using gene expression analysis, we identified downregulated genes in the tumors compared to normal tissues. In parallel 3000 upregulated genes were determined in HT-29 colon adenocarcinoma cell culture model after DNA demethylation treatment. Of the 2533 transcripts showing reduced expression in the tumorous samples, 154 had increased expression as a result of DNA demethylation treatment. Approximately 2/3 of these genes had decreased expression already in the adenoma samples. Expression of five genes (GCG, NMES-1, LRMP, FAM161B and PTGDR), was validated using RT-PCR. PTGDR showed ambiguous results, therefore it was further studied to verify the extent of DNA methylation and its effect on the protein level. Results confirmed that our approach is suitable for genome-wide screening of genes which are regulated or inactivated by DNA methylation. Activity of these genes possibly interferes with tumor progression, therefore genes identified can be key factors in the formation and in the progression of the disease. PMID:23049694
Kraehling, Jan R.; Chidlow, John H.; Rajagopal, Chitra; Sugiyama, Michael G.; Fowler, Joseph W.; Lee, Monica Y.; Zhang, Xinbo; Ramírez, Cristina M.; Park, Eon Joo; Tao, Bo; Chen, Keyang; Kuruvilla, Leena; Larriveé, Bruno; Folta-Stogniew, Ewa; Ola, Roxana; Rotllan, Noemi; Zhou, Wenping; Nagle, Michael W.; Herz, Joachim; Williams, Kevin Jon; Eichmann, Anne; Lee, Warren L.; Fernández-Hernando, Carlos; Sessa, William C.
In humans and animals lacking functional LDL receptor (LDLR), LDL from plasma still readily traverses the endothelium. To identify the pathways of LDL uptake, a genome-wide RNAi screen was performed in endothelial cells and cross-referenced with GWAS-data sets. Here we show that the activin-like kinase 1 (ALK1) mediates LDL uptake into endothelial cells. ALK1 binds LDL with lower affinity than LDLR and saturates only at hypercholesterolemic concentrations. ALK1 mediates uptake of LDL into endothelial cells via an unusual endocytic pathway that diverts the ligand from lysosomal degradation and promotes LDL transcytosis. The endothelium-specific genetic ablation of Alk1 in Ldlr-KO animals leads to less LDL uptake into the aortic endothelium, showing its physiological role in endothelial lipoprotein metabolism. In summary, identification of pathways mediating LDLR-independent uptake of LDL may provide unique opportunities to block the initiation of LDL accumulation in the vessel wall or augment hepatic LDLR-dependent clearance of LDL. PMID:27869117
Taylor, Gwen M.; Kielian, Margaret
The enveloped alphaviruses include important and emerging human pathogens such as Chikungunya virus and Eastern equine encephalitis virus. Alphaviruses enter cells by clathrin-mediated endocytosis, and exit by budding from the plasma membrane. While there has been considerable progress in defining the structure and function of the viral proteins, relatively little is known about the host factors involved in alphavirus infection. We used a genome-wide siRNA screen to identify host factors that promote or inhibit alphavirus infection in human cells. Fuzzy homologue (FUZ), a protein with reported roles in planar cell polarity and cilia biogenesis, was required for the clathrin-dependent internalization of both alphaviruses and the classical endocytic ligand transferrin. The tetraspanin membrane protein TSPAN9 was critical for the efficient fusion of low pH-triggered virus with the endosome membrane. FUZ and TSPAN9 were broadly required for infection by the alphaviruses Sindbis virus, Semliki Forest virus, and Chikungunya virus, but were not required by the structurally-related flavivirus Dengue virus. Our results highlight the unanticipated functions of FUZ and TSPAN9 in distinct steps of alphavirus entry and suggest novel host proteins that may serve as targets for antiviral therapy. PMID:24367265
Williams, Steven P.; Gould, Cathryn M.; Nowell, Cameron J.; Karnezis, Tara; Achen, Marc G.; Simpson, Kaylene J.; Stacker, Steven A.
Many cell types undergo migration during embryogenesis and disease. Endothelial cells line blood vessels and lymphatics, which migrate during development as part of angiogenesis, lymphangiogenesis and other types of vessel remodelling. These processes are also important in wound healing, cancer metastasis and cardiovascular conditions. However, the molecular control of endothelial cell migration is poorly understood. Here, we present a dataset containing siRNA screens that identify known and novel components of signalling pathways regulating migration of lymphatic endothelial cells. These components are compared to signalling in blood vascular endothelial cells. Further, using high-content microscopy, we captured a dataset of images of migrating cells following transfection with a genome-wide siRNA library. These datasets are suitable for the identification and analysis of genes involved in endothelial cell migration and morphology, and for computational approaches to identify signalling networks controlling the migratory response and integration of cell morphology, gene function and cell signaling. This may facilitate identification of protein targets for therapeutically modulating angiogenesis and lymphangiogenesis in the context of human disease. PMID:28248931
Dekanty, Andrés; Romero, Nuria M.; Bertolin, Agustina P.; Thomas, María G.; Leishman, Claudia C.; Perez-Perri, Joel I.; Boccaccio, Graciela L.; Wappner, Pablo
Hypoxia-inducible factors (HIFs) are a family of evolutionary conserved alpha-beta heterodimeric transcription factors that induce a wide range of genes in response to low oxygen tension. Molecular mechanisms that mediate oxygen-dependent HIF regulation operate at the level of the alpha subunit, controlling protein stability, subcellular localization, and transcriptional coactivator recruitment. We have conducted an unbiased genome-wide RNA interference (RNAi) screen in Drosophila cells aimed to the identification of genes required for HIF activity. After 3 rounds of selection, 30 genes emerged as critical HIF regulators in hypoxia, most of which had not been previously associated with HIF biology. The list of genes includes components of chromatin remodeling complexes, transcription elongation factors, and translational regulators. One remarkable hit was the argonaute 1 (ago1) gene, a central element of the microRNA (miRNA) translational silencing machinery. Further studies confirmed the physiological role of the miRNA machinery in HIF–dependent transcription. This study reveals the occurrence of novel mechanisms of HIF regulation, which might contribute to developing novel strategies for therapeutic intervention of HIF–related pathologies, including heart attack, cancer, and stroke. PMID:20585616
Jevsinek Skok, D; Godnic, I; Zorc, M; Horvat, S; Dovc, P; Kovac, M; Kunej, T
MicroRNAs are a class of non-coding RNAs that post-transcriptionally regulate target gene expression. Previous studies have shown that microRNA gene variability can interfere with its function, resulting in phenotypic variation. Polymorphisms within microRNA genes present a source of novel biomarkers for phenotypic traits in animal breeding. However, little is known about microRNA genetic variability in livestock species, which is also due to incomplete data in genomic resource databases. Therefore, the aim of this study was to perform a genome-wide in silico screening of genomic sources and determine the genetic variability of microRNA genes in livestock species using mirna sniper 3.0 (http://www.integratomics-time.com/miRNA-SNiPer/), a new version of our previously developed tool. By examining Ensembl and miRBase genome builds, it was possible to design a tool-based generated search of 16 genomes including four livestock species: pig, horse, cattle and chicken. The analysis revealed 65 polymorphisms located within mature microRNA regions in these four species, including 28% within the seed region in cattle and chicken. Polymorphic microRNA genes in cattle and chicken were further examined for mapping to quantitative trait loci regions associated with production and health traits. The developed bioinformatics tool enables the analysis of polymorphic microRNA genes and prioritization of potential regulatory polymorphisms and therefore contributes to the development of microRNA-based biomarkers in livestock species. The assembled catalog and the developed tool can serve the animal science community to efficiently select microRNA SNPs for further quantitative and molecular genetic evaluations of their phenotypic effects and causal associations with livestock production traits.
Parnas, Oren; Jovanovic, Marko; Eisenhaure, Thomas M.; Herbst, Rebecca H.; Dixit, Atray; Ye, Chun Jimmie; Przybylski, Dariusz; Platt, Randall J.; Tirosh, Itay; Sanjana, Neville E.; Shalem, Ophir; Satija, Rahul; Raychowdhury, Raktima; Mertins, Philipp; Carr, Steven A.; Zhang, Feng; Hacohen, Nir; Regev, Aviv
Finding the components of cellular circuits and determining their functions systematically remains a major challenge in mammalian cells. Here, we introduced genome-wide pooled CRISPR-Cas9 libraries into dendritic cells (DCs) to identify genes that control the induction of tumor necrosis factor (Tnf) by bacterial lipopolysaccharide (LPS), a key process in the host response to pathogens, mediated by the Tlr4 pathway. We found many of the known regulators of Tlr4 signaling, as well as dozens of previously unknown candidates that we validated. By measuring protein markers and mRNA profiles in DCs that are deficient in the known or candidate genes, we classified the genes into three functional modules with distinct effects on the canonical responses to LPS, and highlighted functions for the PAF complex and oligosaccharyltransferase (OST) complex. Our findings uncover new facets of innate immune circuits in primary cells, and provide a genetic approach for dissection of mammalian cell circuits. PMID:26189680
Chen, Jian; Shi, Xiaoying; Padmanabhan, Ranjani; Wang, Qiong; Wu, Zhidan; Stevenson, Susan C.; Hild, Marc; Garza, Dan; Li, Hao
Mitochondrial dysfunction is associated with many human diseases. There has not been a systematic genetic approach for identifying regulators of basal mitochondrial biogenesis and function in higher eukaryotes. We performed a genome-wide RNA interference (RNAi) screen in Drosophila cells using mitochondrial Citrate synthase (CS) activity as the primary readout. We screened 13,071 dsRNAs and identified 152 genes that modulate CS activity. These modulators are involved in a wide range of biological processes and pathways including mitochondrial-related functions, transcriptional and translational regulation, and signaling pathways. Selected hits among the 152 genes were further analyzed for their effect on mitochondrial CS activity in transgenic flies or fly mutants. We confirmed a number of gene hits including HDAC6, Rpd3(HDAC1), CG3249, vimar, Src42A, klumpfuss, barren, and smt3 which exert effects on mitochondrial CS activities in vivo, demonstrating the value of Drosophila genome-wide RNAi screens for identifying genes and pathways that modulate mitochondrial function. PMID:18042644
Background Genome-scale RNA-interference (RNAi) screens are becoming ever more common gene discovery tools. However, whilst every screen identifies interacting genes, less attention has been given to how factors such as library design and post-screening bioinformatics may be effecting the data generated. Results Here we present a new genome-wide RNAi screen of the Drosophila JAK/STAT signalling pathway undertaken in the Sheffield RNAi Screening Facility (SRSF). This screen was carried out using a second-generation, computationally optimised dsRNA library and analysed using current methods and bioinformatic tools. To examine advances in RNAi screening technology, we compare this screen to a biologically very similar screen undertaken in 2005 with a first-generation library. Both screens used the same cell line, reporters and experimental design, with the SRSF screen identifying 42 putative regulators of JAK/STAT signalling, 22 of which verified in a secondary screen and 16 verified with an independent probe design. Following reanalysis of the original screen data, comparisons of the two gene lists allows us to make estimates of false discovery rates in the SRSF data and to conduct an assessment of off-target effects (OTEs) associated with both libraries. We discuss the differences and similarities between the resulting data sets and examine the relative improvements in gene discovery protocols. Conclusions Our work represents one of the first direct comparisons between first- and second-generation libraries and shows that modern library designs together with methodological advances have had a significant influence on genome-scale RNAi screens. PMID:23006893
Sathyanarayanan, Sriram; Zheng, Xiangzhong; Kumar, Shailesh; Chen, Chun-Hong; Chen, Dechun; Hay, Bruce; Sehgal, Amita
Circadian clocks regulate many different physiological processes and synchronize these to environmental light:dark cycles. In Drosophila, light is transmitted to the clock by a circadian blue light photoreceptor CRYPTOCHROME (CRY). In response to light, CRY promotes the degradation of the circadian clock protein TIMELESS (TIM) and then is itself degraded. To identify novel genes involved in circadian entrainment, we performed an unbiased genome-wide screen in Drosophila cells using a sensitive and quantitative assay that measures light-induced degradation of CRY. We systematically knocked down the expression of approximately 21,000 genes and identified those that regulate CRY stability. These genes include ubiquitin ligases, signal transduction molecules, and redox molecules. Many of the genes identified in the screen are specific for CRY degradation and do not affect degradation of the TIM protein in response to light, suggesting that, for the most part, these two pathways are distinct. We further validated the effect of three candidate genes on CRY stability in vivo by assaying flies mutant for each of these genes. This work identifies a novel regulatory network involved in light-dependent CRY degradation and demonstrates the power of a genome-wide RNAi approach for understanding circadian biology.
Gialluisi, A; Newbury, D F; Wilcutt, E G; Olson, R K; DeFries, J C; Brandler, W M; Pennington, B F; Smith, S D; Scerri, T S; Simpson, N H; Luciano, M; Evans, D M; Bates, T C; Stein, J F; Talcott, J B; Monaco, A P; Paracchini, S; Francks, C; Fisher, S E
Reading and language abilities are heritable traits that are likely to share some genetic influences with each other. To identify pleiotropic genetic variants affecting these traits, we first performed a genome-wide association scan (GWAS) meta-analysis using three richly characterized datasets comprising individuals with histories of reading or language problems, and their siblings. GWAS was performed in a total of 1862 participants using the first principal component computed from several quantitative measures of reading- and language-related abilities, both before and after adjustment for performance IQ. We identified novel suggestive associations at the SNPs rs59197085 and rs5995177 (uncorrected P ≈ 10–7 for each SNP), located respectively at the CCDC136/FLNC and RBFOX2 genes. Each of these SNPs then showed evidence for effects across multiple reading and language traits in univariate association testing against the individual traits. FLNC encodes a structural protein involved in cytoskeleton remodelling, while RBFOX2 is an important regulator of alternative splicing in neurons. The CCDC136/FLNC locus showed association with a comparable reading/language measure in an independent sample of 6434 participants from the general population, although involving distinct alleles of the associated SNP. Our datasets will form an important part of on-going international efforts to identify genes contributing to reading and language skills. PMID:25065397
Teng, Yue; Wang, Yuzhuo; Zhang, Xianglilan; Liu, Wenli; Fan, Hang; Yao, Hongwu; Lin, Baihan; Zhu, Ping; Yuan, Wenjun; Tong, Yigang; Cao, Wuchun
Recently, several thousand people have been killed by the Ebolavirus disease (EVD) in West Africa, yet no current antiviral medications and treatments are available. Systematic investigation of ebolavirus whole genomes during the 2014 outbreak may shed light on the underlying mechanisms of EVD development. Here, using the genome-wide screening in ebolavirus genome sequences, we predicted four putative viral microRNA precursors (pre-miRNAs) and seven putative mature microRNAs (miRNAs). Combing bioinformatics analysis and prediction of the potential ebolavirus miRNA target genes, we suggest that two ebolavirus coding possible miRNAs may be silence and down-regulate the target genes NFKBIE and RIPK1, which are the central mediator of the pathways related with host cell defense mechanism. Additionally, the ebolavirus exploits the miRNAs to inhibit the NF-kB and TNF factors to evade the host defense mechanisms that limit replication by killing infected cells, or to conversely trigger apoptosis as a mechanism to increase virus spreading. This is the first study to use the genome-wide scanning to predict microRNAs in the 2014 outbreak EVD and then to apply systematic bioinformatics to analyze their target genes. We revealed a potential mechanism of miRNAs in ebolavirus infection and possible therapeutic targets for Ebola viral infection treatment. PMID:26011078
Forment, Josep V; Herzog, Mareike; Coates, Julia; Konopka, Tomasz; Gapp, Bianca V; Nijman, Sebastian M; Adams, David J; Keane, Thomas M; Jackson, Stephen P
In model organisms, classical genetic screening via random mutagenesis provides key insights into the molecular bases of genetic interactions, helping to define synthetic lethality, synthetic viability and drug-resistance mechanisms. The limited genetic tractability of diploid mammalian cells, however, precludes this approach. Here, we demonstrate the feasibility of classical genetic screening in mammalian systems by using haploid cells, chemical mutagenesis and next-generation sequencing, providing a new tool to explore mammalian genetic interactions.
Neely, G Gregory; Hess, Andreas; Costigan, Michael; Keene, Alex C; Goulas, Spyros; Langeslag, Michiel; Griffin, Robert S; Belfer, Inna; Dai, Feng; Smith, Shad B; Diatchenko, Luda; Gupta, Vaijayanti; Xia, Cui-Ping; Amann, Sabina; Kreitz, Silke; Heindl-Erdmann, Cornelia; Wolz, Susanne; Ly, Cindy V; Arora, Suchir; Sarangi, Rinku; Dan, Debasis; Novatchkova, Maria; Rosenzweig, Mark; Gibson, Dustin G; Truong, Darwin; Schramek, Daniel; Zoranovic, Tamara; Cronin, Shane J F; Angjeli, Belinda; Brune, Kay; Dietzl, Georg; Maixner, William; Meixner, Arabella; Thomas, Winston; Pospisilik, J Andrew; Alenius, Mattias; Kress, Michaela; Subramaniam, Sai; Garrity, Paul A; Bellen, Hugo J; Woolf, Clifford J; Penninger, Josef M
Worldwide, acute, and chronic pain affects 20% of the adult population and represents an enormous financial and emotional burden. Using genome-wide neuronal-specific RNAi knockdown in Drosophila, we report a global screen for an innate behavior and identify hundreds of genes implicated in heat nociception, including the α2δ family calcium channel subunit straightjacket (stj). Mice mutant for the stj ortholog CACNA2D3 (α2δ3) also exhibit impaired behavioral heat pain sensitivity. In addition, in humans, α2δ3 SNP variants associate with reduced sensitivity to acute noxious heat and chronic back pain. Functional imaging in α2δ3 mutant mice revealed impaired transmission of thermal pain-evoked signals from the thalamus to higher-order pain centers. Intriguingly, in α2δ3 mutant mice, thermal pain and tactile stimulation triggered strong cross-activation, or synesthesia, of brain regions involved in vision, olfaction, and hearing.
Gaspa, Laura; González-Medina, Alberto; Hidalgo, Elena; Ayté, José
The Schizosaccharomyces pombe MBF complex activates the transcription of genes required for DNA synthesis and S phase. The MBF complex contains several proteins, including the core components Cdc10, Res1 and Res2, the co-repressor proteins Yox1 and Nrm1 and the co-activator Rep2. It has recently been shown how MBF is regulated when either the DNA damage or the DNA synthesis checkpoints are activated. However, how MBF is regulated in a normal unperturbed cell cycle is still not well understood. We have set up a genome-wide genomic screen searching for global regulators of MBF. We have crossed our knock-out collection library with a reporter strain that allows the measurement of MBF activity in live cells by flow cytometry. We confirm previously known regulators of MBF and show that COP9/signalosome and tRNA methyltransferases also regulate MBF activity.
Liu, Ying; Ge, Qinglan; Chan, Brian; Liu, Hanhan; Singh, Shree Ram; Manley, Jacob; Lee, Jae; Weideman, Ann Marie; Hou, Gerald; Hou, Steven X.
Stem cells are regulated both intrinsically and externally, including by signals from the local environment and distant organs. To identify genes and pathways that regulate stem-cell fates in the whole organism, we perform a genome-wide transgenic RNAi screen through ubiquitous gene knockdowns, focusing on regulators of adult Drosophila testis germline stem cells (GSCs). Here we identify 530 genes that regulate GSC maintenance and differentiation. Of these, we further knock down 113 selected genes using cell-type-specific Gal4s and find that more than half were external regulators, that is, from the local microenvironment or more distal sources. Some genes, for example, versatile (vers), encoding a heterochromatin protein, regulates GSC fates differentially in different cell types and through multiple pathways. We also find that mitosis/cytokinesis proteins are especially important for male GSC maintenance. Our findings provide valuable insights and resources for studying stem cell regulation at the organismal level. PMID:27484291
You, Lei; Chang, De; Du, Hong-Zhen; Zhao, Yu-Pei
Gemcitabine has been a first-line chemotherapy agent for advanced pancreatic cancer, which is associated with one of the lowest 5 years survival rates among human cancers. Due to our lack of understanding of the genetic determinants of Gemcitabine sensitivity in pancreatic cancer, the therapeutic effectiveness of Gemcitabine chemotherapy is typically unpredictable. Using a genome-wide and piggyBac transposon-based genetic screening platform, we identified the PVT1 gene as a regulator of Gemcitabine sensitivity and showed that functional inactivation of the PVT1 gene led to enhanced Gemcitabine sensitivity in human pancreatic cancer ASPC-1 cells. The integration of the piggyBac transposon-based vector system into intron 3 of PVT1 was within a common site of oncogenic retroviral insertions and chromosomal translocations. PVT1 is a non-protein encoding gene; the genomic arrangement of PVT1 and its co-amplification with MYC have been implicated in the tumorigenesis of a variety of cancers. The molecular mechanism of PVT1 transcripts in gene regulation remains a puzzle. We demonstrated that overexpression of a full length PVT1 cDNA in the antisense orientation reconstituted enhanced sensitivity to Gemcitabine in naïve ASPC-1 cells, whereas overexpression of a full length PVT1 cDNA in the sense orientation resulted in decreased sensitivity to Gemcitabine. Our results identified PVT1 as a regulator of Gemcitabine sensitivity in pancreatic cancer cells and validated the genome-wide genetic screening approach for the identification of genetic determinants as well as potential biomarkers for the rational design of Gemcitabine chemotherapies for pancreatic cancer.
Yoshino, Seiko; Hara, Toshiro; Weng, Jane S; Takahashi, Yuka; Seiki, Motoharu; Sakamoto, Takeharu
Oxygen is a vital requirement for multi-cellular organisms to generate energy and cells have developed multiple compensatory mechanisms to adapt to stressful hypoxic conditions. Such adaptive mechanisms are intricately interconnected with other signaling pathways that regulate cellular functions such as cell growth. However, our understanding of the overall system governing the cellular response to the availability of oxygen remains limited. To identify new genes involved in the response to hypoxic stress, we have performed a genome-wide gene knockdown analysis in human lung carcinoma PC8 cells using an shRNA library carried by a lentiviral vector. The knockdown analysis was performed under both normoxic and hypoxic conditions to identify shRNA sequences enriched or lost in the resulting selected cell populations. Consequently, we identified 56 candidate genes that might contribute to the cellular response to hypoxia. Subsequent individual knockdown of each gene demonstrated that 13 of these have a significant effect upon oxygen-sensitive cell growth. The identification of BCL2L1, which encodes a Bcl-2 family protein that plays a role in cell survival by preventing apoptosis, validates the successful design of our screen. The other selected genes have not previously been directly implicated in the cellular response to hypoxia. Interestingly, hypoxia did not directly enhance the expression of any of the identified genes, suggesting that we have identified a new class of genes that have been missed by conventional gene expression analyses to identify hypoxia response genes. Thus, our genetic screening method using a genome-wide shRNA library and the newly-identified genes represent useful tools to analyze the cellular systems that respond to hypoxic stress.
Schäfer, Antje; Wolf, Dieter H
In this chapter, a genetic approach is presented that leads to the isolation of mutants and to the identification of proteins involved in protein quality control and endoplasmic reticulum-associated degradation (ERAD). The method makes use of a genomic screen of a yeast deletion library (EUROSCARF). Transformation of each of the approx 5000 strains deleted in one nonvital gene each with a CPY* chimera containing CPY* C-terminally fused to a transmembrane domain and the cytosolic Leu2 protein (3-isopropylmalate dehydrogenase) constitutes the basic screening procedure. Because of a Leu2p deficiency in all deletion strains, cells can grow only when the CTL* chimera is present. As the CPY* module of CTL* will be recognized in ERAD-proficient cells, CTL* will be degraded and the strain is unable to grow. Therefore the absence of genes necessary for ER quality control and ERAD will allow cell growth and indicate the necessity of the respective gene for these processes.
Sinha, Amit; Rae, Robbie
RNA interference is a rapid, inexpensive, and highly effective tool used to inhibit gene function. In C. elegans, whole genome screens have been used to identify genes involved with numerous traits including aging and innate immunity. RNAi in C. elegans can be carried out via feeding, soaking, or injection. Here we outline protocols used to maintain, grow, and carry out RNAi via feeding in C. elegans and determine whether the inhibited genes are essential for lifespan or innate immunity.
Kennedy, Patrick J; Vashisht, Ajay A; Hoe, Kwang-Lae; Kim, Dong-Uk; Park, Han-Oh; Hayles, Jacqueline; Russell, Paul
Cadmium is a worldwide environmental toxicant responsible for a range of human diseases including cancer. Cellular injury from cadmium is minimized by stress-responsive detoxification mechanisms. We explored the genetic requirements for cadmium tolerance by individually screening mutants from the fission yeast (Schizosaccharomyces pombe) haploid deletion collection for inhibited growth on agar growth media containing cadmium. Cadmium-sensitive mutants were further tested for sensitivity to oxidative stress (hydrogen peroxide) and osmotic stress (potassium chloride). Of 2649 mutants screened, 237 were sensitive to cadmium, of which 168 were cadmium specific. Most were previously unknown to be involved in cadmium tolerance. The 237 genes represent a number of pathways including sulfate assimilation, phytochelatin synthesis and transport, ubiquinone (Coenzyme Q10) biosynthesis, stress signaling, cell wall biosynthesis and cell morphology, gene expression and chromatin remodeling, vacuole function, and intracellular transport of macromolecules. The ubiquinone biosynthesis mutants are acutely sensitive to cadmium but only mildly sensitive to hydrogen peroxide, indicating that Coenzyme Q10 plays a larger role in cadmium tolerance than just as an antioxidant. These and several other mutants turn yellow when exposed to cadmium, suggesting cadmium sulfide accumulation. This phenotype can potentially be used as a biomarker for cadmium. There is remarkably little overlap with a comparable screen of the Saccharomyces cerevisiae haploid deletion collection, indicating that the two distantly related yeasts utilize significantly different strategies for coping with cadmium stress. These strategies and their relation to cadmium detoxification in humans are discussed.
Misra, Mala; Edmund, Hendia; Ennis, Darragh; Schlueter, Marissa A.; Marot, Jessica E.; Tambasco, Janet; Barlow, Ida; Sigurbjornsdottir, Sara; Mathew, Renjith; Vallés, Ana Maria; Wojciech, Waldemar; Roth, Siegfried; Davis, Ilan; Leptin, Maria; Gavis, Elizabeth R.
Localizing messenger RNAs at specific subcellular sites is a conserved mechanism for targeting the synthesis of cytoplasmic proteins to distinct subcellular domains, thereby generating the asymmetric protein distributions necessary for cellular and developmental polarity. However, the full range of transcripts that are asymmetrically distributed in specialized cell types, and the significance of their localization, especially in the nervous system, are not known. We used the EP-MS2 method, which combines EP transposon insertion with the MS2/MCP in vivo fluorescent labeling system, to screen for novel localized transcripts in polarized cells, focusing on the highly branched Drosophila class IV dendritic arborization neurons. Of a total of 541 lines screened, we identified 55 EP-MS2 insertions producing transcripts that were enriched in neuronal processes, particularly in dendrites. The 47 genes identified by these insertions encode molecularly diverse proteins, and are enriched for genes that function in neuronal development and physiology. RNAi-mediated knockdown confirmed roles for many of the candidate genes in dendrite morphogenesis. We propose that the transport of mRNAs encoded by these genes into the dendrites allows their expression to be regulated on a local scale during the dynamic developmental processes of dendrite outgrowth, branching, and/or remodeling. PMID:27260999
Misra, Mala; Edmund, Hendia; Ennis, Darragh; Schlueter, Marissa A; Marot, Jessica E; Tambasco, Janet; Barlow, Ida; Sigurbjornsdottir, Sara; Mathew, Renjith; Vallés, Ana Maria; Wojciech, Waldemar; Roth, Siegfried; Davis, Ilan; Leptin, Maria; Gavis, Elizabeth R
Localizing messenger RNAs at specific subcellular sites is a conserved mechanism for targeting the synthesis of cytoplasmic proteins to distinct subcellular domains, thereby generating the asymmetric protein distributions necessary for cellular and developmental polarity. However, the full range of transcripts that are asymmetrically distributed in specialized cell types, and the significance of their localization, especially in the nervous system, are not known. We used the EP-MS2 method, which combines EP transposon insertion with the MS2/MCP in vivo fluorescent labeling system, to screen for novel localized transcripts in polarized cells, focusing on the highly branched Drosophila class IV dendritic arborization neurons. Of a total of 541 lines screened, we identified 55 EP-MS2 insertions producing transcripts that were enriched in neuronal processes, particularly in dendrites. The 47 genes identified by these insertions encode molecularly diverse proteins, and are enriched for genes that function in neuronal development and physiology. RNAi-mediated knockdown confirmed roles for many of the candidate genes in dendrite morphogenesis. We propose that the transport of mRNAs encoded by these genes into the dendrites allows their expression to be regulated on a local scale during the dynamic developmental processes of dendrite outgrowth, branching, and/or remodeling.
Cheng, V.; Stotz, H. U.; Hippchen, K.; Bakalinsky, A. T.
Oxalic acid is an important virulence factor produced by phytopathogenic filamentous fungi. In order to discover yeast genes whose orthologs in the pathogen may confer self-tolerance and whose plant orthologs may protect the host, a Saccharomyces cerevisiae deletion library consisting of 4,827 haploid mutants harboring deletions in nonessential genes was screened for growth inhibition and survival in a rich medium containing 30 mM oxalic acid at pH 3. A total of 31 mutants were identified that had significantly lower cell yields in oxalate medium than in an oxalate-free medium. About 35% of these mutants had not previously been detected in published screens for sensitivity to sorbic or citric acid. Mutants impaired in endosomal transport, the rgp1Δ, ric1Δ, snf7Δ, vps16Δ, vps20Δ, and vps51Δ mutants, were significantly overrepresented relative to their frequency among all verified yeast open reading frames. Oxalate exposure to a subset of five mutants, the drs2Δ, vps16Δ, vps51Δ, ric1Δ, and rib4Δ mutants, was lethal. With the exception of the rib4Δ mutant, all of these mutants are impaired in vesicle-mediated transport. Indirect evidence is provided suggesting that the sensitivity of the rib4Δ mutant, a riboflavin auxotroph, is due to oxalate-mediated interference with riboflavin uptake by the putative monocarboxylate transporter Mch5. PMID:17644632
San Martin-Uriz, Patxi; Mirete, Salvador; Alcolea, Pedro J; Gomez, Manuel J; Amils, Ricardo; Gonzalez-Pastor, Jose E
Acidiphilium spp. are conspicuous dwellers of acidic, metal-rich environments. Indeed, they are among the most metal-resistant organisms; yet little is known about the mechanisms behind the metal tolerance in this genus. Acidiphilium sp. PM is an environmental isolate from Rio Tinto, an acidic, metal-laden river located in southwestern Spain. The characterization of its metal resistance revealed a remarkable ability to tolerate high Ni concentrations. Here we report the screening of a genomic library of Acidiphilium sp. PM to identify genes involved in Ni resistance. This approach revealed seven different genes conferring Ni resistance to E. coli, two of which form an operon encoding the ATP-dependent protease HslVU (ClpQY). This protease was found to enhance resistance to both Ni and Co in E. coli, a function not previously reported. Other Ni-resistance determinants include genes involved in lipopolysaccharide biosynthesis and the synthesis of branched amino acids. The diversity of molecular functions of the genes recovered in the screening suggests that Ni resistance in Acidiphilium sp. PM probably relies on different molecular mechanisms.
Mano, Miguel; Ippodrino, Rudy; Zentilin, Lorena; Zacchigna, Serena; Giacca, Mauro
Viral vectors based on the adeno-associated virus (AAV) hold great promise for in vivo gene transfer; several unknowns, however, still limit the vectors' broader and more efficient application. Here, we report the results of a high-throughput, whole-genome siRNA screening aimed at identifying cellular factors regulating AAV transduction. We identified 1,483 genes affecting vector efficiency more than 4-fold and up to 50-fold, either negatively or positively. Most of these factors have not previously been associated to AAV infection. The most effective siRNAs were independent from the virus serotype or analyzed cell type and were equally evident for single-stranded and self-complementary AAV vectors. A common characteristic of the most effective siRNAs was the induction of cellular DNA damage and activation of a cell cycle checkpoint. This information can be exploited for the development of more efficient AAV-based gene delivery procedures. Administration of the most effective siRNAs identified by the screening to the liver significantly improved in vivo AAV transduction efficiency.
San Martin-Uriz, Patxi; Mirete, Salvador; Alcolea, Pedro J.; Gomez, Manuel J.; Amils, Ricardo; Gonzalez-Pastor, Jose E.
Acidiphilium spp. are conspicuous dwellers of acidic, metal-rich environments. Indeed, they are among the most metal-resistant organisms; yet little is known about the mechanisms behind the metal tolerance in this genus. Acidiphilium sp. PM is an environmental isolate from Rio Tinto, an acidic, metal-laden river located in southwestern Spain. The characterization of its metal resistance revealed a remarkable ability to tolerate high Ni concentrations. Here we report the screening of a genomic library of Acidiphilium sp. PM to identify genes involved in Ni resistance. This approach revealed seven different genes conferring Ni resistance to E. coli, two of which form an operon encoding the ATP-dependent protease HslVU (ClpQY). This protease was found to enhance resistance to both Ni and Co in E. coli, a function not previously reported. Other Ni-resistance determinants include genes involved in lipopolysaccharide biosynthesis and the synthesis of branched amino acids. The diversity of molecular functions of the genes recovered in the screening suggests that Ni resistance in Acidiphilium sp. PM probably relies on different molecular mechanisms. PMID:24740277
Baptist, Guillaume; Pinel, Corinne; Ranquet, Caroline; Izard, Jérôme; Ropers, Delphine; de Jong, Hidde; Geiselmann, Johannes
We have developed a new screening methodology for identifying all genes that control the expression of a target gene through genetic or metabolic interactions. The screen combines mutant libraries with luciferase reporter constructs, whose expression can be monitored in vivo and over time in different environmental conditions. We apply the method to identify the genes that control the expression of the gene acs, encoding the acetyl coenzyme A synthetase, in Escherichia coli. We confirm most of the known genetic regulators, including CRP–cAMP, IHF and components of the phosphotransferase system. In addition, we identify new regulatory interactions, many of which involve metabolic intermediates or metabolic sensing, such as the genes pgi, pfkA, sucB and lpdA, encoding enzymes in glycolysis and the TCA cycle. Some of these novel interactions were validated by quantitative reverse transcriptase-polymerase chain reaction. More generally, we observe that a large number of mutants directly or indirectly influence acs expression, an effect confirmed for a second promoter, sdhC. The method is applicable to any promoter fused to a luminescent reporter gene in combination with a deletion mutant library. PMID:23892289
Pettitt, Stephen J.; Krastev, Dragomir B.; Pemberton, Helen N.; Fontebasso, Yari; Frankum, Jessica; Rehman, Farah L.; Brough, Rachel; Song, Feifei; Bajrami, Ilirjana; Rafiq, Rumana; Wallberg, Fredrik; Kozarewa, Iwanka; Fenwick, Kerry; Armisen-Garrido, Javier; Swain, Amanda; Gulati, Aditi; Campbell, James; Ashworth, Alan; Lord, Christopher J.
We describe a screen for cellular response to drugs that makes use of haploid embryonic stem cells. We generated ten libraries of mutants with piggyBac gene trap transposon integrations, totalling approximately 100,000 mutant clones. Random barcode sequences were inserted into the transposon vector to allow the number of cells bearing each insertion to be measured by amplifying and sequencing the barcodes. These barcodes were associated with their integration sites by inverse PCR. We exposed these libraries to commonly used cancer drugs and profiled changes in barcode abundance by Ion Torrent sequencing in order to identify mutations that conferred sensitivity. Drugs tested included conventional chemotherapeutics as well as targeted inhibitors of topoisomerases, poly(ADP-ribose) polymerase (PARP), Hsp90 and WEE1. PMID:28248920
Chaillot, Julien; Cook, Michael A.; Corbeil, Jacques; Sellam, Adnane
One of the most critical but still poorly understood aspects of eukaryotic cell proliferation is the basis for commitment to cell division in late G1 phase, called Start in yeast and the Restriction Point in metazoans. In all species, a critical cell size threshold coordinates cell growth with cell division and thereby establishes a homeostatic cell size. While a comprehensive survey of cell size genetic determinism has been performed in the saprophytic yeasts Saccharomyces cerevisiae and Schizosaccharomyces pombe, very little is known in pathogenic fungi. As a number of critical Start regulators are haploinsufficient for cell size, we applied a quantitative analysis of the size phenome, using elutriation-barcode sequencing methodology, to 5639 barcoded heterozygous deletion strains of the opportunistic yeast Candida albicans. Our screen identified conserved known regulators and biological processes required to maintain size homeostasis in the opportunistic yeast C. albicans. We also identified novel C. albicans-specific size genes and provided a conceptual framework for future mechanistic studies. Interestingly, some of the size genes identified were required for fungal pathogenicity suggesting that cell size homeostasis may be elemental to C. albicans fitness or virulence inside the host. PMID:28040776
van der Weyden, Louise; Arends, Mark J; Campbell, Andrew D; Bald, Tobias; Wardle-Jones, Hannah; Griggs, Nicola; Velasco-Herrera, Martin Del Castillo; Tüting, Thomas; Sansom, Owen J; Karp, Natasha A; Clare, Simon; Gleeson, Diane; Ryder, Edward; Galli, Antonella; Tuck, Elizabeth; Cambridge, Emma L; Voet, Thierry; Macaulay, Iain C; Wong, Kim; Spiegel, Sarah; Speak, Anneliese O; Adams, David J
Metastasis is the leading cause of death for cancer patients. This multi-stage process requires tumour cells to survive in the circulation, extravasate at distant sites, then proliferate; it involves contributions from both the tumour cell and tumour microenvironment ('host', which includes stromal cells and the immune system). Studies suggest the early steps of the metastatic process are relatively efficient, with the post-extravasation regulation of tumour growth ('colonization') being critical in determining metastatic outcome. Here we show the results of screening 810 mutant mouse lines using an in vivo assay to identify microenvironmental regulators of metastatic colonization. We identify 23 genes that, when disrupted in mouse, modify the ability of tumour cells to establish metastatic foci, with 19 of these genes not previously demonstrated to play a role in host control of metastasis. The largest reduction in pulmonary metastasis was observed in sphingosine-1-phosphate (S1P) transporter spinster homologue 2 (Spns2)-deficient mice. We demonstrate a novel outcome of S1P-mediated regulation of lymphocyte trafficking, whereby deletion of Spns2, either globally or in a lymphatic endothelial-specific manner, creates a circulating lymphopenia and a higher percentage of effector T cells and natural killer (NK) cells present in the lung. This allows for potent tumour cell killing, and an overall decreased metastatic burden.
Lee, Won Jae; Škalamera, Dubravka; Dahmer-Heath, Mareike; Shakhbazov, Konstanin; Ranall, Max V; Fox, Carly; Lambie, Duncan; Stevenson, Alexander J; Yaswen, Paul; Gonda, Thomas J; Gabrielli, Brian
Malignant melanomas often arise from nevi, which result from initial oncogene-induced hyperproliferation of melanocytes that are maintained in a CDKN2A/p16-mediated senescent state. Thus, genes that can bypass this senescence barrier are likely to contribute to melanoma development. We have performed a gain-of-function screen of 17,030 lentivirally expressed human open reading frames (ORFs) in a melanoma cell line containing an inducible p16 construct to identify such genes. Genes known to bypass p16-induced senescence arrest, including the human papilloma virus 18 E7 gene ( HPV18E7), and genes such as the p16-binding CDK6 with expected functions, as well as panel of novel genes, were identified, including high-mobility group box (HMGB) proteins. A number of these were further validated in two other models of p16-induced senescence. Tissue immunohistochemistry demonstrated higher levels of CDK6 in primary melanomas compared with normal skin and nevi. Reduction of CDK6 levels drove melanoma cells expressing functional p16 into senescence, demonstrating its contribution to bypass senescence.
White, Cory H; Ohmen, Jeffrey D; Sheth, Sonal; Zebboudj, Amina F; McHugh, Richard K; Hoffman, Larry F; Lusis, Aldons J; Davis, Richard C; Friedman, Rick A
Noise-induced hearing loss (NIHL) is one of the more common sources of environmentally induced hearing loss in adults. In a mouse model, Castaneous (CAST/Ei) is an inbred strain that is resistant to NIHL, while the C57BL/6J strain is susceptible. We have used the genome-tagged mice (GTM) library of congenic strains, carrying defined segments of the CAST/Ei genome introgressed onto the C57BL/6J background, to search for loci modifying the noise-induced damage seen in the C57BL/6J strain. NIHL was induced by exposing 6-8-week old mice to 108 dB SPL intensity noise. We tested the hearing of each mouse strain up to 23 days after noise exposure using auditory brainstem response (ABR). This study identifies a number of genetic loci that modify the initial response to damaging noise, as well as long-term recovery. The data suggest that multiple alleles within the CAST/Ei genome modify the pathogenesis of NIHL and that screening congenic libraries for loci that underlie traits of interest can be easily carried out in a high-throughput fashion.
Lamitina, Todd; Huang, Chunyi George; Strange, Kevin
The detection, stabilization, and repair of stress-induced damage are essential requirements for cellular life. All cells respond to osmotic stress-induced water loss with increased expression of genes that mediate accumulation of organic osmolytes, solutes that function as chemical chaperones and restore osmotic homeostasis. The signals and signaling mechanisms that regulate osmoprotective gene expression in animal cells are poorly understood. Here, we show that gpdh-1 and gpdh-2, genes that mediate the accumulation of the organic osmolyte glycerol, are essential for survival of the nematode Caenorhabditis elegans during osmotic stress. Expression of GFP driven by the gpdh-1 promoter (Pgpdh-1::GFP) is detected only during hypertonic stress but is not induced by other stressors. Using Pgpdh-1::GFP expression as a phenotype, we screened ≈16,000 genes by RNAi feeding and identified 122 that cause constitutive activation of gpdh-1 expression and glycerol accumulation. Many of these genes function to regulate protein translation and cotranslational protein folding and to target and degrade denatured proteins, suggesting that the accumulation of misfolded proteins functions as a signal to activate osmoprotective gene expression and organic osmolyte accumulation in animal cells. Consistent with this hypothesis, 73% of these protein-homeostasis genes have been shown to slow age-dependent protein aggregation in C. elegans. Because diverse environmental stressors and numerous disease states result in protein misfolding, mechanisms must exist that discriminate between osmotically induced and other forms of stress-induced protein damage. Our findings provide a foundation for understanding how these damage-selectivity mechanisms function. PMID:16880390
Wong, Lai H.; Flibotte, Stephane; Sinha, Sunita; Chiang, Jennifer; Giaever, Guri; Nislow, Corey
Drug resistance is a consequence of how most modern medicines work. Drugs exert pressure on cells that causes death or the evolution of resistance. Indeed, highly specific drugs are rendered ineffective by a single DNA mutation. In this study, we apply the drug methotrexate, which is widely used in cancer and rheumatoid arthritis, and perform evolution experiments on Baker’s yeast to ask the different ways in which cells become drug resistant. Because of the conserved nature of biological pathways between yeast and man, our results can inform how the same mechanism may operate to render human cells resistant to treatment. Exposure of cells to small molecules and drug therapies imposes a strong selective pressure. As a result, cells rapidly acquire mutations in order to survive. These include resistant variants of the drug target as well as those that modulate drug transport and detoxification. To systematically explore how cells acquire drug resistance in an unbiased manner, rapid cost-effective approaches are required. Methotrexate, as one of the first rationally designed anticancer drugs, has served as a prototypic example of such acquired resistance. Known methotrexate resistance mechanisms include mutations that increase expression of the dihydrofolate reductase (DHFR) target as well as those that maintain function yet reduce the drug’s binding affinity. Recent evidence suggests that target-independent, epistatic mutations can also result in resistance to methotrexate. Currently, however, the relative contribution of such unlinked resistance mutations is not well understood. To address this issue, we took advantage of Saccharomyces cerevisiae as a model eukaryotic system that combined with whole-genome sequencing and a rapid screening methodology, allowed the identification of causative mutations that modulate resistance to methotrexate. We found a recurrent missense mutation in SEC21 (orthologous to human COPG1), which we confirmed in 10 de novo
Toret, Christopher P.; D’Ambrosio, Michael V.; Vale, Ronald D.; Simon, Michael A.
Cadherins and associated catenins provide an important structural interface between neighboring cells, the actin cytoskeleton, and intracellular signaling pathways in a variety of cell types throughout the Metazoa. However, the full inventory of the proteins and pathways required for cadherin-mediated adhesion has not been established. To this end, we completed a genome-wide (∼14,000 genes) ribonucleic acid interference (RNAi) screen that targeted Ca2+-dependent adhesion in DE-cadherin–expressing Drosophila melanogaster S2 cells in suspension culture. This novel screen eliminated Ca2+-independent cell–cell adhesion, integrin-based adhesion, cell spreading, and cell migration. We identified 17 interconnected regulatory hubs, based on protein functions and protein–protein interactions that regulate the levels of the core cadherin–catenin complex and coordinate cadherin-mediated cell–cell adhesion. Representative proteins from these hubs were analyzed further in Drosophila oogenesis, using targeted germline RNAi, and adhesion was analyzed in Madin–Darby canine kidney mammalian epithelial cell–cell adhesion. These experiments reveal roles for a diversity of cellular pathways that are required for cadherin function in Metazoa, including cytoskeleton organization, cell–substrate interactions, and nuclear and cytoplasmic signaling. PMID:24446484
Nybakken, Kent; Vokes, Steven A; Lin, Ting-Yi; McMahon, Andrew P; Perrimon, Norbert
Members of the Hedgehog (Hh) family of signaling proteins are powerful regulators of developmental processes in many organisms and have been implicated in many human disease states. Here we report the results of a genome-wide RNA interference screen in Drosophila melanogaster cells for new components of the Hh signaling pathway. The screen identified hundreds of potential new regulators of Hh signaling, including many large protein complexes with pleiotropic effects, such as the coat protein complex I (COPI) complex, the ribosome and the proteasome. We identified the multimeric protein phosphatase 2A (PP2A) and two new kinases, the D. melanogaster orthologs of the vertebrate PITSLRE and cyclin-dependent kinase-9 (CDK9) kinases, as Hh regulators. We also identified a large group of constitutive and alternative splicing factors, two nucleoporins involved in mRNA export and several RNA-regulatory proteins as potent regulators of Hh signal transduction, indicating that splicing regulation and mRNA transport have a previously unrecognized role in Hh signaling. Finally, we showed that several of these genes have conserved roles in mammalian Hh signaling.
Zhao, Lei; Yang, Qian; Zheng, Ju; Zhu, Xuefeng; Hao, Xinxin; Song, Jia; Lebacq, Tom; Franssens, Vanessa; Winderickx, Joris; Nystrom, Thomas; Liu, Beidong
Synphilin-1 is a major component of Parkinson’s disease (PD) inclusion bodies implicated in PD pathogenesis. However, the machinery controlling synphilin-1 inclusion formation remains unclear. Here, we investigated synphilin-1 inclusion formation using a systematic genome-wide, high-content imaging based screening approach (HCI) in the yeast Saccharomyces cerevisiae. By combining with a secondary screening for mutants showing significant changes on fluorescence signal intensity, we filtered out hits that significantly decreased the expression level of synphilin-1. We found 133 yeast genes that didn’t affect synphilin-1 expression but that were required for the formation of synphilin-1 inclusions. Functional enrichment and physical interaction network analysis revealed these genes to encode for functions involved in cytoskeleton organization, histone modification, sister chromatid segregation, glycolipid biosynthetic process, DNA repair and replication. All hits were confirmed by conventional microscopy. Complementation assays were performed with a selected group of mutants, results indicated that the observed phenotypic changes in synphilin-1 inclusion formation were directly caused by the loss of corresponding genes of the deletion mutants. Further growth assays of these mutants showed a significant synthetic sick effect upon synphilin-1 expression, which supports the hypothesis that matured inclusions represent an end stage of several events meant to protect cells against the synphilin-1 cytotoxicity. PMID:27440388
McKnight, Nicole C; Jefferies, Harold B J; Alemu, Endalkachew A; Saunders, Rebecca E; Howell, Michael; Johansen, Terje; Tooze, Sharon A
Autophagy is a catabolic process by which cytoplasmic components are sequestered and transported by autophagosomes to lysosomes for degradation, enabling recycling of these components and providing cells with amino acids during starvation. It is a highly regulated process and its deregulation contributes to multiple diseases. Despite its importance in cell homeostasis, autophagy is not fully understood. To find new proteins that modulate starvation-induced autophagy, we performed a genome-wide siRNA screen in a stable human cell line expressing GFP–LC3, the marker-protein for autophagosomes. Using stringent validation criteria, our screen identified nine novel autophagy regulators. Among the hits required for autophagosome formation are SCOC (short coiled-coil protein), a Golgi protein, which interacts with fasciculation and elongation protein zeta 1 (FEZ1), an ULK1-binding protein. SCOC forms a starvation-sensitive trimeric complex with UVRAG (UV radiation resistance associated gene) and FEZ1 and may regulate ULK1 and Beclin 1 complex activities. A second candidate WAC is required for starvation-induced autophagy but also acts as a potential negative regulator of the ubiquitin-proteasome system. The identification of these novel regulatory proteins with diverse functions in autophagy contributes towards a fuller understanding of autophagosome formation. PMID:22354037
Kwon, Yong-Jae; Hu, Jennifer A.; Krupinski, Thomas; Casalino-Matsuda, S. Marina; Sporn, Peter H. S.; Sznajder, Jacob I.; Beitel, Greg J.
Hypercapnia, elevated partial pressure of carbon dioxide (PCO2) in blood and tissue, develops in many patients with chronic severe obstructive pulmonary disease and other advanced lung disorders. Patients with advanced disease frequently develop bacterial lung infections, and hypercapnia is a risk factor for mortality in such individuals. We previously demonstrated that hypercapnia suppresses induction of NF-κB-regulated innate immune response genes required for host defense in human, mouse and Drosophila cells, and increases mortality from bacterial infections in both mice and Drosophila. However, the molecular mediator(s) of hypercapnic immune suppression are undefined. Here, we report a genome-wide RNAi screen in Drosophila S2* cells stimulated with bacterial peptidoglycan (PGN). The screen identified 16 genes with human orthologs whose knockdown reduced hypercapnic suppression of the gene encoding the antimicrobial peptide (AMPs) Diptericin (Dipt), but did not increase Dipt mRNA levels in air. In vivo tests of one of the strongest screen hits, Zfh2 (mammalian orthologs ZFHX3/ATBF1 and ZFHX4), demonstrate that reducing zfh2 function using a mutation or RNAi improves survival of flies exposed to elevated CO2 and infected with S. aureus. Tissue-specific knockdown of zfh2 in the fat body, the major immune and metabolic organ of the fly, mitigates hypercapnia-induced reductions in Dipt and other AMPs and improves resistance of CO2-exposed flies to infection. Zfh2 mutations also partially rescue hypercapnia-induced delays in egg hatching, suggesting that Zfh2's role in mediating responses to hypercapnia extends beyond the immune system. Together, these results identify Zfh2 as the first in vivo mediator of hypercapnic immune suppression. PMID:26643480
Marza, Esther; Taouji, Saïd; Barroso, Kim; Raymond, Anne-Aurélie; Guignard, Léo; Bonneu, Marc; Pallares-Lupon, Néstor; Dupuy, Jean-William; Fernandez-Zapico, Martin E; Rosenbaum, Jean; Palladino, Francesca; Dupuy, Denis; Chevet, Eric
The accumulation of misfolded proteins in the endoplasmic reticulum (ER) activates the Unfolded Protein Response (UPR(ER)) to restore ER homeostasis. The AAA(+) ATPase p97/CDC-48 plays key roles in ER stress by promoting both ER protein degradation and transcription of UPR(ER) genes. Although the mechanisms associated with protein degradation are now well established, the molecular events involved in the regulation of gene transcription by p97/CDC-48 remain unclear. Using a reporter-based genome-wide RNAi screen in combination with quantitative proteomic analysis in Caenorhabditis elegans, we have identified RUVB-2, a AAA(+) ATPase, as a novel repressor of a subset of UPR(ER) genes. We show that degradation of RUVB-2 by CDC-48 enhances expression of ER stress response genes through an XBP1-dependent mechanism. The functional interplay between CDC-48 and RUVB-2 in controlling transcription of select UPR(ER) genes appears conserved in human cells. Together, these results describe a novel role for p97/CDC-48, whereby its role in protein degradation is integrated with its role in regulating expression of ER stress response genes.
Moris, Naomi; Shrivastava, Jaya; Jeffery, Linda; Li, Juan-Juan; Hayles, Jacqueline; Nurse, Paul
We have carried out a haploinsufficiency (HI) screen in fission yeast using heterozygous deletion diploid mutants of a genome-wide set of cell cycle genes to identify genes encoding products whose level determines the rate of progression through the cell cycle. Cell size at division was used as a measure of advancement or delay of the G2-M transition of rod-shaped fission yeast cells. We found that 13 mutants were significantly longer or shorter (greater than 10%) than control cells at cell division. These included mutants of the cdc2, cdc25, wee1 and pom1 genes, which have previously been shown to play a role in the timing of entry into mitosis, and which validate this approach. Seven of these genes are involved in regulation of the G2-M transition, 5 for nuclear transport and one for nucleotide metabolism. In addition we identified 4 more genes that were 8-10% longer or shorter than the control that also had roles in regulation of the G2-M transition or in nuclear transport. The genes identified here are all conserved in human cells, suggesting that this dataset will be useful as a basis for further studies to identify rate-limiting steps for progression through the cell cycle in other eukaryotes.
Wang, Huaishan; Sherbini, Omar; Ling-lin Pai, Emily; Kwon, Ji-Sun; He, Wei; Wang, Hong; Chi, Zhikai; Xu, Jinchong; Jiang, Haisong; Andrabi, Shaida A.
Neuronal loss caused by ischemic injury, trauma, or disease can lead to devastating consequences for the individual. With the goal of limiting neuronal loss, a number of cell death pathways have been studied, but there may be additional contributors to neuronal death that are yet unknown. To identify previously unknown cell death mediators, we performed a high-content genome-wide screening of short, interfering RNA (siRNA) with an siRNA library in murine neural stem cells after exposure to N-methyl-N-nitroso-N′-nitroguanidine (MNNG), which leads to DNA damage and cell death. Eighty genes were identified as key mediators for cell death. Among them, 14 are known cell death mediators and 66 have not previously been linked to cell death pathways. Using an integrated approach with functional and bioinformatics analysis, we provide possible molecular networks, interconnected pathways, and/or protein complexes that may participate in cell death. Of the 66 genes, we selected CCR3 for further evaluation and found that CCR3 is a mediator of neuronal injury. CCR3 inhibition or deletion protects murine cortical cultures from oxygen-glucose deprivation–induced cell death, and CCR3 deletion in mice provides protection from ischemia in vivo. Taken together, our findings suggest that CCR3 is a previously unknown mediator of cell death. Future identification of the neural cell death network in which CCR3 participates will enhance our understanding of the molecular mechanisms of neural cell death. PMID:27822494
Tamba, Cox Lwaka; Ni, Yuan-Li; Zhang, Yuan-Ming
Genome-wide association study (GWAS) entails examining a large number of single nucleotide polymorphisms (SNPs) in a limited sample with hundreds of individuals, implying a variable selection problem in the high dimensional dataset. Although many single-locus GWAS approaches under polygenic background and population structure controls have been widely used, some significant loci fail to be detected. In this study, we used an iterative modified-sure independence screening (ISIS) approach in reducing the number of SNPs to a moderate size. Expectation-Maximization (EM)-Bayesian least absolute shrinkage and selection operator (BLASSO) was used to estimate all the selected SNP effects for true quantitative trait nucleotide (QTN) detection. This method is referred to as ISIS EM-BLASSO algorithm. Monte Carlo simulation studies validated the new method, which has the highest empirical power in QTN detection and the highest accuracy in QTN effect estimation, and it is the fastest, as compared with efficient mixed-model association (EMMA), smoothly clipped absolute deviation (SCAD), fixed and random model circulating probability unification (FarmCPU), and multi-locus random-SNP-effect mixed linear model (mrMLM). To further demonstrate the new method, six flowering time traits in Arabidopsis thaliana were re-analyzed by four methods (New method, EMMA, FarmCPU, and mrMLM). As a result, the new method identified most previously reported genes. Therefore, the new method is a good alternative for multi-locus GWAS.
Deffrasnes, Celine; Marsh, Glenn A; Foo, Chwan Hong; Rootes, Christina L; Gould, Cathryn M; Grusovin, Julian; Monaghan, Paul; Lo, Michael K; Tompkins, S Mark; Adams, Timothy E; Lowenthal, John W; Simpson, Kaylene J; Stewart, Cameron R; Bean, Andrew G D; Wang, Lin-Fa
Hendra and Nipah viruses (genus Henipavirus, family Paramyxoviridae) are highly pathogenic bat-borne viruses. The need for high biocontainment when studying henipaviruses has hindered the development of therapeutics and knowledge of the viral infection cycle. We have performed a genome-wide siRNA screen at biosafety level 4 that identified 585 human proteins required for henipavirus infection. The host protein with the largest impact was fibrillarin, a nucleolar methyltransferase that was also required by measles, mumps and respiratory syncytial viruses for infection. While not required for cell entry, henipavirus RNA and protein syntheses were greatly impaired in cells lacking fibrillarin, indicating a crucial role in the RNA replication phase of infection. During infection, the Hendra virus matrix protein co-localized with fibrillarin in cell nucleoli, and co-associated as a complex in pulldown studies, while its nuclear import was unaffected in fibrillarin-depleted cells. Mutagenesis studies showed that the methyltransferase activity of fibrillarin was required for henipavirus infection, suggesting that this enzyme could be targeted therapeutically to combat henipavirus infections.
Foo, Chwan Hong; Rootes, Christina L.; Gould, Cathryn M.; Grusovin, Julian; Monaghan, Paul; Lo, Michael K.; Tompkins, S. Mark; Adams, Timothy E.; Lowenthal, John W.; Simpson, Kaylene J.; Stewart, Cameron R.; Bean, Andrew G. D.; Wang, Lin-Fa
Hendra and Nipah viruses (genus Henipavirus, family Paramyxoviridae) are highly pathogenic bat-borne viruses. The need for high biocontainment when studying henipaviruses has hindered the development of therapeutics and knowledge of the viral infection cycle. We have performed a genome-wide siRNA screen at biosafety level 4 that identified 585 human proteins required for henipavirus infection. The host protein with the largest impact was fibrillarin, a nucleolar methyltransferase that was also required by measles, mumps and respiratory syncytial viruses for infection. While not required for cell entry, henipavirus RNA and protein syntheses were greatly impaired in cells lacking fibrillarin, indicating a crucial role in the RNA replication phase of infection. During infection, the Hendra virus matrix protein co-localized with fibrillarin in cell nucleoli, and co-associated as a complex in pulldown studies, while its nuclear import was unaffected in fibrillarin-depleted cells. Mutagenesis studies showed that the methyltransferase activity of fibrillarin was required for henipavirus infection, suggesting that this enzyme could be targeted therapeutically to combat henipavirus infections. PMID:27010548
Wu, Jingyan; Bao, Alicia; Chatterjee, Kunal; Wan, Yao; Hopper, Anita K.
Transfer ribonucleic acids (tRNAs) are essential for protein synthesis. However, key gene products involved in tRNA biogenesis and subcellular movement remain to be discovered. We conducted the first comprehensive unbiased analysis of the role of nearly an entire proteome in tRNA biology and describe 162 novel and 12 previously known Saccharomyces cerevisiae gene products that function in tRNA processing, turnover, and subcellular movement. tRNA nuclear export is of particular interest because it is essential, but the known tRNA exporters (Los1 [exportin-t] and Msn5 [exportin-5]) are unessential. We report that mutations of CRM1 (Exportin-1), MEX67/MTR2 (TAP/p15), and five nucleoporins cause accumulation of unspliced tRNA, a hallmark of defective tRNA nuclear export. CRM1 mutation genetically interacts with los1Δ and causes altered tRNA nuclear–cytoplasmic distribution. The data implicate roles for the protein and mRNA nuclear export machineries in tRNA nuclear export. Mutations of genes encoding actin cytoskeleton components and mitochondrial outer membrane proteins also cause accumulation of unspliced tRNA, likely due to defective splicing on mitochondria. Additional gene products, such as chromatin modification enzymes, have unanticipated effects on pre-tRNA end processing. Thus, this genome-wide screen uncovered putative novel pathways for tRNA nuclear export and extensive links between tRNA biology and other aspects of cell physiology. PMID:26680305
Lipovsky, Alex; Popa, Andreea; Pimienta, Genaro; Wyler, Michael; Bhan, Ashima; Kuruvilla, Leena; Guie, Marie-Aude; Poffenberger, Adrian C.; Nelson, Christian D. S.; Atwood, Walter J.; DiMaio, Daniel
Despite major advances in our understanding of many aspects of human papillomavirus (HPV) biology, HPV entry is poorly understood. To identify cellular genes required for HPV entry, we conducted a genome-wide screen for siRNAs that inhibited infection of HeLa cells by HPV16 pseudovirus. Many retrograde transport factors were required for efficient infection, including multiple subunits of the retromer, which initiates retrograde transport from the endosome to the trans-Golgi network (TGN). The retromer has not been previously implicated in virus entry. Furthermore, HPV16 capsid proteins arrive in the TGN/Golgi in a retromer-dependent fashion during entry, and incoming HPV proteins form a stable complex with retromer subunits. We propose that HPV16 directly engages the retromer at the early or late endosome and traffics to the TGN/Golgi via the retrograde pathway during cell entry. These results provide important insights into HPV entry, identify numerous potential antiviral targets, and suggest that the role of the retromer in infection by other viruses should be assessed. PMID:23569269
Lipovsky, Alex; Popa, Andreea; Pimienta, Genaro; Wyler, Michael; Bhan, Ashima; Kuruvilla, Leena; Guie, Marie-Aude; Poffenberger, Adrian C; Nelson, Christian D S; Atwood, Walter J; DiMaio, Daniel
Despite major advances in our understanding of many aspects of human papillomavirus (HPV) biology, HPV entry is poorly understood. To identify cellular genes required for HPV entry, we conducted a genome-wide screen for siRNAs that inhibited infection of HeLa cells by HPV16 pseudovirus. Many retrograde transport factors were required for efficient infection, including multiple subunits of the retromer, which initiates retrograde transport from the endosome to the trans-Golgi network (TGN). The retromer has not been previously implicated in virus entry. Furthermore, HPV16 capsid proteins arrive in the TGN/Golgi in a retromer-dependent fashion during entry, and incoming HPV proteins form a stable complex with retromer subunits. We propose that HPV16 directly engages the retromer at the early or late endosome and traffics to the TGN/Golgi via the retrograde pathway during cell entry. These results provide important insights into HPV entry, identify numerous potential antiviral targets, and suggest that the role of the retromer in infection by other viruses should be assessed.
Tamba, Cox Lwaka; Ni, Yuan-Li; Zhang, Yuan-Ming
Genome-wide association study (GWAS) entails examining a large number of single nucleotide polymorphisms (SNPs) in a limited sample with hundreds of individuals, implying a variable selection problem in the high dimensional dataset. Although many single-locus GWAS approaches under polygenic background and population structure controls have been widely used, some significant loci fail to be detected. In this study, we used an iterative modified-sure independence screening (ISIS) approach in reducing the number of SNPs to a moderate size. Expectation-Maximization (EM)-Bayesian least absolute shrinkage and selection operator (BLASSO) was used to estimate all the selected SNP effects for true quantitative trait nucleotide (QTN) detection. This method is referred to as ISIS EM-BLASSO algorithm. Monte Carlo simulation studies validated the new method, which has the highest empirical power in QTN detection and the highest accuracy in QTN effect estimation, and it is the fastest, as compared with efficient mixed-model association (EMMA), smoothly clipped absolute deviation (SCAD), fixed and random model circulating probability unification (FarmCPU), and multi-locus random-SNP-effect mixed linear model (mrMLM). To further demonstrate the new method, six flowering time traits in Arabidopsis thaliana were re-analyzed by four methods (New method, EMMA, FarmCPU, and mrMLM). As a result, the new method identified most previously reported genes. Therefore, the new method is a good alternative for multi-locus GWAS. PMID:28141824
Zhang, Jianmin; Wang, Huaishan; Sherbini, Omar; Ling-Lin Pai, Emily; Kang, Sung-Ung; Kwon, Ji-Sun; Yang, Jia; He, Wei; Wang, Hong; Eacker, Stephen M; Chi, Zhikai; Mao, Xiaobo; Xu, Jinchong; Jiang, Haisong; Andrabi, Shaida A; Dawson, Ted M; Dawson, Valina L
Neuronal loss caused by ischemic injury, trauma, or disease can lead to devastating consequences for the individual. With the goal of limiting neuronal loss, a number of cell death pathways have been studied, but there may be additional contributors to neuronal death that are yet unknown. To identify previously unknown cell death mediators, we performed a high-content genome-wide screening of short, interfering RNA (siRNA) with an siRNA library in murine neural stem cells after exposure to N-methyl-N-nitroso-N'-nitroguanidine (MNNG), which leads to DNA damage and cell death. Eighty genes were identified as key mediators for cell death. Among them, 14 are known cell death mediators and 66 have not previously been linked to cell death pathways. Using an integrated approach with functional and bioinformatics analysis, we provide possible molecular networks, interconnected pathways, and/or protein complexes that may participate in cell death. Of the 66 genes, we selected CCR3 for further evaluation and found that CCR3 is a mediator of neuronal injury. CCR3 inhibition or deletion protects murine cortical cultures from oxygen-glucose deprivation-induced cell death, and CCR3 deletion in mice provides protection from ischemia in vivo. Taken together, our findings suggest that CCR3 is a previously unknown mediator of cell death. Future identification of the neural cell death network in which CCR3 participates will enhance our understanding of the molecular mechanisms of neural cell death.
Wolman, Marc A.; Jain, Roshan A.; Marsden, Kurt C.; Bell, Hannah; Skinner, Julianne; Hayer, Katharina E.; Hogenesch, John B.; Granato, Michael
Summary Habituation represents a fundamental form of learning, yet the underlying molecular genetic mechanisms are not well defined. Here we report on a genome-wide genetic screen, coupled with whole genome sequencing, that identified 14 zebrafish startle habituation mutants including mutants of the vertebrate specific gene pregnancy associated plasma protein-aa (pappaa). PAPP-AA encodes an extracellular metalloprotease known to increase IGF bioavailability thereby enhancing IGF receptor signaling. We find that pappaa is expressed by startle circuit neurons, and expression of wildtype, but not a metalloprotease-inactive version of pappaa restores habituation in pappaa mutants. Furthermore, acutely inhibiting IGF1R function in wild-type reduces habituation, while activation of IGF1R downstream effectors in pappaa mutants restores habituation, demonstrating that pappaa promotes learning by acutely and locally increasing IGF bioavailability. In sum, our results define the first functional gene set for habituation learning in a vertebrate, and identify PAPPAA-regulated IGF signaling as a novel mechanism regulating habituation learning. PMID:25754827
Miles, Rebecca R; Perry, William; Haas, Joseph V; Mosior, Marian K; N'Cho, Mathias; Wang, Jian W J; Yu, Peng; Calley, John; Yue, Yong; Carter, Quincy; Han, Bomie; Foxworthy, Patricia; Kowala, Mark C; Ryan, Timothy P; Solenberg, Patricia J; Michael, Laura F
Control of plasma cholesterol levels is a major therapeutic strategy for management of coronary artery disease (CAD). Although reducing LDL cholesterol (LDL-c) levels decreases morbidity and mortality, this therapeutic intervention only translates into a 25-40% reduction in cardiovascular events. Epidemiological studies have shown that a high LDL-c level is not the only risk factor for CAD; low HDL cholesterol (HDL-c) is an independent risk factor for CAD. Apolipoprotein A-I (ApoA-I) is the major protein component of HDL-c that mediates reverse cholesterol transport from tissues to the liver for excretion. Therefore, increasing ApoA-I levels is an attractive strategy for HDL-c elevation. Using genome-wide siRNA screening, targets that regulate hepatocyte ApoA-I secretion were identified through transfection of 21,789 siRNAs into hepatocytes whereby cell supernatants were assayed for ApoA-I. Approximately 800 genes were identified and triaged using a convergence of information, including genetic associations with HDL-c levels, tissue-specific gene expression, druggability assessments, and pathway analysis. Fifty-nine genes were selected for reconfirmation; 40 genes were confirmed. Here we describe the siRNA screening strategy, assay implementation and validation, data triaging, and example genes of interest. The genes of interest include known and novel genes encoding secreted enzymes, proteases, G-protein-coupled receptors, metabolic enzymes, ion transporters, and proteins of unknown function. Repression of farnesyltransferase (FNTA) by siRNA and the enzyme inhibitor manumycin A caused elevation of ApoA-I secretion from hepatocytes and from transgenic mice expressing hApoA-I and cholesterol ester transfer protein transgenes. In total, this work underscores the power of functional genetic assessment to identify new therapeutic targets.
The Sheffield RNAi Screening Facility (SRSF) was established in November 2008, as Britain's first Drosophila RNAi screening centre, funded by the University of Sheffield, Biomedical Sciences Department and the Wellcome Trust. The SRSF was formed to service the needs of research groups wanting to carry out high-throughput RNAi screens with Drosophila cells. The rationale for the SRSF is to provide RNAi libraries and the specialist equipment and expertise to do such screens. The facility supports both plate reader assays, high-content microscopy as well as the equipment needed to process these samples in a high-throughput fashion. The SRSF can either be used to identify genes involved in disease representing future drug targets, or to identify genes involved in drug resistance and efficacy.
Godnic, Irena; Zorc, Minja; Jevsinek Skok, Dasa; Calin, George Adrian; Horvat, Simon; Dovc, Peter; Kovac, Milena; Kunej, Tanja
MicroRNAs (miRNAs) are non-coding RNAs (ncRNAs) involved in regulation of gene expression. Intragenic miRNAs, especially those exhibiting a high degree of evolutionary conservation, have been shown to be coordinately regulated and/or expressed with their host genes, either with synergistic or antagonistic correlation patterns. However, the degree of cross-species conservation of miRNA/host gene co-location is not known and co-expression information is incomplete and fragmented among several studies. Using the genomic resources (miRBase and Ensembl) we performed a genome-wide in silico screening (GWISS) for miRNA/host gene pairs in three well-annotated vertebrate species: human, mouse, and chicken. Approximately half of currently annotated miRNA genes resided within host genes: 53.0% (849/1,600) in human, 48.8% (418/855) in mouse, and 42.0% (210/499) in chicken, which we present in a central publicly available Catalog of intragenic miRNAs (http://www.integratomics-time.com/miR-host/catalog). The miRNA genes resided within either protein-coding or ncRNA genes, which include long intergenic ncRNAs (lincRNAs) and small nucleolar RNAs (snoRNAs). Twenty-seven miRNA genes were found to be located within the same host genes in all three species and the data integration from literature and databases showed that most (26/27) have been found to be co-expressed. Particularly interesting are miRNA genes located within genes encoding for miRNA silencing machinery (DGCR8, DICER1, and SND1 in human and Cnot3, Gdcr8, Eif4e, Tnrc6b, and Xpo5 in mouse). We furthermore discuss a potential for phenotype misattribution of miRNA host gene polymorphism or gene modification studies due to possible collateral effects on miRNAs hosted within them. In conclusion, the catalog of intragenic miRNAs and identified 27 miRNA/host gene pairs with cross-species conserved co-location, co-expression, and potential co-regulation, provide excellent candidates for further functional annotation of
Li, Ning; Katz, Samuel; Dutta, Bhaskar; Benet, Zachary L.; Sun, Jing; Fraser, Iain D.C.
The mammalian innate immune system senses many bacterial stimuli through the toll-like receptor (TLR) family. Activation of the TLR4 receptor by bacterial lipopolysaccharide (LPS) is the most widely studied TLR pathway due to its central role in host responses to gram-negative bacterial infection and its contribution to endotoxemia and sepsis. Here we describe a genome-wide siRNA screen to identify genes regulating the mouse macrophage TNF-α and NF-κB responses to LPS. We include a secondary validation screen conducted with six independent siRNAs per gene to facilitate removal of off-target screen hits. We also provide microarray data from the same LPS-treated macrophage cells to facilitate downstream data analysis. These data provide a resource for analyzing gene function in the predominant pathway driving inflammatory signaling and cytokine expression in mouse macrophages. PMID:28248925
Zhang, Lifan; Zhou, Xiang; Michal, Jennifer J; Ding, Bo; Li, Rui; Jiang, Zhihua
Birth weight is an economically important trait in pig production because it directly impacts piglet growth and survival rate. In the present study, we performed a genome wide survey of candidate genes and pathways associated with individual birth weight (IBW) using the Illumina PorcineSNP60 BeadChip on 24 high (HEBV) and 24 low estimated breeding value (LEBV) animals. These animals were selected from a reference population of 522 individuals produced by three sires and six dam lines, which were crossbreds with multiple breeds. After quality-control, 43,257 SNPs (single nucleotide polymorphisms), including 42,243 autosomal SNPs and 1,014 SNPs on chromosome X, were used in the data analysis. A total of 27 differentially selected regions (DSRs), including 1 on Sus scrofa chromosome 1 (SSC1), 1 on SSC4, 2 on SSC5, 4 on SSC6, 2 on SSC7, 5 on SSC8, 3 on SSC9, 1 on SSC14, 3 on SSC18, and 5 on SSCX, were identified to show the genome wide separations between the HEBV and LEBV groups for IBW in piglets. A DSR with the most number of significant SNPs (including 7 top 0.1% and 31 top 5% SNPs) was located on SSC6, while another DSR with the largest genetic differences in F ST was found on SSC18. These regions harbor known functionally important genes involved in growth and development, such as TNFRSF9 (tumor necrosis factor receptor superfamily member 9), CA6 (carbonic anhydrase VI) and MDFIC (MyoD family inhibitor domain containing). A DSR rich in imprinting genes appeared on SSC9, which included PEG10 (paternally expressed 10), SGCE (sarcoglycan, epsilon), PPP1R9A (protein phosphatase 1, regulatory subunit 9A) and ASB4 (ankyrin repeat and SOCS box containing 4). More importantly, our present study provided evidence to support six quantitative trait loci (QTL) regions for pig birth weight, six QTL regions for average birth weight (ABW) and three QTL regions for litter birth weight (LBW) reported previously by other groups. Furthermore, gene ontology analysis with 183 genes
Li Wang; Zeng Chan Wang; Cui Xie; Xiao Feng Liu; Mao Sheng Yang
The etiology of recurrent miscarriage (RM) is extremely heterogeneous, including genetic, immunologic, anatomic, endocrinological, and infectious anomalies. About 50% of RM is unexplained or poorly understood, which is called idiopathic recurrent miscarriage (IRM). The primary aim of this study was to identify the genetic loci that might be susceptible to IRM. Forty-four Han Chinese patients with IRM during the first trimester of their pregnancies and 44 healthy sex- and ethnic-matched controls were enrolled in this study. A case-control and genome-wide study was performed and 430 polymorphic microsatellite markers were analyzed. Three loci, 6q27 (D6S446, P = .028), 9q33.1 (D9S1776, P = .037), and Xp22.11 (DXS1226, P = .008), significantly associated with IRM were found. This work identified 3 genetic regions that might harbor genes predisposed to IRM and provided new insights for future genetic and etiological study of IRM. Further study is required to confirm it.
López-Saavedra, Ana; Gómez-Cabello, Daniel; Domínguez-Sánchez, María Salud; Mejías-Navarro, Fernando; Fernández-Ávila, María Jesús; Dinant, Christoffel; Martínez-Macías, María Isabel; Bartek, Jiri; Huertas, Pablo
There are two major and alternative pathways to repair DNA double-strand breaks: non-homologous end-joining and homologous recombination. Here we identify and characterize novel factors involved in choosing between these pathways; in this study we took advantage of the SeeSaw Reporter, in which the repair of double-strand breaks by homology-independent or -dependent mechanisms is distinguished by the accumulation of green or red fluorescence, respectively. Using a genome-wide human esiRNA (endoribonuclease-prepared siRNA) library, we isolate genes that control the recombination/end-joining ratio. Here we report that two distinct sets of genes are involved in the control of the balance between NHEJ and HR: those that are required to facilitate recombination and those that favour NHEJ. This last category includes CCAR2/DBC1, which we show inhibits recombination by limiting the initiation and the extent of DNA end resection, thereby acting as an antagonist of CtIP.
Brammeld, Jonathan S; Petljak, Mia; Martincorena, Inigo; Williams, Steven P; Alonso, Luz Garcia; Dalmases, Alba; Bellosillo, Beatriz; Robles-Espinoza, Carla Daniela; Price, Stacey; Barthorpe, Syd; Tarpey, Patrick; Alifrangis, Constantine; Bignell, Graham; Vidal, Joana; Young, Jamie; Stebbings, Lucy; Beal, Kathryn; Stratton, Michael R; Saez-Rodriguez, Julio; Garnett, Mathew; Montagut, Clara; Iorio, Francesco; McDermott, Ultan
Drug resistance is an almost inevitable consequence of cancer therapy and ultimately proves fatal for the majority of patients. In many cases, this is the consequence of specific gene mutations that have the potential to be targeted to resensitize the tumor. The ability to uniformly saturate the genome with point mutations without chromosome or nucleotide sequence context bias would open the door to identify all putative drug resistance mutations in cancer models. Here, we describe such a method for elucidating drug resistance mechanisms using genome-wide chemical mutagenesis allied to next-generation sequencing. We show that chemically mutagenizing the genome of cancer cells dramatically increases the number of drug-resistant clones and allows the detection of both known and novel drug resistance mutations. We used an efficient computational process that allows for the rapid identification of involved pathways and druggable targets. Such a priori knowledge would greatly empower serial monitoring strategies for drug resistance in the clinic as well as the development of trials for drug-resistant patients.
Brammeld, Jonathan S.; Petljak, Mia; Martincorena, Inigo; Williams, Steven P.; Alonso, Luz Garcia; Dalmases, Alba; Bellosillo, Beatriz; Robles-Espinoza, Carla Daniela; Price, Stacey; Barthorpe, Syd; Tarpey, Patrick; Alifrangis, Constantine; Bignell, Graham; Vidal, Joana; Young, Jamie; Stebbings, Lucy; Beal, Kathryn; Stratton, Michael R.; Saez-Rodriguez, Julio; Garnett, Mathew; Montagut, Clara; Iorio, Francesco; McDermott, Ultan
Drug resistance is an almost inevitable consequence of cancer therapy and ultimately proves fatal for the majority of patients. In many cases, this is the consequence of specific gene mutations that have the potential to be targeted to resensitize the tumor. The ability to uniformly saturate the genome with point mutations without chromosome or nucleotide sequence context bias would open the door to identify all putative drug resistance mutations in cancer models. Here, we describe such a method for elucidating drug resistance mechanisms using genome-wide chemical mutagenesis allied to next-generation sequencing. We show that chemically mutagenizing the genome of cancer cells dramatically increases the number of drug-resistant clones and allows the detection of both known and novel drug resistance mutations. We used an efficient computational process that allows for the rapid identification of involved pathways and druggable targets. Such a priori knowledge would greatly empower serial monitoring strategies for drug resistance in the clinic as well as the development of trials for drug-resistant patients. PMID:28179366
López-Saavedra, Ana; Gómez-Cabello, Daniel; Domínguez-Sánchez, María Salud; Mejías-Navarro, Fernando; Fernández-Ávila, María Jesús; Dinant, Christoffel; Martínez-Macías, María Isabel; Bartek, Jiri; Huertas, Pablo
There are two major and alternative pathways to repair DNA double-strand breaks: non-homologous end-joining and homologous recombination. Here we identify and characterize novel factors involved in choosing between these pathways; in this study we took advantage of the SeeSaw Reporter, in which the repair of double-strand breaks by homology-independent or -dependent mechanisms is distinguished by the accumulation of green or red fluorescence, respectively. Using a genome-wide human esiRNA (endoribonuclease-prepared siRNA) library, we isolate genes that control the recombination/end-joining ratio. Here we report that two distinct sets of genes are involved in the control of the balance between NHEJ and HR: those that are required to facilitate recombination and those that favour NHEJ. This last category includes CCAR2/DBC1, which we show inhibits recombination by limiting the initiation and the extent of DNA end resection, thereby acting as an antagonist of CtIP. PMID:27503537
Kanoh, Hirotaka; Kuraishi, Takayuki; Tong, Li-Li; Watanabe, Ryo; Nagata, Shinji; Kurata, Shoichiro
Damage-associated molecular patterns (DAMPs), so-called "danger signals," play important roles in host defense and pathophysiology in mammals and insects. In Drosophila, the Toll pathway confers damage responses during bacterial infection and improper cell-fate control. However, the intrinsic ligands and signaling mechanisms that potentiate innate immune responses remain unknown. Here, we demonstrate that a Drosophila larva-derived tissue extract strongly elicits Toll pathway activation via the Toll receptor. Using this extract, we performed ex vivo genome-wide RNAi screening in Drosophila cultured cells, and identified several signaling factors that are required for host defense and antimicrobial-peptide expression in Drosophila adults. These results suggest that our larva-derived tissue extract contains active ingredients that mediate Toll pathway activation, and the screening data will shed light on the mechanisms of damage-related Toll pathway signaling in Drosophila.
Park, Jeongbin; Kim, Jin-Soo; Bae, Sangsu
Motivation: CRISPR-derived RNA guided endonucleases (RGENs) have been widely used for both gene knockout and knock-in at the level of single or multiple genes. RGENs are now available for forward genetic screens at genome scale, but single guide RNA (sgRNA) selection at this scale is difficult. Results: We develop an online tool, Cas-Database, a genome-wide gRNA library design tool for Cas9 nucleases from Streptococcus pyogenes (SpCas9). With an easy-to-use web interface, Cas-Database allows users to select optimal target sequences simply by changing the filtering conditions. Furthermore, it provides a powerful way to select multiple optimal target sequences from thousands of genes at once for the creation of a genome-wide library. Cas-Database also provides a web application programming interface (web API) for advanced bioinformatics users. Availability and implementation: Free access at http://www.rgenome.net/cas-database/. Contact: email@example.com or firstname.lastname@example.org Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153724
Westmoreland, Tammy J.; Wickramasekara, Sajith M.; Guo, Andrew Y.; Selim, Alice L.; Winsor, Tiffany S.; Greenleaf, Arno L.; Blackwell, Kimberly L.; Olson, John A.; Marks, Jeffrey R.; Bennett, Craig B.
The chemotherapeutic doxorubicin (DOX) induces DNA double-strand break (DSB) damage. In order to identify conserved genes that mediate DOX resistance, we screened the Saccharomyces cerevisiae diploid deletion collection and identified 376 deletion strains in which exposure to DOX was lethal or severely reduced growth fitness. This diploid screen identified 5-fold more DOX resistance genes than a comparable screen using the isogenic haploid derivative. Since DSB damage is repaired primarily by homologous recombination in yeast, and haploid cells lack an available DNA homolog in G1 and early S phase, this suggests that our diploid screen may have detected the loss of repair functions in G1 or early S phase prior to complete DNA replication. To test this, we compared the relative DOX sensitivity of 30 diploid deletion mutants identified under our screening conditions to their isogenic haploid counterpart, most of which (n = 26) were not detected in the haploid screen. For six mutants (bem1Δ, ctf4Δ, ctk1Δ, hfi1Δ,nup133Δ, tho2Δ) DOX-induced lethality was absent or greatly reduced in the haploid as compared to the isogenic diploid derivative. Moreover, unlike WT, all six diploid mutants displayed severe G1/S phase cell cycle progression defects when exposed to DOX and some were significantly enhanced (ctk1Δ and hfi1Δ) or deficient (tho2Δ) for recombination. Using these and other “THO2-like” hypo-recombinogenic, diploid-specific DOX sensitive mutants (mft1Δ, thp1Δ, thp2Δ) we utilized known genetic/proteomic interactions to construct an interactive functional genomic network which predicted additional DOX resistance genes not detected in the primary screen. Most (76%) of the DOX resistance genes detected in this diploid yeast screen are evolutionarily conserved suggesting the human orthologs are candidates for mediating DOX resistance by impacting on checkpoint and recombination functions in G1 and/or early S phases. PMID:19503795
Remmele, Steffen; Ritzerfeld, Julia; Nickel, Walter; Hesser, Jürgen
RNAi-based high-throughput microscopy screens have become an important tool in biological sciences in order to decrypt mostly unknown biological functions of human genes. However, manual analysis is impossible for such screens since the amount of image data sets can often be in the hundred thousands. Reliable automated tools are thus required to analyse the fluorescence microscopy image data sets usually containing two or more reaction channels. The herein presented image analysis tool is designed to analyse an RNAi screen investigating the intracellular trafficking and targeting of acylated Src kinases. In this specific screen, a data set consists of three reaction channels and the investigated cells can appear in different phenotypes. The main issue of the image processing task is an automatic cell segmentation which has to be robust and accurate for all different phenotypes and a successive phenotype classification. The cell segmentation is done in two steps by segmenting the cell nuclei first and then using a classifier-enhanced region growing on basis of the cell nuclei to segment the cells. The classification of the cells is realized by a support vector machine which has to be trained manually using supervised learning. Furthermore, the tool is brightness invariant allowing different staining quality and it provides a quality control that copes with typical defects during preparation and acquisition. A first version of the tool has already been successfully applied for an RNAi-screen containing three hundred thousand image data sets and the SVM extended version is designed for additional screens.
Sun, Jing; Katz, Samuel; Dutta, Bhaskar; Wang, Ze; Fraser, Iain D.C.
The mammalian innate immune system senses many bacterial stimuli through the toll-like receptor (TLR) family. Activation of the TLR4 receptor by bacterial lipopolysaccharide (LPS) is the most widely studied TLR pathway due to its central role in host responses to gram-negative bacterial infection and its contribution to endotoxemia and sepsis. Here we describe a genome-wide siRNA screen to identify genes regulating the human macrophage TNF-α response to LPS. We include a secondary validation screen conducted with six independent siRNAs per gene to facilitate removal of off-target screen hits. We also provide microarray data from the same LPS-treated macrophage cells to facilitate downstream data analysis. Tertiary screening with multiple TLR ligands and a microbial extract demonstrate that novel screen hits have broad effects on the innate inflammatory response to microbial stimuli. These data provide a resource for analyzing gene function in the predominant pathway driving inflammatory cytokine expression in human macrophages. PMID:28248930
Neely, G. Gregory; Hess, Andreas; Costigan, Michael; Keene, Alex C.; Goulas, Spyros; Langeslag, Michiel; Griffin, Robert S; Belfer, Inna; Dai, Feng; Smith, Shad; Diatchenko, Luda; Gupta, Vaijayanti; Xia, Cui-ping; Amann, Sabina; Kreitz, Silke; Heindl-Erdmann, Cornelia; Wolz, Susanne; Ly, Cindy V.; Arora, Suchir; Sarangi, Rinku; Dan, Debasis; Novatchkova, Maria; Rosenzweig, Mark; Gibson, Dustin; Truong, Darwin; Schramek, Daniel; Zoranovic, Tamara; Cronin, Shane J. F.; Angjeli, Belinda; Brune, Kay; Dietzl, Georg; Maixner, William; Meixner, Arabella; Thomas, Winston; Pospisilik, J. Andrew; Alenius, Mattias; Kress, Michaela; Subramaniam, Sai; Garrity, Paul A.; Bellen, Hugo J.; Woolf, Clifford J.; Penninger, Josef M.
SUMMARY Worldwide, acute and chronic pain affects 20% of the adult population and represents an enormous financial and emotional burden. Using genome-wide neuronal-specific RNAi knock-down in Drosophila, we report a global screen for an innate behavior and identify hundreds of novel genes implicated in heat nociception, including the α2δ-family calcium channel subunit straightjacket (stj). Mice mutant for the stj ortholog CACNA2D3 (α2δ3) also exhibit impaired behavioral heat pain sensitivity. In addition, in humans, α2δ3 SNP variants associate with reduced sensitivity to acute noxious heat and chronic back pain. Functional imaging in α2δ3 mutant mice revealed impaired transmission of thermal pain evoked signals from the thalamus to higher order pain centers. Intriguingly, in α2δ3 mutant mice thermal pain and tactile stimulation triggered strong cross-activation or synesthesia of brain regions involved in vision, olfaction, and hearing. PMID:21074052
Schmid-Burgk, Jonathan L; Chauhan, Dhruv; Schmidt, Tobias; Ebert, Thomas S; Reinhardt, Julia; Endl, Elmar; Hornung, Veit
Inflammasomes are high molecular weight protein complexes that assemble in the cytosol upon pathogen encounter. This results in caspase-1-dependent pro-inflammatory cytokine maturation, as well as a special type of cell death, known as pyroptosis. The Nlrp3 inflammasome plays a pivotal role in pathogen defense, but at the same time, its activity has also been implicated in many common sterile inflammatory conditions. To this effect, several studies have identified Nlrp3 inflammasome engagement in a number of common human diseases such as atherosclerosis, type 2 diabetes, Alzheimer disease, or gout. Although it has been shown that known Nlrp3 stimuli converge on potassium ion efflux upstream of Nlrp3 activation, the exact molecular mechanism of Nlrp3 activation remains elusive. Here, we describe a genome-wide CRISPR/Cas9 screen in immortalized mouse macrophages aiming at the unbiased identification of gene products involved in Nlrp3 inflammasome activation. We employed a FACS-based screen for Nlrp3-dependent cell death, using the ionophoric compound nigericin as a potassium efflux-inducing stimulus. Using a genome-wide guide RNA (gRNA) library, we found that targeting Nek7 rescued macrophages from nigericin-induced lethality. Subsequent studies revealed that murine macrophages deficient in Nek7 displayed a largely blunted Nlrp3 inflammasome response, whereas Aim2-mediated inflammasome activation proved to be fully intact. Although the mechanism of Nek7 functioning upstream of Nlrp3 yet remains elusive, these studies provide a first genetic handle of a component that specifically functions upstream of Nlrp3.
Zhang, Siwei; Li, Jingjing; Lea, Robert; Amaya, Enrique; Dorey, Karel
Embryonic development requires exquisite regulation of several essential processes, such as patterning of tissues and organs, cell fate decisions, and morphogenesis. Intriguingly, these diverse processes are controlled by only a handful of signalling pathways, and mis-regulation in one or more of these pathways may result in a variety of congenital defects and diseases. Consequently, investigating how these signalling pathways are regulated at the molecular level is essential to understanding the mechanisms underlying vertebrate embryogenesis, as well as developing treatments for human diseases. Here, we designed and performed a large-scale gain-of-function screen in Xenopus embryos aimed at identifying new regulators of MAPK/Erk, PI3K/Akt, BMP, and TGF-β/Nodal signalling pathways. Our gain-of-function screen is based on the identification of gene products that alter the phosphorylation state of key signalling molecules, which report the activation state of the pathways. In total, we have identified 20 new molecules that regulate the activity of one or more signalling pathways during early Xenopus development. This is the first time that such a functional screen has been performed, and the findings pave the way toward a more comprehensive understanding of the molecular mechanisms regulating the activity of important signalling pathways under normal and pathological conditions. PMID:24244509
Kadoya, Ryosuke; Kodama, Yu; Matsumoto, Ken'ichiro; Ooi, Toshihiko; Taguchi, Seiichi
Engineered Escherichia coli is a useful platform for production of lactate (LA)-based polyester poly[LA-co-3-hydroxybutyrate (3HB)] from renewable sugars. Here we screened all non-lethal transcription factor deletions of E. coli for efficient production of the polymer. This approach aimed at drawing out the latent potential of the host for efficient polymer production via indirect positive effects. Among 252 mutants from Keio Collection tested, eight mutants (ΔpdhR, ΔcspG, ΔyneJ, ΔchbR, ΔyiaU, ΔcreB, ΔygfI and ΔnanK) accumulated greater amount of polymer (6.2-10.1 g/L) compared to the parent strain E. coli BW25113 (5.1 g/L). The mutants increased polymer production per cell (1.1-1.5-fold) without significant change in cell density. The yield of the polymer from glucose was also higher for the selected mutants (0.34-0.38 g/g) than the parent strain (0.27 g/g). Therefore, the deletions of transcription factors should channel the carbon flux towards polymer production. It should be noted that the screening employed in this study identified beneficial mutants without analyzing causal relationship between the mutation and the enhanced polymer production. This approach, therefore, should be applicable to broad range of fermentation productions.
King, Isabelle N; Qian, Li; Liang, Jianping; Huang, Yu; Shieh, Joseph T C; Kwon, Chulan; Srivastava, Deepak
Many molecular pathways involved in heart disease have their roots in evolutionarily ancient developmental programs that depend critically on gene dosage and timing. MicroRNAs (miRNAs) modulate gene dosage posttranscriptionally, and among these, the muscle-specific miR-1 is particularly important for developing and maintaining somatic/skeletal and cardiac muscle. To identify pathways regulated by miR-1, we performed a forward genetic screen in Drosophila using wing-vein patterning as a biological assay. We identified several unexpected genes that genetically interacted with dmiR-1, one of which was kayak, encodes a developmentally regulated transcription factor. Additional studies directed at this genetic relationship revealed a previously unappreciated function of dmiR-1 in regulating the polarity of cardiac progenitor cells. The mammalian ortholog of kayak, c-Fos, was dysregulated in hearts of gain- or loss-of-function miR-1 mutant mice in a stress-dependent manner. These findings illustrate the power of Drosophila-based screens to find points of intersection between miRNAs and conserved pathways in mammals.
Pereira, Francisco B; Teixeira, Miguel C; Mira, Nuno P; Sá-Correia, Isabel; Domingues, Lucília
The presence of toxic compounds derived from biomass pre-treatment in fermentation media represents an important drawback in second-generation bio-ethanol production technology and overcoming this inhibitory effect is one of the fundamental challenges to its industrial production. The aim of this study was to systematically identify, in industrial medium and at a genomic scale, the Saccharomyces cerevisiae genes required for simultaneous and maximal tolerance to key inhibitors of lignocellulosic fermentations. Based on the screening of EUROSCARF haploid mutant collection, 242 and 216 determinants of tolerance to inhibitory compounds present in industrial wheat straw hydrolysate (WSH) and in inhibitor-supplemented synthetic hydrolysate were identified, respectively. Genes associated to vitamin metabolism, mitochondrial and peroxisomal functions, ribosome biogenesis and microtubule biogenesis and dynamics are among the newly found determinants of WSH resistance. Moreover, PRS3, VMA8, ERG2, RAV1 and RPB4 were confirmed as key genes on yeast tolerance and fermentation of industrial WSH.
Duquet, Arnaud; Melotti, Alice; Mishra, Sonakshi; Malerba, Monica; Seth, Chandan; Conod, Arwen; Ruiz i Altaba, Ariel
The progression of tumors to the metastatic state involves the loss of metastatic suppressor functions. Finding these, however, is difficult as in vitro assays do not fully predict metastatic behavior, and the majority of studies have used cloned cell lines, which do not reflect primary tumor heterogeneity. Here, we have designed a novel genome-wide screen to identify metastatic suppressors using primary human tumor cells in mice, which allows saturation screens. Using this unbiased approach, we have tested the hypothesis that endogenous colon cancer metastatic suppressors affect WNT-TCF signaling. Our screen has identified two novel metastatic suppressors: TMED3 and SOX12, the knockdown of which increases metastatic growth after direct seeding. Moreover, both modify the type of self-renewing spheroids, but only knockdown of TMED3 also induces spheroid cell spreading and lung metastases from a subcutaneous xenograft. Importantly, whereas TMED3 and SOX12 belong to different families involved in protein secretion and transcriptional regulation, both promote endogenous WNT-TCF activity. Treatments for advanced or metastatic colon cancer may thus not benefit from WNT blockers, and these may promote a worse outcome. PMID:24920608
Škalamera, Dubravka; Dahmer-Heath, Mareike; Stevenson, Alexander J; Pinto, Cletus; Shah, Esha T; Daignault, Sheena M; Said, Nur Akmarina B M; Davis, Melissa; Haass, Nikolas K; Williams, Elizabeth D; Hollier, Brett G; Thompson, Erik W; Gabrielli, Brian; Gonda, Thomas J
Epithelial to mesenchymal transition (EMT) is a developmental program that has been implicated in progression, metastasis and therapeutic resistance of some carcinomas. To identify genes whose overexpression drives EMT, we screened a lentiviral expression library of 17000 human open reading frames (ORFs) using high-content imaging to quantitate cytoplasmic vimentin. Hits capable of increasing vimentin in the mammary carcinoma-derived cell line MDA-MB-468 were confirmed in the non-tumorigenic breast-epithelial cell line MCF10A. When overexpressed in this model, they increased the rate of cell invasion through Matrigel™, induced mesenchymal marker expression and reduced expression of the epithelial marker E-cadherin. In gene-expression datasets derived from breast cancer patients, the expression of several novel genes correlated with expression of known EMT marker genes, indicating their in vivo relevance. As EMT-associated properties are thought to contribute in several ways to cancer progression, genes identified in this study may represent novel targets for anti-cancer therapy.
Wallace, Jared; Hu, Ruozhen; Mosbruger, Timothy L; Dahlem, Timothy J; Stephens, W Zac; Rao, Dinesh S; Round, June L; O'Connell, Ryan M
Mammalian microRNA expression is dysregulated in human cancer. However, the functional relevance of many microRNAs in the context of tumor biology remains unclear. Using CRISPR-Cas9 technology, we performed a global loss-of-function screen to simultaneously test the functions of individual microRNAs and protein-coding genes during the growth of a myeloid leukemia cell line. This approach identified evolutionarily conserved human microRNAs that suppress or promote cell growth, revealing that microRNAs are extensively integrated into the molecular networks that control tumor cell physiology. miR-155 was identified as a top microRNA candidate promoting cellular fitness, which we confirmed with two distinct miR-155-targeting CRISPR-Cas9 lentiviral constructs. Further, we performed anti-correlation functional profiling to predict relevant microRNA-tumor suppressor gene or microRNA-oncogene interactions in these cells. This analysis identified miR-150 targeting of p53, a connection that was experimentally validated. Taken together, our study describes a powerful genetic approach by which the function of individual microRNAs can be assessed on a global level, and its use will rapidly advance our understanding of how microRNAs contribute to human disease.
Škalamera, Dubravka; Dahmer-Heath, Mareike; Stevenson, Alexander J.; Pinto, Cletus; Shah, Esha T.; Daignault, Sheena M.; Said, Nur Akmarina B.M.; Davis, Melissa; Haass, Nikolas K.; Williams, Elizabeth D.; Hollier, Brett G.; Thompson, Erik W.; Gabrielli, Brian; Gonda, Thomas J.
Epithelial to mesenchymal transition (EMT) is a developmental program that has been implicated in progression, metastasis and therapeutic resistance of some carcinomas. To identify genes whose overexpression drives EMT, we screened a lentiviral expression library of 17000 human open reading frames (ORFs) using high-content imaging to quantitate cytoplasmic vimentin. Hits capable of increasing vimentin in the mammary carcinoma-derived cell line MDA-MB-468 were confirmed in the non-tumorigenic breast-epithelial cell line MCF10A. When overexpressed in this model, they increased the rate of cell invasion through Matrigel™, induced mesenchymal marker expression and reduced expression of the epithelial marker E-cadherin. In gene-expression datasets derived from breast cancer patients, the expression of several novel genes correlated with expression of known EMT marker genes, indicating their in vivo relevance. As EMT-associated properties are thought to contribute in several ways to cancer progression, genes identified in this study may represent novel targets for anti-cancer therapy. PMID:27876705
Maia, André F.; Tanenbaum, Marvin E.; Galli, Matilde; Lelieveld, Daphne; Egan, David A.; Gassmann, Reto; Sunkel, Claudio E.; van den Heuvel, Sander; Medema, René H.
Kinesins are a superfamily of microtubule-based molecular motors that perform various transport needs and have essential roles in cell division. Among these, the kinesin-5 family has been shown to play a major role in the formation and maintenance of the bipolar mitotic spindle. Moreover, recent work suggests that kinesin-5 motors may have additional roles. In contrast to most model organisms, the sole kinesin-5 gene in Caenorhabditis elegans, bmk-1, is not required for successful mitosis and animals lacking bmk-1 are viable and fertile. To gain insight into factors that may act redundantly with BMK-1 in spindle assembly and to identify possible additional cellular pathways involving BMK-1, we performed a synthetic lethal screen using the bmk-1 deletion allele ok391. We successfully knocked down 82% of the C. elegans genome using RNAi and assayed viability in bmk-1(ok391) and wild type strains using an automated high-throughput approach based on fluorescence microscopy. The dataset includes a final list of 37 synthetic lethal interactions whose further study is likely to provide insight into kinesin-5 function. PMID:25984351
Sebastiani, Paola; Solovieff, Nadia
The availability of high throughput technology for parallel genotyping has opened the field of genetics to genome-wide association studies (GWAS). These studies generate massive amount of genetic data that challenge investigators with issues related to data management, statistical analysis of large data sets, visualization, and annotation of results. We will review the common approach to analysis of GWAS data and then discuss options to learn more from these data.
Wiese, Carrie B.; Ireland, Sara; Fleming, Nicole L.; Yu, Jing; Valerius, M. Todd; Georgas, Kylie; Chiu, Han Sheng; Brennan, Jane; Armstrong, Jane; Little, Melissa H.; McMahon, Andrew P.; Southard-Smith, E. Michelle
Relative positions of neurons within mature murine pelvic ganglia based on expression of neurotransmitters have been described. However the spatial organization of developing innervation in the murine urogenital tract (UGT) and the gene networks that regulate specification and maturation of neurons within the pelvic ganglia of the lower urinary tract (LUT) are unknown. We used whole-mount immunohistochemistry and histochemical stains to localize neural elements in 15.5 days post coitus (dpc) fetal mice. To identify potential regulatory factors expressed in pelvic ganglia, we surveyed expression patterns for known or probable transcription factors (TF) annotated in the mouse genome by screening a whole-mount in situ hybridization library of fetal UGTs. Of the 155 genes detected in pelvic ganglia, 88 encode TFs based on the presence of predicted DNA-binding domains. Neural crest (NC)-derived progenitors within the LUT were labeled by Sox10, a well-known regulator of NC development. Genes identified were categorized based on patterns of restricted expression in pelvic ganglia, pelvic ganglia and urethral epithelium, or pelvic ganglia and urethral mesenchyme. Gene expression patterns and the distribution of Sox10+, Phox2b+, Hu+, and PGP9.5+ cells within developing ganglia suggest previously unrecognized regional segregation of Sox10+ progenitors and differentiating neurons in early development of pelvic ganglia. Reverse transcription-PCR of pelvic ganglia RNA from fetal and post-natal stages demonstrated that multiple TFs maintain post-natal expression, although Pax3 is extinguished before weaning. Our analysis identifies multiple potential regulatory genes including TFs that may participate in segregation of discrete lineages within pelvic ganglia. The genes identified here are attractive candidate disease genes that may now be further investigated for their roles in malformation syndromes or in LUT dysfunction. PMID:22988430
Johnson, Rory; Richter, Nadine; Bogu, Gireesh K; Bhinge, Akshay; Teng, Siaw Wei; Choo, Siew Hua; Andrieux, Lise O; de Benedictis, Cinzia; Jauch, Ralf; Stanton, Lawrence W
Increasing numbers of human diseases are being linked to genetic variants, but our understanding of the mechanistic links leading from DNA sequence to disease phenotype is limited. The majority of disease-causing nucleotide variants fall within the non-protein-coding portion of the genome, making it likely that they act by altering gene regulatory sequences. We hypothesised that SNPs within the binding sites of the transcriptional repressor REST alter the degree of repression of target genes. Given that changes in the effective concentration of REST contribute to several pathologies-various cancers, Huntington's disease, cardiac hypertrophy, vascular smooth muscle proliferation-these SNPs should alter disease-susceptibility in carriers. We devised a strategy to identify SNPs that affect the recruitment of REST to target genes through the alteration of its DNA recognition element, the RE1. A multi-step screen combining genetic, genomic, and experimental filters yielded 56 polymorphic RE1 sequences with robust and statistically significant differences of affinity between alleles. These SNPs have a considerable effect on the the functional recruitment of REST to DNA in a range of in vitro, reporter gene, and in vivo analyses. Furthermore, we observe allele-specific biases in deeply sequenced chromatin immunoprecipitation data, consistent with predicted differenes in RE1 affinity. Amongst the targets of polymorphic RE1 elements are important disease genes including NPPA, PTPRT, and CDH4. Thus, considerable genetic variation exists in the DNA motifs that connect gene regulatory networks. Recently available ChIP-seq data allow the annotation of human genetic polymorphisms with regulatory information to generate prior hypotheses about their disease-causing mechanism.
Kurata, Morito; Rathe, Susan K.; Bailey, Natashay J.; Aumann, Natalie K.; Jones, Justine M.; Veldhuijzen, G. Willemijn; Moriarity, Branden S.; Largaespada, David A.
Acute myeloid leukemia (AML) can display de novo or acquired resistance to cytosine arabinoside (Ara-C), a primary component of induction chemotherapy. To identify genes capable of independently imposing Ara-C resistance, we applied a genome-wide CRISPR library to human U937 cells and exposed to them to Ara-C. Interestingly, all drug resistant clones contained guide RNAs for DCK. To avoid DCK gene modification, gRNA resistant DCK cDNA was created by the introduction of silent mutations. The CRISPR screening was repeated using the gRNA resistant DCK, and loss of SLC29A was identified as also being capable of conveying Ara-C drug resistance. To determine if loss of Dck results in increased sensitivity to other drugs, we conducted a screen of 446 FDA approved drugs using two Dck-defective BXH-2 derived murine AML cell lines and their Ara-C sensitive parental lines. Both cell lines showed an increase in sensitivity to prednisolone. Guide RNA resistant cDNA rescue was a legitimate strategy and multiple DCK or SLC29A deficient human cell clones were established with one clone becoming prednisolone sensitive. Dck-defective leukemic cells may become prednisolone sensitive indicating prednisolone may be an effective adjuvant therapy in some cases of DCK-negative AML. PMID:27808171
Kurata, Morito; Rathe, Susan K; Bailey, Natashay J; Aumann, Natalie K; Jones, Justine M; Veldhuijzen, G Willemijn; Moriarity, Branden S; Largaespada, David A
Acute myeloid leukemia (AML) can display de novo or acquired resistance to cytosine arabinoside (Ara-C), a primary component of induction chemotherapy. To identify genes capable of independently imposing Ara-C resistance, we applied a genome-wide CRISPR library to human U937 cells and exposed to them to Ara-C. Interestingly, all drug resistant clones contained guide RNAs for DCK. To avoid DCK gene modification, gRNA resistant DCK cDNA was created by the introduction of silent mutations. The CRISPR screening was repeated using the gRNA resistant DCK, and loss of SLC29A was identified as also being capable of conveying Ara-C drug resistance. To determine if loss of Dck results in increased sensitivity to other drugs, we conducted a screen of 446 FDA approved drugs using two Dck-defective BXH-2 derived murine AML cell lines and their Ara-C sensitive parental lines. Both cell lines showed an increase in sensitivity to prednisolone. Guide RNA resistant cDNA rescue was a legitimate strategy and multiple DCK or SLC29A deficient human cell clones were established with one clone becoming prednisolone sensitive. Dck-defective leukemic cells may become prednisolone sensitive indicating prednisolone may be an effective adjuvant therapy in some cases of DCK-negative AML.
Sokolova, Maria; Turunen, Mikko; Mortusewicz, Oliver; Björklund, Mikael; Taipale, Minna; Helleday, Thomas; Taipale, Jussi
ABSTRACT To identify cell cycle regulators that enable cancer cells to replicate DNA and divide in an unrestricted manner, we performed a parallel genome-wide RNAi screen in normal and cancer cell lines. In addition to many shared regulators, we found that tumor and normal cells are differentially sensitive to loss of the histone genes transcriptional regulator CASP8AP2. In cancer cells, loss of CASP8AP2 leads to a failure to synthesize sufficient amount of histones in the S-phase of the cell cycle, resulting in slowing of individual replication forks. Despite this, DNA replication fails to arrest, and tumor cells progress in an elongated S-phase that lasts several days, finally resulting in death of most of the affected cells. In contrast, depletion of CASP8AP2 in normal cells triggers a response that arrests viable cells in S-phase. The arrest is dependent on p53, and preceded by accumulation of markers of DNA damage, indicating that nucleosome depletion is sensed in normal cells via a DNA-damage -like response that is defective in tumor cells. PMID:27929715
Lipinski, Marta M.; Hoffman, Greg; Ng, Aylwin; Zhou, Wen; Py, Bénédicte F.; Hsu, Emily; Liu, Xuxin; Eisenberg, Jason; Liu, Jun; Blenis, John; Xavier, Ramnik J.; Yuan, Junying
Summary Autophagy is a cellular catabolic mechanism that plays an essential function in protecting multicellular eukaryotes from neurodegeneration, cancer and other diseases. However, we still know very little about mechanisms regulating autophagy under normal homeostatic conditions when nutrients are not limiting. In a genome-wide human siRNA screen, we demonstrate that under normal nutrient conditions up regulation of autophagy requires the type III PI3 kinase, but not inhibition of mTORC1, the essential negative regulator of starvation-induced autophagy. We show that a group of growth factors and cytokines inhibit the type III PI3 kinase through multiple pathways, including the MAPK-ERK1/2, Stat3, Akt/Foxo3 and CXCR4/GPCR, which are all known to positively regulate cell growth and proliferation. Our study suggests that the type III PI3 kinase integrates diverse signals to regulate cellular levels of autophagy, and that autophagy and cell proliferation may represent two alternative cell fates that are regulated in a mutually exclusive manner. PMID:20627085
Simpson, Craig D; Hurren, Rose; Kasimer, Dahlia; MacLean, Neil; Eberhard, Yanina; Ketela, Troy; Moffat, Jason; Schimmer, Aaron D
Acquisition of resistance to anchorage dependant cell death, a process termed anoikis, is a requirement for cancer cell metastasis. However, the molecular determinants of anoikis resistance and sensitivity are poorly understood. To better understand resistance to anoikis we conducted a genome wide lentiviral shRNA screen to identify genes whose knockdown render anoikis-sensitive RWPE-1 prostate cells resistant to anoikis. RWPE-1 cells were infected with a pooled lentiviral shRNA library with 54,021 shRNA targeting 11,255 genes. After infection, an anoikis-resistant cell population was selected and shRNA sequences were amplified and sequenced. Thirty-four shRNA sequences reproducibly protected RWPE-1 cells from anoikis after culture under suspension conditions including the top validated hit, α/β hydrolase domain containing 4 (ABHD4). In validation studies, ABHD4 knockdown inhibited anoikis in RWPE-1 cells as well as anoikis sensitive NP69 nasopharyngeal and OVCAR3 ovarian cancer cells, while over-expression of the gene increased sensitivity. Induction of anoikis after ABHD4 knockdown was associated with cleavage of PARP and activation of caspases-3, but was independent in changes of FLIP, FAK and Src expression. Interestingly, induction of anoikis after ABHD4 knockdown was independent of the known role of ABHD4 in the anandamide synthesis pathway and the generation of glycerophospho-N-acyl ethanolamines. Thus, ABHD4 is a novel genetic regulator of anoikis sensitivity.
Nagao, Kazuaki; Fujii, Katsunori; Yamada, Masao; Miyashita, Toshiyuki
Mutations in the human homologue of the Drosophila patched gene (PTCH) are responsible for the hereditary disorder called nevoid basal cell carcinoma syndrome (NBCCS). PTCH has a CGG triplet repeat located 4 bp upstream of the first methionine codon. Here we report a novel polymorphism involving the number of the CGG-repeat. The major allele (86.3%) contained a repeat size of seven, whereas the minor allele contained eight. No significant difference in the distributions of genotypes was observed between normal and NBCCS individuals. However, when the repeat was inserted between a heterologous promoter and the luciferase gene, the longer repeats tended to induce higher luciferase activities, suggesting that the repeat length potentially affects the levels of gene expression. A genome-wide screening revealed that 68 and 146 genes contained a CGG/CCG repeat in the coding region and in the 5'-untranslated region (5'-UTR), respectively. None of the genes had this repeat in 3'-UTR. Interestingly, the number of genes with a CGG repeat in the 5'-UTR was significantly higher than that with a CCG repeat in the 5'-UTR. The localization of a CGG/CCG repeat in PTCH is quite unique in that only four other genes have been found in which the repeat is localized up to 4 bp upstream of the first methionine.
Furuya, Yusui; Denda, Miwako; Sakane, Kyohei; Ogusu, Tomoko; Takahashi, Sumio; Magari, Masaki; Kanayama, Naoki; Morishita, Ryo; Tokumitsu, Hiroshi
To search for novel target(s) of the Ca(2+)-signaling transducer, calmodulin (CaM), we performed a newly developed genome-wide CaM interaction screening of 19,676 GST-fused proteins expressed in human. We identified striated muscle activator of Rho signaling (STARS) as a novel CaM target and characterized its CaM binding ability and found that the Ca(2+)/CaM complex interacted stoichiometrically with the N-terminal region (Ala13-Gln35) of STARS in vitro as well as in living cells. Mutagenesis studies identified Ile20 and Trp33 as the essential hydrophobic residues in CaM anchoring. Furthermore, the CaM binding deficient mutant (Ile20Ala, Trp33Ala) of STARS further enhanced its stimulatory effect on SRF-dependent transcriptional activation. These results suggest a connection between Ca(2+)-signaling via excitation-contraction coupling and the regulation of STARS-mediated gene expression in muscles.
Vaqué, Jose P; Dorsam, Robert T; Feng, Xiaodong; Iglesias-Bartolome, Ramiro; Forsthoefel, David J; Chen, Qianming; Debant, Anne; Seeger, Mark A; Ksander, Bruce R; Teramoto, Hidemi; Gutkind, J Silvio
Activating mutations in GNAQ and GNA11, encoding members of the Gα(q) family of G protein α subunits, are the driver oncogenes in uveal melanoma, and mutations in Gq-linked G protein-coupled receptors have been identified recently in numerous human malignancies. How Gα(q) and its coupled receptors transduce mitogenic signals is still unclear because of the complexity of signaling events perturbed upon Gq activation. Using a synthetic-biology approach and a genome-wide RNAi screen, we found that a highly conserved guanine nucleotide exchange factor, Trio, is essential for activating Rho- and Rac-regulated signaling pathways acting on JNK and p38, and thereby transducing proliferative signals from Gα(q) to the nucleus independently of phospholipase C-β. Indeed, whereas many biological responses elicited by Gq depend on the transient activation of second-messenger systems, Gq utilizes a hard-wired protein-protein-interaction-based signaling circuitry to achieve the sustained stimulation of proliferative pathways, thereby controlling normal and aberrant cell growth.
Herr, Patrick; Lundin, Cecilia; Evers, Bastiaan; Ebner, Daniel; Bauerschmidt, Christina; Kingham, Guy; Palmai-Pallag, Timea; Mortusewicz, Oliver; Frings, Oliver; Sonnhammer, Erik; Helleday, Thomas
To identify new regulators of homologous recombination repair, we carried out a genome-wide short-interfering RNA screen combined with ionizing irradiation using RAD51 foci formation as readout. All candidates were confirmed by independent short-interfering RNAs and validated in secondary assays like recombination repair activity and RPA foci formation. Network analysis of the top modifiers identified gene clusters involved in recombination repair as well as components of the ribosome, the proteasome and the spliceosome, which are known to be required for effective DNA repair. We identified and characterized the RNA polymerase II-associated protein CDC73/Parafibromin as a new player in recombination repair and show that it is critical for genomic stability. CDC73 interacts with components of the SCF/Cullin and INO80/NuA4 chromatin-remodeling complexes to promote Histone ubiquitination. Our findings indicate that CDC73 is involved in local chromatin decondensation at sites of DNA damage to promote DNA repair. This function of CDC73 is related to but independent of its role in transcriptional elongation. PMID:27462432
Harmon, Brooke; Bird, Sara W.; Schudel, Benjamin R.; Hatch, Anson V.; Rasley, Amy
ABSTRACT Rift Valley fever virus (RVFV) is an arbovirus within the Bunyaviridae family capable of causing serious morbidity and mortality in humans and livestock. To identify host factors involved in bunyavirus replication, we employed genome-wide RNA interference (RNAi) screening and identified 381 genes whose knockdown reduced infection. The Wnt pathway was the most represented pathway when gene hits were functionally clustered. With further investigation, we found that RVFV infection activated Wnt signaling, was enhanced when Wnt signaling was preactivated, was reduced with knockdown of β-catenin, and was blocked using Wnt signaling inhibitors. Similar results were found using distantly related bunyaviruses La Crosse virus and California encephalitis virus, suggesting a conserved role for Wnt signaling in bunyaviral infection. We propose a model where bunyaviruses activate Wnt-responsive genes to regulate optimal cell cycle conditions needed to promote efficient viral replication. The findings in this study should aid in the design of efficacious host-directed antiviral therapeutics. IMPORTANCE RVFV is a mosquito-borne bunyavirus that is endemic to Africa but has demonstrated a capacity for emergence in new territories (e.g., the Arabian Peninsula). As a zoonotic pathogen that primarily affects livestock, RVFV can also cause lethal hemorrhagic fever and encephalitis in humans. Currently, there are no treatments or fully licensed vaccines for this virus. Using high-throughput RNAi screening, we identified canonical Wnt signaling as an important host pathway regulating RVFV infection. The beneficial role of Wnt signaling was observed for RVFV, along with other disparate bunyaviruses, indicating a conserved bunyaviral replication mechanism involving Wnt signaling. These studies supplement our knowledge of the fundamental mechanisms of bunyavirus infection and provide new avenues for countermeasure development against pathogenic bunyaviruses. PMID:27226375
Gari, Hamid H; Gearheart, Christy M; Fosmire, Susan; DeGala, Gregory D; Fan, Zeying; Torkko, Kathleen C; Edgerton, Susan M; Lucia, M Scott; Ray, Rahul; Thor, Ann D; Porter, Christopher C; Lambert, James R
Triple-negative breast cancers (TNBC) are among the most aggressive and heterogeneous cancers with a high propensity to invade, metastasize and relapse. Here, we demonstrate that the anticancer compound, AMPI-109, is selectively efficacious in inhibiting proliferation and inducing apoptosis of multiple TNBC subtype cell lines as assessed by activation of pro-apoptotic caspases-3 and 7, PARP cleavage and nucleosomal DNA fragmentation. AMPI-109 had little to no effect on growth in the majority of non-TNBC cell lines examined. We therefore utilized AMPI-109 in a genome-wide shRNA screen in the TNBC cell line, BT-20, to investigate the utility of AMPI-109 as a tool in helping to identify molecular alterations unique to TNBC. Our screen identified the oncogenic phosphatase, PRL-3, as a potentially important driver of TNBC growth, migration and invasion. Through stable lentiviral knock downs and transfection with catalytically impaired PRL-3 in TNBC cells, loss of PRL-3 expression, or functionality, led to substantial growth inhibition. Moreover, AMPI-109 treatment, downregulation of PRL-3 expression or impairment of PRL-3 activity reduced TNBC cell migration and invasion. Histological evaluation of human breast cancers revealed PRL-3 was significantly, though not exclusively, associated with the TNBC subtype and correlated positively with regional and distant metastases, as well as 1 and 3 year relapse free survival. Collectively, our study is proof-of-concept that AMPI-109, a selectively active agent against TNBC cell lines, can be used as a molecular tool to uncover unique drivers of disease progression, such as PRL-3, which we show promotes oncogenic phenotypes in TNBC cells.
Harmon, Brooke; Bird, Sara W.; Schudel, Benjamin R.; ...
Rift Valley fever virus (RVFV) is an arbovirus within the Bunyaviridae family capable of causing serious morbidity and mortality in humans and livestock. To identify host factors involved in bunyavirus replication, we employed genome-wide RNA interference (RNAi) screening and identified 381 genes whose knockdown reduced infection. The Wnt pathway was the most represented pathway when gene hits were functionally clustered. With further investigation, we found that RVFV infection activated Wnt signaling, was enhanced when Wnt signaling was preactivated, was reduced with knockdown of β-catenin, and was blocked using Wnt signaling inhibitors. Similar results were found using distantly related bunyaviruses Lamore » Crosse virus and California encephalitis virus, suggesting a conserved role for Wnt signaling in bunyaviral infection. We propose a model where bunyaviruses activate Wnt-responsive genes to regulate optimal cell cycle conditions needed to promote efficient viral replication. The findings in this study should aid in the design of efficacious host-directed antiviral therapeutics. IMPORTANCE RVFV is a mosquito-borne bunyavirus that is endemic to Africa but has demonstrated a capacity for emergence in new territories (e.g., the Arabian Peninsula). As a zoonotic pathogen that primarily affects livestock, RVFV can also cause lethal hemorrhagic fever and encephalitis in humans. Currently, there are no treatments or fully licensed vaccines for this virus. Using high-throughput RNAi screening, we identified canonical Wnt signaling as an important host pathway regulating RVFV infection. The beneficial role of Wnt signaling was observed for RVFV, along with other disparate bunyaviruses, indicating a conserved bunyaviral replication mechanism involving Wnt signaling. Lastly, these studies supplement our knowledge of the fundamental mechanisms of bunyavirus infection and provide new avenues for countermeasure development against pathogenic bunyaviruses.« less
Harmon, Brooke; Bird, Sara W.; Schudel, Benjamin R.; Hatch, Anson V.; Rasley, Amy; Negrete, Oscar A.
Rift Valley fever virus (RVFV) is an arbovirus within the
IMPORTANCE RVFV is a mosquito-borne bunyavirus that is endemic to Africa but has demonstrated a capacity for emergence in new territories (e.g., the Arabian Peninsula). As a zoonotic pathogen that primarily affects livestock, RVFV can also cause lethal hemorrhagic fever and encephalitis in humans. Currently, there are no treatments or fully licensed vaccines for this virus. Using high-throughput RNAi screening, we identified canonical Wnt signaling as an important host pathway regulating RVFV infection. The beneficial role of Wnt signaling was observed for RVFV, along with other disparate bunyaviruses, indicating a conserved bunyaviral replication mechanism involving Wnt signaling. Lastly, these studies supplement our knowledge of the fundamental mechanisms of bunyavirus infection and provide new avenues for
Gari, Hamid H.; Gearheart, Christy M.; Fosmire, Susan; DeGala, Gregory D.; Fan, Zeying; Torkko, Kathleen C.; Edgerton, Susan M.; Lucia, M. Scott; Ray, Rahul; Thor, Ann D.; Porter, Christopher C.; Lambert, James R.
Triple-negative breast cancers (TNBC) are among the most aggressive and heterogeneous cancers with a high propensity to invade, metastasize and relapse. Here, we demonstrate that the anticancer compound, AMPI-109, is selectively efficacious in inhibiting proliferation and inducing apoptosis of multiple TNBC subtype cell lines as assessed by activation of pro-apoptotic caspases-3 and 7, PARP cleavage and nucleosomal DNA fragmentation. AMPI-109 had little to no effect on growth in the majority of non-TNBC cell lines examined. We therefore utilized AMPI-109 in a genome-wide shRNA screen in the TNBC cell line, BT-20, to investigate the utility of AMPI-109 as a tool in helping to identify molecular alterations unique to TNBC. Our screen identified the oncogenic phosphatase, PRL-3, as a potentially important driver of TNBC growth, migration and invasion. Through stable lentiviral knock downs and transfection with catalytically impaired PRL-3 in TNBC cells, loss of PRL-3 expression, or functionality, led to substantial growth inhibition. Moreover, AMPI-109 treatment, downregulation of PRL-3 expression or impairment of PRL-3 activity reduced TNBC cell migration and invasion. Histological evaluation of human breast cancers revealed PRL-3 was significantly, though not exclusively, associated with the TNBC subtype and correlated positively with regional and distant metastases, as well as 1 and 3 year relapse free survival. Collectively, our study is proof-of-concept that AMPI-109, a selectively active agent against TNBC cell lines, can be used as a molecular tool to uncover unique drivers of disease progression, such as PRL-3, which we show promotes oncogenic phenotypes in TNBC cells. PMID:26909599
Ayer, Anita; Fellermeier, Sina; Fife, Christopher; Li, Simone S.; Smits, Gertien; Meyer, Andreas J.; Dawes, Ian W.; Perrone, Gabriel G.
Maintenance of an optimal redox environment is critical for appropriate functioning of cellular processes and cell survival. Despite the importance of maintaining redox homeostasis, it is not clear how the optimal redox potential is sensed and set, and the processes that impact redox on a cellular/organellar level are poorly understood. The genetic bases of cellular redox homeostasis were investigated using a green fluorescent protein (GFP) based redox probe, roGFP2 and a pH sensitive GFP-based probe, pHluorin. The use of roGFP2, in conjunction with pHluorin, enabled determination of pH-adjusted sub-cellular redox potential in a non-invasive and real-time manner. A genome-wide screen using both the non-essential and essential gene collections was carried out in Saccharomyces cerevisiae using cytosolic-roGFP2 to identify factors essential for maintenance of cytosolic redox state under steady-state conditions. 102 genes of diverse function were identified that are required for maintenance of cytosolic redox state. Mutations in these genes led to shifts in the half-cell glutathione redox potential by 75-10 mV. Interestingly, some specific oxidative stress-response processes were identified as over-represented in the data set. Further investigation of the role of oxidative stress-responsive systems in sub-cellular redox homeostasis was conducted using roGFP2 constructs targeted to the mitochondrial matrix and peroxisome and EGSH was measured in cells in exponential and stationary phase. Analyses allowed for the identification of key redox systems on a sub-cellular level and the identification of novel genes involved in the regulation of cellular redox homeostasis. PMID:22970195
Ayer, Anita; Fellermeier, Sina; Fife, Christopher; Li, Simone S; Smits, Gertien; Meyer, Andreas J; Dawes, Ian W; Perrone, Gabriel G
Maintenance of an optimal redox environment is critical for appropriate functioning of cellular processes and cell survival. Despite the importance of maintaining redox homeostasis, it is not clear how the optimal redox potential is sensed and set, and the processes that impact redox on a cellular/organellar level are poorly understood. The genetic bases of cellular redox homeostasis were investigated using a green fluorescent protein (GFP) based redox probe, roGFP2 and a pH sensitive GFP-based probe, pHluorin. The use of roGFP2, in conjunction with pHluorin, enabled determination of pH-adjusted sub-cellular redox potential in a non-invasive and real-time manner. A genome-wide screen using both the non-essential and essential gene collections was carried out in Saccharomyces cerevisiae using cytosolic-roGFP2 to identify factors essential for maintenance of cytosolic redox state under steady-state conditions. 102 genes of diverse function were identified that are required for maintenance of cytosolic redox state. Mutations in these genes led to shifts in the half-cell glutathione redox potential by 75-10 mV. Interestingly, some specific oxidative stress-response processes were identified as over-represented in the data set. Further investigation of the role of oxidative stress-responsive systems in sub-cellular redox homeostasis was conducted using roGFP2 constructs targeted to the mitochondrial matrix and peroxisome and E(GSH) was measured in cells in exponential and stationary phase. Analyses allowed for the identification of key redox systems on a sub-cellular level and the identification of novel genes involved in the regulation of cellular redox homeostasis.
Huang, Bo; Lu, Jian; Byström, Anders S.
We recently showed that the γ-subunit of Kluyveromyces lactis killer toxin (γ-toxin) is a tRNA endonuclease that cleaves , , and 3′ of the wobble nucleoside 5-methoxycarbonylmethyl-2-thiouridine (mcm5s2U). The 5-methoxycarbonylmethyl (mcm5) side chain was important for efficient cleavage by γ-toxin, and defects in mcm5 side-chain synthesis correlated with resistance to γ-toxin. Based on this correlation, a genome-wide screen was performed to identify gene products involved in the formation of the mcm5 side chain. From a collection of 4826 homozygous diploid Saccharomyces cerevisiae strains, each with one nonessential gene deleted, 63 mutants resistant to Kluyveromyces lactis killer toxin were identified. Among these, eight were earlier identified to have a defect in formation of the mcm5 side chain. Analysis of the remaining mutants and other known γ-toxin resistant mutants revealed that sit4, kti14, and KTI5 mutants also have a defect in the formation of mcm5. A mutant lacking two of the Sit4-associated proteins, Sap185 and Sap190, displays the same modification defect as a sit4-null mutant. Interestingly, several mutants were found to be defective in the synthesis of the 2-thio (s2) group of the mcm5s2U nucleoside. In addition to earlier described mutants, formation of the s2 group was also abolished in urm1, uba4, and ncs2 mutants and decreased in the yor251c mutant. Like the absence of the mcm5 side chain, the lack of the s2 group renders less sensitive to γ-toxin, reinforcing the importance of the wobble nucleoside mcm5s2U for tRNA cleavage by γ-toxin. PMID:18755837
Tanaka, Atsushi; Tumkosit, Uranan; Nakamura, Shota; Motooka, Daisuke; Kishishita, Natsuko; Priengprom, Thongkoon; Sa-Ngasang, Areerat; Kinoshita, Taroh; Takeda, Naokazu; Maeda, Yusuke
The molecular mechanisms underlying chikungunya virus (CHIKV) infection are poorly characterized. In this study, we analyzed the host factors involved in CHIKV infection using genome-wide screening. Human haploid HAP1 cells, into which an exon-trapping vector was introduced, were challenged with a vesicular stomatitis virus pseudotype bearing the CHIKV E3-E1 envelope proteins. Analysis of genes enriched in the cells resistant to the pseudotyped virus infection unveiled a critical role of N-sulfation of heparan sulfate (HS) for the infectivity of a clinically isolated CHIKV Thai #16856 strain to HAP1 cells. Knockout of NDST1 that catalyzes N-sulfation of HS greatly decreased the binding and infectivity of CHIKV Thai#16856 strain but not infectivity of Japanese encephalitis virus (JEV) and yellow fever virus (YFV). Whereas glycosaminoglycans were commonly required for efficient infectivity of CHIKV, JEV and YFV as shown by using B3GAT3 knockout cells, the tropism for N-sulfate was specific to CHIKV. Expression of chondroitin sulfate (CS) in NDST1-knockout HAP1 cells did not restore the binding of CHIKV Thai#16856 strain and the infectivity of its pseudotype but restored the infectivity of authentic CHIKV Thai#16856, suggesting that CS functions at the later steps after the CHIKV binding. Among the genes enriched in this screening, we found that TM9SF2 is critical for N-sulfation of HS and therefore for CHIKV infection, because it is involved in proper localization and stability of NDST1. Determination of the significance of and the relevant proteins to N-sulfation of HS may contribute to understanding mechanisms of CHIKV propagation, cell tropism and pathogenesis.IMPORTANCE Recent outbreaks of chikungunya fever have increased its clinical importance. Chikungunya virus (CHIKV) utilizes host glycosaminoglycans to bind efficiently to its target cells. However, the substructure in glycosaminoglycans required for CHIKV infection have not been characterized. Here, we
Identification of seven genes essential for male fertility through a genome-wide association study of non-obstructive azoospermia and RNA interference-mediated large-scale functional screening in Drosophila.
Yu, Jun; Wu, Hao; Wen, Yang; Liu, Yujuan; Zhou, Tao; Ni, Bixian; Lin, Yuan; Dong, Jing; Zhou, Zuomin; Hu, Zhibin; Guo, Xuejiang; Sha, Jiahao; Tong, Chao
Non-obstructive azoospermia (NOA) is a complex and severe condition whose etiology remains largely unknown. In a genome-wide association study (GWAS) of NOA in Chinese men, few loci reached genome-wide significance, although this might be a result of genetic heterogeneity. Single nucleotide polymorphisms (SNPs) without genome-wide significance may also indicate genes that are essential for fertility, and multiple stage validation can lead to false-negative results. To perform large-scale functional screening of the genes surrounding these SNPs, we used in vivo RNA interference (RNAi) in Drosophila, which has a short maturation cycle and is suitable for high-throughput analysis. The analysis found that 7 (31.8%) of the 22 analyzed orthologous Drosophila genes were essential for male fertility. These genes corresponded to nine loci. Of these genes, leukocyte-antigen-related-like (Lar) is primarily required in germ cells to sustain spermatogenesis, whereas CG12404, doublesex-Mab-related 11E (dmrt11E), CG6769, estrogen-related receptor (ERR) and sulfateless (sfl) function in somatic cells. Interestingly, ERR and sfl are also required for testis morphogenesis. Our study thus demonstrates that SNPs without genome-wide significance in GWAS may also provide clues to disease-related genes and therefore warrant functional analysis.
Chromosomal region 8q24 is of interest due to its consistent implication in GWA studies for prostate cancer. A rare non-recurrent 8486 base pair...indicates that some common CNVs have already been indirectly assessed for association with traits in SNP-based genome-wide association studies ( GWAS ...However, recurrent variants and risk-bearing alleles with low minor allele frequencies (MAF) may not be well tagged by SNPs in GWAS , and these variants
Wang, Shuai; Luo, Xuenong; Zhang, Shaohua; Yin, Cai; Dou, Yongxi; Cai, Xuepeng
No endogenous insulin-like peptides in parasitic flatworms have been reported. Insulin receptors from flukes and tapeworms have been shown to interact directly with the host-derived insulin molecule, which suggests the exploitation of host-derived insulin. In this study, a strategy of genome-wide searches followed by comprehensive analyses of strictly conserved features of the insulin family was used to demonstrate the presence of putative insulin-like peptides in the genomes of six tapeworms and two flukes. In addition, whole insulin signaling pathways were annotated on a genome-wide scale. Two putative insulin-like peptide genes in each genome of tapeworms and one insulin-like peptide gene in each genome of flukes were identified. The comprehensive analyses revealed that all of these peptides showed the common features shared by other members of the insulin family, and the phylogenetic analysis implied a putative gene duplication event in the Cestoda during the evolution of insulin-like peptide genes. The quantitative expression analysis and immunolocalization results suggested a putative role of these peptides in reproduction. Entire sets of major components of the classic insulin signaling pathway were successfully identified, suggesting that this pathway in parasitic flatworms might also regulate many other important biological activities. We believe that the identification of the insulin-like peptides gives us a better understanding of the insulin signaling pathway in these parasites, as well as host-parasite interactions.
Ferrari, Raffaele; Grassi, Mario; Salvi, Erika; Borroni, Barbara; Palluzzi, Fernando; Pepe, Daniele; D'Avila, Francesca; Padovani, Alessandro; Archetti, Silvana; Rainero, Innocenzo; Rubino, Elisa; Pinessi, Lorenzo; Benussi, Luisa; Binetti, Giuliano; Ghidoni, Roberta; Galimberti, Daniela; Scarpini, Elio; Serpente, Maria; Rossi, Giacomina; Giaccone, Giorgio; Tagliavini, Fabrizio; Nacmias, Benedetta; Piaceri, Irene; Bagnoli, Silvia; Bruni, Amalia C.; Maletta, Raffaele G.; Bernardi, Livia; Postiglione, Alfredo; Milan, Graziella; Franceschi, Massimo; Puca, Annibale A.; Novelli, Valeria; Barlassina, Cristina; Glorioso, Nicola; Manunta, Paolo; Singleton, Andrew; Cusi, Daniele; Hardy, John; Momeni, Parastoo
Frontotemporal dementia (FTD) is the second most prevalent form of early onset dementia after Alzheimer's disease (AD). We performed a case-control association study in an Italian FTD cohort (n = 530) followed by the novel single nucleotide polymorphisms (SNPs)-to-genes approach and functional annotation analysis. We identified 2 novel potential loci for FTD. Suggestive SNPs reached p-values ∼10−7 and odds ratio > 2.5 (2p16.3) and 1.5 (17q25.3). Suggestive alleles at 17q25.3 identified a disease-associated haplotype causing decreased expression of –cis genes such as RFNG and AATK involved in neuronal genesis and differentiation and axon outgrowth, respectively. We replicated this locus through the SNPs-to-genes approach. Our functional annotation analysis indicated significant enrichment for functions of the brain (neuronal genesis, differentiation, and maturation), the synapse (neurotransmission and synapse plasticity), and elements of the immune system, the latter supporting our recent international FTD–genome-wide association study. This is the largest genome-wide study in Italian FTD to date. Although our results are not conclusive, we set the basis for future replication studies and identification of susceptible molecular mechanisms involved in FTD pathogenesis. PMID:26154020
Warner, Neil; Burberry, Aaron; Pliakas, Maria; McDonald, Christine; Núñez, Gabriel
NOD2 encodes an intracellular multidomain pattern recognition receptor that is the strongest known genetic risk factor in the pathogenesis of Crohn disease (CD), a chronic relapsing inflammatory disorder of the intestinal tract. NOD2 functions as a sensor for bacterial cell wall components and activates proinflammatory and antimicrobial signaling pathways. Here, using a genome-wide small interfering RNA (siRNA) screen, we identify numerous genes that regulate secretion of the proinflammatory cytokine IL-8 in response to NOD2 activation. Moreover, many of the identified IL-8 regulators are linked by protein-protein interactions, revealing subnetworks of highly connected IL-8 regulators implicated in processes such as vesicle formation, mRNA stability, and protein ubiquitination and trafficking. A TNFα counterscreen to induce IL-8 secretion in an NOD2-independent manner reveals that the majority of the identified regulators affect IL-8 secretion irrespective of the initiating stimuli. Using immortalized macrophages, we validate the ubiquitin protease, USP8, and the endosomal sorting protein, VPS28, as negative regulators of NOD2-induced cytokine secretion. Interestingly, several genes that affect NOD2-induced IL-8 secretion are present in loci associated with CD risk by genome-wide association studies, supporting a role for the NOD2/IL-8 pathway, and not just NOD2, in the pathogenesis of CD. Overall, this screen provides a valuable resource in the advancement of our understanding of the genes that regulate the secretion of IL-8.
Andritschke, Daniel; Dilling, Sabrina; Emmenlauer, Mario; Welz, Tobias; Schmich, Fabian; Misselwitz, Benjamin; Rämö, Pauli; Rottner, Klemens; Kerkhoff, Eugen; Wada, Teiji; Penninger, Josef M.; Beerenwinkel, Niko; Horvath, Peter; Dehio, Christoph; Hardt, Wolf-Dietrich
Salmonella Typhimurium (S. Tm) is a leading cause of diarrhea. The disease is triggered by pathogen invasion into the gut epithelium. Invasion is attributed to the SPI-1 type 3 secretion system (T1). T1 injects effector proteins into epithelial cells and thereby elicits rearrangements of the host cellular actin cytoskeleton and pathogen invasion. The T1 effector proteins SopE, SopB, SopE2 and SipA are contributing to this. However, the host cell factors contributing to invasion are still not completely understood. To address this question comprehensively, we used Hela tissue culture cells, a genome-wide siRNA library, a modified gentamicin protection assay and S. TmSipA, a sopBsopE2sopE mutant which strongly relies on the T1 effector protein SipA to invade host cells. We found that S. TmSipA invasion does not elicit membrane ruffles, nor promote the entry of non-invasive bacteria "in trans". However, SipA-mediated infection involved the SPIRE family of actin nucleators, besides well-established host cell factors (WRC, ARP2/3, RhoGTPases, COPI). Stage-specific follow-up assays and knockout fibroblasts indicated that SPIRE1 and SPIRE2 are involved in different steps of the S. Tm infection process. Whereas SPIRE1 interferes with bacterial binding, SPIRE2 influences intracellular replication of S. Tm. Hence, these two proteins might fulfill non-redundant functions in the pathogen-host interaction. The lack of co-localization hints to a short, direct interaction between S. Tm and SPIRE proteins or to an indirect effect. PMID:27627128
Mohamed, Junaith S.; Hajira, Ameena; Lopez, Michael A.; Boriek, Aladin M.
Muscular dystrophies (MDs) are a heterogeneous group of genetic and neuromuscular disorders, which result in severe loss of motor ability and skeletal muscle mass and function. Aberrant mechanotransduction and dysregulated-microRNA pathways are often associated with the progression of MD. Here, we hypothesized that dysregulation of mechanosensitive microRNAs (mechanomiRs) in dystrophic skeletal muscle plays a major role in the progression of MD. To test our hypothesis, we performed a genome-wide expression profile of anisotropically regulated mechanomiRs and bioinformatically analyzed their target gene networks. We assessed their functional roles in the advancement of MD using diaphragm muscles from mdm (MD with myositis) mice, an animal model of human tibial MD (titinopathy), and their wild-type littermates. We were able to show that ex vivo anisotropic mechanical stretch significantly alters the miRNA expression profile in diaphragm muscles from WT and mdm mice; as a result, some of the genes associated with MDs are dysregulated in mdm mice due to differential regulation of a distinct set of mechanomiRs. Interestingly, we found a contrasting expression pattern of the highly expressed let-7 family mechanomiRs, let-7e-5p and miR-98–5p, and their target genes associated with the extracellular matrix and TGF-β pathways, respectively, between WT and mdm mice. Gain- and loss-of-function analysis of let-7e-5p in myocytes isolated from the diaphragms of WT and mdm mice confirmed Col1a1, Col1a2, Col3a1, Col24a1, Col27a1, Itga1, Itga4, Scd1, and Thbs1 as target genes of let-7e-5p. Furthermore, we found that miR-98 negatively regulates myoblast differentiation. Our study therefore introduces additional biological players in the regulation of skeletal muscle structure and myogenesis that may contribute to unexplained disorders of MD. PMID:26272747
Thornbrough, Joshua M.; Gopinath, Adarsh; Hundley, Tom; Worley, Micah J.
Salmonella enterica can actively invade the gastro-intestinal epithelium. This frequently leads to diarrheal disease, and also gives the pathogen access to phagocytes that can serve as vehicles for dissemination into deeper tissue. The ability to invade host cells is also important in maintaining the carrier state. While much is known about the bacterial factors that promote invasion, relatively little is known about the host factors involved. To gain insight into how Salmonella enterica serovar Typhimurium is able to invade normally non-phagocytic cells, we undertook a global RNAi screen with S. Typhimurium-infected human epithelial cells. In all, we identified 633 genes as contributing to bacterial internalization. These genes fall into a diverse group of functional categories revealing that cytoskeletal regulators are not the only factors that modulate invasion. In fact, potassium ion transport was the most enriched molecular function category in our screen, reinforcing a link between potassium and internalization. In addition to providing new insights into the molecular mechanisms underlying the ability of pathogens to invade host cells, all 633 host factors identified are candidates for new anti-microbial targets for treating Salmonella infections, and may be useful in curtailing infections with other pathogens as well. PMID:27880807
Kwon, Soon-Kyeong; Park, Yon-Kyoung; Kim, Jihyun F.
A marine bacterium, Hahella chejuensis, recently has attracted attention due to its lytic activity against a red-tide dinoflagellate. The algicidal function originates from its red pigment, prodigiosin, which also exhibits immunosuppressive or anticancer activity. Genome sequencing and functional analysis revealed a gene set contained in the hap gene cluster that is responsible for the biosynthesis of prodigiosin. To screen for the factors affecting the prodigiosin biosynthesis, we constructed a plasmid library of the H. chejuensis genomic DNA, introduced it into Escherichia coli strains harboring the hap cluster, and observed changes in production of the red pigment. Among the screened clones, hapXY genes whose products constitute a two-component signal transduction system were elucidated as positive regulators of the pigment production. In addition, an Hfq-dependent, noncoding region located at one end of the hap cluster was confirmed to play roles in regulation. Identification of factors involved in the regulation of prodigiosin biosynthesis should help in understanding how the prodigiosin-biosynthetic pathway is organized and controlled and also aid in modulating the overexpression of prodigiosin in a heterologous host, such as E. coli, or in the natural producer, H. chejuensis. PMID:20038694
Mavrakis, Konstantinos J; Wolfe, Andrew L; Oricchio, Elisa; Palomero, Teresa; de Keersmaecker, Kim; McJunkin, Katherine; Zuber, Johannes; James, Taneisha; Khan, Aly A; Leslie, Christina S; Parker, Joel S; Paddison, Patrick J; Tam, Wayne; Ferrando, Adolfo; Wendel, Hans-Guido
MicroRNAs (miRNAs) have emerged as novel cancer genes. In particular, the miR-17-92 cluster, containing six individual miRNAs, is highly expressed in haematopoietic cancers and promotes lymphomagenesis in vivo. Clinical use of these findings hinges on isolating the oncogenic activity within the 17-92 cluster and defining its relevant target genes. Here we show that miR-19 is sufficient to promote leukaemogenesis in Notch1-induced T-cell acute lymphoblastic leukaemia (T-ALL) in vivo. In concord with the pathogenic importance of this interaction in T-ALL, we report a novel translocation that targets the 17-92 cluster and coincides with a second rearrangement that activates Notch1. To identify the miR-19 targets responsible for its oncogenic action, we conducted a large-scale short hairpin RNA screen for genes whose knockdown can phenocopy miR-19. Strikingly, the results of this screen were enriched for miR-19 target genes, and include Bim (Bcl2L11), AMP-activated kinase (Prkaa1) and the phosphatases Pten and PP2A (Ppp2r5e). Hence, an unbiased, functional genomics approach reveals a coordinate clampdown on several regulators of phosphatidylinositol-3-OH kinase-related survival signals by the leukaemogenic miR-19.
Pradeepkiran, Jangampalli Adi; Sainath, Sri Bhashyam; Kumar, Konidala Kranthi; Bhaskar, Matcha
Brucella melitensis 16M is a Gram-negative coccobacillus that infects both animals and humans. It causes a disease known as brucellosis, which is characterized by acute febrile illness in humans and causes abortions in livestock. To prevent and control brucellosis, identification of putative drug targets is crucial. The present study aimed to identify drug targets in B. melitensis 16M by using a subtractive genomic approach. We used available database repositories (Database of Essential Genes, Kyoto Encyclopedia of Genes and Genomes Automatic Annotation Server, and Kyoto Encyclopedia of Genes and Genomes) to identify putative genes that are nonhomologous to humans and essential for pathogen B. melitensis 16M. The results revealed that among 3 Mb genome size of pathogen, 53 putative characterized and 13 uncharacterized hypothetical genes were identified; further, from Basic Local Alignment Search Tool protein analysis, one hypothetical protein showed a close resemblance (50%) to Silicibacter pomeroyi DUF1285 family protein (2RE3). A further homology model of the target was constructed using MODELLER 9.12 and optimized through variable target function method by molecular dynamics optimization with simulating annealing. The stereochemical quality of the restrained model was evaluated by PROCHECK, VERIFY-3D, ERRAT, and WHATIF servers. Furthermore, structure-based virtual screening was carried out against the predicted active site of the respective protein using the glycerol structural analogs from the PubChem database. We identified five best inhibitors with strong affinities, stable interactions, and also with reliable drug-like properties. Hence, these leads might be used as the most effective inhibitors of modeled protein. The outcome of the present work of virtual screening of putative gene targets might facilitate design of potential drugs for better treatment against brucellosis. PMID:25834405
Pradeepkiran, Jangampalli Adi; Sainath, Sri Bhashyam; Kumar, Konidala Kranthi; Bhaskar, Matcha
Brucella melitensis 16M is a Gram-negative coccobacillus that infects both animals and humans. It causes a disease known as brucellosis, which is characterized by acute febrile illness in humans and causes abortions in livestock. To prevent and control brucellosis, identification of putative drug targets is crucial. The present study aimed to identify drug targets in B. melitensis 16M by using a subtractive genomic approach. We used available database repositories (Database of Essential Genes, Kyoto Encyclopedia of Genes and Genomes Automatic Annotation Server, and Kyoto Encyclopedia of Genes and Genomes) to identify putative genes that are nonhomologous to humans and essential for pathogen B. melitensis 16M. The results revealed that among 3 Mb genome size of pathogen, 53 putative characterized and 13 uncharacterized hypothetical genes were identified; further, from Basic Local Alignment Search Tool protein analysis, one hypothetical protein showed a close resemblance (50%) to Silicibacter pomeroyi DUF1285 family protein (2RE3). A further homology model of the target was constructed using MODELLER 9.12 and optimized through variable target function method by molecular dynamics optimization with simulating annealing. The stereochemical quality of the restrained model was evaluated by PROCHECK, VERIFY-3D, ERRAT, and WHATIF servers. Furthermore, structure-based virtual screening was carried out against the predicted active site of the respective protein using the glycerol structural analogs from the PubChem database. We identified five best inhibitors with strong affinities, stable interactions, and also with reliable drug-like properties. Hence, these leads might be used as the most effective inhibitors of modeled protein. The outcome of the present work of virtual screening of putative gene targets might facilitate design of potential drugs for better treatment against brucellosis.
García, Patricia; Encinar Del Dedo, Javier; Ayté, José; Hidalgo, Elena
In response to environmental cues, the mitogen-activated protein kinase Sty1-driven signaling cascade activates hundreds of genes to induce a robust anti-stress cellular response in fission yeast. Thus, upon stress imposition Sty1 transiently accumulates in the nucleus where it up-regulates transcription through the Atf1 transcription factor. Several regulators of transcription and translation have been identified as important to mount an integral response to oxidative stress, such as the Spt-Ada-Gcn5-acetyl transferase or Elongator complexes, respectively. With the aim of identifying new regulators of this massive gene expression program, we have used a GFP-based protein reporter and screened a fission yeast deletion collection using flow cytometry. We find that the levels of catalase fused to GFP, both before and after a threat of peroxides, are altered in hundreds of strains lacking components of chromatin modifiers, transcription complexes, and modulators of translation. Thus, the transcription elongation complex Paf1, the histone methylase Set1-COMPASS, and the translation-related Trm112 dimers are all involved in full expression of Ctt1-GFP and in wild-type tolerance to peroxides.
Asakawa, Jun-Ichi; Kodaira, Mieko; Miura, Akiko; Tsuji, Takahiro; Nakamoto, Yoshiko; Imanaka, Masaaki; Kitamura, Jun; Cullings, Harry; Nishimura, Mayumi; Shimada, Yoshiya; Nakamura, Nori
Until the end of the 20th century, mouse germ cell data on induced mutation rates, which were collected using classical genetic methods at preselected specific loci, provided the principal basis for estimates of genetic risks from radiation in humans. The work reported on here is an extension of earlier efforts in this area using molecular methods. It focuses on validating the use of array comparative genomic hybridization (array CGH) methods for identifying radiation-induced copy number variants (CNVs) and specifically for DNA deletions. The emphasis on deletions stems from the view that it constitutes the predominant type of radiation-induced genetic damage, which is relevant for estimating genetic risks in humans. In the current study, deletion mutations were screened in the genomes of F1 mice born to unirradiated or 4 Gy irradiated sires at the spermatogonia stage (100 offspring each). The array CGH analysis was performed using a "2M array" with over 2 million probes with a mean interprobe distance of approximately 1 kb. The results provide evidence of five molecularly-confirmed paternally-derived deletions in the irradiated group (5/100) and one in the controls (1/100). These data support a calculation, which estimates that the mutation rate is 1 × 10(-2)/Gy per genome for induced deletions; this is much lower than would be expected if one assumes that the specific locus rate of 1 × 10(-5)/locus per Gy (at 34 loci) is applicable to other genes in the genome. The low observed rate of induced deletions suggests that the effective number of genes/genomic regions at which recoverable deletions could be induced would be only approximately 1,000. This estimate is far lower than expected from the size of the mouse genome (>20,000 genes). Such a discrepancy between observation and expectation can occur if the genome contains numerous genes that are far less sensitive to radiation-induced deletions, if many deletion-bearing offspring are not viable or if the current
Su, Bing; Gao, Lingqiu; Baranowski, Catherine; Gillard, Bryan; Wang, Jianmin; Ransom, Ryan; Ko, Hyun-Kyung; Gelman, Irwin H.
Activation of the PI3K/AKT signal pathway is a known driving force for the progression to castration-recurrent prostate cancer (CR-CaP), which constitutes the major lethal phenotype of CaP. Here, we identify using a genomic shRNA screen the PI3K/AKT-inactivating downstream target, FOXO4, as a potential CaP metastasis suppressor. FOXO4 protein levels inversely correlate with the invasive potential of a panel of human CaP cell lines, with decreased mRNA levels correlating with increased incidence of clinical metastasis. Knockdown (KD) of FOXO4 in human LNCaP cells causes increased invasion in vitro and lymph node (LN) metastasis in vivo without affecting indices of proliferation or apoptosis. Increased Matrigel invasiveness was found by KD of FOXO1 but not FOXO3. Comparison of differentially expressed genes affected by FOXO4-KD in LNCaP cells in culture, in primary tumors and in LN metastases identified a panel of upregulated genes, including PIP, CAMK2N1, PLA2G16 and PGC, which, if knocked down by siRNA, could decrease the increased invasiveness associated with FOXO4 deficiency. Although only some of these genes encode FOXO promoter binding sites, they are all RUNX2-inducible, and RUNX2 binding to the PIP promoter is increased in FOXO4-KD cells. Indeed, the forced expression of FOXO4 reversed the increased invasiveness of LNCaP/shFOXO4 cells; the forced expression of FOXO4 did not alter RUNX2 protein levels, yet it decreased RUNX2 binding to the PIP promoter, resulting in PIP downregulation. Finally, there was a correlation between FOXO4, but not FOXO1 or FOXO3, downregulation and decreased metastasis-free survival in human CaP patients. Our data strongly suggest that increased PI3K/AKT-mediated metastatic invasiveness in CaP is associated with FOXO4 loss, and that mechanisms to induce FOXO4 re-expression might suppress CaP metastatic aggressiveness. PMID:24983969
Identification of Restriction Factors by Human Genome-Wide RNA Interference Screening of Viral Host Range Mutants Exemplified by Discovery of SAMD9 and WDR6 as Inhibitors of the Vaccinia Virus K1L−C7L− Mutant
Sivan, Gilad; Ormanoglu, Pinar; Buehler, Eugen C.; Martin, Scott E.
ABSTRACT RNA interference (RNAi) screens intended to identify host factors that restrict virus replication may fail if the virus already counteracts host defense mechanisms. To overcome this limitation, we are investigating the use of viral host range mutants that exhibit impaired replication in nonpermissive cells. A vaccinia virus (VACV) mutant with a deletion of both the C7L and K1L genes, K1L−C7L−, which abrogates replication in human cells at a step prior to late gene expression, was chosen for this strategy. We carried out a human genome-wide small interfering RNA (siRNA) screen in HeLa cells infected with a VACV K1L−C7L− mutant that expresses the green fluorescent protein regulated by a late promoter. This positive-selection screen had remarkably low background levels and resulted in the identification of a few cellular genes, notably SAMD9 and WDR6, from approximately 20,000 tested that dramatically enhanced green fluorescent protein expression. Replication of the mutant virus was enabled by multiple siRNAs to SAMD9 or WDR6. Moreover, SAMD9 and WDR6 clustered regularly interspaced short palindromic repeat (CRISPR)/Cas9 knockout HeLa cell lines were permissive for replication of the K1L−C7L− mutant, in agreement with the siRNA data. Expression of exogenous SAMD9 or interferon regulatory factor 1 restricted replication of the K1L−C7L− mutant in the SAMD9−/− cells. Independent interactions of SAMD9 with the K1 and C7 proteins were suggested by immunoprecipitation. Knockout of WDR6 did not reduce the levels of SAMD9 and interactions of WDR6 with SAMD9, C7, and K1 proteins were not detected, suggesting that these restriction factors act independently but possibly in the same innate defense pathway. PMID:26242627
Gonidakis, Stavros; Finkel, Steven E.; Longo, Valter D.
Summary Single-gene mutants with extended lifespan have been described in several model organisms. We performed a genome-wide screen for long-lived mutants in Escherichia coli which revealed strains lacking TCA cycle-related genes that exhibit longer stationary phase survival and increased resistance to heat stress compared to wild-type. Extended lifespan in the sdhA mutant, lacking subunit A of succinate dehydrogenase, is associated with reduced production of superoxide and increased stress resistance. On the other hand, the longer lifespan of the lipoic acid synthase mutant (lipA) is associated with reduced oxygen consumption and requires the acetate-producing enzyme pyruvate oxidase, as well as acetyl-CoA synthetase, the enzyme that converts extracellular acetate to acetyl-CoA. The hypoxia-inducible transcription factor ArcA, acting independently of acetate metabolism, is also required for maximum lifespan extension in the lipA and lpdA mutants, indicating that these mutations promote entry into a mode normally associated with a low-oxygen environment. Since analogous changes from respiration to fermentation have been observed in long-lived Saccharomyces cerevisiae and Caenorhabditis elegans strains, such metabolic alterations may represent an evolutionarily conserved strategy to extend lifespan. PMID:20707865
Genome-Wide siRNA Screen Identifies Complementary Signaling Pathways Involved in Listeria Infection and Reveals Different Actin Nucleation Mechanisms during Listeria Cell Invasion and Actin Comet Tail Formation
Kühbacher, Andreas; Emmenlauer, Mario; Rämo, Pauli; Kafai, Natasha; Dehio, Christoph
ABSTRACT Listeria monocytogenes enters nonphagocytic cells by a receptor-mediated mechanism that is dependent on a clathrin-based molecular machinery and actin rearrangements. Bacterial intra- and intercellular movements are also actin dependent and rely on the actin nucleating Arp2/3 complex, which is activated by host-derived nucleation-promoting factors downstream of the cell receptor Met during entry and by the bacterial nucleation-promoting factor ActA during comet tail formation. By genome-wide small interfering RNA (siRNA) screening for host factors involved in bacterial infection, we identified diverse cellular signaling networks and protein complexes that support or limit these processes. In addition, we could precise previously described molecular pathways involved in Listeria invasion. In particular our results show that the requirements for actin nucleators during Listeria entry and actin comet tail formation are different. Knockdown of several actin nucleators, including SPIRE2, reduced bacterial invasion while not affecting the generation of comet tails. Most interestingly, we observed that in contrast to our expectations, not all of the seven subunits of the Arp2/3 complex are required for Listeria entry into cells or actin tail formation and that the subunit requirements for each of these processes differ, highlighting a previously unsuspected versatility in Arp2/3 complex composition and function. PMID:25991686
Genome-Wide Linkage Screen for Systolic Blood Pressure in the Veterans Administration Genetic Epidemiology Study (VAGES) of Mexican-Americans and Confirmation of a Major Susceptibility Locus on Chromosome 6q14.1
Puppala, Sobha; Coletta, Dawn K.; Schneider, Jennifer; Hu, Shirley L.; Farook, Vidya S.; Dyer, Thomas D.; Arya, Rector; Blangero, John; Duggirala, Ravindranath; DeFronzo, Ralph A.; Jenkinson, Christopher P.
Objective Hypertension or high blood pressure is a strong correlate of diseases such as obesity and type 2 diabetes. We conducted a genome-wide linkage screen to identify susceptibility genes influencing systolic blood pressure (SBP) and diastolic blood pressure (DBP) in Mexican-Americans from the Veterans Administration Genetic Epidemiology Study (VAGES). Methods Using data from 1,089 individuals distributed across 266 families, we performed a multipoint linkage analysis to localize susceptibility loci for SBP and DBP by applying two models. In model 1, we added a sensible constant to the observed BP values in treated subjects [Tobin et al.; Stat Med 2005;24:2911–2935] to account for antihypertensive use (i.e. 15 and 10 mm Hg to SBP and DBP values, respectively). In model 2, we fixed values of 140 mm Hg for SBP and 90 mm Hg for DBP, if the treated values were less than the standard referenced treatment thresholds of 140/90 mm Hg for hypertensive status. However, if the observed treated BP values were found to be above these standard treatment thresholds, the actual observed treated BP values were retained in order not to reduce them by substitution of the treatment threshold values. Results The multipoint linkage analysis revealed strong linkage signals for SBP compared with DBP. The strongest evidence for linkage of SBP (model 1, LOD = 5.0; model 2, LOD = 3.6) was found on chromosome 6q14.1 near the marker D6S1031 (89 cM) in both models. In addition, some evidence for SBP linkage occurred on chromosomes 1q, 4p, and 16p. Most importantly, our major SBP linkage finding on chromosome 6q near marker D6S1031 was independently confirmed in a Caucasian population (LOD = 3.3). In summary, our study found evidence for a major locus on chromosome 6q influencing SBP levels in Mexican-Americans. PMID:21293138
A genome-wide siRNA screen for regulators of tumor suppressor p53 activity in human non-small lung cancer cells identifies components of the RNA splicing machinery as targets for anticancer treatment.
Siebring-van Olst, Ellen; Blijlevens, Maxime; de Menezes, Renee X; van der Meulen-Muileman, Ida H; Smit, Egbert F; van Beusechem, Victor W
Reinstating wild-type tumor suppressor p53 activity could be a valuable option for the treatment of cancer. To contribute to development of new treatment options for non-small cell lung cancer (NSCLC), we performed genome-wide siRNA screens for determinants of p53 activity in NSCLC cells. We identified many genes not previously known to be involved in regulating p53 activity. Silencing p53 pathway inhibitor genes was associated with loss of cell viability. The largest functional gene cluster influencing p53 activity was mRNA splicing. Prominent p53 activation was observed upon silencing of specific spliceosome components, rather than by general inhibition of the spliceosome. Ten genes were validated as inhibitors of p53 activity in multiple NSCLC cell lines: genes encoding the Ras-pathway activator SOS1, the zinc finger protein TSHZ3, the mitochondrial membrane protein COX16 and the spliceosome components SNRPD3, SF3A3, SF3B1, SF3B6, XAB2, CWC22 and HNRNPL. Silencing these genes generally increased p53 levels, with distinct effects on CDKN1A expression, induction of cell cycle arrest and cell death. Silencing spliceosome components was associated with alternative splicing of MDM4 mRNA, which could contribute to activation of p53. In addition, silencing splice factors was particularly effective in killing NSCLC cells, albeit in a p53-independent manner. Interestingly, silencing SNRPD3 and SF3A3 exerted much stronger cytotoxicity to NSCLC cells than to lung fibroblasts, suggesting that these genes could represent useful therapeutic targets. This article is protected by copyright. All rights reserved.
Govindaraj, Periyasamy; Nizamuddin, Sheikh; Sharath, Anugula; Jyothi, Vuskamalla; Rotti, Harish; Raval, Ritu; Nayak, Jayakrishna; Bhat, Balakrishna K.; Prasanna, B. V.; Shintre, Pooja; Sule, Mayura; Joshi, Kalpana S.; Dedge, Amrish P.; Bharadwaj, Ramachandra; Gangadharan, G. G.; Nair, Sreekumaran; Gopinath, Puthiya M.; Patwardhan, Bhushan; Kondaiah, Paturu; Satyamoorthy, Kapaettu; Valiathan, Marthanda Varma Sankaran; Thangaraj, Kumarasamy
The practice of Ayurveda, the traditional medicine of India, is based on the concept of three major constitutional types (Vata, Pitta and Kapha) defined as “Prakriti”. To the best of our knowledge, no study has convincingly correlated genomic variations with the classification of Prakriti. In the present study, we performed genome-wide SNP (single nucleotide polymorphism) analysis (Affymetrix, 6.0) of 262 well-classified male individuals (after screening 3416 subjects) belonging to three Prakritis. We found 52 SNPs (p ≤ 1 × 10−5) were significantly different between Prakritis, without any confounding effect of stratification, after 106 permutations. Principal component analysis (PCA) of these SNPs classified 262 individuals into their respective groups (Vata, Pitta and Kapha) irrespective of their ancestry, which represent its power in categorization. We further validated our finding with 297 Indian population samples with known ancestry. Subsequently, we found that PGM1 correlates with phenotype of Pitta as described in the ancient text of Caraka Samhita, suggesting that the phenotypic classification of India’s traditional medicine has a genetic basis; and its Prakriti-based practice in vogue for many centuries resonates with personalized medicine. PMID:26511157
Motaung, Thabiso E; Ells, Ruan; Pohl, Carolina H; Albertyn, Jacobus; Tsilo, Toi J
Candida albicans is an important etiological agent of superficial and life-threatening infections in individuals with compromised immune systems. To date, we know of several overlapping genetic networks that govern virulence attributes in this fungal pathogen. Classical use of deletion mutants has led to the discovery of numerous virulence factors over the years, and genome-wide functional analysis has propelled gene discovery at an even faster pace. Indeed, a number of recent studies using large-scale genetic screens followed by genome-wide functional analysis has allowed for the unbiased discovery of many new genes involved in C. albicans biology. Here we share our perspectives on the role of these studies in analyzing fundamental aspects of C. albicans virulence properties.
Gusareva, Elena S.; Carrasquillo, Minerva M.; Bellenguez, Céline; Cuyvers, Elise; Colon, Samuel; Graff-Radford, Neill R.; Petersen, Ronald C.; Dickson, Dennis W.; Mahachie Johna, Jestinah M.; Bessonov, Kyrylo; Van Broeckhoven, Christine; Williams, Julie; Amouyel, Philippe; Sleegers, Kristel; Ertekin-Taner, Nilüfer; Lambert, Jean-Charles; Van Steen, Kristel
We propose a minimal protocol for exhaustive genome-wide association interaction analysis that involves screening for epistasis over large-scale genomic data combining strengths of different methods and statistical tools. The different steps of this protocol are illustrated on a real-life data application for Alzheimer's disease (AD) (2259 patients and 6017 controls from France). Particularly, in the exhaustive genome-wide epistasis screening we identified AD-associated interacting SNPs-pair from chromosome 6q11.1 (rs6455128, the KHDRBS2 gene) and 13q12.11 (rs7989332, the CRYL1 gene) (p = 0.006, corrected for multiple testing). A replication analysis in the independent AD cohort from Germany (555 patients and 824 controls) confirmed the discovered epistasis signal (p = 0.036). This signal was also supported by a meta-analysis approach in 5 independent AD cohorts that was applied in the context of epistasis for the first time. Transcriptome analysis revealed negative correlation between expression levels of KHDRBS2 and CRYL1 in both the temporal cortex (β = −0.19, p = 0.0006) and cerebellum (β = −0.23, p < 0.0001) brain regions. This is the first time a replicable epistasis associated with AD was identified using a hypothesis free screening approach. PMID:24958192
The development of the dorsal vessel in Drosophila is one of the first systems in which key mechanisms regulating cardiogenesis have been defined in great detail at the genetic and molecular level. Due to evolutionary conservation, these findings have also provided major inputs into studies of cardiogenesis in vertebrates. Many of the major components that control Drosophila cardiogenesis were discovered based on candidate gene approaches and their functions were defined by employing the outstanding genetic tools and molecular techniques available in this system. More recently, approaches have been taken that aim to interrogate the entire genome in order to identify novel components and describe genomic features that are pertinent to the regulation of heart development. Apart from classical forward genetic screens, the availability of the thoroughly annotated Drosophila genome sequence made new genome-wide approaches possible, which include the generation of massive numbers of RNA interference (RNAi) reagents that were used in forward genetic screens, as well as studies of the transcriptomes and proteomes of the developing heart under normal and experimentally manipulated conditions. Moreover, genome-wide chromatin immunoprecipitation experiments have been performed with the aim to define the full set of genomic binding sites of the major cardiogenic transcription factors, their relevant target genes, and a more complete picture of the regulatory network that drives cardiogenesis. This review will give an overview on these genome-wide approaches to Drosophila heart development and on computational analyses of the obtained information that ultimately aim to provide a description of this process at the systems level. PMID:27294102
Yong, Wai-Shin; Hsu, Fei-Man; Chen, Pao-Yang
DNA methylation is an epigenetic modification that plays an important role in regulating gene expression and therefore a broad range of biological processes and diseases. DNA methylation is tissue-specific, dynamic, sequence-context-dependent and trans-generationally heritable, and these complex patterns of methylation highlight the significance of profiling DNA methylation to answer biological questions. In this review, we surveyed major methylation assays, along with comparisons and biological examples, to provide an overview of DNA methylation profiling techniques. The advances in microarray and sequencing technologies make genome-wide profiling possible at a single-nucleotide or even a single-cell resolution. These profiling approaches vary in many aspects, such as DNA input, resolution, genomic region coverage, and bioinformatics analysis, and selecting a feasible method requires knowledge of these methods. We first introduce the biological background of DNA methylation and its pattern in plants, animals and fungi. We present an overview of major experimental approaches to profiling genome-wide DNA methylation and hydroxymethylation and then extend to the single-cell methylome. To evaluate these methods, we outline their strengths and weaknesses and perform comparisons across the different platforms. Due to the increasing need to compute high-throughput epigenomic data, we interrogate the computational pipeline for bisulfite sequencing data and also discuss the concept of identifying differentially methylated regions (DMRs). This review summarizes the experimental and computational concepts for profiling genome-wide DNA methylation, followed by biological examples. Overall, this review provides researchers useful guidance for the selection of a profiling method suited to specific research questions.
Duan, Jubao; Sanders, Alan R.; Gejman, Pablo V.
Schizophrenia (SZ) is a common and severe psychiatric disorder with both environmental and genetic risk factors, and a high heritability. After over 20 years of molecular genetics research, new molecular strategies, primarily genome-wide association studies (GWAS), have generated major tangible progress. This new data provides evidence for: 1) A number of chromosomal regions with common polymorphisms showing genome-wide association with SZ (the major histocompatibility complex, MHC, region at 6p22-p21; 18q21.2; and 2q32.1). The associated alleles present small odds ratios (the odds of a risk variant being present in cases versus controls) and suggest causative involvement of gene regulatory mechanisms in SZ. 2) Polygenic inheritance. 3) Involvement of rare (<1%) and large (>100kb) copy number variants (CNVs). 4) A genetic overlap of SZ with autism and with bipolar disorder (BP) challenging the classical clinical classifications. Most new SZ findings (chromosomal regions and genes) have generated new biological leads. These new findings, however, still need to be translated into a better understanding of the underlying biology and into causal mechanisms. Furthermore, a considerable amount of heritability still remains unexplained (missing heritability). Deep resequencing for rare variants and system biology approaches (e.g., integrating DNA sequence and functional data) are expected to further improve our understanding of the genetic architecture of SZ and its underlying biology. PMID:20433910
As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide...
RNAi screening in primary human hepatocytes of genes implicated in genome-wide association studies for roles in type 2 diabetes identifies roles for CAMK1D and CDKAL1, among others, in hepatic glucose regulation.
Haney, Steven; Zhao, Juan; Tiwari, Shiwani; Eng, Kurt; Guey, Lin T; Tien, Eric
Genome-wide association (GWA) studies have described a large number of new candidate genes that contribute to of Type 2 Diabetes (T2D). In some cases, small clusters of genes are implicated, rather than a single gene, and in all cases, the genetic contribution is not defined through the effects on a specific organ, such as the pancreas or liver. There is a significant need to develop and use human cell-based models to examine the effects these genes may have on glucose regulation. We describe the development of a primary human hepatocyte model that adjusts glucose disposition according to hormonal signals. This model was used to determine whether candidate genes identified in GWA studies regulate hepatic glucose disposition through siRNAs corresponding to the list of identified genes. We find that several genes affect the storage of glucose as glycogen (glycolytic response) and/or affect the utilization of pyruvate, the critical step in gluconeogenesis. Of the genes that affect both of these processes, CAMK1D, TSPAN8 and KIF11 affect the localization of a mediator of both gluconeogenesis and glycolysis regulation, CRTC2, to the nucleus in response to glucagon. In addition, the gene CDKAL1 was observed to affect glycogen storage, and molecular experiments using mutant forms of CDK5, a putative target of CDKAL1, in HepG2 cells show that this is mediated by coordinate regulation of CDK5 and PKA on MEK, which ultimately regulates the phosphorylation of ribosomal protein S6, a critical step in the insulin signaling pathway.
Leduc, Frédéric; Faucher, David; Bikond Nkoma, Geneviève; Grégoire, Marie-Chantal; Arguin, Mélina; Wellinger, Raymund J; Boissonneault, Guylain
Determination of cellular DNA damage has so far been limited to global assessment of genome integrity whereas nucleotide-level mapping has been restricted to specific loci by the use of specific primers. Therefore, only limited DNA sequences can be studied and novel regions of genomic instability can hardly be discovered. Using a well-characterized yeast model, we describe a straightforward strategy to map genome-wide DNA strand breaks without compromising nucleotide-level resolution. This technique, termed "damaged DNA immunoprecipitation" (dDIP), uses immunoprecipitation and the terminal deoxynucleotidyl transferase-mediated dUTP-biotin end-labeling (TUNEL) to capture DNA at break sites. When used in combination with microarray or next-generation sequencing technologies, dDIP will allow researchers to map genome-wide DNA strand breaks as well as other types of DNA damage and to establish a clear profiling of altered genes and/or intergenic sequences in various experimental conditions. This mapping technique could find several applications for instance in the study of aging, genotoxic drug screening, cancer, meiosis, radiation and oxidative DNA damage.
Stadler, Zsofia K.; Thom, Peter; Robson, Mark E.; Weitzel, Jeffrey N.; Kauff, Noah D.; Hurley, Karen E.; Devlin, Vincent; Gold, Bert; Klein, Robert J.; Offit, Kenneth
Knowledge of the inherited risk for cancer is an important component of preventive oncology. In addition to well-established syndromes of cancer predisposition, much remains to be discovered about the genetic variation underlying susceptibility to common malignancies. Increased knowledge about the human genome and advances in genotyping technology have made possible genome-wide association studies (GWAS) of human diseases. These studies have identified many important regions of genetic variation associated with an increased risk for human traits and diseases including cancer. Understanding the principles, major findings, and limitations of GWAS is becoming increasingly important for oncologists as dissemination of genomic risk tests directly to consumers is already occurring through commercial companies. GWAS have contributed to our understanding of the genetic basis of cancer and will shed light on biologic pathways and possible new strategies for targeted prevention. To date, however, the clinical utility of GWAS-derived risk markers remains limited. PMID:20585100
Harari, Yaniv; Kupiec, Martin
Telomeres are specialized DNA-protein structures at the ends of eukaryotic chromosomes. Telomeres are essential for chromosomal stability and integrity, as they prevent chromosome ends from being recognized as double strand breaks. In rapidly proliferating cells, telomeric DNA is synthesized by the enzyme telomerase, which copies a short template sequence within its own RNA moiety, thus helping to solve the “end-replication problem”, in which information is lost at the ends of chromosomes with each DNA replication cycle. The basic mechanisms of telomere length, structure and function maintenance are conserved among eukaryotes. Studies in the yeast Saccharomyces cerevisiae have been instrumental in deciphering the basic aspects of telomere biology. In the last decade, technical advances, such as the availability of mutant collections, have allowed carrying out systematic genome-wide screens for mutants affecting various aspects of telomere biology. In this review we summarize these efforts, and the insights that this Systems Biology approach has produced so far.
Mullapudi, Nandita; Ye, Bin; Suzuki, Masako; Fazzari, Melissa; Han, Weiguo; Shi, Miao K; Marquardt, Gaby; Lin, Juan; Wang, Tao; Keller, Steven; Zhu, Changcheng; Locker, Joseph D; Spivack, Simon D
Aberrant cytosine 5-methylation underlies many deregulated elements of cancer. Among paired non-small cell lung cancers (NSCLC), we sought to profile DNA 5-methyl-cytosine features which may underlie genome-wide deregulation. In one of the more dense interrogations of the methylome, we sampled 1.2 million CpG sites from twenty-four NSCLC tumor (T)-non-tumor (NT) pairs using a methylation-sensitive restriction enzyme- based HELP-microarray assay. We found 225,350 differentially methylated (DM) sites in adenocarcinomas versus adjacent non-tumor tissue that vary in frequency across genomic compartment, particularly notable in gene bodies (GB; p<2.2E-16). Further, when DM was coupled to differential transcriptome (DE) in the same samples, 37,056 differential loci in adenocarcinoma emerged. Approximately 90% of the DM-DE relationships were non-canonical; for example, promoter DM associated with DE in the same direction. Of the canonical changes noted, promoter (PR) DM loci with reciprocal changes in expression in adenocarcinomas included HBEGF, AGER, PTPRM, DPT, CST1, MELK; DM GB loci with concordant changes in expression included FOXM1, FERMT1, SLC7A5, and FAP genes. IPA analyses showed adenocarcinoma-specific promoter DMxDE overlay identified familiar lung cancer nodes [tP53, Akt] as well as less familiar nodes [HBEGF, NQO1, GRK5, VWF, HPGD, CDH5, CTNNAL1, PTPN13, DACH1, SMAD6, LAMA3, AR]. The unique findings from this study include the discovery of numerous candidate The unique findings from this study include the discovery of numerous candidate methylation sites in both PR and GB regions not previously identified in NSCLC, and many non-canonical relationships to gene expression. These DNA methylation features could potentially be developed as risk or diagnostic biomarkers, or as candidate targets for newer methylation locus-targeted preventive or therapeutic agents.
Suzuki, Masako; Fazzari, Melissa; Han, Weiguo; Shi, Miao K.; Marquardt, Gaby; Lin, Juan; Wang, Tao; Keller, Steven; Zhu, Changcheng; Locker, Joseph D.; Spivack, Simon D.
Aberrant cytosine 5-methylation underlies many deregulated elements of cancer. Among paired non-small cell lung cancers (NSCLC), we sought to profile DNA 5-methyl-cytosine features which may underlie genome-wide deregulation. In one of the more dense interrogations of the methylome, we sampled 1.2 million CpG sites from twenty-four NSCLC tumor (T)–non-tumor (NT) pairs using a methylation-sensitive restriction enzyme- based HELP-microarray assay. We found 225,350 differentially methylated (DM) sites in adenocarcinomas versus adjacent non-tumor tissue that vary in frequency across genomic compartment, particularly notable in gene bodies (GB; p<2.2E-16). Further, when DM was coupled to differential transcriptome (DE) in the same samples, 37,056 differential loci in adenocarcinoma emerged. Approximately 90% of the DM-DE relationships were non-canonical; for example, promoter DM associated with DE in the same direction. Of the canonical changes noted, promoter (PR) DM loci with reciprocal changes in expression in adenocarcinomas included HBEGF, AGER, PTPRM, DPT, CST1, MELK; DM GB loci with concordant changes in expression included FOXM1, FERMT1, SLC7A5, and FAP genes. IPA analyses showed adenocarcinoma-specific promoter DMxDE overlay identified familiar lung cancer nodes [tP53, Akt] as well as less familiar nodes [HBEGF, NQO1, GRK5, VWF, HPGD, CDH5, CTNNAL1, PTPN13, DACH1, SMAD6, LAMA3, AR]. The unique findings from this study include the discovery of numerous candidate The unique findings from this study include the discovery of numerous candidate methylation sites in both PR and GB regions not previously identified in NSCLC, and many non-canonical relationships to gene expression. These DNA methylation features could potentially be developed as risk or diagnostic biomarkers, or as candidate targets for newer methylation locus-targeted preventive or therapeutic agents. PMID:26683690
Gois, I B; Borém, A; Cristofani-Yaly, M; de Resende, M D V; Azevedo, C F; Bastianel, M; Novelli, V M; Machado, M A
Genome wide selection (GWS) is essential for the genetic improvement of perennial species such as Citrus because of its ability to increase gain per unit time and to enable the efficient selection of characteristics with low heritability. This study assessed GWS efficiency in a population of Citrus and compared it with selection based on phenotypic data. A total of 180 individual trees from a cross between Pera sweet orange (Citrus sinensis Osbeck) and Murcott tangor (Citrus sinensis Osbeck x Citrus reticulata Blanco) were evaluated for 10 characteristics related to fruit quality. The hybrids were genotyped using 5287 DArT_seq(TM) (diversity arrays technology) molecular markers and their effects on phenotypes were predicted using the random regression - best linear unbiased predictor (rr-BLUP) method. The predictive ability, prediction bias, and accuracy of GWS were estimated to verify its effectiveness for phenotype prediction. The proportion of genetic variance explained by the markers was also computed. The heritability of the traits, as determined by markers, was 16-28%. The predictive ability of these markers ranged from 0.53 to 0.64, and the regression coefficients between predicted and observed phenotypes were close to unity. Over 35% of the genetic variance was accounted for by the markers. Accuracy estimates with GWS were lower than those obtained by phenotypic analysis; however, GWS was superior in terms of genetic gain per unit time. Thus, GWS may be useful for Citrus breeding as it can predict phenotypes early and accurately, and reduce the length of the selection cycle. This study demonstrates the feasibility of genomic selection in Citrus.
Ma, Meng; Dou, Taocun; Lu, Jian; Guo, Jun; Hu, Yuping; Yi, Guoqiang; Yuan, Jingwei; Sun, Congjiao; Wang, Kehua; Yang, Ning
The comb, as a secondary sexual character, is an important trait in chicken. Indicators of comb length (CL), comb height (CH), and comb weight (CW) are often selected in production. DNA-based marker-assisted selection could help chicken breeders to accelerate genetic improvement for comb or related economic characters by early selection. Although a number of quantitative trait loci (QTL) and candidate genes have been identified with advances in molecular genetics, candidate genes underlying comb traits are limited. The aim of the study was to use genome-wide association (GWA) studies by 600 K Affymetrix chicken SNP arrays to detect genes that are related to comb, using an F2 resource population. For all comb characters, comb exhibited high SNP-based heritability estimates (0.61–0.69). Chromosome 1 explained 20.80% genetic variance, while chromosome 4 explained 6.89%. Independent univariate genome-wide screens for each character identified 127, 197, and 268 novel significant SNPs with CL, CH, and CW, respectively. Three candidate genes, VPS36, AR, and WNT11B, were determined to have a plausible function in all comb characters. These genes are important to the initiation of follicle development, gonadal growth, and dermal development, respectively. The current study provides the first GWA analysis for comb traits. Identification of the genetic basis as well as promising candidate genes will help us understand the underlying genetic architecture of comb development and has practical significance in breeding programs for the selection of comb as an index for sexual maturity or reproduction. PMID:27427764
A genome-wide screen identifies Salmonella Enteritidis lipopolysaccharide biosynthesis and the HtrA heat shock protein as crucial factors involved in egg white persistence at chicken body temperature.
Raspoet, R; Shearer, N; Appia-Ayme, C; Haesebrouck, F; Ducatelle, R; Thompson, A; Van Immerseel, F
Eggs contaminated with Salmonella Enteritidis are an important source of human foodborne Salmonella infections. Salmonella Enteritidis is able to contaminate egg white during formation of the egg within the chicken oviduct, and it has developed strategies to withstand the antimicrobial properties of egg white to survive in this hostile environment. The mechanisms involved in the persistence of Salmonella Enteritidis in egg white are likely to be complex. To address this issue, a microarray-based transposon library screen was performed to identify genes necessary for survival of Salmonella Enteritidis in egg white at chicken body temperature. The majority of identified genes belonged to the lipopolysaccharide biosynthesis pathway. Additionally, we provide evidence that the serine protease/heat shock protein (HtrA) appears essential for the survival of Salmonella Enteritidis in egg white at chicken body temperature.
St Julien, Krystal R; Jelliffe-Pawlowski, Laura L; Shaw, Gary M; Stevenson, David K; O'Brodovich, Hugh M; Krasnow, Mark A
Spots of blood are routinely collected from newborn babies onto filter paper called Guthrie cards and used to screen for metabolic and genetic disorders. The archived dried blood spots are an important and precious resource for genomic research. Whole genome amplification of dried blood spot DNA has been used to provide DNA for genome-wide SNP genotyping. Here we describe a 96 well format procedure to extract DNA from a portion of a dried blood spot that provides sufficient unamplified genomic DNA for genome-wide single nucleotide polymorphism (SNP) genotyping. We show that SNP genotyping of the unamplified DNA is more robust than genotyping amplified dried blood spot DNA, is comparable in cost, and can be done with thousands of samples. This procedure can be used for genome-wide association studies and other large-scale genomic analyses that require robust, high-accuracy genotyping of dried blood spot DNA.
Ahmad, Aftab; Niwa, Yasuo; Goto, Shingo; Kobayashi, Kyoko; Shimizu, Masanori; Ito, Sohei; Usui, Yumiko; Nakayama, Tsutomu; Kobayashi, Hirokazu
Salinity represents a major abiotic stress factor that can adversely limit the production, quality and geographical distribution of crops. In this study we focused on dedifferentiated calli with fundamental cell functions, the salt tolerance of which had not been previously examined. The experimental approach was based on activation tagging without regeneration of plants for the identification of salt-tolerant mutants of Arabidopsis. Among 62,000 transformed calli that were screened, 18 potential mutants resistant to 150 mM NaCl were obtained. Thermal asymmetric interlaced (TAIL)-PCR was performed to determine the location of T-DNA integration in the genome. In one line, referred to as salt tolerant callus 1 (stc1), expression of a gene [At4g39800: myo-inositol-1-P-synthase 1 (MIPS1)] was considerably enhanced in calli. Plants regenerated from calli showed tolerance to salt in germination and subsequent growth. Retransformation of wild-type Arabidopsis with MIPS1 conferred salt tolerance, indicating that MIPS1 is the causal gene. The over-expression of MIPS1 increased the content of total inositol. The involvement of MIPS1 in salt tolerance through the fundamental cell growth has been proved in Arabidopsis. PMID:25978457
The aryl hydrocarbon receptor (AHR) has a plethora of physiological roles, and upon dysregulation, carcinogenesis can occur. One target gene of AHR encodes the xenobiotic and drug-metabolizing enzyme CYP1A1, which is inducible by the environmental contaminant 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD) via the AHR. An siRNA library targeted against over 5600 gene candidates in the druggable genome was used to transfect mouse Hepa-1 cells, which were then treated with TCDD, and subsequently assayed for CYP1A1-dependent ethoxyresorufin-o-deethylase (EROD) activity. Following redundant siRNA activity (RSA) statistical analysis, we identified 93 hits that reduced EROD activity with a p value ≤ .005 and substantiated 39 of these as positive hits in a secondary screening using endoribonuclease-prepared siRNAs (esiRNAs). Twelve of the corresponding gene products were subsequently confirmed to be necessary for the induction of CYP1A1 messenger RNA by TCDD. None of the candidates were deficient in aryl hydrocarbon nuclear translocator expression. However 6 gene products including UBE2i, RAB40C, CRYGD, DCTN4, RBM5, and RAD50 are required for the expression of AHR as well as for induction of CYP1A1. We also found 2 gene products, ARMC8 and TCF20, to be required for the induction of CYP1A1, but our data are ambiguous as to whether they are required for the expression of AHR. In contrast, SIN3A, PDC, TMEM5, and CD9 are not required for AHR expression but are required for the induction of CYP1A1, implicating a direct role in Cyp1a1 transcription. Our methods, although applied to Cyp1a1, could be modified for identifying proteins that regulate other inducible genes. PMID:23997114
Mondul, Alison M; Yu, Kai; Wheeler, William; Zhang, Hong; Weinstein, Stephanie J; Major, Jacqueline M; Cornelis, Marilyn C; Männistö, Satu; Hazra, Aditi; Hsing, Ann W; Jacobs, Kevin B; Eliassen, Heather; Tanaka, Toshiko; Reding, Douglas J; Hendrickson, Sara; Ferrucci, Luigi; Virtamo, Jarmo; Hunter, David J; Chanock, Stephen J; Kraft, Peter; Albanes, Demetrius
Retinol is one of the most biologically active forms of vitamin A and is hypothesized to influence a wide range of human diseases including asthma, cardiovascular disease, infectious diseases and cancer. We conducted a genome-wide association study of 5006 Caucasian individuals drawn from two cohorts of men: the Alpha-Tocopherol, Beta-Carotene Cancer Prevention (ATBC) Study and the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening Trial. We identified two independent single-nucleotide polymorphisms associated with circulating retinol levels, which are located near the transthyretin (TTR) and retinol binding protein 4 (RBP4) genes which encode major carrier proteins of retinol: rs1667255 (P =2.30× 10(-17)) and rs10882272 (P =6.04× 10(-12)). We replicated the association with rs10882272 in RBP4 in independent samples from the Nurses' Health Study and the Invecchiare in Chianti Study (InCHIANTI) that included 3792 women and 504 men (P =9.49× 10(-5)), but found no association for retinol with rs1667255 in TTR among women, thus suggesting evidence for gender dimorphism (P-interaction=1.31× 10(-5)). Discovery of common genetic variants associated with serum retinol levels may provide further insight into the contribution of retinol and other vitamin A compounds to the development of cancer and other complex diseases.
Lystig, Theodore C
Genome-wide scans for quantitative trait loci (QTL) have traditionally been summarized with plots of logarithm of odds (LOD) scores. A valuable modification is to supplement such plots with an additional vertical axis displaying quantiles of adjusted P values and labeling local maxima of the LOD scores with location-specific adjusted P values. This provides a visible gradation of genome-wide significance for the LOD score curve, instead of the stark dichotomy that a single threshold yields. Adjusted P values give genome-wide significance of individual LOD scores and are obtained through a straightforward modification of the familiar algorithm for generating permutation-based thresholds. PMID:12930772
... historical) Genome-Wide Scan Reveals Mutation Associated with Melanoma A team of international researchers supported by the ... when they divide and grow uncontrollably, develop into melanoma. Also, MITF activity is known to be amplified ...
Otowa, Takeshi; Kawamura, Yoshiya; Tsutsumi, Akizumi; Kawakami, Norito; Kan, Chiemi; Shimada, Takafumi; Umekage, Tadashi; Kasai, Kiyoto; Tokunaga, Katsushi; Sasaki, Tsukasa
Stressful events have been identified as a risk factor for depression. Although gene–environment (G × E) interaction in a limited number of candidate genes has been explored, no genome-wide search has been reported. The aim of the present study is to identify genes that influence the association of stressful events with depression. Therefore, we performed a genome-wide G × E interaction analysis in the Japanese population. A genome-wide screen with 320 subjects was performed using the Affymetrix Genome-Wide Human Array 6.0. Stressful life events were assessed using the Social Readjustment Rating Scale (SRRS) and depression symptoms were assessed with self-rating questionnaires using the Center for Epidemiologic Studies Depression (CES-D) scale. The p values for interactions between single nucleotide polymorphisms (SNPs) and stressful events were calculated using the linear regression model adjusted for sex and age. After quality control of genotype data, a total of 534,848 SNPs on autosomal chromosomes were further analyzed. Although none surpassed the level of the genome-wide significance, a marginal significant association of interaction between SRRS and rs10510057 with depression were found (p = 4.5 × 10−8). The SNP is located on 10q26 near Regulators of G-protein signaling 10 (RGS10), which encodes a regulatory molecule involved in stress response. When we investigated a similar G × E interaction between depression (K6 scale) and work-related stress in an independent sample (n = 439), a significant G × E effect on depression was observed (p = 0.015). Our findings suggest that rs10510057, interacting with stressors, may be involved in depression risk. Incorporating G × E interaction into GWAS can contribute to find susceptibility locus that are potentially missed by conventional GWAS. PMID:27529621
Shen, Xun; Collier, John Michael; Hlaing, Myint; Zhang, Leanne; Delshad, Elizabeth H.; Bristow, James; Bernstein, Harold S.
Skeletal and cardiac myocytes cease division within weeks of birth. Although skeletal muscle retains limited capacity for regeneration through recruitment of satellite cells, resident populations of adult myocardial stem cells have not been identified. Because cell cycle withdrawal accompanies myocyte differentiation, we hypothesized that C2C12 cells, a mouse myoblast cell line previously used to characterize myocyte differentiation, also would provide a model for studying cell cycle withdrawal during differentiation. C2C12 cells were differentiated in culture medium containing horse serum and harvested at various time points to characterize the expression profiles of known cell cycle and myogenic regulatory factors by immunoblot analysis. BrdU incorporation decreased dramatically in confluent cultures 48 hr after addition of horse serum, as cells started to form myotubes. This finding was preceded by up-regulation of MyoD, followed by myogenin, and activation of Bcl-2. Cyclin D1 was expressed in proliferating cultures and became undetectable in cultures containing 40 percent fused myotubes, as levels of p21(WAF1/Cip1) increased and alpha-actin became detectable. Because C2C12 myoblasts withdraw from the cell cycle during myocyte differentiation following a course that recapitulates this process in vivo, we performed a genome-wide screen to identify other gene products involved in this process. Using microarrays containing approximately 10,000 minimally redundant mouse sequences that map to the UniGene database of the National Center for Biotechnology Information, we compared gene expression profiles between proliferating, differentiating, and differentiated C2C12 cells and verified candidate genes demonstrating differential expression by RT-PCR. Cluster analysis of differentially expressed genes revealed groups of gene products involved in cell cycle withdrawal, muscle differentiation, and apoptosis. In addition, we identified several genes, including DDAH2 and Ly
Bushman, Frederic D; Malani, Nirav; Fernandes, Jason; D'Orso, Iván; Cagney, Gerard; Diamond, Tracy L; Zhou, Honglin; Hazuda, Daria J; Espeseth, Amy S; König, Renate; Bandyopadhyay, Sourav; Ideker, Trey; Goff, Stephen P; Krogan, Nevan J; Frankel, Alan D; Young, John A T; Chanda, Sumit K
We have analyzed host cell genes linked to HIV replication that were identified in nine genome-wide studies, including three independent siRNA screens. Overlaps among the siRNA screens were very modest (<7% for any pairwise combination), and similarly, only modest overlaps were seen in pairwise comparisons with other types of genome-wide studies. Combining all genes from the genome-wide studies together with genes reported in the literature to affect HIV yields 2,410 protein-coding genes, or fully 9.5% of all human genes (though of course some of these are false positive calls). Here we report an "encyclopedia" of all overlaps between studies (available at http://www.hostpathogen.org), which yielded a more extensively corroborated set of host factors assisting HIV replication. We used these genes to calculate refined networks that specify cellular subsystems recruited by HIV to assist in replication, and present additional analysis specifying host cell genes that are attractive as potential therapeutic targets.
Wu, Xuesen; Dong, Hua; Luo, Li; Zhu, Yun; Peng, Gang; Reveille, John D; Xiong, Momiao
Although great progress in genome-wide association studies (GWAS) has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked). The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDR<0.001 and 0.001
Genome-wide association study (GWAS) has appeared as a widespread strategy in decoding genotype-phenotype associations in many species thanks to technical advances in next-generation sequencing (NGS) applications. Maize is an ideal crop for GWAS and significant progress has been made in the last dec...
Genome-Wide Association Studies shed light on the identification of genes underlying human diseases and agriculturally important traits. This potential has been shadowed by false positive findings. The Mixed Linear Model (MLM) method is flexible enough to simultaneously incorporate population struct...
We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded genome-wide significant associations with schizophrenia for seven loci, five of which are new (1p21.3, 2q32.3, 8p23.2, 8q21.3 and 10q24.32-q24.33) and two of which have been previously implicated (6p21.32-p22.1 and 18q21.2). The strongest new finding (P = 1.6 × 10−11) was with rs1625579 within an intron of a putative primary transcript for MIR137 (microRNA 137), a known regulator of neuronal development. Four other schizophrenia loci achieving genome-wide significance contain predicted targets of MIR137, suggesting MIR137-mediated dysregulation as a previously unknown etiologic mechanism in schizophrenia. In a joint analysis with a bipolar disorder sample (16,374 affected individuals and 14,044 controls), three loci reached genome-wide significance: CACNA1C (rs4765905, P = 7.0 × 10−9), ANK3 (rs10994359, P = 2.5 × 10−8) and the ITIH3-ITIH4 region (rs2239547, P = 7.8 × 10−9). PMID:21926974
Soybean aphid is the most damaging insect pest of soybean in the Upper Midwest and is primarily controlled by insecticides. Soybean aphid resistance (i.e., Rag genes) has been documented in some soybean lines at chromosomes 6, 7, 13, and 16, but more sources of resistance are needed. Genome-wide ass...
Weiss, Lauren A.; Arking, Dan E.
Summary Although autism is a highly heritable neurodevelopmental disorder, attempts to identify specific susceptibility genes have thus far met with limited success 1. Genome-wide association studies (GWAS) using half a million or more markers, particularly those with very large sample sizes achieved through meta-analysis, have shown great success in mapping genes for other complex genetic traits (http://www.genome.gov/26525384). Consequently, we initiated a linkage and association mapping study using half a million genome-wide SNPs in a common set of 1,031 multiplex autism families (1,553 affected offspring). We identified regions of suggestive and significant linkage on chromosomes 6q27 and 20p13, respectively. Initial analysis did not yield genome-wide significant associations; however, genotyping of top hits in additional families revealed a SNP on chromosome 5p15 (between SEMA5A and TAS2R1) that was significantly associated with autism (P = 2 × 10−7). We also demonstrated that expression of SEMA5A is reduced in brains from autistic patients, further implicating SEMA5A as an autism susceptibility gene. The linkage regions reported here provide targets for rare variation screening while the discovery of a single novel association demonstrates the action of common variants. PMID:19812673
Genovesio, Auguste; Kwon, Yong-Jun; Windisch, Marc P; Kim, Nam Youl; Choi, Seo Yeon; Kim, Hi Chul; Jung, Sungyong; Mammano, Fabrizio; Perrin, Virginie; Boese, Annette S; Casartelli, Nicoletta; Schwartz, Olivier; Nehrbass, Ulf; Emans, Neil
Recent genome-wide RNAi screens have identified >842 human genes that affect the human immunodeficiency virus (HIV) cycle. The list of genes implicated in infection differs between screens, and there is minimal overlap. A reason for this variance is the interdependence of HIV infection and host cell function, producing a multitude of indirect or pleiotropic cellular effects affecting the viral infection during RNAi screening. To overcome this, the authors devised a 15-dimensional phenotypic profile to define the viral infection block induced by CD4 silencing in HeLa cells. They demonstrate that this phenotypic profile excludes nonspecific, RNAi-based side effects and viral replication defects mediated by silencing of housekeeping genes. To achieve statistical robustness, the authors used automatically annotated RNAi arrays for seven independent genome-wide RNAi screens. This identified 56 host genes, which reliably reproduced CD4-like phenotypes upon HIV infection. The factors include 11 known HIV interactors and 45 factors previously not associated with HIV infection. As proof of concept, the authors confirmed that silencing of PAK1, Ku70, and RNAseH2A impaired HIV replication in Jurkat cells. In summary, multidimensional, visual profiling can identify genes required for HIV infection.
Lv, Mengxin; Zhong, Zhenyu; Chi, Hong; Huang, Mengge; Jiang, Rong; Chen, Junxia
miRNAs have emerged as promising markers for tumors. However, the underlying mechanism of specific miRNAs in bladder cancer (BC) remains largely unknown. Here, a comprehensive miRNA/mRNA expression profile was executed by microarray assay for four pairs of bladder carcinoma and para-carcinoma tissues from patients with grade 2 (G2) T2. A total of 99 miRNAs and 4416 mRNAs were discovered to be significantly differentially expressed in BC tissues compared with controls. Five microRNAs and two mRNAs were validated by qRT-PCR in 30 pairs of samples, including G1–G3/T1–T4. Subsequently, we constructed a network with the five miRNAs-target mRNAs; gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were utilized to recognize the functions and associated pathways. Moreover, we further found that miR-130b-3p was significantly up-regulated and negatively correlated with phosphatase and tensin homolog (PTEN) expression in bladder cancer tissues. Next, we demonstrated that miR-130b-3p might target PTEN through bioinformatics and dual-luciferase reporter assay. Finally, we showed that miR-130b-3p could down-regulate PTEN expression, which promoted proliferation, migration, invasion and rearranged cytoskeleton through the activation of the PI3K and integrin β1 signaling pathway in bladder cancer cells. Inversely, miR-130b-3p inhibitors induced apoptosis. Taken together, this research investigated, for the first time, miR-130b-3p by an incorporated analysis of microRNA/mRNA expressions of a genome-wide screen in BC. Our findings suggest that the miR-130b-3p/PTEN/integrin β1 axis could play a critical role in the progression and development of BC and that miR-130b-3p might be a valuable clinical marker and therapeutical target for BC patients. PMID:28042869
Eglė, Preikšaitienė; Laima, Ambrozaitytė; Živilė, Maldžienė; Aušra, Morkūnienė,; Loreta, Cimbalistienė; Tautvydas, Rančelis; Algirdas, Utkus; Vaidutis, Kučinskas
Background. Intellectual disability affects about 1–2% of the general population worldwide, and this is the leading socio-economic problem of health care. The evaluation of the genetic causes of intellectual disability is challenging because these conditions are genetically heterogeneous with many different genetic alterations resulting in clinically indistinguishable phenotypes. Genome wide molecular technologies are effective in a research setting for establishing the new genetic basis of a disease. We describe the first Lithuanian experience in genome-wide CNV detection and whole exome sequencing, presenting the results obtained in the research project UNIGENE. Materials and methods. The patients with developmental delay/intellectual disability have been investigated (n = 66). Diagnostic screening was performed using array-CGH technology. FISH and real time-PCR were used for the confirmation of gene-dose imbalances and investigation of parental samples. Whole exome sequencing using the next generation high throughput NGS technique was used to sequence the samples of 12 selected families. Results. 14 out of 66 patients had pathogenic copy number variants, and one patient had novel likely pathogenic aberration (microdeletion at 4p15.2). Twelve families have been processed for whole exome sequencing. Two identified sequence variants could be classified as pathogenic (in MECP2, CREBBP genes). The other families had several candidate intellectual disability gene variants that are of unclear clinical significance and must be further investigated for possible effect on the molecular pathways of intellectual disability. Conclusions. The genetic heterogeneity of intellectual disability requires genome wide approaches, including detection of chromosomal aberrations by chromosomal microarrays and whole exome sequencing capable of uncovering single gene mutations. This study demonstrates the benefits and challenges that accompany the use of genome wide molecular
Jahanshad, Neda; Rajagopalan, Priya; Hua, Xue; Hibar, Derrek P; Nir, Talia M; Toga, Arthur W; Jack, Clifford R; Saykin, Andrew J; Green, Robert C; Weiner, Michael W; Medland, Sarah E; Montgomery, Grant W; Hansell, Narelle K; McMahon, Katie L; de Zubicaray, Greig I; Martin, Nicholas G; Wright, Margaret J; Thompson, Paul M
Aberrant connectivity is implicated in many neurological and psychiatric disorders, including Alzheimer's disease and schizophrenia. However, other than a few disease-associated candidate genes, we know little about the degree to which genetics play a role in the brain networks; we know even less about specific genes that influence brain connections. Twin and family-based studies can generate estimates of overall genetic influences on a trait, but genome-wide association scans (GWASs) can screen the genome for specific variants influencing the brain or risk for disease. To identify the heritability of various brain connections, we scanned healthy young adult twins with high-field, high-angular resolution diffusion MRI. We adapted GWASs to screen the brain's connectivity pattern, allowing us to discover genetic variants that affect the human brain's wiring. The association of connectivity with the SPON1 variant at rs2618516 on chromosome 11 (11p15.2) reached connectome-wide, genome-wide significance after stringent statistical corrections were enforced, and it was replicated in an independent subsample. rs2618516 was shown to affect brain structure in an elderly population with varying degrees of dementia. Older people who carried the connectivity variant had significantly milder clinical dementia scores and lower risk of Alzheimer's disease. As a posthoc analysis, we conducted GWASs on several organizational and topological network measures derived from the matrices to discover variants in and around genes associated with autism (MACROD2), development (NEDD4), and mental retardation (UBE2A) significantly associated with connectivity. Connectome-wide, genome-wide screening offers substantial promise to discover genes affecting brain connectivity and risk for brain diseases.
Jahanshad, Neda; Rajagopalan, Priya; Hua, Xue; Hibar, Derrek P.; Nir, Talia M.; Toga, Arthur W.; Jack, Clifford R.; Saykin, Andrew J.; Green, Robert C.; Weiner, Michael W.; Medland, Sarah E.; Montgomery, Grant W.; Hansell, Narelle K.; McMahon, Katie L.; de Zubicaray, Greig I.; Martin, Nicholas G.; Wright, Margaret J.; Thompson, Paul M.; Weiner, Michael; Aisen, Paul; Weiner, Michael; Aisen, Paul; Petersen, Ronald; Jack, Clifford R.; Jagust, William; Trojanowski, John Q.; Toga, Arthur W.; Beckett, Laurel; Green, Robert C.; Saykin, Andrew J.; Morris, John; Liu, Enchi; Green, Robert C.; Montine, Tom; Petersen, Ronald; Aisen, Paul; Gamst, Anthony; Thomas, Ronald G.; Donohue, Michael; Walter, Sarah; Gessert, Devon; Sather, Tamie; Beckett, Laurel; Harvey, Danielle; Gamst, Anthony; Donohue, Michael; Kornak, John; Jack, Clifford R.; Dale, Anders; Bernstein, Matthew; Felmlee, Joel; Fox, Nick; Thompson, Paul; Schuff, Norbert; Alexander, Gene; DeCarli, Charles; Jagust, William; Bandy, Dan; Koeppe, Robert A.; Foster, Norm; Reiman, Eric M.; Chen, Kewei; Mathis, Chet; Morris, John; Cairns, Nigel J.; Taylor-Reinwald, Lisa; Trojanowki, J.Q.; Shaw, Les; Lee, Virginia M.Y.; Korecka, Magdalena; Toga, Arthur W.; Crawford, Karen; Neu, Scott; Saykin, Andrew J.; Foroud, Tatiana M.; Potkin, Steven; Shen, Li; Khachaturian, Zaven; Frank, Richard; Snyder, Peter J.; Molchan, Susan; Kaye, Jeffrey; Quinn, Joseph; Lind, Betty; Dolen, Sara; Schneider, Lon S.; Pawluczyk, Sonia; Spann, Bryan M.; Brewer, James; Vanderswag, Helen; Heidebrink, Judith L.; Lord, Joanne L.; Petersen, Ronald; Johnson, Kris; Doody, Rachelle S.; Villanueva-Meyer, Javier; Chowdhury, Munir; Stern, Yaakov; Honig, Lawrence S.; Bell, Karen L.; Morris, John C.; Ances, Beau; Carroll, Maria; Leon, Sue; Mintun, Mark A.; Schneider, Stacy; Marson, Daniel; Griffith, Randall; Clark, David; Grossman, Hillel; Mitsis, Effie; Romirowsky, Aliza; deToledo-Morrell, Leyla; Shah, Raj C.; Duara, Ranjan; Varon, Daniel; Roberts, Peggy; Albert, Marilyn; Onyike, Chiadi; Kielb, Stephanie; Rusinek, Henry; de Leon, Mony J.; Glodzik, Lidia; De Santi, Susan; Doraiswamy, P. Murali; Petrella, Jeffrey R.; Coleman, R. Edward; Arnold, Steven E.; Karlawish, Jason H.; Wolk, David; Smith, Charles D.; Jicha, Greg; Hardy, Peter; Lopez, Oscar L.; Oakley, MaryAnn; Simpson, Donna M.; Porsteinsson, Anton P.; Goldstein, Bonnie S.; Martin, Kim; Makino, Kelly M.; Ismail, M. Saleem; Brand, Connie; Mulnard, Ruth A.; Thai, Gaby; Mc-Adams-Ortiz, Catherine; Womack, Kyle; Mathews, Dana; Quiceno, Mary; Diaz-Arrastia, Ramon; King, Richard; Weiner, Myron; Martin-Cook, Kristen; DeVous, Michael; Levey, Allan I.; Lah, James J.; Cellar, Janet S.; Burns, Jeffrey M.; Anderson, Heather S.; Swerdlow, Russell H.; Apostolova, Liana; Lu, Po H.; Bartzokis, George; Silverman, Daniel H.S.; Graff-Radford, Neill R.; Parfitt, Francine; Johnson, Heather; Farlow, Martin R.; Hake, Ann Marie; Matthews, Brandy R.; Herring, Scott; van Dyck, Christopher H.; Carson, Richard E.; MacAvoy, Martha G.; Chertkow, Howard; Bergman, Howard; Hosein, Chris; Black, Sandra; Stefanovic, Bojana; Caldwell, Curtis; Hsiung, Ging-Yuek Robin; Feldman, Howard; Mudge, Benita; Assaly, Michele; Kertesz, Andrew; Rogers, John; Trost, Dick; Bernick, Charles; Munic, Donna; Kerwin, Diana; Mesulam, Marek-Marsel; Lipowski, Kristina; Wu, Chuang-Kuo; Johnson, Nancy; Sadowsky, Carl; Martinez, Walter; Villena, Teresa; Turner, Raymond Scott; Johnson, Kathleen; Reynolds, Brigid; Sperling, Reisa A.; Johnson, Keith A.; Marshall, Gad; Frey, Meghan; Yesavage, Jerome; Taylor, Joy L.; Lane, Barton; Rosen, Allyson; Tinklenberg, Jared; Sabbagh, Marwan; Belden, Christine; Jacobson, Sandra; Kowall, Neil; Killiany, Ronald; Budson, Andrew E.; Norbash, Alexander; Johnson, Patricia Lynn; Obisesan, Thomas O.; Wolday, Saba; Bwayo, Salome K.; Lerner, Alan; Hudson, Leon; Ogrocki, Paula; Fletcher, Evan; Carmichael, Owen; Olichney, John; DeCarli, Charles; Kittur, Smita; Borrie, Michael; Lee, T.-Y.; Bartha, Rob; Johnson, Sterling; Asthana, Sanjay; Carlsson, Cynthia M.; Potkin, Steven G.; Preda, Adrian; Nguyen, Dana; Tariot, Pierre; Fleisher, Adam; Reeder, Stephanie; Bates, Vernice; Capote, Horacio; Rainka, Michelle; Scharre, Douglas W.; Kataki, Maria; Zimmerman, Earl A.; Celmins, Dzintra; Brown, Alice D.; Pearlson, Godfrey D.; Blank, Karen; Anderson, Karen; Saykin, Andrew J.; Santulli, Robert B.; Schwartz, Eben S.; Sink, Kaycee M.; Williamson, Jeff D.; Garg, Pradeep; Watkins, Franklin; Ott, Brian R.; Querfurth, Henry; Tremont, Geoffrey; Salloway, Stephen; Malloy, Paul; Correia, Stephen; Rosen, Howard J.; Miller, Bruce L.; Mintzer, Jacobo; Longmire, Crystal Flynn; Spicer, Kenneth; Finger, Elizabeth; Rachinsky, Irina; Rogers, John; Kertesz, Andrew; Drost, Dick
Aberrant connectivity is implicated in many neurological and psychiatric disorders, including Alzheimer’s disease and schizophrenia. However, other than a few disease-associated candidate genes, we know little about the degree to which genetics play a role in the brain networks; we know even less about specific genes that influence brain connections. Twin and family-based studies can generate estimates of overall genetic influences on a trait, but genome-wide association scans (GWASs) can screen the genome for specific variants influencing the brain or risk for disease. To identify the heritability of various brain connections, we scanned healthy young adult twins with high-field, high-angular resolution diffusion MRI. We adapted GWASs to screen the brain’s connectivity pattern, allowing us to discover genetic variants that affect the human brain’s wiring. The association of connectivity with the SPON1 variant at rs2618516 on chromosome 11 (11p15.2) reached connectome-wide, genome-wide significance after stringent statistical corrections were enforced, and it was replicated in an independent subsample. rs2618516 was shown to affect brain structure in an elderly population with varying degrees of dementia. Older people who carried the connectivity variant had significantly milder clinical dementia scores and lower risk of Alzheimer’s disease. As a posthoc analysis, we conducted GWASs on several organizational and topological network measures derived from the matrices to discover variants in and around genes associated with autism (MACROD2), development (NEDD4), and mental retardation (UBE2A) significantly associated with connectivity. Connectome-wide, genome-wide screening offers substantial promise to discover genes affecting brain connectivity and risk for brain diseases. PMID:23471985
Hall, Jacob B; Bush, William S
Most analyses of genome-wide association data consider each variant independently without considering or adjusting for the genetic background present in the rest of the genome. New approaches to genome analysis use representations of genomic sharing to better account for confounding factors like population stratification or to directly approximate heritability through the estimated sharing of individuals in a dataset. These approaches use mixed linear models, which relate genotypic sharing to phenotypic sharing, and rely on the efficient computation of genetic sharing among individuals in a dataset. This unit describes the principles and practical application of mixed models for the analysis of genome-wide association study data. © 2016 by John Wiley & Sons, Inc.
Ronald, James; Akey, Joshua M
Natural selection, which can be defined as the differential contribution of genetic variants to future generations, is the driving force of Darwinian evolution. Identifying regions of the human genome that have been targets of natural selection is an important step in clarifying human evolutionary history and understanding how genetic variation results in phenotypic diversity, it may also facilitate the search for complex disease genes. Technological advances in high-throughput DNA sequencing and single nucleotide polymorphism genotyping have enabled several genome-wide scans of natural selection to be undertaken. Here, some of the observations that are beginning to emerge from these studies will be reviewed, including evidence for geographically restricted selective pressures (ie local adaptation) and a relationship between genes subject to natural selection and human disease. In addition, the paper will highlight several important problems that need to be addressed in future genome-wide studies of natural selection.
Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R; Llamas, Bastien; Dryomov, Stanislav; Pickrell, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vyacheslav; Guerra, Manuel A Rojo; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David
Ancient DNA makes it possible to observe natural selection directly by analysing samples from populations before, during and after adaptation events. Here we report a genome-wide scan for selection using ancient DNA, capitalizing on the largest ancient DNA data set yet assembled: 230 West Eurasians who lived between 6500 and 300 bc, including 163 with newly reported data. The new samples include, to our knowledge, the first genome-wide ancient DNA from Anatolian Neolithic farmers, whose genetic material we obtained by extracting from petrous bones, and who we show were members of the population that was the source of Europe's first farmers. We also report a transect of the steppe region in Samara between 5600 and 300 bc, which allows us to identify admixture into the steppe from at least two external sources. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height.
Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R.; Llamas, Bastien; Dryomov, Stanislav; Pickrel, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vayacheslav; Rojo Guerra, Manuel A.; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W.; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David
Ancient DNA makes it possible to directly witness natural selection by analyzing samples from populations before, during and after adaptation events. Here we report the first scan for selection using ancient DNA, capitalizing on the largest genome-wide dataset yet assembled: 230 West Eurasians dating to between 6500 and 1000 BCE, including 163 with newly reported data. The new samples include the first genome-wide data from the Anatolian Neolithic culture whose genetic material we extracted from the DNA-rich petrous bone and who we show were members of the population that was the source of Europe’s first farmers. We also report a complete transect of the steppe region in Samara between 5500 and 1200 BCE that allows us to recognize admixture from at least two external sources into steppe populations during this period. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height. PMID:26595274
Fall, Tove; Ingelsson, Erik
Until just a few years ago, the genetic determinants of obesity and metabolic syndrome were largely unknown, with the exception of a few forms of monogenic extreme obesity. Since genome-wide association studies (GWAS) became available, large advances have been made. The first single nucleotide polymorphism robustly associated with increased body mass index (BMI) was in 2007 mapped to a gene with for the time unknown function. This gene, now known as fat mass and obesity associated (FTO) has been repeatedly replicated in several ethnicities and is affecting obesity by regulating appetite. Since the first report from a GWAS of obesity, an increasing number of markers have been shown to be associated with BMI, other measures of obesity or fat distribution and metabolic syndrome. This systematic review of obesity GWAS will summarize genome-wide significant findings for obesity and metabolic syndrome and briefly give a few suggestions of what is to be expected in the next few years.
Wineinger, Nathan E.; Fu, Dong-Jing; Libiger, Ondrej; Alphs, Larry; Savitz, Adam; Gopal, Srihari; Cohen, Nadine; Schork, Nicholas J.
Objective Clinical response to the atypical antipsychotic paliperidone is known to vary among schizophrenic patients. We carried out a genome-wide association study to identify common genetic variants predictive of paliperidone efficacy. Methods We leveraged a collection of 1390 samples from individuals of European ancestry enrolled in 12 clinical studies investigating the efficacy of the extended-release tablet paliperidone ER (n1=490) and the once-monthly injection paliperidone palmitate (n2=550 and n3=350). We carried out a genome-wide association study using a general linear model (GLM) analysis on three separate cohorts, followed by meta-analysis and using a mixed linear model analysis on all samples. The variations in response explained by each single nucleotide polymorphism (h2SNP) were estimated. Results No SNP passed genome-wide significance in the GLM-based analyses with suggestive signals from rs56240334 [P=7.97×10−8 for change in the Clinical Global Impression Scale-Severity (CGI-S); P=8.72×10−7 for change in the total Positive and Negative Syndrome Scale (PANSS)] in the intron of ADCK1. The mixed linear model-based association P-values for rs56240334 were consistent with the results from GLM-based analyses and the association with change in CGI-S (P=4.26×10−8) reached genome-wide significance (i.e. P<5×10−8). We also found suggestive evidence for a polygenic contribution toward paliperidone treatment response with estimates of heritability, h2SNP, ranging from 0.31 to 0.43 for change in the total PANSS score, the PANSS positive Marder factor score, and CGI-S. Conclusion Genetic variations in the ADCK1 gene may differentially predict paliperidone efficacy in schizophrenic patients. However, this finding should be replicated in additional samples. PMID:27846195
Gao, Xiangwei; Wan, Ji; Qian, Shu-Bing
Regulation of translation initiation is a central control point in protein synthesis. Variations of start codon selection contribute to protein diversity and complexity. Systemic mapping of start codon positions and precise measurement of the corresponding initiation rate would transform our understanding of translational control. Here we describe a ribosome profiling approach that enables identification of translation initiation sites on a genome-wide scale. By capturing initiating ribosomes using lactimidomycin, this approach permits qualitative and quantitative analysis of alternative translation initiation.
Speliotes, Elizabeth K.
Sequencing of the human genome has opened up many opportunities to learn about our own genetic susceptibilities to disease. In this Foreword to this issue of Seminars in Liver Disease, I provide some required background to understanding genome-wide association analyses in general, including a list of terms (Table 1) often used in such studies. Five areas of particular significance are then reviewed in detail in the articles that follow. PMID:26676811
Liu, Xiangdong; Invernizzi, Pietro; Lu, Yue; Kosoy, Roman; Lu, Yan; Bianchi, Ilaria; Podda, Mauro; Xu, Chun; Xie, Gang; Macciardi, Fabio; Selmi, Carlo; Lupoli, Sara; Shigeta, Russell; Ransom, Michael; Lleo, Ana; Lee, Annette T; Mason, Andrew L; Myers, Robert P; Peltekian, Kevork M; Ghent, Cameron N; Bernuzzi, Francesca; Zuin, Massimo; Rosina, Floriano; Borghesio, Elisabetta; Floreani, Annarosa; Lazzari, Roberta; Niro, Grazia; Andriulli, Angelo; Muratori, Luigi; Muratori, Paolo; Almasio, Piero L; Andreone, Pietro; Margotti, Marzia; Brunetto, Maurizia; Coco, Barbara; Alvaro, Domenico; Bragazzi, Maria C; Marra, Fabio; Pisano, Alessandro; Rigamonti, Cristina; Colombo, Massimo; Marzioni, Marco; Benedetti, Antonio; Fabris, Luca; Strazzabosco, Mario; Portincasa, Piero; Palmieri, Vincenzo O; Tiribelli, Claudio; Croce, Lory; Bruno, Savino; Rossi, Sonia; Vinci, Maria; Prisco, Cleofe; Mattalia, Alberto; Toniutto, Pierluigi; Picciotto, Antonio; Galli, Andrea; Ferrari, Carlo; Colombo, Silvia; Casella, Giovanni; Morini, Lorenzo; Caporaso, Nicola; Colli, Agostino; Spinzi, Giancarlo; Montanari, Renzo; Gregersen, Peter K; Heathcote, E Jenny; Hirschfield, Gideon M; Siminovitch, Katherine A; Amos, Christopher I; Gershwin, M Eric; Seldin, Michael F
A genome-wide association screen for primary biliary cirrhosis risk alleles was performed in an Italian cohort. The results from the Italian cohort replicated IL12A and IL12RB associations, and a combined meta-analysis using a Canadian dataset identified newly associated loci at SPIB (P = 7.9 x 10(-11), odds ratio (OR) = 1.46), IRF5-TNPO3 (P = 2.8 x 10(-10), OR = 1.63) and 17q12-21 (P = 1.7 x 10(-10), OR = 1.38).
Liu, Xiangdong; Invernizzi, Pietro; Lu, Yue; Kosoy, Roman; Lu, Yan; Bianchi, Ilaria; Podda, Mauro; Xu, Chun; Xie, Gang; Macciardi, Fabio; Selmi, Carlo; Lupoli, Sara; Shigeta, Russell; Ransom, Michael; Lleo, Ana; Lee, Annette T; Mason, Andrew L; Myers, Robert P; Peltekian, Kevork M; Ghent, Cameron N; Bernuzzi, Francesca; Zuin, Massimo; Rosina, Floriano; Borghesio, Elisabetta; Floreani, Annarosa; Lazzari, Roberta; Niro, Grazia; Andriulli, Angelo; Muratori, Luigi; Muratori, Paolo; Almasio, Piero L; Andreone, Pietro; Margotti, Marzia; Brunetto, Maurizia; Coco, Barbara; Alvaro, Domenico; Bragazzi, Maria C; Marra, Fabio; Pisano, Alessandro; Rigamonti, Cristina; Colombo, Massimo; Marzioni, Marco; Benedetti, Antonio; Fabris, Luca; Strazzabosco, Mario; Portincasa, Piero; Palmieri, Vincenzo O; Tiribelli, Claudio; Croce, Lory; Bruno, Savino; Rossi, Sonia; Vinci, Maria; Prisco, Cleofe; Mattalia, Alberto; Toniutto, Pierluigi; Picciotto, Antonio; Galli, Andrea; Ferrari, Carlo; Colombo, Silvia; Casella, Giovanni; Morini, Lorenzo; Caporaso, Nicola; Colli, Agostino; Spinzi, Giancarlo; Montanari, Renzo; Gregersen, Peter K; Heathcote, E Jenny; Hirschfield, Gideon M; Siminovitch, Katherine A; Amos, Christopher I; Gershwin, M Eric; Seldin, Michael F
A genome-wide association screen for primary biliary cirrhosis risk alleles was performed in an Italian cohort. The results from the Italian cohort replicated IL12A and IL12RB associations, and a combined meta-analysis using a Canadian dataset identified newly associated loci at SPIB (P = 7.9 × 10–11, odds ratio (OR) = 1.46), IRF5-TNPO3 (P = 2.8 × 10–10, OR = 1.63) and 17q12-21 (P = 1.7 × 10–10, OR = 1.38). PMID:20639880
Chen, Chia-Yen; Liu, Xiang; Boris-Lawrie, Kathleen; Sharma, Amit; Jeang, Kuan-Teh
RNA helicases are ubiquitous in plants and animals and function in many cellular processes. Retroviruses, such as human immunodeficiency virus (HIV-1), encode no RNA helicases in their genomes and utilize host cellular RNA helicases at various stages of their life cycle. Here, we briefly summarize the roles RNA helicases play in HIV-1 replication that have been identified recently, in part, through genome-wide screenings, proteomics, and molecular studies. Some of these helicases augment virus propagation while others apparently participate in antiviral defenses against viral replication.
Pasanen, Anu; Karjalainen, Minna K.; Bont, Louis; Piippo-Savolainen, Eija; Ruotsalainen, Marja; Goksör, Emma; Kumawat, Kuldeep; Hodemaekers, Hennie; Nuolivirta, Kirsi; Jartti, Tuomas; Wennergren, Göran; Hallman, Mikko; Rämet, Mika; Korppi, Matti
Bronchiolitis is a major cause of hospitalization among infants. Severe bronchiolitis is associated with later asthma, suggesting a common genetic predisposition. Genetic background of bronchiolitis is not well characterized. To identify polymorphisms associated with bronchiolitis, we conducted a genome-wide association study (GWAS) in which 5,300,000 single nucleotide polymorphisms (SNPs) were tested for association in a Finnish–Swedish population of 217 children hospitalized for bronchiolitis and 778 controls. The most promising SNPs (n = 77) were genotyped in a Dutch replication population of 416 cases and 432 controls. Finally, we used a set of 202 Finnish bronchiolitis cases to further investigate candidate SNPs. We did not detect genome-wide significant associations, but several suggestive association signals (p < 10−5) were observed in the GWAS. In the replication population, three SNPs were nominally associated (p < 0.05). Of them, rs269094 was an expression quantitative trait locus (eQTL) for KCND3, previously shown to be associated with occupational asthma. In the additional set of Finnish cases, the association for another SNP (rs9591920) within a noncoding RNA locus was further strengthened. Our results provide a first genome-wide examination of the genetics underlying bronchiolitis. These preliminary findings require further validation in a larger sample size. PMID:28139761
Hutter, Stephan; Vilella, Albert J; Rozas, Julio
Background DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. Results We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i) exhaustive population-genetic analyses including those based on the coalescent theory; ii) analysis adapted to the shallow data generated by the high-throughput genome projects; iii) use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv) identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v) visualization of the results integrated with current genome annotations in commonly available genome browsers. Conclusion VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data. PMID:16968531
Stein, Jason L; Hua, Xue; Lee, Suh; Ho, April J; Leow, Alex D; Toga, Arthur W; Saykin, Andrew J; Shen, Li; Foroud, Tatiana; Pankratz, Nathan; Huentelman, Matthew J; Craig, David W; Gerber, Jill D; Allen, April N; Corneveaux, Jason J; Dechairo, Bryan M; Potkin, Steven G; Weiner, Michael W; Thompson, Paul
The structure of the human brain is highly heritable, and is thought to be influenced by many common genetic variants, many of which are currently unknown. Recent advances in neuroimaging and genetics have allowed collection of both highly detailed structural brain scans and genome-wide genotype information. This wealth of information presents a new opportunity to find the genes influencing brain structure. Here we explore the relation between 448,293 single nucleotide polymorphisms in each of 31,622 voxels of the entire brain across 740 elderly subjects (mean age+/-s.d.: 75.52+/-6.82 years; 438 male) including subjects with Alzheimer's disease, Mild Cognitive Impairment, and healthy elderly controls from the Alzheimer's Disease Neuroimaging Initiative (ADNI). We used tensor-based morphometry to measure individual differences in brain structure at the voxel level relative to a study-specific template based on healthy elderly subjects. We then conducted a genome-wide association at each voxel to identify genetic variants of interest. By studying only the most associated variant at each voxel, we developed a novel method to address the multiple comparisons problem and computational burden associated with the unprecedented amount of data. No variant survived the strict significance criterion, but several genes worthy of further exploration were identified, including CSMD2 and CADPS2. These genes have high relevance to brain structure. This is the first voxelwise genome wide association study to our knowledge, and offers a novel method to discover genetic influences on brain structure.
Domingue, Benjamin W.; Wedow, Robbee; Conley, Dalton; McQueen, Matt; Hoffmann, Thomas J.; Boardman, Jason D.
An increasing number of studies that are widely used in the demographic research community have collected genome-wide data from their respondents. It is therefore important that demographers have a proper understanding of some of the methodological tools needed to analyze such data. Our paper details the underlying methodology behind one of the most common techniques for analyzing genome-wide data, Genome-Wide Complex Trait Analysis (GCTA). GCTA models provide heritability estimates for health, health behaviors, or indicators of attainment using data from unrelated persons.. Our goal is to describe this model, to highlight the utility of the model for biodemographic research, and to demonstrate the performance of this approach under modifications of the underlying assumptions. The first set of modifications involves changing the nature of the genetic data used to compute genetic similarities between individuals (the genetic relationship matrix). We then explore the sensitivity of the model to heteroscedastic errors. In general, GCTA estimates are robust to the modifications proposed here but we also highlight potential limitations of GCTA estimates. PMID:27050030
Jeong, Seok Won; Chung, Myungguen; Park, Soo-Jung; Cho, Seong Beom
Metabolic syndrome (METS) is a disorder of energy utilization and storage and increases the risk of developing cardiovascular disease and diabetes. To identify the genetic risk factors of METS, we carried out a genome-wide association study (GWAS) for 2,657 cases and 5,917 controls in Korean populations. As a result, we could identify 2 single nucleotide polymorphisms (SNPs) with genome-wide significance level p-values (<5 × 10-8), 8 SNPs with genome-wide suggestive p-values (5 × 10-8 ≤ p < 1 × 10-5), and 2 SNPs of more functional variants with borderline p-values (5 × 10-5 ≤ p < 1 × 10-4). On the other hand, the multiple correction criteria of conventional GWASs exclude false-positive loci, but simultaneously, they discard many true-positive loci. To reconsider the discarded true-positive loci, we attempted to include the functional variants (nonsynonymous SNPs [nsSNPs] and expression quantitative trait loci [eQTL]) among the top 5,000 SNPs based on the proportion of phenotypic variance explained by genotypic variance. In total, 159 eQTLs and 18 nsSNPs were presented in the top 5,000 SNPs. Although they should be replicated in other independent populations, 6 eQTLs and 2 nsSNP loci were located in the molecular pathways of LPL, APOA5, and CHRM2, which were the significant or suggestive loci in the METS GWAS. Conclusively, our approach using the conventional GWAS, reconsidering functional variants and pathway-based interpretation, suggests a useful method to understand the GWAS results of complex traits and can be expanded in other genomewide association studies. PMID:25705157
Coleman, Jonathan R. I.; Ducci, Francesca; Aliev, Fazil; Newhouse, Stephen J.; Liu, Xiehe; Ma, Xiaohong; Wang, Yingcheng; Collier, David A.; Asherson, Philip; Li, Tao; Breen, Gerome
Drug addiction is a costly and recurring healthcare problem, necessitating a need to understand risk factors and mechanisms of addiction, and to identify new biomarkers. To date, genome-wide association studies (GWAS) for heroin addiction have been limited; moreover they have been restricted to examining samples of European and African-American origin due to difficulty of recruiting samples from other populations. This is the first study to test a Han Chinese population; we performed a GWAS on a homogeneous sample of 370 Han Chinese subjects diagnosed with heroin dependence using the DSM-IV criteria and 134 ethnically matched controls. Analysis using the diagnostic criteria of heroin dependence yielded suggestive evidence for association between variants in the genes CCDC42 (coiled coil domain 42; p = 2.8x10-7) and BRSK2 (BR serine/threonine 2; p = 4.110−6). In addition, we found evidence for risk variants within the ARHGEF10 (Rho guanine nucleotide exchange factor 10) gene on chromosome 8 and variants in a region on chromosome 20q13, which is gene-poor but has a concentration of mRNAs and predicted miRNAs. Gene-based association analysis identified genome-wide significant association between variants in CCDC42 and heroin addiction. Additionally, when we investigated shared risk variants between heroin addiction and risk of other addiction-related and psychiatric phenotypes using polygenic risk scores, we found a suggestive relationship with variants predicting tobacco addiction, and a significant relationship with variants predicting schizophrenia. Our genome wide association study of heroin dependence provides data in a novel sample, with functionally plausible results and evidence of genetic data of value to the field. PMID:27936112
Stewart, S Evelyn; Yu, Dongmei; Scharf, Jeremiah M; Neale, Benjamin M; Fagerness, Jesen A; Mathews, Carol A; Arnold, Paul D; Evans, Patrick D; Gamazon, Eric R; Osiecki, Lisa; McGrath, Lauren; Haddad, Stephen; Crane, Jacquelyn; Hezel, Dianne; Illman, Cornelia; Mayerfeld, Catherine; Konkashbaev, Anuar; Liu, Chunyu; Pluzhnikov, Anna; Tikhomirov, Anna; Edlund, Christopher K; Rauch, Scott L; Moessner, Rainald; Falkai, Peter; Maier, Wolfgang; Ruhrmann, Stephan; Grabe, Hans-Jörgen; Lennertz, Leonard; Wagner, Michael; Bellodi, Laura; Cavallini, Maria Cristina; Richter, Margaret A; Cook, Edwin H; Kennedy, James L; Rosenberg, David; Stein, Dan J; Hemmings, Sian MJ; Lochner, Christine; Azzam, Amin; Chavira, Denise A; Fournier, Eduardo; Garrido, Helena; Sheppard, Brooke; Umaña, Paul; Murphy, Dennis L; Wendland, Jens R; Veenstra-VanderWeele, Jeremy; Denys, Damiaan; Blom, Rianne; Deforce, Dieter; Van Nieuwerburgh, Filip; Westenberg, Herman GM; Walitza, Susanne; Egberts, Karin; Renner, Tobias; Miguel, Euripedes Constantino; Cappi, Carolina; Hounie, Ana G; Conceição do Rosário, Maria; Sampaio, Aline S; Vallada, Homero; Nicolini, Humberto; Lanzagorta, Nuria; Camarena, Beatriz; Delorme, Richard; Leboyer, Marion; Pato, Carlos N; Pato, Michele T; Voyiaziakis, Emanuel; Heutink, Peter; Cath, Danielle C; Posthuma, Danielle; Smit, Jan H; Samuels, Jack; Bienvenu, O Joseph; Cullen, Bernadette; Fyer, Abby J; Grados, Marco A; Greenberg, Benjamin D; McCracken, James T; Riddle, Mark A; Wang, Ying; Coric, Vladimir; Leckman, James F; Bloch, Michael; Pittenger, Christopher; Eapen, Valsamma; Black, Donald W; Ophoff, Roel A; Strengman, Eric; Cusi, Daniele; Turiel, Maurizio; Frau, Francesca; Macciardi, Fabio; Gibbs, J Raphael; Cookson, Mark R; Singleton, Andrew; Hardy, John; Crenshaw, Andrew T; Parkin, Melissa A; Mirel, Daniel B; Conti, David V; Purcell, Shaun; Nestadt, Gerald; Hanna, Gregory L; Jenike, Michael A; Knowles, James A; Cox, Nancy; Pauls, David L
Obsessive-compulsive disorder (OCD) is a common, debilitating neuropsychiatric illness with complex genetic etiology. The International OCD Foundation Genetics Collaborative (IOCDF-GC) is a multi-national collaboration established to discover the genetic variation predisposing to OCD. A set of individuals affected with DSM-IV OCD, a subset of their parents, and unselected controls, were genotyped with several different Illumina SNP microarrays. After extensive data cleaning, 1,465 cases, 5,557 ancestry-matched controls and 400 complete trios remained, with a common set of 469,410 autosomal and 9,657 X-chromosome SNPs. Ancestry-stratified case-control association analyses were conducted for three genetically-defined subpopulations and combined in two meta-analyses, with and without the trio-based analysis. In the case-control analysis, the lowest two p-values were located within DLGAP1 (p=2.49×10-6 and p=3.44×10-6), a member of the neuronal postsynaptic density complex. In the trio analysis, rs6131295, near BTBD3, exceeded the genome-wide significance threshold with a p-value=3.84 × 10-8. However, when trios were meta-analyzed with the combined case-control samples, the p-value for this variant was 3.62×10-5, losing genome-wide significance. Although no SNPs were identified to be associated with OCD at a genome-wide significant level in the combined trio-case-control sample, a significant enrichment of methylation-QTLs (p<0.001) and frontal lobe eQTLs (p=0.001) was observed within the top-ranked SNPs (p<0.01) from the trio-case-control analysis, suggesting these top signals may have a broad role in gene expression in the brain, and possibly in the etiology of OCD. PMID:22889921
Aschebrook-Kilfoy, Briseis; Argos, Maria; Pierce, Brandon L; Tong, Lin; Jasmine, Farzana; Roy, Shantanu; Parvez, Faruque; Ahmed, Alauddin; Islam, Tariqul; Kibriya, Muhammad G; Ahsan, Habibul
Human fertility is a complex trait determined by gene-environment interactions in which genetic factors represent a significant component. To better understand inter-individual variability in fertility, we performed one of the first genome-wide association studies (GWAS) of common fertility phenotypes, lifetime number of pregnancies and number of children in a developing country population. The fertility phenotype data and DNA samples were obtained at baseline recruitment from individuals participating in a large prospective cohort study in Bangladesh. GWAS analyses of fertility phenotypes were conducted among 1,686 married women. One SNP on chromosome 4 was non-significantly associated with number of children at P <10(-7) and number of pregnancies at P <10(-6). This SNP is located in a region without a gene within 1 Mb. One SNP on chromosome 6 was non-significantly associated with extreme number of children at P <10(-6). The closest gene to this SNP is HDGFL1, a hepatoma-derived growth factor. When we excluded hormonal contraceptive users, a SNP on chromosome 5 was non-significantly associated at P <10(-5) for number of children and number of pregnancies. This SNP is located near C5orf64, an open reading frame, and ZSWIM6, a zinc ion binding gene. We also estimated the heritability of these phenotypes from our genotype data using GCTA (Genome-wide Complex Trait Analysis) for number of children (hg2 = 0.149, SE = 0.24, p-value = 0.265) and number of pregnancies (hg2 = 0.007, SE = 0.22, p-value = 0.487). Our genome-wide association study and heritability estimates of number of pregnancies and number of children in Bangladesh did not confer strong evidence of common variants for parity variation. However, our results suggest that future studies may want to consider the role of 3 notable SNPs in their analysis.
Ioannidis, John P A; Thomas, Gilles; Daly, Mark J
Studies using genome-wide platforms have yielded an unprecedented number of promising signals of association between genomic variants and human traits. This Review addresses the steps required to validate, augment and refine such signals to identify underlying causal variants for well-defined phenotypes. These steps include: large-scale exact replication across both similar and diverse populations; fine mapping and resequencing; determination of the most informative markers and multiple independent informative loci; incorporation of functional information; and improved phenotype mapping of the implicated genetic effects. Even in cases for which replication proves that an effect exists, confident localization of the causal variant often remains elusive.
Munroe, Patricia B.
The study of family pedigrees with rare monogenic cardiovascular disorders has revealed new molecular players in physiological processes. Genome-wide association studies of complex traits with a heritable component may afford a similar and potentially intellectually richer opportunity. In this review we focus on the interpretation of genetic associations and the issue of causality in relation to known and potentially new physiology. We mainly discuss cardiometabolic traits as it reflects our personal interests, but the issues pertain broadly in many other disciplines. We also describe some of the resources that are now available that may expedite follow up of genetic association signals into observations on causal mechanisms and pathophysiology. PMID:26106147
Fonseca, Gregory J; Seidman, Jason S; Glass, Christopher K
Macrophages play essential roles in the response to injury and infection and contribute to the development and/or homeostasis of the various tissues they reside in. Conversely, macrophages also influence the pathogenesis of metabolic, neurodegenerative, and neoplastic diseases. Mechanisms that contribute to the phenotypic diversity of macrophages in health and disease remain poorly understood. Here we review the recent application of genome-wide approaches to characterize the transcriptomes and epigenetic landscapes of tissue-resident macrophages. These studies are beginning to provide insights into how distinct tissue environments are interpreted by transcriptional regulatory elements to drive specialized programs of gene expression. PMID:28087927
Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G
The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.
The number of obese patients is increasing in Japan, due to the westernization of lifestyle. Obesity, especially visceral fat obesity, is important for the development of metabolic syndrome. Genetic factors are important for the development of obesity as well as environmental factors. Importance of genetic factors of fat distribution is also reported. Recent genome-wide association studies (GWASs) have revealed the obesity and fat distribution-related polymorphisms. GWAS will highlight a better understanding of the underlying molecular mechanisms in the regulation of obesity and distribution of body fat.
Ogura, Yoji; Kou, Ikuyo; Scoliosis, Japan; Matsumoto, Morio; Watanabe, Kota; Ikegawa, Shiro
Adolescent idiopathic scoliosis(AIS)is a polygenic disease. Genome-wide association studies(GWASs)have been performed for a lot of polygenic diseases. For AIS, we conducted GWAS and identified the first AIS locus near LBX1. After the discovery, we have extended our study by increasing the numbers of subjects and SNPs. In total, our Japanese GWAS has identified four susceptibility genes. GWASs for AIS have also been performed in the USA and China, which identified one and three susceptibility genes, respectively. Here we review GWASs in Japan and abroad and functional analysis to clarify the pathomechanism of AIS.
Srivastava, Prashant Kumar; Bagnati, Marta; Delahaye-Duriez, Andree; Ko, Jeong-Hun; Rotival, Maxime; Langley, Sarah R.; Shkura, Kirill; Mazzuferi, Manuela; Danis, Bénédicte; van Eyll, Jonathan; Foerch, Patrik; Behmoaras, Jacques; Kaminski, Rafal M.; Petretto, Enrico; Johnson, Michael R.
The recoding of genetic information through RNA editing contributes to proteomic diversity, but the extent and significance of RNA editing in disease is poorly understood. In particular, few studies have investigated the relationship between RNA editing and disease at a genome-wide level. Here, we developed a framework for the genome-wide detection of RNA sites that are differentially edited in disease. Using RNA-sequencing data from 100 hippocampi from mice with epilepsy (pilocarpine–temporal lobe epilepsy model) and 100 healthy control hippocampi, we identified 256 RNA sites (overlapping with 87 genes) that were significantly differentially edited between epileptic cases and controls. The degree of differential RNA editing in epileptic mice correlated with frequency of seizures, and the set of genes differentially RNA-edited between case and control mice were enriched for functional terms highly relevant to epilepsy, including “neuron projection” and “seizures.” Genes with differential RNA editing were preferentially enriched for genes with a genetic association to epilepsy. Indeed, we found that they are significantly enriched for genes that harbor nonsynonymous de novo mutations in patients with epileptic encephalopathy and for common susceptibility variants associated with generalized epilepsy. These analyses reveal a functional convergence between genes that are differentially RNA-edited in acquired symptomatic epilepsy and those that contribute risk for genetic epilepsy. Taken together, our results suggest a potential role for RNA editing in the epileptic hippocampus in the occurrence and severity of epileptic seizures. PMID:28250018
Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Peltonen, Leena; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E; Baron-Cohen, Simon
Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision.
Srivastava, Prashant Kumar; Bagnati, Marta; Delahaye-Duriez, Andree; Ko, Jeong-Hun; Rotival, Maxime; Langley, Sarah R; Shkura, Kirill; Mazzuferi, Manuela; Danis, Bénédicte; van Eyll, Jonathan; Foerch, Patrik; Behmoaras, Jacques; Kaminski, Rafal M; Petretto, Enrico; Johnson, Michael R
The recoding of genetic information through RNA editing contributes to proteomic diversity, but the extent and significance of RNA editing in disease is poorly understood. In particular, few studies have investigated the relationship between RNA editing and disease at a genome-wide level. Here, we developed a framework for the genome-wide detection of RNA sites that are differentially edited in disease. Using RNA-sequencing data from 100 hippocampi from mice with epilepsy (pilocarpine-temporal lobe epilepsy model) and 100 healthy control hippocampi, we identified 256 RNA sites (overlapping with 87 genes) that were significantly differentially edited between epileptic cases and controls. The degree of differential RNA editing in epileptic mice correlated with frequency of seizures, and the set of genes differentially RNA-edited between case and control mice were enriched for functional terms highly relevant to epilepsy, including "neuron projection" and "seizures." Genes with differential RNA editing were preferentially enriched for genes with a genetic association to epilepsy. Indeed, we found that they are significantly enriched for genes that harbor nonsynonymous de novo mutations in patients with epileptic encephalopathy and for common susceptibility variants associated with generalized epilepsy. These analyses reveal a functional convergence between genes that are differentially RNA-edited in acquired symptomatic epilepsy and those that contribute risk for genetic epilepsy. Taken together, our results suggest a potential role for RNA editing in the epileptic hippocampus in the occurrence and severity of epileptic seizures.
Scharf, Jeremiah M.; Yu, Dongmei; Mathews, Carol A.; Neale, Benjamin M.; Stewart, S. Evelyn; Fagerness, Jesen A; Evans, Patrick; Gamazon, Eric; Edlund, Christopher K.; Service, Susan; Tikhomirov, Anna; Osiecki, Lisa; Illmann, Cornelia; Pluzhnikov, Anna; Konkashbaev, Anuar; Davis, Lea K; Han, Buhm; Crane, Jacquelyn; Moorjani, Priya; Crenshaw, Andrew T.; Parkin, Melissa A.; Reus, Victor I.; Lowe, Thomas L.; Rangel-Lugo, Martha; Chouinard, Sylvain; Dion, Yves; Girard, Simon; Cath, Danielle C; Smit, Jan H; King, Robert A.; Fernandez, Thomas; Leckman, James F.; Kidd, Kenneth K.; Kidd, Judith R.; Pakstis, Andrew J.; State, Matthew; Herrera, Luis Diego; Romero, Roxana; Fournier, Eduardo; Sandor, Paul; Barr, Cathy L; Phan, Nam; Gross-Tsur, Varda; Benarroch, Fortu; Pollak, Yehuda; Budman, Cathy L.; Bruun, Ruth D.; Erenberg, Gerald; Naarden, Allan L; Lee, Paul C; Weiss, Nicholas; Kremeyer, Barbara; Berrío, Gabriel Bedoya; Campbell, Desmond; Silgado, Julio C. Cardona; Ochoa, William Cornejo; Restrepo, Sandra C. Mesa; Muller, Heike; Duarte, Ana V. Valencia; Lyon, Gholson J; Leppert, Mark; Morgan, Jubel; Weiss, Robert; Grados, Marco A.; Anderson, Kelley; Davarya, Sarah; Singer, Harvey; Walkup, John; Jankovic, Joseph; Tischfield, Jay A.; Heiman, Gary A.; Gilbert, Donald L.; Hoekstra, Pieter J.; Robertson, Mary M.; Kurlan, Roger; Liu, Chunyu; Gibbs, J. Raphael; Singleton, Andrew; Hardy, John; Strengman, Eric; Ophoff, Roel; Wagner, Michael; Moessner, Rainald; Mirel, Daniel B.; Posthuma, Danielle; Sabatti, Chiara; Eskin, Eleazar; Conti, David V.; Knowles, James A.; Ruiz-Linares, Andres; Rouleau, Guy A.; Purcell, Shaun; Heutink, Peter; Oostra, Ben A.; McMahon, William; Freimer, Nelson; Cox, Nancy J.; Pauls, David L.
Tourette Syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association study (GWAS) of TS in 1285 cases and 4964 ancestry-matched controls of European ancestry, including two European-derived population isolates, Ashkenazi Jews from North America and Israel, and French Canadians from Quebec, Canada. In a primary meta-analysis of GWAS data from these European ancestry samples, no markers achieved a genome-wide threshold of significance (p<5 × 10−8); the top signal was found in rs7868992 on chromosome 9q32 within COL27A1 (p=1.85 × 10−6). A secondary analysis including an additional 211 cases and 285 controls from two closely-related Latin-American population isolates from the Central Valley of Costa Rica and Antioquia, Colombia also identified rs7868992 as the top signal (p=3.6 × 10−7 for the combined sample of 1496 cases and 5249 controls following imputation with 1000 Genomes data). This study lays the groundwork for the eventual identification of common TS susceptibility variants in larger cohorts and helps to provide a more complete understanding of the full genetic architecture of this disorder. PMID:22889924
Walter, Stefan; Atzmon, Gil; Demerath, Ellen W.; Garcia, Melissa E.; Kaplan, Robert C.; Kumari, Meena; Lunetta, Kathryn L.; Milaneschi, Yuri; Tanaka, Toshiko; Tranah, Gregory J.; Völker, Uwe; Yu, Lei; Arnold, Alice; Benjamin, Emelia J.; Biffar, Reiner; Buchman, Aron S.; Boerwinkle, Eric; Couper, David; De Jager, Philip L.; Evans, Denis A.; Harris, Tamara B.; Hoffmann, Wolfgang; Hofman, Albert; Karasik, David; Kiel, Douglas P.; Kocher, Thomas; Kuningas, Maris; Launer, Lenore J.; Lohman, Kurt K.; Lutsey, Pamela L.; Mackenbach, Johan; Marciante, Kristin; Psaty, Bruce M.; Reiman, Eric M.; Rotter, Jerome I.; Seshadri, Sudha; Shardell, Michelle D.; Smith, Albert V.; van Duijn, Cornelia; Walston, Jeremy; Zillikens, M. Carola; Bandinelli, Stefania; Baumeister, Sebastian E.; Bennett, David A.; Ferrucci, Luigi; Gudnason, Vilmundur; Kivimaki, Mika; Liu, Yongmei; Murabito, Joanne M.; Newman, Anne B.; Tiemeier, Henning; Franceschini, Nora
Human longevity and healthy aging show moderate heritability (20–50%). We conducted a meta-analysis of genome-wide association studies from nine studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium for two outcomes: a) all-cause mortality and b) survival free of major disease or death. No single nucleotide polymorphism (SNP) was a genome-wide significant predictor of either outcome (p < 5 × 10−8). We found fourteen independent SNPs that predicted risk of death, and eight SNPs that predicted event-free survival (p < 10−5). These SNPs are in or near genes that are highly expressed in the brain (HECW2, HIP1, BIN2, GRIA1), genes involved in neural development and function (KCNQ4, LMO4, GRIA1, NETO1) and autophagy (ATG4C), and genes that are associated with risk of various diseases including cancer and Alzheimer’s disease. In addition to considerable overlap between the traits, pathway and network analysis corroborated these findings. These findings indicate that variation in genes involved in neurological processes may be an important factor in regulating aging free of major disease and achieving longevity. PMID:21782286
Preston, Jessica L.; Randel, Melissa A.; Johnson, Eric A.
Here we present a genome-wide method for de novo identification of enhancer regions. This approach enables massively parallel empirical investigation of DNA sequences that mediate transcriptional activation and provides a platform for discovery of regulatory modules capable of driving context-specific gene expression. The method links fragmented genomic DNA to the transcription of randomer molecule identifiers and measures the functional enhancer activity of the library by massively parallel sequencing. We transfected a Drosophila melanogaster library into S2 cells in normoxia and hypoxia, and assayed 4,599,881 genomic DNA fragments in parallel. The locations of the enhancer regions strongly correlate with genes up-regulated after hypoxia and previously described enhancers. Novel enhancer regions were identified and integrated with RNAseq data and transcription factor motifs to describe the hypoxic response on a genome-wide basis as a complex regulatory network involving multiple stress-response pathways. This work provides a novel method for high-throughput assay of enhancer activity and the genome-scale identification of 31 hypoxia-activated enhancers in Drosophila. PMID:26713262
Walter, Stefan; Atzmon, Gil; Demerath, Ellen W; Garcia, Melissa E; Kaplan, Robert C; Kumari, Meena; Lunetta, Kathryn L; Milaneschi, Yuri; Tanaka, Toshiko; Tranah, Gregory J; Völker, Uwe; Yu, Lei; Arnold, Alice; Benjamin, Emelia J; Biffar, Reiner; Buchman, Aron S; Boerwinkle, Eric; Couper, David; De Jager, Philip L; Evans, Denis A; Harris, Tamara B; Hoffmann, Wolfgang; Hofman, Albert; Karasik, David; Kiel, Douglas P; Kocher, Thomas; Kuningas, Maris; Launer, Lenore J; Lohman, Kurt K; Lutsey, Pamela L; Mackenbach, Johan; Marciante, Kristin; Psaty, Bruce M; Reiman, Eric M; Rotter, Jerome I; Seshadri, Sudha; Shardell, Michelle D; Smith, Albert V; van Duijn, Cornelia; Walston, Jeremy; Zillikens, M Carola; Bandinelli, Stefania; Baumeister, Sebastian E; Bennett, David A; Ferrucci, Luigi; Gudnason, Vilmundur; Kivimaki, Mika; Liu, Yongmei; Murabito, Joanne M; Newman, Anne B; Tiemeier, Henning; Franceschini, Nora
Human longevity and healthy aging show moderate heritability (20%-50%). We conducted a meta-analysis of genome-wide association studies from 9 studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium for 2 outcomes: (1) all-cause mortality, and (2) survival free of major disease or death. No single nucleotide polymorphism (SNP) was a genome-wide significant predictor of either outcome (p < 5 × 10(-8)). We found 14 independent SNPs that predicted risk of death, and 8 SNPs that predicted event-free survival (p < 10(-5)). These SNPs are in or near genes that are highly expressed in the brain (HECW2, HIP1, BIN2, GRIA1), genes involved in neural development and function (KCNQ4, LMO4, GRIA1, NETO1) and autophagy (ATG4C), and genes that are associated with risk of various diseases including cancer and Alzheimer's disease. In addition to considerable overlap between the traits, pathway and network analysis corroborated these findings. These findings indicate that variation in genes involved in neurological processes may be an important factor in regulating aging free of major disease and achieving longevity.
Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E.; Baron-Cohen, Simon
Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision. PMID:26176695
Motsinger-Reif, Alison; Dickey, Allison; Yale, Steven; Trepanier, Lauren A.
Background Hypersensitivity (HS) reactions to sulfonamide antibiotics occur uncommonly, but with potentially severe clinical manifestations. A familial predisposition to sulfonamide HS is suspected, but robust predictive genetic risk factors have yet to be identified. Strongly linked genetic polymorphisms have been used clinically as screening tests for other HS reactions prior to administration of high-risk drugs. Objective The purpose of this study was to evaluate for genetic risk of sulfonamide HS in the immunocompetent population using genome-wide association. Methods Ninety-one patients with symptoms after trimethoprim-sulfamethoxazole (TMP-SMX) attributable to “probable” drug HS based on medical record review and the Naranjo Adverse Drug Reaction Probability Scale, and 184 age- and sex-matched patients who tolerated a therapeutic course of TMP-SMX, were included in a genome-wide association study using both common and rare variant techniques. Additionally, two subgroups of HS patients with a more refined clinical phenotype (fever and rash; or fever, rash and eosinophilia) were evaluated separately. Results For the full dataset, no single nucleotide polymorphisms were suggestive of or reached genome-wide significance in the common variant analysis, nor was any genetic locus significant in the rare variant analysis. A single, possible gene locus association (COL12A1) was identified in the rare variant analysis for patients with both fever and rash, but the sample size was very small in this subgroup (n = 16), and this may be a false positive finding. No other significant associations were found for the subgroups. Conclusions No convincing genetic risk factors for sulfonamide HS were identified in this population. These negative findings may be due to challenges in accurately confirming the phenotype in exanthematous drug eruptions, or to unidentified gene-environment interactions influencing sulfonamide HS. PMID:27272151
Smith, Erin N.; Chen, Wei; Kähönen, Mika; Kettunen, Johannes; Lehtimäki, Terho; Peltonen, Leena; Raitakari, Olli T.; Salem, Rany M.; Schork, Nicholas J.; Shaw, Marian; Srinivasan, Sathanur R.; Topol, Eric J.; Viikari, Jorma S.; Berenson, Gerald S.; Murray, Sarah S.
Cardiovascular disease (CVD) is the leading cause of death worldwide. Recent genome-wide association (GWA) studies have pinpointed many loci associated with CVD risk factors in adults. It is unclear, however, if these loci predict trait levels at all ages, if they are associated with how a trait develops over time, or if they could be used to screen individuals who are pre-symptomatic to provide the opportunity for preventive measures before disease onset. We completed a genome-wide association study on participants in the longitudinal Bogalusa Heart Study (BHS) and have characterized the association between genetic factors and the development of CVD risk factors from childhood to adulthood. We report 7 genome-wide significant associations involving CVD risk factors, two of which have been previously reported. Top regions were tested for replication in the Young Finns Study (YF) and two associations strongly replicated: rs247616 in CETP with HDL levels (combined P = 9.7×10−24), and rs445925 at APOE with LDL levels (combined P = 8.7×10−19). We show that SNPs previously identified in adult cross-sectional studies tend to show age-independent effects in the BHS with effect sizes consistent with previous reports. Previously identified variants were associated with adult trait levels above and beyond those seen in childhood; however, variants with time-dependent effects were also promising predictors. This is the first GWA study to evaluate the role of common genetic variants in the development of CVD risk factors in children as they advance through adulthood and highlights the utility of using longitudinal studies to identify genetic predictors of adult traits in children. PMID:20838585
Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J
The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53–3.14), P=1.9 × 10-5). Two polymorphisms at 6p21.2 LINC00951–LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37–1.85), P=1.6 × 10−9) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder. PMID:27598967
Deng, Pingchuan; Wang, Meng; Feng, Kewei; Cui, Licao; Tong, Wei; Song, Weining; Nie, Xiaojun
Microsatellites are an important constituent of plant genome and distributed across entire genome. In this study, genome-wide analysis of microsatellites in 8 Triticeae species and 9 model plants revealed that microsatellite characteristics were similar among the Triticeae species. Furthermore, genome-wide microsatellite markers were designed in wheat and then used to analyze the evolutionary relationship of wheat and other Triticeae species. Results displayed that Aegilops tauschii was found to be the closest species to Triticum aestivum, followed by Triticum urartu, Triticum turgidum and Aegilops speltoides, while Triticum monococcum, Aegilops sharonensis and Hordeum vulgare showed a relatively lower PCR amplification effectivity. Additionally, a significantly higher PCR amplification effectivity was found in chromosomes at the same subgenome than its homoeologous when these markers were subjected to search against different chromosomes in wheat. After a rigorous screening process, a total of 20,666 markers showed high amplification and polymorphic potential in wheat and its relatives, which were integrated with the public available wheat markers and then anchored to the genome of wheat (CS). This study not only provided the useful resource for SSR markers development in Triticeae species, but also shed light on the evolution of polyploid wheat from the perspective of microsatellites. PMID:27561724
Pheiffer, Carmen; Humphries, Stephen E.; Gamieldien, Junaid; Erasmus, Rajiv T.
Aims. To conduct a genome-wide DNA methylation in individuals with type 2 diabetes, individuals with prediabetes, and control mixed ancestry individuals from South Africa. Methods. We used peripheral blood to perform genome-wide DNA methylation analysis in 3 individuals with screen detected diabetes, 3 individuals with prediabetes, and 3 individuals with normoglycaemia from the Bellville South Community, Cape Town, South Africa, who were age-, gender-, body mass index-, and duration of residency-matched. Methylated DNA immunoprecipitation (MeDIP) was performed by Arraystar Inc. (Rockville, MD, USA). Results. Hypermethylated DMRs were 1160 (81.97%) and 124 (43.20%), respectively, in individuals with diabetes and prediabetes when both were compared to subjects with normoglycaemia. Our data shows that genes related to the immune system, signal transduction, glucose transport, and pancreas development have altered DNA methylation in subjects with prediabetes and diabetes. Pathway analysis based on the functional analysis mapping of genes to KEGG pathways suggested that the linoleic acid metabolism and arachidonic acid metabolism pathways are hypomethylated in prediabetes and diabetes. Conclusions. Our study suggests that epigenetic changes are likely to be an early process that occurs before the onset of overt diabetes. Detailed analysis of DMRs that shows gradual methylation differences from control versus prediabetes to prediabetes versus diabetes in a larger sample size is required to confirm these findings. PMID:27555869
Roh, Tae-young; Ngau, Wing Chi; Cui, Kairong; Landsman, David; Zhao, Keji
The expression patterns of eukaryotic genomes are controlled by their chromatin structure, consisting of nucleosome subunits in which DNA of approximately 146 bp is wrapped around a core of 8 histone molecules. Post-translational histone modifications play an essential role in modifying chromatin structure. Here we apply a combination of SAGE and chromatin immunoprecipitation (ChIP) protocols to determine the distribution of hyperacetylated histones H3 and H4 in the Saccharomyces cerevisiae genome. We call this approach genome-wide mapping technique (GMAT). Using GMAT, we find that the highest acetylation levels are detected in the 5' end of a gene's coding region, but not in the promoter. Furthermore, we show that the histone acetyltransferase, GCN5p, regulates H3 acetylation in the promoter and 5' end of the coding regions. These findings indicate that GMAT should find valuable applications in mapping target sites of chromatin-modifying enzymes.
Murphy, Travis W.; Lu, Chang
Next-generation sequencing (NGS) has revolutionized how molecular biology studies are conducted. Its decreasing cost and increasing throughput permit profiling of genomic, transcriptomic, and epigenomic features for a wide range of applications. Microfluidics has been proven to be highly complementary to NGS technology with its unique capabilities for handling small volumes of samples and providing platforms for automation, integration, and multiplexing. In this article, we review recent progress on applying microfluidics to facilitate genome-wide studies. We emphasize on several technical aspects of NGS and how they benefit from coupling with microfluidic technology. We also summarize recent efforts on developing microfluidic technology for genomic, transcriptomic, and epigenomic studies, with emphasis on single cell analysis. We envision rapid growth in these directions, driven by the needs for testing scarce primary cell samples from patients in the context of precision medicine.
Patel, Jai N; McLeod, Howard L; Innocenti, Federico
Genome wide association studies (GWAS) provide an agnostic approach to identifying potential genetic variants associated with disease susceptibility, prognosis of survival and/or predictive of drug response. Although these techniques are costly and interpretation of study results is challenging, they do allow for a more unbiased interrogation of the entire genome, resulting in the discovery of novel genes and understanding of novel biological associations. This review will focus on the implications of GWAS in cancer therapy, in particular germ-line mutations, including findings from major GWAS which have identified predictive genetic loci for clinical outcome and/or toxicity. Lessons and challenges in cancer GWAS are also discussed, including the need for functional analysis and replication, as well as future perspectives for biological and clinical utility. Given the large heterogeneity in response to cancer therapeutics, novel methods of identifying mechanisms and biology of variable drug response and ultimately treatment individualization will be indispensable.
Diseases related to tobacco smoking are the second leading cause of death in the world. Despite increasing evidence of genetic determination, the susceptibility genes and loci underlying various aspects of smoking behavior are largely unknown. Genome-wide association studies (GWASs) provided a new conceptual framework in the search for variants underlying common traits/disorders. A massive scan of the genome and a "hypothesis-free" approach enable discovery of new aspects of genetics of complex traits. In this paper the results of GWASs and GWAS meta-analyzes of cigarette smoking behavior and nicotine dependence are reviewed with the particular attention to smoking cessation success and the replacement therapy. The results of these studies are discussed in the context of the results of the candidate gene association studies. Studies on the role of the genomic regions, identified in GWASs, in the development of smoking-related diseases are also discussed.
Wan, Yue; Qu, Kun; Ouyang, Zhengqing; Kertesz, Michael; Li, Jun; Tibshirani, Robert; Makino, Debora L; Nutter, Robert C; Segal, Eran; Chang, Howard Y
RNA structural transitions are important in the function and regulation of RNAs. Here, we reveal a layer of transcriptome organization in the form of RNA folding energies. By probing yeast RNA structures at different temperatures, we obtained relative melting temperatures (Tm) for RNA structures in over 4000 transcripts. Specific signatures of RNA Tm demarcated the polarity of mRNA open reading frames and highlighted numerous candidate regulatory RNA motifs in 3' untranslated regions. RNA Tm distinguished noncoding versus coding RNAs and identified mRNAs with distinct cellular functions. We identified thousands of putative RNA thermometers, and their presence is predictive of the pattern of RNA decay in vivo during heat shock. The exosome complex recognizes unpaired bases during heat shock to degrade these RNAs, coupling intrinsic structural stabilities to gene regulation. Thus, genome-wide structural dynamics of RNA can parse functional elements of the transcriptome and reveal diverse biological insights.
Lin, Eugene; Lane, Hsien-Yuan
Major depressive disorder (MDD) is one of the most common psychiatric disorders worldwide. Doctors must prescribe antidepressants based on educated guesses due to the fact that it is unmanageable to predict the effectiveness of any particular antidepressant in an individual patient. With the recent advent of scientific research, the genome-wide association study (GWAS) is extensively employed to analyze hundreds of thousands of single nucleotide polymorphisms by high-throughput genotyping technologies. In addition to the candidate-gene approach, the GWAS approach has recently been utilized to investigate the determinants of antidepressant response to therapy. In this study, we reviewed GWAS studies, their limitations and future directions with respect to the pharmacogenomics of antidepressants in MDD.
Boraska, Vesna; Franklin, Christopher S; Floyd, James AB; Thornton, Laura M; Huckins, Laura M; Southam, Lorraine; Rayner, N William; Tachmazidou, Ioanna; Klump, Kelly L; Treasure, Janet; Lewis, Cathryn M; Schmidt, Ulrike; Tozzi, Federica; Kiezebrink, Kirsty; Hebebrand, Johannes; Gorwood, Philip; Adan, Roger AH; Kas, Martien JH; Favaro, Angela; Santonastaso, Paolo; Fernández-Aranda, Fernando; Gratacos, Monica; Rybakowski, Filip; Dmitrzak-Weglarz, Monika; Kaprio, Jaakko; Keski-Rahkonen, Anna; Raevuori, Anu; Van Furth, Eric F; Slof-Op t Landt, Margarita CT; Hudson, James I; Reichborn-Kjennerud, Ted; Knudsen, Gun Peggy S; Monteleone, Palmiero; Kaplan, Allan S; Karwautz, Andreas; Hakonarson, Hakon; Berrettini, Wade H; Guo, Yiran; Li, Dong; Schork, Nicholas J.; Komaki, Gen; Ando, Tetsuya; Inoko, Hidetoshi; Esko, Tõnu; Fischer, Krista; Männik, Katrin; Metspalu, Andres; Baker, Jessica H; Cone, Roger D; Dackor, Jennifer; DeSocio, Janiece E; Hilliard, Christopher E; O’Toole, Julie K; Pantel, Jacques; Szatkiewicz, Jin P; Taico, Chrysecolla; Zerwas, Stephanie; Trace, Sara E; Davis, Oliver SP; Helder, Sietske; Bühren, Katharina; Burghardt, Roland; de Zwaan, Martina; Egberts, Karin; Ehrlich, Stefan; Herpertz-Dahlmann, Beate; Herzog, Wolfgang; Imgart, Hartmut; Scherag, André; Scherag, Susann; Zipfel, Stephan; Boni, Claudette; Ramoz, Nicolas; Versini, Audrey; Brandys, Marek K; Danner, Unna N; de Kovel, Carolien; Hendriks, Judith; Koeleman, Bobby PC; Ophoff, Roel A; Strengman, Eric; van Elburg, Annemarie A; Bruson, Alice; Clementi, Maurizio; Degortes, Daniela; Forzan, Monica; Tenconi, Elena; Docampo, Elisa; Escaramís, Geòrgia; Jiménez-Murcia, Susana; Lissowska, Jolanta; Rajewski, Andrzej; Szeszenia-Dabrowska, Neonila; Slopien, Agnieszka; Hauser, Joanna; Karhunen, Leila; Meulenbelt, Ingrid; Slagboom, P Eline; Tortorella, Alfonso; Maj, Mario; Dedoussis, George; Dikeos, Dimitris; Gonidakis, Fragiskos; Tziouvas, Konstantinos; Tsitsika, Artemis; Papezova, Hana; Slachtova, Lenka; Martaskova, Debora; Kennedy, James L.; Levitan, Robert D.; Yilmaz, Zeynep; Huemer, Julia; Koubek, Doris; Merl, Elisabeth; Wagner, Gudrun; Lichtenstein, Paul; Breen, Gerome; Cohen-Woods, Sarah; Farmer, Anne; McGuffin, Peter; Cichon, Sven; Giegling, Ina; Herms, Stefan; Rujescu, Dan; Schreiber, Stefan; Wichmann, H-Erich; Dina, Christian; Sladek, Rob; Gambaro, Giovanni; Soranzo, Nicole; Julia, Antonio; Marsal, Sara; Rabionet, Raquel; Gaborieau, Valerie; Dick, Danielle M; Palotie, Aarno; Ripatti, Samuli; Widén, Elisabeth; Andreassen, Ole A; Espeseth, Thomas; Lundervold, Astri; Reinvang, Ivar; Steen, Vidar M; Le Hellard, Stephanie; Mattingsdal, Morten; Ntalla, Ioanna; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Navratilova, Marie; Gallinger, Steven; Pinto, Dalila; Scherer, Stephen; Aschauer, Harald; Carlberg, Laura; Schosser, Alexandra; Alfredsson, Lars; Ding, Bo; Klareskog, Lars; Padyukov, Leonid; Finan, Chris; Kalsi, Gursharan; Roberts, Marion; Logan, Darren W; Peltonen, Leena; Ritchie, Graham RS; Barrett, Jeffrey C; Estivill, Xavier; Hinney, Anke; Sullivan, Patrick F; Collier, David A; Zeggini, Eleftheria; Bulik, Cynthia M
Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2,907 cases with AN from 14 countries (15 sites) and 14,860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery datasets. Seventy-six (72 independent) SNPs were taken forward for in silico (two datasets) or de novo (13 datasets) replication genotyping in 2,677 independent AN cases and 8,629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication datasets comprised 5,551 AN cases and 21,080 controls. AN subtype analyses (1,606 AN restricting; 1,445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01×10−7) in SOX2OT and rs17030795 (P=5.84×10−6) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76×10−6) between CUL3 and FAM124B and rs1886797 (P=8.05×10−6) near SPATA13. Comparing discovery to replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P= 4×10−6), strongly suggesting that true findings exist but that our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field. PMID:21079607
Boraska, V; Franklin, C S; Floyd, J A B; Thornton, L M; Huckins, L M; Southam, L; Rayner, N W; Tachmazidou, I; Klump, K L; Treasure, J; Lewis, C M; Schmidt, U; Tozzi, F; Kiezebrink, K; Hebebrand, J; Gorwood, P; Adan, R A H; Kas, M J H; Favaro, A; Santonastaso, P; Fernández-Aranda, F; Gratacos, M; Rybakowski, F; Dmitrzak-Weglarz, M; Kaprio, J; Keski-Rahkonen, A; Raevuori, A; Van Furth, E F; Slof-Op 't Landt, M C T; Hudson, J I; Reichborn-Kjennerud, T; Knudsen, G P S; Monteleone, P; Kaplan, A S; Karwautz, A; Hakonarson, H; Berrettini, W H; Guo, Y; Li, D; Schork, N J; Komaki, G; Ando, T; Inoko, H; Esko, T; Fischer, K; Männik, K; Metspalu, A; Baker, J H; Cone, R D; Dackor, J; DeSocio, J E; Hilliard, C E; O'Toole, J K; Pantel, J; Szatkiewicz, J P; Taico, C; Zerwas, S; Trace, S E; Davis, O S P; Helder, S; Bühren, K; Burghardt, R; de Zwaan, M; Egberts, K; Ehrlich, S; Herpertz-Dahlmann, B; Herzog, W; Imgart, H; Scherag, A; Scherag, S; Zipfel, S; Boni, C; Ramoz, N; Versini, A; Brandys, M K; Danner, U N; de Kovel, C; Hendriks, J; Koeleman, B P C; Ophoff, R A; Strengman, E; van Elburg, A A; Bruson, A; Clementi, M; Degortes, D; Forzan, M; Tenconi, E; Docampo, E; Escaramís, G; Jiménez-Murcia, S; Lissowska, J; Rajewski, A; Szeszenia-Dabrowska, N; Slopien, A; Hauser, J; Karhunen, L; Meulenbelt, I; Slagboom, P E; Tortorella, A; Maj, M; Dedoussis, G; Dikeos, D; Gonidakis, F; Tziouvas, K; Tsitsika, A; Papezova, H; Slachtova, L; Martaskova, D; Kennedy, J L; Levitan, R D; Yilmaz, Z; Huemer, J; Koubek, D; Merl, E; Wagner, G; Lichtenstein, P; Breen, G; Cohen-Woods, S; Farmer, A; McGuffin, P; Cichon, S; Giegling, I; Herms, S; Rujescu, D; Schreiber, S; Wichmann, H-E; Dina, C; Sladek, R; Gambaro, G; Soranzo, N; Julia, A; Marsal, S; Rabionet, R; Gaborieau, V; Dick, D M; Palotie, A; Ripatti, S; Widén, E; Andreassen, O A; Espeseth, T; Lundervold, A; Reinvang, I; Steen, V M; Le Hellard, S; Mattingsdal, M; Ntalla, I; Bencko, V; Foretova, L; Janout, V; Navratilova, M; Gallinger, S; Pinto, D; Scherer, S W; Aschauer, H; Carlberg, L; Schosser, A; Alfredsson, L; Ding, B; Klareskog, L; Padyukov, L; Courtet, P; Guillaume, S; Jaussent, I; Finan, C; Kalsi, G; Roberts, M; Logan, D W; Peltonen, L; Ritchie, G R S; Barrett, J C; Estivill, X; Hinney, A; Sullivan, P F; Collier, D A; Zeggini, E; Bulik, C M
Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome-wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2907 cases with AN from 14 countries (15 sites) and 14 860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery data sets. Seventy-six (72 independent) single nucleotide polymorphisms were taken forward for in silico (two data sets) or de novo (13 data sets) replication genotyping in 2677 independent AN cases and 8629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication data sets comprised 5551 AN cases and 21 080 controls. AN subtype analyses (1606 AN restricting; 1445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01 × 10(-7)) in SOX2OT and rs17030795 (P=5.84 × 10(-6)) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76 × 10(-)(6)) between CUL3 and FAM124B and rs1886797 (P=8.05 × 10(-)(6)) near SPATA13. Comparing discovery with replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4 × 10(-6)), strongly suggesting that true findings exist but our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field.
Boraska, Vesna; Franklin, Christopher S; Floyd, James AB; Thornton, Laura M; Huckins, Laura M; Southam, Lorraine; Rayner, N William; Tachmazidou, Ioanna; Klump, Kelly L; Treasure, Janet; Lewis, Cathryn M; Schmidt, Ulrike; Tozzi, Federica; Kiezebrink, Kirsty; Hebebrand, Johannes; Gorwood, Philip; Adan, Roger AH; Kas, Martien JH; Favaro, Angela; Santonastaso, Paolo; Fernández-Aranda, Fernando; Gratacos, Monica; Rybakowski, Filip; Dmitrzak-Weglarz, Monika; Kaprio, Jaakko; Keski-Rahkonen, Anna; Raevuori, Anu; Van Furth, Eric F; Landt, Margarita CT Slof-Op t; Hudson, James I; Reichborn-Kjennerud, Ted; Knudsen, Gun Peggy S; Monteleone, Palmiero; Kaplan, Allan S; Karwautz, Andreas; Hakonarson, Hakon; Berrettini, Wade H; Guo, Yiran; Li, Dong; Schork, Nicholas J.; Komaki, Gen; Ando, Tetsuya; Inoko, Hidetoshi; Esko, Tõnu; Fischer, Krista; Männik, Katrin; Metspalu, Andres; Baker, Jessica H; Cone, Roger D; Dackor, Jennifer; DeSocio, Janiece E; Hilliard, Christopher E; O'Toole, Julie K; Pantel, Jacques; Szatkiewicz, Jin P; Taico, Chrysecolla; Zerwas, Stephanie; Trace, Sara E; Davis, Oliver SP; Helder, Sietske; Bühren, Katharina; Burghardt, Roland; de Zwaan, Martina; Egberts, Karin; Ehrlich, Stefan; Herpertz-Dahlmann, Beate; Herzog, Wolfgang; Imgart, Hartmut; Scherag, André; Scherag, Susann; Zipfel, Stephan; Boni, Claudette; Ramoz, Nicolas; Versini, Audrey; Brandys, Marek K; Danner, Unna N; de Kovel, Carolien; Hendriks, Judith; Koeleman, Bobby PC; Ophoff, Roel A; Strengman, Eric; van Elburg, Annemarie A; Bruson, Alice; Clementi, Maurizio; Degortes, Daniela; Forzan, Monica; Tenconi, Elena; Docampo, Elisa; Escaramís, Geòrgia; Jiménez-Murcia, Susana; Lissowska, Jolanta; Rajewski, Andrzej; Szeszenia-Dabrowska, Neonila; Slopien, Agnieszka; Hauser, Joanna; Karhunen, Leila; Meulenbelt, Ingrid; Slagboom, P Eline; Tortorella, Alfonso; Maj, Mario; Dedoussis, George; Dikeos, Dimitris; Gonidakis, Fragiskos; Tziouvas, Konstantinos; Tsitsika, Artemis; Papezova, Hana; Slachtova, Lenka; Martaskova, Debora; Kennedy, James L.; Levitan, Robert D.; Yilmaz, Zeynep; Huemer, Julia; Koubek, Doris; Merl, Elisabeth; Wagner, Gudrun; Lichtenstein, Paul; Breen, Gerome; Cohen-Woods, Sarah; Farmer, Anne; McGuffin, Peter; Cichon, Sven; Giegling, Ina; Herms, Stefan; Rujescu, Dan; Schreiber, Stefan; Wichmann, H-Erich; Dina, Christian; Sladek, Rob; Gambaro, Giovanni; Soranzo, Nicole; Julia, Antonio; Marsal, Sara; Rabionet, Raquel; Gaborieau, Valerie; Dick, Danielle M; Palotie, Aarno; Ripatti, Samuli; Widén, Elisabeth; Andreassen, Ole A; Espeseth, Thomas; Lundervold, Astri; Reinvang, Ivar; Steen, Vidar M; Le Hellard, Stephanie; Mattingsdal, Morten; Ntalla, Ioanna; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Navratilova, Marie; Gallinger, Steven; Pinto, Dalila; Scherer, Stephen; Aschauer, Harald; Carlberg, Laura; Schosser, Alexandra; Alfredsson, Lars; Ding, Bo; Klareskog, Lars; Padyukov, Leonid; Finan, Chris; Kalsi, Gursharan; Roberts, Marion; Logan, Darren W; Peltonen, Leena; Ritchie, Graham RS; Barrett, Jeffrey C; Estivill, Xavier; Hinney, Anke; Sullivan, Patrick F; Collier, David A; Zeggini, Eleftheria; Bulik, Cynthia M
Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2,907 cases with AN from 14 countries (15 sites) and 14,860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery datasets. Seventy-six (72 independent) SNPs were taken forward for in silico (two datasets) or de novo (13 datasets) replication genotyping in 2,677 independent AN cases and 8,629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication datasets comprised 5,551 AN cases and 21,080 controls. AN subtype analyses (1,606 AN restricting; 1,445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01×10-7) in SOX2OT and rs17030795 (P=5.84×10-6) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76×10-6) between CUL3 and FAM124B and rs1886797 (P=8.05×10-6) near SPATA13. Comparing discovery to replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4×10-6), strongly suggesting that true findings exist but that our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field. PMID:24514567
Jin, Eun-Heui; Zhang, Enji; Ko, Youngkwon; Sim, Woo Seog; Moon, Dong Eon; Yoon, Keon Jung; Hong, Jang Hee; Lee, Won Hyung
Complex regional pain syndrome (CRPS) is a chronic, progressive, and devastating pain syndrome characterized by spontaneous pain, hyperalgesia, allodynia, altered skin temperature, and motor dysfunction. Although previous gene expression profiling studies have been conducted in animal pain models, there genome-wide expression profiling in the whole blood of CRPS patients has not been reported yet. Here, we successfully identified certain pain-related genes through genome-wide expression profiling in the blood from CRPS patients. We found that 80 genes were differentially expressed between 4 CRPS patients (2 CRPS I and 2 CRPS II) and 5 controls (cut-off value: 1.5-fold change and p<0.05). Most of those genes were associated with signal transduction, developmental processes, cell structure and motility, and immunity and defense. The expression levels of major histocompatibility complex class I A subtype (HLA-A29.1), matrix metalloproteinase 9 (MMP9), alanine aminopeptidase N (ANPEP), l-histidine decarboxylase (HDC), granulocyte colony-stimulating factor 3 receptor (G-CSF3R), and signal transducer and activator of transcription 3 (STAT3) genes selected from the microarray were confirmed in 24 CRPS patients and 18 controls by quantitative reverse transcription-polymerase chain reaction (qRT-PCR). We focused on the MMP9 gene that, by qRT-PCR, showed a statistically significant difference in expression in CRPS patients compared to controls with the highest relative fold change (4.0±1.23 times and p = 1.4×10−4). The up-regulation of MMP9 gene in the blood may be related to the pain progression in CRPS patients. Our findings, which offer a valuable contribution to the understanding of the differential gene expression in CRPS may help in the understanding of the pathophysiology of CRPS pain progression. PMID:24244504
Hayes, Ben; Goddard, Mike
Results from genome-wide association studies in livestock, and humans, has lead to the conclusion that the effect of individual quantitative trait loci (QTL) on complex traits, such as yield, are likely to be small; therefore, a large number of QTL are necessary to explain genetic variation in these traits. Given this genetic architecture, gains from marker-assisted selection (MAS) programs using only a small number of DNA markers to trace a limited number of QTL is likely to be small. This has lead to the development of alternative technology for using the available dense single nucleotide polymorphism (SNP) information, called genomic selection. Genomic selection uses a genome-wide panel of dense markers so that all QTL are likely to be in linkage disequilibrium with at least one SNP. The genomic breeding values are predicted to be the sum of the effect of these SNPs across the entire genome. In dairy cattle breeding, the accuracy of genomic estimated breeding values (GEBV) that can be achieved and the fact that these are available early in life have lead to rapid adoption of the technology. Here, we discuss the design of experiments necessary to achieve accurate prediction of GEBV in future generations in terms of the number of markers necessary and the size of the reference population where marker effects are estimated. We also present a simple method for implementing genomic selection using a genomic relationship matrix. Future challenges discussed include using whole genome sequence data to improve the accuracy of genomic selection and management of inbreeding through genomic relationships.
Begum, Ferdouse; Chowdhury, Reshmi; Cheung, Vivian G.; Sherman, Stephanie L.; Feingold, Eleanor
Meiotic recombination is an essential step in gametogenesis, and is one that also generates genetic diversity. Genome-wide association studies (GWAS) and molecular studies have identified genes that influence of human meiotic recombination. RNF212 is associated with total or average number of recombination events, and PRDM9 is associated with the locations of hotspots, or sequences where crossing over appears to cluster. In addition, a common inversion on chromosome 17 is strongly associated with recombination. Other genes have been identified by GWAS, but those results have not been replicated. In this study, using new datasets, we characterized additional recombination phenotypes to uncover novel candidates and further dissect the role of already known loci. We used three datasets totaling 1562 two-generation families, including 3108 parents with 4304 children. We estimated five different recombination phenotypes including two novel phenotypes (average recombination counts within recombination hotspots and outside of hotspots) using dense SNP array genotype data. We then performed gender-specific and combined-sex genome-wide association studies (GWAS) meta-analyses. We replicated associations for several previously reported recombination genes, including RNF212 and PRDM9. By looking specifically at recombination events outside of hotspots, we showed for the first time that PRDM9 has different effects in males and females. We identified several new candidate loci, particularly for recombination events outside of hotspots. These include regions near the genes SPINK6, EVC2, ARHGAP25, and DLGAP2. This study expands our understanding of human meiotic recombination by characterizing additional features that vary across individuals, and identifying regulatory variants influencing the numbers and locations of recombination events. PMID:27733454
Begum, Ferdouse; Chowdhury, Reshmi; Cheung, Vivian G; Sherman, Stephanie L; Feingold, Eleanor
Meiotic recombination is an essential step in gametogenesis, and is one that also generates genetic diversity. Genome-wide association studies (GWAS) and molecular studies have identified genes that influence of human meiotic recombination. RNF212 is associated with total or average number of recombination events, and PRDM9 is associated with the locations of hotspots, or sequences where crossing over appears to cluster. In addition, a common inversion on chromosome 17 is strongly associated with recombination. Other genes have been identified by GWAS, but those results have not been replicated. In this study, using new datasets, we characterized additional recombination phenotypes to uncover novel candidates and further dissect the role of already known loci. We used three datasets totaling 1562 two-generation families, including 3108 parents with 4304 children. We estimated five different recombination phenotypes including two novel phenotypes (average recombination counts within recombination hotspots and outside of hotspots) using dense SNP array genotype data. We then performed gender-specific and combined-sex genome-wide association studies (GWAS) meta-analyses. We replicated associations for several previously reported recombination genes, including RNF212 and PRDM9 By looking specifically at recombination events outside of hotspots, we showed for the first time that PRDM9 has different effects in males and females. We identified several new candidate loci, particularly for recombination events outside of hotspots. These include regions near the genes SPINK6, EVC2, ARHGAP25, and DLGAP2 This study expands our understanding of human meiotic recombination by characterizing additional features that vary across individuals, and identifying regulatory variants influencing the numbers and locations of recombination events.
Watters, James W; Kraja, Aldi; Meucci, Melissa A; Province, Michael A; McLeod, Howard L
Little is known about the heritability of chemotherapy activity or the identity of genes that may enable the individualization of cancer chemotherapy. Although numerous genes are likely to influence chemotherapy response, current candidate gene-based pharmacogenetics approaches require a priori knowledge and the selection of a small number of candidate genes for hypothesis testing. In this study, an ex vivo familial genetics strategy using lymphoblastoid cells derived from Centre d'Etude du Polymorphisme Humain reference pedigrees was used to discover genetic determinants of chemotherapy cytotoxicity. Cytotoxicity to the mechanistically distinct chemotherapy agents 5-fluorouracil and docetaxel were shown to be heritable traits, with heritability values ranging from 0.26 to 0.65 for 5-fluorouracil and 0.21 to 0.70 for docetaxel, varying with dose. Genome-wide linkage analysis was also used to map a quantitative trait locus influencing the cellular effects of 5-fluorouracil to chromosome 9q13-q22 [logarithm of odds (LOD) = 3.44], and two quantitative trait loci influencing the cellular effects of docetaxel to chromosomes 5q11-21 (LOD = 2.21) and 9q13-q22 (LOD = 2.73). Finally, 5-fluorouracil and docetaxel were shown to cause apoptotic cell death involving caspase-3 cleavage in Centre d'Etude du Polymorphisme Humain lymphoblastoid cells. This study identifies genomic regions likely to harbor genes important for chemotherapy cytotoxicity using genome-wide linkage analysis in human pedigrees and provides a widely applicable strategy for pharmacogenomic discovery without the requirement for a priori candidate gene selection.
Kanazawa, Tetsufumi; Ikeda, Masashi; Glatt, Stephen J; Tsutsumi, Atsushi; Kikuyama, Hiroki; Kawamura, Yoshiya; Nishida, Nao; Miyagawa, Taku; Hashimoto, Ryota; Takeda, Masatoshi; Sasaki, Tsukasa; Tokunaga, Katsushi; Koh, Jun; Iwata, Nakao; Yoneda, Hiroshi
Atypical psychosis with a periodic course of exacerbation and features of major psychiatric disorders [schizophrenia (SZ) and bipolar disorder (BD)] has a long history in clinical psychiatry in Japan. Based upon the new criteria of atypical psychosis, a Genome-Wide Association Study (GWAS) was conducted to identify the risk gene or variants. The relationships between atypical psychosis, SZ and BD were then assessed using independent GWAS data. Forty-seven patients with solid criteria of atypical psychosis and 882 normal controls (NCs) were scanned using an Affymetrics 6.0 chip. GWAS SZ data (560 SZ cases and 548 NCs) and GWAS BD (107 cases with BD type 1 and 107 NCs) were compared using gene-based analysis. The most significant SNPs were detected around the CHN2/CPVL genes (rs245914, P = 1.6 × 10(-7)) , COL21A1 gene (rs12196860, P = 2.45 × 10(-7) ), and PYGL/TRIM9 genes (rs1959536, P = 7.73 × 10(-7) ), although none of the single-nucleotide polymorphisms exhibited genome-wide significance (P = 5 × 10(-8) ). One of the highest peaks was detected on the major histocompatibility complex region, where large SZ GWASs have previously disclosed an association. The gene-based analysis suggested significant enrichment between SZ and atypical psychosis (P = 0.01), but not BD. This study provides clues about the types of patient whose diagnosis lies between SZ and BD. Studies with larger samples are required to determine the causal variant.
Ayers, Stephen; Switnicki, Michal Piotr; Angajala, Anusha; Lammel, Jan; Arumanayagam, Anithachristy S.; Webb, Paul
Thyroid hormone (TH) receptors (TRs) play central roles in metabolism and are major targets for pharmaceutical intervention. Presently, however, there is limited information about genome wide localizations of TR binding sites. Thus, complexities of TR genomic distribution and links between TRβ binding events and gene regulation are not fully appreciated. Here, we employ a BioChIP approach to capture TR genome-wide binding events in a liver cell line (HepG2). Like other NRs, TRβ appears widely distributed throughout the genome. Nevertheless, there is striking enrichment of TRβ binding sites immediately 5′ and 3′ of transcribed genes and TRβ can be detected near 50% of T3 induced genes. In contrast, no significant enrichment of TRβ is seen at negatively regulated genes or genes that respond to unliganded TRs in this system. Canonical TRE half-sites are present in more than 90% of TRβ peaks and classical TREs are also greatly enriched, but individual TRE organization appears highly variable with diverse half-site orientation and spacing. There is also significant enrichment of binding sites for TR associated transcription factors, including AP-1 and CTCF, near TR peaks. We conclude that T3-dependent gene induction commonly involves proximal TRβ binding events but that far-distant binding events are needed for T3 induction of some genes and that distinct, indirect, mechanisms are often at play in negative regulation and unliganded TR actions. Better understanding of genomic context of TR binding sites will help us determine why TR regulates genes in different ways and determine possibilities for selective modulation of TR action. PMID:24558356
Parker Beard, Jeannie
When college composition teachers carefully consider the role and function of multimodal composition in their classrooms, they can enhance the teaching of writing and communication, engage and empower students, and better prepare students for the challenges and possibilities of life in our rapidly changing digital age. To meet this teaching…
Avramopoulos, Dimitrios; Pearce, Brad D; McGrath, John; Wolyniec, Paula; Wang, Ruihua; Eckart, Nicole; Hatzimanolis, Alexandros; Goes, Fernando S; Nestadt, Gerald; Mulle, Jennifer; Coneely, Karen; Hopkins, Myfanwy; Ruczinski, Ingo; Yolken, Robert; Pulver, Ann E
Inflammation and maternal or fetal infections have been suggested as risk factors for schizophrenia (SZ) and bipolar disorder (BP). It is likely that such environmental effects are contingent on genetic background. Here, in a genome-wide approach, we test the hypothesis that such exposures increase the risk for SZ and BP and that the increase is dependent on genetic variants. We use genome-wide genotype data, plasma IgG antibody measurements against Toxoplasma gondii, Herpes simplex virus type 1, Cytomegalovirus, Human Herpes Virus 6 and the food antigen gliadin as well as measurements of C-reactive protein (CRP), a peripheral marker of inflammation. The subjects are SZ cases, BP cases, parents of cases and screened controls. We look for higher levels of our immunity/infection variables and interactions between them and common genetic variation genome-wide. We find many of the antibody measurements higher in both disorders. While individual tests do not withstand correction for multiple comparisons, the number of nominally significant tests and the comparisons showing the expected direction are in significant excess (permutation p=0.019 and 0.004 respectively). We also find CRP levels highly elevated in SZ, BP and the mothers of BP cases, in agreement with existing literature, but possibly confounded by our inability to correct for smoking or body mass index. In our genome-wide interaction analysis no signal reached genome-wide significance, yet many plausible candidate genes emerged. In a hypothesis driven test, we found multiple interactions among SZ-associated SNPs in the HLA region on chromosome 6 and replicated an interaction between CMV infection and genotypes near the CTNNA3 gene reported by a recent GWAS. Our results support that inflammatory processes and infection may modify the risk for psychosis and suggest that the genotype at SZ-associated HLA loci modifies the effect of these variables on the risk to develop SZ.
Zhai, Rihong; Zhao, Yang; Su, Li; Cassidy, Lauren; Liu, Geoffrey; Christiani, David C
Aberrant DNA methylation (DNAm) is a feature of most types of cancers. Genome-wide DNAm profiling has been performed successfully on tumor tissue DNA samples. However, the invasive procedure limits the utility of tumor tissue for epidemiological studies. While recent data indicate that cell-free circulating DNAm (cfDNAm) profiles reflect DNAm status in corresponding tumor tissues, no studies have examined the association of cfDNAm with cancer or precursors on a genome-wide scale. The objective of this pilot study was to evaluate the putative significance of genome-wide cfDNAm profiles in esophageal adenocarcinoma (EA) and Barrett esophagus (BE, EA precursor). We performed genome-wide DNAm profiling in EA tissue DNA (n = 8) and matched serum DNA (n = 8), in serum DNA of BE (n = 10), and in healthy controls (n = 10) using the Infinium HumanMethylation27 BeadChip that covers 27,578 CpG loci in 14,495 genes. We found that cfDNAm profiles were highly correlated to DNAm profiles in matched tumor tissue DNA (r = 0.92) in patients with EA. We selected the most differentially methylated loci to perform hierarchical clustering analysis. We found that 911 loci can discriminate perfectly between EA and control samples, 554 loci can separate EA from BE samples, and 46 loci can distinguish BE from control samples. These results suggest that genome-wide cfDNAm profiles are highly consistent with DNAm profiles detected in corresponding tumor tissues. Differential cfDNAm profiling may be a useful approach for the noninvasive screening of EA and EA premalignant lesions.
Use of 10,129 singleton SNPs of known genomic location in tetraploid cotton provided unique opportunities to characterize genome-wide diversity among 440 Gossypium hirsutum and 219 G. barbadense cultivars and landrace accessions of widespread origin. Using genome-wide distributed SNPs, we examined ...
Background A popular objective of many high-throughput genome projects is to discover various genomic markers associated with traits and develop statistical models to predict traits of future patients based on marker values. Results In this paper, we present a prediction method for time-to-event traits using genome-wide single-nucleotide polymorphisms (SNPs). We also propose a MaxTest associating between a time-to-event trait and a SNP accounting for its possible genetic models. The proposed MaxTest can help screen out nonprognostic SNPs and identify genetic models of prognostic SNPs. The performance of the proposed method is evaluated through simulations. Conclusions In conjunction with the MaxTest, the proposed method provides more parsimonious prediction models but includes more prognostic SNPs than some naive prediction methods. The proposed method is demonstrated with real GWAS data. PMID:23418752
Reich, T; Edenberg, H J; Goate, A; Williams, J T; Rice, J P; Van Eerdewegh, P; Foroud, T; Hesselbrock, V; Schuckit, M A; Bucholz, K; Porjesz, B; Li, T K; Conneally, P M; Nurnberger, J I; Tischfield, J A; Crowe, R R; Cloninger, C R; Wu, W; Shears, S; Carr, K; Crose, C; Willig, C; Begleiter, H
Alcohol dependence is a leading cause of morbidity and premature death. Several lines of evidence suggest a substantial genetic component to the risk for alcoholism: sibs of alcoholic probands have a 3-8 fold increased risk of also developing alcoholism, and twin heritability estimates of 50-60% are reported by contemporary studies of twins. We report on the results of a six-center collaborative study to identify susceptibility loci for alcohol dependence. A genome-wide screen examined 291 markers in 987 individuals from 105 families. Two-point and multipoint nonparametric linkage analyses were performed to detect susceptibility loci for alcohol dependence. Multipoint methods provided the strongest suggestions of linkage with susceptibility loci for alcohol dependence on chromosomes 1 and 7, and more modest evidence for a locus on chromosome 2. In addition, there was suggestive evidence for a protective locus on chromosome 4 near the alcohol dehydrogenase genes, for which protective effects have been reported in Asian populations.
Fernandez-Rozadilla, C; Cazier, J B; Moreno, V; Crous-Bou, M; Guinó, E; Durán, G; Lamas, M J; López, R; Candamio, S; Gallardo, E; Paré, L; Baiget, M; Páez, D; López-Fernández, L A; Cortejoso, L; García, M I; Bujanda, L; González, D; Gonzalo, V; Rodrigo, L; Reñé, J M; Jover, R; Brea-Fernández, A; Andreu, M; Bessa, X; Llor, X; Xicola, R; Palles, C; Tomlinson, I; Castellví-Bel, S; Castells, A; Ruiz-Ponte, C; Carracedo, A
The development of genotyping technologies has allowed for wider screening for inherited causes of variable outcomes following drug administration. We have performed a genome-wide association study (GWAS) on 221 colorectal cancer (CRC) patients that had been treated with 5-fluorouracil (5-FU), either alone or in combination with oxaliplatin (FOLFOX). A validation set of 791 patients was also studied. Seven SNPs (rs16857540, rs2465403, rs10876844, rs10784749, rs17626122, rs7325568 and rs4243761) showed evidence of association (pooled P-values 0.020, 9.426E-03, 0.010, 0.017, 0.042, 2.302E-04, 2.803E-03) with adverse drug reactions (ADRs). This is the first study to explore the genetic basis of inter-individual variation in toxicity responses to the administration of 5-FU or FOLFOX in CRC patients on a genome-wide scale.
Kim, Jin Il; Park, Sehee; Lee, Ilseob; Park, Kwang Sook; Kwak, Eun Jung; Moon, Kwang Mee; Lee, Chang Kyu; Bae, Joon-Yong; Park, Man-Seong; Song, Ki-Joon
Human metapneumovirus (HMPV) has been described as an important etiologic agent of upper and lower respiratory tract infections, especially in young children and the elderly. Most of school-aged children might be introduced to HMPVs, and exacerbation with other viral or bacterial super-infection is common. However, our understanding of the molecular evolution of HMPVs remains limited. To address the comprehensive evolutionary dynamics of HMPVs, we report a genome-wide analysis of the eight genes (N, P, M, F, M2, SH, G, and L) using 103 complete genome sequences. Phylogenetic reconstruction revealed that the eight genes from one HMPV strain grouped into the same genetic group among the five distinct lineages (A1, A2a, A2b, B1, and B2). A few exceptions of phylogenetic incongruence might suggest past recombination events, and we detected possible recombination breakpoints in the F, SH, and G coding regions. The five genetic lineages of HMPVs shared quite remote common ancestors ranging more than 220 to 470 years of age with the most recent origins for the A2b sublineage. Purifying selection was common, but most protein genes except the F and M2-2 coding regions also appeared to experience episodic diversifying selection. Taken together, these suggest that the five lineages of HMPVs maintain their individual evolutionary dynamics and that recombination and selection forces might work on shaping the genetic diversity of HMPVs. PMID:27046055
Fernandez, Christian A.; Smith, Colton; Yang, Wenjian; Mullighan, Charles G.; Qu, Chunxu; Larsen, Eric; Bowman, W. Paul; Liu, Chengcheng; Ramsey, Laura B.; Chang, Tamara; Karol, Seth E.; Loh, Mignon L.; Raetz, Elizabeth A.; Winick, Naomi J.; Hunger, Stephen P.; Carroll, William L.; Jeha, Sima; Pui, Ching-Hon; Evans, William E.; Devidas, Meenakshi
Asparaginase is used to treat acute lymphoblastic leukemia (ALL); however, hypersensitivity reactions can lead to suboptimal asparaginase exposure. Our objective was to use a genome-wide approach to identify loci associated with asparaginase hypersensitivity in children with ALL enrolled on St. Jude Children’s Research Hospital (SJCRH) protocols Total XIIIA (n = 154), Total XV (n = 498), and Total XVI (n = 271), or Children’s Oncology Group protocols POG 9906 (n = 222) and AALL0232 (n = 2163). Germline DNA was genotyped using the Affymetrix 500K, Affymetrix 6.0, or the Illumina Exome BeadChip array. In multivariate logistic regression, the intronic rs6021191 variant in nuclear factor of activated T cells 2 (NFATC2) had the strongest association with hypersensitivity (P = 4.1 × 10−8; odds ratio [OR] = 3.11). RNA-seq data available from 65 SJCRH ALL tumor samples and 52 Yoruba HapMap samples showed that samples carrying the rs6021191 variant had higher NFATC2 expression compared with noncarriers (P = 1.1 × 10−3 and 0.03, respectively). The top ranked nonsynonymous polymorphism was rs17885382 in HLA-DRB1 (P = 3.2 × 10−6; OR = 1.63), which is in near complete linkage disequilibrium with the HLA-DRB1*07:01 allele we previously observed in a candidate gene study. The strongest risk factors for asparaginase allergy are variants within genes regulating the immune response. PMID:25987655
Moorjani, Priya; Patterson, Nick; Loh, Po-Ru; Lipson, Mark; Kisfali, Péter; Melegh, Bela I.; Bonin, Michael; Kádaši, Ľudevít; Rieß, Olaf; Berger, Bonnie; Reich, David; Melegh, Béla
The Roma people, living throughout Europe and West Asia, are a diverse population linked by the Romani language and culture. Previous linguistic and genetic studies have suggested that the Roma migrated into Europe from South Asia about 1,000–1,500 years ago. Genetic inferences about Roma history have mostly focused on the Y chromosome and mitochondrial DNA. To explore what additional information can be learned from genome-wide data, we analyzed data from six Roma groups that we genotyped at hundreds of thousands of single nucleotide polymorphisms (SNPs). We estimate that the Roma harbor about 80% West Eurasian ancestry–derived from a combination of European and South Asian sources–and that the date of admixture of South Asian and European ancestry was about 850 years before present. We provide evidence for Eastern Europe being a major source of European ancestry, and North-west India being a major source of the South Asian ancestry in the Roma. By computing allele sharing as a measure of linkage disequilibrium, we estimate that the migration of Roma out of the Indian subcontinent was accompanied by a severe founder event, which appears to have been followed by a major demographic expansion after the arrival in Europe. PMID:23516520
Quintales, Luis; Vázquez, Enrique; Antequera, Francisco
Nucleosomes contribute to compacting the genome into the nucleus and regulate the physical access of regulatory proteins to DNA either directly or through the epigenetic modifications of the histone tails. Precise mapping of nucleosome positioning across the genome is, therefore, essential to understanding the genome regulation. In recent years, several experimental protocols have been developed for this purpose that include the enzymatic digestion, chemical cleavage or immunoprecipitation of chromatin followed by next-generation sequencing of the resulting DNA fragments. Here, we compare the performance and resolution of these methods from the initial biochemical steps through the alignment of the millions of short-sequence reads to a reference genome to the final computational analysis to generate genome-wide maps of nucleosome occupancy. Because of the lack of a unified protocol to process data sets obtained through the different approaches, we have developed a new computational tool (NUCwave), which facilitates their analysis, comparison and assessment and will enable researchers to choose the most suitable method for any particular purpose. NUCwave is freely available at http://nucleosome.usal.es/nucwave along with a step-by-step protocol for its use.
Larsson, Ola; Sonenberg, Nahum; Nadon, Robert
Regulation of gene expression through translational control is a fundamental mechanism implicated in many biological processes ranging from memory formation to innate immunity and whose dysregulation contributes to human diseases. Genome wide analyses of translational control strive to identify differential translation independent of cytosolic mRNA levels. For this reason, most studies measure genes' translation levels as log ratios (translation levels divided by corresponding cytosolic mRNA levels obtained in parallel). Counterintuitively, arising from a mathematical necessity, these log ratios tend to be highly correlated with the cytosolic mRNA levels. Accordingly, they do not effectively correct for cytosolic mRNA level and generate substantial numbers of biological false positives and false negatives. We show that analysis of partial variance, which produces estimates of translational activity that are independent of cytosolic mRNA levels, is a superior alternative. When combined with a variance shrinkage method for estimating error variance, analysis of partial variance has the additional benefit of having greater statistical power and identifying fewer genes as translationally regulated resulting merely from unrealistically low variance estimates rather than from large changes in translational activity. In contrast to log ratios, this formal analytical approach estimates translation effects in a statistically rigorous manner, eliminates the need for inefficient and error-prone heuristics, and produces results that agree with biological function. The method is applicable to datasets obtained from both the commonly used polysome microarray method and the sequencing-based ribosome profiling method.
Kim, Jin Il; Park, Sehee; Lee, Ilseob; Park, Kwang Sook; Kwak, Eun Jung; Moon, Kwang Mee; Lee, Chang Kyu; Bae, Joon-Yong; Park, Man-Seong; Song, Ki-Joon
Human metapneumovirus (HMPV) has been described as an important etiologic agent of upper and lower respiratory tract infections, especially in young children and the elderly. Most of school-aged children might be introduced to HMPVs, and exacerbation with other viral or bacterial super-infection is common. However, our understanding of the molecular evolution of HMPVs remains limited. To address the comprehensive evolutionary dynamics of HMPVs, we report a genome-wide analysis of the eight genes (N, P, M, F, M2, SH, G, and L) using 103 complete genome sequences. Phylogenetic reconstruction revealed that the eight genes from one HMPV strain grouped into the same genetic group among the five distinct lineages (A1, A2a, A2b, B1, and B2). A few exceptions of phylogenetic incongruence might suggest past recombination events, and we detected possible recombination breakpoints in the F, SH, and G coding regions. The five genetic lineages of HMPVs shared quite remote common ancestors ranging more than 220 to 470 years of age with the most recent origins for the A2b sublineage. Purifying selection was common, but most protein genes except the F and M2-2 coding regions also appeared to experience episodic diversifying selection. Taken together, these suggest that the five lineages of HMPVs maintain their individual evolutionary dynamics and that recombination and selection forces might work on shaping the genetic diversity of HMPVs.
Kang, Yang Jae; Bae, Ahra; Shim, Sangrea; Lee, Taeyoung; Lee, Jayern; Satyawan, Dani; Kim, Moon Young; Lee, Suk-Ha
DNA methylation on cytosine residues is known to affect gene expression and is potentially responsible for the phenotypic variations among different crop cultivars. Here, we present the whole-genome DNA methylation profiles and assess the potential effects of single nucleotide polymorphisms (SNPs) for two mungbean cultivars, Sunhwanogdu (VC1973A) and Kyunggijaerae#5 (V2984). By measuring the DNA methylation levels in leaf tissue with the bisulfite sequencing (BSseq) approach, we show both the frequencies of the various types of DNA methylation and the distribution of weighted gene methylation levels. SNPs that cause nucleotide changes from/to CHH – where C is cytosine and H is any other nucleotide – were found to affect DNA methylation status in VC1973A and V2984. In order to better understand the correlation between gene expression and DNA methylation levels, we surveyed gene expression in leaf tissues of VC1973A and V2984 using RNAseq. Transcript expressions of paralogous genes were controlled by DNA methylation within the VC1973A genome. Moreover, genes that were differentially expressed between the two cultivars showed distinct DNA methylation patterns. Our mungbean genome-wide methylation profiles will be valuable resources for understanding the phenotypic variations between different cultivars, as well as for molecular breeding. PMID:28084412
Cao, Jiaxve; Wu, Mingming; Ma, Xiaomeng; Liu, Zhen; Liu, Ruizao; Zhao, Fuping; Wei, Caihong; Du, Lixin
Background Commercial sheep raised for mutton grow faster than traditional Chinese sheep breeds. Here, we aimed to evaluate genetic selection among three different types of sheep breed: two well-known commercial mutton breeds and one indigenous Chinese breed. Results We first combined locus-specific branch lengths and di statistical methods to detect candidate regions targeted by selection in the three different populations. The results showed that the genetic distances reached at least medium divergence for each pairwise combination. We found these two methods were highly correlated, and identified many growth-related candidate genes undergoing artificial selection. For production traits, APOBR and FTO are associated with body mass index. For meat traits, ALDOA, STK32B and FAM190A are related to marbling. For reproduction traits, CCNB2 and SLC8A3 affect oocyte development. We also found two well-known genes, GHR (which affects meat production and quality) and EDAR (associated with hair thickness) were associated with German mutton merino sheep. Furthermore, four genes (POL, RPL7, MSL1 and SHISA9) were associated with pre-weaning gain in our previous genome-wide association study. Conclusions Our results indicated that combine locus-specific branch lengths and di statistical approaches can reduce the searching ranges for specific selection. And we got many credible candidate genes which not only confirm the results of previous reports, but also provide a suite of novel candidate genes in defined breeds to guide hybridization breeding. PMID:26083354
Hashimoto, Tatsunori; Sherwood, Richard I.; Kang, Daniel D.; Rajagopal, Nisha; Barkal, Amira A.; Zeng, Haoyang; Emons, Bart J.M.; Srinivasan, Sharanya; Jaakkola, Tommi; Gifford, David K.
Enhancers and promoters commonly occur in accessible chromatin characterized by depleted nucleosome contact; however, it is unclear how chromatin accessibility is governed. We show that log-additive cis-acting DNA sequence features can predict chromatin accessibility at high spatial resolution. We develop a new type of high-dimensional machine learning model, the Synergistic Chromatin Model (SCM), which when trained with DNase-seq data for a cell type is capable of predicting expected read counts of genome-wide chromatin accessibility at every base from DNA sequence alone, with the highest accuracy at hypersensitive sites shared across cell types. We confirm that a SCM accurately predicts chromatin accessibility for thousands of synthetic DNA sequences using a novel CRISPR-based method of highly efficient site-specific DNA library integration. SCMs are directly interpretable and reveal that a logic based on local, nonspecific synergistic effects, largely among pioneer TFs, is sufficient to predict a large fraction of cellular chromatin accessibility in a wide variety of cell types. PMID:27456004
Ott, Felix; Weigel, Detlef; Bowman, John L.; Heisler, Marcus G.; Wenkel, Stephan
Plant organ development and polarity establishment is mediated by the action of several transcription factors. Among these, the KANADI (KAN) subclade of the GARP protein family plays important roles in polarity-associated processes during embryo, shoot and root patterning. In this study, we have identified a set of potential direct target genes of KAN1 through a combination of chromatin immunoprecipitation/DNA sequencing (ChIP-Seq) and genome-wide transcriptional profiling using tiling arrays. Target genes are over-represented for genes involved in the regulation of organ development as well as in the response to auxin. KAN1 affects directly the expression of several genes previously shown to be important in the establishment of polarity during lateral organ and vascular tissue development. We also show that KAN1 controls through its target genes auxin effects on organ development at different levels: transport and its regulation, and signaling. In addition, KAN1 regulates genes involved in the response to abscisic acid, jasmonic acid, brassinosteroids, ethylene, cytokinins and gibberellins. The role of KAN1 in organ polarity is antagonized by HD-ZIPIII transcription factors, including REVOLUTA (REV). A comparison of their target genes reveals that the REV/KAN1 module acts in organ patterning through opposite regulation of shared targets. Evidence of mutual repression between closely related family members is also shown. PMID:24155946
Liu, Yong-Jun; Zhang, Lei; Papasian, Christopher J.
In the past few years, the bone field has witnessed great advances in genome-wide association studies (GWASs) of osteoporosis, with a number of promising genes identified. In particular, meta-analysis of GWASs, aimed at increasing the power of studies by combining the results from different study populations, have led to the identification of novel associations that would not otherwise have been identified in individual GWASs. Recently, the first whole genome sequencing study for osteoporosis and fractures was published, reporting a novel rare nonsense mutation. This review summarizes the important and representative findings published by December 2013. Comments are made on the notable findings and representative studies for their potential influence and implications on our present understanding of the genetics of osteoporosis. Potential limitations of GWASs and their meta-analyses are evaluated, with an emphasis on understanding the reasons for inconsistent results between different studies and clarification of misinterpretation of GWAS meta-analysis results. Implications and challenges of GWAS are also discussed, including the need for multi- and inter-disciplinary studies. PMID:25006567
Patra, Biranchi; Kon, Yoshiko; Yadav, Gitanjali; Sevold, Anthony W.; Frumkin, Jesse P.; Vallabhajosyula, Ravishankar R.; Hintze, Arend; Østman, Bjørn; Schossau, Jory; Bhan, Ashish; Marzolf, Bruz; Tamashiro, Jenna K.; Kaur, Amardeep; Baliga, Nitin S.; Grayhack, Elizabeth J.; Adami, Christoph; Galas, David J.; Raval, Alpan; Phizicky, Eric M.; Ray, Animesh
Genomic robustness is the extent to which an organism has evolved to withstand the effects of deleterious mutations. We explored the extent of genomic robustness in budding yeast by genome wide dosage suppressor analysis of 53 conditional lethal mutations in cell division cycle and RNA synthesis related genes, revealing 660 suppressor interactions of which 642 are novel. This collection has several distinctive features, including high co-occurrence of mutant-suppressor pairs within protein modules, highly correlated functions between the pairs and higher diversity of functions among the co-suppressors than previously observed. Dosage suppression of essential genes encoding RNA polymerase subunits and chromosome cohesion complex suggests a surprising degree of functional plasticity of macromolecular complexes, and the existence of numerous degenerate pathways for circumventing the effects of potentially lethal mutations. These results imply that organisms and cancer are likely able to exploit the genomic robustness properties, due the persistence of cryptic gene and pathway functions, to generate variation and adapt to selective pressures. PMID:27899637
Sanchez, Robersy; Mackenzie, Sally A.
Cytosine DNA methylation (CDM) is a highly abundant, heritable but reversible chemical modification to the genome. Herein, a machine learning approach was applied to analyze the accumulation of epigenetic marks in methylomes of 152 ecotypes and 85 silencing mutants of Arabidopsis thaliana. In an information-thermodynamics framework, two measurements were used: (1) the amount of information gained/lost with the CDM changes IR and (2) the uncertainty of not observing a SNP LCR. We hypothesize that epigenetic marks are chromosomal footprints accounting for different ontogenetic and phylogenetic histories of individual populations. A machine learning approach is proposed to verify this hypothesis. Results support the hypothesis by the existence of discriminatory information (DI) patterns of CDM able to discriminate between individuals and between individual subpopulations. The statistical analyses revealed a strong association between the topologies of the structured population of Arabidopsis ecotypes based on IR and on LCR, respectively. A statistical-physical relationship between IR and LCR was also found. Results to date imply that the genome-wide distribution of CDM changes is not only part of the biological signal created by the methylation regulatory machinery, but ensures the stability of the DNA molecule, preserving the integrity of the genetic message under continuous stress from thermal fluctuations in the cell environment. PMID:27322251
Tchurikov, Nickolai A.; Kretova, Olga V.; Sosin, Dmitri V.; Zykov, Ivan A.; Zhimulev, Igor F.; Kravatsky, Yuri V.
Forum domains are stretches of chromosomal DNA that are excised from eukaryotic chromosomes during their spontaneous non-random fragmentation. Most forum domains are 50–200 kb in length. We mapped forum domain termini using FISH on polytene chromosomes and we performed genome-wide mapping using a Drosophila melanogaster genomic tiling microarray consisting of overlapping 3 kb fragments. We found that forum termini very often correspond to regions of intercalary heterochromatin and regions of late replication in polytene chromosomes. We found that forum domains contain clusters of several or many genes. The largest forum domains correspond to the main clusters of homeotic genes inside BX-C and ANTP-C, cluster of histone genes and clusters of piRNAs. PRE/TRE and transcription factor binding sites often reside inside domains and do not overlap with forum domain termini. We also found that about 20% of forum domain termini correspond to small chromosomal regions where Ago1, Ago2, small RNAs and repressive chromatin structures are detected. Our results indicate that forum domains correspond to big multi-gene chromosomal units, some of which could be coordinately expressed. The data on the global mapping of forum domains revealed a strong correlation between fragmentation sites in chromosomes, particular sets of mobile elements and regions of intercalary heterochromatin. PMID:21247882
Kelemen, Linda E.; Lawrenson, Kate; Tyrer, Jonathan; Li, Qiyuan; M. Lee, Janet; Seo, Ji-Heui; Phelan, Catherine M.; Beesley, Jonathan; Chen, Xiaoqin; Spindler, Tassja J.; Aben, Katja K.H.; Anton-Culver, Hoda; Antonenkova, Natalia; Baker, Helen; Bandera, Elisa V.; Bean, Yukie; Beckmann, Matthias W.; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A.; Brooks-Wilson, Angela; Bruinsma, Fiona; Butzow, Ralf; Campbell, Ian G.; Carty, Karen; Chang-Claude, Jenny; Chen, Y. Ann; Chen, Zhihua; Cook, Linda S.; Cramer, Daniel W.; Cunningham, Julie M.; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A.; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas T.; Edwards, Robert P.; Eilber, Ursula; Ekici, Arif B.; Engelholm, Svend Aage; Fasching, Peter A.; Fridley, Brooke L.; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind; Goode, Ellen L.; Goodman, Marc T.; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hasmad, Hanis Nazihah; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A.T.; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus; Hosono, Satoyo; Iversen, Edwin S.; Jakubowska, Anna; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kellar, Melissa; Kelley, Joseph L.; Kiemeney, Lambertus A.; Krakstad, Camilla; Kjaer, Susanne K.; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D.; Lee, Alice W.; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A.; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F.A.G.; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R.; McNeish, Iain; Menon, Usha; Modugno, Francesmary; Moes-Sosnowska, Joanna; Moysich, Kirsten B.; Narod, Steven A.; Nedergaard, Lotte; Ness, Roberta B.; Nevanlinna, Heli; Azmi, Mat Adenan Noor; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Paul, James; Pearce, Celeste Leigh; Pejovic, Tanja; Pelttari, Liisa M.; Permuth-Wey, Jennifer; Pike, Malcolm C.; Poole, Elizabeth M.; Ramus, Susan J.; Risch, Harvey A.; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H.; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Schildkraut, Joellen M.; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Sucheston, Lara; Tangen, Ingvild L.; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J; Tworoger, Shelley S.; van Altena, Anne M.; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A.; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S.; Wicklund, Kristine G.; Wilkens, Lynne R.; Wlodzimierz, Sawicki; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna H.; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Sellers, Thomas A.; Freedman, Matthew L.; Chenevix-Trench, Georgia; Pharoah, Paul D.; Gayther, Simon A.; Berchuck, Andrew
Genome-wide association studies have identified several risk associations for ovarian carcinomas (OC) but not for mucinous ovarian carcinomas (MOC). Genotypes from OC cases and controls were imputed into the 1000 Genomes Project reference panel. Analysis of 1,644 MOC cases and 21,693 controls identified three novel risk associations: rs752590 at 2q13 (P = 3.3 × 10−8), rs711830 at 2q31.1 (P = 7.5 × 10−12) and rs688187 at 19q13.2 (P = 6.8 × 10−13). Expression Quantitative Trait Locus (eQTL) analysis in ovarian and colorectal tumors (which are histologically similar to MOC) identified significant eQTL associations for HOXD9 at 2q31.1 in ovarian (P = 4.95 × 10−4, FDR = 0.003) and colorectal (P = 0.01, FDR = 0.09) tumors, and for PAX8 at 2q13 in colorectal tumors (P = 0.03, FDR = 0.09). Chromosome conformation capture analysis identified interactions between the HOXD9 promoter and risk SNPs at 2q31.1. Overexpressing HOXD9 in MOC cells augmented the neoplastic phenotype. These findings provide the first evidence for MOC susceptibility variants and insights into the underlying biology of the disease. PMID:26075790
Lakshmanan, Vairavan; Bansal, Dhiru; Kulkarni, Jahnavi; Poduval, Deepak; Krishna, Srikar; Sasidharan, Vidyanand; Anand, Praveen; Seshasayee, Aswin; Palakodeti, Dasaradhi
In eukaryotes, 3′ untranslated regions (UTRs) play important roles in regulating posttranscriptional gene expression. The 3′UTR is defined by regulated cleavage/polyadenylation of the pre-mRNA. The advent of next-generation sequencing technology has now enabled us to identify these events on a genome-wide scale. In this study, we used poly(A)-position profiling by sequencing (3P-Seq) to capture all poly(A) sites across the genome of the freshwater planarian, Schmidtea mediterranea, an ideal model system for exploring the process of regeneration and stem cell function. We identified the 3′UTRs for ∼14,000 transcripts and thus improved the existing gene annotations. We found 97 transcripts, which are polyadenylated within an internal exon, resulting in the shrinking of the ORF and loss of a predicted protein domain. Around 40% of the transcripts in planaria were alternatively polyadenylated (ApA), resulting either in an altered 3′UTR or a change in coding sequence. We identified specific ApA transcript isoforms that were subjected to miRNA mediated gene regulation using degradome sequencing. In this study, we also confirmed a tissue-specific expression pattern for alternate polyadenylated transcripts. The insights from this study highlight the potential role of ApA in regulating the gene expression essential for planarian regeneration. PMID:27489207
Li, Zhenhui; Zheng, Ming; Abdalla, Bahareldin Ali; Zhang, Zhe; Xu, Zhenqiang; Ye, Qiao; Xu, Haiping; Luo, Wei; Nie, Qinghua; Zhang, Xiquan
In the poultry industry, aggressive behaviour is a large animal welfare issue all over the world. To date, little is known about the underlying genetics of the aggressive behaviour. Here, we performed a genome-wide association study (GWAS) to explore the genetic mechanism associated with aggressive behaviour in chickens. The GWAS results showed that a total of 33 SNPs were associated with aggressive behaviour traits (P < 4.6E-6). rs312463697 on chromosome 4 was significantly associated with aggression (P = 2.10905E-07), and it was in the intron region of the sortilin-related VPS10 domain containing receptor 2 (SORCS2) gene. In addition, biological function analysis of the nearest 26 genes around the significant SNPs was performed with Ingenuity Pathway Analysis. An interaction network contained 17 genes was obtained and SORCS2 was involved in this network, interacted with nerve growth factor (NGF), nerve growth factor receptor (NGFR), dopa decarboxylase (L-dopa) and dopamine. After knockdown of SORCS2, the mRNA levels of NGF, L-dopa and dopamine receptor genes DRD1, DRD2, DRD3 and DRD4 were significantly decreased (P < 0.05). In summary, our data indicated that SORCS2 might play an important role in chicken aggressive behaviour through the regulation of dopaminergic pathways and NGF. PMID:27485826
Dai, Hui; Zhao, Yang; Qian, Cheng; Cai, Min; Zhang, Ruyang; Chu, Minjie; Dai, Juncheng; Hu, Zhibin; Shen, Hongbing; Chen, Feng
Genome-wide association studies (GWAS) are popular for identifying genetic variants which are associated with disease risk. Many approaches have been proposed to test multiple single nucleotide polymorphisms (SNPs) in a region simultaneously which considering disadvantages of methods in single locus association analysis. Kernel machine based SNP set analysis is more powerful than single locus analysis, which borrows information from SNPs correlated with causal or tag SNPs. Four types of kernel machine functions and principal component based approach (PCA) were also compared. However, given the loss of power caused by low minor allele frequencies (MAF), we conducted an extension work on PCA and used a new method called weighted PCA (wPCA). Comparative analysis was performed for weighted principal component analysis (wPCA), logistic kernel machine based test (LKM) and principal component analysis (PCA) based on SNP set in the case of different minor allele frequencies (MAF) and linkage disequilibrium (LD) structures. We also applied the three methods to analyze two SNP sets extracted from a real GWAS dataset of non-small cell lung cancer in Han Chinese population. Simulation results show that when the MAF of the causal SNP is low, weighted principal component and weighted IBS are more powerful than PCA and other kernel machine functions at different LD structures and different numbers of causal SNPs. Application of the three methods to a real GWAS dataset indicates that wPCA and wIBS have better performance than the linear kernel, IBS kernel and PCA.
Kikuchi, Shinji; Bheemanahalli, Raju; Jagadish, Krishna S V; Kumagai, Etsushi; Masuya, Yusuke; Kuroda, Eiki; Raghavan, Chitra; Dingkuhn, Michael; Abe, Akira; Shimono, Hiroyuki
Phenotypic plasticity of plants in response to environmental changes is important for adapting to changing climate. Less attention has been paid to exploring the advantages of phenotypic plasticity in resource-rich environments to enhance the productivity of agricultural crops. Here, we examined genetic variation in phenotypic plasticity in indica rice (Oryza sativa L.) across two diverse panels: (i) a Phenomics of Rice Adaptation and Yield (PRAY) population comprising 301 accessions and (ii) a Multi-parent-Advanced-Generation-Inter-Cross (MAGIC) indica population comprising 151 accessions. Altered planting density was used as a proxy for elevated atmospheric CO2 response. Low planting density significantly increased panicle weight per plant compared with normal density, and the magnitude of the increase ranged from 1.10 to 2.78 times among accessions for the PRAY population and from 1.05 to 2.45 times for the MAGIC population. Genome-wide-association studies revealed three Environmental Responsiveness (ER) candidate alleles (qER1-3) that were associated with relative response of panicle weight to low density. Two of these alleles were tested in 13 genotypes to clarify their biomass responses during vegetative growth under elevated CO2 in Japan. Our study provides evidence for polymorphisms that control rice phenotypic plasticity in environments that are rich in resources such as light and CO2 .
Sailer, Anna; Nalls, Michael A.; Schulte, Claudia; Federoff, Monica; Price, T. Ryan; Lees, Andrew; Ross, Owen A.; Dickson, Dennis W.; Mok, Kin; Mencacci, Niccolo E.; Schottlaender, Lucia; Chelban, Viorica; Ling, Helen; O'Sullivan, Sean S.; Wood, Nicholas W.; Traynor, Bryan J.; Ferrucci, Luigi; Federoff, Howard J.; Mhyre, Timothy R.; Morris, Huw R.; Deuschl, Günther; Quinn, Niall; Widner, Hakan; Albanese, Alberto; Infante, Jon; Bhatia, Kailash P.; Poewe, Werner; Oertel, Wolfgang; Höglinger, Günter U.; Wüllner, Ullrich; Goldwurm, Stefano; Pellecchia, Maria Teresa; Ferreira, Joaquim; Tolosa, Eduardo; Bloem, Bastiaan R.; Rascol, Olivier; Meissner, Wassilios G.; Hardy, John A.; Revesz, Tamas; Holton, Janice L.; Gasser, Thomas; Wenning, Gregor K.; Singleton, Andrew B.
Objective: To identify genetic variants that play a role in the pathogenesis of multiple system atrophy (MSA), we undertook a genome-wide association study (GWAS). Methods: We performed a GWAS with >5 million genotyped and imputed single nucleotide polymorphisms (SNPs) in 918 patients with MSA of European ancestry and 3,864 controls. MSA cases were collected from North American and European centers, one third of which were neuropathologically confirmed. Results: We found no significant loci after stringent multiple testing correction. A number of regions emerged as potentially interesting for follow-up at p < 1 × 10−6, including SNPs in the genes FBXO47, ELOVL7, EDN1, and MAPT. Contrary to previous reports, we found no association of the genes SNCA and COQ2 with MSA. Conclusions: We present a GWAS in MSA. We have identified several potentially interesting gene loci, including the MAPT locus, whose significance will have to be evaluated in a larger sample set. Common genetic variation in SNCA and COQ2 does not seem to be associated with MSA. In the future, additional samples of well-characterized patients with MSA will need to be collected to perform a larger MSA GWAS, but this initial study forms the basis for these next steps. PMID:27629089
Haltaufderhyde, Kirk D.; Oancea, Elena
Because human epidermal melanocytes (HEMs) provide critical protection against skin cancer, sunburn, and photoaging, a genome-wide perspective of gene expression in these cells is vital to understanding human skin physiology. In this study we performed high throughput sequencing of HEMs to obtain a complete data set of transcript sizes, abundances, and splicing. As expected, we found that melanocyte specific genes that function in pigmentation were among the highest expressed genes. We analyzed receptor, ion channel and transcription factor gene families to get a better understanding of the cell signalling pathways used by melanocytes. We also performed a comparative transcriptomic analysis of lightly versus darkly pigmented HEMs and found 16 genes differentially expressed in the two pigmentation phenotypes; of those, only one putative melanosomal transporter (SLC45A2) has known function in pigmentation. In addition, we found 166 genes with splice isoforms expressed exclusively in one pigmentation phenotype, 17 of which are genes involved in signal transduction. Our melanocyte transcriptome study provides a comprehensive view and may help identify novel pigmentation genes and potential pharmacological targets. PMID:25451175
Tchurikov, Nickolai A; Kretova, Olga V; Sosin, Dmitri V; Zykov, Ivan A; Zhimulev, Igor F; Kravatsky, Yuri V
Forum domains are stretches of chromosomal DNA that are excised from eukaryotic chromosomes during their spontaneous non-random fragmentation. Most forum domains are 50-200 kb in length. We mapped forum domain termini using FISH on polytene chromosomes and we performed genome-wide mapping using a Drosophila melanogaster genomic tiling microarray consisting of overlapping 3 kb fragments. We found that forum termini very often correspond to regions of intercalary heterochromatin and regions of late replication in polytene chromosomes. We found that forum domains contain clusters of several or many genes. The largest forum domains correspond to the main clusters of homeotic genes inside BX-C and ANTP-C, cluster of histone genes and clusters of piRNAs. PRE/TRE and transcription factor binding sites often reside inside domains and do not overlap with forum domain termini. We also found that about 20% of forum domain termini correspond to small chromosomal regions where Ago1, Ago2, small RNAs and repressive chromatin structures are detected. Our results indicate that forum domains correspond to big multi-gene chromosomal units, some of which could be coordinately expressed. The data on the global mapping of forum domains revealed a strong correlation between fragmentation sites in chromosomes, particular sets of mobile elements and regions of intercalary heterochromatin.
Kim, Jinsil; Pitlick, Mitchell M.; Christine, Paul J.; Schaefer, Amanda R.; Saleme, Cesar; Comas, Belén; Cosentino, Viviana; Gadow, Enrique; Murray, Jeffrey C.
The amnion is a specialized tissue in contact with the amniotic fluid, which is in a constantly changing state. To investigate the importance of epigenetic events in this tissue in the physiology and pathophysiology of pregnancy, we performed genome-wide DNA methylation profiling of human amnion from term (with and without labor) and preterm deliveries. Using the Illumina Infinium HumanMethylation27 BeadChip, we identified genes exhibiting differential methylation associated with normal labor and preterm birth. Functional analysis of the differentially methylated genes revealed biologically relevant enriched gene sets. Bisulfite sequencing analysis of the promoter region of the oxytocin receptor (OXTR) gene detected two CpG dinucleotides showing significant methylation differences among the three groups of samples. Hypermethylation of the CpG island of the solute carrier family 30 member 3 (SLC30A3) gene in preterm amnion was confirmed by methylation-specific PCR. This work provides preliminary evidence that DNA methylation changes in the amnion may be at least partially involved in the physiological process of labor and the etiology of preterm birth and suggests that DNA methylation profiles, in combination with other biological data, may provide valuable insight into the mechanisms underlying normal and pathological pregnancies. PMID:23533356
Darcy, Diana; Atwal, Paldeep Singh; Angell, Cathy; Gadi, Inder; Wallerstein, Robert
We report on a 6-month-old girl with two apparent cell lines; one with trisomy 21, and the other with paternal genome-wide uniparental isodisomy (GWUPiD), identified using single nucleotide polymorphism (SNP) based microarray and microsatellite analysis of polymorphic loci. The patient has Beckwith-Wiedemann syndrome (BWS) due to paternal uniparental disomy (UPD) at chromosome location 11p15 (UPD 11p15), which was confirmed through methylation analysis. Hyperinsulinemic hypoglycemia is present, which is associated with paternal UPD 11p15.5; and she likely has medullary nephrocalcinosis, which is associated with paternal UPD 20, although this was not biochemically confirmed. Angelman syndrome (AS) analysis was negative but this testing is not completely informative; she has no specific features of AS. Clinical features of this patient include: dysmorphic features consistent with trisomy 21, tetralogy of Fallot, hemihypertrophy, swirled skin hyperpigmentation, hepatoblastoma, and Wilms tumor. Her karyotype is 47,XX,+21/46,XX, and microarray results suggest that the cell line with trisomy 21 is biparentally inherited and represents 40-50% of the genomic material in the tested specimen. The difference in the level of cytogenetically detected mosaicism versus the level of mosaicism observed via microarray analysis is likely caused by differences in the test methodologies. While a handful of cases of mosaic paternal GWUPiD have been reported, this patient is the only reported case that also involves trisomy 21. Other GWUPiD patients have presented with features associated with multiple imprinted regions, as does our patient.
Abrantes, Patrícia; Francisco, Vânia; Teixeira, Gilberto; Monteiro, Marta; Neves, João; Norte, Ana; Robalo Cordeiro, Carlos; Moura e Sá, João; Reis, Ernestina; Santos, Patrícia; Oliveira, Manuela; Sousa, Susana; Fradinho, Marta; Malheiro, Filipa; Negrão, Luís
Despite elevated incidence and recurrence rates for Primary Spontaneous Pneumothorax (PSP), little is known about its etiology, and the genetics of idiopathic PSP remains unexplored. To identify genetic variants contributing to sporadic PSP risk, we conducted the first PSP genome-wide association study. Two replicate pools of 92 Portuguese PSP cases and of 129 age- and sex-matched controls were allelotyped in triplicate on the Affymetrix Human SNP Array 6.0 arrays. Markers passing quality control were ranked by relative allele score difference between cases and controls (|RASdiff|), by a novel cluster method and by a combined Z-test. 101 single nucleotide polymorphisms (SNPs) were selected using these three approaches for technical validation by individual genotyping in the discovery dataset. 87 out of 94 successfully tested SNPs were nominally associated in the discovery dataset. Replication of the 87 technically validated SNPs was then carried out in an independent replication dataset of 100 Portuguese cases and 425 controls. The intergenic rs4733649 SNP in chromosome 8 (between LINC00824 and LINC00977) was associated with PSP in the discovery (P = 4.07E-03, ORC[95% CI] = 1.88[1.22–2.89]), replication (P = 1.50E-02, ORC[95% CI] = 1.50[1.08–2.09]) and combined datasets (P = 8.61E-05, ORC[95% CI] = 1.65[1.29–2.13]). This study identified for the first time one genetic risk factor for sporadic PSP, but future studies are warranted to further confirm this finding in other populations and uncover its functional role in PSP pathogenesis. PMID:27203581
Lai, Rose K; Chen, Yanwen; Guan, Xiaowei; Nousome, Darryl; Sharma, Charu; Canoll, Peter; Bruce, Jeffrey; Sloan, Andrew E; Cortes, Etty; Vonsattel, Jean-Paul; Su, Tao; Delgado-Cruzata, Lissette; Gurvich, Irina; Santella, Regina M; Ostrom, Quinn; Lee, Annette; Gregersen, Peter; Barnholtz-Sloan, Jill
Few studies had investigated genome-wide methylation in glioblastoma multiforme (GBM). Our goals were to study differential methylation across the genome in gene promoters using an array-based method, as well as repetitive elements using surrogate global methylation markers. The discovery sample set for this study consisted of 54 GBM from Columbia University and Case Western Reserve University, and 24 brain controls from the New York Brain Bank. We assembled a validation dataset using methylation data of 162 TCGA GBM and 140 brain controls from dbGAP. HumanMethylation27 Analysis Bead-Chips (Illumina) were used to interrogate 26,486 informative CpG sites in both the discovery and validation datasets. Global methylation levels were assessed by analysis of L1 retrotransposon (LINE1), 5 methyl-deoxycytidine (5m-dC) and 5 hydroxylmethyl-deoxycytidine (5hm-dC) in the discovery dataset. We validated a total of 1548 CpG sites (1307 genes) that were differentially methylated in GBM compared to controls. There were more than twice as many hypomethylated genes as hypermethylated ones. Both the discovery and validation datasets found 5 tumor methylation classes. Pathway analyses showed that the top ten pathways in hypomethylated genes were all related to functions of innate and acquired immunities. Among hypermethylated pathways, transcriptional regulatory network in embryonic stem cells was the most significant. In the study of global methylation markers, 5m-dC level was the best discriminant among methylation classes, whereas in survival analyses, high level of LINE1 methylation was an independent, favorable prognostic factor in the discovery dataset. Based on a pathway approach, hypermethylation in genes that control stem cell differentiation were significant, poor prognostic factors of overall survival in both the discovery and validation datasets. Approaches that targeted these methylated genes may be a future therapeutic goal.
Christopoulou, Marilena; Wo, Sebastian Reyes-Chin; Kozik, Alex; McHale, Leah K; Truco, Maria-Jose; Wroblewski, Tadeusz; Michelmore, Richard W
Genome-wide motif searches identified 1134 genes in the lettuce reference genome of cv. Salinas that are potentially involved in pathogen recognition, of which 385 were predicted to encode nucleotide binding-leucine rich repeat receptor (NLR) proteins. Using a maximum-likelihood approach, we grouped the NLRs into 25 multigene families and 17 singletons. Forty-one percent of these NLR-encoding genes belong to three families, the largest being RGC16 with 62 genes in cv. Salinas. The majority of NLR-encoding genes are located in five major resistance clusters (MRCs) on chromosomes 1, 2, 3, 4, and 8 and cosegregate with multiple disease resistance phenotypes. Most MRCs contain primarily members of a single NLR gene family but a few are more complex. MRC2 spans 73 Mb and contains 61 NLRs of six different gene families that cosegregate with nine disease resistance phenotypes. MRC3, which is 25 Mb, contains 22 RGC21 genes and colocates with Dm13. A library of 33 transgenic RNA interference tester stocks was generated for functional analysis of NLR-encoding genes that cosegregated with disease resistance phenotypes in each of the MRCs. Members of four NLR-encoding families, RGC1, RGC2, RGC21, and RGC12 were shown to be required for 16 disease resistance phenotypes in lettuce. The general composition of MRCs is conserved across different genotypes; however, the specific repertoire of NLR-encoding genes varied particularly of the rapidly evolving Type I genes. These tester stocks are valuable resources for future analyses of additional resistance phenotypes.
Yamada, Kazuo; Iwayama, Yoshimi; Hattori, Eiji; Iwamoto, Kazuya; Toyota, Tomoko; Ohnishi, Tetsuo; Ohba, Hisako; Maekawa, Motoko; Kato, Tadafumi; Yoshikawa, Takeo
Schizophrenia is a devastating neuropsychiatric disorder with genetically complex traits. Genetic variants should explain a considerable portion of the risk for schizophrenia, and genome-wide association study (GWAS) is a potentially powerful tool for identifying the risk variants that underlie the disease. Here, we report the results of a three-stage analysis of three independent cohorts consisting of a total of 2,535 samples from Japanese and Chinese populations for searching schizophrenia susceptibility genes using a GWAS approach. Firstly, we examined 115,770 single nucleotide polymorphisms (SNPs) in 120 patient-parents trio samples from Japanese schizophrenia pedigrees. In stage II, we evaluated 1,632 SNPs (1,159 SNPs of p<0.01 and 473 SNPs of p<0.05 that located in previously reported linkage regions). The second sample consisted of 1,012 case-control samples of Japanese origin. The most significant p value was obtained for the SNP in the ELAVL2 [(embryonic lethal, abnormal vision, Drosophila)-like 2] gene located on 9p21.3 (p = 0.00087). In stage III, we scrutinized the ELAVL2 gene by genotyping gene-centric tagSNPs in the third sample set of 293 family samples (1,163 individuals) of Chinese descent and the SNP in the gene showed a nominal association with schizophrenia in Chinese population (p = 0.026). The current data in Asian population would be helpful for deciphering ethnic diversity of schizophrenia etiology. PMID:21674006
Yuen, Ryan K C; Merico, Daniele; Cao, Hongzhi; Pellecchia, Giovanna; Alipanahi, Babak; Thiruvahindrapuram, Bhooma; Tong, Xin; Sun, Yuhui; Cao, Dandan; Zhang, Tao; Wu, Xueli; Jin, Xin; Zhou, Ze; Liu, Xiaomin; Nalpathamkalam, Thomas; Walker, Susan; Howe, Jennifer L.; Wang, Zhuozhi; MacDonald, Jeffrey R.; Chan, Ada; D’Abate, Lia; Deneault, Eric; Siu, Michelle T.; Tammimies, Kristiina; Uddin, Mohammed; Zarrei, Mehdi; Wang, Mingbang; Li, Yingrui; Wang, Jun; Wang, Jian; Yang, Huanming; Bookman, Matt; Bingham, Jonathan; Gross, Samuel S.; Loy, Dion; Pletcher, Mathew; Marshall, Christian R.; Anagnostou, Evdokia; Zwaigenbaum, Lonnie; Weksberg, Rosanna; Fernandez, Bridget A; Roberts, Wendy; Szatmari, Peter; Glazer, David; Frey, Brendan J.; Ring, Robert H.; Xu, Xun; Scherer, Stephen W.
De novo mutations (DNMs) are important in Autism Spectrum Disorder (ASD), but so far analyses have mainly been on the ~1.5% of the genome encoding genes. Here, we performed whole genome sequencing (WGS) of 200 ASD parent-child trios and characterized germline and somatic DNMs. We confirmed that the majority of germline DNMs (75.6%) originated from the father, and these increased significantly with paternal age only (p=4.2×10−10). However, when clustered DNMs (those within 20kb) were found in ASD, not only did they mostly originate from the mother (p=7.7×10−13), but they could also be found adjacent to de novo copy number variations (CNVs) where the mutation rate was significantly elevated (p=2.4×10−24). By comparing DNMs detected in controls, we found a significant enrichment of predicted damaging DNMs in ASD cases (p=8.0×10−9; OR=1.84), of which 15.6% (p=4.3×10−3) and 22.5% (p=7.0×10−5) were in the non-coding or genic non-coding, respectively. The non-coding elements most enriched for DNM were untranslated regions of genes, boundaries involved in exon-skipping and DNase I hypersensitive regions. Using microarrays and a novel outlier detection test, we also found aberrant methylation profiles in 2/185 (1.1%) of ASD cases. These same individuals carried independently identified DNMs in the ASD risk- and epigenetic- genes DNMT3A and ADNP. Our data begins to characterize different genome-wide DNMs, and highlight the contribution of non-coding variants, to the etiology of ASD. PMID:27525107
Verkouteren, Joris A. C.; Hofman, Albert; Uitterlinden, André G.; Kraft, Peter; Turman, Constance; Han, Jiali; Cho, Eunyoung; Murabito, Joanne M.; Levy, Daniel; Qureshi, Abrar A.; Nijsten, Tamar
There is strong evidence for a role of environmental risk factors involved in susceptibility to develop multiple keratinocyte cancers (mKCs), but whether genes are also involved in mKCs susceptibility has not been thoroughly investigated. We investigated whether single nucleotide polymorphisms (SNPs) are associated with susceptibility for mKCs. A genome-wide association study (GWAS) of 1,666 cases with mKCs and 1,950 cases with single KC (sKCs; controls) from Harvard cohorts (the Nurses' Health Study [NHS], NHS II, and the Health Professionals Follow-Up Study) and the Framingham Heart Study was carried-out using over 8 million SNPs (stage-1). We sought to replicate the most significant statistical associations (p-value≤ 5.5x10-6) in an independent cohort of 574 mKCs and 872 sKCs from the Rotterdam Study. In the discovery stage, 40 SNPs with suggestive associations (p-value ≤5.5x10-6) were identified, with eight independent SNPs tagging all 40 SNPs. The most significant SNP was located at chromosome 9 (rs7468390; p-value = 3.92x10-7). In stage-2, none of these SNPs replicated and only two of them were associated with mKCs in the same direction in the combined meta-analysis. We tested the associations for 19 previously reported basal cell carcinoma-related SNPs (candidate gene association analysis), and found that rs1805007 (MC1R locus) was significantly associated with risk of mKCs (p-value = 2.80x10-4). Although the suggestive SNPs with susceptibility for mKCs were not replicated, we found that previously identified BCC variants may also be associated with mKC, which the most significant association (rs1805007) located at the MC1R gene. PMID:28081215
Background Sleep is a highly conserved behavior, yet its duration and pattern vary extensively among species and between individuals within species. The genetic basis of natural variation in sleep remains unknown. Results We used the Drosophila Genetic Reference Panel (DGRP) to perform a genome-wide association (GWA) study of sleep in D. melanogaster. We identified candidate single nucleotide polymorphisms (SNPs) associated with differences in the mean as well as the environmental sensitivity of sleep traits; these SNPs typically had sex-specific or sex-biased effects, and were generally located in non-coding regions. The majority of SNPs (80.3%) affecting sleep were at low frequency and had moderately large effects. Additive models incorporating multiple SNPs explained as much as 55% of the genetic variance for sleep in males and females. Many of these loci are known to interact physically and/or genetically, enabling us to place them in candidate genetic networks. We confirmed the role of seven novel loci on sleep using insertional mutagenesis and RNA interference. Conclusions We identified many SNPs in novel loci that are potentially associated with natural variation in sleep, as well as SNPs within genes previously known to affect Drosophila sleep. Several of the candidate genes have human homologues that were identified in studies of human sleep, suggesting that genes affecting variation in sleep are conserved across species. Our discovery of genetic variants that influence environmental sensitivity to sleep may have a wider application to all GWA studies, because individuals with highly plastic genotypes will not have consistent phenotypes. PMID:23617951
Hughes, Austin L; Welch, Robert; Puri, Vinita; Matthews, Casey; Haque, Kashif; Chanock, Stephen J; Yeager, Meredith
Single-nucleotide polymorphism (SNP) arrays have become a popular technology for disease-association studies, but they also have potential for studying the genetic differentiation of human populations. Application of the Affymetrix GeneChip Human Mapping 500K Array Set to a population of 102 individuals representing the major ethnic groups in the United States (African, Asian, European, and Hispanic) revealed patterns of gene diversity and genetic distance that reflected population history. We analyzed allelic frequencies at 388,654 autosomal SNP sites that showed some variation in our study population and 10% or fewer missing values. Despite the small size (23-31 individuals) of each subpopulation, there were no fixed differences at any site between any two subpopulations. As expected from the African origin of modern humans, greater gene diversity was seen in Africans than in either Asians or Europeans, and the genetic distance between the Asian and the European populations was significantly lower than that between either of these two populations and Africans. Principal components analysis applied to a correlation matrix among individuals was able to separate completely the major continental groups of humans (Africans, Asians, and Europeans), while Hispanics overlapped all three of these groups. Genes containing two or more markers with extraordinarily high genetic distance between subpopulations were identified as candidate genes for health differences between subpopulations. The results show that, even with modest sample sizes, genome-wide SNP genotyping technologies have great promise for capturing signatures of gene frequency difference between human subpopulations, with applications in areas as diverse as forensics and the study of ethnic health disparities.
Buzdugan, Laura; Kalisch, Markus; Navarro, Arcadi; Schunk, Daniel; Fehr, Ernst; Bühlmann, Peter
Motivation: Although Genome Wide Association Studies (GWAS) genotype a very large number of single nucleotide polymorphisms (SNPs), the data are often analyzed one SNP at a time. The low predictive power of single SNPs, coupled with the high significance threshold needed to correct for multiple testing, greatly decreases the power of GWAS. Results: We propose a procedure in which all the SNPs are analyzed in a multiple generalized linear model, and we show its use for extremely high-dimensional datasets. Our method yields P-values for assessing significance of single SNPs or groups of SNPs while controlling for all other SNPs and the family wise error rate (FWER). Thus, our method tests whether or not a SNP carries any additional information about the phenotype beyond that available by all the other SNPs. This rules out spurious correlations between phenotypes and SNPs that can arise from marginal methods because the ‘spuriously correlated’ SNP merely happens to be correlated with the ‘truly causal’ SNP. In addition, the method offers a data driven approach to identifying and refining groups of SNPs that jointly contain informative signals about the phenotype. We demonstrate the value of our method by applying it to the seven diseases analyzed by the Wellcome Trust Case Control Consortium (WTCCC). We show, in particular, that our method is also capable of finding significant SNPs that were not identified in the original WTCCC study, but were replicated in other independent studies. Availability and implementation: Reproducibility of our research is supported by the open-source Bioconductor package hierGWAS. Contact: email@example.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153677
Legarra, A; Misztal, I
Genome-wide genetic evaluation might involve the computation of BLUP-like estimations, potentially including thousands of covariates (i.e., single-nucleotide polymorphism markers) for each record. This implies dense Henderson's mixed-model equations and considerable computing resources in time and storage, even for a few thousand records. Possible computing options include the type of storage and the solving algorithm. This work evaluated several computing options, including half-stored Cholesky decomposition, Gauss-Seidel, and 3 matrix-free strategies: Gauss-Seidel, Gauss-Seidel with residuals update, and preconditioned conjugate gradients. Matrix-free Gauss-Seidel with residuals update adjusts the residuals after computing the solution for each effect. This avoids adjusting the left-hand side of the equations by all other effects at every step of the algorithm and saves considerable computing time. Any Gauss-Seidel algorithm can easily be extended for variance component estimation by Markov chain-Monte Carlo. Let m and n be the number of records and markers, respectively. Computing time for Cholesky decomposition is proportional to n3. Computing times per round are proportional to mn2 in matrix-free Gauss-Seidel, to n2 for half-stored Gauss-Seidel, and to n and m for the rest of the algorithms. Algorithms were tested on a real mouse data set, which included 1,928 records and 10,946 single-nucleotide polymorphism markers. Computing times were in the order of a few minutes for Gauss-Seidel with residuals update and preconditioned conjugate gradients, more than 1 h for half-stored Gauss-Seidel, 2 h for Cholesky decomposition, and 4 d for matrix-free Gauss-Seidel. Preconditioned conjugate gradients was the fastest. Gauss-Seidel with residuals update would be the method of choice for variance component estimation as well as solving.
Gaouar, S B S; Lafri, M; Djaout, A; El-Bouyahiaoui, R; Bouri, A; Bouchatal, A; Maftah, A; Ciani, E; Da Silva, A B
Algeria represents a reservoir of genetic diversity with local sheep breeds adapted to a large range of environments and showing specific features necessary to deal with harsh conditions. This remarkable diversity results from the traditional management of dryland by pastoralists over centuries. Most of these breeds are poorly productive, and the economic pressure leads farmers to realize anarchic cross-breeding (that is, not carried out in the framework of selection plans) with the hope to increase animal's conformation. In this study, eight of the nine local Algerian sheep breeds (D'men, Hamra, Ouled-Djellal, Rembi, Sidaoun, Tazegzawt, Berber and Barbarine) were investigated for the first time by genome-wide single-nucleotide polymorphism genotyping. At an international scale, Algerian sheep occupied an original position shaped by relations with African and European (particularly Italian) breeds. The strong genetic proximity with Caribbean and Brazilian breeds confirmed that the genetic make-up of these American breeds was largely influenced by the Atlantic slave trade. At a national scale, an alarming genetic dilution of the Berber (a primitive breed) and the Rembi was observed, as a consequence of uncontrolled mating practices with Ouled-Djellal. A similar, though less pronounced, phenomenon was also detected for the Barbarine, another ancestral breed. Genetic originality appeared to be better preserved in Tazegzawt, Hamra, D'men and Sidaoun. These breeds should be given high priority in the establishment of conservation plans to halt their progressive loss. For Berber and Barbarine that also occur in the bordering neighbor countries, urgent concerted transnational actions are needed.
Bartels, Meike; Saviouk, Viatcheslav; de Moor, Marleen H M; Willemsen, Gonneke; van Beijsterveldt, Toos C E M; Hottenga, Jouke-Jan; de Geus, Eco J C; Boomsma, Dorret I
Causes of individual differences in happiness, as assessed with the Subjective Happiness Scale, are investigated in a large of sample twins and siblings from the Netherlands Twin Register. Over 12,000 twins and siblings, average age 24.7 years (range 12 to 88), took part in the study. A genetic model with an age by sex design was fitted to the data with structural equation modeling in Mx. The heritability of happiness was estimated at 22% for males and 41% in females. No effect of age was observed. To identify the genomic regions contributing to this heritability, a genome-wide linkage study for happiness was conducted in sibling pairs. A subsample of 1157 offspring from 441 families was genotyped with an average of 371 micro-satellite markers per individual. Phenotype and genotype data were analyzed in MERLIN with multipoint variance component linkage analysis and age and sex as covariates. A linkage signal (logarithm of odds score 2.73, empirical p value 0.095) was obtained at the end of the long arm of chromosome 19 for marker D19S254 at 110 cM. A second suggestive linkage peak was found at the short arm of chromosome 1 (LOD of 2.37) at 153 cM, marker D1S534 (empirical p value of .209). These two regions of interest are not overlapping with the regions found for contrasting phenotypes (such as depression, which is negatively associated with happiness). Further linkage and future association studies are warranted.
Background Condensins are multi-subunit protein complexes that are essential for chromosome condensation during mitosis and meiosis, and play key roles in transcription regulation during interphase. Metazoans contain two condensins, I and II, which perform different functions and localize to different chromosomal regions. Caenorhabditis elegans contains a third condensin, IDC, that is targeted to and represses transcription of the X chromosome for dosage compensation. Results To understand condensin binding and function, we performed ChIP-seq analysis of C. elegans condensins in mixed developmental stage embryos, which contain predominantly interphase nuclei. Condensins bind to a subset of active promoters, tRNA genes and putative enhancers. Expression analysis in kle-2-mutant larvae suggests that the primary effect of condensin II on transcription is repression. A DNA sequence motif, GCGC, is enriched at condensin II binding sites. A sequence extension of this core motif, AGGG, creates the condensin IDC motif. In addition to differences in recruitment that result in X-enrichment of condensin IDC and condensin II binding to all chromosomes, we provide evidence for a shared recruitment mechanism, as condensin IDC recruiter SDC-2 also recruits condensin II to the condensin IDC recruitment sites on the X. In addition, we found that condensin sites overlap extensively with the cohesin loader SCC-2, and that SDC-2 also recruits SCC-2 to the condensin IDC recruitment sites. Conclusions Our results provide the first genome-wide view of metazoan condensin II binding in interphase, define putative recruitment motifs, and illustrate shared loading mechanisms for condensin IDC and condensin II. PMID:24125077
Background Even before having its genome sequence published in 2004, Kluyveromyces lactis had long been considered a model organism for studies in genetics and physiology. Research on Kluyveromyces lactis is quite advanced and this yeast species is one of the few with which it is possible to perform formal genetic analysis. Nevertheless, until now, no complete metabolic functional annotation has been performed to the proteins encoded in the Kluyveromyces lactis genome. Results In this work, a new metabolic genome-wide functional re-annotation of the proteins encoded in the Kluyveromyces lactis genome was performed, resulting in the annotation of 1759 genes with metabolic functions, and the development of a methodology supported by merlin (software developed in-house). The new annotation includes novelties, such as the assignment of transporter superfamily numbers to genes identified as transporter proteins. Thus, the genes annotated with metabolic functions could be exclusively enzymatic (1410 genes), transporter proteins encoding genes (301 genes) or have both metabolic activities (48 genes). The new annotation produced by this work largely surpassed the Kluyveromyces lactis currently available annotations. A comparison with KEGG’s annotation revealed a match with 844 (~90%) of the genes annotated by KEGG, while adding 850 new gene annotations. Moreover, there are 32 genes with annotations different from KEGG. Conclusions The methodology developed throughout this work can be used to re-annotate any yeast or, with a little tweak of the reference organism, the proteins encoded in any sequenced genome. The new annotation provided by this study offers basic knowledge which might be useful for the scientific community working on this model yeast, because new functions have been identified for the so-called metabolic genes. Furthermore, it served as the basis for the reconstruction of a compartmentalized, genome-scale metabolic model of Kluyveromyces lactis, which is
Ferfouri, F; Boitrelle, F; Ghout, I; Albert, M; Molina Gomes, D; Wainer, R; Bailly, M; Selva, J; Vialard, F
The objective of this study was to assess genome-wide DNA methylation in testicular tissue from azoospermic patients. A total of 94 azoospermic patients were recruited and classified into three groups: 29 patients presented obstructive azoospermia (OA), 26 displayed non-obstructive azoospermia (NOA) and successful retrieval of spermatozoa by testicular sperm extraction (TESE+) and 39 displayed NOA and failure to retrieve spermatozoa by TESE (TESE-). An Illumina Infinium Human Methylation27 BeadChip DNA methylation array was used to establish a testicular DNA methylation pattern for each type of azoospermic patient. The OA and NOA groups were compared in terms of the relative M-value (the log2 ratio between methylated and non-methylated probe intensities) for each CpG site. We observed significantly different DNA methylation profiles for the NOA and OA groups, with differences at over 9000 of the 27 578 CpG sites; 212 CpG sites had a relative M-value >3. The results highlighted 14 testis-specific genes. Patient clustering with respect to these 212 CpG sites corresponded closely to the clinical classification. The DNA methylation patterns showed that in the NOA group, 78 of the 212 CpG sites were hypomethylated and 134 were hypermethylated (relative to the OA group). On the basis of these DNA methylation profiles, azoospermic patients could be classified as OA or NOA by considering the 212 CpG sites with the greatest methylation differences. Furthermore, we identified genes that may provide insight into the mechanism of idiopathic NOA.
Arend, Béatrice; Sunnen, Patrick; Fixmer, Pierre; Sujbert, Monika
This paper discusses theoretical and methodological issues arising from a video-based research design and the emergent tool "Joint Screen'"when grasping joint activity. We share our reflections regarding the combined reading of four synchronised camera perspectives combined in one screen. By these means we reconstruct and analyse…
Background DNA methylation (DNAm) has important regulatory roles in many biological processes and diseases. It is the only epigenetic mark with a clear mechanism of mitotic inheritance and the only one easily available on a genome scale. Aberrant cytosine-phosphate-guanine (CpG) methylation has been discussed in the context of disease aetiology, especially cancer. CpG hypermethylation of promoter regions is often associated with silencing of tumour suppressor genes and hypomethylation with activation of oncogenes. Supervised principal component analysis (SPCA) is a popular machine learning method. However, in a recent application to phenotype prediction from DNAm data SPCA was inferior to the specific method EVORA. Results We present Model-Selection-SPCA (MS-SPCA), an enhanced version of SPCA. MS-SPCA applies several models that perform well in the training data to the test data and selects the very best models for final prediction based on parameters of the test data. We have applied MS-SPCA for phenotype prediction from genome-wide DNAm data. CpGs used for prediction are selected based on the quantification of three features of their methylation (average methylation difference, methylation variation difference and methylation-age-correlation). We analysed four independent case–control datasets that correspond to different stages of cervical cancer: (i) cases currently cytologically normal, but will later develop neoplastic transformations, (ii, iii) cases showing neoplastic transformations and (iv) cases with confirmed cancer. The first dataset was split into several smaller case–control datasets (samples either Human Papilloma Virus (HPV) positive or negative). We demonstrate that cytology normal HPV+ and HPV- samples contain DNAm patterns which are associated with later neoplastic transformations. We present evidence that DNAm patterns exist in cytology normal HPV- samples that (i) predispose to neoplastic transformations after HPV infection and (ii
Panagiotou, Orestis A; Travis, Ruth C; Campa, Daniele; Berndt, Sonja I.; Lindstrom, Sara; Kraft, Peter; Schumacher, Fredrick R.; Siddiq, Afshan; Papatheodorou, Stefania I.; Stanford, Janet L.; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie J.; Diver, W. Ryan; Gapstur, Susan M.; Stevens, Victoria L.; Boeing, Heiner; Bueno-de-Mesquita, H. Bas; Gurrea, Aurelio Barricarte; Kaaks, Rudolf; Khaw, Kay-Tee; Krogh, Vittorio; Overvad, Kim; Riboli, Elio; Trichopoulos, Dimitrios; Giovannucci, Edward; Stampfer, Meir; Haiman, Christopher; Henderson, Brian; Le Marchand, Loic; Gaziano, J. Michael; Hunter, DavidJ.; Koutros, Stella; Yeager, Meredith; Hoover, Robert N.; Chanock, Stephen J.; Wacholder, Sholom; Key, Timothy J.; Tsilidis, Konstantinos K
Background No single-nucleotide polymorphisms (SNPs) specific for aggressive prostate cancer have been identified in genome-wide association studies (GWAS). Objective To test if SNPs associated with other traits may also affect the risk of aggressive prostate cancer. Design, setting, and participants SNPs implicated in any phenotype other than prostate cancer (p ≤ 10−7) were identified through the catalog of published GWAS and tested in 2891 aggressive prostate cancer cases and 4592 controls from the Breast and Prostate Cancer Cohort Consortium (BPC3). The 40 most significant SNPs were followed up in 4872 aggressive prostate cancer cases and 24 534 controls from the Prostate Cancer Association Group to Investigate Cancer Associated Alterations in the Genome (PRACTICAL) consortium. Outcome measurements and statistical analysis Odds ratios (ORs) and 95% confidence intervals (CIs) for aggressive prostate cancer were estimated. Results and limitations A total of 4666 SNPs were evaluated by the BPC3. Two signals were seen in regions already reported for prostate cancer risk. rs7014346 at 8q24.21 was marginally associated with aggressive prostate cancer in the BPC3 trial (p = 1.6 × 10-6), whereas after meta-analysis by PRACTICAL the summary OR was 1.21 (95%CI 1.16–1.27; p = 3.22 × 10−18). rs9900242 at 17q24.3 was also marginally associated with aggressive disease in the meta-analysis (OR 0.90, 95% CI 0.86–0.94; p = 2.5 × 10−6). Neither of these SNPs remained statistically significant when conditioning on correlated known prostate cancer SNPs. The meta-analysis by BPC3 and PRACTICAL identified a third promising signal, marked by rs16844874 at 2q34, independent of known prostate cancer loci (OR 1.12,95% CI 1.06–1.19; p = 4.67 × 10−5); it has been shown that SNPs correlated with this signal affect glycine concentrations. The main limitation is the heterogeneity in the definition of aggressive prostate cancer between BPC3 and PRACTICAL. Conclusions We did
Bhatti, Parveen; Zhang, Yuzheng; Song, Xiaoling; Makar, Karen W; Sather, Cassandra L; Kelsey, Karl T; Houseman, E Andres; Wang, Pei
The negative health effects of shift work, including carcinogenesis, may be mediated by changes in DNA methylation, particularly in the circadian genes. Using the Infinium HumanMethylation450 Bead Array (Illumina, San Diego, CA), we compared genome-wide methylation between 65 actively working dayshift workers and 59 actively working nightshift workers in the healthcare industry. A total of 473 800 loci, including 391 loci across the 12 core circadian genes, were analyzed to identify methylation markers associated with shift work status using linear regression models adjusted for gender, age, body mass index, race, smoking status and leukocyte cell profile as measured by flow cytometry. Analyses at the level of gene, CpG island and gene region were also conducted. To account for multiple comparisons, we controlled the false discovery rate (FDR ≤0.05). Significant differences between nightshift and dayshift workers were found at 16 135 of 473 800 loci, across 3769 of 20 164 genes, across 7173 of 22 721 CpG islands and across 5508 of 51 843 gene regions. For each significant loci, gene, CpG island or gene region, average methylation was consistently found to be decreased among nightshift workers compared to dayshift workers. Twenty-one loci located in the circadian genes were also found to be significantly hypomethylated among nightshift workers. The largest differences were observed for three loci located in the gene body of PER3. A total of nine significant loci were found in the CSNK1E gene, most of which were located in a CpG island and near the transcription start site of the gene. Methylation changes in these circadian genes may lead to altered expression of these genes which has been associated with cancer in previous studies. Gene ontology enrichment analysis revealed that among the significantly hypomethylated genes, processes related to host defense and immunity were represented. Our results indicate that the health effects of shift work may be
Patenge, Nadja; Pappesch, Roberto; Khani, Afsaneh; Kreikemeyer, Bernd
Streptococci represent a diverse group of Gram-positive bacteria, which colonize a wide range of hosts among animals and humans. Streptococcal species occur as commensal as well as pathogenic organisms. Many of the pathogenic species can cause severe, invasive infections in their hosts leading to a high morbidity and mortality. The consequence is a tremendous suffering on the part of men and livestock besides the significant financial burden in the agricultural and healthcare sectors. An environmentally stimulated and tightly controlled expression of virulence factor genes is of fundamental importance for streptococcal pathogenicity. Bacterial small non-coding RNAs (sRNAs) modulate the expression of genes involved in stress response, sugar metabolism, surface composition, and other properties that are related to bacterial virulence. Even though the regulatory character is shared by this class of RNAs, variation on the molecular level results in a high diversity of functional mechanisms. The knowledge about the role of sRNAs in streptococci is still limited, but in recent years, genome-wide screens for sRNAs have been conducted in an increasing number of species. Bioinformatics prediction approaches have been employed as well as expression analyses by classical array techniques or next generation sequencing. This review will give an overview of whole genome screens for sRNAs in streptococci with a focus on describing the different methods and comparing their outcome considering sRNA conservation among species, functional similarities, and relevance for streptococcal infection. PMID:26042151
Sabrautzki, Sibylle; Rubio-Aliaga, Isabel; Hans, Wolfgang; Fuchs, Helmut; Rathkolb, Birgit; Calzada-Wack, Julia; Cohrs, Christian M; Klaften, Matthias; Seedorf, Hartwig; Eck, Sebastian; Benet-Pagès, Ana; Favor, Jack; Esposito, Irene; Strom, Tim M; Wolf, Eckhard; Lorenz-Depiereux, Bettina; Hrabě de Angelis, Martin
Metabolic bone disorders arise as primary diseases or may be secondary due to a multitude of organ malfunctions. Animal models are required to understand the molecular mechanisms responsible for the imbalances of bone metabolism in disturbed bone mineralization diseases. Here we present the isolation of mutant mouse models for metabolic bone diseases by phenotyping blood parameters that target bone turnover within the large-scale genome-wide Munich ENU Mutagenesis Project. A screening panel of three clinical parameters, also commonly used as biochemical markers in patients with metabolic bone diseases, was chosen. Total alkaline phosphatase activity and total calcium and inorganic phosphate levels in plasma samples of F1 offspring produced from ENU-mutagenized C3HeB/FeJ male mice were measured. Screening of 9,540 mice led to the identification of 257 phenodeviants of which 190 were tested by genetic confirmation crosses. Seventy-one new dominant mutant lines showing alterations of at least one of the biochemical parameters of interest were confirmed. Fifteen mutations among three genes (Phex, Casr, and Alpl) have been identified by positional-candidate gene approaches and one mutation of the Asgr1 gene, which was identified by next-generation sequencing. All new mutant mouse lines are offered as a resource for the scientific community.
Neale, Benjamin M.; Medland, Sarah E.; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V.; Nguyen, Thuy Trang; Schafer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J. L.; Langely, Kate; O'Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P.; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D.; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan
Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association studies (GWAS) have not yielded significant results, we conducted a meta-analysis of…
Neale, Benjamin M.; Medland, Sarah; Ripke, Stephan; Anney, Richard J. L.; Asherson, Philip; Buitelaar, Jan; Franke, Barbara; Gill, Michael; Kent, Lindsey; Holmans, Peter; Middleton, Frank; Thapar, Anita; Lesch, Klaus-Peter; Faraone, Stephen V.; Daly, Mark; Nguyen, Thuy Trang; Schafer, Helmut; Steinhausen, Hans-Christoph; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Warnke, Andreas; Walitza, Susanne; Freitag, Christine; Meyer, Jobst; Palmason, Haukur; Rothenberger, Aribert; Hawi, Ziarih; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph
Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. Thus additional genome-wide association studies (GWAS) are needed. Method: We used case-control analyses of 896 cases…
Mick, Eric; Todorov, Alexandre; Smalley, Susan; Hu, Xiaolan; Loo, Sandra; Todd, Richard D.; Biederman, Joseph; Byrne, Deirdre; Dechairo, Bryan; Guiney, Allan; McCracken, James; McGough, James; Nelson, Stanley F.; Reiersen, Angela M.; Wilens, Timothy E.; Wozniak, Janet; Neale, Benjamin M.; Faraone, Stephen V.
Objective: Genes likely play a substantial role in the etiology of attention-deficit/hyperactivity disorder (ADHD). However, the genetic architecture of the disorder is unknown, and prior genome-wide association studies (GWAS) have not identified a genome-wide significant association. We have conducted a third, independent, multisite GWAS of…
Trop, I.; Lalonde, L.; Mayrand, M.H.; David, J.; Larouche, N.; Provencher, D.
Background Women with a predisposition for breast cancer require a tailored screening program for early cancer detection. We evaluated the performance of mammography (mg), ultrasonography (us), and magnetic resonance imaging (mri) screening in these women. Patients and Methods In asymptomatic women either confirmed as BRCA1/2 carriers, or having a greater than 30% probability of being so as estimated by brcapro [Berry D, Parmigiani G. Duke spore (Specialized Program of Research Excellence) in Breast Cancer. 1999], we conducted a prospective comparative trial consisting of annual mri and mg, and biannual us and clinical breast examination. All evaluations were done within 30 days of one another. For each screening round, imaging tests were independently interpreted by three radiologists. Results The study enrolled 184 women, and 387 screening rounds were performed, detecting 12 cancers (9 infiltrating, 3 in situ), for an overall cancer yield of 6.5%. At diagnosis, 7 infiltrating cancers were smaller than 2 cm (T1); only 1 woman presented with axillary nodal metastases. All tumours were negative for the human epidermal growth factor receptor 2. Of the 12 cancers, mri detected 10, and mg, 7; us did not identify any additional cancers. The overall recall rate after mri was 21.8%, as compared with 11.4% for us and 16.1% for mg. Recall rates declined with successive screening rounds. In total, 45 biopsies were performed: 21 as a result of an us abnormality; 17, because of an mri lesion; and 7, because of a mg anomaly. Interpretation In high-risk women, mri offers the best sensitivity for breast cancer screening. The combination of yearly mri and mg reached a negative predictive value of 100%. The recall rate is greatest with mri, but declines for all modalities with successive screening rounds. PMID:20567624
Rucker, James J.H.; Breen, Gerome; Pinto, Dalila; Pedroso, Inti; Lewis, Cathryn M.; Cohen-Woods, Sarah; Uher, Rudolf; Schosser, Alexandra; Rivera, Margarita; Aitchison, Katherine J.; Craddock, Nick; Owen, Michael J.; Jones, Lisa; Jones, Ian; Korszun, Ania; Muglia, Pierandrea; Barnes, Michael R.; Preisig, Martin; Mors, Ole; Gill, Mike; Maier, Wolfgang; Rice, John; Rietschel, Marcella; Holsboer, Florian; Farmer, Anne E.; Craig, Ian W.; Scherer, Stephen W.; McGuffin, Peter
Large, rare copy number variants (CNV) have been implicated in a variety of psychiatric disorders, but the role of CNVs in recurrent depression is unclear. We performed a genome-wide analysis of large, rare CNVs in 3,106 cases of recurrent depression, 459 controls screened for lifetime-absence of psychiatric disorder and 5,619 unscreened controls from phase 2 of the Wellcome Trust Case Control Consortium (WTCCC2). We compared the frequency of cases with CNVs against the frequency observed in each control group, analysing CNVs over the whole genome, genic, intergenic, intronic and exonic regions. We found that deletion CNVs were associated with recurrent depression while duplications were not. The effect was significant when comparing cases to WTCCC2 controls (p=7.7×10−6, OR =1.25 (95% CI 1.13 - 1.37)) and to screened controls (p=5.6×10−4, OR=1.52 (95% CI 1.20 - 1.93). Further analysis showed that CNVs deleting protein coding regions were largely responsible for the association. Within an analysis of regions previously implicated in schizophrenia, we found an overall enrichment of CNVs in our cases when compared to screened controls (p=0.019). We observe an ordered increase of samples with deletion CNVs, with the lowest proportion seen in screened controls, the next highest in unscreened controls and the highest in cases. This may suggest that the absence of deletion CNVs, especially in genes, is associated with resilience to recurrent depression. PMID:22042228
Jiang, Long; Liu, Lu; Cheng, Yuyan; Lin, Yan; Shen, Changbing; Zhu, Caihong; Yang, Sen; Yin, Xianyong; Zhang, Xuejun
Missing heritability is a common problem in genome-wide association studies in complex diseases/traits. To quantify the unbiased heritability estimate, we applied the phenotype correlation-genotype correlation regression in psoriasis genome-wide association data in Han Chinese which comprises 1139 cases and 1132 controls. We estimated that 45.7% heritability of psoriasis in Han Chinese were captured by common variants (s.e.=12.5%), which reinforced that the majority of psoriasis heritability can be covered by common variants in genome-wide association data (68.2%). The results provided evidence that the heritability covered by psoriasis genome-wide genotyping data was probably underestimated in previous restricted maximum likelihood method. Our study highlights the broad role of common variants in the etiology of psoriasis and sheds light on the possibility to identify more common variants of small effect by increasing the sample size in psoriasis genome-wide association studies.
Background Subtilisin/kexin-like proprotein convertase (PCSK) enzymes have important regulatory function in a wide variety of biological processes. PCSKs proteolytically process at a target sequence that contains basic amino acids arginine and lysine, which results in functional maturation of the target protein. In vitro assays have showed significant biochemical redundancy between the seven family members, but the phenotypes of PCSK deficient mice and patients carrying an inactive PCSK allele argue for a specific biological function. Modeling the structures of individual PCSK enzymes has offered little insights into the specificity determinants. However, previous studies have shown that there can be a coordinated expression between a PCSK and its target molecule. Here, we have surveyed the putative PCSK target proteins using genome-wide expression correlation analysis and cleavage site prediction algorithms. Results We first performed a gene expression correlation analysis over the whole genome for all PCSK enzymes. PCSKs were found to cluster differently based on the strength of correlations. The screen for putative PCSK target proteins showed a significant enrichment (p-values from 1.2e-4 to < 1.0e-10) of putative targets among the most positively correlating genes for most PCSKs. Interestingly, there was no enrichment in putative targets among the genes that correlated positively with the biologically redundant PCSK7, whereas PCSK5 showed an inverse correlation. PCSKs also showed a highly variable degree of shared target genes that were identified by expression correlation and cleavage site prediction. Multiple alignments were used to evaluate the putative targets to pinpoint the important residues for the substrate recognition. Finally, we validated our approach and identified biochemically PAPPA1 and ADAMTS6 as novel targets for FURIN proteolytic activity. Conclusions Most PCSK enzymes display strong positive expression correlation with predicted target
Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Canine Cohort, Canine Intelligence Assessment Regimen, Genome-Wide Single Nucleotide Polymorphism (SNP) Typing, and Unsupervised Classification Algorithm for Genome-Wide Association Data Analysis
were down-selected and successfully genotyped for whole genome (WG) single nucleotide polymorphism (SNP) markers by means of the Affymetrix Canine...SUBJECT TERMS Military working dog genome-wide association study genetic marker intelligence... marker , intelligence, Canine Intelligence Testing Protocol, classification technique, clustering analysis Technical Report: September 2011 2
Chiu, Yen-Feng; Liu, Su-Yun; Tsai, Ya-Yu
We conducted genome-wide linkage scans using both microsatellite and single-nucleotide polymorphism (SNP) markers. Regions showing the strongest evidence of linkage to alcoholism susceptibility genes were identified. Haplotype analyses using a sliding-window approach for SNPs in these regions were performed. In addition, we performed a genome-wide association scan using SNP data. SNPs in these regions with evidence of association (P
Black, James R M; Clark, Simon J
In recent years, genome-wide association studies (GWAS), which are able to analyze the contribution to disease of genetic variations that are common within a population, have attracted considerable investment. Despite identifying genetic variants for many conditions, they have been criticized for yielding data with minimal clinical utility. However, in this regard, age-related macular degeneration (AMD), the most common form of blindness in the Western world, is a striking exception. Through GWAS, common genetic variants at a number of loci have been discovered. Two loci in particular, including genes of the complement cascade on chromosome 1 and the ARMS2/HTRA1 genes on chromosome 10, have been shown to convey significantly increased susceptibility to developing AMD. Today, although it is possible to screen individuals for a genetic predisposition to the disease, effective interventional strategies for those at risk of developing AMD are scarce. Ongoing research in this area is nonetheless promising. After providing brief overviews of AMD and common disease genetics, we outline the main recent advances in the understanding of AMD, particularly those made through GWAS. Finally, the true merit of these findings and their current and potential translational value is examined.Genet Med 18 4, 283-289.
Black, James R. M.; Clark, Simon J.
In recent years, genome-wide association studies (GWAS), which are able to analyze the contribution to disease of genetic variations that are common within a population, have attracted considerable investment. Despite identifying genetic variants for many conditions, they have been criticized for yielding data with minimal clinical utility. However, in this regard, age-related macular degeneration (AMD), the most common form of blindness in the Western world, is a striking exception. Through GWAS, common genetic variants at a number of loci have been discovered. Two loci in particular, including genes of the complement cascade on chromosome 1 and the ARMS2/HTRA1 genes on chromosome 10, have been shown to convey significantly increased susceptibility to developing AMD. Today, although it is possible to screen individuals for a genetic predisposition to the disease, effective interventional strategies for those at risk of developing AMD are scarce. Ongoing research in this area is nonetheless promising. After providing brief overviews of AMD and common disease genetics, we outline the main recent advances in the understanding of AMD, particularly those made through GWAS. Finally, the true merit of these findings and their current and potential translational value is examined. Genet Med 18 4, 283–289. PMID:26020418
Saritas-Yildirim, Banu; Pliner, Hannah A; Ochoa, Angelica; Silva, Elena M
Protein degradation via the multistep ubiquitin/26S proteasome pathway is a rapid way to alter the protein profile and drive cell processes and developmental changes. Many key regulators of embryonic development are targeted for degradation by E3 ubiquitin ligases. The most studied family of E3 ubiquitin ligases is the SCF ubiquitin ligases, which use F-box adaptor proteins to recognize and recruit target proteins. Here, we used a bioinformatics screen and phylogenetic analysis to identify and annotate the family of F-box proteins in the Xenopus tropicalis genome. To shed light on the function of the F-box proteins, we analyzed expression of F-box genes during early stages of Xenopus development. Many F-box genes are broadly expressed with expression domains localized to diverse tissues including brain, spinal cord, eye, neural crest derivatives, somites, kidneys, and heart. All together, our genome-wide identification and expression profiling of the Xenopus F-box family of proteins provide a foundation for future research aimed to identify the precise role of F-box dependent E3 ubiquitin ligases and their targets in the regulatory circuits of development.
Donahue, Sarah L; Lin, Qing; Cao, Shang; Ruley, H Earl
Widespread losses of heterozygosity (LOH) in human cancer have been thought to result from chromosomal instability caused by mutations affecting DNA repair/genome maintenance. However, the origin of LOH in most tumors is unknown. The present study examined the ability of carcinogenic agents to induce LOH at 53 sites throughout the genome of normal diploid mouse ES cells. Brief exposures to nontoxic levels of methylnitrosourea, diepoxybutane, mitomycin C, hydroxyurea, doxorubicin, and UV light stimulated LOH at all loci at frequencies ranging from 1-8 x 10(-3) per cell (10-123 times higher than in untreated cells). This greatly exceeds the frequencies at which these agents have been reported to induce point mutations and is comparable to the rates of LOH observed in ES cells lacking the gene responsible for Bloom syndrome, an inherited DNA repair defect that results in greatly increased risk of cancer. These results suggest that LOH contributes significantly to the carcinogenicity of a variety of mutagens and raises the possibility that genome-wide LOH observed in some human cancers may reflect prior exposure to genotoxic agents rather than a state of chromosomal instability during the carcinogenic process. Finally, as a practical matter, chemically induced LOH is expected to enhance the recovery of homozygous recessive mutants from phenotype-based genetic screens in mammalian cells.
Donahue, Sarah L.; Lin, Qing; Cao, Shang; Ruley, H. Earl
Widespread losses of heterozygosity (LOH) in human cancer have been thought to result from chromosomal instability caused by mutations affecting DNA repair/genome maintenance. However, the origin of LOH in most tumors is unknown. The present study examined the ability of carcinogenic agents to induce LOH at 53 sites throughout the genome of normal diploid mouse ES cells. Brief exposures to nontoxic levels of methylnitrosourea, diepoxybutane, mitomycin C, hydroxyurea, doxorubicin, and UV light stimulated LOH at all loci at frequencies ranging from 1–8 × 10−3 per cell (10–123 times higher than in untreated cells). This greatly exceeds the frequencies at which these agents have been reported to induce point mutations and is comparable to the rates of LOH observed in ES cells lacking the gene responsible for Bloom syndrome, an inherited DNA repair defect that results in greatly increased risk of cancer. These results suggest that LOH contributes significantly to the carcinogenicity of a variety of mutagens and raises the possibility that genome-wide LOH observed in some human cancers may reflect prior exposure to genotoxic agents rather than a state of chromosomal instability during the carcinogenic process. Finally, as a practical matter, chemically induced LOH is expected to enhance the recovery of homozygous recessive mutants from phenotype-based genetic screens in mammalian cells. PMID:16868089
Yu, Lan; Shao, Chaogang; Ye, Xinghuo; Meng, Yijun; Zhou, Yincong; Chen, Ming
MicroRNAs (miRNAs) are important regulators of gene expression. The recent advances in high-throughput sequencing (HTS) technique have greatly facilitated large-scale detection of the miRNAs. However, thoroughly discovery of novel miRNAs from the available HTS data sets remains a major challenge. In this study, we observed that Dicer-mediated cleavage sites for the processing of the miRNA precursors could be mapped by using degradome sequencing data in both animals and plants. In this regard, a novel tool, miRNA Digger, was developed for systematical discovery of miRNA candidates through genome-wide screening of cleavage signals based on degradome sequencing data. To test its sensitivity and reliability, miRNA Digger was applied to discover miRNAs from four organs of Arabidopsis. The results revealed that a majority of already known mature miRNAs along with their miRNA*s expressed in these four organs were successfully recovered. Notably, a total of 30 novel miRNA-miRNA* pairs that have not been registered in miRBase were discovered by miRNA Digger. After target prediction and degradome sequencing data-based validation, eleven miRNA-target interactions involving six of the novel miRNAs were identified. Taken together, miRNA Digger could be applied for sensitive detection of novel miRNAs and it could be freely downloaded from http://www.bioinfolab.cn/miRNA_Digger/index.html.
Goto, Kaku; Annan, Dorcas A.; Morita, Tomoko; Li, Wenwen; Muroyama, Ryosuke; Matsubara, Yasuo; Ito, Sayaka; Nakagawa, Ryo; Tanoue, Yasushi; Jinushi, Masahisa; Kato, Naoya
Pharmacotherapeutic options are limited for hepatocellular carcinoma (HCC). Recently, we identified the anti-tumor ligand MHC class I polypeptide-related sequence A (MICA) gene as a susceptibility gene for hepatitis C virus-induced HCC in a genome-wide association study (GWAS). To prove the concept of HCC immunotherapy based on the results of a GWAS, in the present study, we searched for drugs that could restore MICA expression. A screen of the FDA-approved drug library identified the anti-cancer agent vorinostat as the strongest hit, suggesting histone deacetylase inhibitors (HDACis) as potent candidates. Indeed, the HDACi-induced expression of MICA specific to HCC cells enhanced natural killer (NK) cell-mediated cytotoxicity in co-culture, which was further reinforced by treatment with an inhibitor of MICA sheddase. Similarly augmented anti-tumor activity of NK cells via NK group 2D was observed in vivo. Metabolomics analysis revealed HDACi-mediated alterations in energy supply and stresses for MICA induction and HCC inhibition, providing a mechanism for the chemoimmunotherapeutic actions. These data are indicative of promising strategies for selective HCC innate immunotherapy. PMID:27910927
Rekaya, Romdhane; Smith, Shannon; Hay, El Hamidi; Farhat, Nourhene; Aggrey, Samuel E
Errors in the binary status of some response traits are frequent in human, animal, and plant applications. These error rates tend to differ between cases and controls because diagnostic and screening tests have different sensitivity and specificity. This increases the inaccuracies of classifying individuals into correct groups, giving rise to both false-positive and false-negative cases. The analysis of these noisy binary responses due to misclassification will undoubtedly reduce the statistical power of genome-wide association studies (GWAS). A threshold model that accommodates varying diagnostic errors between cases and controls was investigated. A simulation study was carried out where several binary data sets (case–control) were generated with varying effects for the most influential single nucleotide polymorphisms (SNPs) and different diagnostic error rate for cases and controls. Each simulated data set consisted of 2000 individuals. Ignoring misclassification resulted in biased estimates of true influential SNP effects and inflated estimates for true noninfluential markers. A substantial reduction in bias and increase in accuracy ranging from 12% to 32% was observed when the misclassification procedure was invoked. In fact, the majority of influential SNPs that were not identified using the noisy data were captured using the proposed method. Additionally, truly misclassified binary records were identified with high probability using the proposed method. The superiority of the proposed method was maintained across different simulation parameters (misclassification rates and odds ratios) attesting to its robustness. PMID:27942229
Zaykin, Dmitri V; Kozbur, Damian O
An appealing genome-wide association study design compares one large control group against several disease samples. A pioneering study by the Wellcome Trust Case Control Consortium that employed such a design has identified multiple susceptibility regions, many of which have been independently replicated. While reusing a control sample provides effective utilization of data, it also creates correlation between association statistics across diseases. An observation of a large association statistic for one of the diseases may greatly increase chances of observing a spuriously large association for a different disease. Accounting for the correlation is also particularly important when screening for SNPs that might be involved in a set of diseases with overlapping etiology. We describe methods that correct association statistics for dependency due to shared controls, and we describe ways to obtain a measure of overall evidence and to combine association signals across multiple diseases. The methods we describe require no access to individual subject data, instead, they efficiently utilize information contained in P-values for association reported for individual diseases. P-value based combined tests for association are flexible and essentially as powerful as the approach based on aggregating the individual subject data.
Svenningsson, Anna; Söderhäll, Cilla; Persson, Sofia; Lundberg, Fredrik; Luthman, Holger; Chung, Eddie; Gardiner, Mark; Kockum, Ingrid; Nordenskjöld, Agneta
Infantile hypertrophic pyloric stenosis (IHPS) is a common cause of upper gastrointestinal obstruction during infancy. A multifactorial background of the disease is well established. Multiple susceptibility loci including the neuronal nitric oxide synthase (NOS1) gene have previously been linked to IHPS, but contradictory results of linkage studies in different materials indicate genetic heterogeneity. To identify IHPS susceptibility loci, we conducted a genome-wide linkage analysis in 37 Swedish families. In regions where the Swedish material showed most evidence in favor of linkage, 31 additional British IHPS families were analyzed. Evidence in favor of significant linkage was observed in the Swedish material to two loci on chromosome 2q24 (non-parametric linkage (NPL) =3.77) and 7p21 (NPL=4.55). In addition, evidence of suggestive linkage was found to two loci on chromosome 6p21 (NPL=2.97) and 12q24 (NPL=2.63). Extending the material with British samples did not enhance the level of significance. Regions with linkage harbor interesting candidate genes, such as glucagon-like peptide-2 (GLP-2 encoded by the glucagon gene GCG), NOS1, motilin (MLN) and neuropeptide Y (NPY). The coding exons for GLP-2, and NPY were screened for mutations with negative results. In conclusion, we could confirm suggestive linkage to the region harboring the NOS1 gene and detected additional novel susceptibility loci for IHPS.
Lipinski, Marta M.; Zheng, Bin; Lu, Tao; Yan, Zhenyu; Py, Bénédicte F.; Ng, Aylwin; Xavier, Ramnik J.; Li, Cheng; Yankner, Bruce A.; Scherzer, Clemens R.; Yuan, Junying
Dysregulation of autophagy, a cellular catabolic mechanism essential for degradation of misfolded proteins, has been implicated in multiple neurodegenerative diseases. However, the mechanisms that lead to the autophagy dysfunction are still not clear. Based on the results of a genome-wide screen, we show that reactive oxygen species (ROS) serve as common mediators upstream of the activation of the type III PI3 kinase, which is critical for the initiation of autophagy. Furthermore, ROS play an essential function in the induction of the type III PI3 kinase and autophagy in response to amyloid β peptide, the main pathogenic mediator of Alzheimer's disease (AD). However, lysosomal blockage also caused by Aβ is independent of ROS. In addition, we demonstrate that autophagy is transcriptionally down-regulated during normal aging in the human brain. Strikingly, in contrast to normal aging, we observe transcriptional up-regulation of autophagy in the brains of AD patients, suggesting that there might be a compensatory regulation of autophagy. Interestingly, we show that an AD drug and an AD drug candidate have inhibitory effects on autophagy, raising the possibility that decreasing input into the lysosomal system may help to reduce cellular stress in AD. Finally, we provide a list of candidate drug targets that can be used to safely modulate levels of autophagy without causing cell death. PMID:20660724
Radhakrishna, Uppala; Albayrak, Samet; Alpay-Savasan, Zeynep; Zeb, Amna; Turkoglu, Onur; Sobolewski, Paul; Bahado-Singh, Ray O.
Congenital heart defect (CHD) is the most common cause of death from congenital anomaly. Among several candidate epigenetic mechanisms, DNA methylation may play an important role in the etiology of CHDs. We conducted a genome-wide DNA methylation analysis using an Illumina Infinium 450k human methylation assay in a cohort of 24 newborns who had aortic valve stenosis (AVS), with gestational-age matched controls. The study identified significantly-altered CpG methylation at 59 sites in 52 genes in AVS subjects as compared to controls (either hypermethylated or demethylated). Gene Ontology analysis identified biological processes and functions for these genes including positive regulation of receptor-mediated endocytosis. Consistent with prior clinical data, the molecular function categories as determined using DAVID identified low-density lipoprotein receptor binding, lipoprotein receptor binding and identical protein binding to be over-represented in the AVS group. A significant epigenetic change in the APOA5 and PCSK9 genes known to be involved in AVS was also observed. A large number CpG methylation sites individually demonstrated good to excellent diagnostic accuracy for the prediction of AVS status, thus raising possibility of molecular screening markers for this disorder. Using epigenetic analysis we were able to identify genes significantly involved in the pathogenesis of AVS. PMID:27152866
Mirabello, Lisa; Koster, Roelof; Moriarity, Branden S.; Spector, Logan G.; Meltzer, Paul S.; Gary, Joy; Machiela, Mitchell J.; Pankratz, Nathan; Panagiotou, Orestis A.; Largaespada, David; Wang, Zhaoming; Gastier-Foster, Julie M.; Gorlick, Richard; Khanna, Chand; de Toledo, Silvia Regina Caminada; Petrilli, Antonio S.; Patiño-Garcia, Ana; Sierrasesúmaga, Luis; Lecanda, Fernando; Andrulis, Irene L.; Wunder, Jay S.; Gokgoz, Nalan; Serra, Massimo; Hattinger, Claudia; Picci, Piero; Scotlandi, Katia; Flanagan, Adrienne M.; Tirabosco, Roberto; Amary, Maria Fernanda; Halai, Dina; Ballinger, Mandy L.; Thomas, David M.; Davis, Sean; Barkauskas, Donald A.; Marina, Neyssa; Helman, Lee; Otto, George M.; Becklin, Kelsie L.; Wolf, Natalie K.; Weg, Madison T.; Tucker, Margaret; Wacholder, Sholom; Fraumeni, Joseph F.; Caporaso, Neil E.; Boland, Joseph F.; Hicks, Belynda D.; Vogt, Aurelie; Burdett, Laurie; Yeager, Meredith; Hoover, Robert N.; Chanock, Stephen J.; Savage, Sharon A.
Metastasis is the leading cause of death in osteosarcoma patients, the most common pediatric bone malignancy. We conducted a multi-stage genome-wide association study of osteosarcoma metastasis at diagnosis in 935 osteosarcoma patients to determine whether germline genetic variation contributes to risk of metastasis. We identified a SNP, rs7034162, in NFIB significantly associated with metastasis in European osteosarcoma cases, as well as in cases of African and Brazilian ancestry (meta-analysis of all cases: P=1.2×10−9, OR 2.43, 95% CI 1.83–3.24). The risk allele was significantly associated with lowered NFIB expression, which led to increased osteosarcoma cell migration, proliferation, and colony formation. Additionally, a transposon screen in mice identified a significant proportion of osteosarcomas harboring inactivating insertions in Nfib, and had lowered Nfib expression. These data suggest that germline genetic variation at rs7034162 is important in osteosarcoma metastasis, and that NFIB is an osteosarcoma metastasis susceptibility gene. PMID:26084801
distribution unlimited. QC – quality control QTL – quantitative trait loci SNP – single nucleotide polymorphism TE – Tris + EDTA TBE – Tris + Boric Acid + EDTA WGSA – whole genome sampling assay ...canine intelligence testing protocol EDTA – ethylenediaminetetraacetic acid GWAS – genome-wide association study LD – linkage disequilibrium MWD
We generated 13,789 single nucleotide plymorphism (SNP) markers from 97 melon accessions using genotyping by sequencing and anchored them to chromosomes to understand genome-wide fixation index between various melon morphotypes and linkage disequilibrium (LD) decay for inodorus and cantalupensis, th...
The genetic structure of sheep reflects their domestication and subsequent formation into discrete breeds. Understanding genetic structure is essential for achieving genetic improvement through genome-wide association studies, genomic selection and the dissection of quantitative traits. After identi...
Background Although melon (Cucumis melo L.) is an economically important fruit crop, no genome-wide sequence information is openly available at the current time. We therefore sequenced BAC-ends representing a total of 33,024 clones, half of them from a previously described melon BAC library generated with restriction endonucleases and the remainder from a new random-shear BAC library. Results We generated a total of 47,140 high-quality BAC-end sequences (BES), 91.7% of which were paired-BES. Both libraries were assembled independently and then cross-assembled to obtain a final set of 33,372 non-redundant, high-quality sequences. These were grouped into 6,411 contigs (4.5 Mb) and 26,961 non-assembled BES (14.4 Mb), representing ~4.2% of the melon genome. The sequences were used to screen genomic databases, identifying 7,198 simple sequence repeats (corresponding to one microsatellite every 2.6 kb) and 2,484 additional repeats of which 95.9% represented transposable elements. The sequences were also used to screen expressed sequence tag (EST) databases, revealing 11,372 BES that were homologous to ESTs. This suggests that ~30% of the melon genome consists of coding DNA. We observed regions of microsynteny between melon paired-BES and six other dicotyledonous plant genomes. Conclusion The analysis of nearly 50,000 BES from two complementary genomic libraries covered ~4.2% of the melon genome, providing insight into properties such as microsatellite and transposable element distribution, and the percentage of coding DNA. The observed synteny between melon paired-BES and six other plant genomes showed that useful comparative genomic data can be derived through large scale BAC-end sequencing by anchoring a small proportion of the melon genome to other sequenced genomes. PMID:21054843
Zhou, Xiang; Stephens, Matthew
Multivariate linear mixed models (mvLMMs) are powerful tools for testing associations between single-nucleotide polymorphisms and multiple correlated phenotypes while controlling for population stratification in genome-wide association studies. We present efficient algorithms in the genome-wide efficient mixed model association (GEMMA) software for fitting mvLMMs and computing likelihood ratio tests. These algorithms offer improved computation speed, power and P-value calibration over existing methods, and can deal with more than two phenotypes.
Slovak, Radka; Göschl, Christian; Seren, Ümit; Busch, Wolfgang
Genome-wide association (GWA) mapping is a powerful technique to address the molecular basis of genotype to phenotype relationships and to map regulators of biological processes. This chapter presents a protocol for genome-wide association mapping in Arabidopsis thaliana using the user-friendly internet application GWAPP, and provides a specific protocol for acquiring root trait data suitable for GWA studies using the semi-automated, high-throughput phenotyping pipeline BRAT for early root growth.
Su, Jingliang; Yang, Jun; Zhao, Daimin; Kawula, Thomas H.; Banas, Jeffrey A.; Zhang, Jing-Ren
Francisella tularensis is a gram-negative pathogen that causes life-threatening infections in humans and has potential for use as a biological weapon. The genetic basis of the F. tularensis virulence is poorly understood. This study screened a total of 3,936 transposon mutants of the live vaccine strain for infection in a mouse model of respiratory tularemia by signature-tagged mutagenesis. We identified 341 mutants attenuated for infection in the lungs. The transposon disruptions were mapped to 95 different genes, virtually all of which are also present in the genomes of other F. tularensis strains, including human pathogenic F. tularensis strain Schu S4. A small subset of these attenuated mutants carried insertions in the genes encoding previously known virulence factors, but the majority of the identified genes have not been previously linked to F. tularensis virulence. Among these are genes encoding putative membrane proteins, proteins associated with stress responses, metabolic proteins, transporter proteins, and proteins with unknown functions. Several attenuated mutants contained disruptions in a putative capsule locus which partially resembles the poly-γ-glutamate capsule biosynthesis locus of Bacillus anthracis, the anthrax agent. Deletional mutation analysis confirmed that this locus is essential for F. tularensis virulence. PMID:17420240
Peltier, Jean-Benoît; Emanuelsson, Olof; Kalume, Dário E.; Ytterberg, Jimmy; Friso, Giulia; Rudella, Andrea; Liberles, David A.; Söderberg, Linda; Roepstorff, Peter; von Heijne, Gunnar; van Wijk, Klaas J.
Experimental proteome analysis was combined with a genome-wide prediction screen to characterize the protein content of the thylakoid lumen of Arabidopsis chloroplasts. Soluble thylakoid proteins were separated by two-dimensional electrophoresis and identified by mass spectrometry. The identities of 81 proteins were established, and N termini were sequenced to validate localization prediction. Gene annotation of the identified proteins was corrected by experimental data, and an interesting case of alternative splicing was discovered. Expression of a surprising number of paralogs was detected. Expression of five isomerases of different classes suggests strong (un)folding activity in the thylakoid lumen. These isomerases possibly are connected to a network of peripheral and lumenal proteins involved in antioxidative response, including peroxiredoxins, m-type thioredoxins, and a lumenal ascorbate peroxidase. Characteristics of the experimentally identified lumenal proteins and their orthologs were used for a genome-wide prediction of the lumenal proteome. Lumenal proteins with a typical twin-arginine translocation motif were predicted with good accuracy and sensitivity and included additional isomerases and proteases. Thus, prime functions of the lumenal proteome include assistance in the folding and proteolysis of thylakoid proteins as well as protection against oxidative stress. Many of the predicted lumenal proteins must be present at concentrations at least 10,000-fold lower than proteins of the photosynthetic apparatus. PMID:11826309
Yang, Fang; Silber, Sherman; Leu, N Adrian; Oates, Robert D; Marszalek, Janet D; Skaletsky, Helen; Brown, Laura G; Rozen, Steve; Page, David C; Wang, P Jeremy
Genome-wide recombination is essential for genome stability, evolution, and speciation. Mouse Tex11, an X-linked meiosis-specific gene, promotes meiotic recombination and chromosomal synapsis. Here, we report that TEX11 is mutated in infertile men with non-obstructive azoospermia and that an analogous mutation in the mouse impairs meiosis. Genetic screening of a large cohort of idiopathic infertile men reveals that TEX11 mutations, including frameshift and splicing acceptor site mutations, cause infertility in 1% of azoospermic men. Functional evaluation of three analogous human TEX11 missense mutations in transgenic mouse models identified one mutation (V748A) as a potential infertility allele and found two mutations non-causative. In the mouse model, an intronless autosomal Tex11 transgene functionally substitutes for the X-linked Tex11 gene, providing genetic evidence for the X-to-autosomal retrotransposition evolution phenomenon. Furthermore, we find that TEX11 protein levels modulate genome-wide recombination rates in both sexes. These studies indicate that TEX11 alleles affecting expression level or substituting single amino acids may contribute to variations in recombination rates between sexes and among individuals in humans. PMID:26136358
Veerappa, Avinash M; Saldanha, Marita; Padakannaya, Prakash; Ramachandra, Nallur B
Developmental dyslexia (DD) is a complex heritable disorder with unexpected difficulty in learning to read and spell despite adequate intelligence, education, environment, and normal senses. We performed genome-wide screening for copy number variations (CNVs) in 10 large Indian dyslexic families using Affymetrix Genome-Wide Human SNP Array 6.0. Results revealed the complex genomic rearrangements due to one non-contiguous deletion and five contiguous micro duplications and micro deletions at 17q21.31 region in three dyslexic families. CNVs in this region harbor the genes KIAA1267, LRRC37A, ARL17A/B, NSFP1, and NSF. The CNVs in case 1 and case 2 at this locus were found to be in homozygous state and case 3 was a de novo CNV. These CNVs were found with at least one CNV having a common break and end points in the parents. This cluster of genes containing NSF is implicated in learning, cognition, and memory, though not formally associated with dyslexia. Molecular network analysis of these and other dyslexia related module genes suggests NSF and other genes to be associated with cellular/vesicular membrane fusion and synaptic transmission. Thus, we suggest that NSF in this cluster would be the nearest gene responsible for the learning disability phenotype.
Hoffmann, Thomas J.; Passarelli, Michael N.; Graff, Rebecca E.; Emami, Nima C.; Sakoda, Lori C.; Jorgenson, Eric; Habel, Laurel A.; Shan, Jun; Ranatunga, Dilrini K.; Quesenberry, Charles P.; Chao, Chun R.; Ghai, Nirupa R.; Aaronson, David; Presti, Joseph; Nordström, Tobias; Wang, Zhaoming; Berndt, Sonja I.; Chanock, Stephen J.; Mosley, Jonathan D.; Klein, Robert J.; Middha, Mridu; Lilja, Hans; Melander, Olle; Kvale, Mark N.; Kwok, Pui-Yan; Schaefer, Catherine; Risch, Neil; Van Den Eeden, Stephen K.; Witte, John S.
Prostate-specific antigen (PSA) levels have been used for detection and surveillance of prostate cancer (PCa). However, factors other than PCa—such as genetics—can impact PSA. Here we present findings from a genome-wide association study (GWAS) of PSA in 28,503 Kaiser Permanente whites and 17,428 men from replication cohorts. We detect 40 genome-wide significant (P<5 × 10−8) single-nucleotide polymorphisms (SNPs): 19 novel, 15 previously identified for PSA (14 of which were also PCa-associated), and 6 previously identified for PCa only. Further analysis incorporating PCa cases suggests that at least half of the 40 SNPs are PSA-associated independent of PCa. The 40 SNPs explain 9.5% of PSA variation in non-Hispanic whites, and the remaining GWAS SNPs explain an additional 31.7%; this percentage is higher in younger men, supporting the genetic basis of PSA levels. These findings provide important information about genetic markers for PSA that may improve PCa screening, thereby reducing over-diagnosis and over-treatment. PMID:28139693
Wang, Jia; Jian, Hongju; Wei, Lijuan; Qu, Cunmin; Xu, Xinfu; Lu, Kun; Qian, Wei; Li, Jiana; Li, Maoteng; Liu, Liezhao
A stable yellow-seeded variety is the breeding goal for obtaining the ideal rapeseed (Brassica napus L.) plant, and the amount of acid detergent lignin (ADL) in the seeds and the hull content (HC) are often used as yellow-seeded rapeseed screening indices. In this study, a genome-wide association analysis of 520 accessions was performed using the Q + K model with a total of 31,839 single-nucleotide polymorphism (SNP) sites. As a result, three significant associations on the B. napus chromosomes A05, A09, and C05 were detected for seed ADL content. The peak SNPs were within 9.27, 14.22, and 20.86 kb of the key genes BnaA.PAL4, BnaA.CAD2/BnaA.CAD3, and BnaC.CCR1, respectively. Further analyses were performed on the major locus of A05, which was also detected in the seed HC examination. A comparison of our genome-wide association study (GWAS) results and previous linkage mappings revealed a common chromosomal region on A09, which indicates that GWAS can be used as a powerful complementary strategy for dissecting complex traits in B. napus. Genomic selection (GS) utilizing the significant SNP markers based on the GWAS results exhibited increased predictive ability, indicating that the predictive ability of a given model can be substantially improved by using GWAS and GS.
Niu, Yao-Fang; Ye, Chengyin; He, Ji; Han, Fang; Guo, Long-Biao; Zheng, Hou-Feng; Chen, Guo-Bo
In line with open-source genetics, we report a novel linear regression technique for genome-wide association studies (GWAS), called Open GWAS algoriTHm (OATH). When individual-level data are not available, OATH can not only completely reproduce reported results from an experimental model, but also recover underreported results from other alternative models with a different combination of nuisance parameters using naïve summary statistics (NSS). OATH can also reliably evaluate all reported results in-depth (e.g., p-value variance analysis), as demonstrated for 42 Arabidopsis phenotypes under three magnesium (Mg) conditions. In addition, OATH can be used for consortium-driven genome-wide association meta-analyses (GWAMA), and can greatly improve the flexibility of GWAMA. A prototype of OATH is available in the Genetic Analysis Repository (https://github.com/gc5k/GEAR). PMID:28122950
Background Deconjugation of ubiquitin and/or ubiquitin-like modified protein substrates is essential to modulate protein-protein interactions and, thus, signaling processes in cells. Although deubiquitylating (deubiquitinating) enzymes (DUBs) play a key role in this process, however, their function and regulation remain insufficiently understood. The "loss-of-function" phenotype studies can provide important information to elucidate the gene function, and zebrafish is an excellent model for this goal. Results From an in silico genome-wide search, we found more than 90 putative DUBs encoded in the zebrafish genome belonging to six different subclasses. Out of them, 85 from five classical subclasses have been tested with morpholino (MO) knockdown experiments and 57 of them were found to be important in early development of zebrafish. These DUB morphants resulted in a complex and pleiotropic phenotype that, regardless of gene target, always affected the notochord. Based on the huC neuronal marker expression, we grouped them into five sets (groups I to V). Group I DUBs (otud7b, uchl3 and bap1) appear to be involved in the Notch signaling pathway based on the neuronal hyperplasia, while group IV DUBs (otud4, usp5, usp15 and usp25) play a critical role in dorsoventral patterning through the BMP pathway. Conclusion We have identified an exhaustive list of genes in the zebrafish genome belonging to the five established classes of DUBs. Additionally, we performed the corresponding MO knockdown experiments in zebrafish as well as functional studies for a subset of the predicted DUB genes. The screen results in this work will stimulate functional follow-up studies of potential DUB genes using the zebrafish model system. PMID:20040115
Gandolfi, Barbara; Gruffydd-Jones, Timothy J.; Malik, Richard; Cortes, Alejandro; Jones, Boyd R.; Helps, Chris R.; Prinzenberg, Eva M.; Erhardt, George; Lyons, Leslie A.
Burmese is an old and popular cat breed, however, several health concerns, such as hypokalemia and a craniofacial defect, are prevalent, endangering the general health of the breed. Hypokalemia, a subnormal serum potassium ion concentration ([K+]), most often occurs as a secondary problem but can occur as a primary problem, such as hypokalaemic periodic paralysis in humans, and as feline hypokalaemic periodic polymyopathy primarily in Burmese. The most characteristic clinical sign of hypokalemia in Burmese is a skeletal muscle weakness that is frequently episodic in nature, either generalized, or sometimes localized to the cervical and thoracic limb girdle muscles. Burmese hypokalemia is suspected to be a single locus autosomal recessive trait. A genome wide case-control study using the illumina Infinium Feline 63K iSelect DNA array was performed using 35 cases and 25 controls from the Burmese breed that identified a locus on chromosome E1 associated with hypokalemia. Within approximately 1.2 Mb of the highest associated SNP, two candidate genes were identified, KCNH4 and WNK4. Direct sequencing of the genes revealed a nonsense mutation, producing a premature stop codon within WNK4 (c.2899C>T), leading to a truncated protein that lacks the C-terminal coiled-coil domain and the highly conserved Akt1/SGK phosphorylation site. All cases were homozygous for the mutation. Although the exact mechanism causing hypokalemia has not been determined, extrapolation from the homologous human and mouse genes suggests the mechanism may involve a potassium-losing nephropathy. A genetic test to screen for the genetic defect within the active breeding population has been developed, which should lead to eradication of the mutation and improved general health within the breed. Moreover, the identified mutation may help clarify the role of the protein in K+ regulation and the cat represents the first animal model for WNK4-associated hypokalemia. PMID:23285264
Gandolfi, Barbara; Gruffydd-Jones, Timothy J; Malik, Richard; Cortes, Alejandro; Jones, Boyd R; Helps, Chris R; Prinzenberg, Eva M; Erhardt, George; Lyons, Leslie A
Burmese is an old and popular cat breed, however, several health concerns, such as hypokalemia and a craniofacial defect, are prevalent, endangering the general health of the breed. Hypokalemia, a subnormal serum potassium ion concentration ([K(+)]), most often occurs as a secondary problem but can occur as a primary problem, such as hypokalaemic periodic paralysis in humans, and as feline hypokalaemic periodic polymyopathy primarily in Burmese. The most characteristic clinical sign of hypokalemia in Burmese is a skeletal muscle weakness that is frequently episodic in nature, either generalized, or sometimes localized to the cervical and thoracic limb girdle muscles. Burmese hypokalemia is suspected to be a single locus autosomal recessive trait. A genome wide case-control study using the illumina Infinium Feline 63K iSelect DNA array was performed using 35 cases and 25 controls from the Burmese breed that identified a locus on chromosome E1 associated with hypokalemia. Within approximately 1.2 Mb of the highest associated SNP, two candidate genes were identified, KCNH4 and WNK4. Direct sequencing of the genes revealed a nonsense mutation, producing a premature stop codon within WNK4 (c.2899C>T), leading to a truncated protein that lacks the C-terminal coiled-coil domain and the highly conserved Akt1/SGK phosphorylation site. All cases were homozygous for the mutation. Although the exact mechanism causing hypokalemia has not been determined, extrapolation from the homologous human and mouse genes suggests the mechanism may involve a potassium-losing nephropathy. A genetic test to screen for the genetic defect within the active breeding population has been developed, which should lead to eradication of the mutation and improved general health within the breed. Moreover, the identified mutation may help clarify the role of the protein in K⁺ regulation and the cat represents the first animal model for WNK4-associated hypokalemia.
Zuiderveen, Grady H.; Padder, Bilal A.; Kamfwa, Kelvin; Song, Qijian; Kelly, James D.
Anthracnose is a seed-borne disease of common bean (Phaseolus vulgaris L.) caused by the fungus Colletotrichum lindemuthianum, and the pathogen is cosmopolitan in distribution. The objectives of this study were to identify new sources of anthracnose resistance in a diverse panel of 230 Andean beans comprised of multiple seed types and market classes from the Americas, Africa, and Europe, and explore the genetic basis of this resistance using genome-wide association mapping analysis (GWAS). Twenty-eight of the 230 lines tested were resistant to six out of the eight races screened, but only one cultivar Uyole98 was resistant to all eight races (7, 39, 55, 65, 73, 109, 2047, and 3481) included in the study. Outputs from the GWAS indicated major quantitative trait loci (QTL) for resistance on chromosomes, Pv01, Pv02, and Pv04 and two minor QTL on Pv10 and Pv11. Candidate genes associated with the significant SNPs were detected on all five chromosomes. An independent QTL study was conducted to confirm the physical location of the Co-1 locus identified on Pv01 in an F4:6 recombinant inbred line (RIL) population. Resistance was determined to be conditioned by the single dominant gene Co-1 that mapped between 50.16 and 50.30 Mb on Pv01, and an InDel marker (NDSU_IND_1_50.2219) tightly linked to the gene was developed. The information reported will provide breeders with new and diverse sources of resistance and genomic regions to target in the development of anthracnose resistance in Andean beans. PMID:27270627
Jeemon, Panniyammakal; Pettigrew, Kerry; Sainsbury, Christopher; Prabhakaran, Dorairaj; Padmanabhan, Sandosh
Genome-wide association studies (GWAS) have identified several genetic variants associated with coronary heart disease (CHD), and variations in plasma lipoproteins and blood pressure (BP). Loci corresponding to CDKN2A/CDKN2B/ANRIL, MTHFD1L, CELSR2, PSRC1 and SORT1 genes have been associated with CHD, and TMEM57, DOCK7, CELSR2, APOB, ABCG5, HMGCR, TRIB1, FADS2/S3, LDLR, NCAN and TOMM40-APOE with total cholesterol. Similarly, CELSR2-PSRC1-SORT1, PCSK9, APOB, HMGCR, NCAN-CILP2-PBX4, LDLR, TOMM40-APOE, and APOC1-APOE are associated with variations in low-density lipoprotein cholesterol levels. Altogether, forty, forty three and twenty loci have been associated with high-density lipoprotein cholesterol, triglycerides and BP phenotypes, respectively. Some of these identified loci are common for all the traits, some do not map to functional genes, and some are located in genes that encode for proteins not previously known to be involved in the biological pathway of the trait. GWAS have been successful at identifying new and unexpected genetic loci common to diseases and traits, thus rapidly providing key novel insights into disease biology. Since genotype information is fixed, with minimum biological variability, it is useful in early life risk prediction. However, these variants explain only a small proportion of the observed variance of these traits. Therefore, the utility of genetic determinants in assessing risk at later stages of life has limited immediate clinical impact. The future application of genetic screening will be in identifying risk groups early in life to direct targeted preventive measures. PMID:21860704
Jeemon, Panniyammakal; Pettigrew, Kerry; Sainsbury, Christopher; Prabhakaran, Dorairaj; Padmanabhan, Sandosh
Genome-wide association studies (GWAS) have identified several genetic variants associated with coronary heart disease (CHD), and variations in plasma lipoproteins and blood pressure (BP). Loci corresponding to CDKN2A/CDKN2B/ANRIL, MTHFD1L, CELSR2, PSRC1 and SORT1 genes have been associated with CHD, and TMEM57, DOCK7, CELSR2, APOB, ABCG5, HMGCR, TRIB1, FADS2/S3, LDLR, NCAN and TOMM40-APOE with total cholesterol. Similarly, CELSR2-PSRC1-SORT1, PCSK9, APOB, HMGCR, NCAN-CILP2-PBX4, LDLR, TOMM40-APOE, and APOC1-APOE are associated with variations in low-density lipoprotein cholesterol levels. Altogether, forty, forty three and twenty loci have been associated with high-density lipoprotein cholesterol, triglycerides and BP phenotypes, respectively. Some of these identified loci are common for all the traits, some do not map to functional genes, and some are located in genes that encode for proteins not previously known to be involved in the biological pathway of the trait. GWAS have been successful at identifying new and unexpected genetic loci common to diseases and traits, thus rapidly providing key novel insights into disease biology. Since genotype information is fixed, with minimum biological variability, it is useful in early life risk prediction. However, these variants explain only a small proportion of the observed variance of these traits. Therefore, the utility of genetic determinants in assessing risk at later stages of life has limited immediate clinical impact. The future application of genetic screening will be in identifying risk groups early in life to direct targeted preventive measures.
Shirasawa, Kenta; Hirakawa, Hideki; Nunome, Tsukasa; Tabata, Satoshi; Isobe, Sachiko
Genome-wide mutations induced by ethyl methanesulfonate (EMS) and gamma irradiation in the tomato Micro-Tom genome were identified by a whole-genome shotgun sequencing analysis to estimate the spectrum and distribution of whole-genome DNA mutations and the frequency of deleterious mutations. A total of ~370 Gb of paired-end reads for four EMS-induced mutants and three gamma-ray-irradiated lines as well as a wild-type line were obtained by next-generation sequencing technology. Using bioinformatics analyses, we identified 5920 induced single nucleotide variations and insertion/deletion (indel) mutations. The predominant mutations in the EMS mutants were C/G to T/A transitions, while in the gamma-ray mutants, C/G to T/A transitions, A/T to T/A transversions, A/T to G/C transitions and deletion mutations were equally common. Biases in the base composition flanking mutations differed between the mutagenesis types. Regarding the effects of the mutations on gene function, >90% of the mutations were located in intergenic regions, and only 0.2% were deleterious. In addition, we detected 1,140,687 spontaneous single nucleotide polymorphisms and indel polymorphisms in wild-type Micro-Tom lines. We also found copy number variation, deletions and insertions of chromosomal segments in both the mutant and wild-type lines. The results provide helpful information not only for mutation research, but also for mutant screening methodology with reverse-genetic approaches.
Matsuda, Fumio; Nakabayashi, Ryo; Yang, Zhigang; Okazaki, Yozo; Yonemaru, Jun-ichi; Ebana, Kaworu; Yano, Masahiro; Saito, Kazuki
Plants produce structurally diverse secondary (specialized) metabolites to increase their fitness for survival under adverse environments. Several bioactive compounds for new drugs have been identified through screening of plant extracts. In this study, genome-wide association studies (GWAS) were conducted to investigate the genetic architecture behind the natural variation of rice secondary metabolites. GWAS using the metabolome data of 175 rice accessions successfully identified 323 associations among 143 single nucleotide polymorphisms (SNPs) and 89 metabolites. The data analysis highlighted that levels of many metabolites are tightly associated with a small number of strong quantitative trait loci (QTLs). The tight association may be a mechanism generating strains with distinct metabolic composition through the crossing of two different strains. The results indicate that one plant species produces more diverse phytochemicals than previously expected, and plants still contain many useful compounds for human applications. PMID:25267402
Jayakody, Lahiru N; Hayashi, Nobuyuki; Kitagaki, Hiroshi
Degradation of lignocellulose with pressurised hot water is an efficient method of bioethanol production. However, the resultant solution inhibits ethanol fermentation by Saccharomyces cerevisiae. Here, we first report that glycolaldehyde, which is formed when lignocellulose is treated with pressurised hot water, inhibits ethanol fermentation. The final concentration of glycolaldehyde formed by the treatment of lignocellulose with pressurised hot water ranges from 1 to 24 M, and 1-10 mM glycolaldehyde was sufficient to inhibit fermentation. This result indicates that glycolaldehyde is one of the main substances responsible for inhibiting fermentation after pressurised hot water degradation of lignocellulose. Genome-wide screening of S. cerevisiae revealed that genes encoding alcohol dehydrogenase, methylglyoxal reductase, polysomes, and the ubiquitin ligase complex are required for glycolaldehyde tolerance. These novel findings will provide new perspectives on breeding yeast for bioethanol production from biomass treated with pressurised hot water.
Zou, Fei; Fine, Jason P.; Hu, Jianhua; Lin, D. Y.
Assessing genome-wide statistical significance is an important and difficult problem in multipoint linkage analysis. Due to multiple tests on the same genome, the usual pointwise significance level based on the chi-square approximation is inappropriate. Permutation is widely used to determine genome-wide significance. Theoretical approximations are available for simple experimental crosses. In this article, we propose a resampling procedure to assess the significance of genome-wide QTL mapping for experimental crosses. The proposed method is computationally much less intensive than the permutation procedure (in the order of 102 or higher) and is applicable to complex breeding designs and sophisticated genetic models that cannot be handled by the permutation and theoretical methods. The usefulness of the proposed method is demonstrated through simulation studies and an application to a Drosophila backcross. PMID:15611194
Zhou, Xiang; Stephens, Matthew
Linear mixed models have attracted considerable attention recently as a powerful and effective tool for accounting for population stratification and relatedness in genetic association tests. However, existing methods for exact computation of standard test statistics are computationally impractical for even moderate-sized genome-wide association studies. To address this issue, several approximate methods have been proposed. Here, we present an efficient exact method, which we refer to as genome-wide efficient mixed-model association (GEMMA), that makes approximations unnecessary in many contexts. This method is approximately n times faster than the widely used exact method known as efficient mixed-model association (EMMA), where n is the sample size, making exact genome-wide association analysis computationally practical for large numbers of individuals.
Krapohl, E; Plomin, R
One of the best predictors of children's educational achievement is their family's socioeconomic status (SES), but the degree to which this association is genetically mediated remains unclear. For 3000 UK-representative unrelated children we found that genome-wide single-nucleotide polymorphisms could explain a third of the variance of scores on an age-16 UK national examination of educational achievement and half of the correlation between their scores and family SES. Moreover, genome-wide polygenic scores based on a previously published genome-wide association meta-analysis of total number of years in education accounted for ~3.0% variance in educational achievement and ~2.5% in family SES. This study provides the first molecular evidence for substantial genetic influence on differences in children's educational achievement and its association with family SES. PMID:25754083
Krapohl, E; Plomin, R
One of the best predictors of children's educational achievement is their family's socioeconomic status (SES), but the degree to which this association is genetically mediated remains unclear. For 3000 UK-representative unrelated children we found that genome-wide single-nucleotide polymorphisms could explain a third of the variance of scores on an age-16 UK national examination of educational achievement and half of the correlation between their scores and family SES. Moreover, genome-wide polygenic scores based on a previously published genome-wide association meta-analysis of total number of years in education accounted for ~3.0% variance in educational achievement and ~2.5% in family SES. This study provides the first molecular evidence for substantial genetic influence on differences in children's educational achievement and its association with family SES.
Wacholder, Sholom; Rotunno, Melissa
Investigators planning studies within cohorts have many options for choosing an efficient sampling design for genome-wide association and other molecular epidemiology studies. Consideration of person-year and proportional hazards analyses of full cohorts may add further insight into ramifications of different designs. Empirical evidence from genome-wide association studies can supplement intuition and simulations in comparing properties of various case-control designs within cohorts. Additional theoretical and empirical work, justification of sampling choice in publications, and consideration of context and scientific aims can improve designs and, thereby, increase the scientific value and cost effectiveness of future studies.
Romanelli, Valeria; Nevado, Julián; Fraga, Mario; Trujillo, Alex Martín; Mori, Maria Ángeles; Fernández, Luis; Pérez de Nanclares, Guiomar; Martínez-Glez, Víctor; Pita, Guillermo; Meneses, Heloisa; Gracia, Ricardo; García-Miñaur, Sixto; García de Miguel, Purificación; Lecumberri, Beatriz; Rodríguez, José Ignacio; González Neira, Anna; Monk, David; Lapunzina, Pablo
Molecular studies in a patient with Beckwith-Wiedemann syndrome phenotype who developed two different tumours revealed an unexpected observation of almost complete loss of heterozygosity of all chromosomes. It is shown, by means of numerous molecular methods, that the absence of maternal contribution in somatic cells is due to high-degree (∼ 85%) genome-wide paternal uniparental disomy (UPD). The observations indicate that the genome-wide UPD results from diploidisation, and have important implications for genetic counselling and tumour surveillance for the growing number of UPD associated imprinting disorders.
Kim, H; Gordon, S; Dionne, R
Underlying molecular genetic mechanisms of diseases can be deciphered with unbiased strategies using recently developed technologies enabling genome-wide scale investigations. These technologies have been applied in scanning for genetic variations, gene expression profiles, and epigenetic changes for oral and craniofacial diseases. However, these approaches as applied to oral and craniofacial conditions are in the initial stages, and challenges remain to be overcome, including analysis of high throughput data and their interpretation. Here, we review methodology and studies using genome-wide approaches in oral and craniofacial diseases and suggest future directions. PMID:22913301
Lemieux Perreault, Louis-Philippe; Legault, Marc-André; Asselin, Géraldine; Dubé, Marie-Pierre
Genotype imputation is now commonly performed following genome-wide genotyping experiments. Imputation increases the density of analyzed genotypes in the dataset, enabling fine-mapping across the genome. However, the process of imputation using the most recent publicly available reference datasets can require considerable computation power and the management of hundreds of large intermediate files. We have developed genipe, a complete genome-wide imputation pipeline which includes automatic reporting, imputed data indexing and management, and a suite of statistical tests for imputed data commonly used in genetic epidemiology (Sequence Kernel Association Test, Cox proportional hazards for survival analysis, and linear mixed models for repeated measurements in longitudinal studies).
Ristova, Daniela; Busch, Wolfgang
Genome-wide association (GWA) mapping is a powerful method for the identification of alleles that underlie quantitative traits. It enables one to understand how genetic variation translates into phenotypic variation. In particular, plant hormone signaling pathways play a key role in shaping phenotypes. This chapter presents a protocol for genome-wide association mapping of root traits of Arabidopsis thaliana in the context of hormone research. We describe a specific protocol for acquiring primary and lateral root trait data that is appropriate for GWA studies using FIJI (ImageJ), and subsequent GWA mapping using a user-friendly Internet application.
Suzuki, Masako; Greally, John M
"Epigenetics" refers to a heritable change in transcriptional status without alteration in the primary nucleotide sequence. Epigenetics provides an extra layer of transcriptional control and plays a crucial role in normal development, as well as in pathological conditions. DNA methylation is one of the best known and well-studied epigenetic modifications. Genome-wide DNA methylation profiling has become recognized as a biologically and clinically important epigenomic assay. In this review, we discuss the strengths and weaknesses of the protocols for genome-wide DNA methylation profiling using massively parallel sequencing (MPS) techniques. We will also describe recently discovered DNA modifications, and the protocols to detect these modifications.
Misiewicz, Zuzanna; Hiekkalinna, Tero; Paunio, Tiina; Varilo, Teppo; Terwilliger, Joseph D; Partonen, Timo; Hovatta, Iiris
Acrophobia, an abnormal fear of heights, is a specific phobia characterized as apprehension cued by the occurrence or anticipation of elevated spaces. It is considered a complex trait with onset influenced by both genetic and environmental factors. Identification of genetic risk variants would provide novel insight into the genetic basis of the fear of heights phenotype and contribute to the molecular-level understanding of its aetiology. Genetic isolates may facilitate identification of susceptibility alleles due to reduced genetic heterogeneity. We took advantage of an internal genetic isolate in Finland in which a distinct acrophobia phenotype appears to be segregating in pedigrees originally ascertained for schizophrenia. We conducted parametric, nonparametric, joint linkage and linkage disequilibrium analyses using a microsatellite marker panel, genotyped in families to search for chromosomal regions correlated with acrophobia. Our results implicated a few regions with suggestive evidence for linkage on chromosomes 4q28 (LOD = 2.17), 8q24 (LOD = 2.09) and 13q21-q22 (LOD = 2.22). We observed no risk haplotypes shared between different families. These results suggest that genetic predisposition to acrophobia in this genetic isolate is unlikely to be mediated by a small number of shared high-risk alleles, but rather has a complex genetic architecture.
Belone, Andrea de Faria F.; Rosa, Patrícia S.; Trombone, Ana P. F.; Fachin, Luciana R. V.; Guidella, Cássio C.; Ura, Somei; Barreto, Jaison A.; Pinilla, Mabel G.; de Carvalho, Alex F.; Carraro, Dirce M.; Soares, Fernando A.; Soares, Cleverson T.
Leprosy, an infectious disease caused by Mycobacterium leprae, affects millions of people worldwide. However, little is known regarding its molecular pathophysiological mechanisms. In this study, a comprehensive assessment of human mRNA was performed on leprosy skin lesions by using DNA chip microarrays, which included the entire spectrum of the disease along with its reactional states. Sixty-six samples from leprotic lesions (10TT, 10BT, 10BB, 10BL, 4LL, 14R1, and 10R2) and nine skin biopsies from healthy individuals were used as controls (CC) (ages ranged from 06 to 83 years, 48 were male and 29 female). The evaluation identified 1580 differentially expressed mRNAs [Fold Change (FC) ≥ 2.0, p ≤ 0.05] in diseased lesions vs. healthy controls. Some of these genes were observed in all forms of the disease (CD2, CD27, chit1, FA2H, FAM26F, GZMB, MMP9, SLAMF7, UBD) and others were exclusive to reactional forms (Type “1” reaction: GPNMB, IL1B, MICAL2, FOXQ1; Type “2” reaction: AKR1B10, FAM180B, FOXQ1, NNMT, NR1D1, PTX3, TNFRSF25). In literature, these mRNAs have been associated with numerous pathophysiological processes and signaling pathways and are present in a large number of diseases. The role of these mRNAs maybe studied in the context of developing new diagnostic markers and therapeutic targets for leprosy. PMID:26635870
Misiewicz, Zuzanna; Hiekkalinna, Tero; Paunio, Tiina; Varilo, Teppo; Terwilliger, Joseph D.; Partonen, Timo; Hovatta, Iiris
Acrophobia, an abnormal fear of heights, is a specific phobia characterized as apprehension cued by the occurrence or anticipation of elevated spaces. It is considered a complex trait with onset influenced by both genetic and environmental factors. Identification of genetic risk variants would provide novel insight into the genetic basis of the fear of heights phenotype and contribute to the molecular-level understanding of its aetiology. Genetic isolates may facilitate identification of susceptibility alleles due to reduced genetic heterogeneity. We took advantage of an internal genetic isolate in Finland in which a distinct acrophobia phenotype appears to be segregating in pedigrees originally ascertained for schizophrenia. We conducted parametric, nonparametric, joint linkage and linkage disequilibrium analyses using a microsatellite marker panel, genotyped in families to search for chromosomal regions correlated with acrophobia. Our results implicated a few regions with suggestive evidence for linkage on chromosomes 4q28 (LOD = 2.17), 8q24 (LOD = 2.09) and 13q21-q22 (LOD = 2.22). We observed no risk haplotypes shared between different families. These results suggest that genetic predisposition to acrophobia in this genetic isolate is unlikely to be mediated by a small number of shared high-risk alleles, but rather has a complex genetic architecture. PMID:27996024
Park, Eun-Hee; Kim, Myoung-Dong
During pretreatment of lignocellulosic biomass, a variety of fermentation inhibitors, including acetic acid and vanillin, are released. Using DNA microarray analysis, this study explored genes of the budding yeast Saccharomyces cerevisiae that respond to vanillin-induced stress. The expression of 273 genes was upregulated and that of 205 genes was downregulated under vanillin stress. Significantly induced genes included MCH2, SNG1, GPH1, and TMA10, whereas NOP2, UTP18, FUR1, and SPR1 were down regulated. Sequence analysis of the 5'-flanking region of upregulated genes suggested that vanillin might regulate gene expression in a stress response element (STRE)-dependent manner, in addition to a pathway that involved the transcription factor Yap1p. Retardation in the cell growth of mutant strains indicated that MCH2, SNG1, and GPH1 are intimately involved in vanillin stress response. Deletion of the genes whose expression levels were decreased under vanillin stress did not result in a notable change in S. cerevisiae growth under vanillin stress. This study will provide the basis for a better understanding of the stress response of the yeast S. cerevisiae to fermentation inhibitors.
Fusarium graminearum infection contaminates wheat and barley grain with the potent trichothecene mycotoxin deoxynivalenol (DON). Trichothecene mycotoxins are known to target cytosolic ribosomes and can cause cell death by permanently disruption translation. In addition to the inhibition of protein...
Khanna, Amit; Kumar, Jitendra; Vargas, Misha A.; Barrett, LaKisha; Katewa, Subhash; Li, Patrick; McCloskey, Tom; Sharma, Amit; Naudé, Nicole; Nelson, Christopher; Brem, Rachel; Killilea, David W.; Mooney, Sean D.; Gill, Matthew; Kapahi, Pankaj
Molecular pathways involved in dauer formation, an alternate larval stage that allows Caenorhabditis elegans to survive adverse environmental conditions during development, also modulate longevity and metabolism. The decision to proceed with reproductive development or undergo diapause depends on food abundance, population density, and temperature. In recent years, the chemical identities of pheromone signals that modulate dauer entry have been characterized. However, signals derived from bacteria, the major source of nutrients for C. elegans, remain poorly characterized. To systematically identify bacterial components that influence dauer formation and aging in C. elegans, we utilized the individual gene deletion mutants in E. coli (K12). We identified 56 diverse E. coli deletion mutants that enhance dauer formation in an insulin-like receptor mutant (daf-2) background. We describe the mechanism of action of a bacterial mutant cyaA, that is defective in the production of cyclic AMP, which extends lifespan and enhances dauer formation through the modulation of TGF-β (daf-7) signaling in C. elegans. Our results demonstrate the importance of bacterial components in influencing developmental decisions and lifespan in C. elegans. Furthermore, we demonstrate that C. elegans is a useful model to study bacterial-host interactions. PMID:27958277
Americans with Systemic Lupus Erythematosus PRINCIPAL INVESTIGATOR: John Harley, M.D., Ph.D...September 2012 – 31 August 2013 4. TITLE AND SUBTITLE 5a. CONTRACT NUMBER Genome-Wide Association Study in African-Americans with Systemic Lupus ...SUPPLEMENTARY NOTES 14. ABSTRACT Systemic lupus erythematosus ( lupus ) is a potentially deadly systemic autoimmune disease that disproportionately
The sorghum association panel exhibited extensive variation for seedling traits under cold and heat stress. Genome-wide analyses identified thirty single nucleotide polymorphisms (SNPs) that were strongly associated with traits measured at seedling stage under cold stress and tagged genes that act a...
Harlaar, Nicole; Meaburn, Emma L.; Hayiou-Thomas, Marianna E.; Davis, Oliver S. P.; Docherty, Sophia; Hanscombe, Ken B.; Haworth, Claire M. A.; Price, Thomas S.; Trzaskowski, Maciej; Dale, Philip S.; Plomin, Robert
Purpose Researchers have previously shown that individual differences in measures of receptive language ability at age 12 are highly heritable. In the current study, the authors attempted to identify some of the genes responsible for the heritability of receptive language ability using a genome-wide association approach. Method The authors administered 4 Internet-based measures of receptive language (vocabulary, semantics, syntax, and pragmatics) to a sample of 2,329 twelve-year-olds for whom DNA and genome-wide genotyping were available. Nearly 700,000 single-nucleotide polymorphisms (SNPs) and 1 million imputed SNPs were included in a genome-wide association analysis of receptive language composite scores. Results No SNP associations met the demanding criterion of genome-wide significance that corrects for multiple testing across the genome (p < 5 × 10–8). The strongest SNP association did not replicate in an additional sample of 2,639 twelve-year-olds. Conclusions These results indicate that individual differences in receptive language ability in the general population do not reflect common genetic variants that account for more than 3% of the phenotypic variance. The search for genetic variants associated with language skill will require larger samples and additional methods to identify and functionally characterize the full spectrum of risk variants. PMID:24687471
U.S. maize yield has increased eightfold in the past 80 years with half of the improvement attributed to genetics. Changes in maize leaf angle and size provided a basis for more efficient light capture as plant densities increased. Through a genome wide association study (GWAS) of the maize nested a...
Genotype-phenotype associations within the soybean (Glycine max) germplasm collection could provide valuable information on the frequency and distribution of alleles affecting economically important traits. Here we performed a genome-wide association study (GWAS) for seed protein and oil content in ...
Mixed linear model (MLM) methods have proven useful in controlling for population structure and relatedness within genome-wide association studies. However, MLM-based methods can be computationally challenging for large datasets. We report a compression approach, called ‘compressed MLM,’ that decrea...
Reproductive efficiency has a great impact on the economic success of pork production. Number born alive (NBA) and average piglet birth weight (ABW) contribute greatly to reproductive efficiency. To better understand the underlying genetics of birth traits, a genome wide association study (GWAS) w...
Low plasma B-vitamin levels and elevated homocysteine have been associated with cancer, cardiovascular disease, and neurodegenerative disorders. Common variants in FUT2 on chromosome 19q13 were associated with plasma vitamin B12 levels among women in a genome-wide association study (GWAS) in the Nur...
Mixed models improve the ability to detect phenotype-genotype associations in the presence of population stratification and multiple levels of relatedness in genome-wide association studies (GWAS), but for large data sets the resource consumption becomes impractical. At the same time, the sample siz...
Pork quality plays an important role in the meat processing industry, thus different methodologies have been implemented to elucidate the genetic architecture of traits affecting meat quality. One of the most common and widely used approaches is to perform genome-wide association (GWA) studies. Howe...
P1 encodes an R2R3-MYB transcription factor responsible for the accumulation of insecticidal flavones in maize silks and red phlobaphene pigments in pericarps and other floral tissues. Using genome-wide expression analyses (RNA-Seq) in pericarps and silks of plants with contrasting P1 alleles combin...
Mei, Hao; Li, Lianna; Jiang, Fan; Simino, Jeannette; Griswold, Michael; Mosley, Thomas; Liu, Shijian
Genome-wide studies (GWS) of SNP associations and differential gene expressions have generated abundant results; next-generation sequencing technology has further boosted the number of variants and genes identified. Effective interpretation requires massive annotation and downstream analysis of these genome-wide results, a computationally challenging task. We developed the snpGeneSets package to simplify annotation and analysis of GWS results. Our package integrates local copies of knowledge bases for SNPs, genes, and gene sets, and implements wrapper functions in the R language to enable transparent access to low-level databases for efficient annotation of large genomic data. The package contains functions that execute three types of annotations: (1) genomic mapping annotation for SNPs and genes and functional annotation for gene sets; (2) bidirectional mapping between SNPs and genes, and genes and gene sets; and (3) calculation of gene effect measures from SNP associations and performance of gene set enrichment analyses to identify functional pathways. We applied snpGeneSets to type 2 diabetes (T2D) results from the NHGRI genome-wide association study (GWAS) catalog, a Finnish GWAS, and a genome-wide expression study (GWES). These studies demonstrate the usefulness of snpGeneSets for annotating and performing enrichment analysis of GWS results. The package is open-source, free, and can be downloaded at: https://www.umc.edu/biostats_software/. PMID:27807048
Mei, Hao; Li, Lianna; Jiang, Fan; Simino, Jeannette; Griswold, Michael; Mosley, Thomas; Liu, Shijian
Genome-wide studies (GWS) of SNP associations and differential gene expressions have generated abundant results; next-generation sequencing technology has further boosted the number of variants and genes identified. Effective interpretation requires massive annotation and downstream analysis of these genome-wide results, a computationally challenging task. We developed the snpGeneSets package to simplify annotation and analysis of GWS results. Our package integrates local copies of knowledge bases for SNPs, genes, and gene sets, and implements wrapper functions in the R language to enable transparent access to low-level databases for efficient annotation of large genomic data. The package contains functions that execute three types of annotations: (1) genomic mapping annotation for SNPs and genes and functional annotation for gene sets; (2) bidirectional mapping between SNPs and genes, and genes and gene sets; and (3) calculation of gene effect measures from SNP associations and performance of gene set enrichment analyses to identify functional pathways. We applied snpGeneSets to type 2 diabetes (T2D) results from the NHGRI genome-wide association study (GWAS) catalog, a Finnish GWAS, and a genome-wide expression study (GWES). These studies demonstrate the usefulness of snpGeneSets for annotating and performing enrichment analysis of GWS results. The package is open-source, free, and can be downloaded at: https://www.umc.edu/biostats_software/.
Genome-wide association mapping has recently emerged as a valuable approach to refine genetic basis of polygenic resistance to plant diseases, which are increasingly used in integrated strategies for durable crop protection. Aphanomyces euteiches is a soil borne pathogen of pea and other legumes wor...
The small East African Shorthorn Zebu is the main indigenous cattle across East Africa. A recent genome wide SNPs analysis has revealed their ancient stable African taurine x Asian zebu admixture. Here, we assess the presence of candidate signature of positive selection in their genome, with the aim...
Wheat kernel weight is an important and heritable component of wheat grain yield and a key predictor of flour extraction. Genome-wide association analysis was conducted to identify genomic regions associated with kernel weight and kernel weight environmental response in 8 trials of 299 hard winter ...
Loo, Sandra K.; Shtir, Corina; Doyle, Alysa E.; Mick, Eric; McGough, James J.; McCracken, James; Biederman, Joseph; Smalley, Susan L.; Cantor, Rita M.; Faraone, Stephen V.; Nelson, Stanley F.
Objective: The purpose of the present study was to identify common genetic variants that are associated with human intelligence or general cognitive ability. Method: We performed a genome-wide association analysis with a dense set of 1 million single-nucleotide polymorphisms (SNPs) and quantitative intelligence scores within an ancestrally…
Harlaar, Nicole; Meaburn, Emma L.; Hayiou-Thomas, Marianna E.; Davis, Oliver S. P.; Docherty, Sophia; Hanscombe, Ken B.; Haworth, Claire M. A.; Price, Thomas S.; Trzaskowski, Maciej; Dale, Philip S.; Plomin, Robert
Purpose: Researchers have previously shown that individual differences in measures of receptive language ability at age 12 are highly heritable. In the current study, the authors attempted to identify some of the genes responsible for the heritability of receptive language ability using a "genome-wide association" approach. Method: The…
Pork quality has a large impact on consumer preference and perception of eating quality. A genome-wide association was performed for pork quality traits [intramuscular fat (IMF)], slice shear force (SSF), color attributes, purge, cooking loss, and pH] from 531 to 1,237 records on barrows and gilts o...
Pork quality is a critical concern in the meat industry. Implementation of genome-wide association studies (GWA) allows identification of genomic regions that explain a substantial portion of the variation of relevant traits. It is also important to determine the consistency of results of GWA across...
A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. To increase sample size, results from different GWA can be combined in a meta-analys...
A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. For increasing N, results from different GWA can be combined in a meta-analysis (MA-...
Given the importance of pork quality in the meat processing industry, genome-wide association studies were performed for eight meat quality traits and also, a meta-analysis (MA) of GWA was implemented combining independent results from pig populations. Data from three pig datasets (USMARC, Commercia...
Next-generation sequencing and related technologies have facilitated the creation of enormous public databases that catalogue genomic variation. These databases have facilitated a variety of approaches to discover new genes that regulate normal biology as well as disease. Genome wide association (...
We demonstrated a flexible Genome-Wide Association (GWA) Study (GWAS) platform built upon the iPlant Collaborative Cyber-infrastructure. The platform supports big data management, sharing, and large scale study of both genotype and phenotype data on clusters. End users can add their own analysis too...
Genome-wide association mapping using dense marker sets has identified some nucleotide variants affecting complex traits which have been validated with fine-mapping and functional analysis. Many sequence variants associated with complex traits in maize have small effects and low repeatability, howev...
A genome-wide SNP resource was developed for rice using the GoldenGate assay and used to genotype 400 landrace accessions of O. sativa. SNPs were originally discovered using Perlegen re-sequencing technology in 20 diverse landraces of O. sativa as part of OryzaSNP project (http://irfgc.irri.org). An...
Song, Chi; Chen, Gary K.; Millikan, Robert C.; Ambrosone, Christine B.; John, Esther M.; Bernstein, Leslie; Zheng, Wei; Hu, Jennifer J.; Ziegler, Regina G.; Nyante, Sarah; Bandera, Elisa V.; Ingles, Sue A.; Press, Michael F.; Deming, Sandra L.; Rodriguez-Gil, Jorge L.; Chanock, Stephen J.; Wan, Peggy; Sheng, Xin; Pooler, Loreall C.; Van Den Berg, David J.; Le Marchand, Loic; Kolonel, Laurence N.; Henderson, Brian E.; Haiman, Chris A.; Stram, Daniel O.
Genome-wide association studies (GWAS) simultaneously investigating hundreds of thousands of single nucleotide polymorphisms (SNP) have become a powerful tool in the investigation of new disease susceptibility loci. Haplotypes are sometimes thought to be superior to SNPs and are promising in genetic association analyses. The application of genome-wide haplotype analysis, however, is hindered by the complexity of haplotypes themselves and sophistication in computation. We systematically analyzed the haplotype effects for breast cancer risk among 5,761 African American women (3,016 cases and 2,745 controls) using a sliding window approach on the genome-wide scale. Three regions on chromosomes 1, 4 and 18 exhibited moderate haplotype effects. Furthermore, among 21 breast cancer susceptibility loci previously established in European populations, 10p15 and 14q24 are likely to harbor novel haplotype effects. We also proposed a heuristic of determining the significance level and the effective number of independent tests by the permutation analysis on chromosome 22 data. It suggests that the effective number was approximately half of the total (7,794 out of 15,645), thus the half number could serve as a quick reference to evaluating genome-wide significance if a similar sliding window approach of haplotype analysis is adopted in similar populations using similar genotype density. PMID:23468962
A genome-wide association study (GWAS) was conducted to explore the genetic basis of variation for drought tolerance and related traits in a Middle American diversity panel comprised of 96 common bean (Phaseolus vulgaris) genotypes. The panel grown under irrigated and rainfed conditions and single n...
A genome-wide association study (GWAS) using a global Andean diversity panel (ADP) of 237 genotypes of common bean, Phaseolus vulgaris was conducted to gain insight into the genetic architecture of several agronomic traits controlling phenology, biomass, yield components and seed yield. The panel wa...
Background: Apart from single nucleotide polymorphism (SNP), copy number variation (CNV) is another important type of genetic variation, which may affect growth traits and play key roles for the production of beef cattle. To date, no genome-wide association study (GWAS) for CNV and body traits in be...
Pork quality is important both to the meat processing industry and consumers’ purchasing attitudes. Copy number variation (CNV) is a burgeoning kind of variant that may influence meat quality. Herein, a genome-wide association study (GWAS) was performed between CNVs and meat quality traits in swine....
"Omic" tools are rapidly being employed to delineate the biological framework controlling phenotypes of interest in crop species. An advanced understanding of the genetic basis for quantitative trait variation has been made possible through genome wide association studies (GWAS) that make use of gen...
A genome-wide association study (GWAS) was conducted to explore the genetic basis of variation for symbiotic nitrogen fixation (SNF) and related traits in the Andean diversity panel (ADP) comprised of 259 common bean (Phaseolus vulgaris) genotypes. The ADP was evaluated for SNF and related traits in...
Boycott, Kym; Hartley, Taila; Adam, Shelin; Bernier, Francois; Chong, Karen; Fernandez, Bridget A; Friedman, Jan M; Geraghty, Michael T; Hume, Stacey; Knoppers, Bartha M; Laberge, Anne-Marie; Majewski, Jacek; Mendoza-Londono, Roberto; Meyn, M Stephen; Michaud, Jacques L; Nelson, Tanya N; Richer, Julie; Sadikovic, Bekim; Skidmore, David L; Stockley, Tracy; Taylor, Sherry; van Karnebeek, Clara; Zawati, Ma'n H; Lauzon, Julie; Armour, Christine M
Purpose and scope The aim of this Position Statement is to provide recommendations for Canadian medical geneticists, clinical laboratory geneticists, genetic counsellors and other physicians regarding the use of genome-wide sequencing of germline DNA in the context of clinical genetic diagnosis. This statement has been developed to facilitate the clinical translation and development of best practices for clinical genome-wide sequencing for genetic diagnosis of monogenic diseases in Canada; it does not address the clinical application of this technology in other fields such as molecular investigation of cancer or for population screening of healthy individuals. Methods of statement development Two multidisciplinary groups consisting of medical geneticists, clinical laboratory geneticists, genetic counsellors, ethicists, lawyers and genetic researchers were assembled to review existing literature and guidelines on genome-wide sequencing for clinical genetic diagnosis in the context of monogenic diseases, and to make recommendations relevant to the Canadian context. The statement was circulated for comment to the Canadian College of Medical Geneticists (CCMG) membership-at-large and, following incorporation of feedback, approved by the CCMG Board of Directors. The CCMG is a Canadian organisation responsible for certifying medical geneticists and clinical laboratory geneticists, and for establishing professional and ethical standards for clinical genetics services in Canada. Results and conclusions Recommendations include (1) clinical genome-wide sequencing is an appropriate approach in the diagnostic assessment of a patient for whom there is suspicion of a significant monogenic disease that is associated with a high degree of genetic heterogeneity, or where specific genetic tests have failed to provide a diagnosis; (2) until the benefits of reporting incidental findings are established, we do not endorse the intentional clinical analysis of disease-associated genes
Winham, Stacey J; Biernacka, Joanna M.
Background Complex psychiatric traits have long been thought to be the result of a combination of genetic and environmental factors, and gene-environment interactions are thought to play a crucial role in behavioral phenotypes and the susceptibility and progression of psychiatric disorders. Candidate gene studies to investigate hypothesized gene-environment interactions are now fairly common in human genetic research, and with the shift towards genome-wide association studies, genome-wide gene-environment interaction studies are beginning to emerge. Methods We summarize the basic ideas behind gene-environment interaction, and provide an overview of possible study designs and traditional analysis methods in the context of genome-wide analysis. We then discuss novel approaches beyond the traditional strategy of analyzing the interaction between the environmental factor and each polymorphism individually. Results Two-step filtering approaches that reduce the number of polymorphisms tested for interactions can substantially increase the power of genome-wide gene-environment studies. New analytical methods including data-mining approaches, and gene-level and pathway-level analyses, also have the capacity to improve our understanding of how complex genetic and environmental factors interact to influence psychological and psychiatric traits. Such methods, however, have not yet been utilized much in behavioral and mental health research. Conclusions Although methods to investigate gene-environment interactions are available, there is a need for further development and extension of these methods to identify gene-environment interactions in the context of genome-wide association studies. These novel approaches need to be applied in studies of psychology and psychiatry. PMID:23808649
Pappa, Irene; St Pourcain, Beate; Benke, Kelly; Cavadino, Alana; Hakulinen, Christian; Nivard, Michel G; Nolte, Ilja M; Tiesler, Carla M T; Bakermans-Kranenburg, Marian J; Davies, Gareth E; Evans, David M; Geoffroy, Marie-Claude; Grallert, Harald; Groen-Blokhuis, Maria M; Hudziak, James J; Kemp, John P; Keltikangas-Järvinen, Liisa; McMahon, George; Mileva-Seitz, Viara R; Motazedi, Ehsan; Power, Christine; Raitakari, Olli T; Ring, Susan M; Rivadeneira, Fernando; Rodriguez, Alina; Scheet, Paul A; Seppälä, Ilkka; Snieder, Harold; Standl, Marie; Thiering, Elisabeth; Timpson, Nicholas J; Veenstra, René; Velders, Fleur P; Whitehouse, Andrew J O; Smith, George Davey; Heinrich, Joachim; Hypponen, Elina; Lehtimäki, Terho; Middeldorp, Christel M; Oldehinkel, Albertine J; Pennell, Craig E; Boomsma, Dorret I; Tiemeier, Henning
Individual differences in aggressive behavior emerge in early childhood and predict persisting behavioral problems and disorders. Studies of antisocial and severe aggression in adulthood indicate substantial underlying biology. However, little attention has been given to genome-wide approaches of aggressive behavior in children. We analyzed data from nine population-based studies and assessed aggressive behavior using well-validated parent-reported questionnaires. This is the largest sample exploring children's aggressive behavior to date (N = 18,988), with measures in two developmental stages (N = 15,668 early childhood and N = 16,311 middle childhood/early adolescence). First, we estimated the additive genetic variance of children's aggressive behavior based on genome-wide SNP information, using genome-wide complex trait analysis (GCTA). Second, genetic associations within each study were assessed using a quasi-Poisson regression approach, capturing the highly right-skewed distribution of aggressive behavior. Third, we performed meta-analyses of genome-wide associations for both the total age-mixed sample and the two developmental stages. Finally, we performed a gene-based test using the summary statistics of the total sample. GCTA quantified variance tagged by common SNPs (10-54%). The meta-analysis of the total sample identified one region in chromosome 2 (2p12) at near genome-wide significance (top SNP rs11126630, P = 5.30 × 10(-8) ). The separate meta-analyses of the two developmental stages revealed suggestive evidence of association at the same locus. The gene-based analysis indicated association of variation within AVPR1A with aggressive behavior. We conclude that common variants at 2p12 show suggestive evidence for association with childhood aggression. Replication of these initial findings is needed, and further studies should clarify its biological meaning. © 2015 Wiley Periodicals, Inc.
Zhang, Cathy R.; Adib-Samii, Poneh; Devan, William J.; Parsons, Owen E.; Lanfranconi, Silvia; Gregory, Sarah; Cloonan, Lisa; Falcone, Guido J.; Radmanesh, Farid; Fitzpatrick, Kaitlin; Kanakis, Allison; Barrick, Thomas R.; Moynihan, Barry; Lewis, Cathryn M.; Boncoraglio, Giorgio B.; Lemmens, Robin; Thijs, Vincent; Sudlow, Cathie; Wardlaw, Joanna; Rothwell, Peter M.; Meschia, James F.; Worrall, Bradford B.; Levi, Christopher; Bevan, Steve; Furie, Karen L.; Dichgans, Martin; Rosand, Jonathan; Markus, Hugh S.; Rost, Natalia
Objective: For 3,670 stroke patients from the United Kingdom, United States, Australia, Belgium, and Italy, we performed a genome-wide meta-analysis of white matter hyperintensity volumes (WMHV) on data imputed to the 1000 Genomes reference dataset to provide insights into disease mechanisms. Methods: We first sought to identify genetic associations with white matter hyperintensities in a stroke population, and then examined whether genetic loci previously linked to WMHV in community populations are also associated in stroke patients. Having established that genetic associations are shared between the 2 populations, we performed a meta-analysis testing which associations with WMHV in stroke-free populations are associated overall when combined with stroke populations. Results: There were no associations at genome-wide significance with WMHV in stroke patients. All previously reported genome-wide significant associations with WMHV in community populations shared direction of effect in stroke patients. In a meta-analysis of the genome-wide significant and suggestive loci (p < 5 × 10−6) from community populations (15 single nucleotide polymorphisms in total) and from stroke patients, 6 independent loci were associated with WMHV in both populations. Four of these are novel associations at the genome-wide level (rs72934505 [NBEAL1], p = 2.2 × 10−8; rs941898 [EVL], p = 4.0 × 10−8; rs962888 [C1QL1], p = 1.1 × 10−8; rs9515201 [COL4A2], p = 6.9 × 10−9). Conclusions: Genetic associations with WMHV are shared in otherwise healthy individuals and patients with stroke, indicating common genetic susceptibility in cerebral small vessel disease. PMID:26674333
Elbein, S C; Hoffman, M D; Teng, K; Leppert, M F; Hasstedt, S J
Considerable evidence supports a major inherited component of type 2 diabetes. We initially conducted a genome-wide scan with 440 microsatellite markers at 10-cM intervals in 19 multigenerational families of Northern European ancestry with at least two diabetic siblings. Initial two-point analyses of these families directed marker typing of 23 additional families. Subsequently, all available marker data on the total of 42 families were analyzed using both parametric and nonparametric multipoint methods to test for linkage to type 2 diabetes. One locus on chromosome 1q21-1q23 met genome-wide criteria for significant linkage under a model of recessive inheritance with a common diabetes allele (logarithm of odds [LOD] = 4.295). Both pedigree-based nonparametric linkage (NPL) analysis and affected sib pair (MAPMAKER/SIBS) nonparametric methods also showed the highest genome-wide scores at this region, near markers CRP and APOA2, but failed to meet levels of genome-wide significance. The risk of type 2 diabetes to siblings of a diabetic person when compared with the population (lambdaS) was estimated from MAPMAKER/SIBS to be 2.8 in these 42 families. Simulation studies using study data confirmed a genome-wide significance level of P<0.05 (95% CI 0.005-0.0466). However, analysis of 20 similarly ascertained but smaller families failed to confirm this linkage. The LOD score with 50% heterogeneity for all 62 families considered together was only 2.25, with an estimated lambdaS of 1.87. Our data suggest a novel diabetes susceptibility locus near APOA2 on chromosome 1 in a region with many transcribed genes.
Cai, Guohong; Leadbetter, Clayton W; Muehlbauer, Megan F; Molnar, Thomas J; Hillman, Bradley I
High-throughput sequencing has been dramatically accelerating the discovery of microsatellite markers (also known as Simple Sequence Repeats). Both 454 and Illumina reads have been used directly in microsatellite discovery and primer design (the "Seq-to-SSR" approach). However, constraints of this approach include: 1) many microsatellite-containing reads do not have sufficient flanking sequences to allow primer design, and 2) difficulties in removing microsatellite loci residing in longer, repetitive regions. In the current study, we applied the novel "Seq-Assembly-SSR" approach to overcome these constraints in Anisogramma anomala. In our approach, Illumina reads were first assembled into a draft genome, and the latter was then used in microsatellite discovery. A. anomala is an obligate biotrophic ascomycete that causes eastern filbert blight disease of commercial European hazelnut. Little is known about its population structure or diversity. Approximately 26 M 146 bp Illumina reads were generated from a paired-end library of a fungal strain from Oregon. The reads were assembled into a draft genome of 333 Mb (excluding gaps), with contig N50 of 10,384 bp and scaffold N50 of 32,987 bp. A bioinformatics pipeline identified 46,677 microsatellite motifs at 44,247 loci, including 2,430 compound loci. Primers were successfully designed for 42,923 loci (97%). After removing 2,886 loci close to assembly gaps and 676 loci in repetitive regions, a genome-wide microsatellite database of 39,361 loci was generated for the fungus. In experimental screening of 236 loci using four geographically representative strains, 228 (96.6%) were successfully amplified and 214 (90.7%) produced single PCR products. Twenty-three (9.7%) were found to be perfect polymorphic loci. A small-scale population study using 11 polymorphic loci revealed considerable gene diversity. Clustering analysis grouped isolates of this fungus into two clades in accordance with their geographic origins. Thus, the
Yuan, Jing; Tao, Wenjing; Cheng, Yunying; Huang, Baofeng; Wang, Deshou
The fox genes play important roles in various biological processes, including sexual development. In the present study, we isolated 65 fox genes, belonging to 18 subfamilies named A-R, from Nile tilapia through genome-wide screening. Twenty-four of them have two or three (foxm1) copies. Furthermore, 16, 25, 68, and 45 fox members were isolated from nematodes, protochordates, teleosts, and tetrapods, respectively. Phylogenetic analyses indicated fox gene family had undergone three expansions parallel to the three rounds of genome duplication during evolution. We also analyzed the clustered fox genes and found that apparent linkage duplication existed in teleosts, which further supported fish-specific genome duplication hypothesis. In addition, species- and lineage-specific duplication is another reason for fox gene family expansion. Based on the four pairs of XX and XY gonadal transcriptome data from four critical developmental stages, we analyzed the expression profile of all fox genes and identified sexually dimorphic fox genes at each stage. All fox genes were detected in gonads, with 15 of them at the background expression level (total read per kb per million reads, RPKM < 10), 29 at moderate expression level (10 < total RPKM < 100), and 21 at high expression level (total RPKM > 100). There are 27, 24, 28, and 9 sexually dimorphic fox genes at 5, 30, 90, and 180 days after hatching (dah), respectively. foxq1a, foxf1, foxr1, and foxr1 were identified as the most differentially expressed genes at each stage. foxl2 was characterized as XX-dominant gene, while foxd5, foxi3, foxn3, foxj1a, foxj3b, and foxo6b were characterized as XY-dominant genes. qPCR and in situ hybridization of foxh1 and foxj1a were performed to confirm the expression profiles and to validate the transcriptome data. Our results suggest that fox genes might play important roles in sex determination and gonadal development in teleosts.
Demirkan, Ayşe; Henneman, Peter; Verhoeven, Aswin; Dharuri, Harish; Amin, Najaf; van Klinken, Jan Bert; Karssen, Lennart C; de Vries, Boukje; Meissner, Axel; Göraler, Sibel; van den Maagdenberg, Arn M J M; Deelder, André M; C 't Hoen, Peter A; van Duijn, Cornelia M; van Dijk, Ko Willems
Metabolite quantitative traits carry great promise for epidemiological studies, and their genetic background has been addressed using Genome-Wide Association Studies (GWAS). Thus far, the role of less common variants has not been exhaustively studied. Here, we set out a GWAS for metabolite quantitative traits in serum, followed by exome sequence analysis to zoom in on putative causal variants in the associated genes. 1H Nuclear Magnetic Resonance (1H-NMR) spectroscopy experiments yielded successful quantification of 42 unique metabolites in 2,482 individuals from The Erasmus Rucphen Family (ERF) study. Heritability of metabolites were estimated by SOLAR. GWAS was performed by linear mixed models, using HapMap imputations. Based on physical vicinity and pathway analyses, candidate genes were screened for coding region variation using exome sequence data. Heritability estimates for metabolites ranged between 10% and 52%. GWAS replicated three known loci in the metabolome wide significance: CPS1 with glycine (P-value = 1.27×10-32), PRODH with proline (P-value = 1.11×10-19), SLC16A9 with carnitine level (P-value = 4.81×10-14) and uncovered a novel association between DMGDH and dimethyl-glycine (P-value = 1.65×10-19) level. In addition, we found three novel, suggestively significant loci: TNP1 with pyruvate (P-value = 1.26×10-8), KCNJ16 with 3-hydroxybutyrate (P-value = 1.65×10-8) and 2p12 locus with valine (P-value = 3.49×10-8). Exome sequence analysis identified potentially causal coding and regulatory variants located in the genes CPS1, KCNJ2 and PRODH, and revealed allelic heterogeneity for CPS1 and PRODH. Combined GWAS and exome analyses of metabolites detected by high-resolution 1H-NMR is a robust approach to uncover metabolite quantitative trait loci (mQTL), and the likely causative variants in these loci. It is anticipated that insight in the genetics of intermediate phenotypes will provide additional insight into the
Verhoeven, Aswin; Dharuri, Harish; Amin, Najaf; van Klinken, Jan Bert; Karssen, Lennart C.; de Vries, Boukje; Meissner, Axel; Göraler, Sibel; van den Maagdenberg, Arn M. J. M.; Deelder, André M.; C ’t Hoen, Peter A.; van Duijn, Cornelia M.; van Dijk, Ko Willems
Metabolite quantitative traits carry great promise for epidemiological studies, and their genetic background has been addressed using Genome-Wide Association Studies (GWAS). Thus far, the role of less common variants has not been exhaustively studied. Here, we set out a GWAS for metabolite quantitative traits in serum, followed by exome sequence analysis to zoom in on putative causal variants in the associated genes. 1H Nuclear Magnetic Resonance (1H-NMR) spectroscopy experiments yielded successful quantification of 42 unique metabolites in 2,482 individuals from The Erasmus Rucphen Family (ERF) study. Heritability of metabolites were estimated by SOLAR. GWAS was performed by linear mixed models, using HapMap imputations. Based on physical vicinity and pathway analyses, candidate genes were screened for coding region variation using exome sequence data. Heritability estimates for metabolites ranged between 10% and 52%. GWAS replicated three known loci in the metabolome wide significance: CPS1 with glycine (P-value = 1.27×10−32), PRODH with proline (P-value = 1.11×10−19), SLC16A9 with carnitine level (P-value = 4.81×10−14) and uncovered a novel association between DMGDH and dimethyl-glycine (P-value = 1.65×10−19) level. In addition, we found three novel, suggestively significant loci: TNP1 with pyruvate (P-value = 1.26×10−8), KCNJ16 with 3-hydroxybutyrate (P-value = 1.65×10−8) and 2p12 locus with valine (P-value = 3.49×10−8). Exome sequence analysis identified potentially causal coding and regulatory variants located in the genes CPS1, KCNJ2 and PRODH, and revealed allelic heterogeneity for CPS1 and PRODH. Combined GWAS and exome analyses of metabolites detected by high-resolution 1H-NMR is a robust approach to uncover metabolite quantitative trait loci (mQTL), and the likely causative variants in these loci. It is anticipated that insight in the genetics of intermediate phenotypes will provide additional
Perseguini, Juliana Morini Küpper Cardoso; Oblessuc, Paula Rodrigues; Rosa, João Ricardo Bachega Feijó; Gomes, Kleber Alves; Chiorato, Alisson Fernando; Carbonell, Sérgio Augusto Morais; Garcia, Antonio Augusto Franco; Vianello, Rosana Pereira; Benchimol-Reis, Luciana Lasry
The common bean (Phaseolus vulgaris L.) is the world’s most important legume for human consumption. Anthracnose (ANT; Colletotrichum lindemuthianum) and angular leaf spot (ALS; Pseudocercospora griseola) are complex diseases that cause major yield losses in common bean. Depending on the cultivar and environmental conditions, anthracnose and angular leaf spot infections can reduce crop yield drastically. This study aimed to estimate linkage disequilibrium levels and identify quantitative resistance loci (QRL) controlling resistance to both ANT and ALS diseases of 180 accessions of common bean using genome-wide association analysis. A randomized complete block design with four replicates was performed for the ANT and ALS experiments, with four plants per genotype in each replicate. Association mapping analyses were performed for ANT and ALS using a mixed linear model approach implemented in TASSEL. A total of 17 and 11 significant statistically associations involving SSRs were detected for ANT and ALS resistance loci, respectively. Using SNPs, 21 and 17 significant statistically associations were obtained for ANT and angular ALS, respectively, providing more associations with this marker. The SSR-IAC167 and PvM95 markers, both located on chromosome Pv03, and the SNP scaffold00021_89379, were associated with both diseases. The other markers were distributed across the entire common bean genome, with chromosomes Pv03 and Pv08 showing the greatest number of loci associated with ANT resistance. The chromosome Pv04 was the most saturated one, with six markers associated with ALS resistance. The telomeric region of this chromosome showed four markers located between approximately 2.5 Mb and 4.4 Mb. Our results demonstrate the great potential of genome-wide association studies to identify QRLs related to ANT and ALS in common bean. The results indicate a quantitative and complex inheritance pattern for both diseases in common bean. Our findings will contribute to more
Perseguini, Juliana Morini Küpper Cardoso; Oblessuc, Paula Rodrigues; Rosa, João Ricardo Bachega Feijó; Gomes, Kleber Alves; Chiorato, Alisson Fernando; Carbonell, Sérgio Augusto Morais; Garcia, Antonio Augusto Franco; Vianello, Rosana Pereira; Benchimol-Reis, Luciana Lasry
The common bean (Phaseolus vulgaris L.) is the world's most important legume for human consumption. Anthracnose (ANT; Colletotrichum lindemuthianum) and angular leaf spot (ALS; Pseudocercospora griseola) are complex diseases that cause major yield losses in common bean. Depending on the cultivar and environmental conditions, anthracnose and angular leaf spot infections can reduce crop yield drastically. This study aimed to estimate linkage disequilibrium levels and identify quantitative resistance loci (QRL) controlling resistance to both ANT and ALS diseases of 180 accessions of common bean using genome-wide association analysis. A randomized complete block design with four replicates was performed for the ANT and ALS experiments, with four plants per genotype in each replicate. Association mapping analyses were performed for ANT and ALS using a mixed linear model approach implemented in TASSEL. A total of 17 and 11 significant statistically associations involving SSRs were detected for ANT and ALS resistance loci, respectively. Using SNPs, 21 and 17 significant statistically associations were obtained for ANT and angular ALS, respectively, providing more associations with this marker. The SSR-IAC167 and PvM95 markers, both located on chromosome Pv03, and the SNP scaffold00021_89379, were associated with both diseases. The other markers were distributed across the entire common bean genome, with chromosomes Pv03 and Pv08 showing the greatest number of loci associated with ANT resistance. The chromosome Pv04 was the most saturated one, with six markers associated with ALS resistance. The telomeric region of this chromosome showed four markers located between approximately 2.5 Mb and 4.4 Mb. Our results demonstrate the great potential of genome-wide association studies to identify QRLs related to ANT and ALS in common bean. The results indicate a quantitative and complex inheritance pattern for both diseases in common bean. Our findings will contribute to more
Haas, Stefan; Janetzki, Catrin; González Navarrete, Irene; Mundlos, Stefan
Homeotic genes code for key transcription factors (HOX-TFs) that pattern the animal body plan. During embryonic development, Hox genes are expressed in overlapping patterns and function in a partially redundant manner. In vitro biochemical screens probing the HOX-TF sequence specificity revealed largely overlapping sequence preferences, indicating that co-factors might modulate the biological function of HOX-TFs. However, due to their overlapping expression pattern, high protein homology, and insufficiently specific antibodies, little is known about their genome-wide binding preferences. In order to overcome this problem, we virally expressed tagged versions of limb-expressed posterior HOX genes (HOXA9-13, and HOXD9-13) in primary chicken mesenchymal limb progenitor cells (micromass). We determined the effect of each HOX-TF on cellular differentiation (chondrogenesis) and gene expression and found that groups of HOX-TFs induce distinct regulatory programs. We used ChIP-seq to determine their individual genome-wide binding profiles and identified between 12,721 and 28,572 binding sites for each of the nine HOX-TFs. Principal Component Analysis (PCA) of binding profiles revealed that the HOX-TFs are clustered in two subgroups (Group 1: HOXA/D9, HOXA/D10, HOXD12, and HOXA13 and Group 2: HOXA/D11 and HOXD13), which are characterized by differences in their sequence specificity and by the presence of cofactor motifs. Specifically, we identified CTCF binding sites in Group 1, indicating that this subgroup of HOX-proteins cooperates with CTCF. We confirmed this interaction by an independent biological assay (Proximity Ligation Assay) and demonstrated that CTCF is a novel HOX cofactor that specifically associates with Group 1 HOX-TFs, pointing towards a possible interplay between HOX-TFs and chromatin architecture. PMID:28103242
Xu, Yanshuai; Gao, Zhihong; Tao, Jianmin; Jiang, Weihua; Zhang, Shijie; Wang, Qiunan; Qu, Shenchun
Early ripening in grape (Vitis vinifera L.) is a crucial agronomic trait. The fruits of the grape line 'Summer Black' (SBBM), which contains a bud mutation, can be harvested approximately one week earlier than the 'Summer Black' (SBC)control. To investigate the molecular mechanism of the bud mutation related to early ripening, we detected genome-wide genetic variations based on re-sequencing. In total, 3,692,777 single nucleotide polymorphisms (SNPs) and 81,223 structure variations (SVs) in the SBC genome and 3,823,464 SNPs and 85,801 SVs in the SBBM genome were detected compared with the reference grape sequence. Of these, 635 SBC-specific genes and 665 SBBM-specific genes were screened. Ripening and colour-associated unigenes with non-synonymous mutations (NS), SVs or frame-shift mutations (F) were analysed. The results showed that 90 unigenes in SBC, 76 unigenes in SBBM and 13 genes that mapped to large fragment indels were filtered. The expression patterns of eight genes were confirmed using quantitative reverse transcription-polymerase chain reaction (qRT-PCR).The re-sequencing data showed that 635 SBC-specific genes and 665 SBBM-specific genes associated with early ripening were screened. Among these, NCED6 expression appears to be related to NCED1 and is involved in ABA biosynthesis in grape, which might play a role in the onset of anthocyanin accumulation. The SEP and ERF genes probably play roles in ethylene response.
Reed, Eric; Nunez, Sara; Kulp, David; Qian, Jing; Reilly, Muredach P; Foulkes, Andrea S
This tutorial is a learning resource that outlines the basic process and provides specific software tools for implementing a complete genome-wide association analysis. Approaches to post-analytic visualization and interrogation of potentially novel findings are also presented. Applications are illustrated using the free and open-source R statistical computing and graphics software environment, Bioconductor software for bioinformatics and the UCSC Genome Browser. Complete genome-wide association data on 1401 individuals across 861,473 typed single nucleotide polymorphisms from the PennCATH study of coronary artery disease are used for illustration. All data and code, as well as additional instructional resources, are publicly available through the Open Resources in Statistical Genomics project: http://www.stat-gen.org.
Winkler, Thomas W; Day, Felix R; Croteau-Chonka, Damien C; Wood, Andrew R; Locke, Adam E; Mägi, Reedik; Ferreira, Teresa; Fall, Tove; Graff, Mariaelisa; Justice, Anne E; Luan, Jian'an; Gustafsson, Stefan; Randall, Joshua C; Vedantam, Sailaja; Workalemahu, Tsegaselassie; Kilpeläinen, Tuomas O; Scherag, André; Esko, Tonu; Kutalik, Zoltán; Heid, Iris M; Loos, Ruth J F
Rigorous organization and quality control (QC) are necessary to facilitate successful genome-wide association meta-analyses (GWAMAs) of statistics aggregated across multiple genome-wide association studies. This protocol provides guidelines for (i) organizational aspects of GWAMAs, and for (ii) QC at the study file level, the meta-level across studies and the meta-analysis output level. Real-world examples highlight issues experienced and solutions developed by the GIANT Consortium that has conducted meta-analyses including data from 125 studies comprising more than 330,000 individuals. We provide a general protocol for conducting GWAMAs and carrying out QC to minimize errors and to guarantee maximum use of the data. We also include details for the use of a powerful and flexible software package called EasyQC. Precise timings will be greatly influenced by consortium size. For consortia of comparable size to the GIANT Consortium, this protocol takes a minimum of about 10 months to complete.
Roh, Tae-Young; Zhao, Keji
One major postgenomic challenge is to characterize the epigenomes that control genome functions. The epigenomes are mainly defined by the specific association of nonhistone proteins with chromatin and the covalent modifications of chromatin, including DNA methylation and posttranslational histone modifications. The in vivo protein-binding and chromatin-modification patterns can be revealed by the chromatin immunoprecipitation assay (ChIP). By combining the ChIP assays and the serial analysis of gene expression (SAGE) protocols, we have developed an unbiased and high-resolution genome-wide mapping technique (GMAT) to determine the genome-wide protein-targeting and chromatin-modification patterns. GMAT has been successfully applied to mapping the target sites of the histone acetyltransferase, Gcn5p, in yeast and to the discovery of the histone acetylation islands as an epigenetic mark for functional regulatory elements in the human genome.
Winkler, Thomas W; Day, Felix R; Croteau-Chonka, Damien C; Wood, Andrew R; Locke, Adam E; Mägi, Reedik; Ferreira, Teresa; Fall, Tove; Graff, Mariaelisa; Justice, Anne E; Luan, Jian'an; Gustafsson, Stefan; Randall, Joshua C; Vedantam, Sailaja; Workalemahu, Tsegaselassie; Kilpeläinen, Tuomas O; Scherag, André; Esko, Tonu; Kutalik, Zoltán; Heid, Iris M; Loos, Ruth JF
Rigorous organization and quality control (QC) are necessary to facilitate successful genome-wide association meta-analyses (GWAMAs) of statistics aggregated across multiple genome-wide association studies. This protocol provides guidelines for  organizational aspects of GWAMAs, and for  QC at the study file level, the meta-level across studies, and the meta-analysis output level. Real–world examples highlight issues experienced and solutions developed by the GIANT Consortium that has conducted meta-analyses including data from 125 studies comprising more than 330,000 individuals. We provide a general protocol for conducting GWAMAs and carrying out QC to minimize errors and to guarantee maximum use of the data. We also include details for use of a powerful and flexible software package called EasyQC. For consortia of comparable size to the GIANT consortium, the present protocol takes a minimum of about 10 months to complete. PMID:24762786
Barban, Nicola; Jansen, Rick; de Vlaming, Ronald; Vaez, Ahmad; Mandemakers, Jornt J; Tropf, Felix C; Shen, Xia; Wilson, James F; Chasman, Daniel I; Nolte, Ilja M; Tragante, Vinicius; van der Laan, Sander W; Perry, John R B; Kong, Augustine; Ahluwalia, Tarunveer S; Albrecht, Eva; Yerges-Armstrong, Laura; Atzmon, Gil; Auro, Kirsi; Ayers, Kristin; Bakshi, Andrew; Ben-Avraham, Danny; Berger, Klaus; Bergman, Aviv; Bertram, Lars; Bielak, Lawrence F; Bjornsdottir, Gyda; Bonder, Marc Jan; Broer, Linda; Bui, Minh; Barbieri, Caterina; Cavadino, Alana; Chavarro, Jorge E; Turman, Constance; Concas, Maria Pina; Cordell, Heather J; Davies, Gail; Eibich, Peter; Eriksson, Nicholas; Esko, Tõnu; Eriksson, Joel; Falahi, Fahimeh; Felix, Janine F; Fontana, Mark Alan; Franke, Lude; Gandin, Ilaria; Gaskins, Audrey J; Gieger, Christian; Gunderson, Erica P; Guo, Xiuqing; Hayward, Caroline; He, Chunyan; Hofer, Edith; Huang, Hongyan; Joshi, Peter K; Kanoni, Stavroula; Karlsson, Robert; Kiechl, Stefan; Kifley, Annette; Kluttig, Alexander; Kraft, Peter; Lagou, Vasiliki; Lecoeur, Cecile; Lahti, Jari; Li-Gao, Ruifang; Lind, Penelope A; Liu, Tian; Makalic, Enes; Mamasoula, Crysovalanto; Matteson, Lindsay; Mbarek, Hamdi; McArdle, Patrick F; McMahon, George; Meddens, S Fleur W; Mihailov, Evelin; Miller, Mike; Missmer, Stacey A; Monnereau, Claire; van der Most, Peter J; Myhre, Ronny; Nalls, Mike A; Nutile, Teresa; Kalafati, Ioanna Panagiota; Porcu, Eleonora; Prokopenko, Inga; Rajan, Kumar B; Rich-Edwards, Janet; Rietveld, Cornelius A; Robino, Antonietta; Rose, Lynda M; Rueedi, Rico; Ryan, Kathleen A; Saba, Yasaman; Schmidt, Daniel; Smith, Jennifer A; Stolk, Lisette; Streeten, Elizabeth; Tönjes, Anke; Thorleifsson, Gudmar; Ulivi, Sheila; Wedenoja, Juho; Wellmann, Juergen; Willeit, Peter; Yao, Jie; Yengo, Loic; Zhao, Jing Hua; Zhao, Wei; Zhernakova, Daria V; Amin, Najaf; Andrews, Howard; Balkau, Beverley; Barzilai, Nir; Bergmann, Sven; Biino, Ginevra; Bisgaard, Hans; Bønnelykke, Klaus; Boomsma, Dorret I; Buring, Julie E; Campbell, Harry; Cappellani, Stefania; Ciullo, Marina; Cox, Simon R; Cucca, Francesco; Toniolo, Daniela; Davey-Smith, George; Deary, Ian J; Dedoussis, George; Deloukas, Panos; van Duijn, Cornelia M; de Geus, Eco J C; Eriksson, Johan G; Evans, Denis A; Faul, Jessica D; Sala, Cinzia Felicita; Froguel, Philippe; Gasparini, Paolo; Girotto, Giorgia; Grabe, Hans-Jörgen; Greiser, Karin Halina; Groenen, Patrick J F; de Haan, Hugoline G; Haerting, Johannes; Harris, Tamara B; Heath, Andrew C; Heikkilä, Kauko; Hofman, Albert; Homuth, Georg; Holliday, Elizabeth G; Hopper, John; Hyppönen, Elina; Jacobsson, Bo; Jaddoe, Vincent W V; Johannesson, Magnus; Jugessur, Astanand; Kähönen, Mika; Kajantie, Eero; Kardia, Sharon L R; Keavney, Bernard; Kolcic, Ivana; Koponen, Päivikki; Kovacs, Peter; Kronenberg, Florian; Kutalik, Zoltan; La Bianca, Martina; Lachance, Genevieve; Iacono, William G; Lai, Sandra; Lehtimäki, Terho; Liewald, David C; Lindgren, Cecilia M; Liu, Yongmei; Luben, Robert; Lucht, Michael; Luoto, Riitta; Magnus, Per; Magnusson, Patrik K E; Martin, Nicholas G; McGue, Matt; McQuillan, Ruth; Medland, Sarah E; Meisinger, Christa; Mellström, Dan; Metspalu, Andres; Traglia, Michela; Milani, Lili; Mitchell, Paul; Montgomery, Grant W; Mook-Kanamori, Dennis; de Mutsert, Renée; Nohr, Ellen A; Ohlsson, Claes; Olsen, Jørn; Ong, Ken K; Paternoster, Lavinia; Pattie, Alison; Penninx, Brenda W J H; Perola, Markus; Peyser, Patricia A; Pirastu, Mario; Polasek, Ozren; Power, Chris; Kaprio, Jaakko; Raffel, Leslie J; Räikkönen, Katri; Raitakari, Olli; Ridker, Paul M; Ring, Susan M; Roll, Kathryn; Rudan, Igor; Ruggiero, Daniela; Rujescu, Dan; Salomaa, Veikko; Schlessinger, David; Schmidt, Helena; Schmidt, Reinhold; Schupf, Nicole; Smit, Johannes; Sorice, Rossella; Spector, Tim D; Starr, John M; Stöckl, Doris; Strauch, Konstantin; Stumvoll, Michael; Swertz, Morris A; Thorsteinsdottir, Unnur; Thurik, A Roy; Timpson, Nicholas J; Tung, Joyce Y; Uitterlinden, André G; Vaccargiu, Simona; Viikari, Jorma; Vitart, Veronique; Völzke, Henry; Vollenweider, Peter; Vuckovic, Dragana; Waage, Johannes; Wagner, Gert G; Wang, Jie Jin; Wareham, Nicholas J; Weir, David R; Willemsen, Gonneke; Willeit, Johann; Wright, Alan F; Zondervan, Krina T; Stefansson, Kari; Krueger, Robert F; Lee, James J; Benjamin, Daniel J; Cesarini, David; Koellinger, Philipp D; den Hoed, Marcel; Snieder, Harold; Mills, Melinda C
The genetic architecture of human reproductive behavior-age at first birth (AFB) and number of children ever born (NEB)-has a strong relationship with fitness, human development, infertility and risk of neuropsychiatric disorders. However, very few genetic loci have been identified, and the underlying mechanisms of AFB and NEB are poorly understood. We report a large genome-wide association study of both sexes including 251,151 individuals for AFB and 343,072 individuals for NEB. We identified 12 independent loci that are significantly associated with AFB and/or NEB in a SNP-based genome-wide association study and 4 additional loci associated in a gene-based effort. These loci harbor genes that are likely to have a role, either directly or by affecting non-local gene expression, in human reproduction and infertility, thereby increasing understanding of these complex traits.
Oud, Bart; van Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T
Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages.
Morris, Stephanie A; Baek, Songjoon; Sung, Myong-Hee; John, Sam; Wiench, Malgorzata; Johnson, Thomas A; Schiltz, R Louis; Hager, Gordon L
ATP-dependent chromatin remodeling is an essential process required for the dynamic organization of chromatin structure. Here we describe the genome-wide location and activity of three remodeler proteins with diverse physiological functions in the mouse genome: Brg1, Chd4 and Snf2h. The localization patterns of all three proteins substantially overlap with one another and with regions of accessible chromatin. Furthermore, using inducible mutant variants, we demonstrate that the catalytic activity of these proteins contributes to the remodeling of chromatin genome wide and that each of these remodelers can independently regulate chromatin reorganization at distinct sites. Many regions require the activity of more than one remodeler to regulate accessibility. These findings provide a dynamic view of chromatin organization and highlight the differential contributions of remodelers to chromatin maintenance in higher eukaryotes.
Kamvar, Zhian N.; Brooks, Jonah C.; Grünwald, Niklaus J.
To gain a detailed understanding of how plant microbes evolve and adapt to hosts, pesticides, and other factors, knowledge of the population dynamics and evolutionary history of populations is crucial. Plant pathogen populations are often clonal or partially clonal which requires different analytical tools. With the advent of high throughput sequencing technologies, obtaining genome-wide population genetic data has become easier than ever before. We previously contributed the R package poppr specifically addressing issues with analysis of clonal populations. In this paper we provide several significant extensions to poppr with a focus on large, genome-wide SNP data. Specifically, we provide several new functionalities including the new function mlg.filter to define clone boundaries allowing for inspection and definition of what is a clonal lineage, minimum spanning networks with reticulation, a sliding-window analysis of the index of association, modular bootstrapping of any genetic distance, and analyses across any level of hierarchies. PMID:26113860
The group that formed on the theme of genome-wide association analyses of quantitative traits (Group 2) in the Genetic Analysis Workshop 16 comprised eight sets of investigators. Three data sets were available: one on autoantibodies related to rheumatoid arthritis provided by the North American Rheumatoid Arthritis Consortium; the second on anthropometric, lipid, and biochemical measures provided by the Framingham Heart Study (FHS); and the third a simulated data set modeled after FHS. The different investigators in the group addressed a large set of statistical challenges and applied a wide spectrum of association methods in analyzing quantitative traits at the genome-wide level. While some previously reported genes were validated, some novel chromosomal regions provided significant evidence of association in multiple contributions in the group. In this report, we discuss the different strategies explored by the different investigators with the common goal of improving the power to detect association.
Jin, Ying; Birlea, Stanca A; Fain, Pamela R; Ferrara, Tracey M; Ben, Songtao; Riccardi, Sheri L; Cole, Joanne B; Gowan, Katherine; Holland, Paulene J; Bennett, Dorothy C; Luiten, Rosalie M; Wolkerstorfer, Albert; van der Veen, JP Wietze; Hartmann, Anke; Eichner, Saskia; Schuler, Gerold; van Geel, Nanja; Lambert, Jo; Kemp, E Helen; Gawkrodger, David J; Weetman, Anthony P; Taïeb, Alain; Jouary, Thomas; Ezzedine, Khaled; Wallace, Margaret R; McCormack, Wayne T; Picardo, Mauro; Leone, Giovanni; Overbeck, Andreas; Silverberg, Nanette B; Spritz, Richard A
In previous linkage and genome-wide association studies we identified 17 susceptibility loci for generalized vitiligo. By a second genome-wide association study, meta-analysis, and independent replication study, we have now identified 13 additional vitiligo-associated loci, including OCA2-HERC2, a region of 16q24.3 containing MC1R, a region of chromosome 11q21 near TYR, several immunoregulatory loci including IFIH1, CD80, CLNK, BACH2, SLA, CASP7, CD44, IKZF4, SH2B3, and a region of 22q13.2 where the causal gene remains uncertain. Functional pathway analysis shows that most vitiligo susceptibility loci encode immunoregulatory proteins or melanocyte components that likely mediate immune targeting and genetic relationships among vitiligo, malignant melanoma, and normal variation of eye, skin, and hair color. PMID:22561518
Savage, Sharon A.; Mirabello, Lisa; Wang, Zhaoming; Gastier-Foster, Julie M.; Gorlick, Richard; Khanna, Chand; Flanagan, Adrienne M.; Tirabosco, Roberto; Andrulis, Irene L.; Wunder, Jay S.; Gokgoz, Nalan; Patiño-Garcia, Ana; Sierrasesúmaga, Luis; Lecanda, Fernando; Kurucu, Nilgün; Ilhan, Inci Ergurhan; Sari, Neriman; Serra, Massimo; Hattinger, Claudia; Picci, Piero; Spector, Logan; Barkauskas, Donald A.; Marina, Neyssa; de Toledo, Silvia Regina Caminada; Petrilli, Antonio S.; Amary, Maria Fernanda; Halai, Dina; Thomas, David M.; Douglass, Chester; Meltzer, Paul S.; Jacobs, Kevin; Chung, Charles C.; Berndt, Sonja I.; Purdue, Mark P.; Caporaso, Neil E.; Tucker, Margaret; Rothman, Nathaniel; Landi, Maria Teresa; Silverman, Debra T.; Kraft, Peter; Hunter, David J.; Malats, Nuria; Kogevinas, Manolis; Wacholder, Sholom; Troisi, Rebecca; Helman, Lee; Fraumeni, Joseph F.; Yeager, Meredith; Hoover, Robert N.; Chanock, Stephen J.
Osteosarcoma is the most common primary bone malignancy of adolescents and young adults. In order to better understand the genetic etiology of osteosarcoma, we performed a multi-stage genome-wide association study (GWAS) consisting of 941 cases and 3,291 cancer-free adult controls of European ancestry. Two loci achieved genome-wide significance: rs1906953 at 6p21.3, in the glutamate receptor metabotropic 4 [GRM4] gene (P = 8.1 ×10-9), and rs7591996 and rs10208273 in a gene desert on 2p25.2 (P = 1.0 ×10-8 and 2.9 ×10-7). These two susceptibility loci warrant further exploration to uncover the biological mechanisms underlying susceptibility to osteosarcoma. PMID:23727862
Law, Matthew H; Bishop, D Timothy; Lee, Jeffrey E; Brossard, Myriam; Martin, Nicholas G; Moses, Eric K; Song, Fengju; Barrett, Jennifer H; Kumar, Rajiv; Easton, Douglas F; Pharoah, Paul D P; Swerdlow, Anthony J; Kypreou, Katerina P; Taylor, John C; Harland, Mark; Randerson-Moor, Juliette; Akslen, Lars A; Andresen, Per A; Avril, Marie-Françoise; Azizi, Esther; Scarrà, Giovanna Bianchi; Brown, Kevin M; Dȩbniak, Tadeusz; Duffy, David L; Elder, David E; Fang, Shenying; Friedman, Eitan; Galan, Pilar; Ghiorzo, Paola; Gillanders, Elizabeth M; Goldstein, Alisa M; Gruis, Nelleke A; Hansson, Johan; Helsing, Per; Hočevar, Marko; Höiom, Veronica; Ingvar, Christian; Kanetsky, Peter A; Chen, Wei V; Landi, Maria Teresa; Lang, Julie; Lathrop, G Mark; Lubiński, Jan; Mackie, Rona M; Mann, Graham J; Molven, Anders; Montgomery, Grant W; Novaković, Srdjan; Olsson, Håkan; Puig, Susana; Puig-Butille, Joan Anton; Qureshi, Abrar A; Radford-Smith, Graham L; van der Stoep, Nienke; van Doorn, Remco; Whiteman, David C; Craig, Jamie E; Schadendorf, Dirk; Simms, Lisa A; Burdon, Kathryn P; Nyholt, Dale R; Pooley, Karen A; Orr, Nick; Stratigos, Alexander J; Cust, Anne E; Ward, Sarah V; Hayward, Nicholas K; Han, Jiali; Schulze, Hans-Joachim; Dunning, Alison M; Bishop, Julia A Newton; Demenais, Florence; Amos, Christopher I; MacGregor, Stuart; Iles, Mark M
Thirteen common susceptibility loci have been reproducibly associated with cutaneous malignant melanoma (CMM). We report the results of an international 2-stage meta-analysis of CMM genome-wide association studies (GWAS). This meta-analysis combines 11 GWAS (5 previously unpublished) and a further three stage 2 data sets, totaling 15,990 CMM cases and 26,409 controls. Five loci not previously associated with CMM risk reached genome-wide significance (P < 5 × 10(-8)), as did 2 previously reported but unreplicated loci and all 13 established loci. Newly associated SNPs fall within putative melanocyte regulatory elements, and bioinformatic and expression quantitative trait locus (eQTL) data highlight candidate genes in the associated regions, including one involved in telomere biology.
Law, Matthew H.; Bishop, D. Timothy; Martin, Nicholas G.; Moses, Eric K.; Song, Fengju; Barrett, Jennifer H.; Kumar, Rajiv; Easton, Douglas F.; Pharoah, Paul D. P.; Swerdlow, Anthony J.; Kypreou, Katerina P.; Taylor, John C.; Harland, Mark; Randerson-Moor, Juliette; Akslen, Lars A.; Andresen, Per A.; Avril, Marie-Françoise; Azizi, Esther; Scarrà, Giovanna Bianchi; Brown, Kevin M.; Dębniak, Tadeusz; Duffy, David L.; Elder, David E.; Fang, Shenying; Friedman, Eitan; Galan, Pilar; Ghiorzo, Paola; Gillanders, Elizabeth M.; Goldstein, Alisa M.; Gruis, Nelleke A.; Hansson, Johan; Helsing, Per; Hočevar, Marko; Höiom, Veronica; Ingvar, Christian; Kanetsky, Peter A.; Chen, Wei V.; Landi, Maria Teresa; Lang, Julie; Lathrop, G. Mark; Lubiński, Jan; Mackie, Rona M.; Mann, Graham J.; Molven, Anders; Montgomery, Grant W.; Novaković, Srdjan; Olsson, Håkan; Puig, Susana; Puig-Butille, Joan Anton; Qureshi, Abrar A.; Radford-Smith, Graham L.; van der Stoep, Nienke; van Doorn, Remco; Whiteman, David C.; Craig, Jamie E.; Schadendorf, Dirk; Simms, Lisa A.; Burdon, Kathryn P.; Nyholt, Dale R.; Pooley, Karen A.; Orr, Nick; Stratigos, Alexander J.; Cust, Anne E.; Ward, Sarah V.; Hayward, Nicholas K.; Han, Jiali; Schulze, Hans-Joachim; Dunning, Alison M.; Bishop, Julia A. Newton; MacGregor, Stuart; Iles, Mark M.
Thirteen common susceptibility loci have been reproducibly associated with cutaneous malignant melanoma (CMM). We report the results of an international 2-stage meta-analysis of CMM genome-wide association studies (GWAS). This meta-analysis combines 11 GWAS (5 previously unpublished) and a further three stage 2 data sets, totaling 15,990 CMM cases and 26,409 controls. Five loci not previously associated with CMM risk reached genome-wide significance (P < 5×10–8), as did two previously-reported but un-replicated loci and all thirteen established loci. Novel SNPs fall within putative melanocyte regulatory elements, and bioinformatic and expression quantitative trait locus (eQTL) data highlight candidate genes including one involved in telomere biology. PMID:26237428
Yang, Jian; Bakshi, Andrew; Zhu, Zhihong; Hemani, Gibran; Vinkhuyzen, Anna A E; Nolte, Ilja M; van Vliet-Ostaptchouk, Jana V; Snieder, Harold; Esko, Tonu; Milani, Lili; Mägi, Reedik; Metspalu, Andres; Hamsten, Anders; Magnusson, Patrik K E; Pedersen, Nancy L; Ingelsson, Erik; Visscher, Peter M
Sex-specific genetic effects have been proposed to be an important source of variation for human complex traits. Here we use two distinct genome-wide methods to estimate the autosomal genetic correlation (rg) between men and women for human height and body mass index (BMI), using individual-level (n = ∼44 000) and summary-level (n = ∼133 000) data from genome-wide association studies. Results are consistent and show that the between-sex genetic correlation is not significantly different from unity for both traits. In contrast, we find evidence of genetic heterogeneity between sexes for waist-hip ratio (rg = ∼0.7) and between populations for BMI (rg = ∼0.9 between Europe and the USA) but not for height. The lack of evidence for substantial genetic heterogeneity for body size is consistent with empirical findings across traits and species.
Li, Ming-hui; Fu, Sheng-bo; Xiao, Hua-sheng
Cancer is an extremely diverse and complex disease that results from various genetic and epigenetic changes such as DNA copy-number variations, mutations, and aberrant mRNA and/or protein expression caused by abnormal transcriptional regulation. The expression profiles of certain microRNAs (miRNAs) and messenger RNAs (mRNAs) are closely related to cancer progression stages. In the past few decades, DNA microarray and next-generation sequencing techniques have been widely applied to identify miRNA and mRNA signatures for cancers on a genome-wide scale and have provided meaningful insights into cancer diagnosis, prognosis and personalized medicine. In this review, we summarize the progress in genome-wide analysis of miRNAs and mRNAs as cancer biomarkers, highlighting their diagnostic and prognostic roles. PMID:26299954
Chahal, Harvind S.; Wu, Wenting; Ransohoff, Katherine J.; Yang, Lingyao; Hedlin, Haley; Desai, Manisha; Lin, Yuan; Dai, Hong-Ji; Qureshi, Abrar A.; Li, Wen-Qing; Kraft, Peter; Hinds, David A.; Tang, Jean Y.; Han, Jiali; Sarin, Kavita Y.
Basal cell carcinoma (BCC) is the most common cancer worldwide with an annual incidence of 2.8 million cases in the United States alone. Previous studies have demonstrated an association between 21 distinct genetic loci and BCC risk. Here, we report the results of a two-stage genome-wide association study of BCC, totalling 17,187 cases and 287,054 controls. We confirm 17 previously reported loci and identify 14 new susceptibility loci reaching genome-wide significance (P<5 × 10−8, logistic regression). These newly associated SNPs lie within predicted keratinocyte regulatory elements and in expression quantitative trait loci; furthermore, we identify candidate genes and non-coding RNAs involved in telomere maintenance, immune regulation and tumour progression, providing deeper insight into the pathogenesis of BCC. PMID:27539887
Loth, Daan W; Soler Artigas, María; Gharib, Sina A; Wain, Louise V; Franceschini, Nora; Koch, Beate; Pottinger, Tess D; Smith, Albert Vernon; Duan, Qing; Oldmeadow, Chris; Lee, Mi Kyeong; Strachan, David P; James, Alan L; Huffman, Jennifer E; Vitart, Veronique; Ramasamy, Adaikalavan; Wareham, Nicholas J; Kaprio, Jaakko; Wang, Xin-Qun; Trochet, Holly; Kähönen, Mika; Flexeder, Claudia; Albrecht, Eva; Lopez, Lorna M; de Jong, Kim; Thyagarajan, Bharat; Alves, Alexessander Couto; Enroth, Stefan; Omenaas, Ernst; Joshi, Peter K; Fall, Tove; Viñuela, Ana; Launer, Lenore J; Loehr, Laura R; Fornage, Myriam; Li, Guo; Wilk, Jemma B; Tang, Wenbo; Manichaikul, Ani; Lahousse, Lies; Harris, Tamara B; North, Kari E; Rudnicka, Alicja R; Hui, Jennie; Gu, Xiangjun; Lumley, Thomas; Wright, Alan F; Hastie, Nicholas D; Campbell, Susan; Kumar, Rajesh; Pin, Isabelle; Scott, Robert A; Pietiläinen, Kirsi H; Surakka, Ida; Liu, Yongmei; Holliday, Elizabeth G; Schulz, Holger; Heinrich, Joachim; Davies, Gail; Vonk, Judith M; Wojczynski, Mary; Pouta, Anneli; Johansson, Asa; Wild, Sarah H; Ingelsson, Erik; Rivadeneira, Fernando; Völzke, Henry; Hysi, Pirro G; Eiriksdottir, Gudny; Morrison, Alanna C; Rotter, Jerome I; Gao, Wei; Postma, Dirkje S; White, Wendy B; Rich, Stephen S; Hofman, Albert; Aspelund, Thor; Couper, David; Smith, Lewis J; Psaty, Bruce M; Lohman, Kurt; Burchard, Esteban G; Uitterlinden, André G; Garcia, Melissa; Joubert, Bonnie R; McArdle, Wendy L; Musk, A Bill; Hansel, Nadia; Heckbert, Susan R; Zgaga, Lina; van Meurs, Joyce B J; Navarro, Pau; Rudan, Igor; Oh, Yeon-Mok; Redline, Susan; Jarvis, Deborah L; Zhao, Jing Hua; Rantanen, Taina; O'Connor, George T; Ripatti, Samuli; Scott, Rodney J; Karrasch, Stefan; Grallert, Harald; Gaddis, Nathan C; Starr, John M; Wijmenga, Cisca; Minster, Ryan L; Lederer, David J; Pekkanen, Juha; Gyllensten, Ulf; Campbell, Harry; Morris, Andrew P; Gläser, Sven; Hammond, Christopher J; Burkart, Kristin M; Beilby, John; Kritchevsky, Stephen B; Gudnason, Vilmundur; Hancock, Dana B; Williams, O Dale; Polasek, Ozren; Zemunik, Tatijana; Kolcic, Ivana; Petrini, Marcy F; Wjst, Matthias; Kim, Woo Jin; Porteous, David J; Scotland, Generation; Smith, Blair H; Viljanen, Anne; Heliövaara, Markku; Attia, John R; Sayers, Ian; Hampel, Regina; Gieger, Christian; Deary, Ian J; Boezen, H Marike; Newman, Anne; Jarvelin, Marjo-Riitta; Wilson, James F; Lind, Lars; Stricker, Bruno H; Teumer, Alexander; Spector, Timothy D; Melén, Erik; Peters, Marjolein J; Lange, Leslie A; Barr, R Graham; Bracke, Ken R; Verhamme, Fien M; Sung, Joohon; Hiemstra, Pieter S; Cassano, Patricia A; Sood, Akshay; Hayward, Caroline; Dupuis, Josée; Hall, Ian P; Brusselle, Guy G; Tobin, Martin D; London, Stephanie J
Forced vital capacity (FVC), a spirometric measure of pulmonary function, reflects lung volume and is used to diagnose and monitor lung diseases. We performed genome-wide association study meta-analysis of FVC in 52,253 individuals from 26 studies and followed up the top associations in 32,917 additional individuals of European ancestry. We found six new regions associated at genome-wide significance (P < 5 × 10(-8)) with FVC in or near EFEMP1, BMP6, MIR129-2-HSD17B12, PRDM11, WWOX and KCNJ2. Two loci previously associated with spirometric measures (GSTCD and PTCH1) were related to FVC. Newly implicated regions were followed up in samples from African-American, Korean, Chinese and Hispanic individuals. We detected transcripts for all six newly implicated genes in human lung tissue. The new loci may inform mechanisms involved in lung development and the pathogenesis of restrictive lung disease.
Loth, Daan W.; Artigas, María Soler; Gharib, Sina A.; Wain, Louise V.; Franceschini, Nora; Koch, Beate; Pottinger, Tess; Smith, Albert Vernon; Duan, Qing; Oldmeadow, Chris; Lee, Mi Kyeong; Strachan, David P.; James, Alan L.; Huffman, Jennifer E.; Vitart, Veronique; Ramasamy, Adaikalavan; Wareham, Nicholas J.; Kaprio, Jaakko; Wang, Xin-Qun; Trochet, Holly; Kähönen, Mika; Flexeder, Claudia; Albrecht, Eva; Lopez, Lorna M.; de Jong, Kim; Thyagarajan, Bharat; Alves, Alexessander Couto; Enroth, Stefan; Omenaas, Ernst; Joshi, Peter K.; Fall, Tove; Viňuela, Ana; Launer, Lenore J.; Loehr, Laura R.; Fornage, Myriam; Li, Guo; Wilk, Jemma B.; Tang, Wenbo; Manichaikul, Ani; Lahousse, Lies; Harris, Tamara B.; North, Kari E.; Rudnicka, Alicja R.; Hui, Jennie; Gu, Xiangjun; Lumley, Thomas; Wright, Alan F.; Hastie, Nicholas D.; Campbell, Susan; Kumar, Rajesh; Pin, Isabelle; Scott, Robert A.; Pietiläinen, Kirsi H.; Surakka, Ida; Liu, Yongmei; Holliday, Elizabeth G.; Schulz, Holger; Heinrich, Joachim; Davies, Gail; Vonk, Judith M.; Wojczynski, Mary; Pouta, Anneli; Johansson, Åsa; Wild, Sarah H.; Ingelsson, Erik; Rivadeneira, Fernando; Völzke, Henry; Hysi, Pirro G.; Eiriksdottir, Gudny; Morrison, Alanna C.; Rotter, Jerome I.; Gao, Wei; Postma, Dirkje S.; White, Wendy B.; Rich, Stephen S.; Hofman, Albert; Aspelund, Thor; Couper, David; Smith, Lewis J.; Psaty, Bruce M.; Lohman, Kurt; Burchard, Esteban G.; Uitterlinden, André G.; Garcia, Melissa; Joubert, Bonnie R.; McArdle, Wendy L.; Musk, A. Bill; Hansel, Nadia; Heckbert, Susan R.; Zgaga, Lina; van Meurs, Joyce B.J.; Navarro, Pau; Rudan, Igor; Oh, Yeon-Mok; Redline, Susan; Jarvis, Deborah; Zhao, Jing Hua; Rantanen, Taina; O’Connor, George T.; Ripatti, Samuli; Scott, Rodney J.; Karrasch, Stefan; Grallert, Harald; Gaddis, Nathan C.; Starr, John M.; Wijmenga, Cisca; Minster, Ryan L.; Lederer, David J.; Pekkanen, Juha; Gyllensten, Ulf; Campbell, Harry; Morris, Andrew P.; Gläser, Sven; Hammond, Christopher J.; Burkart, Kristin M.; Beilby, John; Kritchevsky, Stephen B.; Gudnason, Vilmundur; Hancock, Dana B.; Williams, O. Dale; Polasek, Ozren; Zemunik, Tatijana; Kolcic, Ivana; Petrini, Marcy F.; Wjst, Matthias; Kim, Woo Jin; Porteous, David J.; Scotland, Generation; Smith, Blair H.; Viljanen, Anne; Heliövaara, Markku; Attia, John R.; Sayers, Ian; Hampel, Regina; Gieger, Christian; Deary, Ian J.; Boezen, H. Marike; Newman, Anne; Jarvelin, Marjo-Riitta; Wilson, James F.; Lind, Lars; Stricker, Bruno H.; Teumer, Alexander; Spector, Timothy D.; Melén, Erik; Peters, Marjolein J.; Lange, Leslie A.; Barr, R. Graham; Bracke, Ken R.; Verhamme, Fien M.; Sung, Joohon; Hiemstra, Pieter S.; Cassano, Patricia A.; Sood, Akshay; Hayward, Caroline; Dupuis, Josée; Hall, Ian P.; Brusselle, Guy G.; Tobin, Martin D.; London, Stephanie J.
Forced vital capacity (FVC), a spirometric measure of pulmonary function, reflects lung volume and is used to diagnose and monitor lung diseases. We performed genome-wide association study meta-analysis of FVC in 52,253 individuals from 26 studies and followed up the top associations in 32,917 additional individuals of European ancestry. We found six new regions associated at genome-wide significance (P < 5 × 10−8) with FVC in or near EFEMP1, BMP6, MIR-129-2/HSD17B12, PRDM11, WWOX, and KCNJ2. Two (GSTCD and PTCH1) loci previously associated with spirometric measures were related to FVC. Newly implicated regions were followed-up in samples of African American, Korean, Chinese, and Hispanic individuals. We detected transcripts for all six newly implicated genes in human lung tissue. The new loci may inform mechanisms involved in lung development and pathogenesis of restrictive lung disease. PMID:24929828
Osteoporosis, the most common type of bone disease worldwide, is clinically characterized by low bone mineral density (BMD) and increased susceptibility to fracture. Multiple genetic and environmental factors and gene-environment interactions have been implicated in its pathogenesis. Osteoporosis has strong genetic determination, with the heritability of BMD estimated to be as high as 60%. More than 80 genes or genetic variants have been implicated in risk of osteoporosis by hypothesis-free genome-wide studies. However, these genes or genetic variants can only explain a small portion of BMD variation, suggesting that many other genes or genetic variants underlying osteoporosis risk await discovery. Here, we review recent progress in genome-wide studies of osteoporosis and discuss their implications for medicine and the major challenges in the field. PMID:23731620
Oud, Bart; Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T
Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages. PMID:22152095
Gottschalk, Maren; Metzger, Julia; Martinsson, Gunilla; Sieme, Harald; Distl, Ottmar
We performed a genome-wide association study for semen quality traits in 139 German Warmblood stallions. Stallions were genotyped using the Illumina equine SNP50 Beadchip. Traits analysed were de-regressed estimated breeding values (EBVs) for gel-free volume, sperm concentration, total number of sperm, progressive motility and the total number of progressively motile sperm. The GWAS revealed 29 SNPs on 12 different chromosomes as genome-wide significantly associated with semen quality traits. For ten genomic regions we could retrieve candidate genes influencing stallion fertility. Among the candidate genes, we could find the genes encoding cysteine-rich secretory proteins (CRISP1, CRISP2 and CRISP3). This was the first GWAS in horses performed for semen quality traits.
Xu, P; Wu, X; Wang, B; Luo, J; Liu, Y; Ehlers, J D; Close, T J; Roberts, P A; Lu, Z; Wang, S; Li, G
Association mapping of important traits of crop plants relies on first understanding the extent and patterns of linkage disequilibrium (LD) in the particular germplasm being investigated. We characterize here the genetic diversity, population structure and genome wide LD patterns in a set of asparagus bean (Vigna. unguiculata ssp. sesquipedialis) germplasm from China. A diverse collection of 99 asparagus bean and normal cowpea accessions were genotyped with 1127 expressed sequence tag-derived single nucleotide polymorphism markers (SNPs). The proportion of polymorphic SNPs across the collection was relatively low (39%), with an average number of SNPs per locus of 1.33. Bayesian population structure analysis indicated two subdivisions within the collection sampled that generally represented the 'standard vegetable' type (subgroup SV) and the 'non-standard vegetable' type (subgroup NSV), respectively. Level of LD (r(2)) was higher and extent of LD persisted longer in subgroup SV than in subgroup NSV, whereas LD decayed rapidly (0-2 cM) in both subgroups. LD decay distance varied among chromosomes, with the longest (≈ 5 cM) five times longer than the shortest (≈ 1 cM). Partitioning of LD variance into within- and between-subgroup components coupled with comparative LD decay analysis suggested that linkage group 5, 7 and 10 may have undergone the most intensive epistatic selection toward traits favorable for vegetable use. This work provides a first population genetic insight into domestication history of asparagus bean and demonstrates the feasibility of mapping complex traits by genome wide association study in asparagus bean using a currently available cowpea SNPs marker platform.
preinfected with adenovirus carrying either wild type or dominant-negative Stat5, were harvested and RNA was isolated as described in the Materials and...maintain tight control of signal transduction pathways, both for rapid induction and cessation of signaling. The protein CIS1 was isolated independently...differential suppression of Stat5 regulated transcripts and large-scale gene chip analysis. As a result of this work, rapid progress in genome-wide
Verhulst, Brad; Maes, Hermine H; Neale, Michael C
Improving the accuracy of phenotyping through the use of advanced psychometric tools will increase the power to find significant associations with genetic variants and expand the range of possible hypotheses that can be tested on a genome-wide scale. Multivariate methods, such as structural equation modeling (SEM), are valuable in the phenotypic analysis of psychiatric and substance use phenotypes, but these methods have not been integrated into standard genome-wide association analyses because fitting a SEM at each single nucleotide polymorphism (SNP) along the genome was hitherto considered to be too computationally demanding. By developing a method that can efficiently fit SEMs, it is possible to expand the set of models that can be tested. This is particularly necessary in psychiatric and behavioral genetics, where the statistical methods are often handicapped by phenotypes with large components of stochastic variance. Due to the enormous amount of data that genome-wide scans produce, the statistical methods used to analyze the data are relatively elementary and do not directly correspond with the rich theoretical development, and lack the potential to test more complex hypotheses about the measurement of, and interaction between, comorbid traits. In this paper, we present a method to test the association of a SNP with multiple phenotypes or a latent construct on a genome-wide basis using a diagonally weighted least squares (DWLS) estimator for four common SEMs: a one-factor model, a one-factor residuals model, a two-factor model, and a latent growth model. We demonstrate that the DWLS parameters and p-values strongly correspond with the more traditional full information maximum likelihood parameters and p-values. We also present the timing of simulations and power analyses and a comparison with and existing multivariate GWAS software package.
Wei, Sheng; Wang, Li-E; McHugh, Michelle K; Han, Younghun; Xiong, Momiao; Amos, Christopher I; Spitz, Margaret R; Wei, Qingyi Wei
Asbestos exposure is a known risk factor for lung cancer. Although recent genome-wide association studies (GWASs) have identified some novel loci for lung cancer risk, few addressed genome-wide gene-environment interactions. To determine gene-asbestos interactions in lung cancer risk, we conducted genome-wide gene-environment interaction analyses at levels of single nucleotide polymorphisms (SNPs), genes and pathways, using our published Texas lung cancer GWAS dataset. This dataset included 317 498 SNPs from 1154 lung cancer cases and 1137 cancer-free controls. The initial SNP-level P-values for interactions between genetic variants and self-reported asbestos exposure were estimated by unconditional logistic regression models with adjustment for age, sex, smoking status and pack-years. The P-value for the most significant SNP rs13383928 was 2.17×10(-6), which did not reach the genome-wide statistical significance. Using a versatile gene-based test approach, we found that the top significant gene was C7orf54, located on 7q32.1 (P = 8.90×10(-5)). Interestingly, most of the other significant genes were located on 11q13. When we used an improved gene-set-enrichment analysis approach, we found that the Fas signaling pathway and the antigen processing and presentation pathway were most significant (nominal P < 0.001; false discovery rate < 0.05) among 250 pathways containing 17 572 genes. We believe that our analysis is a pilot study that first describes the gene-asbestos interaction in lung cancer risk at levels of SNPs, genes and pathways. Our findings suggest that immune function regulation-related pathways may be mechanistically involved in asbestos-associated lung cancer risk.
likely to harbor cancer-related genes 6 Early-onset breast cancer GWAS • 3,203 non- Hispanic white participants – 1,647cases , 1,556 controls – From BCFR...cases than in controls, using logistic regression methods. Using genome-wide SNP data (525,000 SNPs) on 1,647 non- Hispanic white, early-onset...specific genomic locations, suggesting these regions harbor important cancer genes (10, 11). Homozygosity mapping is a natural extension of large
Markt, Sarah C; Nuttall, Elizabeth; Turman, Constance; Sinnott, Jennifer; Rimm, Eric B; Ecsedy, Ethan; Unger, Robert H; Fall, Katja; Finn, Stephen; Jensen, Majken K; Rider, Jennifer R; Kraft, Peter
Objective To determine the inherited factors associated with the ability to smell asparagus metabolites in urine. Design Genome wide association study. Setting Nurses’ Health Study and Health Professionals Follow-up Study cohorts. Participants 6909 men and women of European-American descent with available genetic data from genome wide association studies. Main outcome measure Participants were characterized as asparagus smellers if they strongly agreed with the prompt “after eating asparagus, you notice a strong characteristic odor in your urine,” and anosmic if otherwise. We calculated per-allele estimates of asparagus anosmia for about nine million single nucleotide polymorphisms using logistic regression. P values <5×10-8 were considered as genome wide significant. Results 58.0% of men (n=1449/2500) and 61.5% of women (n=2712/4409) had anosmia. 871 single nucleotide polymorphisms reached genome wide significance for asparagus anosmia, all in a region on chromosome 1 (1q44: 248139851-248595299) containing multiple genes in the olfactory receptor 2 (OR2) family. Conditional analyses revealed three independent markers associated with asparagus anosmia: rs13373863, rs71538191, and rs6689553. Conclusion A large proportion of people have asparagus anosmia. Genetic variation near multiple olfactory receptor genes is associated with the ability of an individual to smell the metabolites of asparagus in urine. Future replication studies are necessary before considering targeted therapies to help anosmic people discover what they are missing. PMID:27965198
Webb, Bradley T; Guo, An-Yuan; Maher, Brion S; Zhao, Zhongming; van den Oord, Edwin J; Kendler, Kenneth S; Riley, Brien P; Gillespie, Nathan A; Prescott, Carol A; Middeldorp, Christel M; Willemsen, Gonneke; de Geus, Eco JC; Hottenga, Jouke-Jan; Boomsma, Dorret I; Slagboom, Eline P; Wray, Naomi R; Montgomery, Grant W; Martin, Nicholas G; Wright, Margie J; Heath, Andrew C; Madden, Pamela A; Gelernter, Joel; Knowles, James A; Hamilton, Steven P; Weissman, Myrna M; Fyer, Abby J; Huezo-Diaz, Patricia; McGuffin, Peter; Farmer, Anne; Craig, Ian W; Lewis, Cathryn; Sham, Pak; Crowe, Raymond R; Flint, Jonathan; Hettema, John M
Genetic factors underlying trait neuroticism, reflecting a tendency towards negative affective states, may overlap genetic susceptibility for anxiety disorders and help explain the extensive comorbidity amongst internalizing disorders. Genome-wide linkage (GWL) data from several studies of neuroticism and anxiety disorders have been published, providing an opportunity to test such hypotheses and identify genomic regions that harbor genes common to these phenotypes. In all, 11 independent GWL studies of either neuroticism (n=8) or anxiety disorders (n=3) were collected, which comprised of 5341 families with 15 529 individuals. The rank-based genome scan meta-analysis (GSMA) approach was used to analyze each trait separately and combined, and global correlations between results were examined. False discovery rate (FDR) analysis was performed to test for enrichment of significant effects. Using 10 cM intervals, bins nominally significant for both GSMA statistics, PSR and POR, were found on chromosomes 9, 11, 12, and 14 for neuroticism and on chromosomes 1, 5, 15, and 16 for anxiety disorders. Genome-wide, the results for the two phenotypes were significantly correlated, and a combined analysis identified additional nominally significant bins. Although none reached genome-wide significance, an excess of significant PSRP-values were observed, with 12 bins falling under a FDR threshold of 0.50. As demonstrated by our identification of multiple, consistent signals across the genome, meta-analytically combining existing GWL data is a valuable approach to narrowing down regions relevant for anxiety-related phenotypes. This may prove useful for prioritizing emerging genome-wide association data for anxiety disorders. PMID:22473089
Huber, Peter E; Hauser, Kai; Abdollahi, Amir
Genome wide DNA expression profiling coupled with antibody array experiments using endostatin to probe the angiogenic signaling network in human endothelial cells were performed. The results reveal constraints on the measuring process that are of a similar kind as those implied by the uncertainty principle of quantum mechanics as described by Werner Heisenberg. We describe this analogy and argue for its heuristic utility in the conceptualization of angiogenesis as an important step in tumor formation.
For a global understanding of the physiological impact of the nuclear hormone 1α,25-dihydroxyvitamin D3 (1,25(OH)2D3) the analysis of the genome-wide locations of its high affinity receptor, the transcription factor vitamin D receptor (VDR), is essential. Chromatin immunoprecipitation sequencing (ChIP-seq) in GM10855 and GM10861 lymphoblastoid cells, undifferentiated and lipopolysaccharide-differentiated THP-1 monocytes, LS180 colorectal cancer cells and LX2 hepatic stellate cells revealed between 1000 and 13,000 VDR-specific genomic binding sites. The harmonized analysis of these ChIP-seq datasets indicates that the mechanistic basis for the action of the VDR is independent of the cell type. Formaldehyde-assisted isolation of regulatory elements sequencing (FAIRE-seq) data highlight accessible chromatin regions, which are under control of 1,25(OH)2D3. In addition, public data, such as from the ENCODE project, allow to relate the genome-wide actions of VDR and 1,25(OH)2D3 to those of other proteins within the nucleus. For example, locations of the insulator protein CTCF suggest a segregation of the human genome into chromatin domains, of which more than 1000 contain at least one VDR binding site. The integration of all these genome-wide data facilitates the identification of the most important VDR binding sites and associated primary 1,25(OH)2D3 target genes. Expression changes of these key genes can serve as biomarkers for the actions of vitamin D3 and its metabolites in different tissues and cell types of human individuals. Analysis of primary tissues obtained from vitamin D3 intervention studies using such markers indicated a large inter-individual variation for the efficiency of vitamin D3 supplementation. In conclusion, a genome-wide (over)view on the genomic locations of VDR provides a broader basis for addressing vitamin D's role in health and disease. PMID:24808867
Phipps, Amanda I.; Passarelli, Michael N.; Chan, Andrew T.; Harrison, Tabitha A.; Jeon, Jihyoun; Hutter, Carolyn M.; Berndt, Sonja I.; Brenner, Hermann; Caan, Bette J.; Campbell, Peter T.; Chang-Claude, Jenny; Chanock, Stephen J.; Cheadle, Jeremy P.; Curtis, Keith R.; Duggan, David; Fisher, David; Fuchs, Charles S.; Gala, Manish; Giovannucci, Edward L.; Hayes, Richard B.; Hoffmeister, Michael; Hsu, Li; Jacobs, Eric J.; Jansen, Lina; Kaplan, Richard; Kap, Elisabeth J.; Maughan, Timothy S.; Potter, John D.; Schoen, Robert E.; Seminara, Daniela; Slattery, Martha L.; West, Hannah; White, Emily; Peters, Ulrike; Newcomb, Polly A.
Genome-wide association studies have identified several germline single nucleotide polymorphisms (SNPs) significantly associated with colorectal cancer (CRC) incidence. Common germline genetic variation may also be related to CRC survival. We used a discovery-based approach to identify SNPs related to survival outcomes after CRC diagnosis. Genome-wide genotyping arrays were conducted for 3494 individuals with invasive CRC enrolled in six prospective cohort studies (median study-specific follow-up = 4.2–8.1 years). In pooled analyses, we used Cox regression to assess SNP-specific associations with CRC-specific and overall survival, with additional analyses stratified by stage at diagnosis. Top findings were followed-up in independent studies. A P value threshold of P < 5×10−8 in analyses combining discovery and follow-up studies was required for genome-wide significance. Among individuals with distant-metastatic CRC, several SNPs at 6p12.1, nearest the ELOVL5 gene, were statistically significantly associated with poorer survival, with the strongest associations noted for rs209489 [hazard ratio (HR) = 1.8, P = 7.6×10−10 and HR = 1.8, P = 3.7×10−9 for CRC-specific and overall survival, respectively). No SNPs were statistically significantly associated with survival among all cases combined or in cases without distant-metastases. SNPs in 6p12.1/ELOVL5 were associated with survival outcomes in individuals with distant-metastatic CRC, and merit further follow-up for functional significance. Findings from this genome-wide association study highlight the potential importance of genetic variation in CRC prognosis and provide clues to genomic regions of potential interest. PMID:26586795
Nalls, Michael A.; Martinez, Maria; Schulte, Claudia; Holmans, Peter; Gasser, Thomas; Hardy, John; Singleton, Andrew B.; Wood, Nicholas W.; Brice, Alexis; Heutink, Peter; Williams, Nigel; Morris, Huw R.
Parkinson's disease (PD) occurs in both familial and sporadic forms, and both monogenic and complex genetic factors have been identified. Early onset PD (EOPD) is particularly associated with autosomal recessive (AR) mutations, and three genes, PARK2, PARK7 and PINK1, have been found to carry mutations leading to AR disease. Since mutations in these genes account for less than 10% of EOPD patients, we hypothesized that further recessive genetic factors are involved in this disorder, which may appear in extended runs of homozygosity. We carried out genome wide SNP genotyping to look for extended runs of homozygosity (ROHs) in 1,445 EOPD cases and 6,987 controls. Logistic regression analyses showed an increased level of genomic homozygosity in EOPD cases compared to controls. These differences are larger for ROH of 9 Mb and above, where there is a more than three-fold increase in the proportion of cases carrying a ROH. These differences are not explained by occult recessive mutations at existing loci. Controlling for genome wide homozygosity in logistic regression analyses increased the differences between cases and controls, indicating that in EOPD cases ROHs do not simply relate to genome wide measures of inbreeding. Homozygosity at a locus on chromosome19p13.3 was identified as being more common in EOPD cases as compared to controls. Sequencing analysis of genes and predicted transcripts within this locus failed to identify a novel mutation causing EOPD in our cohort. There is an increased rate of genome wide homozygosity in EOPD, as measured by an increase in ROHs. These ROHs are a signature of inbreeding and do not necessarily harbour disease-causing genetic variants. Although there might be other regions of interest apart from chromosome 19p13.3, we lack the power to detect them with this analysis. PMID:22427796
Evans, Daniel S.; Cailotto, Frederic; Parimi, Neeta; Valdes, Ana M.; Castaño-Betancourt, Martha C.; Liu, Youfang; Kaplan, Robert C.; Bidlingmaier, Martin; Vasan, Ramachandran S.; Teumer, Alexander; Tranah, Gregory J.; Nevitt, Michael C.; Cummings, Steven R.; Orwoll, Eric S.; Barrett-Connor, Elizabeth; Renner, Jordan B.; Jordan, Joanne M.; Doherty, Michael; Doherty, Sally A.; Uitterlinden, Andre G.; van Meurs, Joyce B.J.; Spector, Tim D.; Lories, Rik J.; Lane, Nancy E.
Objectives To identify genetic associations with hip osteoarthritis (HOA), we performed a meta-analysis of genome-wide association studies (GWAS) of HOA. Methods The GWAS meta-analysis included approximately 2.5 million imputed HapMap single nucleotide polymorphisms (SNPs). HOA cases and controls defined radiographically and by total hip replacement were selected from the Osteoporotic Fractures in Men (MrOS) Study and the Study of Osteoporotic Fractures (SOF) (654 cases and 4697 controls, combined). Replication of genome-wide significant SNP associations (P-value ≤ 5x10−8) was examined in five studies (3243 cases and 6891 controls, combined). Functional studies were performed using in vitro models of chondrogenesis and osteogenesis. Results The A allele of rs788748, located 65 kb upstream of the IGFBP3 gene, was associated with lower HOA odds at the genome-wide significance level in the discovery stage (OR = 0.71, P-value = 2x10−8). The association replicated in five studies (OR = 0.92, P-value = 0.020), but the joint analysis of discovery and replication results was not genome-wide significant (P-value = 1x10−6). In separate study populations, the rs788748 A allele was also associated with lower circulating IGFBP3 protein levels (P-value = 4x10−13), suggesting that this SNP or a variant in linkage disequilibrium (LD) could be an IGFBP3 regulatory variant. Results from functional studies were consistent with association results. Chondrocyte hypertrophy, a deleterious event in OA pathogenesis, was largely prevented upon IGFBP3 knockdown in chondrocytes. Furthermore, IGFBP3 overexpression induced cartilage catabolism and osteogenic differentiation. Conclusions Results from GWAS and functional studies provided suggestive links between IGFBP3 and HOA. PMID:24928840
Li, Guangdi; De Clercq, Erik
The HIV genome encodes a small number of viral proteins (i.e., 16), invariably establishing cooperative associations among HIV proteins and between HIV and host proteins, to invade host cells and hijack their internal machineries. As a known example, the HIV envelope glycoprotein GP120 is closely associated with GP41 for viral entry. From a genome-wide perspective, a hypothesis can be worked out to determine whether 16 HIV proteins could develop 120 possible pairwise associations either by physical interactions or by functional associations mediated via HIV or host molecules. Here, we present the first systematic review of experimental evidence on HIV genome-wide protein associations using a large body of publications accumulated over the past 3 decades. Of 120 possible pairwise associations between 16 HIV proteins, at least 34 physical interactions and 17 functional associations have been identified. To achieve efficient viral replication and infection, HIV protein associations play essential roles (e.g., cleavage, inhibition, and activation) during the HIV life cycle. In either a dispensable or an indispensable manner, each HIV protein collaborates with another viral protein to accomplish specific activities that precisely take place at the proper stages of the HIV life cycle. In addition, HIV genome-wide protein associations have an impact on anti-HIV inhibitors due to the extensive cross talk between drug-inhibited proteins and other HIV proteins. Overall, this study presents for the first time a comprehensive overview of HIV genome-wide protein associations, highlighting meticulous collaborations between all viral proteins during the HIV life cycle.
Yucesoy, Berran; Kaufman, Kenneth M; Lummus, Zana L; Weirauch, Matthew T; Zhang, Ge; Cartier, André; Boulet, Louis-Philippe; Sastre, Joaquin; Quirce, Santiago; Tarlo, Susan M; Cruz, Maria-Jesus; Munoz, Xavier; Harley, John B; Bernstein, David I
Diisocyanates, reactive chemicals used to produce polyurethane products, are the most common causes of occupational asthma. The aim of this study is to identify susceptibility gene variants that could contribute to the pathogenesis of diisocyanate asthma (DA) using a Genome-Wide Association Study (GWAS) approach. Genome-wide single nucleotide polymorphism (SNP) genotyping was performed in 74 diisocyanate-exposed workers with DA and 824 healthy controls using Omni-2.5 and Omni-5 SNP microarrays. We identified 11 SNPs that exceeded genome-wide significance; the strongest association was for the rs12913832 SNP located on chromosome 15, which has been mapped to the HERC2 gene (p = 6.94 × 10(-14)). Strong associations were also found for SNPs near the ODZ3 and CDH17 genes on chromosomes 4 and 8 (rs908084, p = 8.59 × 10(-9) and rs2514805, p = 1.22 × 10(-8), respectively). We also prioritized 38 SNPs with suggestive genome-wide significance (p < 1 × 10(-6)). Among them, 17 SNPs map to the PITPNC1, ACMSD, ZBTB16, ODZ3, and CDH17 gene loci. Functional genomics data indicate that 2 of the suggestive SNPs (rs2446823 and rs2446824) are located within putative binding sites for the CCAAT/Enhancer Binding Protein (CEBP) and Hepatocyte Nuclear Factor 4, Alpha transcription factors (TFs), respectively. This study identified SNPs mapping to the HERC2, CDH17, and ODZ3 genes as potential susceptibility loci for DA. Pathway analysis indicated that these genes are associated with antigen processing and presentation, and other immune pathways. Overlap of 2 suggestive SNPs with likely TF binding sites suggests possible roles in disruption of gene regulation. These results provide new insights into the genetic architecture of DA and serve as a basis for future functional and mechanistic studies.
Gyori, Miklos; Borsos, Zsófia; Stefanik, Krisztina
At current, screening for, and diagnosis of, autism spectrum disorders (ASD) are based on purely behavioral data; established screening tools rely on human observation and ratings of relevant behaviors. The research and development project in the focus of this paper is aimed at designing, creating and evaluating a social serious game based multi-modal, interactive software system for screening for high functioning cases of ASD at kindergarten age. The aims of this paper are (1) to summarize the evidence-based design process and (2) to present results from the first usability test of the system. Game topic, candidate responses, and candidate game contents were identified via an iterative literature review. On this basis, the 1st partial prototype of the fully playable game has been created, with complete data recording functionality but without the decision making component. A first usability test was carried out on this prototype (n=13). Overall results were unambiguously promising. Although sporadic difficulties in, and slightly negative attitudes towards, using the game occasionally arose, these were confined to non-target-group children only. The next steps of development include (1) completing the game design; (2) carrying out first large-n field test; (3) creating the first prototype of the decision making component.
Xu, Yonghui; Min, Huaqing; Wu, Qingyao; Song, Hengjie; Ye, Bicui
Multi-Instance (MI) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with multiple instances. Many studies in this literature attempted to find an appropriate Multi-Instance Learning (MIL) method for genome-wide protein function prediction under a usual assumption, the underlying distribution from testing data (target domain, i.e., TD) is the same as that from training data (source domain, i.e., SD). However, this assumption may be violated in real practice. To tackle this problem, in this paper, we propose a Multi-Instance Metric Transfer Learning (MIMTL) approach for genome-wide protein function prediction. In MIMTL, we first transfer the source domain distribution to the target domain distribution by utilizing the bag weights. Then, we construct a distance metric learning method with the reweighted bags. At last, we develop an alternative optimization scheme for MIMTL. Comprehensive experimental evidence on seven real-world organisms verifies the effectiveness and efficiency of the proposed MIMTL approach over several state-of-the-art methods.
Xu, Yonghui; Min, Huaqing; Song, Hengjie; Wu, Qingyao
Multi-instance multi-label (MIML) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with not only multiple instances but also multiple class labels. To find an appropriate MIML learning method for genome-wide protein function prediction, many studies in the literature attempted to optimize objective functions in which dissimilarity between instances is measured using the Euclidean distance. But in many real applications, Euclidean distance may be unable to capture the intrinsic similarity/dissimilarity in feature space and label space. Unlike other previous approaches, in this paper, we propose to learn a multi-instance multi-label distance metric learning framework (MIMLDML) for genome-wide protein function prediction. Specifically, we learn a Mahalanobis distance to preserve and utilize the intrinsic geometric information of both feature space and label space for MIML learning. In addition, we try to deal with the sparsely labeled data by giving weight to the labeled data. Extensive experiments on seven real-world organisms covering the biological three-domain system (i.e., archaea, bacteria, and eukaryote; Woese et al., 1990) show that the MIMLDML algorithm is superior to most state-of-the-art MIML learning algorithms.
Wang, Yu; Li, Wei; Xia, Yingying; Wang, Chongzhi; Tang, Y Tom; Guo, Wenying; Li, Jinliang; Zhao, Xia; Sun, Yepeng; Hu, Juan; Zhen, Hefu; Zhang, Xiandong; Chen, Chao; Shi, Yujian; Li, Lin; Cao, Hongzhi; Du, Hongli; Li, Jian
Copy-number variations (CNV), loss of heterozygosity (LOH), and uniparental disomy (UPD) are large genomic aberrations leading to many common inherited diseases, cancers, and other complex diseases. An integrated tool to identify these aberrations is essential in understanding diseases and in designing clinical interventions. Previous discovery methods based on whole-genome sequencing (WGS) require very high depth of coverage on the whole genome scale, and are cost-wise inefficient. Another approach, whole exome genome sequencing (WEGS), is limited to discovering variations within exons. Thus, we are lacking efficient methods to detect genomic aberrations on the whole genome scale using next-generation sequencing technology. Here we present a method to identify genome-wide CNV, LOH and UPD for the human genome via selectively sequencing a small portion of genome termed Selected Target Regions (SeTRs). In our experiments, the SeTRs are covered by 99.73%~99.95% with sufficient depth. Our developed bioinformatics pipeline calls genome-wide CNVs with high confidence, revealing 8 credible events of LOH and 3 UPD events larger than 5M from 15 individual samples. We demonstrate that genome-wide CNV, LOH and UPD can be detected using a cost-effective SeTRs sequencing approach, and that LOH and UPD can be identified using just a sample grouping technique, without using a matched sample or familial information.
Byrne, Enda M; Gehrman, Philip R; Medland, Sarah E; Nyholt, Dale R; Heath, Andrew C; Madden, Pamela A F; Hickie, Ian B; Van Duijn, Cornelia M; Henders, Anjali K; Montgomery, Grant W; Martin, Nicholas G; Wray, Naomi R
Several aspects of sleep behavior such as timing, duration and quality have been demonstrated to be heritable. To identify common variants that influence sleep traits in the population, we conducted a genome-wide association study of six sleep phenotypes assessed by questionnaire in a sample of 2,323 individuals from the Australian Twin Registry. Genotyping was performed on the Illumina 317, 370, and 610K arrays and the SNPs in common between platforms were used to impute non-genotyped SNPs. We tested for association with more than 2,000,000 common polymorphisms across the genome. While no SNPs reached the genome-wide significance threshold, we identified a number of associations in plausible candidate genes. Most notably, a group of SNPs in the third intron of the CACNA1C gene ranked as most significant in the analysis of sleep latency (P = 1.3 × 10⁻⁶). We attempted to replicate this association in an independent sample from the Chronogen Consortium (n = 2,034), but found no evidence of association (P = 0.73). We have identified several other suggestive associations that await replication in an independent sample. We did not replicate the results from previous genome-wide analyses of self-reported sleep phenotypes after correction for multiple testing.
Belmonte Mahon, Pamela; Pirooznia, Mehdi; Goes, Fernando S; Seifuddin, Fayaz; Steele, Jo; Lee, Phil Hyoun; Huang, Jie; Hamshere, Marian L; Depaulo, J Raymond; Kelsoe, John R; Rietschel, Marcella; Nöthen, Markus; Cichon, Sven; Gurling, Hugh; Purcell, Shaun; Smoller, Jordan W; Craddock, Nick; Schulze, Thomas G; McMahon, Francis J; Potash, James B; Zandi, Peter P
Genome-wide association studies (GWAS) have identified several susceptibility loci for bipolar disorder (BP), most notably ANK3. However, most of the inherited risk for BP remains unexplained. One reason for the limited success may be the genetic heterogeneity of BP. Clinical sub-phenotypes of BP may identify more etiologically homogeneous subsets of patients, which can be studied with increased power to detect genetic variation. Here, we report on a mega-analysis of two widely studied sub-phenotypes of BP, age at onset and psychotic symptoms, which are familial and clinically significant. We combined data from three GWAS: NIMH Bipolar Disorder Genetic Association Information Network (GAIN-BP), NIMH Bipolar Disorder Genome Study (BiGS), and a German sample. The combined sample consisted of 2,836 BP cases with information on sub-phenotypes and 2,744 controls. Imputation was performed, resulting in 2.3 million SNPs available for analysis. No SNP reached genome-wide significance for either sub-phenotype. In addition, no SNP reached genome-wide significance in a meta-analysis with an independent replication sample. We had 80% power to detect associations with a common SNP at an OR of 1.6 for psychotic symptoms and a mean difference of 1.8 years in age at onset. Age at onset and psychotic symptoms in BP may be influenced by many genes of smaller effect sizes or other variants not measured well by SNP arrays, such as rare alleles.
Wang, Zhipeng; Zhang, Hui; Yang, Hua; Wang, Shouzhi; Rong, Enguang; Pei, Wenyu; Li, Hui; Wang, Ning
Genome-wide association studies (GWAS) provide a powerful approach for identifying quantitative trait loci without prior knowledge of location or function. To identify loci associated with wool production traits, we performed a genome-wide association study on a total of 765 Chinese Merino sheep (JunKen type) genotyped with 50 K single nucleotide polymorphisms (SNPs). In the present study, five wool production traits were examined: fiber diameter, fiber diameter coefficient of variation, fineness dispersion, staple length and crimp. We detected 28 genome-wide significant SNPs for fiber diameter, fiber diameter coefficient of variation, fineness dispersion, and crimp trait in the Chinese Merino sheep. About 43% of the significant SNP markers were located within known or predicted genes, including YWHAZ, KRTCAP3, TSPEAR, PIK3R4, KIF16B, PTPN3, GPRC5A, DDX47, TCF9, TPTE2, EPHA5 and NBEA genes. Our results not only confirm the results of previous reports, but also provide a suite of novel SNP markers and candidate genes associated with wool traits. Our findings will be useful for exploring the genetic control of wool traits in sheep.
Xu, Yonghui; Min, Huaqing; Wu, Qingyao; Song, Hengjie; Ye, Bicui
Multi-Instance (MI) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with multiple instances. Many studies in this literature attempted to find an appropriate Multi-Instance Learning (MIL) method for genome-wide protein function prediction under a usual assumption, the underlying distribution from testing data (target domain, i.e., TD) is the same as that from training data (source domain, i.e., SD). However, this assumption may be violated in real practice. To tackle this problem, in this paper, we propose a Multi-Instance Metric Transfer Learning (MIMTL) approach for genome-wide protein function prediction. In MIMTL, we first transfer the source domain distribution to the target domain distribution by utilizing the bag weights. Then, we construct a distance metric learning method with the reweighted bags. At last, we develop an alternative optimization scheme for MIMTL. Comprehensive experimental evidence on seven real-world organisms verifies the effectiveness and efficiency of the proposed MIMTL approach over several state-of-the-art methods. PMID:28165495
Müller, Stefanie H.; Girard, Simon L.; Hopfner, Franziska; Merner, Nancy D.; Bourassa, Cynthia V.; Lorenz, Delia; Clark, Lorraine N.; Tittmann, Lukas; Soto-Ortolaza, Alexandra I.; Klebe, Stephan; Hallett, Mark; Schneider, Susanne A.; Hodgkinson, Colin A.; Lieb, Wolfgang; Wszolek, Zbigniew K.; Pendziwiat, Manuela; Lorenzo-Betancor, Oswaldo; Poewe, Werner; Ortega-Cubero, Sara; Seppi, Klaus; Rajput, Alex; Hussl, Anna; Rajput, Ali H.; Berg, Daniela; Dion, Patrick A.; Wurster, Isabel; Shulman, Joshua M.; Srulijes, Karin; Haubenberger, Dietrich; Pastor, Pau; Vilariño-Güell, Carles; Postuma, Ronald B.; Bernard, Geneviève; Ladwig, Karl-Heinz; Dupré, Nicolas; Jankovic, Joseph; Strauch, Konstantin; Panisset, Michel; Winkelmann, Juliane; Testa, Claudia M.; Reischl, Eva; Zeuner, Kirsten E.; Ross, Owen A.; Arzberger, Thomas; Chouinard, Sylvain; Deuschl, Günther; Louis, Elan D.; Kuhlenbäumer, Gregor
We conducted a genome-wide association study of essential tremor, a common movement disorder characterized mainly by a postural and kinetic tremor of the upper extremities. Twin and family history studies show a high heritability for essential tremor. The molecular genetic determinants of essential tremor are unknown. We included 2807 patients and 6441 controls of European descent in our two-stage genome-wide association study. The 59 most significantly disease-associated markers of the discovery stage were genotyped in the replication stage. After Bonferroni correction two markers, one (rs10937625) located in the serine/threonine kinase STK32B and one (rs17590046) in the transcriptional coactivator PPARGC1A were associated with essential tremor. Three markers (rs12764057, rs10822974, rs7903491) in the cell-adhesion molecule CTNNA3 were significant in the combined analysis of both stages. The expression of STK32B was increased in the cerebellar cortex of patients and expression quantitative trait loci database mining showed association between the protective minor allele of rs10937625 and reduced expression in cerebellar cortex. We found no expression differences related to disease status or marker genotype for the other two genes. Replication of two lead single nucleotide polymorphisms of previous small genome-wide association studies (rs3794087 in SLC1A2, rs9652490 in LINGO1) did not confirm the association with essential tremor. PMID:27797806
Byrne, Stephen; Czaban, Adrian; Studer, Bruno; Panitz, Frank; Bendixen, Christian; Asp, Torben
Genotyping-by-Sequencing (GBS) is an excellent tool for characterising genetic variation between plant genomes. To date, its use has been reported only for genotyping of single individuals. However, there are many applications where resolving allele frequencies within populations on a genome-wide scale would be very powerful, examples include the breeding of outbreeding species, varietal protection in outbreeding species, monitoring changes in population allele frequencies. This motivated us to test the potential to use GBS to evaluate allele frequencies within populations. Perennial ryegrass is an outbreeding species, and breeding programs are based upon selection on populations. We tested two restriction enzymes for their efficiency in complexity reduction of the perennial ryegrass genome. The resulting profiles have been termed Genome Wide Allele Frequency Fingerprints (GWAFFs), and we have shown how these fingerprints can be used to distinguish between plant populations. Even at current costs and throughput, using sequencing to directly evaluate populations on a genome-wide scale is viable. GWAFFs should find many applications, from varietal development in outbreeding species right through to playing a role in protecting plant breeders’ rights. PMID:23469194
Guo, Wenying; Li, Jinliang; Zhao, Xia; Sun, Yepeng; Hu, Juan; Zhen, Hefu; Zhang, Xiandong; Chen, Chao; Shi, Yujian; Li, Lin; Cao, Hongzhi; Du, Hongli; Li, Jian
Copy-number variations (CNV), loss of heterozygosity (LOH), and uniparental disomy (UPD) are large genomic aberrations leading to many common inherited diseases, cancers, and other complex diseases. An integrated tool to identify these aberrations is essential in understanding diseases and in designing clinical interventions. Previous discovery methods based on whole-genome sequencing (WGS) require very high depth of coverage on the whole genome scale, and are cost-wise inefficient. Another approach, whole exome genome sequencing (WEGS), is limited to discovering variations within exons. Thus, we are lacking efficient methods to detect genomic aberrations on the whole genome scale using next-generation sequencing technology. Here we present a method to identify genome-wide CNV, LOH and UPD for the human genome via selectively sequencing a small portion of genome termed Selected Target Regions (SeTRs). In our experiments, the SeTRs are covered by 99.73%~99.95% with sufficient depth. Our developed bioinformatics pipeline calls genome-wide CNVs with high confidence, revealing 8 credible events of LOH and 3 UPD events larger than 5M from 15 individual samples. We demonstrate that genome-wide CNV, LOH and UPD can be detected using a cost-effective SeTRs sequencing approach, and that LOH and UPD can be identified using just a sample grouping technique, without using a matched sample or familial information. PMID:25919136
Chen, Huan; Gu, Xiao-hong; Zhou, Yuxi; Ge, Zeng; Wang, Bin; Siok, Wai Ting; Wang, Guoqing; Huen, Michael; Jiang, Yuyang; Tan, Li-Hai; Sun, Yimin
Mathematics ability is a complex cognitive trait with polygenic heritability. Genome-wide association study (GWAS) has been an effective approach to investigate genetic components underlying mathematic ability. Although previous studies reported several candidate genetic variants, none of them exceeded genome-wide significant threshold in general populations. Herein, we performed GWAS in Chinese elementary school students to identify potential genetic variants associated with mathematics ability. The discovery stage included 494 and 504 individuals from two independent cohorts respectively. The replication stage included another cohort of 599 individuals. In total, 28 of 81 candidate SNPs that met validation criteria were further replicated. Combined meta-analysis of three cohorts identified four SNPs (rs1012694, rs11743006, rs17778739 and rs17777541) of SPOCK1 gene showing association with mathematics ability (minimum p value 5.67 × 10−10, maximum β −2.43). The SPOCK1 gene is located on chromosome 5q31.2 and encodes a highly conserved glycoprotein testican-1 which was associated with tumor progression and prognosis as well as neurogenesis. This is the first study to report genome-wide significant association of individual SNPs with mathematics ability in general populations. Our preliminary results further supported the role of SPOCK1 during neurodevelopment. The genetic complexities underlying mathematics ability might contribute to explain the basis of human cognition and intelligence at genetic level. PMID:28155865
Marchetti, F; Manohar, C F
The purpose of this technical feasibility study was to develop and evaluate robust microgenomic tools for investigations of genome-wide expression of very small numbers of cells isolated from whole tissue sections. Tissues contain large numbers of cell-types that play varied roles in organ function and responses to endogenous and exogenous toxicants whether bacterial, viral, chemical or radiation. Expression studies of whole tissue biopsy are severely limited because heterogeneous cell-types result in an averaging of molecular signals masking subtle but important changes in gene expression in any one cell type(s) or group of cells. Accurate gene expression analysis requires the study of specific cell types in their tissue environment but without contamination from surrounding cells. Laser capture microdissection (LCM) is a new technology to isolate morphologically distinct cells from tissue sections. Alternative methods are available for isolating single cells but not yet for their reliable genome-wide expression analyses. The tasks of this feasibility project were to: (1) Develop efficient protocols for laser capture microdissection of cells from tissues identified by antibody label, or morphological stain. (2) Develop reproducible gene-transcript analyses techniques for single cell-types and determine the numbers of cells needed for reliable genome-wide analyses. (3) Validate the technology for epithelial and endothelial cells isolated from the gastrointestinal tract of mice.
Choufani, S; Cytrynbaum, C; Chung, B H Y; Turinsky, A L; Grafodatskaya, D; Chen, Y A; Cohen, A S A; Dupuis, L; Butcher, D T; Siu, M T; Luk, H M; Lo, I F M; Lam, S T S; Caluseriu, O; Stavropoulos, D J; Reardon, W; Mendoza-Londono, R; Brudno, M; Gibson, W T; Chitayat, D; Weksberg, R
Sotos syndrome (SS) represents an important human model system for the study of epigenetic regulation; it is an overgrowth/intellectual disability syndrome caused by mutations in a histone methyltransferase, NSD1. As layered epigenetic modifications are often interdependent, we propose that pathogenic NSD1 mutations have a genome-wide impact on the most stable epigenetic mark, DNA methylation (DNAm). By interrogating DNAm in SS patients, we identify a genome-wide, highly significant NSD1(+/-)-specific signature that differentiates pathogenic NSD1 mutations from controls, benign NSD1 variants and the clinically overlapping Weaver syndrome. Validation studies of independent cohorts of SS and controls assigned 100% of these samples correctly. This highly specific and sensitive NSD1(+/-) signature encompasses genes that function in cellular morphogenesis and neuronal differentiation, reflecting cardinal features of the SS phenotype. The identification of SS-specific genome-wide DNAm alterations will facilitate both the elucidation of the molecular pathophysiology of SS and the development of improved diagnostic testing.
Lee, Young; Park, Suyeon; Moon, Sanghoon; Lee, Juyoung; Elston, Robert C; Lee, Woojoo; Won, Sungho
Longitudinal data enables detecting the effect of aging/time, and as a repeated measures design is statistically more efficient compared to cross-sectional data if the correlations between repeated measurements are not large. In particular, when genotyping cost is more expensive than phenotyping cost, the collection of longitudinal data can be an efficient strategy for genetic association analysis. However, in spite of these advantages, genome-wide association studies (GWAS) with longitudinal data have rarely been analyzed taking this into account. In this report, we calculate the required sample size to achieve 80% power at the genome-wide significance level for both longitudinal and cross-sectional data, and compare their statistical efficiency. Furthermore, we analyzed the GWAS of eight phenotypes with three observations on each individual in the Korean Association Resource (KARE). A linear mixed model allowing for the correlations between observations for each individual was applied to analyze the longitudinal data, and linear regression was used to analyze the first observation on each individual as cross-sectional data. We found 12 novel genome-wide significant disease susceptibility loci that were then confirmed in the Health Examination cohort, as well as some significant interactions between age/sex and SNPs.
Rietveld, Cornelius A; Conley, Dalton; Eriksson, Nicholas; Esko, Tõnu; Medland, Sarah E; Vinkhuyzen, Anna A E; Yang, Jian; Boardman, Jason D; Chabris, Christopher F; Dawes, Christopher T; Domingue, Benjamin W; Hinds, David A; Johannesson, Magnus; Kiefer, Amy K; Laibson, David; Magnusson, Patrik K E; Mountain, Joanna L; Oskarsson, Sven; Rostapshova, Olga; Teumer, Alexander; Tung, Joyce Y; Visscher, Peter M; Benjamin, Daniel J; Cesarini, David; Koellinger, Philipp D
A recent genome-wide-association study of educational attainment identified three single-nucleotide polymorphisms (SNPs) whose associations, despite their small effect sizes (each R (2) ≈ 0.02%), reached genome-wide significance (p < 5 × 10(-8)) in a large discovery sample and were replicated in an independent sample (p < .05). The study also reported associations between educational attainment and indices of SNPs called "polygenic scores." In three studies, we evaluated the robustness of these findings. Study 1 showed that the associations with all three SNPs were replicated in another large (N = 34,428) independent sample. We also found that the scores remained predictive (R (2) ≈ 2%) in regressions with stringent controls for stratification (Study 2) and in new within-family analyses (Study 3). Our results show that large and therefore well-powered genome-wide-association studies can identify replicable genetic associations with behavioral traits. The small effect sizes of individual SNPs are likely to be a major contributing factor explaining the striking contrast between our results and the disappointing replication record of most candidate-gene studies.
Cheng, Timothy H T; Thompson, Deborah J; O'Mara, Tracy A; Painter, Jodie N; Glubb, Dylan M; Flach, Susanne; Lewis, Annabelle; French, Juliet D; Freeman-Mills, Luke; Church, David; Gorman, Maggie; Martin, Lynn; Hodgson, Shirley; Webb, Penelope M; Attia, John; Holliday, Elizabeth G; McEvoy, Mark; Scott, Rodney J; Henders, Anjali K; Martin, Nicholas G; Montgomery, Grant W; Nyholt, Dale R; Ahmed, Shahana; Healey, Catherine S; Shah, Mitul; Dennis, Joe; Fasching, Peter A; Beckmann, Matthias W; Hein, Alexander; Ekici, Arif B; Hall, Per; Czene, Kamila; Darabi, Hatef; Li, Jingmei; Dörk, Thilo; Dürst, Matthias; Hillemanns, Peter; Runnebaum, Ingo; Amant, Frederic; Schrauwen, Stefanie; Zhao, Hui; Lambrechts, Diether; Depreeuw, Jeroen; Dowdy, Sean C; Goode, Ellen L; Fridley, Brooke L; Winham, Stacey J; Njølstad, Tormund S; Salvesen, Helga B; Trovik, Jone; Werner, Henrica M J; Ashton, Katie; Otton, Geoffrey; Proietto, Tony; Liu, Tao; Mints, Miriam; Tham, Emma; Li, Mulin Jun; Yip, Shun H; Wang, Junwen; Bolla, Manjeet K; Michailidou, Kyriaki; Wang, Qin; Tyrer, Jonathan P; Dunlop, Malcolm; Houlston, Richard; Palles, Claire; Hopper, John L; Peto, Julian; Swerdlow, Anthony J; Burwinkel, Barbara; Brenner, Hermann; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Chang-Claude, Jenny; Couch, Fergus J; Giles, Graham G; Kristensen, Vessela N; Cox, Angela; Cunningham, Julie M; Pharoah, Paul D P; Dunning, Alison M; Edwards, Stacey L; Easton, Douglas F; Tomlinson, Ian; Spurdle, Amanda B
We conducted a meta-analysis of three endometrial cancer genome-wide association studies (GWAS) and two follow-up phases totaling 7,737 endometrial cancer cases and 37,144 controls of European ancestry. Genome-wide imputation and meta-analysis identified five new risk loci of genome-wide significance at likely regulatory regions on chromosomes 13q22.1 (rs11841589, near KLF5), 6q22.31 (rs13328298, in LOC643623 and near HEY2 and NCOA7), 8q24.21 (rs4733613, telomeric to MYC), 15q15.1 (rs937213, in EIF2AK4, near BMF) and 14q32.33 (rs2498796, in AKT1, near SIVA1). We also found a second independent 8q24.21 signal (rs17232730). Functional studies of the 13q22.1 locus showed that rs9600103 (pairwise r(2) = 0.98 with rs11841589) is located in a region of active chromatin that interacts with the KLF5 promoter region. The rs9600103[T] allele that is protective in endometrial cancer suppressed gene expression in vitro, suggesting that regulation of the expression of KLF5, a gene linked to uterine development, is implicated in tumorigenesis. These findings provide enhanced insight into the genetic and biological basis of endometrial cancer.
Phillips, Mark A.; Long, Anthony D.; Greenspan, Zachary S.; Greer, Lee F.; Burke, Molly K.; Villeponteau, Bryant; Matsagas, Kennedy C.; Rizza, Cristina L.; Mueller, Laurence D.; Rose, Michael R.
Experimental evolutionary genomics now allows biologists to test fundamental theories concerning the genetic basis of adaptation. We have conducted one of the longest laboratory evolution experiments with any sexually-reproducing metazoan, Drosophila melanogaster. We used next-generation resequencing data from this experiment to examine genome-wide patterns of genetic variation over an evolutionary time-scale that approaches 1,000 generations. We also compared measures of variation within and differentiation between our populations to simulations based on a variety of evolutionary scenarios. Our analysis yielded no clear evidence of hard selective sweeps, whereby natural selection acts to increase the frequency of a newly-arising mutation in a population until it becomes fixed. We do find evidence for selection acting on standing genetic variation, as independent replicate populations exhibit similar population-genetic dynamics, without obvious fixation of candidate alleles under selection. A hidden-Markov model test for selection also found widespread evidence for selection. We found more genetic variation genome-wide, and less differentiation between replicate populations genome-wide, than arose in any of our simulated evolutionary scenarios. PMID:28004838
Gottlieb, Daniel J.; Hek, Karin; Chen, Ting-hsu; Watson, Nathaniel F.; Eiriksdottir, Gudny; Byrne, Enda M.; Cornelis, Marilyn; Warby, Simon C.; Bandinelli, Stefania; Cherkas, Lynn; Evans, Daniel S.; Grabe, Hans J.; Lahti, Jari; Li, Man; Lehtimäki, Terho; Lumley, Thomas; Marciante, Kristin D.; Pérusse, Louis; Psaty, Bruce M.; Robbins, John; Tranah, Gregory J.; Vink, Jacqueline M.; Wilk, Jemma B.; Stafford, Jeanette M.; Bellis, Claire; Biffar, Reiner; Bouchard, Claude; Cade, Brian; Curhan, Gary C.; Eriksson, Johan G.; Ewert, Ralf; Ferrucci, Luigi; Fülöp, Tibor; Gehrman, Philip R.; Goodloe, Robert; Harris, Tamara B.; Heath, Andrew C.; Hernandez, Dena; Hofman, Albert; Hottenga, Jouke-Jan; Hunter, David J.; Jensen, Majken K.; Johnson, Andrew D.; Kähönen, Mika; Kao, Linda; Kraft, Peter; Larkin, Emma K.; Lauderdale, Diane S.; Luik, Annemarie I.; Medici, Marco; Montgomery, Grant W.; Palotie, Aarno; Patel, Sanjay R.; Pistis, Giorgio; Porcu, Eleonora; Quaye, Lydia; Raitakari, Olli; Redline, Susan; Rimm, Eric B.; Rotter, Jerome I.; Smith, Albert V.; Spector, Tim D.; Teumer, Alexander; Uitterlinden, André G.; Vohl, Marie-Claude; Widen, Elisabeth; Willemsen, Gonneke; Young, Terry; Zhang, Xiaoling; Liu, Yongmei; Blangero, John; Boomsma, Dorret I.; Gudnason, Vilmundur; Hu, Frank; Mangino, Massimo; Martin, Nicholas G.; O’Connor, George T.; Stone, Katie L.; Tanaka, Toshiko; Viikari, Jorma; Gharib, Sina A.; Punjabi, Naresh M.; Räikkönen, Katri; Völzke, Henry; Mignot, Emmanuel; Tiemeier, Henning
Usual sleep duration is a heritable trait correlated with psychiatric morbidity, cardiometabolic disease and mortality, although little is known about the genetic variants influencing this trait. A genome-wide association study of usual sleep duration was conducted using 18 population-based cohorts totaling 47,180 individuals of European ancestry. Genome-wide significant association was identified at two loci. The strongest is located on chromosome 2, in an intergenic region 35–80 kb upstream from the thyroid-specific transcription factor PAX8 (lowest p=1.1 ×10−9). This finding was replicated in an African-American sample of 4771 individuals (lowest p=9.3 × 10−4). The strongest combined association was at rs1823125 (p=1.5 × 10−10, minor allele frequency 0.26 in the discovery sample, 0.12 in the replication sample), with each copy of the minor allele associated with a sleep duration 3.1 minutes longer per night. The alleles associated with longer sleep duration were associated in previous genome-wide association studies with a more favorable metabolic profile and a lower risk of attention deficit hyperactivity disorder. Understanding the mechanisms underlying these associations may help elucidate biological mechanisms influencing sleep duration and its association with psychiatric, metabolic and cardiovascular disease. PMID:25469926
Müller, Stefanie H; Girard, Simon L; Hopfner, Franziska; Merner, Nancy D; Bourassa, Cynthia V; Lorenz, Delia; Clark, Lorraine N; Tittmann, Lukas; Soto-Ortolaza, Alexandra I; Klebe, Stephan; Hallett, Mark; Schneider, Susanne A; Hodgkinson, Colin A; Lieb, Wolfgang; Wszolek, Zbigniew K; Pendziwiat, Manuela; Lorenzo-Betancor, Oswaldo; Poewe, Werner; Ortega-Cubero, Sara; Seppi, Klaus; Rajput, Alex; Hussl, Anna; Rajput, Ali H; Berg, Daniela; Dion, Patrick A; Wurster, Isabel; Shulman, Joshua M; Srulijes, Karin; Haubenberger, Dietrich; Pastor, Pau; Vilariño-Güell, Carles; Postuma, Ronald B; Bernard, Geneviève; Ladwig, Karl-Heinz; Dupré, Nicolas; Jankovic, Joseph; Strauch, Konstantin; Panisset, Michel; Winkelmann, Juliane; Testa, Claudia M; Reischl, Eva; Zeuner, Kirsten E; Ross, Owen A; Arzberger, Thomas; Chouinard, Sylvain; Deuschl, Günther; Louis, Elan D; Kuhlenbäumer, Gregor; Rouleau, Guy A
We conducted a genome-wide association study of essential tremor, a common movement disorder characterized mainly by a postural and kinetic tremor of the upper extremities. Twin and family history studies show a high heritability for essential tremor. The molecular genetic determinants of essential tremor are unknown. We included 2807 patients and 6441 controls of European descent in our two-stage genome-wide association study. The 59 most significantly disease-associated markers of the discovery stage were genotyped in the replication stage. After Bonferroni correction two markers, one (rs10937625) located in the serine/threonine kinase STK32B and one (rs17590046) in the transcriptional coactivator PPARGC1A were associated with essential tremor. Three markers (rs12764057, rs10822974, rs7903491) in the cell-adhesion molecule CTNNA3 were significant in the combined analysis of both stages. The expression of STK32B was increased in the cerebellar cortex of patients and expression quantitative trait loci database mining showed association between the protective minor allele of rs10937625 and reduced expression in cerebellar cortex. We found no expression differences related to disease status or marker genotype for the other two genes. Replication of two lead single nucleotide polymorphisms of previous small genome-wide association studies (rs3794087 in SLC1A2, rs9652490 in LINGO1) did not confirm the association with essential tremor.
Kanai, Masahiro; Tanaka, Toshihiro; Okada, Yukinori
To assess the statistical significance of associations between variants and traits, genome-wide association studies (GWAS) should employ an appropriate threshold that accounts for the massive burden of multiple testing in the study. Although most studies in the current literature commonly set a genome-wide significance threshold at the level of P=5.0 × 10−8, the adequacy of this value for respective populations has not been fully investigated. To empirically estimate thresholds for different ancestral populations, we conducted GWAS simulations using the 1000 Genomes Phase 3 data set for Africans (AFR), Europeans (EUR), Admixed Americans (AMR), East Asians (EAS) and South Asians (SAS). The estimated empirical genome-wide significance thresholds were Psig=3.24 × 10−8 (AFR), 9.26 × 10−8 (EUR), 1.83 × 10−7 (AMR), 1.61 × 10−7 (EAS) and 9.46 × 10−8 (SAS). We additionally conducted trans-ethnic meta-analyses across all populations (ALL) and all populations except for AFR (ΔAFR), which yielded Psig=3.25 × 10−8 (ALL) and 4.20 × 10−8 (ΔAFR). Our results indicate that the current threshold (P=5.0 × 10−8) is overly stringent for all ancestral populations except for Africans; however, we should employ a more stringent threshold when conducting a meta-analysis, regardless of the presence of African samples. PMID:27305981
Kanai, Masahiro; Tanaka, Toshihiro; Okada, Yukinori
To assess the statistical significance of associations between variants and traits, genome-wide association studies (GWAS) should employ an appropriate threshold that accounts for the massive burden of multiple testing in the study. Although most studies in the current literature commonly set a genome-wide significance threshold at the level of P=5.0 × 10(-8), the adequacy of this value for respective populations has not been fully investigated. To empirically estimate thresholds for different ancestral populations, we conducted GWAS simulations using the 1000 Genomes Phase 3 data set for Africans (AFR), Europeans (EUR), Admixed Americans (AMR), East Asians (EAS) and South Asians (SAS). The estimated empirical