Science.gov

Sample records for genome-wide transcriptional analysis

  1. Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes.

    PubMed

    Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G

    2000-12-15

    The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.

  2. Genome-wide analysis of a Wnt1-regulated transcriptional network implicates neurodegenerative pathways.

    PubMed

    Wexler, Eric M; Rosen, Ezra; Lu, Daning; Osborn, Gregory E; Martin, Elizabeth; Raybould, Helen; Geschwind, Daniel H

    2011-10-04

    Wnt proteins are critical to mammalian brain development and function. The canonical Wnt signaling pathway involves the stabilization and nuclear translocation of β-catenin; however, Wnt also signals through alternative, noncanonical pathways. To gain a systems-level, genome-wide view of Wnt signaling, we analyzed Wnt1-stimulated changes in gene expression by transcriptional microarray analysis in cultured human neural progenitor (hNP) cells at multiple time points over a 72-hour time course. We observed a widespread oscillatory-like pattern of changes in gene expression, involving components of both the canonical and the noncanonical Wnt signaling pathways. A higher-order, systems-level analysis that combined independent component analysis, waveform analysis, and mutual information-based network construction revealed effects on pathways related to cell death and neurodegenerative disease. Wnt effectors were tightly clustered with presenilin1 (PSEN1) and granulin (GRN), which cause dominantly inherited forms of Alzheimer's disease and frontotemporal dementia (FTD), respectively. We further explored a potential link between Wnt1 and GRN and found that Wnt1 decreased GRN expression by hNPs. Conversely, GRN knockdown increased WNT1 expression, demonstrating that Wnt and GRN reciprocally regulate each other. Finally, we provided in vivo validation of the in vitro findings by analyzing gene expression data from individuals with FTD. These unbiased and genome-wide analyses provide evidence for a connection between Wnt signaling and the transcriptional regulation of neurodegenerative disease genes.

  3. Genome-wide analysis of the MYB transcription factor superfamily in soybean

    PubMed Central

    2012-01-01

    Background The MYB superfamily constitutes one of the most abundant groups of transcription factors described in plants. Nevertheless, their functions appear to be highly diverse and remain rather unclear. To date, no genome-wide characterization of this gene family has been conducted in a legume species. Here we report the first genome-wide analysis of the whole MYB superfamily in a legume species, soybean (Glycine max), including the gene structures, phylogeny, chromosome locations, conserved motifs, and expression patterns, as well as a comparative genomic analysis with Arabidopsis. Results A total of 244 R2R3-MYB genes were identified and further classified into 48 subfamilies based on a phylogenetic comparative analysis with their putative orthologs, showed both gene loss and duplication events. The phylogenetic analysis showed that most characterized MYB genes with similar functions are clustered in the same subfamily, together with the identification of orthologs by synteny analysis, functional conservation among subgroups of MYB genes was strongly indicated. The phylogenetic relationships of each subgroup of MYB genes were well supported by the highly conserved intron/exon structures and motifs outside the MYB domain. Synonymous nucleotide substitution (dN/dS) analysis showed that the soybean MYB DNA-binding domain is under strong negative selection. The chromosome distribution pattern strongly indicated that genome-wide segmental and tandem duplication contribute to the expansion of soybean MYB genes. In addition, we found that ~ 4% of soybean R2R3-MYB genes had undergone alternative splicing events, producing a variety of transcripts from a single gene, which illustrated the extremely high complexity of transcriptome regulation. Comparative expression profile analysis of R2R3-MYB genes in soybean and Arabidopsis revealed that MYB genes play conserved and various roles in plants, which is indicative of a divergence in function. Conclusions In this

  4. Genome-wide classification and expression analysis of MYB transcription factor families in rice and Arabidopsis

    PubMed Central

    2012-01-01

    Background The MYB gene family comprises one of the richest groups of transcription factors in plants. Plant MYB proteins are characterized by a highly conserved MYB DNA-binding domain. MYB proteins are classified into four major groups namely, 1R-MYB, 2R-MYB, 3R-MYB and 4R-MYB based on the number and position of MYB repeats. MYB transcription factors are involved in plant development, secondary metabolism, hormone signal transduction, disease resistance and abiotic stress tolerance. A comparative analysis of MYB family genes in rice and Arabidopsis will help reveal the evolution and function of MYB genes in plants. Results A genome-wide analysis identified at least 155 and 197 MYB genes in rice and Arabidopsis, respectively. Gene structure analysis revealed that MYB family genes possess relatively more number of introns in the middle as compared with C- and N-terminal regions of the predicted genes. Intronless MYB-genes are highly conserved both in rice and Arabidopsis. MYB genes encoding R2R3 repeat MYB proteins retained conserved gene structure with three exons and two introns, whereas genes encoding R1R2R3 repeat containing proteins consist of six exons and five introns. The splicing pattern is similar among R1R2R3 MYB genes in Arabidopsis. In contrast, variation in splicing pattern was observed among R1R2R3 MYB members of rice. Consensus motif analysis of 1kb upstream region (5′ to translation initiation codon) of MYB gene ORFs led to the identification of conserved and over-represented cis-motifs in both rice and Arabidopsis. Real-time quantitative RT-PCR analysis showed that several members of MYBs are up-regulated by various abiotic stresses both in rice and Arabidopsis. Conclusion A comprehensive genome-wide analysis of chromosomal distribution, tandem repeats and phylogenetic relationship of MYB family genes in rice and Arabidopsis suggested their evolution via duplication. Genome-wide comparative analysis of MYB genes and their expression analysis

  5. Genome-wide location analysis reveals a role for Sub1 in RNA polymerase III transcription

    PubMed Central

    Tavenet, Arounie; Suleau, Audrey; Dubreuil, Géraldine; Ferrari, Roberto; Ducrot, Cécile; Michaut, Magali; Aude, Jean-Christophe; Dieci, Giorgio; Lefebvre, Olivier; Conesa, Christine; Acker, Joël

    2009-01-01

    Human PC4 and the yeast ortholog Sub1 have multiple functions in RNA polymerase II transcription. Genome-wide mapping revealed that Sub1 is present on Pol III-transcribed genes. Sub1 was found to interact with components of the Pol III transcription system and to stimulate the initiation and reinitiation steps in a system reconstituted with all recombinant factors. Sub1 was required for optimal Pol III gene transcription in exponentially growing cells. PMID:19706510

  6. Genome-wide characterization of transcriptional start sites in humans by integrative transcriptome analysis

    PubMed Central

    Yamashita, Riu; Sathira, Nuankanya P.; Kanai, Akinori; Tanimoto, Kousuke; Arauchi, Takako; Tanaka, Yoshiaki; Hashimoto, Shin-ichi; Sugano, Sumio; Nakai, Kenta; Suzuki, Yutaka

    2011-01-01

    We performed a genome-wide analysis of transcriptional start sites (TSSs) in human genes by multifaceted use of a massively parallel sequencer. By analyzing 800 million sequences that were obtained from various types of transcriptome analyses, we characterized 140 million TSS tags in 12 human cell types. Despite the large number of TSS clusters (TSCs), the number of TSCs was observed to decrease sharply with increasing expression levels. Highly expressed TSCs exhibited several characteristic features: Nucleosome-seq analysis revealed highly ordered nucleosome structures, ChIP-seq analysis detected clear RNA polymerase II binding signals in their surrounding regions, evaluations of previously sequenced and newly shotgun-sequenced complete cDNA sequences showed that they encode preferable transcripts for protein translation, and RNA-seq analysis of polysome-incorporated RNAs yielded direct evidence that those transcripts are actually translated into proteins. We also demonstrate that integrative interpretation of transcriptome data is essential for the selection of putative alternative promoter TSCs, two of which also have protein consequences. Furthermore, discriminative chromatin features that separate TSCs at different expression levels were found for both genic TSCs and intergenic TSCs. The collected integrative information should provide a useful basis for future biological characterization of TSCs. PMID:21372179

  7. Genome Wide Transcriptional Profile Analysis of Vitis amurensis and Vitis vinifera in Response to Cold Stress

    PubMed Central

    Xin, Haiping; Zhu, Wei; Wang, Lina; Xiang, Yue; Fang, Linchuan; Li, Jitao; Sun, Xiaoming; Wang, Nian; Londo, Jason P.; Li, Shaohua

    2013-01-01

    Grape is one of the most important fruit crops worldwide. The suitable geographical locations and productivity of grapes are largely limited by temperature. Vitis amurensis is a wild grapevine species with remarkable cold-tolerance, exceeding that of Vitis vinifera, the dominant cultivated species of grapevine. However, the molecular mechanisms that contribute to the enhanced freezing tolerance of V. amurensis remain unknown. Here we used deep sequencing data from restriction endonuclease-generated cDNA fragments to evaluate the whole genome wide modification of transcriptome of V. amurensis under cold treatment. Vitis vinifera cv. Muscat of Hamburg was used as control to help investigate the distinctive features of V. amruensis in responding to cold stress. Approximately 9 million tags were sequenced from non-cold treatment (NCT) and cold treatment (CT) cDNA libraries in each species of grapevine sampled from shoot apices. Alignment of tags into V. vinifera cv. Pinot noir (PN40024) annotated genome identified over 15,000 transcripts in each library in V. amruensis and more than 16,000 in Muscat of Hamburg. Comparative analysis between NCT and CT libraries indicate that V. amurensis has fewer differential expressed genes (DEGs, 1314 transcripts) than Muscat of Hamburg (2307 transcripts) when exposed to cold stress. Common DEGs (408 transcripts) suggest that some genes provide fundamental roles during cold stress in grapes. The most robust DEGs (more than 20-fold change) also demonstrated significant differences between two kinds of grapevine, indicating that cold stress may trigger species specific pathways in V. amurensis. Functional categories of DEGs indicated that the proportion of up-regulated transcripts related to metabolism, transport, signal transduction and transcription were more abundant in V. amurensis. Several highly expressed transcripts that were found uniquely accumulated in V. amurensis are discussed in detail. This subset of unique candidate

  8. Genome wide transcriptional profile analysis of Vitis amurensis and Vitis vinifera in response to cold stress.

    PubMed

    Xin, Haiping; Zhu, Wei; Wang, Lina; Xiang, Yue; Fang, Linchuan; Li, Jitao; Sun, Xiaoming; Wang, Nian; Londo, Jason P; Li, Shaohua

    2013-01-01

    Grape is one of the most important fruit crops worldwide. The suitable geographical locations and productivity of grapes are largely limited by temperature. Vitis amurensis is a wild grapevine species with remarkable cold-tolerance, exceeding that of Vitis vinifera, the dominant cultivated species of grapevine. However, the molecular mechanisms that contribute to the enhanced freezing tolerance of V. amurensis remain unknown. Here we used deep sequencing data from restriction endonuclease-generated cDNA fragments to evaluate the whole genome wide modification of transcriptome of V. amurensis under cold treatment. Vitis vinifera cv. Muscat of Hamburg was used as control to help investigate the distinctive features of V. amruensis in responding to cold stress. Approximately 9 million tags were sequenced from non-cold treatment (NCT) and cold treatment (CT) cDNA libraries in each species of grapevine sampled from shoot apices. Alignment of tags into V. vinifera cv. Pinot noir (PN40024) annotated genome identified over 15,000 transcripts in each library in V. amruensis and more than 16,000 in Muscat of Hamburg. Comparative analysis between NCT and CT libraries indicate that V. amurensis has fewer differential expressed genes (DEGs, 1314 transcripts) than Muscat of Hamburg (2307 transcripts) when exposed to cold stress. Common DEGs (408 transcripts) suggest that some genes provide fundamental roles during cold stress in grapes. The most robust DEGs (more than 20-fold change) also demonstrated significant differences between two kinds of grapevine, indicating that cold stress may trigger species specific pathways in V. amurensis. Functional categories of DEGs indicated that the proportion of up-regulated transcripts related to metabolism, transport, signal transduction and transcription were more abundant in V. amurensis. Several highly expressed transcripts that were found uniquely accumulated in V. amurensis are discussed in detail. This subset of unique candidate

  9. Genome-wide analysis of plant-specific Dof transcription factor family in tomato.

    PubMed

    Cai, Xiaofeng; Zhang, Yuyang; Zhang, Chanjuan; Zhang, Tingyan; Hu, Tixu; Ye, Jie; Zhang, Junhong; Wang, Taotao; Li, Hanxia; Ye, Zhibiao

    2013-06-01

    The Dof (DNA binding with One Finger) family encoding single zinc finger proteins has been known as a family of plant-specific transcription factors. These transcription factors are involved in a variety of functions of importance for different biological processes in plants. In the current study, we identified 34 Dof family genes in tomato, distributed on 11 chromosomes. A complete overview of SlDof genes in tomato is presented, including the gene structures, chromosome locations, phylogeny, protein motifs and evolution pattern. Phylogenetic analysis of 34 SlDof proteins resulted in four classes constituting six clusters. In addition, a comparative analysis between these genes in tomato, Arabidopsis and rice was also performed. The tomato Dof family expansion has been dated to recent duplication events, and segmental duplication is predominant for the SlDof genes. Furthermore, the SlDof genes displayed differential expression either in their transcript abundance or in their expression patterns under normal growth conditions. This is the first step towards genome-wide analyses of the Dof genes in tomato. Our study provides a very useful reference for cloning and functional analysis of the members of this gene family in tomato and other species.

  10. Genome-Wide Transcriptional Analysis of Genes Associated with Acute Desiccation Stress in Anopheles gambiae

    PubMed Central

    Wang, Mei-Hui; Marinotti, Osvaldo; Vardo-Zalik, Anne; Boparai, Rajni; Yan, Guiyun

    2011-01-01

    Malaria transmission in sub-Saharan Africa varies seasonally in intensity. Outbreaks of malaria occur after the beginning of the rainy season, whereas, during the dry season, reports of the disease are less frequent. Anopheles gambiae mosquitoes, the main malaria vector, are observed all year long but their densities are low during the dry season that generally lasts several months. Aestivation, seasonal migration, and local adaptation have been suggested as mechanisms that enable mosquito populations to persist through the dry season. Studies of chromosomal inversions have shown that inversions 2La, 2Rb, 2Rc, 2Rd, and 2Ru are associated with various physiological changes that confer aridity resistance. However, little is known about how phenotypic plasticity responds to seasonally dry conditions. This study examined the effects of desiccation stress on transcriptional regulation in An. gambiae. We exposed female An. gambiae G3 mosquitoes to acute desiccation and conducted a genome-wide analysis of their transcriptomes using the Affymetrix Plasmodium/Anopheles Genome Array. The transcription of 248 genes (1.7% of all transcripts) was significantly affected in all experimental conditions, including 96 with increased expression and 152 with decreased expression. In general, the data indicate a reduction in the metabolic rate of mosquitoes exposed to desiccation. Transcripts accumulated at higher levels during desiccation are associated with oxygen radical detoxification, DNA repair and stress responses. The proportion of transcripts within 2La and 2Rs (2Rb, 2Rc, 2Rd, and 2Ru) (67/248, or 27%) is similar to the percentage of transcripts located within these inversions (31%). These data may be useful in efforts to elucidate the role of chromosomal inversions in aridity tolerance. The scope of application of the anopheline genome demonstrates that examining transcriptional activity in relation to genotypic adaptations greatly expands the number of candidate regions

  11. Genome-wide identification and expression analysis of TCP transcription factors in Gossypium raimondii.

    PubMed

    Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C; Zhang, Baohong

    2014-10-16

    Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence.

  12. Genome-wide identification and expression analysis of TCP transcription factors in Gossypium raimondii

    PubMed Central

    Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C.; Zhang, Baohong

    2014-01-01

    Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence. PMID:25322260

  13. Genome-Wide Analysis of the bZIP Transcription Factors in Cucumber

    PubMed Central

    Baloglu, Mehmet Cengiz; Eldem, Vahap; Hajyzadeh, Mortaza; Unver, Turgay

    2014-01-01

    bZIP proteins are one of the largest transcriptional regulators playing crucial roles in plant development, physiological processes, and biotic/abiotic stress responses. Despite the availability of recently published draft genome sequence of Cucumis sativus, no comprehensive investigation of these family members has been presented for cucumber. We have identified 64 bZIP transcription factor-encoding genes in the cucumber genome. Based on structural features of their encoded proteins, CsbZIP genes could be classified into 6 groups. Cucumber bZIP genes were expanded mainly by segmental duplication rather than tandem duplication. Although segmental duplication rate of the CsbZIP genes was lower than that of Arabidopsis, rice and sorghum, it was observed as a common expansion mechanism. Some orthologous relationships and chromosomal rearrangements were observed according to comparative mapping analysis with other species. Genome-wide expression analysis of bZIP genes indicated that 64 CsbZIP genes were differentially expressed in at least one of the ten sampled tissues. A total of 4 CsbZIP genes displayed higher expression values in leaf, flowers and root tissues. The in silico micro-RNA (miRNA) and target transcript analyses identified that a total of 21 CsbZIP genes were targeted by 38 plant miRNAs. CsbZIP20 and CsbZIP22 are the most targeted by miR165 and miR166 family members, respectively. We also analyzed the expression of ten CsbZIP genes in the root and leaf tissues of drought-stressed cucumber using quantitative RT-PCR. All of the selected CsbZIP genes were measured as increased in root tissue at 24th h upon PEG treatment. Contrarily, the down-regulation was observed in leaf tissues of all analyzed CsbZIP genes. CsbZIP12 and CsbZIP44 genes showed gradual induction of expression in root tissues during time points. This genome-wide identification and expression profiling provides new opportunities for cloning and functional analyses, which may be used in

  14. Metabolic network driven analysis of genome-wide transcription data from Aspergillus nidulans

    PubMed Central

    David, Helga; Hofmann, Gerald; Oliveira, Ana Paula; Jarmer, Hanne; Nielsen, Jens

    2006-01-01

    Background Aspergillus nidulans (the asexual form of Emericella nidulans) is a model organism for aspergilli, which are an important group of filamentous fungi that encompasses human and plant pathogens as well as industrial cell factories. Aspergilli have a highly diversified metabolism and, because of their medical, agricultural and biotechnological importance, it would be valuable to have an understanding of how their metabolism is regulated. We therefore conducted a genome-wide transcription analysis of A. nidulans grown on three different carbon sources (glucose, glycerol, and ethanol) with the objective of identifying global regulatory structures. Furthermore, we reconstructed the complete metabolic network of this organism, which resulted in linking 666 genes to metabolic functions, as well as assigning metabolic roles to 472 genes that were previously uncharacterized. Results Through combination of the reconstructed metabolic network and the transcription data, we identified subnetwork structures that pointed to coordinated regulation of genes that are involved in many different parts of the metabolism. Thus, for a shift from glucose to ethanol, we identified coordinated regulation of the complete pathway for oxidation of ethanol, as well as upregulation of gluconeogenesis and downregulation of glycolysis and the pentose phosphate pathway. Furthermore, on change in carbon source from glucose to ethanol, the cells shift from using the pentose phosphate pathway as the major source of NADPH (nicotinamide adenine dinucleotide phosphatase, reduced form) for biosynthesis to use of the malic enzyme. Conclusion Our analysis indicates that some of the genes are regulated by common transcription factors, making it possible to establish new putative links between known transcription factors and genes through clustering. PMID:17107606

  15. A genome-wide transcription analysis of a fungal riboflavin overproducer.

    PubMed

    Karos, Marvin; Vilariño, Cristina; Bollschweiler, Claus; Revuelta, Jose Luis

    2004-09-30

    The production of many fine chemicals such as vitamins and amino acids is carried out in bioreactors using microorganisms. Usually, these strains are developed from wild-type organisms by classical mutation and selection. After several generations of strain improvement, no further enhancement can be achieved. Therefore, metabolic engineering (ME) is a rational approach to optimise such producer organisms beyond this point, or for starting all over from the beginning. Metabolic Engineering involves detailed analysis of the organism's metabolic and genetic properties, leading to the identification of new target genes. The fungal riboflavin overproducer Ashbya gossypii converts vegetable oil to vitamin B2 in a "one-step reaction". The productivity and selectivity of this microorganism have been optimised significantly over the years, first following a classical approach and now a rational one. The improvement is based on our understanding of vitamin B2 metabolism. We have been able to selectively enhance the pathways that are necessary for the formation of riboflavin and to inhibit those leading to unwanted side products. New targets for further improvements of this process have been found using a genome-wide transcript expression analysis; namely massive parallel signature sequencing (MPSS). With this analysis even completely unknown genes can be used for strain improvement.

  16. Genome-wide analysis of coordinated transcript abundance during seed development in different Brassica rapa morphotypes.

    PubMed

    Basnet, Ram Kumar; Moreno-Pachon, Natalia; Lin, Ke; Bucher, Johan; Visser, Richard G F; Maliepaard, Chris; Bonnema, Guusje

    2013-12-01

    Brassica seeds are important as basic units of plant growth and sources of vegetable oil. Seed development is regulated by many dynamic metabolic processes controlled by complex networks of spatially and temporally expressed genes. We conducted a global microarray gene co-expression analysis by measuring transcript abundance of developing seeds from two diverse B. rapa morphotypes: a pak choi (leafy-type) and a yellow sarson (oil-type), and two of their doubled haploid (DH) progenies, (1) to study the timing of metabolic processes in developing seeds, (2) to explore the major transcriptional differences in developing seeds of the two morphotypes, and (3) to identify the optimum stage for a genetical genomics study in B. rapa seed. Seed developmental stages were similar in developing seeds of pak choi and yellow sarson of B. rapa; however, the colour of embryo and seed coat differed among these two morphotypes. In this study, most transcriptional changes occurred between 25 and 35 DAP, which shows that the timing of seed developmental processes in B. rapa is at later developmental stages than in the related species B. napus. Using a Weighted Gene Co-expression Network Analysis (WGCNA), we identified 47 "gene modules", of which 27 showed a significant association with temporal and/or genotypic variation. An additional hierarchical cluster analysis identified broad spectra of gene expression patterns during seed development. The predominant variation in gene expression was according to developmental stages rather than morphotype differences. Since lipids are the major storage compounds of Brassica seeds, we investigated in more detail the regulation of lipid metabolism. Four co-regulated gene clusters were identified with 17 putative cis-regulatory elements predicted in their 1000 bp upstream region, either specific or common to different lipid metabolic pathways. This is the first study of genome-wide profiling of transcript abundance during seed development in B

  17. Genome wide analysis of transcript levels after perturbation of the EGFR pathway in the Drosophila ovary.

    PubMed

    Jordan, Katherine C; Hatfield, Steven D; Tworoger, Michael; Ward, Ellen J; Fischer, Karin A; Bowers, Stuart; Ruohola-Baker, Hannele

    2005-03-01

    Defects in the epidermal growth factor receptor (EGFR) pathway can lead to aggressive tumor formation. Activation of this pathway during normal development produces multiple outcomes at the cellular level, leading to cellular differentiation and cell cycle activation. To elucidate the downstream events induced by this pathway, we used genome-wide cDNA microarray technology to identify potential EGFR targets in Drosophila oogenesis. We focused on genes for which the transcriptional responses due to EGFR pathway activation and inactivation were in opposite directions, as this is expected for genes that are directly regulated by the pathway in this tissue type. We perturbed the EGFR pathway in epithelial follicle cells using seven different genetic backgrounds. To activate the pathway, we overexpressed an activated form of the EGFR (UAS-caEGFR), and an activated form of the signal transducer Raf (UAS-caRaf); we also over- or ectopically expressed the downstream homeobox transcription factor Mirror (UAS-mirr) and the ligand-activating serine protease Rhomboid (UAS-rho). To reduce pathway activity we used loss-of-function mutations in the ligand (gurken) and receptor (torpedo). From microarrays containing 6,255 genes, we found 454 genes that responded in an opposite manner in gain-of-function and loss-of-function conditions among which are many Wingless signaling pathway components. Further analysis of two such components, sugarless and pangolin, revealed a function for these genes in late follicle cell patterning. Of interest, components of other signaling pathways were also enriched in the EGFR target group, suggesting that one reason for the pleiotropic effects seen with EGFR activity in cancer progression and development may be its ability to regulate many other signaling pathways.

  18. Genome-wide analysis of WRKY transcription factors in Solanum lycopersicum.

    PubMed

    Huang, Shengxiong; Gao, Yongfeng; Liu, Jikai; Peng, Xiaoli; Niu, Xiangli; Fei, Zhangjun; Cao, Shuqing; Liu, Yongsheng

    2012-06-01

    The WRKY transcription factors have been implicated in multiple biological processes in plants, especially in regulating defense against biotic and abiotic stresses. However, little information is available about the WRKYs in tomato (Solanum lycopersicum). The recent release of the whole-genome sequence of tomato allowed us to perform a genome-wide investigation for tomato WRKY proteins, and to compare these positively identified proteins with their orthologs in model plants, such as Arabidopsis and rice. In the present study, based on the recently released tomato whole-genome sequences, we identified 81 SlWRKY genes that were classified into three main groups, with the second group further divided into five subgroups. Depending on WRKY domains' sequences derived from tomato, Arabidopsis and rice, construction of a phylogenetic tree demonstrated distinct clustering and unique gene expansion of WRKY genes among the three species. Genome mapping analysis revealed that tomato WRKY genes were enriched on several chromosomes, especially on chromosome 5, and 16 % of the family members were tandemly duplicated genes. The tomato WRKYs from each group were shown to share similar motif compositions. Furthermore, tomato WRKY genes showed distinct temporal and spatial expression patterns in different developmental processes and in response to various biotic and abiotic stresses. The expression of 18 selected tomato WRKY genes in response to drought and salt stresses and Pseudomonas syringae invasion, respectively, was validated by quantitative RT-PCR. Our results will provide a platform for functional identification and molecular breeding study of WRKY genes in tomato and probably other Solanaceae plants.

  19. Genome-wide transcript analysis of maize hybrids: allelic additive gene expression and yield heterosis.

    PubMed

    Guo, Mei; Rupe, Mary A; Yang, Xiaofeng; Crasta, Oswald; Zinselmeier, Christopher; Smith, Oscar S; Bowen, Ben

    2006-09-01

    Heterosis, or hybrid vigor, has been widely exploited in plant breeding for many decades, but the molecular mechanisms underlying the phenomenon remain unknown. In this study, we applied genome-wide transcript profiling to gain a global picture of the ways in which a large proportion of genes are expressed in the immature ear tissues of a series of 16 maize hybrids that vary in their degree of heterosis. Key observations include: (1) the proportion of allelic additively expressed genes is positively associated with hybrid yield and heterosis; (2) the proportion of genes that exhibit a bias towards the expression level of the paternal parent is negatively correlated with hybrid yield and heterosis; and (3) there is no correlation between the over- or under-expression of specific genes in maize hybrids with either yield or heterosis. The relationship of the expression patterns with hybrid performance is substantiated by analysis of a genetically improved modern hybrid (Pioneer hybrid 3394) versus a less improved older hybrid (Pioneer hybrid 3306) grown at different levels of plant density stress. The proportion of allelic additively expressed genes is positively associated with the modern high yielding hybrid, heterosis and high yielding environments, whereas the converse is true for the paternally biased gene expression. The dynamic changes of gene expression in hybrids responding to genotype and environment may result from differential regulation of the two parental alleles. Our findings suggest that differential allele regulation may play an important role in hybrid yield or heterosis, and provide a new insight to the molecular understanding of the underlying mechanisms of heterosis.

  20. Genome-Wide Analysis of Light- and Temperature-Entrained Circadian Transcripts in Caenorhabditis elegans

    PubMed Central

    van der Linden, Alexander M.; Beverly, Matthew; Kadener, Sebastian; Rodriguez, Joseph; Wasserman, Sara; Rosbash, Michael; Sengupta, Piali

    2010-01-01

    Most organisms have an endogenous circadian clock that is synchronized to environmental signals such as light and temperature. Although circadian rhythms have been described in the nematode Caenorhabditis elegans at the behavioral level, these rhythms appear to be relatively non-robust. Moreover, in contrast to other animal models, no circadian transcriptional rhythms have been identified. Thus, whether this organism contains a bona fide circadian clock remains an open question. Here we use genome-wide expression profiling experiments to identify light- and temperature-entrained oscillating transcripts in C. elegans. These transcripts exhibit rhythmic expression with temperature-compensated 24-h periods. In addition, their expression is sustained under constant conditions, suggesting that they are under circadian regulation. Light and temperature cycles strongly drive gene expression and appear to entrain largely nonoverlapping gene sets. We show that mutations in a cyclic nucleotide-gated channel required for sensory transduction abolish both light- and temperature-entrained gene expression, implying that environmental cues act cell nonautonomously to entrain circadian rhythms. Together, these findings demonstrate circadian-regulated transcriptional rhythms in C. elegans and suggest that further analyses in this organism will provide new information about the evolution and function of this biological clock. PMID:20967231

  1. Genome wide analysis of human genes transcriptionally and post-transcriptionally regulated by the HTLV-I protein p30

    PubMed Central

    Taylor, John M; Ghorbel, Sofiane; Nicot, Christophe

    2009-01-01

    Background Human T-cell leukemia virus type 1 (HTLV-I) is a human retrovirus that is etiologically linked to adult T-cell leukemia (ATL), an aggressive and fatal lymphoproliferative disease. The viral transactivator, Tax, is thought to play an important role during the initial stages of CD4+ T-cell immortalization by HTLV-1. Tax has been shown to activate transcription through CREB/ATF and NF-KB, and to alter numerous signaling pathways. These pleiotropic effects of Tax modify the expression of a wide array of cellular genes. Another viral protein encoded by HTLV-I, p30, has been shown to affect virus replication at the transcriptional and posttranscriptional levels. Little is currently known regarding the effect of p30 on the expression and nuclear export of cellular host mRNA transcripts. Identification of these RNA may reveal new targets and increase our understanding of HTLV-I pathogenesis. In this study, using primary peripheral blood mononuclear cells, we report a genome wide analysis of human genes transcriptionally and post-transcriptionally regulated by the HTLV-I protein p30. Results Using microarray analysis, we analyzed total and cytoplasmic cellular mRNA transcript levels isolated from PBMCs to assess the effect of p30 on cellular RNA transcript expression and their nuclear export. We report p30-dependent transcription resulting in the 2.5 fold up-regulation of 15 genes and the down-regulation of 65 human genes. We further tested nuclear export of cellular mRNA and found that p30 expression also resulted in a 2.5 fold post-transcriptional down-regulation of 90 genes and the up-regulation of 33 genes. Conclusion Overall, our study describes that expression of the HTLV-I protein p30 both positively and negatively alters the expression of cellular transcripts. Our study identifies for the first time the cellular genes for which nuclear export is affected by p30. These results suggest that p30 may possess a more global function with respect to m

  2. A genome-wide transcriptional analysis of morphology determination in Candida albicans

    PubMed Central

    Carlisle, Patricia L.; Kadosh, David

    2013-01-01

    Candida albicans, the most common cause of human fungal infections, undergoes a reversible morphological transition from yeast to pseudohyphal and hyphal filaments, which is required for virulence. For many years, the relationship among global gene expression patterns associated with determination of specific C. albicans morphologies has remained obscure. Using a strain that can be genetically manipulated to sequentially transition from yeast to pseudohyphae to hyphae in the absence of complex environmental cues and upstream signaling pathways, we demonstrate by whole-genome transcriptional profiling that genes associated with pseudohyphae represent a subset of those associated with hyphae and are generally expressed at lower levels. Our results also strongly suggest that in addition to dosage, extended duration of filament-specific gene expression is sufficient to drive the C. albicans yeast-pseudohyphal-hyphal transition. Finally, we describe the first transcriptional profile of the C. albicans reverse hyphal-pseudohyphal-yeast transition and demonstrate that this transition involves not only down-regulation of known hyphal-specific, genes but also differential expression of additional genes that have not previously been associated with the forward transition, including many involved in protein synthesis. These findings provide new insight into genome-wide expression patterns important for determining fungal morphology and suggest that in addition to similarities, there are also fundamental differences in global gene expression as pathogenic filamentous fungi undergo forward and reverse morphological transitions. PMID:23242994

  3. Genome-wide analysis of LXRα activation reveals new transcriptional networks in human atherosclerotic foam cells.

    PubMed

    Feldmann, Radmila; Fischer, Cornelius; Kodelja, Vitam; Behrens, Sarah; Haas, Stefan; Vingron, Martin; Timmermann, Bernd; Geikowski, Anne; Sauer, Sascha

    2013-04-01

    Increased physiological levels of oxysterols are major risk factors for developing atherosclerosis and cardiovascular disease. Lipid-loaded macrophages, termed foam cells, are important during the early development of atherosclerotic plaques. To pursue the hypothesis that ligand-based modulation of the nuclear receptor LXRα is crucial for cell homeostasis during atherosclerotic processes, we analysed genome-wide the action of LXRα in foam cells and macrophages. By integrating chromatin immunoprecipitation-sequencing (ChIP-seq) and gene expression profile analyses, we generated a highly stringent set of 186 LXRα target genes. Treatment with the nanomolar-binding ligand T0901317 and subsequent auto-regulatory LXRα activation resulted in sequence-dependent sharpening of the genome-binding patterns of LXRα. LXRα-binding loci that correlated with differential gene expression revealed 32 novel target genes with potential beneficial effects, which in part explained the implications of disease-associated genetic variation data. These observations identified highly integrated LXRα ligand-dependent transcriptional networks, including the APOE/C1/C4/C2-gene cluster, which contribute to the reversal of cholesterol efflux and the dampening of inflammation processes in foam cells to prevent atherogenesis.

  4. Genome-wide identification, classification and transcriptional analysis of nitrate and ammonium transporters in Coffea.

    PubMed

    Santos, Tiago Benedito Dos; Lima, Joni Esrom; Felicio, Mariane Silva; Soares, João Danillo Moura; Domingues, Douglas Silva

    2017-04-10

    Nitrogen (N) is quantitatively the main nutrient required by coffee plants, with acquisition mainly by the roots and mostly exported to coffee beans. Nitrate (NO3-) and ammonium (NH4+) are the most important inorganic sources for N uptake. Several N transporters encoded by different gene families mediate the uptake of these compounds. They have an important role in source preference for N uptake in the root system. In this study, we performed a genome-wide analysis, including in silico expression and phylogenetic analyses of AMT1, AMT2, NRT1/PTR, and NRT2 transporters in the recently sequenced Coffea canephora genome. We analyzed the expression of six selected transporters in Coffea arabica roots submitted to N deficiency. N source preference was also analyzed in C. arabica using isotopes. C. canephora N transporters follow the patterns observed for most eudicots, where each member of the AMT and NRT families has a particular role in N mobilization, and where some of these are modulated by N deficiency. Despite the prevalence of putative nitrate transporters in the Coffea genome, ammonium was the preferential inorganic N source for N-starved C. arabica roots. This data provides an important basis for fundamental and applied studies to depict molecular mechanisms involved in N uptake in coffee trees.

  5. Genome-wide analysis of transcription factors involved in maize embryonic callus formation.

    PubMed

    Ge, Fei; Luo, Xu; Huang, Xing; Zhang, Yanling; He, Xiujing; Liu, Min; Lin, Haijian; Peng, Huanwei; Li, Lujiang; Zhang, Zhiming; Pan, Guangtang; Shen, Yaou

    2016-12-01

    In this study, a maize inbred line with a strong capacity to induce embryonic callus, 18-599R, was used to analyze the transcription factors expressed during embryonic callus formation. A total of 1180 transcription factors were found to be expressed during three key stages of callus induction. Of these, compared with control, 361, 346 and 328 transcription factors were significantly downregulated during stages I, II and III, respectively. In contrast, 355, 372 and 401 transcription factors (TFs) were upregulated during the respective stages. We constructed a transcription factor-mediated regulatory network and found that plant hormone signal transduction was the pathway most significantly enriched among TFs. This pathway includes 48 TFs regulating cell enlargement, cell differentiation, cell division and cell dedifferentiation via the response to plant hormones. Through real-time polymerase chain reaction (PCR) and degradome sequencing, we identified 23 transcription factors that are regulated by miRNA. Through further analysis, ZmMYB138, a member of the MYB transcription factor family localized in the nucleus, was verified to promote embryonic callus formation in the maize embryo through GA signal transduction.

  6. Genome-Wide Identification and Expression Analysis of the NAC Transcription Factor Family in Cassava.

    PubMed

    Hu, Wei; Wei, Yunxie; Xia, Zhiqiang; Yan, Yan; Hou, Xiaowan; Zou, Meiling; Lu, Cheng; Wang, Wenquan; Peng, Ming

    2015-01-01

    NAC [no apical meristem (NAM), Arabidopsis transcription activation factor [ATAF1/2] and cup-shaped cotyledon (CUC2)] proteins is one of the largest groups of plant specific transcription factors and plays a crucial role in plant growth, development, and adaption to the environment. Currently, no information is known about the NAC family in cassava. In this study, 96 NAC genes (MeNACs) were identified from the cassava genome. Phylogenetic analysis of the NACs from cassava and Arabidopsis showed that MeNAC proteins can be clustered into 16 subgroups. Gene structure analysis found that the number of introns of MeNAC genes varied from 0 to 5, with the majority of MeNAC genes containing two introns, indicating a small gene structure diversity of cassava NAC genes. Conserved motif analysis revealed that all of the identified MeNACs had the conserved NAC domain and/or NAM domain. Global expression analysis suggested that MeNAC genes exhibited different expression profiles in different tissues between wild subspecies and cultivated varieties, indicating their involvement in the functional diversity of different accessions. Transcriptome analysis demonstrated that MeNACs had a widely transcriptional response to drought stress and that they had differential expression profiles in different accessions, implying their contribution to drought stress resistance in cassava. Finally, the expression of twelve MeNAC genes was analyzed under osmotic, salt, cold, ABA, and H2O2 treatments, indicating that cassava NACs may represent convergence points of different signaling pathways. Taken together, this work found some excellent tissue-specific and abiotic stress-responsive candidate MeNAC genes, which would provide a solid foundation for functional investigation of the NAC family, crop improvement and improved understanding of signal transduction in plants. These data bring new insight on the complexity of the transcriptional control of MeNAC genes and support the hypothesis that

  7. Genome-wide analysis reveals regulatory role of G4 DNA in gene transcription.

    PubMed

    Du, Zhuo; Zhao, Yiqiang; Li, Ning

    2008-02-01

    G-quadruplex or G4 DNA, a four-stranded DNA structure formed in G-rich sequences, has been hypothesized to be a structural motif involved in gene regulation. In this study, we examined the regulatory role of potential G4 DNA motifs (PG4Ms) located in the putative transcriptional regulatory region (TRR, -500 to +500) of genes across the human genome. We found that PG4Ms in the 500-bp region downstream of the annotated transcription start site (TSS; PG4M(D500)) are associated with gene expression. Generally, PG4M(D500)-positive genes are expressed at higher levels than PG4M(D500)-negative genes, and an increased number of PG4M(D500) provides a cumulative effect. This observation was validated by controlling for attributes, including gene family, function, and promoter similarity. We also observed an asymmetric pattern of PG4M(D500) distribution between strands, whereby the frequency of PG4M(D500) in the coding strand is generally higher than that in the template strand. Further analysis showed that the presence of PG4M(D500) and its strand asymmetry are associated with significant enrichment of RNAP II at the putative TRR. On the basis of these results, we propose a model of G4 DNA-mediated stimulation of transcription with the hypothesis that PG4M(D500) contributes to gene transcription by maintaining the DNA in an open conformation, while the asymmetric distribution of PG4M(D500) considerably reduces the probability of blocking the progression of the RNA polymerase complex on the template strand. Our findings provide a comprehensive view of the regulatory function of G4 DNA in gene transcription.

  8. Genome-wide analysis of the DNA-binding with one zinc finger (Dof) transcription factor family in bananas.

    PubMed

    Dong, Chen; Hu, Huigang; Xie, Jianghui

    2016-12-01

    DNA-binding with one finger (Dof) domain proteins are a multigene family of plant-specific transcription factors involved in numerous aspects of plant growth and development. In this study, we report a genome-wide search for Musa acuminata Dof (MaDof) genes and their expression profiles at different developmental stages and in response to various abiotic stresses. In addition, a complete overview of the Dof gene family in bananas is presented, including the gene structures, chromosomal locations, cis-regulatory elements, conserved protein domains, and phylogenetic inferences. Based on the genome-wide analysis, we identified 74 full-length protein-coding MaDof genes unevenly distributed on 11 chromosomes. Phylogenetic analysis with Dof members from diverse plant species showed that MaDof genes can be classified into four subgroups (StDof I, II, III, and IV). The detailed genomic information of the MaDof gene homologs in the present study provides opportunities for functional analyses to unravel the exact role of the genes in plant growth and development.

  9. Genome-wide analysis of growth phase-dependent translational and transcriptional regulation in halophilic archaea

    PubMed Central

    2007-01-01

    Background Differential expression of genes can be regulated on many different levels. Most global studies of gene regulation concentrate on transcript level regulation, and very few global analyses of differential translational efficiencies exist. The studies have revealed that in Saccharomyces cerevisiae, Arabidopsis thaliana, and human cell lines translational regulation plays a significant role. Additional species have not been investigated yet. Particularly, until now no global study of translational control with any prokaryotic species was available. Results A global analysis of translational control was performed with two haloarchaeal model species, Halobacterium salinarum and Haloferax volcanii. To identify differentially regulated genes, exponentially growing and stationary phase cells were compared. More than 20% of H. salinarum transcripts are translated with non-average efficiencies. By far the largest group is comprised of genes that are translated with above-average efficiency specifically in exponential phase, including genes for many ribosomal proteins, RNA polymerase subunits, enzymes, and chemotaxis proteins. Translation of 1% of all genes is specifically repressed in either of the two growth phases. For comparison, DNA microarrays were also used to identify differential transcriptional regulation in H. salinarum, and 17% of all genes were found to have non-average transcript levels in exponential versus stationary phase. In H. volcanii, 12% of all genes are translated with non-average efficiencies. The overlap with H. salinarum is negligible. In contrast to H. salinarum, 4.6% of genes have non-average translational efficiency in both growth phases, and thus they might be regulated by other stimuli than growth phase. Conclusion For the first time in any prokaryotic species it was shown that a significant fraction of genes is under differential translational control. Groups of genes with different regulatory patterns were discovered. However

  10. An integrated pipeline for the genome-wide analysis of transcription factor binding sites from ChIP-Seq.

    PubMed

    Mercier, Eloi; Droit, Arnaud; Li, Leping; Robertson, Gordon; Zhang, Xuekui; Gottardo, Raphael

    2011-02-16

    ChIP-Seq has become the standard method for genome-wide profiling DNA association of transcription factors. To simplify analyzing and interpreting ChIP-Seq data, which typically involves using multiple applications, we describe an integrated, open source, R-based analysis pipeline. The pipeline addresses data input, peak detection, sequence and motif analysis, visualization, and data export, and can readily be extended via other R and Bioconductor packages. Using a standard multicore computer, it can be used with datasets consisting of tens of thousands of enriched regions. We demonstrate its effectiveness on published human ChIP-Seq datasets for FOXA1, ER, CTCF and STAT1, where it detected co-occurring motifs that were consistent with the literature but not detected by other methods. Our pipeline provides the first complete set of Bioconductor tools for sequence and motif analysis of ChIP-Seq and ChIP-chip data.

  11. Genome-wide analysis of transcriptional regulators in human HSPCs reveals a densely interconnected network of coding and noncoding genes.

    PubMed

    Beck, Dominik; Thoms, Julie A I; Perera, Dilmi; Schütte, Judith; Unnikrishnan, Ashwin; Knezevic, Kathy; Kinston, Sarah J; Wilson, Nicola K; O'Brien, Tracey A; Göttgens, Berthold; Wong, Jason W H; Pimanda, John E

    2013-10-03

    Genome-wide combinatorial binding patterns for key transcription factors (TFs) have not been reported for primary human hematopoietic stem and progenitor cells (HSPCs), and have constrained analysis of the global architecture of molecular circuits controlling these cells. Here we provide high-resolution genome-wide binding maps for a heptad of key TFs (FLI1, ERG, GATA2, RUNX1, SCL, LYL1, and LMO2) in human CD34(+) HSPCs, together with quantitative RNA and microRNA expression profiles. We catalog binding of TFs at coding genes and microRNA promoters, and report that combinatorial binding of all 7 TFs is favored and associated with differential expression of genes and microRNA in HSPCs. We also uncover a previously unrecognized association between FLI1 and RUNX1 pairing in HSPCs, we establish a correlation between the density of histone modifications that mark active enhancers and the number of overlapping TFs at a peak, we demonstrate bivalent histone marks at promoters of heptad target genes in CD34(+) cells that are poised for later expression, and we identify complex relationships between specific microRNAs and coding genes regulated by the heptad. Taken together, these data reveal the power of integrating multifactor sequencing of chromatin immunoprecipitates with coding and noncoding gene expression to identify regulatory circuits controlling cell identity.

  12. Genome-Wide Transcriptional Analysis of the Phosphate Starvation Stimulon of Bacillus subtilis†

    PubMed Central

    Allenby, Nicholas E. E.; O'Connor, Nicola; Prágai, Zoltán; Ward, Alan C.; Wipat, Anil; Harwood, Colin R.

    2005-01-01

    Bacillus subtilis responds to phosphate starvation stress by inducing the PhoP and SigB regulons. While the PhoP regulon provides a specific response to phosphate starvation stress, maximizing the acquisition of phosphate (Pi) from the environment and reducing the cellular requirement for this essential nutrient, the SigB regulon provides nonspecific resistance to stress by protecting essential cellular components, such as DNA and membranes. We have characterized the phosphate starvation stress response of B. subtilis at a genome-wide level using DNA macroarrays. A combination of outlier and cluster analyses identified putative new members of the PhoP regulon, namely, yfkN (2′,3′ cyclic nucleotide 2′-phosphodiesterase), yurI (RNase), yjdB (unknown), and vpr (extracellular serine protease). YurI is thought to be responsible for the nonspecific degradation of RNA, while the activity of YfkN on various nucleotide phosphates suggests that it could act on substrates liberated by YurI, which produces 3′ or 5′ phosphoribonucleotides. The putative new PhoP regulon members are either known or predicted to be secreted and are likely to be important for the recovery of inorganic phosphate from a variety of organic sources of phosphate in the environment. PMID:16291680

  13. Genome-wide transcriptional analysis of T cell activation reveals differential gene expression associated with psoriasis.

    PubMed

    Palau, Nuria; Julià, Antonio; Ferrándiz, Carlos; Puig, Lluís; Fonseca, Eduardo; Fernández, Emilia; López-Lasanta, María; Tortosa, Raül; Marsal, Sara

    2013-11-23

    Psoriasis is a chronic autoimmune disease in which T cells have a predominant role in initiating and perpetuating the chronic inflammation in skin. However, the mechanisms that regulate T cell activation in psoriasis are still incompletely understood. The objective of the present study was to characterize the main genetic pathways associated with T cell activation in psoriasis. Gene expression profiles from in vitro activated T cells were obtained from 17 psoriasis patients and 7 healthy controls using Illumina HT-12 v4 microarrays. From a total of 47,321 analyzed transcripts, 42 genes were found to be differentially expressed between psoriasis and controls (FDR p-value < 0.1, absolute fold-change > 1.2). Using an independent cohort of 8 patients and 8 healthy controls we validated the overexpression of SPATS2L (p-value =0.0009) and KLF6 (p-value =0.0012) genes in activated T cells from psoriasis patients. Using weighted correlation analysis we identified SPATS2L and KLF6 coexpression networks, which were also significantly associated with psoriasis (p-value < 0.05). Gene Ontology analysis allowed the identification of several biological processes associated with each coexpression network. Finally, using Gene Set Enrichment Analysis over the global T cell transcriptome we also found additional genetic pathways strongly associated with psoriasis (p-value < 0.0001). This study has identified two new genes, SPATS2L and KLF6, strongly associated with T cell activation in psoriasis. Functional analyses of the gene expression profiles also revealed new biological processes and genetic pathways associated with psoriasis. The results of this study provide an important insight into the biology of this common chronic inflammatory disease.

  14. Genome-wide digital transcript analysis of putative fruitlet abscission related genes regulated by ethephon in litchi

    PubMed Central

    Li, Caiqin; Wang, Yan; Ying, Peiyuan; Ma, Wuqiang; Li, Jianguo

    2015-01-01

    The high level of physiological fruitlet abscission in litchi (Litchi chinensis Sonn.) causes severe yield loss. Cell separation occurs at the fruit abscission zone (FAZ) and can be triggered by ethylene. However, a deep knowledge of the molecular events occurring in the FAZ is still unknown. Here, genome-wide digital transcript abundance (DTA) analysis of putative fruit abscission related genes regulated by ethephon in litchi were studied. More than 81 million high quality reads from seven ethephon treated and untreated control libraries were obtained by high-throughput sequencing. Through DTA profile analysis in combination with Gene Ontology and KEGG pathway enrichment analyses, a total of 2730 statistically significant candidate genes were involved in the ethephon-promoted litchi fruitlet abscission. Of these, there were 1867 early-responsive genes whose expressions were up- or down-regulated from 0 to 1 d after treatment. The most affected genes included those related to ethylene biosynthesis and signaling, auxin transport and signaling, transcription factors (TFs), protein ubiquitination, ROS response, calcium signal transduction, and cell wall modification. These genes could be clustered into four groups and 13 subgroups according to their similar expression patterns. qRT-PCR displayed the expression pattern of 41 selected candidate genes, which proved the accuracy of our DTA data. Ethephon treatment significantly increased fruit abscission and ethylene production of fruitlet. The possible molecular events to control the ethephon-promoted litchi fruitlet abscission were prompted out. The increased ethylene evolution in fruitlet would suppress the synthesis and polar transport of auxin and trigger abscission signaling. To the best of our knowledge, it is the first time to monitor the gene expression profile occurring in the FAZ-enriched pedicel during litchi fruit abscission induced by ethephon on the genome-wide level. This study will contribute to a better

  15. Genome-wide digital transcript analysis of putative fruitlet abscission related genes regulated by ethephon in litchi.

    PubMed

    Li, Caiqin; Wang, Yan; Ying, Peiyuan; Ma, Wuqiang; Li, Jianguo

    2015-01-01

    The high level of physiological fruitlet abscission in litchi (Litchi chinensis Sonn.) causes severe yield loss. Cell separation occurs at the fruit abscission zone (FAZ) and can be triggered by ethylene. However, a deep knowledge of the molecular events occurring in the FAZ is still unknown. Here, genome-wide digital transcript abundance (DTA) analysis of putative fruit abscission related genes regulated by ethephon in litchi were studied. More than 81 million high quality reads from seven ethephon treated and untreated control libraries were obtained by high-throughput sequencing. Through DTA profile analysis in combination with Gene Ontology and KEGG pathway enrichment analyses, a total of 2730 statistically significant candidate genes were involved in the ethephon-promoted litchi fruitlet abscission. Of these, there were 1867 early-responsive genes whose expressions were up- or down-regulated from 0 to 1 d after treatment. The most affected genes included those related to ethylene biosynthesis and signaling, auxin transport and signaling, transcription factors (TFs), protein ubiquitination, ROS response, calcium signal transduction, and cell wall modification. These genes could be clustered into four groups and 13 subgroups according to their similar expression patterns. qRT-PCR displayed the expression pattern of 41 selected candidate genes, which proved the accuracy of our DTA data. Ethephon treatment significantly increased fruit abscission and ethylene production of fruitlet. The possible molecular events to control the ethephon-promoted litchi fruitlet abscission were prompted out. The increased ethylene evolution in fruitlet would suppress the synthesis and polar transport of auxin and trigger abscission signaling. To the best of our knowledge, it is the first time to monitor the gene expression profile occurring in the FAZ-enriched pedicel during litchi fruit abscission induced by ethephon on the genome-wide level. This study will contribute to a better

  16. Genome-wide assembly and analysis of alternative transcripts in mouse

    PubMed Central

    Sharov, Alexei A.; Dudekula, Dawood B.; Ko, Minoru S.H.

    2005-01-01

    To build a mouse gene index with the most comprehensive coverage of alternative transcription/splicing (ATS), we developed an algorithm and a fully automated computational pipeline for transcript assembly from expressed sequences aligned to the genome. We identified 191,946 genomic loci, which included 27,497 protein-coding genes and 11,906 additional gene candidates (e.g., nonprotein-coding, but multiexon). Comparison of the resulting gene index with TIGR, UniGene, DoTS, and ESTGenes databases revealed that it had a greater number of transcripts, a greater average number of exons and introns with proper splicing sites per gene, and longer ORFs. The 27,497 protein-coding genes had 77,138 transcripts, i.e., 2.8 transcripts per gene on average. Close examination of transcripts led to a combinatorial table of 23 types of ATS units, only nine of which were previously described, i.e., 14 types of alternative splicing, seven types of alternative starts, and two types of alternative termination. The 47%, 18%, and 14% of 20,323 multiexon protein-coding genes with proper splice sites had alternative splicings, alternative starts, and alternative terminations, respectively. The gene index with the comprehensive ATS will provide a useful platform for analyzing the nature and mechanism of ATS, as well as for designing the accurate exon-based DNA microarrays. PMID:15867436

  17. Genome-wide transcriptional analysis of flagellar regeneration in Chlamydomonas reinhardtii identifies orthologs of ciliary disease genes

    PubMed Central

    Stolc, Viktor; Samanta, Manoj Pratim; Tongprasit, Waraporn; Marshall, Wallace F.

    2005-01-01

    The important role that cilia and flagella play in human disease creates an urgent need to identify genes involved in ciliary assembly and function. The strong and specific induction of flagellar-coding genes during flagellar regeneration in Chlamydomonas reinhardtii suggests that transcriptional profiling of such cells would reveal new flagella-related genes. We have conducted a genome-wide analysis of RNA transcript levels during flagellar regeneration in Chlamydomonas by using maskless photolithography method-produced DNA oligonucleotide microarrays with unique probe sequences for all exons of the 19,803 predicted genes. This analysis represents previously uncharacterized whole-genome transcriptional activity profiling study in this important model organism. Analysis of strongly induced genes reveals a large set of known flagellar components and also identifies a number of important disease-related proteins as being involved with cilia and flagella, including the zebrafish polycystic kidney genes Qilin, Reptin, and Pontin, as well as the testis-expressed tubby-like protein TULP2. PMID:15738400

  18. Genome-wide transcriptional analysis of flagellar regeneration in Chlamydomonas reinhardtii identifies orthologs of ciliary disease genes

    NASA Technical Reports Server (NTRS)

    Stolc, Viktor; Samanta, Manoj Pratim; Tongprasit, Waraporn; Marshall, Wallace F.

    2005-01-01

    The important role that cilia and flagella play in human disease creates an urgent need to identify genes involved in ciliary assembly and function. The strong and specific induction of flagellar-coding genes during flagellar regeneration in Chlamydomonas reinhardtii suggests that transcriptional profiling of such cells would reveal new flagella-related genes. We have conducted a genome-wide analysis of RNA transcript levels during flagellar regeneration in Chlamydomonas by using maskless photolithography method-produced DNA oligonucleotide microarrays with unique probe sequences for all exons of the 19,803 predicted genes. This analysis represents previously uncharacterized whole-genome transcriptional activity profiling study in this important model organism. Analysis of strongly induced genes reveals a large set of known flagellar components and also identifies a number of important disease-related proteins as being involved with cilia and flagella, including the zebrafish polycystic kidney genes Qilin, Reptin, and Pontin, as well as the testis-expressed tubby-like protein TULP2.

  19. Genome-wide transcriptional analysis of flagellar regeneration in Chlamydomonas reinhardtii identifies orthologs of ciliary disease genes

    NASA Technical Reports Server (NTRS)

    Stolc, Viktor; Samanta, Manoj Pratim; Tongprasit, Waraporn; Marshall, Wallace F.

    2005-01-01

    The important role that cilia and flagella play in human disease creates an urgent need to identify genes involved in ciliary assembly and function. The strong and specific induction of flagellar-coding genes during flagellar regeneration in Chlamydomonas reinhardtii suggests that transcriptional profiling of such cells would reveal new flagella-related genes. We have conducted a genome-wide analysis of RNA transcript levels during flagellar regeneration in Chlamydomonas by using maskless photolithography method-produced DNA oligonucleotide microarrays with unique probe sequences for all exons of the 19,803 predicted genes. This analysis represents previously uncharacterized whole-genome transcriptional activity profiling study in this important model organism. Analysis of strongly induced genes reveals a large set of known flagellar components and also identifies a number of important disease-related proteins as being involved with cilia and flagella, including the zebrafish polycystic kidney genes Qilin, Reptin, and Pontin, as well as the testis-expressed tubby-like protein TULP2.

  20. Genome-Wide Phylogenetic Comparative Analysis of Plant Transcriptional Regulation: A Timeline of Loss, Gain, Expansion, and Correlation with Complexity

    PubMed Central

    Lang, Daniel; Weiche, Benjamin; Timmerhaus, Gerrit; Richardt, Sandra; Riaño-Pachón, Diego M.; Corrêa, Luiz G. G.; Reski, Ralf; Mueller-Roeber, Bernd; Rensing, Stefan A.

    2010-01-01

    Evolutionary retention of duplicated genes encoding transcription-associated proteins (TAPs, comprising transcription factors and other transcriptional regulators) has been hypothesized to be positively correlated with increasing morphological complexity and paleopolyploidizations, especially within the plant kingdom. Here, we present the most comprehensive set of classification rules for TAPs and its application for genome-wide analyses of plants and algae. Using a dated species tree and phylogenetic comparative (PC) analyses, we define the timeline of TAP loss, gain, and expansion among Viridiplantae and find that two major bursts of gain/expansion occurred, coinciding with the water-to-land transition and the radiation of flowering plants. For the first time, we provide PC proof for the long-standing hypothesis that TAPs are major driving forces behind the evolution of morphological complexity, the latter in Plantae being shaped significantly by polyploidization and subsequent biased paleolog retention. Principal component analysis incorporating the number of TAPs per genome provides an alternate and significant proxy for complexity, ideally suited for PC genomics. Our work lays the ground for further interrogation of the shaping of gene regulatory networks underlying the evolution of organism complexity. PMID:20644220

  1. Genome-wide cloning, identification, classification and functional analysis of cotton heat shock transcription factors in cotton (Gossypium hirsutum).

    PubMed

    Wang, Jun; Sun, Na; Deng, Ting; Zhang, Lida; Zuo, Kaijing

    2014-11-06

    Heat shock transcriptional factors (Hsfs) play important roles in the processes of biotic and abiotic stresses as well as in plant development. Cotton (Gossypium hirsutum, 2n=4x=(AD)2=52) is an important crop for natural fiber production. Due to continuous high temperature and intermittent drought, heat stress is becoming a handicap to improve cotton yield and lint quality. Recently, the related wild diploid species Gossypium raimondii genome (2n=2x=(D5)2=26) has been fully sequenced. In order to analyze the functions of different Hsfs at the genome-wide level, detailed characterization and analysis of the Hsf gene family in G. hirsutum is indispensable. EST assembly and genome-wide analyses were applied to clone and identify heat shock transcription factor (Hsf) genes in Upland cotton (GhHsf). Forty GhHsf genes were cloned, identified and classified into three main classes (A, B and C) according to the characteristics of their domains. Analysis of gene duplications showed that GhHsfs have occurred more frequently than reported in plant genomes such as Arabidopsis and Populus. Quantitative real-time PCR (qRT-PCR) showed that all GhHsf transcripts are expressed in most cotton plant tissues including roots, stems, leaves and developing fibers, and abundantly in developing ovules. Three expression patterns were confirmed in GhHsfs when cotton plants were exposed to high temperature for 1 h. GhHsf39 exhibited the most immediate response to heat shock. Comparative analysis of Hsfs expression differences between the wild-type and fiberless mutant suggested that Hsfs are involved in fiber development. Comparative genome analysis showed that Upland cotton D-subgenome contains 40 Hsf members, and that the whole genome of Upland cotton contains more than 80 Hsf genes due to genome duplication. The expression patterns in different tissues in response to heat shock showed that GhHsfs are important for heat stress as well as fiber development. These results provide an improved

  2. Genome-wide transcriptional analysis of super-embryogenic Medicago truncatula explant cultures

    PubMed Central

    Imin, Nijat; Goffard, Nicolas; Nizamidin, Mahira; Rolfe, Barry G

    2008-01-01

    Background The Medicago truncatula (M. truncatula) line 2HA has a 500-fold greater capacity to regenerate plants in culture by somatic embryogenesis than its wild type progenitor Jemalong. To understand the molecular basis for the regeneration capacity of this super-embryogenic line 2HA, using Affymetrix GeneChip®, we have compared transcriptomes of explant leaf cultures of these two lines that were grown on media containing the auxin NAA (1-naphthaleneacetic acid) and the cytokinin BAP (6-benzylaminopurine) for two weeks, an early time point for tissue culture proliferation. Results Using Affymetrix GeneChip®, GCRMA normalisation and statistical analysis, we have shown that more than 196 and 49 probe sets were significantly (p < 0.05) up- or down-regulated respectively more than 2 fold in expression. We have utilised GeneBins, a database for classifying gene expression data to distinguish differentially displayed pathways among these two cultures which showed changes in number of biochemical pathways including carbon and flavonoid biosynthesis, phytohormone biosynthesis and signalling. The up-regulated genes in the embryogenic 2HA culture included nodulins, transporters, regulatory genes, embryogenesis related arabinogalactans and genes involved in redox homeostasis, the transition from vegetative growth to reproductive growth and cytokinin signalling. Down-regulated genes included protease inhibitors, wound-induced proteins, and genes involved in biosynthesis and signalling of phytohormones auxin, gibberellin and ethylene. These changes indicate essential differences between the super-embryogenic line 2HA and Jemalong not only in many aspects of biochemical pathways but also in their response to auxin and cytokinin. To validate the GeneChip results, we used quantitative real-time RT-PCR to examine the expression of the genes up-regulated in 2HA such as transposase, RNA-directed DNA polymerase, glycoside hydrolase, RESPONSE REGULATOR 10, AGAMOUS-LIKE 20, flower

  3. Genome-wide identification and expression analysis of the ClTCP transcription factors in Citrullus lanatus.

    PubMed

    Shi, Pibiao; Guy, Kateta Malangisha; Wu, Weifang; Fang, Bingsheng; Yang, Jinghua; Zhang, Mingfang; Hu, Zhongyuan

    2016-04-12

    The plant-specific TCP transcription factor family, which is involved in the regulation of cell growth and proliferation, performs diverse functions in multiple aspects of plant growth and development. However, no comprehensive analysis of the TCP family in watermelon (Citrullus lanatus) has been undertaken previously. A total of 27 watermelon TCP encoding genes distributed on nine chromosomes were identified. Phylogenetic analysis clustered the genes into 11 distinct subgroups. Furthermore, phylogenetic and structural analyses distinguished two homology classes within the ClTCP family, designated Class I and Class II. The Class II genes were differentiated into two subclasses, the CIN subclass and the CYC/TB1 subclass. The expression patterns of all members were determined by semi-quantitative PCR. The functions of two ClTCP genes, ClTCP14a and ClTCP15, in regulating plant height were confirmed by ectopic expression in Arabidopsis wild-type and ortholog mutants. This study represents the first genome-wide analysis of the watermelon TCP gene family, which provides valuable information for understanding the classification and functions of the TCP genes in watermelon.

  4. Genome-Wide Identification and Expression Analysis of WRKY Transcription Factors under Multiple Stresses in Brassica napus.

    PubMed

    He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei

    2016-01-01

    WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions

  5. Genome-Wide Identification and Expression Analysis of WRKY Transcription Factors under Multiple Stresses in Brassica napus

    PubMed Central

    He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei

    2016-01-01

    WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions

  6. Genome-wide analysis and expression profile of the bZIP transcription factor gene family in grapevine (Vitis vinifera)

    PubMed Central

    2014-01-01

    Background Basic leucine zipper (bZIP) transcription factor gene family is one of the largest and most diverse families in plants. Current studies have shown that the bZIP proteins regulate numerous growth and developmental processes and biotic and abiotic stress responses. Nonetheless, knowledge concerning the specific expression patterns and evolutionary history of plant bZIP family members remains very limited. Results We identified 55 bZIP transcription factor-encoding genes in the grapevine (Vitis vinifera) genome, and divided them into 10 groups according to the phylogenetic relationship with those in Arabidopsis. The chromosome distribution and the collinearity analyses suggest that expansion of the grapevine bZIP (VvbZIP) transcription factor family was greatly contributed by the segment/chromosomal duplications, which may be associated with the grapevine genome fusion events. Nine intron/exon structural patterns within the bZIP domain and the additional conserved motifs were identified among all VvbZIP proteins, and showed a high group-specificity. The predicted specificities on DNA-binding domains indicated that some highly conserved amino acid residues exist across each major group in the tree of land plant life. The expression patterns of VvbZIP genes across the grapevine gene expression atlas, based on microarray technology, suggest that VvbZIP genes are involved in grapevine organ development, especially seed development. Expression analysis based on qRT-PCR indicated that VvbZIP genes are extensively involved in drought- and heat-responses, with possibly different mechanisms. Conclusions The genome-wide identification, chromosome organization, gene structures, evolutionary and expression analyses of grapevine bZIP genes provide an overall insight of this gene family and their potential involvement in growth, development and stress responses. This will facilitate further research on the bZIP gene family regarding their evolutionary history and

  7. Genome-wide location analysis reveals distinct transcriptional circuitry by paralogous regulators Foxa1 and Foxa2.

    PubMed

    Bochkis, Irina M; Schug, Jonathan; Ye, Diana Z; Kurinna, Svitlana; Stratton, Sabrina A; Barton, Michelle C; Kaestner, Klaus H

    2012-01-01

    Gene duplication is a powerful driver of evolution. Newly duplicated genes acquire new roles that are relevant to fitness, or they will be lost over time. A potential path to functional relevance is mutation of the coding sequence leading to the acquisition of novel biochemical properties, as analyzed here for the highly homologous paralogs Foxa1 and Foxa2 transcriptional regulators. We determine by genome-wide location analysis (ChIP-Seq) that, although Foxa1 and Foxa2 share a large fraction of binding sites in the liver, each protein also occupies distinct regulatory elements in vivo. Foxa1-only sites are enriched for p53 binding sites and are frequently found near genes important to cell cycle regulation, while Foxa2-restricted sites show only a limited match to the forkhead consensus and are found in genes involved in steroid and lipid metabolism. Thus, Foxa1 and Foxa2, while redundant during development, have evolved divergent roles in the adult liver, ensuring the maintenance of both genes during evolution.

  8. Genome-Wide Location Analysis Reveals Distinct Transcriptional Circuitry by Paralogous Regulators Foxa1 and Foxa2

    PubMed Central

    Bochkis, Irina M.; Schug, Jonathan; Ye, Diana Z.; Kurinna, Svitlana; Stratton, Sabrina A.; Barton, Michelle C.; Kaestner, Klaus H.

    2012-01-01

    Gene duplication is a powerful driver of evolution. Newly duplicated genes acquire new roles that are relevant to fitness, or they will be lost over time. A potential path to functional relevance is mutation of the coding sequence leading to the acquisition of novel biochemical properties, as analyzed here for the highly homologous paralogs Foxa1 and Foxa2 transcriptional regulators. We determine by genome-wide location analysis (ChIP-Seq) that, although Foxa1 and Foxa2 share a large fraction of binding sites in the liver, each protein also occupies distinct regulatory elements in vivo. Foxa1-only sites are enriched for p53 binding sites and are frequently found near genes important to cell cycle regulation, while Foxa2-restricted sites show only a limited match to the forkhead consensus and are found in genes involved in steroid and lipid metabolism. Thus, Foxa1 and Foxa2, while redundant during development, have evolved divergent roles in the adult liver, ensuring the maintenance of both genes during evolution. PMID:22737085

  9. Comparative Analysis of Regulatory Elements between Escherichia coli and Klebsiella pneumoniae by Genome-Wide Transcription Start Site Profiling

    PubMed Central

    Qiu, Yu; Nagarajan, Harish; Seo, Joo-Hyun; Cho, Byung-Kwan; Tsai, Shih-Feng; Palsson, Bernhard Ø.

    2012-01-01

    Genome-wide transcription start site (TSS) profiles of the enterobacteria Escherichia coli and Klebsiella pneumoniae were experimentally determined through modified 5′ RACE followed by deep sequencing of intact primary mRNA. This identified 3,746 and 3,143 TSSs for E. coli and K. pneumoniae, respectively. Experimentally determined TSSs were then used to define promoter regions and 5′ UTRs upstream of coding genes. Comparative analysis of these regulatory elements revealed the use of multiple TSSs, identical sequence motifs of promoter and Shine-Dalgarno sequence, reflecting conserved gene expression apparatuses between the two species. In both species, over 70% of primary transcripts were expressed from operons having orthologous genes during exponential growth. However, expressed orthologous genes in E. coli and K. pneumoniae showed a strikingly different organization of upstream regulatory regions with only 20% identical promoters with TSSs in both species. Over 40% of promoters had TSSs identified in only one species, despite conserved promoter sequences existing in the other species. 662 conserved promoters having TSSs in both species resulted in the same number of comparable 5′ UTR pairs, and that regulatory element was found to be the most variant region in sequence among promoter, 5′ UTR, and ORF. In K. pneumoniae, 48 sRNAs were predicted and 36 of them were expressed during exponential growth. Among them, 34 orthologous sRNAs between two species were analyzed in depth, and the analysis showed that many sRNAs of K. pneumoniae, including pleiotropic sRNAs such as rprA, arcZ, and sgrS, may work in the same way as in E. coli. These results reveal a new dimension of comparative genomics such that a comparison of two genomes needs to be comprehensive over all levels of genome organization. PMID:22912590

  10. Genome-wide computational analysis reveals cardiomyocyte-specific transcriptional Cis-regulatory motifs that enable efficient cardiac gene therapy.

    PubMed

    Rincon, Melvin Y; Sarcar, Shilpita; Danso-Abeam, Dina; Keyaerts, Marleen; Matrai, Janka; Samara-Kuko, Ermira; Acosta-Sanchez, Abel; Athanasopoulos, Takis; Dickson, George; Lahoutte, Tony; De Bleser, Pieter; VandenDriessche, Thierry; Chuah, Marinee K

    2015-01-01

    Gene therapy is a promising emerging therapeutic modality for the treatment of cardiovascular diseases and hereditary diseases that afflict the heart. Hence, there is a need to develop robust cardiac-specific expression modules that allow for stable expression of the gene of interest in cardiomyocytes. We therefore explored a new approach based on a genome-wide bioinformatics strategy that revealed novel cardiac-specific cis-acting regulatory modules (CS-CRMs). These transcriptional modules contained evolutionary-conserved clusters of putative transcription factor binding sites that correspond to a "molecular signature" associated with robust gene expression in the heart. We then validated these CS-CRMs in vivo using an adeno-associated viral vector serotype 9 that drives a reporter gene from a quintessential cardiac-specific α-myosin heavy chain promoter. Most de novo designed CS-CRMs resulted in a >10-fold increase in cardiac gene expression. The most robust CRMs enhanced cardiac-specific transcription 70- to 100-fold. Expression was sustained and restricted to cardiomyocytes. We then combined the most potent CS-CRM4 with a synthetic heart and muscle-specific promoter (SPc5-12) and obtained a significant 20-fold increase in cardiac gene expression compared to the cytomegalovirus promoter. This study underscores the potential of rational vector design to improve the robustness of cardiac gene therapy.

  11. Genome-wide Computational Analysis Reveals Cardiomyocyte-specific Transcriptional Cis-regulatory Motifs That Enable Efficient Cardiac Gene Therapy

    PubMed Central

    Rincon, Melvin Y; Sarcar, Shilpita; Danso-Abeam, Dina; Keyaerts, Marleen; Matrai, Janka; Samara-Kuko, Ermira; Acosta-Sanchez, Abel; Athanasopoulos, Takis; Dickson, George; Lahoutte, Tony; De Bleser, Pieter; VandenDriessche, Thierry; Chuah, Marinee K

    2015-01-01

    Gene therapy is a promising emerging therapeutic modality for the treatment of cardiovascular diseases and hereditary diseases that afflict the heart. Hence, there is a need to develop robust cardiac-specific expression modules that allow for stable expression of the gene of interest in cardiomyocytes. We therefore explored a new approach based on a genome-wide bioinformatics strategy that revealed novel cardiac-specific cis-acting regulatory modules (CS-CRMs). These transcriptional modules contained evolutionary-conserved clusters of putative transcription factor binding sites that correspond to a “molecular signature” associated with robust gene expression in the heart. We then validated these CS-CRMs in vivo using an adeno-associated viral vector serotype 9 that drives a reporter gene from a quintessential cardiac-specific α-myosin heavy chain promoter. Most de novo designed CS-CRMs resulted in a >10-fold increase in cardiac gene expression. The most robust CRMs enhanced cardiac-specific transcription 70- to 100-fold. Expression was sustained and restricted to cardiomyocytes. We then combined the most potent CS-CRM4 with a synthetic heart and muscle-specific promoter (SPc5-12) and obtained a significant 20-fold increase in cardiac gene expression compared to the cytomegalovirus promoter. This study underscores the potential of rational vector design to improve the robustness of cardiac gene therapy. PMID:25195597

  12. A Genome-Wide Transcriptional Analysis of Yeast-Hyphal Transition in Candida tropicalis by RNA-Seq

    PubMed Central

    Yu, Shuan-bao; Li, Wen-ge; Liu, Xiao-shu; Zhao, Lei; Lu, Jin-xing

    2016-01-01

    Candida tropicalis is considered as the leading pathogen in nosocomial fungemia and hepatosplenic fungal infections in patients with cancer, particularly in leukemia. The yeast-filament transition is required for virulent infection by Candida. Several studies have explored the genome-wide transcription profile of Candida, however, no report on the transcriptional profile of C. tropicalis under yeast-filament transition has been published. In this study, the transcriptomes of three C. tropicalis isolates with different adhesion and biofilm formation abilities, identified in our previous studies, were analyzed in both the yeast and filament states using RNA-Seq. Differentially expressed genes were found for each isolate during the transition. A total of 115 genes were up- or down- regulated in the two hyphal-producing isolates (ZRCT 4 and ZRCT 45). Among these differentially expressed genes, only two were down-regulated during the yeast-filament transition. Furthermore, six filament-associated genes were up-regulated in the hyphae-producing isolates. According to Candida Hypha Growth Database established in this study, 331 hyphae- related genes were discovered in C. tropicalis. ALS1 and ALS3 were down-regulated and up-regulated, respectively, during filamentous growth of C. tropicalis. These findings proved a better understanding of gene expression dynamics during the yeast-filament transition in C. tropicalis. PMID:27851809

  13. Genome-wide analysis of the human p53 transcriptional network unveils a lncRNA tumour suppressor signature.

    PubMed

    Sánchez, Yolanda; Segura, Victor; Marín-Béjar, Oskar; Athie, Alejandro; Marchese, Francesco P; González, Jovanna; Bujanda, Luis; Guo, Shuling; Matheu, Ander; Huarte, Maite

    2014-12-19

    Despite the inarguable relevance of p53 in cancer, genome-wide studies relating endogenous p53 activity to the expression of lncRNAs in human cells are still missing. Here, by integrating RNA-seq with p53 ChIP-seq analyses of a human cancer cell line under DNA damage, we define a high-confidence set of 18 lncRNAs that are p53 transcriptional targets. We demonstrate that two of the p53-regulated lncRNAs are required for the efficient binding of p53 to some of its target genes, modulating the p53 transcriptional network and contributing to apoptosis induction by DNA damage. We also show that the expression of p53-lncRNAs is lowered in colorectal cancer samples, constituting a tumour suppressor signature with high diagnostic power. Thus, p53-regulated lncRNAs establish a positive regulatory feedback loop that enhances p53 tumour suppressor activity. Furthermore, the signature defined by p53-regulated lncRNAs supports their potential use in the clinic as biomarkers and therapeutic targets.

  14. Heat shock transcription factors in banana: genome-wide characterization and expression profile analysis during development and stress response

    PubMed Central

    Wei, Yunxie; Hu, Wei; Xia, Feiyu; Zeng, Hongqiu; Li, Xiaolin; Yan, Yu; He, Chaozu; Shi, Haitao

    2016-01-01

    Banana (Musa acuminata) is one of the most popular fresh fruits. However, the rapid spread of fungal pathogen Fusarium oxysporum f. sp. cubense (Foc) in tropical areas severely affected banana growth and production. Thus, it is very important to identify candidate genes involved in banana response to abiotic stress and pathogen infection, as well as the molecular mechanism and possible utilization for genetic breeding. Heat stress transcription factors (Hsfs) are widely known for their common involvement in various abiotic stresses and plant-pathogen interaction. However, no MaHsf has been identified in banana, as well as its possible role. In this study, genome-wide identification and further analyses of evolution, gene structure and conserved motifs showed closer relationship of them in every subgroup. The comprehensive expression profiles of MaHsfs revealed the tissue- and developmental stage-specific or dependent, as well as abiotic and biotic stress-responsive expressions of them. The common regulation of several MaHsfs by abiotic and biotic stress indicated the possible roles of them in plant stress responses. Taken together, this study extended our understanding of MaHsf gene family and identified some candidate MaHsfs with specific expression profiles, which may be used as potential candidates for genetic breeding in banana. PMID:27857174

  15. Genome-wide identification and transcriptional profiling analysis of auxin response-related gene families in cucumber

    PubMed Central

    2014-01-01

    Background Auxin signaling has a vital function in the regulation of plant growth and development, both which are known to be mediated by auxin-responsive genes. So far, significant progress has been made toward the identification and characterization of auxin-response genes in several model plants, while no systematic analysis for these families was reported in cucumber (Cucumis sativus L.), a reference species for Cucurbitaceae crops. The comprehensive analyses will help design experiments for functional validation of their precise roles in plant development and stress responses. Results A genome-wide search for auxin-response gene homologues identified 16 auxin-response factors (ARFs), 27 auxin/indole acetic acids (Aux/IAAs), 10 Gretchen Hagen 3 (GH3s), 61 small auxin-up mRNAs (SAURs), and 39 lateral organ boundaries (LBDs) in cucumber. Sequence analysis together with the organization of putative motifs indicated the potential diverse functions of these five auxin-related family members. The distribution and density of auxin response-related genes on chromosomes were not uniform. Evolutionary analysis showed that the chromosomal segment duplications mainly contributed to the expansion of the CsARF, CsIAA, CsGH3, and CsLBD gene families. Quantitative real-time RT-PCR analysis demonstrated that many ARFs, AUX/IAAs, GH3s, SAURs, and LBD genes were expressed in diverse patterns within different organs/tissues and during different development stages. They were also implicated in IAA, methyl jasmonic acid, or salicylic acid response, which is consistent with the finding that a great number of diverse cis-elements are present in their promoter regions involving a variety of signaling transduction pathways. Conclusion Genome-wide comparative analysis of auxin response-related family genes and their expression analysis provide new evidence for the potential role of auxin in development and hormone response of plants. Our data imply that the auxin response genes may be

  16. Genome-wide functional analysis of CREB/long-term memory-dependent transcription reveals distinct basal and memory gene expression programs.

    PubMed

    Lakhina, Vanisha; Arey, Rachel N; Kaletsky, Rachel; Kauffman, Amanda; Stein, Geneva; Keyes, William; Xu, Daniel; Murphy, Coleen T

    2015-01-21

    Induced CREB activity is a hallmark of long-term memory, but the full repertoire of CREB transcriptional targets required specifically for memory is not known in any system. To obtain a more complete picture of the mechanisms involved in memory, we combined memory training with genome-wide transcriptional analysis of C. elegans CREB mutants. This approach identified 757 significant CREB/memory-induced targets and confirmed the involvement of known memory genes from other organisms, but also suggested new mechanisms and novel components that may be conserved through mammals. CREB mediates distinct basal and memory transcriptional programs at least partially through spatial restriction of CREB activity: basal targets are regulated primarily in nonneuronal tissues, while memory targets are enriched for neuronal expression, emanating from CREB activity in AIM neurons. This suite of novel memory-associated genes will provide a platform for the discovery of orthologous mammalian long-term memory components.

  17. Genome-wide Functional Analysis of CREB/Long-Term Memory-Dependent Transcription Reveals Distinct Basal and Memory Gene Expression Programs

    PubMed Central

    Lakhina, Vanisha; Arey, Rachel N.; Kaletsky, Rachel; Kauffman, Amanda; Stein, Geneva; Keyes, William; Xu, Daniel; Murphy, Coleen T.

    2014-01-01

    SUMMARY Induced CREB activity is a hallmark of long-term memory, but the full repertoire of CREB transcriptional targets required specifically for memory is not known in any system. To obtain a more complete picture of the mechanisms involved in memory, we combined memory training with genome-wide transcriptional analysis of C. elegans CREB mutants. This approach identified 757 significant CREB/memory-induced targets and confirmed the involvement of known memory genes from other organisms, but also suggested new mechanisms and novel components that may be conserved through mammals. CREB mediates distinct basal and memory transcriptional programs at least partially through spatial restriction of CREB activity: basal targets are regulated primarily in nonneuronal tissues, while memory targets are enriched for neuronal expression, emanating from CREB activity in AIM neurons. This suite of novel memory-associated genes will provide a platform for the discovery of orthologous mammalian long-term memory components. PMID:25611510

  18. Genome-wide analysis of the bHLH transcription factor family in Chinese cabbage (Brassica rapa ssp. pekinensis).

    PubMed

    Song, Xiao-Ming; Huang, Zhi-Nan; Duan, Wei-Ke; Ren, Jun; Liu, Tong-Kun; Li, Ying; Hou, Xi-Lin

    2014-02-01

    Basic helix-loop-helix (bHLH) transcription factors are widely distributed in eukaryotic organisms and are thought to be one of the largest families of regulatory proteins. This important family of transcriptional regulators plays crucial roles in plant development. However, a systematic analysis of the bHLH transcription factor family has not been reported in Chinese cabbage. In this study, 230 bHLH transcription factors were identified from the whole Chinese cabbage genome and compared with proteins from other representative plants, fungi and metazoans. The Chinese cabbage bHLH (BrabHLH) gene family could be classified into 24 subfamilies. Phylogenetic analysis of BrabHLHs along with bHLHs from Arabidopsis and rice indicated 26 subfamilies. The identification, classification, phylogenetic reconstruction, conserved motifs, chromosome distribution, functional annotation, expression patterns and interaction networks of BrabHLHs were analyzed. Distribution mapping showed that BrabHLHs were non-randomly located on the ten Chinese cabbage chromosomes. One hundred and twenty-four orthologous bHLH genes were identified between Chinese cabbage and Arabidopsis, and the interaction networks of the orthologous genes were constructed in Chinese cabbage. Quantitative RT-PCR analysis showed that expressions of BrabHLH genes varied widely under different abiotic stress treatments for different times. Thus, this comprehensive analysis of BrabHLHs represents a rich resource, aiding the elucidation of the roles of bHLH family members in plant growth and development. Furthermore, the comparative genomics analysis deepened our understanding of the evolution of this gene family after a polyploidy event.

  19. Genome-wide analysis of ethylene-responsive element binding factor-associated amphiphilic repression motif-containing transcriptional regulators in Arabidopsis.

    PubMed

    Kagale, Sateesh; Links, Matthew G; Rozwadowski, Kevin

    2010-03-01

    The ethylene-responsive element binding factor-associated amphiphilic repression (EAR) motif is a transcriptional regulatory motif identified in members of the ethylene-responsive element binding factor, C2H2, and auxin/indole-3-acetic acid families of transcriptional regulators. Sequence comparison of the core EAR motif sites from these proteins revealed two distinct conservation patterns: LxLxL and DLNxxP. Proteins containing these motifs play key roles in diverse biological functions by negatively regulating genes involved in developmental, hormonal, and stress signaling pathways. Through a genome-wide bioinformatics analysis, we have identified the complete repertoire of the EAR repressome in Arabidopsis (Arabidopsis thaliana) comprising 219 proteins belonging to 21 different transcriptional regulator families. Approximately 72% of these proteins contain a LxLxL type of EAR motif, 22% contain a DLNxxP type of EAR motif, and the remaining 6% have a motif where LxLxL and DLNxxP are overlapping. Published in vitro and in planta investigations support approximately 40% of these proteins functioning as negative regulators of gene expression. Comparative sequence analysis of EAR motif sites and adjoining regions has identified additional preferred residues and potential posttranslational modification sites that may influence the functionality of the EAR motif. Homology searches against protein databases of poplar (Populus trichocarpa), grapevine (Vitis vinifera), rice (Oryza sativa), and sorghum (Sorghum bicolor) revealed that the EAR motif is conserved across these diverse plant species. This genome-wide analysis represents the most extensive survey of EAR motif-containing proteins in Arabidopsis to date and provides a resource enabling investigations into their biological roles and the mechanism of EAR motif-mediated transcriptional regulation.

  20. Genome-wide analysis of the AP2/ERF transcription factor superfamily in Chinese cabbage (Brassica rapa ssp. pekinensis)

    PubMed Central

    2013-01-01

    Background Chinese cabbage (Brassica rapa ssp. pekinensis) is a member of one of the most important leaf vegetables grown worldwide, which has experienced thousands of years in cultivation and artificial selection. The entire Chinese cabbage genome sequence, and more than forty thousand proteins have been obtained to date. The genome has undergone triplication events since its divergence from Arabidopsis thaliana (13 to 17 Mya), however a high degree of sequence similarity and conserved genome structure remain between the two species. Arabidopsis is therefore a viable reference species for comparative genomics studies. Variation in the number of members in gene families due to genome triplication may contribute to the broad range of phenotypic plasticity, and increased tolerance to environmental extremes observed in Brassica species. Transcription factors are important regulators involved in plant developmental and physiological processes. The AP2/ERF proteins, one of the most important families of transcriptional regulators, play a crucial role in plant growth, and in response to biotic and abiotic stressors. Our analysis will provide resources for understanding the tolerance mechanisms in Brassica rapa ssp. pekinensis. Results In the present study, 291 putative AP2/ERF transcription factor proteins were identified from the Chinese cabbage genome database, and compared with proteins from 15 additional species. The Chinese cabbage AP2/ERF superfamily was classified into four families, including AP2, ERF, RAV, and Soloist. The ERF family was further divided into DREB and ERF subfamilies. The AP2/ERF superfamily was subsequently divided into 15 groups. The identification, classification, phylogenetic reconstruction, conserved motifs, chromosome distribution, functional annotation, expression patterns, and interaction networks of the AP2/ERF transcription factor superfamily were predicted and analyzed. Distribution mapping results showed AP2/ERF superfamily genes were

  1. Genome-wide analysis of the AP2/ERF transcription factor superfamily in Chinese cabbage (Brassica rapa ssp. pekinensis).

    PubMed

    Song, Xiaoming; Li, Ying; Hou, Xilin

    2013-08-23

    Chinese cabbage (Brassica rapa ssp. pekinensis) is a member of one of the most important leaf vegetables grown worldwide, which has experienced thousands of years in cultivation and artificial selection. The entire Chinese cabbage genome sequence, and more than forty thousand proteins have been obtained to date. The genome has undergone triplication events since its divergence from Arabidopsis thaliana (13 to 17 Mya), however a high degree of sequence similarity and conserved genome structure remain between the two species. Arabidopsis is therefore a viable reference species for comparative genomics studies. Variation in the number of members in gene families due to genome triplication may contribute to the broad range of phenotypic plasticity, and increased tolerance to environmental extremes observed in Brassica species. Transcription factors are important regulators involved in plant developmental and physiological processes. The AP2/ERF proteins, one of the most important families of transcriptional regulators, play a crucial role in plant growth, and in response to biotic and abiotic stressors. Our analysis will provide resources for understanding the tolerance mechanisms in Brassica rapa ssp. pekinensis. In the present study, 291 putative AP2/ERF transcription factor proteins were identified from the Chinese cabbage genome database, and compared with proteins from 15 additional species. The Chinese cabbage AP2/ERF superfamily was classified into four families, including AP2, ERF, RAV, and Soloist. The ERF family was further divided into DREB and ERF subfamilies. The AP2/ERF superfamily was subsequently divided into 15 groups. The identification, classification, phylogenetic reconstruction, conserved motifs, chromosome distribution, functional annotation, expression patterns, and interaction networks of the AP2/ERF transcription factor superfamily were predicted and analyzed. Distribution mapping results showed AP2/ERF superfamily genes were localized on the

  2. Genome-wide characterization and analysis of bZIP transcription factor gene family related to abiotic stress in cassava

    PubMed Central

    Hu, Wei; Yang, Hubiao; Yan, Yan; Wei, Yunxie; Tie, Weiwei; Ding, Zehong; Zuo, Jiao; Peng, Ming; Li, Kaimian

    2016-01-01

    The basic leucine zipper (bZIP) transcription factor family plays crucial roles in various aspects of biological processes. Currently, no information is available regarding the bZIP family in the important tropical crop cassava. Herein, 77 bZIP genes were identified from cassava. Evolutionary analysis indicated that MebZIPs could be divided into 10 subfamilies, which was further supported by conserved motif and gene structure analyses. Global expression analysis suggested that MebZIPs showed similar or distinct expression patterns in different tissues between cultivated variety and wild subspecies. Transcriptome analysis of three cassava genotypes revealed that many MebZIP genes were activated by drought in the root of W14 subspecies, indicating the involvement of these genes in the strong resistance of cassava to drought. Expression analysis of selected MebZIP genes in response to osmotic, salt, cold, ABA, and H2O2 suggested that they might participate in distinct signaling pathways. Our systematic analysis of MebZIPs reveals constitutive, tissue-specific and abiotic stress-responsive candidate MebZIP genes for further functional characterization in planta, yields new insights into transcriptional regulation of MebZIP genes, and lays a foundation for understanding of bZIP-mediated abiotic stress response. PMID:26947924

  3. Genome-wide analysis and expression profiling of the ERF transcription factor family in potato (Solanum tuberosum L.).

    PubMed

    Charfeddine, Mariam; Saïdi, Mohamed Najib; Charfeddine, Safa; Hammami, Asma; Gargouri Bouzid, Radhia

    2015-04-01

    The ERF transcription factors belong to the AP2/ERF superfamily, one of the largest transcription factor families in plants. They play important roles in plant development processes, as well as in the response to biotic, abiotic, and hormone signaling. In the present study, 155 putative ERF transcription factor genes were identified from the potato (Solanum tuberosum) genome database, and compared with those from Arabidopsis thaliana. The StERF proteins are divided into ten phylogenetic groups. Expression analyses of five StERFs were carried out by semi-quantitative RT-PCR and compared with published RNA-seq data. These latter analyses were used to distinguish tissue-specific, biotic, and abiotic stress genes as well as hormone-responsive StERF genes. The results are of interest to better understand the role of the AP2/ERF genes in response to diverse types of stress in potatoes. A comprehensive analysis of the physiological functions and biological roles of the ERF family genes in S. tuberosum is required to understand crop stress tolerance mechanisms.

  4. Genome-wide analysis of the Dof transcription factor gene family reveals soybean-specific duplicable and functional characteristics.

    PubMed

    Guo, Yong; Qiu, Li-Juan

    2013-01-01

    The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max). In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs) were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.

  5. Bombyx mori Transcription Factors: Genome-Wide Identification, Expression Profiles and Response to Pathogens by Microarray Analysis

    PubMed Central

    Huang, Lulin; Cheng, Tingcai; Xu, Pingzhen; Fang, Ting; Xia, Qingyou

    2012-01-01

    Transcription factors are present in all living organisms, and play vital roles in a wide range of biological processes. Studies of transcription factors will help reveal the complex regulation mechanism of organisms. So far, hundreds of domains have been identified that show transcription factor activity. Here, 281 reported transcription factor domains were used as seeds to search the transcription factors in genomes of Bombyx mori L. (Lepidoptera: Bombycidae) and four other model insects. Overall, 666 transcription factors including 36 basal factors and 630 other factors were identified in B. mori genome, which accounted for 4.56% of its genome. The silkworm transcription factors' expression profiles were investigated in relation to multiple tissues, developmental stages, sexual dimorphism, and responses to oral infection by pathogens and direct bacterial injection. These all provided rich clues for revealing the transcriptional regulation mechanism of silkworm organ differentiation, growth and development, sexual dimorphism, and response to pathogen infection. PMID:22943524

  6. Tumor-specific usage of alternative transcription start sites in colorectal cancer identified by genome-wide exon array analysis.

    PubMed

    Thorsen, Kasper; Schepeler, Troels; Øster, Bodil; Rasmussen, Mads H; Vang, Søren; Wang, Kai; Hansen, Kristian Q; Lamy, Philippe; Pedersen, Jakob Skou; Eller, Asger; Mansilla, Francisco; Laurila, Kirsti; Wiuf, Carsten; Laurberg, Søren; Dyrskjøt, Lars; Ørntoft, Torben F; Andersen, Claus L

    2011-10-14

    Approximately half of all human genes use alternative transcription start sites (TSSs) to control mRNA levels and broaden the transcriptional output in healthy tissues. Aberrant expression patterns promoting carcinogenesis, however, may arise from alternative promoter usage. By profiling 108 colorectal samples using exon arrays, we identified nine genes (TCF12, OSBPL1A, TRAK1, ANK3, CHEK1, UGP2, LMO7, ACSL5, and SCIN) showing tumor-specific alternative TSS usage in both adenoma and cancer samples relative to normal mucosa. Analysis of independent exon array data sets corroborated these findings. Additionally, we confirmed the observed patterns for selected mRNAs using quantitative real-time reverse-transcription PCR. Interestingly, for some of the genes, the tumor-specific TSS usage was not restricted to colorectal cancer. A comprehensive survey of the nine genes in lung, bladder, liver, prostate, gastric, and brain cancer revealed significantly altered mRNA isoform ratios for CHEK1, OSBPL1A, and TCF12 in a subset of these cancer types.To identify the mechanism responsible for the shift in alternative TSS usage, we antagonized the Wnt-signaling pathway in DLD1 and Ls174T colorectal cancer cell lines, which remarkably led to a shift in the preferred TSS for both OSBPL1A and TRAK1. This indicated a regulatory role of the Wnt pathway in selecting TSS, possibly also involving TP53 and SOX9, as their transcription binding sites were enriched in the promoters of the tumor preferred isoforms together with their mRNA levels being increased in tumor samples. Finally, to evaluate the prognostic impact of the altered TSS usage, immunohistochemistry was used to show deregulation of the total protein levels of both TCF12 and OSBPL1A, corresponding to the mRNA levels observed. Furthermore, the level of nuclear TCF12 had a significant correlation to progression free survival in a cohort of 248 stage II colorectal cancer samples. Alternative TSS usage in colorectal adenoma and

  7. Dynamic Analysis of Gene Expression and Genome-wide Transcription Factor Binding during Lineage Specification of Multipotent Progenitors

    PubMed Central

    May, Gillian; Soneji, Shamit; Tipping, Alex J.; Teles, Jose; McGowan, Simon J.; Wu, Mengchu; Guo, Yanping; Fugazza, Cristina; Brown, John; Karlsson, Göran; Pina, Cristina; Olariu, Victor; Taylor, Stephen; Tenen, Daniel G.; Peterson, Carsten; Enver, Tariq

    2013-01-01

    Summary We used the paradigmatic GATA-PU.1 axis to explore, at the systems level, dynamic relationships between transcription factor (TF) binding and global gene expression programs as multipotent cells differentiate. We combined global ChIP-seq of GATA1, GATA2, and PU.1 with expression profiling during differentiation to erythroid and neutrophil lineages. Our analysis reveals (1) differential complexity of sequence motifs bound by GATA1, GATA2, and PU.1; (2) the scope and interplay of GATA1 and GATA2 programs within, and during transitions between, different cell compartments, and the extent of their hard-wiring by DNA motifs; (3) the potential to predict gene expression trajectories based on global associations between TF-binding data and target gene expression; and (4) how dynamic modeling of DNA-binding and gene expression data can be used to infer regulatory logic of TF circuitry. This rubric exemplifies the utility of this cross-platform resource for deconvoluting the complexity of transcriptional programs controlling stem/progenitor cell fate in hematopoiesis. PMID:24120743

  8. Genome-Wide Transcriptional Profile Analysis of Prunus persica in Response to Low Sink Demand after Fruit Removal

    PubMed Central

    Duan, Wei; Xu, Hongguo; Liu, Guotian; Fan, Peige; Liang, Zhenchang; Li, Shaohua

    2016-01-01

    Prunus persica fruits were removed from 1-year-old shoots to analysis photosynthesis, chlorophyll fluorescence and genes changes in leaves to low sink demand caused by fruit removal (−fruit) during the final stage of rapid fruit growth. A decline in net photosynthesis rate was observed, accompanied with a decrease in stomatal conductance. The intercellular CO2 concentrations and leaf temperature increased as compared with a normal fruit load (+fruit). Moreover, low sink demand significantly inhibited the donor side and the reaction center of photosystem II. 382 genes in leaf with an absolute fold change ≥1 change in expression level, representing 116 up- and 266 down-regulated genes except for unknown transcripts. Among these, 25 genes for photosynthesis were down-regulated, 69 stress and 19 redox related genes up-regulated under the low sink demand. These studies revealed high leaf temperature may result in a decline of net photosynthesis rate through down-regulation in photosynthetic related genes and up-regulation in redox and stress related genes, especially heat shock proteins genes. The complex changes in genes at the transcriptional level under low sink demand provided useful starting points for in-depth analyses of source-sink relationship in P. persica. PMID:27446115

  9. Genome-Wide Transcriptional Profile Analysis of Prunus persica in Response to Low Sink Demand after Fruit Removal.

    PubMed

    Duan, Wei; Xu, Hongguo; Liu, Guotian; Fan, Peige; Liang, Zhenchang; Li, Shaohua

    2016-01-01

    Prunus persica fruits were removed from 1-year-old shoots to analysis photosynthesis, chlorophyll fluorescence and genes changes in leaves to low sink demand caused by fruit removal (-fruit) during the final stage of rapid fruit growth. A decline in net photosynthesis rate was observed, accompanied with a decrease in stomatal conductance. The intercellular CO2 concentrations and leaf temperature increased as compared with a normal fruit load (+fruit). Moreover, low sink demand significantly inhibited the donor side and the reaction center of photosystem II. 382 genes in leaf with an absolute fold change ≥1 change in expression level, representing 116 up- and 266 down-regulated genes except for unknown transcripts. Among these, 25 genes for photosynthesis were down-regulated, 69 stress and 19 redox related genes up-regulated under the low sink demand. These studies revealed high leaf temperature may result in a decline of net photosynthesis rate through down-regulation in photosynthetic related genes and up-regulation in redox and stress related genes, especially heat shock proteins genes. The complex changes in genes at the transcriptional level under low sink demand provided useful starting points for in-depth analyses of source-sink relationship in P. persica.

  10. Dynamic analysis of gene expression and genome-wide transcription factor binding during lineage specification of multipotent progenitors.

    PubMed

    May, Gillian; Soneji, Shamit; Tipping, Alex J; Teles, Jose; McGowan, Simon J; Wu, Mengchu; Guo, Yanping; Fugazza, Cristina; Brown, John; Karlsson, Göran; Pina, Cristina; Olariu, Victor; Taylor, Stephen; Tenen, Daniel G; Peterson, Carsten; Enver, Tariq

    2013-12-05

    We used the paradigmatic GATA-PU.1 axis to explore, at the systems level, dynamic relationships between transcription factor (TF) binding and global gene expression programs as multipotent cells differentiate. We combined global ChIP-seq of GATA1, GATA2, and PU.1 with expression profiling during differentiation to erythroid and neutrophil lineages. Our analysis reveals (1) differential complexity of sequence motifs bound by GATA1, GATA2, and PU.1; (2) the scope and interplay of GATA1 and GATA2 programs within, and during transitions between, different cell compartments, and the extent of their hard-wiring by DNA motifs; (3) the potential to predict gene expression trajectories based on global associations between TF-binding data and target gene expression; and (4) how dynamic modeling of DNA-binding and gene expression data can be used to infer regulatory logic of TF circuitry. This rubric exemplifies the utility of this cross-platform resource for deconvoluting the complexity of transcriptional programs controlling stem/progenitor cell fate in hematopoiesis.

  11. Genome wide identification of Dof transcription factor gene family in sorghum and its comparative phylogenetic analysis with rice and Arabidopsis.

    PubMed

    Kushwaha, Hariom; Gupta, Shubhra; Singh, Vinay Kumar; Rastogi, Smita; Yadav, Dinesh

    2011-11-01

    The Dof (DNA binding with One Finger) family represents a classic zinc-finger transcription factors involved with multifarious roles exclusively in plants. There exists great diversity in terms of number of Dof genes observed in different crops. In current study, a total of 28 putative Dof genes have been predicted in silico from the recently available whole genome shotgun sequence of Sorghum bicolor (L.) Moench (with assigned accession numbers TPA:BK006983-BK007006 and TPA:BK007079-BK007082). The predicted SbDof genes are distributed on nine out of ten chromosomes of sorghum and most of these genes lack introns based on canonical intron/exon structure. Phylogenetic analysis of 28 SbDof proteins resulted in four subgroups constituting six clusters. The comparative phylogenetic analysis of these Dof proteins along with 30 rice and 36 Arabidopsis Dof proteins revealed six major groups similar to what has been observed earlier for rice and Arabidopsis. Motif analysis revealed the presence of conserved 50-52 amino acids Dof domain uniformly distributed across all the 28 Dof proteins of sorghum. The in silico cis-regulatory elements analysis of these SbDof genes suggested its diverse functions associated with light responsiveness, endosperm specific gene expression, hormone responsiveness, meristem specific expression and stress responsiveness.

  12. Genome-wide analysis of WRKY transcription factors in wheat (Triticum aestivum L.) and differential expression under water deficit condition.

    PubMed

    Ning, Pan; Liu, Congcong; Kang, Jingquan; Lv, Jinyin

    2017-01-01

    WRKY proteins, which comprise one of the largest transcription factor (TF) families in the plant kingdom, play crucial roles in plant development and stress responses. Despite several studies on WRKYs in wheat (Triticum aestivum L.), functional annotation information about wheat WRKYs is limited. Here, 171 TaWRKY TFs were identified from the whole wheat genome and compared with proteins from 19 other species representing nine major plant lineages. A phylogenetic analysis, coupled with gene structure analysis and motif determination, divided these TaWRKYs into seven subgroups (Group I, IIa-e, and III). Chromosomal location showed that most TaWRKY genes were enriched on four chromosomes, especially on chromosome 3B. In addition, 85 (49.7%) genes were either tandem (5) or segmental duplication (80), which suggested that though tandem duplication has contributed to the expansion of TaWRKY family, segmental duplication probably played a more pivotal role. Analysis of cis-acting elements revealed putative functions of WRKYs in wheat during development as well as under numerous biotic and abiotic stresses. Finally, the expression of TaWRKY genes in flag leaves, glumes, and lemmas under water-deficit condition were analyzed. Results showed that different TaWRKY genes preferentially express in specific tissue during the grain-filling stage. Our results provide a more extensive insight on WRKY gene family in wheat, and also contribute to the screening of more candidate genes for further investigation on function characterization of WRKYs under various stresses.

  13. Genome-wide analysis of the R2R3-MYB transcription factor gene family in sweet orange (Citrus sinensis).

    PubMed

    Liu, Chaoyang; Wang, Xia; Xu, Yuantao; Deng, Xiuxin; Xu, Qiang

    2014-10-01

    MYB transcription factor represents one of the largest gene families in plant genomes. Sweet orange (Citrus sinensis) is one of the most important fruit crops worldwide, and recently the genome has been sequenced. This provides an opportunity to investigate the organization and evolutionary characteristics of sweet orange MYB genes from whole genome view. In the present study, we identified 100 R2R3-MYB genes in the sweet orange genome. A comprehensive analysis of this gene family was performed, including the phylogeny, gene structure, chromosomal localization and expression pattern analyses. The 100 genes were divided into 29 subfamilies based on the sequence similarity and phylogeny, and the classification was also well supported by the highly conserved exon/intron structures and motif composition. The phylogenomic comparison of MYB gene family among sweet orange and related plant species, Arabidopsis, cacao and papaya suggested the existence of functional divergence during evolution. Expression profiling indicated that sweet orange R2R3-MYB genes exhibited distinct temporal and spatial expression patterns. Our analysis suggested that the sweet orange MYB genes may play important roles in different plant biological processes, some of which may be potentially involved in citrus fruit quality. These results will be useful for future functional analysis of the MYB gene family in sweet orange.

  14. Genome-wide analysis and expression profiling of DNA-binding with one zinc finger (Dof) transcription factor family in potato.

    PubMed

    Venkatesh, Jelli; Park, Se Won

    2015-09-01

    DNA-binding with one finger (Dof) domain proteins are a multigene family of plant-specific transcription factors involved in numerous aspects of plant growth and development. Here, we report a genome-wide search for Solanum tuberosum Dof (StDof) genes and their expression profiles at various developmental stages and in response to various abiotic stresses. In addition, a complete overview of Dof gene family in potato is presented, including the gene structures, chromosomal locations, cis-regulatory elements, conserved protein domains, and phylogenetic inferences. Based on the genome-wide analysis, we identified 35 full-length protein-coding StDof genes, unevenly distributed on 10 chromosomes. Phylogenetic analysis with Dof members from diverse plant species showed that StDof genes can be classified into four subgroups (StDofI, II, III, and IV). qPCR expression analysis of StDof gene transcripts showed the distinct expression patterns of StDof genes in various potato organs, and tuber developmental stages analyzed. Many StDof genes were upregulated in response to drought, salinity, and ABA treatments. Overall, the StDof gene expression pattern and the number of over-represented cis-acting elements in the promoter regions of the StDof genes indicate that most of the StDof genes have redundant functions. The detailed genomic information and expression profiles of the StDof gene homologs in the present study provide opportunities for functional analyses to unravel the genes' exact role in plant growth and development as well as in abiotic stress tolerance.

  15. Genome-wide identification and transcriptional expression analysis of mitogen-activated protein kinase and mitogen-activated protein kinase kinase genes in Capsicum annuum

    PubMed Central

    Liu, Zhiqin; Shi, Lanping; Liu, Yanyan; Tang, Qian; Shen, Lei; Yang, Sheng; Cai, Jinsen; Yu, Huanxin; Wang, Rongzhang; Wen, Jiayu; Lin, Youquan; Hu, Jiong; Liu, Cailing; Zhang, Yangwen; Mou, Shaoliang; He, Shuilin

    2015-01-01

    The tripartite mitogen-activated protein kinase (MAPK) signaling cascades have been implicated in plant growth, development, and environment adaptation, but a comprehensive understanding of MAPK signaling at genome-wide level is limited in Capsicum annuum. Herein, genome-wide identification and transcriptional expression analysis of MAPK and MAPK kinase (MAPKK) were performed in pepper. A total of 19 pepper MAPK (CaMAPKs) genes and five MAPKK (CaMAPKKs) genes were identified. Phylogenetic analysis indicated that CaMAPKs and CaMAPKKs could be classified into four groups and each group contains similar exon-intron structures. However, significant divergences were also found. Notably, five members of the pepper MAPKK family were much less conserved than those found in Arabidopsis, and 9 Arabidopsis MAPKs did not have orthologs in pepper. Additionally, 7 MAPKs in Arabidopsis had either two or three orthologs in the pepper genome, and six pepper MAPKs and one MAPKK differing in sequence were found in three pepper varieties. Quantitative real-time RT-PCR analysis showed that the majority of MAPK and MAPKK genes were ubiquitously expressed and transcriptionally modified in pepper leaves after treatments with heat, salt, and Ralstonia solanacearum inoculation as well as exogenously applied salicylic acid, methyl jasmonate, ethephon, and abscisic acid. The MAPKK-MAPK interactome was tested by yeast two-hybrid assay, the results showed that one MAPKK might interact with multiple MAPKs, one MAPK might also interact with more than one MAPKKs, constituting MAPK signaling networks which may collaborate in transmitting upstream signals into appropriate downstream cellular responses and processes. These results will facilitate future functional characterization of MAPK cascades in pepper. PMID:26442088

  16. Genome-wide analysis reveals conserved transcriptional responses downstream of resting potential change in Xenopus embryos, axolotl regeneration, and human mesenchymal cell differentiation.

    PubMed

    Pai, Vaibhav P; Martyniuk, Christopher J; Echeverri, Karen; Sundelacruz, Sarah; Kaplan, David L; Levin, Michael

    2016-02-01

    Endogenous bioelectric signaling via changes in cellular resting potential (V mem) is a key regulator of patterning during regeneration and embryogenesis in numerous model systems. Depolarization of V mem has been functionally implicated in dedifferentiation, tumorigenesis, anatomical re-specification, and appendage regeneration. However, no unbiased analyses have been performed to understand genome-wide transcriptional responses to V mem change in vivo. Moreover, it is unknown which genes or gene networks represent conserved targets of bioelectrical signaling across different patterning contexts and species. Here, we use microarray analysis to comparatively analyze transcriptional responses to V mem depolarization. We compare the response of the transcriptome during embryogenesis (Xenopus development), regeneration (axolotl regeneration), and stem cell differentiation (human mesenchymal stem cells in culture) to identify common networks across model species that are associated with depolarization. Both subnetwork enrichment and PANTHER analyses identified a number of key genetic modules as targets of V mem change, and also revealed important (well-conserved) commonalities in bioelectric signal transduction, despite highly diverse experimental contexts and species. Depolarization regulates specific transcriptional networks across all three germ layers (ectoderm, mesoderm, and endoderm) such as cell differentiation and apoptosis, and this information will be used for developing mechanistic models of bioelectric regulation of patterning. Moreover, our analysis reveals that V mem change regulates transcripts related to important disease pathways such as cancer and neurodegeneration, which may represent novel targets for emerging electroceutical therapies.

  17. Genome-wide identification, classification, and analysis of heat shock transcription factor family in Chinese cabbage (Brassica rapa pekinensis).

    PubMed

    Huang, X Y; Tao, P; Li, B Y; Wang, W H; Yue, Z C; Lei, J L; Zhong, X M

    2015-03-27

    Chinese cabbage (Brassica rapa ssp. pekinensis) is one of the most important vegetable crops grown worldwide, and various methods exist for selection, propagation, and cultivation. The entire Chinese cabbage genome has been sequenced, and the heat shock transcription factor family (Hsfs) has been found to play a central role in plant growth and development and in the response to biotic and abiotic stress conditions, particularly in acquired thermotolerance. We analyzed heat tolerance mechanisms in Chinese cabbage. In this study, 30 Hsfs were identified from the Chinese cabbage genome database. The classification, phylogenetic reconstruction, chromosome distribution, conserved motifs, expression analysis, and interaction networks of the Hsfs were predicted and analyzed. Thirty BrHsfs were classified into 3 major classes (class A, B, and C) according to their structural characteristics and phylogenetic comparisons, and class A was further subdivided into 8 subclasses. Distribution mapping results showed that Hsf genes were located on 10 Chinese cabbage chromosomes. The expression profile indicated that Hsfs play differential roles in 5 organs in Chinese cabbage, and likely participate in the development of underground parts and regulation of reproductive growth. An orthologous gene interaction network was constructed, and included MBF1C, ROF1, TBP2, CDC2, and HSP70 5 genes, which are closely related to heat stress. Our results contribute to the understanding of the complexity of Hsfs in Chinese cabbage and provide a basis for further functional gene research.

  18. Genome-wide analysis of Dof transcription factors reveals functional characteristics during development and response to biotic stresses in pepper.

    PubMed

    Kang, Won-Hee; Kim, Seungill; Lee, Hyun-Ah; Choi, Doil; Yeom, Seon-In

    2016-09-22

    The DNA-binding with one zinc finger proteins (Dofs) are a plant-specific family of transcription factors. The Dofs are involved in a variety of biological processes such as phytohormone production, seed development, and environmental adaptation. Dofs have been previously identified in several plants, but not in pepper. We identified 33 putative Dof genes in pepper (CaDofs). To gain an overview of the CaDofs, we analyzed phylogenetic relationships, protein motifs, and evolutionary history. We divided the 33 CaDofs, containing 25 motifs, into four major groups distributed on eight chromosomes. We discovered an expansion of the CaDofs dated to a recent duplication event. Segmental duplication that occurred before the speciation of the Solanaceae lineages was predominant among the CaDofs. The global gene-expression profiling of the CaDofs by RNA-seq analysis showed distinct temporal and pathogen-specific variation during development and response to biotic stresses (two TMV strains, PepMoV, and Phytophthora capsici), suggesting functional diversity among the CaDofs. These results will provide the useful clues into the responses of Dofs in biotic stresses and promote a better understanding of their multiple function in pepper and other species.

  19. Genome-wide analysis of Dof transcription factors reveals functional characteristics during development and response to biotic stresses in pepper

    PubMed Central

    Kang, Won-Hee; Kim, Seungill; Lee, Hyun-Ah; Choi, Doil; Yeom, Seon-In

    2016-01-01

    The DNA-binding with one zinc finger proteins (Dofs) are a plant-specific family of transcription factors. The Dofs are involved in a variety of biological processes such as phytohormone production, seed development, and environmental adaptation. Dofs have been previously identified in several plants, but not in pepper. We identified 33 putative Dof genes in pepper (CaDofs). To gain an overview of the CaDofs, we analyzed phylogenetic relationships, protein motifs, and evolutionary history. We divided the 33 CaDofs, containing 25 motifs, into four major groups distributed on eight chromosomes. We discovered an expansion of the CaDofs dated to a recent duplication event. Segmental duplication that occurred before the speciation of the Solanaceae lineages was predominant among the CaDofs. The global gene-expression profiling of the CaDofs by RNA-seq analysis showed distinct temporal and pathogen-specific variation during development and response to biotic stresses (two TMV strains, PepMoV, and Phytophthora capsici), suggesting functional diversity among the CaDofs. These results will provide the useful clues into the responses of Dofs in biotic stresses and promote a better understanding of their multiple function in pepper and other species. PMID:27653666

  20. Genome-wide characterization and expression analysis enables identification of abiotic stress-responsive MYB transcription factors in cassava (Manihot esculenta).

    PubMed

    Ruan, Meng-Bin; Guo, Xin; Wang, Bin; Yang, Yi-Ling; Li, Wen-Qi; Yu, Xiao-Ling; Zhang, Peng; Peng, Ming

    2017-06-15

    The myeloblastosis (MYB) transcription factor superfamily is the largest transcription factor family in plants, playing different roles during stress response. However, abiotic stress-responsive MYB transcription factors have not been systematically studied in cassava (Manihot esculenta), an important tropical tuber root crop. In this study, we used a genome-wide transcriptome analysis to predict 299 putative MeMYB genes in the cassava genome. Under drought and cold stresses, many MeMYB genes exhibited different expression patterns in cassava leaves, indicating that these genes might play a role in abiotic stress responses. We found that several stress-responsive MeMYB genes responded to abscisic acid (ABA) in cassava leaves. We characterize four MeMYBs, namely MeMYB1, MeMYB2, MeMYB4, and MeMYB9, as R2R3-MYB transcription factors. Furthermore, RNAi-driven repression of MeMYB2 resulted in drought and cold tolerance in transgenic cassava. Gene expression assays in wild-type and MeMYB2-RNAi cassava plants revealed that MeMYB2 may affect other MeMYBs as well as MeWRKYs under drought and cold stress, suggesting crosstalk between MYB and WRKY family genes under stress conditions in cassava. © The Author 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  1. Genome-wide identification and expression analysis of SBP-like transcription factor genes in Moso Bamboo (Phyllostachys edulis).

    PubMed

    Pan, Feng; Wang, Yue; Liu, Huanglong; Wu, Min; Chu, Wenyuan; Chen, Danmei; Xiang, Yan

    2017-06-27

    The SQUAMOSA promoter binding protein-like (SPL) proteins are plant-specific transcription factors (TFs) that function in a variety of developmental processes including growth, flower development, and signal transduction. SPL proteins are encoded by a gene family, and these genes have been characterized in two model grass species, Zea mays and Oryza sativa. The SPL gene family has not been well studied in moso bamboo (Phyllostachys edulis), a woody grass species. We identified 32 putative PeSPL genes in the P. edulis genome. Phylogenetic analysis arranged the PeSPL protein sequences in eight groups. Similarly, phylogenetic analysis of the SBP-like and SBP proteins from rice and maize clustered them into eight groups analogous to those from P. edulis. Furthermore, the deduced PeSPL proteins in each group contained very similar conserved sequence motifs. Our analyses indicate that the PeSPL genes experienced a large-scale duplication event ~15 million years ago (MYA), and that divergence between the PeSPL and OsSPL genes occurred 34 MYA. The stress-response expression profiles and tissue-specificity of the putative PeSPL gene promoter regions showed that SPL genes in moso bamboo have potential biological functions in stress resistance as well as in growth and development. We therefore examined PeSPL gene expression in response to different plant hormone and drought (polyethylene glycol-6000; PEG) treatments to mimic biotic and abiotic stresses. Expression of three (PeSPL10, -12, -17), six (PeSPL1, -10, -12, -17, -20, -31), and nine (PeSPL5, -8, -9, -14, -15, -19, -20, -31, -32) genes remained relatively stable after treating with salicylic acid (SA), gibberellic acid (GA), and PEG, respectively, while the expression patterns of other genes changed. In addition, analysis of tissue-specific expression of the moso bamboo SPL genes during development showed differences in their spatiotemporal expression patterns, and many were expressed at high levels in flowers and

  2. Identification and functional prediction of stress responsive AP2/ERF transcription factors in Brassica napus by genome-wide analysis.

    PubMed

    Owji, Hajar; Hajiebrahimi, Ali; Seradj, Hassan; Hemmati, Shiva

    2017-09-14

    Using homology and domain authentication, 321 putative AP2/ERF transcription factors were identified in Brassica napus, called BnAP2/ERF TFs. BnAP2/ERF TFs were classified into five major subfamilies, including DREB, ERF, AP2, RAV, and BnSoloist. This classification is based on phylogenetic analysis, motif identification, gene structure analysis, and physiochemical characterization. These TFs were annotated based on phylogenetic relationship with Brassica rapa. BnAP2/ERF TFs were located on 19 chromosomes of B. napus. Orthologs and paralogs were identified using synteny-based methods Ks calculation within B. napus genome and between B. napus with other species such as B. rapa, Brassica oleracea, and Arabidopsis thaliana indicated that BnAP2/ERF TFs were formed through duplication events occurred before B. napus formation. Kn/Ks values were between 0 and 1, suggesting the purifying selection among BnAP2/ERF TFs. Gene ontology annotation, cis-regulatory elements and functional interaction networks suggested that BnAP2/ERF TFs participate in response to stressors, including drought, high salinity, heat and cold as well as developmental processes particularly organ specification and embryogenesis. The identified cis-regulatory elements in the upstream of BnAP2/ERF TFs were responsive to abscisic acid. Analysis of the expression data derived from Illumina Hiseq 2000 RNA sequencing revealed that BnAP2/ERF genes were highly expressed in the roots comparing to flower buds, leaves, and stems. Also, the ERF subfamily was over-expressed under salt and fungal treatments. BnERF039 and BnERF245 are candidates for salt-tolerant B. napus. BnERF253-256 and BnERF260-277 are potential cytokinin response factors. BnERF227, BnERF228, BnERF234, BnERF134, BnERF132, BnERF176, and BnERF235 were suggested for resistance against Leptosphaeria maculan and Leptosphaeria biglobosa. Copyright © 2017 Elsevier Ltd. All rights reserved.

  3. Genome-wide analysis of expression modes and DNA methylation status at sense-antisense transcript loci in mouse.

    PubMed

    Watanabe, Yutaka; Numata, Koji; Murata, Shinya; Osada, Yuko; Saito, Rintaro; Nakaoka, Hajime; Yamamoto, Naoyuki; Watanabe, Kazufumi; Kato, Hidemasa; Abe, Kuniya; Kiyosawa, Hidenori

    2010-12-01

    The functionality of sense-antisense transcripts (SATs), although widespread throughout the mammalian genome, is largely unknown. Here, we analyzed the SATs expression and its associated promoter DNA methylation status by surveying 12 tissues of mice to gain insights into the relationship between expression and DNA methylation of SATs. We have found that sense and antisense expression positively correlate in most tissues. However, in some SATs with tissue-specific expression, the expression level of a transcript from a CpG island-bearing promoter is low when the promoter DNA methylation is present. In these circumstances, the expression level of its opposite-strand transcript, especially when it is poly(A)-negative was coincidentally higher. These observations suggest that, albeit the general tendency of sense-antisense simultaneous expression, some antisense transcripts have coordinated expression with its counterpart sense gene promoter methylation. This cross-strand relationship is not a privilege of imprinted genes but seems to occur widely in SATs.

  4. RNA-Seq analysis of stuA mutants in Fusarium verticillioides indicates dramatic genomic wide transcriptional reprogramming

    USDA-ARS?s Scientific Manuscript database

    StuA, first discovered in Aspergillus nidulans and a member of the APSES class of transcription factors, regulates several essential developmental stages in fungi such as virulence, sporulation and toxin production in phytopathogenic fungi. Fusarium verticillioides (Fv), a maize phytopathogen, produ...

  5. Genome-Wide Analysis of Hydrogen Peroxide-Regulated Gene Expression in Arabidopsis Reveals a High Light-Induced Transcriptional Cluster Involved in Anthocyanin Biosynthesis1[w

    PubMed Central

    Vanderauwera, Sandy; Zimmermann, Philip; Rombauts, Stéphane; Vandenabeele, Steven; Langebartels, Christian; Gruissem, Wilhelm; Inzé, Dirk; Van Breusegem, Frank

    2005-01-01

    In plants, reactive oxygen species and, more particularly, hydrogen peroxide (H2O2) play a dual role as toxic by-products of normal cell metabolism and as regulatory molecules in stress perception and signal transduction. Peroxisomal catalases are an important sink for photorespiratory H2O2. Using ATH1 Affymetrix microarrays, expression profiles were compared between control and catalase-deficient Arabidopsis (Arabidopsis thaliana) plants. Reduced catalase levels already provoked differences in nuclear gene expression under ambient growth conditions, and these effects were amplified by high light exposure in a sun simulator for 3 and 8 h. This genome-wide expression analysis allowed us to reveal the expression characteristics of complete pathways and functional categories during H2O2 stress. In total, 349 transcripts were significantly up-regulated by high light in catalase-deficient plants and 88 were down-regulated. From this data set, H2O2 was inferred to play a key role in the transcriptional up-regulation of small heat shock proteins during high light stress. In addition, several transcription factors and candidate regulatory genes involved in H2O2 transcriptional gene networks were identified. Comparisons with other publicly available transcriptome data sets of abiotically stressed Arabidopsis revealed an important intersection with H2O2-deregulated genes, positioning elevated H2O2 levels as an important signal within abiotic stress-induced gene expression. Finally, analysis of transcriptional changes in a combination of a genetic (catalase deficiency) and an environmental (high light) perturbation identified a transcriptional cluster that was strongly and rapidly induced by high light in control plants, but impaired in catalase-deficient plants. This cluster comprises the complete known anthocyanin regulatory and biosynthetic pathway, together with genes encoding unknown proteins. PMID:16183842

  6. Evolutionary conservation of otd/Otx2 transcription factor action: a genome-wide microarray analysis in Drosophila

    PubMed Central

    Montalta-He, Haiqiong; Leemans, Ronny; Loop, Thomas; Strahm, Martin; Certa, Ulrich; Primig, Michael; Acampora, Dario; Simeone, Antonio; Reichert, Heinrich

    2002-01-01

    Background Homeobox genes of the orthodenticle (otd)/Otx family have conserved roles in the embryogenesis of head and brain. Gene replacement experiments show that the Drosophila otd gene and orthologous mammalian Otx genes are functionally equivalent, in that overexpression of either gene in null mutants of Drosophila or mouse can restore defects in cephalic and brain development. This suggests that otd and Otx genes control a comparable subset of downstream target genes in either organism. Here we use quantitative transcript imaging to analyze this equivalence of otd and Otx gene action at a genomic level. Results Oligonucleotide arrays representing 13,400 annotated Drosophila genes were used to study differential gene expression in flies in which either the Drosophila otd gene or the human Otx2 gene was overexpressed. Two hundred and eighty-seven identified transcripts showed highly significant changes in expression levels in response to otd overexpression, and 682 identified transcripts showed highly significant changes in expression levels in response to Otx2 overexpression. Among these, 93 showed differential expression changes following overexpression of either otd or Otx2, and for 90 of these, comparable changes were observed under both experimental conditions. We postulate that these transcripts are common downstream targets of the fly otd gene and the human Otx2 gene in Drosophila. Conclusion Our experiments indicate that approximately one third of the otd-regulated transcripts also respond to overexpression of the human Otx2 gene in Drosophila. These common otd/Otx2 downstream genes are likely to represent the molecular basis of the functional equivalence of otd and Otx2 gene action in Drosophila. PMID:11983056

  7. ChIP on chip and ChIP-Seq assays: genome-wide analysis of transcription factor binding and histone modifications.

    PubMed

    Pillai, Smitha; Chellappan, Srikumar P

    2015-01-01

    Deregulation of transcriptional activity of many genes has been causatively linked to human diseases including cancer. Altered patterns of gene expression in normal and cancer cells are the result of inappropriate expression of transcription factors and chromatin modifying proteins. Chromatin immunoprecipitation assay is a well-established tool for investigating the interactions between regulatory proteins and DNA at distinct stages of gene activation. ChIP coupled with DNA microarrays, known as ChIP on chip, or sequencing of DNA associated with the factors (ChIP-Seq) allow us to determine the entire spectrum of in vivo DNA binding sites for a given protein. This has been of immense value because ChIP on chip assays and ChIP-Seq experiments can provide a snapshot of the transcriptional regulatory mechanisms on a genome-wide scale. This chapter outlines the general strategies used to carry out ChIP-chip assays to study the differential recruitment of regulatory molecules based on the studies conducted in our lab as well as other published protocols; these can be easily modified to a ChIP-Seq analysis.

  8. Genome-wide identification and analysis of the basic leucine zipper (bZIP) transcription factor gene family in Ustilaginoidea virens.

    PubMed

    Yin, Weixiao; Cui, Peng; Wei, Wei; Lin, Yang; Luo, Chaoxi

    2017-08-25

    The basic leucine zipper (bZIP) transcription factor (TF) family is one of the largest and most diverse TF families widely distributed across the eukaryotes. The bZIP TF family play important roles in growth, development, and response to abiotic or biotic stresses, which have been well characterized in plants, but not in plant pathogenic fungi. In this study, we performed genome-wide and systematic bioinformatics analysis of bZIP genes in the fungus Ustilaginoidea virens, the causal agent of rice false smut disease. We identified 28 bZIP family members in the U. virens genome by searching for the bZIP domain in predicted genes. The gene structures, motifs and phylogenetic relationships were analyzed for bZIP genes in U. virens (UvbZIPs). Together with bZIPs from other two fungi, the bZIP genes can be divided into eight groups according to the phylogenetic relationship. Based on RNA-Seq data, the expression profiles of UvbZIPs at different infection stages were evaluated. Results showed that 17 of the UvbZIPs were up-regulated expression during the infection period. Furthermore, 11 infection-related UvbZIPs was investigated under H2O2 stress and the expression level of eight genes have changed, which confirmed their roles in stress tolerance and pathogenicity. In summary, our genome wide systematic characterization and expression analysis of UvbZIPs provided insights into the molecular functions of these genes in U. virens and provides a reference for other pathogens.

  9. Genome-wide Expression Analysis and Metabolite Profiling Elucidate Transcriptional Regulation of Flavonoid Biosynthesis and Modulation under Abiotic Stresses in Banana

    PubMed Central

    Pandey, Ashutosh; Alok, Anshu; Lakhwani, Deepika; Singh, Jagdeep; Asif, Mehar H.; Trivedi, Prabodh K.

    2016-01-01

    Flavonoid biosynthesis is largely regulated at the transcriptional level due to the modulated expression of genes related to the phenylpropanoid pathway in plants. Although accumulation of different flavonoids has been reported in banana, a staple fruit crop, no detailed information is available on regulation of the biosynthesis in this important plant. We carried out genome-wide analysis of banana (Musa acuminata, AAA genome) and identified 28 genes belonging to 9 gene families associated with flavonoid biosynthesis. Expression analysis suggested spatial and temporal regulation of the identified genes in different tissues of banana. Analysis revealed enhanced expression of genes related to flavonol and proanthocyanidin (PA) biosynthesis in peel and pulp at the early developmental stages of fruit. Genes involved in anthocyanin biosynthesis were highly expressed during banana fruit ripening. In general, higher accumulation of metabolites was observed in the peel as compared to pulp tissue. A correlation between expression of genes and metabolite content was observed at the early stage of fruit development. Furthermore, this study also suggests regulation of flavonoid biosynthesis, at transcriptional level, under light and dark exposures as well as methyl jasmonate (MJ) treatment in banana. PMID:27539368

  10. Identification of differentially expressed genes associated with flower color in peach using genome-wide transcriptional analysis.

    PubMed

    Zhou, Y; Wu, X X; Zhang, Z; Gao, Z H

    2015-05-11

    Flower color is an important trait of the ornamental peach (Prunus persica L.). However, the mechanism responsible for the different colors that appear in the same genotype remains unclear. In this study, red samples showed higher anthocyanins content (0.122 ± 0.009 mg/g), which was significantly different from that in white samples (0.066 ± 0.010 mg/g). Similarly to carotenoids content, red extract (0.058 ± 0.004 mg/L) was significantly higher in white extract (0.015 ± 0.004 mg/L). We estimated gene expression using Illumina sequencing technology in libraries from white and red flower buds. A total of 3,599,960 and 3,464,141 tags were sequenced from the 2 libraries, respectively. Moreover, we identified 106 significantly differentially expressed genes between the 2 libraries. Among these, 78 and 28 represented transcripts with a higher or lower abundance of more than 2-fold than in the white flower library, respectively. GO annotation indicated that highly ranked genes were involved in the pigment biosynthetic process. Expression patterns of 11 genes were verified using quantitative reverse transcription-polymerase chain reaction assays. The results suggest that hydroxycinnamoyl-coenzyme A shikimate/quinate hydroxycinnamoyltransferase, 2-oxoglutarate-dependent dioxygenase, isoflavone reductase, riboflavin kinase, zeta-carotene desaturase, and ATP binding cassette transporter may be associated with the flower color formation. Our results may be useful for scientists focusing on Prunus persica floral development and biotechnology.

  11. Integrative analysis of SF-1 transcription factor dosage impact on genome-wide binding and gene expression regulation

    PubMed Central

    Doghman, Mabrouka; Figueiredo, Bonald C.; Volante, Marco; Papotti, Mauro; Lalli, Enzo

    2013-01-01

    Steroidogenic Factor-1 (SF-1) is a nuclear receptor that has a pivotal role in the development of adrenal glands and gonads and in the control of steroid hormone production, being also implicated in the pathogenesis of adrenocortical tumors. We have analyzed the mechanisms how SF-1 controls gene expression in adrenocortical cells and showed that it regulates different categories of genes according to its dosage. Significant correlations exist between the localization of SF-1-binding sites in chromatin under different dosage conditions and dosage-dependent regulation of gene expression. Our study revealed unexpected functional interactions between SF-1 and Neuron-Restrictive Silencer Factor/RE1-Silencing Transcription Factor (NRSF/REST), which was first characterized as a repressor of neuronal gene expression in non-neuronal tissues, in the regulation of gene expression in steroidogenic cells. When overexpressed, SF-1 reshapes the repertoire of NRSF/REST—regulated genes, relieving repression of key steroidogenic genes. These data show that NRSF/REST has a novel function in regulating gene expression in steroidogenic cells and suggest that it may have a broad role in regulating tissue-specific gene expression programs. PMID:23907384

  12. Integrative analysis of SF-1 transcription factor dosage impact on genome-wide binding and gene expression regulation.

    PubMed

    Doghman, Mabrouka; Figueiredo, Bonald C; Volante, Marco; Papotti, Mauro; Lalli, Enzo

    2013-10-01

    Steroidogenic Factor-1 (SF-1) is a nuclear receptor that has a pivotal role in the development of adrenal glands and gonads and in the control of steroid hormone production, being also implicated in the pathogenesis of adrenocortical tumors. We have analyzed the mechanisms how SF-1 controls gene expression in adrenocortical cells and showed that it regulates different categories of genes according to its dosage. Significant correlations exist between the localization of SF-1-binding sites in chromatin under different dosage conditions and dosage-dependent regulation of gene expression. Our study revealed unexpected functional interactions between SF-1 and Neuron-Restrictive Silencer Factor/RE1-Silencing Transcription Factor (NRSF/REST), which was first characterized as a repressor of neuronal gene expression in non-neuronal tissues, in the regulation of gene expression in steroidogenic cells. When overexpressed, SF-1 reshapes the repertoire of NRSF/REST-regulated genes, relieving repression of key steroidogenic genes. These data show that NRSF/REST has a novel function in regulating gene expression in steroidogenic cells and suggest that it may have a broad role in regulating tissue-specific gene expression programs.

  13. A genome-wide identification and analysis of basic helix-loop-helix transcription factors in cattle.

    PubMed

    Zhang, Debao; Li, Guanying; Wang, Yong

    2017-08-30

    Basic helix-loop-helix (BHLH) transcription factors comprise a large family of regulatory proteins and play critical roles in the developmental processes of higher organisms. Complete lists of BHLH family members have been identified in about 50 organisms, including fruit fly, zebrafish, mouse, giant panda, worm, yeast, rice and apple. Cattle, Bos taurus, is important for agriculture and animal nutrition, and is also a good model organism for health research. In the present study, 116 putative BHLHs were identified in the cattle genome. Phylogenetic analyses revealed that 111 Bos taurus BHLH (BtBHLH: Bos taurus BHLH) members belong to 44 families, with 48, 26, 16, 4, 13 and 4 members in group A, B, C, D, E and F respectively, and the remaining 5 BtBHLHs are orphan members. All of them were named and assigned into the corresponding BHLH families based on acceptable bootstrap values from in-group phylogenetic analyses with orthologous BHLHs from mouse and other mammalian species. A comparison between annotations deposited in the GenBank and KEGG databases with our analyses indicated that the annotations of 2 of the 116 BtBHLH members were inconsistent with our analytical results. Microarray evidence and expressed sequence tags of only 14 BtBHLH genes was now not available. Chromosomal locations of the BtBHLHs showed that the distribution of the BtBHLHs was uneven and some genes, e.g., BtOligo, BtHes and BtMyf6, may arise from gene duplication. The test of positive selection showed episodic positive selection occurs only in 5 families among the studied mammalian BHLHs. These results provide a solid basis for further studies on BHLH protein regulation of key growth and developmental processes. Copyright © 2017 Elsevier B.V. All rights reserved.

  14. Genome-wide location analysis reveals an important overlap between the targets of the yeast transcriptional regulators Rds2 and Adr1.

    PubMed

    Soontorngun, Nitnipa; Baramee, Sirilak; Tangsombatvichit, Chalinee; Thepnok, Piyasuda; Cheevadhanarak, Supapon; Robert, François; Turcotte, Bernard

    2012-07-13

    Upon glucose depletion, a massive reprogramming of gene expression occurs in the yeast Saccharomyces cerevisiae for the use of alternate carbon sources such as the nonfermentable compounds ethanol and glycerol. This process is mediated by the master kinase Snf1 that controls the activity of various targets including the transcriptional regulators Cat8, Sip4 and Adr1. We have recently identified Rds2 as an additional player in this pathway. Here, we have performed genome-wide location analysis of Rds2 in cells grown in the presence of glycerol. We show that Rds2 binds to promoters of genes involved in gluconeogenesis, the glyoxylate shunt, and the TCA cycle as well as some genes encoding mitochondrial components or some involved in the stress response. Interestingly, we also detected Rds2 at the promoters of SIP4, ADR1 and HAP4 which encodes the limiting subunit of the Hap2/3/4/5 complex, a regulator of respiration. Strikingly, we observed an important overlap between the targets of Rds2 and Adr1. Finally, we provide a model to account for the complex interplay among these transcriptional regulators.

  15. Genome-wide transcriptional analysis suggests hydrogenase- and nitrogenase-mediated hydrogen production in Clostridium butyricum CWBI 1009.

    PubMed

    Calusinska, Magdalena; Hamilton, Christopher; Monsieurs, Pieter; Mathy, Gregory; Leys, Natalie; Franck, Fabrice; Joris, Bernard; Thonart, Philippe; Hiligsmann, Serge; Wilmotte, Annick

    2015-01-01

    Molecular hydrogen, given its pollution-free combustion, has great potential to replace fossil fuels in future transportation and energy production. However, current industrial hydrogen production processes, such as steam reforming of methane, contribute significantly to the greenhouse effect. Therefore alternative methods, in particular the use of fermentative microorganisms, have attracted scientific interest in recent years. However the low overall yield obtained is a major challenge in biological H2 production. Thus, a thorough and detailed understanding of the relationships between genome content, gene expression patterns, pathway utilisation and metabolite synthesis is required to optimise the yield of biohydrogen production pathways. In this study transcriptomic and proteomic analyses of the hydrogen-producing bacterium Clostridium butyricum CWBI 1009 were carried out to provide a biomolecular overview of the changes that occur when the metabolism shifts to H2 production. The growth, H2-production, and glucose-fermentation profiles were monitored in 20 L batch bioreactors under unregulated-pH and fixed-pH conditions (pH 7.3 and 5.2). Conspicuous differences were observed in the bioreactor performances and cellular metabolisms for all the tested metabolites, and they were pH dependent. During unregulated-pH glucose fermentation increased H2 production was associated with concurrent strong up-regulation of the nitrogenase coding genes. However, no such concurrent up-regulation of the [FeFe] hydrogenase genes was observed. During the fixed pH 5.2 fermentation, by contrast, the expression levels for the [FeFe] hydrogenase coding genes were higher than during the unregulated-pH fermentation, while the nitrogenase transcripts were less abundant. The overall results suggest, for the first time, that environmental factors may determine whether H2 production in C. butyricum CWBI 1009 is mediated by the hydrogenases and/or the nitrogenase. This work, contributing to

  16. Genome-wide analysis of AR binding and comparison with transcript expression in primary human fetal prostate fibroblasts and cancer associated fibroblasts.

    PubMed

    Nash, Claire; Boufaied, Nadia; Mills, Ian G; Franco, Omar E; Hayward, Simon W; Thomson, Axel A

    2017-05-05

    The androgen receptor (AR) is a transcription factor, and key regulator of prostate development and cancer, which has discrete functions in stromal versus epithelial cells. AR expressed in mesenchyme is necessary and sufficient for prostate development while loss of stromal AR is predictive of prostate cancer progression. Many studies have characterized genome-wide binding of AR in prostate tumour cells but none have used primary mesenchyme or stroma. We applied ChIPseq to identify genomic AR binding sites in primary human fetal prostate fibroblasts and patient derived cancer associated fibroblasts, as well as the WPMY1 cell line overexpressing AR. We identified AR binding sites that were specific to fetal prostate fibroblasts (7534), cancer fibroblasts (629), WPMY1-AR (2561) as well as those common among all (783). Primary fibroblasts had a distinct AR binding profile versus prostate cancer cell lines and tissue, and showed a localisation to gene promoter binding sites 1 kb upstream of the transcriptional start site, as well as non-classical AR binding sequence motifs. We used RNAseq to define transcribed genes associated with AR binding sites and derived cistromes for embryonic and cancer fibroblasts as well as a cistrome common to both. These were compared to several in vivo ChIPseq and transcript expression datasets; which identified subsets of AR targets that were expressed in vivo and regulated by androgens. This analysis enabled us to deconvolute stromal AR targets active in stroma within tumour samples. Taken together, our data suggest that the AR shows significantly different genomic binding site locations in primary prostate fibroblasts compared to that observed in tumour cells. Validation of our AR binding site data with transcript expression in vitro and in vivo suggests that the AR target genes we have identified in primary fibroblasts may contribute to clinically significant and biologically important AR-regulated changes in prostate tissue

  17. Genome Wide Analysis of the Apple MYB Transcription Factor Family Allows the Identification of MdoMYB121 Gene Confering Abiotic Stress Tolerance in Plants

    PubMed Central

    Wang, Rong-Kai; Zhang, Rui-Fen; Hao, Yu-Jin

    2013-01-01

    The MYB proteins comprise one of the largest families of transcription factors (TFs) in plants. Although several MYB genes have been characterized to play roles in secondary metabolism, the MYB family has not yet been identified in apple. In this study, 229 apple MYB genes were identified through a genome-wide analysis and divided into 45 subgroups. A computational analysis was conducted using the apple genomic database to yield a complete overview of the MYB family, including the intron-exon organizations, the sequence features of the MYB DNA-binding domains, the carboxy-terminal motifs, and the chromosomal locations. Subsequently, the expression of 18 MYB genes, including 12 were chosen from stress-related subgroups, while another 6 ones from other subgroups, in response to various abiotic stresses was examined. It was found that several of these MYB genes, particularly MdoMYB121, were induced by multiple stresses. The MdoMYB121 was then further functionally characterized. Its predicted protein was found to be localized in the nucleus. A transgenic analysis indicated that the overexpression of the MdoMYB121 gene remarkably enhanced the tolerance to high salinity, drought, and cold stresses in transgenic tomato and apple plants. Our results indicate that the MYB genes are highly conserved in plant species and that MdoMYB121 can be used as a target gene in genetic engineering approaches to improve the tolerance of plants to multiple abiotic stresses. PMID:23950843

  18. A Genome-Wide Identification and Analysis of the Basic Helix-Loop-Helix Transcription Factors in Brown Planthopper, Nilaparvata lugens

    PubMed Central

    Wan, Pin-Jun; Yuan, San-Yue; Wang, Wei-Xia; Chen, Xu; Lai, Feng-Xiang; Fu, Qiang

    2016-01-01

    The basic helix-loop-helix (bHLH) transcription factors in insects play essential roles in multiple developmental processes including neurogenesis, sterol metabolism, circadian rhythms, organogenesis and formation of olfactory sensory neurons. The identification and function analysis of bHLH family members of the most destructive insect pest of rice, Nilaparvata lugens, may provide novel tools for pest management. Here, a genome-wide survey for bHLH sequences identified 60 bHLH sequences (NlbHLHs) encoded in the draft genome of N. lugens. Phylogenetic analysis of the bHLH domains successfully classified these genes into 40 bHLH families in group A (25), B (14), C (10), D (1), E (8) and F (2). The number of NlbHLHs with introns is higher than many other insect species, and the average intron length is shorter than those of Acyrthosiphon pisum. High number of ortholog families of NlbHLHs was found suggesting functional conversation for these proteins. Compared to other insect species studied, N. lugens has the highest number of bHLH members. Furthermore, gene duplication events of SREBP, Kn(col), Tap, Delilah, Sim, Ato and Crp were found in N. lugens. In addition, a putative full set of NlbHLH genes is defined and compared with another insect species. Thus, our classification of these NlbHLH members provides a platform for further investigations of bHLH protein functions in the regulation of N. lugens, and of insects in general. PMID:27869716

  19. Genome-wide analysis correlates Ayurveda Prakriti

    PubMed Central

    Govindaraj, Periyasamy; Nizamuddin, Sheikh; Sharath, Anugula; Jyothi, Vuskamalla; Rotti, Harish; Raval, Ritu; Nayak, Jayakrishna; Bhat, Balakrishna K.; Prasanna, B. V.; Shintre, Pooja; Sule, Mayura; Joshi, Kalpana S.; Dedge, Amrish P.; Bharadwaj, Ramachandra; Gangadharan, G. G.; Nair, Sreekumaran; Gopinath, Puthiya M.; Patwardhan, Bhushan; Kondaiah, Paturu; Satyamoorthy, Kapaettu; Valiathan, Marthanda Varma Sankaran; Thangaraj, Kumarasamy

    2015-01-01

    The practice of Ayurveda, the traditional medicine of India, is based on the concept of three major constitutional types (Vata, Pitta and Kapha) defined as “Prakriti”. To the best of our knowledge, no study has convincingly correlated genomic variations with the classification of Prakriti. In the present study, we performed genome-wide SNP (single nucleotide polymorphism) analysis (Affymetrix, 6.0) of 262 well-classified male individuals (after screening 3416 subjects) belonging to three Prakritis. We found 52 SNPs (p ≤ 1 × 10−5) were significantly different between Prakritis, without any confounding effect of stratification, after 106 permutations. Principal component analysis (PCA) of these SNPs classified 262 individuals into their respective groups (Vata, Pitta and Kapha) irrespective of their ancestry, which represent its power in categorization. We further validated our finding with 297 Indian population samples with known ancestry. Subsequently, we found that PGM1 correlates with phenotype of Pitta as described in the ancient text of Caraka Samhita, suggesting that the phenotypic classification of India’s traditional medicine has a genetic basis; and its Prakriti-based practice in vogue for many centuries resonates with personalized medicine. PMID:26511157

  20. Genome-wide analysis of histone methylation reveals chromatin state-based complex regulation of differential gene transcription and function of CD8 memory T cells

    PubMed Central

    Araki, Yasuto; Wang, Zhibin; Zang, Chongzhi; Wood, William H.; Schones, Dustin; Cui, Kairong; Roh, Tae-Young; Lhotsky, Brad; Wersto, Robert P.; Peng, Weiqun; Becker, Kevin G.; Zhao, Keji; Weng, Nan-ping

    2009-01-01

    Summary Memory lymphocytes are characterized by their ability to exhibit a rapid response to the recall antigen, in which differential transcription plays a significant role, yet the underlying mechanism is not understood. We report here a genome-wide analysis of histone methylation on two histone H3 lysine residues (H3K4me3 and H3K27me3) and gene expression profiles in naïve and memory CD8 T cells. We found that a general correlation exists between the levels of gene expression and the levels of H3K4me3 (positive correlation) and H3K27me3 (negative correlation) across the gene body. These correlations display four distinct modes: repressive, active, poised, and bivalent, reflecting different functions of these genes. Furthermore, a permissive chromatin state of each gene is established by a combination of different histone modifications. Our findings reveal a complex regulation by histone methylation in differential gene expression and suggest that histone methylation may be responsible for memory CD8 T cell function. PMID:19523850

  1. Genome-wide transcriptional analysis of apoptosis-related genes and pathways regulated by H2AX in lung cancer A549 cells.

    PubMed

    Lu, Chengrong; Xiong, Min; Luo, Yuan; Li, Jing; Zhang, Yanjun; Dong, Yaqiong; Zhu, Yanjun; Niu, Tianhui; Wang, Zhe; Duan, Lianning

    2013-09-01

    Histone H2AX is a novel tumor suppressor protein and plays an important role in apoptosis of cancer cells. However, the role of H2AX in lung cancer cells is unclear. The detailed mechanism and epigenetic regulation by H2AX remain elusive in cancer cells. We showed that H2AX was involved in apoptosis of lung cancer A549 cells as in other tumor cells. Knockdown of H2AX strongly suppressed apoptosis of A549 cells. We clarified the molecular mechanisms of apoptosis regulated by H2AX based on genome-wide transcriptional analysis. Microarray data analysis demonstrated that H2AX knockdown in A549 cells affected expression of 3,461 genes, including upregulation of 1,435 and downregulation of 2,026. These differentially expressed genes were subjected to bioinformatic analysis for exploring biological processes regulated by H2AX in lung cancer cells. Gene ontology analysis showed that H2AX affected expression of many genes, through which, many important functions including response to stimuli, gene expression, and apoptosis were involved in apoptotic regulation of lung cancer cells. Pathway analysis identified the mitogen-activated protein kinase signaling pathway and apoptosis as the most important pathways targeted by H2AX. Signal transduction pathway networks analysis and chromatin immunoprecipitation assay showed that two core genes, NFKB1 and JUN, were involved in apoptosis regulated by H2AX in lung cancer cells. Taken together, these data provide compelling clues for further exploration of H2AX function in cancer cells.

  2. Genome-wide analysis of N1ICD/RBPJ targets in vivo reveals direct transcriptional regulation of Wnt, SHH, and hippo pathway effectors by Notch1.

    PubMed

    Li, Yaochen; Hibbs, Matthew Aaron; Gard, Ashley Lauren; Shylo, Natalia Aliakseeuna; Yun, Kyuson

    2012-04-01

    The Notch pathway plays a pivotal role in regulating cell fate decisions in many stem cell systems. However, the full repertoire of Notch target genes in vivo and the mechanisms through which this pathway activity is integrated with other signaling pathways are largely unknown. Here, we report a transgenic mouse in which the activation of the Notch pathway massively expands the neural stem cell (NSC) pool in a cell context-dependent manner. Using this in vivo system, we identify direct targets of RBPJ/N1ICD in cortical NSCs at a genome-wide level through combined ChIP-Seq and transcriptome analyses. Through a highly conservative analysis of these datasets, we identified 98 genes that are directly regulated by N1ICD/RPBJ in vivo. These include many transcription factors that are known to be critical for NSC self-renewal (Sox2, Pax6, Tlx, and Id4) and the transcriptional effectors of the Wnt, SHH, and Hippo pathways, TCF4, Gli2, Gli3, Yap1, and Tead2. Since little is known about the function of the Hippo-Yap pathway in NSCs, we analyzed Yap1 expression and function in NSCs. We show that Yap1 expression is restricted to the stem cell compartment in the developing forebrain and that its expression is sufficient to rescue Notch pathway inhibition in NSC self-renewal assays. Together, results of this study reveal a previously underappreciated complexity and breadth of Notch1 targets in vivo and show direct interaction between Notch and Hippo-Yap pathways in NSCs.

  3. Heterosis in early maize ear inflorescence development: a genome-wide transcription analysis for two maize inbred lines and their hybrid.

    PubMed

    Ding, Haiping; Qin, Cheng; Luo, Xirong; Li, Lujiang; Chen, Zhe; Liu, Hongjun; Gao, Jian; Lin, Haijian; Shen, Yaou; Zhao, Maojun; Lübberstedt, Thomas; Zhang, Zhiming; Pan, Guangtang

    2014-08-11

    Heterosis, or hybrid vigor, contributes to superior agronomic performance of hybrids compared to their inbred parents. Despite its importance, little is known about the genetic and molecular basis of heterosis. Early maize ear inflorescences formation affects grain yield, and are thus an excellent model for molecular mechanisms involved in heterosis. To determine the parental contributions and their regulation during maize ear-development-genesis, we analyzed genome-wide digital gene expression profiles in two maize elite inbred lines (B73 and Mo17) and their F1 hybrid using deep sequencing technology. Our analysis revealed 17,128 genes expressed in these three genotypes and 22,789 genes expressed collectively in the present study. Approximately 38% of the genes were differentially expressed in early maize ear inflorescences from heterotic cross, including many transcription factor genes and some presence/absence variations (PAVs) genes, and exhibited multiple modes of gene action. These different genes showing differential expression patterns were mainly enriched in five cellular component categories (organelle, cell, cell part, organelle part and macromolecular complex), five molecular function categories (structural molecule activity, binding, transporter activity, nucleic acid binding transcription factor activity and catalytic activity), and eight biological process categories (cellular process, metabolic process, biological regulation, regulation of biological process, establishment of localization, cellular component organization or biogenesis, response to stimulus and localization). Additionally, a significant number of genes were expressed in only one inbred line or absent in both inbred lines. Comparison of the differences of modes of gene action between previous studies and the present study revealed only a small number of different genes had the same modes of gene action in both maize seedlings and ear inflorescences. This might be an indication that in

  4. Genome-wide analysis and identification of stress-responsive genes of the NAM-ATAF1,2-CUC2 transcription factor family in apple.

    PubMed

    Su, Hongyan; Zhang, Shizhong; Yuan, Xiaowei; Chen, Changtian; Wang, Xiao-Fei; Hao, Yu-Jin

    2013-10-01

    NAC (NAM, ATAF1,2, and CUC2) proteins constitute one of the largest families of plant-specific transcription factors. To date, little is known about the NAC genes in the apple (Malus domestica). In this study, a total of 180 NAC genes were identified in the apple genome and were phylogenetically clustered into six groups (I-VI) with the NAC genes from Arabidopsis and rice. The predicted apple NAC genes were distributed across all of 17 chromosomes at various densities. Additionally, the gene structure and motif compositions of the apple NAC genes were analyzed. Moreover, the expression of 29 selected apple NAC genes was analyzed in different tissues and under different abiotic stress conditions. All of the selected genes, with the exception of four genes, were expressed in at least one of the tissues tested, which indicates that the NAC genes are involved in various aspects of the physiological and developmental processes of the apple. Encouragingly, 17 of the selected genes were found to respond to one or more of the abiotic stress treatments, and these 17 genes included not only the expected 7 genes that were clustered with the well-known stress-related marker genes in group IV but also 10 genes located in other subgroups, none of which contains members that have been reported to be stress-related. To the best of our knowledge, this report describes the first genome-wide analysis of the apple NAC gene family, and the results should provide valuable information for understanding the classification and putative functions of this family.

  5. Genome-wide binding analysis of the transcription activator ideal plant architecture1 reveals a complex network regulating rice plant architecture.

    PubMed

    Lu, Zefu; Yu, Hong; Xiong, Guosheng; Wang, Jing; Jiao, Yongqing; Liu, Guifu; Jing, Yanhui; Meng, Xiangbing; Hu, Xingming; Qian, Qian; Fu, Xiangdong; Wang, Yonghong; Li, Jiayang

    2013-10-01

    Ideal plant architecture1 (IPA1) is critical in regulating rice (Oryza sativa) plant architecture and substantially enhances grain yield. To elucidate its molecular basis, we first confirmed IPA1 as a functional transcription activator and then identified 1067 and 2185 genes associated with IPA1 binding sites in shoot apices and young panicles, respectively, through chromatin immunoprecipitation sequencing assays. The Squamosa promoter binding protein-box direct binding core motif GTAC was highly enriched in IPA1 binding peaks; interestingly, a previously uncharacterized indirect binding motif TGGGCC/T was found to be significantly enriched through the interaction of IPA1 with proliferating cell nuclear antigen promoter binding factor1 or promoter binding factor2. Genome-wide expression profiling by RNA sequencing revealed IPA1 roles in diverse pathways. Moreover, our results demonstrated that IPA1 could directly bind to the promoter of rice teosinte branched1, a negative regulator of tiller bud outgrowth, to suppress rice tillering, and directly and positively regulate dense and erect panicle1, an important gene regulating panicle architecture, to influence plant height and panicle length. The elucidation of target genes of IPA1 genome-wide will contribute to understanding the molecular mechanisms underlying plant architecture and to facilitating the breeding of elite varieties with ideal plant architecture.

  6. Genome-Wide Binding Analysis of the Transcription Activator IDEAL PLANT ARCHITECTURE1 Reveals a Complex Network Regulating Rice Plant Architecture[W

    PubMed Central

    Lu, Zefu; Yu, Hong; Xiong, Guosheng; Wang, Jing; Jiao, Yongqing; Liu, Guifu; Jing, Yanhui; Meng, Xiangbing; Hu, Xingming; Qian, Qian; Fu, Xiangdong; Wang, Yonghong; Li, Jiayang

    2013-01-01

    IDEAL PLANT ARCHITECTURE1 (IPA1) is critical in regulating rice (Oryza sativa) plant architecture and substantially enhances grain yield. To elucidate its molecular basis, we first confirmed IPA1 as a functional transcription activator and then identified 1067 and 2185 genes associated with IPA1 binding sites in shoot apices and young panicles, respectively, through chromatin immunoprecipitation sequencing assays. The SQUAMOSA PROMOTER BINDING PROTEIN-box direct binding core motif GTAC was highly enriched in IPA1 binding peaks; interestingly, a previously uncharacterized indirect binding motif TGGGCC/T was found to be significantly enriched through the interaction of IPA1 with proliferating cell nuclear antigen PROMOTER BINDING FACTOR1 or PROMOTER BINDING FACTOR2. Genome-wide expression profiling by RNA sequencing revealed IPA1 roles in diverse pathways. Moreover, our results demonstrated that IPA1 could directly bind to the promoter of rice TEOSINTE BRANCHED1, a negative regulator of tiller bud outgrowth, to suppress rice tillering, and directly and positively regulate DENSE AND ERECT PANICLE1, an important gene regulating panicle architecture, to influence plant height and panicle length. The elucidation of target genes of IPA1 genome-wide will contribute to understanding the molecular mechanisms underlying plant architecture and to facilitating the breeding of elite varieties with ideal plant architecture. PMID:24170127

  7. Genome-wide analysis of human global and transcription-coupled excision repair of UV damage at single-nucleotide resolution

    PubMed Central

    Hu, Jinchuan; Adar, Sheera; Selby, Christopher P.

    2015-01-01

    We developed a method for genome-wide mapping of DNA excision repair named XR-seq (excision repair sequencing). Human nucleotide excision repair generates two incisions surrounding the site of damage, creating an ∼30-mer. In XR-seq, this fragment is isolated and subjected to high-throughput sequencing. We used XR-seq to produce stranded, nucleotide-resolution maps of repair of two UV-induced DNA damages in human cells: cyclobutane pyrimidine dimers (CPDs) and (6-4) pyrimidine–pyrimidone photoproducts [(6-4)PPs]. In wild-type cells, CPD repair was highly associated with transcription, specifically with the template strand. Experiments in cells defective in either transcription-coupled excision repair or general excision repair isolated the contribution of each pathway to the overall repair pattern and showed that transcription-coupled repair of both photoproducts occurs exclusively on the template strand. XR-seq maps capture transcription-coupled repair at sites of divergent gene promoters and bidirectional enhancer RNA (eRNA) production at enhancers. XR-seq data also uncovered the repair characteristics and novel sequence preferences of CPDs and (6-4)PPs. XR-seq and the resulting repair maps will facilitate studies of the effects of genomic location, chromatin context, transcription, and replication on DNA repair in human cells. PMID:25934506

  8. Genome wide transcription start sites analysis of Xanthomonas campestris pv. campestris B100 with insights into the gum gene cluster directing the biosynthesis of the exopolysaccharide xanthan.

    PubMed

    Alkhateeb, Rabeaa S; Vorhölter, Frank-Jörg; Rückert, Christian; Mentz, Almut; Wibberg, Daniel; Hublik, Gerd; Niehaus, Karsten; Pühler, Alfred

    2016-05-10

    Xanthomonas campestris pv. campestris (Xcc) is the major producer of the exopolysaccharide xanthan, the commercially most important natural polysaccharide of microbial origin. The current work provides deeper insights into the yet uncharacterized transcriptomic features of the xanthan producing strain Xcc-B100. Towards this goal, RNA sequencing of a library based on the selective enrichment of the 5' ends of native transcripts was performed. This approach resulted in the genome wide identification of 3067 transcription start sites (TSSs) that were further classified based on their genomic positions. Among them, 1545 mapped upstream of an actively transcribed CDS and 1363 were classified as novel TSSs representing antisense, internal, and TSSs belonging to previously unidentified genomic features. Analyzing the transcriptional strength of primary and antisense TSSs revealed that in some instances antisense transcription seemed to be initiated at a higher level than its sense counterpart. Mapping the exact positions of TSSs aided in the identification of promoter consensus motifs, ribosomal binding sites, and enhanced the genome annotation of 159 in silico predicted translational start (TLS) sites. The global view on length distribution of the 5' untranslated regions (5'-UTRs) deduced from the data pointed to the occurrence of leaderless transcripts and transcripts with unusually long 5'-UTRs, in addition to identifying seven putative riboswitch elements for Xcc-B100. Concerning the biosynthesis of xanthan, we focused on the transcriptional organization of the gum gene cluster. Under the conditions tested, we present evidence for a complex transcription pattern of the gum genes with multiple TSSs and an obvious considerable role of antisense transcription. The gene gumB, encoding an outer membrane xanthan exporter, is presented here as an example for genes that possessed a strong antisense TSS.

  9. Genome-wide inference of natural selection on human transcription factor binding sites.

    PubMed

    Arbiza, Leonardo; Gronau, Ilan; Aksoy, Bulent A; Hubisz, Melissa J; Gulko, Brad; Keinan, Alon; Siepel, Adam

    2013-07-01

    For decades, it has been hypothesized that gene regulation has had a central role in human evolution, yet much remains unknown about the genome-wide impact of regulatory mutations. Here we use whole-genome sequences and genome-wide chromatin immunoprecipitation and sequencing data to demonstrate that natural selection has profoundly influenced human transcription factor binding sites since the divergence of humans from chimpanzees 4-6 million years ago. Our analysis uses a new probabilistic method, called INSIGHT, for measuring the influence of selection on collections of short, interspersed noncoding elements. We find that, on average, transcription factor binding sites have experienced somewhat weaker selection than protein-coding genes. However, the binding sites of several transcription factors show clear evidence of adaptation. Several measures of selection are strongly correlated with predicted binding affinity. Overall, regulatory elements seem to contribute substantially to both adaptive substitutions and deleterious polymorphisms with key implications for human evolution and disease.

  10. Genome-wide analysis of MEF2 transcriptional program reveals synaptic target genes and neuronal activity-dependent polyadenylation site selection

    PubMed Central

    Flavell, Steven W.; Kim, Tae-Kyung; Gray, Jesse M.; Harmin, David A.; Hemberg, Martin; Hong, Elizabeth J.; Markenscoff-Papadimitriou, Eirene; Bear, Daniel M.; Greenberg, Michael E.

    2009-01-01

    SUMMARY Although many transcription factors are known to control important aspects of neural development, the genome-wide programs that are directly regulated by these factors are not known. We have characterized the genetic program that is activated by MEF2, a key regulator of activity-dependent synapse development. These MEF2 target genes have diverse functions at synapses, revealing a broad role for MEF2 in synapse development. Several of the MEF2 targets are mutated in human neurological disorders including epilepsy and autism-spectrum disorders, suggesting that these disorders may be caused by disruption of an activity-dependent gene program that controls synapse development. Our analyses also reveal that neuronal activity promotes alternative polyadenylation site usage at many of the MEF2 target genes, leading to the production of truncated mRNAs that may have different functions than their full-length counterparts. Taken together, these analyses suggest that the ubiquitously expressed transcription factor MEF2 regulates an intricate transcriptional program in neurons that controls synapse development. PMID:19109909

  11. Genome-Wide Chromosomal Targets of Oncogenic Transcription Factors

    DTIC Science & Technology

    2005-04-01

    cancer. Cancer involves, at least in part, aberrant programs of gene expression often mediated by oncogenic transcription factors activating downstream...networks that underlie complex gene expression programs that are activated in cancer. Indeed, transcription factors have been proposed as targets of...some of the limitations of ChIP-chip analysis and can be applied to transcription factors important in breast cancer such as c-myc and ER ( estrogen

  12. Nanobody®-based chromatin immunoprecipitation/micro-array analysis for genome-wide identification of transcription factor DNA binding sites

    PubMed Central

    Nguyen-Duc, Trong; Peeters, Eveline; Muyldermans, Serge; Charlier, Daniel; Hassanzadeh-Ghassabeh, Gholamreza

    2013-01-01

    Nanobodies® are single-domain antibody fragments derived from camelid heavy-chain antibodies. Because of their small size, straightforward production in Escherichia coli, easy tailoring, high affinity, specificity, stability and solubility, nanobodies® have been exploited in various biotechnological applications. A major challenge in the post-genomics and post-proteomics era is the identification of regulatory networks involving nucleic acid–protein and protein–protein interactions. Here, we apply a nanobody® in chromatin immunoprecipitation followed by DNA microarray hybridization (ChIP-chip) for genome-wide identification of DNA–protein interactions. The Lrp-like regulator Ss-LrpB, arguably one of the best-studied specific transcription factors of the hyperthermophilic archaeon Sulfolobus solfataricus, was chosen for this proof-of-principle nanobody®-assisted ChIP. Three distinct Ss-LrpB-specific nanobodies®, each interacting with a different epitope, were generated for ChIP. Genome-wide ChIP-chip with one of these nanobodies® identified the well-established Ss-LrpB binding sites and revealed several unknown target sequences. Furthermore, these ChIP-chip profiles revealed auxiliary operator sites in the open reading frame of Ss-lrpB. Our work introduces nanobodies® as a novel class of affinity reagents for ChIP. Taking into account the unique characteristics of nanobodies®, in particular, their short generation time, nanobody®-based ChIP is expected to further streamline ChIP-chip and ChIP-Seq experiments, especially in organisms with no (or limited) possibility of genetic manipulation. PMID:23275538

  13. Hematopoietic Transcriptional Mechanisms: From Locus-Specific to Genome-Wide Vantage Points

    PubMed Central

    DeVilbiss, Andrew W.; Sanalkumar, Rajendran; Johnson, Kirby D.; Keles, Sunduz; Bresnick, Emery H.

    2014-01-01

    Hematopoiesis is an exquisitely regulated process in which stem cells in the developing embryo and the adult generate progenitor cells that give rise to all blood lineages. Master regulatory transcription factors control hematopoiesis by integrating signals from the microenvironment and dynamically establishing and maintaining genetic networks. One of the most rudimentary aspects of cell type-specific transcription factor function, how they occupy a highly restricted cohort of cis-elements in chromatin remains poorly understood. Transformative technological advances involving the coupling of next-generation DNA sequencing technology with the chromatin immunoprecipitation assay (ChIP-seq) have enabled genome-wide mapping of factor occupancy patterns. However, formidable problems remain, notably ChIP-seq analysis yields hundreds to thousands of chromatin sites occupied by a given transcription factor, and only a fraction of the sites appear to be endowed with critical, non-redundant function. It has become en vogue to map transcription factor occupancy patterns genome-wide, while utilizing powerful statistical tools to establish correlations to inform biology and mechanisms. With the advent of revolutionary genome editing technologies, one can now reach beyond correlations to conduct definitive hypothesis testing. This review will focus on key discoveries that have emerged during the path from single loci to genome-wide analyses, specifically in the context of hematopoietic transcriptional mechanisms. PMID:24816274

  14. Genome-wide identification of MAPK, MAPKK, and MAPKKK gene families and transcriptional profiling analysis during development and stress response in cucumber.

    PubMed

    Wang, Jie; Pan, Changtian; Wang, Yan; Ye, Lei; Wu, Jian; Chen, Lifei; Zou, Tao; Lu, Gang

    2015-05-15

    The mitogen-activated protein kinase (MAPK) cascade consists of three types of reversibly phosphorylated kinases, namely, MAPK, MAPK kinase (MAPKK/MEK), and MAPK kinase kinase (MAPKKK/MEKK), playing important roles in plant growth, development, and defense response. The MAPK cascade genes have been investigated in detail in model plants, including Arabidopsis, rice, and tomato, but poorly characterized in cucumber (Cucumis sativus L.), a major popular vegetable in Cucurbitaceae crops, which is highly susceptible to environmental stress and pathogen attack. A genome-wide analysis revealed the presence of at least 14 MAPKs, 6 MAPKKs, and 59 MAPKKKs in the cucumber genome. Phylogenetic analyses classified all the CsMAPK and CsMAPKK genes into four groups, whereas the CsMAPKKK genes were grouped into the MEKK, RAF, and ZIK subfamilies. The expansion of these three gene families was mainly contributed by segmental duplication events. Furthermore, the ratios of non-synonymous substitution rates (Ka) and synonymous substitution rates (Ks) implied that the duplicated gene pairs had experienced strong purifying selection. Real-time PCR analysis demonstrated that some MAPK, MAPKK and MAPKKK genes are preferentially expressed in specific organs or tissues. Moreover, the expression levels of most of these genes significantly changed under heat, cold, drought, and Pseudoperonospora cubensis treatments. Exposure to abscisic acid and jasmonic acid markedly affected the expression levels of these genes, thereby implying that they may play important roles in the plant hormone network. A comprehensive genome-wide analysis of gene structure, chromosomal distribution, and evolutionary relationship of MAPK cascade genes in cucumber are present here. Further expression analysis revealed that these genes were involved in important signaling pathways for biotic and abiotic stress responses in cucumber, as well as the response to plant hormones. Our first systematic description of the MAPK

  15. Genome-wide transcription analyses in rice using tiling microarrays.

    PubMed

    Li, Lei; Wang, Xiangfeng; Stolc, Viktor; Li, Xueyong; Zhang, Dongfen; Su, Ning; Tongprasit, Waraporn; Li, Songgang; Cheng, Zhukuan; Wang, Jun; Deng, Xing Wang

    2006-01-01

    Sequencing and computational annotation revealed several features, including high gene numbers, unusual composition of the predicted genes and a large number of genes lacking homology to known genes, that distinguish the rice (Oryza sativa) genome from that of other fully sequenced model species. We report here a full-genome transcription analysis of the indica rice subspecies using high-density oligonucleotide tiling microarrays. Our results provided expression data support for the existence of 35,970 (81.9%) annotated gene models and identified 5,464 unique transcribed intergenic regions that share similar compositional properties with the annotated exons and have significant homology to other plant proteins. Elucidating and mapping of all transcribed regions revealed an association between global transcription and cytological chromosome features, and an overall similarity of transcriptional activity between duplicated segments of the genome. Collectively, our results provide the first whole-genome transcription map useful for further understanding the rice genome.

  16. Genome-wide transcript analysis of early maize leaf development reveals gene cohorts associated with the differentiation of C4 Kranz anatomy.

    PubMed

    Wang, Peng; Kelly, Steven; Fouracre, Jim P; Langdale, Jane A

    2013-08-01

    Photosynthesis underpins the viability of most ecosystems, with C4 plants that exhibit 'Kranz' anatomy being the most efficient primary producers. Kranz anatomy is characterized by closely spaced veins that are encircled by two morphologically distinct photosynthetic cell types. Although Kranz anatomy evolved multiple times, the underlying genetic mechanisms remain largely elusive, with only the maize scarecrow gene so far implicated in Kranz patterning. To provide a broader insight into the regulation of Kranz differentiation, we performed a genome-wide comparative analysis of developmental trajectories in Kranz (foliar leaf blade) and non-Kranz (husk leaf sheath) leaves of the C4 plant maize. Using profile classification of gene expression in early leaf primordia, we identified cohorts of genes associated with procambium initiation and vascular patterning. In addition, we used supervised classification criteria inferred from anatomical and developmental analyses of five developmental stages to identify candidate regulators of cell-type specification. Our analysis supports the suggestion that Kranz anatomy is patterned, at least in part, by a SCARECROW/SHORTROOT regulatory network, and suggests likely components of that network. Furthermore, the data imply a role for additional pathways in the development of Kranz leaves. © 2013 The Authors The Plant Journal © 2013 John Wiley & Sons Ltd.

  17. Genome-wide transcriptional profiling reveals molecular signatures of secondary xylem differentiation in Populus tomentosa.

    PubMed

    Yang, X H; Li, X G; Li, B L; Zhang, D Q

    2014-11-11

    Wood formation occurs via cell division, primary cell wall and secondary wall formation, and programmed cell death in the vascular cambium. Transcriptional profiling of secondary xylem differentiation is essential for understanding the molecular mechanisms underlying wood formation. Differential gene expression in secondary xylem differentiation of Populus has been previously investigated using cDNA microarray analysis. However, little is known about the molecular mechanisms from a genome-wide perspective. In this study, the Affymetrix poplar genome chips containing 61,413 probes were used to investigate the changes in the transcriptome during secondary xylem differentiation in Chinese white poplar (Populus tomentosa). Two xylem tissues (newly formed and lignified) were sampled for genome-wide transcriptional profiling. In total, 6843 genes (~11%) were identified with differential expression in the two xylem tissues. Many genes involved in cell division, primary wall modification, and cellulose synthesis were preferentially expressed in the newly formed xylem. In contrast, many genes, including 4-coumarate:cinnamate-4-hydroxylase (C4H), 4-coumarate:CoA ligase (4CL), cinnamyl alcohol dehydrogenase (CAD), and caffeoyl CoA 3-O-methyltransferase (CCoAOMT), associated with lignin biosynthesis were more transcribed in the lignified xylem. The two xylem tissues also showed differential expression of genes related to various hormones; thus, the secondary xylem differentiation could be regulated by hormone signaling. Furthermore, many transcription factor genes were preferentially expressed in the lignified xylem, suggesting that wood lignification involves extensive transcription regulation. The genome-wide transcriptional profiling of secondary xylem differentiation could provide additional insights into the molecular basis of wood formation in poplar species.

  18. A novel statistic for genome-wide interaction analysis.

    PubMed

    Wu, Xuesen; Dong, Hua; Luo, Li; Zhu, Yun; Peng, Gang; Reveille, John D; Xiong, Momiao

    2010-09-23

    Although great progress in genome-wide association studies (GWAS) has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked). The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDR<0.001 and 0.001genome-wide interaction analysis is a valuable tool for finding remaining missing heritability unexplained by the current GWAS, and the developed novel statistic is able to search significant interaction between SNPs across the genome. Real data analysis showed that the results of genome-wide interaction analysis can be replicated in two independent studies.

  19. Genome-wide analysis of rice cis-natural antisense transcription under cadmium exposure using strand-specific RNA-Seq.

    PubMed

    Oono, Youko; Yazawa, Takayuki; Kanamori, Hiroyuki; Sasaki, Harumi; Mori, Satomi; Matsumoto, Takashi

    2017-10-06

    The elucidation of novel transcripts and their expression in response to various stress conditions is necessary to understand the transcriptional network of plants as an adaptation to biotic and abiotic stresses. We performed strand-specific RNA-Seq (ssRNA-Seq) on rice exposed to cadmium (Cd) for 24 h and investigated the expression of cis-natural antisense transcripts (cis-NATs), a class of endogenous coding or non-protein-coding RNAs with sequence complementarity to the opposite strands of RAP transcripts. Many RAP transcripts possessed cis-NATs and these cis-NATs were responsive to some extent. Cis-NATs were upregulated from 26, 266 and 409 RAP gene loci, while 2054, 2501 and 2825 RAP transcripts were upregulated from 38,123 RAP loci under high Cd exposure in roots at 1, 12 and 24 h, respectively. In addition, most of the upregulated cis-NATs showed little upregulation under ABA or cold treatment. A number of cis-NATs were upregulated from less than 35 RAP gene loci in different tissue and time-point combinations under low Cd exposure, suggesting that cis-NATs respond to environmental stress. Furthermore, 409 RAP transcripts with upregulated cis-NATs were classified into three groups based on the expression of the RAP transcripts from the opposite DNA strand, including 138 upregulated, 128 invariable, and 143 downregulated transcripts, although the responses of cis-NATs and RAP transcripts were not always correlated. We have shown that the cis-NATs identified by ssRNA-Seq analysis are novel genes and that some of them are stress-specific and show different responses depending on the degree of stress and tissue. These results improve our understanding of the complete molecular mechanism of plant adaptation to Cd exposure.

  20. Genome-wide transcriptome analysis of human epidermal melanocytes

    PubMed Central

    Haltaufderhyde, Kirk D.; Oancea, Elena

    2015-01-01

    Because human epidermal melanocytes (HEMs) provide critical protection against skin cancer, sunburn, and photoaging, a genome-wide perspective of gene expression in these cells is vital to understanding human skin physiology. In this study we performed high throughput sequencing of HEMs to obtain a complete data set of transcript sizes, abundances, and splicing. As expected, we found that melanocyte specific genes that function in pigmentation were among the highest expressed genes. We analyzed receptor, ion channel and transcription factor gene families to get a better understanding of the cell signalling pathways used by melanocytes. We also performed a comparative transcriptomic analysis of lightly versus darkly pigmented HEMs and found 16 genes differentially expressed in the two pigmentation phenotypes; of those, only one putative melanosomal transporter (SLC45A2) has known function in pigmentation. In addition, we found 166 genes with splice isoforms expressed exclusively in one pigmentation phenotype, 17 of which are genes involved in signal transduction. Our melanocyte transcriptome study provides a comprehensive view and may help identify novel pigmentation genes and potential pharmacological targets. PMID:25451175

  1. Genome wide analysis of protein production load in Trichoderma reesei.

    PubMed

    Pakula, Tiina M; Nygren, Heli; Barth, Dorothee; Heinonen, Markus; Castillo, Sandra; Penttilä, Merja; Arvas, Mikko

    2016-01-01

    The filamentous fungus Trichoderma reesei (teleomorph Hypocrea jecorina) is a widely used industrial host organism for protein production. In industrial cultivations, it can produce over 100 g/l of extracellular protein, mostly constituting of cellulases and hemicellulases. In order to improve protein production of T. reesei the transcriptional regulation of cellulases and secretory pathway factors have been extensively studied. However, the metabolism of T. reesei under protein production conditions has not received much attention. To understand the physiology and metabolism of T. reesei under protein production conditions we carried out a well-controlled bioreactor experiment with extensive analysis. We used minimal media to make the data amenable for modelling and three strain pairs to cover different protein production levels. With RNA-sequencing transcriptomics we detected the concentration of the carbon source as the most important determinant of the transcriptome. As the major transcriptional response concomitant to protein production we detected the induction of selected genes that were putatively regulated by xyr1 and were related to protein transport, amino acid metabolism and transcriptional regulation. We found novel metabolic responses such as production of glycerol and a cellotriose-like compound. We then used this cultivation data for flux balance analysis of T. reesei metabolism and demonstrate for the first time the use of genome wide stoichiometric metabolic modelling for T. reesei. We show that our model can predict protein production rate and provides novel insight into the metabolism of protein production. We also provide this unprecedented cultivation and transcriptomics data set for future modelling efforts. The use of stoichiometric modelling can open a novel path for the improvement of protein production in T. reesei. Based on this we propose sulphur assimilation as a major limiting factor of protein production. As an organism with

  2. Genome-wide analysis of condensin binding in Caenorhabditis elegans

    PubMed Central

    2013-01-01

    Background Condensins are multi-subunit protein complexes that are essential for chromosome condensation during mitosis and meiosis, and play key roles in transcription regulation during interphase. Metazoans contain two condensins, I and II, which perform different functions and localize to different chromosomal regions. Caenorhabditis elegans contains a third condensin, IDC, that is targeted to and represses transcription of the X chromosome for dosage compensation. Results To understand condensin binding and function, we performed ChIP-seq analysis of C. elegans condensins in mixed developmental stage embryos, which contain predominantly interphase nuclei. Condensins bind to a subset of active promoters, tRNA genes and putative enhancers. Expression analysis in kle-2-mutant larvae suggests that the primary effect of condensin II on transcription is repression. A DNA sequence motif, GCGC, is enriched at condensin II binding sites. A sequence extension of this core motif, AGGG, creates the condensin IDC motif. In addition to differences in recruitment that result in X-enrichment of condensin IDC and condensin II binding to all chromosomes, we provide evidence for a shared recruitment mechanism, as condensin IDC recruiter SDC-2 also recruits condensin II to the condensin IDC recruitment sites on the X. In addition, we found that condensin sites overlap extensively with the cohesin loader SCC-2, and that SDC-2 also recruits SCC-2 to the condensin IDC recruitment sites. Conclusions Our results provide the first genome-wide view of metazoan condensin II binding in interphase, define putative recruitment motifs, and illustrate shared loading mechanisms for condensin IDC and condensin II. PMID:24125077

  3. Genome-wide conserved consensus transcription factor binding motifs are hyper-methylated

    PubMed Central

    2010-01-01

    Background DNA methylation can regulate gene expression by modulating the interaction between DNA and proteins or protein complexes. Conserved consensus motifs exist across the human genome ("predicted transcription factor binding sites": "predicted TFBS") but the large majority of these are proven by chromatin immunoprecipitation and high throughput sequencing (ChIP-seq) not to be biological transcription factor binding sites ("empirical TFBS"). We hypothesize that DNA methylation at conserved consensus motifs prevents promiscuous or disorderly transcription factor binding. Results Using genome-wide methylation maps of the human heart and sperm, we found that all conserved consensus motifs as well as the subset of those that reside outside CpG islands have an aggregate profile of hyper-methylation. In contrast, empirical TFBS with conserved consensus motifs have a profile of hypo-methylation. 40% of empirical TFBS with conserved consensus motifs resided in CpG islands whereas only 7% of all conserved consensus motifs were in CpG islands. Finally we further identified a minority subset of TF whose profiles are either hypo-methylated or neutral at their respective conserved consensus motifs implicating that these TF may be responsible for establishing or maintaining an un-methylated DNA state, or whose binding is not regulated by DNA methylation. Conclusions Our analysis supports the hypothesis that at least for a subset of TF, empirical binding to conserved consensus motifs genome-wide may be controlled by DNA methylation. PMID:20875111

  4. Genome-wide identification of citrus ATP-citrate lyase genes and their transcript analysis in fruits reveals their possible role in citrate utilization.

    PubMed

    Hu, Xiao-Mei; Shi, Cai-Yun; Liu, Xiao; Jin, Long-Fei; Liu, Yong-Zhong; Peng, Shu-Ang

    2015-02-01

    ATP-citrate lyase (ACL, EC4.1.3.8) catalyzes citrate to oxaloacetate and acetyl-CoA in the cell cytosol, and has important roles in normal plant growth and in the biosynthesis of some secondary metabolites. We identified three ACL genes, CitACLα1, CitACLα2, and CitACLβ1, in the citrus genome database. Both CitACLα1 and CitACLα2 encode putative ACL α subunits with 82.5 % amino acid identity, whereas CitACLβ1 encodes a putative ACL β subunit. Gene structure analysis showed that CitACLα1 and CitACLα2 had 12 exons and 11 introns, and CitACLβ1 had 16 exons and 15 introns. CitACLα1 and CitACLβ1 were predominantly expressed in flower, and CitACLα2 was predominantly expressed in stem and fibrous roots. As fruits ripen, the transcript levels of CitACLα1, CitACLβ1, and/or CitACLα2 in cultivars 'Niuher' and 'Owari' increased, accompanied by significant decreases in citrate content, while their transcript levels decreased significantly in 'Egan No. 1' and 'Iyokan', although citrate content also decreased. In 'HB pummelo', in which acid content increased as fruit ripened, and in acid-free pummelo, transcript levels of CitACLα2, CitACLβ1, and/or CitACLα1 increased. Moreover, mild drought stress and ABA treatment significantly increased citrate contents in fruits. Transcript levels of the three genes were significantly reduced by mild drought stress, and the transcript level of only CitACLβ1 was significantly reduced by ABA treatment. Taken together, these data indicate that the effects of ACL on citrate use during fruit ripening depends on the cultivar, and the reduction in ACL gene expression may be attributed to citrate increases under mild drought stress or ABA treatment.

  5. Genome-wide analysis of bHLH transcription factor and involvement in the infection by yellow leaf curl virus in tomato (Solanum lycopersicum).

    PubMed

    Wang, Jinyan; Hu, Zhongze; Zhao, Tongmin; Yang, Yuwen; Chen, Tianzi; Yang, Mali; Yu, Wengui; Zhang, Baolong

    2015-02-05

    The basic helix-loop-helix (bHLH) proteins are a superfamily of transcription factors that can bind to specific DNA target sites. They have been well characterized in model plants such as Arabidopsis and rice and have been shown to be important regulatory components in many different biological processes. However, no systemic analysis of the bHLH transcription factor family has yet been reported in tomatoes. Tomato yellow leaf curl virus (TYLCV) threatens tomato production worldwide by causing leaf yellowing, leaf curling, plant stunting and flower abscission. A total of 152 bHLH transcription factors were identified from the entire tomato genome. Phylogenetic analysis of bHLH domain sequences from Arabidopsis and tomato facilitated classification of these genes into 26 subfamilies. The evolutionary and possible functional relationships revealed during this analysis are supported by other criteria, including the chromosomal distribution of these genes, the conservation of motifs and exon/intron structural patterns, and the predicted DNA binding activities within subfamilies. Distribution mapping results showed bHLH genes were localized on the 12 tomato chromosomes. Among the 152 bHLH genes from the tomato genome, 96 bHLH genes were detected in the TYLCV-susceptible and resistant tomato breeding line before (0 dpi) and after TYLCV (357 dpi) infection. As anticipated, gene ontology (GO) analysis indicated that most bHLH genes are related to the regulation of macromolecule metabolic processes and gene expression. Only four bHLH genes were differentially expressed between 0 and 357 dpi. Virus-induced gene silencing (VIGS) of one bHLH genes SlybHLH131 in resistant lines can lead to the cell death. In the present study, 152 bHLH transcription factor genes were identified. One of which bHLH genes, SlybHLH131, was found to be involved in the TYLCV infection through qRT-PCR expression analysis and VIGS validation. The isolation and identification of these bHLH transcription

  6. Genome-Wide Transcriptional Profiling and Metabolic Analysis Uncover Multiple Molecular Responses of the Grass Species Lolium perenne Under Low-Intensity Xenobiotic Stress

    PubMed Central

    Serra, Anne-Antonella; Couée, Ivan; Heijnen, David; Michon-Coudouel, Sophie; Sulmon, Cécile; Gouesbet, Gwenola

    2015-01-01

    Lolium perenne, which is a major component of pastures, lawns, and grass strips, can be exposed to xenobiotic stresses due to diffuse and residual contaminations of soil. L. perenne was recently shown to undergo metabolic adjustments in response to sub-toxic levels of xenobiotics. To gain insight in such chemical stress responses, a de novo transcriptome analysis was carried out on leaves from plants subjected at the root level to low levels of xenobiotics, glyphosate, tebuconazole, and a combination of the two, leading to no adverse physiological effect. Chemical treatments influenced significantly the relative proportions of functional categories and of transcripts related to carbohydrate processes, to signaling, to protein-kinase cascades, such as Serine/Threonine-protein kinases, to transcriptional regulations, to responses to abiotic or biotic stimuli and to responses to phytohormones. Transcriptomics-based expressions of genes encoding different types of SNF1 (sucrose non-fermenting 1)-related kinases involved in sugar and stress signaling or encoding key metabolic enzymes were in line with specific qRT-PCR analysis or with the important metabolic and regulatory changes revealed by metabolomic analysis. The effects of pesticide treatments on metabolites and gene expression strongly suggest that pesticides at low levels, as single molecule or as mixture, affect cell signaling and functioning even in the absence of major physiological impact. This global analysis of L. perenne therefore highlighted the interactions between molecular regulation of responses to xenobiotics, and also carbohydrate dynamics, energy dysfunction, phytohormones and calcium signaling. PMID:26734031

  7. Genome-Wide Transcriptional Profiling and Metabolic Analysis Uncover Multiple Molecular Responses of the Grass Species Lolium perenne Under Low-Intensity Xenobiotic Stress.

    PubMed

    Serra, Anne-Antonella; Couée, Ivan; Heijnen, David; Michon-Coudouel, Sophie; Sulmon, Cécile; Gouesbet, Gwenola

    2015-01-01

    Lolium perenne, which is a major component of pastures, lawns, and grass strips, can be exposed to xenobiotic stresses due to diffuse and residual contaminations of soil. L. perenne was recently shown to undergo metabolic adjustments in response to sub-toxic levels of xenobiotics. To gain insight in such chemical stress responses, a de novo transcriptome analysis was carried out on leaves from plants subjected at the root level to low levels of xenobiotics, glyphosate, tebuconazole, and a combination of the two, leading to no adverse physiological effect. Chemical treatments influenced significantly the relative proportions of functional categories and of transcripts related to carbohydrate processes, to signaling, to protein-kinase cascades, such as Serine/Threonine-protein kinases, to transcriptional regulations, to responses to abiotic or biotic stimuli and to responses to phytohormones. Transcriptomics-based expressions of genes encoding different types of SNF1 (sucrose non-fermenting 1)-related kinases involved in sugar and stress signaling or encoding key metabolic enzymes were in line with specific qRT-PCR analysis or with the important metabolic and regulatory changes revealed by metabolomic analysis. The effects of pesticide treatments on metabolites and gene expression strongly suggest that pesticides at low levels, as single molecule or as mixture, affect cell signaling and functioning even in the absence of major physiological impact. This global analysis of L. perenne therefore highlighted the interactions between molecular regulation of responses to xenobiotics, and also carbohydrate dynamics, energy dysfunction, phytohormones and calcium signaling.

  8. Genome-wide analysis of AP2/ERF transcription factors in carrot (Daucus carota L.) reveals evolution and expression profiles under abiotic stress.

    PubMed

    Li, Meng-Yao; Xu, Zhi-Sheng; Huang, Ying; Tian, Chang; Wang, Feng; Xiong, Ai-Sheng

    2015-12-01

    AP2/ERF is a large transcription factor family that regulates plant physiological processes, such as plant development and stress response. Carrot (Daucus carota L.) is an important economical crop with a genome size of 480 Mb; the draft genome sequencing of this crop has been completed by our group. However, little is known about the AP2/ERF factors in carrot. In this study, a total of 267 putative AP2/ERF factors were identified from the whole-genome sequence of carrot. These AP2/ERF proteins were phylogenetically clustered into five subfamilies based on their similarity to the amino acid sequences from Arabidopsis. The distribution and comparative genome analysis of the AP2/ERF factors among plants showed the AP2/ERF factors had expansion during the evolutionary process, and the AP2 domain was highly conserved during evolution. The number of AP2/ERF factors in land plants expanded during their evolution. A total of 60 orthologous and 145 coorthologous AP2/ERF gene pairs between carrot and Arabidopsis were identified, and the interaction network of orthologous genes was constructed. The expression patterns of eight AP2/ERF family genes from each subfamily (DREB, ERF, AP2, and RAV) were related to abiotic stresses. Yeast one-hybrid and β-galactosidase activity assays confirmed the DRE and GCC box-binding activities of DREB subfamily genes. This study is the first to identify and characterize the AP2/ERF transcription factors in carrot using whole-genome analysis, and the findings may serve as references for future functional research on the transcription factors in carrot.

  9. Genome-wide analysis of the distribution of AP2/ERF transcription factors reveals duplication and CBFs genes elucidate their potential function in Brassica oleracea

    PubMed Central

    2014-01-01

    Background Cabbage (Brassica oleracea) is one of the most important leaf vegetables grown worldwide. The entire cabbage genome sequence and more than fifty thousand proteins have been obtained to date. However a high degree of sequence similarity and conserved genome structure remain between cabbage and Arabidopsis; therefore, Arabidopsis is a viable reference species for comparative genomics studies. Transcription factors (TFs) are important regulators involved in plant development and physiological processes and the AP2/ERF protein family contains transcriptional factors that play a crucial role in plant growth and development, as well as response to biotic and abiotic stress conditions in plants. However, no detailed expression profile of AP2/ERF-like genes is available for B. oleracea. Results In the present study, 226 AP2/ERF TFs were identified from B. oleracea based on the available genome sequence. Based on sequence similarity, the AP2/ERF superfamily was classified into five groups (DREB, ERF, AP2, RAV and Soloist) and 15 subgroups. The identification, classification, phylogenetic construction, conserved motifs, chromosome distribution, functional annotation, expression patterns and interaction network were then predicted and analyzed. AP2/ERF transcription factor expression levels exhibited differences in response to varying abiotic stresses based on expressed sequence tags (ESTs). BoCBF1a, 1b, 2, 3 and 4, which were highly conserved in Arabidopsis and B. rapa CBF/DREB genes families were well characterized. Expression analysis enabled elucidation of the molecular and genetic level expression patterns of cold tolerance (CT) and susceptible lines (CS) of cabbage and indicated that all BoCBF genes responded to abiotic stresses. Conclusions Comprehensive analysis of the physiological functions and biological roles of AP2/ERF superfamily genes and BoCBF family genes in B. oleracea is required to fully elucidate AP2/ERF, which will provide rich resources and

  10. Genome-wide transcriptional reorganization associated with senescence-to-immortality switch during human hepatocellular carcinogenesis.

    PubMed

    Yildiz, Gokhan; Arslan-Ergul, Ayca; Bagislar, Sevgi; Konu, Ozlen; Yuzugullu, Haluk; Gursoy-Yuzugullu, Ozge; Ozturk, Nuri; Ozen, Cigdem; Ozdag, Hilal; Erdal, Esra; Karademir, Sedat; Sagol, Ozgul; Mizrak, Dilsa; Bozkaya, Hakan; Ilk, Hakki Gokhan; Ilk, Ozlem; Bilen, Biter; Cetin-Atalay, Rengul; Akar, Nejat; Ozturk, Mehmet

    2013-01-01

    Senescence is a permanent proliferation arrest in response to cell stress such as DNA damage. It contributes strongly to tissue aging and serves as a major barrier against tumor development. Most tumor cells are believed to bypass the senescence barrier (become "immortal") by inactivating growth control genes such as TP53 and CDKN2A. They also reactivate telomerase reverse transcriptase. Senescence-to-immortality transition is accompanied by major phenotypic and biochemical changes mediated by genome-wide transcriptional modifications. This appears to happen during hepatocellular carcinoma (HCC) development in patients with liver cirrhosis, however, the accompanying transcriptional changes are virtually unknown. We investigated genome-wide transcriptional changes related to the senescence-to-immortality switch during hepatocellular carcinogenesis. Initially, we performed transcriptome analysis of senescent and immortal clones of Huh7 HCC cell line, and identified genes with significant differential expression to establish a senescence-related gene list. Through the analysis of senescence-related gene expression in different liver tissues we showed that cirrhosis and HCC display expression patterns compatible with senescent and immortal phenotypes, respectively; dysplasia being a transitional state. Gene set enrichment analysis revealed that cirrhosis/senescence-associated genes were preferentially expressed in non-tumor tissues, less malignant tumors, and differentiated or senescent cells. In contrast, HCC/immortality genes were up-regulated in tumor tissues, or more malignant tumors and progenitor cells. In HCC tumors and immortal cells genes involved in DNA repair, cell cycle, telomere extension and branched chain amino acid metabolism were up-regulated, whereas genes involved in cell signaling, as well as in drug, lipid, retinoid and glycolytic metabolism were down-regulated. Based on these distinctive gene expression features we developed a 15-gene

  11. Genome-Wide Transcriptional Reorganization Associated with Senescence-to-Immortality Switch during Human Hepatocellular Carcinogenesis

    PubMed Central

    Konu, Ozlen; Yuzugullu, Haluk; Gursoy-Yuzugullu, Ozge; Ozturk, Nuri; Ozen, Cigdem; Ozdag, Hilal; Erdal, Esra; Karademir, Sedat; Sagol, Ozgul; Mizrak, Dilsa; Bozkaya, Hakan; Ilk, Hakki Gokhan; Ilk, Ozlem; Bilen, Biter; Cetin-Atalay, Rengul; Akar, Nejat; Ozturk, Mehmet

    2013-01-01

    Senescence is a permanent proliferation arrest in response to cell stress such as DNA damage. It contributes strongly to tissue aging and serves as a major barrier against tumor development. Most tumor cells are believed to bypass the senescence barrier (become “immortal”) by inactivating growth control genes such as TP53 and CDKN2A. They also reactivate telomerase reverse transcriptase. Senescence-to-immortality transition is accompanied by major phenotypic and biochemical changes mediated by genome-wide transcriptional modifications. This appears to happen during hepatocellular carcinoma (HCC) development in patients with liver cirrhosis, however, the accompanying transcriptional changes are virtually unknown. We investigated genome-wide transcriptional changes related to the senescence-to-immortality switch during hepatocellular carcinogenesis. Initially, we performed transcriptome analysis of senescent and immortal clones of Huh7 HCC cell line, and identified genes with significant differential expression to establish a senescence-related gene list. Through the analysis of senescence-related gene expression in different liver tissues we showed that cirrhosis and HCC display expression patterns compatible with senescent and immortal phenotypes, respectively; dysplasia being a transitional state. Gene set enrichment analysis revealed that cirrhosis/senescence-associated genes were preferentially expressed in non-tumor tissues, less malignant tumors, and differentiated or senescent cells. In contrast, HCC/immortality genes were up-regulated in tumor tissues, or more malignant tumors and progenitor cells. In HCC tumors and immortal cells genes involved in DNA repair, cell cycle, telomere extension and branched chain amino acid metabolism were up-regulated, whereas genes involved in cell signaling, as well as in drug, lipid, retinoid and glycolytic metabolism were down-regulated. Based on these distinctive gene expression features we developed a 15-gene

  12. Genome-wide analysis of EgEVE_1, a transcriptionally active endogenous viral element associated to small RNAs in Eucalyptus genomes.

    PubMed

    Marcon, Helena Sanches; Costa-Silva, Juliana; Lorenzetti, Alan Péricles Rodrigues; Marino, Celso Luis; Domingues, Douglas Silva

    2017-01-01

    Endogenous viral elements (EVEs) are the result of heritable horizontal gene transfer from viruses to hosts. In the last years, several EVE integration events were reported in plants by the exponential availability of sequenced genomes. Eucalyptus grandis is a forest tree species with a sequenced genome that is poorly studied in terms of evolution and mobile genetic elements composition. Here we report the characterization of E. grandis endogenous viral element 1 (EgEVE_1), a transcriptionally active EVE with a size of 5,664 bp. Phylogenetic analysis and genomic distribution demonstrated that EgEVE_1 is a newly described member of the Caulimoviridae family, distinct from the recently characterized plant Florendoviruses. Genomic distribution of EgEVE_1 and Florendovirus is also distinct. EgEVE_1 qPCR quantification in Eucalyptus urophylla suggests that this genome has more EgEVE_1 copies than E. grandis. EgEVE_1 transcriptional activity was demonstrated by RT-qPCR in five Eucalyptus species and one intrageneric hybrid. We also identified that Eucalyptus EVEs can generate small RNAs (sRNAs),that might be involved in de novo DNA methylation and virus resistance. Our data suggest that EVE families in Eucalyptus have distinct properties, and we provide the first comparative analysis of EVEs in Eucalyptus genomes.

  13. Genome-wide analysis of EgEVE_1, a transcriptionally active endogenous viral element associated to small RNAs in Eucalyptus genomes

    PubMed Central

    Marcon, Helena Sanches; Costa-Silva, Juliana; Lorenzetti, Alan Péricles Rodrigues; Marino, Celso Luis; Domingues, Douglas Silva

    2017-01-01

    Abstract Endogenous viral elements (EVEs) are the result of heritable horizontal gene transfer from viruses to hosts. In the last years, several EVE integration events were reported in plants by the exponential availability of sequenced genomes. Eucalyptus grandis is a forest tree species with a sequenced genome that is poorly studied in terms of evolution and mobile genetic elements composition. Here we report the characterization of E. grandis endogenous viral element 1 (EgEVE_1), a transcriptionally active EVE with a size of 5,664 bp. Phylogenetic analysis and genomic distribution demonstrated that EgEVE_1 is a newly described member of the Caulimoviridae family, distinct from the recently characterized plant Florendoviruses. Genomic distribution of EgEVE_1 and Florendovirus is also distinct. EgEVE_1 qPCR quantification in Eucalyptus urophylla suggests that this genome has more EgEVE_1 copies than E. grandis. EgEVE_1 transcriptional activity was demonstrated by RT-qPCR in five Eucalyptus species and one intrageneric hybrid. We also identified that Eucalyptus EVEs can generate small RNAs (sRNAs),that might be involved in de novo DNA methylation and virus resistance. Our data suggest that EVE families in Eucalyptus have distinct properties, and we provide the first comparative analysis of EVEs in Eucalyptus genomes. PMID:28235127

  14. Genome-wide assessment of differential roles for p300 and CBP in transcription regulation

    PubMed Central

    Ramos, Yolande F. M.; Hestand, Matthew S.; Verlaan, Matty; Krabbendam, Elise; Ariyurek, Yavuz; van Galen, Michiel; van Dam, Hans; van Ommen, Gert-Jan B.; den Dunnen, Johan T.; Zantema, Alt; ′t Hoen, Peter A. C.

    2010-01-01

    Despite high levels of homology, transcription coactivators p300 and CREB binding protein (CBP) are both indispensable during embryogenesis. They are largely known to regulate the same genes. To identify genes preferentially regulated by p300 or CBP, we performed an extensive genome-wide survey using the ChIP-seq on cell-cycle synchronized cells. We found that 57% of the tags were within genes or proximal promoters, with an overall preference for binding to transcription start and end sites. The heterogeneous binding patterns possibly reflect the divergent roles of CBP and p300 in transcriptional regulation. Most of the 16 103 genes were bound by both CBP and p300. However, after stimulation 89 and 1944 genes were preferentially bound by CBP or p300, respectively. Target genes were found to be primarily involved in the regulation of metabolic and developmental processes, and transcription, with CBP showing a stronger preference than p300 for genes active in negative regulation of transcription. Analysis of transcription factor binding sites suggest that CBP and p300 have many partners in common, but AP-1 and Serum Response Factor (SRF) appear to be more prominent in CBP-specific sequences, whereas AP-2 and SP1 are enriched in p300-specific targets. Taken together, our findings further elucidate the distinct roles of coactivators p300 and CBP in transcriptional regulation. PMID:20435671

  15. Genome-wide functional analysis in Candida albicans.

    PubMed

    Motaung, Thabiso E; Ells, Ruan; Pohl, Carolina H; Albertyn, Jacobus; Tsilo, Toi J

    2017-02-08

    Candida albicans is an important etiological agent of superficial and life-threatening infections in individuals with compromised immune systems. To date, we know of several overlapping genetic networks that govern virulence attributes in this fungal pathogen. Classical use of deletion mutants has led to the discovery of numerous virulence factors over the years, and genome-wide functional analysis has propelled gene discovery at an even faster pace. Indeed, a number of recent studies using large-scale genetic screens followed by genome-wide functional analysis has allowed for the unbiased discovery of many new genes involved in C. albicans biology. Here we share our perspectives on the role of these studies in analyzing fundamental aspects of C. albicans virulence properties.

  16. Analysis of Heritability Using Genome-Wide Data.

    PubMed

    Hall, Jacob B; Bush, William S

    2016-10-11

    Most analyses of genome-wide association data consider each variant independently without considering or adjusting for the genetic background present in the rest of the genome. New approaches to genome analysis use representations of genomic sharing to better account for confounding factors like population stratification or to directly approximate heritability through the estimated sharing of individuals in a dataset. These approaches use mixed linear models, which relate genotypic sharing to phenotypic sharing, and rely on the efficient computation of genetic sharing among individuals in a dataset. This unit describes the principles and practical application of mixed models for the analysis of genome-wide association study data. © 2016 by John Wiley & Sons, Inc.

  17. Genome-wide transcription responses to synchrotron microbeam radiotherapy.

    PubMed

    Sprung, Carl N; Yang, Yuqing; Forrester, Helen B; Li, Jason; Zaitseva, Marina; Cann, Leonie; Restall, Tina; Anderson, Robin L; Crosbie, Jeffrey C; Rogers, Peter A W

    2012-10-01

    The majority of cancer patients achieve benefit from radiotherapy. A significant limitation of radiotherapy is its relatively low therapeutic index, defined as the maximum radiation dose that causes acceptable normal tissue damage to the minimum dose required to achieve tumor control. Recently, a new radiotherapy modality using synchrotron-generated X-ray microbeam radiotherapy has been demonstrated in animal models to ablate tumors with concurrent sparing of normal tissue. Very little work has been undertaken into the cellular and molecular mechanisms that differentiate microbeam radiotherapy from broad beam. The purpose of this study was to investigate and compare the whole genome transcriptional response of in vivo microbeam radiotherapy versus broad beam irradiated tumors. We hypothesized that gene expression changes after microbeam radiotherapy are different from those seen after broad beam. We found that in EMT6.5 tumors at 4-48 h postirradiation, microbeam radiotherapy differentially regulates a number of genes, including major histocompatibility complex (MHC) class II antigen gene family members, and other immunity-related genes including Ciita, Ifng, Cxcl1, Cxcl9, Indo and Ubd when compared to broad beam. Our findings demonstrate molecular differences in the tumor response to microbeam versus broad beam irradiation and these differences provide insight into the underlying mechanisms of microbeam radiotherapy and broad beam.

  18. Genome-wide transcriptional responses of Nitrosomonas europaea to zinc.

    PubMed

    Park, Sunhwa; Ely, Roger L

    2008-06-01

    Nitrosomonas europaea, a Gram-negative obligate chemolithoautotroph, participates in global nitrogen cycling by carrying out nitrification and derives energy for growth through oxidation of ammonia. In this work, the physiological, proteomic, and transcriptional responses of N. europaea to zinc stress were studied. The nitrite production rate and ammonia-dependent oxygen uptake rate of the cells exposed to 3.4 microM ZnCl2 decreased about 61 and 69% within 30 min, respectively. Two proteins were notably up regulated in zinc treatment and the mRNA levels of their encoding genes started to increase by 1 h after the addition of zinc. A total of 27 genes were up regulated and 30 genes were down regulated. Up-regulated genes included mercury resistance genes (merTPCAD), inorganic ion transport genes, oxidative stress genes, toxin-antitoxin genes, and two-component signal transduction systems genes. merTPCAD was the highest up-regulated operon (46-fold). Down-regulated genes included the RubisCO operon (cbbO), biosynthesis (mrsA), and amino acid transporter.

  19. Genome Wide Identification and Characterization of Apple bHLH Transcription Factors and Expression Analysis in Response to Drought and Salt Stress

    PubMed Central

    Mao, Ke; Dong, Qinglong; Li, Chao; Liu, Changhai; Ma, Fengwang

    2017-01-01

    The bHLH (basic helix-loop-helix) transcription factor family is the second largest in plants. It occurs in all three eukaryotic kingdoms, and plays important roles in regulating growth and development. However, family members have not previously been studied in apple. Here, we identified 188 MdbHLH proteins in apple “Golden Delicious” (Malus × domestica Borkh.), which could be classified into 18 groups. We also investigated the gene structures and 12 conserved motifs in these MdbHLHs. Coupled with expression analysis and protein interaction network prediction, we identified several genes that might be responsible for abiotic stress responses. This study provides insight and rich resources for subsequent investigations of such proteins in apple. PMID:28443104

  20. Genome-Wide Analysis of the AP2/ERF Transcription Factors Family and the Expression Patterns of DREB Genes in Moso Bamboo (Phyllostachys edulis).

    PubMed

    Wu, Huili; Lv, Hao; Li, Long; Liu, Jun; Mu, Shaohua; Li, Xueping; Gao, Jian

    2015-01-01

    The AP2/ERF transcription factor family, one of the largest families unique to plants, performs a significant role in terms of regulation of growth and development, and responses to biotic and abiotic stresses. Moso bamboo (Phyllostachys edulis) is a fast-growing non-timber forest species with the highest ecological, economic and social values of all bamboos in Asia. The draft genome of moso bamboo and the available genomes of other plants provide great opportunities to research global information on the AP2/ERF family in moso bamboo. In total, 116 AP2/ERF transcription factors were identified in moso bamboo. The phylogeny analyses indicated that the 116 AP2/ERF genes could be divided into three subfamilies: AP2, RAV and ERF; and the ERF subfamily genes were divided into 11 groups. The gene structures, exons/introns and conserved motifs of the PeAP2/ERF genes were analyzed. Analysis of the evolutionary patterns and divergence showed the PeAP2/ERF genes underwent a large-scale event around 15 million years ago (MYA) and the division time of AP2/ERF family genes between rice and moso bamboo was 15-23 MYA. We surveyed the putative promoter regions of the PeDREBs and showed that largely stress-related cis-elements existed in these genes. Further analysis of expression patterns of PeDREBs revealed that the most were strongly induced by drought, low-temperature and/or high salinity stresses in roots and, in contrast, most PeDREB genes had negative functions in leaves under the same respective stresses. In this study there were two main interesting points: there were fewer members of the PeDREB subfamily in moso bamboo than in other plants and there were differences in DREB gene expression profiles between leaves and roots triggered in response to abiotic stress. The information produced from this study may be valuable in overcoming challenges in cultivating moso bamboo.

  1. Genome wide copy number analysis of single cells

    PubMed Central

    Baslan, Timour; Kendall, Jude; Rodgers, Linda; Cox, Hilary; Riggs, Mike; Stepansky, Asya; Troge, Jennifer; Ravi, Kandasamy; Esposito, Diane; Lakshmi, B.; Wigler, Michael; Navin, Nicholas; Hicks, James

    2016-01-01

    Summary Copy number variation (CNV) is increasingly recognized as an important contributor to phenotypic variation in health and disease. Most methods for determining CNV rely on admixtures of cells, where information regarding genetic heterogeneity is lost. Here, we present a protocol that allows for the genome wide copy number analysis of single nuclei isolated from mixed populations of cells. Single nucleus sequencing (SNS), combines flow sorting of single nuclei based on DNA content, whole genome amplification (WGA), followed by next generation sequencing to quantize genomic intervals in a genome wide manner. Multiplexing of single cells is discussed. Additionally, we outline informatic approaches that correct for biases inherent in the WGA procedure and allow for accurate determination of copy number profiles. All together, the protocol takes ~3 days from flow cytometry to sequence-ready DNA libraries. PMID:22555242

  2. Genome-Wide Detection and Analysis of Multifunctional Genes

    PubMed Central

    Pritykin, Yuri; Ghersi, Dario; Singh, Mona

    2015-01-01

    Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655

  3. Genome-wide analysis of the R2R3-MYB transcription factor genes in Chinese cabbage (Brassica rapa ssp. pekinensis) reveals their stress and hormone responsive patterns.

    PubMed

    Wang, Zhen; Tang, Jun; Hu, Rong; Wu, Peng; Hou, Xi-Lin; Song, Xiao-Ming; Xiong, Ai-Sheng

    2015-01-23

    The MYB superfamily is one of the most abundant transcription factor (TF) families in plants. MYB proteins include highly conserved N-terminal MYB repeats (1R, R2R3, 3R, and atypical) and various C-terminal sequences that confer extensive functions. However, the functions of most MYB genes are unknown, and have been little studied in Chinese cabbage. Here, we analyzed 256 (55.2% of total MYBs) R2R3-MYB genes from Chinese cabbage (Brassica rapa ssp. pekinensis) and anchored them onto the 10 chromosomes and three subgenomes. The R2R3-, 3R- and atypical MYB proteins in Chinese cabbage formed 45 subgroups based on domain similarity and phylogenetic topology. Organization and syntenic analysis revealed the genomic distribution and collinear relationships of the R2R3-BrMYBs. Synonymous nucleotide substitution (Ka/Ks) analysis showed that the Chinese cabbage MYB DNA-binding domain is under strong purifying selection. Moreover, RNA-seq data revealed tissue-specific and distinct R2R3-BrMYB expression profiles, and quantitative real-time PCR (qPCR) analysis in leaves showed stress responsive expression and crosstalk with ABA-auxin signaling cascades. In this study, we identified the largest MYB gene family in plants to date. Our results indicate that members of this superfamily may be involved in plant development, stress responses and leaf senescence, highlighting their functional diversity.

  4. Genome-wide identification and characterization of WRKY transcriptional factor family in apple and analysis of their responses to waterlogging and drought stress.

    PubMed

    Meng, Dong; Li, Yuanyuan; Bai, Yang; Li, Mingjun; Cheng, Lailiang

    2016-06-01

    As one of the largest transcriptional factor families in plants, WRKY genes play significant roles in various biotic and abiotic stress responses. Although the WRKY gene family has been characterized in a few plant species, the details remain largely unknown in the apple (Malus domestica Borkh.). In this study, we identified a total of 127 MdWRKYs from the apple genome, which were divided into four subgroups according to the WRKY domains and zinc finger motif. Most of them were mapped onto the apple's 17 chromosomes and were expressed in more than one tissue, including shoot tips, mature leaves, fruit and apple calli. We then contrasted WRKY expression patterns between calli grown in solid medium (control) and liquid medium (representing waterlogging stress) and found that 34 WRKY genes were differentially expressed between the two growing conditions. Finally, we determined the expression patterns of 10 selected WRKY genes in an apple rootstock, G41, in response to waterlogging and drought stress, which identified candidate genes involved in responses to water stress for functional analysis. Our data provide interesting candidate MdWRKYs for future functional analysis and demonstrate that apple callus is a useful system for characterizing gene expression and function in apple. Copyright © 2016 Elsevier Masson SAS. All rights reserved.

  5. Genome-wide transcriptional response of a Saccharomyces cerevisiae strain with an altered redox metabolism.

    PubMed

    Bro, Christoffer; Regenberg, Birgitte; Nielsen, Jens

    2004-02-05

    The genome-wide transcriptional response of a Saccharomyces cerevisiae strain deleted in GDH1 that encodes a NADP(+)-dependent glutamate dehydrogenase was compared to a wild-type strain under anaerobic steady-state conditions. The GDH1-deleted strain has a significantly reduced NADPH requirement, and therefore, an altered redox metabolism. Identification of genes with significantly changed expression using a t-test and a Bonferroni correction yielded only 16 transcripts when accepting two false-positives, and 7 of these were Open Reading Frames (ORFs) with unknown function. Among the 16 transcripts the only one with a direct link to redox metabolism was GND1, encoding phosphogluconate dehydrogenase. To extract additional information we analyzed the transcription data for a gene subset consisting of all known genes encoding metabolic enzymes that use NAD(+) or NADP(+). The subset was analyzed for genes with significantly changed expression again with a t-test and correction for multiple testing. This approach was found to enrich the analysis since GND1, ZWF1 and ALD6, encoding the most important enzymes for regeneration of NADPH under anaerobic conditions, were down-regulated together with eight other genes encoding NADP(H)-dependent enzymes. This indicates a possible common redox-dependent regulation of these genes. Furthermore, we showed that it might be necessary to analyze the expression of a subset of genes to extract all available information from global transcription analysis.

  6. Genome-Wide Analysis of the AP2/ERF Transcription Factors Family and the Expression Patterns of DREB Genes in Moso Bamboo (Phyllostachys edulis)

    PubMed Central

    Li, Long; Liu, Jun; Mu, Shaohua; Li, Xueping; Gao, Jian

    2015-01-01

    The AP2/ERF transcription factor family, one of the largest families unique to plants, performs a significant role in terms of regulation of growth and development, and responses to biotic and abiotic stresses. Moso bamboo (Phyllostachys edulis) is a fast-growing non-timber forest species with the highest ecological, economic and social values of all bamboos in Asia. The draft genome of moso bamboo and the available genomes of other plants provide great opportunities to research global information on the AP2/ERF family in moso bamboo. In total, 116 AP2/ERF transcription factors were identified in moso bamboo. The phylogeny analyses indicated that the 116 AP2/ERF genes could be divided into three subfamilies: AP2, RAV and ERF; and the ERF subfamily genes were divided into 11 groups. The gene structures, exons/introns and conserved motifs of the PeAP2/ERF genes were analyzed. Analysis of the evolutionary patterns and divergence showed the PeAP2/ERF genes underwent a large-scale event around 15 million years ago (MYA) and the division time of AP2/ERF family genes between rice and moso bamboo was 15–23 MYA. We surveyed the putative promoter regions of the PeDREBs and showed that largely stress-related cis-elements existed in these genes. Further analysis of expression patterns of PeDREBs revealed that the most were strongly induced by drought, low-temperature and/or high salinity stresses in roots and, in contrast, most PeDREB genes had negative functions in leaves under the same respective stresses. In this study there were two main interesting points: there were fewer members of the PeDREB subfamily in moso bamboo than in other plants and there were differences in DREB gene expression profiles between leaves and roots triggered in response to abiotic stress. The information produced from this study may be valuable in overcoming challenges in cultivating moso bamboo. PMID:25985202

  7. Genome-wide identification and characterization of the Populus WRKY transcription factor family and analysis of their expression in response to biotic and abiotic stresses.

    PubMed

    Jiang, Yuanzhong; Duan, Yanjiao; Yin, Jia; Ye, Shenglong; Zhu, Jingru; Zhang, Faqi; Lu, Wanxiang; Fan, Di; Luo, Keming

    2014-12-01

    WRKY proteins are a large family of regulators involved in various developmental and physiological processes, especially in coping with diverse biotic and abiotic stresses. In this study, 100 putative PtrWRKY genes encoded the proteins contained in the complete WRKY domain in Populus. Phylogenetic analysis revealed that the members of this superfamily among poplar, Arabidopsis, and other species were divided into three groups with several subgroups based on the structures of the WRKY protein sequences. Various cis-acting elements related to stress and defence responses were found in the promoter regions of PtrWRKY genes by promoter analysis. High-throughput transcriptomic analyses identified that 61 of the PtrWRKY genes were induced by biotic and abiotic treatments, such as Marssonina brunnea, salicylic acid (SA), methyl jasmonate (MeJA), wounding, cold, and salinity. Among these PtrWRKY genes, transcripts of 46 selected genes were observed in different tissues, including roots, stems, and leaves. Quantitative RT-PCR analysis further confirmed the induced expression of 18 PtrWRKY genes by one or more stress treatments. The overexpression of an SA-inducible gene, PtrWRKY89, accelerated expression of PR protein genes and improved resistance to pathogens in transgenic poplar, suggesting that PtrWRKY89 is a regulator of an SA-dependent defence-signalling pathway in poplar. Taken together, our results provided significant information for improving the resistance and stress tolerance of woody plants. © The Author 2014. Published by Oxford University Press on behalf of the Society for Experimental Biology.

  8. The peptide semax affects the expression of genes related to the immune and vascular systems in rat brain focal ischemia: genome-wide transcriptional analysis

    PubMed Central

    2014-01-01

    Background The nootropic neuroprotective peptide Semax (Met-Glu-His-Phe-Pro-Gly-Pro) has proved efficient in the therapy of brain stroke; however, the molecular mechanisms underlying its action remain obscure. Our genome-wide study was designed to investigate the response of the transcriptome of ischemized rat brain cortex tissues to the action of Semax in vivo. Results The gene-expression alteration caused by the action of the peptide Semax was compared with the gene expression of the “ischemia” group animals at 3 and 24 h after permanent middle cerebral artery occlusion (pMCAO). The peptide predominantly enhanced the expression of genes related to the immune system. Three hours after pMCAO, Semax influenced the expression of some genes that affect the activity of immune cells, and, 24 h after pMCAO, the action of Semax on the immune response increased considerably. The genes implicated in this response represented over 50% of the total number of genes that exhibited Semax-induced altered expression. Among the immune-response genes, the expression of which was modulated by Semax, genes that encode immunoglobulins and chemokines formed the most notable groups. In response to Semax administration, 24 genes related to the vascular system exhibited altered expression 3 h after pMCAO, whereas 12 genes were changed 24 h after pMCAO. These genes are associated with such processes as the development and migration of endothelial tissue, the migration of smooth muscle cells, hematopoiesis, and vasculogenesis. Conclusions Semax affects several biological processes involved in the function of various systems. The immune response is the process most markedly affected by the drug. Semax altered the expression of genes that modulate the amount and mobility of immune cells and enhanced the expression of genes that encode chemokines and immunoglobulins. In conditions of rat brain focal ischemia, Semax influenced the expression of genes that promote the formation and

  9. The peptide semax affects the expression of genes related to the immune and vascular systems in rat brain focal ischemia: genome-wide transcriptional analysis.

    PubMed

    Medvedeva, Ekaterina V; Dmitrieva, Veronika G; Povarova, Oksana V; Limborska, Svetlana A; Skvortsova, Veronika I; Myasoedov, Nikolay F; Dergunova, Lyudmila V

    2014-03-24

    The nootropic neuroprotective peptide Semax (Met-Glu-His-Phe-Pro-Gly-Pro) has proved efficient in the therapy of brain stroke; however, the molecular mechanisms underlying its action remain obscure. Our genome-wide study was designed to investigate the response of the transcriptome of ischemized rat brain cortex tissues to the action of Semax in vivo. The gene-expression alteration caused by the action of the peptide Semax was compared with the gene expression of the "ischemia" group animals at 3 and 24 h after permanent middle cerebral artery occlusion (pMCAO). The peptide predominantly enhanced the expression of genes related to the immune system. Three hours after pMCAO, Semax influenced the expression of some genes that affect the activity of immune cells, and, 24 h after pMCAO, the action of Semax on the immune response increased considerably. The genes implicated in this response represented over 50% of the total number of genes that exhibited Semax-induced altered expression. Among the immune-response genes, the expression of which was modulated by Semax, genes that encode immunoglobulins and chemokines formed the most notable groups. In response to Semax administration, 24 genes related to the vascular system exhibited altered expression 3 h after pMCAO, whereas 12 genes were changed 24 h after pMCAO. These genes are associated with such processes as the development and migration of endothelial tissue, the migration of smooth muscle cells, hematopoiesis, and vasculogenesis. Semax affects several biological processes involved in the function of various systems. The immune response is the process most markedly affected by the drug. Semax altered the expression of genes that modulate the amount and mobility of immune cells and enhanced the expression of genes that encode chemokines and immunoglobulins. In conditions of rat brain focal ischemia, Semax influenced the expression of genes that promote the formation and functioning of the vascular system

  10. Genome-wide analysis of brain and gonad transcripts reveals changes of key sex reversal-related genes expression and signaling pathways in three stages of Monopterus albus

    PubMed Central

    Hu, Qing; Guo, Wei; Li, Dapeng

    2017-01-01

    Background The natural sex reversal severely affects the sex ratio and thus decreases the productivity of the rice field eel (Monopterus albus). How to understand and manipulate this process is one of the major issues for the rice field eel stocking. So far the genomics and transcriptomics data available for this species are still scarce. Here we provide a comprehensive study of transcriptomes of brain and gonad tissue in three sex stages (female, intersex and male) from the rice field eel to investigate changes in transcriptional level during the sex reversal process. Results Approximately 195 thousand unigenes were generated and over 44.4 thousand were functionally annotated. Comparative study between stages provided multiple differentially expressed genes in brain and gonad tissue. Overall 4668 genes were found to be of unequal abundance between gonad tissues, far more than that of the brain tissues (59 genes). These genes were enriched in several different signaling pathways. A number of 231 genes were found with different levels in gonad in each stage, with several reproduction-related genes included. A total of 19 candidate genes that could be most related to sex reversal were screened out, part of these genes’ expression patterns were validated by RT-qPCR. The expression of spef2, maats1, spag6 and dmc1 were abundant in testis, but was barely detected in females, while the 17β-hsd12, zpsbp3, gal3 and foxn5 were only expressed in ovary. Conclusion This study investigated the complexity of brain and gonad transcriptomes in three sex stages of the rice field eel. Integrated analysis of different gene expression and changes in signaling pathways, such as PI3K-Akt pathway, provided crucial data for further study of sex transformation mechanisms. PMID:28319194

  11. Genome-wide analysis of thiourea-modulated salinity stress-responsive transcripts in seeds of Brassica juncea: identification of signalling and effector components of stress tolerance

    PubMed Central

    Srivastava, A. K.; Ramaswamy, N. K.; Suprasanna, P.; D'Souza, S. F.

    2010-01-01

    Background and Aims Abiotic stresses including salinity are the major constraints to crop production. In this regard, the use of thiourea (TU) in imparting salinity-stress tolerance to Indian mustard (Brassica juncea) has been demonstrated earlier. To gain an insight into the mechanism of TU action, various molecular and biochemical studies were conducted. Methods Microarray analysis was performed in seeds subjected to distilled water (control), 1 m NaCl, 1 m NaCl + 6·5 mm TU and 6·5 mm TU alone for 1 h. Real-time PCR validation of selected genes and biochemical studies were conducted under similar treatments at 1 h and 6 h. Key Results The microarray analysis revealed a differential expression profile of 33 genes in NaCl- and NaCl + TU-treated seeds, most of which are established markers of stress tolerance. The temporal regulation of eight selected genes by real-time PCR indicated their early and co-ordinated induction at 1 h in NaCl + TU only. Besides, NaCl + TU-treated seeds also maintained a higher level of abscisic acid, reduced to oxidized glutathione (GSH : GSSG) ratio and activities of catalase, phenylalanine ammonia lyase and glutathione-S-transferases, as compared with that of NaCl treatment. The addition of LaCl3 (a specific calcium-channel blocker) restricted the responses of TU both at molecular and biochemical level suggesting the possible involvement of a cytosolic calcium burst in the TU-mediated response. The TU-alone treatment was comparable to that of the control; however, it reduced the expression of some transcription factors and heat-shock proteins presumably due to the stabilization of the corresponding proteins. Conclusions The TU treatment co-ordinately regulates different signalling and effector mechanisms at an early stage to alleviate stress even under a high degree of salinity. This also indicates the potential of TU to be used as an effective bioregulator to impart salinity tolerance under field conditions. PMID:20736293

  12. Genome-Wide Analysis of Polyadenylation Events in Schmidtea mediterranea

    PubMed Central

    Lakshmanan, Vairavan; Bansal, Dhiru; Kulkarni, Jahnavi; Poduval, Deepak; Krishna, Srikar; Sasidharan, Vidyanand; Anand, Praveen; Seshasayee, Aswin; Palakodeti, Dasaradhi

    2016-01-01

    In eukaryotes, 3′ untranslated regions (UTRs) play important roles in regulating posttranscriptional gene expression. The 3′UTR is defined by regulated cleavage/polyadenylation of the pre-mRNA. The advent of next-generation sequencing technology has now enabled us to identify these events on a genome-wide scale. In this study, we used poly(A)-position profiling by sequencing (3P-Seq) to capture all poly(A) sites across the genome of the freshwater planarian, Schmidtea mediterranea, an ideal model system for exploring the process of regeneration and stem cell function. We identified the 3′UTRs for ∼14,000 transcripts and thus improved the existing gene annotations. We found 97 transcripts, which are polyadenylated within an internal exon, resulting in the shrinking of the ORF and loss of a predicted protein domain. Around 40% of the transcripts in planaria were alternatively polyadenylated (ApA), resulting either in an altered 3′UTR or a change in coding sequence. We identified specific ApA transcript isoforms that were subjected to miRNA mediated gene regulation using degradome sequencing. In this study, we also confirmed a tissue-specific expression pattern for alternate polyadenylated transcripts. The insights from this study highlight the potential role of ApA in regulating the gene expression essential for planarian regeneration. PMID:27489207

  13. Genome-wide analysis of brain transcriptional changes in honey bee (Apis mellifera L.) queens exposed to carbon dioxide and physical manipulation.

    PubMed

    Niño, E L; Tarpy, D R; Grozinger, C

    2011-06-01

    Mating is a complex process causing many behavioural and physiological changes, but the factors triggering them and the underlying molecular processes are not well characterized. In the present study we examine the effects of CO(2) (a commonly used anaesthetic in instrumental insemination that causes changes similar to those occurring after mating) and physical manipulation (which may mimic certain aspects of copulation) on the behavioural, physiological and brain transcriptional changes in honey bee queens. We show that while CO(2) causes cessation of mating flights and ovary activation, physical manipulation has additional effects on ovary activation and brain transcriptional changes. Comparisons with previous studies of honey bees and female Drosophila indicate that common molecular mechanisms may be responsible for regulating reproductive changes across different mating regimes and insect orders. © 2011 The Authors. Insect Molecular Biology © 2011 The Royal Entomological Society.

  14. Identification of a Salmonella ancillary copper detoxification mechanism by a comparative analysis of the genome-wide transcriptional response to copper and zinc excess.

    PubMed

    Pontel, Lucas B; Scampoli, Nadia L; Porwollik, Steffen; Checa, Susana K; McClelland, Michael; Soncini, Fernando C

    2014-08-01

    Copper and zinc are essential metal ions, but toxic in excess. Bacteria have evolved different strategies to control their intracellular concentrations, ensuring proper supply while avoiding toxicity, including the induction of metal-specific as well as non-specific mechanisms. We compared the transcriptional profiles of Salmonella Typhimurium after exposure to either copper or zinc ions in both rich and minimal media. Besides metal-specific regulatory networks many global stress-response pathways react to an excess of either of these metal ions. Copper excess affects both zinc and iron homeostasis by inducing transcription of these metal-specific regulons. In addition to the control of zinc-specific regulons, zinc excess affects the Cpx regulon and the σ(E) envelope-stress responses. Finally, novel metal-specific upregulated genes were detected including a new copper-detoxification pathway that involves the siderophore enterobactin and the outer-membrane protein TolC. This work sheds light onto the transcriptional landscape of Salmonella after copper or zinc overload, and discloses a new mechanism of copper detoxification.

  15. Genome-wide association interaction analysis for Alzheimer's disease

    PubMed Central

    Gusareva, Elena S.; Carrasquillo, Minerva M.; Bellenguez, Céline; Cuyvers, Elise; Colon, Samuel; Graff-Radford, Neill R.; Petersen, Ronald C.; Dickson, Dennis W.; Mahachie Johna, Jestinah M.; Bessonov, Kyrylo; Van Broeckhoven, Christine; Williams, Julie; Amouyel, Philippe; Sleegers, Kristel; Ertekin-Taner, Nilüfer; Lambert, Jean-Charles; Van Steen, Kristel

    2015-01-01

    We propose a minimal protocol for exhaustive genome-wide association interaction analysis that involves screening for epistasis over large-scale genomic data combining strengths of different methods and statistical tools. The different steps of this protocol are illustrated on a real-life data application for Alzheimer's disease (AD) (2259 patients and 6017 controls from France). Particularly, in the exhaustive genome-wide epistasis screening we identified AD-associated interacting SNPs-pair from chromosome 6q11.1 (rs6455128, the KHDRBS2 gene) and 13q12.11 (rs7989332, the CRYL1 gene) (p = 0.006, corrected for multiple testing). A replication analysis in the independent AD cohort from Germany (555 patients and 824 controls) confirmed the discovered epistasis signal (p = 0.036). This signal was also supported by a meta-analysis approach in 5 independent AD cohorts that was applied in the context of epistasis for the first time. Transcriptome analysis revealed negative correlation between expression levels of KHDRBS2 and CRYL1 in both the temporal cortex (β = −0.19, p = 0.0006) and cerebellum (β = −0.23, p < 0.0001) brain regions. This is the first time a replicable epistasis associated with AD was identified using a hypothesis free screening approach. PMID:24958192

  16. A genome-wide analysis reveals that the Drosophila transcription factor Lola promotes axon growth in part by suppressing expression of the actin nucleation factor Spire

    PubMed Central

    2011-01-01

    Background The phylogenetically conserved transcription factor Lola is essential for many aspects of axon growth and guidance, synapse formation and neural circuit development in Drosophila. To date it has been difficult, however, to obtain an overall view of Lola functions and mechanisms. Results We use expression microarrays to identify the lola-dependent transcriptome in the Drosophila embryo. We find that lola regulates the expression of a large selection of genes that are known to affect each of several lola-dependent developmental processes. Among other loci, we find lola to be a negative regulator of spire, an actin nucleation factor that has been studied for its essential role in oogenesis. We show that spire is expressed in the nervous system and is required for a known lola-dependent axon guidance decision, growth of ISNb motor axons. We further show that reducing spire gene dosage suppresses this aspect of the lola phenotype, verifying that derepression of spire is an important contributor to the axon stalling phenotype of embryonic motor axons in lola mutants. Conclusions These data shed new light on the molecular mechanisms of many lola-dependent processes, and also identify several developmental processes not previously linked to lola that are apt to be regulated by this transcription factor. These data further demonstrate that excessive expression of the actin nucleation factor Spire is as deleterious for axon growth in vivo as is the loss of Spire, thus highlighting the need for a balance in the elementary steps of actin dynamics to achieve effective neuronal morphogenesis. PMID:22129300

  17. Genome-wide identification, classification, and functional analysis of the basic helix-loop-helix transcription factors in the cattle, Bos Taurus.

    PubMed

    Li, Fengmei; Liu, Wuyi

    2017-06-01

    The basic helix-loop-helix (bHLH) transcription factors (TFs) form a huge superfamily and play crucial roles in many essential developmental, genetic, and physiological-biochemical processes of eukaryotes. In total, 109 putative bHLH TFs were identified and categorized successfully in the genomic databases of cattle, Bos Taurus, after removing redundant sequences and merging genetic isoforms. Through phylogenetic analyses, 105 proteins among these bHLH TFs were classified into 44 families with 46, 25, 14, 3, 13, and 4 members in the high-order groups A, B, C, D, E, and F, respectively. The remaining 4 bHLH proteins were sorted out as 'orphans.' Next, these 109 putative bHLH proteins identified were further characterized as significantly enriched in 524 significant Gene Ontology (GO) annotations (corrected P value ≤ 0.05) and 21 significantly enriched pathways (corrected P value ≤ 0.05) that had been mapped by the web server KOBAS 2.0. Furthermore, 95 bHLH proteins were further screened and analyzed together with two uncharacterized proteins in the STRING online database to reconstruct the protein-protein interaction network of cattle bHLH TFs. Ultimately, 89 bHLH proteins were fully mapped in a network with 67 biological process, 13 molecular functions, 5 KEGG pathways, 12 PFAM protein domains, and 25 INTERPRO classified protein domains and features. These results provide much useful information and a good reference for further functional investigations and updated researches on cattle bHLH TFs.

  18. Genome-wide identification of transcript start and end sites by Transcript Isoform Sequencing, TIF-Seq

    PubMed Central

    Pelechano, Vicent; Wei, Wu; Jakob, Petra; Steinmetz, Lars M.

    2014-01-01

    Hundreds of transcript isoforms with varying boundaries and alternative regulatory signals are transcribed from the genome, even in a genetically homogeneous population of cells. To study this transcriptional heterogeneity, we developed Transcript Isoform Sequencing (TIF-Seq), a method that allows the genome-wide profiling of full-length transcript isoforms defined by their exact 5′ and 3′ boundaries. TIF-Seq entails generating full-length cDNA libraries, followed by their circularization and the sequencing of the junction fragments spanning the 5′ and 3′ transcript ends. By determining the respective co-occurrence of start and end sites of individual transcript molecules, TIF-Seq can distinguish variations that conventional approaches for mapping single ends cannot, such as short abortive transcripts, bicistronic messages, and overlapping transcripts that differ in lengths. The TIF-Seq protocol we describe here can be applied to any eukaryotic organism (e.g., yeast, human) and requires 6-10 days to generate TIF-Seq libraries, 10 days for sequencing and 2-3 days for analysis. PMID:24967623

  19. Transcription, epigenetics and ameliorative strategies in Huntington's Disease: a genome-wide perspective.

    PubMed

    Valor, Luis M

    2015-02-01

    Transcriptional dysregulation in Huntington's disease (HD) is an early event that shapes the brain transcriptome by both the depletion and ectopic activation of gene products that eventually affect survival and neuronal functions. Disruption in the activity of gene expression regulators, such as transcription factors, chromatin-remodeling proteins, and noncoding RNAs, accounts for the expression changes observed in multiple animal and cellular models of HD and in samples from patients. Here, I review the recent advances in the study of HD transcriptional dysregulation and its causes to finally discuss the possible implications in ameliorative strategies from a genome-wide perspective. To date, the use of genome-wide approaches, predominantly based on microarray platforms, has been successful in providing an extensive catalog of differentially regulated genes, including biomarkers aimed at monitoring the progress of the pathology. Although still incipient, the introduction of combined next-generation sequencing techniques is enhancing our comprehension of the mechanisms underlying altered transcriptional dysregulation in HD by providing the first genomic landscapes associated with epigenetics and the occupancy of transcription factors. In addition, the use of genome-wide approaches is becoming more and more necessary to evaluate the efficacy and safety of ameliorative strategies and to identify novel mechanisms of amelioration that may help in the improvement of current preclinical therapeutics. Finally, the major conclusions obtained from HD transcriptomics studies have the potential to be extrapolated to other neurodegenerative disorders.

  20. High resolution genome wide binding event finding and motif discovery reveals transcription factor spatial binding constraints.

    PubMed

    Guo, Yuchun; Mahony, Shaun; Gifford, David K

    2012-01-01

    An essential component of genome function is the syntax of genomic regulatory elements that determine how diverse transcription factors interact to orchestrate a program of regulatory control. A precise characterization of in vivo spacing constraints between key transcription factors would reveal key aspects of this genomic regulatory language. To discover novel transcription factor spatial binding constraints in vivo, we developed a new integrative computational method, genome wide event finding and motif discovery (GEM). GEM resolves ChIP data into explanatory motifs and binding events at high spatial resolution by linking binding event discovery and motif discovery with positional priors in the context of a generative probabilistic model of ChIP data and genome sequence. GEM analysis of 63 transcription factors in 214 ENCODE human ChIP-Seq experiments recovers more known factor motifs than other contemporary methods, and discovers six new motifs for factors with unknown binding specificity. GEM's adaptive learning of binding-event read distributions allows it to further improve upon previous methods for processing ChIP-Seq and ChIP-exo data to yield unsurpassed spatial resolution and discovery of closely spaced binding events of the same factor. In a systematic analysis of in vivo sequence-specific transcription factor binding using GEM, we have found hundreds of spatial binding constraints between factors. GEM found 37 examples of factor binding constraints in mouse ES cells, including strong distance-specific constraints between Klf4 and other key regulatory factors. In human ENCODE data, GEM found 390 examples of spatially constrained pair-wise binding, including such novel pairs as c-Fos:c-Jun/USF1, CTCF/Egr1, and HNF4A/FOXA1. The discovery of new factor-factor spatial constraints in ChIP data is significant because it proposes testable models for regulatory factor interactions that will help elucidate genome function and the implementation of combinatorial

  1. Susceptibility to Childhood Pneumonia: A Genome-Wide Analysis.

    PubMed

    Hayden, Lystra P; Cho, Michael H; McDonald, Merry-Lynn N; Crapo, James D; Beaty, Terri H; Silverman, Edwin K; Hersh, Craig P

    2017-01-01

    Previous studies have indicated that in adult smokers, a history of childhood pneumonia is associated with reduced lung function and chronic obstructive pulmonary disease. There have been few previous investigations using genome-wide association studies to investigate genetic predisposition to pneumonia. This study aims to identify the genetic variants associated with the development of pneumonia during childhood and over the course of the lifetime. Study subjects included current and former smokers with and without chronic obstructive pulmonary disease participating in the COPDGene Study. Pneumonia was defined by subject self-report, with childhood pneumonia categorized as having the first episode at <16 years. Genome-wide association studies for childhood pneumonia (843 cases, 9,091 control subjects) and lifetime pneumonia (3,766 cases, 5,659 control subjects) were performed separately in non-Hispanic whites and African Americans. Non-Hispanic white and African American populations were combined in the meta-analysis. Top genetic variants from childhood pneumonia were assessed in network analysis. No single-nucleotide polymorphisms reached genome-wide significance, although we identified potential regions of interest. In the childhood pneumonia analysis, this included variants in NGR1 (P = 6.3 × 10(-8)), PAK6 (P = 3.3 × 10(-7)), and near MATN1 (P = 2.8 × 10(-7)). In the lifetime pneumonia analysis, this included variants in LOC339862 (P = 8.7 × 10(-7)), RAPGEF2 (P = 8.4 × 10(-7)), PHACTR1 (P = 6.1 × 10(-7)), near PRR27 (P = 4.3 × 10(-7)), and near MCPH1 (P = 2.7 × 10(-7)). Network analysis of the genes associated with childhood pneumonia included top networks related to development, blood vessel morphogenesis, muscle contraction, WNT signaling, DNA damage, apoptosis, inflammation, and immune response (P ≤ 0.05). We have identified genes potentially associated with the risk of pneumonia

  2. Genome-wide analysis and expression patterns of ZF-HD transcription factors under different developmental tissues and abiotic stresses in Chinese cabbage.

    PubMed

    Wang, Wenli; Wu, Peng; Li, Ying; Hou, XiLin

    2016-06-01

    The ZF-HD gene family plays an important role in plant developmental processes and stress responses. However, the function of the ZF-HD genes in Chinese cabbage remains largely unknown. Chinese cabbage (Brassica rapa ssp. pekinensis) is a member of one of the most important leaf vegetables grown worldwide. The entire Chinese cabbage genome sequence has been determined, and more than forty thousand proteins have been identified to date. In this study, 31 ZF-HD genes were identified in Chinese cabbage. We show here that the BraZF-HD genes could be categorized into ZHD and MIF subfamilies. Among them, ZHD genes are plant-specific, nearly all intronless, and related to MINI ZINC FINGER genes that possess only the zinc finger. Phylogenetic analysis suggested that ZHDs have expanded considerably during angiosperm evolution. In addition, the ZHD group has 24 members, which is twice as much as the Arabidopsis ZHD group, indicating that the Chinese cabbage ZHD genes have been retained more frequently than other group genes. Real-time PCR analysis showed that most of BraZF-HD genes are preferentially expressed in flower. Furthermore, most of these genes are significantly induced under photoperiod or vernalization conditions, as well as abiotic stresses. Thereby implying that they may play important roles in these processes. This study provides insight into the evolution of ZF-HD genes in Chinese cabbage genome and may aid efforts to further characterize the function of these predicted ZF-HD genes in flowering and resistance.

  3. Genome-Wide Dissection of the Heat Shock Transcription Factor Family Genes in Arachis

    PubMed Central

    Wang, Pengfei; Song, Hui; Li, Changsheng; Li, Pengcheng; Li, Aiqin; Guan, Hongshan; Hou, Lei; Wang, Xingjun

    2017-01-01

    Heat shock transcription factors (Hsfs) are important transcription factors (TFs) in protecting plants from damages caused by various stresses. The released whole genome sequences of wild peanuts make it possible for genome-wide analysis of Hsfs in peanut. In this study, a total of 16 and 17 Hsf genes were identified from Arachis duranensis and A. ipaensis, respectively. We identified 16 orthologous Hsf gene pairs in both peanut species; however HsfXs was only identified from A. ipaensis. Orthologous pairs between two wild peanut species were highly syntenic. Based on phylogenetic relationship, peanut Hsfs were divided into groups A, B, and C. Selection pressure analysis showed that group B Hsf genes mainly underwent positive selection and group A Hsfs were affected by purifying selection. Small scale segmental and tandem duplication may play important roles in the evolution of these genes. Cis-elements, such as ABRE, DRE, and HSE, were found in the promoters of most Arachis Hsf genes. Five AdHsfs and two AiHsfs contained fungal elicitor responsive elements suggesting their involvement in response to fungi infection. These genes were differentially expressed in cultivated peanut under abiotic stress and Aspergillus flavus infection. AhHsf2 and AhHsf14 were significantly up-regulated after inoculation with A. flavus suggesting their possible role in fungal resistance. PMID:28220134

  4. Genome-Wide Dissection of the Heat Shock Transcription Factor Family Genes in Arachis.

    PubMed

    Wang, Pengfei; Song, Hui; Li, Changsheng; Li, Pengcheng; Li, Aiqin; Guan, Hongshan; Hou, Lei; Wang, Xingjun

    2017-01-01

    Heat shock transcription factors (Hsfs) are important transcription factors (TFs) in protecting plants from damages caused by various stresses. The released whole genome sequences of wild peanuts make it possible for genome-wide analysis of Hsfs in peanut. In this study, a total of 16 and 17 Hsf genes were identified from Arachis duranensis and A. ipaensis, respectively. We identified 16 orthologous Hsf gene pairs in both peanut species; however HsfXs was only identified from A. ipaensis. Orthologous pairs between two wild peanut species were highly syntenic. Based on phylogenetic relationship, peanut Hsfs were divided into groups A, B, and C. Selection pressure analysis showed that group B Hsf genes mainly underwent positive selection and group A Hsfs were affected by purifying selection. Small scale segmental and tandem duplication may play important roles in the evolution of these genes. Cis-elements, such as ABRE, DRE, and HSE, were found in the promoters of most Arachis Hsf genes. Five AdHsfs and two AiHsfs contained fungal elicitor responsive elements suggesting their involvement in response to fungi infection. These genes were differentially expressed in cultivated peanut under abiotic stress and Aspergillus flavus infection. AhHsf2 and AhHsf14 were significantly up-regulated after inoculation with A. flavus suggesting their possible role in fungal resistance.

  5. Genome-wide in silico identification of GPI proteins in Mycosphaerella fijiensis and transcriptional analysis of two GPI-anchored β-1,3-glucanosyltransferases.

    PubMed

    Kantún-Moreno, Nuvia; Vázquez-Euán, Roberto; Tzec-Simá, Miguel; Peraza-Echeverría, Leticia; Grijalva-Arango, Rosa; Rodríguez-García, Cecilia; James, Andrew C; Ramírez-Prado, Jorge; Islas-Flores, Ignacio; Canto-Canché, Blondy

    2013-01-01

    The hemibiotrophic fungus Mycosphaerella fijiensis is the causal agent of black Sigatoka (BS), the most devastating foliar disease in banana (Musa spp.) worldwide. Little is known about genes that are important during M. fijiensis-Musa sp. interaction. The fungal cell wall is an attractive area of study because it is essential for maintenance of cellular homeostasis and it is the most external structure in the fungal cell and therefore mediates the interaction of the pathogen with the host. In this manuscript we describe the in silico identification of glycosyl phosphatidylinositol-protein (GPI) family in M. fijiensis, and the analysis of two β-1,3-glucanosyltrans-ferases (Gas), selected by homology with fungal pathogenicity factors. Potential roles in pathogenesis were evaluated through analyzing expression during different stages of black Sigatoka disease, comparing expression data with BS symptoms and fungal biomass inside leaves. Real-time quantitative RT-PCR showed nearly constant expression of MfGAS1 with slightly increases (about threefold) in conidia and at speck-necrotrophic stage during banana-pathogen interaction. Conversely, MfGAS2 expression was increased during biotrophy (about seven times) and reached a maximum at speck (about 23 times) followed by a progressive decrease in next stages, suggesting an active role in M. fijiensis pathogenesis.

  6. Systems-Level Analysis of Genome-Wide Association Data

    PubMed Central

    Farber, Charles R.

    2013-01-01

    Genome-wide association studies (GWAS) have emerged as the method of choice for identifying common variants affecting complex disease. In a GWAS, particular attention is placed, for obvious reasons, on single-nucleotide polymorphisms (SNPs) that exceed stringent genome-wide significance thresholds. However, it is expected that many SNPs with only nominal evidence of association (e.g., P < 0.05) truly influence disease. Efforts to extract additional biological information from entire GWAS datasets have primarily focused on pathway-enrichment analyses. However, these methods suffer from a number of limitations and typically fail to lead to testable hypotheses. To evaluate alternative approaches, we performed a systems-level analysis of GWAS data using weighted gene coexpression network analysis. A weighted gene coexpression network was generated for 1918 genes harboring SNPs that displayed nominal evidence of association (P ≤ 0.05) from a GWAS of bone mineral density (BMD) using microarray data on circulating monocytes isolated from individuals with extremely low or high BMD. Thirteen distinct gene modules were identified, each comprising coexpressed and highly interconnected GWAS genes. Through the characterization of module content and topology, we illustrate how network analysis can be used to discover disease-associated subnetworks and characterize novel interactions for genes with a known role in the regulation of BMD. In addition, we provide evidence that network metrics can be used as a prioritizing tool when selecting genes and SNPs for replication studies. Our results highlight the advantages of using systems-level strategies to add value to and inform GWAS. PMID:23316444

  7. The Csr system regulates genome-wide mRNA stability and transcription and thus gene expression in Escherichia coli.

    PubMed

    Esquerré, Thomas; Bouvier, Marie; Turlan, Catherine; Carpousis, Agamemnon J; Girbal, Laurence; Cocaign-Bousquet, Muriel

    2016-04-26

    Bacterial adaptation requires large-scale regulation of gene expression. We have performed a genome-wide analysis of the Csr system, which regulates many important cellular functions. The Csr system is involved in post-transcriptional regulation, but a role in transcriptional regulation has also been suggested. Two proteins, an RNA-binding protein CsrA and an atypical signaling protein CsrD, participate in the Csr system. Genome-wide transcript stabilities and levels were compared in wildtype E. coli (MG1655) and isogenic mutant strains deficient in CsrA or CsrD activity demonstrating for the first time that CsrA and CsrD are global negative and positive regulators of transcription, respectively. The role of CsrA in transcription regulation may be indirect due to the 4.6-fold increase in csrD mRNA concentration in the CsrA deficient strain. Transcriptional action of CsrA and CsrD on a few genes was validated by transcriptional fusions. In addition to an effect on transcription, CsrA stabilizes thousands of mRNAs. This is the first demonstration that CsrA is a global positive regulator of mRNA stability. For one hundred genes, we predict that direct control of mRNA stability by CsrA might contribute to metabolic adaptation by regulating expression of genes involved in carbon metabolism and transport independently of transcriptional regulation.

  8. The Csr system regulates genome-wide mRNA stability and transcription and thus gene expression in Escherichia coli

    PubMed Central

    Esquerré, Thomas; Bouvier, Marie; Turlan, Catherine; Carpousis, Agamemnon J.; Girbal, Laurence; Cocaign-Bousquet, Muriel

    2016-01-01

    Bacterial adaptation requires large-scale regulation of gene expression. We have performed a genome-wide analysis of the Csr system, which regulates many important cellular functions. The Csr system is involved in post-transcriptional regulation, but a role in transcriptional regulation has also been suggested. Two proteins, an RNA-binding protein CsrA and an atypical signaling protein CsrD, participate in the Csr system. Genome-wide transcript stabilities and levels were compared in wildtype E. coli (MG1655) and isogenic mutant strains deficient in CsrA or CsrD activity demonstrating for the first time that CsrA and CsrD are global negative and positive regulators of transcription, respectively. The role of CsrA in transcription regulation may be indirect due to the 4.6-fold increase in csrD mRNA concentration in the CsrA deficient strain. Transcriptional action of CsrA and CsrD on a few genes was validated by transcriptional fusions. In addition to an effect on transcription, CsrA stabilizes thousands of mRNAs. This is the first demonstration that CsrA is a global positive regulator of mRNA stability. For one hundred genes, we predict that direct control of mRNA stability by CsrA might contribute to metabolic adaptation by regulating expression of genes involved in carbon metabolism and transport independently of transcriptional regulation. PMID:27112822

  9. The CHR site: definition and genome-wide identification of a cell cycle transcriptional element.

    PubMed

    Müller, Gerd A; Wintsche, Axel; Stangner, Konstanze; Prohaska, Sonja J; Stadler, Peter F; Engeland, Kurt

    2014-01-01

    The cell cycle genes homology region (CHR) has been identified as a DNA element with an important role in transcriptional regulation of late cell cycle genes. It has been shown that such genes are controlled by DREAM, MMB and FOXM1-MuvB and that these protein complexes can contact DNA via CHR sites. However, it has not been elucidated which sequence variations of the canonical CHR are functional and how frequent CHR-based regulation is utilized in mammalian genomes. Here, we define the spectrum of functional CHR elements. As the basis for a computational meta-analysis, we identify new CHR sequences and compile phylogenetic motif conservation as well as genome-wide protein-DNA binding and gene expression data. We identify CHR elements in most late cell cycle genes binding DREAM, MMB, or FOXM1-MuvB. In contrast, Myb- and forkhead-binding sites are underrepresented in both early and late cell cycle genes. Our findings support a general mechanism: sequential binding of DREAM, MMB and FOXM1-MuvB complexes to late cell cycle genes requires CHR elements. Taken together, we define the group of CHR-regulated genes in mammalian genomes and provide evidence that the CHR is the central promoter element in transcriptional regulation of late cell cycle genes by DREAM, MMB and FOXM1-MuvB. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. Genome-Wide Transcriptional Response of Saccharomyces cerevisiae to Stress-Induced Perturbations

    PubMed Central

    Taymaz-Nikerel, Hilal; Cankorur-Cetinkaya, Ayca; Kirdar, Betul

    2016-01-01

    Cells respond to environmental and/or genetic perturbations in order to survive and proliferate. Characterization of the changes after various stimuli at different -omics levels is crucial to comprehend the adaptation of cells to the changing conditions. Genome-wide quantification and analysis of transcript levels, the genes affected by perturbations, extends our understanding of cellular metabolism by pointing out the mechanisms that play role in sensing the stress caused by those perturbations and related signaling pathways, and in this way guides us to achieve endeavors, such as rational engineering of cells or interpretation of disease mechanisms. Saccharomyces cerevisiae as a model system has been studied in response to different perturbations and corresponding transcriptional profiles were followed either statically or/and dynamically, short and long term. This review focuses on response of yeast cells to diverse stress inducing perturbations, including nutritional changes, ionic stress, salt stress, oxidative stress, osmotic shock, and to genetic interventions such as deletion and overexpression of genes. It is aimed to conclude on common regulatory phenomena that allow yeast to organize its transcriptomic response after any perturbation under different external conditions. PMID:26925399

  11. Genome-wide transcript profiling reveals novel breast cancer-associated intronic sense RNAs.

    PubMed

    Kim, Sang Woo; Fishilevich, Elane; Arango-Argoty, Gustavo; Lin, Yuefeng; Liu, Guodong; Li, Zhihua; Monaghan, A Paula; Nichols, Mark; John, Bino

    2015-01-01

    Non-coding RNAs (ncRNAs) play major roles in development and cancer progression. To identify novel ncRNAs that may identify key pathways in breast cancer development, we performed high-throughput transcript profiling of tumor and normal matched-pair tissue samples. Initial transcriptome profiling using high-density genome-wide tiling arrays revealed changes in over 200 novel candidate genomic regions that map to intronic regions. Sixteen genomic loci were identified that map to the long introns of five key protein-coding genes, CRIM1, EPAS1, ZEB2, RBMS1, and RFX2. Consistent with the known role of the tumor suppressor ZEB2 in the cancer-associated epithelial to mesenchymal transition (EMT), in situ hybridization reveals that the intronic regions deriving from ZEB2 as well as those from RFX2 and EPAS1 are down-regulated in cells of epithelial morphology, suggesting that these regions may be important for maintaining normal epithelial cell morphology. Paired-end deep sequencing analysis reveals a large number of distinct genomic clusters with no coding potential within the introns of these genes. These novel transcripts are only transcribed from the coding strand. A comprehensive search for breast cancer associated genes reveals enrichment for transcribed intronic regions from these loci, pointing to an underappreciated role of introns or mechanisms relating to their biology in EMT and breast cancer.

  12. Genome-Wide Transcript Profiling Reveals Novel Breast Cancer-Associated Intronic Sense RNAs

    PubMed Central

    Lin, Yuefeng; Liu, Guodong; Li, Zhihua; Monaghan, A. Paula; Nichols, Mark; John, Bino

    2015-01-01

    Non-coding RNAs (ncRNAs) play major roles in development and cancer progression. To identify novel ncRNAs that may identify key pathways in breast cancer development, we performed high-throughput transcript profiling of tumor and normal matched-pair tissue samples. Initial transcriptome profiling using high-density genome-wide tiling arrays revealed changes in over 200 novel candidate genomic regions that map to intronic regions. Sixteen genomic loci were identified that map to the long introns of five key protein-coding genes, CRIM1, EPAS1, ZEB2, RBMS1, and RFX2. Consistent with the known role of the tumor suppressor ZEB2 in the cancer-associated epithelial to mesenchymal transition (EMT), in situ hybridization reveals that the intronic regions deriving from ZEB2 as well as those from RFX2 and EPAS1 are down-regulated in cells of epithelial morphology, suggesting that these regions may be important for maintaining normal epithelial cell morphology. Paired-end deep sequencing analysis reveals a large number of distinct genomic clusters with no coding potential within the introns of these genes. These novel transcripts are only transcribed from the coding strand. A comprehensive search for breast cancer associated genes reveals enrichment for transcribed intronic regions from these loci, pointing to an underappreciated role of introns or mechanisms relating to their biology in EMT and breast cancer. PMID:25798919

  13. The CHR site: definition and genome-wide identification of a cell cycle transcriptional element

    PubMed Central

    Müller, Gerd A.; Wintsche, Axel; Stangner, Konstanze; Prohaska, Sonja J.; Stadler, Peter F.; Engeland, Kurt

    2014-01-01

    The cell cycle genes homology region (CHR) has been identified as a DNA element with an important role in transcriptional regulation of late cell cycle genes. It has been shown that such genes are controlled by DREAM, MMB and FOXM1-MuvB and that these protein complexes can contact DNA via CHR sites. However, it has not been elucidated which sequence variations of the canonical CHR are functional and how frequent CHR-based regulation is utilized in mammalian genomes. Here, we define the spectrum of functional CHR elements. As the basis for a computational meta-analysis, we identify new CHR sequences and compile phylogenetic motif conservation as well as genome-wide protein-DNA binding and gene expression data. We identify CHR elements in most late cell cycle genes binding DREAM, MMB, or FOXM1-MuvB. In contrast, Myb- and forkhead-binding sites are underrepresented in both early and late cell cycle genes. Our findings support a general mechanism: sequential binding of DREAM, MMB and FOXM1-MuvB complexes to late cell cycle genes requires CHR elements. Taken together, we define the group of CHR-regulated genes in mammalian genomes and provide evidence that the CHR is the central promoter element in transcriptional regulation of late cell cycle genes by DREAM, MMB and FOXM1-MuvB. PMID:25106871

  14. Where to begin? Mapping transcription start sites genome-wide in Escherichia coli.

    PubMed

    Wade, Joseph T

    2015-01-01

    Recent genome-wide studies of bacterial transcription have revealed large numbers of promoters located inside genes. In this issue of the Journal of Bacteriology, Thomason and colleagues (J. Bacteriol. 197:18-28, 2015, doi:10.1128/JB.02096-14) map transcription start sites in Escherichia coli on an unprecedented scale. This work provides important insights into the regulation of transcripts that initiate inside genes and sources of variability between studies aimed at identifying these RNAs. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  15. Genome-Wide Analysis of Human MicroRNA Stability

    PubMed Central

    Li, Yang; Li, Zhixin; Zhou, Shixin; Wen, Jinhua; Geng, Bin; Yang, Jichun; Cui, Qinghua

    2013-01-01

    Increasing studies have shown that microRNA (miRNA) stability plays important roles in physiology. However, the global picture of miRNA stability remains largely unknown. Here, we had analyzed genome-wide miRNA stability across 10 diverse cell types using miRNA arrays. We found that miRNA stability shows high dynamics and diversity both within individual cells and across cell types. Strikingly, we observed a negative correlation between miRNA stability and miRNA expression level, which is different from current findings on other biological molecules such as proteins and mRNAs that show positive and not negative correlations between stability and expression level. This finding indicates that miRNA has a distinct action mode, which we called “rapid production, rapid turnover; slow production, slow turnover.” This mode further suggests that high expression miRNAs normally degrade fast and may endow the cell with special properties that facilitate cellular status-transition. Moreover, we revealed that the stability of miRNAs is affected by cohorts of factors that include miRNA targets, transcription factors, nucleotide content, evolution, associated disease, and environmental factors. Together, our results provided an extensive description of the global landscape, dynamics, and distinct mode of human miRNA stability, which provide help in investigating their functions in physiology and pathophysiology. PMID:24187663

  16. Weighted SNP set analysis in genome-wide association study.

    PubMed

    Dai, Hui; Zhao, Yang; Qian, Cheng; Cai, Min; Zhang, Ruyang; Chu, Minjie; Dai, Juncheng; Hu, Zhibin; Shen, Hongbing; Chen, Feng

    2013-01-01

    Genome-wide association studies (GWAS) are popular for identifying genetic variants which are associated with disease risk. Many approaches have been proposed to test multiple single nucleotide polymorphisms (SNPs) in a region simultaneously which considering disadvantages of methods in single locus association analysis. Kernel machine based SNP set analysis is more powerful than single locus analysis, which borrows information from SNPs correlated with causal or tag SNPs. Four types of kernel machine functions and principal component based approach (PCA) were also compared. However, given the loss of power caused by low minor allele frequencies (MAF), we conducted an extension work on PCA and used a new method called weighted PCA (wPCA). Comparative analysis was performed for weighted principal component analysis (wPCA), logistic kernel machine based test (LKM) and principal component analysis (PCA) based on SNP set in the case of different minor allele frequencies (MAF) and linkage disequilibrium (LD) structures. We also applied the three methods to analyze two SNP sets extracted from a real GWAS dataset of non-small cell lung cancer in Han Chinese population. Simulation results show that when the MAF of the causal SNP is low, weighted principal component and weighted IBS are more powerful than PCA and other kernel machine functions at different LD structures and different numbers of causal SNPs. Application of the three methods to a real GWAS dataset indicates that wPCA and wIBS have better performance than the linear kernel, IBS kernel and PCA.

  17. Transcription facilitated genome-wide recruitment of topoisomerase I and DNA gyrase.

    PubMed

    Ahmed, Wareed; Sala, Claudia; Hegde, Shubhada R; Jha, Rajiv Kumar; Cole, Stewart T; Nagaraja, Valakunja

    2017-05-01

    Movement of the transcription machinery along a template alters DNA topology resulting in the accumulation of supercoils in DNA. The positive supercoils generated ahead of transcribing RNA polymerase (RNAP) and the negative supercoils accumulating behind impose severe topological constraints impeding transcription process. Previous studies have implied the role of topoisomerases in the removal of torsional stress and the maintenance of template topology but the in vivo interaction of functionally distinct topoisomerases with heterogeneous chromosomal territories is not deciphered. Moreover, how the transcription-induced supercoils influence the genome-wide recruitment of DNA topoisomerases remains to be explored in bacteria. Using ChIP-Seq, we show the genome-wide occupancy profile of both topoisomerase I and DNA gyrase in conjunction with RNAP in Mycobacterium tuberculosis taking advantage of minimal topoisomerase representation in the organism. The study unveils the first in vivo genome-wide interaction of both the topoisomerases with the genomic regions and establishes that transcription-induced supercoils govern their recruitment at genomic sites. Distribution profiles revealed co-localization of RNAP and the two topoisomerases on the active transcriptional units (TUs). At a given locus, topoisomerase I and DNA gyrase were localized behind and ahead of RNAP, respectively, correlating with the twin-supercoiled domains generated. The recruitment of topoisomerases was higher at the genomic loci with higher transcriptional activity and/or at regions under high torsional stress compared to silent genomic loci. Importantly, the occupancy of DNA gyrase, sole type II topoisomerase in Mtb, near the Ter domain of the Mtb chromosome validates its function as a decatenase.

  18. Transcription facilitated genome-wide recruitment of topoisomerase I and DNA gyrase

    PubMed Central

    Ahmed, Wareed; Sala, Claudia; Hegde, Shubhada R.; Jha, Rajiv Kumar

    2017-01-01

    Movement of the transcription machinery along a template alters DNA topology resulting in the accumulation of supercoils in DNA. The positive supercoils generated ahead of transcribing RNA polymerase (RNAP) and the negative supercoils accumulating behind impose severe topological constraints impeding transcription process. Previous studies have implied the role of topoisomerases in the removal of torsional stress and the maintenance of template topology but the in vivo interaction of functionally distinct topoisomerases with heterogeneous chromosomal territories is not deciphered. Moreover, how the transcription-induced supercoils influence the genome-wide recruitment of DNA topoisomerases remains to be explored in bacteria. Using ChIP-Seq, we show the genome-wide occupancy profile of both topoisomerase I and DNA gyrase in conjunction with RNAP in Mycobacterium tuberculosis taking advantage of minimal topoisomerase representation in the organism. The study unveils the first in vivo genome-wide interaction of both the topoisomerases with the genomic regions and establishes that transcription-induced supercoils govern their recruitment at genomic sites. Distribution profiles revealed co-localization of RNAP and the two topoisomerases on the active transcriptional units (TUs). At a given locus, topoisomerase I and DNA gyrase were localized behind and ahead of RNAP, respectively, correlating with the twin-supercoiled domains generated. The recruitment of topoisomerases was higher at the genomic loci with higher transcriptional activity and/or at regions under high torsional stress compared to silent genomic loci. Importantly, the occupancy of DNA gyrase, sole type II topoisomerase in Mtb, near the Ter domain of the Mtb chromosome validates its function as a decatenase. PMID:28463980

  19. Genome-wide transcription-coupled repair in Escherichia coli is mediated by the Mfd translocase

    PubMed Central

    Adebali, Ogun; Chiou, Yi-Ying; Hu, Jinchuan; Sancar, Aziz; Selby, Christopher P.

    2017-01-01

    We used high-throughput sequencing of short, cyclobutane pyrimidine dimer-containing ssDNA oligos generated during repair of UV-induced damage to study that process at both mechanistic and systemic levels in Escherichia coli. Numerous important insights on DNA repair were obtained, bringing clarity to the respective roles of UvrD helicase and Mfd translocase in repair of UV-induced damage. Mechanistically, experiments showed that the predominant role of UvrD in vivo is to unwind the excised 13-mer from dsDNA and that mutation of uvrD results in remarkable protection of that oligo from exonuclease activity as it remains hybridized to the dsDNA. Genome-wide analysis of the transcribed strand/nontranscribed strand (TS/NTS) repair ratio demonstrated that deletion of mfd globally shifts the distribution of TS/NTS ratios downward by a factor of about 2 on average for the most highly transcribed genes. Even for the least transcribed genes, Mfd played a role in preferential repair of the transcribed strand. On the other hand, mutation of uvrD, if anything, slightly pushed the distribution of TS/NTS ratios to higher ratios. These results indicate that Mfd is the transcription repair-coupling factor whereas UvrD plays a role in excision repair by aiding the catalytic turnover of excision repair proteins. PMID:28167766

  20. Genome-wide antisense transcription drives mRNA processing in bacteria

    PubMed Central

    Lasa, Iñigo; Toledo-Arana, Alejandro; Dobin, Alexander; Villanueva, Maite; de los Mozos, Igor Ruiz; Vergara-Irigaray, Marta; Segura, Víctor; Fagegaltier, Delphine; Penadés, José R.; Valle, Jaione; Solano, Cristina; Gingeras, Thomas R.

    2011-01-01

    RNA deep sequencing technologies are revealing unexpected levels of complexity in bacterial transcriptomes with the discovery of abundant noncoding RNAs, antisense RNAs, long 5′ and 3′ untranslated regions, and alternative operon structures. Here, by applying deep RNA sequencing to both the long and short RNA fractions (<50 nucleotides) obtained from the major human pathogen Staphylococcus aureus, we have detected a collection of short RNAs that is generated genome-wide through the digestion of overlapping sense/antisense transcripts by RNase III endoribonuclease. At least 75% of sense RNAs from annotated genes are subject to this mechanism of antisense processing. Removal of RNase III activity reduces the amount of short RNAs and is accompanied by the accumulation of discrete antisense transcripts. These results suggest the production of pervasive but hidden antisense transcription used to process sense transcripts by means of creating double-stranded substrates. This process of RNase III-mediated digestion of overlapping transcripts can be observed in several evolutionarily diverse Gram-positive bacteria and is capable of providing a unique genome-wide posttranscriptional mechanism to adjust mRNA levels. PMID:22123973

  1. Comparative analysis of methods for genome-wide nucleosome cartography.

    PubMed

    Quintales, Luis; Vázquez, Enrique; Antequera, Francisco

    2015-07-01

    Nucleosomes contribute to compacting the genome into the nucleus and regulate the physical access of regulatory proteins to DNA either directly or through the epigenetic modifications of the histone tails. Precise mapping of nucleosome positioning across the genome is, therefore, essential to understanding the genome regulation. In recent years, several experimental protocols have been developed for this purpose that include the enzymatic digestion, chemical cleavage or immunoprecipitation of chromatin followed by next-generation sequencing of the resulting DNA fragments. Here, we compare the performance and resolution of these methods from the initial biochemical steps through the alignment of the millions of short-sequence reads to a reference genome to the final computational analysis to generate genome-wide maps of nucleosome occupancy. Because of the lack of a unified protocol to process data sets obtained through the different approaches, we have developed a new computational tool (NUCwave), which facilitates their analysis, comparison and assessment and will enable researchers to choose the most suitable method for any particular purpose. NUCwave is freely available at http://nucleosome.usal.es/nucwave along with a step-by-step protocol for its use. © The Author 2014. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

  2. Fine mapping by composite genome-wide association analysis.

    PubMed

    Casellas, Joaquim; Cañas-Álvarez, Jhon Jacobo; Fina, Marta; Piedrafita, Jesús; Cecchinato, Alessio

    2017-06-06

    Genome-wide association (GWA) studies play a key role in current genetics research, unravelling genomic regions linked to phenotypic traits of interest in multiple species. Nevertheless, the extent of linkage disequilibrium (LD) may provide confounding results when significant genetic markers span along several contiguous cM. In this study, we have adapted the composite interval mapping approach to the GWA framework (composite GWA), in order to evaluate the impact of including competing (possibly linked) genetic markers when testing for the additive allelic effect inherent to a given genetic marker. We tested model performance on simulated data sets under different scenarios (i.e., qualitative trait loci effects, LD between genetic markers and width of the genomic region involved in the analysis). Our results showed that the genomic region had a small impact on the number of competing single nucleotide polymorphisms (SNPs) as well as on the precision of the composite GWA analysis. A similar conclusion was derived from the preferable range of LD between the tested SNP and competing SNPs, although moderate-to-high LD seemed to attenuate the loss of statistical power. The composite GWA improved specificity and reduced the number of significant genetic markers. The composite GWA model contributes a novel point of view for GWA analyses where testing circumscribed to the genomic region flanking each SNP (delimited by the nearest competing SNPs) and conditioning on linked markers increases the precision to locate causal mutations, but possibly at the expense of power.

  3. Natural selection on functional modules, a genome-wide analysis.

    PubMed

    Serra, François; Arbiza, Leonardo; Dopazo, Joaquín; Dopazo, Hernán

    2011-03-01

    Classically, the functional consequences of natural selection over genomes have been analyzed as the compound effects of individual genes. The current paradigm for large-scale analysis of adaptation is based on the observed significant deviations of rates of individual genes from neutral evolutionary expectation. This approach, which assumed independence among genes, has not been able to identify biological functions significantly enriched in positively selected genes in individual species. Alternatively, pooling related species has enhanced the search for signatures of selection. However, grouping signatures does not allow testing for adaptive differences between species. Here we introduce the Gene-Set Selection Analysis (GSSA), a new genome-wide approach to test for evidences of natural selection on functional modules. GSSA is able to detect lineage specific evolutionary rate changes in a notable number of functional modules. For example, in nine mammal and Drosophilae genomes GSSA identifies hundreds of functional modules with significant associations to high and low rates of evolution. Many of the detected functional modules with high evolutionary rates have been previously identified as biological functions under positive selection. Notably, GSSA identifies conserved functional modules with many positively selected genes, which questions whether they are exclusively selected for fitting genomes to environmental changes. Our results agree with previous studies suggesting that adaptation requires positive selection, but not every mutation under positive selection contributes to the adaptive dynamical process of the evolution of species.

  4. Genome-Wide Analysis of DNA Methylation in Human Amnion

    PubMed Central

    Kim, Jinsil; Pitlick, Mitchell M.; Christine, Paul J.; Schaefer, Amanda R.; Saleme, Cesar; Comas, Belén; Cosentino, Viviana; Gadow, Enrique; Murray, Jeffrey C.

    2013-01-01

    The amnion is a specialized tissue in contact with the amniotic fluid, which is in a constantly changing state. To investigate the importance of epigenetic events in this tissue in the physiology and pathophysiology of pregnancy, we performed genome-wide DNA methylation profiling of human amnion from term (with and without labor) and preterm deliveries. Using the Illumina Infinium HumanMethylation27 BeadChip, we identified genes exhibiting differential methylation associated with normal labor and preterm birth. Functional analysis of the differentially methylated genes revealed biologically relevant enriched gene sets. Bisulfite sequencing analysis of the promoter region of the oxytocin receptor (OXTR) gene detected two CpG dinucleotides showing significant methylation differences among the three groups of samples. Hypermethylation of the CpG island of the solute carrier family 30 member 3 (SLC30A3) gene in preterm amnion was confirmed by methylation-specific PCR. This work provides preliminary evidence that DNA methylation changes in the amnion may be at least partially involved in the physiological process of labor and the etiology of preterm birth and suggests that DNA methylation profiles, in combination with other biological data, may provide valuable insight into the mechanisms underlying normal and pathological pregnancies. PMID:23533356

  5. Genome-wide effects of selenium and translational uncoupling on transcription in the termite gut symbiont Treponema primitia.

    PubMed

    Matson, Eric G; Rosenthal, Adam Z; Zhang, Xinning; Leadbetter, Jared R

    2013-11-12

    noncanonical amino acid selenocysteine is able to tune transcription of an important metabolic gene via translational coupling. Furthermore, a genome-wide analysis reveals that transcriptional decoupling produces a wide-ranging effect and that this effect is not uniform. These results exemplify how growth conditions that impact translational processivity can rapidly feed back on transcriptional productivity of prespecified groups of genes, providing bacteria with an efficient response to environmental changes.

  6. Genome-Wide Analysis of Human Metapneumovirus Evolution

    PubMed Central

    Kim, Jin Il; Park, Sehee; Lee, Ilseob; Park, Kwang Sook; Kwak, Eun Jung; Moon, Kwang Mee; Lee, Chang Kyu; Bae, Joon-Yong; Park, Man-Seong; Song, Ki-Joon

    2016-01-01

    Human metapneumovirus (HMPV) has been described as an important etiologic agent of upper and lower respiratory tract infections, especially in young children and the elderly. Most of school-aged children might be introduced to HMPVs, and exacerbation with other viral or bacterial super-infection is common. However, our understanding of the molecular evolution of HMPVs remains limited. To address the comprehensive evolutionary dynamics of HMPVs, we report a genome-wide analysis of the eight genes (N, P, M, F, M2, SH, G, and L) using 103 complete genome sequences. Phylogenetic reconstruction revealed that the eight genes from one HMPV strain grouped into the same genetic group among the five distinct lineages (A1, A2a, A2b, B1, and B2). A few exceptions of phylogenetic incongruence might suggest past recombination events, and we detected possible recombination breakpoints in the F, SH, and G coding regions. The five genetic lineages of HMPVs shared quite remote common ancestors ranging more than 220 to 470 years of age with the most recent origins for the A2b sublineage. Purifying selection was common, but most protein genes except the F and M2-2 coding regions also appeared to experience episodic diversifying selection. Taken together, these suggest that the five lineages of HMPVs maintain their individual evolutionary dynamics and that recombination and selection forces might work on shaping the genetic diversity of HMPVs. PMID:27046055

  7. Genome-wide analysis of microRNA and mRNA expression signatures in cancer

    PubMed Central

    Li, Ming-hui; Fu, Sheng-bo; Xiao, Hua-sheng

    2015-01-01

    Cancer is an extremely diverse and complex disease that results from various genetic and epigenetic changes such as DNA copy-number variations, mutations, and aberrant mRNA and/or protein expression caused by abnormal transcriptional regulation. The expression profiles of certain microRNAs (miRNAs) and messenger RNAs (mRNAs) are closely related to cancer progression stages. In the past few decades, DNA microarray and next-generation sequencing techniques have been widely applied to identify miRNA and mRNA signatures for cancers on a genome-wide scale and have provided meaningful insights into cancer diagnosis, prognosis and personalized medicine. In this review, we summarize the progress in genome-wide analysis of miRNAs and mRNAs as cancer biomarkers, highlighting their diagnostic and prognostic roles. PMID:26299954

  8. Genome-wide association analysis identifies six new loci associated with forced vital capacity

    PubMed Central

    Loth, Daan W.; Artigas, María Soler; Gharib, Sina A.; Wain, Louise V.; Franceschini, Nora; Koch, Beate; Pottinger, Tess; Smith, Albert Vernon; Duan, Qing; Oldmeadow, Chris; Lee, Mi Kyeong; Strachan, David P.; James, Alan L.; Huffman, Jennifer E.; Vitart, Veronique; Ramasamy, Adaikalavan; Wareham, Nicholas J.; Kaprio, Jaakko; Wang, Xin-Qun; Trochet, Holly; Kähönen, Mika; Flexeder, Claudia; Albrecht, Eva; Lopez, Lorna M.; de Jong, Kim; Thyagarajan, Bharat; Alves, Alexessander Couto; Enroth, Stefan; Omenaas, Ernst; Joshi, Peter K.; Fall, Tove; Viňuela, Ana; Launer, Lenore J.; Loehr, Laura R.; Fornage, Myriam; Li, Guo; Wilk, Jemma B.; Tang, Wenbo; Manichaikul, Ani; Lahousse, Lies; Harris, Tamara B.; North, Kari E.; Rudnicka, Alicja R.; Hui, Jennie; Gu, Xiangjun; Lumley, Thomas; Wright, Alan F.; Hastie, Nicholas D.; Campbell, Susan; Kumar, Rajesh; Pin, Isabelle; Scott, Robert A.; Pietiläinen, Kirsi H.; Surakka, Ida; Liu, Yongmei; Holliday, Elizabeth G.; Schulz, Holger; Heinrich, Joachim; Davies, Gail; Vonk, Judith M.; Wojczynski, Mary; Pouta, Anneli; Johansson, Åsa; Wild, Sarah H.; Ingelsson, Erik; Rivadeneira, Fernando; Völzke, Henry; Hysi, Pirro G.; Eiriksdottir, Gudny; Morrison, Alanna C.; Rotter, Jerome I.; Gao, Wei; Postma, Dirkje S.; White, Wendy B.; Rich, Stephen S.; Hofman, Albert; Aspelund, Thor; Couper, David; Smith, Lewis J.; Psaty, Bruce M.; Lohman, Kurt; Burchard, Esteban G.; Uitterlinden, André G.; Garcia, Melissa; Joubert, Bonnie R.; McArdle, Wendy L.; Musk, A. Bill; Hansel, Nadia; Heckbert, Susan R.; Zgaga, Lina; van Meurs, Joyce B.J.; Navarro, Pau; Rudan, Igor; Oh, Yeon-Mok; Redline, Susan; Jarvis, Deborah; Zhao, Jing Hua; Rantanen, Taina; O’Connor, George T.; Ripatti, Samuli; Scott, Rodney J.; Karrasch, Stefan; Grallert, Harald; Gaddis, Nathan C.; Starr, John M.; Wijmenga, Cisca; Minster, Ryan L.; Lederer, David J.; Pekkanen, Juha; Gyllensten, Ulf; Campbell, Harry; Morris, Andrew P.; Gläser, Sven; Hammond, Christopher J.; Burkart, Kristin M.; Beilby, John; Kritchevsky, Stephen B.; Gudnason, Vilmundur; Hancock, Dana B.; Williams, O. Dale; Polasek, Ozren; Zemunik, Tatijana; Kolcic, Ivana; Petrini, Marcy F.; Wjst, Matthias; Kim, Woo Jin; Porteous, David J.; Scotland, Generation; Smith, Blair H.; Viljanen, Anne; Heliövaara, Markku; Attia, John R.; Sayers, Ian; Hampel, Regina; Gieger, Christian; Deary, Ian J.; Boezen, H. Marike; Newman, Anne; Jarvelin, Marjo-Riitta; Wilson, James F.; Lind, Lars; Stricker, Bruno H.; Teumer, Alexander; Spector, Timothy D.; Melén, Erik; Peters, Marjolein J.; Lange, Leslie A.; Barr, R. Graham; Bracke, Ken R.; Verhamme, Fien M.; Sung, Joohon; Hiemstra, Pieter S.; Cassano, Patricia A.; Sood, Akshay; Hayward, Caroline; Dupuis, Josée; Hall, Ian P.; Brusselle, Guy G.; Tobin, Martin D.; London, Stephanie J.

    2014-01-01

    Forced vital capacity (FVC), a spirometric measure of pulmonary function, reflects lung volume and is used to diagnose and monitor lung diseases. We performed genome-wide association study meta-analysis of FVC in 52,253 individuals from 26 studies and followed up the top associations in 32,917 additional individuals of European ancestry. We found six new regions associated at genome-wide significance (P < 5 × 10−8) with FVC in or near EFEMP1, BMP6, MIR-129-2/HSD17B12, PRDM11, WWOX, and KCNJ2. Two (GSTCD and PTCH1) loci previously associated with spirometric measures were related to FVC. Newly implicated regions were followed-up in samples of African American, Korean, Chinese, and Hispanic individuals. We detected transcripts for all six newly implicated genes in human lung tissue. The new loci may inform mechanisms involved in lung development and pathogenesis of restrictive lung disease. PMID:24929828

  9. Genome-wide association analysis identifies six new loci associated with forced vital capacity.

    PubMed

    Loth, Daan W; Soler Artigas, María; Gharib, Sina A; Wain, Louise V; Franceschini, Nora; Koch, Beate; Pottinger, Tess D; Smith, Albert Vernon; Duan, Qing; Oldmeadow, Chris; Lee, Mi Kyeong; Strachan, David P; James, Alan L; Huffman, Jennifer E; Vitart, Veronique; Ramasamy, Adaikalavan; Wareham, Nicholas J; Kaprio, Jaakko; Wang, Xin-Qun; Trochet, Holly; Kähönen, Mika; Flexeder, Claudia; Albrecht, Eva; Lopez, Lorna M; de Jong, Kim; Thyagarajan, Bharat; Alves, Alexessander Couto; Enroth, Stefan; Omenaas, Ernst; Joshi, Peter K; Fall, Tove; Viñuela, Ana; Launer, Lenore J; Loehr, Laura R; Fornage, Myriam; Li, Guo; Wilk, Jemma B; Tang, Wenbo; Manichaikul, Ani; Lahousse, Lies; Harris, Tamara B; North, Kari E; Rudnicka, Alicja R; Hui, Jennie; Gu, Xiangjun; Lumley, Thomas; Wright, Alan F; Hastie, Nicholas D; Campbell, Susan; Kumar, Rajesh; Pin, Isabelle; Scott, Robert A; Pietiläinen, Kirsi H; Surakka, Ida; Liu, Yongmei; Holliday, Elizabeth G; Schulz, Holger; Heinrich, Joachim; Davies, Gail; Vonk, Judith M; Wojczynski, Mary; Pouta, Anneli; Johansson, Asa; Wild, Sarah H; Ingelsson, Erik; Rivadeneira, Fernando; Völzke, Henry; Hysi, Pirro G; Eiriksdottir, Gudny; Morrison, Alanna C; Rotter, Jerome I; Gao, Wei; Postma, Dirkje S; White, Wendy B; Rich, Stephen S; Hofman, Albert; Aspelund, Thor; Couper, David; Smith, Lewis J; Psaty, Bruce M; Lohman, Kurt; Burchard, Esteban G; Uitterlinden, André G; Garcia, Melissa; Joubert, Bonnie R; McArdle, Wendy L; Musk, A Bill; Hansel, Nadia; Heckbert, Susan R; Zgaga, Lina; van Meurs, Joyce B J; Navarro, Pau; Rudan, Igor; Oh, Yeon-Mok; Redline, Susan; Jarvis, Deborah L; Zhao, Jing Hua; Rantanen, Taina; O'Connor, George T; Ripatti, Samuli; Scott, Rodney J; Karrasch, Stefan; Grallert, Harald; Gaddis, Nathan C; Starr, John M; Wijmenga, Cisca; Minster, Ryan L; Lederer, David J; Pekkanen, Juha; Gyllensten, Ulf; Campbell, Harry; Morris, Andrew P; Gläser, Sven; Hammond, Christopher J; Burkart, Kristin M; Beilby, John; Kritchevsky, Stephen B; Gudnason, Vilmundur; Hancock, Dana B; Williams, O Dale; Polasek, Ozren; Zemunik, Tatijana; Kolcic, Ivana; Petrini, Marcy F; Wjst, Matthias; Kim, Woo Jin; Porteous, David J; Scotland, Generation; Smith, Blair H; Viljanen, Anne; Heliövaara, Markku; Attia, John R; Sayers, Ian; Hampel, Regina; Gieger, Christian; Deary, Ian J; Boezen, H Marike; Newman, Anne; Jarvelin, Marjo-Riitta; Wilson, James F; Lind, Lars; Stricker, Bruno H; Teumer, Alexander; Spector, Timothy D; Melén, Erik; Peters, Marjolein J; Lange, Leslie A; Barr, R Graham; Bracke, Ken R; Verhamme, Fien M; Sung, Joohon; Hiemstra, Pieter S; Cassano, Patricia A; Sood, Akshay; Hayward, Caroline; Dupuis, Josée; Hall, Ian P; Brusselle, Guy G; Tobin, Martin D; London, Stephanie J

    2014-07-01

    Forced vital capacity (FVC), a spirometric measure of pulmonary function, reflects lung volume and is used to diagnose and monitor lung diseases. We performed genome-wide association study meta-analysis of FVC in 52,253 individuals from 26 studies and followed up the top associations in 32,917 additional individuals of European ancestry. We found six new regions associated at genome-wide significance (P < 5 × 10(-8)) with FVC in or near EFEMP1, BMP6, MIR129-2-HSD17B12, PRDM11, WWOX and KCNJ2. Two loci previously associated with spirometric measures (GSTCD and PTCH1) were related to FVC. Newly implicated regions were followed up in samples from African-American, Korean, Chinese and Hispanic individuals. We detected transcripts for all six newly implicated genes in human lung tissue. The new loci may inform mechanisms involved in lung development and the pathogenesis of restrictive lung disease.

  10. Genome-wide analysis of intraspecific transposon diversity in yeast

    PubMed Central

    2013-01-01

    Background In the model organism Saccharomyces cerevisiae, the transposable elements (TEs) consist of LTR (Long Terminal Repeat) retrotransposons called Ty elements belonging to five families, Ty1 to Ty5. They take the form of either full-length coding elements or non-coding solo-LTRs corresponding to remnants of former transposition events. Although the biological features of Ty elements have been studied in detail in S. cerevisiae and the Ty content of the reference strain (S288c) was accurately annotated, the Ty-related intra-specific diversity has not been closely investigated so far. Results In this study, we investigated the Ty contents of 41 available genomes of isolated S. cerevisiae strains of diverse geographical and ecological origins. The strains were compared in terms of the number of Ty copies, the content of the potential transpositionally active elements and the genomic insertion maps. The strain repertoires were also investigated in the closely related Ty1 and Ty2 families and subfamilies. Conclusions This is the first genome-wide analysis of the diversity associated to the Ty elements, carried out for a large set of S. cerevisiae strains. The results of the present analyses suggest that the current Ty-related polymorphism has resulted from multiple causes such as differences between strains, between Ty families and over time, in the recent transpositional activity of Ty elements. Some new Ty1 variants were also identified, and we have established that Ty1 variants have different patterns of distribution among strains, which further contributes to the strain diversity. PMID:23768249

  11. Recruitment of Saccharomyces cerevisiae Cmr1/Ydl156w to Coding Regions Promotes Transcription Genome Wide

    PubMed Central

    Govind, Chhabi K.

    2016-01-01

    Cmr1 (changed mutation rate 1) is a largely uncharacterized nuclear protein that has recently emerged in several global genetic interaction and protein localization studies. It clusters with proteins involved in DNA damage and replication stress response, suggesting a role in maintaining genome integrity. Under conditions of proteasome inhibition or replication stress, this protein localizes to distinct sub-nuclear foci termed as intranuclear quality control (INQ) compartments, which sequester proteins for their subsequent degradation. Interestingly, it also interacts with histones, chromatin remodelers and modifiers, as well as with proteins involved in transcription including subunits of RNA Pol I and Pol III, but not with those of Pol II. It is not known whether Cmr1 plays a role in regulating transcription of Pol II target genes. Here, we show that Cmr1 is recruited to the coding regions of transcribed genes of S. cerevisiae. Cmr1 occupancy correlates with the Pol II occupancy genome-wide, indicating that it is recruited to coding sequences in a transcription-dependent manner. Cmr1-enriched genes include Gcn4 targets and ribosomal protein genes. Furthermore, our results show that Cmr1 recruitment to coding sequences is stimulated by Pol II CTD kinase, Kin28, and the histone deacetylases, Rpd3 and Hos2. Finally, our genome-wide analyses implicate Cmr1 in regulating Pol II occupancy at transcribed coding sequences. However, it is dispensable for maintaining co-transcriptional histone occupancy and histone modification (acetylation and methylation). Collectively, our results show that Cmr1 facilitates transcription by directly engaging with transcribed coding regions. PMID:26848854

  12. Recruitment of Saccharomyces cerevisiae Cmr1/Ydl156w to Coding Regions Promotes Transcription Genome Wide.

    PubMed

    Jones, Jeffery W; Singh, Priyanka; Govind, Chhabi K

    2016-01-01

    Cmr1 (changed mutation rate 1) is a largely uncharacterized nuclear protein that has recently emerged in several global genetic interaction and protein localization studies. It clusters with proteins involved in DNA damage and replication stress response, suggesting a role in maintaining genome integrity. Under conditions of proteasome inhibition or replication stress, this protein localizes to distinct sub-nuclear foci termed as intranuclear quality control (INQ) compartments, which sequester proteins for their subsequent degradation. Interestingly, it also interacts with histones, chromatin remodelers and modifiers, as well as with proteins involved in transcription including subunits of RNA Pol I and Pol III, but not with those of Pol II. It is not known whether Cmr1 plays a role in regulating transcription of Pol II target genes. Here, we show that Cmr1 is recruited to the coding regions of transcribed genes of S. cerevisiae. Cmr1 occupancy correlates with the Pol II occupancy genome-wide, indicating that it is recruited to coding sequences in a transcription-dependent manner. Cmr1-enriched genes include Gcn4 targets and ribosomal protein genes. Furthermore, our results show that Cmr1 recruitment to coding sequences is stimulated by Pol II CTD kinase, Kin28, and the histone deacetylases, Rpd3 and Hos2. Finally, our genome-wide analyses implicate Cmr1 in regulating Pol II occupancy at transcribed coding sequences. However, it is dispensable for maintaining co-transcriptional histone occupancy and histone modification (acetylation and methylation). Collectively, our results show that Cmr1 facilitates transcription by directly engaging with transcribed coding regions.

  13. Diversity of Eukaryotic DNA Replication Origins Revealed by Genome-Wide Analysis of Chromatin Structure

    PubMed Central

    Berbenetz, Nicolas M.; Nislow, Corey; Brown, Grant W.

    2010-01-01

    Eukaryotic DNA replication origins differ both in their efficiency and in the characteristic time during S phase when they become active. The biological basis for these differences remains unknown, but they could be a consequence of chromatin structure. The availability of genome-wide maps of nucleosome positions has led to an explosion of information about how nucleosomes are assembled at transcription start sites, but no similar maps exist for DNA replication origins. Here we combine high-resolution genome-wide nucleosome maps with comprehensive annotations of DNA replication origins to identify patterns of nucleosome occupancy at eukaryotic replication origins. On average, replication origins contain a nucleosome depleted region centered next to the ACS element, flanked on both sides by arrays of well-positioned nucleosomes. Our analysis identified DNA sequence properties that correlate with nucleosome occupancy at replication origins genome-wide and that are correlated with the nucleosome-depleted region. Clustering analysis of all annotated replication origins revealed a surprising diversity of nucleosome occupancy patterns. We provide evidence that the origin recognition complex, which binds to the origin, acts as a barrier element to position and phase nucleosomes on both sides of the origin. Finally, analysis of chromatin reconstituted in vitro reveals that origins are inherently nucleosome depleted. Together our data provide a comprehensive, genome-wide view of chromatin structure at replication origins and suggest a model of nucleosome positioning at replication origins in which the underlying sequence occludes nucleosomes to permit binding of the origin recognition complex, which then (likely in concert with nucleosome modifiers and remodelers) positions nucleosomes adjacent to the origin to promote replication origin function. PMID:20824081

  14. Genome-wide bisulfite sensitivity profiling of yeast suggests bisulfite inhibits transcription.

    PubMed

    Segovia, Romulo; Mathew, Veena; Tam, Annie S; Stirling, Peter C

    2017-09-01

    Bisulfite, in the form of sodium bisulfite or metabisulfite, is used commercially as a food preservative. Bisulfite is used in the laboratory as a single-stranded DNA mutagen in epigenomic analyses of DNA methylation. Recently it has also been used on whole yeast cells to induce mutations in exposed single-stranded regions in vivo. To understand the effects of bisulfite on live cells we conducted a genome-wide screen for bisulfite sensitive mutants in yeast. Screening the deletion mutant array, and collections of essential gene mutants we define a genetic network of bisulfite sensitive mutants. Validation of screen hits revealed hyper-sensitivity of transcription and RNA processing mutants, rather than DNA repair pathways and follow-up analyses support a role in perturbation of RNA transactions. We propose a model in which bisulfite-modified nucleotides may interfere with transcription or RNA metabolism when used in vivo. Copyright © 2017 Elsevier B.V. All rights reserved.

  15. Cooperative Genome-Wide Analysis Shows Increased Homozygosity in Early Onset Parkinson's Disease

    PubMed Central

    Nalls, Michael A.; Martinez, Maria; Schulte, Claudia; Holmans, Peter; Gasser, Thomas; Hardy, John; Singleton, Andrew B.; Wood, Nicholas W.; Brice, Alexis; Heutink, Peter; Williams, Nigel; Morris, Huw R.

    2012-01-01

    Parkinson's disease (PD) occurs in both familial and sporadic forms, and both monogenic and complex genetic factors have been identified. Early onset PD (EOPD) is particularly associated with autosomal recessive (AR) mutations, and three genes, PARK2, PARK7 and PINK1, have been found to carry mutations leading to AR disease. Since mutations in these genes account for less than 10% of EOPD patients, we hypothesized that further recessive genetic factors are involved in this disorder, which may appear in extended runs of homozygosity. We carried out genome wide SNP genotyping to look for extended runs of homozygosity (ROHs) in 1,445 EOPD cases and 6,987 controls. Logistic regression analyses showed an increased level of genomic homozygosity in EOPD cases compared to controls. These differences are larger for ROH of 9 Mb and above, where there is a more than three-fold increase in the proportion of cases carrying a ROH. These differences are not explained by occult recessive mutations at existing loci. Controlling for genome wide homozygosity in logistic regression analyses increased the differences between cases and controls, indicating that in EOPD cases ROHs do not simply relate to genome wide measures of inbreeding. Homozygosity at a locus on chromosome19p13.3 was identified as being more common in EOPD cases as compared to controls. Sequencing analysis of genes and predicted transcripts within this locus failed to identify a novel mutation causing EOPD in our cohort. There is an increased rate of genome wide homozygosity in EOPD, as measured by an increase in ROHs. These ROHs are a signature of inbreeding and do not necessarily harbour disease-causing genetic variants. Although there might be other regions of interest apart from chromosome 19p13.3, we lack the power to detect them with this analysis. PMID:22427796

  16. Pervasive, Genome-Wide Transcription in the Organelle Genomes of Diverse Plastid-Bearing Protists.

    PubMed

    Sanitá Lima, Matheus; Smith, David Roy

    2017-09-21

    Organelle genomes are among the most sequenced kinds of chromosome. This is largely because they are small and widely used in molecular studies, but also because next-generation sequencing (NGS) technologies made sequencing easier, faster, and cheaper. However, studies of organelle RNA have not kept pace with those of DNA, despite huge amounts of freely available eukaryotic RNA-sequencing (RNA-seq) data. Little is known about organelle transcription in non-model species, and most of the available eukaryotic RNA-seq data have not been mined for organelle transcripts. Here, we use publicly available RNA-seq experiments to investigate organelle transcription in 30 diverse plastid-bearing protists with varying organelle genomic architectures. Mapping RNA-seq data to organelle genomes revealed pervasive, genome-wide transcription, regardless of the taxonomic grouping, gene organization, or non-coding content. For every species analyzed, transcripts covered at least 85% of the mitochondrial and/or plastid genomes (all of which were ≤ 105 kb), indicating that most of the organelle DNA-coding and non-coding-is transcriptionally active. These results follow earlier studies of model species showing that organellar transcription is coupled and ubiquitous across the genome, requiring significant downstream processing of polycistronic transcripts. Our findings suggest that non-coding organelle DNA can be transcriptionally active, raising questions about the underlying function of these transcripts and underscoring the utility of publicly available RNA-seq data for recovering complete genome sequences. If pervasive transcription is also found in bigger organelle genomes (>105 kb) across a broader range of eukaryotes, this could indicate that non-coding organelle RNAs are regulating fundamental processes within eukaryotic cells. Copyright © 2017, G3: Genes, Genomes, Genetics.

  17. Acute genome-wide effects of rosiglitazone on PPARγ transcriptional networks in adipocytes.

    PubMed

    Haakonsson, Anders Kristian; Stahl Madsen, Maria; Nielsen, Ronni; Sandelin, Albin; Mandrup, Susanne

    2013-09-01

    Peroxisome proliferator-activated receptor γ (PPARγ) is a master regulator of adipocyte differentiation, and genome-wide studies indicate that it is involved in the induction of most adipocyte genes. Here we report, for the first time, the acute effects of the synthetic PPARγ agonist rosiglitazone on the transcriptional network of PPARγ in adipocytes. Treatment with rosiglitazone for 1 hour leads to acute transcriptional activation as well as repression of a number of genes as determined by genome-wide RNA polymerase II occupancy. Unlike what has been shown for many other nuclear receptors, agonist treatment does not lead to major changes in the occurrence of PPARγ binding sites. However, rosiglitazone promotes PPARγ occupancy at many preexisting sites, and this is paralleled by increased occupancy of the mediator subunit MED1. The increase in PPARγ and MED1 binding is correlated with an increase in transcription of nearby genes, indicating that rosiglitazone, in addition to activating the receptor, also promotes its association with DNA, and that this is causally linked to recruitment of mediator and activation of genes. Notably, both rosiglitazone-activated and -repressed genes are induced during adipogenesis. However, rosiglitazone-activated genes are markedly more associated with PPARγ than repressed genes and are highly dependent on PPARγ for expression in adipocytes. By contrast, repressed genes are associated with the other key adipocyte transcription factor CCAAT-enhancer binding proteinα (C/EBPα), and their expression is more dependent on C/EBPα. This suggests that the relative occupancies of PPARγ and C/EBPα are critical for whether genes will be induced or repressed by PPARγ agonist.

  18. Dynamics of oscillatory phenotypes in S. cerevisiae reveal a network of genome-wide transcriptional oscillators

    PubMed Central

    Chin, Shwe L.; Marcus, Ian M.; Klevecz, Robert R.; Li, Caroline M.

    2012-01-01

    Genetic and environmental factors are well-studied influences on phenotype; however, time is a variable that is rarely considered when studying changes in cellular phenotype. Time-resolved microarray data revealed genome-wide transcriptional oscillation in a yeast continuous culture system with ~2 and ~4 h periods. We mapped the global patterns of transcriptional oscillations into a 3D map to represent different cellular phenotypes of redox cycles. This map shows the dynamic nature of gene expression in that transcripts are ordered and coupled to each other through time and concentration space. Although cells differed in oscillation periods, transcripts involved in certain processes were conserved in a deterministic way. When oscillation period lengthened, the peak to trough ratio of transcripts increased and the fraction of cells in the unbudded (G0/G1) phase of the cell division cycle increased. Decreasing the glucose level in the culture media was one way to increase the redox cycle, possibly from changes in metabolic flux. The period may be responding to lower glucose levels by increasing the fraction of cells in G1 and reducing S-phase gating so that cells can spend more time in catabolic processes. Our results support that gene transcripts are coordinated with metabolic functions and the cell division cycle. PMID:22289124

  19. Genome-Wide Transcriptional Response of the Archaeon Thermococcus gammatolerans to Cadmium

    PubMed Central

    Lagorce, Arnaud; Fourçans, Aude; Dutertre, Murielle; Bouyssiere, Brice; Zivanovic, Yvan; Confalonieri, Fabrice

    2012-01-01

    Thermococcus gammatolerans, the most radioresistant archaeon known to date, is an anaerobic and hyperthermophilic sulfur-reducing organism living in deep-sea hydrothermal vents. Knowledge of mechanisms underlying archaeal metal tolerance in such metal-rich ecosystem is still poorly documented. We showed that T. gammatolerans exhibits high resistance to cadmium (Cd), cobalt (Co) and zinc (Zn), a weaker tolerance to nickel (Ni), copper (Cu) and arsenate (AsO4) and that cells exposed to 1 mM Cd exhibit a cellular Cd concentration of 67 µM. A time-dependent transcriptomic analysis using microarrays was performed at a non-toxic (100 µM) and a toxic (1 mM) Cd dose. The reliability of microarray data was strengthened by real time RT-PCR validations. Altogether, 114 Cd responsive genes were revealed and a substantial subset of genes is related to metal homeostasis, drug detoxification, re-oxidization of cofactors and ATP production. This first genome-wide expression profiling study of archaeal cells challenged with Cd showed that T. gammatolerans withstands induced stress through pathways observed in both prokaryotes and eukaryotes but also through new and original strategies. T. gammatolerans cells challenged with 1 mM Cd basically promote: 1) the induction of several transporter/permease encoding genes, probably to detoxify the cell; 2) the upregulation of Fe transporters encoding genes to likely compensate Cd damages in iron-containing proteins; 3) the induction of membrane-bound hydrogenase (Mbh) and membrane-bound hydrogenlyase (Mhy2) subunits encoding genes involved in recycling reduced cofactors and/or in proton translocation for energy production. By contrast to other organisms, redox homeostasis genes appear constitutively expressed and only a few genes encoding DNA repair proteins are regulated. We compared the expression of 27 Cd responsive genes in other stress conditions (Zn, Ni, heat shock, γ-rays), and showed that the Cd transcriptional pattern is

  20. Genome-wide transcriptional responses to a lipid hydroperoxide: adaptation occurs without induction of oxidant defenses.

    PubMed

    Alic, Nazif; Felder, Thomas; Temple, Mark D; Gloeckner, Christian; Higgins, Vincent J; Briza, Peter; Dawes, Ian W

    2004-07-01

    Free radicals can initiate the oxidation of polyunsaturated fatty acids in cells through the process of lipid peroxidation. The genome-wide transcriptional changes in Saccharomyces cerevisiae after treatment with the toxic lipid peroxidation product linoleic acid hydroperoxide (LoaOOH) were identified. High-dose treatment led to a switch in transcription from biosynthetic to protective functions. This response encompassed a set of genes stimulated predominantly by LoaOOH, and not by other oxidants or heat shock, which contained components of the pleiotropic drug resistance system. The dose dependence of the transcriptional response revealed that large and widespread changes occur only in response to higher doses. Pretreatment of cells with sublethal doses of LoaOOH induces resistance to an otherwise lethal dose through the process of adaptation. Adaptive doses elicited a more subtle transcriptional response affecting metabolic functions, including an increase in the capacity for detoxification and downregulation of the rate of protein synthesis. Surprisingly, the cellular response to adaptive doses did not include induction of oxidative-stress defense enzymes nor of transcripts involved in general cellular defense systems.

  1. Genome-Wide Chromatin Immunoprecipitation Sequencing Analysis Shows that WhiB Is a Transcription Factor That Cocontrols Its Regulon with WhiA To Initiate Developmental Cell Division in Streptomyces.

    PubMed

    Bush, Matthew J; Chandra, Govind; Bibb, Maureen J; Findlay, Kim C; Buttner, Mark J

    2016-04-19

    WhiB is the founding member of a family of proteins (the WhiB-like [Wbl] family) that carry a [4Fe-4S] iron-sulfur cluster and play key roles in diverse aspects of the biology of actinomycetes, including pathogenesis, antibiotic resistance, and the control of development. In Streptomyces, WhiB is essential for the process of developmentally controlled cell division that leads to sporulation. The biochemical function of Wbl proteins has been controversial; here, we set out to determine unambiguously if WhiB functions as a transcription factor using chromatin immunoprecipitation sequencing (ChIP-seq) in Streptomyces venezuelae. In the first demonstration of in vivo genome-wide Wbl binding, we showed that WhiB regulates the expression of key genes required for sporulation by binding upstream of ~240 transcription units. Strikingly, the WhiB regulon is identical to the previously characterized WhiA regulon, providing an explanation for the identical phenotypes of whiA and whiB mutants. Using ChIP-seq, we demonstrated that in vivo DNA binding by WhiA depends on WhiB and vice versa, showing that WhiA and WhiB function cooperatively to control expression of a common set of WhiAB target genes. Finally, we show that mutation of the cysteine residues that coordinate the [4Fe-4S] cluster in WhiB prevents DNA binding by both WhiB and WhiA in vivo. Despite the central importance of WhiB-like (Wbl) proteins in actinomycete biology, a conclusive demonstration of their biochemical function has been elusive, and they have been difficult to study, particularly in vitro, largely because they carry an oxygen-sensitive [4Fe-4S] cluster. Here we used genome-wide ChIP-seq to investigate the function of Streptomyces WhiB, the founding member of the Wbl family. The advantage of this approach is that the oxygen sensitivity of the [4Fe-4S] cluster becomes irrelevant once the protein has been cross-linked to DNA in vivo. Our data provide the most compelling in vivo evidence to date that Whi

  2. Genome-wide analysis of differential RNA editing in epilepsy.

    PubMed

    Srivastava, Prashant Kumar; Bagnati, Marta; Delahaye-Duriez, Andree; Ko, Jeong-Hun; Rotival, Maxime; Langley, Sarah R; Shkura, Kirill; Mazzuferi, Manuela; Danis, Bénédicte; van Eyll, Jonathan; Foerch, Patrik; Behmoaras, Jacques; Kaminski, Rafal M; Petretto, Enrico; Johnson, Michael R

    2017-03-01

    The recoding of genetic information through RNA editing contributes to proteomic diversity, but the extent and significance of RNA editing in disease is poorly understood. In particular, few studies have investigated the relationship between RNA editing and disease at a genome-wide level. Here, we developed a framework for the genome-wide detection of RNA sites that are differentially edited in disease. Using RNA-sequencing data from 100 hippocampi from mice with epilepsy (pilocarpine-temporal lobe epilepsy model) and 100 healthy control hippocampi, we identified 256 RNA sites (overlapping with 87 genes) that were significantly differentially edited between epileptic cases and controls. The degree of differential RNA editing in epileptic mice correlated with frequency of seizures, and the set of genes differentially RNA-edited between case and control mice were enriched for functional terms highly relevant to epilepsy, including "neuron projection" and "seizures." Genes with differential RNA editing were preferentially enriched for genes with a genetic association to epilepsy. Indeed, we found that they are significantly enriched for genes that harbor nonsynonymous de novo mutations in patients with epileptic encephalopathy and for common susceptibility variants associated with generalized epilepsy. These analyses reveal a functional convergence between genes that are differentially RNA-edited in acquired symptomatic epilepsy and those that contribute risk for genetic epilepsy. Taken together, our results suggest a potential role for RNA editing in the epileptic hippocampus in the occurrence and severity of epileptic seizures.

  3. Genome-wide analysis of differential RNA editing in epilepsy

    PubMed Central

    Srivastava, Prashant Kumar; Bagnati, Marta; Delahaye-Duriez, Andree; Ko, Jeong-Hun; Rotival, Maxime; Langley, Sarah R.; Shkura, Kirill; Mazzuferi, Manuela; Danis, Bénédicte; van Eyll, Jonathan; Foerch, Patrik; Behmoaras, Jacques; Kaminski, Rafal M.; Petretto, Enrico; Johnson, Michael R.

    2017-01-01

    The recoding of genetic information through RNA editing contributes to proteomic diversity, but the extent and significance of RNA editing in disease is poorly understood. In particular, few studies have investigated the relationship between RNA editing and disease at a genome-wide level. Here, we developed a framework for the genome-wide detection of RNA sites that are differentially edited in disease. Using RNA-sequencing data from 100 hippocampi from mice with epilepsy (pilocarpine–temporal lobe epilepsy model) and 100 healthy control hippocampi, we identified 256 RNA sites (overlapping with 87 genes) that were significantly differentially edited between epileptic cases and controls. The degree of differential RNA editing in epileptic mice correlated with frequency of seizures, and the set of genes differentially RNA-edited between case and control mice were enriched for functional terms highly relevant to epilepsy, including “neuron projection” and “seizures.” Genes with differential RNA editing were preferentially enriched for genes with a genetic association to epilepsy. Indeed, we found that they are significantly enriched for genes that harbor nonsynonymous de novo mutations in patients with epileptic encephalopathy and for common susceptibility variants associated with generalized epilepsy. These analyses reveal a functional convergence between genes that are differentially RNA-edited in acquired symptomatic epilepsy and those that contribute risk for genetic epilepsy. Taken together, our results suggest a potential role for RNA editing in the epileptic hippocampus in the occurrence and severity of epileptic seizures. PMID:28250018

  4. Genome-wide transcriptional responses of Alteromonas naphthalenivorans SN2 to contaminated seawater and marine tidal flat sediment

    PubMed Central

    Jin, Hyun Mi; Jeong, Hye Im; Kim, Kyung Hyun; Hahn, Yoonsoo; Madsen, Eugene L.; Jeon, Che Ok

    2016-01-01

    A genome-wide transcriptional analysis of Alteromonas naphthalenivorans SN2 was performed to investigate its ecophysiological behavior in contaminated tidal flats and seawater. The experimental design mimicked these habitats that either added naphthalene or pyruvate; tidal flat-naphthalene (TF-N), tidal flat-pyruvate (TF-P), seawater-naphthalene (SW-N), and seawater-pyruvate (SW-P). The transcriptional profiles clustered by habitat (TF-N/TF-P and SW-N/SW-P), rather than carbon source, suggesting that the former may exert a greater influence on genome-wide expression in strain SN2 than the latter. Metabolic mapping of cDNA reads from strain SN2 based on KEGG pathway showed that metabolic and regulatory genes associated with energy metabolism, translation, and cell motility were highly expressed in all four test conditions, probably highlighting the copiotrophic properties of strain SN2 as an opportunistic marine r-strategist. Differential gene expression analysis revealed that strain SN2 displayed specific cellular responses to environmental variables (tidal flat, seawater, naphthalene, and pyruvate) and exhibited certain ecological fitness traits –– its notable PAH degradation capability in seasonally cold tidal flat might be reflected in elevated expression of stress response and chaperone proteins, while fast growth in nitrogen-deficient and aerobic seawater probably correlated with high expression of glutamine synthetase, enzymes utilizing nitrite/nitrate, and those involved in the removal of reactive oxygen species. PMID:26887987

  5. Genome-wide analysis of homeobox gene family in legumes: identification, gene duplication and expression profiling.

    PubMed

    Bhattacharjee, Annapurna; Ghangal, Rajesh; Garg, Rohini; Jain, Mukesh

    2015-01-01

    Homeobox genes encode transcription factors that are known to play a major role in different aspects of plant growth and development. In the present study, we identified homeobox genes belonging to 14 different classes in five legume species, including chickpea, soybean, Medicago, Lotus and pigeonpea. The characteristic differences within homeodomain sequences among various classes of homeobox gene family were quite evident. Genome-wide expression analysis using publicly available datasets (RNA-seq and microarray) indicated that homeobox genes are differentially expressed in various tissues/developmental stages and under stress conditions in different legumes. We validated the differential expression of selected chickpea homeobox genes via quantitative reverse transcription polymerase chain reaction. Genome duplication analysis in soybean indicated that segmental duplication has significantly contributed in the expansion of homeobox gene family. The Ka/Ks ratio of duplicated homeobox genes in soybean showed that several members of this family have undergone purifying selection. Moreover, expression profiling indicated that duplicated genes might have been retained due to sub-functionalization. The genome-wide identification and comprehensive gene expression profiling of homeobox gene family members in legumes will provide opportunities for functional analysis to unravel their exact role in plant growth and development.

  6. Genome-Wide Analysis of Homeobox Gene Family in Legumes: Identification, Gene Duplication and Expression Profiling

    PubMed Central

    Garg, Rohini; Jain, Mukesh

    2015-01-01

    Homeobox genes encode transcription factors that are known to play a major role in different aspects of plant growth and development. In the present study, we identified homeobox genes belonging to 14 different classes in five legume species, including chickpea, soybean, Medicago, Lotus and pigeonpea. The characteristic differences within homeodomain sequences among various classes of homeobox gene family were quite evident. Genome-wide expression analysis using publicly available datasets (RNA-seq and microarray) indicated that homeobox genes are differentially expressed in various tissues/developmental stages and under stress conditions in different legumes. We validated the differential expression of selected chickpea homeobox genes via quantitative reverse transcription polymerase chain reaction. Genome duplication analysis in soybean indicated that segmental duplication has significantly contributed in the expansion of homeobox gene family. The Ka/Ks ratio of duplicated homeobox genes in soybean showed that several members of this family have undergone purifying selection. Moreover, expression profiling indicated that duplicated genes might have been retained due to sub-functionalization. The genome-wide identification and comprehensive gene expression profiling of homeobox gene family members in legumes will provide opportunities for functional analysis to unravel their exact role in plant growth and development. PMID:25745864

  7. Drosophila Genome-Wide RNAi Screen Identifies Multiple Regulators of HIF–Dependent Transcription in Hypoxia

    PubMed Central

    Dekanty, Andrés; Romero, Nuria M.; Bertolin, Agustina P.; Thomas, María G.; Leishman, Claudia C.; Perez-Perri, Joel I.; Boccaccio, Graciela L.; Wappner, Pablo

    2010-01-01

    Hypoxia-inducible factors (HIFs) are a family of evolutionary conserved alpha-beta heterodimeric transcription factors that induce a wide range of genes in response to low oxygen tension. Molecular mechanisms that mediate oxygen-dependent HIF regulation operate at the level of the alpha subunit, controlling protein stability, subcellular localization, and transcriptional coactivator recruitment. We have conducted an unbiased genome-wide RNA interference (RNAi) screen in Drosophila cells aimed to the identification of genes required for HIF activity. After 3 rounds of selection, 30 genes emerged as critical HIF regulators in hypoxia, most of which had not been previously associated with HIF biology. The list of genes includes components of chromatin remodeling complexes, transcription elongation factors, and translational regulators. One remarkable hit was the argonaute 1 (ago1) gene, a central element of the microRNA (miRNA) translational silencing machinery. Further studies confirmed the physiological role of the miRNA machinery in HIF–dependent transcription. This study reveals the occurrence of novel mechanisms of HIF regulation, which might contribute to developing novel strategies for therapeutic intervention of HIF–related pathologies, including heart attack, cancer, and stroke. PMID:20585616

  8. Meta-Analysis of Genome-Wide Association Studies of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Neale, Benjamin M.; Medland, Sarah E.; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V.; Nguyen, Thuy Trang; Schafer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J. L.; Langely, Kate; O'Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P.; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D.; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan

    2010-01-01

    Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association studies (GWAS) have not yielded significant results, we conducted a meta-analysis of…

  9. Meta-Analysis of Genome-Wide Association Studies of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Neale, Benjamin M.; Medland, Sarah E.; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V.; Nguyen, Thuy Trang; Schafer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J. L.; Langely, Kate; O'Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P.; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D.; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan

    2010-01-01

    Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association studies (GWAS) have not yielded significant results, we conducted a meta-analysis of…

  10. Genome-wide modeling of transcription kinetics reveals patterns of RNA production delays.

    PubMed

    Honkela, Antti; Peltonen, Jaakko; Topa, Hande; Charapitsa, Iryna; Matarese, Filomena; Grote, Korbinian; Stunnenberg, Hendrik G; Reid, George; Lawrence, Neil D; Rattray, Magnus

    2015-10-20

    Genes with similar transcriptional activation kinetics can display very different temporal mRNA profiles because of differences in transcription time, degradation rate, and RNA-processing kinetics. Recent studies have shown that a splicing-associated RNA production delay can be significant. To investigate this issue more generally, it is useful to develop methods applicable to genome-wide datasets. We introduce a joint model of transcriptional activation and mRNA accumulation that can be used for inference of transcription rate, RNA production delay, and degradation rate given data from high-throughput sequencing time course experiments. We combine a mechanistic differential equation model with a nonparametric statistical modeling approach allowing us to capture a broad range of activation kinetics, and we use Bayesian parameter estimation to quantify the uncertainty in estimates of the kinetic parameters. We apply the model to data from estrogen receptor α activation in the MCF-7 breast cancer cell line. We use RNA polymerase II ChIP-Seq time course data to characterize transcriptional activation and mRNA-Seq time course data to quantify mature transcripts. We find that 11% of genes with a good signal in the data display a delay of more than 20 min between completing transcription and mature mRNA production. The genes displaying these long delays are significantly more likely to be short. We also find a statistical association between high delay and late intron retention in pre-mRNA data, indicating significant splicing-associated production delays in many genes.

  11. Genome-Wide Analysis of Acute Endurance Exercise-Induced Translational Regulation in Mouse Skeletal Muscle

    PubMed Central

    Sako, Hiroaki; Yada, Koichi; Suzuki, Katsuhiko

    2016-01-01

    Exercise dynamically changes skeletal muscle protein synthesis to respond and adapt to the external and internal stimuli. Many studies have focused on overall protein synthesis to understand how exercise regulates the muscular adaptation. However, despite the probability that each gene transcript may have its own unique translational characteristics and would be differentially regulated at translational level, little attention has been paid to how exercise affects translational regulation of individual genes at a genome-wide scale. Here, we conducted a genome-wide translational analysis using ribosome profiling to investigate the effect of a single bout of treadmill running (20 m/min for 60 min) on mouse gastrocnemius. Global translational profiles largely differed from those in transcription even at a basal resting condition as well as immediately after exercise. As for individual gene, Slc25a25 (Solute carrier family 25, member 25), localized in mitochondrial inner membrane and maintaining ATP homeostasis and endurance performance, showed significant up-regulation at translational level. However, multiple regression analysis suggests that Slc25a25 protein degradation may also have a role in mediating Slc25a25 protein abundance in the basal and early stages after acute endurance exercise. PMID:26845575

  12. A high definition look at the NF-Y regulome reveals genome-wide associations with selected transcription factors

    PubMed Central

    Dolfini, Diletta; Zambelli, Federico; Pedrazzoli, Maurizio; Mantovani, Roberto; Pavesi, Giulio

    2016-01-01

    NF-Y is a trimeric transcription factor (TF), binding the CCAAT box element, for which several results suggest a pioneering role in activation of transcription. In this work, we integrated 380 ENCODE ChIP-Seq experiments for 154 TFs and cofactors with sequence analysis, protein–protein interactions and RNA profiling data, in order to identify genome-wide regulatory modules resulting from the co-association of NF-Y with other TFs. We identified three main degrees of co-association with NF-Y for sequence-specific TFs. In the most relevant one, we found TFs having a significant overlap with NF-Y in their DNA binding loci, some with a precise spacing of binding sites with respect to the CCAAT box, others (FOS, Sp1/2, RFX5, IRF3, PBX3) mostly lacking their canonical binding site and bound to arrays of well spaced CCAAT boxes. As expected, NF-Y binding also correlates with RNA Pol II General TFs and with subunits of complexes involved in the control of H3K4 methylations. Co-association patterns are confirmed by protein–protein interactions, and correspond to specific functional categorizations and expression level changes of target genes following NF-Y inactivation. These data define genome-wide rules for the organization of NF-Y-centered regulatory modules, supporting a model of distinct categorization and synergy with well defined sets of TFs. PMID:26896797

  13. Genome-Wide Identification of the Target Genes of AP2-O, a Plasmodium AP2-Family Transcription Factor

    PubMed Central

    Kaneko, Izumi; Iwanaga, Shiroh; Kato, Tomomi; Kobayashi, Issei; Yuda, Masao

    2015-01-01

    Stage-specific transcription is a fundamental biological process in the life cycle of the Plasmodium parasite. Proteins containing the AP2 DNA-binding domain are responsible for stage-specific transcriptional regulation and belong to the only known family of transcription factors in Plasmodium parasites. Comprehensive identification of their target genes will advance our understanding of the molecular basis of stage-specific transcriptional regulation and stage-specific parasite development. AP2-O is an AP2 family transcription factor that is expressed in the mosquito midgut-invading stage, called the ookinete, and is essential for normal morphogenesis of this stage. In this study, we identified the genome-wide target genes of AP2-O by chromatin immunoprecipitation-sequencing and elucidate how this AP2 family transcription factor contributes to the formation of this motile stage. The analysis revealed that AP2-O binds specifically to the upstream genomic regions of more than 500 genes, suggesting that approximately 10% of the parasite genome is directly regulated by AP2-O. These genes are involved in distinct biological processes such as morphogenesis, locomotion, midgut penetration, protection against mosquito immunity and preparation for subsequent oocyst development. This direct and global regulation by AP2-O provides a model for gene regulation in Plasmodium parasites and may explain how these parasites manage to control their complex life cycle using a small number of sequence-specific AP2 transcription factors. PMID:26018192

  14. Genome-wide analysis of promoter architecture in Drosophila melanogaster

    SciTech Connect

    Hoskins, Roger A.; Landolin, Jane M.; Brown, James B.; Sandler, Jeremy E.; Takahashi, Hazuki; Lassmann, Timo; Yu, Charles; Booth, Benjamin W.; Zhang, Dayu; Wan, Kenneth H.; Yang, Li; Boley, Nathan; Andrews, Justen; Kaufman, Thomas C.; Graveley, Brenton R.; Bickel, Peter J.; Carninci, Piero; Carlson, Joseph W.; Celniker, Susan E.

    2010-10-20

    Core promoters are critical regions for gene regulation in higher eukaryotes. However, the boundaries of promoter regions, the relative rates of initiation at the transcription start sites (TSSs) distributed within them, and the functional significance of promoter architecture remain poorly understood. We produced a high-resolution map of promoters active in the Drosophila melanogaster embryo by integrating data from three independent and complementary methods: 21 million cap analysis of gene expression (CAGE) tags, 1.2 million RNA ligase mediated rapid amplification of cDNA ends (RLMRACE) reads, and 50,000 cap-trapped expressed sequence tags (ESTs). We defined 12,454 promoters of 8037 genes. Our analysis indicates that, due to non-promoter-associated RNA background signal, previous studies have likely overestimated the number of promoter-associated CAGE clusters by fivefold. We show that TSS distributions form a complex continuum of shapes, and that promoters active in the embryo and adult have highly similar shapes in 95% of cases. This suggests that these distributions are generally determined by static elements such as local DNA sequence and are not modulated by dynamic signals such as histone modifications. Transcription factor binding motifs are differentially enriched as a function of promoter shape, and peaked promoter shape is correlated with both temporal and spatial regulation of gene expression. Our results contribute to the emerging view that core promoters are functionally diverse and control patterning of gene expression in Drosophila and mammals.

  15. Genome-wide sequential, evolutionary, organizational and expression analyses of phenylpropanoid biosynthesis associated MYB domain transcription factors in Arabidopsis.

    PubMed

    Mondal, Sunil Kanti; Roy, Sujit

    2017-06-02

    The MYB gene family represents one of the largest groups of transcription factors in plants. Recent evidences have also demonstrated key role of MYB transcription factors in regulating the expression of major genes involved in the biosynthesis of phenylpropanoid compounds which confer biotic and abiotic stress tolerance in plant species. However, no comprehensive genome-wide analysis of the phenylpropanoid pathway-associated MYB transcription factors has been reported thus far. In this study, 11 Arabidopsis MYB proteins, such as MYB3, MYB4, MYB7, MYB11, MYB12, MYB32, MYB75, MYB90, MYB111, MYB113, and MYB114 were initially identified considering their reported regulatory function in phenylpropanoid biosynthesis pathway. Subsequent genome-wide analysis have identified the corresponding homologues from Glycine max, Vigna radiata, Oryza sativa, and Zea mays, while homologous of Arabidopsis MYB75, MYB90, MYB113, and MYB114 were not detected in rice and maize genomes. The identified MYB proteins were classified into three groups (I-III) based on phylogeny. Sequence and domain analysis revealed presence of two conserved DNA binding MYB domains in the selected MYB proteins. Promoter analysis indicated presence of cis-regulatory elements related to light signaling, development, and stress response. Expression analysis of selected Arabidopsis MYB genes revealed their function in plant development and abiotic stress response, consistent with gene ontology annotations. Together, these results provide a useful framework for further experimental studies for the functional characterization of the target MYB genes in the context of regulation of phenylpropanoid biosynthesis and plant stress response.

  16. Genome-wide annotation of microRNA primary transcript structures reveals novel regulatory mechanisms

    PubMed Central

    Chang, Tsung-Cheng; Pertea, Mihaela; Lee, Sungyul; Salzberg, Steven L.; Mendell, Joshua T.

    2015-01-01

    Precise regulation of microRNA (miRNA) expression is critical for diverse physiologic and pathophysiologic processes. Nevertheless, elucidation of the mechanisms through which miRNA expression is regulated has been greatly hindered by the incomplete annotation of primary miRNA (pri-miRNA) transcripts. While a subset of miRNAs are hosted in protein-coding genes, the majority of pri-miRNAs are transcribed as poorly characterized noncoding RNAs that are 10's to 100's of kilobases in length and low in abundance due to efficient processing by the endoribonuclease DROSHA, which initiates miRNA biogenesis. Accordingly, these transcripts are poorly represented in existing RNA-seq data sets and exhibit limited and inaccurate annotation in current transcriptome assemblies. To overcome these challenges, we developed an experimental and computational approach that allows genome-wide detection and mapping of pri-miRNA structures. Deep RNA-seq in cells expressing dominant-negative DROSHA resulted in much greater coverage of pri-miRNA transcripts compared with standard RNA-seq. A computational pipeline was developed that produces highly accurate pri-miRNA assemblies, as confirmed by extensive validation. This approach was applied to a panel of human and mouse cell lines, providing pri-miRNA transcript structures for 1291/1871 human and 888/1181 mouse miRNAs, including 594 human and 425 mouse miRNAs that fall outside protein-coding genes. These new assemblies uncovered unanticipated features and new potential regulatory mechanisms, including links between pri-miRNAs and distant protein-coding genes, alternative pri-miRNA splicing, and transcripts carrying subsets of miRNAs encoded by polycistronic clusters. These results dramatically expand our understanding of the organization of miRNA-encoding genes and provide a valuable resource for the study of mammalian miRNA regulation. PMID:26290535

  17. Genome-wide Analysis of RNA Polymerase II Termination at Protein-Coding Genes.

    PubMed

    Baejen, Carlo; Andreani, Jessica; Torkler, Phillipp; Battaglia, Sofia; Schwalb, Bjoern; Lidschreiber, Michael; Maier, Kerstin C; Boltendahl, Andrea; Rus, Petra; Esslinger, Stephanie; Söding, Johannes; Cramer, Patrick

    2017-03-06

    At the end of protein-coding genes, RNA polymerase (Pol) II undergoes a concerted transition that involves 3'-processing of the pre-mRNA and transcription termination. Here, we present a genome-wide analysis of the 3'-transition in budding yeast. We find that the 3'-transition globally requires the Pol II elongation factor Spt5 and factors involved in the recognition of the polyadenylation (pA) site and in endonucleolytic RNA cleavage. Pol II release from DNA occurs in a narrow termination window downstream of the pA site and requires the "torpedo" exonuclease Rat1 (XRN2 in human). The Rat1-interacting factor Rai1 contributes to RNA degradation downstream of the pA site. Defects in the 3'-transition can result in increased transcription at downstream genes.

  18. Genome Wide Analysis of Chromatin Regulation by Cocaine Reveals a Novel Role for Sirtuins

    PubMed Central

    Renthal, William; Kumar, Arvind; Xiao, Guanghua; Wilkinson, Matthew; Covington, Herbert E.; Maze, Ian; Sikder, Devanjan; Robison, Alfred J.; LaPlant, Quincey; Dietz, David M.; Russo, Scott J.; Vialou, Vincent; Chakravarty, Sumana; Kodadek, Thomas J.; Stack, Ashley; Kabbaj, Mohammed; Nestler, Eric J.

    2009-01-01

    Summary Changes in gene expression contribute to the long-lasting regulation of the brain’s reward circuitry seen in drug addiction, however, the specific genes regulated and the transcriptional mechanisms underlying such regulation remain poorly understood. Here, we used chromatin immunoprecipitation coupled with promoter microarray analysis to characterize genome-wide chromatin changes in the mouse nucleus accumbens, a crucial brain reward region, after repeated cocaine administration. Our findings reveal several interesting principles of gene regulation by cocaine and of the role of ΔFosB and CREB, two prominent cocaine-induced transcription factors, in this brain region. The findings also provide novel and comprehensive insight into the molecular pathways regulated by cocaine – including a new role for sirtuins (Sirt1 and Sirt2) –which are induced in the nucleus accumbens by cocaine and, in turn, dramatically enhance the behavioral effects of the drug. PMID:19447090

  19. Genome-wide analysis of alternative splicing during human heart development

    NASA Astrophysics Data System (ADS)

    Wang, He; Chen, Yanmei; Li, Xinzhong; Chen, Guojun; Zhong, Lintao; Chen, Gangbing; Liao, Yulin; Liao, Wangjun; Bin, Jianping

    2016-10-01

    Alternative splicing (AS) drives determinative changes during mouse heart development. Recent high-throughput technological advancements have facilitated genome-wide AS, while its analysis in human foetal heart transition to the adult stage has not been reported. Here, we present a high-resolution global analysis of AS transitions between human foetal and adult hearts. RNA-sequencing data showed extensive AS transitions occurred between human foetal and adult hearts, and AS events occurred more frequently in protein-coding genes than in long non-coding RNA (lncRNA). A significant difference of AS patterns was found between foetal and adult hearts. The predicted difference in AS events was further confirmed using quantitative reverse transcription-polymerase chain reaction analysis of human heart samples. Functional foetal-specific AS event analysis showed enrichment associated with cell proliferation-related pathways including cell cycle, whereas adult-specific AS events were associated with protein synthesis. Furthermore, 42.6% of foetal-specific AS events showed significant changes in gene expression levels between foetal and adult hearts. Genes exhibiting both foetal-specific AS and differential expression were highly enriched in cell cycle-associated functions. In conclusion, we provided a genome-wide profiling of AS transitions between foetal and adult hearts and proposed that AS transitions and deferential gene expression may play determinative roles in human heart development.

  20. Genome-wide computational prediction and analysis of core promoter elements across plant monocots and dicots

    USDA-ARS?s Scientific Manuscript database

    Transcription initiation, essential to gene expression regulation, involves recruitment of basal transcription factors to the core promoter elements (CPEs). The distribution of currently known CPEs across plant genomes is largely unknown. This is the first large scale genome-wide report on the compu...

  1. Genome-wide transcription factor gene prediction and their expressional tissue-specificities in maize.

    PubMed

    Jiang, Yi; Zeng, Biao; Zhao, Hainan; Zhang, Mei; Xie, Shaojun; Lai, Jinsheng

    2012-09-01

    Transcription factors (TFs) are important regulators of gene expression. To better understand TF-encoding genes in maize (Zea mays L.), a genome-wide TF prediction was performed using the updated B73 reference genome. A total of 2298 TF genes were identified, which can be classified into 56 families. The largest family, known as the MYB superfamily, comprises 322 MYB and MYB-related TF genes. The expression patterns of 2 014 (87.64%) TF genes were examined using RNA-seq data, which resulted in the identification of a subset of TFs that are specifically expressed in particular tissues (including root, shoot, leaf, ear, tassel and kernel). Similarly, 98 kernel-specific TF genes were further analyzed, and it was observed that 29 of the kernel-specific genes were preferentially expressed in the early kernel developmental stage, while 69 of the genes were expressed in the late kernel developmental stage. Identification of these TFs, particularly the tissue-specific ones, provides important information for the understanding of development and transcriptional regulation of maize.

  2. Genome-wide analysis links NFATC2 with asparaginase hypersensitivity

    PubMed Central

    Fernandez, Christian A.; Smith, Colton; Yang, Wenjian; Mullighan, Charles G.; Qu, Chunxu; Larsen, Eric; Bowman, W. Paul; Liu, Chengcheng; Ramsey, Laura B.; Chang, Tamara; Karol, Seth E.; Loh, Mignon L.; Raetz, Elizabeth A.; Winick, Naomi J.; Hunger, Stephen P.; Carroll, William L.; Jeha, Sima; Pui, Ching-Hon; Evans, William E.; Devidas, Meenakshi

    2015-01-01

    Asparaginase is used to treat acute lymphoblastic leukemia (ALL); however, hypersensitivity reactions can lead to suboptimal asparaginase exposure. Our objective was to use a genome-wide approach to identify loci associated with asparaginase hypersensitivity in children with ALL enrolled on St. Jude Children’s Research Hospital (SJCRH) protocols Total XIIIA (n = 154), Total XV (n = 498), and Total XVI (n = 271), or Children’s Oncology Group protocols POG 9906 (n = 222) and AALL0232 (n = 2163). Germline DNA was genotyped using the Affymetrix 500K, Affymetrix 6.0, or the Illumina Exome BeadChip array. In multivariate logistic regression, the intronic rs6021191 variant in nuclear factor of activated T cells 2 (NFATC2) had the strongest association with hypersensitivity (P = 4.1 × 10−8; odds ratio [OR] = 3.11). RNA-seq data available from 65 SJCRH ALL tumor samples and 52 Yoruba HapMap samples showed that samples carrying the rs6021191 variant had higher NFATC2 expression compared with noncarriers (P = 1.1 × 10−3 and 0.03, respectively). The top ranked nonsynonymous polymorphism was rs17885382 in HLA-DRB1 (P = 3.2 × 10−6; OR = 1.63), which is in near complete linkage disequilibrium with the HLA-DRB1*07:01 allele we previously observed in a candidate gene study. The strongest risk factors for asparaginase allergy are variants within genes regulating the immune response. PMID:25987655

  3. Genome-wide identification and characterization of transcription start sites and promoters in the tunicate Ciona intestinalis

    PubMed Central

    Yokomori, Rui; Shimai, Kotaro; Nishitsuji, Koki; Suzuki, Yutaka; Kusakabe, Takehiro G.; Nakai, Kenta

    2016-01-01

    The tunicate Ciona intestinalis, an invertebrate chordate, has recently emerged as a powerful model organism for gene regulation analysis. However, few studies have been conducted to identify and characterize its transcription start sites (TSSs) and promoters at the genome-wide level. Here, using TSS-seq, we identified TSSs at the genome-wide scale and characterized promoters in C. intestinalis. Specifically, we identified TSS clusters (TSCs), high-density regions of TSS-seq tags, each of which appears to originate from an identical promoter. TSCs were found not only at known TSSs but also in other regions, suggesting the existence of many unknown transcription units in the genome. We also identified candidate promoters of 79 ribosomal protein (RP) genes, each of which had the major TSS in a polypyrimidine tract and showed a sharp TSS distribution like human RP gene promoters. Ciona RP gene promoters, however, did not appear to have typical TATA boxes, unlike human RP gene promoters. In Ciona non-RP promoters, two pyrimidine-purine dinucleotides, CA and TA, were frequently used as TSSs. Despite the absence of CpG islands, Ciona TATA-less promoters showed low expression specificity like CpG-associated human TATA-less promoters. By using TSS-seq, we also predicted trans-spliced gene TSSs and found that their downstream regions had higher G+T content than those of non-trans-spliced gene TSSs. Furthermore, we identified many putative alternative promoters, some of which were regulated in a tissue-specific manner. Our results provide valuable information about TSSs and promoter characteristics in C. intestinalis and will be helpful in future analysis of transcriptional regulation in chordates. PMID:26668163

  4. Genome-wide identification and characterization of transcription start sites and promoters in the tunicate Ciona intestinalis.

    PubMed

    Yokomori, Rui; Shimai, Kotaro; Nishitsuji, Koki; Suzuki, Yutaka; Kusakabe, Takehiro G; Nakai, Kenta

    2016-01-01

    The tunicate Ciona intestinalis, an invertebrate chordate, has recently emerged as a powerful model organism for gene regulation analysis. However, few studies have been conducted to identify and characterize its transcription start sites (TSSs) and promoters at the genome-wide level. Here, using TSS-seq, we identified TSSs at the genome-wide scale and characterized promoters in C. intestinalis. Specifically, we identified TSS clusters (TSCs), high-density regions of TSS-seq tags, each of which appears to originate from an identical promoter. TSCs were found not only at known TSSs but also in other regions, suggesting the existence of many unknown transcription units in the genome. We also identified candidate promoters of 79 ribosomal protein (RP) genes, each of which had the major TSS in a polypyrimidine tract and showed a sharp TSS distribution like human RP gene promoters. Ciona RP gene promoters, however, did not appear to have typical TATA boxes, unlike human RP gene promoters. In Ciona non-RP promoters, two pyrimidine-purine dinucleotides, CA and TA, were frequently used as TSSs. Despite the absence of CpG islands, Ciona TATA-less promoters showed low expression specificity like CpG-associated human TATA-less promoters. By using TSS-seq, we also predicted trans-spliced gene TSSs and found that their downstream regions had higher G+T content than those of non-trans-spliced gene TSSs. Furthermore, we identified many putative alternative promoters, some of which were regulated in a tissue-specific manner. Our results provide valuable information about TSSs and promoter characteristics in C. intestinalis and will be helpful in future analysis of transcriptional regulation in chordates.

  5. Genome-wide DNA methylation analysis in permanent atrial fibrillation.

    PubMed

    Zhao, Guochang; Zhou, Jian; Gao, Jie; Liu, Yan; Gu, Song; Zhang, Xitao; Su, Pixiong

    2017-10-01

    Atrial fibrillation (AF) is a highly heterogeneous genetic disease; however, the pathogenesis of AF cannot be explained by genetic variants alone. DNA methylation is a heritable method of gene expression regulation, and may be a potential regulatory mechanism in AF. Therefore, in the present study, the genome‑wide DNA methylation pattern in cells derived from the left atrium of patients with permanent AF (n=7) was compared with that of healthy heart donors (n=4) with a normal sinus rhythm (SR). Enriched biological functions of the differentially methylated genes were assessed. Integrated analysis of genome‑wide methylation and mRNA expression profiles was performed, and reverse transcription quantitative‑polymerase chain reaction (RT‑qPCR) was used to determine the expression levels of four selected genes. A total of 417 differentially methylated CpG sites were identified in the fibrillating atrium (P<0.05; |β|>0.17); the majority of which were located in gene‑body and intergenic regions outside of CpG islands. Aberrantly methylated genes participated in the activation of inflammation, sodium and potassium ion transport, fibrosis and the reduction of lipid metabolism. Hypermethylation in the AF susceptible loci, paired‑like homeodomain transcription factor 2 (chromosome 4q25) and coiled‑coil domain containing 141 (chromosome 2q31), as well as hypomethylation in the calcium voltage‑gated channel subunit α1C (chromosome 12p13) locus, were identified in all patients with AF. Of the 420 upregulated and 567 downregulated genes previously identified in patients with AF relative to those with normal SR (fold‑change >2.0; P≤0.05), 12 genes were hypomethylated and eight genes were hypermethylated in each group, respectively (|β|>0.2: P<0.05). RT‑qPCR analysis of four of these genes supported the modulatory effect of DNA methylation on gene expression. These results suggest that DNA methylation‑mediated regulation of gene expression may serve

  6. Genome-wide analysis of DNA methylation in Arabidopsis using MeDIP-chip.

    PubMed

    Cortijo, Sandra; Wardenaar, René; Colomé-Tatché, Maria; Johannes, Frank; Colot, Vincent

    2014-01-01

    DNA methylation is an epigenetic mark that is essential for preserving genome integrity and normal development in plants and mammals. Although this modification may serve a variety of purposes, it is best known for its role in stable transcriptional silencing of transposable elements and epigenetic regulation of some genes. In addition, it is increasingly recognized that alterations in DNA methylation patterns can sometimes be inherited across multiple generations and thus are a source of heritable phenotypic variation that is independent of any DNA sequence changes. With the advent of genomics, it is now possible to analyze DNA methylation genome-wide with high precision, which is a prerequisite for understanding fully the various functions and phenotypic impact of this modification. Indeed, several so-called epigenomic mapping methods have been developed for the analysis of DNA methylation. Among these, immunoprecipitation of methylated DNA followed by hybridization to genome tiling arrays (MeDIP-chip) arguably offers a reasonable compromise between cost, ease of implementation, and sensitivity to date. Here we describe the application of this method, from DNA extraction to data analysis, to the study of DNA methylation genome-wide in Arabidopsis.

  7. Genome-Wide Chromatin Immunoprecipitation Sequencing Analysis Shows that WhiB Is a Transcription Factor That Cocontrols Its Regulon with WhiA To Initiate Developmental Cell Division in Streptomyces

    PubMed Central

    Chandra, Govind; Bibb, Maureen J.; Findlay, Kim C.; Buttner, Mark J.

    2016-01-01

    ABSTRACT WhiB is the founding member of a family of proteins (the WhiB-like [Wbl] family) that carry a [4Fe-4S] iron-sulfur cluster and play key roles in diverse aspects of the biology of actinomycetes, including pathogenesis, antibiotic resistance, and the control of development. In Streptomyces, WhiB is essential for the process of developmentally controlled cell division that leads to sporulation. The biochemical function of Wbl proteins has been controversial; here, we set out to determine unambiguously if WhiB functions as a transcription factor using chromatin immunoprecipitation sequencing (ChIP-seq) in Streptomyces venezuelae. In the first demonstration of in vivo genome-wide Wbl binding, we showed that WhiB regulates the expression of key genes required for sporulation by binding upstream of ~240 transcription units. Strikingly, the WhiB regulon is identical to the previously characterized WhiA regulon, providing an explanation for the identical phenotypes of whiA and whiB mutants. Using ChIP-seq, we demonstrated that in vivo DNA binding by WhiA depends on WhiB and vice versa, showing that WhiA and WhiB function cooperatively to control expression of a common set of WhiAB target genes. Finally, we show that mutation of the cysteine residues that coordinate the [4Fe-4S] cluster in WhiB prevents DNA binding by both WhiB and WhiA in vivo. PMID:27094333

  8. Assessing statistical significance in multivariable genome wide association analysis

    PubMed Central

    Buzdugan, Laura; Kalisch, Markus; Navarro, Arcadi; Schunk, Daniel; Fehr, Ernst; Bühlmann, Peter

    2016-01-01

    Motivation: Although Genome Wide Association Studies (GWAS) genotype a very large number of single nucleotide polymorphisms (SNPs), the data are often analyzed one SNP at a time. The low predictive power of single SNPs, coupled with the high significance threshold needed to correct for multiple testing, greatly decreases the power of GWAS. Results: We propose a procedure in which all the SNPs are analyzed in a multiple generalized linear model, and we show its use for extremely high-dimensional datasets. Our method yields P-values for assessing significance of single SNPs or groups of SNPs while controlling for all other SNPs and the family wise error rate (FWER). Thus, our method tests whether or not a SNP carries any additional information about the phenotype beyond that available by all the other SNPs. This rules out spurious correlations between phenotypes and SNPs that can arise from marginal methods because the ‘spuriously correlated’ SNP merely happens to be correlated with the ‘truly causal’ SNP. In addition, the method offers a data driven approach to identifying and refining groups of SNPs that jointly contain informative signals about the phenotype. We demonstrate the value of our method by applying it to the seven diseases analyzed by the Wellcome Trust Case Control Consortium (WTCCC). We show, in particular, that our method is also capable of finding significant SNPs that were not identified in the original WTCCC study, but were replicated in other independent studies. Availability and implementation: Reproducibility of our research is supported by the open-source Bioconductor package hierGWAS. Contact: peter.buehlmann@stat.math.ethz.ch Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153677

  9. Genome-wide analysis highlights genetic dilution in Algerian sheep.

    PubMed

    Gaouar, S B S; Lafri, M; Djaout, A; El-Bouyahiaoui, R; Bouri, A; Bouchatal, A; Maftah, A; Ciani, E; Da Silva, A B

    2017-03-01

    Algeria represents a reservoir of genetic diversity with local sheep breeds adapted to a large range of environments and showing specific features necessary to deal with harsh conditions. This remarkable diversity results from the traditional management of dryland by pastoralists over centuries. Most of these breeds are poorly productive, and the economic pressure leads farmers to realize anarchic cross-breeding (that is, not carried out in the framework of selection plans) with the hope to increase animal's conformation. In this study, eight of the nine local Algerian sheep breeds (D'men, Hamra, Ouled-Djellal, Rembi, Sidaoun, Tazegzawt, Berber and Barbarine) were investigated for the first time by genome-wide single-nucleotide polymorphism genotyping. At an international scale, Algerian sheep occupied an original position shaped by relations with African and European (particularly Italian) breeds. The strong genetic proximity with Caribbean and Brazilian breeds confirmed that the genetic make-up of these American breeds was largely influenced by the Atlantic slave trade. At a national scale, an alarming genetic dilution of the Berber (a primitive breed) and the Rembi was observed, as a consequence of uncontrolled mating practices with Ouled-Djellal. A similar, though less pronounced, phenomenon was also detected for the Barbarine, another ancestral breed. Genetic originality appeared to be better preserved in Tazegzawt, Hamra, D'men and Sidaoun. These breeds should be given high priority in the establishment of conservation plans to halt their progressive loss. For Berber and Barbarine that also occur in the bordering neighbor countries, urgent concerted transnational actions are needed.

  10. Genome-Wide Transcriptional Profiling Reveals Connective Tissue Mast Cell Accumulation in Bronchopulmonary Dysplasia

    PubMed Central

    Bhattacharya, Soumyaroop; Go, Diana; Krenitsky, Daria L.; Huyck, Heidi L.; Solleti, Siva Kumar; Lunger, Valerie A.; Metlay, Leon; Srisuma, Sorachai; Wert, Susan E.; Pryhuber, Gloria S.

    2012-01-01

    Rationale: Bronchopulmonary dysplasia (BPD) is a major complication of premature birth. Risk factors for BPD are complex and include prenatal infection and O2 toxicity. BPD pathology is equally complex and characterized by inflammation and dysmorphic airspaces and vasculature. Due to the limited availability of clinical samples, an understanding of the molecular pathogenesis of this disease and its causal mechanisms and associated biomarkers is limited. Objectives: Apply genome-wide expression profiling to define pathways affected in BPD lungs. Methods: Lung tissue was obtained at autopsy from 11 BPD cases and 17 age-matched control subjects without BPD. RNA isolated from these tissue samples was interrogated using microarrays. Standard gene selection and pathway analysis methods were applied to the data set. Abnormal expression patterns were validated by quantitative reverse transcriptase–polymerase chain reaction and immunohistochemistry. Measurements and Main Results: We identified 159 genes differentially expressed in BPD tissues. Pathway analysis indicated previously appreciated (e.g., DNA damage regulation of cell cycle) as well as novel (e.g., B-cell development) biological functions were affected. Three of the five most highly induced genes were mast cell (MC)-specific markers. We confirmed an increased accumulation of connective tissue MCTC (chymase expressing) mast cells in BPD tissues. Increased expression of MCTC markers was also demonstrated in an animal model of BPD-like pathology. Conclusions: We present a unique genome-wide expression data set from human BPD lung tissue. Our data provide information on gene expression patterns associated with BPD and facilitated the discovery that MCTC accumulation is a prominent feature of this disease. These observations have significant clinical and mechanistic implications. PMID:22723293

  11. From Human Monocytes to Genome-Wide Binding Sites - A Protocol for Small Amounts of Blood: Monocyte Isolation/ChIP-Protocol/Library Amplification/Genome Wide Computational Data Analysis

    PubMed Central

    Weiterer, Sebastian; Uhle, Florian; Bhuju, Sabin; Jarek, Michael; Weigand, Markus A.; Bartkuhn, Marek

    2014-01-01

    Chromatin immunoprecipitation in combination with a genome-wide analysis via high-throughput sequencing is the state of the art method to gain genome-wide representation of histone modification or transcription factor binding profiles. However, chromatin immunoprecipitation analysis in the context of human experimental samples is limited, especially in the case of blood cells. The typically extremely low yields of precipitated DNA are usually not compatible with library amplification for next generation sequencing. We developed a highly reproducible protocol to present a guideline from the first step of isolating monocytes from a blood sample to analyse the distribution of histone modifications in a genome-wide manner. Conclusion: The protocol describes the whole work flow from isolating monocytes from human blood samples followed by a high-sensitivity and small-scale chromatin immunoprecipitation assay with guidance for generating libraries compatible with next generation sequencing from small amounts of immunoprecipitated DNA. PMID:24732314

  12. Genome-wide DNA methylation analysis using massively parallel sequencing technologies.

    PubMed

    Suzuki, Masako; Greally, John M

    2013-01-01

    "Epigenetics" refers to a heritable change in transcriptional status without alteration in the primary nucleotide sequence. Epigenetics provides an extra layer of transcriptional control and plays a crucial role in normal development, as well as in pathological conditions. DNA methylation is one of the best known and well-studied epigenetic modifications. Genome-wide DNA methylation profiling has become recognized as a biologically and clinically important epigenomic assay. In this review, we discuss the strengths and weaknesses of the protocols for genome-wide DNA methylation profiling using massively parallel sequencing (MPS) techniques. We will also describe recently discovered DNA modifications, and the protocols to detect these modifications.

  13. Genome-wide analysis of TCP family in tobacco.

    PubMed

    Chen, L; Chen, Y Q; Ding, A M; Chen, H; Xia, F; Wang, W F; Sun, Y H

    2016-05-23

    The TCP family is a transcription factor family, members of which are extensively involved in plant growth and development as well as in signal transduction in the response against many physiological and biochemical stimuli. In the present study, 61 TCP genes were identified in tobacco (Nicotiana tabacum) genome. Bioinformatic methods were employed for predicting and analyzing the gene structure, gene expression, phylogenetic analysis, and conserved domains of TCP proteins in tobacco. The 61 NtTCP genes were divided into three diverse groups, based on the division of TCP genes in tomato and Arabidopsis, and the results of the conserved domain and sequence analyses further confirmed the classification of the NtTCP genes. The expression pattern of NtTCP also demonstrated that majority of these genes play important roles in all the tissues, while some special genes exercise their functions only in specific tissues. In brief, the comprehensive and thorough study of the TCP family in other plants provides sufficient resources for studying the structure and functions of TCPs in tobacco.

  14. Genome-wide transcriptional and physiological responses of Bradyrhizobium japonicum to paraquat-mediated oxidative stress.

    PubMed

    Donati, Andrew J; Jeon, Jeong-Min; Sangurdekar, Dipen; So, Jae-Seong; Chang, Woo-Suk

    2011-06-01

    The rhizobial bacterium Bradyrhizobium japonicum functions as a nitrogen-fixing symbiont of the soybean plant (Glycine max). Plants are capable of producing an oxidative burst, a rapid proliferation of reactive oxygen species (ROS), as a defense mechanism against pathogenic and symbiotic bacteria. Therefore, B. japonicum must be able to resist such a defense mechanism to initiate nodulation. In this study, paraquat, a known superoxide radical-inducing agent, was used to investigate this response. Genome-wide transcriptional profiles were created for both prolonged exposure (PE) and fulminant shock (FS) conditions. These profiles revealed that 190 and 86 genes were up- and downregulated for the former condition, and that 299 and 105 genes were up- and downregulated for the latter condition, respectively (>2.0-fold; P < 0.05). Many genes within putative operons for F(0)F(1)-ATP synthase, chemotaxis, transport, and ribosomal proteins were upregulated during PE. The transcriptional profile for the FS condition strangely resembled that of a bacteroid condition, including the FixK(2) transcription factor and most of its response elements. However, genes encoding canonical ROS scavenging enzymes, such as superoxide dismutase and catalase, were not detected, suggesting constitutive expression of those genes by endogenous ROS. Various physiological tests, including exopolysaccharide (EPS), cellular protein, and motility characterization, were performed to corroborate the gene expression data. The results suggest that B. japonicum responds to tolerable oxidative stress during PE through enhanced motility, increased translational activity, and EPS production, in addition to the expression of genes involved in global stress responses, such as chaperones and sigma factors.

  15. Genome-Wide Identification and Expression Profile of Dof Transcription Factor Gene Family in Pepper (Capsicum annuum L.).

    PubMed

    Wu, Zhiming; Cheng, Jiaowen; Cui, Junjie; Xu, Xiaowan; Liang, Guansheng; Luo, Xirong; Chen, Xiaocui; Tang, Xiangqun; Hu, Kailin; Qin, Cheng

    2016-01-01

    Dof (DNA-binding One Zinc Finger) transcription factor family is unique to plants and has diverse roles associated with plant-specific phenomena, such as light, phytohormone and defense responses as well as seed development and germination. Although, genome-wide analysis of this family has been performed in many species, information regarding Dof genes in the pepper, Capsicum annuum L., is extremely limited. In this study, exhaustive searches of pepper genome revealed 33 potential CaDofs that were phylogenetically clustered into four subgroups. Twenty-nine of the 33 Dof genes could be mapped on 11 chromosomes, except for chromosome 7. The intron/exon organizations and conserved motif compositions of these genes were also analyzed. Additionally, phylogenetic analysis and classification of the Dof transcription factor family in eight plant species revealed that S. lycopersicum and C. annuum as well as O. sativa and S. bicolor Dof proteins may have evolved conservatively. Moreover, comprehensive expression analysis of CaDofs using a RNA-seq atlas and quantitative real-time polymerase chain reaction (qRT-PCR) revealed that these genes exhibit a variety of expression patterns. Most of the CaDofs were expressed in at least one of the tissues tested, whereas several genes were identified as being highly responsive to heat and salt stresses. Overall, this study describes the first genome-wide analysis of the pepper Dof family, whose genes exhibited different expression patterns in all primary fruit developmental stages and tissue types, as in response to abiotic stress. In particular, some Dof genes might be used as biomarkers for heat and salt stress. The results could expand our understanding of the roles of Dof genes in pepper.

  16. Genome-Wide Identification and Expression Profile of Dof Transcription Factor Gene Family in Pepper (Capsicum annuum L.)

    PubMed Central

    Wu, Zhiming; Cheng, Jiaowen; Cui, Junjie; Xu, Xiaowan; Liang, Guansheng; Luo, Xirong; Chen, Xiaocui; Tang, Xiangqun; Hu, Kailin; Qin, Cheng

    2016-01-01

    Dof (DNA-binding One Zinc Finger) transcription factor family is unique to plants and has diverse roles associated with plant-specific phenomena, such as light, phytohormone and defense responses as well as seed development and germination. Although, genome-wide analysis of this family has been performed in many species, information regarding Dof genes in the pepper, Capsicum annuum L., is extremely limited. In this study, exhaustive searches of pepper genome revealed 33 potential CaDofs that were phylogenetically clustered into four subgroups. Twenty-nine of the 33 Dof genes could be mapped on 11 chromosomes, except for chromosome 7. The intron/exon organizations and conserved motif compositions of these genes were also analyzed. Additionally, phylogenetic analysis and classification of the Dof transcription factor family in eight plant species revealed that S. lycopersicum and C. annuum as well as O. sativa and S. bicolor Dof proteins may have evolved conservatively. Moreover, comprehensive expression analysis of CaDofs using a RNA-seq atlas and quantitative real-time polymerase chain reaction (qRT-PCR) revealed that these genes exhibit a variety of expression patterns. Most of the CaDofs were expressed in at least one of the tissues tested, whereas several genes were identified as being highly responsive to heat and salt stresses. Overall, this study describes the first genome-wide analysis of the pepper Dof family, whose genes exhibited different expression patterns in all primary fruit developmental stages and tissue types, as in response to abiotic stress. In particular, some Dof genes might be used as biomarkers for heat and salt stress. The results could expand our understanding of the roles of Dof genes in pepper. PMID:27200047

  17. Genome-wide analysis of mRNAs associated with mouse peroxisomes.

    PubMed

    Yarmishyn, Aliaksandr A; Kremenskoy, Maksym; Batagov, Arsen O; Preuss, Axel; Wong, Jin Huei; Kurochkin, Igor V

    2016-12-22

    RNA is often targeted to be localized to the specific subcellular compartments. Specific localization of mRNA is believed to be an important mechanism for targeting their protein products to the locations, where their function is required. In this study we performed the genome wide transcriptome analysis of peroxisome preparations from the mouse liver using microarrays. We demonstrate that RNA is absent inside peroxisomes, however it is associated at their exterior via the noncovalent contacts with the membrane proteins. We detect enrichment of specific sets of transcripts in two preparations of peroxisomes, purified with different degrees of stringency. Importantly, among these were mRNAs encoding bona fide peroxisomal proteins, such as peroxins and peroxisomal matrix enzymes involved in beta-oxidation of fatty acids and bile acid biosynthesis. The top-most enriched mRNA, whose association with peroxisomes we confirm microscopically was Hmgcs1, encoding 3-hydroxy-3-methylglutaryl-CoA synthase, a crucial enzyme of cholesterol biosynthesis pathway. We observed significant representation of mRNAs encoding mitochondrial and secreted proteins in the peroxisomal fractions. This is a pioneer genome-wide study of localization of mRNAs to peroxisomes that provides foundation for more detailed dissection of mechanisms of RNA targeting to subcellular compartments.

  18. Mapping genome-wide transcription-factor binding sites using DAP-seq.

    PubMed

    Bartlett, Anna; O'Malley, Ronan C; Huang, Shao-Shan Carol; Galli, Mary; Nery, Joseph R; Gallavotti, Andrea; Ecker, Joseph R

    2017-08-01

    To enable low-cost, high-throughput generation of cistrome and epicistrome maps for any organism, we developed DNA affinity purification sequencing (DAP-seq), a transcription factor (TF)-binding site (TFBS) discovery assay that couples affinity-purified TFs with next-generation sequencing of a genomic DNA library. The method is fast, inexpensive, and more easily scaled than chromatin immunoprecipitation sequencing (ChIP-seq). DNA libraries are constructed using native genomic DNA from any source of interest, preserving cell- and tissue-specific chemical modifications that are known to affect TF binding (such as DNA methylation) and providing increased specificity as compared with in silico predictions based on motifs from methods such as protein-binding microarrays (PBMs) and systematic evolution of ligands by exponential enrichment (SELEX). The resulting DNA library is incubated with an affinity-tagged in vitro-expressed TF, and TF-DNA complexes are purified using magnetic separation of the affinity tag. Bound genomic DNA is eluted from the TF and sequenced using next-generation sequencing. Sequence reads are mapped to a reference genome, identifying genome-wide binding locations for each TF assayed, from which sequence motifs can then be derived. A researcher with molecular biology experience should be able to follow this protocol, processing up to 400 samples per week.

  19. Genome-wide histone acetylation correlates with active transcription in maize.

    PubMed

    Zhang, Wei; Garcia, Nelson; Feng, Yaping; Zhao, Han; Messing, Joachim

    2015-10-01

    Gene expression is regulated at many different levels during the life cycle of all plant species. Recent investigations have taken advantage of next-generation sequencing to study the relevance of DNA methylation and sRNAs in controlling tissue-specific gene expression in maize at the genome-wide level. Here, we profiled H3K27ac in maize, which has one of the largest sequenced plant genomes due to the amplification of retrotransposons. Because transcribed genes represent only a small proportion of its genome, gene-specific epigenetic modifications are concentrated in a relatively small percentage of the genome. Indeed, H3K27ac marks are mostly in gene-rich, in contrast to gene-poor regions. A large proportion of those marks are located in transcribed regions of genes, including 111 out of 458 known genetic loci. Moreover, increased transcription correlates with the presence of H3K27ac modification in gene bodies. Using maize as an example, we suggest that H3K27ac marks actively transcribed genes in plants. Copyright © 2015 Elsevier Inc. All rights reserved.

  20. MPE-seq, a new method for the genome-wide analysis of chromatin structure

    PubMed Central

    Ishii, Haruhiko; Kadonaga, James T.; Ren, Bing

    2015-01-01

    The analysis of chromatin structure is essential for the understanding of transcriptional regulation in eukaryotes. Here we describe methidiumpropyl-EDTA sequencing (MPE-seq), a method for the genome-wide characterization of chromatin that involves the digestion of nuclei withMPE-Fe(II) followed by massively parallel sequencing. Like micrococcal nuclease (MNase), MPE-Fe(II) preferentially cleaves the linker DNA between nucleosomes. However, there are differences in the cleavage of nuclear chromatin by MPE-Fe(II) relative to MNase. Most notably, immediately upstream of the transcription start site of active promoters, we frequently observed nucleosome-sized (141–190 bp) and subnucleosome-sized (such as 101–140 bp) peaks of digested chromatin fragments with MPE-seq but not with MNase-seq. These peaks also correlate with the presence of core histones and could thus be due, at least in part, to noncanonical chromatin structures such as labile nucleosome-like particles that have been observed in other contexts. The subnucleosome-sized MPE-seq peaks exhibit a particularly distinct association with active promoters. In addition, unlike MNase, MPE-Fe(II) cleaves nuclear DNA with little sequence bias. In this regard, we found that DNA sequences at RNA splice sites are hypersensitive to digestion by MNase but not by MPE-Fe(II). This phenomenon may have affected the analysis of nucleosome occupancy over exons. These findings collectively indicate that MPE-seq provides a unique and straightforward means for the genome-wide analysis of chromatin structure with minimal DNA sequence bias. In particular, the combined use of MPE-seq and MNase-seq enables the identification of noncanonical chromatin structures that are likely to be important for the regulation of gene expression. PMID:26080409

  1. Genome-wide profiling of RNA polymerase transcription at nucleotide resolution in human cells with native elongating transcript sequencing

    PubMed Central

    Mayer, Andreas; Churchman, L. Stirling

    2017-01-01

    Many features of gene transcription in human cells remain unclear, mainly due to a lack of quantitative approaches to follow genome transcription with nucleotide precision in vivo. Here we present a robust genome-wide approach to study RNA polymerase (Pol) II-mediated transcription in human cells at single-nucleotide resolution by native elongating transcript sequencing (NET-seq). Elongating RNA polymerase and the associated nascent RNA is prepared by cell fractionation, avoiding immunoprecipitation or RNA labeling. The 3′-ends of nascent RNAs are captured through barcode linker ligation and converted into a DNA sequencing library. The identity and abundance of the 3′-ends are determined by high-throughput sequencing, revealing the exact genomic locations of Pol II. Human NET-seq can be applied to study the full spectrum of Pol II transcriptional activities, including the production of unstable RNAs and transcriptional pausing. Using the protocol described here, a NET-seq library can be obtained from human cells in 5 days. PMID:27010758

  2. Genome-wide analysis of Polycomb targets in Drosophila

    SciTech Connect

    Schwartz, Yuri B.; Kahn, Tatyana G.; Nix, David A.; Li,Xiao-Yong; Bourgon, Richard; Biggin, Mark; Pirrotta, Vincenzo

    2006-04-01

    Polycomb Group (PcG) complexes are multiprotein assemblages that bind to chromatin and establish chromatin states leading to epigenetic silencing. PcG proteins regulate homeotic genes in flies and vertebrates but little is known about other PcG targets and the role of the PcG in development, differentiation and disease. We have determined the distribution of the PcG proteins PC, E(Z) and PSC and of histone H3K27 trimethylation in the Drosophila genome. At more than 200 PcG target genes, binding sites for the three PcG proteins colocalize to presumptive Polycomb Response Elements (PREs). In contrast, H3 me3K27 forms broad domains including the entire transcription unit and regulatory regions. PcG targets are highly enriched in genes encoding transcription factors but receptors, signaling proteins, morphogens and regulators representing all major developmental pathways are also included.

  3. A high definition look at the NF-Y regulome reveals genome-wide associations with selected transcription factors.

    PubMed

    Dolfini, Diletta; Zambelli, Federico; Pedrazzoli, Maurizio; Mantovani, Roberto; Pavesi, Giulio

    2016-06-02

    NF-Y is a trimeric transcription factor (TF), binding the CCAAT box element, for which several results suggest a pioneering role in activation of transcription. In this work, we integrated 380 ENCODE ChIP-Seq experiments for 154 TFs and cofactors with sequence analysis, protein-protein interactions and RNA profiling data, in order to identify genome-wide regulatory modules resulting from the co-association of NF-Y with other TFs. We identified three main degrees of co-association with NF-Y for sequence-specific TFs. In the most relevant one, we found TFs having a significant overlap with NF-Y in their DNA binding loci, some with a precise spacing of binding sites with respect to the CCAAT box, others (FOS, Sp1/2, RFX5, IRF3, PBX3) mostly lacking their canonical binding site and bound to arrays of well spaced CCAAT boxes. As expected, NF-Y binding also correlates with RNA Pol II General TFs and with subunits of complexes involved in the control of H3K4 methylations. Co-association patterns are confirmed by protein-protein interactions, and correspond to specific functional categorizations and expression level changes of target genes following NF-Y inactivation. These data define genome-wide rules for the organization of NF-Y-centered regulatory modules, supporting a model of distinct categorization and synergy with well defined sets of TFs. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. Genome-wide Scanning and Characterization of Sorghum bicolor L. Heat Shock Transcription Factors

    PubMed Central

    Nagaraju, M.; Reddy, Palakolanu Sudhakar; Kumar, S. Anil; Srivastava, Rakesh K.; Kishor, P. B. Kavi; Rao, D. Manohar

    2015-01-01

    A genome-wide scanning of Sorghum bicolor resulted in the identification of 25 SbHsf genes. Phylogenetic analysis shows the ortholog genes that are clustered with only rice, representing a common ancestor. Promoter analysis revealed the identification of different cis-acting elements that are responsible for abiotic as well as biotic stresses. Hsf domains like DBD, NLS, NES, and AHA have been analyzed for their sequence similarity and functional characterization. Tissue specific expression patterns of Hsfs in different tissues like mature embryo, seedling, root, and panicle were studied using real-time PCR. While Hsfs4 and 22 are highly expressed in panicle, 4 and 9 are expressed in seedlings. Sorghum plants were exposed to different abiotic stress treatments but no expression of any Hsf was observed when seedlings were treated with ABA. High level expression of Hsf1 was noticed during high temperature as well as cold stresses, 4 and 6 during salt and 5, 6, 10, 13, 19, 23 and 25 during drought stress. This comprehensive analysis of SbHsf genes will provide an insight on how these genes are regulated in different tissues and also under different abiotic stresses and help to determine the functions of Hsfs during drought and temperature stress tolerance. PMID:27006630

  5. Mammalian NET-seq analysis defines nascent RNA profiles and associated RNA processing genome-wide

    PubMed Central

    Nojima, Takayuki; Gomes, Tomás; Carmo-Fonseca, Maria; Proudfoot, Nicholas J

    2016-01-01

    The transcription cycle of RNA polymerase II (Pol II) correlates with changes to the phosphorylation state of its large subunit C-terminal domain (CTD). We recently developed Native Elongation Transcript sequencing using mammalian cells (mNET-seq), which generates single-nucleotide–resolution genome-wide profiles of nascent RNA and co-transcriptional RNA processing that are associated with different CTD phosphorylation states. Here we provide a detailed protocol for mNET-seq. First, Pol II elongation complexes are isolated with specific phospho-CTD antibodies from chromatin solubilized by micrococcal nuclease digestion. Next, RNA derived from within the Pol II complex is size fractionated and Illumina sequenced. using mNET-seq, we have previously shown that Pol II pauses at both ends of protein-coding genes but with different CTD phosphorylation patterns, and we have also detected phosphorylation at serine 5 (Ser5-P) CTD-specific splicing intermediates and Pol II accumulation over co-transcriptionally spliced exons. With moderate biochemical and bioinformatic skills, mNET-seq can be completed in ~6 d, not including sequencing and data analysis. PMID:26844429

  6. ChEA: transcription factor regulation inferred from integrating genome-wide ChIP-X experiments

    PubMed Central

    Lachmann, Alexander; Xu, Huilei; Krishnan, Jayanth; Berger, Seth I.; Mazloom, Amin R.; Ma'ayan, Avi

    2010-01-01

    Motivation: Experiments such as ChIP-chip, ChIP-seq, ChIP-PET and DamID (the four methods referred herein as ChIP-X) are used to profile the binding of transcription factors to DNA at a genome-wide scale. Such experiments provide hundreds to thousands of potential binding sites for a given transcription factor in proximity to gene coding regions. Results: In order to integrate data from such studies and utilize it for further biological discovery, we collected interactions from such experiments to construct a mammalian ChIP-X database. The database contains 189 933 interactions, manually extracted from 87 publications, describing the binding of 92 transcription factors to 31 932 target genes. We used the database to analyze mRNA expression data where we perform gene-list enrichment analysis using the ChIP-X database as the prior biological knowledge gene-list library. The system is delivered as a web-based interactive application called ChIP Enrichment Analysis (ChEA). With ChEA, users can input lists of mammalian gene symbols for which the program computes over-representation of transcription factor targets from the ChIP-X database. The ChEA database allowed us to reconstruct an initial network of transcription factors connected based on shared overlapping targets and binding site proximity. To demonstrate the utility of ChEA we present three case studies. We show how by combining the Connectivity Map (CMAP) with ChEA, we can rank pairs of compounds to be used to target specific transcription factor activity in cancer cells. Availability: The ChEA software and ChIP-X database is freely available online at: http://amp.pharm.mssm.edu/lib/chea.jsp Contact: avi.maayan@mssm.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:20709693

  7. Quantitative genome-wide methylation analysis of high-grade non-muscle invasive bladder cancer

    PubMed Central

    Kitchen, Mark O.; Bryan, Richard T.; Emes, Richard D.; Glossop, John R.; Luscombe, Christopher; Cheng, K. K.; Zeegers, Maurice P.; James, Nicholas D.; Devall, Adam J.; Mein, Charles A.; Gommersall, Lyndon; Fryer, Anthony A.; Farrell, William E.

    2016-01-01

    ABSTRACT High-grade non-muscle invasive bladder cancer (HG-NMIBC) is a clinically unpredictable disease with greater risks of recurrence and progression relative to their low-intermediate-grade counterparts. The molecular events, including those affecting the epigenome, that characterize this disease entity in the context of tumor development, recurrence, and progression, are incompletely understood. We therefore interrogated genome-wide DNA methylation using HumanMethylation450 BeadChip arrays in 21 primary HG-NMIBC tumors relative to normal bladder controls. Using strict inclusion-exclusion criteria we identified 1,057 hypermethylated CpGs within gene promoter-associated CpG islands, representing 256 genes. We validated the array data by bisulphite pyrosequencing and examined 25 array-identified candidate genes in an independent cohort of 30 HG-NMIBC and 18 low-intermediate-grade NMIBC. These analyses revealed significantly higher methylation frequencies in high-grade tumors relative to low-intermediate-grade tumors for the ATP5G2, IRX1 and VAX2 genes (P<0.05), and similarly significant increases in mean levels of methylation in high-grade tumors for the ATP5G2, VAX2, INSRR, PRDM14, VSX1, TFAP2b, PRRX1, and HIST1H4F genes (P<0.05). Although inappropriate promoter methylation was not invariantly associated with reduced transcript expression, a significant association was apparent for the ARHGEF4, PON3, STAT5a, and VAX2 gene transcripts (P<0.05). Herein, we present the first genome-wide DNA methylation analysis in a unique HG-NMIBC cohort, showing extensive and discrete methylation changes relative to normal bladder and low-intermediate-grade tumors. The genes we identified hold significant potential as targets for novel therapeutic intervention either alone, or in combination, with more conventional therapeutic options in the treatment of this clinically unpredictable disease. PMID:26929985

  8. The genome-wide binding profile of the Sulfolobus solfataricus transcription factor Ss-LrpB shows binding events beyond direct transcription regulation.

    PubMed

    Nguyen-Duc, Trong; van Oeffelen, Liesbeth; Song, Ningning; Hassanzadeh-Ghassabeh, Gholamreza; Muyldermans, Serge; Charlier, Daniel; Peeters, Eveline

    2013-11-25

    Gene regulatory processes are largely resulting from binding of transcription factors to specific genomic targets. Leucine-responsive Regulatory Protein (Lrp) is a prevalent transcription factor family in prokaryotes, however, little information is available on biological functions of these proteins in archaea. Here, we study genome-wide binding of the Lrp-like transcription factor Ss-LrpB from Sulfolobus solfataricus. Chromatin immunoprecipitation in combination with DNA microarray analysis (ChIP-chip) has revealed that Ss-LrpB interacts with 36 additional loci besides the four previously identified local targets. Only a subset of the newly identified binding targets, concentrated in a highly variable IS-dense genomic region, is also bound in vitro by pure Ss-LrpB. There is no clear relationship between the in vitro measured DNA-binding specificity of Ss-LrpB and the in vivo association suggesting a limited permissivity of the crenarchaeal chromatin for transcription factor binding. Of 37 identified binding regions, 29 are co-bound by LysM, another Lrp-like transcription factor in S. solfataricus. Comparative gene expression analysis in an Ss-lrpB mutant strain shows no significant Ss-LrpB-mediated regulation for most targeted genes, with exception of the CRISPR B cluster, which is activated by Ss-LrpB through binding to a specific motif in the leader region. The genome-wide binding profile presented here implies that Ss-LrpB is associated at additional genomic binding sites besides the local gene targets, but acts as a specific transcription regulator in the tested growth conditions. Moreover, we have provided evidence that two Lrp-like transcription factors in S. solfataricus, Ss-LrpB and LysM, interact in vivo.

  9. Genome-wide analysis of long noncoding RNA (lncRNA) expression in hepatoblastoma tissues.

    PubMed

    Dong, Rui; Jia, Deshui; Xue, Ping; Cui, Ximao; Li, Kai; Zheng, Shan; He, Xianghuo; Dong, Kuiran

    2014-01-01

    Long noncoding RNAs (lncRNAs) have crucial roles in cancer biology. We performed a genome-wide analysis of lncRNA expression in hepatoblastoma tissues to identify novel targets for further study of hepatoblastoma. Hepatoblastoma and normal liver tissue samples were obtained from hepatoblastoma patients. The genome-wide analysis of lncRNA expression in these tissues was performed using a 4×180 K lncRNA microarray and Sureprint G3 Human lncRNA Chips. Quantitative RT-PCR (qRT-PCR) was performed to confirm these results. The differential expressions of lncRNAs and mRNAs were identified through fold-change filtering. Gene Ontology (GO) and pathway analyses were performed using the standard enrichment computation method. Associations between lncRNAs and adjacent protein-coding genes were determined through complex transcriptional loci analysis. We found that 2736 lncRNAs were differentially expressed in hepatoblastoma tissues. Among these, 1757 lncRNAs were upregulated more than two-fold relative to normal tissues and 979 lncRNAs were downregulated. Moreover, in hepatoblastoma there were 420 matched lncRNA-mRNA pairs for 120 differentially expressed lncRNAs, and 167 differentially expressed mRNAs. The co-expression network analysis predicted 252 network nodes and 420 connections between 120 lncRNAs and 132 coding genes. Within this co-expression network, 369 pairs were positive, and 51 pairs were negative. Lastly, qRT-PCR data verified six upregulated and downregulated lncRNAs in hepatoblastoma, plus endothelial cell-specific molecule 1 (ESM1) mRNA. Our results demonstrated that expression of these aberrant lncRNAs could respond to hepatoblastoma development. Further study of these lncRNAs could provide useful insight into hepatoblastoma biology.

  10. Genome-wide screen identifies a novel p97/CDC-48-dependent pathway regulating ER-stress-induced gene transcription.

    PubMed

    Marza, Esther; Taouji, Saïd; Barroso, Kim; Raymond, Anne-Aurélie; Guignard, Léo; Bonneu, Marc; Pallares-Lupon, Néstor; Dupuy, Jean-William; Fernandez-Zapico, Martin E; Rosenbaum, Jean; Palladino, Francesca; Dupuy, Denis; Chevet, Eric

    2015-03-01

    The accumulation of misfolded proteins in the endoplasmic reticulum (ER) activates the Unfolded Protein Response (UPR(ER)) to restore ER homeostasis. The AAA(+) ATPase p97/CDC-48 plays key roles in ER stress by promoting both ER protein degradation and transcription of UPR(ER) genes. Although the mechanisms associated with protein degradation are now well established, the molecular events involved in the regulation of gene transcription by p97/CDC-48 remain unclear. Using a reporter-based genome-wide RNAi screen in combination with quantitative proteomic analysis in Caenorhabditis elegans, we have identified RUVB-2, a AAA(+) ATPase, as a novel repressor of a subset of UPR(ER) genes. We show that degradation of RUVB-2 by CDC-48 enhances expression of ER stress response genes through an XBP1-dependent mechanism. The functional interplay between CDC-48 and RUVB-2 in controlling transcription of select UPR(ER) genes appears conserved in human cells. Together, these results describe a novel role for p97/CDC-48, whereby its role in protein degradation is integrated with its role in regulating expression of ER stress response genes.

  11. Genome-Wide Transcriptome Analysis of Cadmium Stress in Rice

    PubMed Central

    Oono, Youko; Yazawa, Takayuki; Kanamori, Hiroyuki; Sasaki, Harumi; Mori, Satomi; Handa, Hirokazu; Matsumoto, Takashi

    2016-01-01

    Rice growth is severely affected by toxic concentrations of the nonessential heavy metal cadmium (Cd). To elucidate the molecular basis of the response to Cd stress, we performed mRNA sequencing of rice following our previous study on exposure to high concentrations of Cd (Oono et al., 2014). In this study, rice plants were hydroponically treated with low concentrations of Cd and approximately 211 million sequence reads were mapped onto the IRGSP-1.0 reference rice genome sequence. Many genes, including some identified under high Cd concentration exposure in our previous study, were found to be responsive to low Cd exposure, with an average of about 11,000 transcripts from each condition. However, genes expressed constitutively across the developmental course responded only slightly to low Cd concentrations, in contrast to their clear response to high Cd concentration, which causes fatal damage to rice seedlings according to phenotypic changes. The expression of metal ion transporter genes tended to correlate with Cd concentration, suggesting the potential of the RNA-Seq strategy to reveal novel Cd-responsive transporters by analyzing gene expression under different Cd concentrations. This study could help to develop novel strategies for improving tolerance to Cd exposure in rice and other cereal crops. PMID:27034955

  12. Impaired alveolarization and intra-uterine growth restriction in rats: a postnatal genome-wide analysis.

    PubMed

    Zana-Taieb, E; Pham, H; Franco-Montoya, M L; Jacques, S; Letourneur, F; Baud, O; Jarreau, P H; Vaiman, D

    2015-02-01

    Intra-uterine growth restriction (IUGR) dramatically increases the risk of bronchopulmonary dysplasia in preterm babies, a disease characterized by arrested alveolarization and abnormal microvascular angiogenesis. We have previously described a rodent low protein diet (LPD) model of IUGR inducing impaired alveolarization, but failed to demonstrate any modification of the classical factors involved in lung development. We performed a genome-wide microarray analysis in 120 rat pups with LPD-induced IUGR and their controls, at three key time points of the alveolarization process: postnatal day 4 (P4): start of alveolarization; P10: peak of the alveolarization process and P21: end of the alveolarization process. Results were analysed using Arraymining, DAVID and KEGG software and validated by qRT-PCR and western blots. Considering a cut-off of 2:1 as significant, 67 transcripts at P4, 102 transcripts at P10 and 451 transcripts at P21 were up-regulated, and 89 transcripts at P4, 25 transcripts at P10 and 585 transcripts at P21 were down-regulated. Automatic functional classification identified three main modified pathways, 'cell adhesion molecules', 'cardiac muscle contraction' and 'peroxisome proliferator-activated receptor' (PPAR). Protein analysis confirmed involvement of the PPAR pathway, with an increase of FABP4, an activator of this pathway, at P4 and an increase of adiponectin at P21. Other data also suggest involvement of the PPAR pathway in impaired alveolarization. Our results show that deregulation of the PPAR pathway may be an important component of the mechanism inducing impaired alveolarization observed in IUGR. The complete dataset is available as GEO profiles on the Gene Expression Omnibus (GEO) database ( www.ncbi.nih.gov/geo/, GEO Accession No. GSE56956). Copyright © 2014 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd.

  13. Genome-wide microarray analysis of tomato roots showed defined responses to iron deficiency

    PubMed Central

    2012-01-01

    Background Plants react to iron deficiency stress adopting different kind of adaptive responses. Tomato, a Strategy I plant, improves iron uptake through acidification of rhizosphere, reduction of Fe3+ to Fe2+ and transport of Fe2+ into the cells. Large-scale transcriptional analyses of roots under iron deficiency are only available for a very limited number of plant species with particular emphasis for Arabidopsis thaliana. Regarding tomato, an interesting model species for Strategy I plants and an economically important crop, physiological responses to Fe-deficiency have been thoroughly described and molecular analyses have provided evidence for genes involved in iron uptake mechanisms and their regulation. However, no detailed transcriptome analysis has been described so far. Results A genome-wide transcriptional analysis, performed with a chip that allows to monitor the expression of more than 25,000 tomato transcripts, identified 97 differentially expressed transcripts by comparing roots of Fe-deficient and Fe-sufficient tomato plants. These transcripts are related to the physiological responses of tomato roots to the nutrient stress resulting in an improved iron uptake, including regulatory aspects, translocation, root morphological modification and adaptation in primary metabolic pathways, such as glycolysis and TCA cycle. Other genes play a role in flavonoid biosynthesis and hormonal metabolism. Conclusions The transcriptional characterization confirmed the presence of the previously described mechanisms to adapt to iron starvation in tomato, but also allowed to identify other genes potentially playing a role in this process, thus opening new research perspectives to improve the knowledge on the tomato root response to the nutrient deficiency. PMID:22433273

  14. Genome-wide linkage analysis and association study identifies loci for polydactyly in chickens.

    PubMed

    Sun, Yanfa; Liu, Ranran; Zhao, Guiping; Zheng, Maiqing; Sun, Yan; Yu, Xiaoqiong; Li, Peng; Wen, Jie

    2014-04-21

    Polydactyly occurs in some chicken breeds, but the molecular mechanism remains incompletely understood. Combined genome-wide linkage analysis and association study (GWAS) for chicken polydactyly helps identify loci or candidate genes for the trait and potentially provides further mechanistic understanding of this phenotype in chickens and perhaps other species. The linkage analysis and GWAS for polydactyly was conducted using an F2 population derived from Beijing-You chickens and commercial broilers. The results identified two QTLs through linkage analysis and seven single-nucleotide polymorphisms (SNPs) through GWAS, associated with the polydactyly trait. One QTL located at 35 cM on the GGA2 was significant at the 1% genome-wise level and another QTL at the 1% chromosome-wide significance level was detected at 39 cM on GGA19. A total of seven SNPs, four of 5% genome-wide significance (P < 2.98 × 10(-6)) and three of suggestive significance (5.96 × 10(-5)) were identified, including two SNPs (GGaluGA132178 and Gga_rs14135036) in the QTL on GGA2. Of the identified SNPs, the eight nearest genes were sonic hedgehog (SHH), limb region 1 homolog (mouse) (LMBR1), dipeptidyl-peptidase 6, transcript variant 3 (DPP6), thyroid-stimulating hormone, beta (TSHB), sal-like 4 (Drosophila) (SALL4), par-6 partitioning defective 6 homolog beta (Caenorhabditis elegans) (PARD6B), coenzyme Q5 (COQ5), and tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein, etapolypeptide (YWHAH). The GWAS supports earlier reports of the importance of SHH and LMBR1 as regulating genes for polydactyly in chickens and other species, and identified others, most of which have not previously been associated with limb development. The genes and associated SNPs revealed here provide detailed information for further exploring the molecular and developmental mechanisms underlying polydactyly.

  15. The genome-wide transcriptional response to neonatal hyperoxia identifies Ahr as a key regulator

    PubMed Central

    Bhattacharya, Soumyaroop; Zhou, Zhongyang; Yee, Min; Chu, Chin-Yi; Lopez, Ashley M.; Lunger, Valerie A.; Solleti, Siva Kumar; Resseguie, Emily; Buczynski, Bradley; O'Reilly, Michael A.

    2014-01-01

    Premature infants requiring supplemental oxygen are at increased risk for developing bronchopulmonary dysplasia (BPD). Rodent models involving neonatal exposure to excessive oxygen concentrations (hyperoxia) have helped to identify mechanisms of BPD-associated pathology. Genome-wide assessments of the effects of hyperoxia in neonatal mouse lungs could identify novel BPD-related genes and pathways. Newborn C57BL/6 mice were exposed to 100% oxygen for 10 days, and whole lung tissue RNA was used for high-throughput, sequencing-based transcriptomic analysis (RNA-Seq). Significance Analysis of Microarrays and Ingenuity Pathway Analysis were used to identify genes and pathways affected. Expression patterns for selected genes were validated by qPCR. Mechanistic relationships between genes were further tested in cultured mouse lung epithelial cells. We identified 300 genes significantly and substantially affected following acute neonatal hyperoxia. Canonical pathways dysregulated in hyperoxia lungs included nuclear fctor (erythryoid-derived-2)-like 2-mediated oxidative stress signaling, p53 signaling, eNOS signaling, and aryl hydrocarbon receptor (Ahr) pathways. Cluster analysis identified Ccnd1, Cdkn1a, and Ahr as critical regulatory nodes in the response to hyperoxia, with Ahr serving as the major effector node. A mechanistic role for Ahr was assessed in lung epithelial cells, and we confirmed its ability to regulate the expression of multiple hyperoxia markers, including Cdkn1a, Pdgfrb, and A2m. We conclude that a global assessment of gene regulation in the acute neonatal hyperoxia model of BPD-like pathology has identified Ahr as one driver of gene dysregulation. PMID:25150061

  16. Genome-wide determination of transcription start sites reveals new insights into promoter structures in the actinomycete Corynebacterium glutamicum.

    PubMed

    Albersmeier, Andreas; Pfeifer-Sancar, Katharina; Rückert, Christian; Kalinowski, Jörn

    2017-09-10

    The genome-wide identification of transcription start sites, enabled by high-throughput sequencing of a cDNA library enriched for native 5' transcript ends, is ideally suited for the analysis of promoters. Here, the transcriptome of Corynebacterium glutamicum, a non-pathogenic soil bacterium from the actinomycetes branch that is used in industry for the production of amino acids, was analysed by transcriptome sequencing of the 5'-ends of native transcripts. Total RNA samples were harvested from the exponential phase of growth, therefore the study mainly addressed promoters recognized by the main house-keeping sigma factor σ(A). The identification of 2454 transcription start sites (TSS) allowed the detailed analysis of most promoters recognized by σ(A) and furthermore enabled us to form different promoter groups according to their location relative to protein-coding regions. These groups included leaderless transcripts (546 promoters), short-leadered (<500 bases) transcripts (917), and long-leadered (>500 bases) transcripts (173) as well as intragenic (557) and antisense transcripts (261). All promoters and the individual groups were searched for information, e.g. conserved residues and promoter motifs, and general design features as well as group-specific preferences were identified. A purine was found highly favored as TSS, whereas the -1 position was dominated by pyrimidines. The spacer between TSS and -10 region were consistently 6-7 bases and the -10 promoter motif was generally visible, whereas a recognizable -35 region was only occurring in a smaller fraction of promoters (7.5%) and enriched for leadered and antisense transcripts but depleted for leaderless transcripts. Promoters showing an extended -10 region were especially frequent in case of non-canonical -10 motifs (45.5%). Two bases downstream of the -10 core region, a G was conserved, exceeding 40% abundance in most groups. This fraction reached 74.6% for a group of putative σ(B)-dependent promoters

  17. Genome-Wide Binding of Posterior HOXA/D Transcription Factors Reveals Subgrouping and Association with CTCF

    PubMed Central

    Haas, Stefan; Janetzki, Catrin; González Navarrete, Irene; Mundlos, Stefan

    2017-01-01

    Homeotic genes code for key transcription factors (HOX-TFs) that pattern the animal body plan. During embryonic development, Hox genes are expressed in overlapping patterns and function in a partially redundant manner. In vitro biochemical screens probing the HOX-TF sequence specificity revealed largely overlapping sequence preferences, indicating that co-factors might modulate the biological function of HOX-TFs. However, due to their overlapping expression pattern, high protein homology, and insufficiently specific antibodies, little is known about their genome-wide binding preferences. In order to overcome this problem, we virally expressed tagged versions of limb-expressed posterior HOX genes (HOXA9-13, and HOXD9-13) in primary chicken mesenchymal limb progenitor cells (micromass). We determined the effect of each HOX-TF on cellular differentiation (chondrogenesis) and gene expression and found that groups of HOX-TFs induce distinct regulatory programs. We used ChIP-seq to determine their individual genome-wide binding profiles and identified between 12,721 and 28,572 binding sites for each of the nine HOX-TFs. Principal Component Analysis (PCA) of binding profiles revealed that the HOX-TFs are clustered in two subgroups (Group 1: HOXA/D9, HOXA/D10, HOXD12, and HOXA13 and Group 2: HOXA/D11 and HOXD13), which are characterized by differences in their sequence specificity and by the presence of cofactor motifs. Specifically, we identified CTCF binding sites in Group 1, indicating that this subgroup of HOX-proteins cooperates with CTCF. We confirmed this interaction by an independent biological assay (Proximity Ligation Assay) and demonstrated that CTCF is a novel HOX cofactor that specifically associates with Group 1 HOX-TFs, pointing towards a possible interplay between HOX-TFs and chromatin architecture. PMID:28103242

  18. Genome-Wide Identification and Expression Analysis of Calcium-dependent Protein Kinase in Tomato

    PubMed Central

    Hu, Zhangjian; Lv, Xiangzhang; Xia, Xiaojian; Zhou, Jie; Shi, Kai; Yu, Jingquan; Zhou, Yanhong

    2016-01-01

    Calcium-dependent protein kinases (CDPKs) play critical roles in regulating growth, development and stress response in plants. Information about CDPKs in tomato, however, remains obscure although it is one of the most important model crops in the world. In this study, we performed a bioinformatics analysis of the entire tomato genome and identified 29 CDPK genes. These CDPK genes are found to be located in 12 chromosomes, and could be divided into four groups. Analysis of the gene structure and splicing site reflected high structure conservation within different CDPK gene groups both in the exon-intron pattern and mRNA splicing. Transcripts of most CDPK genes varied with plant organs and developmental stages and their transcripts could be differentially induced by abscisic acid (ABA), brassinosteroids (BRs), methyl jasmonate (MeJA), and salicylic acid (SA), as well as after exposure to heat, cold, and drought, respectively. To our knowledge, this is the first report about the genome-wide analysis of the CDPK gene family in tomato, and the findings obtained offer a clue to the elaborated regulatory role of CDPKs in plant growth, development and stress response in tomato. PMID:27092168

  19. Genome-wide mRNA processing in methanogenic archaea reveals post-transcriptional regulation of ribosomal protein synthesis

    PubMed Central

    Qi, Lei; Yue, Lei; Feng, Deqin; Qi, Fengxia

    2017-01-01

    Abstract Unlike stable RNAs that require processing for maturation, prokaryotic cellular mRNAs generally follow an ‘all-or-none’ pattern. Herein, we used a 5΄ monophosphate transcript sequencing (5΄P-seq) that specifically captured the 5΄-end of processed transcripts and mapped the genome-wide RNA processing sites (PSSs) in a methanogenic archaeon. Following statistical analysis and stringent filtration, we identified 1429 PSSs, among which 23.5% and 5.4% were located in 5΄ untranslated region (uPSS) and intergenic region (iPSS), respectively. A predominant uridine downstream PSSs served as a processing signature. Remarkably, 5΄P-seq detected overrepresented uPSS and iPSS in the polycistronic operons encoding ribosomal proteins, and the majority upstream and proximal ribosome binding sites, suggesting a regulatory role of processing on translation initiation. The processed transcripts showed increased stability and translation efficiency. Particularly, processing within the tricistronic transcript of rplA-rplJ-rplL enhanced the translation of rplL, which can provide a driving force for the 1:4 stoichiometry of L10 to L12 in the ribosome. Growth-associated mRNA processing intensities were also correlated with the cellular ribosomal protein levels, thereby suggesting that mRNA processing is involved in tuning growth-dependent ribosome synthesis. In conclusion, our findings suggest that mRNA processing-mediated post-transcriptional regulation is a potential mechanism of ribosomal protein synthesis and stoichiometry. PMID:28520982

  20. Genome wide analysis of blood pressure variability and ischemic stroke

    PubMed Central

    Khan, Muhammad S; Nalls, Michael A; Bevan, Steve; Cheng, Yu-Ching; Chen, Wei-Min; Malik, Rainer; McCarthy, Nina S; Holliday, Elizabeth G; Speed, Douglas; Hasan, Nazeeha; Pucek, Mateusz; Rinne, Paul E.; Sever, Peter; Stanton, Alice; Shields, Denis C; Maguire, Jane M; McEvoy, Mark; Scott, Rodney J; Ferrucci, Luigi; Macleod, Mary J; Attia, John; Markus, Hugh S; Sale, Michele M; Worrall, Bradford B; Mitchell, Braxton D; Dichgans, Martin; Sudlow, Cathy; Meschia, James F; Rothwell, Peter M

    2013-01-01

    Background and Purpose Visit-to-visit variability in BP is associated with ischemic stroke. We sought to determine whether such variability has a genetic aetiology and whether genetic variants associated with BP variability are also associated with ischemic stroke. Methods A GWAS for loci influencing BP variability was undertaken in 3,802 individuals from the Anglo-Scandinavian Cardiac Outcome Trial (ASCOT) study where long-term visit-to-visit and within visit BP measures were available. Since BP variability is strongly associated with ischemic stroke, we genotyped the sentinel SNP in an independent ischemic stroke population comprising of 8,624 cases and 12,722 controls and in 3,900 additional (Scandinavian) participants from the ASCOT study in order to replicate our findings. Results The ASCOT discovery GWAS identified a cluster of 17 correlated SNPs within the NLGN1 gene (3q26.31) associated with BP variability. The strongest association was with rs976683 (p=1.4×10−8). Conditional analysis on rs976683 provided no evidence of additional independent associations at the locus. Analysis of rs976683 in ischemic stroke patients found no association for overall stroke (OR 1.02; 95% CI 0.97-1.07; p=0.52) or its sub-types: CE (OR 1.07; 95% CI 0.97-1.16; p=0.17), LVD (OR 0.98; 95% 0.89-1.07; p=0.60) and SVD (OR 1.07; 95% CI 0.97-1.17; p=0.19). No evidence for association was found between rs976683 and BP variability in the additional (Scandinavian) ASCOT participants (p=0.18). Conclusions We identified a cluster of SNPs at the NLGN1 locus showing significant association with BP variability. Follow up analyses did not support an association with risk of ischemic stroke and its subtypes. PMID:23929743

  1. Genome-wide transcriptional response of silkworm (Bombyx mori) to infection by the microsporidian Nosema bombycis.

    PubMed

    Ma, Zhengang; Li, Chunfeng; Pan, Guoqing; Li, Zhihong; Han, Bing; Xu, Jinshan; Lan, Xiqian; Chen, Jie; Yang, Donglin; Chen, Quanmei; Sang, Qi; Ji, Xiaocun; Li, Tian; Long, Mengxian; Zhou, Zeyang

    2013-01-01

    Microsporidia have attracted much attention because they infect a variety of species ranging from protists to mammals, including immunocompromised patients with AIDS or cancer. Aside from the study on Nosema ceranae, few works have focused on elucidating the mechanism in host response to microsporidia infection. Nosema bombycis is a pathogen of silkworm pébrine that causes great economic losses to the silkworm industry. Detailed understanding of the host (Bombyx mori) response to infection by N. bombycis is helpful for prevention of this disease. A genome-wide survey of the gene expression profile at 2, 4, 6 and 8 days post-infection by N. bombycis was performed and results showed that 64, 244, 1,328, 1,887 genes were induced, respectively. Up to 124 genes, which are involved in basal metabolism pathways, were modulated. Notably, B. mori genes that play a role in juvenile hormone synthesis and metabolism pathways were induced, suggesting that the host may accumulate JH as a response to infection. Interestingly, N. bombycis can inhibit the silkworm serine protease cascade melanization pathway in hemolymph, which may be due to the secretion of serpins in the microsporidia. N. bombycis also induced up-regulation of several cellular immune factors, in which CTL11 has been suggested to be involved in both spore recognition and immune signal transduction. Microarray and real-time PCR analysis indicated the activation of silkworm Toll and JAK/STAT pathways. The notable up-regulation of antimicrobial peptides, including gloverins, lebocins and moricins, strongly indicated that antimicrobial peptide defense mechanisms were triggered to resist the invasive microsporidia. An analysis of N. bombycis-specific response factors suggested their important roles in anti-microsporidia defense. Overall, this study primarily provides insight into the potential molecular mechanisms for the host-parasite interaction between B. mori and N. bombycis and may provide a foundation for

  2. Genome-Wide Transcriptional Response of Silkworm (Bombyx mori) to Infection by the Microsporidian Nosema bombycis

    PubMed Central

    Pan, Guoqing; Li, Zhihong; Han, Bing; Xu, Jinshan; Lan, Xiqian; Chen, Jie; Yang, Donglin; Chen, Quanmei; Sang, Qi; Ji, Xiaocun; Li, Tian; Long, Mengxian; Zhou, Zeyang

    2013-01-01

    Microsporidia have attracted much attention because they infect a variety of species ranging from protists to mammals, including immunocompromised patients with AIDS or cancer. Aside from the study on Nosema ceranae, few works have focused on elucidating the mechanism in host response to microsporidia infection. Nosema bombycis is a pathogen of silkworm pébrine that causes great economic losses to the silkworm industry. Detailed understanding of the host (Bombyx mori) response to infection by N. bombycis is helpful for prevention of this disease. A genome-wide survey of the gene expression profile at 2, 4, 6 and 8 days post-infection by N. bombycis was performed and results showed that 64, 244, 1,328, 1,887 genes were induced, respectively. Up to 124 genes, which are involved in basal metabolism pathways, were modulated. Notably, B. mori genes that play a role in juvenile hormone synthesis and metabolism pathways were induced, suggesting that the host may accumulate JH as a response to infection. Interestingly, N. bombycis can inhibit the silkworm serine protease cascade melanization pathway in hemolymph, which may be due to the secretion of serpins in the microsporidia. N. bombycis also induced up-regulation of several cellular immune factors, in which CTL11 has been suggested to be involved in both spore recognition and immune signal transduction. Microarray and real-time PCR analysis indicated the activation of silkworm Toll and JAK/STAT pathways. The notable up-regulation of antimicrobial peptides, including gloverins, lebocins and moricins, strongly indicated that antimicrobial peptide defense mechanisms were triggered to resist the invasive microsporidia. An analysis of N. bombycis-specific response factors suggested their important roles in anti-microsporidia defense. Overall, this study primarily provides insight into the potential molecular mechanisms for the host-parasite interaction between B. mori and N. bombycis and may provide a foundation for

  3. Identification of Genetic Susceptibility Loci for Colorectal Tumors in a Genome-wide Meta-analysis

    PubMed Central

    Peters, Ulrike; Jiao, Shuo; Schumacher, Fredrick R.; Hutter, Carolyn M.; Aragaki, Aaron K.; Baron, John A.; Berndt, Sonja I.; Bézieau, Stéphane; Brenner, Hermann; Butterbach, Katja; Caan, Bette J.; Campbell, Peter T.; Carlson, Christopher S.; Casey, Graham; Chan, Andrew T.; Chang-Claude, Jenny; Chanock, Stephen J.; Chen, Lin S.; Coetzee, Gerhard A.; Coetzee, Simon G.; Conti, David V.; Curtis, Keith R.; Duggan, David; Edwards, Todd; Fuchs, Charles S.; Gallinger, Steven; Giovannucci, Edward L.; Gogarten, Stephanie M.; Gruber, Stephen B.; Haile, Robert W.; Harrison, Tabitha A.; Hayes, Richard B.; Henderson, Brian E.; Hoffmeister, Michael; Hopper, John L.; Hudson, Thomas J.; Hunter, David J.; Jackson, Rebecca D.; Jee, Sun Ha; Jenkins, Mark A.; Jia, Wei-Hua; Kolonel, Laurence N.; Kooperberg, Charles; Küry, Sébastien; Lacroix, Andrea Z.; Laurie, Cathy C.; Laurie, Cecelia A.; Le Marchand, Loic; Lemire, Mathieu; Levine, David; Lindor, Noralane M.; Liu, Yan; Ma, Jing; Makar, Karen W.; Matsuo, Keitaro; Newcomb, Polly A.; Potter, John D.; Prentice, Ross L.; Qu, Conghui; Rohan, Thomas; Rosse, Stephanie A.; Schoen, Robert E.; Seminara, Daniela; Shrubsole, Martha; Shu, Xiao-Ou; Slattery, Martha L.; Taverna, Darin; Thibodeau, Stephen N.; Ulrich, Cornelia M.; White, Emily; Xiang, Yongbing; Zanke, Brent W.; Zeng, Yi-Xin; Zhang, Ben; Zheng, Wei; Hsu, Li

    2013-01-01

    BACKGROUND & AIMS Heritable factors contribute to the development of colorectal cancer. Identifying the genetic loci associated with colorectal tumor formation could elucidate the mechanisms of pathogenesis. METHODS We conducted a genome-wide association study that included 14 studies, 12,696 cases of colorectal tumors (11,870 cancer, 826 adenoma), and 15,113 controls of European descent. The 10 most statistically significant, previously unreported findings were followed up in 6 studies; these included 3056 colorectal tumor cases (2098 cancer, 958 adenoma) and 6658 controls of European and Asian descent. RESULTS Based on the combined analysis, we identified a locus that reached the conventional genome-wide significance level at less than 5.0 × 10−8: an intergenic region on chromosome 2q32.3, close to nucleic acid binding protein 1 (most significant single nucleotide polymorphism: rs11903757; odds ratio [OR], 1.15 per risk allele; P = 3.7 × 10−8). We also found evidence for 3 additional loci with P values less than 5.0 × 10−7: a locus within the laminin gamma 1 gene on chromosome 1q25.3 (rs10911251; OR, 1.10 per risk allele; P = 9.5 × 10−8), a locus within the cyclin D2 gene on chromosome 12p13.32 (rs3217810 per risk allele; OR, 0.84; P = 5.9 × 10−8), and a locus in the T-box 3 gene on chromosome 12q24.21 (rs59336; OR, 0.91 per risk allele; P = 3.7 × 10−7). CONCLUSIONS In a large genome-wide association study, we associated polymorphisms close to nucleic acid binding protein 1 (which encodes a DNA-binding protein involved in DNA repair) with colorectal tumor risk. We also provided evidence for an association between colorectal tumor risk and polymorphisms in laminin gamma 1 (this is the second gene in the laminin family to be associated with colorectal cancers), cyclin D2 (which encodes for cyclin D2), and T-box 3 (which encodes a T-box transcription factor and is a target of Wnt signaling to β-catenin). The roles of these genes and their products

  4. Identification of Genetic Susceptibility Loci for Colorectal Tumors in a Genome-Wide Meta-analysis.

    PubMed

    Peters, Ulrike; Jiao, Shuo; Schumacher, Fredrick R; Hutter, Carolyn M; Aragaki, Aaron K; Baron, John A; Berndt, Sonja I; Bézieau, Stéphane; Brenner, Hermann; Butterbach, Katja; Caan, Bette J; Campbell, Peter T; Carlson, Christopher S; Casey, Graham; Chan, Andrew T; Chang-Claude, Jenny; Chanock, Stephen J; Chen, Lin S; Coetzee, Gerhard A; Coetzee, Simon G; Conti, David V; Curtis, Keith R; Duggan, David; Edwards, Todd; Fuchs, Charles S; Gallinger, Steven; Giovannucci, Edward L; Gogarten, Stephanie M; Gruber, Stephen B; Haile, Robert W; Harrison, Tabitha A; Hayes, Richard B; Henderson, Brian E; Hoffmeister, Michael; Hopper, John L; Hudson, Thomas J; Hunter, David J; Jackson, Rebecca D; Jee, Sun Ha; Jenkins, Mark A; Jia, Wei-Hua; Kolonel, Laurence N; Kooperberg, Charles; Küry, Sébastien; Lacroix, Andrea Z; Laurie, Cathy C; Laurie, Cecelia A; Le Marchand, Loic; Lemire, Mathieu; Levine, David; Lindor, Noralane M; Liu, Yan; Ma, Jing; Makar, Karen W; Matsuo, Keitaro; Newcomb, Polly A; Potter, John D; Prentice, Ross L; Qu, Conghui; Rohan, Thomas; Rosse, Stephanie A; Schoen, Robert E; Seminara, Daniela; Shrubsole, Martha; Shu, Xiao-Ou; Slattery, Martha L; Taverna, Darin; Thibodeau, Stephen N; Ulrich, Cornelia M; White, Emily; Xiang, Yongbing; Zanke, Brent W; Zeng, Yi-Xin; Zhang, Ben; Zheng, Wei; Hsu, Li

    2013-04-01

    Heritable factors contribute to the development of colorectal cancer. Identifying the genetic loci associated with colorectal tumor formation could elucidate the mechanisms of pathogenesis. We conducted a genome-wide association study that included 14 studies, 12,696 cases of colorectal tumors (11,870 cancer, 826 adenoma), and 15,113 controls of European descent. The 10 most statistically significant, previously unreported findings were followed up in 6 studies; these included 3056 colorectal tumor cases (2098 cancer, 958 adenoma) and 6658 controls of European and Asian descent. Based on the combined analysis, we identified a locus that reached the conventional genome-wide significance level at less than 5.0 × 10(-8): an intergenic region on chromosome 2q32.3, close to nucleic acid binding protein 1 (most significant single nucleotide polymorphism: rs11903757; odds ratio [OR], 1.15 per risk allele; P = 3.7 × 10(-8)). We also found evidence for 3 additional loci with P values less than 5.0 × 10(-7): a locus within the laminin gamma 1 gene on chromosome 1q25.3 (rs10911251; OR, 1.10 per risk allele; P = 9.5 × 10(-8)), a locus within the cyclin D2 gene on chromosome 12p13.32 (rs3217810 per risk allele; OR, 0.84; P = 5.9 × 10(-8)), and a locus in the T-box 3 gene on chromosome 12q24.21 (rs59336; OR, 0.91 per risk allele; P = 3.7 × 10(-7)). In a large genome-wide association study, we associated polymorphisms close to nucleic acid binding protein 1 (which encodes a DNA-binding protein involved in DNA repair) with colorectal tumor risk. We also provided evidence for an association between colorectal tumor risk and polymorphisms in laminin gamma 1 (this is the second gene in the laminin family to be associated with colorectal cancers), cyclin D2 (which encodes for cyclin D2), and T-box 3 (which encodes a T-box transcription factor and is a target of Wnt signaling to β-catenin). The roles of these genes and their products in cancer pathogenesis warrant further

  5. Genome-wide identification and characterization of reference genes with different transcript abundances for Streptomyces coelicolor

    PubMed Central

    Li, Shanshan; Wang, Weishan; Li, Xiao; Fan, Keqiang; Yang, Keqian

    2015-01-01

    The lack of reliable reference genes (RGs) in the genus Streptomyces hampers effort to obtain the precise data of transcript levels. To address this issue, we aimed to identify reliable RGs in the model organism Streptomyces coelicolor. A pool of potential RGs containing 1,471 genes was first identified by determining the intersection of genes with stable transcript levels from four time-series transcriptome microarray datasets of S. coelicolor M145 cultivated in different conditions. Then, following a strict rational selection scheme including homology analysis, disturbance analysis, function analysis and transcript abundance analysis, 13 candidates were selected from the 1,471 genes. Based on real-time quantitative reverse transcription PCR assays, SCO0710, SCO6185, SCO1544, SCO3183 and SCO4758 were identified as the top five genes with the most stable transcript levels among the 13 candidates. Further analyses showed these five genes also maintained stable transcript levels in different S. coelicolor strains, as well as in Streptomyces avermitilis MA-4680 and Streptomyces clavuligerus NRRL 3585, suggesting they could fulfill the requirements of accurate data normalization in streptomycetes. Moreover, the systematic strategy employed in this work could be used for reference in other microorganism to select reliable RGs. PMID:26527303

  6. Cyanobacterial daily life with Kai-based circadian and diurnal genome-wide transcriptional control in Synechococcus elongatus.

    PubMed

    Ito, Hiroshi; Mutsuda, Michinori; Murayama, Yoriko; Tomita, Jun; Hosokawa, Norimune; Terauchi, Kazuki; Sugita, Chieko; Sugita, Mamoru; Kondo, Takao; Iwasaki, Hideo

    2009-08-18

    In the unicellular cyanobacterium Synechococcus elongatus PCC 7942, essentially all promoter activities are under the control of the circadian clock under continuous light (LL) conditions. Here, we used high-density oligonucleotide arrays to explore comprehensive profiles of genome-wide Synechococcus gene expression in wild-type, kaiABC-null, and kaiC-overexpressor strains under LL and continuous dark (DD) conditions. In the wild-type strains, >30% of transcripts oscillated significantly in a circadian fashion, peaking at subjective dawn and dusk. Such circadian control was severely attenuated in kaiABC-null strains. Although it has been proposed that KaiC globally represses gene expression, our analysis revealed that dawn-expressed genes were up-regulated by kaiC-overexpression so that the clock was arrested at subjective dawn. Transfer of cells to DD conditions from LL immediately suppressed expression of most of the genes, while the clock kept even time in the absence of transcriptional feedback. Thus, the Synechococcus genome seems to be primarily regulated by light/dark cycles and is dramatically modified by the protein-based circadian oscillator.

  7. Reconstructing genome-wide regulatory network of E. coli using transcriptome data and predicted transcription factor activities

    PubMed Central

    2011-01-01

    Background Gene regulatory networks play essential roles in living organisms to control growth, keep internal metabolism running and respond to external environmental changes. Understanding the connections and the activity levels of regulators is important for the research of gene regulatory networks. While relevance score based algorithms that reconstruct gene regulatory networks from transcriptome data can infer genome-wide gene regulatory networks, they are unfortunately prone to false positive results. Transcription factor activities (TFAs) quantitatively reflect the ability of the transcription factor to regulate target genes. However, classic relevance score based gene regulatory network reconstruction algorithms use models do not include the TFA layer, thus missing a key regulatory element. Results This work integrates TFA prediction algorithms with relevance score based network reconstruction algorithms to reconstruct gene regulatory networks with improved accuracy over classic relevance score based algorithms. This method is called Gene expression and Transcription factor activity based Relevance Network (GTRNetwork). Different combinations of TFA prediction algorithms and relevance score functions have been applied to find the most efficient combination. When the integrated GTRNetwork method was applied to E. coli data, the reconstructed genome-wide gene regulatory network predicted 381 new regulatory links. This reconstructed gene regulatory network including the predicted new regulatory links show promising biological significances. Many of the new links are verified by known TF binding site information, and many other links can be verified from the literature and databases such as EcoCyc. The reconstructed gene regulatory network is applied to a recent transcriptome analysis of E. coli during isobutanol stress. In addition to the 16 significantly changed TFAs detected in the original paper, another 7 significantly changed TFAs have been detected by

  8. Genome-wide identification and phylogenetic analysis of the ERF gene family in cucumbers

    PubMed Central

    Hu, Lifang; Liu, Shiqiang

    2011-01-01

    Members of the ERF transcription-factor family participate in a number of biological processes, viz., responses to hormones, adaptation to biotic and abiotic stress, metabolism regulation, beneficial symbiotic interactions, cell differentiation and developmental processes. So far, no tissue-expression profile of any cucumber ERF protein has been reported in detail. Recent completion of the cucumber full-genome sequence has come to facilitate, not only genome-wide analysis of ERF family members in cucumbers themselves, but also a comparative analysis with those in Arabidopsis and rice. In this study, 103 hypothetical ERF family genes in the cucumber genome were identified, phylogenetic analysis indicating their classification into 10 groups, designated I to X. Motif analysis further indicated that most of the conserved motifs outside the AP2/ERF domain, are selectively distributed among the specific clades in the phylogenetic tree. From chromosomal localization and genome distribution analysis, it appears that tandem-duplication may have contributed to CsERF gene expansion. Intron/exon structure analysis indicated that a few CsERFs still conserved the former intron-position patterns existent in the common ancestor of monocots and eudicots. Expression analysis revealed the widespread distribution of the cucumber ERF gene family within plant tissues, thereby implying the probability of their performing various roles therein. Furthermore, members of some groups presented mutually similar expression patterns that might be related to their phylogenetic groups. PMID:22215967

  9. Genome-Wide Identification and Expression Analysis of WRKY Gene Family in Capsicum annuum L.

    PubMed

    Diao, Wei-Ping; Snyder, John C; Wang, Shu-Bin; Liu, Jin-Bing; Pan, Bao-Gui; Guo, Guang-Jun; Wei, Ge

    2016-01-01

    The WRKY family of transcription factors is one of the most important families of plant transcriptional regulators with members regulating multiple biological processes, especially in regulating defense against biotic and abiotic stresses. However, little information is available about WRKYs in pepper (Capsicum annuum L.). The recent release of completely assembled genome sequences of pepper allowed us to perform a genome-wide investigation for pepper WRKY proteins. In the present study, a total of 71 WRKY genes were identified in the pepper genome. According to structural features of their encoded proteins, the pepper WRKY genes (CaWRKY) were classified into three main groups, with the second group further divided into five subgroups. Genome mapping analysis revealed that CaWRKY were enriched on four chromosomes, especially on chromosome 1, and 15.5% of the family members were tandemly duplicated genes. A phylogenetic tree was constructed depending on WRKY domain' sequences derived from pepper and Arabidopsis. The expression of 21 selected CaWRKY genes in response to seven different biotic and abiotic stresses (salt, heat shock, drought, Phytophtora capsici, SA, MeJA, and ABA) was evaluated by quantitative RT-PCR; Some CaWRKYs were highly expressed and up-regulated by stress treatment. Our results will provide a platform for functional identification and molecular breeding studies of WRKY genes in pepper.

  10. Genome-Wide Identification and Expression Analysis of WRKY Gene Family in Capsicum annuum L.

    PubMed Central

    Diao, Wei-Ping; Snyder, John C.; Wang, Shu-Bin; Liu, Jin-Bing; Pan, Bao-Gui; Guo, Guang-Jun; Wei, Ge

    2016-01-01

    The WRKY family of transcription factors is one of the most important families of plant transcriptional regulators with members regulating multiple biological processes, especially in regulating defense against biotic and abiotic stresses. However, little information is available about WRKYs in pepper (Capsicum annuum L.). The recent release of completely assembled genome sequences of pepper allowed us to perform a genome-wide investigation for pepper WRKY proteins. In the present study, a total of 71 WRKY genes were identified in the pepper genome. According to structural features of their encoded proteins, the pepper WRKY genes (CaWRKY) were classified into three main groups, with the second group further divided into five subgroups. Genome mapping analysis revealed that CaWRKY were enriched on four chromosomes, especially on chromosome 1, and 15.5% of the family members were tandemly duplicated genes. A phylogenetic tree was constructed depending on WRKY domain' sequences derived from pepper and Arabidopsis. The expression of 21 selected CaWRKY genes in response to seven different biotic and abiotic stresses (salt, heat shock, drought, Phytophtora capsici, SA, MeJA, and ABA) was evaluated by quantitative RT-PCR; Some CaWRKYs were highly expressed and up-regulated by stress treatment. Our results will provide a platform for functional identification and molecular breeding studies of WRKY genes in pepper. PMID:26941768

  11. Genome-Wide Organization and Expression Profiling of the NAC Transcription Factor Family in Potato (Solanum tuberosum L.)

    PubMed Central

    Singh, Anil Kumar; Sharma, Vishal; Pal, Awadhesh Kumar; Acharya, Vishal; Ahuja, Paramvir Singh

    2013-01-01

    NAC [no apical meristem (NAM), Arabidopsis thaliana transcription activation factor [ATAF1/2] and cup-shaped cotyledon (CUC2)] proteins belong to one of the largest plant-specific transcription factor (TF) families and play important roles in plant development processes, response to biotic and abiotic cues and hormone signalling. Our genome-wide analysis identified 110 StNAC genes in potato encoding for 136 proteins, including 14 membrane-bound TFs. The physical map positions of StNAC genes on 12 potato chromosomes were non-random, and 40 genes were found to be distributed in 16 clusters. The StNAC proteins were phylogenetically clustered into 12 subgroups. Phylogenetic analysis of StNACs along with their Arabidopsis and rice counterparts divided these proteins into 18 subgroups. Our comparative analysis has also identified 36 putative TNAC proteins, which appear to be restricted to Solanaceae family. In silico expression analysis, using Illumina RNA-seq transcriptome data, revealed tissue-specific, biotic, abiotic stress and hormone-responsive expression profile of StNAC genes. Several StNAC genes, including StNAC072 and StNAC101that are orthologs of known stress-responsive Arabidopsis RESPONSIVE TO DEHYDRATION 26 (RD26) were identified as highly abiotic stress responsive. Quantitative real-time polymerase chain reaction analysis largely corroborated the expression profile of StNAC genes as revealed by the RNA-seq data. Taken together, this analysis indicates towards putative functions of several StNAC TFs, which will provide blue-print for their functional characterization and utilization in potato improvement. PMID:23649897

  12. Genome-wide organization and expression profiling of the NAC transcription factor family in potato (Solanum tuberosum L.).

    PubMed

    Singh, Anil Kumar; Sharma, Vishal; Pal, Awadhesh Kumar; Acharya, Vishal; Ahuja, Paramvir Singh

    2013-08-01

    NAC [no apical meristem (NAM), Arabidopsis thaliana transcription activation factor [ATAF1/2] and cup-shaped cotyledon (CUC2)] proteins belong to one of the largest plant-specific transcription factor (TF) families and play important roles in plant development processes, response to biotic and abiotic cues and hormone signalling. Our genome-wide analysis identified 110 StNAC genes in potato encoding for 136 proteins, including 14 membrane-bound TFs. The physical map positions of StNAC genes on 12 potato chromosomes were non-random, and 40 genes were found to be distributed in 16 clusters. The StNAC proteins were phylogenetically clustered into 12 subgroups. Phylogenetic analysis of StNACs along with their Arabidopsis and rice counterparts divided these proteins into 18 subgroups. Our comparative analysis has also identified 36 putative TNAC proteins, which appear to be restricted to Solanaceae family. In silico expression analysis, using Illumina RNA-seq transcriptome data, revealed tissue-specific, biotic, abiotic stress and hormone-responsive expression profile of StNAC genes. Several StNAC genes, including StNAC072 and StNAC101that are orthologs of known stress-responsive Arabidopsis RESPONSIVE TO DEHYDRATION 26 (RD26) were identified as highly abiotic stress responsive. Quantitative real-time polymerase chain reaction analysis largely corroborated the expression profile of StNAC genes as revealed by the RNA-seq data. Taken together, this analysis indicates towards putative functions of several StNAC TFs, which will provide blue-print for their functional characterization and utilization in potato improvement.

  13. Sample size and power analysis for sparse signal recovery in genome-wide association studies

    PubMed Central

    Xie, Jichun; Cai, T. Tony; Li, Hongzhe

    2011-01-01

    Genome-wide association studies have successfully identified hundreds of novel genetic variants associated with many complex human diseases. However, there is a lack of rigorous work on evaluating the statistical power for identifying these variants. In this paper, we consider sparse signal identification in genome-wide association studies and present two analytical frameworks for detailed analysis of the statistical power for detecting and identifying the disease-associated variants. We present an explicit sample size formula for achieving a given false non-discovery rate while controlling the false discovery rate based on an optimal procedure. Sparse genetic variant recovery is also considered and a boundary condition is established in terms of sparsity and signal strength for almost exact recovery of both disease-associated variants and nondisease-associated variants. A data-adaptive procedure is proposed to achieve this bound. The analytical results are illustrated with a genome-wide association study of neuroblastoma. PMID:23049128

  14. Genome-wide efficient mixed-model analysis for association studies.

    PubMed

    Zhou, Xiang; Stephens, Matthew

    2012-06-17

    Linear mixed models have attracted considerable attention recently as a powerful and effective tool for accounting for population stratification and relatedness in genetic association tests. However, existing methods for exact computation of standard test statistics are computationally impractical for even moderate-sized genome-wide association studies. To address this issue, several approximate methods have been proposed. Here, we present an efficient exact method, which we refer to as genome-wide efficient mixed-model association (GEMMA), that makes approximations unnecessary in many contexts. This method is approximately n times faster than the widely used exact method known as efficient mixed-model association (EMMA), where n is the sample size, making exact genome-wide association analysis computationally practical for large numbers of individuals.

  15. Genome-wide Identification and Structural, Functional and Evolutionary Analysis of WRKY Components of Mulberry

    PubMed Central

    Baranwal, Vinay Kumar; Negi, Nisha; Khurana, Paramjit

    2016-01-01

    Mulberry is known to be sensitive to several biotic and abiotic stresses, which in turn have a direct impact on the yield of silk, because it is the sole food source for the silk worm. WRKYs are a family of transcription factors, which play an important role in combating various biotic and abiotic stresses. In this study, we identified 54 genes with conserved WRKY motifs in the Morus notabilis genome. Motif searches coupled with a phylogenetic analysis revealed seven sub-groups as well as the absence of members of Group Ib in mulberry. Analyses of the 2K upstream region in addition to a gene ontology terms enrichment analysis revealed putative functions of mulberry WRKYs under biotic and abiotic stresses. An RNA-seq-based analysis showed that several of the identified WRKYs have shown preferential expression in the leaf, bark, root, male flower, and winter bud of M. notabilis. Finally, expression analysis by qPCR under different stress and hormone treatments revealed genotype-specific responses. Taken together, our results briefs about the genome-wide identification of WRKYs as well as their differential response to stresses and hormones. Importantly, these data can also be utilized to identify potential molecular targets for conferring tolerance to various stresses in mulberry. PMID:27477686

  16. Genome-wide Identification and Structural, Functional and Evolutionary Analysis of WRKY Components of Mulberry.

    PubMed

    Baranwal, Vinay Kumar; Negi, Nisha; Khurana, Paramjit

    2016-08-01

    Mulberry is known to be sensitive to several biotic and abiotic stresses, which in turn have a direct impact on the yield of silk, because it is the sole food source for the silk worm. WRKYs are a family of transcription factors, which play an important role in combating various biotic and abiotic stresses. In this study, we identified 54 genes with conserved WRKY motifs in the Morus notabilis genome. Motif searches coupled with a phylogenetic analysis revealed seven sub-groups as well as the absence of members of Group Ib in mulberry. Analyses of the 2K upstream region in addition to a gene ontology terms enrichment analysis revealed putative functions of mulberry WRKYs under biotic and abiotic stresses. An RNA-seq-based analysis showed that several of the identified WRKYs have shown preferential expression in the leaf, bark, root, male flower, and winter bud of M. notabilis. Finally, expression analysis by qPCR under different stress and hormone treatments revealed genotype-specific responses. Taken together, our results briefs about the genome-wide identification of WRKYs as well as their differential response to stresses and hormones. Importantly, these data can also be utilized to identify potential molecular targets for conferring tolerance to various stresses in mulberry.

  17. Genome-Wide Analysis of the Cyclin Gene Family in Tomato

    PubMed Central

    Zhang, Tingyan; Wang, Xin; Lu, Yongen; Cai, Xiaofeng; Ye, Zhibiao; Zhang, Junhong

    2014-01-01

    Cyclins play important roles in cell division and cell expansion. They also interact with cyclin-dependent kinases to control cell cycle progression in plants. Our genome-wide analysis identified 52 expressed cyclin genes in tomato. Phylogenetic analysis of the deduced amino sequences of tomato and Arabidopsis cyclin genes divided them into 10 types, A-, B-, C-, D-, H-, L-, T-, U-, SDS- and J18. Pfam analysis indicated that most tomato cyclins contain a cyclin-N domain. C-, H- and J18 types only contain a cyclin-C domain, and U-type cyclins contain another potential cyclin domain. All of the cyclin genes are distributed throughout the tomato genome except for chromosome 8, and 30 of them were found to be segmentally duplicated; they are found on the duplicate segments of chromosome 1, 2, 3, 4, 5, 6, 10, 11 and 12, suggesting that tomato cyclin genes experienced a mass of segmental duplication. Quantitative real-time polymerase chain reaction analysis indicates that the expression patterns of tomato cyclin genes were significantly different in vegetative and reproductive stages. Transcription of most cyclin genes can be enhanced or repressed by exogenous application of gibberellin, which implies that gibberellin maybe a direct regulator of cyclin genes. The study presented here may be useful as a guide for further functional research on tomato cyclins. PMID:24366066

  18. Genome-Wide Analysis of Branched-Chain Amino Acid Levels in Arabidopsis Seeds[W

    PubMed Central

    Angelovici, Ruthie; Lipka, Alexander E.; Deason, Nicholas; Gonzalez-Jorge, Sabrina; Lin, Haining; Cepela, Jason; Buell, Robin; Gore, Michael A.; DellaPenna, Dean

    2013-01-01

    Branched-chain amino acids (BCAAs) are three of the nine essential amino acids in human and animal diets and are important for numerous processes in development and growth. However, seed BCAA levels in major crops are insufficient to meet dietary requirements, making genetic improvement for increased and balanced seed BCAAs an important nutritional target. Addressing this issue requires a better understanding of the genetics underlying seed BCAA content and composition. Here, a genome-wide association study and haplotype analysis for seed BCAA traits in Arabidopsis thaliana revealed a strong association with a chromosomal interval containing two BRANCHED-CHAIN AMINO ACID TRANSFERASES, BCAT1 and BCAT2. Linkage analysis, reverse genetic approaches, and molecular complementation analysis demonstrated that allelic variation at BCAT2 is responsible for the natural variation of seed BCAAs in this interval. Complementation analysis of a bcat2 null mutant with two significantly different alleles from accessions Bayreuth-0 and Shahdara is consistent with BCAT2 contributing to natural variation in BCAA levels, glutamate recycling, and free amino acid homeostasis in seeds in an allele-dependent manner. The seed-specific phenotype of bcat2 null alleles, its strong transcription induction during late seed development, and its subcellular localization to the mitochondria are consistent with a unique, catabolic role for BCAT2 in BCAA metabolism in seeds. PMID:24368787

  19. Genome-wide identification and phylogenetic analysis of the SBP-box gene family in melons.

    PubMed

    Ma, Y; Guo, J W; Bade, R; Men, Z H; Hasi, A

    2014-10-27

    The SBP-box gene family is specific to plants and encodes a class of zinc finger-containing transcription factors with a broad range of functions. Although SBP-box genes have been identified in numerous plants, including green algae, moss, silver birch, snapdragon, Arabidopsis, rice, and maize, there is little information concerning SBP-box genes, or the corresponding miR156/157, function in melon. Using the highly conserved sequence of the Arabidopsis thaliana SBP-box domain protein as a probe of information sequence, the genome-wide protein database of melon was explored to obtain 13 SBP-box protein sequences, which were further divided into 4 groups, based on phylogenetic analysis. A further analysis centered on the melon SBP-box genetic family's phylogenetic evolution, sequence similarities, gene structure, and miR156 target sequence was also conducted. Analysis of all the expression patterns of melon SBP-box family genes showed that the SBP-box genes were detected in 7 kinds of tissue, and fruit had the highest expression level. CmSBP11 tends to present its specific expression in melon fruit and root. CmSBP09 expression was the highest in flower. Overall, the molecular evolution and expression pattern of the melon SBP-box gene family, revealed by these results, suggest its function differentiation that followed gene duplication.

  20. Genome-wide analysis of the MADS-box gene family in Brachypodium distachyon.

    PubMed

    Wei, Bo; Zhang, Rong-Zhi; Guo, Juan-Juan; Liu, Dan-Mei; Li, Ai-Li; Fan, Ren-Chun; Mao, Long; Zhang, Xiang-Qi

    2014-01-01

    MADS-box genes are important transcription factors for plant development, especially floral organogenesis. Brachypodium distachyon is a model for biofuel plants and temperate grasses such as wheat and barley, but a comprehensive analysis of MADS-box family proteins in Brachypodium is still missing. We report here a genome-wide analysis of the MADS-box gene family in Brachypodium distachyon. We identified 57 MADS-box genes and classified them into 32 MIKC(c)-type, 7 MIKC*-type, 9 Mα, 7 Mβ and 2 Mγ MADS-box genes according to their phylogenetic relationships to the Arabidopsis and rice MADS-box genes. Detailed gene structure and motif distribution were then studied. Investigation of their chromosomal localizations revealed that Brachypodium MADS-box genes distributed evenly across five chromosomes. In addition, five pairs of type II MADS-box genes were found on synteny blocks derived from whole genome duplication blocks. We then performed a systematic expression analysis of Brachypodium MADS-box genes in various tissues, particular floral organs. Further detection under salt, drought, and low-temperature conditions showed that some MADS-box genes may also be involved in abiotic stress responses, including type I genes. Comparative studies of MADS-box genes among Brachypodium, rice and Arabidopsis showed that Brachypodium had fewer gene duplication events. Taken together, this work provides useful data for further functional studies of MADS-box genes in Brachypodium distachyon.

  1. Genome-wide transcription profile of endothelial cells after cardiac transplantation in the rat.

    PubMed

    Mikalsen, B; Fosby, B; Wang, J; Hammarström, C; Bjaerke, H; Lundström, M; Kasprzycka, M; Scott, H; Line, P-D; Haraldsen, G

    2010-07-01

    Transcriptome analyses of organ transplants have until now usually focused on whole tissue samples containing activation profiles from different cell populations. Here, we enriched endothelial cells from rat cardiac allografts and isografts, establishing their activation profile at baseline and on days 2, 3 and 4 after transplantation. Modulated transcripts were assigned to three categories based on their regulation profile in allografts and isografts. Categories A and B contained the majority of transcripts and showed similar regulation in both graft types, appearing to represent responses to surgical trauma. By contrast, category C contained transcripts that were partly allograft-specific and to a large extent associated with interferon-gamma-responsiveness. Several transcripts were verified by immunohistochemical analysis of graft lesions, among them the matricellular protein periostin, which was one of the most highly upregulated transcripts but has not been associated with transplantation previously. In conclusion, the majority of the differentially expressed genes in graft endothelial cells are affected by the transplantation procedure whereas relatively few are associated with allograft rejection.

  2. Signatures of positive selection in East African Shorthorn Zebu: a genome-wide SNP analysis

    USDA-ARS?s Scientific Manuscript database

    The small East African Shorthorn Zebu is the main indigenous cattle across East Africa. A recent genome wide SNPs analysis has revealed their ancient stable African taurine x Asian zebu admixture. Here, we assess the presence of candidate signature of positive selection in their genome, with the aim...

  3. Methods for meta-analysis of genome-wide association studies

    USDA-ARS?s Scientific Manuscript database

    A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. For increasing N, results from different GWA can be combined in a meta-analysis (MA-...

  4. Meta-analysis of genome wide association studies for pork quality traits

    USDA-ARS?s Scientific Manuscript database

    Given the importance of pork quality in the meat processing industry, genome-wide association studies were performed for eight meat quality traits and also, a meta-analysis (MA) of GWA was implemented combining independent results from pig populations. Data from three pig datasets (USMARC, Commercia...

  5. Genome-wide Association Analysis of Kernel Weight in Hard Winter Wheat

    USDA-ARS?s Scientific Manuscript database

    Wheat kernel weight is an important and heritable component of wheat grain yield and a key predictor of flour extraction. Genome-wide association analysis was conducted to identify genomic regions associated with kernel weight and kernel weight environmental response in 8 trials of 299 hard winter ...

  6. Transcription factors expressed in olfactory bulb local progenitor cells revealed by genome-wide transcriptome profiling

    PubMed Central

    Campbell, Gordon R. O.; Baudhuin, Ariane; Vranizan, Karen; Ngai, John

    2011-01-01

    The local progenitor population in the olfactory bulb (OB) gives rise to mitral and tufted projection neurons during embryonic development. In contrast, OB interneurons are derived from sources outside the bulb where neurogenesis continues throughout life. While many of the genes involved in OB interneuron development have been characterized, the genetic pathways driving local progenitor cell differentiation in this tissue are largely unknown. To better understand this process, we used transcriptional profiling to monitor gene expression of whole OB at daily intervals from embryonic day 11 through birth, generating a compendium of gene expression encompassing the major developmental events of this tissue. Through hierarchical clustering, bioinformatics analysis, and validation by RNA in situ hybridizations, we identified a large number of transcription factors, DNA binding proteins, and cell cycle-related genes expressed by the local neural progenitor cells (NPCs) of the embryonic OB. Further in silico analysis of transcription factor binding sites identified an enrichment of genes regulated by the E2F-Rb pathway among those expressed in the local NPC population. Together these results provide initial insights into the molecular identity of the OB local NPC population and the transcription factor networks that may regulate their function. PMID:21194568

  7. Genome-wide analysis of thyroid hormone receptors shared and specific functions in neural cells

    PubMed Central

    Chatonnet, Fabrice; Guyot, Romain; Benoît, Gérard; Flamant, Frederic

    2013-01-01

    TRα1 and TRβ1, the two main thyroid hormone receptors in mammals, are transcription factors that share similar properties. However, their respective functions are very different. This functional divergence might be explained in two ways: it can reflect different expression patterns or result from different intrinsic properties of the receptors. We tested this second hypothesis by comparing the repertoires of 3,3′,5-triiodo-L-thyronine (T3)-responsive genes of two neural cell lines, expressing either TRα1 or TRβ1. Using transcriptome analysis, we found that a substantial fraction of the T3 target genes display a marked preference for one of the two receptors. So when placed alone in identical situations, the two receptors have different repertoires of target genes. Chromatin occupancy analysis, performed at a genome-wide scale, revealed that TRα1 and TRβ1 cistromes were also different. However, receptor-selective regulation of T3 target genes did not result from receptor-selective chromatin occupancy of their promoter regions. We conclude that modification of TRα1 and TRβ1 intrinsic properties contributes in a large part to the divergent evolution of the receptors’ function, at least during neurodevelopment. PMID:23382204

  8. Genome-wide common and rare variant analysis provides novel insights into clozapine-associated neutropenia

    PubMed Central

    Legge, Sophie E; Hamshere, Marian L; Ripke, Stephan; Pardinas, Antonio F; Goldstein, Jacqueline I; Rees, Elliott; Richards, Alexander L; Leonenko, Ganna; Jorskog, L Fredrik; Chambert, Kimberly D; Collier, David A; Genovese, Giulio; Giegling, Ina; Holmans, Peter; Jonasdottir, Adalbjorg; Kirov, George; McCarroll, Steven A; MacCabe, James H; Mantripragada, Kiran; Moran, Jennifer L; Neale, Benjamin M; Stefansson, Hreinn; Rujescu, Dan; Daly, Mark J; Sullivan, Patrick F; Owen, Michael J; O’Donovan, Michael C; Walters, James T R

    2016-01-01

    The antipsychotic clozapine is uniquely effective in the management of schizophrenia, but its use is limited by its potential to induce agranulocytosis. The causes of this, and of its precursor neutropenia, are largely unknown although genetic factors play an important role. We sought risk alleles for clozapine-associated neutropenia in a sample of 66 cases and 5583 clozapine-treated controls, through a genome-wide association study (GWAS), imputed HLA alleles, exome array, and copy number variation analyses. We then combined associated variants in a meta-analysis with data from the Clozapine-Induced Agranulocytosis Consortium (up to 163 cases and 7970 controls). In the largest combined sample to date, we identified a novel association with rs149104283 (OR=4.32, P=1.79×10-8), intronic to transcripts of SLCO1B3 and SLCO1B7, members of a family of hepatic transporter genes previously implicated in adverse drug reactions including simvastatin-induced myopathy and docetaxel-induced neutropenia. Exome array analysis identified gene-wide associations of uncommon non-synonymous variants within UBAP2 and STARD9. We additionally provide independent replication of a previously identified variant in HLA-DQB1 (OR=15.6, P = 0.015, positive predictive value = 35.1%). These results implicate biological pathways through which clozapine may act to cause this serious adverse effect. PMID:27400856

  9. Genome-Wide Analysis of the Lysine Biosynthesis Pathway Network during Maize Seed Development

    PubMed Central

    Liu, Yuwei; Xie, Shaojun; Yu, Jingjuan

    2016-01-01

    Lysine is one of the most limiting essential amino acids for humans and livestock. The nutritional value of maize (Zea mays L.) is reduced by its poor lysine content. To better understand the lysine biosynthesis pathway in maize seed, we conducted a genome-wide analysis of the genes involved in lysine biosynthesis. We identified lysine biosynthesis pathway genes (LBPGs) and investigated whether a diaminopimelate pathway variant exists in maize. We analyzed two genes encoding the key enzyme dihydrodipicolinate synthase, and determined that they contribute differently to lysine synthesis during maize seed development. A coexpression network of LBPGs was constructed using RNA-sequencing data from 21 developmental stages of B73 maize seed. We found a large set of genes encoding ribosomal proteins, elongation factors and zein proteins that were coexpressed with LBPGs. The coexpressed genes were enriched in cellular metabolism terms and protein related terms. A phylogenetic analysis of the LBPGs from different plant species revealed different relationships. Additionally, six transcription factor (TF) families containing 13 TFs were identified as the Hub TFs of the LBPGs modules. Several expression quantitative trait loci of LBPGs were also identified. Our results should help to elucidate the lysine biosynthesis pathway network in maize seed. PMID:26829553

  10. Genome-wide Identification of TCP Family Transcription Factors from Populus euphratica and Their Involvement in Leaf Shape Regulation.

    PubMed

    Ma, Xiaodong; Ma, Jianchao; Fan, Di; Li, Chaofeng; Jiang, Yuanzhong; Luo, Keming

    2016-09-08

    Higher plants have been shown to experience a juvenile vegetative phase, an adult vegetative phase, and a reproductive phase during its postembryonic development and distinct lateral organ morphologies have been observed at the different development stages. Populus euphratica, commonly known as a desert poplar, has developed heteromorphic leaves during its development. The TCP family genes encode a group of plant-specific transcription factors involved in several aspects of plant development. In particular, TCPs have been shown to influence leaf size and shape in many herbaceous plants. However, whether these functions are conserved in woody plants remains unknown. In the present study, we carried out genome-wide identification of TCP genes in P. euphratica and P. trichocarpa, and 33 and 36 genes encoding putative TCP proteins were found, respectively. Phylogenetic analysis of the poplar TCPs together with Arabidopsis TCPs indicated a biased expansion of the TCP gene family via segmental duplications. In addition, our results have also shown a correlation between different expression patterns of several P. euphratica TCP genes and leaf shape variations, indicating their involvement in the regulation of leaf shape development.

  11. Genome-wide impact of Androgen Receptor Trapped Clone-27 Loss on Androgen-regulated Transcription in Prostate Cancer Cells

    PubMed Central

    Nwachukwu, Jerome C.; Mita, Paolo; Ruoff, Rachel; Ha, Susan; Wang, Qianben; Huang, S. Joseph; Taneja, Samir S.; Brown, Myles; Gerald, William L.; Garabedian, Michael J.; Logan, Susan K.

    2009-01-01

    The Androgen Receptor (AR) directs diverse biological processes through interaction with coregulators such as Androgen Receptor Trapped clone-27 (ART-27). Our results demonstrate that ART-27 is recruited to AR-binding sites by ChIP analysis. In addition, the impact of ART-27 on genome wide transcription was examined. The studies indicate that loss of ART-27 enhances expression of many androgen-regulated genes, suggesting that ART-27 inhibits gene expression. Surprisingly, classes of genes that are upregulated upon ART-27 depletion include regulators of DNA damage checkpoint and cell cycle progression, suggesting that ART-27 functions to keep expression levels of these genes low. Consistent with this idea, stable reduction of ART-27 by shRNA enhances LNCaP cell proliferation compared to control cells. The impact of ART-27 loss was also examined in response to the antiandrogen, bicalutamide. Unexpectedly, cells treated with ART-27 siRNA no longer exhibited gene repression in response to bicalutamide. To examine ART-27 loss in prostate cancer progression, immunohistochemistry was conducted on a tissue array containing samples from primary tumors of individuals who were clinically followed and later shown to have either recurrent or non-recurrent disease. Comparison of ART-27 and AR staining indicated that nuclear ART-27 expression was lost in the majority of AR-positive recurrent prostate cancers. Our studies demonstrate that reduction of ART-27 protein levels in prostate cancer may facilitate antiandrogen resistant disease. PMID:19318562

  12. On the analysis of a repeated measure design in genome-wide association analysis.

    PubMed

    Lee, Young; Park, Suyeon; Moon, Sanghoon; Lee, Juyoung; Elston, Robert C; Lee, Woojoo; Won, Sungho

    2014-11-28

    Longitudinal data enables detecting the effect of aging/time, and as a repeated measures design is statistically more efficient compared to cross-sectional data if the correlations between repeated measurements are not large. In particular, when genotyping cost is more expensive than phenotyping cost, the collection of longitudinal data can be an efficient strategy for genetic association analysis. However, in spite of these advantages, genome-wide association studies (GWAS) with longitudinal data have rarely been analyzed taking this into account. In this report, we calculate the required sample size to achieve 80% power at the genome-wide significance level for both longitudinal and cross-sectional data, and compare their statistical efficiency. Furthermore, we analyzed the GWAS of eight phenotypes with three observations on each individual in the Korean Association Resource (KARE). A linear mixed model allowing for the correlations between observations for each individual was applied to analyze the longitudinal data, and linear regression was used to analyze the first observation on each individual as cross-sectional data. We found 12 novel genome-wide significant disease susceptibility loci that were then confirmed in the Health Examination cohort, as well as some significant interactions between age/sex and SNPs.

  13. Genome-wide transcriptome and expression profile analysis of Phalaenopsis during explant browning.

    PubMed

    Xu, Chuanjun; Zeng, Biyu; Huang, Junmei; Huang, Wen; Liu, Yumei

    2015-01-01

    Explant browning presents a major problem for in vitro culture, and can lead to the death of the explant and failure of regeneration. Considerable work has examined the physiological mechanisms underlying Phalaenopsis leaf explant browning, but the molecular mechanisms of browning remain elusive. In this study, we used whole genome RNA sequencing to examine Phalaenopsis leaf explant browning at genome-wide level. We first used Illumina high-throughput technology to sequence the transcriptome of Phalaenopsis and then performed de novo transcriptome assembly. We assembled 79,434,350 clean reads into 31,708 isogenes and generated 26,565 annotated unigenes. We assigned Gene Ontology (GO) terms, Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations, and potential Pfam domains to each transcript. Using the transcriptome data as a reference, we next analyzed the differential gene expression of explants cultured for 0, 3, and 6 d, respectively. We then identified differentially expressed genes (DEGs) before and after Phalaenopsis explant browning. We also performed GO, KEGG functional enrichment and Pfam analysis of all DEGs. Finally, we selected 11 genes for quantitative real-time PCR (qPCR) analysis to confirm the expression profile analysis. Here, we report the first comprehensive analysis of transcriptome and expression profiles during Phalaenopsis explant browning. Our results suggest that Phalaenopsis explant browning may be due in part to gene expression changes that affect the secondary metabolism, such as: phenylpropanoid pathway and flavonoid biosynthesis. Genes involved in photosynthesis and ATPase activity have been found to be changed at transcription level; these changes may perturb energy metabolism and thus lead to the decay of plant cells and tissues. This study provides comprehensive gene expression data for Phalaenopsis browning. Our data constitute an important resource for further functional studies to prevent explant browning.

  14. Genome-Wide Transcriptome and Expression Profile Analysis of Phalaenopsis during Explant Browning

    PubMed Central

    Xu, Chuanjun; Zeng, Biyu; Huang, Junmei; Huang, Wen; Liu, Yumei

    2015-01-01

    Background Explant browning presents a major problem for in vitro culture, and can lead to the death of the explant and failure of regeneration. Considerable work has examined the physiological mechanisms underlying Phalaenopsis leaf explant browning, but the molecular mechanisms of browning remain elusive. In this study, we used whole genome RNA sequencing to examine Phalaenopsis leaf explant browning at genome-wide level. Methodology/Principal Findings We first used Illumina high-throughput technology to sequence the transcriptome of Phalaenopsis and then performed de novo transcriptome assembly. We assembled 79,434,350 clean reads into 31,708 isogenes and generated 26,565 annotated unigenes. We assigned Gene Ontology (GO) terms, Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations, and potential Pfam domains to each transcript. Using the transcriptome data as a reference, we next analyzed the differential gene expression of explants cultured for 0, 3, and 6 d, respectively. We then identified differentially expressed genes (DEGs) before and after Phalaenopsis explant browning. We also performed GO, KEGG functional enrichment and Pfam analysis of all DEGs. Finally, we selected 11 genes for quantitative real-time PCR (qPCR) analysis to confirm the expression profile analysis. Conclusions/Significance Here, we report the first comprehensive analysis of transcriptome and expression profiles during Phalaenopsis explant browning. Our results suggest that Phalaenopsis explant browning may be due in part to gene expression changes that affect the secondary metabolism, such as: phenylpropanoid pathway and flavonoid biosynthesis. Genes involved in photosynthesis and ATPase activity have been found to be changed at transcription level; these changes may perturb energy metabolism and thus lead to the decay of plant cells and tissues. This study provides comprehensive gene expression data for Phalaenopsis browning. Our data constitute an important resource for further

  15. Genome-Wide Meta-Analysis of Sciatica in Finnish Population

    PubMed Central

    Lemmelä, Susanna; Solovieva, Svetlana; Shiri, Rahman; Benner, Christian; Heliövaara, Markku; Kettunen, Johannes; Anttila, Verneri; Ripatti, Samuli; Perola, Markus; Seppälä, Ilkka; Juonala, Markus; Kähönen, Mika; Salomaa, Veikko; Viikari, Jorma; Raitakari, Olli T.; Lehtimäki, Terho; Palotie, Aarno; Viikari-Juntura, Eira; Husgafvel-Pursiainen, Kirsti

    2016-01-01

    Sciatica or the sciatic syndrome is a common and often disabling low back disorder in the working-age population. It has a relatively high heritability but poorly understood molecular mechanisms. The Finnish population is a genetic isolate where small founder population and bottleneck events have led to enrichment of certain rare and low frequency variants. We performed here the first genome-wide association (GWAS) and meta-analysis of sciatica. The meta-analysis was conducted across two GWAS covering 291 Finnish sciatica cases and 3671 controls genotyped and imputed at 7.7 million autosomal variants. The most promising loci (p<1x10-6) were replicated in 776 Finnish sciatica patients and 18,489 controls. We identified five intragenic variants, with relatively low frequencies, at two novel loci associated with sciatica at genome-wide significance. These included chr9:14344410:I (rs71321981) at 9p22.3 (NFIB gene; p = 1.30x10-8, MAF = 0.08) and four variants at 15q21.2: rs145901849, rs80035109, rs190200374 and rs117458827 (MYO5A; p = 1.34x10-8, MAF = 0.06; p = 2.32x10-8, MAF = 0.07; p = 3.85x10-8, MAF = 0.06; p = 4.78x10-8, MAF = 0.07, respectively). The most significant association in the meta-analysis, a single base insertion rs71321981 within the regulatory region of the transcription factor NFIB, replicated in an independent Finnish population sample (p = 0.04). Despite identifying 15q21.2 as a promising locus, we were not able to replicate it. It was differentiated; the lead variants within 15q21.2 were more frequent in Finland (6–7%) than in other European populations (1–2%). Imputation accuracies of the three significantly associated variants (chr9:14344410:I, rs190200374, and rs80035109) were validated by genotyping. In summary, our results suggest a novel locus, 9p22.3 (NFIB), which may be involved in susceptibility to sciatica. In addition, another locus, 15q21.2, emerged as a promising one, but failed to replicate. PMID:27764105

  16. Physiological, biochemical, and genome-wide transcriptional analysis reveals that elevated CO2 mitigates the impact of combined heat wave and drought stress in Arabidopsis thaliana at multiple organizational levels.

    PubMed

    Zinta, Gaurav; AbdElgawad, Hamada; Domagalska, Malgorzata A; Vergauwen, Lucia; Knapen, Dries; Nijs, Ivan; Janssens, Ivan A; Beemster, Gerrit T S; Asard, Han

    2014-12-01

    Climate changes increasingly threaten plant growth and productivity. Such changes are complex and involve multiple environmental factors, including rising CO2 levels and climate extreme events. As the molecular and physiological mechanisms underlying plant responses to realistic future climate extreme conditions are still poorly understood, a multiple organizational level analysis (i.e. eco-physiological, biochemical, and transcriptional) was performed, using Arabidopsis exposed to incremental heat wave and water deficit under ambient and elevated CO2 . The climate extreme resulted in biomass reduction, photosynthesis inhibition, and considerable increases in stress parameters. Photosynthesis was a major target as demonstrated at the physiological and transcriptional levels. In contrast, the climate extreme treatment induced a protective effect on oxidative membrane damage, most likely as a result of strongly increased lipophilic antioxidants and membrane-protecting enzymes. Elevated CO2 significantly mitigated the negative impact of a combined heat and drought, as apparent in biomass reduction, photosynthesis inhibition, chlorophyll fluorescence decline, H2 O2 production, and protein oxidation. Analysis of enzymatic and molecular antioxidants revealed that the stress-mitigating CO2 effect operates through up-regulation of antioxidant defense metabolism, as well as by reduced photorespiration resulting in lowered oxidative pressure. Therefore, exposure to future climate extreme episodes will negatively impact plant growth and production, but elevated CO2 is likely to mitigate this effect. © 2014 John Wiley & Sons Ltd.

  17. Genome-wide analysis of complex wheat gliadins, the dominant carriers of celiac disease epitopes

    PubMed Central

    Wang, Da-Wei; Li, Da; Wang, Junjun; Zhao, Yue; Wang, Zhaojun; Yue, Guidong; Liu, Xin; Qin, Huanju; Zhang, Kunpu; Dong, Lingli; Wang, Daowen

    2017-01-01

    Gliadins, specified by six compound chromosomal loci (Gli-A1/B1/D1 and Gli-A2/B2/D2) in hexaploid bread wheat, are the dominant carriers of celiac disease (CD) epitopes. Because of their complexity, genome-wide characterization of gliadins is a strong challenge. Here, we approached this challenge by combining transcriptomic, proteomic and bioinformatic investigations. Through third-generation RNA sequencing, full-length transcripts were identified for 52 gliadin genes in the bread wheat cultivar Xiaoyan 81. Of them, 42 were active and predicted to encode 25 α-, 11 γ-, one δ- and five ω-gliadins. Comparative proteomic analysis between Xiaoyan 81 and six newly-developed mutants each lacking one Gli locus indicated the accumulation of 38 gliadins in the mature grains. A novel group of α-gliadins (the CSTT group) was recognized to contain very few or no CD epitopes. The δ-gliadins identified here or previously did not carry CD epitopes. Finally, the mutant lacking Gli-D2 showed significant reductions in the most celiac-toxic α-gliadins and derivative CD epitopes. The insights and resources generated here should aid further studies on gliadin functions in CD and the breeding of healthier wheat. PMID:28300172

  18. Genome-wide characterization and comparative analysis of the MLO gene family in cotton.

    PubMed

    Wang, Xiaoyan; Ma, Qifeng; Dou, Lingling; Liu, Zhen; Peng, Renhai; Yu, Shuxun

    2016-06-01

    In plants, MLO (Mildew Locus O) gene encodes a plant-specific seven transmembrane (TM) domain protein involved in several cellular processes, including susceptibility to powdery mildew (PM). In this study, a genome-wide characterization of the MLO gene family in G. raimondii L., G. arboreum L. and G. hirsutum L. was performed. In total, 22, 17 and 38 homologous sequences were identified for each species, respectively. Gene organization, including chromosomal location, gene clustering and gene duplication, was investigated. Homologues related to PM susceptibility in upland cotton were inferred by phylogenetic relationships with functionally characterized MLO proteins. To conduct a comparative analysis between MLO candidate genes from G. raimondii L., G. arboreum L. and G. hirsutum L., orthologous relationships and conserved synteny blocks were constructed. The transcriptional variation of 38 GhMLO genes in response to exogenous application of salt, mannitol (Man), abscisic acid (ABA), ethylene (ETH), jasmonic acid (JA) and salicylic acid (SA) was monitored. Further studies should be conducted to elucidate the functions of MLO genes in PM susceptibility and phytohormone signalling pathways.

  19. Genome-wide association analysis implicates dysregulation of immunity genes in chronic lymphocytic leukaemia.

    PubMed

    Law, Philip J; Berndt, Sonja I; Speedy, Helen E; Camp, Nicola J; Sava, Georgina P; Skibola, Christine F; Holroyd, Amy; Joseph, Vijai; Sunter, Nicola J; Nieters, Alexandra; Bea, Silvia; Monnereau, Alain; Martin-Garcia, David; Goldin, Lynn R; Clot, Guillem; Teras, Lauren R; Quintela, Inés; Birmann, Brenda M; Jayne, Sandrine; Cozen, Wendy; Majid, Aneela; Smedby, Karin E; Lan, Qing; Dearden, Claire; Brooks-Wilson, Angela R; Hall, Andrew G; Purdue, Mark P; Mainou-Fowler, Tryfonia; Vajdic, Claire M; Jackson, Graham H; Cocco, Pierluigi; Marr, Helen; Zhang, Yawei; Zheng, Tongzhang; Giles, Graham G; Lawrence, Charles; Call, Timothy G; Liebow, Mark; Melbye, Mads; Glimelius, Bengt; Mansouri, Larry; Glenn, Martha; Curtin, Karen; Diver, W Ryan; Link, Brian K; Conde, Lucia; Bracci, Paige M; Holly, Elizabeth A; Jackson, Rebecca D; Tinker, Lesley F; Benavente, Yolanda; Boffetta, Paolo; Brennan, Paul; Maynadie, Marc; McKay, James; Albanes, Demetrius; Weinstein, Stephanie; Wang, Zhaoming; Caporaso, Neil E; Morton, Lindsay M; Severson, Richard K; Riboli, Elio; Vineis, Paolo; Vermeulen, Roel C H; Southey, Melissa C; Milne, Roger L; Clavel, Jacqueline; Topka, Sabine; Spinelli, John J; Kraft, Peter; Ennas, Maria Grazia; Summerfield, Geoffrey; Ferri, Giovanni M; Harris, Robert J; Miligi, Lucia; Pettitt, Andrew R; North, Kari E; Allsup, David J; Fraumeni, Joseph F; Bailey, James R; Offit, Kenneth; Pratt, Guy; Hjalgrim, Henrik; Pepper, Chris; Chanock, Stephen J; Fegan, Chris; Rosenquist, Richard; de Sanjose, Silvia; Carracedo, Angel; Dyer, Martin J S; Catovsky, Daniel; Campo, Elias; Cerhan, James R; Allan, James M; Rothman, Nathanial; Houlston, Richard; Slager, Susan

    2017-02-06

    Several chronic lymphocytic leukaemia (CLL) susceptibility loci have been reported; however, much of the heritable risk remains unidentified. Here we perform a meta-analysis of six genome-wide association studies, imputed using a merged reference panel of 1,000 Genomes and UK10K data, totalling 6,200 cases and 17,598 controls after replication. We identify nine risk loci at 1p36.11 (rs34676223, P=5.04 × 10(-13)), 1q42.13 (rs41271473, P=1.06 × 10(-10)), 4q24 (rs71597109, P=1.37 × 10(-10)), 4q35.1 (rs57214277, P=3.69 × 10(-8)), 6p21.31 (rs3800461, P=1.97 × 10(-8)), 11q23.2 (rs61904987, P=2.64 × 10(-11)), 18q21.1 (rs1036935, P=3.27 × 10(-8)), 19p13.3 (rs7254272, P=4.67 × 10(-8)) and 22q13.33 (rs140522, P=2.70 × 10(-9)). These new and established risk loci map to areas of active chromatin and show an over-representation of transcription factor binding for the key determinants of B-cell development and immune response.

  20. Genome-wide analysis of complex wheat gliadins, the dominant carriers of celiac disease epitopes.

    PubMed

    Wang, Da-Wei; Li, Da; Wang, Junjun; Zhao, Yue; Wang, Zhaojun; Yue, Guidong; Liu, Xin; Qin, Huanju; Zhang, Kunpu; Dong, Lingli; Wang, Daowen

    2017-03-16

    Gliadins, specified by six compound chromosomal loci (Gli-A1/B1/D1 and Gli-A2/B2/D2) in hexaploid bread wheat, are the dominant carriers of celiac disease (CD) epitopes. Because of their complexity, genome-wide characterization of gliadins is a strong challenge. Here, we approached this challenge by combining transcriptomic, proteomic and bioinformatic investigations. Through third-generation RNA sequencing, full-length transcripts were identified for 52 gliadin genes in the bread wheat cultivar Xiaoyan 81. Of them, 42 were active and predicted to encode 25 α-, 11 γ-, one δ- and five ω-gliadins. Comparative proteomic analysis between Xiaoyan 81 and six newly-developed mutants each lacking one Gli locus indicated the accumulation of 38 gliadins in the mature grains. A novel group of α-gliadins (the CSTT group) was recognized to contain very few or no CD epitopes. The δ-gliadins identified here or previously did not carry CD epitopes. Finally, the mutant lacking Gli-D2 showed significant reductions in the most celiac-toxic α-gliadins and derivative CD epitopes. The insights and resources generated here should aid further studies on gliadin functions in CD and the breeding of healthier wheat.

  1. Genome-wide association analysis implicates dysregulation of immunity genes in chronic lymphocytic leukaemia

    PubMed Central

    Law, Philip J.; Berndt, Sonja I.; Speedy, Helen E.; Camp, Nicola J.; Sava, Georgina P.; Skibola, Christine F.; Holroyd, Amy; Joseph, Vijai; Sunter, Nicola J.; Nieters, Alexandra; Bea, Silvia; Monnereau, Alain; Martin-Garcia, David; Goldin, Lynn R.; Clot, Guillem; Teras, Lauren R.; Quintela, Inés; Birmann, Brenda M.; Jayne, Sandrine; Cozen, Wendy; Majid, Aneela; Smedby, Karin E.; Lan, Qing; Dearden, Claire; Brooks-Wilson, Angela R.; Hall, Andrew G.; Purdue, Mark P.; Mainou-Fowler, Tryfonia; Vajdic, Claire M.; Jackson, Graham H.; Cocco, Pierluigi; Marr, Helen; Zhang, Yawei; Zheng, Tongzhang; Giles, Graham G.; Lawrence, Charles; Call, Timothy G.; Liebow, Mark; Melbye, Mads; Glimelius, Bengt; Mansouri, Larry; Glenn, Martha; Curtin, Karen; Diver, W Ryan; Link, Brian K.; Conde, Lucia; Bracci, Paige M.; Holly, Elizabeth A.; Jackson, Rebecca D.; Tinker, Lesley F.; Benavente, Yolanda; Boffetta, Paolo; Brennan, Paul; Maynadie, Marc; McKay, James; Albanes, Demetrius; Weinstein, Stephanie; Wang, Zhaoming; Caporaso, Neil E.; Morton, Lindsay M.; Severson, Richard K.; Riboli, Elio; Vineis, Paolo; Vermeulen, Roel C. H.; Southey, Melissa C.; Milne, Roger L.; Clavel, Jacqueline; Topka, Sabine; Spinelli, John J.; Kraft, Peter; Ennas, Maria Grazia; Summerfield, Geoffrey; Ferri, Giovanni M.; Harris, Robert J.; Miligi, Lucia; Pettitt, Andrew R.; North, Kari E.; Allsup, David J.; Fraumeni, Joseph F.; Bailey, James R.; Offit, Kenneth; Pratt, Guy; Hjalgrim, Henrik; Pepper, Chris; Chanock, Stephen J.; Fegan, Chris; Rosenquist, Richard; de Sanjose, Silvia; Carracedo, Angel; Dyer, Martin J. S.; Catovsky, Daniel; Campo, Elias; Cerhan, James R.; Allan, James M.; Rothman, Nathanial; Houlston, Richard; Slager, Susan

    2017-01-01

    Several chronic lymphocytic leukaemia (CLL) susceptibility loci have been reported; however, much of the heritable risk remains unidentified. Here we perform a meta-analysis of six genome-wide association studies, imputed using a merged reference panel of 1,000 Genomes and UK10K data, totalling 6,200 cases and 17,598 controls after replication. We identify nine risk loci at 1p36.11 (rs34676223, P=5.04 × 10−13), 1q42.13 (rs41271473, P=1.06 × 10−10), 4q24 (rs71597109, P=1.37 × 10−10), 4q35.1 (rs57214277, P=3.69 × 10−8), 6p21.31 (rs3800461, P=1.97 × 10−8), 11q23.2 (rs61904987, P=2.64 × 10−11), 18q21.1 (rs1036935, P=3.27 × 10−8), 19p13.3 (rs7254272, P=4.67 × 10−8) and 22q13.33 (rs140522, P=2.70 × 10−9). These new and established risk loci map to areas of active chromatin and show an over-representation of transcription factor binding for the key determinants of B-cell development and immune response. PMID:28165464

  2. MYB Transcription Factors in Chinese Pear (Pyrus bretschneideri Rehd.): Genome-Wide Identification, Classification, and Expression Profiling during Fruit Development.

    PubMed

    Cao, Yunpeng; Han, Yahui; Li, Dahui; Lin, Yi; Cai, Yongping

    2016-01-01

    The MYB family is one of the largest families of transcription factors in plants. Although, some MYBs were reported to play roles in secondary metabolism, no comprehensive study of the MYB family in Chinese pear (Pyrus bretschneideri Rehd.) has been reported. In the present study, we performed genome-wide analysis of MYB genes in Chinese pear, designated as PbMYBs, including analyses of their phylogenic relationships, structures, chromosomal locations, promoter regions, GO annotations, and collinearity. A total of 129 PbMYB genes were identified in the pear genome and were divided into 31 subgroups based on phylogenetic analysis. These PbMYBs were unevenly distributed among 16 chromosomes (total of 17 chromosomes). The occurrence of gene duplication events indicated that whole-genome duplication and segmental duplication likely played key roles in expansion of the PbMYB gene family. Ka/Ks analysis suggested that the duplicated PbMYBs mainly experienced purifying selection with restrictive functional divergence after the duplication events. Interspecies microsynteny analysis revealed maximum orthology between pear and peach, followed by plum and strawberry. Subsequently, the expression patterns of 20 PbMYB genes that may be involved in lignin biosynthesis according to their phylogenetic relationships were examined throughout fruit development. Among the 20 genes examined, PbMYB25 and PbMYB52 exhibited expression patterns consistent with the typical variations in the lignin content previously reported. Moreover, sub-cellular localization analysis revealed that two proteins PbMYB25 and PbMYB52 were localized to the nucleus. All together, PbMYB25 and PbMYB52 were inferred to be candidate genes involved in the regulation of lignin biosynthesis during the development of pear fruit. This study provides useful information for further functional analysis of the MYB gene family in pear.

  3. Genome-wide meta-analysis of cerebral white matter hyperintensities in patients with stroke

    PubMed Central

    Zhang, Cathy R.; Adib-Samii, Poneh; Devan, William J.; Parsons, Owen E.; Lanfranconi, Silvia; Gregory, Sarah; Cloonan, Lisa; Falcone, Guido J.; Radmanesh, Farid; Fitzpatrick, Kaitlin; Kanakis, Allison; Barrick, Thomas R.; Moynihan, Barry; Lewis, Cathryn M.; Boncoraglio, Giorgio B.; Lemmens, Robin; Thijs, Vincent; Sudlow, Cathie; Wardlaw, Joanna; Rothwell, Peter M.; Meschia, James F.; Worrall, Bradford B.; Levi, Christopher; Bevan, Steve; Furie, Karen L.; Dichgans, Martin; Rosand, Jonathan; Markus, Hugh S.; Rost, Natalia

    2016-01-01

    Objective: For 3,670 stroke patients from the United Kingdom, United States, Australia, Belgium, and Italy, we performed a genome-wide meta-analysis of white matter hyperintensity volumes (WMHV) on data imputed to the 1000 Genomes reference dataset to provide insights into disease mechanisms. Methods: We first sought to identify genetic associations with white matter hyperintensities in a stroke population, and then examined whether genetic loci previously linked to WMHV in community populations are also associated in stroke patients. Having established that genetic associations are shared between the 2 populations, we performed a meta-analysis testing which associations with WMHV in stroke-free populations are associated overall when combined with stroke populations. Results: There were no associations at genome-wide significance with WMHV in stroke patients. All previously reported genome-wide significant associations with WMHV in community populations shared direction of effect in stroke patients. In a meta-analysis of the genome-wide significant and suggestive loci (p < 5 × 10−6) from community populations (15 single nucleotide polymorphisms in total) and from stroke patients, 6 independent loci were associated with WMHV in both populations. Four of these are novel associations at the genome-wide level (rs72934505 [NBEAL1], p = 2.2 × 10−8; rs941898 [EVL], p = 4.0 × 10−8; rs962888 [C1QL1], p = 1.1 × 10−8; rs9515201 [COL4A2], p = 6.9 × 10−9). Conclusions: Genetic associations with WMHV are shared in otherwise healthy individuals and patients with stroke, indicating common genetic susceptibility in cerebral small vessel disease. PMID:26674333

  4. Genome-Wide Analysis of Polycistronic MicroRNAs in Cultivated and Wild Rice

    PubMed Central

    Baldrich, Patricia; Hsing, Yue-Ie Caroline; San Segundo, Blanca

    2016-01-01

    MicroRNAs (miRNAs) are small noncoding RNAs that direct posttranscriptional gene silencing in eukaryotes. They are frequently clustered in the genomes of animals and can be independently transcribed or simultaneously transcribed into single polycistronic transcripts. Only a few miRNA clusters have been described in plants, and most of them are generated from independent transcriptional units. Here, we used a combination of bioinformatic tools and experimental analyses to discover new polycistronic miRNAs in rice. A genome-wide analysis of clustering patterns of MIRNA loci in the rice genome was carried out using a criterion of 3 kb as the maximal distance between two miRNAs. This analysis revealed 28 loci with the ability to form the typical hairpin structure of miRNA precursors in which 2 or more mature miRNAs mapped along the same structure. RT-PCR provided evidence for the polycistronic nature of seven miRNA precursors containing homologous or nonhomologous miRNA species. Polycistronic miRNAs and candidate polycistronic miRNAs are located across different rice chromosomes, except chromosome 12, and resided in both duplicated and nonduplicated chromosomal regions. Finally, most polycistronic and candidate polycistronic miRNAs showed a pattern of conservation in the genome of rice species with an AA genome. The diversity in the organization of MIR genes that are transcribed as polycistrons suggests a versatile mechanism for the control of gene expression in different biological processes and supports additional levels of complexity in miRNA functioning in plants. PMID:27190137

  5. Genome-wide Identification and Expression Analysis of the CDPK Gene Family in Grape, Vitis spp.

    PubMed

    Zhang, Kai; Han, Yong-Tao; Zhao, Feng-Li; Hu, Yang; Gao, Yu-Rong; Ma, Yan-Fei; Zheng, Yi; Wang, Yue-Jin; Wen, Ying-Qiang

    2015-06-30

    Calcium-dependent protein kinases (CDPKs) play vital roles in plant growth and development, biotic and abiotic stress responses, and hormone signaling. Little is known about the CDPK gene family in grapevine. In this study, we performed a genome-wide analysis of the 12X grape genome (Vitis vinifera) and identified nineteen CDPK genes. Comparison of the structures of grape CDPK genes allowed us to examine their functional conservation and differentiation. Segmentally duplicated grape CDPK genes showed high structural conservation and contributed to gene family expansion. Additional comparisons between grape and Arabidopsis thaliana demonstrated that several grape CDPK genes occured in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of grapevine and Arabidopsis. Phylogenetic analysis divided the grape CDPK genes into four groups. Furthermore, we examined the expression of the corresponding nineteen homologous CDPK genes in the Chinese wild grape (Vitis pseudoreticulata) under various conditions, including biotic stress, abiotic stress, and hormone treatments. The expression profiles derived from reverse transcription and quantitative PCR suggested that a large number of VpCDPKs responded to various stimuli on the transcriptional level, indicating their versatile roles in the responses to biotic and abiotic stresses. Moreover, we examined the subcellular localization of VpCDPKs by transiently expressing six VpCDPK-GFP fusion proteins in Arabidopsis mesophyll protoplasts; this revealed high variability consistent with potential functional differences. Taken as a whole, our data provide significant insights into the evolution and function of grape CDPKs and a framework for future investigation of grape CDPK genes.

  6. Inclusion of neighboring base interdependencies substantially improves genome-wide prokaryotic transcription factor binding site prediction

    PubMed Central

    Salama, Rafik A.; Stekel, Dov J.

    2010-01-01

    Prediction of transcription factor binding sites is an important challenge in genome analysis. The advent of next generation genome sequencing technologies makes the development of effective computational approaches particularly imperative. We have developed a novel training-based methodology intended for prokaryotic transcription factor binding site prediction. Our methodology extends existing models by taking into account base interdependencies between neighbouring positions using conditional probabilities and includes genomic background weighting. This has been tested against other existing and novel methodologies including position-specific weight matrices, first-order Hidden Markov Models and joint probability models. We have also tested the use of gapped and ungapped alignments and the inclusion or exclusion of background weighting. We show that our best method enhances binding site prediction for all of the 22 Escherichia coli transcription factors with at least 20 known binding sites, with many showing substantial improvements. We highlight the advantage of using block alignments of binding sites over gapped alignments to capture neighbouring position interdependencies. We also show that combining these methods with ChIP-on-chip data has the potential to further improve binding site prediction. Finally we have developed the ungapped likelihood under positional background platform: a user friendly website that gives access to the prediction method devised in this work. PMID:20439311

  7. Genome-wide analysis of WRKY gene family in Cucumis sativus

    PubMed Central

    2011-01-01

    Background WRKY proteins are a large family of transcriptional regulators in higher plant. They are involved in many biological processes, such as plant development, metabolism, and responses to biotic and abiotic stresses. Prior to the present study, only one full-length cucumber WRKY protein had been reported. The recent publication of the draft genome sequence of cucumber allowed us to conduct a genome-wide search for cucumber WRKY proteins, and to compare these positively identified proteins with their homologs in model plants, such as Arabidopsis. Results We identified a total of 55 WRKY genes in the cucumber genome. According to structural features of their encoded proteins, the cucumber WRKY (CsWRKY) genes were classified into three groups (group 1-3). Analysis of expression profiles of CsWRKY genes indicated that 48 WRKY genes display differential expression either in their transcript abundance or in their expression patterns under normal growth conditions, and 23 WRKY genes were differentially expressed in response to at least one abiotic stresses (cold, drought or salinity). The expression profile of stress-inducible CsWRKY genes were correlated with those of their putative Arabidopsis WRKY (AtWRKY) orthologs, except for the group 3 WRKY genes. Interestingly, duplicated group 3 AtWRKY genes appear to have been under positive selection pressure during evolution. In contrast, there was no evidence of recent gene duplication or positive selection pressure among CsWRKY group 3 genes, which may have led to the expressional divergence of group 3 orthologs. Conclusions Fifty-five WRKY genes were identified in cucumber and the structure of their encoded proteins, their expression, and their evolution were examined. Considering that there has been extensive expansion of group 3 WRKY genes in angiosperms, the occurrence of different evolutionary events could explain the functional divergence of these genes. PMID:21955985

  8. Genome-Wide Analysis of Gene Expression in Primate Taste Buds Reveals Links to Diverse Processes

    PubMed Central

    Lu, Min; Gao, Na; White, Evan; Echeverri, Fernando; Kalabat, Dalia; Soto, Hortensia; Laita, Bianca; Li, Cherry; Yeh, Shaoyang Anthony; Zoller, Mark; Zlotnik, Albert

    2009-01-01

    Efforts to unravel the mechanisms underlying taste sensation (gustation) have largely focused on rodents. Here we present the first comprehensive characterization of gene expression in primate taste buds. Our findings reveal unique new insights into the biology of taste buds. We generated a taste bud gene expression database using laser capture microdissection (LCM) procured fungiform (FG) and circumvallate (CV) taste buds from primates. We also used LCM to collect the top and bottom portions of CV taste buds. Affymetrix genome wide arrays were used to analyze gene expression in all samples. Known taste receptors are preferentially expressed in the top portion of taste buds. Genes associated with the cell cycle and stem cells are preferentially expressed in the bottom portion of taste buds, suggesting that precursor cells are located there. Several chemokines including CXCL14 and CXCL8 are among the highest expressed genes in taste buds, indicating that immune system related processes are active in taste buds. Several genes expressed specifically in endocrine glands including growth hormone releasing hormone and its receptor are also strongly expressed in taste buds, suggesting a link between metabolism and taste. Cell type-specific expression of transcription factors and signaling molecules involved in cell fate, including KIT, reveals the taste bud as an active site of cell regeneration, differentiation, and development. IKBKAP, a gene mutated in familial dysautonomia, a disease that results in loss of taste buds, is expressed in taste cells that communicate with afferent nerve fibers via synaptic transmission. This database highlights the power of LCM coupled with transcriptional profiling to dissect the molecular composition of normal tissues, represents the most comprehensive molecular analysis of primate taste buds to date, and provides a foundation for further studies in diverse aspects of taste biology. PMID:19636377

  9. Examination of the genome-wide transcriptional response of Escherichia coli O157:H7 to cinnamaldehyde exposure.

    PubMed

    Visvalingam, Jeyachchandran; Hernandez-Doria, Juan David; Holley, Richard A

    2013-02-01

    Cinnamaldehyde is a natural antimicrobial that has been found to be effective against many food-borne pathogens, including Escherichia coli O157:H7. Although its antimicrobial effects have been well investigated, limited information is available on its effects at the molecular level. Sublethal treatment at 200 mg/liter cinnamaldehyde inhibited growth of E. coli O157:H7 at 37°C and for ≤2 h caused cell elongation, but from 2 to 4 h growth resumed and cells reverted to normal length. To understand this transient behavior, genome-wide transcriptional analysis of E. coli O157:H7 was performed at 2 and 4 h of exposure to cinnamaldehyde in conjunction with reverse-phase high-performance liquid chromatography (RP-HPLC) analysis for cinnamaldehyde and other cinnamic compounds. Drastically different gene expression profiles were obtained at 2 and 4 h. RP-HPLC analysis showed that cinnamaldehyde was structurally stable for at least 2 h. At 2 h of exposure, cinnamaldehyde induced expression of many oxidative stress-related genes and repressed expression of DNA, protein, O-antigen, and fimbrial synthetic genes. At 4 h, many cinnamaldehyde-induced repressive effects on E. coli O157:H7 gene expression were reversed, and cells became more motile and grew at a slightly higher rate. Data indicated that by 4 h, E. coli O157:H7 was able to convert cinnamaldehyde into the less toxic cinnamic alcohol using dehydrogenase/reductase enzymes (YqhD and DkgA). This is the first study to characterize the ability of E. coli O157:H7 to convert cinnamaldehyde into cinnamic alcohol which, in turn, showed that the antimicrobial activity of cinnamaldehyde is mainly attributable to its carbonyl aldehyde group.

  10. Examination of the Genome-Wide Transcriptional Response of Escherichia coli O157:H7 to Cinnamaldehyde Exposure

    PubMed Central

    Visvalingam, Jeyachchandran; Hernandez-Doria, Juan David

    2013-01-01

    Cinnamaldehyde is a natural antimicrobial that has been found to be effective against many food-borne pathogens, including Escherichia coli O157:H7. Although its antimicrobial effects have been well investigated, limited information is available on its effects at the molecular level. Sublethal treatment at 200 mg/liter cinnamaldehyde inhibited growth of E. coli O157:H7 at 37°C and for ≤2 h caused cell elongation, but from 2 to 4 h growth resumed and cells reverted to normal length. To understand this transient behavior, genome-wide transcriptional analysis of E. coli O157:H7 was performed at 2 and 4 h of exposure to cinnamaldehyde in conjunction with reverse-phase high-performance liquid chromatography (RP-HPLC) analysis for cinnamaldehyde and other cinnamic compounds. Drastically different gene expression profiles were obtained at 2 and 4 h. RP-HPLC analysis showed that cinnamaldehyde was structurally stable for at least 2 h. At 2 h of exposure, cinnamaldehyde induced expression of many oxidative stress-related genes and repressed expression of DNA, protein, O-antigen, and fimbrial synthetic genes. At 4 h, many cinnamaldehyde-induced repressive effects on E. coli O157:H7 gene expression were reversed, and cells became more motile and grew at a slightly higher rate. Data indicated that by 4 h, E. coli O157:H7 was able to convert cinnamaldehyde into the less toxic cinnamic alcohol using dehydrogenase/reductase enzymes (YqhD and DkgA). This is the first study to characterize the ability of E. coli O157:H7 to convert cinnamaldehyde into cinnamic alcohol which, in turn, showed that the antimicrobial activity of cinnamaldehyde is mainly attributable to its carbonyl aldehyde group. PMID:23183978

  11. Genome-Wide Transcriptional Profiling and Structural Magnetic Resonance Imaging in the Maternal Immune Activation Model of Neurodevelopmental Disorders.

    PubMed

    Richetto, Juliet; Chesters, Robert; Cattaneo, Annamaria; Labouesse, Marie A; Gutierrez, Ana Maria Carrillo; Wood, Tobias C; Luoni, Alessia; Meyer, Urs; Vernon, Anthony; Riva, Marco A

    2017-06-01

    Prenatal exposure to maternal infection increases the risk of neurodevelopmental disorders, including schizophrenia and autism. The molecular processes underlying this pathological association, however, are only partially understood. Here, we combined unbiased genome-wide transcriptional profiling with follow-up epigenetic analyses and structural magnetic resonance imaging to explore convergent molecular and neuromorphological alterations in corticostriatal areas of adult offspring exposed to prenatal immune activation. Genome-wide transcriptional profiling revealed that prenatal immune activation caused a differential expression of 116 and 251 genes in the medial prefrontal cortex and nucleus accumbens, respectively. A large part of genes that were commonly affected in both brain areas were related to myelin functionality and stability. Subsequent epigenetic analyses indicated that altered DNA methylation of promoter regions might contribute to the differential expression of myelin-related genes. Quantitative relaxometry comparing T1, T2, and myelin water fraction revealed sparse increases in T1 relaxation times and consistent reductions in T2 relaxation times. Together, our multi-system approach demonstrates that prenatal viral-like immune activation causes myelin-related transcriptional and epigenetic changes in corticostriatal areas. Even though these abnormalities do not seem to be associated with overt white matter reduction, they may provide a molecular mechanism whereby prenatal infection can impair myelin functionality and stability. © The Author 2017. Published by Oxford University Press.

  12. Genome-Wide Effects of Selenium and Translational Uncoupling on Transcription in the Termite Gut Symbiont Treponema primitia

    PubMed Central

    Matson, Eric G.; Rosenthal, Adam Z.; Zhang, Xinning; Leadbetter, Jared R.

    2013-01-01

    ABSTRACT When prokaryotic cells acquire mutations, encounter translation-inhibiting substances, or experience adverse environmental conditions that limit their ability to synthesize proteins, transcription can become uncoupled from translation. Such uncoupling is known to suppress transcription of protein-encoding genes in bacteria. Here we show that the trace element selenium controls transcription of the gene for the selenocysteine-utilizing enzyme formate dehydrogenase (fdhFSec) through a translation-coupled mechanism in the termite gut symbiont Treponema primitia, a member of the bacterial phylum Spirochaetes. We also evaluated changes in genome-wide transcriptional patterns caused by selenium limitation and by generally uncoupling translation from transcription via antibiotic-mediated inhibition of protein synthesis. We observed that inhibiting protein synthesis in T. primitia influences transcriptional patterns in unexpected ways. In addition to suppressing transcription of certain genes, the expected consequence of inhibiting protein synthesis, we found numerous examples in which transcription of genes and operons is truncated far downstream from putative promoters, is unchanged, or is even stimulated overall. These results indicate that gene regulation in bacteria allows for specific post-initiation transcriptional responses during periods of limited protein synthesis, which may depend both on translational coupling and on unclassified intrinsic elements of protein-encoding genes. PMID:24222491

  13. Genome-wide identification and analysis of the MADS-box gene family in sesame.

    PubMed

    Wei, Xin; Wang, Linhai; Yu, Jingyin; Zhang, Yanxin; Li, Donghua; Zhang, Xiurong

    2015-09-10

    MADS-box genes encode transcription factors that play crucial roles in plant growth and development. Sesame (Sesamum indicum L.) is an oil crop that contributes to the daily oil and protein requirements of almost half of the world's population; therefore, a genome-wide analysis of the MADS-box gene family is needed. Fifty-seven MADS-box genes were identified from 14 linkage groups of the sesame genome. Analysis of phylogenetic relationships with Arabidopsis thaliana, Utricularia gibba and Solanum lycopersicum MADS-box genes was performed. Sesame MADS-box genes were clustered into four groups: 28 MIKC(c)-type, 5 MIKC(⁎)-type, 14 Mα-type and 10 Mγ-type. Gene structure analysis revealed from 1 to 22 exons of sesame MADS-box genes. The number of exons in type II MADS-box genes greatly exceeded the number in type I genes. Motif distribution analysis of sesame MADS-box genes also indicated that type II MADS-box genes contained more motifs than type I genes. These results suggested that type II sesame MADS-box genes had more complex structures. By analyzing expression profiles of MADS-box genes in seven sesame transcriptomes, we determined that MIKC(C)-type MADS-box genes played significant roles in sesame flower and seed development. Although most MADS-box genes in the same clade showed similar expression features, some gene functions were diversified from the orthologous Arabidopsis genes. This research will contribute to uncovering the role of MADS-box genes in sesame development.

  14. Genome-wide identification, evolutionary and expression analysis of the aspartic protease gene superfamily in grape

    PubMed Central

    2013-01-01

    Background Aspartic proteases (APs) are a large family of proteolytic enzymes found in almost all organisms. In plants, they are involved in many biological processes, such as senescence, stress responses, programmed cell death, and reproduction. Prior to the present study, no grape AP gene(s) had been reported, and their research on woody species was very limited. Results In this study, a total of 50 AP genes (VvAP) were identified in the grape genome, among which 30 contained the complete ASP domain. Synteny analysis within grape indicated that segmental and tandem duplication events contributed to the expansion of the grape AP family. Additional analysis between grape and Arabidopsis demonstrated that several grape AP genes were found in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of grape and Arabidopsis. Phylogenetic relationships of the 30 VvAPs with the complete ASP domain and their Arabidopsis orthologs, as well as their gene and protein features were analyzed and their cellular localization was predicted. Moreover, expression profiles of VvAP genes in six different tissues were determined, and their transcript abundance under various stresses and hormone treatments were measured. Twenty-seven VvAP genes were expressed in at least one of the six tissues examined; nineteen VvAPs responded to at least one abiotic stress, 12 VvAPs responded to powdery mildew infection, and most of the VvAPs responded to SA and ABA treatments. Furthermore, integrated synteny and phylogenetic analysis identified orthologous AP genes between grape and Arabidopsis, providing a unique starting point for investigating the function of grape AP genes. Conclusions The genome-wide identification, evolutionary and expression analyses of grape AP genes provide a framework for future analysis of AP genes in defining their roles during stress response. Integrated synteny and phylogenetic analyses provide novel insight into the

  15. Genome-wide analysis for protein-DNA interaction: ChIP-chip.

    PubMed

    Tong, Yunguang; Falk, Jeff

    2009-01-01

    Chromatin immunoprecipitation (ChIP) is a well-established procedure for protein-DNA interaction research. ChIP-chip, combining chromatin immunoprecipitation (ChIP) and microarray technology (Chip), enables scientists to survey genome-wide DNA binding sites for a given protein. The ChIP-chip technique has been used to identify transcription factor binding sites, explore epigenomic information and investigate factors in DNA replicate/repairs. Here we describe a protocol for ChIP-chip to study Pituitary Tumor Transforming Gene (PTTG1) in mammalian cells.

  16. Genome-wide analysis of SAUR gene family in Solanaceae species.

    PubMed

    Wu, Jian; Liu, Songyu; He, Yanjun; Guan, Xiaoyan; Zhu, Xiangfei; Cheng, Lin; Wang, Jie; Lu, Gang

    2012-11-01

    The plant hormone auxin plays a vital role in regulating many aspects of plant growth and development. Small auxin up-regulated RNAs (SAURs) are primary auxin response genes hypothesized to be involved in auxin signaling pathway, but their functions remain unclear. Here, a genome-wide search for SAUR gene homologues in Solanaceae species identified 99 and 134 members of SAUR gene family from tomato and potato, respectively. Phylogenetic analysis indicated that the SAUR proteins from Arabidopsis, rice, sorghum, tomato and potato were divided into four major groups with 16 subgroups. Among them, 25 histidine-rich SAURs genes with metal-binding characteristics were found in Arabidopsis, sorghum and Solanaceae species, but not in rice. Using tomato as a model, a comprehensive overview of SAUR gene family is presented, including the gene structures, phylogeny and chromosome locations. Quantitative real-time PCR analysis indicated that 11 randomly selected SlSAUR genes in tomato could be expressed at least in one of the tomato organs/tissues tested. However, different SlSAUR genes displayed distinctive expression levels. SlSAUR16 and SlSAUR71 exhibited highly tissue-specific expression patterns. Almost all of the detected SlSAURs showed an accumulating pattern of mRNA along tomato flower and fruit development. Some of them displayed differential response to exogenous IAA treatment. The abiotic (cold, salt and drought) stresses significantly modified transcript levels of SlSAURs genes. Most of them were down-regulated in response to abiotic stresses (drought, heat and salinity), but SlSAUR58, as a histidine-rich SAUR gene, was up-regulated after salt treatment, indicating that it may play a specific role in the salt signaling transduction pathway. Our comparative analysis provides some basic genomic information for the SAUR genes in the Solanaceae species and will pave the way for deciphering their function during plant development.

  17. Rho-dependent transcription termination is essential to prevent excessive genome-wide R-loops in Escherichia coli.

    PubMed

    Leela, J Krishna; Syeda, Aisha H; Anupama, K; Gowrishankar, J

    2013-01-02

    Two pathways of transcription termination, factor-independent and -dependent, exist in bacteria. The latter pathway operates on nascent transcripts that are not simultaneously translated and requires factors Rho, NusG, and NusA, each of which is essential for viability of WT Escherichia coli. NusG and NusA are also involved in antitermination of transcription at the ribosomal RNA operons, as well as in regulating the rates of transcription elongation of all genes. We have used a bisulfite-sensitivity assay to demonstrate genome-wide increase in the occurrence of RNA-DNA hybrids (R-loops), including from antisense and read-through transcripts, in a nusG missense mutant defective for Rho-dependent termination. Lethality associated with complete deficiency of Rho and NusG (but not NusA) was rescued by ectopic expression of an R-loop-helicase UvsW, especially so on defined growth media. Our results suggest that factor-dependent transcription termination subserves a surveillance function to prevent translation-uncoupled transcription from generating R-loops, which would block replication fork progression and therefore be lethal, and that NusA performs additional essential functions as well in E. coli. Prevention of R-loop-mediated transcription-replication conflicts by cotranscriptional protein engagement of nascent RNA is emerging as a unifying theme among both prokaryotes and eukaryotes.

  18. Meta-analysis of 32 genome-wide linkage studies of schizophrenia

    PubMed Central

    Ng, MYM; Levinson, DF; Faraone, SV; Suarez, BK; DeLisi, LE; Arinami, T; Riley, B; Paunio, T; Pulver, AE; Irmansyah; Holmans, PA; Escamilla, M; Wildenauer, DB; Williams, NM; Laurent, C; Mowry, BJ; Brzustowicz, LM; Maziade, M; Sklar, P; Garver, DL; Abecasis, GR; Lerer, B; Fallin, MD; Gurling, HMD; Gejman, PV; Lindholm, E; Moises, HW; Byerley, W; Wijsman, EM; Forabosco, P; Tsuang, MT; Hwu, H-G; Okazaki, Y; Kendler, KS; Wormley, B; Fanous, A; Walsh, D; O’Neill, FA; Peltonen, L; Nestadt, G; Lasseter, VK; Liang, KY; Papadimitriou, GM; Dikeos, DG; Schwab, SG; Owen, MJ; O’Donovan, MC; Norton, N; Hare, E; Raventos, H; Nicolini, H; Albus, M; Maier, W; Nimgaonkar, VL; Terenius, L; Mallet, J; Jay, M; Godard, S; Nertney, D; Alexander, M; Crowe, RR; Silverman, JM; Bassett, AS; Roy, M-A; Mérette, C; Pato, CN; Pato, MT; Roos, J Louw; Kohn, Y; Amann-Zalcenstein, D; Kalsi, G; McQuillin, A; Curtis, D; Brynjolfson, J; Sigmundsson, T; Petursson, H; Sanders, AR; Duan, J; Jazin, E; Myles-Worsley, M; Karayiorgou, M; Lewis, CM

    2009-01-01

    A genome scan meta-analysis (GSMA) was carried out on 32 independent genome-wide linkage scan analyses that included 3255 pedigrees with 7413 genotyped cases affected with schizophrenia (SCZ) or related disorders. The primary GSMA divided the autosomes into 120 bins, rank-ordered the bins within each study according to the most positive linkage result in each bin, summed these ranks (weighted for study size) for each bin across studies and determined the empirical probability of a given summed rank (PSR) by simulation. Suggestive evidence for linkage was observed in two single bins, on chromosomes 5q (142-168 Mb) and 2q (103-134 Mb). Genome-wide evidence for linkage was detected on chromosome 2q (119-152 Mb) when bin boundaries were shifted to the middle of the previous bins. The primary analysis met empirical criteria for ‘aggregate’ genome-wide significance, indicating that some or all of 10 bins are likely to contain loci linked to SCZ, including regions of chromosomes 1, 2q, 3q, 4q, 5q, 8p and 10q. In a secondary analysis of 22 studies of European-ancestry samples, suggestive evidence for linkage was observed on chromosome 8p (16-33 Mb). Although the newer genome-wide association methodology has greater power to detect weak associations to single common DNA sequence variants, linkage analysis can detect diverse genetic effects that segregate in families, including multiple rare variants within one locus or several weakly associated loci in the same region. Therefore, the regions supported by this meta-analysis deserve close attention in future studies. PMID:19349958

  19. Genome-wide meta-analysis identifies five new susceptibility loci for cutaneous malignant melanoma

    PubMed Central

    Law, Matthew H.; Bishop, D. Timothy; Martin, Nicholas G.; Moses, Eric K.; Song, Fengju; Barrett, Jennifer H.; Kumar, Rajiv; Easton, Douglas F.; Pharoah, Paul D. P.; Swerdlow, Anthony J.; Kypreou, Katerina P.; Taylor, John C.; Harland, Mark; Randerson-Moor, Juliette; Akslen, Lars A.; Andresen, Per A.; Avril, Marie-Françoise; Azizi, Esther; Scarrà, Giovanna Bianchi; Brown, Kevin M.; Dębniak, Tadeusz; Duffy, David L.; Elder, David E.; Fang, Shenying; Friedman, Eitan; Galan, Pilar; Ghiorzo, Paola; Gillanders, Elizabeth M.; Goldstein, Alisa M.; Gruis, Nelleke A.; Hansson, Johan; Helsing, Per; Hočevar, Marko; Höiom, Veronica; Ingvar, Christian; Kanetsky, Peter A.; Chen, Wei V.; Landi, Maria Teresa; Lang, Julie; Lathrop, G. Mark; Lubiński, Jan; Mackie, Rona M.; Mann, Graham J.; Molven, Anders; Montgomery, Grant W.; Novaković, Srdjan; Olsson, Håkan; Puig, Susana; Puig-Butille, Joan Anton; Qureshi, Abrar A.; Radford-Smith, Graham L.; van der Stoep, Nienke; van Doorn, Remco; Whiteman, David C.; Craig, Jamie E.; Schadendorf, Dirk; Simms, Lisa A.; Burdon, Kathryn P.; Nyholt, Dale R.; Pooley, Karen A.; Orr, Nick; Stratigos, Alexander J.; Cust, Anne E.; Ward, Sarah V.; Hayward, Nicholas K.; Han, Jiali; Schulze, Hans-Joachim; Dunning, Alison M.; Bishop, Julia A. Newton; MacGregor, Stuart; Iles, Mark M.

    2015-01-01

    Thirteen common susceptibility loci have been reproducibly associated with cutaneous malignant melanoma (CMM). We report the results of an international 2-stage meta-analysis of CMM genome-wide association studies (GWAS). This meta-analysis combines 11 GWAS (5 previously unpublished) and a further three stage 2 data sets, totaling 15,990 CMM cases and 26,409 controls. Five loci not previously associated with CMM risk reached genome-wide significance (P < 5×10–8), as did two previously-reported but un-replicated loci and all thirteen established loci. Novel SNPs fall within putative melanocyte regulatory elements, and bioinformatic and expression quantitative trait locus (eQTL) data highlight candidate genes including one involved in telomere biology. PMID:26237428

  20. Novel R tools for analysis of genome-wide population genetic data with emphasis on clonality

    PubMed Central

    Kamvar, Zhian N.; Brooks, Jonah C.; Grünwald, Niklaus J.

    2015-01-01

    To gain a detailed understanding of how plant microbes evolve and adapt to hosts, pesticides, and other factors, knowledge of the population dynamics and evolutionary history of populations is crucial. Plant pathogen populations are often clonal or partially clonal which requires different analytical tools. With the advent of high throughput sequencing technologies, obtaining genome-wide population genetic data has become easier than ever before. We previously contributed the R package poppr specifically addressing issues with analysis of clonal populations. In this paper we provide several significant extensions to poppr with a focus on large, genome-wide SNP data. Specifically, we provide several new functionalities including the new function mlg.filter to define clone boundaries allowing for inspection and definition of what is a clonal lineage, minimum spanning networks with reticulation, a sliding-window analysis of the index of association, modular bootstrapping of any genetic distance, and analyses across any level of hierarchies. PMID:26113860

  1. Genome-wide transcriptome analysis of soybean primary root under varying water-deficit conditions.

    PubMed

    Song, Li; Prince, Silvas; Valliyodan, Babu; Joshi, Trupti; Maldonado dos Santos, Joao V; Wang, Jiaojiao; Lin, Li; Wan, Jinrong; Wang, Yongqin; Xu, Dong; Nguyen, Henry T

    2016-01-15

    Soybean is a major crop that provides an important source of protein and oil to humans and animals, but its production can be dramatically decreased by the occurrence of drought stress. Soybeans can survive drought stress if there is a robust and deep root system at the early vegetative growth stage. However, little is known about the genome-wide molecular mechanisms contributing to soybean root system architecture. This study was performed to gain knowledge on transcriptome changes and related molecular mechanisms contributing to soybean root development under water limited conditions. The soybean Williams 82 genotype was subjected to very mild stress (VMS), mild stress (MS) and severe stress (SS) conditions, as well as recovery from the severe stress after re-watering (SR). In total, 6,609 genes in the roots showed differential expression patterns in response to different water-deficit stress levels. Genes involved in hormone (Auxin/Ethylene), carbohydrate, and cell wall-related metabolism (XTH/lipid/flavonoids/lignin) pathways were differentially regulated in the soybean root system. Several transcription factors (TFs) regulating root growth and responses under varying water-deficit conditions were identified and the expression patterns of six TFs were found to be common across the stress levels. Further analysis on the whole plant level led to the finding of tissue-specific or water-deficit levels specific regulation of transcription factors. Analysis of the over-represented motif of different gene groups revealed several new cis-elements associated with different levels of water deficit. The expression patterns of 18 genes were confirmed byquantitative reverse transcription polymerase chain reaction method and demonstrated the accuracy and effectiveness of RNA-Seq. The primary root specific transcriptome in soybean can enable a better understanding of the root response to water deficit conditions. The genes detected in root tissues that were associated with

  2. Meta-analysis of sex-specific genome-wide association studies.

    PubMed

    Magi, Reedik; Lindgren, Cecilia M; Morris, Andrew P

    2010-12-01

    Despite the success of genome-wide association studies, much of the genetic contribution to complex human traits is still unexplained. One potential source of genetic variation that may contribute to this "missing heritability" is that which differs in magnitude and/or direction between males and females, which could result from sexual dimorphism in gene expression. Such sex-differentiated effects are common in model organisms, and are becoming increasingly evident in human complex traits through large-scale male- and female-specific meta-analyses. In this article, we review the methodology for meta-analysis of sex-specific genome-wide association studies, and propose a sex-differentiated test of association with quantitative or dichotomous traits, which allows for heterogeneity of allelic effects between males and females. We perform detailed simulations to compare the power of the proposed sex-differentiated meta-analysis with the more traditional "sex-combined" approach, which is ambivalent to gender. The results of this study highlight only a small loss in power for the sex-differentiated meta-analysis when the allelic effects of the causal variant are the same in males and females. However, over a range of models of heterogeneity in allelic effects between genders, our sex-differentiated meta-analysis strategy offers substantial gains in power, and thus has the potential to discover novel loci contributing effects to complex human traits with existing genome-wide association data.

  3. Genome-Wide Transcription Study of Cryptococcus neoformans H99 Clinical Strain versus Environmental Strains

    PubMed Central

    Movahed, Elaheh; Munusamy, Komathy; Tan, Grace Min Yi; Looi, Chung Yeng; Tay, Sun Tee; Wong, Won Fen

    2015-01-01

    The infection of Cryptococcus neoformans is acquired through the inhalation of desiccated yeast cells and basidiospores originated from the environment, particularly from bird’s droppings and decaying wood. Three environmental strains of C. neoformans originated from bird droppings (H4, S48B and S68B) and C. neoformans reference clinical strain (H99) were used for intranasal infection in C57BL/6 mice. We showed that the H99 strain demonstrated higher virulence compared to H4, S48B and S68B strains. To examine if gene expression contributed to the different degree of virulence among these strains, a genome-wide microarray study was performed to inspect the transcriptomic profiles of all four strains. Our results revealed that out of 7,419 genes (22,257 probes) examined, 65 genes were significantly up-or down-regulated in H99 versus H4, S48B and S68B strains. The up-regulated genes in H99 strain include Hydroxymethylglutaryl-CoA synthase (MVA1), Mitochondrial matrix factor 1 (MMF1), Bud-site-selection protein 8 (BUD8), High affinity glucose transporter 3 (SNF3) and Rho GTPase-activating protein 2 (RGA2). Pathway annotation using DAVID bioinformatics resource showed that metal ion binding and sugar transmembrane transporter activity pathways were highly expressed in the H99 strain. We suggest that the genes and pathways identified may possibly play crucial roles in the fungal pathogenesis. PMID:26360021

  4. Genome-Wide Transcription Study of Cryptococcus neoformans H99 Clinical Strain versus Environmental Strains.

    PubMed

    Movahed, Elaheh; Munusamy, Komathy; Tan, Grace Min Yi; Looi, Chung Yeng; Tay, Sun Tee; Wong, Won Fen

    2015-01-01

    The infection of Cryptococcus neoformans is acquired through the inhalation of desiccated yeast cells and basidiospores originated from the environment, particularly from bird's droppings and decaying wood. Three environmental strains of C. neoformans originated from bird droppings (H4, S48B and S68B) and C. neoformans reference clinical strain (H99) were used for intranasal infection in C57BL/6 mice. We showed that the H99 strain demonstrated higher virulence compared to H4, S48B and S68B strains. To examine if gene expression contributed to the different degree of virulence among these strains, a genome-wide microarray study was performed to inspect the transcriptomic profiles of all four strains. Our results revealed that out of 7,419 genes (22,257 probes) examined, 65 genes were significantly up-or down-regulated in H99 versus H4, S48B and S68B strains. The up-regulated genes in H99 strain include Hydroxymethylglutaryl-CoA synthase (MVA1), Mitochondrial matrix factor 1 (MMF1), Bud-site-selection protein 8 (BUD8), High affinity glucose transporter 3 (SNF3) and Rho GTPase-activating protein 2 (RGA2). Pathway annotation using DAVID bioinformatics resource showed that metal ion binding and sugar transmembrane transporter activity pathways were highly expressed in the H99 strain. We suggest that the genes and pathways identified may possibly play crucial roles in the fungal pathogenesis.

  5. Genome-wide identification and characterization of GRAS transcription factors in sacred lotus (Nelumbo nucifera)

    PubMed Central

    Zhou, Ying; Zhou, Yu; Yang, Jie

    2016-01-01

    The GRAS gene family is one of the most important plant-specific gene families, which encodes transcriptional regulators and plays an essential role in plant development and physiological processes. The GRAS gene family has been well characterized in many higher plants such as Arabidopsis, rice, Chinese cabbage, tomato and tobacco. In this study, we identified 38 GRAS genes in sacred lotus (Nelumbo nucifera), analyzed their physical and chemical characteristics and performed phylogenetic analysis using the GRAS genes from eight representative plant species to show the evolution of GRAS genes in Planta. In addition, the gene structures and motifs of the sacred lotus GRAS proteins were characterized in detail. Comparative analysis identified 42 orthologous and 9 co-orthologous gene pairs between sacred lotus and Arabidopsis, and 35 orthologous and 22 co-orthologous gene pairs between sacred lotus and rice. Based on publically available RNA-seq data generated from leaf, petiole, rhizome and root, we found that most of the sacred lotus GRAS genes exhibited a tissue-specific expression pattern. Eight of the ten PAT1-clade GRAS genes, particularly NnuGRAS-05, NnuGRAS-10 and NnuGRAS-25, were preferentially expressed in rhizome and root. In summary, this is the first in silico analysis of the GRAS gene family in sacred lotus, which will provide valuable information for further molecular and biological analyses of this important gene family. PMID:27635351

  6. Genome-wide identification of transcriptional targets of RORA reveals direct regulation of multiple genes associated with autism spectrum disorder

    PubMed Central

    2013-01-01

    Background We have recently identified the nuclear hormone receptor RORA (retinoic acid-related orphan receptor-alpha) as a novel candidate gene for autism spectrum disorder (ASD). Our independent cohort studies have consistently demonstrated the reduction of RORA transcript and/or protein levels in blood-derived lymphoblasts as well as in the postmortem prefrontal cortex and cerebellum of individuals with ASD. Moreover, we have also shown that RORA has the potential to be under negative and positive regulation by androgen and estrogen, respectively, suggesting the possibility that RORA may contribute to the male bias of ASD. However, little is known about transcriptional targets of this nuclear receptor, particularly in humans. Methods Here we identify transcriptional targets of RORA in human neuronal cells on a genome-wide level using chromatin immunoprecipitation (ChIP) with an anti-RORA antibody followed by whole-genome promoter array (chip) analysis. Selected potential targets of RORA were then validated by an independent ChIP followed by quantitative PCR analysis. To further demonstrate that reduced RORA expression results in reduced transcription of RORA targets, we determined the expression levels of the selected transcriptional targets in RORA-deficient human neuronal cells, as well as in postmortem brain tissues from individuals with ASD who exhibit reduced RORA expression. Results The ChIP-on-chip analysis reveals that RORA1, a major isoform of RORA protein in human brain, can be recruited to as many as 2,764 genomic locations corresponding to promoter regions of 2,544 genes across the human genome. Gene ontology analysis of this dataset of genes that are potentially directly regulated by RORA1 reveals statistically significant enrichment in biological functions negatively impacted in individuals with ASD, including neuronal differentiation, adhesion and survival, synaptogenesis, synaptic transmission and plasticity, and axonogenesis, as well as higher

  7. Genome-wide identification of transcriptional targets of RORA reveals direct regulation of multiple genes associated with autism spectrum disorder.

    PubMed

    Sarachana, Tewarit; Hu, Valerie W

    2013-05-22

    We have recently identified the nuclear hormone receptor RORA (retinoic acid-related orphan receptor-alpha) as a novel candidate gene for autism spectrum disorder (ASD). Our independent cohort studies have consistently demonstrated the reduction of RORA transcript and/or protein levels in blood-derived lymphoblasts as well as in the postmortem prefrontal cortex and cerebellum of individuals with ASD. Moreover, we have also shown that RORA has the potential to be under negative and positive regulation by androgen and estrogen, respectively, suggesting the possibility that RORA may contribute to the male bias of ASD. However, little is known about transcriptional targets of this nuclear receptor, particularly in humans. Here we identify transcriptional targets of RORA in human neuronal cells on a genome-wide level using chromatin immunoprecipitation (ChIP) with an anti-RORA antibody followed by whole-genome promoter array (chip) analysis. Selected potential targets of RORA were then validated by an independent ChIP followed by quantitative PCR analysis. To further demonstrate that reduced RORA expression results in reduced transcription of RORA targets, we determined the expression levels of the selected transcriptional targets in RORA-deficient human neuronal cells, as well as in postmortem brain tissues from individuals with ASD who exhibit reduced RORA expression. The ChIP-on-chip analysis reveals that RORA1, a major isoform of RORA protein in human brain, can be recruited to as many as 2,764 genomic locations corresponding to promoter regions of 2,544 genes across the human genome. Gene ontology analysis of this dataset of genes that are potentially directly regulated by RORA1 reveals statistically significant enrichment in biological functions negatively impacted in individuals with ASD, including neuronal differentiation, adhesion and survival, synaptogenesis, synaptic transmission and plasticity, and axonogenesis, as well as higher level functions such as

  8. Genome-wide analysis reveals positional-nucleosome-oriented binding pattern of pioneer factor FOXA1

    PubMed Central

    Ye, Zhenqing; Chen, Zhong; Sunkel, Benjamin; Frietze, Seth; Huang, Tim H.-M.; Wang, Qianben; Jin, Victor X.

    2016-01-01

    The compaction of nucleosomal structures creates a barrier for DNA-binding transcription factors (TFs) to access their cognate cis-regulatory elements. Pioneer factors (PFs) such as FOXA1 are able to directly access these cis-targets within compact chromatin. However, how these PFs interplay with nucleosomes remains to be elucidated, and is critical for us to understand the underlying mechanism of gene regulation. Here, we have conducted a computational analysis on a strand-specific paired-end ChIP-exo (termed as ChIP-ePENS) data of FOXA1 in LNCaP cells by our novel algorithm ePEST. We find that FOXA1 chromatin binding occurs via four distinct border modes (or footprint boundary patterns), with a preferential footprint boundary patterns relative to FOXA1 motif orientation. In addition, from this analysis three fundamental nucleotide positions (oG, oS and oH) emerged as major determinants for blocking exo-digestion and forming these four distinct border modes. By integrating histone MNase-seq data, we found an astonishingly consistent, ‘well-positioned’ configuration occurs between FOXA1 motifs and dyads of nucleosomes genome-wide. We further performed ChIP-seq of eight chromatin remodelers and found an increased occupancy of these remodelers on FOXA1 motifs for all four border modes (or footprint boundary patterns), indicating the full occupancy of FOXA1 complex on the three blocking sites (oG, oS and oH) likely produces an active regulatory status with well-positioned phasing for protein binding events. Together, our results suggest a positional-nucleosome-oriented accessing model for PFs seeking target motifs, in which FOXA1 can examine each underlying DNA nucleotide and is able to sense all potential motifs regardless of whether they face inward or outward from histone octamers along the DNA helix axis. PMID:27458208

  9. Genome-wide common and rare variant analysis provides novel insights into clozapine-associated neutropenia.

    PubMed

    Legge, S E; Hamshere, M L; Ripke, S; Pardinas, A F; Goldstein, J I; Rees, E; Richards, A L; Leonenko, G; Jorskog, L F; Chambert, K D; Collier, D A; Genovese, G; Giegling, I; Holmans, P; Jonasdottir, A; Kirov, G; McCarroll, S A; MacCabe, J H; Mantripragada, K; Moran, J L; Neale, B M; Stefansson, H; Rujescu, D; Daly, M J; Sullivan, P F; Owen, M J; O'Donovan, M C; Walters, J T R

    2016-07-12

    The antipsychotic clozapine is uniquely effective in the management of schizophrenia; however, its use is limited by its potential to induce agranulocytosis. The causes of this, and of its precursor neutropenia, are largely unknown, although genetic factors have an important role. We sought risk alleles for clozapine-associated neutropenia in a sample of 66 cases and 5583 clozapine-treated controls, through a genome-wide association study (GWAS), imputed human leukocyte antigen (HLA) alleles, exome array and copy-number variation (CNV) analyses. We then combined associated variants in a meta-analysis with data from the Clozapine-Induced Agranulocytosis Consortium (up to 163 cases and 7970 controls). In the largest combined sample to date, we identified a novel association with rs149104283 (odds ratio (OR)=4.32, P=1.79 × 10(-8)), intronic to transcripts of SLCO1B3 and SLCO1B7, members of a family of hepatic transporter genes previously implicated in adverse drug reactions including simvastatin-induced myopathy and docetaxel-induced neutropenia. Exome array analysis identified gene-wide associations of uncommon non-synonymous variants within UBAP2 and STARD9. We additionally provide independent replication of a previously identified variant in HLA-DQB1 (OR=15.6, P=0.015, positive predictive value=35.1%). These results implicate biological pathways through which clozapine may act to cause this serious adverse effect.Molecular Psychiatry advance online publication, 12 July 2016; doi:10.1038/mp.2016.97.

  10. Genome-Wide Identification, Characterization and Expression Analysis of the TCP Gene Family in Prunus mume.

    PubMed

    Zhou, Yuzhen; Xu, Zongda; Zhao, Kai; Yang, Weiru; Cheng, Tangren; Wang, Jia; Zhang, Qixiang

    2016-01-01

    TCP proteins, belonging to a plant-specific transcription factors family, are known to have great functions in plant development, especially flower and leaf development. However, there is little information about this gene family in Prunus mume, which is widely cultivated in China as an ornamental and fruit tree. Here a genome-wide analysis of TCP genes was performed to explore their evolution in P. mume. Nineteen PmTCPs were identified and three of them contained putative miR319 target sites. Phylogenetic and comprehensive bioinformatics analyses of these genes revealed that different types of TCP genes had undergone different evolutionary processes and the genes in the same clade had similar chromosomal location, gene structure, and conserved domains. Expression analysis of these PmTCPs indicated that there were diverse expression patterns among different clades. Most TCP genes were predominantly expressed in flower, leaf, and stem, and showed high expression levels in the different stages of flower bud differentiation, especially in petal formation stage and gametophyte development. Genes in TCP-P subfamily had main roles in both flower development and gametophyte development. The CIN genes in double petal cultivars might have key roles in the formation of petal, while they were correlated with gametophyte development in the single petal cultivar. The CYC/TB1 type genes were highly detected in the formation of petal and pistil. The less-complex flower types of P. mume might result from the fact that there were only two CYC type genes present in P. mume and a lack of CYC2 genes to control the identity of flower types. These results lay the foundation for further study on the functions of TCP genes during flower development.

  11. Genome-Wide Identification, Characterization and Expression Analysis of the TCP Gene Family in Prunus mume

    PubMed Central

    Zhou, Yuzhen; Xu, Zongda; Zhao, Kai; Yang, Weiru; Cheng, Tangren; Wang, Jia; Zhang, Qixiang

    2016-01-01

    TCP proteins, belonging to a plant-specific transcription factors family, are known to have great functions in plant development, especially flower and leaf development. However, there is little information about this gene family in Prunus mume, which is widely cultivated in China as an ornamental and fruit tree. Here a genome-wide analysis of TCP genes was performed to explore their evolution in P. mume. Nineteen PmTCPs were identified and three of them contained putative miR319 target sites. Phylogenetic and comprehensive bioinformatics analyses of these genes revealed that different types of TCP genes had undergone different evolutionary processes and the genes in the same clade had similar chromosomal location, gene structure, and conserved domains. Expression analysis of these PmTCPs indicated that there were diverse expression patterns among different clades. Most TCP genes were predominantly expressed in flower, leaf, and stem, and showed high expression levels in the different stages of flower bud differentiation, especially in petal formation stage and gametophyte development. Genes in TCP-P subfamily had main roles in both flower development and gametophyte development. The CIN genes in double petal cultivars might have key roles in the formation of petal, while they were correlated with gametophyte development in the single petal cultivar. The CYC/TB1 type genes were highly detected in the formation of petal and pistil. The less-complex flower types of P. mume might result from the fact that there were only two CYC type genes present in P. mume and a lack of CYC2 genes to control the identity of flower types. These results lay the foundation for further study on the functions of TCP genes during flower development. PMID:27630648

  12. Meta-analysis of genome-wide association studies of attention deficit/hyperactivity disorder

    PubMed Central

    Neale, Benjamin M; Medland, Sarah E.; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V.; Nguyen, Thuy Trang; Schäfer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J.L.; Langely, Kate; O’Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P.; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D.; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan

    2010-01-01

    Objective Although twin and family studies have shown Attention Deficit/Hyperactivity Disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association scans (GWAS) have not yielded significant results, we conducted a meta-analysis of existing studies to boost statistical power. Method We used data from four projects: a) the Children’s Hospital of Philadelphia (CHOP), b) phase I of the International Multicenter ADHD Genetics project (IMAGE), c) phase II of IMAGE (IMAGE II), and d) the Pfizer funded study from the University of California, Los Angeles, Washington University and the Massachusetts General Hospital (PUWMa). The final sample size consisted of 2,064 trios, 896 cases and 2,455 controls. For each study, we imputed HapMap SNPs, computed association test statistics and transformed them to Z-scores, and then combined weighted Z-scores in a meta-analysis. Results No genome-wide significant associations were found, although an analysis of candidate genes suggests they may be involved in the disorder. Conclusions Given that ADHD is a highly heritable disorder, our negative results suggest that the effects of common ADHD risk variants must, individually, be very small or that other types of variants, e.g. rare ones, account for much of the disorder’s heritability. PMID:20732625

  13. Meta-analysis of genome-wide association studies of attention-deficit/hyperactivity disorder.

    PubMed

    Neale, Benjamin M; Medland, Sarah E; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V; Nguyen, Thuy Trang; Schäfer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J L; Langely, Kate; O'Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan

    2010-09-01

    Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association studies (GWAS) have not yielded significant results, we conducted a meta-analysis of existing studies to boost statistical power. We used data from four projects: a) the Children's Hospital of Philadelphia (CHOP); b) phase I of the International Multicenter ADHD Genetics project (IMAGE); c) phase II of IMAGE (IMAGE II); and d) the Pfizer-funded study from the University of California, Los Angeles, Washington University, and Massachusetts General Hospital (PUWMa). The final sample size consisted of 2,064 trios, 896 cases, and 2,455 controls. For each study, we imputed HapMap single nucleotide polymorphisms, computed association test statistics and transformed them to z-scores, and then combined weighted z-scores in a meta-analysis. No genome-wide significant associations were found, although an analysis of candidate genes suggests that they may be involved in the disorder. Given that ADHD is a highly heritable disorder, our negative results suggest that the effects of common ADHD risk variants must, individually, be very small or that other types of variants, e.g., rare ones, account for much of the disorder's heritability. 2010 American Academy of Child and Adolescent Psychiatry. Published by Elsevier Inc. All rights reserved.

  14. Genome-wide association analysis of age at onset and psychotic symptoms in bipolar disorder.

    PubMed

    Belmonte Mahon, Pamela; Pirooznia, Mehdi; Goes, Fernando S; Seifuddin, Fayaz; Steele, Jo; Lee, Phil Hyoun; Huang, Jie; Hamshere, Marian L; Depaulo, J Raymond; Kelsoe, John R; Rietschel, Marcella; Nöthen, Markus; Cichon, Sven; Gurling, Hugh; Purcell, Shaun; Smoller, Jordan W; Craddock, Nick; Schulze, Thomas G; McMahon, Francis J; Potash, James B; Zandi, Peter P

    2011-04-01

    Genome-wide association studies (GWAS) have identified several susceptibility loci for bipolar disorder (BP), most notably ANK3. However, most of the inherited risk for BP remains unexplained. One reason for the limited success may be the genetic heterogeneity of BP. Clinical sub-phenotypes of BP may identify more etiologically homogeneous subsets of patients, which can be studied with increased power to detect genetic variation. Here, we report on a mega-analysis of two widely studied sub-phenotypes of BP, age at onset and psychotic symptoms, which are familial and clinically significant. We combined data from three GWAS: NIMH Bipolar Disorder Genetic Association Information Network (GAIN-BP), NIMH Bipolar Disorder Genome Study (BiGS), and a German sample. The combined sample consisted of 2,836 BP cases with information on sub-phenotypes and 2,744 controls. Imputation was performed, resulting in 2.3 million SNPs available for analysis. No SNP reached genome-wide significance for either sub-phenotype. In addition, no SNP reached genome-wide significance in a meta-analysis with an independent replication sample. We had 80% power to detect associations with a common SNP at an OR of 1.6 for psychotic symptoms and a mean difference of 1.8 years in age at onset. Age at onset and psychotic symptoms in BP may be influenced by many genes of smaller effect sizes or other variants not measured well by SNP arrays, such as rare alleles.

  15. Genome-Wide Spectra of Transcription Insertions and Deletions Reveal That Slippage Depends on RNA:DNA Hybrid Complementarity

    PubMed Central

    Traverse, Charles C.

    2017-01-01

    ABSTRACT Advances in sequencing technologies have enabled direct quantification of genome-wide errors that occur during RNA transcription. These errors occur at rates that are orders of magnitude higher than rates during DNA replication, but due to technical difficulties such measurements have been limited to single-base substitutions and have not yet quantified the scope of transcription insertions and deletions. Previous reporter gene assay findings suggested that transcription indels are produced exclusively by elongation complex slippage at homopolymeric runs, so we enumerated indels across the protein-coding transcriptomes of Escherichia coli and Buchnera aphidicola, which differ widely in their genomic base compositions and incidence of repeat regions. As anticipated from prior assays, transcription insertions prevailed in homopolymeric runs of A and T; however, transcription deletions arose in much more complex sequences and were rarely associated with homopolymeric runs. By reconstructing the relocated positions of the elongation complex as inferred from the sequences inserted or deleted during transcription, we show that continuation of transcription after slippage hinges on the degree of nucleotide complementarity within the RNA:DNA hybrid at the new DNA template location. PMID:28851848

  16. A guide to genome-wide association analysis and post-analytic interrogation.

    PubMed

    Reed, Eric; Nunez, Sara; Kulp, David; Qian, Jing; Reilly, Muredach P; Foulkes, Andrea S

    2015-12-10

    This tutorial is a learning resource that outlines the basic process and provides specific software tools for implementing a complete genome-wide association analysis. Approaches to post-analytic visualization and interrogation of potentially novel findings are also presented. Applications are illustrated using the free and open-source R statistical computing and graphics software environment, Bioconductor software for bioinformatics and the UCSC Genome Browser. Complete genome-wide association data on 1401 individuals across 861,473 typed single nucleotide polymorphisms from the PennCATH study of coronary artery disease are used for illustration. All data and code, as well as additional instructional resources, are publicly available through the Open Resources in Statistical Genomics project: http://www.stat-gen.org.

  17. Meta-Analysis of Genome-Wide Linkage Scans of Attention Deficit Hyperactivity Disorder

    PubMed Central

    Zhou, Kaixin; Dempfle, Astrid; Arcos-Burgos, Mauricio; Bakker, Steven C.; Banaschewski, Tobias; Biederman, Joseph; Buitelaar, Jan; Castellanos, F.Xavier; Doyle, Alysa; Ebstein, Richard P.; Ekholm, Jenny; Forabosco, Paola; Franke, Barbara; Freitag, Christine; Friedel, Susann; Gill, Michael; Hebebrand, Johannes; Hinney, Anke; Jacob, Christian; Lesch, Klaus Peter; Loo, Sandra K.; Lopera, Francisco; McCracken, James T.; McGough, James J.; Meyer, Jobst; Mick, Eric; Miranda, Ana; Muenke, Maximilian; Mulas, Fernando; Nelson, Stanley F.; Nguyen, T.Trang; Oades, Robert D.; Ogdie, Matthew N.; Palacio, Juan David; Pineda, David; Reif, Andreas; Renner, Tobias J.; Roeyers, Herbert; Romanos, Marcel; Rothenberger, Aribert; Schäfer, Helmut; Sergeant, Joseph; Sinke, Richard J.; Smalley, Susan L.; Sonuga-Barke, Edmund; Steinhausen, Hans-Christoph; van der Meulen, Emma; Walitza, Susanne; Warnke, Andreas; Lewis, Cathryn M; Faraone, Stephen V.; Asherson, Philip

    2010-01-01

    Genetic contribution to the development of attention deficit hyperactivity disorder (ADHD) is well established. Seven independent genome-wide linkage scans have been performed to map loci that increase the risk for ADHD. Although significant linkage signals were identified in some of the studies, there has been limited replications between the various independent datasets. The current study gathered the results from all seven of the ADHD linkage scans and performed a Genome Scan Meta Analysis (GSMA) to identify the genomic region with most consistent linkage evidence across the studies. Genome-wide significant linkage (PSR=0.00034, POR=0.04) was identified on chromosome 16 between 64 and 83 Mb. In addition there are nine other genomic regions from the GSMA showing nominal or suggestive evidence of linkage. All these linkage results may be informative and focus the search for novel ADHD susceptibility genes. PMID:18988193

  18. Genome-wide identification of the potato WRKY transcription factor family.

    PubMed

    Zhang, Chao; Wang, Dongdong; Yang, Chenghui; Kong, Nana; Shi, Zheng; Zhao, Peng; Nan, Yunyou; Nie, Tengkun; Wang, Ruoqiu; Ma, Haoli; Chen, Qin

    2017-01-01

    WRKY transcription factors play pivotal roles in regulation of stress responses. This study identified 79 WRKY genes in potato (Solanum tuberosum). Based on multiple sequence alignment and phylogenetic relationships, WRKY genes were classified into three major groups. The majority of WRKY genes belonged to Group II (52 StWRKYs), Group III had 14 and Group I consisted of 13. The phylogenetic tree further classified Group II into five sub-groups. All StWRKY genes except StWRKY79 were mapped on potato chromosomes, with eight tandem duplication gene pairs and seven segmental duplication gene pairs found from StWRKY family genes. The expression analysis of 22 StWRKYs showed their differential expression levels under various stress conditions. Cis-element prediction showed that a large number of elements related to drought, heat and salicylic acid were present in the promotor regions of StWRKY genes. The expression analysis indicated that seven StWRKYs seemed to respond to stress (heat, drought and salinity) and salicylic acid treatment. These genes are candidates for abiotic stress signaling for further research.

  19. Genome-wide characterization of human L1 antisense promoter-driven transcripts.

    PubMed

    Criscione, Steven W; Theodosakis, Nicholas; Micevic, Goran; Cornish, Toby C; Burns, Kathleen H; Neretti, Nicola; Rodić, Nemanja

    2016-06-14

    Long INterspersed Element-1 (LINE-1 or L1) is the only autonomously active, transposable element in the human genome. L1 sequences comprise approximately 17 % of the human genome, but only the evolutionarily recent, human-specific subfamily is retrotransposition competent. The L1 promoter has a bidirectional orientation containing a sense promoter that drives the transcription of two proteins required for retrotransposition and an antisense promoter. The L1 antisense promoter can drive transcription of chimeric transcripts: 5' L1 antisense sequences spliced to the exons of neighboring genes. The impact of L1 antisense promoter activity on cellular transcriptomes is poorly understood. To investigate this, we analyzed GenBank ESTs for messenger RNAs that initiate in the L1 antisense promoter. We identified 988 putative L1 antisense chimeric transcripts, 911 of which have not been previously reported. These appear to be alternative genic transcripts, sense-oriented with respect to gene and initiating near, but typically downstream of, the gene transcriptional start site. In multiple cell lines, L1 antisense promoters display enrichment for YY1 transcription factor and histone modifications associated with active promoters. Global run-on sequencing data support the activity of the L1 antisense promoter. We independently detected 124 L1 antisense chimeric transcripts using long read Pacific Biosciences RNA-seq data. Furthermore, we validated four chimeric transcripts by quantitative RT-PCR and Sanger sequencing and demonstrated that they are readily detectable in many normal human tissues. We present a comprehensive characterization of human L1 antisense promoter-driven transcripts and provide substantial evidence that they are transcribed in a variety of human cell-types. Our findings reveal a new wide-reaching aspect of L1 biology by identifying antisense transcripts affecting as many as 4 % of all human genes.

  20. Genome-Wide Meta-Analysis of Longitudinal Alcohol Consumption Across Youth and Early Adulthood.

    PubMed

    Adkins, Daniel E; Clark, Shaunna L; Copeland, William E; Kennedy, Martin; Conway, Kevin; Angold, Adrian; Maes, Hermine; Liu, Youfang; Kumar, Gaurav; Erkanli, Alaattin; Patkar, Ashwin A; Silberg, Judy; Brown, Tyson H; Fergusson, David M; Horwood, L John; Eaves, Lindon; van den Oord, Edwin J C G; Sullivan, Patrick F; Costello, E J

    2015-08-01

    The public health burden of alcohol is unevenly distributed across the life course, with levels of use, abuse, and dependence increasing across adolescence and peaking in early adulthood. Here, we leverage this temporal patterning to search for common genetic variants predicting developmental trajectories of alcohol consumption. Comparable psychiatric evaluations measuring alcohol consumption were collected in three longitudinal community samples (N=2,126, obs=12,166). Consumption-repeated measurements spanning adolescence and early adulthood were analyzed using linear mixed models, estimating individual consumption trajectories, which were then tested for association with Illumina 660W-Quad genotype data (866,099 SNPs after imputation and QC). Association results were combined across samples using standard meta-analysis methods. Four meta-analysis associations satisfied our pre-determined genome-wide significance criterion (FDR<0.1) and six others met our 'suggestive' criterion (FDR<0.2). Genome-wide significant associations were highly biological plausible, including associations within GABA transporter 1, SLC6A1 (solute carrier family 6, member 1), and exonic hits in LOC100129340 (mitofusin-1-like). Pathway analyses elaborated single marker results, indicating significant enriched associations to intuitive biological mechanisms, including neurotransmission, xenobiotic pharmacodynamics, and nuclear hormone receptors (NHR). These findings underscore the value of combining longitudinal behavioral data and genome-wide genotype information in order to study developmental patterns and improve statistical power in genomic studies.

  1. Five endometrial cancer risk loci identified through genome-wide association analysis.

    PubMed

    Cheng, Timothy H T; Thompson, Deborah J; O'Mara, Tracy A; Painter, Jodie N; Glubb, Dylan M; Flach, Susanne; Lewis, Annabelle; French, Juliet D; Freeman-Mills, Luke; Church, David; Gorman, Maggie; Martin, Lynn; Hodgson, Shirley; Webb, Penelope M; Attia, John; Holliday, Elizabeth G; McEvoy, Mark; Scott, Rodney J; Henders, Anjali K; Martin, Nicholas G; Montgomery, Grant W; Nyholt, Dale R; Ahmed, Shahana; Healey, Catherine S; Shah, Mitul; Dennis, Joe; Fasching, Peter A; Beckmann, Matthias W; Hein, Alexander; Ekici, Arif B; Hall, Per; Czene, Kamila; Darabi, Hatef; Li, Jingmei; Dörk, Thilo; Dürst, Matthias; Hillemanns, Peter; Runnebaum, Ingo; Amant, Frederic; Schrauwen, Stefanie; Zhao, Hui; Lambrechts, Diether; Depreeuw, Jeroen; Dowdy, Sean C; Goode, Ellen L; Fridley, Brooke L; Winham, Stacey J; Njølstad, Tormund S; Salvesen, Helga B; Trovik, Jone; Werner, Henrica M J; Ashton, Katie; Otton, Geoffrey; Proietto, Tony; Liu, Tao; Mints, Miriam; Tham, Emma; Li, Mulin Jun; Yip, Shun H; Wang, Junwen; Bolla, Manjeet K; Michailidou, Kyriaki; Wang, Qin; Tyrer, Jonathan P; Dunlop, Malcolm; Houlston, Richard; Palles, Claire; Hopper, John L; Peto, Julian; Swerdlow, Anthony J; Burwinkel, Barbara; Brenner, Hermann; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Chang-Claude, Jenny; Couch, Fergus J; Giles, Graham G; Kristensen, Vessela N; Cox, Angela; Cunningham, Julie M; Pharoah, Paul D P; Dunning, Alison M; Edwards, Stacey L; Easton, Douglas F; Tomlinson, Ian; Spurdle, Amanda B

    2016-06-01

    We conducted a meta-analysis of three endometrial cancer genome-wide association studies (GWAS) and two follow-up phases totaling 7,737 endometrial cancer cases and 37,144 controls of European ancestry. Genome-wide imputation and meta-analysis identified five new risk loci of genome-wide significance at likely regulatory regions on chromosomes 13q22.1 (rs11841589, near KLF5), 6q22.31 (rs13328298, in LOC643623 and near HEY2 and NCOA7), 8q24.21 (rs4733613, telomeric to MYC), 15q15.1 (rs937213, in EIF2AK4, near BMF) and 14q32.33 (rs2498796, in AKT1, near SIVA1). We also found a second independent 8q24.21 signal (rs17232730). Functional studies of the 13q22.1 locus showed that rs9600103 (pairwise r(2) = 0.98 with rs11841589) is located in a region of active chromatin that interacts with the KLF5 promoter region. The rs9600103[T] allele that is protective in endometrial cancer suppressed gene expression in vitro, suggesting that regulation of the expression of KLF5, a gene linked to uterine development, is implicated in tumorigenesis. These findings provide enhanced insight into the genetic and biological basis of endometrial cancer.

  2. Genome-wide gene-environment interaction analysis for asbestos exposure in lung cancer susceptibility.

    PubMed

    Wei, Sheng; Wang, Li-E; McHugh, Michelle K; Han, Younghun; Xiong, Momiao; Amos, Christopher I; Spitz, Margaret R; Wei, Qingyi Wei

    2012-08-01

    Asbestos exposure is a known risk factor for lung cancer. Although recent genome-wide association studies (GWASs) have identified some novel loci for lung cancer risk, few addressed genome-wide gene-environment interactions. To determine gene-asbestos interactions in lung cancer risk, we conducted genome-wide gene-environment interaction analyses at levels of single nucleotide polymorphisms (SNPs), genes and pathways, using our published Texas lung cancer GWAS dataset. This dataset included 317 498 SNPs from 1154 lung cancer cases and 1137 cancer-free controls. The initial SNP-level P-values for interactions between genetic variants and self-reported asbestos exposure were estimated by unconditional logistic regression models with adjustment for age, sex, smoking status and pack-years. The P-value for the most significant SNP rs13383928 was 2.17×10(-6), which did not reach the genome-wide statistical significance. Using a versatile gene-based test approach, we found that the top significant gene was C7orf54, located on 7q32.1 (P = 8.90×10(-5)). Interestingly, most of the other significant genes were located on 11q13. When we used an improved gene-set-enrichment analysis approach, we found that the Fas signaling pathway and the antigen processing and presentation pathway were most significant (nominal P < 0.001; false discovery rate < 0.05) among 250 pathways containing 17 572 genes. We believe that our analysis is a pilot study that first describes the gene-asbestos interaction in lung cancer risk at levels of SNPs, genes and pathways. Our findings suggest that immune function regulation-related pathways may be mechanistically involved in asbestos-associated lung cancer risk.

  3. Genome-wide DNA methylation analysis in obsessive-compulsive disorder patients.

    PubMed

    Yue, Weihua; Cheng, Weiqiu; Liu, Zhaorui; Tang, Yi; Lu, Tianlan; Zhang, Dai; Tang, Muni; Huang, Yueqin

    2016-08-16

    Literatures have suggested that not only genetic but also environmental factors, interactively accounted for susceptibility of obsessive-compulsive disorder (OCD). DNA methylation may regulate expression of genes as the heritable epigenetic modification. The examination for genome-wide DNA methylation was performed on blood samples from 65 patients with OCD, as well as 96 healthy control subjects. The DNA methylation was examined at over 485,000 CpG sites using the Illumina Infinium Human Methylation450 BeadChip. As a result, 8,417 probes corresponding to 2,190 unique genes were found to be differentially methylated between OCD and healthy control subjects. Of those genes, 4,013 loci were located in CpG islands and 2,478 were in promoter regions. These included BCYRN1, BCOR, FGF13, HLA-DRB1, ARX, etc., which have previously been reported to be associated with OCD. Pathway analyses indicated that regulation of actin cytoskeleton, cell adhesion molecules (CAMs), actin binding, transcription regulator activity, and other pathways might be further associated with risk of OCD. Unsupervised clustering analysis of the top 3,000 most variable probes revealed two distinct groups with significantly more people with OCD in cluster one compared with controls (67.74% of cases v.s. 27.13% of controls, Chi-square = 26.011, df = 1, P = 3.41E-07). These results strongly suggested that differential DNA methylation might play an important role in etiology of OCD.

  4. Genome-wide DNA methylation analysis in obsessive-compulsive disorder patients

    PubMed Central

    Yue, Weihua; Cheng, Weiqiu; Liu, Zhaorui; Tang, Yi; Lu, Tianlan; Zhang, Dai; Tang, Muni; Huang, Yueqin

    2016-01-01

    Literatures have suggested that not only genetic but also environmental factors, interactively accounted for susceptibility of obsessive-compulsive disorder (OCD). DNA methylation may regulate expression of genes as the heritable epigenetic modification. The examination for genome-wide DNA methylation was performed on blood samples from 65 patients with OCD, as well as 96 healthy control subjects. The DNA methylation was examined at over 485,000 CpG sites using the Illumina Infinium Human Methylation450 BeadChip. As a result, 8,417 probes corresponding to 2,190 unique genes were found to be differentially methylated between OCD and healthy control subjects. Of those genes, 4,013 loci were located in CpG islands and 2,478 were in promoter regions. These included BCYRN1, BCOR, FGF13, HLA-DRB1, ARX, etc., which have previously been reported to be associated with OCD. Pathway analyses indicated that regulation of actin cytoskeleton, cell adhesion molecules (CAMs), actin binding, transcription regulator activity, and other pathways might be further associated with risk of OCD. Unsupervised clustering analysis of the top 3,000 most variable probes revealed two distinct groups with significantly more people with OCD in cluster one compared with controls (67.74% of cases v.s. 27.13% of controls, Chi-square = 26.011, df = 1, P = 3.41E-07). These results strongly suggested that differential DNA methylation might play an important role in etiology of OCD. PMID:27527274

  5. Quantitative Models of the Mechanisms That Control Genome-Wide Patterns of Transcription Factor Binding during Early Drosophila Development

    PubMed Central

    Kaplan, Tommy; Li, Xiao-Yong; Sabo, Peter J.; Thomas, Sean; Stamatoyannopoulos, John A.; Biggin, Mark D.; Eisen, Michael B.

    2011-01-01

    Transcription factors that drive complex patterns of gene expression during animal development bind to thousands of genomic regions, with quantitative differences in binding across bound regions mediating their activity. While we now have tools to characterize the DNA affinities of these proteins and to precisely measure their genome-wide distribution in vivo, our understanding of the forces that determine where, when, and to what extent they bind remains primitive. Here we use a thermodynamic model of transcription factor binding to evaluate the contribution of different biophysical forces to the binding of five regulators of early embryonic anterior-posterior patterning in Drosophila melanogaster. Predictions based on DNA sequence and in vitro protein-DNA affinities alone achieve a correlation of ∼0.4 with experimental measurements of in vivo binding. Incorporating cooperativity and competition among the five factors, and accounting for spatial patterning by modeling binding in every nucleus independently, had little effect on prediction accuracy. A major source of error was the prediction of binding events that do not occur in vivo, which we hypothesized reflected reduced accessibility of chromatin. To test this, we incorporated experimental measurements of genome-wide DNA accessibility into our model, effectively restricting predicted binding to regions of open chromatin. This dramatically improved our predictions to a correlation of 0.6–0.9 for various factors across known target genes. Finally, we used our model to quantify the roles of DNA sequence, accessibility, and binding competition and cooperativity. Our results show that, in regions of open chromatin, binding can be predicted almost exclusively by the sequence specificity of individual factors, with a minimal role for protein interactions. We suggest that a combination of experimentally determined chromatin accessibility data and simple computational models of transcription factor binding may be

  6. Genetic determinants of common epilepsies: a meta-analysis of genome-wide association studies

    PubMed Central

    2014-01-01

    Summary Background The epilepsies are a clinically heterogeneous group of neurological disorders. Despite strong evidence for heritability, genome-wide association studies have had little success in identification of risk loci associated with epilepsy, probably because of relatively small sample sizes and insufficient power. We aimed to identify risk loci through meta-analyses of genome-wide association studies for all epilepsy and the two largest clinical subtypes (genetic generalised epilepsy and focal epilepsy). Methods We combined genome-wide association data from 12 cohorts of individuals with epilepsy and controls from population-based datasets. Controls were ethnically matched with cases. We phenotyped individuals with epilepsy into categories of genetic generalised epilepsy, focal epilepsy, or unclassified epilepsy. After standardised filtering for quality control and imputation to account for different genotyping platforms across sites, investigators at each site conducted a linear mixed-model association analysis for each dataset. Combining summary statistics, we conducted fixed-effects meta-analyses of all epilepsy, focal epilepsy, and genetic generalised epilepsy. We set the genome-wide significance threshold at p<1·66 × 10−8. Findings We included 8696 cases and 26 157 controls in our analysis. Meta-analysis of the all-epilepsy cohort identified loci at 2q24.3 (p=8·71 × 10−10), implicating SCN1A, and at 4p15.1 (p=5·44 × 10−9), harbouring PCDH7, which encodes a protocadherin molecule not previously implicated in epilepsy. For the cohort of genetic generalised epilepsy, we noted a single signal at 2p16.1 (p=9·99 × 10−9), implicating VRK2 or FANCL. No single nucleotide polymorphism achieved genome-wide significance for focal epilepsy. Interpretation This meta-analysis describes a new locus not previously implicated in epilepsy and provides further evidence about the genetic architecture of these disorders, with the

  7. TRANSIENT GENOME-WIDE TRANSCRIPTIONAL RESPONSE TO LOW-DOSE IONIZING RADIATION IN VIVO IN HUMANS

    PubMed Central

    Berglund, Susanne R.; Rocke, David M.; Dai, Jian; Schwietert, Chad W.; Santana, Alison; Stern, Robin L.; Lehmann, Joerg; Hartmann Siantar, Christine L.; Goldberg, Zelanna

    2009-01-01

    Purpose The in vivo effects of low-dose low linear energy transfer ionizing radiation on healthy human skin are largely unknown. Using a patient-based tissue acquisition protocol, we have performed a series of genomic analyses on the temporal dynamics over a 24-hour period to determine the radiation response after a single exposure of 10 cGy. Methods and Materials RNA from each patient tissue sample was hybridized to an Affymetrix Human Genome U133 Plus 2.0 array. Data analysis was performed on selected gene groups and pathways. Results Nineteen gene groups and seven gene pathways that had been shown to be radiation responsive were analyzed. Of these, nine gene groups showed significant transient transcriptional changes in the human tissue samples, which returned to baseline by 24 hours postexposure. Conclusions Low doses of ionizing radiation on full-thickness human skin produce a definable temporal response out to 24 hours postexposure. Genes involved in DNA and tissue remodeling, cell cycle transition, and inflammation show statistically significant changes in expression, despite variability between patients. These data serve as a reference for the temporal dynamics of ionizing radiation response following low-dose exposure in healthy full-thickness human skin. PMID:17996396

  8. Growth Stimulatory Effects and Genome-Wide Transcriptional Changes Produced by Protein Hydrolysates in Maize Seedlings

    PubMed Central

    Santi, Chiara; Zamboni, Anita; Varanini, Zeno; Pandolfini, Tiziana

    2017-01-01

    Protein hydrolysates are an emerging class of crop management products utilized for improving nutrient assimilation and mitigating crop stress. They generally consist of a mixture of peptides and free amino acids derived from the hydrolysis of plant or animal sources. The present work was aimed at studying the effects and the action mechanisms of a protein hydrolysate derived from animal residues on maize root growth and physiology in comparison with the effects induced by either free amino acids or inorganic N supply. The application of the protein hydrolysate caused a remarkable enhancement of root growth. In particular, in the protein hydrolysate-treated plants the length and surface area of lateral roots were about 7 and 1.5 times higher than in plants treated with inorganic N or free amino acids, respectively. The root growth promoting effect of the protein hydrolysate was associated with an increased root accumulation of K, Zn, Cu, and Mn when compared with inorganic N and amino acids treatments. A microarray analysis allowed to dissect the transcriptional changes induced by the different treatments demonstrating treatment-specific effects principally on cell wall organization, transport processes, stress responses and hormone metabolism. PMID:28424716

  9. Transient Genome-Wide Transcriptional Response to Low-Dose Ionizing Radiation In Vivo in Humans

    SciTech Connect

    Berglund, Susanne R.; Rocke, David M.; Dai Jian; Schwietert, Chad W.; Santana, Alison; Stern, Robin L.; Lehmann, Joerg; Hartmann Siantar, Christine L.; Goldberg, Zelanna

    2008-01-01

    Purpose: The in vivo effects of low-dose low linear energy transfer ionizing radiation on healthy human skin are largely unknown. Using a patient-based tissue acquisition protocol, we have performed a series of genomic analyses on the temporal dynamics over a 24-hour period to determine the radiation response after a single exposure of 10 cGy. Methods and Materials: RNA from each patient tissue sample was hybridized to an Affymetrix Human Genome U133 Plus 2.0 array. Data analysis was performed on selected gene groups and pathways. Results: Nineteen gene groups and seven gene pathways that had been shown to be radiation responsive were analyzed. Of these, nine gene groups showed significant transient transcriptional changes in the human tissue samples, which returned to baseline by 24 hours postexposure. Conclusions: Low doses of ionizing radiation on full-thickness human skin produce a definable temporal response out to 24 hours postexposure. Genes involved in DNA and tissue remodeling, cell cycle transition, and inflammation show statistically significant changes in expression, despite variability between patients. These data serve as a reference for the temporal dynamics of ionizing radiation response following low-dose exposure in healthy full-thickness human skin.

  10. Genome-wide characterization of monomeric transcriptional regulators in Mycobacterium tuberculosis.

    PubMed

    Feng, Lipeng; Chen, Zhenkang; Wang, Zhongwei; Hu, Yangbo; Chen, Shiyun

    2016-05-01

    Gene transcription catalysed by RNA polymerase is regulated by transcriptional regulators, which play central roles in the control of gene transcription in both eukaryotes and prokaryotes. In regulating gene transcription, many regulators form dimers that bind to DNA with repeated motifs. However, some regulators function as monomers, but their mechanisms of gene expression control are largely uncharacterized. Here we systematically characterized monomeric versus dimeric regulators in the tuberculosis causative agent Mycobacterium tuberculosis. Of the >160 transcriptional regulators annotated in M. tuberculosis, 154 transcriptional regulators were tested, 22 % probably act as monomers and most are annotated as hypothetical regulators. Notably, all members of the WhiB-like protein family are classified as monomers. To further investigate mechanisms of monomeric regulators, we analysed the actions of these WhiB proteins and found that the majority interact with the principal sigma factor σA, which is also a monomeric protein within the RNA polymerase holoenzyme. Taken together, our study for the first time globally classified monomeric regulators in M. tuberculosis and suggested a mechanism for monomeric regulators in controlling gene transcription through interacting with monomeric sigma factors.

  11. Genome-wide analysis of long-term evolutionary domestication in Drosophila melanogaster

    PubMed Central

    Phillips, Mark A.; Long, Anthony D.; Greenspan, Zachary S.; Greer, Lee F.; Burke, Molly K.; Villeponteau, Bryant; Matsagas, Kennedy C.; Rizza, Cristina L.; Mueller, Laurence D.; Rose, Michael R.

    2016-01-01

    Experimental evolutionary genomics now allows biologists to test fundamental theories concerning the genetic basis of adaptation. We have conducted one of the longest laboratory evolution experiments with any sexually-reproducing metazoan, Drosophila melanogaster. We used next-generation resequencing data from this experiment to examine genome-wide patterns of genetic variation over an evolutionary time-scale that approaches 1,000 generations. We also compared measures of variation within and differentiation between our populations to simulations based on a variety of evolutionary scenarios. Our analysis yielded no clear evidence of hard selective sweeps, whereby natural selection acts to increase the frequency of a newly-arising mutation in a population until it becomes fixed. We do find evidence for selection acting on standing genetic variation, as independent replicate populations exhibit similar population-genetic dynamics, without obvious fixation of candidate alleles under selection. A hidden-Markov model test for selection also found widespread evidence for selection. We found more genetic variation genome-wide, and less differentiation between replicate populations genome-wide, than arose in any of our simulated evolutionary scenarios. PMID:28004838

  12. Genome-wide association analysis of age-at-onset in Alzheimer’s disease

    PubMed Central

    Kamboh, M. Ilyas; Barmada, M. Michael; Demirci, F. Yesim; Minster, Ryan L.; Carrasquillo, Minerva M.; Pankratz, V. Shane; Younkin, Steven G.; Saykin, Andrew J.; Sweet, Robert A.; Feingold, Eleanor; DeKosky, Steven T.; Lopez, Oscar L.

    2011-01-01

    The risk of Alzheimer’s disease (AD) is strongly determined by genetic factors and recent genome-wide association studies (GWAS) have identified several genes for the disease risk. In addition to the disease risk, age-at-onset (AAO) of AD has also strong genetic component with an estimated heritability of 42%. Identification of AAO genes may help to understand the biological mechanisms that regulate the onset of the disease. Here we report the first GWAS focused on identifying genes for the AAO of AD. We performed a genome-wide meta analysis on 3 samples comprising a total of 2,222 AD cases. A total of ~2.5 million directly genotyped or imputed SNPs were analyzed in relation to AAO of AD. As expected, the most significant associations were observed in the APOE region on chromosome 19 where several SNPs surpassed the conservative genome-wide significant threshold (P<5E-08). The most significant SNP outside the APOE region was located in the DCHS2 gene on chromosome 4q31.3 (rs1466662; P=4.95E-07). There were 19 additional significant SNPs in this region at P<1E-04 and the DCHS2 gene is expressed in the cerebral cortex and thus is a potential candidate for affecting AAO in AD. These findings need to be confirmed in additional well-powered samples. PMID:22005931

  13. Host cell factors in HIV replication: meta-analysis of genome-wide studies.

    PubMed

    Bushman, Frederic D; Malani, Nirav; Fernandes, Jason; D'Orso, Iván; Cagney, Gerard; Diamond, Tracy L; Zhou, Honglin; Hazuda, Daria J; Espeseth, Amy S; König, Renate; Bandyopadhyay, Sourav; Ideker, Trey; Goff, Stephen P; Krogan, Nevan J; Frankel, Alan D; Young, John A T; Chanda, Sumit K

    2009-05-01

    We have analyzed host cell genes linked to HIV replication that were identified in nine genome-wide studies, including three independent siRNA screens. Overlaps among the siRNA screens were very modest (<7% for any pairwise combination), and similarly, only modest overlaps were seen in pairwise comparisons with other types of genome-wide studies. Combining all genes from the genome-wide studies together with genes reported in the literature to affect HIV yields 2,410 protein-coding genes, or fully 9.5% of all human genes (though of course some of these are false positive calls). Here we report an "encyclopedia" of all overlaps between studies (available at http://www.hostpathogen.org), which yielded a more extensively corroborated set of host factors assisting HIV replication. We used these genes to calculate refined networks that specify cellular subsystems recruited by HIV to assist in replication, and present additional analysis specifying host cell genes that are attractive as potential therapeutic targets.

  14. Genome-wide Association Analysis Identifies 14 New Risk Loci for Schizophrenia

    PubMed Central

    Ripke, Stephan; O'Dushlaine, Colm; Chambert, Kimberly; Moran, Jennifer L; Kähler, Anna K; Akterin, Susanne; Bergen, Sarah; Collins, Ann L; Crowley, James J; Fromer, Menachem; Kim, Yunjung; Lee, Sang Hong; Magnusson, Patrik KE; Sanchez, Nick; Stahl, Eli A; Williams, Stephanie; Wray, Naomi R; Xia, Kai; Bettella, Francesco; Børglum, Anders D; Bulik-Sullivan, Brendan K; Cormican, Paul; Craddock, Nick; de Leeuw, Christiaan; Durmishi, Naser; Gill, Michael; Golimbet, Vera; Hamshere, Marian L; Holmans, Peter; Hougaard, David M; Kendler, Kenneth S; Lin, Kuang; Morris, Derek W; Mors, Ole; Mortensen, Preben B; Neale, Benjamin M; O'Neill, Francis A; Owen, Michael J; Milovancevic, MilicaPejovic; Posthuma, Danielle; Powell, John; Richards, Alexander L; Riley, Brien P; Ruderfer, Douglas; Rujescu, Dan; Sigurdsson, Engilbert; Silagadze, Teimuraz; Smit, August B; Stefansson, Hreinn; Steinberg, Stacy; Suvisaari, Jaana; Tosato, Sarah; Verhage, Matthijs; Walters, James T; Bramon, Elvira; Corvin, Aiden P; O'Donovan, Michael C; Stefansson, Kari; Scolnick, Edward; Purcell, Shaun; McCarroll, Steve; Sklar, Pamela; Hultman, Christina M; Sullivan, Patrick F

    2013-01-01

    Schizophrenia is a heritable disorder with substantial public health impact. We conducted a multi-stage genome-wide association study (GWAS) for schizophrenia beginning with a Swedish national sample (5,001 cases, 6,243 controls) followed by meta-analysis with prior schizophrenia GWAS (8,832 cases, 12,067 controls) and finally by replication of SNPs in 168 genomic regions in independent samples (7,413 cases, 19,762 controls, and 581 trios). In total, 22 regions met genome-wide significance (14 novel and one previously implicated in bipolar disorder). The results strongly implicate calcium signaling in the etiology of schizophrenia, and include genome-wide significant results for CACNA1C and CACNB2 whose protein products interact. We estimate that ∼8,300 independent and predominantly common SNPs contribute to risk for schizophrenia and that these collectively account for most of its heritability. Common genetic variation plays an important role in the etiology of schizophrenia, and larger studies will allow more detailed understanding of this devastating disorder. PMID:23974872

  15. Genome-Wide Association between Transcription Factor Expression and Chromatin Accessibility Reveals Regulators of Chromatin Accessibility.

    PubMed

    Lamparter, David; Marbach, Daniel; Rueedi, Rico; Bergmann, Sven; Kutalik, Zoltán

    2017-01-01

    To better understand genome regulation, it is important to uncover the role of transcription factors in the process of chromatin structure establishment and maintenance. Here we present a data-driven approach to systematically characterise transcription factors that are relevant for this process. Our method uses a linear mixed modelling approach to combine datasets of transcription factor binding motif enrichments in open chromatin and gene expression across the same set of cell lines. Applying this approach to the ENCODE dataset, we confirm already known and imply numerous novel transcription factors that play a role in the establishment or maintenance of open chromatin. In particular, our approach rediscovers many factors that have been annotated as pioneer factors.

  16. Genome-Wide Association between Transcription Factor Expression and Chromatin Accessibility Reveals Regulators of Chromatin Accessibility

    PubMed Central

    Rueedi, Rico

    2017-01-01

    To better understand genome regulation, it is important to uncover the role of transcription factors in the process of chromatin structure establishment and maintenance. Here we present a data-driven approach to systematically characterise transcription factors that are relevant for this process. Our method uses a linear mixed modelling approach to combine datasets of transcription factor binding motif enrichments in open chromatin and gene expression across the same set of cell lines. Applying this approach to the ENCODE dataset, we confirm already known and imply numerous novel transcription factors that play a role in the establishment or maintenance of open chromatin. In particular, our approach rediscovers many factors that have been annotated as pioneer factors. PMID:28118358

  17. The Genetic Architecture of the Genome-Wide Transcriptional Response to ER Stress in the Mouse

    PubMed Central

    Chow, Clement Y; Wang, Xu; Riccardi, David; Wolfner, Mariana F.; Clark, Andrew G.

    2015-01-01

    Endoplasmic reticulum (ER) stress occurs when misfolded proteins accumulate in the ER. The cellular response to ER stress involves complex transcriptional and translational changes, important to the survival of the cell. ER stress is a primary cause and a modifier of many human diseases. A first step to understanding how the ER stress response impacts human disease is to determine how the transcriptional response to ER stress varies among individuals. The genetic diversity of the eight mouse Collaborative Cross (CC) founder strains allowed us to determine how genetic variation impacts the ER stress transcriptional response. We used tunicamycin, a drug commonly used to induce ER stress, to elicit an ER stress response in mouse embryonic fibroblasts (MEFs) derived from the CC founder strains and measured their transcriptional responses. We identified hundreds of genes that differed in response to ER stress across these genetically diverse strains. Strikingly, inflammatory response genes differed most between strains; major canonical ER stress response genes showed relatively invariant responses across strains. To uncover the genetic architecture underlying these strain differences in ER stress response, we measured the transcriptional response to ER stress in MEFs derived from a subset of F1 crosses between the CC founder strains. We found a unique layer of regulatory variation that is only detectable under ER stress conditions. Over 80% of the regulatory variation under ER stress derives from cis-regulatory differences. This is the first study to characterize the genetic variation in ER stress transcriptional response in the laboratory mouse. Our findings indicate that the ER stress transcriptional response is highly variable among strains and arises from genetic variation in individual downstream response genes, rather than major signaling transcription factors. These results have important implications for understanding how genetic variation impacts the ER stress

  18. Genome-Wide Identification and Function Analyses of Heat Shock Transcription Factors in Potato

    PubMed Central

    Tang, Ruimin; Zhu, Wenjiao; Song, Xiaoyan; Lin, Xingzhong; Cai, Jinghui; Wang, Man; Yang, Qing

    2016-01-01

    Heat shock transcription factors (Hsfs) play vital roles in the regulation of tolerance to various stresses in living organisms. To dissect the mechanisms of the Hsfs in potato adaptation to abiotic stresses, genome and transcriptome analyses of Hsf gene family were investigated in Solanum tuberosum L. Twenty-seven StHsf members were identified by bioinformatics and phylogenetic analyses and were classified into A, B, and C groups according to their structural and phylogenetic features. StHsfs in the same class shared similar gene structures and conserved motifs. The chromosomal location analysis showed that 27 Hsfs were located in 10 of 12 chromosomes (except chromosome 1 and chromosome 5) and that 18 of these genes formed 9 paralogous pairs. Expression profiles of StHsfs in 12 different organs and tissues uncovered distinct spatial expression patterns of these genes and their potential roles in the process of growth and development. Promoter and quantitative real-time polymerase chain reaction (qRT-PCR) detections of StHsfs were conducted and demonstrated that these genes were all responsive to various stresses. StHsf004, StHsf007, StHsf009, StHsf014, and StHsf019 were constitutively expressed under non-stress conditions, and some specific Hsfs became the predominant Hsfs in response to different abiotic stresses, indicating their important and diverse regulatory roles in adverse conditions. A co-expression network between StHsfs and StHsf -co-expressed genes was generated based on the publicly-available potato transcriptomic databases and identified key candidate StHsfs for further functional studies. PMID:27148315

  19. Genome-wide transcriptional profiling of the Bacillus subtilis cold-shock response.

    PubMed

    Kaan, Tanja; Homuth, Georg; Mäder, Ulrike; Bandow, Julia; Schweder, Thomas

    2002-11-01

    The transcriptome of Bacillus subtilis was analysed at different time points (30, 60 and 90 min) after a temperature downshift from 37 to 18 degrees C using DNA macroarrays. This approach allowed the identification of around 50 genes exhibiting an increased mRNA level and around 50 genes exhibiting a decreased mRNA level under cold-shock conditions. Many of the repressed genes encode enzymes involved in the biosynthesis of amino acids, nucleotides and coenzymes, indicating metabolic adaptation of the cells to the decreased growth rate at the lower temperature. The strongest cold-inducible gene encodes fatty acid desaturase, which forms unsaturated fatty acids from saturated phospholipid precursors, thereby increasing membrane fluidity. The cold-shock-induced increase of mRNA levels of the classical cold-shock genes cspB, cspC and cspD could be verified. Furthermore, besides many genes encoding proteins of unknown function, some genes encoding ribosomal proteins were transcriptionally up-regulated, which points to an adaptive reprogramming of the ribosomes under cold-shock conditions. Interestingly, the amount of mRNA specified by the operon ptb-bcd-buk-lpd-bkdA1-bkdA2-bkdB, which encodes enzymes involved in degradation of branched-chain amino acids, also increases after a temperature downshift. As cells utilize the isoleucine and valine degradation intermediates alpha-methylbutyryl-CoA and isobutyryl-CoA for synthesis of branched-chain fatty acids, this finding reflects the adaptation of membrane lipid composition, ensuring the maintenance of appropriate membrane fluidity at low temperatures. The results of the DNA array analyses were verified for several selected genes by RNA slot-blot analysis and compared with two-dimensional PAGE analyses.

  20. A resource for characterizing genome-wide binding and putative target genes of transcription factors expressed during secondary growth and wood formation in Populus

    Treesearch

    Lijun Liu; Trevor Ramsay; Matthew S. Zinkgraf; David Sundell; Nathaniel Robert Street; Vladimir Filkov; Andrew Groover

    2015-01-01

    Identifying transcription factor target genes is essential for modeling the transcriptional networks underlying developmental processes. Here we report a chromatin immunoprecipitation sequencing (ChIP-seq) resource consisting of genome-wide binding regions and associated putative target genes for four Populus homeodomain transcription factors...

  1. Comprehensive analysis of genome-wide DNA methylation across human polycystic ovary syndrome ovary granulosa cell

    PubMed Central

    Peng, Zhaofeng; Wang, Linlin; Du, Linqing; Niu, Wenbin; Sun, Yingpu

    2016-01-01

    Polycystic ovary syndrome (PCOS) affects approximately 7% of the reproductive-age women. A growing body of evidence indicated that epigenetic mechanisms contributed to the development of PCOS. The role of DNA modification in human PCOS ovary granulosa cell is still unknown in PCOS progression. Global DNA methylation and hydroxymethylation were detected between PCOS’ and controls’ granulosa cell. Genome-wide DNA methylation was profiled to investigate the putative function of DNA methylaiton. Selected genes expressions were analyzed between PCOS’ and controls’ granulosa cell. Our results showed that the granulosa cell global DNA methylation of PCOS patients was significant higher than the controls’. The global DNA hydroxymethylation showed low level and no statistical difference between PCOS and control. 6936 differentially methylated CpG sites were identified between control and PCOS-obesity. 12245 differential methylated CpG sites were detected between control and PCOS-nonobesity group. 5202 methylated CpG sites were significantly differential between PCOS-obesity and PCOS-nonobesity group. Our results showed that DNA methylation not hydroxymethylation altered genome-wide in PCOS granulosa cell. The different methylation genes were enriched in development protein, transcription factor activity, alternative splicing, sequence-specific DNA binding and embryonic morphogenesis. YWHAQ, NCF2, DHRS9 and SCNA were up-regulation in PCOS-obesity patients with no significance different between control and PCOS-nonobesity patients, which may be activated by lower DNA methylaiton. Global and genome-wide DNA methylation alteration may contribute to different genes expression and PCOS clinical pathology. PMID:27056885

  2. Genome-Wide Investigation and Expression Profiling of AP2/ERF Transcription Factor Superfamily in Foxtail Millet (Setaria italica L.)

    PubMed Central

    Lata, Charu; Mishra, Awdhesh Kumar; Muthamilarasan, Mehanathan; Bonthala, Venkata Suresh; Khan, Yusuf; Prasad, Manoj

    2014-01-01

    The APETALA2/ethylene-responsive element binding factor (AP2/ERF) family is one of the largest transcription factor (TF) families in plants that includes four major sub-families, namely AP2, DREB (dehydration responsive element binding), ERF (ethylene responsive factors) and RAV (Related to ABI3/VP). AP2/ERFs are known to play significant roles in various plant processes including growth and development and biotic and abiotic stress responses. Considering this, a comprehensive genome-wide study was conducted in foxtail millet (Setaria italica L.). A total of 171 AP2/ERF genes were identified by systematic sequence analysis and were physically mapped onto nine chromosomes. Phylogenetic analysis grouped AP2/ERF genes into six classes (I to VI). Duplication analysis revealed that 12 (∼7%) SiAP2/ERF genes were tandem repeated and 22 (∼13%) were segmentally duplicated. Comparative physical mapping between foxtail millet AP2/ERF genes and its orthologs of sorghum (18 genes), maize (14 genes), rice (9 genes) and Brachypodium (6 genes) showed the evolutionary insights of AP2/ERF gene family and also the decrease in orthology with increase in phylogenetic distance. The evolutionary significance in terms of gene-duplication and divergence was analyzed by estimating synonymous and non-synonymous substitution rates. Expression profiling of candidate AP2/ERF genes against drought, salt and phytohormones revealed insights into their precise and/or overlapping expression patterns which could be responsible for their functional divergence in foxtail millet. The study showed that the genes SiAP2/ERF-069, SiAP2/ERF-103 and SiAP2/ERF-120 may be considered as potential candidate genes for further functional validation as well for utilization in crop improvement programs for stress resistance since these genes were up-regulated under drought and salinity stresses in ABA dependent manner. Altogether the present study provides new insights into evolution, divergence and systematic

  3. Genome-wide characterization of JASMONATE-ZIM DOMAIN transcription repressors in wheat (Triticum aestivum L.).

    PubMed

    Wang, Yukun; Qiao, Linyi; Bai, Jianfang; Wang, Peng; Duan, Wenjing; Yuan, Shaohua; Yuan, Guoliang; Zhang, Fengting; Zhang, Liping; Zhao, Changping

    2017-02-13

    The JASMONATE-ZIM DOMAIN (JAZ) repressor family proteins are jasmonate co-receptors and transcriptional repressor in jasmonic acid (JA) signaling pathway, and they play important roles in regulating the growth and development of plants. Recently, more and more researches on JAZ gene family are reported in many plants. Although the genome sequencing of common wheat (Triticum aestivum L.) and its relatives is complete, our knowledge about this gene family remains vacant. Fourteen JAZ genes were identified in the wheat genome. Structural analysis revealed that the TaJAZ proteins in wheat were as conserved as those in other plants, but had structural characteristics. By phylogenetic analysis, all JAZ proteins from wheat and other plants were clustered into 11 sub-groups (G1-G11), and TaJAZ proteins shared a high degree of similarity with some JAZ proteins from Aegliops tauschii, Brachypodium distachyon and Oryza sativa. The Ka/Ks ratios of TaJAZ genes ranged from 0.0016 to 0.6973, suggesting that the TaJAZ family had undergone purifying selection in wheat. Gene expression patterns obtained by quantitative real-time PCR (qRT-PCR) revealed differential temporal and spatial regulation of TaJAZ genes under multifarious abiotic stress treatments of high salinity, drought, cold and phytohormone. Among these, TaJAZ7, 8 and 12 were specifically expressed in the anther tissues of the thermosensitive genic male sterile (TGMS) wheat line BS366 and normal control wheat line Jing411. Compared with the gene expression patterns in the normal wheat line Jing411, TaJAZ7, 8 and 12 had different expression patterns in abnormally dehiscent anthers of BS366 at the heading stage 6, suggesting that specific up- or down-regulation of these genes might be associated with the abnormal anther dehiscence in TGMS wheat line. This study analyzed the size and composition of the JAZ gene family in wheat, and investigated stress responsive and differential tissue-specific expression profiles of each

  4. Genome-Wide Transcriptional Regulation and Chromosome Structural Arrangement by GalR in E. coli

    PubMed Central

    Qian, Zhong; Trostel, Andrei; Lewis, Dale E. A.; Lee, Sang Jun; He, Ximiao; Stringer, Anne M.; Wade, Joseph T.; Schneider, Thomas D.; Durfee, Tim; Adhya, Sankar

    2016-01-01

    The regulatory protein, GalR, is known for controlling transcription of genes related to D-galactose metabolism in Escherichia coli. Here, using a combination of experimental and bioinformatic approaches, we identify novel GalR binding sites upstream of several genes whose function is not directly related to D-galactose metabolism. Moreover, we do not observe regulation of these genes by GalR under standard growth conditions. Thus, our data indicate a broader regulatory role for GalR, and suggest that regulation by GalR is modulated by other factors. Surprisingly, we detect regulation of 158 transcripts by GalR, with few regulated genes being associated with a nearby GalR binding site. Based on our earlier observation of long-range interactions between distally bound GalR dimers, we propose that GalR indirectly regulates the transcription of many genes by inducing large-scale restructuring of the chromosome. PMID:27900321

  5. Refining genome-wide linkage intervals using a meta-analysis of genome-wide association studies identifies loci influencing personality dimensions

    PubMed Central

    Amin, Najaf; Hottenga, Jouke-Jan; Hansell, Narelle K; Janssens, A Cecile JW; de Moor, Marleen HM; Madden, Pamela AF; Zorkoltseva, Irina V; Penninx, Brenda W; Terracciano, Antonio; Uda, Manuela; Tanaka, Toshiko; Esko, Tonu; Realo, Anu; Ferrucci, Luigi; Luciano, Michelle; Davies, Gail; Metspalu, Andres; Abecasis, Goncalo R; Deary, Ian J; Raikkonen, Katri; Bierut, Laura J; Costa, Paul T; Saviouk, Viatcheslav; Zhu, Gu; Kirichenko, Anatoly V; Isaacs, Aaron; Aulchenko, Yurii S; Willemsen, Gonneke; Heath, Andrew C; Pergadia, Michele L; Medland, Sarah E; Axenovich, Tatiana I; de Geus, Eco; Montgomery, Grant W; Wright, Margaret J; Oostra, Ben A; Martin, Nicholas G; Boomsma, Dorret I; van Duijn, Cornelia M

    2013-01-01

    Personality traits are complex phenotypes related to psychosomatic health. Individually, various gene finding methods have not achieved much success in finding genetic variants associated with personality traits. We performed a meta-analysis of four genome-wide linkage scans (N=6149 subjects) of five basic personality traits assessed with the NEO Five-Factor Inventory. We compared the significant regions from the meta-analysis of linkage scans with the results of a meta-analysis of genome-wide association studies (GWAS) (N∼17 000). We found significant evidence of linkage of neuroticism to chromosome 3p14 (rs1490265, LOD=4.67) and to chromosome 19q13 (rs628604, LOD=3.55); of extraversion to 14q32 (ATGG002, LOD=3.3); and of agreeableness to 3p25 (rs709160, LOD=3.67) and to two adjacent regions on chromosome 15, including 15q13 (rs970408, LOD=4.07) and 15q14 (rs1055356, LOD=3.52) in the individual scans. In the meta-analysis, we found strong evidence of linkage of extraversion to 4q34, 9q34, 10q24 and 11q22, openness to 2p25, 3q26, 9p21, 11q24, 15q26 and 19q13 and agreeableness to 4q34 and 19p13. Significant evidence of association in the GWAS was detected between openness and rs677035 at 11q24 (P-value=2.6 × 10−06, KCNJ1). The findings of our linkage meta-analysis and those of the GWAS suggest that 11q24 is a susceptible locus for openness, with KCNJ1 as the possible candidate gene. PMID:23211697

  6. Refining genome-wide linkage intervals using a meta-analysis of genome-wide association studies identifies loci influencing personality dimensions.

    PubMed

    Amin, Najaf; Hottenga, Jouke-Jan; Hansell, Narelle K; Janssens, A Cecile J W; de Moor, Marleen H M; Madden, Pamela A F; Zorkoltseva, Irina V; Penninx, Brenda W; Terracciano, Antonio; Uda, Manuela; Tanaka, Toshiko; Esko, Tonu; Realo, Anu; Ferrucci, Luigi; Luciano, Michelle; Davies, Gail; Metspalu, Andres; Abecasis, Goncalo R; Deary, Ian J; Raikkonen, Katri; Bierut, Laura J; Costa, Paul T; Saviouk, Viatcheslav; Zhu, Gu; Kirichenko, Anatoly V; Isaacs, Aaron; Aulchenko, Yurii S; Willemsen, Gonneke; Heath, Andrew C; Pergadia, Michele L; Medland, Sarah E; Axenovich, Tatiana I; de Geus, Eco; Montgomery, Grant W; Wright, Margaret J; Oostra, Ben A; Martin, Nicholas G; Boomsma, Dorret I; van Duijn, Cornelia M

    2013-08-01

    Personality traits are complex phenotypes related to psychosomatic health. Individually, various gene finding methods have not achieved much success in finding genetic variants associated with personality traits. We performed a meta-analysis of four genome-wide linkage scans (N=6149 subjects) of five basic personality traits assessed with the NEO Five-Factor Inventory. We compared the significant regions from the meta-analysis of linkage scans with the results of a meta-analysis of genome-wide association studies (GWAS) (N∼17 000). We found significant evidence of linkage of neuroticism to chromosome 3p14 (rs1490265, LOD=4.67) and to chromosome 19q13 (rs628604, LOD=3.55); of extraversion to 14q32 (ATGG002, LOD=3.3); and of agreeableness to 3p25 (rs709160, LOD=3.67) and to two adjacent regions on chromosome 15, including 15q13 (rs970408, LOD=4.07) and 15q14 (rs1055356, LOD=3.52) in the individual scans. In the meta-analysis, we found strong evidence of linkage of extraversion to 4q34, 9q34, 10q24 and 11q22, openness to 2p25, 3q26, 9p21, 11q24, 15q26 and 19q13 and agreeableness to 4q34 and 19p13. Significant evidence of association in the GWAS was detected between openness and rs677035 at 11q24 (P-value=2.6 × 10(-06), KCNJ1). The findings of our linkage meta-analysis and those of the GWAS suggest that 11q24 is a susceptible locus for openness, with KCNJ1 as the possible candidate gene.

  7. Genome-wide analysis of the TPX2 family proteins in Eucalyptus grandis.

    PubMed

    Du, Pingzhou; Kumar, Manoj; Yao, Yuan; Xie, Qiaoli; Wang, Jinyan; Zhang, Baolong; Gan, Siming; Wang, Yuqi; Wu, Ai-Min

    2016-11-24

    The Xklp2 (TPX2) proteins belong to the microtubule-associated (MAP) family of proteins. All members of the family contain the conserved TPX2 motif, which can interact with microtubules, regulate microtubule dynamics or assist with different microtubule functions, for example, maintenance of cell morphology or regulation of cell growth and development. However, the role of members of the TPX family have not been studied in the model tree species Eucalyptus to date. Here, we report the identification of the members of the TPX2 family in Eucalyptus grandis (Eg) and analyse the expression patterns and functions of these genes. In present study, a comprehensive analysis of the plant TPX2 family proteins was performed. Phylogenetic analyses indicated that the genes can be classified into 6 distinct subfamilies. A genome-wide survey identified 12 members of the TPX2 family in the sequenced genome of Eucalyptus grandis. The basic genetic properties of the TPX2 family in Eucalyptus were analysed. Our results suggest that the TPX2 family proteins within different sub-groups are relatively conserved but there are important differences between groups. Quantitative real-time PCR (qRT-PCR) was performed to confirm the expression levels of the genes in different tissues. The results showed that in the whole plant, the levels of EgWDL5 transcript are the highest, followed by those of EgWDL4. Compared with other tissues, the level of the EgMAP20 transcript is the highest in the root. Over-expression of EgMAP20 in Arabidopsis resulted in organ twisting. The cotyledon petioles showed left-handed twisting while the hypocotyl epidermal cells produced right-handed helical twisting. Finally, EgMAP20, EgWDL3 and EgWDL3L were all able to decorate microtubules. Plant TPX2 family proteins were systematically analysed using bioinformatics methods. There are 12 TPX2 family proteins in Eucalyptus. We have performed an initial characterization of the functions of several members of the TPX2

  8. Comparative analysis of genome-wide divergence, domestication footprints and genome-wide association study of root traits for Gossypium hirsutum and Gossypium barbadense

    USDA-ARS?s Scientific Manuscript database

    Use of 10,129 singleton SNPs of known genomic location in tetraploid cotton provided unique opportunities to characterize genome-wide diversity among 440 Gossypium hirsutum and 219 G. barbadense cultivars and landrace accessions of widespread origin. Using genome-wide distributed SNPs, we examined ...

  9. FVGWAS: Fast Voxelwise Genome Wide Association Analysis of Large-scale Imaging Genetic Data 1

    PubMed Central

    Huang, Meiyan; Nichols, Thomas; Huang, Chao; Yang, Yu; Lu, Zhaohua; Feng, Qianjing; Knickmeyer, Rebecca C; Zhu, Hongtu

    2015-01-01

    More and more large-scale imaging genetic studies are being widely conducted to collect a rich set of imaging, genetic, and clinical data to detect putative genes for complexly inherited neuropsychiatric and neurodegenerative disorders. Several major big-data challenges arise from testing genome-wide (NC > 12 million known variants) associations with signals at millions of locations (NV ~ 106) in the brain from thousands of subjects (n ~ 103). The aim of this paper is to develop a Fast Voxelwise Genome Wide Association analysiS (FVGWAS) framework to e ciently carry out whole-genome analyses of whole-brain data. FVGWAS consists of three components including a heteroscedastic linear model, a global sure independence screening (G-SIS) procedure, and a detection procedure based on wild bootstrap methods. Specifically, for standard linear association, the computational complexity is O(nNV NC) for voxelwise genome wide association analysis (VGWAS) method compared with O((NC + NV)n2) for FVGWAS. Simulation studies show that FVGWAS is an effcient method of searching sparse signals in an extremely large search space, while controlling for the family-wise error rate. Finally, we have successfully applied FVGWAS to a large-scale imaging genetic data analysis of ADNI data with 708 subjects, 193,275 voxels in RAVENS maps, and 501,584 SNPs, and the total processing time was 203,645 seconds for a single CPU. Our FVG-WAS may be a valuable statistical toolbox for large-scale imaging genetic analysis as the field is rapidly advancing with ultra-high-resolution imaging and whole-genome sequencing. PMID:26025292

  10. A functional genome-wide genetic screening identifies new pathways controlling the G1/S transcriptional wave.

    PubMed

    Gaspa, Laura; González-Medina, Alberto; Hidalgo, Elena; Ayté, José

    2016-01-01

    The Schizosaccharomyces pombe MBF complex activates the transcription of genes required for DNA synthesis and S phase. The MBF complex contains several proteins, including the core components Cdc10, Res1 and Res2, the co-repressor proteins Yox1 and Nrm1 and the co-activator Rep2. It has recently been shown how MBF is regulated when either the DNA damage or the DNA synthesis checkpoints are activated. However, how MBF is regulated in a normal unperturbed cell cycle is still not well understood. We have set up a genome-wide genomic screen searching for global regulators of MBF. We have crossed our knock-out collection library with a reporter strain that allows the measurement of MBF activity in live cells by flow cytometry. We confirm previously known regulators of MBF and show that COP9/signalosome and tRNA methyltransferases also regulate MBF activity.

  11. No Promoter Left Behind (NPLB): learn de novo promoter architectures from genome-wide transcription start sites

    PubMed Central

    Mitra, Sneha; Narlikar, Leelavati

    2016-01-01

    Summary: Promoters have diverse regulatory architectures and thus activate genes differently. For example, some have a TATA-box, many others do not. Even the ones with it can differ in its position relative to the transcription start site (TSS). No Promoter Left Behind (NPLB) is an efficient, organism-independent method for characterizing such diverse architectures directly from experimentally identified genome-wide TSSs, without relying on known promoter elements. As a test case, we show its application in identifying novel architectures in the fly genome. Availability and implementation: Web-server at http://nplb.ncl.res.in. Standalone also at https://github.com/computationalBiology/NPLB/ (Mac OSX/Linux). Contact: l.narlikar@ncl.res.in Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26530723

  12. Genome-wide analyses identify transcription factors required for proper morphogenesis of Drosophila sensory neuron dendrites

    PubMed Central

    Parrish, Jay Z.; Kim, Michael D.; Jan, Lily Yeh; Jan, Yuh Nung

    2006-01-01

    Dendrite arborization patterns are critical determinants of neuronal function. To explore the basis of transcriptional regulation in dendrite pattern formation, we used RNA interference (RNAi) to screen 730 transcriptional regulators and identified 78 genes involved in patterning the stereotyped dendritic arbors of class I da neurons in Drosophila. Most of these transcriptional regulators affect dendrite morphology without altering the number of class I dendrite arborization (da) neurons and fall primarily into three groups. Group A genes control both primary dendrite extension and lateral branching, hence the overall dendritic field. Nineteen genes within group A act to increase arborization, whereas 20 other genes restrict dendritic coverage. Group B genes appear to balance dendritic outgrowth and branching. Nineteen group B genes function to promote branching rather than outgrowth, and two others have the opposite effects. Finally, 10 group C genes are critical for the routing of the dendritic arbors of individual class I da neurons. Thus, multiple genetic programs operate to calibrate dendritic coverage, to coordinate the elaboration of primary versus secondary branches, and to lay out these dendritic branches in the proper orientation. PMID:16547170

  13. Genome-Wide Transcriptional Responses to Carbon Starvation in Nongrowing Lactococcus lactis

    PubMed Central

    Ercan, Onur; Wels, Michiel; Smid, Eddy J.

    2015-01-01

    This paper describes the transcriptional adaptations of nongrowing, retentostat cultures of Lactococcus lactis to starvation. Near-zero-growth cultures (μ = 0.0001 h−1) obtained by extended retentostat cultivation were exposed to starvation by termination of the medium supply for 24 h, followed by a recovery period of another 24 h by reinitiating the medium supply to the retentostat culture. During starvation, the viability of the culture was largely retained, and the expression of genes involved in transcription and translational machineries, cell division, and cell membrane energy metabolism was strongly repressed. Expression of these genes was largely recovered following the reinitiation of the medium supply. Starvation triggered the elevated expression of genes associated with synthesis of branched-chain amino acids, histidine, purine, and riboflavin. The expression of these biosynthesis genes was found to remain at an elevated level after reinitiation of the medium supply. In addition, starvation induced the complete gene set predicted to be involved in natural competence in L. lactis KF147, and the elevated expression of these genes was sustained during the subsequent recovery period, but our attempts to experimentally demonstrate natural transformation in these cells failed. Mining the starvation response gene set identified a conserved cis-acting element that resembles the lactococcal CodY motif in the upstream regions of genes associated with transcription and translational machineries, purine biosynthesis, and natural transformation in L. lactis, suggesting a role for CodY in the observed transcriptome adaptations to starvation in nongrowing cells. PMID:25636846

  14. Genome-wide features of neuroendocrine regulation in Drosophila by the basic helix-loop-helix transcription factor DIMMED

    PubMed Central

    Hadžić, Tarik; Park, Dongkook; Abruzzi, Katharine C.; Yang, Lin; Trigg, Jennifer S.; Rohs, Remo; Rosbash, Michael; Taghert, Paul H.

    2015-01-01

    Neuroendocrine (NE) cells use large dense core vesicles (LDCVs) to traffic, process, store and secrete neuropeptide hormones through the regulated secretory pathway. The dimmed (DIMM) basic helix-loop-helix transcription factor of Drosophila controls the level of regulated secretory activity in NE cells. To pursue its mechanisms, we have performed two independent genome-wide analyses of DIMM's activities: (i) in vivo chromatin immunoprecipitation (ChIP) to define genomic sites of DIMM occupancy and (ii) deep sequencing of purified DIMM neurons to characterize their transcriptional profile. By this combined approach, we showed that DIMM binds to conserved E-boxes in enhancers of 212 genes whose expression is enriched in DIMM-expressing NE cells. DIMM binds preferentially to certain E-boxes within first introns of specific gene isoforms. Statistical machine learning revealed that flanking regions of putative DIMM binding sites contribute to its DNA binding specificity. DIMM's transcriptional repertoire features at least 20 LDCV constituents. In addition, DIMM notably targets the pro-secretory transcription factor, creb-A, but significantly, DIMM does not target any neuropeptide genes. DIMM therefore prescribes the scale of secretory activity in NE neurons, by a systematic control of both proximal and distal points in the regulated secretory pathway. PMID:25634895

  15. An efficient hierarchical generalized linear mixed model for pathway analysis of genome-wide association studies.

    PubMed

    Wang, Lily; Jia, Peilin; Wolfinger, Russell D; Chen, Xi; Grayson, Britney L; Aune, Thomas M; Zhao, Zhongming

    2011-03-01

    In genome-wide association studies (GWAS) of complex diseases, genetic variants having real but weak associations often fail to be detected at the stringent genome-wide significance level. Pathway analysis, which tests disease association with combined association signals from a group of variants in the same pathway, has become increasingly popular. However, because of the complexities in genetic data and the large sample sizes in typical GWAS, pathway analysis remains to be challenging. We propose a new statistical model for pathway analysis of GWAS. This model includes a fixed effects component that models mean disease association for a group of genes, and a random effects component that models how each gene's association with disease varies about the gene group mean, thus belongs to the class of mixed effects models. The proposed model is computationally efficient and uses only summary statistics. In addition, it corrects for the presence of overlapping genes and linkage disequilibrium (LD). Via simulated and real GWAS data, we showed our model improved power over currently available pathway analysis methods while preserving type I error rate. Furthermore, using the WTCCC Type 1 Diabetes (T1D) dataset, we demonstrated mixed model analysis identified meaningful biological processes that agreed well with previous reports on T1D. Therefore, the proposed methodology provides an efficient statistical modeling framework for systems analysis of GWAS. The software code for mixed models analysis is freely available at http://biostat.mc.vanderbilt.edu/LilyWang.

  16. Genome-wide DNA binding pattern of the homeodomain transcription factor Sine oculis (So) in the developing eye of Drosophila melanogaster.

    PubMed

    Jusiak, Barbara; Wang, Feng; Karandikar, Umesh C; Kwak, Su-Jin; Wang, Hui; Chen, Rui; Mardon, Graeme

    2014-12-01

    The eye of the fruit fly Drosophila melanogaster provides a highly tractable genetic model system for the study of animal development, and many genes that regulate Drosophila eye formation have homologs implicated in human development and disease. Among these is the homeobox gene sine oculis (so), which encodes a homeodomain transcription factor (TF) that is both necessary for eye development and sufficient to reprogram a subset of cells outside the normal eye field toward an eye fate. We have performed a genome-wide analysis of So binding to DNA prepared from developing Drosophila eye tissue in order to identify candidate direct targets of So-mediated transcriptional regulation, as described in our recent article [1]. The data are available from NCBI Gene Expression Omnibus (GEO) with the accession number GSE52943. Here we describe the methods, data analysis, and quality control of our So ChIP-seq dataset.

  17. Genome-Wide Analysis and Characterization of Aux/IAA Family Genes in Brassica rapa

    PubMed Central

    Rameneni, Jana Jeevan; Li, Xiaonan; Sivanandhan, Ganesan; Choi, Su Ryun; Pang, Wenxing; Im, Subin; Lim, Yong Pyo

    2016-01-01

    Auxins are the key players in plant growth development involving leaf formation, phototropism, root, fruit and embryo development. Auxin/Indole-3-Acetic Acid (Aux/IAA) are early auxin response genes noted as transcriptional repressors in plant auxin signaling. However, many studies focus on Aux/ARF gene families and much less is known about the Aux/IAA gene family in Brassica rapa (B. rapa). Here we performed a comprehensive genome-wide analysis and identified 55 Aux/IAA genes in B. rapa using four conserved motifs of Aux/IAA family (PF02309). Chromosomal mapping of the B. rapa Aux/IAA (BrIAA) genes facilitated understanding cluster rearrangement of the crucifer building blocks in the genome. Phylogenetic analysis of BrIAA with Arabidopsis thaliana, Oryza sativa and Zea mays identified 51 sister pairs including 15 same species (BrIAA—BrIAA) and 36 cross species (BrIAA—AtIAA) IAA genes. Among the 55 BrIAA genes, expression of 43 and 45 genes were verified using Genebank B. rapa ESTs and in home developed microarray data from mature leaves of Chiifu and RcBr lines. Despite their huge morphological difference, tissue specific expression analysis of BrIAA genes between the parental lines Chiifu and RcBr showed that the genes followed a similar pattern of expression during leaf development and a different pattern during bud, flower and siliqua development stages. The response of the BrIAA genes to abiotic and auxin stress at different time intervals revealed their involvement in stress response. Single Nucleotide Polymorphisms between IAA genes of reference genome Chiifu and RcBr were focused and identified. Our study examines the scope of conservation and divergence of Aux/IAA genes and their structures in B. rapa. Analyzing the expression and structural variation between two parental lines will significantly contribute to functional genomics of Brassica crops and we belive our study would provide a foundation in understanding the Aux/IAA genes in B. rapa. PMID

  18. Genome-Wide Analysis and Characterization of Aux/IAA Family Genes in Brassica rapa.

    PubMed

    Paul, Parameswari; Dhandapani, Vignesh; Rameneni, Jana Jeevan; Li, Xiaonan; Sivanandhan, Ganesan; Choi, Su Ryun; Pang, Wenxing; Im, Subin; Lim, Yong Pyo

    2016-01-01

    Auxins are the key players in plant growth development involving leaf formation, phototropism, root, fruit and embryo development. Auxin/Indole-3-Acetic Acid (Aux/IAA) are early auxin response genes noted as transcriptional repressors in plant auxin signaling. However, many studies focus on Aux/ARF gene families and much less is known about the Aux/IAA gene family in Brassica rapa (B. rapa). Here we performed a comprehensive genome-wide analysis and identified 55 Aux/IAA genes in B. rapa using four conserved motifs of Aux/IAA family (PF02309). Chromosomal mapping of the B. rapa Aux/IAA (BrIAA) genes facilitated understanding cluster rearrangement of the crucifer building blocks in the genome. Phylogenetic analysis of BrIAA with Arabidopsis thaliana, Oryza sativa and Zea mays identified 51 sister pairs including 15 same species (BrIAA-BrIAA) and 36 cross species (BrIAA-AtIAA) IAA genes. Among the 55 BrIAA genes, expression of 43 and 45 genes were verified using Genebank B. rapa ESTs and in home developed microarray data from mature leaves of Chiifu and RcBr lines. Despite their huge morphological difference, tissue specific expression analysis of BrIAA genes between the parental lines Chiifu and RcBr showed that the genes followed a similar pattern of expression during leaf development and a different pattern during bud, flower and siliqua development stages. The response of the BrIAA genes to abiotic and auxin stress at different time intervals revealed their involvement in stress response. Single Nucleotide Polymorphisms between IAA genes of reference genome Chiifu and RcBr were focused and identified. Our study examines the scope of conservation and divergence of Aux/IAA genes and their structures in B. rapa. Analyzing the expression and structural variation between two parental lines will significantly contribute to functional genomics of Brassica crops and we belive our study would provide a foundation in understanding the Aux/IAA genes in B. rapa.

  19. Meta-analysis of heterogeneous Down Syndrome data reveals consistent genome-wide dosage effects related to neurological processes

    PubMed Central

    2011-01-01

    Background Down syndrome (DS; trisomy 21) is the most common genetic cause of mental retardation in the human population and key molecular networks dysregulated in DS are still unknown. Many different experimental techniques have been applied to analyse the effects of dosage imbalance at the molecular and phenotypical level, however, currently no integrative approach exists that attempts to extract the common information. Results We have performed a statistical meta-analysis from 45 heterogeneous publicly available DS data sets in order to identify consistent dosage effects from these studies. We identified 324 genes with significant genome-wide dosage effects, including well investigated genes like SOD1, APP, RUNX1 and DYRK1A as well as a large proportion of novel genes (N = 62). Furthermore, we characterized these genes using gene ontology, molecular interactions and promoter sequence analysis. In order to judge relevance of the 324 genes for more general cerebral pathologies we used independent publicly available microarry data from brain studies not related with DS and identified a subset of 79 genes with potential impact for neurocognitive processes. All results have been made available through a web server under http://ds-geneminer.molgen.mpg.de/. Conclusions Our study represents a comprehensive integrative analysis of heterogeneous data including genome-wide transcript levels in the domain of trisomy 21. The detected dosage effects build a resource for further studies of DS pathology and the development of new therapies. PMID:21569303

  20. Meta-analysis of genome-wide association studies of anxiety disorders

    PubMed Central

    Otowa, Takeshi; Hek, Karin; Lee, Minyoung; Byrne, Enda M.; Mirza, Saira S.; Nivard, Michel G.; Bigdeli, Timothy; Aggen, Steven H.; Adkins, Daniel; Wolen, Aaron; Fanous, Ayman; Keller, Matthew C.; Castelao, Enrique; Kutalik, Zoltan; Van der Auwera, Sandra; Homuth, Georg; Nauck, Matthias; Teumer, Alexander; Milaneschi, Yuri; Hottenga, Jouke-Jan; Direk, Nese; Hofman, Albert; Uitterlinden, Andre; Mulder, Cornelis L.; Henders, Anjali K.; Medland, Sarah E.; Gordon, Scott; Heath, Andrew C.; Madden, Pamela A.F.; Pergadia, Michelle; van der Most, Peter J.; Nolte, Ilja M.; van Oort, Floor V.A.; Hartman, Catharina A.; Oldehinkel, Albertine J.; Preisig, Martin; Grabe, Hans Jörgen; Middeldorp, Christel M.; Penninx, Brenda WJH; Boomsma, Dorret; Martin, Nicholas G.; Montgomery, Grant; Maher, Brion S.; van den Oord, Edwin J.; Wray, Naomi R.; Tiemeier, Henning; Hettema, John M.

    2015-01-01

    Anxiety disorders, namely generalized anxiety disorder, panic disorder, and phobias, are common, etiologically complex conditions with a partially genetic basis. Despite differing on diagnostic definitions based upon clinical presentation, anxiety disorders likely represent various expressions of an underlying common diathesis of abnormal regulation of basic threat-response systems. We conducted genome-wide association analyses in nine samples of European ancestry from seven large, independent studies. To identify genetic variants contributing to genetic susceptibility shared across interview-generated DSM-based anxiety disorders, we applied two phenotypic approaches: (1) comparisons between categorical anxiety disorder cases and super-normal controls, and (2) quantitative phenotypic factor scores derived from a multivariate analysis combining information across the clinical phenotypes. We used logistic and linear regression, respectively, to analyze the association between these phenotypes and genome-wide single nucleotide polymorphisms. Meta-analysis for each phenotype combined results across the nine samples for over 18 000 unrelated individuals. Each meta-analysis identified a different genome-wide significant region, with the following markers showing the strongest association: for case-control contrasts, rs1709393 located in an uncharacterized non-coding RNA locus on chromosomal band 3q12.3 (P=1.65×10−8); for factor scores, rs1067327 within CAMKMT encoding the calmodulin-lysine N-methyltransferase on chromosomal band 2p21 (P=2.86×10−9). Independent replication and further exploration of these findings are needed to more fully understand the role of these variants in risk and expression of anxiety disorders. PMID:26754954

  1. Power comparison of admixture mapping and direct association analysis in genome-wide association studies.

    PubMed

    Qin, Huaizhen; Zhu, Xiaofeng

    2012-04-01

    When dense markers are available, one can interrogate almost every common variant across the genome via imputation and single nucleotide polymorphism (SNP) test, which has become a routine in current genome-wide association studies (GWASs). As a complement, admixture mapping exploits the long-range linkage disequilibrium (LD) generated by admixture between genetically distinct ancestral populations. It is then questionable whether admixture mapping analysis is still necessary in detecting the disease associated variants in admixed populations. We argue that admixture mapping is able to reduce the burden of massive comparisons in GWASs; it therefore can be a powerful tool to locate the disease variants with substantial allele frequency differences between ancestral populations. In this report we studied a two-stage approach, where candidate regions are defined by conducting admixture mapping at stage 1, and single SNP association tests are followed at stage 2 within the candidate regions defined at stage 1. We first established the genome-wide significance levels corresponding to the criteria to define the candidate regions at stage 1 by simulations. We next compared the power of the two-stage approach with direct association analysis. Our simulations suggest that the two-stage approach can be more powerful than the standard genome-wide association analysis when the allele frequency difference of a causal variant in ancestral populations, is larger than 0.4. Our conclusion is consistent with a theoretical prediction by Risch and Tang ([2006] Am J Hum Genet 79:S254). Surprisingly, our study also suggests that power can be improved when we use less strict criteria to define the candidate regions at stage 1.

  2. The complex genetics of gait speed: genome-wide meta-analysis approach

    PubMed Central

    Lunetta, Kathryn L.; Smith, Jennifer A.; Eicher, John D.; Vered, Rotem; Deelen, Joris; Arnold, Alice M.; Buchman, Aron S.; Tanaka, Toshiko; Faul, Jessica D.; Nethander, Maria; Fornage, Myriam; Adams, Hieab H.; Matteini, Amy M.; Callisaya, Michele L.; Smith, Albert V.; Yu, Lei; De Jager, Philip L.; Evans, Denis A.; Gudnason, Vilmundur; Hofman, Albert; Pattie, Alison; Corley, Janie; Launer, Lenore J.; Knopman, Davis S.; Parimi, Neeta; Turner, Stephen T.; Bandinelli, Stefania; Beekman, Marian; Gutman, Danielle; Sharvit, Lital; Mooijaart, Simon P.; Liewald, David C.; Houwing-Duistermaat, Jeanine J.; Ohlsson, Claes; Moed, Matthijs; Verlinden, Vincent J.; Mellström, Dan; van der Geest, Jos N.; Karlsson, Magnus; Hernandez, Dena; McWhirter, Rebekah; Liu, Yongmei; Thomson, Russell; Tranah, Gregory J.; Uitterlinden, Andre G.; Weir, David R.; Zhao, Wei; Starr, John M.; Johnson, Andrew D.; Ikram, M. Arfan; Bennett, David A.; Cummings, Steven R.; Deary, Ian J.; Harris, Tamara B.; Kardia, Sharon L. R.; Mosley, Thomas H.; Srikanth, Velandai K.; Windham, Beverly G.; Newman, Ann B.; Walston, Jeremy D.; Davies, Gail; Evans, Daniel S.; Slagboom, Eline P.; Ferrucci, Luigi; Kiel, Douglas P.; Murabito, Joanne M.; Atzmon, Gil

    2017-01-01

    Emerging evidence suggests that the basis for variation in late-life mobility is attributable, in part, to genetic factors, which may become increasingly important with age. Our objective was to systematically assess the contribution of genetic variation to gait speed in older individuals. We conducted a meta-analysis of gait speed GWASs in 31,478 older adults from 17 cohorts of the CHARGE consortium, and validated our results in 2,588 older adults from 4 independent studies. We followed our initial discoveries with network and eQTL analysis of candidate signals in tissues. The meta-analysis resulted in a list of 536 suggestive genome wide significant SNPs in or near 69 genes. Further interrogation with Pathway Analysis placed gait speed as a polygenic complex trait in five major networks. Subsequent eQTL analysis revealed several SNPs significantly associated with the expression of PRSS16, WDSUB1 and PTPRT, which in addition to the meta-analysis and pathway suggested that genetic effects on gait speed may occur through synaptic function and neuronal development pathways. No genome-wide significant signals for gait speed were identified from this moderately large sample of older adults, suggesting that more refined physical function phenotypes will be needed to identify the genetic basis of gait speed in aging. PMID:28077804

  3. Genome-wide linkage analysis of blood pressure under locus heterogeneity

    PubMed Central

    Yang, Xinqun; Wang, Kai; Huang, Jian; Vieland, Veronica J

    2003-01-01

    We describe a method for mapping quantitative trait loci that allows for locus heterogeneity. A genome-wide linkage analysis of blood pressure was performed using sib-pair data from the Framingham Heart Study. Evidence of linkage was found on four markers (GATA89G08, GATA23D06, GATA14E09, and 049xd2) at a significance level of 0.01. Two of them (GATA14E09 and 049xd2) seem to overlap with linkage signals reported previously, while the other two are not linked to any known signals. PMID:14975146

  4. Pharmacogenetic meta-analysis of genome-wide association studies of LDL cholesterol response to statins.

    PubMed

    Postmus, Iris; Trompet, Stella; Deshmukh, Harshal A; Barnes, Michael R; Li, Xiaohui; Warren, Helen R; Chasman, Daniel I; Zhou, Kaixin; Arsenault, Benoit J; Donnelly, Louise A; Wiggins, Kerri L; Avery, Christy L; Griffin, Paula; Feng, QiPing; Taylor, Kent D; Li, Guo; Evans, Daniel S; Smith, Albert V; de Keyser, Catherine E; Johnson, Andrew D; de Craen, Anton J M; Stott, David J; Buckley, Brendan M; Ford, Ian; Westendorp, Rudi G J; Slagboom, P Eline; Sattar, Naveed; Munroe, Patricia B; Sever, Peter; Poulter, Neil; Stanton, Alice; Shields, Denis C; O'Brien, Eoin; Shaw-Hawkins, Sue; Chen, Y-D Ida; Nickerson, Deborah A; Smith, Joshua D; Dubé, Marie Pierre; Boekholdt, S Matthijs; Hovingh, G Kees; Kastelein, John J P; McKeigue, Paul M; Betteridge, John; Neil, Andrew; Durrington, Paul N; Doney, Alex; Carr, Fiona; Morris, Andrew; McCarthy, Mark I; Groop, Leif; Ahlqvist, Emma; Bis, Joshua C; Rice, Kenneth; Smith, Nicholas L; Lumley, Thomas; Whitsel, Eric A; Stürmer, Til; Boerwinkle, Eric; Ngwa, Julius S; O'Donnell, Christopher J; Vasan, Ramachandran S; Wei, Wei-Qi; Wilke, Russell A; Liu, Ching-Ti; Sun, Fangui; Guo, Xiuqing; Heckbert, Susan R; Post, Wendy; Sotoodehnia, Nona; Arnold, Alice M; Stafford, Jeanette M; Ding, Jingzhong; Herrington, David M; Kritchevsky, Stephen B; Eiriksdottir, Gudny; Launer, Leonore J; Harris, Tamara B; Chu, Audrey Y; Giulianini, Franco; MacFadyen, Jean G; Barratt, Bryan J; Nyberg, Fredrik; Stricker, Bruno H; Uitterlinden, André G; Hofman, Albert; Rivadeneira, Fernando; Emilsson, Valur; Franco, Oscar H; Ridker, Paul M; Gudnason, Vilmundur; Liu, Yongmei; Denny, Joshua C; Ballantyne, Christie M; Rotter, Jerome I; Adrienne Cupples, L; Psaty, Bruce M; Palmer, Colin N A; Tardif, Jean-Claude; Colhoun, Helen M; Hitman, Graham; Krauss, Ronald M; Wouter Jukema, J; Caulfield, Mark J

    2014-10-28

    Statins effectively lower LDL cholesterol levels in large studies and the observed interindividual response variability may be partially explained by genetic variation. Here we perform a pharmacogenetic meta-analysis of genome-wide association studies (GWAS) in studies addressing the LDL cholesterol response to statins, including up to 18,596 statin-treated subjects. We validate the most promising signals in a further 22,318 statin recipients and identify two loci, SORT1/CELSR2/PSRC1 and SLCO1B1, not previously identified in GWAS. Moreover, we confirm the previously described associations with APOE and LPA. Our findings advance the understanding of the pharmacogenetic architecture of statin response.

  5. Principal components analysis corrects for stratification in genome-wide association studies.

    PubMed

    Price, Alkes L; Patterson, Nick J; Plenge, Robert M; Weinblatt, Michael E; Shadick, Nancy A; Reich, David

    2006-08-01

    Population stratification--allele frequency differences between cases and controls due to systematic ancestry differences-can cause spurious associations in disease studies. We describe a method that enables explicit detection and correction of population stratification on a genome-wide scale. Our method uses principal components analysis to explicitly model ancestry differences between cases and controls. The resulting correction is specific to a candidate marker's variation in frequency across ancestral populations, minimizing spurious associations while maximizing power to detect true associations. Our simple, efficient approach can easily be applied to disease studies with hundreds of thousands of markers.

  6. Histone deacetylase inhibition modulates histone acetylation at gene promoter regions and affects genome-wide gene transcription in Schistosoma mansoni

    PubMed Central

    Anderson, Letícia; Gomes, Monete Rajão; daSilva, Lucas Ferreira; Pereira, Adriana da Silva Andrade; Mourão, Marina M.; Romier, Christophe; Pierce, Raymond

    2017-01-01

    Background Schistosomiasis is a parasitic disease infecting hundreds of millions of people worldwide. Treatment depends on a single drug, praziquantel, which kills the Schistosoma spp. parasite only at the adult stage. HDAC inhibitors (HDACi) such as Trichostatin A (TSA) induce parasite mortality in vitro (schistosomula and adult worms), however the downstream effects of histone hyperacetylation on the parasite are not known. Methodology/Principal findings TSA treatment of adult worms in vitro increased histone acetylation at H3K9ac and H3K14ac, which are transcription activation marks, not affecting the unrelated transcription repression mark H3K27me3. We investigated the effect of TSA HDACi on schistosomula gene expression at three different time points, finding a marked genome-wide change in the transcriptome profile. Gene transcription activity was correlated with changes on the chromatin acetylation mark at gene promoter regions. Moreover, combining expression data with ChIP-Seq public data for schistosomula, we found that differentially expressed genes having the H3K4me3 mark at their promoter region in general showed transcription activation upon HDACi treatment, compared with those without the mark, which showed transcription down-regulation. Affected genes are enriched for DNA replication processes, most of them being up-regulated. Twenty out of 22 genes encoding proteins involved in reducing reactive oxygen species accumulation were down-regulated. Dozens of genes encoding proteins with histone reader motifs were changed, including SmEED from the PRC2 complex. We targeted SmEZH2 methyltransferase PRC2 component with a new EZH2 inhibitor (GSK343) and showed a synergistic effect with TSA, significantly increasing schistosomula mortality. Conclusions/Significance Genome-wide gene expression analyses have identified important pathways and cellular functions that were affected and may explain the schistosomicidal effect of TSA HDACi. The change in expression

  7. Histone deacetylase inhibition modulates histone acetylation at gene promoter regions and affects genome-wide gene transcription in Schistosoma mansoni.

    PubMed

    Anderson, Letícia; Gomes, Monete Rajão; daSilva, Lucas Ferreira; Pereira, Adriana da Silva Andrade; Mourão, Marina M; Romier, Christophe; Pierce, Raymond; Verjovski-Almeida, Sergio

    2017-04-01

    Schistosomiasis is a parasitic disease infecting hundreds of millions of people worldwide. Treatment depends on a single drug, praziquantel, which kills the Schistosoma spp. parasite only at the adult stage. HDAC inhibitors (HDACi) such as Trichostatin A (TSA) induce parasite mortality in vitro (schistosomula and adult worms), however the downstream effects of histone hyperacetylation on the parasite are not known. TSA treatment of adult worms in vitro increased histone acetylation at H3K9ac and H3K14ac, which are transcription activation marks, not affecting the unrelated transcription repression mark H3K27me3. We investigated the effect of TSA HDACi on schistosomula gene expression at three different time points, finding a marked genome-wide change in the transcriptome profile. Gene transcription activity was correlated with changes on the chromatin acetylation mark at gene promoter regions. Moreover, combining expression data with ChIP-Seq public data for schistosomula, we found that differentially expressed genes having the H3K4me3 mark at their promoter region in general showed transcription activation upon HDACi treatment, compared with those without the mark, which showed transcription down-regulation. Affected genes are enriched for DNA replication processes, most of them being up-regulated. Twenty out of 22 genes encoding proteins involved in reducing reactive oxygen species accumulation were down-regulated. Dozens of genes encoding proteins with histone reader motifs were changed, including SmEED from the PRC2 complex. We targeted SmEZH2 methyltransferase PRC2 component with a new EZH2 inhibitor (GSK343) and showed a synergistic effect with TSA, significantly increasing schistosomula mortality. Genome-wide gene expression analyses have identified important pathways and cellular functions that were affected and may explain the schistosomicidal effect of TSA HDACi. The change in expression of dozens of histone reader genes involved in regulation of the

  8. Genome-wide Meta-analysis on the Sense of Smell Among US Older Adults.

    PubMed

    Dong, Jing; Yang, Jingyun; Tranah, Greg; Franceschini, Nora; Parimi, Neeta; Alkorta-Aranburu, Gorka; Xu, Zongli; Alonso, Alvaro; Cummings, Steven R; Fornage, Myriam; Huang, Xuemei; Kritchevsky, Stephen; Liu, Yongmei; London, Stephanie; Niu, Liang; Wilson, Robert S; De Jager, Philip L; Yu, Lei; Singleton, Andrew B; Harris, Tamara; Mosley, Thomas H; Pinto, Jayant M; Bennett, David A; Chen, Honglei

    2015-11-01

    Olfactory dysfunction is common among older adults and affects their safety, nutrition, quality of life, and mortality. More importantly, the decreased sense of smell is an early symptom of neurodegenerative diseases such as Parkinson disease (PD) and Alzheimer disease. However, the genetic determinants for the sense of smell have been poorly investigated. We here performed the first genome-wide meta-analysis on the sense of smell among 6252 US older adults of European descent from the Atherosclerosis Risk in Communities (ARIC) study, the Health, Aging, and Body Composition (Health ABC) study, and the Religious Orders Study and the Rush Memory and Aging Project (ROS/MAP). Genome-wide association study analysis was performed first by individual cohorts and then meta-analyzed using fixed-effect models with inverse variance weights. Although no SNPs reached genome-wide statistical significance, we identified 13 loci with suggestive evidence for an association with the sense of smell (Pmeta < 1 × 10). Of these, 2 SNPs at chromosome 17q21.31 (rs199443 in NSF, P = 3.02 × 10; and rs2732614 in KIAA1267-LRRC37A, P = 6.65 × 10) exhibited cis effects on the expression of microtubule-associated protein tau (MAPT, 17q21.31) in 447 frontal-cortex samples obtained postmortem and profiled by RNA-seq (P < 1 × 10). Gene-based and pathway-enrichment analyses further implicated MAPT in regulating the sense of smell in older adults. Similar results were obtained after excluding participants who reported a physician-diagnosed PD or use of PD medications. In conclusion, we provide preliminary evidence that the MAPT locus may play a role in regulating the sense of smell in older adults and therefore offer a potential genetic link between poor sense of smell and major neurodegenerative diseases.

  9. Genome-Wide Transcriptional Analysis Reveals the Protection against Hypoxia-Induced Oxidative Injury in the Intestine of Tibetans via the Inhibition of GRB2/EGFR/PTPN11 Pathways

    PubMed Central

    Gesang, Luobu; Dan, Zeng; Gusang, Lamu

    2016-01-01

    The molecular mechanisms for hypoxic environment causing the injury of intestinal mucosal barrier (IMB) are widely unknown. To address the issue, Han Chinese from 100 m altitude and Tibetans from high altitude (more than 3650 m) were recruited. Histological and transcriptome analyses were performed. The results showed intestinal villi were reduced and appeared irregular, and glandular epithelium was destroyed in the IMB of Tibetans when compared with Han Chinese. Transcriptome analysis revealed 2573 genes with altered expression. The levels of 1137 genes increased and 1436 genes decreased in Tibetans when compared with Han Chinese. Gene ontology (GO) analysis indicated most immunological responses were reduced in the IMB of Tibetans when compared with Han Chinese. Gene microarray showed that there were 25-, 22-, and 18-fold downregulation for growth factor receptor-bound protein 2 (GRB2), epidermal growth factor receptor (EGFR), and tyrosine-protein phosphatase nonreceptor type 11 (PTPN11) in the IMB of Tibetans when compared with Han Chinese. The downregulation of EGFR, GRB2, and PTPN11 will reduce the production of reactive oxygen species and protect against oxidative stress-induced injury for intestine. Thus, the transcriptome analysis showed the protecting functions of IMB patients against hypoxia-induced oxidative injury in the intestine of Tibetans via affecting GRB2/EGFR/PTPN11 pathways. PMID:27594973

  10. Genome-wide transcriptional response of Silurana (Xenopus) tropicalis to infection with the deadly chytrid fungus.

    PubMed

    Rosenblum, Erica Bree; Poorten, Thomas J; Settles, Matthew; Murdoch, Gordon K; Robert, Jacques; Maddox, Nicole; Eisen, Michael B

    2009-08-04

    Emerging infectious diseases are of great concern for both wildlife and humans. Several highly virulent fungal pathogens have recently been discovered in natural populations, highlighting the need for a better understanding of fungal-vertebrate host-pathogen interactions. Because most fungal pathogens are not fatal in the absence of other predisposing conditions, host-pathogen dynamics for deadly fungal pathogens are of particular interest. The chytrid fungus Batrachochytrium dendrobatidis (hereafter Bd) infects hundreds of species of frogs in the wild. It is found worldwide and is a significant contributor to the current global amphibian decline. However, the mechanism by which Bd causes death in amphibians, and the response of the host to Bd infection, remain largely unknown. Here we use whole-genome microarrays to monitor the transcriptional responses to Bd infection in the model frog species, Silurana (Xenopus) tropicalis, which is susceptible to chytridiomycosis. To elucidate the immune response to Bd and evaluate the physiological effects of chytridiomycosis, we measured gene expression changes in several tissues (liver, skin, spleen) following exposure to Bd. We detected a strong transcriptional response for genes involved in physiological processes that can help explain some clinical symptoms of chytridiomycosis at the organismal level. However, we detected surprisingly little evidence of an immune response to Bd exposure, suggesting that this susceptible species may not be mounting efficient innate and adaptive immune responses against Bd. The weak immune response may be partially explained by the thermal conditions of the experiment, which were optimal for Bd growth. However, many immune genes exhibited decreased expression in Bd-exposed frogs compared to control frogs, suggesting a more complex effect of Bd on the immune system than simple temperature-mediated immune suppression. This study generates important baseline data for ongoing efforts to

  11. Genome-wide DNA methylation measurements in prostate tissues uncovers novel prostate cancer diagnostic biomarkers and transcription factor binding patterns.

    PubMed

    Kirby, Marie K; Ramaker, Ryne C; Roberts, Brian S; Lasseigne, Brittany N; Gunther, David S; Burwell, Todd C; Davis, Nicholas S; Gulzar, Zulfiqar G; Absher, Devin M; Cooper, Sara J; Brooks, James D; Myers, Richard M

    2017-04-17

    Current diagnostic tools for prostate cancer lack specificity and sensitivity for detecting very early lesions. DNA methylation is a stable genomic modification that is detectable in peripheral patient fluids such as urine and blood plasma that could serve as a non-invasive diagnostic biomarker for prostate cancer. We measured genome-wide DNA methylation patterns in 73 clinically annotated fresh-frozen prostate cancers and 63 benign-adjacent prostate tissues using the Illumina Infinium HumanMethylation450 BeadChip array. We overlaid the most significantly differentially methylated sites in the genome with transcription factor binding sites measured by the Encyclopedia of DNA Elements consortium. We used logistic regression and receiver operating characteristic curves to assess the performance of candidate diagnostic models. We identified methylation patterns that have a high predictive power for distinguishing malignant prostate tissue from benign-adjacent prostate tissue, and these methylation signatures were validated using data from The Cancer Genome Atlas Project. Furthermore, by overlaying ENCODE transcription factor binding data, we observed an enrichment of enhancer of zeste homolog 2 binding in gene regulatory regions with higher DNA methylation in malignant prostate tissues. DNA methylation patterns are greatly altered in prostate cancer tissue in comparison to benign-adjacent tissue. We have discovered patterns of DNA methylation marks that can distinguish prostate cancers with high specificity and sensitivity in multiple patient tissue cohorts, and we have identified transcription factors binding in these differentially methylated regions that may play important roles in prostate cancer development.

  12. Integrated genome-wide chromatin occupancy and expression analyses identify key myeloid pro-differentiation transcription factors repressed by Myb.

    PubMed

    Zhao, Liang; Glazov, Evgeny A; Pattabiraman, Diwakar R; Al-Owaidi, Faisal; Zhang, Ping; Brown, Matthew A; Leo, Paul J; Gonda, Thomas J

    2011-06-01

    To gain insight into the mechanisms by which the Myb transcription factor controls normal hematopoiesis and particularly, how it contributes to leukemogenesis, we mapped the genome-wide occupancy of Myb by chromatin immunoprecipitation followed by massively parallel sequencing (ChIP-Seq) in ERMYB myeloid progenitor cells. By integrating the genome occupancy data with whole genome expression profiling data, we identified a Myb-regulated transcriptional program. Gene signatures for leukemia stem cells, normal hematopoietic stem/progenitor cells and myeloid development were overrepresented in 2368 Myb regulated genes. Of these, Myb bound directly near or within 793 genes. Myb directly activates some genes known critical in maintaining hematopoietic stem cells, such as Gfi1 and Cited2. Importantly, we also show that, despite being usually considered as a transactivator, Myb also functions to repress approximately half of its direct targets, including several key regulators of myeloid differentiation, such as Sfpi1 (also known as Pu.1), Runx1, Junb and Cebpb. Furthermore, our results demonstrate that interaction with p300, an established coactivator for Myb, is unexpectedly required for Myb-mediated transcriptional repression. We propose that the repression of the above mentioned key pro-differentiation factors may contribute essentially to Myb's ability to suppress differentiation and promote self-renewal, thus maintaining progenitor cells in an undifferentiated state and promoting leukemic transformation.

  13. Genome-wide mapping of transcription factor binding reveals developmental process integration and a fresh look at evolutionary dynamics.

    PubMed

    Yant, Levi

    2012-02-01

    How does evolution forge adaptive responses? Are many changes required or few? Just how complex are the transcriptional networks that control development? Diverse questions like these are being newly addressed by next-generation sequencing-based techniques. Facilitating a mechanistic understanding, these approaches reveal the direct in vivo interactions between transcription factors and their physical targets, combined with genome-scale readouts to comprehensively map adaptive gene regulatory networks (GRNs). Here I focus on pioneering work from the last 3 years that has leveraged these data to investigate diverse aspects of GRN circuitry controlling the reproductive transition in plants. These approaches have revealed surprising new functions for long-investigated key players in developmental programs and laid bare the basis for pleiotropy in many others, suggesting widespread process integration at the transcriptional level. Evolutionary questions begged by the recent deluge of GRN mapping data are being assessed anew, both by emerging work outside Arabidopsis thaliana and novel analyses within. These studies have swiftly exposed the distinctive power and adaptability of genome-wide GRN mapping and illustrate that this unique data type holds tremendous promise for plant biology.

  14. Genome-wide association analysis reveals new targets for carotenoid biofortification in maize.

    PubMed

    Suwarno, Willy B; Pixley, Kevin V; Palacios-Rojas, Natalia; Kaeppler, Shawn M; Babu, Raman

    2015-05-01

    Genome-wide association analysis in CIMMYT's association panel revealed new favorable native genomic variations in/nearby important genes such as hydroxylases and CCD1 that have potential for carotenoid biofortification in maize. Genome-wide association studies (GWAS) have been used extensively to identify allelic variation for genes controlling important agronomic and nutritional traits in plants. Provitamin A (proVA) enhancing alleles of lycopene epsilon cyclase (LCYE) and β-carotene hydroxylase 1 (CRTRB1), previously identified through candidate-gene based GWAS, are currently used in CIMMYT's maize breeding program. The objective of this study was to identify genes or genomic regions controlling variation for carotenoid concentrations in grain for CIMMYT's carotenoid association mapping panel of 380 inbred maize lines, using high-density genome-wide platforms with ~476,000 SNP markers. Population structure effects were minimized by adjustments using principal components and kinship matrix with mixed models. Genome-wide linkage disequilibrium (LD) analysis indicated faster LD decay (3.9 kb; r (2) = 0.1) than commonly reported for temperate germplasm, and therefore the possibility of achieving higher mapping resolution with our mostly tropical diversity panel. GWAS for various carotenoids identified CRTRB1, LCYE and other key genes or genomic regions that govern rate-critical steps in the upstream pathway, such as DXS1, GGPS1, and GGPS2 that are known to play important roles in the accumulation of precursor isoprenoids as well as downstream genes HYD5, CCD1, and ZEP1, which are involved in hydroxylation and carotenoid degradation. SNPs at or near all of these regions were identified and may be useful target regions for carotenoid biofortification breeding efforts in maize; for example a genomic region on chromosome 2 explained ~16% of the phenotypic variance for β-carotene independently of CRTRB1, and a variant of CCD1 that resulted in reduced

  15. Genome-Wide Analysis of DNA Methylation and Cigarette Smoking in a Chinese Population.

    PubMed

    Zhu, Xiaoyan; Li, Jun; Deng, Siyun; Yu, Kuai; Liu, Xuezhen; Deng, Qifei; Sun, Huizhen; Zhang, Xiaomin; He, Meian; Guo, Huan; Chen, Weihong; Yuan, Jing; Zhang, Bing; Kuang, Dan; He, Xiaosheng; Bai, Yansen; Han, Xu; Liu, Bing; Li, Xiaoliang; Yang, Liangle; Jiang, Haijing; Zhang, Yizhi; Hu, Jie; Cheng, Longxian; Luo, Xiaoting; Mei, Wenhua; Zhou, Zhiming; Sun, Shunchang; Zhang, Liyun; Liu, Chuanyao; Guo, Yanjun; Zhang, Zhihong; Hu, Frank B; Liang, Liming; Wu, Tangchun

    2016-07-01

    Smoking is a risk factor for many human diseases. DNA methylation has been related to smoking, but genome-wide methylation data for smoking in Chinese populations is limited. We aimed to investigate epigenome-wide methylation in relation to smoking in a Chinese population. We measured the methylation levels at > 485,000 CpG sites (CpGs) in DNA from leukocytes using a methylation array and conducted a genome-wide meta-analysis of DNA methylation and smoking in a total of 596 Chinese participants. We further evaluated the associations of smoking-related CpGs with internal polycyclic aromatic hydrocarbon (PAH) biomarkers and their correlations with the expression of corresponding genes. We identified 318 CpGs whose methylation levels were associated with smoking at a genome-wide significance level (false discovery rate < 0.05), among which 161 CpGs annotated to 123 genes were not associated with smoking in recent studies of Europeans and African Americans. Of these smoking-related CpGs, methylation levels at 80 CpGs showed significant correlations with the expression of corresponding genes (including RUNX3, IL6R, PTAFR, ANKRD11, CEP135 and CDH23), and methylation at 15 CpGs was significantly associated with urinary 2-hydroxynaphthalene, the most representative internal monohydroxy-PAH biomarker for smoking. We identified DNA methylation markers associated with smoking in a Chinese population, including some markers that were also correlated with gene expression. Exposure to naphthalene, a byproduct of tobacco smoke, may contribute to smoking-related methylation. Zhu X, Li J, Deng S, Yu K, Liu X, Deng Q, Sun H, Zhang X, He M, Guo H, Chen W, Yuan J, Zhang B, Kuang D, He X, Bai Y, Han X, Liu B, Li X, Yang L, Jiang H, Zhang Y, Hu J, Cheng L, Luo X, Mei W, Zhou Z, Sun S, Zhang L, Liu C, Guo Y, Zhang Z, Hu FB, Liang L, Wu T. 2016. Genome-wide analysis of DNA methylation and cigarette smoking in Chinese. Environ Health Perspect 124:966-973; http://dx.doi.org/10.1289/ehp

  16. Genome-wide association analysis identifies multiple loci related to resting heart rate

    PubMed Central

    Eijgelsheim, Mark; Newton-Cheh, Christopher; Sotoodehnia, Nona; de Bakker, Paul I.W.; Müller, Martina; Morrison, Alanna C.; Smith, Albert V.; Isaacs, Aaron; Sanna, Serena; Dörr, Marcus; Navarro, Pau; Fuchsberger, Christian; Nolte, Ilja M.; de Geus, Eco J.C.; Estrada, Karol; Hwang, Shih-Jen; Bis, Joshua C.; Rückert, Ina-Maria; Alonso, Alvaro; Launer, Lenore J.; Hottenga, Jouke Jan; Rivadeneira, Fernando; Noseworthy, Peter A.; Rice, Kenneth M.; Perz, Siegfried; Arking, Dan E.; Spector, Tim D.; Kors, Jan A.; Aulchenko, Yurii S.; Tarasov, Kirill V.; Homuth, Georg; Wild, Sarah H.; Marroni, Fabio; Gieger, Christian; Licht, Carmilla M.; Prineas, Ronald J.; Hofman, Albert; Rotter, Jerome I.; Hicks, Andrew A.; Ernst, Florian; Najjar, Samer S.; Wright, Alan F.; Peters, Annette; Fox, Ervin R.; Oostra, Ben A.; Kroemer, Heyo K.; Couper, David; Völzke, Henry; Campbell, Harry; Meitinger, Thomas; Uda, Manuela; Witteman, Jacqueline C.M.; Psaty, Bruce M.; Wichmann, H-Erich; Harris, Tamara B.; Kääb, Stefan; Siscovick, David S.; Jamshidi, Yalda; Uitterlinden, André G.; Folsom, Aaron R.; Larson, Martin G.; Wilson, James F.; Penninx, Brenda W.; Snieder, Harold; Pramstaller, Peter P.; van Duijn, Cornelia M.; Lakatta, Edward G.; Felix, Stephan B.; Gudnason, Vilmundur; Pfeufer, Arne; Heckbert, Susan R.; Stricker, Bruno H.Ch.; Boerwinkle, Eric; O'Donnell, Christopher J.

    2010-01-01

    Higher resting heart rate is associated with increased cardiovascular disease and mortality risk. Though heritable factors play a substantial role in population variation, little is known about specific genetic determinants. This knowledge can impact clinical care by identifying novel factors that influence pathologic heart rate states, modulate heart rate through cardiac structure and function or by improving our understanding of the physiology of heart rate regulation. To identify common genetic variants associated with heart rate, we performed a meta-analysis of 15 genome-wide association studies (GWAS), including 38 991 subjects of European ancestry, estimating the association between age-, sex- and body mass-adjusted RR interval (inverse heart rate) and ∼2.5 million markers. Results with P < 5 × 10−8 were considered genome-wide significant. We constructed regression models with multiple markers to assess whether results at less stringent thresholds were likely to be truly associated with RR interval. We identified six novel associations with resting heart rate at six loci: 6q22 near GJA1; 14q12 near MYH7; 12p12 near SOX5, c12orf67, BCAT1, LRMP and CASC1; 6q22 near SLC35F1, PLN and c6orf204; 7q22 near SLC12A9 and UfSp1; and 11q12 near FADS1. Associations at 6q22 400 kb away from GJA1, at 14q12 MYH6 and at 1q32 near CD34 identified in previously published GWAS were confirmed. In aggregate, these variants explain ∼0.7% of RR interval variance. A multivariant regression model including 20 variants with P < 10−5 increased the explained variance to 1.6%, suggesting that some loci falling short of genome-wide significance are likely truly associated. Future research is warranted to elucidate underlying mechanisms that may impact clinical care. PMID:20639392

  17. Genome-wide association analysis of red blood cell traits in African Americans: the COGENT Network

    PubMed Central

    Chen, Zhao; Tang, Hua; Qayyum, Rehan; Schick, Ursula M.; Nalls, Michael A.; Handsaker, Robert; Li, Jin; Lu, Yingchang; Yanek, Lisa R.; Keating, Brendan; Meng, Yan; van Rooij, Frank J.A.; Okada, Yukinori; Kubo, Michiaki; Rasmussen-Torvik, Laura; Keller, Margaux F.; Lange, Leslie; Evans, Michele; Bottinger, Erwin P.; Linderman, Michael D.; Ruderfer, Douglas M.; Hakonarson, Hakon; Papanicolaou, George; Zonderman, Alan B.; Gottesman, Omri; Thomson, Cynthia; Ziv, Elad; Singleton, Andrew B.; Loos, Ruth J.F.; Sleiman, Patrick M.A.; Ganesh, Santhi; McCarroll, Steven; Becker, Diane M.; Wilson, James G.; Lettre, Guillaume; Reiner, Alexander P.

    2013-01-01

    Laboratory red blood cell (RBC) measurements are clinically important, heritable and differ among ethnic groups. To identify genetic variants that contribute to RBC phenotypes in African Americans (AAs), we conducted a genome-wide association study in up to ∼16 500 AAs. The alpha-globin locus on chromosome 16pter [lead SNP rs13335629 in ITFG3 gene; P < 1E−13 for hemoglobin (Hgb), RBC count, mean corpuscular volume (MCV), MCH and MCHC] and the G6PD locus on Xq28 [lead SNP rs1050828; P < 1E − 13 for Hgb, hematocrit (Hct), MCV, RBC count and red cell distribution width (RDW)] were each associated with multiple RBC traits. At the alpha-globin region, both the common African 3.7 kb deletion and common single nucleotide polymorphisms (SNPs) appear to contribute independently to RBC phenotypes among AAs. In the 2p21 region, we identified a novel variant of PRKCE distinctly associated with Hct in AAs. In a genome-wide admixture mapping scan, local European ancestry at the 6p22 region containing HFE and LRRC16A was associated with higher Hgb. LRRC16A has been previously associated with the platelet count and mean platelet volume in AAs, but not with Hgb. Finally, we extended to AAs the findings of association of erythrocyte traits with several loci previously reported in Europeans and/or Asians, including CD164 and HBS1L-MYB. In summary, this large-scale genome-wide analysis in AAs has extended the importance of several RBC-associated genetic loci to AAs and identified allelic heterogeneity and pleiotropy at several previously known genetic loci associated with blood cell traits in AAs. PMID:23446634

  18. Genome-wide DNA methylation analysis of pseudohypoparathyroidism patients with GNAS imprinting defects.

    PubMed

    Rochtus, Anne; Martin-Trujillo, Alejandro; Izzi, Benedetta; Elli, Francesca; Garin, Intza; Linglart, Agnes; Mantovani, Giovanna; Perez de Nanclares, Guiomar; Thiele, Suzanne; Decallonne, Brigitte; Van Geet, Chris; Monk, David; Freson, Kathleen

    2016-01-01

    Pseudohypoparathyroidism (PHP) is caused by (epi)genetic defects in the imprinted GNAS cluster. Current classification of PHP patients is hampered by clinical and molecular diagnostic overlaps. The European Consortium for the study of PHP designed a genome-wide methylation study to improve molecular diagnosis. The HumanMethylation 450K BeadChip was used to analyze genome-wide methylation in 24 PHP patients with parathyroid hormone resistance and 20 age- and gender-matched controls. Patients were previously diagnosed with GNAS-specific differentially methylated regions (DMRs) and include 6 patients with known STX16 deletion (PHP(Δstx16)) and 18 without deletion (PHP(neg)). The array demonstrated that PHP patients do not show DNA methylation differences at the whole-genome level. Unsupervised clustering of GNAS-specific DMRs divides PHP(Δstx16) versus PHP(neg) patients. Interestingly, in contrast to the notion that all PHP patients share methylation defects in the A/B DMR while only PHP(Δstx16) patients have normal NESP, GNAS-AS1 and XL methylation, we found a novel DMR (named GNAS-AS2) in the GNAS-AS1 region that is significantly different in both PHP(Δstx16) and PHP(neg), as validated by Sequenom EpiTYPER in a larger PHP cohort. The analysis of 58 DMRs revealed that 8/18 PHP(neg) and 1/6 PHP(Δstx16) patients have multi-locus methylation defects. Validation was performed for FANCC and SVOPL DMRs. This is the first genome-wide methylation study for PHP patients that confirmed that GNAS is the most significant DMR, and the presence of STX16 deletion divides PHP patients in two groups. Moreover, a novel GNAS-AS2 DMR affects all PHP patients, and PHP patients seem sensitive to multi-locus methylation defects.

  19. A genome-wide analysis of gene-caffeine consumption interaction on basal cell carcinoma.

    PubMed

    Li, Xin; Cornelis, Marilyn C; Liang, Liming; Song, Fengju; De Vivo, Immaculata; Giovannucci, Edward; Tang, Jean Y; Han, Jiali

    2016-12-01

    Animal models have suggested that oral or topical administration of caffeine could inhibit ultraviolet-induced carcinogenesis via the ataxia telangiectasia and rad3 (ATR)-related apoptosis. Previous epidemiological studies have demonstrated that increased caffeine consumption is associated with reduced risk of basal cell carcinoma (BCC). To identify common genetic markers that may modify this association, we tested gene-caffeine intake interaction on BCC risk in a genome-wide analysis. We included 3383 BCC cases and 8528 controls of European ancestry from the Nurses' Health Study and Health Professionals Follow-up Study. Single nucleotide polymorphism (SNP) rs142310826 near the NEIL3 gene showed a genome-wide significant interaction with caffeine consumption (P = 1.78 × 10(-8) for interaction) on BCC risk. There was no gender difference for this interaction (P = 0.64 for heterogeneity). NEIL3, a gene belonging to the base excision DNA repair pathway, encodes a DNA glycosylase that recognizes and removes lesions produced by oxidative stress. In addition, we identified several loci with P value for interaction <5 × 10(-7) in gender-specific analyses (P for heterogeneity between genders < 0.001) including those mapping to the genes LRRTM4, ATF3 and DCLRE1C in women and POTEA in men. Finally, we tested the associations between caffeine consumption-related SNPs reported by previous genome-wide association studies and risk of BCC, both individually and jointly, but found no significant association. In sum, we identified a DNA repair gene that could be involved in caffeine-mediated skin tumor inhibition. Further studies are warranted to confirm these findings. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  20. Using genome-wide complex trait analysis to quantify ‘missing heritability’ in Parkinson's disease

    PubMed Central

    Keller, Margaux F.; Saad, Mohamad; Bras, Jose; Bettella, Francesco; Nicolaou, Nayia; Simón-Sánchez, Javier; Mittag, Florian; Büchel, Finja; Sharma, Manu; Gibbs, J. Raphael; Schulte, Claudia; Moskvina, Valentina; Durr, Alexandra; Holmans, Peter; Kilarski, Laura L.; Guerreiro, Rita; Hernandez, Dena G.; Brice, Alexis; Ylikotila, Pauli; Stefánsson, Hreinn; Majamaa, Kari; Morris, Huw R.; Williams, Nigel; Gasser, Thomas; Heutink, Peter; Wood, Nicholas W.; Hardy, John; Martinez, Maria; Singleton, Andrew B.; Nalls, Michael A.

    2012-01-01

    Genome-wide association studies (GWASs) have been successful at identifying single-nucleotide polymorphisms (SNPs) highly associated with common traits; however, a great deal of the heritable variation associated with common traits remains unaccounted for within the genome. Genome-wide complex trait analysis (GCTA) is a statistical method that applies a linear mixed model to estimate phenotypic variance of complex traits explained by genome-wide SNPs, including those not associated with the trait in a GWAS. We applied GCTA to 8 cohorts containing 7096 case and 19 455 control individuals of European ancestry in order to examine the missing heritability present in Parkinson's disease (PD). We meta-analyzed our initial results to produce robust heritability estimates for PD types across cohorts. Our results identify 27% (95% CI 17–38, P = 8.08E − 08) phenotypic variance associated with all types of PD, 15% (95% CI −0.2 to 33, P = 0.09) phenotypic variance associated with early-onset PD and 31% (95% CI 17–44, P = 1.34E − 05) phenotypic variance associated with late-onset PD. This is a substantial increase from the genetic variance identified by top GWAS hits alone (between 3 and 5%) and indicates there are substantially more risk loci to be identified. Our results suggest that although GWASs are a useful tool in identifying the most common variants associated with complex disease, a great deal of common variants of small effect remain to be discovered. PMID:22892372

  1. Genome-wide analysis of zygotic linkage disequilibrium and its components in crossbred cattle

    PubMed Central

    2012-01-01

    Background Linkage disequilibrium (LD) between genes at linked or independent loci can occur at gametic and zygotic levels known asgametic LD and zygotic LD, respectively. Gametic LD is well known for its roles in fine-scale mapping of quantitative trait loci, genomic selection and evolutionary inference. The less-well studied is the zygotic LD and its components that can be also estimated directly from the unphased SNPs. Results This study was set up to investigate the genome-wide extent and patterns of zygotic LD and its components in a crossbred cattle population using the genomic data from the Illumina BovineSNP50 beadchip. The animal population arose from repeated crossbreeding of multiple breeds and selection for growth and cow reproduction. The study showed that similar genomic structures in gametic and zygotic LD were observed, with zygotic LD decaying faster than gametic LD over marker distance. The trigenic and quadrigenic disequilibria were generally two- to three-fold smaller than the usual digenic disequilibria (gametic or composite LD). There was less power of testing for these high-order genic disequilibria than for the digenic disequilibria. The power estimates decreased with the marker distance between markers though the decay trend is more obvious for the digenic disequilibria than for high-order disequilibria. Conclusions This study is the first major genome-wide survey of all non-allelic associations between pairs of SNPs in a cattle population. Such analysis allows us to assess the relative importance of gametic LD vs. all other non-allelic genic LDs regardless of whether or not the population is in HWE. The observed predominance of digenic LD (gametic or composite LD) coupled with insignificant high-order trigenic and quadrigenic disequilibria supports the current intensive focus on the use of high-density SNP markers for genome-wide association studies and genomic selection activities in the cattle population. PMID:22827586

  2. A genome-wide analysis of putative functional and exonic variation associated with extremely high intelligence.

    PubMed

    Spain, S L; Pedroso, I; Kadeva, N; Miller, M B; Iacono, W G; McGue, M; Stergiakouli, E; Smith, G D; Putallaz, M; Lubinski, D; Meaburn, E L; Plomin, R; Simpson, M A

    2016-08-01

    Although individual differences in intelligence (general cognitive ability) are highly heritable, molecular genetic analyses to date have had limited success in identifying specific loci responsible for its heritability. This study is the first to investigate exome variation in individuals of extremely high intelligence. Under the quantitative genetic model, sampling from the high extreme of the distribution should provide increased power to detect associations. We therefore performed a case-control association analysis with 1409 individuals drawn from the top 0.0003 (IQ >170) of the population distribution of intelligence and 3253 unselected population-based controls. Our analysis focused on putative functional exonic variants assayed on the Illumina HumanExome BeadChip. We did not observe any individual protein-altering variants that are reproducibly associated with extremely high intelligence and within the entire distribution of intelligence. Moreover, no significant associations were found for multiple rare alleles within individual genes. However, analyses using genome-wide similarity between unrelated individuals (genome-wide complex trait analysis) indicate that the genotyped functional protein-altering variation yields a heritability estimate of 17.4% (s.e. 1.7%) based on a liability model. In addition, investigation of nominally significant associations revealed fewer rare alleles associated with extremely high intelligence than would be expected under the null hypothesis. This observation is consistent with the hypothesis that rare functional alleles are more frequently detrimental than beneficial to intelligence.

  3. Genome-wide genetic diversity, population structure and admixture analysis in African and Asian cattle breeds.

    PubMed

    Edea, Z; Bhuiyan, M S A; Dessie, T; Rothschild, M F; Dadi, H; Kim, K S

    2015-02-01

    Knowledge about genetic diversity and population structure is useful for designing effective strategies to improve the production, management and conservation of farm animal genetic resources. Here, we present a comprehensive genome-wide analysis of genetic diversity, population structure and admixture based on 244 animals sampled from 10 cattle populations in Asia and Africa and genotyped for 69,903 autosomal single-nucleotide polymorphisms (SNPs) mainly derived from the indicine breed. Principal component analysis, STRUCTURE and distance analysis from high-density SNP data clearly revealed that the largest genetic difference occurred between the two domestic lineages (taurine and indicine), whereas Ethiopian cattle populations represent a mosaic of the humped zebu and taurine. Estimation of the genetic influence of zebu and taurine revealed that Ethiopian cattle were characterized by considerable levels of introgression from South Asian zebu, whereas Bangladeshi populations shared very low taurine ancestry. The relationships among Ethiopian cattle populations reflect their history of origin and admixture rather than phenotype-based distinctions. The high within-individual genetic variability observed in Ethiopian cattle represents an untapped opportunity for adaptation to changing environments and for implementation of within-breed genetic improvement schemes. Our results provide a basis for future applications of genome-wide SNP data to exploit the unique genetic makeup of indigenous cattle breeds and to facilitate their improvement and conservation.

  4. [Analysis of population stratification using random SNPs in genome-wide association studies].

    PubMed

    Cao, Zong-Fu; Ma, Chuan-Xiang; Wang, Lei; Cai, Bin

    2010-09-01

    Since population genetic STRUCTURE can increase false-positive rate in genome-wide association studies (GWAS) for complex diseases, the effect of population stratification should be taken into account in GWAS. However, the effect of randomly selected SNPs in population stratification analysis is underdetermined. In this study, based on the genotype data generated on Genome-Wide Human SNP Array 6.0 from unrelated individuals of HapMap Phase2, we randomly selected SNPs that were evenly distributed across the whole-genome, and acquired Ancestry Informative Markers (AIMs) by the method of f value and allelic Fisher exact test. F-statistics and STRUCTURE analysis based on the select different sets of SNPs were used to evaluate the effect of distinguishing the populations from HapMap Phase3. We found that randomly selected SNPs that were evenly distributed across the whole-genome were able to be used to identify the population structure. This study further indicated that more than 3 000 randomly selected SNPs that were evenly distributed across the whole-genome were substituted for AIMs in population stratification analysis, when there were no available AIMs for spe-cific populations.

  5. Meta-analysis of genome-wide linkage studies across autoimmune diseases

    PubMed Central

    Forabosco, Paola; Bouzigon, Emmanuelle; Ng, Mandy Y; Hermanowski, Jane; Fisher, Sheila A; Criswell, Lindsey A; Lewis, Cathryn M

    2009-01-01

    Autoimmune diseases are chronic disorders initiated by a loss of immunologic tolerance to self-antigens. They cluster within families, and patients may be diagnosed with more than one disease, suggesting pleiotropic genes are involved in the aetiology of different diseases. To identify potential loci, which confer susceptibility to autoimmunity independent of disease phenotype, we pooled results from genome-wide linkage studies, using the genome scan meta-analysis method (GSMA). The meta-analysis included 42 independent studies for 11 autoimmune diseases, using 7350 families with 18 291 affected individuals. In addition to the HLA region, which showed highly significant genome-wide evidence for linkage, we obtained suggestive evidence for linkage on chromosome 16, with peak evidence at 10.0–19.8 Mb. This region may harbour a pleiotropic gene (or genes) conferring risk for several diseases, although no such gene has been identified through association studies. We did not identify evidence for linkage at several genes known to confer increased risk to different autoimmune diseases (PTPN22, CTLA4), even in subgroups of diseases consistently found to be associated with these genes. The relative risks conferred by variants in these genes are modest (<1.5 in most cases), and even a large study like this meta-analysis lacks power to detect linkage. This study illustrates the concept that linkage and association studies have power to identify very different types of disease-predisposing variants. PMID:18781189

  6. dSLAM analysis of genome-wide genetic interactions in Saccharomyces cerevisiae

    PubMed Central

    Pan, Xuewen; Yuan, Daniel S.; Ooi, Siew-Loon; Wang, Xiaoling; Sookhai-Mahadeo, Sharon; Meluh, Pamela; Boeke, Jef D.

    2007-01-01

    Analysis of genetic interactions has been extensively exploited to study gene functions and to dissect pathway structures. One such genetic interaction is synthetic lethality, in which the combination of two non-lethal mutations leads to loss of organism viability. We have developed a dSLAM (heterozygote diploid-based synthetic lethality analysis with microarrays) technology that effectively studies synthetic lethality interactions on a genome-wide scale in the budding yeast Saccharomyces cerevisiae. Typically, a query mutation is introduced en masse into a population of ~6,000 haploid-convertible heterozygote diploid Yeast Knockout (YKO) mutants via integrative transformation. Haploid pools of single and double mutants are freshly generated from the resultant heterozygote diploid double mutant pool after meiosis and haploid selection and studied for potential growth defects of each double mutant combination by microarray analysis of the “molecular barcodes” representing each YKO. This technology has been effectively adapted to study other types of genome-wide genetic interactions including gene-compound synthetic lethality, secondary mutation suppression, dosage-dependent synthetic lethality and suppression. PMID:17189863

  7. Genome-wide association analysis demonstrates the highly polygenic character of age-related hearing impairment

    PubMed Central

    Fransen, Erik; Bonneux, Sarah; Corneveaux, Jason J; Schrauwen, Isabelle; Di Berardino, Federica; White, Cory H; Ohmen, Jeffrey D; Van de Heyning, Paul; Ambrosetti, Umberto; Huentelman, Matthew J; Van Camp, Guy; Friedman, Rick A

    2015-01-01

    We performed a genome-wide association study (GWAS) to identify the genes responsible for age-related hearing impairment (ARHI), the most common form of hearing impairment in the elderly. Analysis of common variants, with and without adjustment for stratification and environmental covariates, rare variants and interactions, as well as gene-set enrichment analysis, showed no variants with genome-wide significance. No evidence for replication of any previously reported genes was found. A study of the genetic architecture indicates for the first time that ARHI is highly polygenic in nature, with probably no major genes involved. The phenotype depends on the aggregated effect of a large number of SNPs, of which the individual effects are undetectable in a modestly powered GWAS. We estimated that 22% of the variance in our data set can be explained by the collective effect of all genotyped SNPs. A score analysis showed a modest enrichment in causative SNPs among the SNPs with a P-value below 0.01. PMID:24939585