Gazestani, Vahid H; Salavati, Reza
2015-01-01
Trypanosoma brucei is a vector-borne parasite with intricate life cycle that can cause serious diseases in humans and animals. This pathogen relies on fine regulation of gene expression to respond and adapt to variable environments, with implications in transmission and infectivity. However, the involved regulatory elements and their mechanisms of actions are largely unknown. Here, benefiting from a new graph-based approach for finding functional regulatory elements in RNA (GRAFFER), we have predicted 88 new RNA regulatory elements that are potentially involved in the gene regulatory network of T. brucei. We show that many of these newly predicted elements are responsive to both transcriptomic and proteomic changes during the life cycle of the parasite. Moreover, we found that 11 of predicted elements strikingly resemble previously identified regulatory elements for the parasite. Additionally, comparison with previously predicted motifs on T. brucei suggested the superior performance of our approach based on the current limited knowledge of regulatory elements in T. brucei.
2011-01-01
Background Phytohormones organize plant development and environmental adaptation through cell-to-cell signal transduction, and their action involves transcriptional activation. Recent international efforts to establish and maintain public databases of Arabidopsis microarray data have enabled the utilization of this data in the analysis of various phytohormone responses, providing genome-wide identification of promoters targeted by phytohormones. Results We utilized such microarray data for prediction of cis-regulatory elements with an octamer-based approach. Our test prediction of a drought-responsive RD29A promoter with the aid of microarray data for response to drought, ABA and overexpression of DREB1A, a key regulator of cold and drought response, provided reasonable results that fit with the experimentally identified regulatory elements. With this succession, we expanded the prediction to various phytohormone responses, including those for abscisic acid, auxin, cytokinin, ethylene, brassinosteroid, jasmonic acid, and salicylic acid, as well as for hydrogen peroxide, drought and DREB1A overexpression. Totally 622 promoters that are activated by phytohormones were subjected to the prediction. In addition, we have assigned putative functions to 53 octamers of the Regulatory Element Group (REG) that have been extracted as position-dependent cis-regulatory elements with the aid of their feature of preferential appearance in the promoter region. Conclusions Our prediction of Arabidopsis cis-regulatory elements for phytohormone responses provides guidance for experimental analysis of promoters to reveal the basis of the transcriptional network of phytohormone responses. PMID:21349196
Scanning sequences after Gibbs sampling to find multiple occurrences of functional elements
Tharakaraman, Kannan; Mariño-Ramírez, Leonardo; Sheetlin, Sergey L; Landsman, David; Spouge, John L
2006-01-01
Background Many DNA regulatory elements occur as multiple instances within a target promoter. Gibbs sampling programs for finding DNA regulatory elements de novo can be prohibitively slow in locating all instances of such an element in a sequence set. Results We describe an improvement to the A-GLAM computer program, which predicts regulatory elements within DNA sequences with Gibbs sampling. The improvement adds an optional "scanning step" after Gibbs sampling. Gibbs sampling produces a position specific scoring matrix (PSSM). The new scanning step resembles an iterative PSI-BLAST search based on the PSSM. First, it assigns an "individual score" to each subsequence of appropriate length within the input sequences using the initial PSSM. Second, it computes an E-value from each individual score, to assess the agreement between the corresponding subsequence and the PSSM. Third, it permits subsequences with E-values falling below a threshold to contribute to the underlying PSSM, which is then updated using the Bayesian calculus. A-GLAM iterates its scanning step to convergence, at which point no new subsequences contribute to the PSSM. After convergence, A-GLAM reports predicted regulatory elements within each sequence in order of increasing E-values, so users have a statistical evaluation of the predicted elements in a convenient presentation. Thus, although the Gibbs sampling step in A-GLAM finds at most one regulatory element per input sequence, the scanning step can now rapidly locate further instances of the element in each sequence. Conclusion Datasets from experiments determining the binding sites of transcription factors were used to evaluate the improvement to A-GLAM. Typically, the datasets included several sequences containing multiple instances of a regulatory motif. The improvements to A-GLAM permitted it to predict the multiple instances. PMID:16961919
Yao, Shi; Guo, Yan; Dong, Shan-Shan; Hao, Ruo-Han; Chen, Xiao-Feng; Chen, Yi-Xiao; Chen, Jia-Bin; Tian, Qing; Deng, Hong-Wen; Yang, Tie-Lin
2017-08-01
Despite genome-wide association studies (GWASs) have identified many susceptibility genes for osteoporosis, it still leaves a large part of missing heritability to be discovered. Integrating regulatory information and GWASs could offer new insights into the biological link between the susceptibility SNPs and osteoporosis. We generated five machine learning classifiers with osteoporosis-associated variants and regulatory features data. We gained the optimal classifier and predicted genome-wide SNPs to discover susceptibility regulatory variants. We further utilized Genetic Factors for Osteoporosis Consortium (GEFOS) and three in-house GWASs samples to validate the associations for predicted positive SNPs. The random forest classifier performed best among all machine learning methods with the F1 score of 0.8871. Using the optimized model, we predicted 37,584 candidate SNPs for osteoporosis. According to the meta-analysis results, a list of regulatory variants was significantly associated with osteoporosis after multiple testing corrections and contributed to the expression of known osteoporosis-associated protein-coding genes. In summary, combining GWASs and regulatory elements through machine learning could provide additional information for understanding the mechanism of osteoporosis. The regulatory variants we predicted will provide novel targets for etiology research and treatment of osteoporosis.
Global reorganisation of cis-regulatory units upon lineage commitment of human embryonic stem cells
Freire-Pritchett, Paula; Schoenfelder, Stefan; Várnai, Csilla; Wingett, Steven W; Cairns, Jonathan; Collier, Amanda J; García-Vílchez, Raquel; Furlan-Magaril, Mayra; Osborne, Cameron S; Fraser, Peter; Rugg-Gunn, Peter J; Spivakov, Mikhail
2017-01-01
Long-range cis-regulatory elements such as enhancers coordinate cell-specific transcriptional programmes by engaging in DNA looping interactions with target promoters. Deciphering the interplay between the promoter connectivity and activity of cis-regulatory elements during lineage commitment is crucial for understanding developmental transcriptional control. Here, we use Promoter Capture Hi-C to generate a high-resolution atlas of chromosomal interactions involving ~22,000 gene promoters in human pluripotent and lineage-committed cells, identifying putative target genes for known and predicted enhancer elements. We reveal extensive dynamics of cis-regulatory contacts upon lineage commitment, including the acquisition and loss of promoter interactions. This spatial rewiring occurs preferentially with predicted changes in the activity of cis-regulatory elements and is associated with changes in target gene expression. Our results provide a global and integrated view of promoter interactome dynamics during lineage commitment of human pluripotent cells. DOI: http://dx.doi.org/10.7554/eLife.21926.001 PMID:28332981
Exploring the read-write genome: mobile DNA and mammalian adaptation.
Shapiro, James A
2017-02-01
The read-write genome idea predicts that mobile DNA elements will act in evolution to generate adaptive changes in organismal DNA. This prediction was examined in the context of mammalian adaptations involving regulatory non-coding RNAs, viviparous reproduction, early embryonic and stem cell development, the nervous system, and innate immunity. The evidence shows that mobile elements have played specific and sometimes major roles in mammalian adaptive evolution by generating regulatory sites in the DNA and providing interaction motifs in non-coding RNA. Endogenous retroviruses and retrotransposons have been the predominant mobile elements in mammalian adaptive evolution, with the notable exception of bats, where DNA transposons are the major agents of RW genome inscriptions. A few examples of independent but convergent exaptation of mobile DNA elements for similar regulatory rewiring functions are noted.
A cis-regulatory logic simulator.
Zeigler, Robert D; Gertz, Jason; Cohen, Barak A
2007-07-27
A major goal of computational studies of gene regulation is to accurately predict the expression of genes based on the cis-regulatory content of their promoters. The development of computational methods to decode the interactions among cis-regulatory elements has been slow, in part, because it is difficult to know, without extensive experimental validation, whether a particular method identifies the correct cis-regulatory interactions that underlie a given set of expression data. There is an urgent need for test expression data in which the interactions among cis-regulatory sites that produce the data are known. The ability to rapidly generate such data sets would facilitate the development and comparison of computational methods that predict gene expression patterns from promoter sequence. We developed a gene expression simulator which generates expression data using user-defined interactions between cis-regulatory sites. The simulator can incorporate additive, cooperative, competitive, and synergistic interactions between regulatory elements. Constraints on the spacing, distance, and orientation of regulatory elements and their interactions may also be defined and Gaussian noise can be added to the expression values. The simulator allows for a data transformation that simulates the sigmoid shape of expression levels from real promoters. We found good agreement between sets of simulated promoters and predicted regulatory modules from real expression data. We present several data sets that may be useful for testing new methodologies for predicting gene expression from promoter sequence. We developed a flexible gene expression simulator that rapidly generates large numbers of simulated promoters and their corresponding transcriptional output based on specified interactions between cis-regulatory sites. When appropriate rule sets are used, the data generated by our simulator faithfully reproduces experimentally derived data sets. We anticipate that using simulated gene expression data sets will facilitate the direct comparison of computational strategies to predict gene expression from promoter sequence. The source code is available online and as additional material. The test sets are available as additional material.
Visootsat, Akasit; Payungporn, Sunchai; T-Thienprasert, Nattanan P
2015-12-01
Hepatitis B virus (HBV) infection is a primary cause of hepatocellular carcinoma and liver cirrhosis worldwide. To develop novel antiviral drugs, a better understanding of HBV gene expression regulation is vital. One important aspect is to understand how HBV hijacks the cellular machinery to export unspliced RNA from the nucleus. The HBV post-transcriptional regulatory element (HBV PRE) has been proposed to be the HBV RNA nuclear export element. However, the function remains controversial, and the core element is unclear. This study, therefore, aimed to identify functional regulatory elements within the HBV PRE and investigate their functions. Using bioinformatics programs based on sequence conservation and conserved RNA secondary structures, three regulatory elements were predicted, namely PRE 1151-1410, PRE 1520-1620 and PRE 1650-1684. PRE 1151-1410 significantly increased intronless and unspliced luciferase activity in both HepG2 and COS-7 cells. Likewise, PRE 1151-1410 significantly elevated intronless and unspliced HBV surface transcripts in liver cancer cells. Moreover, motif analysis predicted that PRE 1151-1410 contains several regulatory motifs. This study reported the roles of PRE 1151-1410 in intronless transcript nuclear export and the splicing mechanism. Additionally, these results provide knowledge in the field of HBV RNA regulation. Moreover, PRE 1151-1410 may be used to enhance the expression of other mRNAs in intronless reporter plasmids.
Ibraheem, Omodele; Botha, Christiaan E J; Bradley, Graeme
2010-12-01
The regulation of gene expression involves a multifarious regulatory system. Each gene contains a unique combination of cis-acting regulatory sequence elements in the 5' regulatory region that determines its temporal and spatial expression. Cis-acting regulatory elements are essential transcriptional gene regulatory units; they control many biological processes and stress responses. Thus a full understanding of the transcriptional gene regulation system will depend on successful functional analyses of cis-acting elements. Cis-acting regulatory elements present within the 5' regulatory region of the sucrose transporter gene families in rice (Oryza sativa Japonica cultivar-group) and Arabidopsis thaliana, were identified using a bioinformatics approach. The possible cis-acting regulatory elements were predicted by scanning 1.5kbp of 5' regulatory regions of the sucrose transporter genes translational start sites, using Plant CARE, PLACE and Genomatix Matinspector professional databases. Several cis-acting regulatory elements that are associated with plant development, plant hormonal regulation and stress response were identified, and were present in varying frequencies within the 1.5kbp of 5' regulatory region, among which are; A-box, RY, CAT, Pyrimidine-box, Sucrose-box, ABRE, ARF, ERE, GARE, Me-JA, ARE, DRE, GA-motif, GATA, GT-1, MYC, MYB, W-box, and I-box. This result reveals the probable cis-acting regulatory elements that possibly are involved in the expression and regulation of sucrose transporter gene families in rice and Arabidopsis thaliana during cellular development or environmental stress conditions. Copyright © 2010 Elsevier Ltd. All rights reserved.
Di Giacomo, Daniela; Gaildrat, Pascaline; Abuli, Anna; Abdat, Julie; Frébourg, Thierry; Tosi, Mario; Martins, Alexandra
2013-11-01
Exonic variants can alter pre-mRNA splicing either by changing splice sites or by modifying splicing regulatory elements. Often these effects are difficult to predict and are only detected by performing RNA analyses. Here, we analyzed, in a minigene assay, 26 variants identified in the exon 7 of BRCA2, a cancer predisposition gene. Our results revealed eight new exon skipping mutations in this exon: one directly altering the 5' splice site and seven affecting potential regulatory elements. This brings the number of splicing regulatory mutations detected in BRCA2 exon 7 to a total of 11, a remarkably high number considering the total number of variants reported in this exon (n = 36), all tested in our minigene assay. We then exploited this large set of splicing data to test the predictive value of splicing regulator hexamers' scores recently established by Ke et al. (). Comparisons of hexamer-based predictions with our experimental data revealed high sensitivity in detecting variants that increased exon skipping, an important feature for prescreening variants before RNA analysis. In conclusion, hexamer scores represent a promising tool for predicting the biological consequences of exonic variants and may have important applications for the interpretation of variants detected by high-throughput sequencing. © 2013 WILEY PERIODICALS, INC.
TRACTOR_DB: a database of regulatory networks in gamma-proteobacterial genomes
González, Abel D.; Espinosa, Vladimir; Vasconcelos, Ana T.; Pérez-Rueda, Ernesto; Collado-Vides, Julio
2005-01-01
Experimental data on the Escherichia coli transcriptional regulatory system has been used in the past years to predict new regulatory elements (promoters, transcription factors (TFs), TFs' binding sites and operons) within its genome. As more genomes of gamma-proteobacteria are being sequenced, the prediction of these elements in a growing number of organisms has become more feasible, as a step towards the study of how different bacteria respond to environmental changes at the level of transcriptional regulation. In this work, we present TRACTOR_DB (TRAnscription FaCTORs' predicted binding sites in prokaryotic genomes), a relational database that contains computational predictions of new members of 74 regulons in 17 gamma-proteobacterial genomes. For these predictions we used a comparative genomics approach regarding which several proof-of-principle articles for large regulons have been published. TRACTOR_DB may be currently accessed at http://www.bioinfo.cu/Tractor_DB, http://www.tractor.lncc.br/ or at http://www.cifn.unam.mx/Computational_Genomics/tractorDB. Contact Email id is tractor@cifn.unam.mx. PMID:15608293
Fujibuchi, Wataru; Anderson, John S. J.; Landsman, David
2001-01-01
Consensus pattern and matrix-based searches designed to predict cis-acting transcriptional regulatory sequences have historically been subject to large numbers of false positives. We sought to decrease false positives by incorporating expression profile data into a consensus pattern-based search method. We have systematically analyzed the expression phenotypes of over 6000 yeast genes, across 121 expression profile experiments, and correlated them with the distribution of 14 known regulatory elements over sequences upstream of the genes. Our method is based on a metric we term probabilistic element assessment (PEA), which is a ranking of potential sites based on sequence similarity in the upstream regions of genes with similar expression phenotypes. For eight of the 14 known elements that we examined, our method had a much higher selectivity than a naïve consensus pattern search. Based on our analysis, we have developed a web-based tool called PROSPECT, which allows consensus pattern-based searching of gene clusters obtained from microarray data. PMID:11574681
DiRE: identifying distant regulatory elements of co-expressed genes
Gotea, Valer; Ovcharenko, Ivan
2008-01-01
Regulation of gene expression in eukaryotic genomes is established through a complex cooperative activity of proximal promoters and distant regulatory elements (REs) such as enhancers, repressors and silencers. We have developed a web server named DiRE, based on the Enhancer Identification (EI) method, for predicting distant regulatory elements in higher eukaryotic genomes, namely for determining their chromosomal location and functional characteristics. The server uses gene co-expression data, comparative genomics and profiles of transcription factor binding sites (TFBSs) to determine TFBS-association signatures that can be used for discriminating specific regulatory functions. DiRE's unique feature is its ability to detect REs outside of proximal promoter regions, as it takes advantage of the full gene locus to conduct the search. DiRE can predict common REs for any set of input genes for which the user has prior knowledge of co-expression, co-function or other biologically meaningful grouping. The server predicts function-specific REs consisting of clusters of specifically-associated TFBSs and it also scores the association of individual transcription factors (TFs) with the biological function shared by the group of input genes. Its integration with the Array2BIO server allows users to start their analysis with raw microarray expression data. The DiRE web server is freely available at http://dire.dcode.org. PMID:18487623
Explaining the disease phenotype of intergenic SNP through predicted long range regulation
Chen, Jingqi; Tian, Weidong
2016-01-01
Thousands of disease-associated SNPs (daSNPs) are located in intergenic regions (IGR), making it difficult to understand their association with disease phenotypes. Recent analysis found that non-coding daSNPs were frequently located in or approximate to regulatory elements, inspiring us to try to explain the disease phenotypes of IGR daSNPs through nearby regulatory sequences. Hence, after locating the nearest distal regulatory element (DRE) to a given IGR daSNP, we applied a computational method named INTREPID to predict the target genes regulated by the DRE, and then investigated their functional relevance to the IGR daSNP's disease phenotypes. 36.8% of all IGR daSNP-disease phenotype associations investigated were possibly explainable through the predicted target genes, which were enriched with, were functionally relevant to, or consisted of the corresponding disease genes. This proportion could be further increased to 60.5% if the LD SNPs of daSNPs were also considered. Furthermore, the predicted SNP-target gene pairs were enriched with known eQTL/mQTL SNP-gene relationships. Overall, it's likely that IGR daSNPs may contribute to disease phenotypes by interfering with the regulatory function of their nearby DREs and causing abnormal expression of disease genes. PMID:27280978
Chappell, James; Jensen, Kirsten; Freemont, Paul S.
2013-01-01
A bottleneck in our capacity to rationally and predictably engineer biological systems is the limited number of well-characterized genetic elements from which to build. Current characterization methods are tied to measurements in living systems, the transformation and culturing of which are inherently time-consuming. To address this, we have validated a completely in vitro approach for the characterization of DNA regulatory elements using Escherichia coli extract cell-free systems. Importantly, we demonstrate that characterization in cell-free systems correlates and is reflective of performance in vivo for the most frequently used DNA regulatory elements. Moreover, we devise a rapid and completely in vitro method to generate DNA templates for cell-free systems, bypassing the need for DNA template generation and amplification from living cells. This in vitro approach is significantly quicker than current characterization methods and is amenable to high-throughput techniques, providing a valuable tool for rapidly prototyping libraries of DNA regulatory elements for synthetic biology. PMID:23371936
Taylor, James; Tyekucheva, Svitlana; King, David C; Hardison, Ross C; Miller, Webb; Chiaromonte, Francesca
2006-12-01
Genomic sequence signals - such as base composition, presence of particular motifs, or evolutionary constraint - have been used effectively to identify functional elements. However, approaches based only on specific signals known to correlate with function can be quite limiting. When training data are available, application of computational learning algorithms to multispecies alignments has the potential to capture broader and more informative sequence and evolutionary patterns that better characterize a class of elements. However, effective exploitation of patterns in multispecies alignments is impeded by the vast number of possible alignment columns and by a limited understanding of which particular strings of columns may characterize a given class. We have developed a computational method, called ESPERR (evolutionary and sequence pattern extraction through reduced representations), which uses training examples to learn encodings of multispecies alignments into reduced forms tailored for the prediction of chosen classes of functional elements. ESPERR produces a greatly improved Regulatory Potential score, which can discriminate regulatory regions from neutral sites with excellent accuracy ( approximately 94%). This score captures strong signals (GC content and conservation), as well as subtler signals (with small contributions from many different alignment patterns) that characterize the regulatory elements in our training set. ESPERR is also effective for predicting other classes of functional elements, as we show for DNaseI hypersensitive sites and highly conserved regions with developmental enhancer activity. Our software, training data, and genome-wide predictions are available from our Web site (http://www.bx.psu.edu/projects/esperr).
Loots, Gabriela G
2008-01-01
Despite remarkable recent advances in genomics that have enabled us to identify most of the genes in the human genome, comparable efforts to define transcriptional cis-regulatory elements that control gene expression are lagging behind. The difficulty of this task stems from two equally important problems: our knowledge of how regulatory elements are encoded in genomes remains elementary, and there is a vast genomic search space for regulatory elements, since most of mammalian genomes are noncoding. Comparative genomic approaches are having a remarkable impact on the study of transcriptional regulation in eukaryotes and currently represent the most efficient and reliable methods of predicting noncoding sequences likely to control the patterns of gene expression. By subjecting eukaryotic genomic sequences to computational comparisons and subsequent experimentation, we are inching our way toward a more comprehensive catalog of common regulatory motifs that lie behind fundamental biological processes. We are still far from comprehending how the transcriptional regulatory code is encrypted in the human genome and providing an initial global view of regulatory gene networks, but collectively, the continued development of comparative and experimental approaches will rapidly expand our knowledge of the transcriptional regulome.
Nguyen, Quan H; Tellam, Ross L; Naval-Sanchez, Marina; Porto-Neto, Laercio R; Barendse, William; Reverter, Antonio; Hayes, Benjamin; Kijas, James; Dalrymple, Brian P
2018-01-01
Abstract Genome sequences for hundreds of mammalian species are available, but an understanding of their genomic regulatory regions, which control gene expression, is only beginning. A comprehensive prediction of potential active regulatory regions is necessary to functionally study the roles of the majority of genomic variants in evolution, domestication, and animal production. We developed a computational method to predict regulatory DNA sequences (promoters, enhancers, and transcription factor binding sites) in production animals (cows and pigs) and extended its broad applicability to other mammals. The method utilizes human regulatory features identified from thousands of tissues, cell lines, and experimental assays to find homologous regions that are conserved in sequences and genome organization and are enriched for regulatory elements in the genome sequences of other mammalian species. Importantly, we developed a filtering strategy, including a machine learning classification method, to utilize a very small number of species-specific experimental datasets available to select for the likely active regulatory regions. The method finds the optimal combination of sensitivity and accuracy to unbiasedly predict regulatory regions in mammalian species. Furthermore, we demonstrated the utility of the predicted regulatory datasets in cattle for prioritizing variants associated with multiple production and climate change adaptation traits and identifying potential genome editing targets. PMID:29618048
Nguyen, Quan H; Tellam, Ross L; Naval-Sanchez, Marina; Porto-Neto, Laercio R; Barendse, William; Reverter, Antonio; Hayes, Benjamin; Kijas, James; Dalrymple, Brian P
2018-03-01
Genome sequences for hundreds of mammalian species are available, but an understanding of their genomic regulatory regions, which control gene expression, is only beginning. A comprehensive prediction of potential active regulatory regions is necessary to functionally study the roles of the majority of genomic variants in evolution, domestication, and animal production. We developed a computational method to predict regulatory DNA sequences (promoters, enhancers, and transcription factor binding sites) in production animals (cows and pigs) and extended its broad applicability to other mammals. The method utilizes human regulatory features identified from thousands of tissues, cell lines, and experimental assays to find homologous regions that are conserved in sequences and genome organization and are enriched for regulatory elements in the genome sequences of other mammalian species. Importantly, we developed a filtering strategy, including a machine learning classification method, to utilize a very small number of species-specific experimental datasets available to select for the likely active regulatory regions. The method finds the optimal combination of sensitivity and accuracy to unbiasedly predict regulatory regions in mammalian species. Furthermore, we demonstrated the utility of the predicted regulatory datasets in cattle for prioritizing variants associated with multiple production and climate change adaptation traits and identifying potential genome editing targets.
Zhang, Weixiong; Ruan, Jianhua; Ho, Tuan-Hua David; You, Youngsook; Yu, Taotao; Quatrano, Ralph S
2005-07-15
A fundamental problem of computational genomics is identifying the genes that respond to certain endogenous cues and environmental stimuli. This problem can be referred to as targeted gene finding. Since gene regulation is mainly determined by the binding of transcription factors and cis-regulatory DNA sequences, most existing gene annotation methods, which exploit the conservation of open reading frames, are not effective in finding target genes. A viable approach to targeted gene finding is to exploit the cis-regulatory elements that are known to be responsible for the transcription of target genes. Given such cis-elements, putative target genes whose promoters contain the elements can be identified. As a case study, we apply the above approach to predict the genes in model plant Arabidopsis thaliana which are inducible by a phytohormone, abscisic acid (ABA), and abiotic stress, such as drought, cold and salinity. We first construct and analyze two ABA specific cis-elements, ABA-responsive element (ABRE) and its coupling element (CE), in A.thaliana, based on their conservation in rice and other cereal plants. We then use the ABRE-CE module to identify putative ABA-responsive genes in A.thaliana. Based on RT-PCR verification and the results from literature, this method has an accuracy rate of 67.5% for the top 40 predictions. The cis-element based targeted gene finding approach is expected to be widely applicable since a large number of cis-elements in many species are available.
Explaining the disease phenotype of intergenic SNP through predicted long range regulation.
Chen, Jingqi; Tian, Weidong
2016-10-14
Thousands of disease-associated SNPs (daSNPs) are located in intergenic regions (IGR), making it difficult to understand their association with disease phenotypes. Recent analysis found that non-coding daSNPs were frequently located in or approximate to regulatory elements, inspiring us to try to explain the disease phenotypes of IGR daSNPs through nearby regulatory sequences. Hence, after locating the nearest distal regulatory element (DRE) to a given IGR daSNP, we applied a computational method named INTREPID to predict the target genes regulated by the DRE, and then investigated their functional relevance to the IGR daSNP's disease phenotypes. 36.8% of all IGR daSNP-disease phenotype associations investigated were possibly explainable through the predicted target genes, which were enriched with, were functionally relevant to, or consisted of the corresponding disease genes. This proportion could be further increased to 60.5% if the LD SNPs of daSNPs were also considered. Furthermore, the predicted SNP-target gene pairs were enriched with known eQTL/mQTL SNP-gene relationships. Overall, it's likely that IGR daSNPs may contribute to disease phenotypes by interfering with the regulatory function of their nearby DREs and causing abnormal expression of disease genes. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Giorgetti, Luca; Galupa, Rafael; Nora, Elphège P.; Piolot, Tristan; Lam, France; Dekker, Job; Tiana, Guido; Heard, Edith
2015-01-01
Summary A new level of chromosome organization, Topologically Associating Domains (TADs), was recently uncovered by chromosome-confirmation-capture (3C) techniques. To explore TAD structure and function, we developed a polymer model that can extract the full repertoire of chromatin conformations within TADs from population-based 3C data. This model predicts actual physical distances and to what extent chromosomal contacts vary between cells. It also identifies interactions within single TADs that stabilize boundaries between TADs and allows us to identify and genetically validate key structural elements within TADs. Combining the model’s predictions with high-resolution DNA FISH and quantitative RNA FISH for TADs within the X-inactivation center (Xic), we dissect the relationship between transcription and spatial proximity to cis-regulatory elements. We demonstrate that contacts between potential regulatory elements occur in the context of fluctuating structures rather than stable loops and propose that such fluctuations may contribute to asymmetric expression in the Xic during X inactivation. PMID:24813616
Turatsinze, Jean-Valery; Thomas-Chollier, Morgane; Defrance, Matthieu; van Helden, Jacques
2008-01-01
This protocol shows how to detect putative cis-regulatory elements and regions enriched in such elements with the regulatory sequence analysis tools (RSAT) web server (http://rsat.ulb.ac.be/rsat/). The approach applies to known transcription factors, whose binding specificity is represented by position-specific scoring matrices, using the program matrix-scan. The detection of individual binding sites is known to return many false predictions. However, results can be strongly improved by estimating P value, and by searching for combinations of sites (homotypic and heterotypic models). We illustrate the detection of sites and enriched regions with a study case, the upstream sequence of the Drosophila melanogaster gene even-skipped. This protocol is also tested on random control sequences to evaluate the reliability of the predictions. Each task requires a few minutes of computation time on the server. The complete protocol can be executed in about one hour.
PlantTFDB 4.0: toward a central hub for transcription factors and regulatory interactions in plants.
Jin, Jinpu; Tian, Feng; Yang, De-Chang; Meng, Yu-Qi; Kong, Lei; Luo, Jingchu; Gao, Ge
2017-01-04
With the goal of providing a comprehensive, high-quality resource for both plant transcription factors (TFs) and their regulatory interactions with target genes, we upgraded plant TF database PlantTFDB to version 4.0 (http://planttfdb.cbi.pku.edu.cn/). In the new version, we identified 320 370 TFs from 165 species, presenting a more comprehensive genomic TF repertoires of green plants. Besides updating the pre-existing abundant functional and evolutionary annotation for identified TFs, we generated three new types of annotation which provide more directly clues to investigate functional mechanisms underlying: (i) a set of high-quality, non-redundant TF binding motifs derived from experiments; (ii) multiple types of regulatory elements identified from high-throughput sequencing data; (iii) regulatory interactions curated from literature and inferred by combining TF binding motifs and regulatory elements. In addition, we upgraded previous TF prediction server, and set up four novel tools for regulation prediction and functional enrichment analyses. Finally, we set up a novel companion portal PlantRegMap (http://plantregmap.cbi.pku.edu.cn) for users to access the regulation resource and analysis tools conveniently. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Uhl, Juli D.; Cook, Tiffany A.; Gebelein, Brian
2010-01-01
Hox transcription factors specify numerous cell fates along the anterior-posterior axis by regulating the expression of downstream target genes. While expression analysis has uncovered large numbers of de-regulated genes in cells with altered Hox activity, determining which are direct versus indirect targets has remained a significant challenge. Here, we characterize the DNA binding activity of Hox transcription factor complexes on eight experimentally verified cis-regulatory elements. Hox factors regulate the activity of each element by forming protein complexes with two cofactor proteins, Extradenticle (Exd) and Homothorax (Hth). Using comparative DNA binding assays, we found that a number of flexible arrangements of Hox, Exd, and Hth binding sites mediate cooperative transcription factor complexes. Moreover, analysis of a Distal-less regulatory element (DMXR) that is repressed by abdominal Hox factors revealed that suboptimal binding sites can be combined to form high affinity transcription complexes. Lastly, we determined that the anterior Hox factors are more dependent upon Exd and Hth for complex formation than posterior Hox factors. Based upon these findings, we suggest a general set of guidelines to serve as a basis for designing bioinformatics algorithms aimed at identifying Hox regulatory elements using the wealth of recently sequenced genomes. PMID:20398649
Chappell, James; Freemont, Paul
2013-01-01
The characterization of DNA regulatory elements such as ribosome binding sites and transcriptional promoters is a fundamental aim of synthetic biology. Characterization of such DNA regulatory elements by monitoring the synthesis of fluorescent proteins is a commonly used technique to resolve the relative or absolute strengths. These measurements can be used in combination with mathematical models and computer simulation to rapidly assess performance of DNA regulatory elements both in isolation and in combination, to assist predictable and efficient engineering of complex novel biological devices and systems. Here we describe the construction and relative characterization of Escherichia coli (E. coli) σ(70) transcriptional promoters by monitoring the synthesis of green fluorescent protein (GFP) both in vivo in E. coli and in vitro in a E. coli cell-free transcription and translation reaction.
The identification of cis-regulatory elements: A review from a machine learning perspective.
Li, Yifeng; Chen, Chih-Yu; Kaye, Alice M; Wasserman, Wyeth W
2015-12-01
The majority of the human genome consists of non-coding regions that have been called junk DNA. However, recent studies have unveiled that these regions contain cis-regulatory elements, such as promoters, enhancers, silencers, insulators, etc. These regulatory elements can play crucial roles in controlling gene expressions in specific cell types, conditions, and developmental stages. Disruption to these regions could contribute to phenotype changes. Precisely identifying regulatory elements is key to deciphering the mechanisms underlying transcriptional regulation. Cis-regulatory events are complex processes that involve chromatin accessibility, transcription factor binding, DNA methylation, histone modifications, and the interactions between them. The development of next-generation sequencing techniques has allowed us to capture these genomic features in depth. Applied analysis of genome sequences for clinical genetics has increased the urgency for detecting these regions. However, the complexity of cis-regulatory events and the deluge of sequencing data require accurate and efficient computational approaches, in particular, machine learning techniques. In this review, we describe machine learning approaches for predicting transcription factor binding sites, enhancers, and promoters, primarily driven by next-generation sequencing data. Data sources are provided in order to facilitate testing of novel methods. The purpose of this review is to attract computational experts and data scientists to advance this field. Crown Copyright © 2015. Published by Elsevier Ireland Ltd. All rights reserved.
Mohamed Salleh, Faridah Hani; Arif, Shereena Mohd; Zainudin, Suhaila; Firdaus-Raih, Mohd
2015-12-01
A gene regulatory network (GRN) is a large and complex network consisting of interacting elements that, over time, affect each other's state. The dynamics of complex gene regulatory processes are difficult to understand using intuitive approaches alone. To overcome this problem, we propose an algorithm for inferring the regulatory interactions from knock-out data using a Gaussian model combines with Pearson Correlation Coefficient (PCC). There are several problems relating to GRN construction that have been outlined in this paper. We demonstrated the ability of our proposed method to (1) predict the presence of regulatory interactions between genes, (2) their directionality and (3) their states (activation or suppression). The algorithm was applied to network sizes of 10 and 50 genes from DREAM3 datasets and network sizes of 10 from DREAM4 datasets. The predicted networks were evaluated based on AUROC and AUPR. We discovered that high false positive values were generated by our GRN prediction methods because the indirect regulations have been wrongly predicted as true relationships. We achieved satisfactory results as the majority of sub-networks achieved AUROC values above 0.5. Copyright © 2015 Elsevier Ltd. All rights reserved.
FARME DB: a functional antibiotic resistance element database
Wallace, James C.; Port, Jesse A.; Smith, Marissa N.; Faustman, Elaine M.
2017-01-01
Antibiotic resistance (AR) is a major global public health threat but few resources exist that catalog AR genes outside of a clinical context. Current AR sequence databases are assembled almost exclusively from genomic sequences derived from clinical bacterial isolates and thus do not include many microbial sequences derived from environmental samples that confer resistance in functional metagenomic studies. These environmental metagenomic sequences often show little or no similarity to AR sequences from clinical isolates using standard classification criteria. In addition, existing AR databases provide no information about flanking sequences containing regulatory or mobile genetic elements. To help address this issue, we created an annotated database of DNA and protein sequences derived exclusively from environmental metagenomic sequences showing AR in laboratory experiments. Our Functional Antibiotic Resistant Metagenomic Element (FARME) database is a compilation of publically available DNA sequences and predicted protein sequences conferring AR as well as regulatory elements, mobile genetic elements and predicted proteins flanking antibiotic resistant genes. FARME is the first database to focus on functional metagenomic AR gene elements and provides a resource to better understand AR in the 99% of bacteria which cannot be cultured and the relationship between environmental AR sequences and antibiotic resistant genes derived from cultured isolates. Database URL: http://staff.washington.edu/jwallace/farme PMID:28077567
Genome-wide inference of regulatory networks in Streptomyces coelicolor.
Castro-Melchor, Marlene; Charaniya, Salim; Karypis, George; Takano, Eriko; Hu, Wei-Shou
2010-10-18
The onset of antibiotics production in Streptomyces species is co-ordinated with differentiation events. An understanding of the genetic circuits that regulate these coupled biological phenomena is essential to discover and engineer the pharmacologically important natural products made by these species. The availability of genomic tools and access to a large warehouse of transcriptome data for the model organism, Streptomyces coelicolor, provides incentive to decipher the intricacies of the regulatory cascades and develop biologically meaningful hypotheses. In this study, more than 500 samples of genome-wide temporal transcriptome data, comprising wild-type and more than 25 regulatory gene mutants of Streptomyces coelicolor probed across multiple stress and medium conditions, were investigated. Information based on transcript and functional similarity was used to update a previously-predicted whole-genome operon map and further applied to predict transcriptional networks constituting modules enriched in diverse functions such as secondary metabolism, and sigma factor. The predicted network displays a scale-free architecture with a small-world property observed in many biological networks. The networks were further investigated to identify functionally-relevant modules that exhibit functional coherence and a consensus motif in the promoter elements indicative of DNA-binding elements. Despite the enormous experimental as well as computational challenges, a systems approach for integrating diverse genome-scale datasets to elucidate complex regulatory networks is beginning to emerge. We present an integrated analysis of transcriptome data and genomic features to refine a whole-genome operon map and to construct regulatory networks at the cistron level in Streptomyces coelicolor. The functionally-relevant modules identified in this study pose as potential targets for further studies and verification.
Improved regulatory element prediction based on tissue-specific local epigenomic signatures
He, Yupeng; Gorkin, David U.; Dickel, Diane E.; Nery, Joseph R.; Castanon, Rosa G.; Lee, Ah Young; Shen, Yin; Visel, Axel; Pennacchio, Len A.; Ren, Bing; Ecker, Joseph R.
2017-01-01
Accurate enhancer identification is critical for understanding the spatiotemporal transcriptional regulation during development as well as the functional impact of disease-related noncoding genetic variants. Computational methods have been developed to predict the genomic locations of active enhancers based on histone modifications, but the accuracy and resolution of these methods remain limited. Here, we present an algorithm, regulatory element prediction based on tissue-specific local epigenetic marks (REPTILE), which integrates histone modification and whole-genome cytosine DNA methylation profiles to identify the precise location of enhancers. We tested the ability of REPTILE to identify enhancers previously validated in reporter assays. Compared with existing methods, REPTILE shows consistently superior performance across diverse cell and tissue types, and the enhancer locations are significantly more refined. We show that, by incorporating base-resolution methylation data, REPTILE greatly improves upon current methods for annotation of enhancers across a variety of cell and tissue types. REPTILE is available at https://github.com/yupenghe/REPTILE/. PMID:28193886
2011-01-01
Background Gene regulatory networks play essential roles in living organisms to control growth, keep internal metabolism running and respond to external environmental changes. Understanding the connections and the activity levels of regulators is important for the research of gene regulatory networks. While relevance score based algorithms that reconstruct gene regulatory networks from transcriptome data can infer genome-wide gene regulatory networks, they are unfortunately prone to false positive results. Transcription factor activities (TFAs) quantitatively reflect the ability of the transcription factor to regulate target genes. However, classic relevance score based gene regulatory network reconstruction algorithms use models do not include the TFA layer, thus missing a key regulatory element. Results This work integrates TFA prediction algorithms with relevance score based network reconstruction algorithms to reconstruct gene regulatory networks with improved accuracy over classic relevance score based algorithms. This method is called Gene expression and Transcription factor activity based Relevance Network (GTRNetwork). Different combinations of TFA prediction algorithms and relevance score functions have been applied to find the most efficient combination. When the integrated GTRNetwork method was applied to E. coli data, the reconstructed genome-wide gene regulatory network predicted 381 new regulatory links. This reconstructed gene regulatory network including the predicted new regulatory links show promising biological significances. Many of the new links are verified by known TF binding site information, and many other links can be verified from the literature and databases such as EcoCyc. The reconstructed gene regulatory network is applied to a recent transcriptome analysis of E. coli during isobutanol stress. In addition to the 16 significantly changed TFAs detected in the original paper, another 7 significantly changed TFAs have been detected by using our reconstructed network. Conclusions The GTRNetwork algorithm introduces the hidden layer TFA into classic relevance score-based gene regulatory network reconstruction processes. Integrating the TFA biological information with regulatory network reconstruction algorithms significantly improves both detection of new links and reduces that rate of false positives. The application of GTRNetwork on E. coli gene transcriptome data gives a set of potential regulatory links with promising biological significance for isobutanol stress and other conditions. PMID:21668997
Identification of trans-acting factors regulating SamDC expression in Oryza sativa
DOE Office of Scientific and Technical Information (OSTI.GOV)
Basu, Supratim, E-mail: supratim_genetics@yahoo.co.in; Division of Plant Biology, Bose Institute, Kolkata; Roychoudhury, Aryadeep
2014-03-07
Highlights: • Identification of cis elements responsible for SamDC expression by in silico analysis. • qPCR analysis of SamDC expression to abiotic and biotic stress treatments. • Detection of SamDC regulators using identified cis-elements as probe by EMSA. • Southwestern Blot analysis to predict the size of the trans-acting factors. - Abstract: Abiotic stress affects the growth and productivity of crop plants; to cope with the adverse environmental conditions, plants have developed efficient defense machinery comprising of antioxidants like phenolics and flavonoids, and osmolytes like polyamines. SamDC is a key enzyme in the polyamine biosynthesis pathway in plants. In ourmore » present communication we have done in silico analysis of the promoter region of SamDC to look for the presence of different cis-regulatory elements contributing to its expression. Based on the presence of different cis-regulatory elements we completed comparative analysis of SamDC gene expression in rice lamina of IR-29 and Nonabokra by qPCR in response to the abiotic stress treatments of salinity, drought, cold and the biotic stress treatments of ABA and light. Additionally, to explore the role of the cis-regulatory elements in regulating the expression of SamDC gene in plants we comparatively analyzed the binding of rice nuclear proteins prepared from IR-29 and Nonabokra undergoing various stress treatments. The intensity of the complex formed was low and inducible in IR-29 in contrast to Nonabokra. Southwestern blot analysis helped in predicting the size of the trans-acting factors binding to these cis-elements. To our knowledge this is the first report on the comprehensive analysis of SamDC gene expression in rice and identification of the trans-acting factors regulating its expression.« less
Computational methods in sequence and structure prediction
NASA Astrophysics Data System (ADS)
Lang, Caiyi
This dissertation is organized into two parts. In the first part, we will discuss three computational methods for cis-regulatory element recognition in three different gene regulatory networks as the following: (a) Using a comprehensive "Phylogenetic Footprinting Comparison" method, we will investigate the promoter sequence structures of three enzymes (PAL, CHS and DFR) that catalyze sequential steps in the pathway from phenylalanine to anthocyanins in plants. Our result shows there exists a putative cis-regulatory element "AC(C/G)TAC(C)" in the upstream of these enzyme genes. We propose this cis-regulatory element to be responsible for the genetic regulation of these three enzymes and this element, might also be the binding site for MYB class transcription factor PAP1. (b) We will investigate the role of the Arabidopsis gene glutamate receptor 1.1 (AtGLR1.1) in C and N metabolism by utilizing the microarray data we obtained from AtGLR1.1 deficient lines (antiAtGLR1.1). We focus our investigation on the putatively co-regulated transcript profile of 876 genes we have collected in antiAtGLR1.1 lines. By (a) scanning the occurrence of several groups of known abscisic acid (ABA) related cisregulatory elements in the upstream regions of 876 Arabidopsis genes; and (b) exhaustive scanning of all possible 6-10 bps motif occurrence in the upstream regions of the same set of genes, we are able to make a quantative estimation on the enrichment level of each of the cis-regulatory element candidates. We finally conclude that one specific cis-regulatory element group, called "ABRE" elements, are statistically highly enriched within the 876-gene group as compared to their occurrence within the genome. (c) We will introduce a new general purpose algorithm, called "fuzzy REDUCE1", which we have developed recently for automated cis-regulatory element identification. In the second part, we will discuss our newly devised protein design framework. With this framework we have developed a software package which is capable of designing novel protein structures at the atomic resolution. This software package allows us to perform protein structure design with a flexible backbone. The backbone flexibility includes loop region relaxation as well as a secondary structure collective mode relaxation scheme. (Abstract shortened by UMI.)
Smith, Robin P; Riesenfeld, Samantha J; Holloway, Alisha K; Li, Qiang; Murphy, Karl K; Feliciano, Natalie M; Orecchia, Lorenzo; Oksenberg, Nir; Pollard, Katherine S; Ahituv, Nadav
2013-07-18
Large-scale annotation efforts have improved our ability to coarsely predict regulatory elements throughout vertebrate genomes. However, it is unclear how complex spatiotemporal patterns of gene expression driven by these elements emerge from the activity of short, transcription factor binding sequences. We describe a comprehensive promoter extension assay in which the regulatory potential of all 6 base-pair (bp) sequences was tested in the context of a minimal promoter. To enable this large-scale screen, we developed algorithms that use a reverse-complement aware decomposition of the de Bruijn graph to design a library of DNA oligomers incorporating every 6-bp sequence exactly once. Our library multiplexes all 4,096 unique 6-mers into 184 double-stranded 15-bp oligomers, which is sufficiently compact for in vivo testing. We injected each multiplexed construct into zebrafish embryos and scored GFP expression in 15 tissues at two developmental time points. Twenty-seven constructs produced consistent expression patterns, with the majority doing so in only one tissue. Functional sequences are enriched near biologically relevant genes, match motifs for developmental transcription factors, and are required for enhancer activity. By concatenating tissue-specific functional sequences, we generated completely synthetic enhancers for the notochord, epidermis, spinal cord, forebrain and otic lateral line, and show that short regulatory sequences do not always function modularly. This work introduces a unique in vivo catalog of short, functional regulatory sequences and demonstrates several important principles of regulatory element organization. Furthermore, we provide resources for designing compact, reverse-complement aware k-mer libraries.
Tsou, Ann-Ping; Sun, Yi-Ming; Liu, Chia-Lin; Huang, Hsien-Da; Horng, Jorng-Tzong; Tsai, Meng-Feng; Liu, Baw-Juine
2006-07-01
Identification of transcriptional regulatory sites plays an important role in the investigation of gene regulation. For this propose, we designed and implemented a data warehouse to integrate multiple heterogeneous biological data sources with data types such as text-file, XML, image, MySQL database model, and Oracle database model. The utility of the biological data warehouse in predicting transcriptional regulatory sites of coregulated genes was explored using a synexpression group derived from a microarray study. Both of the binding sites of known transcription factors and predicted over-represented (OR) oligonucleotides were demonstrated for the gene group. The potential biological roles of both known nucleotides and one OR nucleotide were demonstrated using bioassays. Therefore, the results from the wet-lab experiments reinforce the power and utility of the data warehouse as an approach to the genome-wide search for important transcription regulatory elements that are the key to many complex biological systems.
Zhang, Li-Feng; Li, Wan-Feng; Han, Su-Ying; Yang, Wen-Hua; Qi, Li-Wang
2013-10-15
A full-length cDNA and genomic sequences of a translationally controlled tumor protein (TCTP) gene were isolated from Japanese larch (Larix leptolepis) and designated LaTCTP. The length of the cDNA was 1, 043 bp and contained a 504 bp open reading frame that encodes a predicted protein of 167 amino acids, characterized by two signature sequences of the TCTP protein family. Analysis of the LaTCTP gene structure indicated four introns and five exons, and it is the largest of all currently known TCTP genes in plants. The 5'-flanking promoter region of LaTCTP was cloned using an improved TAIL-PCR technique. In this region we identified many important potential cis-acting elements, such as a Box-W1 (fungal elicitor responsive element), a CAT-box (cis-acting regulatory element related to meristem expression), a CGTCA-motif (cis-acting regulatory element involved in MeJA-responsiveness), a GT1-motif (light responsive element), a Skn-1-motif (cis-acting regulatory element required for endosperm expression) and a TGA-element (auxin-responsive element), suggesting that expression of LaTCTP is highly regulated. Expression analysis demonstrated ubiquitous localization of LaTCTP mRNA in the roots, stems and needles, high mRNA levels in the embryonal-suspensor mass (ESM), browning embryogenic cultures and mature somatic embryos, and low levels of mRNA at day five during somatic embryogenesis. We suggest that LaTCTP might participate in the regulation of somatic embryo development. These results provide a theoretical basis for understanding the molecular regulatory mechanism of LaTCTP and lay the foundation for artificial regulation of somatic embryogenesis. © 2013.
Majoros, William H; Ohler, Uwe
2010-12-16
The computational detection of regulatory elements in DNA is a difficult but important problem impacting our progress in understanding the complex nature of eukaryotic gene regulation. Attempts to utilize cross-species conservation for this task have been hampered both by evolutionary changes of functional sites and poor performance of general-purpose alignment programs when applied to non-coding sequence. We describe a new and flexible framework for modeling binding site evolution in multiple related genomes, based on phylogenetic pair hidden Markov models which explicitly model the gain and loss of binding sites along a phylogeny. We demonstrate the value of this framework for both the alignment of regulatory regions and the inference of precise binding-site locations within those regions. As the underlying formalism is a stochastic, generative model, it can also be used to simulate the evolution of regulatory elements. Our implementation is scalable in terms of numbers of species and sequence lengths and can produce alignments and binding-site predictions with accuracy rivaling or exceeding current systems that specialize in only alignment or only binding-site prediction. We demonstrate the validity and power of various model components on extensive simulations of realistic sequence data and apply a specific model to study Drosophila enhancers in as many as ten related genomes and in the presence of gain and loss of binding sites. Different models and modeling assumptions can be easily specified, thus providing an invaluable tool for the exploration of biological hypotheses that can drive improvements in our understanding of the mechanisms and evolution of gene regulation.
Mechanism of Chromosomal Boundary Action: Roadblock, Sink, or Loop?
Gohl, Daryl; Aoki, Tsutomu; Blanton, Jason; Shanower, Greg; Kappes, Gretchen; Schedl, Paul
2011-01-01
Boundary elements or insulators subdivide eukaryotic chromosomes into a series of structurally and functionally autonomous domains. They ensure that the action of enhancers and silencers is restricted to the domain in which these regulatory elements reside. Three models, the roadblock, sink/decoy, and topological loop, have been proposed to explain the insulating activity of boundary elements. Strong predictions about how boundaries will function in different experimental contexts can be drawn from these models. In the studies reported here, we have designed assays that test these predictions. The results of our assays are inconsistent with the expectations of the roadblock and sink models. Instead, they support the topological loop model. PMID:21196526
DEEP: a general computational framework for predicting enhancers
Kleftogiannis, Dimitrios; Kalnis, Panos; Bajic, Vladimir B.
2015-01-01
Transcription regulation in multicellular eukaryotes is orchestrated by a number of DNA functional elements located at gene regulatory regions. Some regulatory regions (e.g. enhancers) are located far away from the gene they affect. Identification of distal regulatory elements is a challenge for the bioinformatics research. Although existing methodologies increased the number of computationally predicted enhancers, performance inconsistency of computational models across different cell-lines, class imbalance within the learning sets and ad hoc rules for selecting enhancer candidates for supervised learning, are some key questions that require further examination. In this study we developed DEEP, a novel ensemble prediction framework. DEEP integrates three components with diverse characteristics that streamline the analysis of enhancer's properties in a great variety of cellular conditions. In our method we train many individual classification models that we combine to classify DNA regions as enhancers or non-enhancers. DEEP uses features derived from histone modification marks or attributes coming from sequence characteristics. Experimental results indicate that DEEP performs better than four state-of-the-art methods on the ENCODE data. We report the first computational enhancer prediction results on FANTOM5 data where DEEP achieves 90.2% accuracy and 90% geometric mean (GM) of specificity and sensitivity across 36 different tissues. We further present results derived using in vivo-derived enhancer data from VISTA database. DEEP-VISTA, when tested on an independent test set, achieved GM of 80.1% and accuracy of 89.64%. DEEP framework is publicly available at http://cbrc.kaust.edu.sa/deep/. PMID:25378307
In silico analysis of high affinity potassium transporter (HKT) isoforms in different plants
2014-01-01
Background High affinity potassium transporters (HKTs) are located in the plasma membrane of the vessels and have significant influence on salt tolerance in some plants. They exclude Na+ from the parenchyma cells to reduce Na+ concentration. Despite many studies, the underlying regulatory mechanisms and the exact functions of HKTs within different genomic backgrounds are relatively unknown. In this study, various bioinformatics techniques, including promoter analysis, identification of HKT-surrounding genes, and construction of gene networks, were applied to investigate the HKT regulatory mechanism. Results Promoter analysis showed that rice HKTs carry ABA response elements. Additionally, jasmonic acid response elements were detected on promoter region of TmHKT1;5. In silico synteny highlighted several unknown and new loci near rice, Arabidopsis thaliana and Physcomitrella patent HKTs, which may play a significant role in salt stress tolerance in concert with HKTs. Gene network prediction unravelled that crosstalk between jasmonate and ethylene reduces AtHKT1;1 expression. Furthermore, antiporter and transferase proteins were found in AtHKT1;1 gene network. Interestingly, regulatory elements on the promoter region of HKT in wild genotype (TmHKT1;5) were more frequent and variable than the ones in cultivated wheat (TaHKT1;5) which provides the possibility of rapid response and better understanding of environmental conditions for wild genotype. Conclusion Detecting ABA and jasmonic acid response elements on promoter regions of HKTs provide valuable clues on underlying regulatory mechanisms of HKTs. In silico synteny and pathway discovery indicated several candidates which act in concert with HKTs in stress condition. We highlighted different arrangement of regulatory elements on promoter region of wild wheat (TmHKT1;5) compared to bread wheat (TaHKT1;5) in this study. PMID:25279141
Transcriptomic analysis of rice aleurone cells identified a novel abscisic acid response element.
Watanabe, Kenneth A; Homayouni, Arielle; Gu, Lingkun; Huang, Kuan-Ying; Ho, Tuan-Hua David; Shen, Qingxi J
2017-09-01
Seeds serve as a great model to study plant responses to drought stress, which is largely mediated by abscisic acid (ABA). The ABA responsive element (ABRE) is a key cis-regulatory element in ABA signalling. However, its consensus sequence (ACGTG(G/T)C) is present in the promoters of only about 40% of ABA-induced genes in rice aleurone cells, suggesting other ABREs may exist. To identify novel ABREs, RNA sequencing was performed on aleurone cells of rice seeds treated with 20 μM ABA. Gibbs sampling was used to identify enriched elements, and particle bombardment-mediated transient expression studies were performed to verify the function. Gene ontology analysis was performed to predict the roles of genes containing the novel ABREs. This study revealed 2443 ABA-inducible genes and a novel ABRE, designated as ABREN, which was experimentally verified to mediate ABA signalling in rice aleurone cells. Many of the ABREN-containing genes are predicted to be involved in stress responses and transcription. Analysis of other species suggests that the ABREN may be monocot specific. This study also revealed interesting expression patterns of genes involved in ABA metabolism and signalling. Collectively, this study advanced our understanding of diverse cis-regulatory sequences and the transcriptomes underlying ABA responses in rice aleurone cells. © 2017 John Wiley & Sons Ltd.
Identification of an evolutionarily conserved regulatory element of the zebrafish col2a1a gene.
Dale, Rodney M; Topczewski, Jacek
2011-09-15
Zebrafish (Danio rerio) is an excellent model organism for the study of vertebrate development including skeletogenesis. Studies of mammalian cartilage formation were greatly advanced through the use of a cartilage specific regulatory element of the Collagen type II alpha 1 (Col2a1) gene. In an effort to isolate such an element in zebrafish, we compared the expression of two col2a1 homologues and found that expression of col2a1b, a previously uncharacterized zebrafish homologue, only partially overlaps with col2a1a. We focused our analysis on col2a1a, as it is expressed in both the stacked chondrocytes and the perichondrium. By comparing the genomic sequence surrounding the predicted transcriptional start site of col2a1a among several species of teleosts we identified a small highly conserved sequence (R2) located 1.7 kb upstream of the presumptive transcriptional initiation site. Interestingly, neither the sequence nor location of this element is conserved between teleost and mammalian Col2a1. We generated transient and stable transgenic lines with just the R2 element or the entire 1.7 kb fragment 5' of the transcriptional initiation site. The identified regulatory elements enable the tracking of cellular development in various tissues by driving robust reporter expression in craniofacial cartilage, ear, notochord, floor plate, hypochord and fins in a pattern similar to the expression of endogenous col2a1a. Using a reporter gene driven by the R2 regulatory element, we analyzed the morphogenesis of the notochord sheath cells as they withdraw from the stack of initially uniform cells and encase the inflating vacuolated notochord cells. Finally, we show that like endogenous col2a1a, craniofacial expression of these reporter constructs depends on Sox9a transcription factor activity. At the same time, notochord expression is maintained after Sox9a knockdown, suggesting that other factors can activate expression through the identified regulatory element in this tissue. Copyright © 2011 Elsevier Inc. All rights reserved.
Identification of an evolutionarily conserved regulatory element of the zebrafish col2a1a gene
Dale, Rodney M.; Topczewski, Jacek
2011-01-01
Zebrafish (Danio rerio) is an excellent model organism for the study of vertebrate development including skeletogenesis. Studies of mammalian cartilage formation were greatly advanced through the use of a cartilage specific regulatory element of the Collagen type II alpha 1 (Col2a1) gene. In an effort to isolate such an element in zebrafish, we compared the expression of two col2a1 homologues and found that expression of col2a1b, a previously uncharacterized zebrafish homologue, only partially overlaps with col2a1a. We focused our analysis on col2a1a, as it is expressed in both the stacked chondrocytes and the perichondrium. By comparing the genomic sequence surrounding the predicted transcriptional start site of col2a1a among several species of teleosts we identified a small highly conserved sequence (R2) located 1.7 kb upstream of the presumptive transcriptional initiation site. Interestingly, neither the sequence nor location of this element is conserved between teleost and mammalian Col2a1. We generated transient and stable transgenic lines with just the R2 element or the entire 1.7 kb fragment 5’ of the transcriptional initiation site. The identified regulatory elements enable the tracking of cellular development in various tissues by driving robust reporter expression in craniofacial cartilage, ear, notochord, floor plate, hypochord and fins in a pattern similar to the expression of endogenous col2a1a. Using a reporter gene driven by the R2 regulatory element, we analyzed the morphogenesis of the notochord sheath cells as they withdraw from the stack of initially uniform cells and encase the inflating vacuolated notochord cells. Finally, we show that like endogenous col2a1a, craniofacial expression of these reporter constructs depends on Sox9a transcription factor activity. At the same time, notochord expression is maintained after Sox9a knockdown, suggesting that other factors can activate expression through the identified regulatory element in this tissue. PMID:21723274
Burzynski, Grzegorz M.; Reed, Xylena; Taher, Leila; Stine, Zachary E.; Matsui, Takeshi; Ovcharenko, Ivan; McCallion, Andrew S.
2012-01-01
Illuminating the primary sequence encryption of enhancers is central to understanding the regulatory architecture of genomes. We have developed a machine learning approach to decipher motif patterns of hindbrain enhancers and identify 40,000 sequences in the human genome that we predict display regulatory control that includes the hindbrain. Consistent with their roles in hindbrain patterning, MEIS1, NKX6-1, as well as HOX and POU family binding motifs contributed strongly to this enhancer model. Predicted hindbrain enhancers are overrepresented at genes expressed in hindbrain and associated with nervous system development, and primarily reside in the areas of open chromatin. In addition, 77 (0.2%) of these predictions are identified as hindbrain enhancers on the VISTA Enhancer Browser, and 26,000 (60%) overlap enhancer marks (H3K4me1 or H3K27ac). To validate these putative hindbrain enhancers, we selected 55 elements distributed throughout our predictions and six low scoring controls for evaluation in a zebrafish transgenic assay. When assayed in mosaic transgenic embryos, 51/55 elements directed expression in the central nervous system. Furthermore, 30/34 (88%) predicted enhancers analyzed in stable zebrafish transgenic lines directed expression in the larval zebrafish hindbrain. Subsequent analysis of sequence fragments selected based upon motif clustering further confirmed the critical role of the motifs contributing to the classifier. Our results demonstrate the existence of a primary sequence code characteristic to hindbrain enhancers. This code can be accurately extracted using machine-learning approaches and applied successfully for de novo identification of hindbrain enhancers. This study represents a critical step toward the dissection of regulatory control in specific neuronal subtypes. PMID:22759862
Woznica, Arielle; Haeussler, Maximilian; Starobinska, Ella; Jemmett, Jessica; Li, Younan; Mount, David; Davidson, Brad
2012-08-01
The complex, partially redundant gene regulatory architecture underlying vertebrate heart formation has been difficult to characterize. Here, we dissect the primary cardiac gene regulatory network in the invertebrate chordate, Ciona intestinalis. The Ciona heart progenitor lineage is first specified by Fibroblast Growth Factor/Map Kinase (FGF/MapK) activation of the transcription factor Ets1/2 (Ets). Through microarray analysis of sorted heart progenitor cells, we identified the complete set of primary genes upregulated by FGF/Ets shortly after heart progenitor emergence. Combinatorial sequence analysis of these co-regulated genes generated a hypothetical regulatory code consisting of Ets binding sites associated with a specific co-motif, ATTA. Through extensive reporter analysis, we confirmed the functional importance of the ATTA co-motif in primary heart progenitor gene regulation. We then used the Ets/ATTA combination motif to successfully predict a number of additional heart progenitor gene regulatory elements, including an intronic element driving expression of the core conserved cardiac transcription factor, GATAa. This work significantly advances our understanding of the Ciona heart gene network. Furthermore, this work has begun to elucidate the precise regulatory architecture underlying the conserved, primary role of FGF/Ets in chordate heart lineage specification. Copyright © 2012 Elsevier Inc. All rights reserved.
Janky, Rekin's; van Helden, Jacques
2008-01-23
The detection of conserved motifs in promoters of orthologous genes (phylogenetic footprints) has become a common strategy to predict cis-acting regulatory elements. Several software tools are routinely used to raise hypotheses about regulation. However, these tools are generally used as black boxes, with default parameters. A systematic evaluation of optimal parameters for a footprint discovery strategy can bring a sizeable improvement to the predictions. We evaluate the performances of a footprint discovery approach based on the detection of over-represented spaced motifs. This method is particularly suitable for (but not restricted to) Bacteria, since such motifs are typically bound by factors containing a Helix-Turn-Helix domain. We evaluated footprint discovery in 368 Escherichia coli K12 genes with annotated sites, under 40 different combinations of parameters (taxonomical level, background model, organism-specific filtering, operon inference). Motifs are assessed both at the levels of correctness and significance. We further report a detailed analysis of 181 bacterial orthologs of the LexA repressor. Distinct motifs are detected at various taxonomical levels, including the 7 previously characterized taxon-specific motifs. In addition, we highlight a significantly stronger conservation of half-motifs in Actinobacteria, relative to Firmicutes, suggesting an intermediate state in specificity switching between the two Gram-positive phyla, and thereby revealing the on-going evolution of LexA auto-regulation. The footprint discovery method proposed here shows excellent results with E. coli and can readily be extended to predict cis-acting regulatory signals and propose testable hypotheses in bacterial genomes for which nothing is known about regulation.
Parker, Brian J; Moltke, Ida; Roth, Adam; Washietl, Stefan; Wen, Jiayu; Kellis, Manolis; Breaker, Ronald; Pedersen, Jakob Skou
2011-11-01
Regulatory RNA structures are often members of families with multiple paralogous instances across the genome. Family members share functional and structural properties, which allow them to be studied as a whole, facilitating both bioinformatic and experimental characterization. We have developed a comparative method, EvoFam, for genome-wide identification of families of regulatory RNA structures, based on primary sequence and secondary structure similarity. We apply EvoFam to a 41-way genomic vertebrate alignment. Genome-wide, we identify 220 human, high-confidence families outside protein-coding regions comprising 725 individual structures, including 48 families with known structural RNA elements. Known families identified include both noncoding RNAs, e.g., miRNAs and the recently identified MALAT1/MEN β lincRNA family; and cis-regulatory structures, e.g., iron-responsive elements. We also identify tens of new families supported by strong evolutionary evidence and other statistical evidence, such as GO term enrichments. For some of these, detailed analysis has led to the formulation of specific functional hypotheses. Examples include two hypothesized auto-regulatory feedback mechanisms: one involving six long hairpins in the 3'-UTR of MAT2A, a key metabolic gene that produces the primary human methyl donor S-adenosylmethionine; the other involving a tRNA-like structure in the intron of the tRNA maturation gene POP1. We experimentally validate the predicted MAT2A structures. Finally, we identify potential new regulatory networks, including large families of short hairpins enriched in immunity-related genes, e.g., TNF, FOS, and CTLA4, which include known transcript destabilizing elements. Our findings exemplify the diversity of post-transcriptional regulation and provide a resource for further characterization of new regulatory mechanisms and families of noncoding RNAs.
Mars, Ruben A T; Nicolas, Pierre; Denham, Emma L; van Dijl, Jan Maarten
2016-12-01
Bacteria can employ widely diverse RNA molecules to regulate their gene expression. Such molecules include trans-acting small regulatory RNAs, antisense RNAs, and a variety of transcriptional attenuation mechanisms in the 5' untranslated region. Thus far, most regulatory RNA research has focused on Gram-negative bacteria, such as Escherichia coli and Salmonella. Hence, there is uncertainty about whether the resulting insights can be extrapolated directly to other bacteria, such as the Gram-positive soil bacterium Bacillus subtilis. A recent study identified 1,583 putative regulatory RNAs in B. subtilis, whose expression was assessed across 104 conditions. Here, we review the current understanding of RNA-based regulation in B. subtilis, and we categorize the newly identified putative regulatory RNAs on the basis of their conservation in other bacilli and the stability of their predicted secondary structures. Our present evaluation of the publicly available data indicates that RNA-mediated gene regulation in B. subtilis mostly involves elements at the 5' ends of mRNA molecules. These can include 5' secondary structure elements and metabolite-, tRNA-, or protein-binding sites. Importantly, sense-independent segments are identified as the most conserved and structured potential regulatory RNAs in B. subtilis. Altogether, the present survey provides many leads for the identification of new regulatory RNA functions in B. subtilis. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Mars, Ruben A. T.; Nicolas, Pierre; Denham, Emma L.
2016-01-01
SUMMARY Bacteria can employ widely diverse RNA molecules to regulate their gene expression. Such molecules include trans-acting small regulatory RNAs, antisense RNAs, and a variety of transcriptional attenuation mechanisms in the 5′ untranslated region. Thus far, most regulatory RNA research has focused on Gram-negative bacteria, such as Escherichia coli and Salmonella. Hence, there is uncertainty about whether the resulting insights can be extrapolated directly to other bacteria, such as the Gram-positive soil bacterium Bacillus subtilis. A recent study identified 1,583 putative regulatory RNAs in B. subtilis, whose expression was assessed across 104 conditions. Here, we review the current understanding of RNA-based regulation in B. subtilis, and we categorize the newly identified putative regulatory RNAs on the basis of their conservation in other bacilli and the stability of their predicted secondary structures. Our present evaluation of the publicly available data indicates that RNA-mediated gene regulation in B. subtilis mostly involves elements at the 5′ ends of mRNA molecules. These can include 5′ secondary structure elements and metabolite-, tRNA-, or protein-binding sites. Importantly, sense-independent segments are identified as the most conserved and structured potential regulatory RNAs in B. subtilis. Altogether, the present survey provides many leads for the identification of new regulatory RNA functions in B. subtilis. PMID:27784798
Guo, Liyuan; Wang, Jing
2018-01-04
Here, we present the updated rSNPBase 3.0 database (http://rsnp3.psych.ac.cn), which provides human SNP-related regulatory elements, element-gene pairs and SNP-based regulatory networks. This database is the updated version of the SNP regulatory annotation database rSNPBase and rVarBase. In comparison to the last two versions, there are both structural and data adjustments in rSNPBase 3.0: (i) The most significant new feature is the expansion of analysis scope from SNP-related regulatory elements to include regulatory element-target gene pairs (E-G pairs), therefore it can provide SNP-based gene regulatory networks. (ii) Web function was modified according to data content and a new network search module is provided in the rSNPBase 3.0 in addition to the previous regulatory SNP (rSNP) search module. The two search modules support data query for detailed information (related-elements, element-gene pairs, and other extended annotations) on specific SNPs and SNP-related graphic networks constructed by interacting transcription factors (TFs), miRNAs and genes. (3) The type of regulatory elements was modified and enriched. To our best knowledge, the updated rSNPBase 3.0 is the first data tool supports SNP functional analysis from a regulatory network prospective, it will provide both a comprehensive understanding and concrete guidance for SNP-related regulatory studies. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Ibarra-Arellano, Miguel A.; Campos-González, Adrián I.; Treviño-Quintanilla, Luis G.; Tauch, Andreas; Freyre-González, Julio A.
2016-01-01
The availability of databases electronically encoding curated regulatory networks and of high-throughput technologies and methods to discover regulatory interactions provides an invaluable source of data to understand the principles underpinning the organization and evolution of these networks responsible for cellular regulation. Nevertheless, data on these sources never goes beyond the regulon level despite the fact that regulatory networks are complex hierarchical-modular structures still challenging our understanding. This brings the necessity for an inventory of systems across a large range of organisms, a key step to rendering feasible comparative systems biology approaches. In this work, we take the first step towards a global understanding of the regulatory networks organization by making a cartography of the functional architectures of diverse bacteria. Abasy (Across-bacteria systems) Atlas provides a comprehensive inventory of annotated functional systems, global network properties and systems-level elements (global regulators, modular genes shaping functional systems, basal machinery genes and intermodular genes) predicted by the natural decomposition approach for reconstructed and meta-curated regulatory networks across a large range of bacteria, including pathogenically and biotechnologically relevant organisms. The meta-curation of regulatory datasets provides the most complete and reliable set of regulatory interactions currently available, which can even be projected into subsets by considering the force or weight of evidence supporting them or the systems that they belong to. Besides, Abasy Atlas provides data enabling large-scale comparative systems biology studies aimed at understanding the common principles and particular lifestyle adaptions of systems across bacteria. Abasy Atlas contains systems and system-level elements for 50 regulatory networks comprising 78 649 regulatory interactions covering 42 bacteria in nine taxa, containing 3708 regulons and 1776 systems. All this brings together a large corpus of data that will surely inspire studies to generate hypothesis regarding the principles governing the evolution and organization of systems and the functional architectures controlling them. Database URL: http://abasy.ccg.unam.mx PMID:27242034
Transcription factor trapping by RNA in gene regulatory elements.
Sigova, Alla A; Abraham, Brian J; Ji, Xiong; Molinie, Benoit; Hannett, Nancy M; Guo, Yang Eric; Jangi, Mohini; Giallourakis, Cosmas C; Sharp, Phillip A; Young, Richard A
2015-11-20
Transcription factors (TFs) bind specific sequences in promoter-proximal and -distal DNA elements to regulate gene transcription. RNA is transcribed from both of these DNA elements, and some DNA binding TFs bind RNA. Hence, RNA transcribed from regulatory elements may contribute to stable TF occupancy at these sites. We show that the ubiquitously expressed TF Yin-Yang 1 (YY1) binds to both gene regulatory elements and their associated RNA species across the entire genome. Reduced transcription of regulatory elements diminishes YY1 occupancy, whereas artificial tethering of RNA enhances YY1 occupancy at these elements. We propose that RNA makes a modest but important contribution to the maintenance of certain TFs at gene regulatory elements and suggest that transcription of regulatory elements produces a positive-feedback loop that contributes to the stability of gene expression programs. Copyright © 2015, American Association for the Advancement of Science.
Characterization of noncoding regulatory DNA in the human genome.
Elkon, Ran; Agami, Reuven
2017-08-08
Genetic variants associated with common diseases are usually located in noncoding parts of the human genome. Delineation of the full repertoire of functional noncoding elements, together with efficient methods for probing their biological roles, is therefore of crucial importance. Over the past decade, DNA accessibility and various epigenetic modifications have been associated with regulatory functions. Mapping these features across the genome has enabled researchers to begin to document the full complement of putative regulatory elements. High-throughput reporter assays to probe the functions of regulatory regions have also been developed but these methods separate putative regulatory elements from the chromosome so that any effects of chromatin context and long-range regulatory interactions are lost. Definitive assignment of function(s) to putative cis-regulatory elements requires perturbation of these elements. Genome-editing technologies are now transforming our ability to perturb regulatory elements across entire genomes. Interpretation of high-throughput genetic screens that incorporate genome editors might enable the construction of an unbiased map of functional noncoding elements in the human genome.
Novel green tissue-specific synthetic promoters and cis-regulatory elements in rice.
Wang, Rui; Zhu, Menglin; Ye, Rongjian; Liu, Zuoxiong; Zhou, Fei; Chen, Hao; Lin, Yongjun
2015-12-11
As an important part of synthetic biology, synthetic promoter has gradually become a hotspot in current biology. The purposes of the present study were to synthesize green tissue-specific promoters and to discover green tissue-specific cis-elements. We first assembled several regulatory sequences related to tissue-specific expression in different combinations, aiming to obtain novel green tissue-specific synthetic promoters. GUS assays of the transgenic plants indicated 5 synthetic promoters showed green tissue-specific expression patterns and different expression efficiencies in various tissues. Subsequently, we scanned and counted the cis-elements in different tissue-specific promoters based on the plant cis-elements database PLACE and the rice cDNA microarray database CREP for green tissue-specific cis-element discovery, resulting in 10 potential cis-elements. The flanking sequence of one potential core element (GEAT) was predicted by bioinformatics. Then, the combination of GEAT and its flanking sequence was functionally identified with synthetic promoter. GUS assays of the transgenic plants proved its green tissue-specificity. Furthermore, the function of GEAT flanking sequence was analyzed in detail with site-directed mutagenesis. Our study provides an example for the synthesis of rice tissue-specific promoters and develops a feasible method for screening and functional identification of tissue-specific cis-elements with their flanking sequences at the genome-wide level in rice.
In Silico Detection of Sequence Variations Modifying Transcriptional Regulation
Andersen, Malin C; Engström, Pär G; Lithwick, Stuart; Arenillas, David; Eriksson, Per; Lenhard, Boris; Wasserman, Wyeth W; Odeberg, Jacob
2008-01-01
Identification of functional genetic variation associated with increased susceptibility to complex diseases can elucidate genes and underlying biochemical mechanisms linked to disease onset and progression. For genes linked to genetic diseases, most identified causal mutations alter an encoded protein sequence. Technological advances for measuring RNA abundance suggest that a significant number of undiscovered causal mutations may alter the regulation of gene transcription. However, it remains a challenge to separate causal genetic variations from linked neutral variations. Here we present an in silico driven approach to identify possible genetic variation in regulatory sequences. The approach combines phylogenetic footprinting and transcription factor binding site prediction to identify variation in candidate cis-regulatory elements. The bioinformatics approach has been tested on a set of SNPs that are reported to have a regulatory function, as well as background SNPs. In the absence of additional information about an analyzed gene, the poor specificity of binding site prediction is prohibitive to its application. However, when additional data is available that can give guidance on which transcription factor is involved in the regulation of the gene, the in silico binding site prediction improves the selection of candidate regulatory polymorphisms for further analyses. The bioinformatics software generated for the analysis has been implemented as a Web-based application system entitled RAVEN (regulatory analysis of variation in enhancers). The RAVEN system is available at http://www.cisreg.ca for all researchers interested in the detection and characterization of regulatory sequence variation. PMID:18208319
2018-01-01
Abstract Here, we present the updated rSNPBase 3.0 database (http://rsnp3.psych.ac.cn), which provides human SNP-related regulatory elements, element-gene pairs and SNP-based regulatory networks. This database is the updated version of the SNP regulatory annotation database rSNPBase and rVarBase. In comparison to the last two versions, there are both structural and data adjustments in rSNPBase 3.0: (i) The most significant new feature is the expansion of analysis scope from SNP-related regulatory elements to include regulatory element–target gene pairs (E–G pairs), therefore it can provide SNP-based gene regulatory networks. (ii) Web function was modified according to data content and a new network search module is provided in the rSNPBase 3.0 in addition to the previous regulatory SNP (rSNP) search module. The two search modules support data query for detailed information (related-elements, element-gene pairs, and other extended annotations) on specific SNPs and SNP-related graphic networks constructed by interacting transcription factors (TFs), miRNAs and genes. (3) The type of regulatory elements was modified and enriched. To our best knowledge, the updated rSNPBase 3.0 is the first data tool supports SNP functional analysis from a regulatory network prospective, it will provide both a comprehensive understanding and concrete guidance for SNP-related regulatory studies. PMID:29140525
Computational Approaches to Identify Promoters and cis-Regulatory Elements in Plant Genomes1
Rombauts, Stephane; Florquin, Kobe; Lescot, Magali; Marchal, Kathleen; Rouzé, Pierre; Van de Peer, Yves
2003-01-01
The identification of promoters and their regulatory elements is one of the major challenges in bioinformatics and integrates comparative, structural, and functional genomics. Many different approaches have been developed to detect conserved motifs in a set of genes that are either coregulated or orthologous. However, although recent approaches seem promising, in general, unambiguous identification of regulatory elements is not straightforward. The delineation of promoters is even harder, due to its complex nature, and in silico promoter prediction is still in its infancy. Here, we review the different approaches that have been developed for identifying promoters and their regulatory elements. We discuss the detection of cis-acting regulatory elements using word-counting or probabilistic methods (so-called “search by signal” methods) and the delineation of promoters by considering both sequence content and structural features (“search by content” methods). As an example of search by content, we explored in greater detail the association of promoters with CpG islands. However, due to differences in sequence content, the parameters used to detect CpG islands in humans and other vertebrates cannot be used for plants. Therefore, a preliminary attempt was made to define parameters that could possibly define CpG and CpNpG islands in Arabidopsis, by exploring the compositional landscape around the transcriptional start site. To this end, a data set of more than 5,000 gene sequences was built, including the promoter region, the 5′-untranslated region, and the first introns and coding exons. Preliminary analysis shows that promoter location based on the detection of potential CpG/CpNpG islands in the Arabidopsis genome is not straightforward. Nevertheless, because the landscape of CpG/CpNpG islands differs considerably between promoters and introns on the one side and exons (whether coding or not) on the other, more sophisticated approaches can probably be developed for the successful detection of “putative” CpG and CpNpG islands in plants. PMID:12857799
Bisognin, Andrea; Sales, Gabriele; Coppe, Alessandro; Bortoluzzi, Stefania; Romualdi, Chiara
2012-01-01
MAGIA2 (http://gencomp.bio.unipd.it/magia2) is an update, extension and evolution of the MAGIA web tool. It is dedicated to the integrated analysis of in silico target prediction, microRNA (miRNA) and gene expression data for the reconstruction of post-transcriptional regulatory networks. miRNAs are fundamental post-transcriptional regulators of several key biological and pathological processes. As miRNAs act prevalently through target degradation, their expression profiles are expected to be inversely correlated to those of the target genes. Low specificity of target prediction algorithms makes integration approaches an interesting solution for target prediction refinement. MAGIA2 performs this integrative approach supporting different association measures, multiple organisms and almost all target predictions algorithms. Nevertheless, miRNAs activity should be viewed as part of a more complex scenario where regulatory elements and their interactors generate a highly connected network and where gene expression profiles are the result of different levels of regulation. The updated MAGIA2 tries to dissect this complexity by reconstructing mixed regulatory circuits involving either miRNA or transcription factor (TF) as regulators. Two types of circuits are identified: (i) a TF that regulates both a miRNA and its target and (ii) a miRNA that regulates both a TF and its target. PMID:22618880
Genome-wide prediction of cis-regulatory regions using supervised deep learning methods.
Li, Yifeng; Shi, Wenqiang; Wasserman, Wyeth W
2018-05-31
In the human genome, 98% of DNA sequences are non-protein-coding regions that were previously disregarded as junk DNA. In fact, non-coding regions host a variety of cis-regulatory regions which precisely control the expression of genes. Thus, Identifying active cis-regulatory regions in the human genome is critical for understanding gene regulation and assessing the impact of genetic variation on phenotype. The developments of high-throughput sequencing and machine learning technologies make it possible to predict cis-regulatory regions genome wide. Based on rich data resources such as the Encyclopedia of DNA Elements (ENCODE) and the Functional Annotation of the Mammalian Genome (FANTOM) projects, we introduce DECRES based on supervised deep learning approaches for the identification of enhancer and promoter regions in the human genome. Due to their ability to discover patterns in large and complex data, the introduction of deep learning methods enables a significant advance in our knowledge of the genomic locations of cis-regulatory regions. Using models for well-characterized cell lines, we identify key experimental features that contribute to the predictive performance. Applying DECRES, we delineate locations of 300,000 candidate enhancers genome wide (6.8% of the genome, of which 40,000 are supported by bidirectional transcription data), and 26,000 candidate promoters (0.6% of the genome). The predicted annotations of cis-regulatory regions will provide broad utility for genome interpretation from functional genomics to clinical applications. The DECRES model demonstrates potentials of deep learning technologies when combined with high-throughput sequencing data, and inspires the development of other advanced neural network models for further improvement of genome annotations.
Structure and Regulatory Interactions of the Cytoplasmic Terminal Domains of Serotonin Transporter
2014-01-01
Uptake of neurotransmitters by sodium-coupled monoamine transporters of the NSS family is required for termination of synaptic transmission. Transport is tightly regulated by protein–protein interactions involving the small cytoplasmic segments at the amino- and carboxy-terminal ends of the transporter. Although structures of homologues provide information about the transmembrane regions of these transporters, the structural arrangement of the terminal domains remains largely unknown. Here, we combined molecular modeling, biochemical, and biophysical approaches in an iterative manner to investigate the structure of the 82-residue N-terminal and 30-residue C-terminal domains of human serotonin transporter (SERT). Several secondary structures were predicted in these domains, and structural models were built using the Rosetta fragment-based methodology. One-dimensional 1H nuclear magnetic resonance and circular dichroism spectroscopy supported the presence of helical elements in the isolated SERT N-terminal domain. Moreover, introducing helix-breaking residues within those elements altered the fluorescence resonance energy transfer signal between terminal cyan fluorescent protein and yellow fluorescent protein tags attached to full-length SERT, consistent with the notion that the fold of the terminal domains is relatively well-defined. Full-length models of SERT that are consistent with these and published experimental data were generated. The resultant models predict confined loci for the terminal domains and predict that they move apart during the transport-related conformational cycle, as predicted by structures of homologues and by the “rocking bundle” hypothesis, which is consistent with spectroscopic measurements. The models also suggest the nature of binding to regulatory interaction partners. This study provides a structural context for functional and regulatory mechanisms involving SERT terminal domains. PMID:25093911
Schmidt, Ellen M; Zhang, Ji; Zhou, Wei; Chen, Jin; Mohlke, Karen L; Chen, Y Eugene; Willer, Cristen J
2015-08-15
The majority of variation identified by genome wide association studies falls in non-coding genomic regions and is hypothesized to impact regulatory elements that modulate gene expression. Here we present a statistically rigorous software tool GREGOR (Genomic Regulatory Elements and Gwas Overlap algoRithm) for evaluating enrichment of any set of genetic variants with any set of regulatory features. Using variants from five phenotypes, we describe a data-driven approach to determine the tissue and cell types most relevant to a trait of interest and to identify the subset of regulatory features likely impacted by these variants. Last, we experimentally evaluate six predicted functional variants at six lipid-associated loci and demonstrate significant evidence for allele-specific impact on expression levels. GREGOR systematically evaluates enrichment of genetic variation with the vast collection of regulatory data available to explore novel biological mechanisms of disease and guide us toward the functional variant at trait-associated loci. GREGOR, including source code, documentation, examples, and executables, is available at http://genome.sph.umich.edu/wiki/GREGOR. cristen@umich.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Kaplan, Oktay I; Berber, Burak; Hekim, Nezih; Doluca, Osman
2016-11-02
Many studies show that short non-coding sequences are widely conserved among regulatory elements. More and more conserved sequences are being discovered since the development of next generation sequencing technology. A common approach to identify conserved sequences with regulatory roles relies on topological changes such as hairpin formation at the DNA or RNA level. G-quadruplexes, non-canonical nucleic acid topologies with little established biological roles, are increasingly considered for conserved regulatory element discovery. Since the tertiary structure of G-quadruplexes is strongly dependent on the loop sequence which is disregarded by the generally accepted algorithm, we hypothesized that G-quadruplexes with similar topology and, indirectly, similar interaction patterns, can be determined using phylogenetic clustering based on differences in the loop sequences. Phylogenetic analysis of 52 G-quadruplex forming sequences in the Escherichia coli genome revealed two conserved G-quadruplex motifs with a potential regulatory role. Further analysis revealed that both motifs tend to form hairpins and G quadruplexes, as supported by circular dichroism studies. The phylogenetic analysis as described in this work can greatly improve the discovery of functional G-quadruplex structures and may explain unknown regulatory patterns. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Suciu, Maria C.; Telenius, Jelena
2017-01-01
In the era of genome-wide association studies (GWAS) and personalized medicine, predicting the impact of single nucleotide polymorphisms (SNPs) in regulatory elements is an important goal. Current approaches to determine the potential of regulatory SNPs depend on inadequate knowledge of cell-specific DNA binding motifs. Here, we present Sasquatch, a new computational approach that uses DNase footprint data to estimate and visualize the effects of noncoding variants on transcription factor binding. Sasquatch performs a comprehensive k-mer-based analysis of DNase footprints to determine any k-mer's potential for protein binding in a specific cell type and how this may be changed by sequence variants. Therefore, Sasquatch uses an unbiased approach, independent of known transcription factor binding sites and motifs. Sasquatch only requires a single DNase-seq data set per cell type, from any genotype, and produces consistent predictions from data generated by different experimental procedures and at different sequence depths. Here we demonstrate the effectiveness of Sasquatch using previously validated functional SNPs and benchmark its performance against existing approaches. Sasquatch is available as a versatile webtool incorporating publicly available data, including the human ENCODE collection. Thus, Sasquatch provides a powerful tool and repository for prioritizing likely regulatory SNPs in the noncoding genome. PMID:28904015
USDA-ARS?s Scientific Manuscript database
Transcription factors (TFs) are proteins that regulate the expression of target genes by binding to specific elements in their regulatory regions. Transcriptional regulators (TRs) also regulate the expression of target genes; however, they operate indirectly via interaction with the basal transcript...
Lenka, Sangram K; Lohia, Bikash; Kumar, Abhay; Chinnusamy, Viswanathan; Bansal, Kailash C
2009-02-01
Abscisic acid (ABA), the popular plant stress hormone, plays a key role in regulation of sub-set of stress responsive genes. These genes respond to ABA through specific transcription factors which bind to cis-regulatory elements present in their promoters. We discovered the ABA Responsive Element (ABRE) core (ACGT) containing CGMCACGTGB motif as over-represented motif among the promoters of ABA responsive co-expressed genes in rice. Targeted gene prediction strategy using this motif led to the identification of 402 protein coding genes potentially regulated by ABA-dependent molecular genetic network. RT-PCR analysis of arbitrarily chosen 45 genes from the predicted 402 genes confirmed 80% accuracy of our prediction. Plant Gene Ontology (GO) analysis of ABA responsive genes showed enrichment of signal transduction and stress related genes among diverse functional categories.
Torres, Matthew P; Dewhurst, Henry; Sundararaman, Niveda
2016-11-01
Post-translational modifications (PTMs) regulate protein behavior through modulation of protein-protein interactions, enzymatic activity, and protein stability essential in the translation of genotype to phenotype in eukaryotes. Currently, less than 4% of all eukaryotic PTMs are reported to have biological function - a statistic that continues to decrease with an increasing rate of PTM detection. Previously, we developed SAPH-ire (Structural Analysis of PTM Hotspots) - a method for the prioritization of PTM function potential that has been used effectively to reveal novel PTM regulatory elements in discrete protein families (Dewhurst et al., 2015). Here, we apply SAPH-ire to the set of eukaryotic protein families containing experimental PTM and 3D structure data - capturing 1,325 protein families with 50,839 unique PTM sites organized into 31,747 modified alignment positions (MAPs), of which 2010 (∼6%) possess known biological function. Here, we show that using an artificial neural network model (SAPH-ire NN) trained to identify MAP hotspots with biological function results in prediction outcomes that far surpass the use of single hotspot features, including nearest neighbor PTM clustering methods. We find the greatest enhancement in prediction for positions with PTM counts of five or less, which represent 98% of all MAPs in the eukaryotic proteome and 90% of all MAPs found to have biological function. Analysis of the top 1092 MAP hotspots revealed 267 of truly unknown function (containing 5443 distinct PTMs). Of these, 165 hotspots could be mapped to human KEGG pathways for normal and/or disease physiology. Many high-ranking hotspots were also found to be disease-associated pathogenic sites of amino acid substitution despite the lack of observable PTM in the human protein family member. Taken together, these experiments demonstrate that the functional relevance of a PTM can be predicted very effectively by neural network models, revealing a large but testable body of potential regulatory elements that impact hundreds of different biological processes important in eukaryotic biology and human health. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Dewhurst, Henry; Sundararaman, Niveda
2016-01-01
Post-translational modifications (PTMs) regulate protein behavior through modulation of protein-protein interactions, enzymatic activity, and protein stability essential in the translation of genotype to phenotype in eukaryotes. Currently, less than 4% of all eukaryotic PTMs are reported to have biological function - a statistic that continues to decrease with an increasing rate of PTM detection. Previously, we developed SAPH-ire (Structural Analysis of PTM Hotspots) - a method for the prioritization of PTM function potential that has been used effectively to reveal novel PTM regulatory elements in discrete protein families (Dewhurst et al., 2015). Here, we apply SAPH-ire to the set of eukaryotic protein families containing experimental PTM and 3D structure data - capturing 1,325 protein families with 50,839 unique PTM sites organized into 31,747 modified alignment positions (MAPs), of which 2010 (∼6%) possess known biological function. Here, we show that using an artificial neural network model (SAPH-ire NN) trained to identify MAP hotspots with biological function results in prediction outcomes that far surpass the use of single hotspot features, including nearest neighbor PTM clustering methods. We find the greatest enhancement in prediction for positions with PTM counts of five or less, which represent 98% of all MAPs in the eukaryotic proteome and 90% of all MAPs found to have biological function. Analysis of the top 1092 MAP hotspots revealed 267 of truly unknown function (containing 5443 distinct PTMs). Of these, 165 hotspots could be mapped to human KEGG pathways for normal and/or disease physiology. Many high-ranking hotspots were also found to be disease-associated pathogenic sites of amino acid substitution despite the lack of observable PTM in the human protein family member. Taken together, these experiments demonstrate that the functional relevance of a PTM can be predicted very effectively by neural network models, revealing a large but testable body of potential regulatory elements that impact hundreds of different biological processes important in eukaryotic biology and human health. PMID:27697855
Ibarra-Arellano, Miguel A; Campos-González, Adrián I; Treviño-Quintanilla, Luis G; Tauch, Andreas; Freyre-González, Julio A
2016-01-01
The availability of databases electronically encoding curated regulatory networks and of high-throughput technologies and methods to discover regulatory interactions provides an invaluable source of data to understand the principles underpinning the organization and evolution of these networks responsible for cellular regulation. Nevertheless, data on these sources never goes beyond the regulon level despite the fact that regulatory networks are complex hierarchical-modular structures still challenging our understanding. This brings the necessity for an inventory of systems across a large range of organisms, a key step to rendering feasible comparative systems biology approaches. In this work, we take the first step towards a global understanding of the regulatory networks organization by making a cartography of the functional architectures of diverse bacteria. Abasy ( A: cross- BA: cteria SY: stems) Atlas provides a comprehensive inventory of annotated functional systems, global network properties and systems-level elements (global regulators, modular genes shaping functional systems, basal machinery genes and intermodular genes) predicted by the natural decomposition approach for reconstructed and meta-curated regulatory networks across a large range of bacteria, including pathogenically and biotechnologically relevant organisms. The meta-curation of regulatory datasets provides the most complete and reliable set of regulatory interactions currently available, which can even be projected into subsets by considering the force or weight of evidence supporting them or the systems that they belong to. Besides, Abasy Atlas provides data enabling large-scale comparative systems biology studies aimed at understanding the common principles and particular lifestyle adaptions of systems across bacteria. Abasy Atlas contains systems and system-level elements for 50 regulatory networks comprising 78 649 regulatory interactions covering 42 bacteria in nine taxa, containing 3708 regulons and 1776 systems. All this brings together a large corpus of data that will surely inspire studies to generate hypothesis regarding the principles governing the evolution and organization of systems and the functional architectures controlling them.Database URL: http://abasy.ccg.unam.mx. © The Author(s) 2016. Published by Oxford University Press.
A Machine Learning Approach to Predict Gene Regulatory Networks in Seed Development in Arabidopsis
Ni, Ying; Aghamirzaie, Delasa; Elmarakeby, Haitham; Collakova, Eva; Li, Song; Grene, Ruth; Heath, Lenwood S.
2016-01-01
Gene regulatory networks (GRNs) provide a representation of relationships between regulators and their target genes. Several methods for GRN inference, both unsupervised and supervised, have been developed to date. Because regulatory relationships consistently reprogram in diverse tissues or under different conditions, GRNs inferred without specific biological contexts are of limited applicability. In this report, a machine learning approach is presented to predict GRNs specific to developing Arabidopsis thaliana embryos. We developed the Beacon GRN inference tool to predict GRNs occurring during seed development in Arabidopsis based on a support vector machine (SVM) model. We developed both global and local inference models and compared their performance, demonstrating that local models are generally superior for our application. Using both the expression levels of the genes expressed in developing embryos and prior known regulatory relationships, GRNs were predicted for specific embryonic developmental stages. The targets that are strongly positively correlated with their regulators are mostly expressed at the beginning of seed development. Potential direct targets were identified based on a match between the promoter regions of these inferred targets and the cis elements recognized by specific regulators. Our analysis also provides evidence for previously unknown inhibitory effects of three positive regulators of gene expression. The Beacon GRN inference tool provides a valuable model system for context-specific GRN inference and is freely available at https://github.com/BeaconProjectAtVirginiaTech/beacon_network_inference.git. PMID:28066488
De novo mutations in regulatory elements in neurodevelopmental disorders
Short, Patrick J.; McRae, Jeremy F.; Gallone, Giuseppe; Sifrim, Alejandro; Won, Hyejung; Geschwind, Daniel H.; Wright, Caroline F.; Firth, Helen V; FitzPatrick, David R.; Barrett, Jeffrey C.; Hurles, Matthew E.
2018-01-01
We previously estimated that 42% of patients with severe developmental disorders carry pathogenic de novo mutations in coding sequences. The role of de novo mutations in regulatory elements affecting genes associated with developmental disorders, or other genes, has been essentially unexplored. We identified de novo mutations in three classes of putative regulatory elements in almost 8,000 patients with developmental disorders. Here we show that de novo mutations in highly evolutionarily conserved fetal brain-active elements are significantly and specifically enriched in neurodevelopmental disorders. We identified a significant twofold enrichment of recurrently mutated elements. We estimate that, genome-wide, 1-3% of patients without a diagnostic coding variant carry pathogenic de novo mutations in fetal brain-active regulatory elements and that only 0.15% of all possible mutations within highly conserved fetal brain-active elements cause neurodevelopmental disorders with a dominant mechanism. Our findings represent a robust estimate of the contribution of de novo mutations in regulatory elements to this genetically heterogeneous set of disorders, and emphasize the importance of combining functional and evolutionary evidence to identify regulatory causes of genetic disorders. PMID:29562236
Cheng, Chao; Ung, Matthew; Grant, Gavin D.; Whitfield, Michael L.
2013-01-01
Cell cycle is a complex and highly supervised process that must proceed with regulatory precision to achieve successful cellular division. Despite the wide application, microarray time course experiments have several limitations in identifying cell cycle genes. We thus propose a computational model to predict human cell cycle genes based on transcription factor (TF) binding and regulatory motif information in their promoters. We utilize ENCODE ChIP-seq data and motif information as predictors to discriminate cell cycle against non-cell cycle genes. Our results show that both the trans- TF features and the cis- motif features are predictive of cell cycle genes, and a combination of the two types of features can further improve prediction accuracy. We apply our model to a complete list of GENCODE promoters to predict novel cell cycle driving promoters for both protein-coding genes and non-coding RNAs such as lincRNAs. We find that a similar percentage of lincRNAs are cell cycle regulated as protein-coding genes, suggesting the importance of non-coding RNAs in cell cycle division. The model we propose here provides not only a practical tool for identifying novel cell cycle genes with high accuracy, but also new insights on cell cycle regulation by TFs and cis-regulatory elements. PMID:23874175
Zhao, Ming-Tao; Shao, Ning-Yi; Hu, Shijun; Ma, Ning; Srinivasan, Rajini; Jahanbani, Fereshteh; Lee, Jaecheol; Zhang, Sophia L; Snyder, Michael P; Wu, Joseph C
2017-11-10
Regulatory DNA elements in the human genome play important roles in determining the transcriptional abundance and spatiotemporal gene expression during embryonic heart development and somatic cell reprogramming. It is not well known how chromatin marks in regulatory DNA elements are modulated to establish cell type-specific gene expression in the human heart. We aimed to decipher the cell type-specific epigenetic signatures in regulatory DNA elements and how they modulate heart-specific gene expression. We profiled genome-wide transcriptional activity and a variety of epigenetic marks in the regulatory DNA elements using massive RNA-seq (n=12) and ChIP-seq (chromatin immunoprecipitation combined with high-throughput sequencing; n=84) in human endothelial cells (CD31 + CD144 + ), cardiac progenitor cells (Sca-1 + ), fibroblasts (DDR2 + ), and their respective induced pluripotent stem cells. We uncovered 2 classes of regulatory DNA elements: class I was identified with ubiquitous enhancer (H3K4me1) and promoter (H3K4me3) marks in all cell types, whereas class II was enriched with H3K4me1 and H3K4me3 in a cell type-specific manner. Both class I and class II regulatory elements exhibited stimulatory roles in nearby gene expression in a given cell type. However, class I promoters displayed more dominant regulatory effects on transcriptional abundance regardless of distal enhancers. Transcription factor network analysis indicated that human induced pluripotent stem cells and somatic cells from the heart selected their preferential regulatory elements to maintain cell type-specific gene expression. In addition, we validated the function of these enhancer elements in transgenic mouse embryos and human cells and identified a few enhancers that could possibly regulate the cardiac-specific gene expression. Given that a large number of genetic variants associated with human diseases are located in regulatory DNA elements, our study provides valuable resources for deciphering the epigenetic modulation of regulatory DNA elements that fine-tune spatiotemporal gene expression in human cardiac development and diseases. © 2017 American Heart Association, Inc.
Evidence of reduced recombination rate in human regulatory domains.
Liu, Yaping; Sarkar, Abhishek; Kheradpour, Pouya; Ernst, Jason; Kellis, Manolis
2017-10-20
Recombination rate is non-uniformly distributed across the human genome. The variation of recombination rate at both fine and large scales cannot be fully explained by DNA sequences alone. Epigenetic factors, particularly DNA methylation, have recently been proposed to influence the variation in recombination rate. We study the relationship between recombination rate and gene regulatory domains, defined by a gene and its linked control elements. We define these links using expression quantitative trait loci (eQTLs), methylation quantitative trait loci (meQTLs), chromatin conformation from publicly available datasets (Hi-C and ChIA-PET), and correlated activity links that we infer across cell types. Each link type shows a "recombination rate valley" of significantly reduced recombination rate compared to matched control regions. This recombination rate valley is most pronounced for gene regulatory domains of early embryonic development genes, housekeeping genes, and constitutive regulatory elements, which are known to show increased evolutionary constraint across species. Recombination rate valleys show increased DNA methylation, reduced doublestranded break initiation, and increased repair efficiency, specifically in the lineage leading to the germ line. Moreover, by using only the overlap of functional links and DNA methylation in germ cells, we are able to predict the recombination rate with high accuracy. Our results suggest the existence of a recombination rate valley at regulatory domains and provide a potential molecular mechanism to interpret the interplay between genetic and epigenetic variations.
Martinez, Carlos A.; Barr, Kenneth; Kim, Ah-Ram; Reinitz, John
2013-01-01
Synthetic biology offers novel opportunities for elucidating transcriptional regulatory mechanisms and enhancer logic. Complex cis-regulatory sequences—like the ones driving expression of the Drosophila even-skipped gene—have proven difficult to design from existing knowledge, presumably due to the large number of protein-protein interactions needed to drive the correct expression patterns of genes in multicellular organisms. This work discusses two novel computational methods for the custom design of enhancers that employ a sophisticated, empirically validated transcriptional model, optimization algorithms, and synthetic biology. These synthetic elements have both utilitarian and academic value, including improving existing regulatory models as well as evolutionary questions. The first method involves the use of simulated annealing to explore the sequence space for synthetic enhancers whose expression output fit a given search criterion. The second method uses a novel optimization algorithm to find functionally accessible pathways between two enhancer sequences. These paths describe a set of mutations wherein the predicted expression pattern does not significantly vary at any point along the path. Both methods rely on a predictive mathematical framework that maps the enhancer sequence space to functional output. PMID:23732772
Gómez-Porras, Judith L; Riaño-Pachón, Diego Mauricio; Dreyer, Ingo; Mayer, Jorge E; Mueller-Roeber, Bernd
2007-01-01
Background In plants, complex regulatory mechanisms are at the core of physiological and developmental processes. The phytohormone abscisic acid (ABA) is involved in the regulation of various such processes, including stomatal closure, seed and bud dormancy, and physiological responses to cold, drought and salinity stress. The underlying tissue or plant-wide control circuits often include combinatorial gene regulatory mechanisms and networks that we are only beginning to unravel with the help of new molecular tools. The increasing availability of genomic sequences and gene expression data enables us to dissect ABA regulatory mechanisms at the individual gene expression level. In this paper we used an in-silico-based approach directed towards genome-wide prediction and identification of specific features of ABA-responsive elements. In particular we analysed the genome-wide occurrence and positional arrangements of two well-described ABA-responsive cis-regulatory elements (CREs), ABRE and CE3, in thale cress (Arabidopsis thaliana) and rice (Oryza sativa). Results Our results show that Arabidopsis and rice use the ABA-responsive elements ABRE and CE3 distinctively. Earlier reports for various monocots have identified CE3 as a coupling element (CE) associated with ABRE. Surprisingly, we found that while ABRE is equally abundant in both species, CE3 is practically absent in Arabidopsis. ABRE-ABRE pairs are common in both genomes, suggesting that these can form functional ABA-responsive complexes (ABRCs) in Arabidopsis and rice. Furthermore, we detected distinct combinations, orientation patterns and DNA strand preferences of ABRE and CE3 motifs in rice gene promoters. Conclusion Our computational analyses revealed distinct recruitment patterns of ABA-responsive CREs in upstream sequences of Arabidopsis and rice. The apparent absence of CE3s in Arabidopsis suggests that another CE pairs with ABRE to establish a functional ABRC capable of interacting with transcription factors. Further studies will be needed to test whether the observed differences are extrapolatable to monocots and dicots in general, and to understand how they contribute to the fine-tuning of the hormonal response. The outcome of our investigation can now be used to direct future experimentation designed to further dissect the ABA-dependent regulatory networks. PMID:17672917
Gómez-Porras, Judith L; Riaño-Pachón, Diego Mauricio; Dreyer, Ingo; Mayer, Jorge E; Mueller-Roeber, Bernd
2007-08-01
In plants, complex regulatory mechanisms are at the core of physiological and developmental processes. The phytohormone abscisic acid (ABA) is involved in the regulation of various such processes, including stomatal closure, seed and bud dormancy, and physiological responses to cold, drought and salinity stress. The underlying tissue or plant-wide control circuits often include combinatorial gene regulatory mechanisms and networks that we are only beginning to unravel with the help of new molecular tools. The increasing availability of genomic sequences and gene expression data enables us to dissect ABA regulatory mechanisms at the individual gene expression level. In this paper we used an in-silico-based approach directed towards genome-wide prediction and identification of specific features of ABA-responsive elements. In particular we analysed the genome-wide occurrence and positional arrangements of two well-described ABA-responsive cis-regulatory elements (CREs), ABRE and CE3, in thale cress (Arabidopsis thaliana) and rice (Oryza sativa). Our results show that Arabidopsis and rice use the ABA-responsive elements ABRE and CE3 distinctively. Earlier reports for various monocots have identified CE3 as a coupling element (CE) associated with ABRE. Surprisingly, we found that while ABRE is equally abundant in both species, CE3 is practically absent in Arabidopsis. ABRE-ABRE pairs are common in both genomes, suggesting that these can form functional ABA-responsive complexes (ABRCs) in Arabidopsis and rice. Furthermore, we detected distinct combinations, orientation patterns and DNA strand preferences of ABRE and CE3 motifs in rice gene promoters. Our computational analyses revealed distinct recruitment patterns of ABA-responsive CREs in upstream sequences of Arabidopsis and rice. The apparent absence of CE3s in Arabidopsis suggests that another CE pairs with ABRE to establish a functional ABRC capable of interacting with transcription factors. Further studies will be needed to test whether the observed differences are extrapolatable to monocots and dicots in general, and to understand how they contribute to the fine-tuning of the hormonal response. The outcome of our investigation can now be used to direct future experimentation designed to further dissect the ABA-dependent regulatory networks.
Schwessinger, Ron; Suciu, Maria C; McGowan, Simon J; Telenius, Jelena; Taylor, Stephen; Higgs, Doug R; Hughes, Jim R
2017-10-01
In the era of genome-wide association studies (GWAS) and personalized medicine, predicting the impact of single nucleotide polymorphisms (SNPs) in regulatory elements is an important goal. Current approaches to determine the potential of regulatory SNPs depend on inadequate knowledge of cell-specific DNA binding motifs. Here, we present Sasquatch, a new computational approach that uses DNase footprint data to estimate and visualize the effects of noncoding variants on transcription factor binding. Sasquatch performs a comprehensive k -mer-based analysis of DNase footprints to determine any k -mer's potential for protein binding in a specific cell type and how this may be changed by sequence variants. Therefore, Sasquatch uses an unbiased approach, independent of known transcription factor binding sites and motifs. Sasquatch only requires a single DNase-seq data set per cell type, from any genotype, and produces consistent predictions from data generated by different experimental procedures and at different sequence depths. Here we demonstrate the effectiveness of Sasquatch using previously validated functional SNPs and benchmark its performance against existing approaches. Sasquatch is available as a versatile webtool incorporating publicly available data, including the human ENCODE collection. Thus, Sasquatch provides a powerful tool and repository for prioritizing likely regulatory SNPs in the noncoding genome. © 2017 Schwessinger et al.; Published by Cold Spring Harbor Laboratory Press.
Simon, Jeremy M.; Giresi, Paul G.; Davis, Ian J.; Lieb, Jason D.
2013-01-01
Eviction or destabilization of nucleosomes from chromatin is a hallmark of functional regulatory elements of the eukaryotic genome. Historically identified by nuclease hypersensitivity, these regulatory elements are typically bound by transcription factors or other regulatory proteins. FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements) is an alternative approach to identify these genomic regions and has proven successful in a multitude of eukaryotic cell and tissue types. Cells or dissociated tissues are crosslinked briefly with formaldehyde, lysed, and sonicated. Sheared chromatin is subjected to phenol-chloroform extraction and the isolated DNA, typically encompassing 1–3% of the human genome, is purified. We provide guidelines for quantitative analysis by PCR, microarrays, or next-generation sequencing. Regulatory elements enriched by FAIRE display high concordance with those identified by nuclease hypersensitivity or ChIP, and the entire procedure can be completed in three days. FAIRE exhibits low technical variability, which allows its use in large-scale studies of chromatin from normal or diseased tissues. PMID:22262007
A liver enhancer in the fibrinogen gene cluster.
Fort, Alexandre; Fish, Richard J; Attanasio, Catia; Dosch, Roland; Visel, Axel; Neerman-Arbez, Marguerite
2011-01-06
The plasma concentration of fibrinogen varies in the healthy human population between 1.5 and 3.5 g/L. Understanding the basis of this variability has clinical importance because elevated fibrinogen levels are associated with increased cardiovascular disease risk. To identify novel regulatory elements involved in the control of fibrinogen expression, we used sequence conservation and in silico-predicted regulatory potential to select 14 conserved noncoding sequences (CNCs) within the conserved block of synteny containing the fibrinogen locus. The regulatory potential of each CNC was tested in vitro using a luciferase reporter gene assay in fibrinogen-expressing hepatoma cell lines (HuH7 and HepG2). 4 potential enhancers were tested for their ability to direct enhanced green fluorescent protein expression in zebrafish embryos. CNC12, a sequence equidistant from the human fibrinogen alpha and beta chain genes, activates strong liver enhanced green fluorescent protein expression in injected embryos and their transgenic progeny. A transgenic assay in embryonic day 14.5 mouse embryos confirmed the ability of CNC12 to activate transcription in the liver. While additional experiments are necessary to prove the role of CNC12 in the regulation of fibrinogen, our study reveals a novel regulatory element in the fibrinogen locus that is active in the liver and may contribute to variable fibrinogen expression in humans.
Improved regulatory element prediction based on tissue-specific local epigenomic signatures
DOE Office of Scientific and Technical Information (OSTI.GOV)
He, Yupeng; Gorkin, David U.; Dickel, Diane E.
Accurate enhancer identification is critical for understanding the spatiotemporal transcriptional regulation during development as well as the functional impact of disease-related noncoding genetic variants. Computational methods have been developed to predict the genomic locations of active enhancers based on histone modifications, but the accuracy and resolution of these methods remain limited. Here, we present an algorithm, regulator y element prediction based on tissue-specific local epigenetic marks (REPTILE), which integrates histone modification and whole-genome cytosine DNA methylation profiles to identify the precise location of enhancers. We tested the ability of REPTILE to identify enhancers previously validated in reporter assays. Compared withmore » existing methods, REPTILE shows consistently superior performance across diverse cell and tissue types, and the enhancer locations are significantly more refined. We show that, by incorporating base-resolution methylation data, REPTILE greatly improves upon current methods for annotation of enhancers across a variety of cell and tissue types.« less
Improved regulatory element prediction based on tissue-specific local epigenomic signatures
He, Yupeng; Gorkin, David U.; Dickel, Diane E.; ...
2017-02-13
Accurate enhancer identification is critical for understanding the spatiotemporal transcriptional regulation during development as well as the functional impact of disease-related noncoding genetic variants. Computational methods have been developed to predict the genomic locations of active enhancers based on histone modifications, but the accuracy and resolution of these methods remain limited. Here, we present an algorithm, regulator y element prediction based on tissue-specific local epigenetic marks (REPTILE), which integrates histone modification and whole-genome cytosine DNA methylation profiles to identify the precise location of enhancers. We tested the ability of REPTILE to identify enhancers previously validated in reporter assays. Compared withmore » existing methods, REPTILE shows consistently superior performance across diverse cell and tissue types, and the enhancer locations are significantly more refined. We show that, by incorporating base-resolution methylation data, REPTILE greatly improves upon current methods for annotation of enhancers across a variety of cell and tissue types.« less
Discriminative prediction of mammalian enhancers from DNA sequence
Lee, Dongwon; Karchin, Rachel; Beer, Michael A.
2011-01-01
Accurately predicting regulatory sequences and enhancers in entire genomes is an important but difficult problem, especially in large vertebrate genomes. With the advent of ChIP-seq technology, experimental detection of genome-wide EP300/CREBBP bound regions provides a powerful platform to develop predictive tools for regulatory sequences and to study their sequence properties. Here, we develop a support vector machine (SVM) framework which can accurately identify EP300-bound enhancers using only genomic sequence and an unbiased set of general sequence features. Moreover, we find that the predictive sequence features identified by the SVM classifier reveal biologically relevant sequence elements enriched in the enhancers, but we also identify other features that are significantly depleted in enhancers. The predictive sequence features are evolutionarily conserved and spatially clustered, providing further support of their functional significance. Although our SVM is trained on experimental data, we also predict novel enhancers and show that these putative enhancers are significantly enriched in both ChIP-seq signal and DNase I hypersensitivity signal in the mouse brain and are located near relevant genes. Finally, we present results of comparisons between other EP300/CREBBP data sets using our SVM and uncover sequence elements enriched and/or depleted in the different classes of enhancers. Many of these sequence features play a role in specifying tissue-specific or developmental-stage-specific enhancer activity, but our results indicate that some features operate in a general or tissue-independent manner. In addition to providing a high confidence list of enhancer targets for subsequent experimental investigation, these results contribute to our understanding of the general sequence structure of vertebrate enhancers. PMID:21875935
A Predictive Approach to Network Reverse-Engineering
NASA Astrophysics Data System (ADS)
Wiggins, Chris
2005-03-01
A central challenge of systems biology is the ``reverse engineering" of transcriptional networks: inferring which genes exert regulatory control over which other genes. Attempting such inference at the genomic scale has only recently become feasible, via data-intensive biological innovations such as DNA microrrays (``DNA chips") and the sequencing of whole genomes. In this talk we present a predictive approach to network reverse-engineering, in which we integrate DNA chip data and sequence data to build a model of the transcriptional network of the yeast S. cerevisiae capable of predicting the response of genes in unseen experiments. The technique can also be used to extract ``motifs,'' sequence elements which act as binding sites for regulatory proteins. We validate by a number of approaches and present comparison of theoretical prediction vs. experimental data, along with biological interpretations of the resulting model. En route, we will illustrate some basic notions in statistical learning theory (fitting vs. over-fitting; cross- validation; assessing statistical significance), highlighting ways in which physicists can make a unique contribution in data- driven approaches to reverse engineering.
Naturally occurring deletions of hunchback binding sites in the even-skipped stripe 3+7 enhancer.
Palsson, Arnar; Wesolowska, Natalia; Reynisdóttir, Sigrún; Ludwig, Michael Z; Kreitman, Martin
2014-01-01
Changes in regulatory DNA contribute to phenotypic differences within and between taxa. Comparative studies show that many transcription factor binding sites (TFBS) are conserved between species whereas functional studies reveal that some mutations segregating within species alter TFBS function. Consistently, in this analysis of 13 regulatory elements in Drosophila melanogaster populations, single base and insertion/deletion polymorphism are rare in characterized regulatory elements. Experimentally defined TFBS are nearly devoid of segregating mutations and, as has been shown before, are quite conserved. For instance 8 of 11 Hunchback binding sites in the stripe 3+7 enhancer of even-skipped are conserved between D. melanogaster and Drosophila virilis. Oddly, we found a 72 bp deletion that removes one of these binding sites (Hb8), segregating within D. melanogaster. Furthermore, a 45 bp deletion polymorphism in the spacer between the stripe 3+7 and stripe 2 enhancers, removes another predicted Hunchback site. These two deletions are separated by ∼250 bp, sit on distinct haplotypes, and segregate at appreciable frequency. The Hb8Δ is at 5 to 35% frequency in the new world, but also shows cosmopolitan distribution. There is depletion of sequence variation on the Hb8Δ-carrying haplotype. Quantitative genetic tests indicate that Hb8Δ affects developmental time, but not viability of offspring. The Eve expression pattern differs between inbred lines, but the stripe 3 and 7 boundaries seem unaffected by Hb8Δ. The data reveal segregating variation in regulatory elements, which may reflect evolutionary turnover of characterized TFBS due to drift or co-evolution.
N-3 polyunsaturated fatty acid regulation of hepatic gene transcription
Jump, Donald B.
2009-01-01
Purpose of review The liver plays a central role in whole body lipid metabolism and adapts rapidly to changes in dietary fat composition. This adaption involves changes in the expression of genes involved in glycolysis, de-novo lipogenesis, fatty acid elongation, desaturation and oxidation. This review brings together metabolic and molecular studies that help explain n-3 (omega-3) polyunsaturated fatty acid regulation of hepatic gene transcription. Recent findings Dietary n-3 polyunsaturated fatty acid regulates hepatic gene expression by targeting three major transcriptional regulatory networks: peroxisome proliferator-activated receptor α, sterol regulatory element binding protein-1 and the carbohydrate regulatory element binding protein/Max-like factor X heterodimer. 22 : 6,n-3, the most prominent n-3 polyunsaturated fatty acid in tissues, is a weak activator of peroxisome proliferator-activated receptor α. Hepatic metabolism of 22 : 6,n-3, however, generates 20 : 5,n-3, a strong peroxisome proliferator-activated receptor α activator. In contrast to peroxisome proliferator-activated receptor α, 22 : 6,n-3 is the most potent fatty acid regulator of hepatic sterol regulatory element binding protein-1. 22 : 6,n-3 suppresses sterol regulatory element binding protein-1 gene expression while enhancing degradation of nuclear sterol regulatory element binding protein-1 through 26S proteasome and Erk1/2-dependent mechanisms. Both n-3 and n-6 polyunsaturated fatty acid suppress carbohydrate regulatory element binding protein and Max-like factor X nuclear abundance and interfere with glucose-regulated hepatic metabolism. Summary These studies have revealed unique mechanisms by which specific polyunsaturated fatty acids control peroxisome proliferator activated receptor α, sterol regulatory element binding protein-1 and carbohydrate regulatory element binding protein/Max-like factor X function. As such, specific metabolic and signal transduction pathways contribute significantly to the fatty acid regulation of these transcription factors and their corresponding regulatory networks. PMID:18460914
Xu, Zheng; Zhang, Guosheng; Duan, Qing; Chai, Shengjie; Zhang, Baqun; Wu, Cong; Jin, Fulai; Yue, Feng; Li, Yun; Hu, Ming
2016-03-11
Genome-wide association studies (GWAS) have identified thousands of genetic variants associated with complex traits and diseases. However, most of them are located in the non-protein coding regions, and therefore it is challenging to hypothesize the functions of these non-coding GWAS variants. Recent large efforts such as the ENCODE and Roadmap Epigenomics projects have predicted a large number of regulatory elements. However, the target genes of these regulatory elements remain largely unknown. Chromatin conformation capture based technologies such as Hi-C can directly measure the chromatin interactions and have generated an increasingly comprehensive catalog of the interactome between the distal regulatory elements and their potential target genes. Leveraging such information revealed by Hi-C holds the promise of elucidating the functions of genetic variants in human diseases. In this work, we present HiView, the first integrative genome browser to leverage Hi-C results for the interpretation of GWAS variants. HiView is able to display Hi-C data and statistical evidence for chromatin interactions in genomic regions surrounding any given GWAS variant, enabling straightforward visualization and interpretation. We believe that as the first GWAS variants-centered Hi-C genome browser, HiView is a useful tool guiding post-GWAS functional genomics studies. HiView is freely accessible at: http://www.unc.edu/~yunmli/HiView .
A HLA class I cis-regulatory element whose activity can be modulated by hormones.
Sim, B C; Hui, K M
1994-12-01
To elucidate the basis of the down-regulation in major histocompatibility complex (MHC) class I gene expression and to identify possible DNA-binding regulatory elements that have the potential to interact with class I MHC genes, we have studied the transcriptional regulation of class I HLA genes in human breast carcinoma cells. A 9 base pair (bp) negative cis-regulatory element (NRE) has been identified using band-shift assays employing DNA sequences derived from the 5'-flanking region of HLA class I genes. This 9-bp element, GTCATGGCG, located within exon I of the HLA class I gene, can potently inhibit the expression of a heterologous thymidine kinase (TK) gene promoter and the HLA enhancer element. Furthermore, this regulatory element can exert its suppressive function in either the sense or anti-sense orientation. More interestingly, NRE can suppress dexamethasone-mediated gene activation in the context of the reported glucocorticoid-responsive element (GRE) in MCF-7 cells but has no influence on the estrogen-mediated transcriptional activation of MCF-7 cells in the context of the reported estrogen-responsive element (ERE). Furthermore, the presence of such a regulatory element within the HLA class I gene whose activity can be modulated by hormones correlates well with our observation that the level of HLA class I gene expression can be down-regulated by hormones in human breast carcinoma cells. Such interactions between negative regulatory elements and specific hormone trans-activators are novel and suggest a versatile form of transcriptional control.
Brody, Thomas; Yavatkar, Amarendra S; Kuzin, Alexander; Kundu, Mukta; Tyson, Leonard J; Ross, Jermaine; Lin, Tzu-Yang; Lee, Chi-Hon; Awasaki, Takeshi; Lee, Tzumin; Odenwald, Ward F
2012-01-01
Background: Phylogenetic footprinting has revealed that cis-regulatory enhancers consist of conserved DNA sequence clusters (CSCs). Currently, there is no systematic approach for enhancer discovery and analysis that takes full-advantage of the sequence information within enhancer CSCs. Results: We have generated a Drosophila genome-wide database of conserved DNA consisting of >100,000 CSCs derived from EvoPrints spanning over 90% of the genome. cis-Decoder database search and alignment algorithms enable the discovery of functionally related enhancers. The program first identifies conserved repeat elements within an input enhancer and then searches the database for CSCs that score highly against the input CSC. Scoring is based on shared repeats as well as uniquely shared matches, and includes measures of the balance of shared elements, a diagnostic that has proven to be useful in predicting cis-regulatory function. To demonstrate the utility of these tools, a temporally-restricted CNS neuroblast enhancer was used to identify other functionally related enhancers and analyze their structural organization. Conclusions: cis-Decoder reveals that co-regulating enhancers consist of combinations of overlapping shared sequence elements, providing insights into the mode of integration of multiple regulating transcription factors. The database and accompanying algorithms should prove useful in the discovery and analysis of enhancers involved in any developmental process. Developmental Dynamics 241:169–189, 2012. © 2011 Wiley Periodicals, Inc. Key findings A genome-wide catalog of Drosophila conserved DNA sequence clusters. cis-Decoder discovers functionally related enhancers. Functionally related enhancers share balanced sequence element copy numbers. Many enhancers function during multiple phases of development. PMID:22174086
Dong, S-S; Guo, Y; Zhu, D-L; Chen, X-F; Wu, X-M; Shen, H; Chen, X-D; Tan, L-J; Tian, Q; Deng, H-W; Yang, T-L
2016-07-01
With ENCODE epigenomic data and results from published genome-wide association studies (GWASs), we aimed to find regulatory signatures of obesity genes and discover novel susceptibility genes. Obesity genes were obtained from public GWAS databases and their promoters were annotated based on the regulatory element information. Significantly enriched or depleted epigenomic elements in the promoters of obesity genes were evaluated and all human genes were then prioritized according to the existence of the selected elements to predict new candidate genes. Top-ranked genes were subsequently applied to validate their associations with obesity-related traits in three independent in-house GWAS samples. We identified RAD21 and EZH2 as over-represented, and STAT2 (signal transducer and activator of transcription 2) and IRF3 (interferon regulatory transcription factor 3) as depleted transcription factors. Histone modification of H3K9me3 and chromatin state segmentation of 'poised promoter' and 'repressed' were over-represented. All genes were prioritized and we selected the top five genes for validation at the population level. Combining results from the three GWAS samples, rs7522101 in ESRRG (estrogen-related receptor-γ) remained significantly associated with body mass index after multiple testing corrections (P=7.25 × 10(-5)). It was also associated with β-cell function (P=1.99 × 10(-3)) and fasting glucose level (P<0.05) in the meta-analyses of glucose and insulin-related traits consortium (MAGIC) data set.Cnoclusions:In summary, we identified epigenomic characteristics for obesity genes and suggested ESRRG as a novel obesity-susceptibility gene.
RSAT: regulatory sequence analysis tools.
Thomas-Chollier, Morgane; Sand, Olivier; Turatsinze, Jean-Valéry; Janky, Rekin's; Defrance, Matthieu; Vervisch, Eric; Brohée, Sylvain; van Helden, Jacques
2008-07-01
The regulatory sequence analysis tools (RSAT, http://rsat.ulb.ac.be/rsat/) is a software suite that integrates a wide collection of modular tools for the detection of cis-regulatory elements in genome sequences. The suite includes programs for sequence retrieval, pattern discovery, phylogenetic footprint detection, pattern matching, genome scanning and feature map drawing. Random controls can be performed with random gene selections or by generating random sequences according to a variety of background models (Bernoulli, Markov). Beyond the original word-based pattern-discovery tools (oligo-analysis and dyad-analysis), we recently added a battery of tools for matrix-based detection of cis-acting elements, with some original features (adaptive background models, Markov-chain estimation of P-values) that do not exist in other matrix-based scanning tools. The web server offers an intuitive interface, where each program can be accessed either separately or connected to the other tools. In addition, the tools are now available as web services, enabling their integration in programmatic workflows. Genomes are regularly updated from various genome repositories (NCBI and EnsEMBL) and 682 organisms are currently supported. Since 1998, the tools have been used by several hundreds of researchers from all over the world. Several predictions made with RSAT were validated experimentally and published.
Uncovering drug-responsive regulatory elements
Luizon, Marcelo R; Ahituv, Nadav
2015-01-01
Nucleotide changes in gene regulatory elements can have a major effect on interindividual differences in drug response. For example, by reviewing all published pharmacogenomic genome-wide association studies, we show here that 96.4% of the associated single nucleotide polymorphisms reside in noncoding regions. We discuss how sequencing technologies are improving our ability to identify drug response-associated regulatory elements genome-wide and to annotate nucleotide variants within them. We highlight specific examples of how nucleotide changes in these elements can affect drug response and illustrate the techniques used to find them and functionally characterize them. Finally, we also discuss challenges in the field of drug-responsive regulatory elements that need to be considered in order to translate these findings into the clinic. PMID:26555224
Levy, Nitzan; Tatomer, Dierdre; Herber, Candice B.; Zhao, Xiaoyue; Tang, Hui; Sargeant, Toby; Ball, Lonnele J.; Summers, Jonathan; Speed, Terence P.; Leitman, Dale C.
2008-01-01
Estrogen receptors (ERs) regulate gene transcription by interacting with regulatory elements. Most information regarding how ER activates genes has come from studies using a small set of target genes or simple consensus sequences such as estrogen response element, activator protein 1, and Sp1 elements. However, these elements cannot explain the differences in gene regulation patterns and clinical effects observed with estradiol (E2) and selective estrogen receptor modulators. To obtain a greater understanding of how E2 and selective estrogen receptor modulators differentially regulate genes, it is necessary to investigate their action on a more comprehensive set of native regulatory elements derived from ER target genes. Here we used chromatin immunoprecipitation-cloning and sequencing to isolate 173 regulatory elements associated with ERα. Most elements were found in the introns (38%) and regions greater than 10 kb upstream of the transcription initiation site (38%); 24% of the elements were found in the proximal promoter region (<10 kb). Only 11% of the elements contained a classical estrogen response element; 23% of the elements did not have any known response elements, including one derived from the naked cuticle homolog gene, which was associated with the recruitment of p160 coactivators. Transfection studies found that 80% of the 173 elements were regulated by E2, raloxifene, or tamoxifen with ERα or ERβ. Tamoxifen was more effective than raloxifene at activating the elements with ERα, whereas raloxifene was superior with ERβ. Our findings demonstrate that E2, tamoxifen, and raloxifene differentially regulate native ER-regulatory elements isolated by chromatin immunoprecipitation with ERα and ERβ. PMID:17962382
Biswas, Ambarish; Brown, Chris M
2014-06-08
Gene expression in vertebrate cells may be controlled post-transcriptionally through regulatory elements in mRNAs. These are usually located in the untranslated regions (UTRs) of mRNA sequences, particularly the 3'UTRs. Scan for Motifs (SFM) simplifies the process of identifying a wide range of regulatory elements on alignments of vertebrate 3'UTRs. SFM includes identification of both RNA Binding Protein (RBP) sites and targets of miRNAs. In addition to searching pre-computed alignments, the tool provides users the flexibility to search their own sequences or alignments. The regulatory elements may be filtered by expected value cutoffs and are cross-referenced back to their respective sources and literature. The output is an interactive graphical representation, highlighting potential regulatory elements and overlaps between them. The output also provides simple statistics and links to related resources for complementary analyses. The overall process is intuitive and fast. As SFM is a free web-application, the user does not need to install any software or databases. Visualisation of the binding sites of different classes of effectors that bind to 3'UTRs will facilitate the study of regulatory elements in 3' UTRs.
Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures
Stark, Alexander; Lin, Michael F.; Kheradpour, Pouya; Pedersen, Jakob S.; Parts, Leopold; Carlson, Joseph W.; Crosby, Madeline A.; Rasmussen, Matthew D.; Roy, Sushmita; Deoras, Ameya N.; Ruby, J. Graham; Brennecke, Julius; Hodges, Emily; Hinrichs, Angie S.; Caspi, Anat; Paten, Benedict; Park, Seung-Won; Han, Mira V.; Maeder, Morgan L.; Polansky, Benjamin J.; Robson, Bryanne E.; Aerts, Stein; van Helden, Jacques; Hassan, Bassem; Gilbert, Donald G.; Eastman, Deborah A.; Rice, Michael; Weir, Michael; Hahn, Matthew W.; Park, Yongkyu; Dewey, Colin N.; Pachter, Lior; Kent, W. James; Haussler, David; Lai, Eric C.; Bartel, David P.; Hannon, Gregory J.; Kaufman, Thomas C.; Eisen, Michael B.; Clark, Andrew G.; Smith, Douglas; Celniker, Susan E.; Gelbart, William M.; Kellis, Manolis
2008-01-01
Sequencing of multiple related species followed by comparative genomics analysis constitutes a powerful approach for the systematic understanding of any genome. Here, we use the genomes of 12 Drosophila species for the de novo discovery of functional elements in the fly. Each type of functional element shows characteristic patterns of change, or ‘evolutionary signatures’, dictated by its precise selective constraints. Such signatures enable recognition of new protein-coding genes and exons, spurious and incorrect gene annotations, and numerous unusual gene structures, including abundant stop-codon readthrough. Similarly, we predict non-protein-coding RNA genes and structures, and new microRNA (miRNA) genes. We provide evidence of miRNA processing and functionality from both hairpin arms and both DNA strands. We identify several classes of pre- and post-transcriptional regulatory motifs, and predict individual motif instances with high confidence. We also study how discovery power scales with the divergence and number of species compared, and we provide general guidelines for comparative studies. PMID:17994088
Ross, Christian; Shen, Qingxi J
2006-09-01
Abscisic acid (ABA) is one of the central plant hormones, responsible for controlling both maturation and germination in seeds, as well as mediating adaptive responses to desiccation, injury, and pathogen infection in vegetative tissues. Thorough analyses of two barley genes, HVA1 and HVA22, indicate that their response to ABA relies on the interaction of two cis-acting elements in their promoters, an ABA response element (ABRE) and a coupling element (CE). Together, they form an ABA response promoter complex (ABRC). Comparison of promoters of barley HVA1 and it rice orthologue indicates that the structures and sequences of their ABRCs are highly similar. Prediction of ABA responsive genes in the rice genome is then tractable to a bioinformatics approach based on the structures of the well-defined barley ABRCs. Here we describe a model developed based on the consensus, inter-element spacing and orientations of experimentally determined ABREs and CEs. Our search of the rice promoter database for promoters that fit the model has generated a partial list of genes in rice that have a high likelihood of being involved in the ABA signaling network. The ABA inducibility of some of the rice genes identified was validated with quantitative reverse transcription PCR (QPCR). By limiting our input data to known enhancer modules and experimentally derived rules, we have generated a high confidence subset of ABA-regulated genes. The results suggest that the pathways by which cereals respond to biotic and abiotic stresses overlap significantly, and that regulation is not confined to the level transcription. The large fraction of putative regulatory genes carrying HVA1-like enhancer modules in their promoters suggests the ABA signal enters at multiple points into a complex regulatory network that remains largely unmapped.
Deciphering the transcriptional cis-regulatory code.
Yáñez-Cuna, J Omar; Kvon, Evgeny Z; Stark, Alexander
2013-01-01
Information about developmental gene expression resides in defined regulatory elements, called enhancers, in the non-coding part of the genome. Although cells reliably utilize enhancers to orchestrate gene expression, a cis-regulatory code that would allow their interpretation has remained one of the greatest challenges of modern biology. In this review, we summarize studies from the past three decades that describe progress towards revealing the properties of enhancers and discuss how recent approaches are providing unprecedented insights into regulatory elements in animal genomes. Over the next years, we believe that the functional characterization of regulatory sequences in entire genomes, combined with recent computational methods, will provide a comprehensive view of genomic regulatory elements and their building blocks and will enable researchers to begin to understand the sequence basis of the cis-regulatory code. Copyright © 2012 Elsevier Ltd. All rights reserved.
Optimized mixed Markov models for motif identification
Huang, Weichun; Umbach, David M; Ohler, Uwe; Li, Leping
2006-01-01
Background Identifying functional elements, such as transcriptional factor binding sites, is a fundamental step in reconstructing gene regulatory networks and remains a challenging issue, largely due to limited availability of training samples. Results We introduce a novel and flexible model, the Optimized Mixture Markov model (OMiMa), and related methods to allow adjustment of model complexity for different motifs. In comparison with other leading methods, OMiMa can incorporate more than the NNSplice's pairwise dependencies; OMiMa avoids model over-fitting better than the Permuted Variable Length Markov Model (PVLMM); and OMiMa requires smaller training samples than the Maximum Entropy Model (MEM). Testing on both simulated and actual data (regulatory cis-elements and splice sites), we found OMiMa's performance superior to the other leading methods in terms of prediction accuracy, required size of training data or computational time. Our OMiMa system, to our knowledge, is the only motif finding tool that incorporates automatic selection of the best model. OMiMa is freely available at [1]. Conclusion Our optimized mixture of Markov models represents an alternative to the existing methods for modeling dependent structures within a biological motif. Our model is conceptually simple and effective, and can improve prediction accuracy and/or computational speed over other leading methods. PMID:16749929
Schroeder, Mark D.; Greer, Christina; Gaul, Ulrike
2011-01-01
The generation of metameric body plans is a key process in development. In Drosophila segmentation, periodicity is established rapidly through the complex transcriptional regulation of the pair-rule genes. The ‘primary’ pair-rule genes generate their 7-stripe expression through stripe-specific cis-regulatory elements controlled by the preceding non-periodic maternal and gap gene patterns, whereas ‘secondary’ pair-rule genes are thought to rely on 7-stripe elements that read off the already periodic primary pair-rule patterns. Using a combination of computational and experimental approaches, we have conducted a comprehensive systems-level examination of the regulatory architecture underlying pair-rule stripe formation. We find that runt (run), fushi tarazu (ftz) and odd skipped (odd) establish most of their pattern through stripe-specific elements, arguing for a reclassification of ftz and odd as primary pair-rule genes. In the case of run, we observe long-range cis-regulation across multiple intervening genes. The 7-stripe elements of run, ftz and odd are active concurrently with the stripe-specific elements, indicating that maternal/gap-mediated control and pair-rule gene cross-regulation are closely integrated. Stripe-specific elements fall into three distinct classes based on their principal repressive gap factor input; stripe positions along the gap gradients correlate with the strength of predicted input. The prevalence of cis-elements that generate two stripes and their genomic organization suggest that single-stripe elements arose by splitting and subfunctionalization of ancestral dual-stripe elements. Overall, our study provides a greatly improved understanding of how periodic patterns are established in the Drosophila embryo. PMID:21693522
Characterization of new regulatory elements within the Drosophila bithorax complex.
Pérez-Lluch, Sílvia; Cuartero, Sergi; Azorín, Fernando; Espinàs, M Lluïsa
2008-12-01
The homeotic Abdominal-B (Abd-B) gene expression depends on a modular cis-regulatory region divided into discrete functional domains (iab) that control the expression of the gene in a particular segment of the fly. These domains contain regulatory elements implicated in both initiation and maintenance of homeotic gene expression and elements that separate the different domains. In this paper we have performed an extensive analysis of the iab-6 regulatory region, which regulates Abd-B expression at abdominal segment A6 (PS11), and we have characterized two new polycomb response elements (PREs) within this domain. We report that PREs at Abd-B cis-regulatory domains present a particular chromatin structure which is nuclease accessible all along Drosophila development and both in active and repressed states. We also show that one of these regions contains a dCTCF and CP190 dependent activity in transgenic enhancer-blocking assays, suggesting that it corresponds to the Fab-6 boundary element of the Drosophila bithorax complex.
Interplay between DMD Point Mutations and Splicing Signals in Dystrophinopathy Phenotypes
Juan-Mateu, Jonàs; González-Quereda, Lidia; Rodríguez, Maria José; Verdura, Edgard; Lázaro, Kira; Jou, Cristina; Nascimento, Andrés; Jiménez-Mallebrera, Cecilia; Colomer, Jaume; Monges, Soledad; Lubieniecki, Fabiana; Foncuberta, Maria Eugenia; Pascual-Pascual, Samuel Ignacio; Molano, Jesús; Baiget, Montserrat; Gallano, Pia
2013-01-01
DMD nonsense and frameshift mutations lead to severe Duchenne muscular dystrophy while in-frame mutations lead to milder Becker muscular dystrophy. Exceptions are found in 10% of cases and the production of alternatively spliced transcripts is considered a key modifier of disease severity. Several exonic mutations have been shown to induce exon-skipping, while splice site mutations result in exon-skipping or activation of cryptic splice sites. However, factors determining the splicing pathway are still unclear. Point mutations provide valuable information regarding the regulation of pre-mRNA splicing and elements defining exon identity in the DMD gene. Here we provide a comprehensive analysis of 98 point mutations related to clinical phenotype and their effect on muscle mRNA and dystrophin expression. Aberrant splicing was found in 27 mutations due to alteration of splice sites or splicing regulatory elements. Bioinformatics analysis was performed to test the ability of the available algorithms to predict consequences on mRNA and to investigate the major factors that determine the splicing pathway in mutations affecting splicing signals. Our findings suggest that the splicing pathway is highly dependent on the interplay between splice site strength and density of regulatory elements. PMID:23536893
Baillie, J Kenneth; Bretherick, Andrew; Haley, Christopher S; Clohisey, Sara; Gray, Alan; Neyton, Lucile P A; Barrett, Jeffrey; Stahl, Eli A; Tenesa, Albert; Andersson, Robin; Brown, J Ben; Faulkner, Geoffrey J; Lizio, Marina; Schaefer, Ulf; Daub, Carsten; Itoh, Masayoshi; Kondo, Naoto; Lassmann, Timo; Kawai, Jun; Mole, Damian; Bajic, Vladimir B; Heutink, Peter; Rehli, Michael; Kawaji, Hideya; Sandelin, Albin; Suzuki, Harukazu; Satsangi, Jack; Wells, Christine A; Hacohen, Nir; Freeman, Thomas C; Hayashizaki, Yoshihide; Carninci, Piero; Forrest, Alistair R R; Hume, David A
2018-03-01
Genetic variants underlying complex traits, including disease susceptibility, are enriched within the transcriptional regulatory elements, promoters and enhancers. There is emerging evidence that regulatory elements associated with particular traits or diseases share similar patterns of transcriptional activity. Accordingly, shared transcriptional activity (coexpression) may help prioritise loci associated with a given trait, and help to identify underlying biological processes. Using cap analysis of gene expression (CAGE) profiles of promoter- and enhancer-derived RNAs across 1824 human samples, we have analysed coexpression of RNAs originating from trait-associated regulatory regions using a novel quantitative method (network density analysis; NDA). For most traits studied, phenotype-associated variants in regulatory regions were linked to tightly-coexpressed networks that are likely to share important functional characteristics. Coexpression provides a new signal, independent of phenotype association, to enable fine mapping of causative variants. The NDA coexpression approach identifies new genetic variants associated with specific traits, including an association between the regulation of the OCT1 cation transporter and genetic variants underlying circulating cholesterol levels. NDA strongly implicates particular cell types and tissues in disease pathogenesis. For example, distinct groupings of disease-associated regulatory regions implicate two distinct biological processes in the pathogenesis of ulcerative colitis; a further two separate processes are implicated in Crohn's disease. Thus, our functional analysis of genetic predisposition to disease defines new distinct disease endotypes. We predict that patients with a preponderance of susceptibility variants in each group are likely to respond differently to pharmacological therapy. Together, these findings enable a deeper biological understanding of the causal basis of complex traits.
Gray, Alan; Neyton, Lucile P. A.; Barrett, Jeffrey; Stahl, Eli A.; Tenesa, Albert; Andersson, Robin; Brown, J. Ben; Faulkner, Geoffrey J.; Lizio, Marina; Schaefer, Ulf; Daub, Carsten; Kondo, Naoto; Lassmann, Timo; Kawai, Jun; Kawaji, Hideya; Suzuki, Harukazu; Satsangi, Jack; Wells, Christine A.; Hacohen, Nir; Freeman, Thomas C.; Hayashizaki, Yoshihide; Forrest, Alistair R. R.; Hume, David A.
2018-01-01
Genetic variants underlying complex traits, including disease susceptibility, are enriched within the transcriptional regulatory elements, promoters and enhancers. There is emerging evidence that regulatory elements associated with particular traits or diseases share similar patterns of transcriptional activity. Accordingly, shared transcriptional activity (coexpression) may help prioritise loci associated with a given trait, and help to identify underlying biological processes. Using cap analysis of gene expression (CAGE) profiles of promoter- and enhancer-derived RNAs across 1824 human samples, we have analysed coexpression of RNAs originating from trait-associated regulatory regions using a novel quantitative method (network density analysis; NDA). For most traits studied, phenotype-associated variants in regulatory regions were linked to tightly-coexpressed networks that are likely to share important functional characteristics. Coexpression provides a new signal, independent of phenotype association, to enable fine mapping of causative variants. The NDA coexpression approach identifies new genetic variants associated with specific traits, including an association between the regulation of the OCT1 cation transporter and genetic variants underlying circulating cholesterol levels. NDA strongly implicates particular cell types and tissues in disease pathogenesis. For example, distinct groupings of disease-associated regulatory regions implicate two distinct biological processes in the pathogenesis of ulcerative colitis; a further two separate processes are implicated in Crohn’s disease. Thus, our functional analysis of genetic predisposition to disease defines new distinct disease endotypes. We predict that patients with a preponderance of susceptibility variants in each group are likely to respond differently to pharmacological therapy. Together, these findings enable a deeper biological understanding of the causal basis of complex traits. PMID:29494619
Regulatory network involving miRNAs and genes in serous ovarian carcinoma
Zhao, Haiyan; Xu, Hao; Xue, Luchen
2017-01-01
Serous ovarian carcinoma (SOC) is one of the most life-threatening types of gynecological malignancy, but the pathogenesis of SOC remains unknown. Previous studies have indicated that differentially expressed genes and microRNAs (miRNAs) serve important functions in SOC. However, genes and miRNAs are identified in a disperse form, and limited information is known about the regulatory association between miRNAs and genes in SOC. In the present study, three regulatory networks were hierarchically constructed, including a differentially-expressed network, a related network and a global network to reveal associations between each factor. In each network, there were three types of factors, which were genes, miRNAs and transcription factors that interact with each other. Focus was placed on the differentially-expressed network, in which all genes and miRNAs were differentially expressed and therefore may have affected the development of SOC. Following the comparison and analysis between the three networks, a number of signaling pathways which demonstrated differentially expressed elements were highlighted. Subsequently, the upstream and downstream elements of differentially expressed miRNAs and genes were listed, and a number of key elements (differentially expressed miRNAs, genes and TFs predicted using the P-match method) were analyzed. The differentially expressed network partially illuminated the pathogenesis of SOC. It was hypothesized that if there was no differential expression of miRNAs and genes, SOC may be prevented and treatment may be identified. The present study provided a theoretical foundation for gene therapy for SOC. PMID:29113276
Bucsenez, M; Rüping, B; Behrens, S; Twyman, R M; Noll, G A; Prüfer, D
2012-09-01
The sieve element occlusion (SEO) gene family includes several members that are expressed specifically in immature sieve elements (SEs) in the developing phloem of dicotyledonous plants. To determine how this restricted expression profile is achieved, we analysed the SE-specific Medicago truncatula SEO-F1 promoter (PMtSEO-F1) by constructing deletion, substitution and hybrid constructs and testing them in transgenic tobacco plants using green fluorescent protein as a reporter. This revealed four promoter regions, each containing cis-regulatory elements that activate transcription in SEs. One of these segments also contained sufficient information to suppress PMtSEO-F1 transcription in the phloem companion cells (CCs). Subsequent in silico analysis revealed several candidate cis-regulatory elements that PMtSEO-F1 shares with other SEO promoters. These putative sieve element boxes (PSE boxes) are promising candidates for cis-regulatory elements controlling the SE-specific expression of PMtSEO-F1. © 2012 German Botanical Society and The Royal Botanical Society of the Netherlands.
The role of heterologous chloroplast sequence elements in transgene integration and expression.
Ruhlman, Tracey; Verma, Dheeraj; Samson, Nalapalli; Daniell, Henry
2010-04-01
Heterologous regulatory elements and flanking sequences have been used in chloroplast transformation of several crop species, but their roles and mechanisms have not yet been investigated. Nucleotide sequence identity in the photosystem II protein D1 (psbA) upstream region is 59% across all taxa; similar variation was consistent across all genes and taxa examined. Secondary structure and predicted Gibbs free energy values of the psbA 5' untranslated region (UTR) among different families reflected this variation. Therefore, chloroplast transformation vectors were made for tobacco (Nicotiana tabacum) and lettuce (Lactuca sativa), with endogenous (Nt-Nt, Ls-Ls) or heterologous (Nt-Ls, Ls-Nt) psbA promoter, 5' UTR and 3' UTR, regulating expression of the anthrax protective antigen (PA) or human proinsulin (Pins) fused with the cholera toxin B-subunit (CTB). Unique lettuce flanking sequences were completely eliminated during homologous recombination in the transplastomic tobacco genomes but not unique tobacco sequences. Nt-Ls or Ls-Nt transplastomic lines showed reduction of 80% PA and 97% CTB-Pins expression when compared with endogenous psbA regulatory elements, which accumulated up to 29.6% total soluble protein PA and 72.0% total leaf protein CTB-Pins, 2-fold higher than Rubisco. Transgene transcripts were reduced by 84% in Ls-Nt-CTB-Pins and by 72% in Nt-Ls-PA lines. Transcripts containing endogenous 5' UTR were stabilized in nonpolysomal fractions. Stromal RNA-binding proteins were preferentially associated with endogenous psbA 5' UTR. A rapid and reproducible regeneration system was developed for lettuce commercial cultivars by optimizing plant growth regulators. These findings underscore the need for sequencing complete crop chloroplast genomes, utilization of endogenous regulatory elements and flanking sequences, as well as optimization of plant growth regulators for efficient chloroplast transformation.
Ruhlman, Tracey; Verma, Dheeraj; Samson, Nalapalli; Daniell, Henry
2010-01-01
Heterologous regulatory elements and flanking sequences have been used in chloroplast transformation of several crop species, but their roles and mechanisms have not yet been investigated. Nucleotide sequence identity in the photosystem II protein D1 (psbA) upstream region is 59% across all taxa; similar variation was consistent across all genes and taxa examined. Secondary structure and predicted Gibbs free energy values of the psbA 5′ untranslated region (UTR) among different families reflected this variation. Therefore, chloroplast transformation vectors were made for tobacco (Nicotiana tabacum) and lettuce (Lactuca sativa), with endogenous (Nt-Nt, Ls-Ls) or heterologous (Nt-Ls, Ls-Nt) psbA promoter, 5′ UTR and 3′ UTR, regulating expression of the anthrax protective antigen (PA) or human proinsulin (Pins) fused with the cholera toxin B-subunit (CTB). Unique lettuce flanking sequences were completely eliminated during homologous recombination in the transplastomic tobacco genomes but not unique tobacco sequences. Nt-Ls or Ls-Nt transplastomic lines showed reduction of 80% PA and 97% CTB-Pins expression when compared with endogenous psbA regulatory elements, which accumulated up to 29.6% total soluble protein PA and 72.0% total leaf protein CTB-Pins, 2-fold higher than Rubisco. Transgene transcripts were reduced by 84% in Ls-Nt-CTB-Pins and by 72% in Nt-Ls-PA lines. Transcripts containing endogenous 5′ UTR were stabilized in nonpolysomal fractions. Stromal RNA-binding proteins were preferentially associated with endogenous psbA 5′ UTR. A rapid and reproducible regeneration system was developed for lettuce commercial cultivars by optimizing plant growth regulators. These findings underscore the need for sequencing complete crop chloroplast genomes, utilization of endogenous regulatory elements and flanking sequences, as well as optimization of plant growth regulators for efficient chloroplast transformation. PMID:20130101
Regulatory activities of transposable elements: from conflicts to benefits
Chuong, Edward B.; Elde, Nels C.; Feschotte, Cédric
2017-01-01
Transposable elements (TEs) are a prolific source of tightly regulated, biochemically active non-coding elements, such as transcription factor binding sites and non-coding RNAs. A wealth of recent studies reinvigorates the idea that these elements are pervasively co-opted for the regulation of host genes. We argue that the inherent genetic properties of TEs and conflicting relationships with their hosts facilitate their recruitment for regulatory functions in diverse genomes. We review recent findings supporting the long-standing hypothesis that the waves of TE invasions endured by organisms for eons have catalyzed the evolution of gene regulatory networks. We also discuss the challenges of dissecting and interpreting the phenotypic impact of regulatory activities encoded by TEs in health and disease. PMID:27867194
Vischi Winck, Flavia; Arvidsson, Samuel; Riaño-Pachón, Diego Mauricio; Hempel, Sabrina; Koseska, Aneta; Nikoloski, Zoran; Urbina Gomez, David Alejandro; Rupprecht, Jens; Mueller-Roeber, Bernd
2013-01-01
The unicellular green alga Chlamydomonas reinhardtii is a long-established model organism for studies on photosynthesis and carbon metabolism-related physiology. Under conditions of air-level carbon dioxide concentration [CO2], a carbon concentrating mechanism (CCM) is induced to facilitate cellular carbon uptake. CCM increases the availability of carbon dioxide at the site of cellular carbon fixation. To improve our understanding of the transcriptional control of the CCM, we employed FAIRE-seq (formaldehyde-assisted Isolation of Regulatory Elements, followed by deep sequencing) to determine nucleosome-depleted chromatin regions of algal cells subjected to carbon deprivation. Our FAIRE data recapitulated the positions of known regulatory elements in the promoter of the periplasmic carbonic anhydrase (Cah1) gene, which is upregulated during CCM induction, and revealed new candidate regulatory elements at a genome-wide scale. In addition, time series expression patterns of 130 transcription factor (TF) and transcription regulator (TR) genes were obtained for cells cultured under photoautotrophic condition and subjected to a shift from high to low [CO2]. Groups of co-expressed genes were identified and a putative directed gene-regulatory network underlying the CCM was reconstructed from the gene expression data using the recently developed IOTA (inner composition alignment) method. Among the candidate regulatory genes, two members of the MYB-related TF family, Lcr1 (Low-CO 2 response regulator 1) and Lcr2 (Low-CO 2 response regulator 2), may play an important role in down-regulating the expression of a particular set of TF and TR genes in response to low [CO2]. The results obtained provide new insights into the transcriptional control of the CCM and revealed more than 60 new candidate regulatory genes. Deep sequencing of nucleosome-depleted genomic regions indicated the presence of new, previously unknown regulatory elements in the C. reinhardtii genome. Our work can serve as a basis for future functional studies of transcriptional regulator genes and genomic regulatory elements in Chlamydomonas. PMID:24224019
Identification of functional elements and regulatory circuits by Drosophila modENCODE
DOE Office of Scientific and Technical Information (OSTI.GOV)
Roy, Sushmita; Ernst, Jason; Kharchenko, Peter V.
2010-12-22
To gain insight into how genomic information is translated into cellular and developmental programs, the Drosophila model organism Encyclopedia of DNA Elements (modENCODE) project is comprehensively mapping transcripts, histone modifications, chromosomal proteins, transcription factors, replication proteins and intermediates, and nucleosome properties across a developmental time course and in multiple cell lines. We have generated more than 700 data sets and discovered protein-coding, noncoding, RNA regulatory, replication, and chromatin elements, more than tripling the annotated portion of the Drosophila genome. Correlated activity patterns of these elements reveal a functional regulatory network, which predicts putative new functions for genes, reveals stage- andmore » tissue-specific regulators, and enables gene-expression prediction. Our results provide a foundation for directed experimental and computational studies in Drosophila and related species and also a model for systematic data integration toward comprehensive genomic and functional annotation. Several years after the complete genetic sequencing of many species, it is still unclear how to translate genomic information into a functional map of cellular and developmental programs. The Encyclopedia of DNA Elements (ENCODE) (1) and model organism ENCODE (modENCODE) (2) projects use diverse genomic assays to comprehensively annotate the Homo sapiens (human), Drosophila melanogaster (fruit fly), and Caenorhabditis elegans (worm) genomes, through systematic generation and computational integration of functional genomic data sets. Previous genomic studies in flies have made seminal contributions to our understanding of basic biological mechanisms and genome functions, facilitated by genetic, experimental, computational, and manual annotation of the euchromatic and heterochromatic genome (3), small genome size, short life cycle, and a deep knowledge of development, gene function, and chromosome biology. The functions of {approx}40% of the protein and nonprotein-coding genes [FlyBase 5.12 (4)] have been determined from cDNA collections (5, 6), manual curation of gene models (7), gene mutations and comprehensive genome-wide RNA interference screens (8-10), and comparative genomic analyses (11, 12). The Drosophila modENCODE project has generated more than 700 data sets that profile transcripts, histone modifications and physical nucleosome properties, general and specific transcription factors (TFs), and replication programs in cell lines, isolated tissues, and whole organisms across several developmental stages (Fig. 1). Here, we computationally integrate these data sets and report (i) improved and additional genome annotations, including full-length proteincoding genes and peptides as short as 21 amino acids; (ii) noncoding transcripts, including 132 candidate structural RNAs and 1608 nonstructural transcripts; (iii) additional Argonaute (Ago)-associated small RNA genes and pathways, including new microRNAs (miRNAs) encoded within protein-coding exons and endogenous small interfering RNAs (siRNAs) from 3-inch untranslated regions; (iv) chromatin 'states' defined by combinatorial patterns of 18 chromatin marks that are associated with distinct functions and properties; (v) regions of high TF occupancy and replication activity with likely epigenetic regulation; (vi)mixed TF and miRNA regulatory networks with hierarchical structure and enriched feed-forward loops; (vii) coexpression- and co-regulation-based functional annotations for nearly 3000 genes; (viii) stage- and tissue-specific regulators; and (ix) predictive models of gene expression levels and regulator function.« less
TrawlerWeb: an online de novo motif discovery tool for next-generation sequencing datasets.
Dang, Louis T; Tondl, Markus; Chiu, Man Ho H; Revote, Jerico; Paten, Benedict; Tano, Vincent; Tokolyi, Alex; Besse, Florence; Quaife-Ryan, Greg; Cumming, Helen; Drvodelic, Mark J; Eichenlaub, Michael P; Hallab, Jeannette C; Stolper, Julian S; Rossello, Fernando J; Bogoyevitch, Marie A; Jans, David A; Nim, Hieu T; Porrello, Enzo R; Hudson, James E; Ramialison, Mirana
2018-04-05
A strong focus of the post-genomic era is mining of the non-coding regulatory genome in order to unravel the function of regulatory elements that coordinate gene expression (Nat 489:57-74, 2012; Nat 507:462-70, 2014; Nat 507:455-61, 2014; Nat 518:317-30, 2015). Whole-genome approaches based on next-generation sequencing (NGS) have provided insight into the genomic location of regulatory elements throughout different cell types, organs and organisms. These technologies are now widespread and commonly used in laboratories from various fields of research. This highlights the need for fast and user-friendly software tools dedicated to extracting cis-regulatory information contained in these regulatory regions; for instance transcription factor binding site (TFBS) composition. Ideally, such tools should not require prior programming knowledge to ensure they are accessible for all users. We present TrawlerWeb, a web-based version of the Trawler_standalone tool (Nat Methods 4:563-5, 2007; Nat Protoc 5:323-34, 2010), to allow for the identification of enriched motifs in DNA sequences obtained from next-generation sequencing experiments in order to predict their TFBS composition. TrawlerWeb is designed for online queries with standard options common to web-based motif discovery tools. In addition, TrawlerWeb provides three unique new features: 1) TrawlerWeb allows the input of BED files directly generated from NGS experiments, 2) it automatically generates an input-matched biologically relevant background, and 3) it displays resulting conservation scores for each instance of the motif found in the input sequences, which assists the researcher in prioritising the motifs to validate experimentally. Finally, to date, this web-based version of Trawler_standalone remains the fastest online de novo motif discovery tool compared to other popular web-based software, while generating predictions with high accuracy. TrawlerWeb provides users with a fast, simple and easy-to-use web interface for de novo motif discovery. This will assist in rapidly analysing NGS datasets that are now being routinely generated. TrawlerWeb is freely available and accessible at: http://trawler.erc.monash.edu.au .
Organization of cis-acting regulatory elements in osmotic- and cold-stress-responsive promoters.
Yamaguchi-Shinozaki, Kazuko; Shinozaki, Kazuo
2005-02-01
cis-Acting regulatory elements are important molecular switches involved in the transcriptional regulation of a dynamic network of gene activities controlling various biological processes, including abiotic stress responses, hormone responses and developmental processes. In particular, understanding regulatory gene networks in stress response cascades depends on successful functional analyses of cis-acting elements. The ever-improving accuracy of transcriptome expression profiling has led to the identification of various combinations of cis-acting elements in the promoter regions of stress-inducible genes involved in stress and hormone responses. Here we discuss major cis-acting elements, such as the ABA-responsive element (ABRE) and the dehydration-responsive element/C-repeat (DRE/CRT), that are a vital part of ABA-dependent and ABA-independent gene expression in osmotic and cold stress responses.
Kang, Shin-Young; Kim, Yeon-Gu; Kang, Seunghee; Lee, Hong Weon; Lee, Eun Gyo
2016-05-01
Vectors flanked by regulatory DNA elements have been used to generate stable cell lines with high productivity and transgene stability; however, regulatory elements in Chinese hamster ovary (CHO) cells, which are the most widely used mammalian cells in biopharmaceutical production, are still poorly understood. We isolated a novel gene regulatory element from CHO-K1 cells, designated E77, which was found to enhance the stable expression of a transgene. A genomic library was constructed by combining CHO-K1 genomic DNA fragments with a CMV promoter-driven GFP expression vector, and the E77 element was isolated by screening. The incorporation of the E77 regulatory element resulted in the generation of an increased number of clones with high expression, thereby enhancing the expression level of the transgene in the stable transfectant cell pool. Interestingly, the E77 element was found to consist of two distinct fragments derived from different locations in the CHO genome shotgun sequence. High and stable transgene expression was obtained in transfected CHO cells by combining these fragments. Additionally, the function of E77 was found to be dependent on its site of insertion and specific orientation in the vector construct. Our findings demonstrate that stable gene expression mediated by the CMV promoter in CHO cells may be improved by the isolated novel gene regulatory element E77 identified in the present study. © 2016 The Authors. Biotechnology Journal published by WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Song, Lingyun; Zhang, Zhancheng; Grasfeder, Linda L.; Boyle, Alan P.; Giresi, Paul G.; Lee, Bum-Kyu; Sheffield, Nathan C.; Gräf, Stefan; Huss, Mikael; Keefe, Damian; Liu, Zheng; London, Darin; McDaniell, Ryan M.; Shibata, Yoichiro; Showers, Kimberly A.; Simon, Jeremy M.; Vales, Teresa; Wang, Tianyuan; Winter, Deborah; Zhang, Zhuzhu; Clarke, Neil D.; Birney, Ewan; Iyer, Vishwanath R.; Crawford, Gregory E.; Lieb, Jason D.; Furey, Terrence S.
2011-01-01
The human body contains thousands of unique cell types, each with specialized functions. Cell identity is governed in large part by gene transcription programs, which are determined by regulatory elements encoded in DNA. To identify regulatory elements active in seven cell lines representative of diverse human cell types, we used DNase-seq and FAIRE-seq (Formaldehyde Assisted Isolation of Regulatory Elements) to map “open chromatin.” Over 870,000 DNaseI or FAIRE sites, which correspond tightly to nucleosome-depleted regions, were identified across the seven cell lines, covering nearly 9% of the genome. The combination of DNaseI and FAIRE is more effective than either assay alone in identifying likely regulatory elements, as judged by coincidence with transcription factor binding locations determined in the same cells. Open chromatin common to all seven cell types tended to be at or near transcription start sites and to be coincident with CTCF binding sites, while open chromatin sites found in only one cell type were typically located away from transcription start sites and contained DNA motifs recognized by regulators of cell-type identity. We show that open chromatin regions bound by CTCF are potent insulators. We identified clusters of open regulatory elements (COREs) that were physically near each other and whose appearance was coordinated among one or more cell types. Gene expression and RNA Pol II binding data support the hypothesis that COREs control gene activity required for the maintenance of cell-type identity. This publicly available atlas of regulatory elements may prove valuable in identifying noncoding DNA sequence variants that are causally linked to human disease. PMID:21750106
TEMPLE: analysing population genetic variation at transcription factor binding sites.
Litovchenko, Maria; Laurent, Stefan
2016-11-01
Genetic variation occurring at the level of regulatory sequences can affect phenotypes and fitness in natural populations. This variation can be analysed in a population genetic framework to study how genetic drift and selection affect the evolution of these functional elements. However, doing this requires a good understanding of the location and nature of regulatory regions and has long been a major hurdle. The current proliferation of genomewide profiling experiments of transcription factor occupancies greatly improves our ability to identify genomic regions involved in specific DNA-protein interactions. Although software exists for predicting transcription factor binding sites (TFBS), and the effects of genetic variants on TFBS specificity, there are no tools currently available for inferring this information jointly with the genetic variation at TFBS in natural populations. We developed the software Transcription Elements Mapping at the Population LEvel (TEMPLE), which predicts TFBS, evaluates the effects of genetic variants on TFBS specificity and summarizes the genetic variation occurring at TFBS in intraspecific sequence alignments. We demonstrate that TEMPLE's TFBS prediction algorithms gives identical results to PATSER, a software distribution commonly used in the field. We also illustrate the unique features of TEMPLE by analysing TFBS diversity for the TF Senseless (SENS) in one ancestral and one cosmopolitan population of the fruit fly Drosophila melanogaster. TEMPLE can be used to localize TFBS that are characterized by strong genetic differentiation across natural populations. This will be particularly useful for studies aiming to identify adaptive mutations. TEMPLE is a java-based cross-platform software that easily maps the genetic diversity at predicted TFBSs using a graphical interface, or from the Unix command line. © 2016 John Wiley & Sons Ltd.
An internal regulatory element controls troponin I gene expression
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yutzey, K.E.; Kline, R.L.; Konieczmy, S.F.
1989-04-01
During skeletal myogenesis, approximately 20 contractile proteins and related gene products temporally accumulate as the cells fuse to form multinucleated muscle fibers. In most instances, the contractile protein genes are regulated transcriptionally, which suggests that a common molecular mechanism may coordinate the expression of this diverse and evolutionarily unrelated gene set. Recent studies have examined the muscle-specific cis-acting elements associated with numerous contractile protein genes. All of the identified regulatory elements are positioned in the 5'-flanking regions, usually within 1,500 base pairs of the transcription start site. Surprisingly, a DNA consensus sequence that is common to each contractile protein genemore » has not been identified. In contrast to the results of these earlier studies, the authors have found that the 5'-flanking region of the quail troponin I (TnI) gene is not sufficient to permit the normal myofiber transcriptional activation of the gene. Instead, the TnI gene utilizes a unique internal regulatory element that is responsible for the correct myofiber-specific expression pattern associated with the TnI gene. This is the first example in which a contractile protein gene has been shown to rely primarily on an internal regulatory element to elicit transcriptional activation during myogenesis. The diversity of regulatory elements associated with the contractile protein genes suggests that the temporal expression of the genes may involve individual cis-trans regulatory components specific for each gene.« less
An internal regulatory element controls troponin I gene expression.
Yutzey, K E; Kline, R L; Konieczny, S F
1989-01-01
During skeletal myogenesis, approximately 20 contractile proteins and related gene products temporally accumulate as the cells fuse to form multinucleated muscle fibers. In most instances, the contractile protein genes are regulated transcriptionally, which suggests that a common molecular mechanism may coordinate the expression of this diverse and evolutionarily unrelated gene set. Recent studies have examined the muscle-specific cis-acting elements associated with numerous contractile protein genes. All of the identified regulatory elements are positioned in the 5'-flanking regions, usually within 1,500 base pairs of the transcription start site. Surprisingly, a DNA consensus sequence that is common to each contractile protein gene has not been identified. In contrast to the results of these earlier studies, we have found that the 5'-flanking region of the quail troponin I (TnI) gene is not sufficient to permit the normal myofiber transcriptional activation of the gene. Instead, the TnI gene utilizes a unique internal regulatory element that is responsible for the correct myofiber-specific expression pattern associated with the TnI gene. This is the first example in which a contractile protein gene has been shown to rely primarily on an internal regulatory element to elicit transcriptional activation during myogenesis. The diversity of regulatory elements associated with the contractile protein genes suggests that the temporal expression of the genes may involve individual cis-trans regulatory components specific for each gene. Images PMID:2725509
Mapping cis-Regulatory Domains in the Human Genome UsingMulti-Species Conservation of Synteny
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ahituv, Nadav; Prabhakar, Shyam; Poulin, Francis
2005-06-13
Our inability to associate distant regulatory elements with the genes that they regulate has largely precluded their examination for sequence alterations contributing to human disease. One major obstacle is the large genomic space surrounding targeted genes in which such elements could potentially reside. In order to delineate gene regulatory boundaries we used whole-genome human-mouse-chicken (HMC) and human-mouse-frog (HMF) multiple alignments to compile conserved blocks of synteny (CBS), under the hypothesis that these blocks have been kept intact throughout evolution at least in part by the requirement of regulatory elements to stay linked to the genes that they regulate. A totalmore » of 2,116 and 1,942 CBS>200 kb were assembled for HMC and HMF respectively, encompassing 1.53 and 0.86 Gb of human sequence. To support the existence of complex long-range regulatory domains within these CBS we analyzed the prevalence and distribution of chromosomal aberrations leading to position effects (disruption of a genes regulatory environment), observing a clear bias not only for mapping onto CBS but also for longer CBS size. Our results provide a genome wide data set characterizing the regulatory domains of genes and the conserved regulatory elements within them.« less
Incorporating incorporating economic models into seasonal pool conservation planning
Freeman, Robert C.; Bell, Kathleen P.; Calhoun, Aram J.K.; Loftin, Cyndy
2012-01-01
Massachusetts, New Jersey, Connecticut, and Maine have adopted regulatory zones around seasonal (vernal) pools to conserve terrestrial habitat for pool-breeding amphibians. Most amphibians require access to distinct seasonal habitats in both terrestrial and aquatic ecosystems because of their complex life histories. These habitat requirements make them particularly vulnerable to land uses that destroy habitat or limit connectivity (or permeability) among habitats. Regulatory efforts focusing on breeding pools without consideration of terrestrial habitat needs will not ensure the persistence of pool-breeding amphibians. We used GIS to combine a discrete-choice, parcel-scale economic model of land conversion with a landscape permeability model based on known habitat requirements of wood frogs (Lithobates sylvaticus) in Maine (USA) to examine permeability among habitat elements for alternative future scenarios. The economic model predicts future landscapes under different subdivision open space and vernal pool regulatory requirements. Our model showed that even “no build” permit zones extending 76 m (250 ft) outward from the pool edge were insufficient to assure permeability among required habitat elements. Furthermore, effectiveness of permit zones may be inconsistent due to interactions with other growth management policies, highlighting the need for local and state planning for the long-term persistence of pool-breeding amphibians in developing landscapes.
Hoballa, Mohamad Hussein; Soltani, Bahram M; Mowla, Seyed Javad; Sheikhpour, Mojgan; Kay, Maryam
2018-07-01
Frequent abnormalities in 7p12 locus in different tumors like lung cancer candidate this region for novel regulatory elements. MiRNAs as novel regulatory elements encoded within the human genome are potentially oncomiRs or miR suppressors. Here, we have used bioinformatics tools to search for the novel miRNAs embedded within human chromosome 7p12. A bona fide stem loop (named mirZa precursor) had the features of producing a real miRNA (named miRZa) which was detected through RT-qPCR following the overexpression of its precursor. Then, endogenous miRZa was detected in human cell lines and tissues and sequenced. Consistent to the bioinformatics prediction, RT-qPCR as well as dual luciferase assay indicated that SMAD3 and IGF1R genes were targeted by miRZa. MiRZa-3p and miRZa-5p were downregulated in lung tumor tissue samples detected by RT-qPCR, and mirZa precursor overexpression in SW480 cells resulted in increased sub-G1 cell population. Overall, here we introduced a novel miRNA which is capable of targeting SMAD3 and IGF1R regulatory genes and increases the cell population in sub-G1 stage.
Ribo-attenuators: novel elements for reliable and modular riboswitch engineering.
Folliard, Thomas; Mertins, Barbara; Steel, Harrison; Prescott, Thomas P; Newport, Thomas; Jones, Christopher W; Wadhams, George; Bayer, Travis; Armitage, Judith P; Papachristodoulou, Antonis; Rothschild, Lynn J
2017-07-04
Riboswitches are structural genetic regulatory elements that directly couple the sensing of small molecules to gene expression. They have considerable potential for applications throughout synthetic biology and bio-manufacturing as they are able to sense a wide range of small molecules and regulate gene expression in response. Despite over a decade of research they have yet to reach this considerable potential as they cannot yet be treated as modular components. This is due to several limitations including sensitivity to changes in genetic context, low tunability, and variability in performance. To overcome the associated difficulties with riboswitches, we have designed and introduced a novel genetic element called a ribo-attenuator in Bacteria. This genetic element allows for predictable tuning, insulation from contextual changes, and a reduction in expression variation. Ribo-attenuators allow riboswitches to be treated as truly modular and tunable components, thus increasing their reliability for a wide range of applications.
Shafiee, Mohamad N; Mongan, Nigel; Seedhouse, Claire; Chapman, Caroline; Deen, Suha; Abu, Jafaru; Atiomo, William
2017-05-01
Women with polycystic ovary syndrome have a three-fold higher risk of endometrial cancer. Insulin resistance and hyperlipidemia may be pertinent factors in the pathogenesis of both conditions. The aim of this study was to investigate endometrial sterol regulatory element binding protein-1 gene expression in polycystic ovary syndrome and endometrial cancer endometrium, and to correlate endometrial sterol regulatory element binding protein-1 gene expression with serum lipid profiles. A cross-sectional study was performed at Nottingham University Hospital, UK. A total of 102 women (polycystic ovary syndrome, endometrial cancer and controls; 34 participants in each group) were recruited. Clinical and biochemical assessments were performed before endometrial biopsies were obtained from all participants. Taqman real-time polymerase chain reaction for endometrial sterol regulatory element binding protein-1 gene and its systemic protein expression were analyzed. The body mass indices of women with polycystic ovary syndrome (29.28 ± 2.91 kg/m 2 ) and controls (28.58 ± 2.62 kg/m 2 ) were not significantly different. Women with endometrial cancer had a higher mean body mass index (32.22 ± 5.70 kg/m 2 ). Sterol regulatory element binding protein-1 gene expression was significantly increased in polycystic ovary syndrome and endometrial cancer endometrium compared with controls (p < 0.0001). Sterol regulatory element binding protein-1 gene expression was positively correlated with body mass index (r = 0.017, p = 0.921) and waist-hip ratio (r = 0.023, p = 0.544) in polycystic ovary syndrome, but this was not statistically significant. Similarly, statistically insignificant positive correlations were found between endometrial sterol regulatory element binding protein-1 gene expression and body mass index in endometrial cancer (r = 0.643, p = 0.06) and waist-hip ratio (r = 0.096, p = 0.073). Sterol regulatory element binding protein-1 gene expression was significantly positively correlated with triglyceride in both polycystic ovary syndrome and endometrial cancer (p = 0.028 and p = 0.027, respectively). Quantitative serum sterol regulatory element binding protein-1 gene correlated with endometrial gene expression (p < 0.05). Sterol regulatory element binding protein-1 gene expression is significantly increased in the endometrium of women with polycystic ovary syndrome and women with endometrial cancer compared with controls and positively correlates with serum triglyceride in both polycystic ovary syndrome and endometrial cancer. © 2017 Nordic Federation of Societies of Obstetrics and Gynecology.
Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters
DOE Office of Scientific and Technical Information (OSTI.GOV)
Santini, Simona; Boore, Jeffrey L.; Meyer, Axel
2003-12-31
Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involvedmore » in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.« less
Mechanisms and Evolution of Control Logic in Prokaryotic Transcriptional Regulation
van Hijum, Sacha A. F. T.; Medema, Marnix H.; Kuipers, Oscar P.
2009-01-01
Summary: A major part of organismal complexity and versatility of prokaryotes resides in their ability to fine-tune gene expression to adequately respond to internal and external stimuli. Evolution has been very innovative in creating intricate mechanisms by which different regulatory signals operate and interact at promoters to drive gene expression. The regulation of target gene expression by transcription factors (TFs) is governed by control logic brought about by the interaction of regulators with TF binding sites (TFBSs) in cis-regulatory regions. A factor that in large part determines the strength of the response of a target to a given TF is motif stringency, the extent to which the TFBS fits the optimal TFBS sequence for a given TF. Advances in high-throughput technologies and computational genomics allow reconstruction of transcriptional regulatory networks in silico. To optimize the prediction of transcriptional regulatory networks, i.e., to separate direct regulation from indirect regulation, a thorough understanding of the control logic underlying the regulation of gene expression is required. This review summarizes the state of the art of the elements that determine the functionality of TFBSs by focusing on the molecular biological mechanisms and evolutionary origins of cis-regulatory regions. PMID:19721087
NASA Technical Reports Server (NTRS)
Wan, B.; Moreadith, R. W.; Blomqvist, C. G. (Principal Investigator)
1995-01-01
In order to investigate the mechanism(s) governing the striated muscle-specific expression of cytochrome c oxidase VIaH we have characterized the murine gene and analyzed its transcriptional regulatory elements in skeletal myogenic cell lines. The gene is single copy, spans 689 base pairs (bp), and is comprised of three exons. The 5'-ends of transcripts from the gene are heterogeneous, but the most abundant transcript includes a 5'-untranslated region of 30 nucleotides. When fused to the luciferase reporter gene, the 3.5-kilobase 5'-flanking region of the gene directed the expression of the heterologous protein selectively in differentiated Sol8 cells and transgenic mice, recapitulating the pattern of expression of the endogenous gene. Deletion analysis identified a 300-bp fragment sufficient to direct the myotube-specific expression of luciferase in Sol8 cells. The region lacks an apparent TATA element, and sequence motifs predicted to bind NRF-1, NRF-2, ox-box, or PPAR factors known to regulate other nuclear genes encoding mitochondrial proteins are not evident. Mutational analysis, however, identified two cis-elements necessary for the high level expression of the reporter protein: a MEF2 consensus element at -90 to -81 bp and an E-box element at -147 to -142 bp. Additional E-box motifs at closely located positions were mutated without loss of transcriptional activity. The dependence of transcriptional activation of cytochrome c oxidase VIaH on cis-elements similar to those found in contractile protein genes suggests that the striated muscle-specific expression is coregulated by mechanisms that control the lineage-specific expression of several contractile and cytosolic proteins.
Brachyury, Foxa2 and the cis-Regulatory Origins of the Notochord
José-Edwards, Diana S.; Oda-Ishii, Izumi; Kugler, Jamie E.; Passamaneck, Yale J.; Katikala, Lavanya; Nibu, Yutaka; Di Gregorio, Anna
2015-01-01
A main challenge of modern biology is to understand how specific constellations of genes are activated to differentiate cells and give rise to distinct tissues. This study focuses on elucidating how gene expression is initiated in the notochord, an axial structure that provides support and patterning signals to embryos of humans and all other chordates. Although numerous notochord genes have been identified, the regulatory DNAs that orchestrate development and propel evolution of this structure by eliciting notochord gene expression remain mostly uncharted, and the information on their configuration and recurrence is still quite fragmentary. Here we used the simple chordate Ciona for a systematic analysis of notochord cis-regulatory modules (CRMs), and investigated their composition, architectural constraints, predictive ability and evolutionary conservation. We found that most Ciona notochord CRMs relied upon variable combinations of binding sites for the transcription factors Brachyury and/or Foxa2, which can act either synergistically or independently from one another. Notably, one of these CRMs contains a Brachyury binding site juxtaposed to an (AC) microsatellite, an unusual arrangement also found in Brachyury-bound regulatory regions in mouse. In contrast, different subsets of CRMs relied upon binding sites for transcription factors of widely diverse families. Surprisingly, we found that neither intra-genomic nor interspecific conservation of binding sites were reliably predictive hallmarks of notochord CRMs. We propose that rather than obeying a rigid sequence-based cis-regulatory code, most notochord CRMs are rather unique. Yet, this study uncovered essential elements recurrently used by divergent chordates as basic building blocks for notochord CRMs. PMID:26684323
Brachyury, Foxa2 and the cis-Regulatory Origins of the Notochord.
José-Edwards, Diana S; Oda-Ishii, Izumi; Kugler, Jamie E; Passamaneck, Yale J; Katikala, Lavanya; Nibu, Yutaka; Di Gregorio, Anna
2015-12-01
A main challenge of modern biology is to understand how specific constellations of genes are activated to differentiate cells and give rise to distinct tissues. This study focuses on elucidating how gene expression is initiated in the notochord, an axial structure that provides support and patterning signals to embryos of humans and all other chordates. Although numerous notochord genes have been identified, the regulatory DNAs that orchestrate development and propel evolution of this structure by eliciting notochord gene expression remain mostly uncharted, and the information on their configuration and recurrence is still quite fragmentary. Here we used the simple chordate Ciona for a systematic analysis of notochord cis-regulatory modules (CRMs), and investigated their composition, architectural constraints, predictive ability and evolutionary conservation. We found that most Ciona notochord CRMs relied upon variable combinations of binding sites for the transcription factors Brachyury and/or Foxa2, which can act either synergistically or independently from one another. Notably, one of these CRMs contains a Brachyury binding site juxtaposed to an (AC) microsatellite, an unusual arrangement also found in Brachyury-bound regulatory regions in mouse. In contrast, different subsets of CRMs relied upon binding sites for transcription factors of widely diverse families. Surprisingly, we found that neither intra-genomic nor interspecific conservation of binding sites were reliably predictive hallmarks of notochord CRMs. We propose that rather than obeying a rigid sequence-based cis-regulatory code, most notochord CRMs are rather unique. Yet, this study uncovered essential elements recurrently used by divergent chordates as basic building blocks for notochord CRMs.
Ronsseray, S.; Lehmann, M.; Nouaud, D.; Anxolabehere, D.
1996-01-01
Genetic recombination was used in Drosophila melanogaster to isolate P elements, inserted at the telomeres of X chromosomes (cytological site 1A) from natural populations, in a genetic background devoid of other P elements. We show that complete maternally inherited P repression in the germline (P cytotype) can be elicited by only two autonomous P elements at 1A and that a single element at this site has partial regulatory properties. The analysis of the surrounding chromosomal regions of the P elements at 1A shows that in all cases these elements are flanked by Telomeric Associated Sequences, tandemly repetitive noncoding sequences that have properties of heterochromatin. In addition, we show that the regulatory properties of P elements at 1A can be inhibited by some of the mutant alleles of the Su(var)205 gene and by a deficiency of this gene. However, the regulatory properties of reference P strains (Harwich and Texas 007) are not impaired by Su(var)205 mutations. Su(var)205 encodes Heterochromatin Protein 1 (HP1). These results suggest that the HP1 dosage effect on the P element properties is site-dependent and could involve the structure of the chromatin. PMID:8844154
Two distinct auto-regulatory loops operate at the PU.1 locus in B cells and myeloid cells
Leddin, Mathias; Perrod, Chiara; Hoogenkamp, Maarten; Ghani, Saeed; Assi, Salam; Heinz, Sven; Wilson, Nicola K.; Follows, George; Schönheit, Jörg; Vockentanz, Lena; Mosammam, Ali M.; Chen, Wei; Tenen, Daniel G.; Westhead, David R.; Göttgens, Berthold
2011-01-01
The transcription factor PU.1 occupies a central role in controlling myeloid and early B-cell development, and its correct lineage-specific expression is critical for the differentiation choice of hematopoietic progenitors. However, little is known of how this tissue-specific pattern is established. We previously identified an upstream regulatory cis element whose targeted deletion in mice decreases PU.1 expression and causes leukemia. We show here that the upstream regulatory cis element alone is insufficient to confer physiologic PU.1 expression in mice but requires the cooperation with other, previously unidentified elements. Using a combination of transgenic studies, global chromatin assays, and detailed molecular analyses we present evidence that PU.1 is regulated by a novel mechanism involving cross talk between different cis elements together with lineage-restricted autoregulation. In this model, PU.1 regulates its expression in B cells and macrophages by differentially associating with cell type–specific transcription factors at one of its cis-regulatory elements to establish differential activity patterns at other elements. PMID:21239694
BiRen: predicting enhancers with a deep-learning-based model using the DNA sequence alone.
Yang, Bite; Liu, Feng; Ren, Chao; Ouyang, Zhangyi; Xie, Ziwei; Bo, Xiaochen; Shu, Wenjie
2017-07-01
Enhancer elements are noncoding stretches of DNA that play key roles in controlling gene expression programmes. Despite major efforts to develop accurate enhancer prediction methods, identifying enhancer sequences continues to be a challenge in the annotation of mammalian genomes. One of the major issues is the lack of large, sufficiently comprehensive and experimentally validated enhancers for humans or other species. Thus, the development of computational methods based on limited experimentally validated enhancers and deciphering the transcriptional regulatory code encoded in the enhancer sequences is urgent. We present a deep-learning-based hybrid architecture, BiRen, which predicts enhancers using the DNA sequence alone. Our results demonstrate that BiRen can learn common enhancer patterns directly from the DNA sequence and exhibits superior accuracy, robustness and generalizability in enhancer prediction relative to other state-of-the-art enhancer predictors based on sequence characteristics. Our BiRen will enable researchers to acquire a deeper understanding of the regulatory code of enhancer sequences. Our BiRen method can be freely accessed at https://github.com/wenjiegroup/BiRen . shuwj@bmi.ac.cn or boxc@bmi.ac.cn. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
RegPrecise 3.0--a resource for genome-scale exploration of transcriptional regulation in bacteria.
Novichkov, Pavel S; Kazakov, Alexey E; Ravcheev, Dmitry A; Leyn, Semen A; Kovaleva, Galina Y; Sutormin, Roman A; Kazanov, Marat D; Riehl, William; Arkin, Adam P; Dubchak, Inna; Rodionov, Dmitry A
2013-11-01
Genome-scale prediction of gene regulation and reconstruction of transcriptional regulatory networks in prokaryotes is one of the critical tasks of modern genomics. Bacteria from different taxonomic groups, whose lifestyles and natural environments are substantially different, possess highly diverged transcriptional regulatory networks. The comparative genomics approaches are useful for in silico reconstruction of bacterial regulons and networks operated by both transcription factors (TFs) and RNA regulatory elements (riboswitches). RegPrecise (http://regprecise.lbl.gov) is a web resource for collection, visualization and analysis of transcriptional regulons reconstructed by comparative genomics. We significantly expanded a reference collection of manually curated regulons we introduced earlier. RegPrecise 3.0 provides access to inferred regulatory interactions organized by phylogenetic, structural and functional properties. Taxonomy-specific collections include 781 TF regulogs inferred in more than 160 genomes representing 14 taxonomic groups of Bacteria. TF-specific collections include regulogs for a selected subset of 40 TFs reconstructed across more than 30 taxonomic lineages. Novel collections of regulons operated by RNA regulatory elements (riboswitches) include near 400 regulogs inferred in 24 bacterial lineages. RegPrecise 3.0 provides four classifications of the reference regulons implemented as controlled vocabularies: 55 TF protein families; 43 RNA motif families; ~150 biological processes or metabolic pathways; and ~200 effectors or environmental signals. Genome-wide visualization of regulatory networks and metabolic pathways covered by the reference regulons are available for all studied genomes. A separate section of RegPrecise 3.0 contains draft regulatory networks in 640 genomes obtained by an conservative propagation of the reference regulons to closely related genomes. RegPrecise 3.0 gives access to the transcriptional regulons reconstructed in bacterial genomes. Analytical capabilities include exploration of: regulon content, structure and function; TF binding site motifs; conservation and variations in genome-wide regulatory networks across all taxonomic groups of Bacteria. RegPrecise 3.0 was selected as a core resource on transcriptional regulation of the Department of Energy Systems Biology Knowledgebase, an emerging software and data environment designed to enable researchers to collaboratively generate, test and share new hypotheses about gene and protein functions, perform large-scale analyses, and model interactions in microbes, plants, and their communities.
Qiu, Zhengkun; Li, Ren; Zhang, Shuaibin; Wang, Ketao; Xu, Meng; Li, Jiayang; Du, Yongchen; Yu, Hong; Cui, Xia
2016-08-01
Development and ripening of tomato fruit are precisely controlled by transcriptional regulation, which depends on the orchestrated accessibility of regulatory proteins to promoters and other cis-regulatory DNA elements. This accessibility and its effect on gene expression play a major role in defining the developmental process. To understand the regulatory mechanism and functional elements modulating morphological and anatomical changes during fruit development, we generated genome-wide high-resolution maps of DNase I hypersensitive sites (DHSs) from the fruit tissues of the tomato cultivar "Moneymaker" at 20 days post anthesis as well as break stage. By exploring variation of DHSs across fruit development stages, we pinpointed the most likely hypersensitive sites related to development-specific genes. By detecting binding motifs on DHSs of these development-specific genes or genes in the ascorbic acid biosynthetic pathway, we revealed the common regulatory elements contributing to coordinating gene transcription of plant ripening and specialized metabolic pathways. Our results contribute to a better understanding of the regulatory dynamics of genes involved in tomato fruit development and ripening. Copyright © 2016 The Author. Published by Elsevier Inc. All rights reserved.
Karnik, Rahul; Beer, Michael A.
2015-01-01
The generation of genomic binding or accessibility data from massively parallel sequencing technologies such as ChIP-seq and DNase-seq continues to accelerate. Yet state-of-the-art computational approaches for the identification of DNA binding motifs often yield motifs of weak predictive power. Here we present a novel computational algorithm called MotifSpec, designed to find predictive motifs, in contrast to over-represented sequence elements. The key distinguishing feature of this algorithm is that it uses a dynamic search space and a learned threshold to find discriminative motifs in combination with the modeling of motifs using a full PWM (position weight matrix) rather than k-mer words or regular expressions. We demonstrate that our approach finds motifs corresponding to known binding specificities in several mammalian ChIP-seq datasets, and that our PWMs classify the ChIP-seq signals with accuracy comparable to, or marginally better than motifs from the best existing algorithms. In other datasets, our algorithm identifies novel motifs where other methods fail. Finally, we apply this algorithm to detect motifs from expression datasets in C. elegans using a dynamic expression similarity metric rather than fixed expression clusters, and find novel predictive motifs. PMID:26465884
Karnik, Rahul; Beer, Michael A
2015-01-01
The generation of genomic binding or accessibility data from massively parallel sequencing technologies such as ChIP-seq and DNase-seq continues to accelerate. Yet state-of-the-art computational approaches for the identification of DNA binding motifs often yield motifs of weak predictive power. Here we present a novel computational algorithm called MotifSpec, designed to find predictive motifs, in contrast to over-represented sequence elements. The key distinguishing feature of this algorithm is that it uses a dynamic search space and a learned threshold to find discriminative motifs in combination with the modeling of motifs using a full PWM (position weight matrix) rather than k-mer words or regular expressions. We demonstrate that our approach finds motifs corresponding to known binding specificities in several mammalian ChIP-seq datasets, and that our PWMs classify the ChIP-seq signals with accuracy comparable to, or marginally better than motifs from the best existing algorithms. In other datasets, our algorithm identifies novel motifs where other methods fail. Finally, we apply this algorithm to detect motifs from expression datasets in C. elegans using a dynamic expression similarity metric rather than fixed expression clusters, and find novel predictive motifs.
A new method for enhancer prediction based on deep belief network.
Bu, Hongda; Gan, Yanglan; Wang, Yang; Zhou, Shuigeng; Guan, Jihong
2017-10-16
Studies have shown that enhancers are significant regulatory elements to play crucial roles in gene expression regulation. Since enhancers are unrelated to the orientation and distance to their target genes, it is a challenging mission for scholars and researchers to accurately predicting distal enhancers. In the past years, with the high-throughout ChiP-seq technologies development, several computational techniques emerge to predict enhancers using epigenetic or genomic features. Nevertheless, the inconsistency of computational models across different cell-lines and the unsatisfactory prediction performance call for further research in this area. Here, we propose a new Deep Belief Network (DBN) based computational method for enhancer prediction, which is called EnhancerDBN. This method combines diverse features, composed of DNA sequence compositional features, DNA methylation and histone modifications. Our computational results indicate that 1) EnhancerDBN outperforms 13 existing methods in prediction, and 2) GC content and DNA methylation can serve as relevant features for enhancer prediction. Deep learning is effective in boosting the performance of enhancer prediction.
Hoermann, Astrid; Cicin-Sain, Damjan; Jaeger, Johannes
2016-03-15
Understanding eukaryotic transcriptional regulation and its role in development and pattern formation is one of the big challenges in biology today. Most attempts at tackling this problem either focus on the molecular details of transcription factor binding, or aim at genome-wide prediction of expression patterns from sequence through bioinformatics and mathematical modelling. Here we bridge the gap between these two complementary approaches by providing an integrative model of cis-regulatory elements governing the expression of the gap gene giant (gt) in the blastoderm embryo of Drosophila melanogaster. We use a reverse-engineering method, where mathematical models are fit to quantitative spatio-temporal reporter gene expression data to infer the regulatory mechanisms underlying gt expression in its anterior and posterior domains. These models are validated through prediction of gene expression in mutant backgrounds. A detailed analysis of our data and models reveals that gt is regulated by domain-specific CREs at early stages, while a late element drives expression in both the anterior and the posterior domains. Initial gt expression depends exclusively on inputs from maternal factors. Later, gap gene cross-repression and gt auto-activation become increasingly important. We show that auto-regulation creates a positive feedback, which mediates the transition from early to late stages of regulation. We confirm the existence and role of gt auto-activation through targeted mutagenesis of Gt transcription factor binding sites. In summary, our analysis provides a comprehensive picture of spatio-temporal gene regulation by different interacting enhancer elements for an important developmental regulator. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Global Organization of a Positive-strand RNA Virus Genome
Wu, Baodong; Grigull, Jörg; Ore, Moriam O.; Morin, Sylvie; White, K. Andrew
2013-01-01
The genomes of plus-strand RNA viruses contain many regulatory sequences and structures that direct different viral processes. The traditional view of these RNA elements are as local structures present in non-coding regions. However, this view is changing due to the discovery of regulatory elements in coding regions and functional long-range intra-genomic base pairing interactions. The ∼4.8 kb long RNA genome of the tombusvirus tomato bushy stunt virus (TBSV) contains these types of structural features, including six different functional long-distance interactions. We hypothesized that to achieve these multiple interactions this viral genome must utilize a large-scale organizational strategy and, accordingly, we sought to assess the global conformation of the entire TBSV genome. Atomic force micrographs of the genome indicated a mostly condensed structure composed of interconnected protrusions extending from a central hub. This configuration was consistent with the genomic secondary structure model generated using high-throughput selective 2′-hydroxyl acylation analysed by primer extension (i.e. SHAPE), which predicted different sized RNA domains originating from a central region. Known RNA elements were identified in both domain and inter-domain regions, and novel structural features were predicted and functionally confirmed. Interestingly, only two of the six long-range interactions known to form were present in the structural model. However, for those interactions that did not form, complementary partner sequences were positioned relatively close to each other in the structure, suggesting that the secondary structure level of viral genome structure could provide a basic scaffold for the formation of different long-range interactions. The higher-order structural model for the TBSV RNA genome provides a snapshot of the complex framework that allows multiple functional components to operate in concert within a confined context. PMID:23717202
Castresana, C; Garcia-Luque, I; Alonso, E; Malik, V S; Cashmore, A R
1988-01-01
We have analyzed promoter regulatory elements from a photoregulated CAB gene (Cab-E) isolated from Nicotiana plumbaginifolia. These studies have been performed by introducing chimeric gene constructs into tobacco cells via Agrobacterium tumefaciens-mediated transformation. Expression studies on the regenerated transgenic plants have allowed us to characterize three positive and one negative cis-acting elements that influence photoregulated expression of the Cab-E gene. Within the upstream sequences we have identified two positive regulatory elements (PRE1 and PRE2) which confer maximum levels of photoregulated expression. These sequences contain multiple repeated elements related to the sequence-ACCGGCCCACTT-. We have also identified within the upstream region a negative regulatory element (NRE) extremely rich in AT sequences, which reduces the level of gene expression in the light. We have defined a light regulatory element (LRE) within the promoter region extending from -396 to -186 bp which confers photoregulated expression when fused to a constitutive nopaline synthase ('nos') promoter. Within this region there is a 132-bp element, extending from -368 to -234 bp, which on deletion from the Cab-E promoter reduces gene expression from high levels to undetectable levels. Finally, we have demonstrated for a full length Cab-E promoter conferring high levels of photoregulated expression, that sequences proximal to the Cab-E TATA box are not replaceable by corresponding sequences from a 'nos' promoter. This contrasts with the apparent equivalence of these Cab-E and 'nos' TATA box-proximal sequences in truncated promoters conferring low levels of photoregulated expression. Images PMID:2901343
Yue, Jia-Xing; Kozmikova, Iryna; Ono, Hiroki; Nossa, Carlos W.; Kozmik, Zbynek; Putnam, Nicholas H.; Yu, Jr-Kai; Holland, Linda Z.
2016-01-01
Cephalochordates, the sister group of vertebrates + tunicates, are evolving particularly slowly. Therefore, genome comparisons between two congeners of Branchiostoma revealed so many conserved noncoding elements (CNEs), that it was not clear how many are functional regulatory elements. To more effectively identify CNEs with potential regulatory functions, we compared noncoding sequences of genomes of the most phylogenetically distant cephalochordate genera, Asymmetron and Branchiostoma, which diverged approximately 120–160 million years ago. We found 113,070 noncoding elements conserved between the two species, amounting to 3.3% of the genome. The genomic distribution, target gene ontology, and enriched motifs of these CNEs all suggest that many of them are probably cis-regulatory elements. More than 90% of previously verified amphioxus regulatory elements were re-captured in this study. A search of the cephalochordate CNEs around 50 developmental genes in several vertebrate genomes revealed eight CNEs conserved between cephalochordates and vertebrates, indicating sequence conservation over >500 million years of divergence. The function of five CNEs was tested in reporter assays in zebrafish, and one was also tested in amphioxus. All five CNEs proved to be tissue-specific enhancers. Taken together, these findings indicate that even though Branchiostoma and Asymmetron are distantly related, as they are evolving slowly, comparisons between them are likely optimal for identifying most of their tissue-specific cis-regulatory elements laying the foundation for functional characterizations and a better understanding of the evolution of developmental regulation in cephalochordates. PMID:27412606
Decoding the role of regulatory element polymorphisms in complex disease.
Vockley, Christopher M; Barrera, Alejandro; Reddy, Timothy E
2017-04-01
Genetic variation in gene regulatory elements contributes to diverse human diseases, ranging from rare and severe developmental defects to common and complex diseases such as obesity and diabetes. Early examples of regulatory mechanisms of human diseases involve large chromosomal rearrangements that change the regulatory connections within the genome. Single nucleotide variants in regulatory elements can also contribute to disease, potentially via demonstrated associations with changes in transcription factor binding, enhancer activity, post-translational histone modifications, long-range enhancer-promoter interactions, or RNA polymerase recruitment. Establishing causality between non-coding genetic variants, gene regulation, and disease has recently become more feasible with advances in genome-editing and epigenome-editing technologies. As establishing causal regulatory mechanisms of diseases becomes routine, functional annotation of target genes is likely to emerge as a major bottleneck for translation into patient benefits. In this review, we discuss the history and recent advances in understanding the regulatory mechanisms of human disease, and new challenges likely to be encountered once establishing those mechanisms becomes rote. Copyright © 2016 Elsevier Ltd. All rights reserved.
Feather Development Genes and Associated Regulatory Innovation Predate the Origin of Dinosauria
Lowe, Craig B.; Clarke, Julia A.; Baker, Allan J.; Haussler, David; Edwards, Scott V.
2015-01-01
The evolution of avian feathers has recently been illuminated by fossils and the identification of genes involved in feather patterning and morphogenesis. However, molecular studies have focused mainly on protein-coding genes. Using comparative genomics and more than 600,000 conserved regulatory elements, we show that patterns of genome evolution in the vicinity of feather genes are consistent with a major role for regulatory innovation in the evolution of feathers. Rates of innovation at feather regulatory elements exhibit an extended period of innovation with peaks in the ancestors of amniotes and archosaurs. We estimate that 86% of such regulatory elements and 100% of the nonkeratin feather gene set were present prior to the origin of Dinosauria. On the branch leading to modern birds, we detect a strong signal of regulatory innovation near insulin-like growth factor binding protein (IGFBP) 2 and IGFBP5, which have roles in body size reduction, and may represent a genomic signature for the miniaturization of dinosaurian body size preceding the origin of flight. PMID:25415961
Genome-wide colonization of gene regulatory elements by G4 DNA motifs
Du, Zhuo; Zhao, Yiqiang; Li, Ning
2009-01-01
G-quadruplex (or G4 DNA), a stable four-stranded structure found in guanine-rich regions, is implicated in the transcriptional regulation of genes involved in growth and development. Previous studies on the role of G4 DNA in gene regulation mostly focused on genomic regions proximal to transcription start sites (TSSs). To gain a more comprehensive understanding of the regulatory role of G4 DNA, we examined the landscape of potential G4 DNA (PG4Ms) motifs in the human genome and found that G4 motifs, not restricted to those found in the TSS-proximal regions, are bias toward gene-associated regions. Significantly, analyses of G4 motifs in seven types of well-known gene regulatory elements revealed a constitutive enrichment pattern and the clusters of G4 motifs tend to be colocalized with regulatory elements. Considering our analysis from a genome evolutionary perspective, we found evidence that the occurrence and accumulation of certain progenitors and canonical G4 DNA motifs within regulatory regions were progressively favored by natural selection. Our results suggest that G4 DNA motifs are ‘colonized’ in regulatory regions, supporting a likely genome-wide role of G4 DNA in gene regulation. We hypothesize that G4 DNA is a regulatory apparatus situated in regulatory elements, acting as a molecular switch that can modulate the role of the host functional regions, by transition in DNA structure. PMID:19759215
Hughes, David; Vincent-Jones, Peter
2008-12-01
Since devolution, the four countries of the United Kingdom have pursued strikingly different National Health Service (NHS) reforms. While England created a supply-side market more radical than the previous internal market system, Wales moved to a softer version of the purchaser/provider split emphasizing localism. This article deploys institutional theory to analyze the forces shaping change, and describes the hybrid forms of economic organization emerging, including the economic regulation model implemented in England. The schism that has resulted in separate NHS subsystems warrants a different analysis from the more familiar phenomenon of infield divergence. We argue that schism was triggered by political-regulatory influences rather than economic or other social institutional forces, and predict that other decentralized public health care systems may follow a similar path. While political-regulatory, normative, and cognitive institutional influences push in the same direction in Wales, the misalignment of political-regulatory and normative elements in England looks set to result in a period of organizational turbulence.
Cis-regulatory somatic mutations and gene-expression alteration in B-cell lymphomas.
Mathelier, Anthony; Lefebvre, Calvin; Zhang, Allen W; Arenillas, David J; Ding, Jiarui; Wasserman, Wyeth W; Shah, Sohrab P
2015-04-23
With the rapid increase of whole-genome sequencing of human cancers, an important opportunity to analyze and characterize somatic mutations lying within cis-regulatory regions has emerged. A focus on protein-coding regions to identify nonsense or missense mutations disruptive to protein structure and/or function has led to important insights; however, the impact on gene expression of mutations lying within cis-regulatory regions remains under-explored. We analyzed somatic mutations from 84 matched tumor-normal whole genomes from B-cell lymphomas with accompanying gene expression measurements to elucidate the extent to which these cancers are disrupted by cis-regulatory mutations. We characterize mutations overlapping a high quality set of well-annotated transcription factor binding sites (TFBSs), covering a similar portion of the genome as protein-coding exons. Our results indicate that cis-regulatory mutations overlapping predicted TFBSs are enriched in promoter regions of genes involved in apoptosis or growth/proliferation. By integrating gene expression data with mutation data, our computational approach culminates with identification of cis-regulatory mutations most likely to participate in dysregulation of the gene expression program. The impact can be measured along with protein-coding mutations to highlight key mutations disrupting gene expression and pathways in cancer. Our study yields specific genes with disrupted expression triggered by genomic mutations in either the coding or the regulatory space. It implies that mutated regulatory components of the genome contribute substantially to cancer pathways. Our analyses demonstrate that identifying genomically altered cis-regulatory elements coupled with analysis of gene expression data will augment biological interpretation of mutational landscapes of cancers.
Dong, Shan-Shan; Guo, Yan; Zhu, Dong-Li; Chen, Xiao-Feng; Wu, Xiao-Ming; Shen, Hui; Chen, Xiang-Ding; Tan, Li-Jun; Tian, Qing; Deng, Hong-Wen; Yang, Tie-Lin
2016-01-01
OBJECTIVES With ENCODE epigenomic data and results from published genome-wide association studies (GWASs), we aimed to find regulatory signatures of obesity genes and discover novel susceptibility genes. METHODS Obesity genes were obtained from public GWASs databases and their promoters were annotated based on the regulatory elements information. Significantly enriched or depleted epigenomic elements in the promoters of obesity genes were evaluated and all human genes were then prioritized according to the existence of the selected elements to predict new candidate genes. Top ranked genes were subsequently applied to validate their associations with obesity-related traits in three independent in-house GWASs samples. RESULTS We identified RAD21 and EZH2 as over-represented, STAT2 and IRF3 as depleted transcription factors. Histone modification of H3K9me3 and chromatin state segmentation of “poised promoter” and “repressed” were overrepresented. All genes were prioritized and we selected the top five genes for validation at population level. Combined results from the three GWASs samples, rs7522101 in ESRRG remained significantly associated with BMI after multiple testing corrections (P = 7.25 × 10−5). It was also associated with β-cell function (P = 1.99 × 10−3) and fasting glucose level (P < 0.05) in the meta-analyses of glucose and insulin-related traits consortium (MAGIC) dataset. CONCLUSIONS In summary, we identified epigenomic characteristics for obesity genes and suggested ESRRG as a novel obesity susceptibility gene. PMID:27113491
Jin, Erqing; Wong, Lynn; Jiao, Yun; Engel, Jake; Holdridge, Benjamin; Xu, Peng
2017-12-01
Engineering cell factories for producing biofuels and pharmaceuticals has spurred great interests to develop rapid and efficient synthetic biology tools customized for modular pathway engineering. Along the way, combinatorial gene expression control through modification of regulatory element offered tremendous opportunity for fine-tuning gene expression and generating digital-like genetic circuits. In this report, we present an efficient evolutionary approach to build a range of regulatory control elements. The reported method allows for rapid construction of promoter, 5'UTR, terminator and trans -activating RNA libraries. Synthetic overlapping oligos with high portion of degenerate nucleotides flanking the regulatory element could be efficiently assembled to a vector expressing fluorescence reporter. This approach combines high mutation rate of the synthetic DNA with the high assembly efficiency of Gibson Mix. Our constructed library demonstrates broad range of transcriptional or translational gene expression dynamics. Specifically, both the promoter library and 5'UTR library exhibits gene expression dynamics spanning across three order of magnitude. The terminator library and trans -activating RNA library displays relatively narrowed gene expression pattern. The reported study provides a versatile toolbox for rapidly constructing a large family of prokaryotic regulatory elements. These libraries also facilitate the implementation of combinatorial pathway engineering principles and the engineering of more efficient microbial cell factory for various biomanufacturing applications.
Unraveling transcriptional control and cis-regulatory codes using the software suite GeneACT
Cheung, Tom Hiu; Kwan, Yin Lam; Hamady, Micah; Liu, Xuedong
2006-01-01
Deciphering gene regulatory networks requires the systematic identification of functional cis-acting regulatory elements. We present a suite of web-based bioinformatics tools, called GeneACT , that can rapidly detect evolutionarily conserved transcription factor binding sites or microRNA target sites that are either unique or over-represented in differentially expressed genes from DNA microarray data. GeneACT provides graphic visualization and extraction of common regulatory sequence elements in the promoters and 3'-untranslated regions that are conserved across multiple mammalian species. PMID:17064417
ReNE: A Cytoscape Plugin for Regulatory Network Enhancement
Politano, Gianfranco; Benso, Alfredo; Savino, Alessandro; Di Carlo, Stefano
2014-01-01
One of the biggest challenges in the study of biological regulatory mechanisms is the integration, americanmodeling, and analysis of the complex interactions which take place in biological networks. Despite post transcriptional regulatory elements (i.e., miRNAs) are widely investigated in current research, their usage and visualization in biological networks is very limited. Regulatory networks are commonly limited to gene entities. To integrate networks with post transcriptional regulatory data, researchers are therefore forced to manually resort to specific third party databases. In this context, we introduce ReNE, a Cytoscape 3.x plugin designed to automatically enrich a standard gene-based regulatory network with more detailed transcriptional, post transcriptional, and translational data, resulting in an enhanced network that more precisely models the actual biological regulatory mechanisms. ReNE can automatically import a network layout from the Reactome or KEGG repositories, or work with custom pathways described using a standard OWL/XML data format that the Cytoscape import procedure accepts. Moreover, ReNE allows researchers to merge multiple pathways coming from different sources. The merged network structure is normalized to guarantee a consistent and uniform description of the network nodes and edges and to enrich all integrated data with additional annotations retrieved from genome-wide databases like NCBI, thus producing a pathway fully manageable through the Cytoscape environment. The normalized network is then analyzed to include missing transcription factors, miRNAs, and proteins. The resulting enhanced network is still a fully functional Cytoscape network where each regulatory element (transcription factor, miRNA, gene, protein) and regulatory mechanism (up-regulation/down-regulation) is clearly visually identifiable, thus enabling a better visual understanding of its role and the effect in the network behavior. The enhanced network produced by ReNE is exportable in multiple formats for further analysis via third party applications. ReNE can be freely installed from the Cytoscape App Store (http://apps.cytoscape.org/apps/rene) and the full source code is freely available for download through a SVN repository accessible at http://www.sysbio.polito.it/tools_svn/BioInformatics/Rene/releases/. ReNE enhances a network by only integrating data from public repositories, without any inference or prediction. The reliability of the introduced interactions only depends on the reliability of the source data, which is out of control of ReNe developers. PMID:25541727
Transcripts with in silico predicted RNA structure are enriched everywhere in the mouse brain
2012-01-01
Background Post-transcriptional control of gene expression is mostly conducted by specific elements in untranslated regions (UTRs) of mRNAs, in collaboration with specific binding proteins and RNAs. In several well characterized cases, these RNA elements are known to form stable secondary structures. RNA secondary structures also may have major functional implications for long noncoding RNAs (lncRNAs). Recent transcriptional data has indicated the importance of lncRNAs in brain development and function. However, no methodical efforts to investigate this have been undertaken. Here, we aim to systematically analyze the potential for RNA structure in brain-expressed transcripts. Results By comprehensive spatial expression analysis of the adult mouse in situ hybridization data of the Allen Mouse Brain Atlas, we show that transcripts (coding as well as non-coding) associated with in silico predicted structured probes are highly and significantly enriched in almost all analyzed brain regions. Functional implications of these RNA structures and their role in the brain are discussed in detail along with specific examples. We observe that mRNAs with a structure prediction in their UTRs are enriched for binding, transport and localization gene ontology categories. In addition, after manual examination we observe agreement between RNA binding protein interaction sites near the 3’ UTR structures and correlated expression patterns. Conclusions Our results show a potential use for RNA structures in expressed coding as well as noncoding transcripts in the adult mouse brain, and describe the role of structured RNAs in the context of intracellular signaling pathways and regulatory networks. Based on this data we hypothesize that RNA structure is widely involved in transcriptional and translational regulatory mechanisms in the brain and ultimately plays a role in brain function. PMID:22651826
MusTRD can regulate postnatal fiber-specific expression.
Issa, Laura L; Palmer, Stephen J; Guven, Kim L; Santucci, Nicole; Hodgson, Vanessa R M; Popovic, Kata; Joya, Josephine E; Hardeman, Edna C
2006-05-01
Human MusTRD1alpha1 was isolated as a result of its ability to bind a critical element within the Troponin I slow upstream enhancer (TnIslow USE) and was predicted to be a regulator of slow fiber-specific genes. To test this hypothesis in vivo, we generated transgenic mice expressing hMusTRD1alpha1 in skeletal muscle. Adult transgenic mice show a complete loss of slow fibers and a concomitant replacement by fast IIA fibers, resulting in postural muscle weakness. However, developmental analysis demonstrates that transgene expression has no impact on embryonic patterning of slow fibers but causes a gradual postnatal slow to fast fiber conversion. This conversion was underpinned by a demonstrable repression of many slow fiber-specific genes, whereas fast fiber-specific gene expression was either unchanged or enhanced. These data are consistent with our initial predictions for hMusTRD1alpha1 and suggest that slow fiber genes contain a specific common regulatory element that can be targeted by MusTRD proteins.
Status of VICTORIA: NRC peer review and recent code applications
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bixler, N.E.; Schaperow, J.H.
1997-12-01
VICTORIA is a mechanistic computer code designed to analyze fission product behavior within a nuclear reactor coolant system (RCS) during a severe accident. It provides detailed predictions of the release of radioactive and nonradioactive materials from the reactor core and transport and deposition of these materials within the RCS. A summary of the results and recommendations of an independent peer review of VICTORIA by the US Nuclear Regulatory Commission (NRC) is presented, along with recent applications of the code. The latter include analyses of a temperature-induced steam generator tube rupture sequence and post-test analyses of the Phebus FPT-1 test. Themore » next planned Phebus test, FTP-4, will focus on fission product releases from a rubble bed, especially those of the less-volatile elements, and on the speciation of the released elements. Pretest analyses using VICTORIA to estimate the magnitude and timing of releases are presented. The predicted release of uranium is a matter of particular importance because of concern about filter plugging during the test.« less
Insights into Structural and Mechanistic Features of Viral IRES Elements
Martinez-Salas, Encarnacion; Francisco-Velilla, Rosario; Fernandez-Chamorro, Javier; Embarek, Azman M.
2018-01-01
Internal ribosome entry site (IRES) elements are cis-acting RNA regions that promote internal initiation of protein synthesis using cap-independent mechanisms. However, distinct types of IRES elements present in the genome of various RNA viruses perform the same function despite lacking conservation of sequence and secondary RNA structure. Likewise, IRES elements differ in host factor requirement to recruit the ribosomal subunits. In spite of this diversity, evolutionarily conserved motifs in each family of RNA viruses preserve sequences impacting on RNA structure and RNA–protein interactions important for IRES activity. Indeed, IRES elements adopting remarkable different structural organizations contain RNA structural motifs that play an essential role in recruiting ribosomes, initiation factors and/or RNA-binding proteins using different mechanisms. Therefore, given that a universal IRES motif remains elusive, it is critical to understand how diverse structural motifs deliver functions relevant for IRES activity. This will be useful for understanding the molecular mechanisms beyond cap-independent translation, as well as the evolutionary history of these regulatory elements. Moreover, it could improve the accuracy to predict IRES-like motifs hidden in genome sequences. This review summarizes recent advances on the diversity and biological relevance of RNA structural motifs for viral IRES elements. PMID:29354113
Transcriptional regulation of mammalian selenoprotein expression
Stoytcheva, Zoia R.; Berry, Marla J.
2009-01-01
Background Selenoproteins contain the twenty-first amino acid, selenocysteine, and are involved in cellular defenses against oxidative damage, important metabolic and developmental pathways, and responses to environmental challenges. Elucidating the mechanisms regulating selenoprotein expression at the transcriptional level is key to understanding how these mechanisms are called into play to respond to the changing environment. Methods This review summarizes published studies on transcriptional regulation of selenoprotein genes, focused primarily on genes whose encoded protein functions are at least partially understood. This is followed by in silico analysis of predicted regulatory elements in selenoprotein genes, including those in the aforementioned category as well as the genes whose functions are not known. Results Our findings reveal regulatory pathways common to many selenoprotein genes, including several involved in stress-responses. In addition, tissue-specific regulatory factors are implicated in regulating many selenoprotein genes. Conclusions These studies provide new insights into how selenoprotein genes respond to environmental and other challenges, and the roles these proteins play in allowing cells to adapt to these changes. General Significance Elucidating the regulatory mechanisms affecting selenoprotein expression is essential for understanding their roles in human diseases, and for developing diagnostic and potential therapeutic approaches to address dysregulation of members of this gene family. PMID:19465084
Nikolić, Miloš; Papantonis, Argyris
2017-01-01
Abstract Genome-wide association studies (GWAS) have emerged as a powerful tool to uncover the genetic basis of human common diseases, which often show a complex, polygenic and multi-factorial aetiology. These studies have revealed that 70–90% of all single nucleotide polymorphisms (SNPs) associated with common complex diseases do not occur within genes (i.e. they are non-coding), making the discovery of disease-causative genetic variants and the elucidation of the underlying pathological mechanisms far from straightforward. Based on emerging evidences suggesting that disease-associated SNPs are frequently found within cell type-specific regulatory sequences, here we present GARLIC (GWAS-based Prediction Toolkit for Connecting Diseases and Cell Types), a user-friendly, multi-purpose software with an associated database and online viewer that, using global maps of cis-regulatory elements, can aetiologically connect human diseases with relevant cell types. Additionally, GARLIC can be used to retrieve potential disease-causative genetic variants overlapping regulatory sequences of interest. Overall, GARLIC can satisfy several important needs within the field of medical genetics, thus potentially assisting in the ultimate goal of uncovering the elusive and complex genetic basis of common human disorders. PMID:28007912
Feather development genes and associated regulatory innovation predate the origin of Dinosauria.
Lowe, Craig B; Clarke, Julia A; Baker, Allan J; Haussler, David; Edwards, Scott V
2015-01-01
The evolution of avian feathers has recently been illuminated by fossils and the identification of genes involved in feather patterning and morphogenesis. However, molecular studies have focused mainly on protein-coding genes. Using comparative genomics and more than 600,000 conserved regulatory elements, we show that patterns of genome evolution in the vicinity of feather genes are consistent with a major role for regulatory innovation in the evolution of feathers. Rates of innovation at feather regulatory elements exhibit an extended period of innovation with peaks in the ancestors of amniotes and archosaurs. We estimate that 86% of such regulatory elements and 100% of the nonkeratin feather gene set were present prior to the origin of Dinosauria. On the branch leading to modern birds, we detect a strong signal of regulatory innovation near insulin-like growth factor binding protein (IGFBP) 2 and IGFBP5, which have roles in body size reduction, and may represent a genomic signature for the miniaturization of dinosaurian body size preceding the origin of flight. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hong, R. L., Hamaguchi, L., Busch, M. A., and Weigel, D.
2003-06-01
OAK-B135 In Arabidopsis thaliana, cis-regulatory sequences of the floral homeotic gene AGAMOUS (AG) are located in the second intron. This 3 kb intron contains binding sites for two direct activators of AG, LEAFY (LFY) and WUSCHEL (WUS), along with other putative regulatory elements. We have used phylogenetic footprinting and the related technique of phylogenetic shadowing to identify putative cis-regulatory elements in this intron. Among 29 Brassicaceae, several other motifs, but not the LFY and WUS binding sites previously identified, are largely invariant. Using reporter gene analyses, we tested six of these motifs and found that they are all functionally importantmore » for activity of AG regulatory sequences in A. thaliana. Although there is little obvious sequence similarity outside the Brassicaceae, the intron from cucumber AG has at least partial activity in A. thaliana. Our studies underscore the value of the comparative approach as a tool that complements gene-by-gene promoter dissection, but also highlight that sequence-based studies alone are insufficient for a complete identification of cis-regulatory sites.« less
Ma, Meng; Ru, Ying; Chuang, Ling-Shiang; Hsu, Nai-Yun; Shi, Li-Song; Hakenberg, Jörg; Cheng, Wei-Yi; Uzilov, Andrew; Ding, Wei; Glicksberg, Benjamin S; Chen, Rong
2015-01-01
The invention of high throughput sequencing technologies has led to the discoveries of hundreds of thousands of genetic variants associated with thousands of human diseases. Many of these genetic variants are located outside the protein coding regions, and as such, it is challenging to interpret the function of these genetic variants by traditional genetic approaches. Recent genome-wide functional genomics studies, such as FANTOM5 and ENCODE have uncovered a large number of regulatory elements across hundreds of different tissues or cell lines in the human genome. These findings provide an opportunity to study the interaction between regulatory elements and disease-associated genetic variants. Identifying these diseased-related regulatory elements will shed light on understanding the mechanisms of how these variants regulate gene expression and ultimately result in disease formation and progression. In this study, we curated and categorized 27,558 Mendelian disease variants, 20,964 complex disease variants, 5,809 cancer predisposing germline variants, and 43,364 recurrent cancer somatic mutations. Compared against nine different types of regulatory regions from FANTOM5 and ENCODE projects, we found that different types of disease variants show distinctive propensity for particular regulatory elements. Mendelian disease variants and recurrent cancer somatic mutations are 22-fold and 10- fold significantly enriched in promoter regions respectively (q<0.001), compared with allele-frequency-matched genomic background. Separate from these two categories, cancer predisposing germline variants are 27-fold enriched in histone modification regions (q<0.001), 10-fold enriched in chromatin physical interaction regions (q<0.001), and 6-fold enriched in transcription promoters (q<0.001). Furthermore, Mendelian disease variants and recurrent cancer somatic mutations share very similar distribution across types of functional effects. We further found that regulatory regions are located within over 50% coding exon regions. Transcription promoters, methylation regions, and transcription insulators have the highest density of disease variants, with 472, 239, and 72 disease variants per one million base pairs, respectively. Disease-associated variants in different disease categories are preferentially located in particular regulatory elements. These results will be useful for an overall understanding about the differences among the pathogenic mechanisms of various disease-associated variants.
2015-01-01
Background The invention of high throughput sequencing technologies has led to the discoveries of hundreds of thousands of genetic variants associated with thousands of human diseases. Many of these genetic variants are located outside the protein coding regions, and as such, it is challenging to interpret the function of these genetic variants by traditional genetic approaches. Recent genome-wide functional genomics studies, such as FANTOM5 and ENCODE have uncovered a large number of regulatory elements across hundreds of different tissues or cell lines in the human genome. These findings provide an opportunity to study the interaction between regulatory elements and disease-associated genetic variants. Identifying these diseased-related regulatory elements will shed light on understanding the mechanisms of how these variants regulate gene expression and ultimately result in disease formation and progression. Results In this study, we curated and categorized 27,558 Mendelian disease variants, 20,964 complex disease variants, 5,809 cancer predisposing germline variants, and 43,364 recurrent cancer somatic mutations. Compared against nine different types of regulatory regions from FANTOM5 and ENCODE projects, we found that different types of disease variants show distinctive propensity for particular regulatory elements. Mendelian disease variants and recurrent cancer somatic mutations are 22-fold and 10- fold significantly enriched in promoter regions respectively (q<0.001), compared with allele-frequency-matched genomic background. Separate from these two categories, cancer predisposing germline variants are 27-fold enriched in histone modification regions (q<0.001), 10-fold enriched in chromatin physical interaction regions (q<0.001), and 6-fold enriched in transcription promoters (q<0.001). Furthermore, Mendelian disease variants and recurrent cancer somatic mutations share very similar distribution across types of functional effects. We further found that regulatory regions are located within over 50% coding exon regions. Transcription promoters, methylation regions, and transcription insulators have the highest density of disease variants, with 472, 239, and 72 disease variants per one million base pairs, respectively. Conclusions Disease-associated variants in different disease categories are preferentially located in particular regulatory elements. These results will be useful for an overall understanding about the differences among the pathogenic mechanisms of various disease-associated variants. PMID:26110593
kmer-SVM: a web server for identifying predictive regulatory sequence features in genomic data sets
Fletez-Brant, Christopher; Lee, Dongwon; McCallion, Andrew S.; Beer, Michael A.
2013-01-01
Massively parallel sequencing technologies have made the generation of genomic data sets a routine component of many biological investigations. For example, Chromatin immunoprecipitation followed by sequence assays detect genomic regions bound (directly or indirectly) by specific factors, and DNase-seq identifies regions of open chromatin. A major bottleneck in the interpretation of these data is the identification of the underlying DNA sequence code that defines, and ultimately facilitates prediction of, these transcription factor (TF) bound or open chromatin regions. We have recently developed a novel computational methodology, which uses a support vector machine (SVM) with kmer sequence features (kmer-SVM) to identify predictive combinations of short transcription factor-binding sites, which determine the tissue specificity of these genomic assays (Lee, Karchin and Beer, Discriminative prediction of mammalian enhancers from DNA sequence. Genome Res. 2011; 21:2167–80). This regulatory information can (i) give confidence in genomic experiments by recovering previously known binding sites, and (ii) reveal novel sequence features for subsequent experimental testing of cooperative mechanisms. Here, we describe the development and implementation of a web server to allow the broader research community to independently apply our kmer-SVM to analyze and interpret their genomic datasets. We analyze five recently published data sets and demonstrate how this tool identifies accessory factors and repressive sequence elements. kmer-SVM is available at http://kmersvm.beerlab.org. PMID:23771147
footprintDB: a database of transcription factors with annotated cis elements and binding interfaces.
Sebastian, Alvaro; Contreras-Moreira, Bruno
2014-01-15
Traditional and high-throughput techniques for determining transcription factor (TF) binding specificities are generating large volumes of data of uneven quality, which are scattered across individual databases. FootprintDB integrates some of the most comprehensive freely available libraries of curated DNA binding sites and systematically annotates the binding interfaces of the corresponding TFs. The first release contains 2422 unique TF sequences, 10 112 DNA binding sites and 3662 DNA motifs. A survey of the included data sources, organisms and TF families was performed together with proprietary database TRANSFAC, finding that footprintDB has a similar coverage of multicellular organisms, while also containing bacterial regulatory data. A search engine has been designed that drives the prediction of DNA motifs for input TFs, or conversely of TF sequences that might recognize input regulatory sequences, by comparison with database entries. Such predictions can also be extended to a single proteome chosen by the user, and results are ranked in terms of interface similarity. Benchmark experiments with bacterial, plant and human data were performed to measure the predictive power of footprintDB searches, which were able to correctly recover 10, 55 and 90% of the tested sequences, respectively. Correctly predicted TFs had a higher interface similarity than the average, confirming its diagnostic value. Web site implemented in PHP,Perl, MySQL and Apache. Freely available from http://floresta.eead.csic.es/footprintdb.
kmer-SVM: a web server for identifying predictive regulatory sequence features in genomic data sets.
Fletez-Brant, Christopher; Lee, Dongwon; McCallion, Andrew S; Beer, Michael A
2013-07-01
Massively parallel sequencing technologies have made the generation of genomic data sets a routine component of many biological investigations. For example, Chromatin immunoprecipitation followed by sequence assays detect genomic regions bound (directly or indirectly) by specific factors, and DNase-seq identifies regions of open chromatin. A major bottleneck in the interpretation of these data is the identification of the underlying DNA sequence code that defines, and ultimately facilitates prediction of, these transcription factor (TF) bound or open chromatin regions. We have recently developed a novel computational methodology, which uses a support vector machine (SVM) with kmer sequence features (kmer-SVM) to identify predictive combinations of short transcription factor-binding sites, which determine the tissue specificity of these genomic assays (Lee, Karchin and Beer, Discriminative prediction of mammalian enhancers from DNA sequence. Genome Res. 2011; 21:2167-80). This regulatory information can (i) give confidence in genomic experiments by recovering previously known binding sites, and (ii) reveal novel sequence features for subsequent experimental testing of cooperative mechanisms. Here, we describe the development and implementation of a web server to allow the broader research community to independently apply our kmer-SVM to analyze and interpret their genomic datasets. We analyze five recently published data sets and demonstrate how this tool identifies accessory factors and repressive sequence elements. kmer-SVM is available at http://kmersvm.beerlab.org.
Moorthy, Sakthi D.; Davidson, Scott; Shchuka, Virlana M.; Singh, Gurdeep; Malek-Gilani, Nakisa; Langroudi, Lida; Martchenko, Alexandre; So, Vincent; Macpherson, Neil N.; Mitchell, Jennifer A.
2017-01-01
Transcriptional enhancers are critical for maintaining cell-type–specific gene expression and driving cell fate changes during development. Highly transcribed genes are often associated with a cluster of individual enhancers such as those found in locus control regions. Recently, these have been termed stretch enhancers or super-enhancers, which have been predicted to regulate critical cell identity genes. We employed a CRISPR/Cas9-mediated deletion approach to study the function of several enhancer clusters (ECs) and isolated enhancers in mouse embryonic stem (ES) cells. Our results reveal that the effect of deleting ECs, also classified as ES cell super-enhancers, is highly variable, resulting in target gene expression reductions ranging from 12% to as much as 92%. Partial deletions of these ECs which removed only one enhancer or a subcluster of enhancers revealed partially redundant control of the regulated gene by multiple enhancers within the larger cluster. Many highly transcribed genes in ES cells are not associated with a super-enhancer; furthermore, super-enhancer predictions ignore 81% of the potentially active regulatory elements predicted by cobinding of five or more pluripotency-associated transcription factors. Deletion of these additional enhancer regions revealed their robust regulatory role in gene transcription. In addition, select super-enhancers and enhancers were identified that regulated clusters of paralogous genes. We conclude that, whereas robust transcriptional output can be achieved by an isolated enhancer, clusters of enhancers acting on a common target gene act in a partially redundant manner to fine tune transcriptional output of their target genes. PMID:27895109
Decoding the non-coding genome: elucidating genetic risk outside the coding genome.
Barr, C L; Misener, V L
2016-01-01
Current evidence emerging from genome-wide association studies indicates that the genetic underpinnings of complex traits are likely attributable to genetic variation that changes gene expression, rather than (or in combination with) variation that changes protein-coding sequences. This is particularly compelling with respect to psychiatric disorders, as genetic changes in regulatory regions may result in differential transcriptional responses to developmental cues and environmental/psychosocial stressors. Until recently, however, the link between transcriptional regulation and psychiatric genetic risk has been understudied. Multiple obstacles have contributed to the paucity of research in this area, including challenges in identifying the positions of remote (distal from the promoter) regulatory elements (e.g. enhancers) and their target genes and the underrepresentation of neural cell types and brain tissues in epigenome projects - the availability of high-quality brain tissues for epigenetic and transcriptome profiling, particularly for the adolescent and developing brain, has been limited. Further challenges have arisen in the prediction and testing of the functional impact of DNA variation with respect to multiple aspects of transcriptional control, including regulatory-element interaction (e.g. between enhancers and promoters), transcription factor binding and DNA methylation. Further, the brain has uncommon DNA-methylation marks with unique genomic distributions not found in other tissues - current evidence suggests the involvement of non-CG methylation and 5-hydroxymethylation in neurodevelopmental processes but much remains unknown. We review here knowledge gaps as well as both technological and resource obstacles that will need to be overcome in order to elucidate the involvement of brain-relevant gene-regulatory variants in genetic risk for psychiatric disorders. © 2015 John Wiley & Sons Ltd and International Behavioural and Neural Genetics Society.
Fedrigo, Olivier; Babbitt, Courtney C.; Wortham, Matthew; Tewari, Alok K.; London, Darin; Song, Lingyun; Lee, Bum-Kyu; Iyer, Vishwanath R.; Parker, Stephen C. J.; Margulies, Elliott H.; Wray, Gregory A.; Furey, Terrence S.; Crawford, Gregory E.
2012-01-01
Understanding the molecular basis for phenotypic differences between humans and other primates remains an outstanding challenge. Mutations in non-coding regulatory DNA that alter gene expression have been hypothesized as a key driver of these phenotypic differences. This has been supported by differential gene expression analyses in general, but not by the identification of specific regulatory elements responsible for changes in transcription and phenotype. To identify the genetic source of regulatory differences, we mapped DNaseI hypersensitive (DHS) sites, which mark all types of active gene regulatory elements, genome-wide in the same cell type isolated from human, chimpanzee, and macaque. Most DHS sites were conserved among all three species, as expected based on their central role in regulating transcription. However, we found evidence that several hundred DHS sites were gained or lost on the lineages leading to modern human and chimpanzee. Species-specific DHS site gains are enriched near differentially expressed genes, are positively correlated with increased transcription, show evidence of branch-specific positive selection, and overlap with active chromatin marks. Species-specific sequence differences in transcription factor motifs found within these DHS sites are linked with species-specific changes in chromatin accessibility. Together, these indicate that the regulatory elements identified here are genetic contributors to transcriptional and phenotypic differences among primate species. PMID:22761590
Enhanced Regulatory Sequence Prediction Using Gapped k-mer Features
Mohammad-Noori, Morteza; Beer, Michael A.
2014-01-01
Abstract Oligomers of length k, or k-mers, are convenient and widely used features for modeling the properties and functions of DNA and protein sequences. However, k-mers suffer from the inherent limitation that if the parameter k is increased to resolve longer features, the probability of observing any specific k-mer becomes very small, and k-mer counts approach a binary variable, with most k-mers absent and a few present once. Thus, any statistical learning approach using k-mers as features becomes susceptible to noisy training set k-mer frequencies once k becomes large. To address this problem, we introduce alternative feature sets using gapped k-mers, a new classifier, gkm-SVM, and a general method for robust estimation of k-mer frequencies. To make the method applicable to large-scale genome wide applications, we develop an efficient tree data structure for computing the kernel matrix. We show that compared to our original kmer-SVM and alternative approaches, our gkm-SVM predicts functional genomic regulatory elements and tissue specific enhancers with significantly improved accuracy, increasing the precision by up to a factor of two. We then show that gkm-SVM consistently outperforms kmer-SVM on human ENCODE ChIP-seq datasets, and further demonstrate the general utility of our method using a Naïve-Bayes classifier. Although developed for regulatory sequence analysis, these methods can be applied to any sequence classification problem. PMID:25033408
Enhanced regulatory sequence prediction using gapped k-mer features.
Ghandi, Mahmoud; Lee, Dongwon; Mohammad-Noori, Morteza; Beer, Michael A
2014-07-01
Oligomers of length k, or k-mers, are convenient and widely used features for modeling the properties and functions of DNA and protein sequences. However, k-mers suffer from the inherent limitation that if the parameter k is increased to resolve longer features, the probability of observing any specific k-mer becomes very small, and k-mer counts approach a binary variable, with most k-mers absent and a few present once. Thus, any statistical learning approach using k-mers as features becomes susceptible to noisy training set k-mer frequencies once k becomes large. To address this problem, we introduce alternative feature sets using gapped k-mers, a new classifier, gkm-SVM, and a general method for robust estimation of k-mer frequencies. To make the method applicable to large-scale genome wide applications, we develop an efficient tree data structure for computing the kernel matrix. We show that compared to our original kmer-SVM and alternative approaches, our gkm-SVM predicts functional genomic regulatory elements and tissue specific enhancers with significantly improved accuracy, increasing the precision by up to a factor of two. We then show that gkm-SVM consistently outperforms kmer-SVM on human ENCODE ChIP-seq datasets, and further demonstrate the general utility of our method using a Naïve-Bayes classifier. Although developed for regulatory sequence analysis, these methods can be applied to any sequence classification problem.
Transterm: a database to aid the analysis of regulatory sequences in mRNAs
Jacobs, Grant H.; Chen, Augustine; Stevens, Stewart G.; Stockwell, Peter A.; Black, Michael A.; Tate, Warren P.; Brown, Chris M.
2009-01-01
Messenger RNAs, in addition to coding for proteins, may contain regulatory elements that affect how the protein is translated. These include protein and microRNA-binding sites. Transterm (http://mRNA.otago.ac.nz/Transterm.html) is a database of regions and elements that affect translation with two major unique components. The first is integrated results of analysis of general features that affect translation (initiation, elongation, termination) for species or strains in Genbank, processed through a standard pipeline. The second is curated descriptions of experimentally determined regulatory elements that function as translational control elements in mRNAs. Transterm focuses on protein binding sites, particularly those in 3′-untranslated regions (3′-UTR). For this release the interface has been extensively updated based on user feedback. The data is now accessible by strain rather than species, for example there are 10 Escherichia coli strains (genomes) analysed separately. In addition to providing a repository of data, the database also provides tools for users to query their own mRNA sequences. Users can search sequences for Transterm or user defined regulatory elements, including protein or miRNA targets. Transterm also provides a central core of links to related resources for complementary analyses. PMID:18984623
NASA Astrophysics Data System (ADS)
Murugan, R.
2010-10-01
In this paper, we develop a theory on the mechanism of distal action of the transcription factors, which are bound at their respective cis-regulatory enhancer modules on the promoter-RNA polymerase II (PR) complexes to initiate the transcription event in eukaryotes. We consider both the looping and tracking modes of their distal communication and calculate the mean first passage time that is required for the distal interactions of the complex of enhancer and transcription factor with the PR via both these modes. We further investigate how this mean first passage time is dependent on the length of the DNA segment (L, base-pairs) that connects the cis-regulatory binding site and the respective promoter. When the radius of curvature of this connecting segment of DNA is R that was induced upon binding of the transcription factor at the cis-acting element and RNAPII at the promoter in cis-positions, our calculations indicate that the looping mode of distal action will dominate when L is such that L > 2πR and the tracking mode of distal action will be favored when L < 2πR. The time required for the distal action will be minimum when L = 2πR where the typical value of R for the binding of histones will be R ~ 16 bps and L ~ 102 bps. It seems that the free energy associated with the binding of the transcription factor with its cis-acting element and the distance of this cis-acting element from the corresponding promoter of the gene of interest is negatively correlated. Our results suggest that the looping and tracking modes of distal action are concurrently operating on the transcription activation and the physics that determines the timescales associated with the looping/tracking in the mechanism of action of these transcription factors on the initiation of the transcription event must put a selection pressure on the distribution of the distances of cis-regulatory modules from their respective promoters of the genes. The computational analysis of the upstream sequences of promoters of various genes in the human and mouse genomes for the presence of putative cis-regulatory elements for a set of known transcription factors using the position weight matrices available with the JASPAR database indicates the presence of cis-acting elements with maximum probability at a distance of ~102 bps from the promoters which substantiates our theoretical predictions.
Kumar, Vibhor; Rayan, Nirmala Arul; Muratani, Masafumi; Lim, Stefan; Elanggovan, Bavani; Xin, Lixia; Lu, Tess; Makhija, Harshyaa; Poschmann, Jeremie; Lufkin, Thomas; Ng, Huck Hui; Prabhakar, Shyam
2016-05-01
Although over 35 different histone acetylation marks have been described, the overwhelming majority of regulatory genomics studies focus exclusively on H3K27ac and H3K9ac. In order to identify novel epigenomic traits of regulatory elements, we constructed a benchmark set of validated enhancers by performing 140 enhancer assays in human T cells. We tested 40 chromatin signatures on this unbiased enhancer set and identified H2BK20ac, a little-studied histone modification, as the most predictive mark of active enhancers. Notably, we detected a novel class of functionally distinct enhancers enriched in H2BK20ac but lacking H3K27ac, which was present in all examined cell lines and also in embryonic forebrain tissue. H2BK20ac was also unique in highlighting cell-type-specific promoters. In contrast, other acetylation marks were present in all active promoters, regardless of cell-type specificity. In stimulated microglial cells, H2BK20ac was more correlated with cell-state-specific expression changes than H3K27ac, with TGF-beta signaling decoupling the two acetylation marks at a subset of regulatory elements. In summary, our study reveals a previously unknown connection between histone acetylation and cell-type-specific gene regulation and indicates that H2BK20ac profiling can be used to uncover new dimensions of gene regulation. © 2016 Kumar et al.; Published by Cold Spring Harbor Laboratory Press.
Kumar, Vibhor; Rayan, Nirmala Arul; Muratani, Masafumi; Lim, Stefan; Elanggovan, Bavani; Xin, Lixia; Lu, Tess; Makhija, Harshyaa; Poschmann, Jeremie; Lufkin, Thomas; Ng, Huck Hui; Prabhakar, Shyam
2016-01-01
Although over 35 different histone acetylation marks have been described, the overwhelming majority of regulatory genomics studies focus exclusively on H3K27ac and H3K9ac. In order to identify novel epigenomic traits of regulatory elements, we constructed a benchmark set of validated enhancers by performing 140 enhancer assays in human T cells. We tested 40 chromatin signatures on this unbiased enhancer set and identified H2BK20ac, a little-studied histone modification, as the most predictive mark of active enhancers. Notably, we detected a novel class of functionally distinct enhancers enriched in H2BK20ac but lacking H3K27ac, which was present in all examined cell lines and also in embryonic forebrain tissue. H2BK20ac was also unique in highlighting cell-type-specific promoters. In contrast, other acetylation marks were present in all active promoters, regardless of cell-type specificity. In stimulated microglial cells, H2BK20ac was more correlated with cell-state-specific expression changes than H3K27ac, with TGF-beta signaling decoupling the two acetylation marks at a subset of regulatory elements. In summary, our study reveals a previously unknown connection between histone acetylation and cell-type-specific gene regulation and indicates that H2BK20ac profiling can be used to uncover new dimensions of gene regulation. PMID:26957309
Li, Yang Eric; Xiao, Mu; Shi, Binbin; Yang, Yu-Cheng T; Wang, Dong; Wang, Fei; Marcia, Marco; Lu, Zhi John
2017-09-08
Crosslinking immunoprecipitation sequencing (CLIP-seq) technologies have enabled researchers to characterize transcriptome-wide binding sites of RNA-binding protein (RBP) with high resolution. We apply a soft-clustering method, RBPgroup, to various CLIP-seq datasets to group together RBPs that specifically bind the same RNA sites. Such combinatorial clustering of RBPs helps interpret CLIP-seq data and suggests functional RNA regulatory elements. Furthermore, we validate two RBP-RBP interactions in cell lines. Our approach links proteins and RNA motifs known to possess similar biochemical and cellular properties and can, when used in conjunction with additional experimental data, identify high-confidence RBP groups and their associated RNA regulatory elements.
Low-income minority fathers' control strategies and children's regulatory skills
Malin, Jenessa L.; Cabrera, Natasha J.; Karberg, Elizabeth; Aldoney, Daniela; Rowe, Meredith
2015-01-01
The current study explored the bidirectional association of children's individual characteristics, fathers' control strategies at 24-months and children's regulatory skills at pre-kindergarten (pre-K). Using a sample of low-income minority families with 2-year-olds from the Early Head Start Evaluation Research Program (n = 71) we assessed the association between child gender and vocabulary skills, fathers' control strategies at 24-months (e.g., regulatory behavior and regulatory language), and children's sustained attention and emotion regulation at pre-kindergarten. There were three main findings. First, fathers' overwhelmingly use commands (e.g., do that) to promote compliance in their 24-month old children. Second, children's vocabulary skills predict fathers' regulatory behaviors during a father-child interaction, whereas children's gender predicts fathers' regulatory language during an interaction. Third, controlling for maternal supportiveness, fathers' regulatory behaviors at 24-months predict children's sustained attention at pre-kindergarten whereas fathers' regulatory language at 24-months predicts children's emotion regulation at pre-kindergarten. Our findings highlight the importance of examining paternal contributions to children's regulatory skills. PMID:25798496
Low-income, minority fathers' control strategies and their children's regulatory skills.
Malin, Jenessa L; Cabrera, Natasha J; Karberg, Elizabeth; Aldoney, Daniela; Rowe, Meredith L
2014-01-01
The current study explored the bidirectional association of children's individual characteristics, fathers' control strategies at 24 months, and children's regulatory skills at prekindergarten (pre-K). Using a sample of low-income, minority families with 2-year-olds from the Early Head Start Research and Evaluation Project (n = 71), we assessed the association between child gender and vocabulary skills, fathers' control strategies at 24 months (e.g., regulatory behavior and regulatory language), and children's sustained attention and emotion regulation at prekindergarten. There were three main findings. First, fathers overwhelmingly used commands (e.g., "Do that.") to promote compliance in their 24-month-old children. Second, children's vocabulary skills predicted fathers' regulatory behaviors during a father-child interaction whereas children's gender predicted fathers' regulatory language during an interaction. Third, controlling for maternal supportiveness, fathers' regulatory behaviors at 24 months predicted children's sustained attention at pre-K whereas fathers' regulatory language at 24 months predicted children's emotion regulation at pre-K. Our findings highlight the importance of examining paternal contributions to children's regulatory skills. © 2014 Michigan Association for Infant Mental Health.
Giresi, Paul G.; Kim, Jonghwan; McDaniell, Ryan M.; Iyer, Vishwanath R.; Lieb, Jason D.
2007-01-01
DNA segments that actively regulate transcription in vivo are typically characterized by eviction of nucleosomes from chromatin and are experimentally identified by their hypersensitivity to nucleases. Here we demonstrate a simple procedure for the isolation of nucleosome-depleted DNA from human chromatin, termed FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements). To perform FAIRE, chromatin is crosslinked with formaldehyde in vivo, sheared by sonication, and phenol-chloroform extracted. The DNA recovered in the aqueous phase is fluorescently labeled and hybridized to a DNA microarray. FAIRE performed in human cells strongly enriches DNA coincident with the location of DNaseI hypersensitive sites, transcriptional start sites, and active promoters. Evidence for cell-type–specific patterns of FAIRE enrichment is also presented. FAIRE has utility as a positive selection for genomic regions associated with regulatory activity, including regions traditionally detected by nuclease hypersensitivity assays. PMID:17179217
Genetic and epigenetic variation in the lineage specification of regulatory T cells
Arvey, Aaron; van der Veeken, Joris; Plitas, George; Rich, Stephen S; Concannon, Patrick; Rudensky, Alexander Y
2015-01-01
Regulatory T (Treg) cells, which suppress autoimmunity and other inflammatory states, are characterized by a distinct set of genetic elements controlling their gene expression. However, the extent of genetic and associated epigenetic variation in the Treg cell lineage and its possible relation to disease states in humans remain unknown. We explored evolutionary conservation of regulatory elements and natural human inter-individual epigenetic variation in Treg cells to identify the core transcriptional control program of lineage specification. Analysis of single nucleotide polymorphisms in core lineage-specific enhancers revealed disease associations, which were further corroborated by high-resolution genotyping to fine map causal polymorphisms in lineage-specific enhancers. Our findings suggest that a small set of regulatory elements specify the Treg lineage and that genetic variation in Treg cell-specific enhancers may alter Treg cell function contributing to polygenic disease. DOI: http://dx.doi.org/10.7554/eLife.07571.001 PMID:26510014
Keyter, Andrea; Gouws, Joey; Salek, Sam; Walker, Stuart
2018-01-01
The aims of this study were to assess the regulatory review process in South Africa from 2015 to 2017, identify the key milestones and timelines; evaluate the effectiveness of measures to ensure consistency, transparency, timeliness, and predictability in the review process; and to provide recommendations for enhanced regulatory practices. A questionnaire was completed by the Medicines Control Council (MCC) to describe the organization of the authority, record key milestones and timelines in the review process and to identify good review practices (GRevPs). Currently, the MCC conducts a full assessment of quality, efficacy, and safety data in the review of all applications. The overall regulatory median approval time decreased by 14% in 2017 (1411 calendar days) compared with that of 2016, despite the 27% increase in the number of applications. However, the MCC has no target for overall approval time of new active substance applications and no targets for key review milestones. Guidelines, standard operating procedures, and review templates are in place, while the formal implementation of GRevPs and the application of an electronic document management system are planned for the near future. As the MCC transitions to the newly established South Africa Health Products Regulatory Authority, it would be crucial for the authority to recognize the opportunities for an enhanced regulatory review and should consider models such as abridged assessment, which encompass elements of risk stratification and reliance. It is hoped that resource constraints may then be alleviated and capacity developed to meet target timelines.
Barsi, Julius C; Davidson, Eric H
2016-01-01
Specification of the ciliated band (CB) of echinoid embryos executes three spatial functions essential for postgastrular organization. These are establishment of a band about 5 cells wide which delimits and bounds other embryonic territories; definition of a neurogenic domain within this band; and generation within it of arrays of ciliary cells that bear the special long cilia from which the structure derives its name. In Strongylocentrotus purpuratus the spatial coordinates of the future ciliated band are initially and exactly determined by the disposition of a ring of cells that transcriptionally activate the onecut homeodomain regulatory gene, beginning in blastula stage, long before the appearance of the CB per se. Thus the cis-regulatory apparatus that governs onecut expression in the blastula directly reveals the genomic sequence code by which these aspects of the spatial organization of the embryo are initially determined. We screened the entire onecut locus and its flanking region for transcriptionally active cis-regulatory elements, and by means of BAC recombineered deletions identified three separated and required cis-regulatory modules that execute different functions. The operating logic of the crucial spatial control module accounting for the spectacularly precise and beautiful early onecut expression domain depends on spatial repression. Previously predicted oral ectoderm and aboral ectoderm repressors were identified by cis-regulatory mutation as the products of goosecoid and irxa genes respectively, while the pan-ectodermal activator SoxB1 supplies a transcriptional driver function. Copyright © 2015. Published by Elsevier Inc.
Conserved Non-Coding Regulatory Signatures in Arabidopsis Co-Expressed Gene Modules
Spangler, Jacob B.; Ficklin, Stephen P.; Luo, Feng; Freeling, Michael; Feltus, F. Alex
2012-01-01
Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome. PMID:23024789
Conserved non-coding regulatory signatures in Arabidopsis co-expressed gene modules.
Spangler, Jacob B; Ficklin, Stephen P; Luo, Feng; Freeling, Michael; Feltus, F Alex
2012-01-01
Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome.
Two regulatory RNA elements affect TisB-dependent depolarization and persister formation.
Berghoff, Bork A; Hoekzema, Mirthe; Aulbach, Lena; Wagner, E Gerhart H
2017-03-01
Bacterial survival strategies involve phenotypic diversity which is generated by regulatory factors and noisy expression of effector proteins. The question of how bacteria exploit regulatory RNAs to make decisions between phenotypes is central to a general understanding of these universal regulators. We investigated the TisB/IstR-1 toxin-antitoxin system of Escherichia coli to appreciate the role of the RNA antitoxin IstR-1 in TisB-dependent depolarization of the inner membrane and persister formation. Persisters are phenotypic variants that have become transiently drug-tolerant by arresting growth. The RNA antitoxin IstR-1 sets a threshold for TisB-dependent depolarization under DNA-damaging conditions, resulting in two sub-populations: polarized and depolarized cells. Furthermore, our data indicate that an inhibitory 5' UTR structure in the tisB mRNA serves as a regulatory RNA element that delays TisB translation to avoid inappropriate depolarization when DNA damage is low. Investigation of the persister sub-population further revealed that both regulatory RNA elements affect persister levels as well as persistence time. This work provides an intriguing example of how bacteria exploit regulatory RNAs to control phenotypic heterogeneity. © 2016 John Wiley & Sons Ltd.
Regulatory principles governing Salmonella and Yersinia virulence
Erhardt, Marc; Dersch, Petra
2015-01-01
Enteric pathogens such as Salmonella and Yersinia evolved numerous strategies to survive and proliferate in different environmental reservoirs and mammalian hosts. Deciphering common and pathogen-specific principles for how these bacteria adjust and coordinate spatiotemporal expression of virulence determinants, stress adaptation, and metabolic functions is fundamental to understand microbial pathogenesis. In order to manage sudden environmental changes, attacks by the host immune systems and microbial competition, the pathogens employ a plethora of transcriptional and post-transcriptional control elements, including transcription factors, sensory and regulatory RNAs, RNAses, and proteases, to fine-tune and control complex gene regulatory networks. Many of the contributing global regulators and the molecular mechanisms of regulation are frequently conserved between Yersinia and Salmonella. However, the interplay, arrangement, and composition of the control elements vary between these closely related enteric pathogens, which generate phenotypic differences leading to distinct pathogenic properties. In this overview we present common and different regulatory networks used by Salmonella and Yersinia to coordinate the expression of crucial motility, cell adhesion and invasion determinants, immune defense strategies, and metabolic adaptation processes. We highlight evolutionary changes of the gene regulatory circuits that result in different properties of the regulatory elements and how this influences the overall outcome of the infection process. PMID:26441883
Giresi, Paul G.; Lieb, Jason D.
2009-01-01
The binding of sequence-specific regulatory factors and the recruitment of chromatin remodeling activities cause nucleosomes to be evicted from chromatin in eukaryotic cells. Traditionally, these active sites have been identified experimentally through their sensitivity to nucleases. Here we describe the details of a simple procedure for the genome-wide isolation of nucleosome-depleted DNA from human chromatin, termed FAIRE (Formaldehyde Assisted Isolation of Regulatory Elements). We also provide protocols for different methods of detecting FAIRE-enriched DNA, including use of PCR, DNA microarrays, and next-generation sequencing. FAIRE works on all eukaryotic chromatin tested to date. To perform FAIRE, chromatin is crosslinked with formaldehyde, sheared by sonication, and phenol-chloroform extracted. Most genomic DNA is crosslinked to nucleosomes and is sequestered to the interphase, whereas DNA recovered in the aqueous phase corresponds to nucleosome-depleted regions of the genome. The isolated regions are largely coincident with the location of DNaseI hypersensitive sites, transcriptional start sites, enhancers, insulators, and active promoters. Given its speed and simplicity, FAIRE has utility in establishing chromatin profiles of diverse cell types in health and disease, isolating DNA regulatory elements en masse for further characterization, and as a screening assay for the effects of small molecules on chromatin organization. PMID:19303047
Huber, M C; Bosch, F X; Sippel, A E; Bonifer, C
1994-01-01
The complete chicken lysozyme gene locus is expressed copy number dependently and at a high level in macrophages of transgenic mice. Gene expression independent of genomic position can only be achieved by the concerted action of all cis regulatory elements located on the lysozyme gene domain. Position independency of expression is lost if one essential cis regulatory region is deleted. Here we compared the DNase I hypersensitive site (DHS) pattern formed on the chromatin of position independently and position dependently expressed transgenes in order to assess the influence of deletions within the gene domain on active chromatin formation. We demonstrate, that in position independently expressed transgene all DHSs are formed with the authentic relative frequency on all genes. This is not the case for position dependently expressed transgenes. Our results show that the formation of a DHS during cellular differentiation does not occur autonomously. In case essential regulatory elements of the chicken lysozyme gene domain are lacking, the efficiency of DHS formation on remaining cis regulatory elements during myeloid differentiation is reduced and influenced by the chromosomal position. Hence, no individual regulatory element on the lysozyme domain is capable of organizing the chromatin structure of the whole locus in a dominant fashion. Images PMID:7937145
Arenas-Mena, Cesar; Coffman, James A.
2016-01-01
Summary It is proposed that the evolution of complex animals required repressive genetic mechanisms for controlling the transcriptional and proliferative potency of cells. Unicellular organisms are transcriptionally potent, able to express their full genetic complement as the need arises through their life cycle, whereas differentiated cells of multicellular organisms can only express a fraction of their genomic potential. Likewise, whereas cell proliferation in unicellular organisms is primarily limited by nutrient availability, cell proliferation in multicellular organisms is developmentally regulated. Repressive genetic controls limiting the potency of cells at the end of ontogeny would have stabilized the gene expression states of differentiated cells and prevented disruptive proliferation, allowing the emergence of diverse cell types and functional shapes. We propose that distal cis-regulatory elements represent the primary innovations that set the stage for the evolution of developmental gene regulatory networks and the repressive control of key multipotency and cell-cycle control genes. The testable prediction of this model is that the genomes of extant animals, unlike those of our unicellular relatives, encode gene regulatory circuits dedicated to the developmental control of transcriptional and proliferative potency. PMID:26173445
McBride, David J.; Buckle, Adam; van Heyningen, Veronica; Kleinjan, Dirk A.
2011-01-01
The PAX6 gene plays a crucial role in development of the eye, brain, olfactory system and endocrine pancreas. Consistent with its pleiotropic role the gene exhibits a complex developmental expression pattern which is subject to strict spatial, temporal and quantitative regulation. Control of expression depends on a large array of cis-elements residing in an extended genomic domain around the coding region of the gene. The minimal essential region required for proper regulation of this complex locus has been defined through analysis of human aniridia-associated breakpoints and YAC transgenic rescue studies of the mouse smalleye mutant. We have carried out a systematic DNase I hypersensitive site (HS) analysis across 200 kb of this critical region of mouse chromosome 2E3 to identify putative regulatory elements. Mapping the identified HSs onto a percent identity plot (PIP) shows many HSs correspond to recognisable genomic features such as evolutionarily conserved sequences, CpG islands and retrotransposon derived repeats. We then focussed on a region previously shown to contain essential long range cis-regulatory information, the Pax6 downstream regulatory region (DRR), allowing comparison of mouse HS data with previous human HS data for this region. Reporter transgenic mice for two of the HS sites, HS5 and HS6, show that they function as tissue specific regulatory elements. In addition we have characterised enhancer activity of an ultra-conserved cis-regulatory region located near Pax6, termed E60. All three cis-elements exhibit multiple spatio-temporal activities in the embryo that overlap between themselves and other elements in the locus. Using a deletion set of YAC reporter transgenic mice we demonstrate functional interdependence of the elements. Finally, we use the HS6 enhancer as a marker for the migration of precerebellar neuro-epithelium cells to the hindbrain precerebellar nuclei along the posterior and anterior extramural streams allowing visualisation of migratory defects in both pathways in Pax6Sey/Sey mice. PMID:22220192
A role for circadian evening elements in cold-regulated gene expression in Arabidopsis.
Mikkelsen, Michael D; Thomashow, Michael F
2009-10-01
The plant transcriptome is dramatically altered in response to low temperature. The cis-acting DNA regulatory elements and trans-acting factors that regulate the majority of cold-regulated genes are unknown. Previous bioinformatic analysis has indicated that the promoters of cold-induced genes are enriched in the Evening Element (EE), AAAATATCT, a DNA regulatory element that has a role in circadian-regulated gene expression. Here we tested the role of EE and EE-like (EEL) elements in cold-induced expression of two Arabidopsis genes, CONSTANS-like 1 (COL1; At5g54470) and a gene encoding a 27-kDa protein of unknown function that we designated COLD-REGULATED GENE 27 (COR27; At5g42900). Mutational analysis indicated that the EE/EEL elements were required for cold induction of COL1 and COR27, and that their action was amplified through coupling with ABA response element (ABRE)-like (ABREL) motifs. An artificial promoter consisting solely of four EE motifs interspersed with three ABREL motifs was sufficient to impart cold-induced gene expression. Both COL1 and COR27 were found to be regulated by the circadian clock at warm growth temperatures and cold-induction of COR27 was gated by the clock. These results suggest that cold- and clock-regulated gene expression are integrated through regulatory proteins that bind to EE and EEL elements supported by transcription factors acting at ABREL sequences. Bioinformatic analysis indicated that the coupling of EE and EEL motifs with ABREL motifs is highly enriched in cold-induced genes and thus may constitute a DNA regulatory element pair with a significant role in configuring the low-temperature transcriptome.
Parallel evolution of chordate cis-regulatory code for development.
Doglio, Laura; Goode, Debbie K; Pelleri, Maria C; Pauls, Stefan; Frabetti, Flavia; Shimeld, Sebastian M; Vavouri, Tanya; Elgar, Greg
2013-11-01
Urochordates are the closest relatives of vertebrates and at the larval stage, possess a characteristic bilateral chordate body plan. In vertebrates, the genes that orchestrate embryonic patterning are in part regulated by highly conserved non-coding elements (CNEs), yet these elements have not been identified in urochordate genomes. Consequently the evolution of the cis-regulatory code for urochordate development remains largely uncharacterised. Here, we use genome-wide comparisons between C. intestinalis and C. savignyi to identify putative urochordate cis-regulatory sequences. Ciona conserved non-coding elements (ciCNEs) are associated with largely the same key regulatory genes as vertebrate CNEs. Furthermore, some of the tested ciCNEs are able to activate reporter gene expression in both zebrafish and Ciona embryos, in a pattern that at least partially overlaps that of the gene they associate with, despite the absence of sequence identity. We also show that the ability of a ciCNE to up-regulate gene expression in vertebrate embryos can in some cases be localised to short sub-sequences, suggesting that functional cross-talk may be defined by small regions of ancestral regulatory logic, although functional sub-sequences may also be dispersed across the whole element. We conclude that the structure and organisation of cis-regulatory modules is very different between vertebrates and urochordates, reflecting their separate evolutionary histories. However, functional cross-talk still exists because the same repertoire of transcription factors has likely guided their parallel evolution, exploiting similar sets of binding sites but in different combinations.
The G-Box Transcriptional Regulatory Code in Arabidopsis1[OPEN
Shepherd, Samuel J.K.; Brestovitsky, Anna; Dickinson, Patrick; Biswas, Surojit
2017-01-01
Plants have significantly more transcription factor (TF) families than animals and fungi, and plant TF families tend to contain more genes; these expansions are linked to adaptation to environmental stressors. Many TF family members bind to similar or identical sequence motifs, such as G-boxes (CACGTG), so it is difficult to predict regulatory relationships. We determined that the flanking sequences near G-boxes help determine in vitro specificity but that this is insufficient to predict the transcription pattern of genes near G-boxes. Therefore, we constructed a gene regulatory network that identifies the set of bZIPs and bHLHs that are most predictive of the expression of genes downstream of perfect G-boxes. This network accurately predicts transcriptional patterns and reconstructs known regulatory subnetworks. Finally, we present Ara-BOX-cis (araboxcis.org), a Web site that provides interactive visualizations of the G-box regulatory network, a useful resource for generating predictions for gene regulatory relations. PMID:28864470
The 3’-Jα Region of the TCRα Locus Bears Gene Regulatory Activity in Thymic and Peripheral T Cells
Kučerová-Levisohn, Martina; Knirr, Stefan; Mejia, Rosa I.; Ortiz, Benjamin D.
2015-01-01
Much progress has been made in understanding the important cis-mediated controls on mouse TCRα gene function, including identification of the Eα enhancer and TCRα locus control region (LCR). Nevertheless, previous data have suggested that other cis-regulatory elements may reside in the locus outside of the Eα/LCR. Based on prior findings, we hypothesized the existence of gene regulatory elements in a 3.9-kb region 5’ of the Cα exons. Using DNase hypersensitivity assays and TCRα BAC reporter transgenes in mice, we detected gene regulatory activity within this 3.9-kb region. This region is active in both thymic and peripheral T cells, and selectively affects upstream, but not downstream, gene expression. Together, these data indicate the existence of a novel cis-acting regulatory complex that contributes to TCRα transgene expression in vivo. The active chromatin sites we discovered within this region would remain in the locus after TCRα gene rearrangement, and thus may contribute to endogenous TCRα gene activity, particularly in peripheral T cells, where the Eα element has been found to be inactive. PMID:26177549
A provisional regulatory gene network for specification of endomesoderm in the sea urchin embryo
NASA Technical Reports Server (NTRS)
Davidson, Eric H.; Rast, Jonathan P.; Oliveri, Paola; Ransick, Andrew; Calestani, Cristina; Yuh, Chiou-Hwa; Minokawa, Takuya; Amore, Gabriele; Hinman, Veronica; Arenas-Mena, Cesar;
2002-01-01
We present the current form of a provisional DNA sequence-based regulatory gene network that explains in outline how endomesodermal specification in the sea urchin embryo is controlled. The model of the network is in a continuous process of revision and growth as new genes are added and new experimental results become available; see http://www.its.caltech.edu/mirsky/endomeso.htm (End-mes Gene Network Update) for the latest version. The network contains over 40 genes at present, many newly uncovered in the course of this work, and most encoding DNA-binding transcriptional regulatory factors. The architecture of the network was approached initially by construction of a logic model that integrated the extensive experimental evidence now available on endomesoderm specification. The internal linkages between genes in the network have been determined functionally, by measurement of the effects of regulatory perturbations on the expression of all relevant genes in the network. Five kinds of perturbation have been applied: (1) use of morpholino antisense oligonucleotides targeted to many of the key regulatory genes in the network; (2) transformation of other regulatory factors into dominant repressors by construction of Engrailed repressor domain fusions; (3) ectopic expression of given regulatory factors, from genetic expression constructs and from injected mRNAs; (4) blockade of the beta-catenin/Tcf pathway by introduction of mRNA encoding the intracellular domain of cadherin; and (5) blockade of the Notch signaling pathway by introduction of mRNA encoding the extracellular domain of the Notch receptor. The network model predicts the cis-regulatory inputs that link each gene into the network. Therefore, its architecture is testable by cis-regulatory analysis. Strongylocentrotus purpuratus and Lytechinus variegatus genomic BAC recombinants that include a large number of the genes in the network have been sequenced and annotated. Tests of the cis-regulatory predictions of the model are greatly facilitated by interspecific computational sequence comparison, which affords a rapid identification of likely cis-regulatory elements in advance of experimental analysis. The network specifies genomically encoded regulatory processes between early cleavage and gastrula stages. These control the specification of the micromere lineage and of the initial veg(2) endomesodermal domain; the blastula-stage separation of the central veg(2) mesodermal domain (i.e., the secondary mesenchyme progenitor field) from the peripheral veg(2) endodermal domain; the stabilization of specification state within these domains; and activation of some downstream differentiation genes. Each of the temporal-spatial phases of specification is represented in a subelement of the network model, that treats regulatory events within the relevant embryonic nuclei at particular stages. (c) 2002 Elsevier Science (USA).
Long-Range Control of Gene Expression: Emerging Mechanisms and Disruption in Disease
Kleinjan, Dirk A.; van Heyningen, Veronica
2005-01-01
Transcriptional control is a major mechanism for regulating gene expression. The complex machinery required to effect this control is still emerging from functional and evolutionary analysis of genomic architecture. In addition to the promoter, many other regulatory elements are required for spatiotemporally and quantitatively correct gene expression. Enhancer and repressor elements may reside in introns or up- and downstream of the transcription unit. For some genes with highly complex expression patterns—often those that function as key developmental control genes—the cis-regulatory domain can extend long distances outside the transcription unit. Some of the earliest hints of this came from disease-associated chromosomal breaks positioned well outside the relevant gene. With the availability of wide-ranging genome sequence comparisons, strong conservation of many noncoding regions became obvious. Functional studies have shown many of these conserved sites to be transcriptional regulatory elements that sometimes reside inside unrelated neighboring genes. Such sequence-conserved elements generally harbor sites for tissue-specific DNA-binding proteins. Developmentally variable chromatin conformation can control protein access to these sites and can regulate transcription. Disruption of these finely tuned mechanisms can cause disease. Some regulatory element mutations will be associated with phenotypes distinct from any identified for coding-region mutations. PMID:15549674
Cis-regulatory RNA elements that regulate specialized ribosome activity.
Xue, Shifeng; Barna, Maria
2015-01-01
Recent evidence has shown that the ribosome itself can play a highly regulatory role in the specialized translation of specific subpools of mRNAs, in particular at the level of ribosomal proteins (RP). However, the mechanism(s) by which this selection takes place has remained poorly understood. In our recent study, we discovered a combination of unique RNA elements in the 5'UTRs of mRNAs that allows for such control by the ribosome. These mRNAs contain a Translation Inhibitory Element (TIE) that inhibits general cap-dependent translation, and an Internal Ribosome Entry Site (IRES) that relies on a specific RP for activation. The unique combination of an inhibitor of general translation and an activator of specialized translation is key to ribosome-mediated control of gene expression. Here we discuss how these RNA regulatory elements provide a new level of control to protein expression and their implications for gene expression, organismal development and evolution.
Sano, R; Kuboya, E; Nakajima, T; Takahashi, Y; Takahashi, K; Kubo, R; Kominato, Y; Takeshita, H; Yamao, H; Kishida, T; Isa, K; Ogasawara, K; Uchikawa, M
2015-04-01
We developed a sequence-specific primer PCR (SSP-PCR) for detection of a 5.8-kb deletion (B(m) 5.8) involving an erythroid cell-specific regulatory element in intron 1 of the ABO blood group gene. Using this SSP-PCR, we performed genetic analysis of 382 individuals with Bm or ABm. The 5.8-kb deletion was found in 380 individuals, and disruption of the GATA motif in the regulatory element was found in one individual. Furthermore, a novel 3.0-kb deletion involving the element (B(m) 3.0) was demonstrated in the remaining individual. Comparisons of single-nucleotide polymorphisms and microsatellites in intron 1 between B(m) 5.8 and B(m) 3.0 suggested that these deletions occurred independently. © 2014 International Society of Blood Transfusion.
Identification of germline transcriptional regulatory elements in Aedes aegypti.
Akbari, Omar S; Papathanos, Philippos A; Sandler, Jeremy E; Kennedy, Katie; Hay, Bruce A
2014-02-04
The mosquito Aedes aegypti is the principal vector for the yellow fever and dengue viruses, and is also responsible for recent outbreaks of the alphavirus chikungunya. Vector control strategies utilizing engineered gene drive systems are being developed as a means of replacing wild, pathogen transmitting mosquitoes with individuals refractory to disease transmission, or bringing about population suppression. Several of these systems, including Medea, UD(MEL), and site-specific nucleases, which can be used to drive genes into populations or bring about population suppression, utilize transcriptional regulatory elements that drive germline-specific expression. Here we report the identification of multiple regulatory elements able to drive gene expression specifically in the female germline, or in the male and female germline, in the mosquito Aedes aegypti. These elements can also be used as tools with which to probe the roles of specific genes in germline function and in the early embryo, through overexpression or RNA interference.
Identification of germline transcriptional regulatory elements in Aedes aegypti
NASA Astrophysics Data System (ADS)
Akbari, Omar S.; Papathanos, Philippos A.; Sandler, Jeremy E.; Kennedy, Katie; Hay, Bruce A.
2014-02-01
The mosquito Aedes aegypti is the principal vector for the yellow fever and dengue viruses, and is also responsible for recent outbreaks of the alphavirus chikungunya. Vector control strategies utilizing engineered gene drive systems are being developed as a means of replacing wild, pathogen transmitting mosquitoes with individuals refractory to disease transmission, or bringing about population suppression. Several of these systems, including Medea, UDMEL, and site-specific nucleases, which can be used to drive genes into populations or bring about population suppression, utilize transcriptional regulatory elements that drive germline-specific expression. Here we report the identification of multiple regulatory elements able to drive gene expression specifically in the female germline, or in the male and female germline, in the mosquito Aedes aegypti. These elements can also be used as tools with which to probe the roles of specific genes in germline function and in the early embryo, through overexpression or RNA interference.
Zandvakili, Arya; Campbell, Ian; Weirauch, Matthew T.
2018-01-01
Cells use thousands of regulatory sequences to recruit transcription factors (TFs) and produce specific transcriptional outcomes. Since TFs bind degenerate DNA sequences, discriminating functional TF binding sites (TFBSs) from background sequences represents a significant challenge. Here, we show that a Drosophila regulatory element that activates Epidermal Growth Factor signaling requires overlapping, low-affinity TFBSs for competing TFs (Pax2 and Senseless) to ensure cell- and segment-specific activity. Testing available TF binding models for Pax2 and Senseless, however, revealed variable accuracy in predicting such low-affinity TFBSs. To better define parameters that increase accuracy, we developed a method that systematically selects subsets of TFBSs based on predicted affinity to generate hundreds of position-weight matrices (PWMs). Counterintuitively, we found that degenerate PWMs produced from datasets depleted of high-affinity sequences were more accurate in identifying both low- and high-affinity TFBSs for the Pax2 and Senseless TFs. Taken together, these findings reveal how TFBS arrangement can be constrained by competition rather than cooperativity and that degenerate models of TF binding preferences can improve identification of biologically relevant low affinity TFBSs. PMID:29617378
Characterization of promoter of EgPAL1, a novel PAL gene from the oil palm Elaeis guineensis Jacq.
Yusuf, Chong Yu Lok; Abdullah, Janna Ong; Shaharuddin, Noor Azmi; Abu Seman, Idris; Abdullah, Mohd Puad
2018-02-01
The oil palm EgPAL1 gene promoter and its regulatory region were functional as a promoter in the heterologous system of Arabidopsis according to the cis-acting elements present in that region. The promoter was developmentally regulated, vascular tissue specific and responsive to water stress agents. Phenylalanine ammonia lyase (PAL, EC 4.3.1.24) is the key enzyme of the phenylpropanoid pathway which plays important roles in plant development and adaptation. To date, there is no report on the study of PAL from oil palm (Elaeis guineensis), an economically important oil crop. In this study, the 5' regulatory sequence of a highly divergent oil palm PAL gene (EgPAL1) was isolated and fused with GUS in Arabidopsis to create two transgenic plants carrying the minimal promoter with (2302 bp) and without its regulatory elements (139 bp). The regulatory sequence contained cis-acting elements known to be important for plant development and stress response including the AC-II element for lignin biosynthesis and several stress responsive elements. The promoter and its regulatory region were fully functional in Arabidopsis. Its activities were characterised by two common fundamental features of PAL which are responsive to plant internal developmental programme and external factors. The promoter was developmentally regulated in certain organs; highly active in young organs but less active or inactive in mature organs. The presence of the AC elements and global activity of the EgPAL1 promoter in all organs resembled the property of lignin-related genes. The existence of the MBS element and enhancement of the promoter activity by PEG reflected the behaviour of drought-responsive genes. Our findings provide a platform for evaluating oil palm gene promoters in the heterologous system of Arabidopsis and give insights into the activities of EgPAL1 promoter in oil palm.
Gordon, Kacy L.; Arthur, Robert K.; Ruvinsky, Ilya
2015-01-01
Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements. PMID:26020930
Early Evolution of Conserved Regulatory Sequences Associated with Development in Vertebrates
McEwen, Gayle K.; Goode, Debbie K.; Parker, Hugo J.; Woolfe, Adam; Callaway, Heather; Elgar, Greg
2009-01-01
Comparisons between diverse vertebrate genomes have uncovered thousands of highly conserved non-coding sequences, an increasing number of which have been shown to function as enhancers during early development. Despite their extreme conservation over 500 million years from humans to cartilaginous fish, these elements appear to be largely absent in invertebrates, and, to date, there has been little understanding of their mode of action or the evolutionary processes that have modelled them. We have now exploited emerging genomic sequence data for the sea lamprey, Petromyzon marinus, to explore the depth of conservation of this type of element in the earliest diverging extant vertebrate lineage, the jawless fish (agnathans). We searched for conserved non-coding elements (CNEs) at 13 human gene loci and identified lamprey elements associated with all but two of these gene regions. Although markedly shorter and less well conserved than within jawed vertebrates, identified lamprey CNEs are able to drive specific patterns of expression in zebrafish embryos, which are almost identical to those driven by the equivalent human elements. These CNEs are therefore a unique and defining characteristic of all vertebrates. Furthermore, alignment of lamprey and other vertebrate CNEs should permit the identification of persistent sequence signatures that are responsible for common patterns of expression and contribute to the elucidation of the regulatory language in CNEs. Identifying the core regulatory code for development, common to all vertebrates, provides a foundation upon which regulatory networks can be constructed and might also illuminate how large conserved regulatory sequence blocks evolve and become fixed in genomic DNA. PMID:20011110
A powerful approach reveals numerous expression quantitative trait haplotypes in multiple tissues.
Ying, Dingge; Li, Mulin Jun; Sham, Pak Chung; Li, Miaoxin
2018-04-26
Recently many studies showed single nucleotide polymorphisms (SNPs) affect gene expression and contribute to development of complex traits/diseases in a tissue context-dependent manner. However, little is known about haplotype's influence on gene expression and complex traits, which reflects the interaction effect between SNPs. In the present study, we firstly proposed a regulatory region guided eQTL haplotype association analysis approach, and then systematically investigate the expression quantitative trait loci (eQTL) haplotypes in 20 different tissues by the approach. The approach has a powerful design of reducing computational burden by the utilization of regulatory predictions for candidate SNP selection and multiple testing corrections on non-independent haplotypes. The application results in multiple tissues showed that haplotype-based eQTLs not only increased the number of eQTL genes in a tissue specific manner, but were also enriched in loci that associated with complex traits in a tissue-matched manner. In addition, we found that tag SNPs of eQTL haplotypes from whole blood were selectively enriched in certain combination of regulatory elements (e.g. promoters and enhancers) according to predicted chromatin states. In summary, this eQTL haplotype detection approach, together with the application results, shed insights into synergistic effect of sequence variants on gene expression and their susceptibility to complex diseases. The executable application "eHaplo" is implemented in Java and is publicly available at http://grass.cgs.hku.hk/limx/ehaplo/. jonsonfox@gmail.com, limiaoxin@mail.sysu.edu.cn. Supplementary data are available at Bioinformatics online.
41 CFR 102-2.140 - What elements of plain language appear in the FMR?
Code of Federal Regulations, 2011 CFR
2011-01-01
... language appear in the FMR? 102-2.140 Section 102-2.140 Public Contracts and Property Management Federal... MANAGEMENT REGULATION SYSTEM Plain Language Regulatory Style § 102-2.140 What elements of plain language appear in the FMR? The FMR is written in a “plain language” regulatory style. This style is easy to read...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nelson, J.A.; Reynolds-Kohler, C.; Smith, B.A.
1987-11-01
To analyze the significance of inducible DNase I-hypersensitive sites occurring in the 5'-flanking sequence of the major immediate-early gene of human cytomegalovirus (HCMV), various deleted portions of the HCMV immediate-early promoter regulatory region were attached to the chloramphenicol acetyltransferase (CAT) gene and assayed for activity in transiently transfected undifferentiated and differentiated human teratocarcinoma cells, Tera-2. Assays of progressive deletions in the promoter regulatory region indicated that removal of a 395-base-pair portion of this element (nucleotides -750 to -1145) containing two inducible DNase I sites which correlate with gene expression resulted in a 7.5-fold increase in CAT activity in undifferentiated cells.more » However, in permissive differentiated Tera-2, human foreskin fibroblast, and HeLa cells, removal of this regulatory region resulted in decreased activity. In addition, attachment of this HCMV upstream element to a homologous or heterologous promoter increased activity three-to fivefold in permissive cells. Therefore, a cis regulatory element exists 5' to the enhancer of the major immediate-early gene of HCMV. This element negatively modulates expression in nonpermissive cells but positively influences expression in permissive cells.« less
A methodology for post-EIS (environmental impact statement) monitoring
Marcus, Linda Graves
1979-01-01
A methodology for monitoring the impacts predicted in environmental impact statements (EIS's) was developed using the EIS on phosphate development in southeastern Idaho as a case study. A monitoring system based on this methodology: (1) coordinates a comprehensive, intergovernmental monitoring effort; (2) documents the major impacts that result, thereby improving the accuracy of impact predictions in future EIS's; (3) helps agencies control impacts by warning them when critical impact levels are reached and by providing feedback on the success of mitigating measures; and (4) limits monitoring data to the essential information that agencies need to carry out their regulatory and environmental protection responsibilities. The methodology is presented as flow charts accompanied by tables that describe the objectives, tasks, and products for each work element in the flow chart.
Freyre-González, Julio A; Alonso-Pavón, José A; Treviño-Quintanilla, Luis G; Collado-Vides, Julio
2008-10-27
Previous studies have used different methods in an effort to extract the modular organization of transcriptional regulatory networks. However, these approaches are not natural, as they try to cluster strongly connected genes into a module or locate known pleiotropic transcription factors in lower hierarchical layers. Here, we unravel the transcriptional regulatory network of Escherichia coli by separating it into its key elements, thus revealing its natural organization. We also present a mathematical criterion, based on the topological features of the transcriptional regulatory network, to classify the network elements into one of two possible classes: hierarchical or modular genes. We found that modular genes are clustered into physiologically correlated groups validated by a statistical analysis of the enrichment of the functional classes. Hierarchical genes encode transcription factors responsible for coordinating module responses based on general interest signals. Hierarchical elements correlate highly with the previously studied global regulators, suggesting that this could be the first mathematical method to identify global regulators. We identified a new element in transcriptional regulatory networks never described before: intermodular genes. These are structural genes that integrate, at the promoter level, signals coming from different modules, and therefore from different physiological responses. Using the concept of pleiotropy, we have reconstructed the hierarchy of the network and discuss the role of feedforward motifs in shaping the hierarchical backbone of the transcriptional regulatory network. This study sheds new light on the design principles underpinning the organization of transcriptional regulatory networks, showing a novel nonpyramidal architecture composed of independent modules globally governed by hierarchical transcription factors, whose responses are integrated by intermodular genes.
RNA sequencing uncovers antisense RNAs and novel small RNAs in Streptococcus pyogenes.
Le Rhun, Anaïs; Beer, Yan Yan; Reimegård, Johan; Chylinski, Krzysztof; Charpentier, Emmanuelle
2016-01-01
Streptococcus pyogenes is a human pathogen responsible for a wide spectrum of diseases ranging from mild to life-threatening infections. During the infectious process, the temporal and spatial expression of pathogenicity factors is tightly controlled by a complex network of protein and RNA regulators acting in response to various environmental signals. Here, we focus on the class of small RNA regulators (sRNAs) and present the first complete analysis of sRNA sequencing data in S. pyogenes. In the SF370 clinical isolate (M1 serotype), we identified 197 and 428 putative regulatory RNAs by visual inspection and bioinformatics screening of the sequencing data, respectively. Only 35 from the 197 candidates identified by visual screening were assigned a predicted function (T-boxes, ribosomal protein leaders, characterized riboswitches or sRNAs), indicating how little is known about sRNA regulation in S. pyogenes. By comparing our list of predicted sRNAs with previous S. pyogenes sRNA screens using bioinformatics or microarrays, 92 novel sRNAs were revealed, including antisense RNAs that are for the first time shown to be expressed in this pathogen. We experimentally validated the expression of 30 novel sRNAs and antisense RNAs. We show that the expression profile of 9 sRNAs including 2 predicted regulatory elements is affected by the endoribonucleases RNase III and/or RNase Y, highlighting the critical role of these enzymes in sRNA regulation.
The standard operating procedure of the DOE-JGI Microbial Genome Annotation Pipeline (MGAP v.4).
Huntemann, Marcel; Ivanova, Natalia N; Mavromatis, Konstantinos; Tripp, H James; Paez-Espino, David; Palaniappan, Krishnaveni; Szeto, Ernest; Pillay, Manoj; Chen, I-Min A; Pati, Amrita; Nielsen, Torben; Markowitz, Victor M; Kyrpides, Nikos C
2015-01-01
The DOE-JGI Microbial Genome Annotation Pipeline performs structural and functional annotation of microbial genomes that are further included into the Integrated Microbial Genome comparative analysis system. MGAP is applied to assembled nucleotide sequence datasets that are provided via the IMG submission site. Dataset submission for annotation first requires project and associated metadata description in GOLD. The MGAP sequence data processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNA features, as well as CRISPR elements. Structural annotation is followed by assignment of protein product names and functions.
Marbach, Daniel; Roy, Sushmita; Ay, Ferhat; Meyer, Patrick E.; Candeias, Rogerio; Kahveci, Tamer; Bristow, Christopher A.; Kellis, Manolis
2012-01-01
Gaining insights on gene regulation from large-scale functional data sets is a grand challenge in systems biology. In this article, we develop and apply methods for transcriptional regulatory network inference from diverse functional genomics data sets and demonstrate their value for gene function and gene expression prediction. We formulate the network inference problem in a machine-learning framework and use both supervised and unsupervised methods to predict regulatory edges by integrating transcription factor (TF) binding, evolutionarily conserved sequence motifs, gene expression, and chromatin modification data sets as input features. Applying these methods to Drosophila melanogaster, we predict ∼300,000 regulatory edges in a network of ∼600 TFs and 12,000 target genes. We validate our predictions using known regulatory interactions, gene functional annotations, tissue-specific expression, protein–protein interactions, and three-dimensional maps of chromosome conformation. We use the inferred network to identify putative functions for hundreds of previously uncharacterized genes, including many in nervous system development, which are independently confirmed based on their tissue-specific expression patterns. Last, we use the regulatory network to predict target gene expression levels as a function of TF expression, and find significantly higher predictive power for integrative networks than for motif or ChIP-based networks. Our work reveals the complementarity between physical evidence of regulatory interactions (TF binding, motif conservation) and functional evidence (coordinated expression or chromatin patterns) and demonstrates the power of data integration for network inference and studies of gene regulation at the systems level. PMID:22456606
The genome in three dimensions: a new frontier in human brain research.
Mitchell, Amanda C; Bharadwaj, Rahul; Whittle, Catheryne; Krueger, Winfried; Mirnics, Karoly; Hurd, Yasmin; Rasmussen, Theodore; Akbarian, Schahram
2014-06-15
Less than 1.5% of the human genome encodes protein. However, vast portions of the human genome are subject to transcriptional and epigenetic regulation, and many noncoding regulatory DNA elements are thought to regulate the spatial organization of interphase chromosomes. For example, chromosomal "loopings" are pivotal for the orderly process of gene expression, by enabling distal regulatory enhancer or silencer elements to directly interact with proximal promoter and transcription start sites, potentially bypassing hundreds of kilobases of interspersed sequence on the linear genome. To date, however, epigenetic studies in the human brain are mostly limited to the exploration of DNA methylation and posttranslational modifications of the nucleosome core histones. In contrast, very little is known about the regulation of supranucleosomal structures. Here, we show that chromosome conformation capture, a widely used approach to study higher-order chromatin, is applicable to tissue collected postmortem, thereby informing about genome organization in the human brain. We introduce chromosome conformation capture protocols for brain and compare higher-order chromatin structures at the chromosome 6p22.2-22.1 schizophrenia and bipolar disorder susceptibility locus, and additional neurodevelopmental risk genes, (DPP10, MCPH1) in adult prefrontal cortex and various cell culture systems, including neurons derived from reprogrammed skin cells. We predict that the exploration of three-dimensional genome architectures and function will open up new frontiers in human brain research and psychiatric genetics and provide novel insights into the epigenetic risk architectures of regulatory noncoding DNA. Copyright © 2014 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
Functionally conserved cis-regulatory elements of COL18A1 identified through zebrafish transgenesis.
Kague, Erika; Bessling, Seneca L; Lee, Josephine; Hu, Gui; Passos-Bueno, Maria Rita; Fisher, Shannon
2010-01-15
Type XVIII collagen is a component of basement membranes, and expressed prominently in the eye, blood vessels, liver, and the central nervous system. Homozygous mutations in COL18A1 lead to Knobloch Syndrome, characterized by ocular defects and occipital encephalocele. However, relatively little has been described on the role of type XVIII collagen in development, and nothing is known about the regulation of its tissue-specific expression pattern. We have used zebrafish transgenesis to identify and characterize cis-regulatory sequences controlling expression of the human gene. Candidate enhancers were selected from non-coding sequence associated with COL18A1 based on sequence conservation among mammals. Although these displayed no overt conservation with orthologous zebrafish sequences, four regions nonetheless acted as tissue-specific transcriptional enhancers in the zebrafish embryo, and together recapitulated the major aspects of col18a1 expression. Additional post-hoc computational analysis on positive enhancer sequences revealed alignments between mammalian and teleost sequences, which we hypothesize predict the corresponding zebrafish enhancers; for one of these, we demonstrate functional overlap with the orthologous human enhancer sequence. Our results provide important insight into the biological function and regulation of COL18A1, and point to additional sequences that may contribute to complex diseases involving COL18A1. More generally, we show that combining functional data with targeted analyses for phylogenetic conservation can reveal conserved cis-regulatory elements in the large number of cases where computational alignment alone falls short. Copyright 2009 Elsevier Inc. All rights reserved.
AtmiRNET: a web-based resource for reconstructing regulatory networks of Arabidopsis microRNAs.
Chien, Chia-Hung; Chiang-Hsieh, Yi-Fan; Chen, Yi-An; Chow, Chi-Nga; Wu, Nai-Yun; Hou, Ping-Fu; Chang, Wen-Chi
2015-01-01
Compared with animal microRNAs (miRNAs), our limited knowledge of how miRNAs involve in significant biological processes in plants is still unclear. AtmiRNET is a novel resource geared toward plant scientists for reconstructing regulatory networks of Arabidopsis miRNAs. By means of highlighted miRNA studies in target recognition, functional enrichment of target genes, promoter identification and detection of cis- and trans-elements, AtmiRNET allows users to explore mechanisms of transcriptional regulation and miRNA functions in Arabidopsis thaliana, which are rarely investigated so far. High-throughput next-generation sequencing datasets from transcriptional start sites (TSSs)-relevant experiments as well as five core promoter elements were collected to establish the support vector machine-based prediction model for Arabidopsis miRNA TSSs. Then, high-confidence transcription factors participate in transcriptional regulation of Arabidopsis miRNAs are provided based on statistical approach. Furthermore, both experimentally verified and putative miRNA-target interactions, whose validity was supported by the correlations between the expression levels of miRNAs and their targets, are elucidated for functional enrichment analysis. The inferred regulatory networks give users an intuitive insight into the pivotal roles of Arabidopsis miRNAs through the crosstalk between miRNA transcriptional regulation (upstream) and miRNA-mediate (downstream) gene circuits. The valuable information that is visually oriented in AtmiRNET recruits the scant understanding of plant miRNAs and will be useful (e.g. ABA-miR167c-auxin signaling pathway) for further research. Database URL: http://AtmiRNET.itps.ncku.edu.tw/ © The Author(s) 2015. Published by Oxford University Press.
Potter, Adam W; Blanchard, Laurie A; Friedl, Karl E; Cadarette, Bruce S; Hoyt, Reed W
2017-02-01
Physiological models provide useful summaries of complex interrelated regulatory functions. These can often be reduced to simple input requirements and simple predictions for pragmatic applications. This paper demonstrates this modeling efficiency by tracing the development of one such simple model, the Heat Strain Decision Aid (HSDA), originally developed to address Army needs. The HSDA, which derives from the Givoni-Goldman equilibrium body core temperature prediction model, uses 16 inputs from four elements: individual characteristics, physical activity, clothing biophysics, and environmental conditions. These inputs are used to mathematically predict core temperature (T c ) rise over time and can estimate water turnover from sweat loss. Based on a history of military applications such as derivation of training and mission planning tools, we conclude that the HSDA model is a robust integration of physiological rules that can guide a variety of useful predictions. The HSDA model is limited to generalized predictions of thermal strain and does not provide individualized predictions that could be obtained from physiological sensor data-driven predictive models. This fully transparent physiological model should be improved and extended with new findings and new challenging scenarios. Published by Elsevier Ltd.
Sanges, Remo; Hadzhiev, Yavor; Gueroult-Bellone, Marion; Roure, Agnes; Ferg, Marco; Meola, Nicola; Amore, Gabriele; Basu, Swaraj; Brown, Euan R.; De Simone, Marco; Petrera, Francesca; Licastro, Danilo; Strähle, Uwe; Banfi, Sandro; Lemaire, Patrick; Birney, Ewan; Müller, Ferenc; Stupka, Elia
2013-01-01
Co-option of cis-regulatory modules has been suggested as a mechanism for the evolution of expression sites during development. However, the extent and mechanisms involved in mobilization of cis-regulatory modules remains elusive. To trace the history of non-coding elements, which may represent candidate ancestral cis-regulatory modules affirmed during chordate evolution, we have searched for conserved elements in tunicate and vertebrate (Olfactores) genomes. We identified, for the first time, 183 non-coding sequences that are highly conserved between the two groups. Our results show that all but one element are conserved in non-syntenic regions between vertebrate and tunicate genomes, while being syntenic among vertebrates. Nevertheless, in all the groups, they are significantly associated with transcription factors showing specific functions fundamental to animal development, such as multicellular organism development and sequence-specific DNA binding. The majority of these regions map onto ultraconserved elements and we demonstrate that they can act as functional enhancers within the organism of origin, as well as in cross-transgenesis experiments, and that they are transcribed in extant species of Olfactores. We refer to the elements as ‘Olfactores conserved non-coding elements’. PMID:23393190
Wnt-mediated activation of NeuroD1 and retro-elements during adult neurogenesis.
Kuwabara, Tomoko; Hsieh, Jenny; Muotri, Alysson; Yeo, Gene; Warashina, Masaki; Lie, Dieter Chichung; Moore, Lynne; Nakashima, Kinichi; Asashima, Makoto; Gage, Fred H
2009-09-01
In adult hippocampus, new neurons are continuously generated from neural stem cells (NSCs), but the molecular mechanisms regulating adult neurogenesis remain elusive. We found that Wnt signaling, together with the removal of Sox2, triggered the expression of NeuroD1 in mice. This transcriptional regulatory mechanism was dependent on a DNA element containing overlapping Sox2 and T-cell factor/lymphoid enhancer factor (TCF/LEF)-binding sites (Sox/LEF) in the promoter. Notably, Sox/LEF sites were also found in long interspersed nuclear element 1 (LINE-1) elements, consistent with their critical roles in the transition of NSCs to proliferating neuronal progenitors. Our results describe a previously unknown Wnt-mediated regulatory mechanism that simultaneously coordinates activation of NeuroD1 and LINE-1, which is important for adult neurogenesis and survival of neuronal progenitors. Moreover, the discovery that LINE-1 retro-elements embedded in the mammalian genome can function as bi-directional promoters suggests that Sox/LEF regulatory sites may represent a general mechanism, at least in part, for relaying environmental signals to other nearby loci to promote adult hippocampal neurogenesis.
Zhang, Monica; Song, Lingyun; Lee, Bum-Kyu; Iyer, Vishwanath R.; Furey, Terrence S.; Crawford, Gregory E.; Yan, Hai; He, Yiping
2014-01-01
Despite an emerging understanding of the genetic alterations giving rise to various tumors, the mechanisms whereby most oncogenes are overexpressed remain unclear. Here we have utilized an integrated approach of genomewide regulatory element mapping via DNase-seq followed by conventional reporter assays and transcription factor binding site discovery to characterize the transcriptional regulation of the medulloblastoma oncogene Orthodenticle Homeobox 2 (OTX2). Through these studies we have revealed that OTX2 is differentially regulated in medulloblastoma at the level of chromatin accessibility, which is in part mediated by DNA methylation. In cell lines exhibiting chromatin accessibility of OTX2 regulatory regions, we found that autoregulation maintains OTX2 expression. Comparison of medulloblastoma regulatory elements with those of the developing brain reveals that these tumors engage a developmental regulatory program to drive OTX2 transcription. Finally, we have identified a transcriptional regulatory element mediating retinoid-induced OTX2 repression in these tumors. This work characterizes for the first time the mechanisms of OTX2 overexpression in medulloblastoma. Furthermore, this study establishes proof of principle for applying ENCODE datasets towards the characterization of upstream trans-acting factors mediating expression of individual genes. PMID:25198066
Modular arrangement of regulatory RNA elements.
Roßmanith, Johanna; Narberhaus, Franz
2017-03-04
Due to their simple architecture and control mechanism, regulatory RNA modules are attractive building blocks in synthetic biology. This is especially true for riboswitches, which are natural ligand-binding regulators of gene expression. The discovery of various tandem riboswitches inspired the design of combined RNA modules with activities not yet found in nature. Riboswitches were placed in tandem or in combination with a ribozyme or temperature-responsive RNA thermometer resulting in new functionalities. Here, we compare natural examples of tandem riboswitches with recently designed artificial RNA regulators suggesting substantial modularity of regulatory RNA elements. Challenges associated with modular RNA design are discussed.
Investigating the transcriptional control of cardiovascular development
Kathiriya, Irfan S.; Nora, Elphege P.; Bruneau, Benoit G.
2015-01-01
Transcriptional regulation of thousands of genes instructs complex morphogenetic and molecular events for heart development. Cardiac transcription factors (TFs) choreograph gene expression at each stage of differentiation by interacting with co-factors, including chromatin-modifying enzymes, and by binding to a constellation of regulatory DNA elements. Here, we present salient examples relevant to cardiovascular development and heart disease and review techniques that can sharpen our understanding of cardiovascular biology. We discuss the interplay between cardiac TFs, cis-regulatory elements and chromatin as dynamic regulatory networks, to orchestrate sequential deployment of the cardiac gene expression program. PMID:25677518
DOE Office of Scientific and Technical Information (OSTI.GOV)
Banerjee, Poulabi; Bahlo, Melanie; Schwartz, Jody R.
2002-01-01
Genome wide disease association analysis using SNPs is being explored as a method for dissecting complex genetic traits and a vast number of SNPs have been generated for this purpose. As there are cost and throughput limitations of genotyping large numbers of SNPs and statistical issues regarding the large number of dependent tests on the same data set, to make association analysis practical it has been proposed that SNPs should be prioritized based on likely functional importance. The most easily identifiable functional SNPs are coding SNPs (cSNPs) and accordingly cSNPs have been screened in a number of studies. SNPs inmore » gene regulatory sequences embedded in noncoding DNA are another class of SNPs suggested for prioritization due to their predicted quantitative impact on gene expression. The main challenge in evaluating these SNPs, in contrast to cSNPs is a lack of robust algorithms and databases for recognizing regulatory sequences in noncoding DNA. Approaches that have been previously used to delineate noncoding sequences with gene regulatory activity include cross-species sequence comparisons and the search for sequences recognized by transcription factors. We combined these two methods to sift through mouse human genomic sequences to identify putative gene regulatory elements and subsequently localized SNPs within these sequences in a 1 Megabase (Mb) region of human chromosome 5q31, orthologous to mouse chromosome 11 containing the Interleukin cluster.« less
Dynamics and function of distal regulatory elements during neurogenesis and neuroplasticity
Thakurela, Sudhir; Sahu, Sanjeeb Kumar; Garding, Angela; Tiwari, Vijay K.
2015-01-01
Gene regulation in mammals involves a complex interplay between promoters and distal regulatory elements that function in concert to drive precise spatiotemporal gene expression programs. However, the dynamics of the distal gene regulatory landscape and its function in the transcriptional reprogramming that underlies neurogenesis and neuronal activity remain largely unknown. Here, we performed a combinatorial analysis of genome-wide data sets for chromatin accessibility (FAIRE-seq) and the enhancer mark H3K27ac, revealing the highly dynamic nature of distal gene regulation during neurogenesis, which gets progressively restricted to distinct genomic regions as neurons acquire a post-mitotic, terminally differentiated state. We further find that the distal accessible and active regions serve as target sites for distinct transcription factors that function in a stage-specific manner to contribute to the transcriptional program underlying neuronal commitment and maturation. Mature neurons respond to a sustained activity of NMDA receptors by epigenetic reprogramming at a large number of distal regulatory regions as well as dramatic reorganization of super-enhancers. Such massive remodeling of the distal regulatory landscape in turn results in a transcriptome that confers a transient loss of neuronal identity and gain of cellular plasticity. Furthermore, NMDA receptor activity also induces many novel prosurvival genes that function in neuroprotective pathways. Taken together, these findings reveal the dynamics of the distal regulatory landscape during neurogenesis and uncover novel regulatory elements that function in concert with epigenetic mechanisms and transcription factors to generate the transcriptome underlying neuronal development and activity. PMID:26170447
Evolution of UCP1 Transcriptional Regulatory Elements Across the Mammalian Phylogeny
Gaudry, Michael J.; Campbell, Kevin L.
2017-01-01
Uncoupling protein 1 (UCP1) permits non-shivering thermogenesis (NST) when highly expressed in brown adipose tissue (BAT) mitochondria. Exclusive to placental mammals, BAT has commonly been regarded to be advantageous for thermoregulation in hibernators, small-bodied species, and the neonates of larger species. While numerous regulatory control motifs associated with UCP1 transcription have been proposed for murid rodents, it remains unclear whether these are conserved across the eutherian mammal phylogeny and hence essential for UCP1 expression. To address this shortcoming, we conducted a broad comparative survey of putative UCP1 transcriptional regulatory elements in 139 mammals (135 eutherians). We find no evidence for presence of a UCP1 enhancer in monotremes and marsupials, supporting the hypothesis that this control region evolved in a stem eutherian ancestor. We additionally reveal that several putative promoter elements (e.g., CRE-4, CCAAT) identified in murid rodents are not conserved among BAT-expressing eutherians, and together with the putative regulatory region (PRR) and CpG island do not appear to be crucial for UCP1 expression. The specificity and importance of the upTRE, dnTRE, URE1, CRE-2, RARE-2, NBRE, BRE-1, and BRE-2 enhancer elements first described from rats and mice are moreover uncertain as these motifs differ substantially—but generally remain highly conserved—in other BAT-expressing eutherians. Other UCP1 enhancer motifs (CRE-3, PPRE, and RARE-3) as well as the TATA box are also highly conserved in nearly all eutherian lineages with an intact UCP1. While these transcriptional regulatory motifs are generally also maintained in species where this gene is pseudogenized, the loss or degeneration of key basal promoter (e.g., TATA box) and enhancer elements in other UCP1-lacking lineages make it unlikely that the enhancer region is pleiotropic (i.e., co-regulates additional genes). Importantly, differential losses of (or mutations within) putative regulatory elements among the eutherian lineages with an intact UCP1 suggests that the transcriptional control of gene expression is not highly conserved in this mammalian clade. PMID:28979209
Intrinsic limits to gene regulation by global crosstalk
Friedlander, Tamar; Prizak, Roshan; Guet, Călin C.; Barton, Nicholas H.; Tkačik, Gašper
2016-01-01
Gene regulation relies on the specificity of transcription factor (TF)–DNA interactions. Limited specificity may lead to crosstalk: a regulatory state in which a gene is either incorrectly activated due to noncognate TF–DNA interactions or remains erroneously inactive. As each TF can have numerous interactions with noncognate cis-regulatory elements, crosstalk is inherently a global problem, yet has previously not been studied as such. We construct a theoretical framework to analyse the effects of global crosstalk on gene regulation. We find that crosstalk presents a significant challenge for organisms with low-specificity TFs, such as metazoans. Crosstalk is not easily mitigated by known regulatory schemes acting at equilibrium, including variants of cooperativity and combinatorial regulation. Our results suggest that crosstalk imposes a previously unexplored global constraint on the functioning and evolution of regulatory networks, which is qualitatively distinct from the known constraints that act at the level of individual gene regulatory elements. PMID:27489144
Florman, H M; First, N L
1988-08-01
The effects of accessory sex gland secretions on the zona pellucida-induced acrosome reaction of bovine spermatozoa were investigated. Soluble extracts of zonae pellucidae initiated exocytosis in ejaculated spermatozoa. This process had an ED50 of 20 ng/microliter zona pellucida protein and saturated at 50 ng/microliter (Florman and First, 1988. Dev. Biol. 128, 453-463). In epididymal sperm this dose-response relationship was shifted toward greater agonist concentrations by at least a factor of 10(3). Reconstitution of high potency agonist response was achieved in vitro by incubation of epididymal sperm with bovine seminal plasma. Reconstitution was dependent on the seminal plasma protein concentration. The ED50 of this process was 62 micrograms protein/10(8) sperm and saturation was observed with 124 micrograms protein/10(8) sperm. Agonist responses in reconstituted epididymal sperm and in ejaculated sperm were indistinguishable with regard to dependence on the zona pellucida protein concentration and the kinetics of induced acrosome reactions. Kinetic studies suggest that reconstitution is due to adsorption of regulatory factors from seminal plasma. In addition to the positive regulatory elements responsible for reconstituting activity, seminal plasma also contains negative regulatory elements which inhibit agonist response. These negative factors are inactivated during sperm capacitation, permitting the expression of positive regulators. Acting together, these regulatory elements could coordinate high affinity agonist response with the availability of eggs in vivo.
Lung evolution as a cipher for physiology
Torday, J. S.; Rehan, V. K.
2009-01-01
In the postgenomic era, we need an algorithm to readily translate genes into physiologic principles. The failure to advance biomedicine is due to the false hope raised in the wake of the Human Genome Project (HGP) by the promise of systems biology as a ready means of reconstructing physiology from genes. like the atom in physics, the cell, not the gene, is the smallest completely functional unit of biology. Trying to reassemble gene regulatory networks without accounting for this fundamental feature of evolution will result in a genomic atlas, but not an algorithm for functional genomics. For example, the evolution of the lung can be “deconvoluted” by applying cell-cell communication mechanisms to all aspects of lung biology development, homeostasis, and regeneration/repair. Gene regulatory networks common to these processes predict ontogeny, phylogeny, and the disease-related consequences of failed signaling. This algorithm elucidates characteristics of vertebrate physiology as a cascade of emergent and contingent cellular adaptational responses. By reducing complex physiological traits to gene regulatory networks and arranging them hierarchically in a self-organizing map, like the periodic table of elements in physics, the first principles of physiology will emerge. PMID:19366785
Epigenetic functions enriched in transcription factors binding to mouse recombination hotspots.
Wu, Min; Kwoh, Chee-Keong; Przytycka, Teresa M; Li, Jing; Zheng, Jie
2012-06-21
The regulatory mechanism of recombination is a fundamental problem in genomics, with wide applications in genome-wide association studies, birth-defect diseases, molecular evolution, cancer research, etc. In mammalian genomes, recombination events cluster into short genomic regions called "recombination hotspots". Recently, a 13-mer motif enriched in hotspots is identified as a candidate cis-regulatory element of human recombination hotspots; moreover, a zinc finger protein, PRDM9, binds to this motif and is associated with variation of recombination phenotype in human and mouse genomes, thus is a trans-acting regulator of recombination hotspots. However, this pair of cis and trans-regulators covers only a fraction of hotspots, thus other regulators of recombination hotspots remain to be discovered. In this paper, we propose an approach to predicting additional trans-regulators from DNA-binding proteins by comparing their enrichment of binding sites in hotspots. Applying this approach on newly mapped mouse hotspots genome-wide, we confirmed that PRDM9 is a major trans-regulator of hotspots. In addition, a list of top candidate trans-regulators of mouse hotspots is reported. Using GO analysis we observed that the top genes are enriched with function of histone modification, highlighting the epigenetic regulatory mechanisms of recombination hotspots.
Epigenetic functions enriched in transcription factors binding to mouse recombination hotspots
2012-01-01
The regulatory mechanism of recombination is a fundamental problem in genomics, with wide applications in genome-wide association studies, birth-defect diseases, molecular evolution, cancer research, etc. In mammalian genomes, recombination events cluster into short genomic regions called "recombination hotspots". Recently, a 13-mer motif enriched in hotspots is identified as a candidate cis-regulatory element of human recombination hotspots; moreover, a zinc finger protein, PRDM9, binds to this motif and is associated with variation of recombination phenotype in human and mouse genomes, thus is a trans-acting regulator of recombination hotspots. However, this pair of cis and trans-regulators covers only a fraction of hotspots, thus other regulators of recombination hotspots remain to be discovered. In this paper, we propose an approach to predicting additional trans-regulators from DNA-binding proteins by comparing their enrichment of binding sites in hotspots. Applying this approach on newly mapped mouse hotspots genome-wide, we confirmed that PRDM9 is a major trans-regulator of hotspots. In addition, a list of top candidate trans-regulators of mouse hotspots is reported. Using GO analysis we observed that the top genes are enriched with function of histone modification, highlighting the epigenetic regulatory mechanisms of recombination hotspots. PMID:22759569
Benner, Christopher; Hutt, Kasey R.; Stunnenberg, Rieka; Garcia-Bassets, Ivan
2013-01-01
Genome-wide maps of DNase I hypersensitive sites (DHSs) reveal that most human promoters contain perpetually active cis-regulatory elements between −150 bp and +50 bp (−150/+50 bp) relative to the transcription start site (TSS). Transcription factors (TFs) recruit cofactors (chromatin remodelers, histone/protein-modifying enzymes, and scaffold proteins) to these elements in order to organize the local chromatin structure and coordinate the balance of post-translational modifications nearby, contributing to the overall regulation of transcription. However, the rules of TF-mediated cofactor recruitment to the −150/+50 bp promoter regions remain poorly understood. Here, we provide evidence for a general model in which a series of cis-regulatory elements (here termed ‘cardinal’ motifs) prefer acting individually, rather than in fixed combinations, within the −150/+50 bp regions to recruit TFs that dictate cofactor signatures distinctive of specific promoter subsets. Subsequently, human promoters can be subclassified based on the presence of cardinal elements and their associated cofactor signatures. In this study, furthermore, we have focused on promoters containing the nuclear respiratory factor 1 (NRF1) motif as the cardinal cis-regulatory element and have identified the pervasive association of NRF1 with the cofactor lysine-specific demethylase 1 (LSD1/KDM1A). This signature might be distinctive of promoters regulating nuclear-encoded mitochondrial and other particular genes in at least some cells. Together, we propose that decoding a signature-based, expanded model of control at proximal promoter regions should lead to a better understanding of coordinated regulation of gene transcription. PMID:24244184
Discrete Dynamics Model for the Speract-Activated Ca2+ Signaling Network Relevant to Sperm Motility
Espinal, Jesús; Aldana, Maximino; Guerrero, Adán; Wood, Christopher
2011-01-01
Understanding how spermatozoa approach the egg is a central biological issue. Recently a considerable amount of experimental evidence has accumulated on the relation between oscillations in intracellular calcium ion concentration ([Ca]) in the sea urchin sperm flagellum, triggered by peptides secreted from the egg, and sperm motility. Determination of the structure and dynamics of the signaling pathway leading to these oscillations is a fundamental problem. However, a biochemically based formulation for the comprehension of the molecular mechanisms operating in the axoneme as a response to external stimulus is still lacking. Based on experiments on the S. purpuratus sea urchin spermatozoa, we propose a signaling network model where nodes are discrete variables corresponding to the pathway elements and the signal transmission takes place at discrete time intervals according to logical rules. The validity of this model is corroborated by reproducing previous empirically determined signaling features. Prompted by the model predictions we performed experiments which identified novel characteristics of the signaling pathway. We uncovered the role of a high voltage-activated channel as a regulator of the delay in the onset of fluctuations after activation of the signaling cascade. This delay time has recently been shown to be an important regulatory factor for sea urchin sperm reorientation. Another finding is the participation of a voltage-dependent calcium-activated channel in the determination of the period of the fluctuations. Furthermore, by analyzing the spread of network perturbations we find that it operates in a dynamically critical regime. Our work demonstrates that a coarse-grained approach to the dynamics of the signaling pathway is capable of revealing regulatory sperm navigation elements and provides insight, in terms of criticality, on the concurrence of the high robustness and adaptability that the reproduction processes are predicted to have developed throughout evolution. PMID:21857937
The impact of transposable elements on mammalian development
Garcia-Perez, Jose L.; Widmann, Thomas J.; Adams, Ian R.
2018-01-01
Summary Despite often being classified as selfish or junk DNA, transposable elements (TEs) are a group of abundant genetic sequences that significantly impact on mammalian development and genome regulation. In recent years, our understanding of how pre-existing TEs affect genome architecture, gene regulatory networks and protein function during mammalian embryogenesis has dramatically expanded. In addition, the mobilization of active TEs in selected cell types has been shown to generate genetic variation during development and in fully differentiated tissues. Importantly, the ongoing domestication and evolution of TEs appears to provide a rich source of regulatory elements, functional modules and genetic variation that fuels the evolution of mammalian developmental processes. Here, we review the functional impact that TEs exert on mammalian developmental processes and how the somatic activity of TEs can influence gene regulatory networks. PMID:27875251
Disrupted auto-regulation of the spliceosomal gene SNRPB causes cerebro–costo–mandibular syndrome
Lynch, Danielle C.; Revil, Timothée; Schwartzentruber, Jeremy; Bhoj, Elizabeth J.; Innes, A. Micheil; Lamont, Ryan E.; Lemire, Edmond G.; Chodirker, Bernard N.; Taylor, Juliet P.; Zackai, Elaine H.; McLeod, D. Ross; Kirk, Edwin P.; Hoover-Fong, Julie; Fleming, Leah; Savarirayan, Ravi; Boycott, Kym; MacKenzie, Alex; Brudno, Michael; Bulman, Dennis; Dyment, David; Majewski, Jacek; Jerome-Majewska, Loydie A.; Parboosingh, Jillian S.; Bernier, Francois P.
2014-01-01
Elucidating the function of highly conserved regulatory sequences is a significant challenge in genomics today. Certain intragenic highly conserved elements have been associated with regulating levels of core components of the spliceosome and alternative splicing of downstream genes. Here we identify mutations in one such element, a regulatory alternative exon of SNRPB as the cause of cerebro–costo–mandibular syndrome. This exon contains a premature termination codon that triggers nonsense-mediated mRNA decay when included in the transcript. These mutations cause increased inclusion of the alternative exon and decreased overall expression of SNRPB. We provide evidence for the functional importance of this conserved intragenic element in the regulation of alternative splicing and development, and suggest that the evolution of such a regulatory mechanism has contributed to the complexity of mammalian development. PMID:25047197
Disrupted auto-regulation of the spliceosomal gene SNRPB causes cerebro-costo-mandibular syndrome.
Lynch, Danielle C; Revil, Timothée; Schwartzentruber, Jeremy; Bhoj, Elizabeth J; Innes, A Micheil; Lamont, Ryan E; Lemire, Edmond G; Chodirker, Bernard N; Taylor, Juliet P; Zackai, Elaine H; McLeod, D Ross; Kirk, Edwin P; Hoover-Fong, Julie; Fleming, Leah; Savarirayan, Ravi; Majewski, Jacek; Jerome-Majewska, Loydie A; Parboosingh, Jillian S; Bernier, Francois P
2014-07-22
Elucidating the function of highly conserved regulatory sequences is a significant challenge in genomics today. Certain intragenic highly conserved elements have been associated with regulating levels of core components of the spliceosome and alternative splicing of downstream genes. Here we identify mutations in one such element, a regulatory alternative exon of SNRPB as the cause of cerebro-costo-mandibular syndrome. This exon contains a premature termination codon that triggers nonsense-mediated mRNA decay when included in the transcript. These mutations cause increased inclusion of the alternative exon and decreased overall expression of SNRPB. We provide evidence for the functional importance of this conserved intragenic element in the regulation of alternative splicing and development, and suggest that the evolution of such a regulatory mechanism has contributed to the complexity of mammalian development.
Kumar, V; Wong, D T; Pasion, S G; Biswas, D K
1987-12-08
The prolactin-nonproducing (PRL-) GH cell strains (rat pituitary tumor cells in culture). GH12C1 and F1BGH12C1, do not respond to steroid hormones estradiol or hydrocortisone (HC). However, the stimulatory effect of estradiol and the inhibitory effect of hydrocortisone on prolactin synthesis can be demonstrated in the prolactin-producing GH cell strain, GH4C1. In this investigation we have examined the 5' end flanking region of rat prolactin (rat PRL) gene of steroid-responsive, GH4C1 cells to identify the positive and negative regulatory elements and to verify the status of these elements in steroid-nonresponsive F1BGH12C1 cells. Results presented in this report demonstrate that the basel level expression of the co-transferred Neo gene (neomycin phosphoribosyl transferase) is modulated by the distal upstream regulatory elements of rat PRL gene in response to steroid hormones. The expression of adjacent Neo gene is inhibited by dexamethasone and is stimulated by estradiol in transfectants carrying distal regulatory elements (SRE) of steroid-responsive cells. These responses are not observed in transfectants with the rat PRL upstream sequences derived from steroid-nonresponsive cells. The basal level expression of the host cell alpha-2 tubulin gene is not affected by dexamethasone. We report here the identification of the distal steroid regulatory element (SRE) located between 3.8 and 7.8 kb upstream of the transcription initiation site of rat PRL gene. Both the positive and the negative effects of steroid hormones can be identified within this upstream sequence. This distal SRE appears to be nonfunctional in steroid-nonresponsive cells. Though the proximal SRE is functional, the defect in the distal SRE makes the GH substrain nonresponsive to steroid hormones. These results suggest that both the proximal and the distal SREs are essential for the mediation of action of steroid hormones in GH cells.
Manimaran, P; Raghurami Reddy, M; Bhaskar Rao, T; Mangrauthia, Satendra K; Sundaram, R M; Balachandran, S M
2015-12-01
Pollen-specific expression. Promoters comprise of various cis-regulatory elements which control development and physiology of plants by regulating gene expression. To understand the promoter specificity and also identification of functional cis-acting elements, progressive 5' deletion analysis of the promoter fragments is widely used. We have evaluated the activity of regulatory elements of 5' promoter deletion sequences of anther-specific gene OSIPP3, viz. OSIPP3-∆1 (1504 bp), OSIPP3-∆2 (968 bp), OSIPP3-∆3 (388 bp) and OSIPP3-∆4 (286 bp) through the expression of transgene GUS in rice. In silico analysis of 1504-bp sequence harboring different copy number of cis-acting regulatory elements such as POLLENLELAT52, GTGANTG10, enhancer element of LAT52 and LAT56 indicated that they were essential for high level of expression in pollen. Histochemical GUS analysis of the transgenic plants revealed that 1504- and 968-bp fragments directed GUS expression in roots and anthers, while the 388- and 286-bp fragments restricted the GUS expression to only pollen, of which 388 bp conferred strong GUS expression. Further, GUS staining analysis of different panicle development stages (P1-P6) confirmed that the GUS gene was preferentially expressed only at P6 stage (late pollen stage). The qRT-PCR analysis of GUS transcript revealed 23-fold higher expression of GUS transcript in OSIPP3-Δ1 followed by OSIPP3-Δ2 (eightfold) and OSIPP3-Δ3 (threefold) when compared to OSIPP3-Δ4. Based on our results, we proposed that among the two smaller fragments, the 388-bp upstream regulatory region could be considered as a promising candidate for pollen-specific expression of agronomically important transgenes in rice.
AFO Manure Management - Minnesota: Feedlot Registration
Compendium of State Approaches for Manure Management, Part A -- Example of program features for manure management that have a regulatory basis, such as permit provisions and other regulatory program elements.
Deciphering RNA regulatory elements in trypanosomatids: one piece at a time or genome-wide?
Gazestani, Vahid H; Lu, Zhiquan; Salavati, Reza
2014-05-01
Morphological and metabolic changes in the life cycle of Trypanosoma brucei are accomplished by precise regulation of hundreds of genes. In the absence of transcriptional control, RNA-binding proteins (RBPs) shape the structure of gene regulatory maps in this organism, but our knowledge about their target RNAs, binding sites, and mechanisms of action is far from complete. Although recent technological advances have revolutionized the RBP-based approaches, the main framework for the RNA regulatory element (RRE)-based approaches has not changed over the last two decades in T. brucei. In this Opinion, after highlighting the current challenges in RRE inference, we explain some genome-wide solutions that can significantly boost our current understanding about gene regulatory networks in T. brucei. Copyright © 2014 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tessum, C. W.; Hill, J. D.; Marshall, J. D.
We present results from and evaluate the performance of a 12-month, 12 km horizontal resolution year 2005 air pollution simulation for the contiguous United States using the WRF-Chem (Weather Research and Forecasting with Chemistry) meteorology and chemical transport model (CTM). We employ the 2005 US National Emissions Inventory, the Regional Atmospheric Chemistry Mechanism (RACM), and the Modal Aerosol Dynamics Model for Europe (MADE) with a volatility basis set (VBS) secondary aerosol module. Overall, model performance is comparable to contemporary modeling efforts used for regulatory and health-effects analysis, with an annual average daytime ozone (O 3) mean fractional bias (MFB) ofmore » 12% and an annual average fine particulate matter (PM 2.5) MFB of −1%. WRF-Chem, as configured here, tends to overpredict total PM 2.5 at some high concentration locations and generally overpredicts average 24 h O 3 concentrations. Performance is better at predicting daytime-average and daily peak O 3 concentrations, which are more relevant for regulatory and health effects analyses relative to annual average values. Predictive performance for PM 2.5 subspecies is mixed: the model overpredicts particulate sulfate (MFB = 36%), underpredicts particulate nitrate (MFB = −110%) and organic carbon (MFB = −29%), and relatively accurately predicts particulate ammonium (MFB = 3%) and elemental carbon (MFB = 3%), so that the accuracy in total PM 2.5 predictions is to some extent a function of offsetting over- and underpredictions of PM 2.5 subspecies. Model predictive performance for PM 2.5 and its subspecies is in general worse in winter and in the western US than in other seasons and regions, suggesting spatial and temporal opportunities for future WRF-Chem model development and evaluation.« less
Tessum, C. W.; Hill, J. D.; Marshall, J. D.
2015-04-07
We present results from and evaluate the performance of a 12-month, 12 km horizontal resolution year 2005 air pollution simulation for the contiguous United States using the WRF-Chem (Weather Research and Forecasting with Chemistry) meteorology and chemical transport model (CTM). We employ the 2005 US National Emissions Inventory, the Regional Atmospheric Chemistry Mechanism (RACM), and the Modal Aerosol Dynamics Model for Europe (MADE) with a volatility basis set (VBS) secondary aerosol module. Overall, model performance is comparable to contemporary modeling efforts used for regulatory and health-effects analysis, with an annual average daytime ozone (O 3) mean fractional bias (MFB) ofmore » 12% and an annual average fine particulate matter (PM 2.5) MFB of −1%. WRF-Chem, as configured here, tends to overpredict total PM 2.5 at some high concentration locations and generally overpredicts average 24 h O 3 concentrations. Performance is better at predicting daytime-average and daily peak O 3 concentrations, which are more relevant for regulatory and health effects analyses relative to annual average values. Predictive performance for PM 2.5 subspecies is mixed: the model overpredicts particulate sulfate (MFB = 36%), underpredicts particulate nitrate (MFB = −110%) and organic carbon (MFB = −29%), and relatively accurately predicts particulate ammonium (MFB = 3%) and elemental carbon (MFB = 3%), so that the accuracy in total PM 2.5 predictions is to some extent a function of offsetting over- and underpredictions of PM 2.5 subspecies. Model predictive performance for PM 2.5 and its subspecies is in general worse in winter and in the western US than in other seasons and regions, suggesting spatial and temporal opportunities for future WRF-Chem model development and evaluation.« less
PreCisIon: PREdiction of CIS-regulatory elements improved by gene's positION.
Elati, Mohamed; Nicolle, Rémy; Junier, Ivan; Fernández, David; Fekih, Rim; Font, Julio; Képès, François
2013-02-01
Conventional approaches to predict transcriptional regulatory interactions usually rely on the definition of a shared motif sequence on the target genes of a transcription factor (TF). These efforts have been frustrated by the limited availability and accuracy of TF binding site motifs, usually represented as position-specific scoring matrices, which may match large numbers of sites and produce an unreliable list of target genes. To improve the prediction of binding sites, we propose to additionally use the unrelated knowledge of the genome layout. Indeed, it has been shown that co-regulated genes tend to be either neighbors or periodically spaced along the whole chromosome. This study demonstrates that respective gene positioning carries significant information. This novel type of information is combined with traditional sequence information by a machine learning algorithm called PreCisIon. To optimize this combination, PreCisIon builds a strong gene target classifier by adaptively combining weak classifiers based on either local binding sequence or global gene position. This strategy generically paves the way to the optimized incorporation of any future advances in gene target prediction based on local sequence, genome layout or on novel criteria. With the current state of the art, PreCisIon consistently improves methods based on sequence information only. This is shown by implementing a cross-validation analysis of the 20 major TFs from two phylogenetically remote model organisms. For Bacillus subtilis and Escherichia coli, respectively, PreCisIon achieves on average an area under the receiver operating characteristic curve of 70 and 60%, a sensitivity of 80 and 70% and a specificity of 60 and 56%. The newly predicted gene targets are demonstrated to be functionally consistent with previously known targets, as assessed by analysis of Gene Ontology enrichment or of the relevant literature and databases.
Potvin, Eric; Beuret, Laurent; Cadrin-Girard, Jean-François; Carter, Marcelle; Roy, Sophie; Tremblay, Michel; Charron, Jean
2010-11-01
The precise expression of the N-myc proto-oncogene is essential for normal mammalian development, whereas altered N-myc gene regulation is known to be a determinant factor in tumor formation. Using transgenic mouse embryos, we show that N-myc sequences from kb -8.7 to kb +7.2 are sufficient to reproduce the N-myc embryonic expression profile in developing branchial arches and limb buds. These sequences encompass several regulatory elements dispersed throughout the N-myc locus, including an upstream limb bud enhancer, a downstream somite enhancer, a branchial arch enhancer in the second intron, and a negative regulatory element in the first intron. N-myc expression in the limb buds is under the dominant control of the limb bud enhancer. The expression in the branchial arches necessitates the interplay of three regulatory domains. The branchial arch enhancer cooperates with the somite enhancer region to prevent an inhibitory activity contained in the first intron. The characterization of the branchial arch enhancer has revealed a specific role of the transcription factor GATA3 in the regulation of N-myc expression. Together, these data demonstrate that correct N-myc developmental expression is achieved via cooperation of multiple positive and negative regulatory elements.
Zerenturk, Eser J; Sharpe, Laura J; Brown, Andrew J
2012-10-01
3β-Hydroxysterol Δ24-reductase (DHCR24) catalyzes a final step in cholesterol synthesis, and has been ascribed diverse functions, such as being anti-apoptotic and anti-inflammatory. How this enzyme is regulated transcriptionally by sterols is currently unclear. Some studies have suggested that its expression is regulated by Sterol Regulatory Element Binding Proteins (SREBPs) while another suggests it is through the Liver X Receptor (LXR). However, these transcription factors have opposing effects on cellular sterol levels, so it is likely that one predominates. Here we establish that sterol regulation of DHCR24 occurs predominantly through SREBP-2, and identify the particular region of the DHCR24 promoter to which SREBP-2 binds. We demonstrate that sterol regulation is mediated by two sterol regulatory elements (SREs) in the promoter of the gene, assisted by two nearby NF-Y binding sites. Moreover, we present evidence that the dual SREs work cooperatively to regulate DHCR24 expression by comparison to two known SREBP target genes, the LDL receptor with one SRE, and farnesyl-diphosphate farnesyltransferase 1, with two SREs. Copyright © 2012 Elsevier B.V. All rights reserved.
12 CFR 324.63 - Disclosures by FDIC-supervised institutions described in § 324.61.
Code of Federal Regulations, 2014 CFR
2014-01-01
..., tier 2 capital, tier 1 and total capital ratios, including the regulatory capital elements and all the regulatory adjustments and deductions needed to calculate the numerator of such ratios; (2) Total risk... risk-weighted assets; (3) Regulatory capital ratios during any transition periods, including a...
12 CFR 217.63 - Disclosures by Board-regulated institutions described in § 217.61.
Code of Federal Regulations, 2014 CFR
2014-01-01
... and total capital ratios, including the regulatory capital elements and all the regulatory adjustments and deductions needed to calculate the numerator of such ratios; (2) Total risk-weighted assets...; (3) Regulatory capital ratios during any transition periods, including a description of all the...
Developmental Control of NRAMP1 (SLC11A1) Expression in Professional Phagocytes.
Cellier, Mathieu F M
2017-05-03
NRAMP1 (SLC11A1) is a professional phagocyte membrane importer of divalent metals that contributes to iron recycling at homeostasis and to nutritional immunity against infection. Analyses of data generated by several consortia and additional studies were integrated to hypothesize mechanisms restricting NRAMP1 expression to mature phagocytes. Results from various epigenetic and transcriptomic approaches were collected for mesodermal and hematopoietic cell types and compiled for combined analysis with results of genetic studies associating single nucleotide polymorphisms (SNPs) with variations in NRAMP1 expression (eQTLs). Analyses establish that NRAMP1 is part of an autonomous topologically associated domain delimited by ubiquitous CCCTC-binding factor (CTCF) sites. NRAMP1 locus contains five regulatory regions: a predicted super-enhancer (S-E) key to phagocyte-specific expression; the proximal promoter; two intronic areas, including 3' inhibitory elements that restrict expression during development; and a block of upstream sites possibly extending the S-E domain. Also the downstream region adjacent to the 3' CTCF locus boundary may regulate expression during hematopoiesis. Mobilization of the locus 14 predicted transcriptional regulatory elements occurs in three steps, beginning with hematopoiesis; at the onset of myelopoiesis and through myelo-monocytic differentiation. Basal expression level in mature phagocytes is further influenced by genetic variation, tissue environment, and in response to infections that induce various epigenetic memories depending on microorganism nature. Constitutively associated transcription factors (TFs) include CCAAT enhancer binding protein beta (C/EBPb), purine rich DNA binding protein (PU.1), early growth response 2 (EGR2) and signal transducer and activator of transcription 1 (STAT1) while hypoxia-inducible factors (HIFs) and interferon regulatory factor 1 (IRF1) may stimulate iron acquisition in pro-inflammatory conditions. Mouse orthologous locus is generally conserved; chromatin patterns typify a de novo myelo-monocytic gene whose expression is tightly controlled by TFs Pu.1, C/ebps and Irf8; Irf3 and nuclear factor NF-kappa-B p 65 subunit (RelA) regulate expression in inflammatory conditions. Functional differences in the determinants identified at these orthologous loci imply that species-specific mechanisms control gene expression.
AFO Manure Management - Michigan: Manure Transfer Requirements
Compendium of State Approaches for Manure Management, Part A -- Example of program features for manure management that have a regulatory basis, such as permit provisions and other regulatory program elements.
Satheesh, Viswanathan; Jagannadham, P Tej Kumar; Chidambaranathan, Parameswaran; Jain, P K; Srinivasan, R
2014-12-01
The NAC (NAM, ATAF and CUC) proteins are plant-specific transcription factors implicated in development and stress responses. In the present study 88 pigeonpea NAC genes were identified from the recently published draft genome of pigeonpea by using homology based and de novo prediction programmes. These sequences were further subjected to phylogenetic, motif and promoter analyses. In motif analysis, highly conserved motifs were identified in the NAC domain and also in the C-terminal region of the NAC proteins. A phylogenetic reconstruction using pigeonpea, Arabidopsis and soybean NAC genes revealed 33 putative stress-responsive pigeonpea NAC genes. Several stress-responsive cis-elements were identified through in silico analysis of the promoters of these putative stress-responsive genes. This analysis is the first report of NAC gene family in pigeonpea and will be useful for the identification and selection of candidate genes associated with stress tolerance.
Infante, Carlos R; Mihala, Alexandra G; Park, Sungdae; Wang, Jialiang S; Johnson, Kenji K; Lauderdale, James D; Menke, Douglas B
2015-10-12
The amniote phallus and limbs differ dramatically in their morphologies but share patterns of signaling and gene expression in early development. Thus far, the extent to which genital and limb transcriptional networks also share cis-regulatory elements has remained unexplored. We show that many limb enhancers are retained in snake genomes, suggesting that these elements may function in non-limb tissues. Consistent with this, our analysis of cis-regulatory activity in mice and Anolis lizards reveals that patterns of enhancer activity in embryonic limbs and genitalia overlap heavily. In mice, deletion of HLEB, an enhancer of Tbx4, produces defects in hindlimbs and genitalia, establishing the importance of this limb-genital enhancer for development of these different appendages. Further analyses demonstrate that the HLEB of snakes has lost hindlimb enhancer function while retaining genital activity. Our findings identify roles for Tbx4 in genital development and highlight deep similarities in cis-regulatory activity between limbs and genitalia. Copyright © 2015 Elsevier Inc. All rights reserved.
Screening of MITF and SOX10 regulatory regions in Waardenburg syndrome type 2.
Baral, Viviane; Chaoui, Asma; Watanabe, Yuli; Goossens, Michel; Attie-Bitach, Tania; Marlin, Sandrine; Pingault, Veronique; Bondurand, Nadege
2012-01-01
Waardenburg syndrome (WS) is a rare auditory-pigmentary disorder that exhibits varying combinations of sensorineural hearing loss and pigmentation defects. Four subtypes are clinically defined based on the presence or absence of additional symptoms. WS type 2 (WS2) can result from mutations within the MITF or SOX10 genes; however, 70% of WS2 cases remain unexplained at the molecular level, suggesting that other genes might be involved and/or that mutations within the known genes escaped previous screenings. The recent identification of a deletion encompassing three of the SOX10 regulatory elements in a patient presenting with another WS subtype, WS4, defined by its association with Hirschsprung disease, led us to search for deletions and point mutations within the MITF and SOX10 regulatory elements in 28 yet unexplained WS2 cases. Two nucleotide variations were identified: one in close proximity to the MITF distal enhancer (MDE) and one within the U1 SOX10 enhancer. Functional analyses argued against a pathogenic effect of these variations, suggesting that mutations within regulatory elements of WS genes are not a major cause of this neurocristopathy.
Pocock, Ginger M.; Zimdars, Laraine L.; Yuan, Ming; Eliceiri, Kevin W.; Ahlquist, Paul; Sherer, Nathan M.
2017-01-01
Cis-acting RNA structural elements govern crucial aspects of viral gene expression. How these structures and other posttranscriptional signals affect RNA trafficking and translation in the context of single cells is poorly understood. Herein we describe a multicolor, long-term (>24 h) imaging strategy for measuring integrated aspects of viral RNA regulatory control in individual cells. We apply this strategy to demonstrate differential mRNA trafficking behaviors governed by RNA elements derived from three retroviruses (HIV-1, murine leukemia virus, and Mason-Pfizer monkey virus), two hepadnaviruses (hepatitis B virus and woodchuck hepatitis virus), and an intron-retaining transcript encoded by the cellular NXF1 gene. Striking behaviors include “burst” RNA nuclear export dynamics regulated by HIV-1’s Rev response element and the viral Rev protein; transient aggregations of RNAs into discrete foci at or near the nuclear membrane triggered by multiple elements; and a novel, pulsiform RNA export activity regulated by the hepadnaviral posttranscriptional regulatory element. We incorporate single-cell tracking and a data-mining algorithm into our approach to obtain RNA element–specific, high-resolution gene expression signatures. Together these imaging assays constitute a tractable, systems-based platform for studying otherwise difficult to access spatiotemporal features of viral and cellular gene regulation. PMID:27903772
Deep conservation of cis-regulatory elements in metazoans
Maeso, Ignacio; Irimia, Manuel; Tena, Juan J.; Casares, Fernando; Gómez-Skarmeta, José Luis
2013-01-01
Despite the vast morphological variation observed across phyla, animals share multiple basic developmental processes orchestrated by a common ancestral gene toolkit. These genes interact with each other building complex gene regulatory networks (GRNs), which are encoded in the genome by cis-regulatory elements (CREs) that serve as computational units of the network. Although GRN subcircuits involved in ancient developmental processes are expected to be at least partially conserved, identification of CREs that are conserved across phyla has remained elusive. Here, we review recent studies that revealed such deeply conserved CREs do exist, discuss the difficulties associated with their identification and describe new approaches that will facilitate this search. PMID:24218633
2009-01-01
Background Tardigrades represent an animal phylum with extraordinary resistance to environmental stress. Results To gain insights into their stress-specific adaptation potential, major clusters of related and similar proteins are identified, as well as specific functional clusters delineated comparing all tardigrades and individual species (Milnesium tardigradum, Hypsibius dujardini, Echiniscus testudo, Tulinus stephaniae, Richtersius coronifer) and functional elements in tardigrade mRNAs are analysed. We find that 39.3% of the total sequences clustered in 58 clusters of more than 20 proteins. Among these are ten tardigrade specific as well as a number of stress-specific protein clusters. Tardigrade-specific functional adaptations include strong protein, DNA- and redox protection, maintenance and protein recycling. Specific regulatory elements regulate tardigrade mRNA stability such as lox P DICE elements whereas 14 other RNA elements of higher eukaryotes are not found. Further features of tardigrade specific adaption are rapidly identified by sequence and/or pattern search on the web-tool tardigrade analyzer http://waterbear.bioapps.biozentrum.uni-wuerzburg.de. The work-bench offers nucleotide pattern analysis for promotor and regulatory element detection (tardigrade specific; nrdb) as well as rapid COG search for function assignments including species-specific repositories of all analysed data. Conclusion Different protein clusters and regulatory elements implicated in tardigrade stress adaptations are analysed including unpublished tardigrade sequences. PMID:19821996
Förster, Frank; Liang, Chunguang; Shkumatov, Alexander; Beisser, Daniela; Engelmann, Julia C; Schnölzer, Martina; Frohme, Marcus; Müller, Tobias; Schill, Ralph O; Dandekar, Thomas
2009-10-12
Tardigrades represent an animal phylum with extraordinary resistance to environmental stress. To gain insights into their stress-specific adaptation potential, major clusters of related and similar proteins are identified, as well as specific functional clusters delineated comparing all tardigrades and individual species (Milnesium tardigradum, Hypsibius dujardini, Echiniscus testudo, Tulinus stephaniae, Richtersius coronifer) and functional elements in tardigrade mRNAs are analysed. We find that 39.3% of the total sequences clustered in 58 clusters of more than 20 proteins. Among these are ten tardigrade specific as well as a number of stress-specific protein clusters. Tardigrade-specific functional adaptations include strong protein, DNA- and redox protection, maintenance and protein recycling. Specific regulatory elements regulate tardigrade mRNA stability such as lox P DICE elements whereas 14 other RNA elements of higher eukaryotes are not found. Further features of tardigrade specific adaption are rapidly identified by sequence and/or pattern search on the web-tool tardigrade analyzer http://waterbear.bioapps.biozentrum.uni-wuerzburg.de. The work-bench offers nucleotide pattern analysis for promotor and regulatory element detection (tardigrade specific; nrdb) as well as rapid COG search for function assignments including species-specific repositories of all analysed data. Different protein clusters and regulatory elements implicated in tardigrade stress adaptations are analysed including unpublished tardigrade sequences.
Oti, Martin; Dutilh, Bas E.; Alonso, M. Eva; de la Calle-Mustienes, Elisa; Smeenk, Leonie; Rinne, Tuula; Parsaulian, Lilian; Bolat, Emine; Jurgelenaite, Rasa; Huynen, Martijn A.; Hoischen, Alexander; Veltman, Joris A.; Brunner, Han G.; Roscioli, Tony; Oates, Emily; Wilson, Meredith; Manzanares, Miguel; Gómez-Skarmeta, José Luis; Stunnenberg, Hendrik G.; Lohrum, Marion; van Bokhoven, Hans; Zhou, Huiqing
2010-01-01
Heterozygous mutations in p63 are associated with split hand/foot malformations (SHFM), orofacial clefting, and ectodermal abnormalities. Elucidation of the p63 gene network that includes target genes and regulatory elements may reveal new genes for other malformation disorders. We performed genome-wide DNA–binding profiling by chromatin immunoprecipitation (ChIP), followed by deep sequencing (ChIP–seq) in primary human keratinocytes, and identified potential target genes and regulatory elements controlled by p63. We show that p63 binds to an enhancer element in the SHFM1 locus on chromosome 7q and that this element controls expression of DLX6 and possibly DLX5, both of which are important for limb development. A unique micro-deletion including this enhancer element, but not the DLX5/DLX6 genes, was identified in a patient with SHFM. Our study strongly indicates disruption of a non-coding cis-regulatory element located more than 250 kb from the DLX5/DLX6 genes as a novel disease mechanism in SHFM1. These data provide a proof-of-concept that the catalogue of p63 binding sites identified in this study may be of relevance to the studies of SHFM and other congenital malformations that resemble the p63-associated phenotypes. PMID:20808887
AFO Manure Management - Nevada: CAFO Drainage Collection Requirements
Compendium of State Approaches for Manure Management, Part A -- Example of program features for manure management that have a regulatory basis, such as permit provisions and other regulatory program elements.
AFO Manure Management - Virginia: Nutrient Management Inspector Qualifications
Compendium of State Approaches for Manure Management, Part A -- Example of program features for manure management that have a regulatory basis, such as permit provisions and other regulatory program elements.
AFO Manure Management - California: Implementing TMDL Wasteload Allocations
Compendium of State Approaches for Manure Management, Part A -- Example of program features for manure management that have a regulatory basis, such as permit provisions and other regulatory program elements.
Overview Article: Identifying transcriptional cis-regulatory modules in animal genomes
Suryamohan, Kushal; Halfon, Marc S.
2014-01-01
Gene expression is regulated through the activity of transcription factors and chromatin modifying proteins acting on specific DNA sequences, referred to as cis-regulatory elements. These include promoters, located at the transcription initiation sites of genes, and a variety of distal cis-regulatory modules (CRMs), the most common of which are transcriptional enhancers. Because regulated gene expression is fundamental to cell differentiation and acquisition of new cell fates, identifying, characterizing, and understanding the mechanisms of action of CRMs is critical for understanding development. CRM discovery has historically been challenging, as CRMs can be located far from the genes they regulate, have few readily-identifiable sequence characteristics, and for many years were not amenable to high-throughput discovery methods. However, the recent availability of complete genome sequences and the development of next-generation sequencing methods has led to an explosion of both computational and empirical methods for CRM discovery in model and non-model organisms alike. Experimentally, CRMs can be identified through chromatin immunoprecipitation directed against transcription factors or histone post-translational modifications, identification of nucleosome-depleted “open” chromatin regions, or sequencing-based high-throughput functional screening. Computational methods include comparative genomics, clustering of known or predicted transcription factor binding sites, and supervised machine-learning approaches trained on known CRMs. All of these methods have proven effective for CRM discovery, but each has its own considerations and limitations, and each is subject to a greater or lesser number of false-positive identifications. Experimental confirmation of predictions is essential, although shortcomings in current methods suggest that additional means of validation need to be developed. PMID:25704908
Regulatory Fit Improves Fitness for People With Low Exercise Experience.
Kay, Sophie A; Grimm, Lisa R
2017-04-01
Considering only 20.8% of American adults meet current physical activity recommendations, it is important to examine the psychological processes that affect exercise motivation and behavior. Drawing from regulatory fit theory, this study examined how manipulating regulatory focus and reward structures would affect exercise performance, with a specific interest in investigating whether exercise experience would moderate regulatory fit effects. We predicted that regulatory fit effects would appear only for participants with low exercise experience. One hundred and sixty-five young adults completed strength training exercise tasks (i.e., sit-ups, squats, plank, and wall-sit) in regulatory match or mismatch conditions. Consistent with predictions, only participants low in experience in regulatory match conditions exercised more compared with those in regulatory mismatch conditions. Although this is the first study manipulating regulatory fit in a controlled setting to examine exercise behavior, findings suggest that generating regulatory fit could positively influence those low in exercise experience.
The impact of transposable elements on mammalian development.
Garcia-Perez, Jose L; Widmann, Thomas J; Adams, Ian R
2016-11-15
Despite often being classified as selfish or junk DNA, transposable elements (TEs) are a group of abundant genetic sequences that have a significant impact on mammalian development and genome regulation. In recent years, our understanding of how pre-existing TEs affect genome architecture, gene regulatory networks and protein function during mammalian embryogenesis has dramatically expanded. In addition, the mobilization of active TEs in selected cell types has been shown to generate genetic variation during development and in fully differentiated tissues. Importantly, the ongoing domestication and evolution of TEs appears to provide a rich source of regulatory elements, functional modules and genetic variation that fuels the evolution of mammalian developmental processes. Here, we review the functional impact that TEs exert on mammalian developmental processes and discuss how the somatic activity of TEs can influence gene regulatory networks. © 2016. Published by The Company of Biologists Ltd.
An Autonomous BMP2 Regulatory Element in Mesenchymal Cells
Kruithof, Boudewijn P.T.; Fritz, David T.; Liu, Yijun; Garsetti, Diane E.; Frank, David B.; Pregizer, Steven K.; Gaussin, Vinciane; Mortlock, Douglas P.; Rogers, Melissa B.
2014-01-01
BMP2 is a morphogen that controls mesenchymal cell differentiation and behavior. For example, BMP2 concentration controls the differentiation of mesenchymal precursors into myocytes, adipocytes, chondrocytes, and osteoblasts. Sequences within the 3′untranslated region (UTR) of the Bmp2 mRNA mediate a post-transcriptional block of protein synthesis. Interaction of cell and developmental stage-specific trans-regulatory factors with the 3′UTR is a nimble and versatile mechanism for modulating this potent morphogen in different cell types. We show here, that an ultra-conserved sequence in the 3′UTR functions independently of promoter, coding region, and 3′UTR context in primary and immortalized tissue culture cells and in transgenic mice. Our findings indicate that the ultra-conserved sequence is an autonomously functioning post-transcriptional element that may be used to modulate the level of BMP2 and other proteins while retaining tissue specific regulatory elements. PMID:21268088
Mapping and analysis of Caenorhabditis elegans transcription factor sequence specificities
Narasimhan, Kamesh; Lambert, Samuel A; Yang, Ally WH; Riddell, Jeremy; Mnaimneh, Sanie; Zheng, Hong; Albu, Mihai; Najafabadi, Hamed S; Reece-Hoyes, John S; Fuxman Bass, Juan I; Walhout, Albertha JM; Weirauch, Matthew T; Hughes, Timothy R
2015-01-01
Caenorhabditis elegans is a powerful model for studying gene regulation, as it has a compact genome and a wealth of genomic tools. However, identification of regulatory elements has been limited, as DNA-binding motifs are known for only 71 of the estimated 763 sequence-specific transcription factors (TFs). To address this problem, we performed protein binding microarray experiments on representatives of canonical TF families in C. elegans, obtaining motifs for 129 TFs. Additionally, we predict motifs for many TFs that have DNA-binding domains similar to those already characterized, increasing coverage of binding specificities to 292 C. elegans TFs (∼40%). These data highlight the diversification of binding motifs for the nuclear hormone receptor and C2H2 zinc finger families and reveal unexpected diversity of motifs for T-box and DM families. Motif enrichment in promoters of functionally related genes is consistent with known biology and also identifies putative regulatory roles for unstudied TFs. DOI: http://dx.doi.org/10.7554/eLife.06967.001 PMID:25905672
The standard operating procedure of the DOE-JGI Microbial Genome Annotation Pipeline (MGAP v.4)
Huntemann, Marcel; Ivanova, Natalia N.; Mavromatis, Konstantinos; ...
2015-10-26
The DOE-JGI Microbial Genome Annotation Pipeline performs structural and functional annotation of microbial genomes that are further included into the Integrated Microbial Genome comparative analysis system. MGAP is applied to assembled nucleotide sequence datasets that are provided via the IMG submission site. Dataset submission for annotation first requires project and associated metadata description in GOLD. The MGAP sequence data processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNA features, as well as CRISPR elements. In conclusion, structural annotation is followed by assignment of protein product names and functions.
The standard operating procedure of the DOE-JGI Microbial Genome Annotation Pipeline (MGAP v.4)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Huntemann, Marcel; Ivanova, Natalia N.; Mavromatis, Konstantinos
The DOE-JGI Microbial Genome Annotation Pipeline performs structural and functional annotation of microbial genomes that are further included into the Integrated Microbial Genome comparative analysis system. MGAP is applied to assembled nucleotide sequence datasets that are provided via the IMG submission site. Dataset submission for annotation first requires project and associated metadata description in GOLD. The MGAP sequence data processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNA features, as well as CRISPR elements. In conclusion, structural annotation is followed by assignment of protein product names and functions.
Contamination in food from packaging material.
Lau, O W; Wong, S K
2000-06-16
Packaging has become an indispensible element in the food manufacturing process, and different types of additives, such as antioxidants, stabilizers, lubricants, anti-static and anti-blocking agents, have also been developed to improve the performance of polymeric packaging materials. Recently the packaging has been found to represent a source of contamination itself through the migration of substances from the packaging into food. Various analytical methods have been developed to analyze the migrants in the foodstuff, and migration evaluation procedures based on theoretical prediction of migration from plastic food contact material were also introduced recently. In this paper, the regulatory control, analytical methodology, factors affecting the migration and migration evaluation are reviewed.
12 CFR 3.63 - Disclosures by national banks or Federal savings associations described in § 3.61.
Code of Federal Regulations, 2014 CFR
2014-01-01
... tier 1 capital, tier 2 capital, tier 1 and total capital ratios, including the regulatory capital elements and all the regulatory adjustments and deductions needed to calculate the numerator of such ratios... to calculate total risk-weighted assets; (3) Regulatory capital ratios during any transition periods...
Zhu, Changfu; Yang, Qingjie; Ni, Xiuzhen; Bai, Chao; Sheng, Yanmin; Shi, Lianxuan; Capell, Teresa; Sandmann, Gerhard; Christou, Paul
2014-04-01
Over the last two decades, many carotenogenic genes have been cloned and used to generate metabolically engineered plants producing higher levels of carotenoids. However, comparatively little is known about the regulation of endogenous carotenogenic genes in higher plants, and this restricts our ability to predict how engineered plants will perform in terms of carotenoid content and composition. During petal development in the Great Yellow Gentian (Gentiana lutea), carotenoid accumulation, the formation of chromoplasts and the upregulation of several carotenogenic genes are temporally coordinated. We investigated the regulatory mechanisms responsible for this coordinated expression by isolating five G. lutea carotenogenic gene (GlPDS, GlZDS, GlLYCB, GlBCH and GlLYCE) promoters by inverse polymerase chain reaction (PCR). Each promoter was sufficient for developmentally regulated expression of the gusA reporter gene following transient expression in tomato (Solanum lycopersicum cv. Micro-Tom). Interestingly, the GlLYCB and GlBCH promoters drove high levels of gusA expression in chromoplast-containing mature green fruits, but low levels in chloroplast-containing immature green fruits, indicating a strict correlation between promoter activity, tomato fruit development and chromoplast differentiation. As well as core promoter elements such as TATA and CAAT boxes, all five promoters together with previously characterized GlZEP promoter contained three common cis-regulatory motifs involved in the response to methyl jasmonate (CGTCA) and ethylene (ATCTA), and required for endosperm expression (Skn-1_motif, GTCAT). These shared common cis-acting elements may represent binding sites for transcription factors responsible for co-regulation. Our data provide insight into the regulatory basis of the coordinated upregulation of carotenogenic gene expression during flower development in G. lutea. © 2013 Scandinavian Plant Physiology Society.
RNA sequencing uncovers antisense RNAs and novel small RNAs in Streptococcus pyogenes
Le Rhun, Anaïs; Beer, Yan Yan; Reimegård, Johan; Chylinski, Krzysztof; Charpentier, Emmanuelle
2016-01-01
ABSTRACT Streptococcus pyogenes is a human pathogen responsible for a wide spectrum of diseases ranging from mild to life-threatening infections. During the infectious process, the temporal and spatial expression of pathogenicity factors is tightly controlled by a complex network of protein and RNA regulators acting in response to various environmental signals. Here, we focus on the class of small RNA regulators (sRNAs) and present the first complete analysis of sRNA sequencing data in S. pyogenes. In the SF370 clinical isolate (M1 serotype), we identified 197 and 428 putative regulatory RNAs by visual inspection and bioinformatics screening of the sequencing data, respectively. Only 35 from the 197 candidates identified by visual screening were assigned a predicted function (T-boxes, ribosomal protein leaders, characterized riboswitches or sRNAs), indicating how little is known about sRNA regulation in S. pyogenes. By comparing our list of predicted sRNAs with previous S. pyogenes sRNA screens using bioinformatics or microarrays, 92 novel sRNAs were revealed, including antisense RNAs that are for the first time shown to be expressed in this pathogen. We experimentally validated the expression of 30 novel sRNAs and antisense RNAs. We show that the expression profile of 9 sRNAs including 2 predicted regulatory elements is affected by the endoribonucleases RNase III and/or RNase Y, highlighting the critical role of these enzymes in sRNA regulation. PMID:26580233
Imai, S; Fujino, T; Nishibayashi, S; Manabe, T; Takano, T
1994-01-01
Dramatic changes occur in expression of the type I collagenase gene during the process of immortalization in simian virus 40 large T antigen-transformed human fibroblasts (S. Imai and T. Takano, Biochem. Biophys. Res. Commun. 189:148-153, 1992). From transient transfection assays, it was determined that these changes involved the functions of two immortalization-susceptible cis-acting elements, ISE1 and ISE2, located in a 100-bp region about 1.7 kb upstream. The profiles of binding of an activator, Proserpine, to the enhancer ISE1 were similar in the extracts of young, senescent preimmortalized and immortalized cells. ISE2 contained both negative and positive regulatory elements located adjacent to each other. The positive regulatory element consisted of a tandem array of putative Ets family- and AP-1-binding sites. An activator, Pluto, interacted with this positive regulatory element and had an AP-1-related component as a complex. The binding activity of Pluto was predominantly detected only in the extract from senescent preimmortalized cells. In contrast, a repressor, Orpheus, which bound to the ATG-rich negative regulatory element of ISE2, was prominently detected in extracts from both young preimmortalized and immortalized cells and appeared to suppress transcription in an orientation-dependent manner. Thus, the interplay of Pluto and Orpheus was suggested to be crucial for regulation of the collagenase gene accompanying in vitro aging and immortalization. Proserpine seemed to interact with Pluto to mediate strong expression of the collagenase gene in cellular senescence. On the basis of these results, we propose a model for regulation of the collagenase gene during in vitro aging and immortalization. Images PMID:7935433
Predictive computation of genomic logic processing functions in embryonic development
Peter, Isabelle S.; Faure, Emmanuel; Davidson, Eric H.
2012-01-01
Gene regulatory networks (GRNs) control the dynamic spatial patterns of regulatory gene expression in development. Thus, in principle, GRN models may provide system-level, causal explanations of developmental process. To test this assertion, we have transformed a relatively well-established GRN model into a predictive, dynamic Boolean computational model. This Boolean model computes spatial and temporal gene expression according to the regulatory logic and gene interactions specified in a GRN model for embryonic development in the sea urchin. Additional information input into the model included the progressive embryonic geometry and gene expression kinetics. The resulting model predicted gene expression patterns for a large number of individual regulatory genes each hour up to gastrulation (30 h) in four different spatial domains of the embryo. Direct comparison with experimental observations showed that the model predictively computed these patterns with remarkable spatial and temporal accuracy. In addition, we used this model to carry out in silico perturbations of regulatory functions and of embryonic spatial organization. The model computationally reproduced the altered developmental functions observed experimentally. Two major conclusions are that the starting GRN model contains sufficiently complete regulatory information to permit explanation of a complex developmental process of gene expression solely in terms of genomic regulatory code, and that the Boolean model provides a tool with which to test in silico regulatory circuitry and developmental perturbations. PMID:22927416
Sequence-based model of gap gene regulatory network.
Kozlov, Konstantin; Gursky, Vitaly; Kulakovskiy, Ivan; Samsonova, Maria
2014-01-01
The detailed analysis of transcriptional regulation is crucially important for understanding biological processes. The gap gene network in Drosophila attracts large interest among researches studying mechanisms of transcriptional regulation. It implements the most upstream regulatory layer of the segmentation gene network. The knowledge of molecular mechanisms involved in gap gene regulation is far less complete than that of genetics of the system. Mathematical modeling goes beyond insights gained by genetics and molecular approaches. It allows us to reconstruct wild-type gene expression patterns in silico, infer underlying regulatory mechanism and prove its sufficiency. We developed a new model that provides a dynamical description of gap gene regulatory systems, using detailed DNA-based information, as well as spatial transcription factor concentration data at varying time points. We showed that this model correctly reproduces gap gene expression patterns in wild type embryos and is able to predict gap expression patterns in Kr mutants and four reporter constructs. We used four-fold cross validation test and fitting to random dataset to validate the model and proof its sufficiency in data description. The identifiability analysis showed that most model parameters are well identifiable. We reconstructed the gap gene network topology and studied the impact of individual transcription factor binding sites on the model output. We measured this impact by calculating the site regulatory weight as a normalized difference between the residual sum of squares error for the set of all annotated sites and for the set with the site of interest excluded. The reconstructed topology of the gap gene network is in agreement with previous modeling results and data from literature. We showed that 1) the regulatory weights of transcription factor binding sites show very weak correlation with their PWM score; 2) sites with low regulatory weight are important for the model output; 3) functional important sites are not exclusively located in cis-regulatory elements, but are rather dispersed through regulatory region. It is of importance that some of the sites with high functional impact in hb, Kr and kni regulatory regions coincide with strong sites annotated and verified in Dnase I footprint assays.
Tang, Guiying; Xu, Pingli; Liu, Wei; Liu, Zhanji; Shan, Lei
2015-01-01
LEAFY COTYLEDON1 (LEC1) is a B subunit of Nuclear Factor Y (NF-YB) transcription factor that mainly accumulates during embryo development. We cloned the 5′ flanking regulatory sequence of AhLEC1B gene, a homolog of Arabidopsis LEC1, and analyzed its regulatory elements using online software. To identify the crucial regulatory region, we generated a series of GUS expression frameworks driven by different length promoters with 5′ terminal and/or 3′ terminal deletion. We further characterized the GUS expression patterns in the transgenic Arabidopsis lines. Our results show that both the 65bp proximal promoter region and the 52bp 5′ UTR of AhLEC1B contain the key motifs required for the essential promoting activity. Moreover, AhLEC1B is preferentially expressed in the embryo and is co-regulated by binding of its upstream genes with both positive and negative corresponding cis-regulatory elements. PMID:26426444
2011-01-01
Background Transcription factors (TFs) play a central role in regulating gene expression by interacting with cis-regulatory DNA elements associated with their target genes. Recent surveys have examined the DNA binding specificities of most Saccharomyces cerevisiae TFs, but a comprehensive evaluation of their data has been lacking. Results We analyzed in vitro and in vivo TF-DNA binding data reported in previous large-scale studies to generate a comprehensive, curated resource of DNA binding specificity data for all characterized S. cerevisiae TFs. Our collection comprises DNA binding site motifs and comprehensive in vitro DNA binding specificity data for all possible 8-bp sequences. Investigation of the DNA binding specificities within the basic leucine zipper (bZIP) and VHT1 regulator (VHR) TF families revealed unexpected plasticity in TF-DNA recognition: intriguingly, the VHR TFs, newly characterized by protein binding microarrays in this study, recognize bZIP-like DNA motifs, while the bZIP TF Hac1 recognizes a motif highly similar to the canonical E-box motif of basic helix-loop-helix (bHLH) TFs. We identified several TFs with distinct primary and secondary motifs, which might be associated with different regulatory functions. Finally, integrated analysis of in vivo TF binding data with protein binding microarray data lends further support for indirect DNA binding in vivo by sequence-specific TFs. Conclusions The comprehensive data in this curated collection allow for more accurate analyses of regulatory TF-DNA interactions, in-depth structural studies of TF-DNA specificity determinants, and future experimental investigations of the TFs' predicted target genes and regulatory roles. PMID:22189060
Vazquez-Anderson, Jorge; Mihailovic, Mia K.; Baldridge, Kevin C.; Reyes, Kristofer G.; Haning, Katie; Cho, Seung Hee; Amador, Paul; Powell, Warren B.
2017-01-01
Abstract Current approaches to design efficient antisense RNAs (asRNAs) rely primarily on a thermodynamic understanding of RNA–RNA interactions. However, these approaches depend on structure predictions and have limited accuracy, arguably due to overlooking important cellular environment factors. In this work, we develop a biophysical model to describe asRNA–RNA hybridization that incorporates in vivo factors using large-scale experimental hybridization data for three model RNAs: a group I intron, CsrB and a tRNA. A unique element of our model is the estimation of the availability of the target region to interact with a given asRNA using a differential entropic consideration of suboptimal structures. We showcase the utility of this model by evaluating its prediction capabilities in four additional RNAs: a group II intron, Spinach II, 2-MS2 binding domain and glgC 5΄ UTR. Additionally, we demonstrate the applicability of this approach to other bacterial species by predicting sRNA–mRNA binding regions in two newly discovered, though uncharacterized, regulatory RNAs. PMID:28334800
Creating and validating cis-regulatory maps of tissue-specific gene expression regulation
O'Connor, Timothy R.; Bailey, Timothy L.
2014-01-01
Predicting which genomic regions control the transcription of a given gene is a challenge. We present a novel computational approach for creating and validating maps that associate genomic regions (cis-regulatory modules–CRMs) with genes. The method infers regulatory relationships that explain gene expression observed in a test tissue using widely available genomic data for ‘other’ tissues. To predict the regulatory targets of a CRM, we use cross-tissue correlation between histone modifications present at the CRM and expression at genes within 1 Mbp of it. To validate cis-regulatory maps, we show that they yield more accurate models of gene expression than carefully constructed control maps. These gene expression models predict observed gene expression from transcription factor binding in the CRMs linked to that gene. We show that our maps are able to identify long-range regulatory interactions and improve substantially over maps linking genes and CRMs based on either the control maps or a ‘nearest neighbor’ heuristic. Our results also show that it is essential to include CRMs predicted in multiple tissues during map-building, that H3K27ac is the most informative histone modification, and that CAGE is the most informative measure of gene expression for creating cis-regulatory maps. PMID:25200088
Glinsky, Gennadi V
2018-03-01
Transposable elements have made major evolutionary impacts on creation of primate-specific and human-specific genomic regulatory loci and species-specific genomic regulatory networks (GRNs). Molecular and genetic definitions of human-specific changes to GRNs contributing to development of unique to human phenotypes remain a highly significant challenge. Genome-wide proximity placement analysis of diverse families of human-specific genomic regulatory loci (HSGRL) identified topologically associating domains (TADs) that are significantly enriched for HSGRL and designated rapidly evolving in human TADs. Here, the analysis of HSGRL, hESC-enriched enhancers, super-enhancers (SEs), and specific sub-TAD structures termed super-enhancer domains (SEDs) has been performed. In the hESC genome, 331 of 504 (66%) of SED-harboring TADs contain HSGRL and 68% of SEDs co-localize with HSGRL, suggesting that emergence of HSGRL may have rewired SED-associated GRNs within specific TADs by inserting novel and/or erasing existing non-coding regulatory sequences. Consequently, markedly distinct features of the principal regulatory structures of interphase chromatin evolved in the hESC genome compared to mouse: the SED quantity is 3-fold higher and the median SED size is significantly larger. Concomitantly, the overall TAD quantity is increased by 42% while the median TAD size is significantly decreased (p = 9.11E-37) in the hESC genome. Present analyses illustrate a putative global role for transposable elements and HSGRL in shaping the human-specific features of the interphase chromatin organization and functions, which are facilitated by accelerated creation of novel transcription factor binding sites and new enhancers driven by targeted placement of HSGRL at defined genomic coordinates. A trend toward the convergence of TAD and SED architectures of interphase chromatin in the hESC genome may reflect changes of 3D-folding patterns of linear chromatin fibers designed to enhance both regulatory complexity and functional precision of GRNs by creating predominantly a single gene (or a set of functionally linked genes) per regulatory domain structures. Collectively, present analyses reveal critical evolutionary contributions of transposable elements and distal enhancers to creation of thousands primate- and human-specific elements of a chromatin folding code, which defines the 3D context of interphase chromatin both restricting and facilitating biological functions of GRNs.
Structural imprints in vivo decode RNA regulatory mechanisms
Spitale, Robert C.; Flynn, Ryan A.; Zhang, Qiangfeng Cliff; Crisalli, Pete; Lee, Byron; Jung, Jong-Wha; Kuchelmeister, Hannes Y.; Batista, Pedro J.; Torre, Eduardo A.; Kool, Eric T.; Chang, Howard Y.
2015-01-01
Visualizing the physical basis for molecular behavior inside living cells is a grand challenge in biology. RNAs are central to biological regulation, and RNA’s ability to adopt specific structures intimately controls every step of the gene expression program1. However, our understanding of physiological RNA structures is limited; current in vivo RNA structure profiles view only two of four nucleotides that make up RNA2,3. Here we present a novel biochemical approach, In Vivo Click SHAPE (icSHAPE), that enables the first global view of RNA secondary structures of all four bases in living cells. icSHAPE of mouse embryonic stem cell transcriptome versus purified RNA folded in vitro shows that the structural dynamics of RNA in the cellular environment distinguishes different classes of RNAs and regulatory elements. Structural signatures at translational start sites and ribosome pause sites are conserved from in vitro, suggesting that these RNA elements are programmed by sequence. In contrast, focal structural rearrangements in vivo reveal precise interfaces of RNA with RNA binding proteins or RNA modification sites that are consistent with atomic-resolution structural data. Such dynamic structural footprints enable accurate prediction of RNA-protein interactions and N6-methyladenosine (m6A) modification genome-wide. These results open the door for structural genomics of RNA in living cells and reveal key physiological structures controlling gene expression. PMID:25799993
Structural imprints in vivo decode RNA regulatory mechanisms.
Spitale, Robert C; Flynn, Ryan A; Zhang, Qiangfeng Cliff; Crisalli, Pete; Lee, Byron; Jung, Jong-Wha; Kuchelmeister, Hannes Y; Batista, Pedro J; Torre, Eduardo A; Kool, Eric T; Chang, Howard Y
2015-03-26
Visualizing the physical basis for molecular behaviour inside living cells is a great challenge for biology. RNAs are central to biological regulation, and the ability of RNA to adopt specific structures intimately controls every step of the gene expression program. However, our understanding of physiological RNA structures is limited; current in vivo RNA structure profiles include only two of the four nucleotides that make up RNA. Here we present a novel biochemical approach, in vivo click selective 2'-hydroxyl acylation and profiling experiment (icSHAPE), which enables the first global view, to our knowledge, of RNA secondary structures in living cells for all four bases. icSHAPE of the mouse embryonic stem cell transcriptome versus purified RNA folded in vitro shows that the structural dynamics of RNA in the cellular environment distinguish different classes of RNAs and regulatory elements. Structural signatures at translational start sites and ribosome pause sites are conserved from in vitro conditions, suggesting that these RNA elements are programmed by sequence. In contrast, focal structural rearrangements in vivo reveal precise interfaces of RNA with RNA-binding proteins or RNA-modification sites that are consistent with atomic-resolution structural data. Such dynamic structural footprints enable accurate prediction of RNA-protein interactions and N(6)-methyladenosine (m(6)A) modification genome wide. These results open the door for structural genomics of RNA in living cells and reveal key physiological structures controlling gene expression.
Huang, Daosheng; Guo, Guoji; Yuan, Ping; Ralston, Amy; Sun, Lingang; Huss, Mikael; Mistri, Tapan; Pinello, Luca; Ng, Huck Hui; Yuan, Guocheng; Ji, Junfeng; Rossant, Janet; Robson, Paul; Han, Xiaoping
2017-12-07
The first cellular differentiation event in mouse development leads to the formation of the blastocyst consisting of the inner cell mass (ICM) and trophectoderm (TE). The transcription factor CDX2 is required for proper TE specification, where it promotes expression of TE genes, and represses expression of Pou5f1 (OCT4). However its downstream network in the developing embryo is not fully characterized. Here, we performed high-throughput single embryo qPCR analysis in Cdx2 null embryos to identify CDX2-regulated targets in vivo. To identify genes likely to be regulated by CDX2 directly, we performed CDX2 ChIP-Seq on trophoblast stem (TS) cells. In addition, we examined the dynamics of gene expression changes using inducible CDX2 embryonic stem (ES) cells, so that we could predict which CDX2-bound genes are activated or repressed by CDX2 binding. By integrating these data with observations of chromatin modifications, we identify putative novel regulatory elements that repress gene expression in a lineage-specific manner. Interestingly, we found CDX2 binding sites within regulatory elements of key pluripotent genes such as Pou5f1 and Nanog, pointing to the existence of a novel mechanism by which CDX2 maintains repression of OCT4 in trophoblast. Our study proposes a general mechanism in regulating lineage segregation during mammalian development.
An Enhancer Near ISL1 and an Ultraconserved Exon of PCBP2 areDerived from a Retroposon
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bejerano, Gill; Lowe, Craig; Ahituv, Nadav
2005-11-27
Hundreds of highly conserved distal cis-regulatory elementshave been characterized to date in vertebrate genomes1. Many thousandsmore are predicted based on comparative genomics2,3. Yet, in starkcontrast to the genes they regulate, virtually none of these regions canbe traced using sequence similarity in invertebrates, leaving theirevolutionary origin obscure. Here we show that a class of conserved,primarily non-coding regions in tetrapods originated from a novel shortinterspersed repetitive element (SINE) retroposon family that was activein Sarcopterygii (lobe-finned fishes and terrestrial vertebrates) in theSilurian at least 410 Mya4, and, remarkably, appears to be recentlyactive in the "living fossil" Indonesian coelacanth, Latimeriamenadoensis. We show that onemore » copy is a distal enhancer, located 500kbfrom the neuro-developmental gene ISL1. Several others represent new,possibly regulatory, alternatively spliced exons in the middle ofpre-existing Sarcopterygian genes. One of these is the>200bpultraconserved region5, 100 percent identical in mammals, and 80 percentidentical to the coelacanth SINE, that contains a 31aa alternativelyspliced exon of the mRNA processing gene PCBP26. These add to a growinglist of examples7 in which relics of transposable elements have acquireda function that serves their host, a process termed "exaptation"8, andprovide an origin for at least some of the highly-conservedvertebrate-specific genomic sequences recently discovered usingcomparative genomics.« less
AFO Manure Management - Oregon: Plan Review and Public Notice of Substantial Changes
Compendium of State Approaches for Manure Management, Part A -- Example of program features for manure management that have a regulatory basis, such as permit provisions and other regulatory program elements.
Smith, Emily M.; Lajoie, Bryan R.; Jain, Gaurav; Dekker, Job
2016-01-01
Three-dimensional genome structure plays an important role in gene regulation. Globally, chromosomes are organized into active and inactive compartments while, at the gene level, looping interactions connect promoters to regulatory elements. Topologically associating domains (TADs), typically several hundred kilobases in size, form an intermediate level of organization. Major questions include how TADs are formed and how they are related to looping interactions between genes and regulatory elements. Here we performed a focused 5C analysis of a 2.8 Mb chromosome 7 region surrounding CFTR in a panel of cell types. We find that the same TAD boundaries are present in all cell types, indicating that TADs represent a universal chromosome architecture. Furthermore, we find that these TAD boundaries are present irrespective of the expression and looping of genes located between them. In contrast, looping interactions between promoters and regulatory elements are cell-type specific and occur mostly within TADs. This is exemplified by the CFTR promoter that in different cell types interacts with distinct sets of distal cell-type-specific regulatory elements that are all located within the same TAD. Finally, we find that long-range associations between loci located in different TADs are also detected, but these display much lower interaction frequencies than looping interactions within TADs. Interestingly, interactions between TADs are also highly cell-type-specific and often involve loci clustered around TAD boundaries. These data point to key roles of invariant TAD boundaries in constraining as well as mediating cell-type-specific long-range interactions and gene regulation. PMID:26748519
Ashworth, Justin; Plaisier, Christopher L.; Lo, Fang Yin; Reiss, David J.; Baliga, Nitin S.
2014-01-01
Widespread microbial genome sequencing presents an opportunity to understand the gene regulatory networks of non-model organisms. This requires knowledge of the binding sites for transcription factors whose DNA-binding properties are unknown or difficult to infer. We adapted a protein structure-based method to predict the specificities and putative regulons of homologous transcription factors across diverse species. As a proof-of-concept we predicted the specificities and transcriptional target genes of divergent archaeal feast/famine regulatory proteins, several of which are encoded in the genome of Halobacterium salinarum. This was validated by comparison to experimentally determined specificities for transcription factors in distantly related extremophiles, chromatin immunoprecipitation experiments, and cis-regulatory sequence conservation across eighteen related species of halobacteria. Through this analysis we were able to infer that Halobacterium salinarum employs a divergent local trans-regulatory strategy to regulate genes (carA and carB) involved in arginine and pyrimidine metabolism, whereas Escherichia coli employs an operon. The prediction of gene regulatory binding sites using structure-based methods is useful for the inference of gene regulatory relationships in new species that are otherwise difficult to infer. PMID:25255272
Ashworth, Justin; Plaisier, Christopher L; Lo, Fang Yin; Reiss, David J; Baliga, Nitin S
2014-01-01
Widespread microbial genome sequencing presents an opportunity to understand the gene regulatory networks of non-model organisms. This requires knowledge of the binding sites for transcription factors whose DNA-binding properties are unknown or difficult to infer. We adapted a protein structure-based method to predict the specificities and putative regulons of homologous transcription factors across diverse species. As a proof-of-concept we predicted the specificities and transcriptional target genes of divergent archaeal feast/famine regulatory proteins, several of which are encoded in the genome of Halobacterium salinarum. This was validated by comparison to experimentally determined specificities for transcription factors in distantly related extremophiles, chromatin immunoprecipitation experiments, and cis-regulatory sequence conservation across eighteen related species of halobacteria. Through this analysis we were able to infer that Halobacterium salinarum employs a divergent local trans-regulatory strategy to regulate genes (carA and carB) involved in arginine and pyrimidine metabolism, whereas Escherichia coli employs an operon. The prediction of gene regulatory binding sites using structure-based methods is useful for the inference of gene regulatory relationships in new species that are otherwise difficult to infer.
[Bacteriophage λ: electrostatic properties of the genome and its elements].
Krutinina, G G; Krutinin, E A; Kamzolova, S G; Osypov, A A
2015-01-01
Bacteriophage λ is a classical model object in molecular biology, but little is still known on the physical properties of its DNA and regulatory elements. A study was made of the electrostatic properties of phage λ DNA and regulatory elements. A global electrostatic potential distribution along the phage genome was found to be nonuniform with main regulatory elements being located in a limited region with a high potential. The RNA polymerase binding frequency on the linearized phage chromosome directly correlates with its local potential. Strong promoters of the phage and its host Escherichia coli have distinct electrostatic upstream elements, which differ in nucleotide sequence. Attachment and recombination sites of phage λ and its host have a higher potential, which possibly facilitates their recognition by integrase. Phage λ and host Rho-independent terminators have a symmetrical M-shaped potential profile, which only slightly depends on the annotated terminator palindrome length, and occur in a region with a substantially higher potential, which may cause polymerase retention, facilitating the formation of a terminator hairpin in RNA. It was concluded that virtually all elements of phage λ genome have potential distribution specifics, which are related to their structural properties and may play a role in their biological function. The global potential distribution along the phage genome reflects the architecture of the regulation of its transcription and integration in the host genome.
CORECLUST: identification of the conserved CRM grammar together with prediction of gene regulation.
Nikulova, Anna A; Favorov, Alexander V; Sutormin, Roman A; Makeev, Vsevolod J; Mironov, Andrey A
2012-07-01
Identification of transcriptional regulatory regions and tracing their internal organization are important for understanding the eukaryotic cell machinery. Cis-regulatory modules (CRMs) of higher eukaryotes are believed to possess a regulatory 'grammar', or preferred arrangement of binding sites, that is crucial for proper regulation and thus tends to be evolutionarily conserved. Here, we present a method CORECLUST (COnservative REgulatory CLUster STructure) that predicts CRMs based on a set of positional weight matrices. Given regulatory regions of orthologous and/or co-regulated genes, CORECLUST constructs a CRM model by revealing the conserved rules that describe the relative location of binding sites. The constructed model may be consequently used for the genome-wide prediction of similar CRMs, and thus detection of co-regulated genes, and for the investigation of the regulatory grammar of the system. Compared with related methods, CORECLUST shows better performance at identification of CRMs conferring muscle-specific gene expression in vertebrates and early-developmental CRMs in Drosophila.
Cronin, Mark T D; Walker, John D; Jaworska, Joanna S; Comber, Michael H I; Watts, Christopher D; Worth, Andrew P
2003-01-01
This article is a review of the use, by regulatory agencies and authorities, of quantitative structure-activity relationships (QSARs) to predict ecologic effects and environmental fate of chemicals. For many years, the U.S. Environmental Protection Agency has been the most prominent regulatory agency using QSARs to predict the ecologic effects and environmental fate of chemicals. However, as increasing numbers of standard QSAR methods are developed and validated to predict ecologic effects and environmental fate of chemicals, it is anticipated that more regulatory agencies and authorities will find them to be acceptable alternatives to chemical testing. PMID:12896861
Genome-Wide Discovery of Drug-Dependent Human Liver Regulatory Elements
Morrissey, Kari M.; Luizon, Marcelo R.; Hoffmann, Thomas J.; Sun, Xuefeng; Jones, Stacy L.; Force Aldred, Shelley; Ramamoorthy, Anuradha; Desta, Zeruesenay; Liu, Yunlong; Skaar, Todd C.; Trinklein, Nathan D.; Giacomini, Kathleen M.; Ahituv, Nadav
2014-01-01
Inter-individual variation in gene regulatory elements is hypothesized to play a causative role in adverse drug reactions and reduced drug activity. However, relatively little is known about the location and function of drug-dependent elements. To uncover drug-associated elements in a genome-wide manner, we performed RNA-seq and ChIP-seq using antibodies against the pregnane X receptor (PXR) and three active regulatory marks (p300, H3K4me1, H3K27ac) on primary human hepatocytes treated with rifampin or vehicle control. Rifampin and PXR were chosen since they are part of the CYP3A4 pathway, which is known to account for the metabolism of more than 50% of all prescribed drugs. We selected 227 proximal promoters for genes with rifampin-dependent expression or nearby PXR/p300 occupancy sites and assayed their ability to induce luciferase in rifampin-treated HepG2 cells, finding only 10 (4.4%) that exhibited drug-dependent activity. As this result suggested a role for distal enhancer modules, we searched more broadly to identify 1,297 genomic regions bearing a conditional PXR occupancy as well as all three active regulatory marks. These regions are enriched near genes that function in the metabolism of xenobiotics, specifically members of the cytochrome P450 family. We performed enhancer assays in rifampin-treated HepG2 cells for 42 of these sequences as well as 7 sequences that overlap linkage-disequilibrium blocks defined by lead SNPs from pharmacogenomic GWAS studies, revealing 15/42 and 4/7 to be functional enhancers, respectively. A common African haplotype in one of these enhancers in the GSTA locus was found to exhibit potential rifampin hypersensitivity. Combined, our results further suggest that enhancers are the predominant targets of rifampin-induced PXR activation, provide a genome-wide catalog of PXR targets and serve as a model for the identification of drug-responsive regulatory elements. PMID:25275310
An ant colony optimization based algorithm for identifying gene regulatory elements.
Liu, Wei; Chen, Hanwu; Chen, Ling
2013-08-01
It is one of the most important tasks in bioinformatics to identify the regulatory elements in gene sequences. Most of the existing algorithms for identifying regulatory elements are inclined to converge into a local optimum, and have high time complexity. Ant Colony Optimization (ACO) is a meta-heuristic method based on swarm intelligence and is derived from a model inspired by the collective foraging behavior of real ants. Taking advantage of the ACO in traits such as self-organization and robustness, this paper designs and implements an ACO based algorithm named ACRI (ant-colony-regulatory-identification) for identifying all possible binding sites of transcription factor from the upstream of co-expressed genes. To accelerate the ants' searching process, a strategy of local optimization is presented to adjust the ants' start positions on the searched sequences. By exploiting the powerful optimization ability of ACO, the algorithm ACRI can not only improve precision of the results, but also achieve a very high speed. Experimental results on real world datasets show that ACRI can outperform other traditional algorithms in the respects of speed and quality of solutions. Copyright © 2013 Elsevier Ltd. All rights reserved.
Screening of MITF and SOX10 Regulatory Regions in Waardenburg Syndrome Type 2
Baral, Viviane; Chaoui, Asma; Watanabe, Yuli; Goossens, Michel; Attie-Bitach, Tania; Marlin, Sandrine; Pingault, Veronique; Bondurand, Nadege
2012-01-01
Waardenburg syndrome (WS) is a rare auditory-pigmentary disorder that exhibits varying combinations of sensorineural hearing loss and pigmentation defects. Four subtypes are clinically defined based on the presence or absence of additional symptoms. WS type 2 (WS2) can result from mutations within the MITF or SOX10 genes; however, 70% of WS2 cases remain unexplained at the molecular level, suggesting that other genes might be involved and/or that mutations within the known genes escaped previous screenings. The recent identification of a deletion encompassing three of the SOX10 regulatory elements in a patient presenting with another WS subtype, WS4, defined by its association with Hirschsprung disease, led us to search for deletions and point mutations within the MITF and SOX10 regulatory elements in 28 yet unexplained WS2 cases. Two nucleotide variations were identified: one in close proximity to the MITF distal enhancer (MDE) and one within the U1 SOX10 enhancer. Functional analyses argued against a pathogenic effect of these variations, suggesting that mutations within regulatory elements of WS genes are not a major cause of this neurocristopathy. PMID:22848661
Federal Register 2010, 2011, 2012, 2013, 2014
2012-03-22
... Fuel Elements for Use in Research and Test Reactors AGENCY: Nuclear Regulatory Commission. ACTION... Plate-Type Uranium-Aluminum Fuel Elements for Use in Research and Test Reactors.'' This guide describes... plate-type uranium-aluminum fuel elements used in research and test reactors (RTRs). DATES: Submit...
Retroviruses facilitate the rapid evolution of the mammalian placenta
Chuong, Edward B.
2015-01-01
The mammalian placenta exhibits elevated expression of endogenous retroviruses (ERVs), but the evolutionary significance of this feature remains unclear. I propose that ERV-mediated regulatory evolution was, and continues to be, an important mechanism underlying the evolution of placenta development. Many recent studies have focused on the co-option of ERV-derived genes for specific functional adaptations in the placenta. However, the co-option of ERV-derived regulatory elements has the potential to co-opt entire gene regulatory networks, which, I argue, would facilitate relatively rapid developmental evolution of the placenta. I suggest a model in which an ancient retroviral infection led to the establishment of the ancestral placental developmental gene network through the co-option of ERV-derived regulatory elements. Consequently, placenta development would require elevated tolerance to ERV activity, which in turn would expose a continuous stream of novel ERV mutations that may have catalyzed the developmental diversification of the mammalian placenta. PMID:23873343
Naval-Sanchez, Marina; Nguyen, Quan; McWilliam, Sean; Porto-Neto, Laercio R; Tellam, Ross; Vuocolo, Tony; Reverter, Antonio; Perez-Enciso, Miguel; Brauning, Rudiger; Clarke, Shannon; McCulloch, Alan; Zamani, Wahid; Naderi, Saeid; Rezaei, Hamid Reza; Pompanon, Francois; Taberlet, Pierre; Worley, Kim C; Gibbs, Richard A; Muzny, Donna M; Jhangiani, Shalini N; Cockett, Noelle; Daetwyler, Hans; Kijas, James
2018-02-28
Domestication fundamentally reshaped animal morphology, physiology and behaviour, offering the opportunity to investigate the molecular processes driving evolutionary change. Here we assess sheep domestication and artificial selection by comparing genome sequence from 43 modern breeds (Ovis aries) and their Asian mouflon ancestor (O. orientalis) to identify selection sweeps. Next, we provide a comparative functional annotation of the sheep genome, validated using experimental ChIP-Seq of sheep tissue. Using these annotations, we evaluate the impact of selection and domestication on regulatory sequences and find that sweeps are significantly enriched for protein coding genes, proximal regulatory elements of genes and genome features associated with active transcription. Finally, we find individual sites displaying strong allele frequency divergence are enriched for the same regulatory features. Our data demonstrate that remodelling of gene expression is likely to have been one of the evolutionary forces that drove phenotypic diversification of this common livestock species.
Light water reactor lower head failure analysis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rempe, J.L.; Chavez, S.A.; Thinnes, G.L.
1993-10-01
This document presents the results from a US Nuclear Regulatory Commission-sponsored research program to investigate the mode and timing of vessel lower head failure. Major objectives of the analysis were to identify plausible failure mechanisms and to develop a method for determining which failure mode would occur first in different light water reactor designs and accident conditions. Failure mechanisms, such as tube ejection, tube rupture, global vessel failure, and localized vessel creep rupture, were studied. Newly developed models and existing models were applied to predict which failure mechanism would occur first in various severe accident scenarios. So that a broadermore » range of conditions could be considered simultaneously, calculations relied heavily on models with closed-form or simplified numerical solution techniques. Finite element techniques-were employed for analytical model verification and examining more detailed phenomena. High-temperature creep and tensile data were obtained for predicting vessel and penetration structural response.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chris Amemiya
2003-04-01
The goals of this project were to isolate, characterize, and sequence the Dlx3/Dlx7 bigene cluster from twelve different species of mammals. The Dlx3 and Dlx7 genes are known to encode homeobox transcription factors involved in patterning of structures in the vertebrate jaw as well as vertebrate limbs. Genomic sequences from the respective taxa will subsequently be compared in order to identify conserved non-coding sequences that are potential cis-regulatory elements. Based on the comparisons they will fashion transgenic mouse experiments to functionally test the strength of the potential cis-regulatory elements. A goal of the project is to attempt to identify thosemore » elements that may function in coordinately regulating both Dlx3 and Dlx7 functions.« less
Gomes, S; Civetta, A
2014-09-01
Hybrid male sterility is a common outcome of crosses between different species. Gene expression studies have found that a number of spermatogenesis genes are differentially expressed in sterile hybrid males, compared with parental species. Late-stage sperm development genes are particularly likely to be misexpressed, with fewer early-stage genes affected. Thus, a link has been posited between misexpression and sterility. A more recent alternative explanation for hybrid gene misexpression has been that it is independent of sterility and driven by divergent evolution of male-specific regulatory elements between species (faster male hypothesis). The faster male hypothesis predicts that misregulation of spermatogenesis genes should be independent of sterility and approximately the same in both hybrids, whereas sterility should only affect gene expression in sterile hybrids. To test the faster male hypothesis vs. the effect of sterility on gene misexpression, we analyse spermatogenesis gene expression in different species pairs of the Drosophila phylogeny, where hybrid male sterility occurs in only one direction of the interspecies cross (i.e. unidirectional sterility). We find significant differences among genes in misexpression with effects that are lineage-specific and caused by sterility or fast male regulatory divergence. © 2014 The Authors. Journal of Evolutionary Biology © 2014 European Society For Evolutionary Biology.
Yorio, Patrick L; Willmer, Dana R; Haight, Joel M
2014-08-01
Since the late 1980s, the U.S. Department of Labor has considered regulating a systems approach to occupational health and safety management. Recently, a health and safety management systems (HSMS) standard has returned to the regulatory agenda of both the Occupational Safety and Health Administration (OSHA) and the Mine Safety and Health Administration (MSHA). Because a mandated standard has implications for both industry and regulating bodies alike, it is imperative to gain a greater understanding of the potential effects that an HSMS regulatory approach can have on establishment-level injuries and illnesses. Through the lens of MSHA's regulatory framework, we first explore how current enforcement activities align with HSMS elements. Using MSHA data for the years 2003-2010, we then analyze the relationship between various types of enforcement activities (e.g., total number of citations, total penalty amount, and HSMS-aligned citations) and mine reportable injuries. Our findings show that the reduction in mine reportable injuries predicted by increases in MSHA enforcement ranges from negligible to 18%. The results suggest that the type and focus of the enforcement activity may be more important for accident reduction than the total number of citations issued and the associated penalty amount. © 2014 Society for Risk Analysis.
Lucidi, F; Pica, G; Mallia, L; Castrucci, E; Manganelli, S; Bélanger, J J; Pierro, A
2016-06-01
A prospective field study conducted with runners training for an upcoming marathon (Marathon of Rome 2013) examined the relation between regulatory modes, locomotion and assessment, and stress. Integrating regulatory mode theory and the dualistic model of passion, we hypothesized that the relation between regulatory modes (evaluated 3 months before the race) and the experience of stress approaching the marathon, is mediated by the type of passion (harmonious vs obsessive) athletes experience with regard to marathoning. Results revealed that (a) locomotion positively predicted harmonious passion, which in turn reduced athletes' experience of stress; and (b) assessment positively predicted obsessive passion, which in turn enhanced athletes' experience of stress. Overall, the present results suggest that proximal psychological mechanisms such as basic regulatory mode orientations can predict distal outcomes such as stress indirectly through their relation with motivational phenomena such as passion. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
76 FR 38213 - Notice of Issuance of Regulatory Guide
Federal Register 2010, 2011, 2012, 2013, 2014
2011-06-29
... quality standards for using Portland Cement grout to protect prestressing steel from corrosion. The prestressing tendon system of a prestressed concrete containment structure is a principal strength element of... strength elements. Thus, any significant deterioration of the prestressing elements caused by corrosion may...
Jenjaroenpun, Piroon; Chew, Chee Siang; Yong, Tai Pang; Choowongkomon, Kiattawee; Thammasorn, Wimada; Kuznetsov, Vladimir A
2015-01-01
A triplex target DNA site (TTS), a stretch of DNA that is composed of polypurines, is able to form a triple-helix (triplex) structure with triplex-forming oligonucleotides (TFOs) and is able to influence the site-specific modulation of gene expression and/or the modification of genomic DNA. The co-localization of a genomic TTS with gene regulatory signals and functional genome structures suggests that TFOs could potentially be exploited in antigene strategies for the therapy of cancers and other genetic diseases. Here, we present the TTS Mapping and Integration (TTSMI; http://ttsmi.bii.a-star.edu.sg) database, which provides a catalog of unique TTS locations in the human genome and tools for analyzing the co-localization of TTSs with genomic regulatory sequences and signals that were identified using next-generation sequencing techniques and/or predicted by computational models. TTSMI was designed as a user-friendly tool that facilitates (i) fast searching/filtering of TTSs using several search terms and criteria associated with sequence stability and specificity, (ii) interactive filtering of TTSs that co-localize with gene regulatory signals and non-B DNA structures, (iii) exploration of dynamic combinations of the biological signals of specific TTSs and (iv) visualization of a TTS simultaneously with diverse annotation tracks via the UCSC genome browser. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
THE ROLES OF METAL IONS IN REGULATION BY RIBOSWITCHES
2012-01-01
Metal ions are required by all organisms in order to execute an array of essential molecular functions. They play a critical role in many catalytic mechanisms and structural properties. Proper homeostasis of ions is critical; levels that are aberrantly low or high are deleterious to cellular physiology. To maintain stable intracellular pools, metal ion-sensing regulatory (metalloregulatory) proteins couple metal ion concentration fluctuations with expression of genes encoding for cation transport or sequestration. However, these transcriptional-based regulatory strategies are not the only mechanisms by which organisms coordinate metal ions with gene expression. Intriguingly, a few classes of signal-responsive RNA elements have also been discovered to function as metalloregulatory agents. This suggests that RNA-based regulatory strategies can be precisely tuned to intracellular metal ion pools, functionally akin to metalloregulatory proteins. In addition to these metal-sensing regulatory RNAs, there is a yet broader role for metal ions in directly assisting the structural integrity of other signal-responsive regulatory RNA elements. In this chapter, we discuss how the intimate physicochemical relationship between metal ions and nucleic acids is important for the structure and function of metal ion- and metabolite-sensing regulatory RNAs. PMID:22010271
RNA-Seq Based Transcriptional Map of Bovine Respiratory Disease Pathogen “Histophilus somni 2336”
Kumar, Ranjit; Lawrence, Mark L.; Watt, James; Cooksey, Amanda M.; Burgess, Shane C.; Nanduri, Bindu
2012-01-01
Genome structural annotation, i.e., identification and demarcation of the boundaries for all the functional elements in a genome (e.g., genes, non-coding RNAs, proteins and regulatory elements), is a prerequisite for systems level analysis. Current genome annotation programs do not identify all of the functional elements of the genome, especially small non-coding RNAs (sRNAs). Whole genome transcriptome analysis is a complementary method to identify “novel” genes, small RNAs, regulatory regions, and operon structures, thus improving the structural annotation in bacteria. In particular, the identification of non-coding RNAs has revealed their widespread occurrence and functional importance in gene regulation, stress and virulence. However, very little is known about non-coding transcripts in Histophilus somni, one of the causative agents of Bovine Respiratory Disease (BRD) as well as bovine infertility, abortion, septicemia, arthritis, myocarditis, and thrombotic meningoencephalitis. In this study, we report a single nucleotide resolution transcriptome map of H. somni strain 2336 using RNA-Seq method. The RNA-Seq based transcriptome map identified 94 sRNAs in the H. somni genome of which 82 sRNAs were never predicted or reported in earlier studies. We also identified 38 novel potential protein coding open reading frames that were absent in the current genome annotation. The transcriptome map allowed the identification of 278 operon (total 730 genes) structures in the genome. When compared with the genome sequence of a non-virulent strain 129Pt, a disproportionate number of sRNAs (∼30%) were located in genomic region unique to strain 2336 (∼18% of the total genome). This observation suggests that a number of the newly identified sRNAs in strain 2336 may be involved in strain-specific adaptations. PMID:22276113
RNA-seq based transcriptional map of bovine respiratory disease pathogen "Histophilus somni 2336".
Kumar, Ranjit; Lawrence, Mark L; Watt, James; Cooksey, Amanda M; Burgess, Shane C; Nanduri, Bindu
2012-01-01
Genome structural annotation, i.e., identification and demarcation of the boundaries for all the functional elements in a genome (e.g., genes, non-coding RNAs, proteins and regulatory elements), is a prerequisite for systems level analysis. Current genome annotation programs do not identify all of the functional elements of the genome, especially small non-coding RNAs (sRNAs). Whole genome transcriptome analysis is a complementary method to identify "novel" genes, small RNAs, regulatory regions, and operon structures, thus improving the structural annotation in bacteria. In particular, the identification of non-coding RNAs has revealed their widespread occurrence and functional importance in gene regulation, stress and virulence. However, very little is known about non-coding transcripts in Histophilus somni, one of the causative agents of Bovine Respiratory Disease (BRD) as well as bovine infertility, abortion, septicemia, arthritis, myocarditis, and thrombotic meningoencephalitis. In this study, we report a single nucleotide resolution transcriptome map of H. somni strain 2336 using RNA-Seq method.The RNA-Seq based transcriptome map identified 94 sRNAs in the H. somni genome of which 82 sRNAs were never predicted or reported in earlier studies. We also identified 38 novel potential protein coding open reading frames that were absent in the current genome annotation. The transcriptome map allowed the identification of 278 operon (total 730 genes) structures in the genome. When compared with the genome sequence of a non-virulent strain 129Pt, a disproportionate number of sRNAs (∼30%) were located in genomic region unique to strain 2336 (∼18% of the total genome). This observation suggests that a number of the newly identified sRNAs in strain 2336 may be involved in strain-specific adaptations.
Favorable genomic environments for cis-regulatory evolution: A novel theoretical framework.
Maeso, Ignacio; Tena, Juan J
2016-09-01
Cis-regulatory changes are arguably the primary evolutionary source of animal morphological diversity. With the recent explosion of genome-wide comparisons of the cis-regulatory content in different animal species is now possible to infer general principles underlying enhancer evolution. However, these studies have also revealed numerous discrepancies and paradoxes, suggesting that the mechanistic causes and modes of cis-regulatory evolution are still not well understood and are probably much more complex than generally appreciated. Here, we argue that the mutational mechanisms and genomic regions generating new regulatory activities must comply with the constraints imposed by the molecular properties of cis-regulatory elements (CREs) and the organizational features of long-range chromatin interactions. Accordingly, we propose a new integrative evolutionary framework for cis-regulatory evolution based on two major premises for the origin of novel enhancer activity: (i) an accessible chromatin environment and (ii) compatibility with the 3D structure and interactions of pre-existing CREs. Mechanisms and DNA sequences not fulfilling these premises, will be less likely to have a measurable impact on gene expression and as such, will have a minor contribution to the evolution of gene regulation. Finally, we discuss current comparative cis-regulatory data under the light of this new evolutionary model, and propose that the two most prominent mechanisms for the evolution of cis-regulatory changes are the overprinting of ancestral CREs and the exaptation of transposable elements. Copyright © 2015 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mitchell, Hugh D.; Eisfeld, Amie J.; Sims, Amy
Respiratory infections stemming from influenza viruses and the Severe Acute Respiratory Syndrome corona virus (SARS-CoV) represent a serious public health threat as emerging pandemics. Despite efforts to identify the critical interactions of these viruses with host machinery, the key regulatory events that lead to disease pathology remain poorly targeted with therapeutics. Here we implement an integrated network interrogation approach, in which proteome and transcriptome datasets from infection of both viruses in human lung epithelial cells are utilized to predict regulatory genes involved in the host response. We take advantage of a novel “crowd-based” approach to identify and combine ranking metricsmore » that isolate genes/proteins likely related to the pathogenicity of SARS-CoV and influenza virus. Subsequently, a multivariate regression model is used to compare predicted lung epithelial regulatory influences with data derived from other respiratory virus infection models. We predicted a small set of regulatory factors with conserved behavior for consideration as important components of viral pathogenesis that might also serve as therapeutic targets for intervention. Our results demonstrate the utility of integrating diverse ‘omic datasets to predict and prioritize regulatory features conserved across multiple pathogen infection models.« less
2010-01-01
Background Cinnamoyl CoA reductase (CCR) and cinnamyl alcohol dehydrogenase (CAD) catalyze the final steps in the biosynthesis of monolignols, the monomeric units of the phenolic lignin polymers which confer rigidity, imperviousness and resistance to biodegradation to cell walls. We have previously shown that the Eucalyptus gunnii CCR and CAD2 promoters direct similar expression patterns in vascular tissues suggesting that monolignol production is controlled, at least in part, by the coordinated transcriptional regulation of these two genes. Although consensus motifs for MYB transcription factors occur in most gene promoters of the whole phenylpropanoid pathway, functional evidence for their contribution to promoter activity has only been demonstrated for a few of them. Here, in the lignin-specific branch, we studied the functional role of MYB elements as well as other cis-elements identified in the regulatory regions of EgCAD2 and EgCCR promoters, in the transcriptional activity of these gene promoters. Results By using promoter deletion analysis and in vivo footprinting, we identified an 80 bp regulatory region in the Eucalyptus gunnii EgCAD2 promoter that contains two MYB elements, each arranged in a distinct module with newly identified cis-elements. A directed mutagenesis approach was used to introduce block mutations in all putative cis-elements of the EgCAD2 promoter and in those of the 50 bp regulatory region previously delineated in the EgCCR promoter. We showed that the conserved MYB elements in EgCAD2 and EgCCR promoters are crucial both for the formation of DNA-protein complexes in EMSA experiments and for the transcriptional activation of EgCAD2 and EgCCR promoters in vascular tissues in planta. In addition, a new regulatory cis-element that modulates the balance between two DNA-protein complexes in vitro was found to be important for EgCAD2 expression in the cambial zone. Conclusions Our assignment of functional roles to the identified cis-elements clearly demonstrates the importance of MYB cis-elements in the transcriptional regulation of two genes of the lignin-specific pathway and support the hypothesis that MYB elements serve as a common means for the coordinated regulation of genes in the entire lignin biosynthetic pathway. PMID:20584286
Cis-regulatory Elements and Human Evolution
Siepel, Adam
2014-01-01
Modification of gene regulation has long been considered an important force in human evolution, particularly through changes to cis-regulatory elements (CREs) that function in transcriptional regulation. For decades, however, the study of cis-regulatory evolution was severely limited by the available data. New data sets describing the locations of CREs and genetic variation within and between species have now made it possible to study CRE evolution much more directly on a genome-wide scale. Here, we review recent research on the evolution of CREs in humans based on large-scale genomic data sets. We consider inferences based on primate divergence, human polymorphism, and combinations of divergence and polymorphism. We then consider “new frontiers” in this field stemming from recent research on transcriptional regulation. PMID:25218861
Lactase non-persistence is directed by DNA variation-dependent epigenetic aging
Labrie, Viviane; Buske, Orion J; Oh, Edward; Jeremian, Richie; Ptak, Carolyn; Gasiūnas, Giedrius; Maleckas, Almantas; Petereit, Rūta; Žvirbliene, Aida; Adamonis, Kęstutis; Kriukienė, Edita; Koncevičius, Karolis; Gordevičius, Juozas; Nair, Akhil; Zhang, Aiping; Ebrahimi, Sasha; Oh, Gabriel; Šikšnys, Virginijus; Kupčinskas, Limas; Brudno, Michael; Petronis, Arturas
2016-01-01
Inability to digest lactose due to lactase non-persistence is a common trait in adult mammals, with the exception of certain human populations that exhibit lactase persistence. It is not clear how the lactase gene can be dramatically downregulated with age in most individuals, but remains active in some. We performed a comprehensive epigenetic study of the human and mouse intestine using chromosome-wide DNA modification profiling and targeted bisulfite sequencing. Epigenetically-controlled regulatory elements were found to account for the differences in lactase mRNA levels between individuals, intestinal cell types and species. The importance of these regulatory elements in modulating lactase mRNA levels was confirmed by CRISPR-Cas9-induced deletions. Genetic factors contribute to epigenetic changes occurring with age at the regulatory elements, as lactase persistence- and non-persistence-DNA haplotypes demonstrated markedly different epigenetic aging. Thus, genetic factors facilitate a gradual accumulation of epigenetic changes with age to affect phenotypic outcome. PMID:27159559
Laurette, Patrick; Strub, Thomas; Koludrovic, Dana; Keime, Céline; Le Gras, Stéphanie; Seberg, Hannah; Van Otterloo, Eric; Imrichova, Hana; Siddaway, Robert; Aerts, Stein; Cornell, Robert A; Mengus, Gabrielle; Davidson, Irwin
2015-03-24
Microphthalmia-associated transcription factor (MITF) is the master regulator of the melanocyte lineage. To understand how MITF regulates transcription, we used tandem affinity purification and mass spectrometry to define a comprehensive MITF interactome identifying novel cofactors involved in transcription, DNA replication and repair, and chromatin organisation. We show that MITF interacts with a PBAF chromatin remodelling complex comprising BRG1 and CHD7. BRG1 is essential for melanoma cell proliferation in vitro and for normal melanocyte development in vivo. MITF and SOX10 actively recruit BRG1 to a set of MITF-associated regulatory elements (MAREs) at active enhancers. Combinations of MITF, SOX10, TFAP2A, and YY1 bind between two BRG1-occupied nucleosomes thus defining both a signature of transcription factors essential for the melanocyte lineage and a specific chromatin organisation of the regulatory elements they occupy. BRG1 also regulates the dynamics of MITF genomic occupancy. MITF-BRG1 interplay thus plays an essential role in transcription regulation in melanoma.
Genomic deletion of a long-range bone enhancer misregulatessclerostin in Van Buchem disease
DOE Office of Scientific and Technical Information (OSTI.GOV)
Loots, Gabriela G.; Kneissel, Michaela; Keller, Hansjoerg
2005-04-15
Mutations in distant regulatory elements can negatively impact human development and health, yet due to the difficulty of detecting these critical sequences we predominantly focus on coding sequences for diagnostic purposes. We have undertaken a comparative sequence-based approach to characterize a large noncoding region deleted in patients affected by Van Buchem disease (VB), a severe sclerosing bone dysplasia. Using BAC recombination and transgenesis we characterized the expression of human sclerostin (sost) from normal (hSOSTwt) or Van Buchem(hSOSTvb D) alleles. Only the hSOSTwt allele faithfully expressed high levels of human sost in the adult bone and impacted bone metabolism, consistent withmore » the model that the VB noncoding deletion removes a sost specific regulatory element. By exploiting cross-species sequence comparisons with in vitro and in vivo enhancer assays we were able to identify a candidate enhancer element that drives human sost expression in osteoblast-like cell lines in vitro and in the skeletal anlage of the E14.5 mouse embryo, and discovered a novel function for sclerostin during limb development. Our approach represents a framework for characterizing distant regulatory elements associated with abnormal human phenotypes.« less
Functional autonomy of distant-acting human enhancers
DOE Office of Scientific and Technical Information (OSTI.GOV)
Visel, Axel; Akiyama, Jennifer A.; Shoukry, Malak
2009-02-19
Many human genes are associated with dispersed arrays of transcriptional enhancers that regulate their expression in time and space. Studies in invertebrate model systems have suggested that these elements function as discrete and independent regulatory units, but the in vivo combinatorial properties of vertebrate enhancers remain poorly understood. To explore the modularity and regulatory autonomy of human developmental enhancers, we experimentally concatenated up to four enhancers from different genes and used a transgenic mouse assay to compare the in vivo activity of these compound elements with that of the single modules. In all of the six different combinations of elementsmore » tested, the reporter gene activity patterns were additive without signs of interference between the individual modules, indicating that regulatory specificity was maintained despite the presence of closely-positioned heterologous enhancers. Even in cases where two elements drove expression in close anatomical proximity, such as within neighboring subregions of the developing limb bud, the compound patterns did not show signs of cross-inhibition between individual elements or novel expression sites. These data indicate that human developmental enhancers are highly modular and functionally autonomous and suggest that genomic enhancer shuffling may have contributed to the evolution of complex gene expression patterns in vertebrates« less
Mink, S; Härtig, E; Jennewein, P; Doppler, W; Cato, A C
1992-01-01
Mouse mammary tumor virus (MMTV) is a milk-transmitted retrovirus involved in the neoplastic transformation of mouse mammary gland cells. The expression of this virus is regulated by mammary cell type-specific factors, steroid hormones, and polypeptide growth factors. Sequences for mammary cell-specific expression are located in an enhancer element in the extreme 5' end of the long terminal repeat region of this virus. This enhancer, when cloned in front of the herpes simplex thymidine kinase promoter, endows the promoter with mammary cell-specific response. Using functional and DNA-protein-binding studies with constructs mutated in the MMTV long terminal repeat enhancer, we have identified two main regulatory elements necessary for the mammary cell-specific response. These elements consist of binding sites for a transcription factor in the family of CTF/NFI proteins and the transcription factor mammary cell-activating factor (MAF) that recognizes the sequence G Pu Pu G C/G A A G G/T. Combinations of CTF/NFI- and MAF-binding sites or multiple copies of either one of these binding sites but not solitary binding sites mediate mammary cell-specific expression. The functional activities of these two regulatory elements are enhanced by another factor that binds to the core sequence ACAAAG. Interdigitated binding sites for CTF/NFI, MAF, and/or the ACAAAG factor are also found in the 5' upstream regions of genes encoding whey milk proteins from different species. These findings suggest that mammary cell-specific regulation is achieved by a concerted action of factors binding to multiple regulatory sites. Images PMID:1328867
Wong, S W; Schaffer, P A
1991-05-01
Like other DNA-containing viruses, the three origins of herpes simplex virus type 1 (HSV-1) DNA replication are flanked by sequences containing transcriptional regulatory elements. In a transient plasmid replication assay, deletion of sequences comprising the transcriptional regulatory elements of ICP4 and ICP22/47, which flank oriS, resulted in a greater than 80-fold decrease in origin function compared with a plasmid, pOS-822, which retains these sequences. In an effort to identify specific cis-acting elements responsible for this effect, we conducted systematic deletion analysis of the flanking region with plasmid pOS-822 and tested the resulting mutant plasmids for origin function. Stimulation by cis-acting elements was shown to be both distance and orientation dependent, as changes in either parameter resulted in a decrease in oriS function. Additional evidence for the stimulatory effect of flanking sequences on origin function was demonstrated by replacement of these sequences with the cytomegalovirus immediate-early promoter, resulting in nearly wild-type levels of oriS function. In competition experiments, cotransfection of cells with the test plasmid, pOS-822, and increasing molar concentrations of a competitor plasmid which contained the ICP4 and ICP22/47 transcriptional regulatory regions but lacked core origin sequences resulted in a significant reduction in the replication efficiency of pOS-822, demonstrating that factors which bind specifically to the oriS-flanking sequences are likely involved as auxiliary proteins in oriS function. Together, these studies demonstrate that trans-acting factors and the sites to which they bind play a critical role in the efficiency of HSV-1 DNA replication from oriS in transient-replication assays.
Volkmann, Bethany A.; Zinkevich, Natalya S.; Mustonen, Aki; Schilter, Kala F.; Bosenko, Dmitry V.; Reis, Linda M.; Broeckel, Ulrich; Link, Brian A.
2011-01-01
Purpose. Mutations in PITX2 are associated with Axenfeld-Rieger syndrome (ARS), which involves ocular, dental, and umbilical abnormalities. Identification of cis-regulatory elements of PITX2 is important to better understand the mechanisms of disease. Methods. Conserved noncoding elements surrounding PITX2/pitx2 were identified and examined through transgenic analysis in zebrafish; expression pattern was studied by in situ hybridization. Patient samples were screened for deletion/duplication of the PITX2 upstream region using arrays and probes. Results. Zebrafish pitx2 demonstrates conserved expression during ocular and craniofacial development. Thirteen conserved noncoding sequences positioned within a gene desert as far as 1.1 Mb upstream of the human PITX2 gene were identified; 11 have enhancer activities consistent with pitx2 expression. Ten elements mediated expression in the developing brain, four regions were active during eye formation, and two sequences were associated with craniofacial expression. One region, CE4, located approximately 111 kb upstream of PITX2, directed a complex pattern including expression in the developing eye and craniofacial region, the classic sites affected in ARS. Screening of ARS patients identified an approximately 7600-kb deletion that began 106 to 108 kb upstream of the PITX2 gene, leaving PITX2 intact while removing regulatory elements CE4 to CE13. Conclusions. These data suggest the presence of a complex distant regulatory matrix within the gene desert located upstream of PITX2 with an essential role in its activity and provides a possible mechanism for the previous reports of ARS in patients with balanced translocations involving the 4q25 region upstream of PITX2 and the current patient with an upstream deletion. PMID:20881290
Oelze, I; Rittner, K; Sczakiel, G
1994-01-01
Adeno-associated virus type 2 (AAV-2), a human parvovirus which is apathogenic in adults, inhibits replication and gene expression of human immunodeficiency virus type 1 (HIV-1) in human cells. The rep gene of AAV-2, which was shown earlier to be sufficient for this negative interference, also down-regulated the expression of heterologous sequences driven by the long terminal repeat (LTR) of HIV-1. This effect was observed in the absence of the HIV-1 transactivator Tat, i.e., at basal levels of LTR-driven transcription. In this work, we studied the involvement of functional subsequences of the HIV-1 LTR in rep-mediated inhibition in the absence of Tat. Mutated LTRs driving an indicator gene (cat) were cointroduced into human SW480 cells together with rep alone or with double-stranded DNA fragments or RNA containing sequences of the HIV-1 LTR. The results indicate that rep strongly enhances the function of negative regulatory elements of the LTR. In addition, the experiments revealed a transcribed sequence element located within the TAR-coding sequence termed AHHH (AAV-HIV homology element derived from HIV-1) which is involved in rep-mediated inhibition. The AHHH element is also involved in down-regulation of basal expression levels in the absence of rep, suggesting that AHHH also contributes to negative regulatory functions of the LTR of HIV-1. In contrast, positive regulatory elements of the HIV-1 LTR such as the NF kappa B and SP1 binding sites have no significant influence on the rep-mediated inhibition. Images PMID:8289357
Smith, Emily M; Lajoie, Bryan R; Jain, Gaurav; Dekker, Job
2016-01-07
Three-dimensional genome structure plays an important role in gene regulation. Globally, chromosomes are organized into active and inactive compartments while, at the gene level, looping interactions connect promoters to regulatory elements. Topologically associating domains (TADs), typically several hundred kilobases in size, form an intermediate level of organization. Major questions include how TADs are formed and how they are related to looping interactions between genes and regulatory elements. Here we performed a focused 5C analysis of a 2.8 Mb chromosome 7 region surrounding CFTR in a panel of cell types. We find that the same TAD boundaries are present in all cell types, indicating that TADs represent a universal chromosome architecture. Furthermore, we find that these TAD boundaries are present irrespective of the expression and looping of genes located between them. In contrast, looping interactions between promoters and regulatory elements are cell-type specific and occur mostly within TADs. This is exemplified by the CFTR promoter that in different cell types interacts with distinct sets of distal cell-type-specific regulatory elements that are all located within the same TAD. Finally, we find that long-range associations between loci located in different TADs are also detected, but these display much lower interaction frequencies than looping interactions within TADs. Interestingly, interactions between TADs are also highly cell-type-specific and often involve loci clustered around TAD boundaries. These data point to key roles of invariant TAD boundaries in constraining as well as mediating cell-type-specific long-range interactions and gene regulation. Copyright © 2016 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Pocock, Ginger M; Zimdars, Laraine L; Yuan, Ming; Eliceiri, Kevin W; Ahlquist, Paul; Sherer, Nathan M
2017-02-01
Cis-acting RNA structural elements govern crucial aspects of viral gene expression. How these structures and other posttranscriptional signals affect RNA trafficking and translation in the context of single cells is poorly understood. Herein we describe a multicolor, long-term (>24 h) imaging strategy for measuring integrated aspects of viral RNA regulatory control in individual cells. We apply this strategy to demonstrate differential mRNA trafficking behaviors governed by RNA elements derived from three retroviruses (HIV-1, murine leukemia virus, and Mason-Pfizer monkey virus), two hepadnaviruses (hepatitis B virus and woodchuck hepatitis virus), and an intron-retaining transcript encoded by the cellular NXF1 gene. Striking behaviors include "burst" RNA nuclear export dynamics regulated by HIV-1's Rev response element and the viral Rev protein; transient aggregations of RNAs into discrete foci at or near the nuclear membrane triggered by multiple elements; and a novel, pulsiform RNA export activity regulated by the hepadnaviral posttranscriptional regulatory element. We incorporate single-cell tracking and a data-mining algorithm into our approach to obtain RNA element-specific, high-resolution gene expression signatures. Together these imaging assays constitute a tractable, systems-based platform for studying otherwise difficult to access spatiotemporal features of viral and cellular gene regulation. © 2017 Pocock et al. This article is distributed by The American Society for Cell Biology under license from the author(s). Two months after publication it is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).
Evolutionary divergence of vertebrate Hoxb2 expression patterns and transcriptional regulatory loci.
Scemama, Jean-Luc; Hunter, Michael; McCallum, Jeff; Prince, Victoria; Stellwag, Edmund
2002-10-15
Hox gene expression is regulated by a complex array of cis-acting elements that control spatial and temporal gene expression in developing embryos. Here, we report the isolation of the striped bass Hoxb2a gene, comparison of its expression to the orthologous gene from zebrafish, and comparative genomic analysis of the upstream regulatory region to that of other vertebrates. Comparison of the Hoxb2a gene expression patterns from striped bass to zebrafish revealed similar expression patterns within rhombomeres 3, 4, and 5 of the hindbrain but a notable absence of expression in neural crest tissues of striped bass while neural crest expression is observed in zebrafish and common to other vertebrates. Comparative genomic analysis of the striped bass Hoxb2a-b3a intergenic region to those from zebrafish, pufferfish, human, and mouse demonstrated the presence of common Meis, Hox/Pbx, Krox-20, and Box 1 elements, which are necessary for rhombomere 3, 4, and 5 expression. Despite their common occurrence, the location and orientation of these transcription elements differed among the five species analyzed, such that Krox-20 and Box 1 elements are located 3' to the Meis, Hox/Pbx elements in striped bass, pufferfish, and human while they are located 5' of this r4 enhancer in zebrafish and mouse. Our results suggest that the plasticity exhibited in the organization of key regulatory elements responsible for rhombomere-specific Hoxb2a expression may reflect the effects of stabilizing selection in the evolution cis-acting elements. Copyright 2002 Wiley-Liss, Inc.
vonHoldt, Bridgett M; Ji, Sarah S; Aardema, Matthew L; Stahler, Daniel; Udell, Monique A R; Sinsheimer, Janet S
2018-06-01
In canines, transposon dynamics have been associated with a hyper-social behavioral syndrome, although the functional mechanism has yet to be described. We investigate the epigenetic and transcriptional consequences of these behavior-associated mobile element insertions in dogs and Yellowstone wolves. We posit that the transposons themselves may not be the causative feature; rather, their transcriptional regulation may exert the functional impact. We survey four outlier transposons associated with hyper-sociability, with the expectation that they are targeted for epigenetic silencing. We predict hyper-methylation of mobile element insertions (MEIs), suggestive that the epigenetic silencing of and not the MEIs themselves may be driving dysregulation of nearby genes. We found that transposon-derived sequences are significantly hyper-methylated, regardless of their copy number or species. Further, we have assessed transcriptome sequence data and found evidence that mobile element insertions impact the expression levels of six genes (WBSCR17, LIMK1, GTF2I, WBSCR27, BAZ1B, and BCL7B), all of which have known roles in human Williams-Beuren syndrome due to changes in copy number, typically hemizygosity. Although further evidence is needed, our results suggest that a few insertions alter local expression at multiple genes, likely through a cis-regulatory mechanism that excludes proximal methylation.
Six Degree-of-Freedom Measurements of Human Mild Traumatic Brain Injury.
Hernandez, Fidel; Wu, Lyndia C; Yip, Michael C; Laksari, Kaveh; Hoffman, Andrew R; Lopez, Jaime R; Grant, Gerald A; Kleiven, Svein; Camarillo, David B
2015-08-01
This preliminary study investigated whether direct measurement of head rotation improves prediction of mild traumatic brain injury (mTBI). Although many studies have implicated rotation as a primary cause of mTBI, regulatory safety standards use 3 degree-of-freedom (3DOF) translation-only kinematic criteria to predict injury. Direct 6DOF measurements of human head rotation (3DOF) and translation (3DOF) have not been previously available to examine whether additional DOFs improve injury prediction. We measured head impacts in American football, boxing, and mixed martial arts using 6DOF instrumented mouthguards, and predicted clinician-diagnosed injury using 12 existing kinematic criteria and 6 existing brain finite element (FE) criteria. Among 513 measured impacts were the first two 6DOF measurements of clinically diagnosed mTBI. For this dataset, 6DOF criteria were the most predictive of injury, more than 3DOF translation-only and 3DOF rotation-only criteria. Peak principal strain in the corpus callosum, a 6DOF FE criteria, was the strongest predictor, followed by two criteria that included rotation measurements, peak rotational acceleration magnitude and Head Impact Power (HIP). These results suggest head rotation measurements may improve injury prediction. However, more 6DOF data is needed to confirm this evaluation of existing injury criteria, and to develop new criteria that considers directional sensitivity to injury.
Integrative analyses shed new light on human ribosomal protein gene regulation
Li, Xin; Zheng, Yiyu; Hu, Haiyan; Li, Xiaoman
2016-01-01
Ribosomal protein genes (RPGs) are important house-keeping genes that are well-known for their coordinated expression. Previous studies on RPGs are largely limited to their promoter regions. Recent high-throughput studies provide an unprecedented opportunity to study how human RPGs are transcriptionally modulated and how such transcriptional regulation may contribute to the coordinate gene expression in various tissues and cell types. By analyzing the DNase I hypersensitive sites under 349 experimental conditions, we predicted 217 RPG regulatory regions in the human genome. More than 86.6% of these computationally predicted regulatory regions were partially corroborated by independent experimental measurements. Motif analyses on these predicted regulatory regions identified 31 DNA motifs, including 57.1% of experimentally validated motifs in literature that regulate RPGs. Interestingly, we observed that the majority of the predicted motifs were shared by the predicted distal and proximal regulatory regions of the same RPGs, a likely general mechanism for enhancer-promoter interactions. We also found that RPGs may be differently regulated in different cells, indicating that condition-specific RPG regulatory regions still need to be discovered and investigated. Our study advances the understanding of how RPGs are coordinately modulated, which sheds light to the general principles of gene transcriptional regulation in mammals. PMID:27346035
Integrative analyses shed new light on human ribosomal protein gene regulation.
Li, Xin; Zheng, Yiyu; Hu, Haiyan; Li, Xiaoman
2016-06-27
Ribosomal protein genes (RPGs) are important house-keeping genes that are well-known for their coordinated expression. Previous studies on RPGs are largely limited to their promoter regions. Recent high-throughput studies provide an unprecedented opportunity to study how human RPGs are transcriptionally modulated and how such transcriptional regulation may contribute to the coordinate gene expression in various tissues and cell types. By analyzing the DNase I hypersensitive sites under 349 experimental conditions, we predicted 217 RPG regulatory regions in the human genome. More than 86.6% of these computationally predicted regulatory regions were partially corroborated by independent experimental measurements. Motif analyses on these predicted regulatory regions identified 31 DNA motifs, including 57.1% of experimentally validated motifs in literature that regulate RPGs. Interestingly, we observed that the majority of the predicted motifs were shared by the predicted distal and proximal regulatory regions of the same RPGs, a likely general mechanism for enhancer-promoter interactions. We also found that RPGs may be differently regulated in different cells, indicating that condition-specific RPG regulatory regions still need to be discovered and investigated. Our study advances the understanding of how RPGs are coordinately modulated, which sheds light to the general principles of gene transcriptional regulation in mammals.
Identification of cyanobacterial non-coding RNAs by comparative genome analysis.
Axmann, Ilka M; Kensche, Philip; Vogel, Jörg; Kohl, Stefan; Herzel, Hanspeter; Hess, Wolfgang R
2005-01-01
Whole genome sequencing of marine cyanobacteria has revealed an unprecedented degree of genomic variation and streamlining. With a size of 1.66 megabase-pairs, Prochlorococcus sp. MED4 has the most compact of these genomes and it is enigmatic how the few identified regulatory proteins efficiently sustain the lifestyle of an ecologically successful marine microorganism. Small non-coding RNAs (ncRNAs) control a plethora of processes in eukaryotes as well as in bacteria; however, systematic searches for ncRNAs are still lacking for most eubacterial phyla outside the enterobacteria. Based on a computational prediction we show the presence of several ncRNAs (cyanobacterial functional RNA or Yfr) in several different cyanobacteria of the Prochlorococcus-Synechococcus lineage. Some ncRNA genes are present only in two or three of the four strains investigated, whereas the RNAs Yfr2 through Yfr5 are structurally highly related and are encoded by a rapidly evolving gene family as their genes exist in different copy numbers and at different sites in the four investigated genomes. One ncRNA, Yfr7, is present in at least seven other cyanobacteria. In addition, control elements for several ribosomal operons were predicted as well as riboswitches for thiamine pyrophosphate and cobalamin. This is the first genome-wide and systematic screen for ncRNAs in cyanobacteria. Several ncRNAs were both computationally predicted and their presence was biochemically verified. These RNAs may have regulatory functions and each shows a distinct phylogenetic distribution. Our approach can be applied to any group of microorganisms for which more than one total genome sequence is available for comparative analysis.
Wang, Yan; Wang, Lei; Cui, Xianghua; Fang, Yuan; Chen, Qianqiu; Wang, Ya; Qiang, Yao
2015-12-01
Self-regulatory resources and trait self-control have been found to moderate the impulse-behavior relationship. The current study investigated whether the interaction of self-regulatory resources and trait self-control moderates the association between implicit attitudes and food consumption. One hundred twenty female participants were randomly assigned to either a depletion condition in which their self-regulatory resources were reduced or a no-depletion condition. Participants' implicit attitudes for chocolate were measured with the Single Category Implicit Association Test and self-report measures of trait self-control were collected. The dependent variable was chocolate consumption in an ostensible taste and rate task. Implicit attitudes predicted chocolate consumption in depleted participants but not in non-depleted participants. However, this predictive power of implicit attitudes on eating in depleted condition disappeared in participants with high trait self-control. Thus, trait self-control and self-regulatory resources interact to moderate the prediction of implicit attitude on eating behavior. Results suggest that high trait self-control buffers the effect of self-regulatory depletion on impulsive eating. Copyright © 2015 Elsevier Ltd. All rights reserved.
Tarlochan, Faris; Mehboob, Hassan; Mehboob, Ali; Chang, Seung-Hwan
2018-06-01
Cementless hip prostheses with porous outer coating are commonly used to repair the proximally damaged femurs. It has been demonstrated that stability of prosthesis is also highly dependent on the bone ingrowth into the porous texture. Bone ingrowth is influenced by the mechanical environment produced in the callus. In this study, bone ingrowth into the porous structure was predicted by using a mechano-regulatory model. Homogenously distributed pores (200 and 800 [Formula: see text]m in diameter) and functionally graded pores along the length of the prosthesis were introduced as a porous coating. Bone ingrowth was simulated using 25 and 12 [Formula: see text]m micromovements. Load control simulations were carried out instead of traditionally used displacement control. Spatial and temporal distributions of tissues were predicted in all cases. Functionally graded pore decreasing models gave the most homogenous bone distribution, the highest bone ingrowth (98%) with highest average Young's modulus of all tissue phenotypes approximately 4.1 GPa. Besides this, the volume of the initial callus increased to 8.33% in functionally graded pores as compared to the 200 [Formula: see text]m pore size models which increased the bone volume. These findings indicate that functionally graded porous surface promote bone ingrowth efficiently which can be considered to design of surface texture of hip prosthesis.
[The ENCODE project and functional genomics studies].
Ding, Nan; Qu, Hongzhu; Fang, Xiangdong
2014-03-01
Upon the completion of the Human Genome Project, scientists have been trying to interpret the underlying genomic code for human biology. Since 2003, National Human Genome Research Institute (NHGRI) has invested nearly $0.3 billion and gathered over 440 scientists from more than 32 institutions in the United States, China, United Kingdom, Japan, Spain and Singapore to initiate the Encyclopedia of DNA Elements (ENCODE) project, aiming to identify and analyze all regulatory elements in the human genome. Taking advantage of the development of next-generation sequencing technologies and continuous improvement of experimental methods, ENCODE had made remarkable achievements: identified methylation and histone modification of DNA sequences and their regulatory effects on gene expression through altering chromatin structures, categorized binding sites of various transcription factors and constructed their regulatory networks, further revised and updated database for pseudogenes and non-coding RNA, and identified SNPs in regulatory sequences associated with diseases. These findings help to comprehensively understand information embedded in gene and genome sequences, the function of regulatory elements as well as the molecular mechanism underlying the transcriptional regulation by noncoding regions, and provide extensive data resource for life sciences, particularly for translational medicine. We re-viewed the contributions of high-throughput sequencing platform development and bioinformatical technology improve-ment to the ENCODE project, the association between epigenetics studies and the ENCODE project, and the major achievement of the ENCODE project. We also provided our prospective on the role of the ENCODE project in promoting the development of basic and clinical medicine.
Developmental gene regulatory network architecture across 500 million years of echinoderm evolution
NASA Technical Reports Server (NTRS)
Hinman, Veronica F.; Nguyen, Albert T.; Cameron, R. Andrew; Davidson, Eric H.
2003-01-01
Evolutionary change in morphological features must depend on architectural reorganization of developmental gene regulatory networks (GRNs), just as true conservation of morphological features must imply retention of ancestral developmental GRN features. Key elements of the provisional GRN for embryonic endomesoderm development in the sea urchin are here compared with those operating in embryos of a distantly related echinoderm, a starfish. These animals diverged from their common ancestor 520-480 million years ago. Their endomesodermal fate maps are similar, except that sea urchins generate a skeletogenic cell lineage that produces a prominent skeleton lacking entirely in starfish larvae. A relevant set of regulatory genes was isolated from the starfish Asterina miniata, their expression patterns determined, and effects on the other genes of perturbing the expression of each were demonstrated. A three-gene feedback loop that is a fundamental feature of the sea urchin GRN for endoderm specification is found in almost identical form in the starfish: a detailed element of GRN architecture has been retained since the Cambrian Period in both echinoderm lineages. The significance of this retention is highlighted by the observation of numerous specific differences in the GRN connections as well. A regulatory gene used to drive skeletogenesis in the sea urchin is used entirely differently in the starfish, where it responds to endomesodermal inputs that do not affect it in the sea urchin embryo. Evolutionary changes in the GRNs since divergence are limited sharply to certain cis-regulatory elements, whereas others have persisted unaltered.
Validating regulatory predictions from diverse bacteria with mutant fitness data
Sagawa, Shiori; Price, Morgan N.; Deutschbauer, Adam M.; ...
2017-05-24
Although transcriptional regulation is fundamental to understanding bacterial physiology, the targets of most bacterial transcription factors are not known. Comparative genomics has been used to identify likely targets of some of these transcription factors, but these predictions typically lack experimental support. Here, we used mutant fitness data, which measures the importance of each gene for a bacterium's growth across many conditions, to test regulatory predictions from RegPrecise, a curated collection of comparative genomics predictions. Because characterized transcription factors often have correlated fitness with one of their targets (either positively or negatively), correlated fitness patterns provide support for the comparative genomicsmore » predictions. At a false discovery rate of 3%, we identified significant cofitness for at least one target of 158 TFs in 107 ortholog groups and from 24 bacteria. Thus, high-throughput genetics can be used to identify a high-confidence subset of the sequence-based regulatory predictions.« less
Validating regulatory predictions from diverse bacteria with mutant fitness data
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sagawa, Shiori; Price, Morgan N.; Deutschbauer, Adam M.
Although transcriptional regulation is fundamental to understanding bacterial physiology, the targets of most bacterial transcription factors are not known. Comparative genomics has been used to identify likely targets of some of these transcription factors, but these predictions typically lack experimental support. Here, we used mutant fitness data, which measures the importance of each gene for a bacterium's growth across many conditions, to test regulatory predictions from RegPrecise, a curated collection of comparative genomics predictions. Because characterized transcription factors often have correlated fitness with one of their targets (either positively or negatively), correlated fitness patterns provide support for the comparative genomicsmore » predictions. At a false discovery rate of 3%, we identified significant cofitness for at least one target of 158 TFs in 107 ortholog groups and from 24 bacteria. Thus, high-throughput genetics can be used to identify a high-confidence subset of the sequence-based regulatory predictions.« less
Hamaji, Takashi; Lopez, David; Pellegrini, Matteo; ...
2016-03-26
Upon fertilization Chlamydomonas reinhardtii zygotes undergo a program of differentiation into a diploid zygospore that is accompanied by transcription of hundreds of zygote-specific genes. We identified a distinct sequence motif we term a zygotic response element (ZYRE) that is highly enriched in promoter regions of C. reinhardtii early zygotic genes. A luciferase reporter assay was used to show that native ZYRE motifs within the promoter of zygotic gene ZYS3 or intron of zygotic gene DMT4 are necessary for zygotic induction. A synthetic luciferase reporter with a minimal promoter was used to show that ZYRE motifs introduced upstream are sufficient tomore » confer zygotic upregulation, and that ZYRE-controlled zygotic transcription is dependent on the homeodomain transcription factor GSP1. Furthermore, we predict that ZYRE motifs will correspond to binding sites for the homeodomain proteins GSP1-GSM1 that heterodimerize and activate zygotic gene expression in early zygotes.« less
Repression of enhancer II activity by a negative regulatory element in the hepatitis B virus genome.
Lo, W Y; Ting, L P
1994-01-01
Enhancer II of human hepatitis B virus has dual functions in vivo. Located at nucleotides (nt) 1646 to 1741, it can stimulate the surface and X promoters from a downstream position. Moreover, the same sequence can also function as upstream regulatory element that activates the core promoter in a position- and orientation-dependent manner. In this study, we report the identification and characterization of a negative regulatory element (NRE) upstream of enhancer II (nt 1613 to 1636) which can repress both the enhancer and upstream stimulatory function of the enhancer II sequence in differentiated liver cells. This NRE has marginal inhibitory effect by itself but a strong repressive function in the presence of a functional enhancer II. Mutational analysis reveals that sequence from nt 1616 to 1621 is required for repression of enhancer activity by the NRE. Gel shift analysis reveals that this negative regulatory region can be recognized by a specific protein factor(s) present at the 0.4 M NaCl fraction of HepG2 nuclear extracts. The discovery of the NRE indicates that HBV gene transcription is controlled by combined effects of both positive and negative regulation. It also provides a unique system with which to study the mechanism of negative regulation of gene expression. Images PMID:8107237
Gordon, Christopher T.; Attanasio, Catia; Bhatia, Shipra; Benko, Sabina; Ansari, Morad; Tan, Tiong Y.; Munnich, Arnold; Pennacchio, Len A.; Abadie, Véronique; Temple, I. Karen; Goldenberg, Alice; van Heyningen, Veronica; Amiel, Jeanne; FitzPatrick, David; Kleinjan, Dirk A.; Visel, Axel; Lyonnet, Stanislas
2015-01-01
Mutations in the coding sequence of SOX9 cause campomelic dysplasia (CD), a disorder of skeletal development associated with 46,XY disorders of sex development (DSDs). Translocations, deletions and duplications within a ~2 Mb region upstream of SOX9 can recapitulate the CD-DSD phenotype fully or partially, suggesting the existence of an unusually large cis-regulatory control region. Pierre Robin sequence (PRS) is a craniofacial disorder that is frequently an endophenotype of CD and a locus for isolated PRS at ~1.2-1.5 Mb upstream of SOX9 has been previously reported. The craniofacial regulatory potential within this locus, and within the greater genomic domain surrounding SOX9, remains poorly defined. We report two novel deletions upstream of SOX9 in families with PRS, allowing refinement of the regions harbouring candidate craniofacial regulatory elements. In parallel, ChIP-Seq for p300 binding sites in mouse craniofacial tissue led to the identification of several novel craniofacial enhancers at the SOX9 locus, which were validated in transgenic reporter mice and zebrafish. Notably, some of the functionally validated elements fall within the PRS deletions. These studies suggest that multiple non-coding elements contribute to the craniofacial regulation of SOX9 expression, and that their disruption results in PRS. PMID:24934569
2010-01-01
Background Regulatory elements that control expression of specific genes during development have been shown in many cases to contain functionally-conserved modules that can be transferred between species and direct gene expression in a comparable developmental pattern. An example of such a module has been identified at the rat myosin light chain (MLC) 1/3 locus, which has been well characterised in transgenic mouse studies. This locus contains two promoters encoding two alternatively spliced isoforms of alkali myosin light chain. These promoters are differentially regulated during development through the activity of two enhancer elements. The MLC3 promoter alone has been shown to confer expression of a reporter gene in skeletal and cardiac muscle in transgenic mice and the addition of the downstream MLC enhancer increased expression levels in skeletal muscle. We asked whether this regulatory module, sufficient for striated muscle gene expression in the mouse, would drive expression in similar domains in the chicken. Results We have observed that a conserved downstream MLC enhancer is present in the chicken MLC locus. We found that the rat MLC1/3 regulatory elements were transcriptionally active in chick skeletal muscle primary cultures. We observed that a single copy lentiviral insert containing this regulatory cassette was able to drive expression of a lacZ reporter gene in the fast-fibres of skeletal muscle in chicken in three independent transgenic chicken lines in a pattern similar to the endogenous MLC locus. Reporter gene expression in cardiac muscle tissues was not observed for any of these lines. Conclusions From these results we conclude that skeletal expression from this regulatory module is conserved in a genomic context between rodents and chickens. This transgenic module will be useful in future investigations of muscle development in avian species. PMID:20184756
Genome-wide comparative analysis reveals human-mouse regulatory landscape and evolution.
Denas, Olgert; Sandstrom, Richard; Cheng, Yong; Beal, Kathryn; Herrero, Javier; Hardison, Ross C; Taylor, James
2015-02-14
Because species-specific gene expression is driven by species-specific regulation, understanding the relationship between sequence and function of the regulatory regions in different species will help elucidate how differences among species arise. Despite active experimental and computational research, relationships among sequence, conservation, and function are still poorly understood. We compared transcription factor occupied segments (TFos) for 116 human and 35 mouse TFs in 546 human and 125 mouse cell types and tissues from the Human and the Mouse ENCODE projects. We based the map between human and mouse TFos on a one-to-one nucleotide cross-species mapper, bnMapper, that utilizes whole genome alignments (WGA). Our analysis shows that TFos are under evolutionary constraint, but a substantial portion (25.1% of mouse and 25.85% of human on average) of the TFos does not have a homologous sequence on the other species; this portion varies among cell types and TFs. Furthermore, 47.67% and 57.01% of the homologous TFos sequence shows binding activity on the other species for human and mouse respectively. However, 79.87% and 69.22% is repurposed such that it binds the same TF in different cells or different TFs in the same cells. Remarkably, within the set of repurposed TFos, the corresponding genome regions in the other species are preferred locations of novel TFos. These events suggest exaptation of some functional regulatory sequences into new function. Despite TFos repurposing, we did not find substantial changes in their predicted target genes, suggesting that CRMs buffer evolutionary events allowing little or no change in the TFos - target gene associations. Thus, the small portion of TFos with strictly conserved occupancy underestimates the degree of conservation of regulatory interactions. We mapped regulatory sequences from an extensive number of TFs and cell types between human and mouse using WGA. A comparative analysis of this correspondence unveiled the extent of the shared regulatory sequence across TFs and cell types under study. Importantly, a large part of the shared regulatory sequence is repurposed on the other species. This sequence, fueled by turnover events, provides a strong case for exaptation in regulatory elements.
CisMapper: predicting regulatory interactions from transcription factor ChIP-seq data
O'Connor, Timothy; Bodén, Mikael
2017-01-01
Abstract Identifying the genomic regions and regulatory factors that control the transcription of genes is an important, unsolved problem. The current method of choice predicts transcription factor (TF) binding sites using chromatin immunoprecipitation followed by sequencing (ChIP-seq), and then links the binding sites to putative target genes solely on the basis of the genomic distance between them. Evidence from chromatin conformation capture experiments shows that this approach is inadequate due to long-distance regulation via chromatin looping. We present CisMapper, which predicts the regulatory targets of a TF using the correlation between a histone mark at the TF's bound sites and the expression of each gene across a panel of tissues. Using both chromatin conformation capture and differential expression data, we show that CisMapper is more accurate at predicting the target genes of a TF than the distance-based approaches currently used, and is particularly advantageous for predicting the long-range regulatory interactions typical of tissue-specific gene expression. CisMapper also predicts which TF binding sites regulate a given gene more accurately than using genomic distance. Unlike distance-based methods, CisMapper can predict which transcription start site of a gene is regulated by a particular binding site of the TF. PMID:28204599
Della Gatta, Giusy; Palomero, Teresa; Perez-Garcia, Arianne; Ambesi-Impiombato, Alberto; Bansal, Mukesh; Carpenter, Zachary W; De Keersmaecker, Kim; Sole, Xavier; Xu, Luyao; Paietta, Elisabeth; Racevskis, Janis; Wiernik, Peter H; Rowe, Jacob M; Meijerink, Jules P; Califano, Andrea; Ferrando, Adolfo A
2012-02-26
The TLX1 and TLX3 transcription factor oncogenes have a key role in the pathogenesis of T cell acute lymphoblastic leukemia (T-ALL). Here we used reverse engineering of global transcriptional networks to decipher the oncogenic regulatory circuit controlled by TLX1 and TLX3. This systems biology analysis defined T cell leukemia homeobox 1 (TLX1) and TLX3 as master regulators of an oncogenic transcriptional circuit governing T-ALL. Notably, a network structure analysis of this hierarchical network identified RUNX1 as a key mediator of the T-ALL induced by TLX1 and TLX3 and predicted a tumor-suppressor role for RUNX1 in T cell transformation. Consistent with these results, we identified recurrent somatic loss-of-function mutations in RUNX1 in human T-ALL. Overall, these results place TLX1 and TLX3 at the top of an oncogenic transcriptional network controlling leukemia development, show the power of network analyses to identify key elements in the regulatory circuits governing human cancer and identify RUNX1 as a tumor-suppressor gene in T-ALL.
77 FR 8072 - Semiannual Regulatory Flexibility Agenda
Federal Register 2010, 2011, 2012, 2013, 2014
2012-02-13
... six months. ADDRESSES: Comments should be addressed to Jennifer J. Johnson, Secretary of the Board... Regulatory and Deregulatory Actions, which is coordinated by the Office of Management and Budget under... pledged; and certain other elements including a strategic analysis of the company's plans for maintaining...
Canver, Matthew C; Lessard, Samuel; Pinello, Luca; Wu, Yuxuan; Ilboudo, Yann; Stern, Emily N; Needleman, Austen J; Galactéros, Frédéric; Brugnara, Carlo; Kutlar, Abdullah; McKenzie, Colin; Reid, Marvin; Chen, Diane D; Das, Partha Pratim; A Cole, Mitchel; Zeng, Jing; Kurita, Ryo; Nakamura, Yukio; Yuan, Guo-Cheng; Lettre, Guillaume; Bauer, Daniel E; Orkin, Stuart H
2017-04-01
Cas9-mediated, high-throughput, saturating in situ mutagenesis permits fine-mapping of function across genomic segments. Disease- and trait-associated variants identified in genome-wide association studies largely cluster at regulatory loci. Here we demonstrate the use of multiple designer nucleases and variant-aware library design to interrogate trait-associated regulatory DNA at high resolution. We developed a computational tool for the creation of saturating-mutagenesis libraries with single or multiple nucleases with incorporation of variants. We applied this methodology to the HBS1L-MYB intergenic region, which is associated with red-blood-cell traits, including fetal hemoglobin levels. This approach identified putative regulatory elements that control MYB expression. Analysis of genomic copy number highlighted potential false-positive regions, thus emphasizing the importance of off-target analysis in the design of saturating-mutagenesis experiments. Together, these data establish a widely applicable high-throughput and high-resolution methodology to identify minimal functional sequences within large disease- and trait-associated regions.
Accurate SHAPE-directed RNA secondary structure modeling, including pseudoknots.
Hajdin, Christine E; Bellaousov, Stanislav; Huggins, Wayne; Leonard, Christopher W; Mathews, David H; Weeks, Kevin M
2013-04-02
A pseudoknot forms in an RNA when nucleotides in a loop pair with a region outside the helices that close the loop. Pseudoknots occur relatively rarely in RNA but are highly overrepresented in functionally critical motifs in large catalytic RNAs, in riboswitches, and in regulatory elements of viruses. Pseudoknots are usually excluded from RNA structure prediction algorithms. When included, these pairings are difficult to model accurately, especially in large RNAs, because allowing this structure dramatically increases the number of possible incorrect folds and because it is difficult to search the fold space for an optimal structure. We have developed a concise secondary structure modeling approach that combines SHAPE (selective 2'-hydroxyl acylation analyzed by primer extension) experimental chemical probing information and a simple, but robust, energy model for the entropic cost of single pseudoknot formation. Structures are predicted with iterative refinement, using a dynamic programming algorithm. This melded experimental and thermodynamic energy function predicted the secondary structures and the pseudoknots for a set of 21 challenging RNAs of known structure ranging in size from 34 to 530 nt. On average, 93% of known base pairs were predicted, and all pseudoknots in well-folded RNAs were identified.
Vazquez-Anderson, Jorge; Mihailovic, Mia K; Baldridge, Kevin C; Reyes, Kristofer G; Haning, Katie; Cho, Seung Hee; Amador, Paul; Powell, Warren B; Contreras, Lydia M
2017-05-19
Current approaches to design efficient antisense RNAs (asRNAs) rely primarily on a thermodynamic understanding of RNA-RNA interactions. However, these approaches depend on structure predictions and have limited accuracy, arguably due to overlooking important cellular environment factors. In this work, we develop a biophysical model to describe asRNA-RNA hybridization that incorporates in vivo factors using large-scale experimental hybridization data for three model RNAs: a group I intron, CsrB and a tRNA. A unique element of our model is the estimation of the availability of the target region to interact with a given asRNA using a differential entropic consideration of suboptimal structures. We showcase the utility of this model by evaluating its prediction capabilities in four additional RNAs: a group II intron, Spinach II, 2-MS2 binding domain and glgC 5΄ UTR. Additionally, we demonstrate the applicability of this approach to other bacterial species by predicting sRNA-mRNA binding regions in two newly discovered, though uncharacterized, regulatory RNAs. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Kwon, Andrew T.; Chou, Alice Yi; Arenillas, David J.; Wasserman, Wyeth W.
2011-01-01
We performed a genome-wide scan for muscle-specific cis-regulatory modules (CRMs) using three computational prediction programs. Based on the predictions, 339 candidate CRMs were tested in cell culture with NIH3T3 fibroblasts and C2C12 myoblasts for capacity to direct selective reporter gene expression to differentiated C2C12 myotubes. A subset of 19 CRMs validated as functional in the assay. The rate of predictive success reveals striking limitations of computational regulatory sequence analysis methods for CRM discovery. Motif-based methods performed no better than predictions based only on sequence conservation. Analysis of the properties of the functional sequences relative to inactive sequences identifies nucleotide sequence composition can be an important characteristic to incorporate in future methods for improved predictive specificity. Muscle-related TFBSs predicted within the functional sequences display greater sequence conservation than non-TFBS flanking regions. Comparison with recent MyoD and histone modification ChIP-Seq data supports the validity of the functional regions. PMID:22144875
Kimura-Yoshida, Chiharu; Yan, Kuo; Bormuth, Olga; Ding, Qiong; Nakanishi, Akiko; Sasaki, Takeshi; Hirakawa, Mika; Sumiyama, Kenta; Furuta, Yasuhide; Tarabykin, Victor; Matsuo, Isao; Okada, Norihiro
2016-01-01
Acquisition of cis-regulatory elements is a major driving force of evolution, and there are several examples of developmental enhancers derived from transposable elements (TEs). However, it remains unclear whether one enhancer element could have been produced via cooperation among multiple, yet distinct, TEs during evolution. Here we show that an evolutionarily conserved genomic region named AS3_9 comprises three TEs (AmnSINE1, X6b_DNA and MER117), inserted side-by-side, and functions as a distal enhancer for wnt5a expression during morphogenesis of the mammalian secondary palate. Functional analysis of each TE revealed step-by-step retroposition/transposition and co-option together with acquisition of a binding site for Msx1 for its full enhancer function during mammalian evolution. The present study provides a new perspective suggesting that a huge variety of TEs, in combination, could have accelerated the diversity of cis-regulatory elements involved in morphological evolution. PMID:27741242
Characterization of "cis"-regulatory elements ("c"RE) associated with mammary gland function
USDA-ARS?s Scientific Manuscript database
The Bos taurus genome assembly has propelled dairy science into a new era; still, most of the information encoded in the genome has not yet been decoded. The human Encyclopedia of DNA Elements (ENCODE) project has spearheaded the identification and annotation of functional genomic elements in the hu...
NASA Astrophysics Data System (ADS)
Omar, Aimi Farehah; Ismail, Ismanizan
2016-11-01
Sesquiterpene synthase (SS) catalyzes the formation of sesquiterpenes from farnesyl diphosphate (FDP) via carbocation intermediates. In this study, the promoter region of sesquiterpene synthase was isolated from Persicaria minor to identify possible cis-acting elements in the promoter. The full-length PmSS promoter of P. minor is 1824-bp sequences. The sequence was analyzed and several putative cis-acting regulatory elements were identified. Three cis-acting regulatory elements were selected for deletion analysis which are cis-acting element involved in wound responsiveness (WUN), cis - acting element involved in defense and stress responsiveness (TC) and cis-acting element involved in ABA responsiveness (ABRE). Series of deletions were conducted to assess the promoter activity producing three truncated fragments promoter; Prom 2 1606-bp, Prom 3 1144- bp, and Prom 4 921-bp. The full-length promoter and its deletion series were cloned into the pBGWFS7 vector which contain β-glucuronidase (GUS) gene and green fluorescent protein (GFP) as the reporter gene. All constructs were successfully transformed into Arabidopsis thaliana based on PCR of positive BASTA resistance plants.
Evolution of Hsp70 Gene Expression: A Role for Changes in AT-Richness within Promoters
Ma, Ronghui; Zhang, Bo; Kang, Le
2011-01-01
In disparate organisms adaptation to thermal stress has been linked to changes in the expression of genes encoding heat-shock proteins (Hsp). The underlying genetics, however, remain elusive. We show here that two AT-rich sequence elements in the promoter region of the hsp70 gene of the fly Liriomyza sativae that are absent in the congeneric species, Liriomyza huidobrensis, have marked cis-regulatory consequences. We studied the cis-regulatory consequences of these elements (called ATRS1 and ATRS2) by measuring the constitutive and heat-shock-induced luciferase luminescence that they drive in cells transfected with constructs carrying them modified, deleted, or intact, in the hsp70 promoter fused to the luciferase gene. The elements affected expression level markedly and in different ways: Deleting ATRS1 augmented both the constitutive and the heat-shock-induced luminescence, suggesting that this element represses transcription. Interestingly, replacing the element with random sequences of the same length and A+T content delivered the wild-type luminescence pattern, proving that the element's high A+T content is crucial for its effects. Deleting ATRS2 decreased luminescence dramatically and almost abolished heat-shock inducibility and so did replacing the element with random sequences matching the element's length and A+T content, suggesting that ATRS2's effects on transcription and heat-shock inducibility involve a common mechanism requiring at least in part the element's specific primary structure. Finally, constitutive and heat-shock luminescence were reduced strongly when two putative binding sites for the Zeste transcription factor identified within ATRS2 were altered through site-directed mutagenesis, and the heat-shock-induced luminescence increased when Zeste was over-expressed, indicating that Zeste participates in the effects mapped to ATRS2 at least in part. AT-rich sequences are common in promoters and our results suggest that they should play important roles in regulatory evolution since they can affect expression markedly and constrain promoter DNA in at least two different ways. PMID:21655251
De novo design of a synthetic riboswitch that regulates transcription termination
Wachsmuth, Manja; Findeiß, Sven; Weissheimer, Nadine; Stadler, Peter F.; Mörl, Mario
2013-01-01
Riboswitches are regulatory RNA elements typically located in the 5′-untranslated region of certain mRNAs and control gene expression at the level of transcription or translation. These elements consist of a sensor and an adjacent actuator domain. The sensor usually is an aptamer that specifically interacts with a ligand. The actuator contains an intrinsic terminator or a ribosomal binding site for transcriptional or translational regulation, respectively. Ligand binding leads to structural rearrangements of the riboswitch and to presentation or masking of these regulatory elements. Based on this modular organization, riboswitches are an ideal target for constructing synthetic regulatory systems for gene expression. Although riboswitches for translational control have been designed successfully, attempts to construct synthetic elements regulating transcription have failed so far. Here, we present an in silico pipeline for the rational design of synthetic riboswitches that regulate gene expression at the transcriptional level. Using the well-characterized theophylline aptamer as sensor, we designed the actuator part as RNA sequences that can fold into functional intrinsic terminator structures. In the biochemical characterization, several of the designed constructs show ligand-dependent control of gene expression in Escherichia coli, demonstrating that it is possible to engineer riboswitches not only for translational but also for transcriptional regulation. PMID:23275562
A novel E2 box-GATA element modulates Cdc6 transcription during human cells polyploidization
Vilaboa, Nuria; Bermejo, Rodrigo; Martinez, Pilar; Bornstein, Rafael; Calés, Carmela
2004-01-01
Cdc6 is a key regulator of the strict alternation of S and M phases during the mitotic cell cycle. In mammalian and plant cells that physiologically become polyploid, cdc6 is transcriptionally and post-translationally regulated. We have recently reported that Cdc6 levels are maintained in megakaryoblastic HEL cells, but severely downregulated by ectopic expression of transcriptional repressor Drosophila melanogaster escargot. Here, we show that cdc6 promoter activity is upregulated during megakaryocytic differentiation of HEL endoreplicating cells, and that Escargot interferes with such activation. Transactivation experiments showed that a 1.7 kb region located at 2800 upstream cdc6 transcription initiation site behaved as a potent enhancer in endoreplicating cells only. This activity was mainly dependent on a novel cis-regulatory element composed by an E2 box overlapping a GATA motif. Ectopic Escargot could bind this regulatory element in vitro and endogenous GATA-1 and E2A formed specific complexes in megakaryoblastic cells as well as in primary megakaryocytes. Chromatin Immunoprecipitation analysis revealed that both transcription factors were occupying the E2 box/GATA site in vivo. Altogether, these data suggest that cdc6 expression could be actively maintained during megakaryocytic differentiation through transcriptional mechanisms involving specific cis- and trans-regulatory elements. PMID:15590906
Weischenfeldt, Joachim; Dubash, Taronish; Drainas, Alexandros P.; Mardin, Balca R.; Chen, Yuanyuan; Stütz, Adrian M.; Waszak, Sebastian M.; Bosco, Graziella; Halvorsen, Ann Rita; Raeder, Benjamin; Efthymiopoulos, Theocharis; Erkek, Serap; Siegl, Christine; Brenner, Hermann; Brustugun, Odd Terje; Dieter, Sebastian M.; Northcott, Paul A.; Petersen, Iver; Pfister, Stefan M.; Schneider, Martin; Solberg, Steinar K.; Thunissen, Erik; Weichert, Wilko; Zichner, Thomas; Thomas, Roman; Peifer, Martin; Helland, Aslaug; Ball, Claudia R.; Jechlinger, Martin; Sotillo, Rocio; Glimm, Hanno; Korbel, Jan O.
2018-01-01
Extensive prior research has focused on somatic copy-number alterations (SCNAs) affecting cancer genes, yet the extent to which recurrent SCNAs exert their influence through rearranging cis-regulatory elements remains unclear. Here, we present a framework for inferring cancer-related gene overexpression resulting from cis-regulatory element reorganization (e.g., enhancer hijacking), by integrating SCNAs, gene expression data, and information on chromatin interaction domains. Analysis of 7,416 cancer genomes uncovered several pan-cancer candidate genes, including IRS4, SMARCA1 and TERT. We demonstrate that IRS4 overexpression in lung cancer associates with recurrent deletions in cis, and present evidence supporting a tumor-promoting role. We additionally pursued cancer type-specific analyses, uncovering IGF2 as a target for enhancer hijacking in colorectal cancer. IGF2-containing tandem duplications result in the de novo formation of a 3D contact domain comprising IGF2 and a lineage-specific super-enhancer, which mediates high-level gene activation. Our framework enables systematic inference of cis-regulatory element rearrangements mediating dysregulation in cancer. PMID:27869826
Gene transfer strategies in animal transgenesis.
Montoliu, Lluís
2002-01-01
Position effects in animal transgenesis have prevented the reproducible success and limited the initial expectations of this technique in many biotechnological projects. Historically, several strategies have been devised to overcome such position effects, including the progressive addition of regulatory elements belonging to the same or to a heterologous expression domain. An expression domain is thought to contain all regulatory elements that are needed to specifically control the expression of a given gene in time and space. The lack of profound knowledge on the chromatin structure of expression domains of biotechnological interest, such as mammary gland-specific genes, explains why most standard expression vectors have failed to drive high-level, position-independent, and copy-number-dependent expression of transgenes in a reproducible manner. In contrast, the application of artificial chromosome-type constructs to animal transgenesis usually ensures optimal expression levels. YACs, BACs, and PACs have become crucial tools in animal transgenesis, allowing the inclusion of distant key regulatory sequences, previously unknown, that are characteristic for each expression domain. These elements contribute to insulating the artificial chromosome-type constructs from chromosomal position effects and are fundamental in order to guarantee the correct expression of transgenes.
Neuman, Sarah D.; Bashirullah, Arash; Kumar, Justin P.
2016-01-01
The eyes absent (eya) gene of the fruit fly, Drosophila melanogaster, is a member of an evolutionarily conserved gene regulatory network that controls eye formation in all seeing animals. The loss of eya leads to the complete elimination of the compound eye while forced expression of eya in non-retinal tissues is sufficient to induce ectopic eye formation. Within the developing retina eya is expressed in a dynamic pattern and is involved in tissue specification/determination, cell proliferation, apoptosis, and cell fate choice. In this report we explore the mechanisms by which eya expression is spatially and temporally governed in the developing eye. We demonstrate that multiple cis-regulatory elements function cooperatively to control eya transcription and that spacing between a pair of enhancer elements is important for maintaining correct gene expression. Lastly, we show that the loss of eya expression in sine oculis (so) mutants is the result of massive cell death and a progressive homeotic transformation of retinal progenitor cells into head epidermis. PMID:27930646
Design and testing of regulatory cassettes for optimal activity in skeletal and cardiac muscles.
Himeda, Charis L; Chen, Xiaolan; Hauschka, Stephen D
2011-01-01
Gene therapy for muscular dystrophies requires efficient gene delivery to the striated musculature and specific, high-level expression of the therapeutic gene in a physiologically diverse array of muscles. This can be achieved by the use of recombinant adeno-associated virus vectors in conjunction with muscle-specific regulatory cassettes. We have constructed several generations of regulatory cassettes based on the enhancer and promoter of the muscle creatine kinase gene, some of which include heterologous enhancers and individual elements from other muscle genes. Since the relative importance of many control elements varies among different anatomical muscles, we are aiming to tailor these cassettes for high-level expression in cardiac muscle, and in fast and slow skeletal muscles. With the achievement of efficient intravascular gene delivery to isolated limbs, selected muscle groups, and heart in large animal models, the design of cassettes optimized for activity in different muscle types is now a practical goal. In this protocol, we outline the key steps involved in the design of regulatory cassettes for optimal activity in skeletal and cardiac muscle, and testing in mature muscle fiber cultures. The basic principles described here can also be applied to engineering tissue-specific regulatory cassettes for other cell types.
Efforts are underway to transform regulatory toxicology and chemical safety assessment from a largely empirical science based on direct observation of apical toxicity outcomes in whole organism toxicity tests to a predictive one in which outcomes and risk are inferred from accumu...
Negi, Pooja; Rai, Archana N; Suprasanna, Penna
2016-01-01
The recognition of a positive correlation between organism genome size with its transposable element (TE) content, represents a key discovery of the field of genome biology. Considerable evidence accumulated since then suggests the involvement of TEs in genome structure, evolution and function. The global genome reorganization brought about by transposon activity might play an adaptive/regulatory role in the host response to environmental challenges, reminiscent of McClintock's original 'Controlling Element' hypothesis. This regulatory aspect of TEs is also garnering support in light of the recent evidences, which project TEs as "distributed genomic control modules." According to this view, TEs are capable of actively reprogramming host genes circuits and ultimately fine-tuning the host response to specific environmental stimuli. Moreover, the stress-induced changes in epigenetic status of TE activity may allow TEs to propagate their stress responsive elements to host genes; the resulting genome fluidity can permit phenotypic plasticity and adaptation to stress. Given their predominating presence in the plant genomes, nested organization in the genic regions and potential regulatory role in stress response, TEs hold unexplored potential for crop improvement programs. This review intends to present the current information about the roles played by TEs in plant genome organization, evolution, and function and highlight the regulatory mechanisms in plant stress responses. We will also briefly discuss the connection between TE activity, host epigenetic response and phenotypic plasticity as a critical link for traversing the translational bridge from a purely basic study of TEs, to the applied field of stress adaptation and crop improvement.
Diverse patterns of genomic targeting by transcriptional regulators in Drosophila melanogaster.
Slattery, Matthew; Ma, Lijia; Spokony, Rebecca F; Arthur, Robert K; Kheradpour, Pouya; Kundaje, Anshul; Nègre, Nicolas; Crofts, Alex; Ptashkin, Ryan; Zieba, Jennifer; Ostapenko, Alexander; Suchy, Sarah; Victorsen, Alec; Jameel, Nader; Grundstad, A Jason; Gao, Wenxuan; Moran, Jennifer R; Rehm, E Jay; Grossman, Robert L; Kellis, Manolis; White, Kevin P
2014-07-01
Annotation of regulatory elements and identification of the transcription-related factors (TRFs) targeting these elements are key steps in understanding how cells interpret their genetic blueprint and their environment during development, and how that process goes awry in the case of disease. One goal of the modENCODE (model organism ENCyclopedia of DNA Elements) Project is to survey a diverse sampling of TRFs, both DNA-binding and non-DNA-binding factors, to provide a framework for the subsequent study of the mechanisms by which transcriptional regulators target the genome. Here we provide an updated map of the Drosophila melanogaster regulatory genome based on the location of 84 TRFs at various stages of development. This regulatory map reveals a variety of genomic targeting patterns, including factors with strong preferences toward proximal promoter binding, factors that target intergenic and intronic DNA, and factors with distinct chromatin state preferences. The data also highlight the stringency of the Polycomb regulatory network, and show association of the Trithorax-like (Trl) protein with hotspots of DNA binding throughout development. Furthermore, the data identify more than 5800 instances in which TRFs target DNA regions with demonstrated enhancer activity. Regions of high TRF co-occupancy are more likely to be associated with open enhancers used across cell types, while lower TRF occupancy regions are associated with complex enhancers that are also regulated at the epigenetic level. Together these data serve as a resource for the research community in the continued effort to dissect transcriptional regulatory mechanisms directing Drosophila development. © 2014 Slattery et al.; Published by Cold Spring Harbor Laboratory Press.
Equity Access Plans: A Regulatory and Educational State Response Model.
ERIC Educational Resources Information Center
DeLisle, James
1984-01-01
Introduces the basic notion of equity access plans as property-based solutions to the cash flow needs of elderly homeowners and then proposes a normative response model that states can adopt to help manage the risk exposures. The recommended model incorporates regulatory, information dissemination, and educational elements. (BH)
Federal Register 2010, 2011, 2012, 2013, 2014
2013-06-11
... circumstances where dissemination may mislead or confuse investors and other market participants. In addition... data elements are updated.\\12\\ In disseminated data, market participants will cross-reference the RDID... are rounded or truncated. The dissemination protocol to provide an RDID that market participants will...
AP1 Keeps Chromatin Poised for Action | Center for Cancer Research
The human genome harbors gene-encoding DNA, the blueprint for building proteins that regulate cellular function. Embedded across the genome, in non-coding regions, are DNA elements to which regulatory factors bind. The interaction of regulatory factors with DNA at these sites modifies gene expression to modulate cell activity. In cells, DNA exists in a complex with proteins called chromatin that compacts the DNA in the nucleus, strongly restricting access to DNA sequences. As a result, regulatory factors only interact with a small subset of their potential binding elements in a given cell to regulate genes. How factors recognize and select sites in chromatin across the genome is not well understood -- but several discoveries in CCR’s Laboratory of Receptor Biology and Gene Expression (LRBGE) have shed light on the mechanisms that direct factors to DNA.
Lobo, Daniel; Morokuma, Junji; Levin, Michael
2016-09-01
Automated computational methods can infer dynamic regulatory network models directly from temporal and spatial experimental data, such as genetic perturbations and their resultant morphologies. Recently, a computational method was able to reverse-engineer the first mechanistic model of planarian regeneration that can recapitulate the main anterior-posterior patterning experiments published in the literature. Validating this comprehensive regulatory model via novel experiments that had not yet been performed would add in our understanding of the remarkable regeneration capacity of planarian worms and demonstrate the power of this automated methodology. Using the Michigan Molecular Interactions and STRING databases and the MoCha software tool, we characterized as hnf4 an unknown regulatory gene predicted to exist by the reverse-engineered dynamic model of planarian regeneration. Then, we used the dynamic model to predict the morphological outcomes under different single and multiple knock-downs (RNA interference) of hnf4 and its predicted gene pathway interactors β-catenin and hh Interestingly, the model predicted that RNAi of hnf4 would rescue the abnormal regenerated phenotype (tailless) of RNAi of hh in amputated trunk fragments. Finally, we validated these predictions in vivo by performing the same surgical and genetic experiments with planarian worms, obtaining the same phenotypic outcomes predicted by the reverse-engineered model. These results suggest that hnf4 is a regulatory gene in planarian regeneration, validate the computational predictions of the reverse-engineered dynamic model, and demonstrate the automated methodology for the discovery of novel genes, pathways and experimental phenotypes. michael.levin@tufts.edu. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Detecting Disease Specific Pathway Substructures through an Integrated Systems Biology Approach
Alaimo, Salvatore; Marceca, Gioacchino Paolo; Ferro, Alfredo; Pulvirenti, Alfredo
2017-01-01
In the era of network medicine, pathway analysis methods play a central role in the prediction of phenotype from high throughput experiments. In this paper, we present a network-based systems biology approach capable of extracting disease-perturbed subpathways within pathway networks in connection with expression data taken from The Cancer Genome Atlas (TCGA). Our system extends pathways with missing regulatory elements, such as microRNAs, and their interactions with genes. The framework enables the extraction, visualization, and analysis of statistically significant disease-specific subpathways through an easy to use web interface. Our analysis shows that the methodology is able to fill the gap in current techniques, allowing a more comprehensive analysis of the phenomena underlying disease states. PMID:29657291
Short interspersed DNA elements and miRNAs: a novel hidden gene regulation layer in zebrafish?
Scarpato, Margherita; Angelini, Claudia; Cocca, Ennio; Pallotta, Maria M; Morescalchi, Maria A; Capriglione, Teresa
2015-09-01
In this study, we investigated by in silico analysis the possible correlation between microRNAs (miRNAs) and Anamnia V-SINEs (a superfamily of short interspersed nuclear elements), which belong to those retroposon families that have been preserved in vertebrate genomes for millions of years and are actively transcribed because they are embedded in the 3' untranslated region (UTR) of several genes. We report the results of the analysis of the genomic distribution of these mobile elements in zebrafish (Danio rerio) and discuss their involvement in generating miRNA gene loci. The computational study showed that the genes predicted to bear V-SINEs can be targeted by miRNAs with a very high hybridization E-value. Gene ontology analysis indicates that these genes are mainly involved in metabolic, membrane, and cytoplasmic signaling pathways. Nearly all the miRNAs that were predicted to target the V-SINEs of these genes, i.e., miR-338, miR-9, miR-181, miR-724, miR-735, and miR-204, have been validated in similar regulatory roles in mammals. The large number of genes bearing a V-SINE involved in metabolic and cellular processes suggests that V-SINEs may play a role in modulating cell responses to different stimuli and in preserving the metabolic balance during cell proliferation and differentiation. Although they need experimental validation, these preliminary results suggest that in the genome of D. rerio, as in other TE families in vertebrates, the preservation of V-SINE retroposons may also have been favored by their putative role in gene network modulation.
Gene expression systems in corynebacteria.
Srivastava, Preeti; Deb, J K
2005-04-01
Corynebacterium belongs to a group of gram-positive bacteria having moderate to high G+C content, the other members being Mycobacterium, Nocardia, and Rhodococcus. Considerable information is now available on the plasmids, gene regulatory elements, and gene expression in corynebacteria, especially in soil corynebacteria such as Corynebacterium glutamicum. These bacteria are non-pathogenic and, unlike Bacillus and Streptomyces, are low in proteolytic activity and thus have the potential of becoming attractive systems for expression of heterologous proteins. This review discusses recent advances in our understanding of the organization of various regulatory elements, such as promoters, transcription terminators, and development of vectors for cloning and gene expression.
Kosteli, Maria-Christina; Cumming, Jennifer; Williams, Sarah E
2018-01-01
Limited research has investigated exercise imagery use in middle-aged and older adults and its relationship with affective and behavioral correlates. The study examined the association between self-regulatory imagery and physical activity (PA) through key social cognitive variables. Middle-aged and older adults (N = 299; M age = 59.73 years, SD = 7.73, range = 50 to 80) completed self-report measures assessing self-regulatory imagery use, self-efficacy, outcome expectations, perceived barriers, self-regulatory behavior, enjoyment, and PA levels. Path analysis supported a model (χ² [14] = 21.76, p = .08, CFI = .99, TLI = .97, SRMR = .03, RMSEA = .04) whereby self-regulatory imagery positively predicted self-efficacy, outcome expectations, and self-regulatory behaviors. Furthermore, self-regulatory imagery indirectly predicted barriers, outcome expectations, self-regulation, enjoyment, and PA. This research highlights self-regulatory imagery as an effective strategy in modifying exercise-related cognitions and behaviors. Incorporating social cognitive constructs into the design of imagery interventions may increase PA engagement.
Kwan, C T; Tsang, S L; Krumlauf, R; Sham, M H
2001-04-01
The expression pattern of the mouse Hoxb3 gene is exceptionally complex and dynamic compared with that of other members of the Hoxb cluster. There are multiple types of transcripts for Hoxb3 gene, and the anterior boundaries of its expression vary at different stages of development. Two enhancers flanking Hoxb3 on the 3' and 5' sides regulate Hoxb2 and Hoxb4, respectively, and these control regions define the two ends of a 28-kb interval in and around the Hoxb3 locus. To assay the regulatory potential of DNA fragments in this interval we have used transgenic analysis with a lacZ reporter gene to locate cis-elements for directing the dynamic patterns of Hoxb3 expression. Our detailed analysis has identified four new and widely spaced cis-acting regulatory regions that can together account for major aspects of the Hoxb3 expression pattern. Elements Ib, IIIa, and IVb control gene expression in neural and mesodermal tissues; element Va controls mesoderm-specific gene expression. The most anterior neural expression domain of Hoxb3 is controlled by an r5 enhancer (element IVa); element IIIa directs reporter expression in the anterior spinal cord and hindbrain up to r6, and the region A enhancer (in element I) mediates posterior neural expression. Hence, the regulation of segmental expression of Hoxb3 in the hindbrain is different from that of Hoxa3, as two separate enhancer elements contribute to expression in r5 and r6. The mesoderm-specific element (Va) directs reporter expression to prevertebra C1 at 12.5 dpc, which is the anterior limit of paraxial mesoderm expression for Hoxb3. When tested in combinations, these cis-elements appear to work as modules in an additive manner to recapitulate the major endogenous expression patterns of Hoxb3 during embryogenesis. Together our study shows that multiple control elements direct reporter gene expression in diverse tissue-, temporal-, and spatially restricted subset of the endogenous Hoxb3 expression domains and work in concert to control the neural and mesodermal patterns of expression. Copyright 2001 Academic Press.
Federal Register 2010, 2011, 2012, 2013, 2014
2013-06-03
... Fuel Elements for Use in Research and Test Reactors AGENCY: Nuclear Regulatory Commission. ACTION... Research and Test Reactors.'' This guide describes a method that the staff of the NRC considers acceptable... assurance program for verifying the quality of plate-type uranium-aluminum fuel elements used in research...
Federal Register 2010, 2011, 2012, 2013, 2014
2012-04-09
... allocates market data fees among Subscribers based on the data elements consumed, including top-of-book,\\3... apply to any Subscriber that accesses any data elements included in the TotalView entitlement, including the TotalView, OpenView, or Level 2 data elements. Professional Subscribers that access Depth-of-Book...
41 CFR 102-2.140 - What elements of plain language appear in the FMR?
Code of Federal Regulations, 2010 CFR
2010-07-01
... MANAGEMENT REGULATION SYSTEM Plain Language Regulatory Style § 102-2.140 What elements of plain language... 41 Public Contracts and Property Management 3 2010-07-01 2010-07-01 false What elements of plain language appear in the FMR? 102-2.140 Section 102-2.140 Public Contracts and Property Management Federal...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kwon, Deug-Nam; Park, Mi-Ryung; Park, Jong-Yi
Highlights: {yields} The sequences of -604 to -84 bp of the pUPII promoter contained the region of a putative negative cis-regulatory element. {yields} The core promoter was located in the 5F-1. {yields} Transcription factor HNF4 can directly bind in the pUPII core promoter region, which plays a critical role in controlling promoter activity. {yields} These features of the pUPII promoter are fundamental to development of a target-specific vector. -- Abstract: Uroplakin II (UPII) is a one of the integral membrane proteins synthesized as a major differentiation product of mammalian urothelium. UPII gene expression is bladder specific and differentiation dependent, butmore » little is known about its transcription response elements and molecular mechanism. To identify the cis-regulatory elements in the pig UPII (pUPII) gene promoter region, we constructed pUPII 5' upstream region deletion mutants and demonstrated that each of the deletion mutants participates in controlling the expression of the pUPII gene in human bladder carcinoma RT4 cells. We also identified a new core promoter region and putative negative cis-regulatory element within a minimal promoter region. In addition, we showed that hepatocyte nuclear factor 4 (HNF4) can directly bind in the pUPII core promoter (5F-1) region, which plays a critical role in controlling promoter activity. Transient cotransfection experiments showed that HNF4 positively regulates pUPII gene promoter activity. Thus, the binding element and its binding protein, HNF4 transcription factor, may be involved in the mechanism that specifically regulates pUPII gene transcription.« less
Regulation of expression of transgenes in developing fish.
Moav, B; Liu, Z; Caldovic, L D; Gross, M L; Faras, A J; Hackett, P B
1993-05-01
The transcriptional regulatory elements of the beta-actin gene of carp (Cyprinus carpio) have been examined in zebrafish and goldfish harbouring transgenes. The high sequence conservation of the putative regulatory elements in the beta-actin genes of animals suggested that their function would be conserved, so that transgenic constructs with the same transcriptional control elements would promote similar levels of transgene expression in different species of transgenic animals. To test this assumption, we analysed the temporal expression of a reporter gene under the control of transcriptional control sequences from the carp beta-actin gene in zebrafish (Brachydanio rerio) and goldfish (Carrasius auratus). Our results indicated that, contrary to expectations, combinations of different transcriptional control elements affected the level, duration, and onset of gene expression differently in developing zebrafish and goldfish. The major differences in expression of beta-actin/CAT (chloramphenicol acetyltransferase) constructs in zebrafish and goldfish were: (1) overall expression was almost 100-fold higher in goldfish than in zebrafish embryos, (2) the first intron had an enhancing effect on gene expression in zebrafish but not in goldfish, and (3) the serum-responsive/CArG-containing regulatory element in the proximal promoter was not always required for maximal CAT activity in goldfish, but was required in zebrafish. These results suggest that in the zebrafish, but not in the goldfish, there may be interactions between motifs in the proximal promoter and the first intron which appear to be required for maximal enhancement of transcription.
Kikuta, Hiroshi; Laplante, Mary; Navratilova, Pavla; Komisarczuk, Anna Z.; Engström, Pär G.; Fredman, David; Akalin, Altuna; Caccamo, Mario; Sealy, Ian; Howe, Kerstin; Ghislain, Julien; Pezeron, Guillaume; Mourrain, Philippe; Ellingsen, Staale; Oates, Andrew C.; Thisse, Christine; Thisse, Bernard; Foucher, Isabelle; Adolf, Birgit; Geling, Andrea; Lenhard, Boris; Becker, Thomas S.
2007-01-01
We report evidence for a mechanism for the maintenance of long-range conserved synteny across vertebrate genomes. We found the largest mammal-teleost conserved chromosomal segments to be spanned by highly conserved noncoding elements (HCNEs), their developmental regulatory target genes, and phylogenetically and functionally unrelated “bystander” genes. Bystander genes are not specifically under the control of the regulatory elements that drive the target genes and are expressed in patterns that are different from those of the target genes. Reporter insertions distal to zebrafish developmental regulatory genes pax6.1/2, rx3, id1, and fgf8 and miRNA genes mirn9-1 and mirn9-5 recapitulate the expression patterns of these genes even if located inside or beyond bystander genes, suggesting that the regulatory domain of a developmental regulatory gene can extend into and beyond adjacent transcriptional units. We termed these chromosomal segments genomic regulatory blocks (GRBs). After whole genome duplication in teleosts, GRBs, including HCNEs and target genes, were often maintained in both copies, while bystander genes were typically lost from one GRB, strongly suggesting that evolutionary pressure acts to keep the single-copy GRBs of higher vertebrates intact. We show that loss of bystander genes and other mutational events suffered by duplicated GRBs in teleost genomes permits target gene identification and HCNE/target gene assignment. These findings explain the absence of evolutionary breakpoints from large vertebrate chromosomal segments and will aid in the recognition of position effect mutations within human GRBs. PMID:17387144
NASA Astrophysics Data System (ADS)
Basyuni, M.; Wati, R.; Sulistiyono, N.; Sumardi; Oku, H.; Baba, S.; Sagami, H.
2018-03-01
Molecular cloning of Kandelia candel KcMS gene has previously been cloned and encoded a multifunctional triterpene synthase. In this study, the KcMS gene promoter was cloned through Genome walking, sequenced, and analyzed. A 1,358 bp genomic DNA fragment of KcMS promoter was obtained. PLACE and PlantCARE analysis of the KcMS promoter revealed that there was some regulatory elements in response to environmental signals and involved in the regulation of gene expression. Results showed that four kinds of elements are regulated by hormone binding, namely 2 MeJA-responsiveness elements (CGTCA-motif and TGACG-motif), the ABRE (TACGTG) involved in abscisic acid responsiveness, gibberellin-related GARE-motif (AAACAGA), and the TGA-element (AACGAC) as an auxin-responsive element. Several elements in the KcMS have been shown in other plants to be responsive to abiotic stress. These motifs were MBS (CAACTG), TC-rich repeats, and eight light responsive elements. The KcMS promoter was also involved in the activation of defense genes in plants such as HSE (AAAAAATTC) and four circadian control elements (CAANNNNATC). The presence of multipotential regulatory motifs suggested that KcMS may be involved in regulation of plant tolerance to several types of stresses.
Molecular mechanisms of system responses to novel stimuli are predictable from public data
Danziger, Samuel A.; Ratushny, Alexander V.; Smith, Jennifer J.; Saleem, Ramsey A.; Wan, Yakun; Arens, Christina E.; Armstrong, Abraham M.; Sitko, Katherine; Chen, Wei-Ming; Chiang, Jung-Hsien; Reiss, David J.; Baliga, Nitin S.; Aitchison, John D.
2014-01-01
Systems scale models provide the foundation for an effective iterative cycle between hypothesis generation, experiment and model refinement. Such models also enable predictions facilitating the understanding of biological complexity and the control of biological systems. Here, we demonstrate the reconstruction of a globally predictive gene regulatory model from public data: a model that can drive rational experiment design and reveal new regulatory mechanisms underlying responses to novel environments. Specifically, using ∼1500 publically available genome-wide transcriptome data sets from Saccharomyces cerevisiae, we have reconstructed an environment and gene regulatory influence network that accurately predicts regulatory mechanisms and gene expression changes on exposure of cells to completely novel environments. Focusing on transcriptional networks that induce peroxisomes biogenesis, the model-guided experiments allow us to expand a core regulatory network to include novel transcriptional influences and linkage across signaling and transcription. Thus, the approach and model provides a multi-scalar picture of gene dynamics and are powerful resources for exploiting extant data to rationally guide experimentation. The techniques outlined here are generally applicable to any biological system, which is especially important when experimental systems are challenging and samples are difficult and expensive to obtain—a common problem in laboratory animal and human studies. PMID:24185701
From Binding-Induced Dynamic Effects in SH3 Structures to Evolutionary Conserved Sectors.
Zafra Ruano, Ana; Cilia, Elisa; Couceiro, José R; Ruiz Sanz, Javier; Schymkowitz, Joost; Rousseau, Frederic; Luque, Irene; Lenaerts, Tom
2016-05-01
Src Homology 3 domains are ubiquitous small interaction modules known to act as docking sites and regulatory elements in a wide range of proteins. Prior experimental NMR work on the SH3 domain of Src showed that ligand binding induces long-range dynamic changes consistent with an induced fit mechanism. The identification of the residues that participate in this mechanism produces a chart that allows for the exploration of the regulatory role of such domains in the activity of the encompassing protein. Here we show that a computational approach focusing on the changes in side chain dynamics through ligand binding identifies equivalent long-range effects in the Src SH3 domain. Mutation of a subset of the predicted residues elicits long-range effects on the binding energetics, emphasizing the relevance of these positions in the definition of intramolecular cooperative networks of signal transduction in this domain. We find further support for this mechanism through the analysis of seven other publically available SH3 domain structures of which the sequences represent diverse SH3 classes. By comparing the eight predictions, we find that, in addition to a dynamic pathway that is relatively conserved throughout all SH3 domains, there are dynamic aspects specific to each domain and homologous subgroups. Our work shows for the first time from a structural perspective, which transduction mechanisms are common between a subset of closely related and distal SH3 domains, while at the same time highlighting the differences in signal transduction that make each family member unique. These results resolve the missing link between structural predictions of dynamic changes and the domain sectors recently identified for SH3 domains through sequence analysis.
From Binding-Induced Dynamic Effects in SH3 Structures to Evolutionary Conserved Sectors
Ruiz Sanz, Javier; Schymkowitz, Joost; Rousseau, Frederic
2016-01-01
Src Homology 3 domains are ubiquitous small interaction modules known to act as docking sites and regulatory elements in a wide range of proteins. Prior experimental NMR work on the SH3 domain of Src showed that ligand binding induces long-range dynamic changes consistent with an induced fit mechanism. The identification of the residues that participate in this mechanism produces a chart that allows for the exploration of the regulatory role of such domains in the activity of the encompassing protein. Here we show that a computational approach focusing on the changes in side chain dynamics through ligand binding identifies equivalent long-range effects in the Src SH3 domain. Mutation of a subset of the predicted residues elicits long-range effects on the binding energetics, emphasizing the relevance of these positions in the definition of intramolecular cooperative networks of signal transduction in this domain. We find further support for this mechanism through the analysis of seven other publically available SH3 domain structures of which the sequences represent diverse SH3 classes. By comparing the eight predictions, we find that, in addition to a dynamic pathway that is relatively conserved throughout all SH3 domains, there are dynamic aspects specific to each domain and homologous subgroups. Our work shows for the first time from a structural perspective, which transduction mechanisms are common between a subset of closely related and distal SH3 domains, while at the same time highlighting the differences in signal transduction that make each family member unique. These results resolve the missing link between structural predictions of dynamic changes and the domain sectors recently identified for SH3 domains through sequence analysis. PMID:27213566
Huang, Kezhen; Wang, Yue-Hao; Brown, Alex; Sun, Gongqin
2009-01-01
Csk and Src protein tyrosine kinases are structurally homologous, but use opposite regulatory strategies. The isolated catalytic domain of Csk is intrinsically inactive and is activated by interactions with the regulatory SH3 and SH2 domains, while the isolated catalytic domain of Src is intrinsically active and is suppressed by interactions with the regulatory SH3 and SH2 domains. The structural basis for why one isolated catalytic domain is intrinsically active while the other is inactive is not clear. In this current study, we identify the structural elements in the N-terminal lobe of the catalytic domain that render the Src catalytic domain active. These structural elements include the α-helix C region, a β-turn between the β-4 and β-5 strands, and an Arg residue at the beginning of the catalytic domain. These three motifs interact with each other to activate the Src catalytic domain, but the equivalent motifs in Csk directly interact with the regulatory domains that are important for Csk activation. The Src motifs can be grafted to the Csk catalytic domain to obtain an active Csk catalytic domain. These results, together with available Src and Csk tertiary structures, reveal an important structural switch that determines the kinase activity of a catalytic domain and dictates the regulatory strategy of a kinase. PMID:19244618
Regulatory T cells in the control of host-microorganism interactions (*).
Belkaid, Yasmine; Tarbell, Kristin
2009-01-01
Each microenvironment requires a specific set of regulatory elements that are finely and constantly tuned to maintain local homeostasis. Various populations of regulatory T cells contribute to the maintenance of this equilibrium and establishment of controlled immune responses. In particular, regulatory T cells limit the magnitude of effector responses, which may result in failure to adequately control infection. However, regulatory T cells also help limit collateral tissue damage caused by vigorous antimicrobial immune responses against pathogenic microbes as well as commensals. In this review, we describe various situations in which the balance between regulatory T cells and effector immune functions influence the outcome of host-microorganism coexistence and discuss current hypotheses and points of polemic associated with the origin, target, and antigen specificity of both endogenous and induced regulatory T cells during these interactions.
Fang, Xin; Sastry, Anand; Mih, Nathan; Kim, Donghyuk; Tan, Justin; Lloyd, Colton J.; Gao, Ye; Yang, Laurence; Palsson, Bernhard O.
2017-01-01
Transcriptional regulatory networks (TRNs) have been studied intensely for >25 y. Yet, even for the Escherichia coli TRN—probably the best characterized TRN—several questions remain. Here, we address three questions: (i) How complete is our knowledge of the E. coli TRN; (ii) how well can we predict gene expression using this TRN; and (iii) how robust is our understanding of the TRN? First, we reconstructed a high-confidence TRN (hiTRN) consisting of 147 transcription factors (TFs) regulating 1,538 transcription units (TUs) encoding 1,764 genes. The 3,797 high-confidence regulatory interactions were collected from published, validated chromatin immunoprecipitation (ChIP) data and RegulonDB. For 21 different TF knockouts, up to 63% of the differentially expressed genes in the hiTRN were traced to the knocked-out TF through regulatory cascades. Second, we trained supervised machine learning algorithms to predict the expression of 1,364 TUs given TF activities using 441 samples. The algorithms accurately predicted condition-specific expression for 86% (1,174 of 1,364) of the TUs, while 193 TUs (14%) were predicted better than random TRNs. Third, we identified 10 regulatory modules whose definitions were robust against changes to the TRN or expression compendium. Using surrogate variable analysis, we also identified three unmodeled factors that systematically influenced gene expression. Our computational workflow comprehensively characterizes the predictive capabilities and systems-level functions of an organism’s TRN from disparate data types. PMID:28874552
Brown, Kerry K.; Reiss, Jacob A.; Crow, Kate; Ferguson, Heather L.; Kelly, Chantal; Fritzsch, Bernd; Morton, Cynthia C.
2010-01-01
Precisely regulated temporal and spatial patterns of gene expression are essential for proper human development. Cis-acting regulatory elements, some located at large distances from their corresponding genes, play a critical role in transcriptional control of key developmental genes and disruption of these regulatory elements can lead to disease. We report a three generation family with five affected members, all of whom have hearing loss, craniofacial defects, and a paracentric inversion of the long arm of chromosome 7, inv(7)(q21.3q35). High resolution mapping of the inversion showed that the 7q21.3 breakpoint is located 65 and 80 kb centromeric of DLX6 and DLX5, respectively. Further analysis revealed a 5115 bp deletion at the 7q21.3 breakpoint. While the breakpoint does not disrupt either DLX5 or DLX6, the syndrome present in the family is similar to that observed in Dlx5 knockout mice and includes a subset of the features observed in individuals with DLX5 and DLX6 deletions, implicating dysregulation of DLX5 and DLX6 in the family’s phenotype. Bioinformatic analysis indicates that the 5115 bp deletion at the 7q21.3 breakpoint could contain regulatory elements necessary for DLX5 and DLX6 expression. Using a transgenic mouse reporter assay, we show that the deleted sequence can drive expression in the ear and developing bones of E12.5 embryos. Consequently, the observed familial syndrome is likely caused by dysregulation of DLX5 and/or DLX6 in specific tissues due to deletion of an enhancer and possibly separation from other regulatory elements by the chromosomal inversion. PMID:19707792
75 FR 62893 - Draft Regulatory Guide: Issuance, Availability
Federal Register 2010, 2011, 2012, 2013, 2014
2010-10-13
... for using portland cement grout to protect prestressing steel from corrosion. The prestressing tendon system of a prestressed concrete containment structure is a principal strength element of the structure... of the structure depends on the functional reliability of the structure's principal strength elements...
Federal Register 2010, 2011, 2012, 2013, 2014
2013-08-30
... Organizations; BOX Options Exchange LLC; Notice of Filing and Immediate Effectiveness of a Proposed Rule Change... Proprietary Trader Program (S501) Continuing Education Regulatory Element Session on the BOX Market LLC (``BOX'') options facility. While changes to the fee schedule pursuant to this proposal will be effective upon...
Federal Register 2010, 2011, 2012, 2013, 2014
2010-12-08
... NUCLEAR REGULATORY COMMISSION [Docket No. 70-143; NRC-2010-0379] Nuclear Fuel Services, Inc.; Environmental Assessment and Finding of No Significant Impact for Proposed Exemption From a Requirement To Measure the Uranium Element and Isotopic Content of Special Nuclear Material AGENCY: Nuclear Regulatory Commission. ACTION: Environmental...
Federal Register 2010, 2011, 2012, 2013, 2014
2013-09-11
... Change To Amend Rule 640, Continuing Education for Registered Persons and Adopt a Corresponding Fee... Substance of the Proposed Rule Change The Exchange proposes to amend Rule 640, Continuing Education for... 640. Continuing Education for Registered Persons (a) Regulatory Element (1) Requirements--No member...
Defining Transcriptional Regulatory Mechanisms for Primary let-7 miRNAs
Gaeta, Xavier; Le, Luat; Lin, Ying; Xie, Yuan; Lowry, William E.
2017-01-01
The let-7 family of miRNAs have been shown to control developmental timing in organisms from C. elegans to humans; their function in several essential cell processes throughout development is also well conserved. Numerous studies have defined several steps of post-transcriptional regulation of let-7 production; from pri-miRNA through pre-miRNA, to the mature miRNA that targets endogenous mRNAs for degradation or translational inhibition. Less-well defined are modes of transcriptional regulation of the pri-miRNAs for let-7. let-7 pri-miRNAs are expressed in polycistronic fashion, in long transcripts newly annotated based on chromatin-associated RNA-sequencing. Upon differentiation, we found that some let-7 pri-miRNAs are regulated at the transcriptional level, while others appear to be constitutively transcribed. Using the Epigenetic Roadmap database, we further annotated regulatory elements of each polycistron identified putative promoters and enhancers. Probing these regulatory elements for transcription factor binding sites identified factors that regulate transcription of let-7 in both promoter and enhancer regions, and identified novel regulatory mechanisms for this important class of miRNAs. PMID:28052101
A system-level model for the microbial regulatory genome.
Brooks, Aaron N; Reiss, David J; Allard, Antoine; Wu, Wei-Ju; Salvanha, Diego M; Plaisier, Christopher L; Chandrasekaran, Sriram; Pan, Min; Kaur, Amardeep; Baliga, Nitin S
2014-07-15
Microbes can tailor transcriptional responses to diverse environmental challenges despite having streamlined genomes and a limited number of regulators. Here, we present data-driven models that capture the dynamic interplay of the environment and genome-encoded regulatory programs of two types of prokaryotes: Escherichia coli (a bacterium) and Halobacterium salinarum (an archaeon). The models reveal how the genome-wide distributions of cis-acting gene regulatory elements and the conditional influences of transcription factors at each of those elements encode programs for eliciting a wide array of environment-specific responses. We demonstrate how these programs partition transcriptional regulation of genes within regulons and operons to re-organize gene-gene functional associations in each environment. The models capture fitness-relevant co-regulation by different transcriptional control mechanisms acting across the entire genome, to define a generalized, system-level organizing principle for prokaryotic gene regulatory networks that goes well beyond existing paradigms of gene regulation. An online resource (http://egrin2.systemsbiology.net) has been developed to facilitate multiscale exploration of conditional gene regulation in the two prokaryotes. © 2014 The Authors. Published under the terms of the CC BY 4.0 license.
Discovery of functional non-coding conserved regions in the α-synuclein gene locus
Sterling, Lori; Walter, Michael; Ting, Dennis; Schüle, Birgitt
2014-01-01
Several single nucleotide polymorphisms (SNPs) and the Rep-1 microsatellite marker of the α-synuclein ( SNCA) gene have consistently been shown to be associated with Parkinson’s disease, but the functional relevance is unclear. Based on these findings we hypothesized that conserved cis-regulatory elements in the SNCA genomic region regulate expression of SNCA, and that SNPs in these regions could be functionally modulating the expression of SNCA, thus contributing to neuronal demise and predisposing to Parkinson’s disease. In a pair-wise comparison of a 206kb genomic region encompassing the SNCA gene, we revealed 34 evolutionary conserved DNA sequences between human and mouse. All elements were cloned into reporter vectors and assessed for expression modulation in dual luciferase reporter assays. We found that 12 out of 34 elements exhibited either an enhancement or reduction of the expression of the reporter gene. Three elements upstream of the SNCA gene displayed an approximately 1.5 fold (p<0.009) increase in expression. Of the intronic regions, three showed a 1.5 fold increase and two others indicated a 2 and 2.5 fold increase in expression (p<0.002). Three elements downstream of the SNCA gene showed 1.5 fold and 2.5 fold increase (p<0.0009). One element downstream of SNCA had a reduced expression of the reporter gene of 0.35 fold (p<0.0009) of normal activity. Our results demonstrate that the SNCA gene contains cis-regulatory regions that might regulate the transcription and expression of SNCA. Further studies in disease-relevant tissue types will be important to understand the functional impact of regulatory regions and specific Parkinson’s disease-associated SNPs and its function in the disease process. PMID:25566351
Transposable elements and G-quadruplexes.
Kejnovsky, Eduard; Tokan, Viktor; Lexa, Matej
2015-09-01
A significant part of eukaryotic genomes is formed by transposable elements (TEs) containing not only genes but also regulatory sequences. Some of the regulatory sequences located within TEs can form secondary structures like hairpins or three-stranded (triplex DNA) and four-stranded (quadruplex DNA) conformations. This review focuses on recent evidence showing that G-quadruplex-forming sequences in particular are often present in specific parts of TEs in plants and humans. We discuss the potential role of these structures in the TE life cycle as well as the impact of G-quadruplexes on replication, transcription, translation, chromatin status, and recombination. The aim of this review is to emphasize that TEs may serve as vehicles for the genomic spread of G-quadruplexes. These non-canonical DNA structures and their conformational switches may constitute another regulatory system that, together with small and long non-coding RNA molecules and proteins, contribute to the complex cellular network resulting in the large diversity of eukaryotes.
Hogan, Daniel J; Riordan, Daniel P; Gerber, André P; Herschlag, Daniel; Brown, Patrick O
2008-10-28
RNA-binding proteins (RBPs) have roles in the regulation of many post-transcriptional steps in gene expression, but relatively few RBPs have been systematically studied. We searched for the RNA targets of 40 proteins in the yeast Saccharomyces cerevisiae: a selective sample of the approximately 600 annotated and predicted RBPs, as well as several proteins not annotated as RBPs. At least 33 of these 40 proteins, including three of the four proteins that were not previously known or predicted to be RBPs, were reproducibly associated with specific sets of a few to several hundred RNAs. Remarkably, many of the RBPs we studied bound mRNAs whose protein products share identifiable functional or cytotopic features. We identified specific sequences or predicted structures significantly enriched in target mRNAs of 16 RBPs. These potential RNA-recognition elements were diverse in sequence, structure, and location: some were found predominantly in 3'-untranslated regions, others in 5'-untranslated regions, some in coding sequences, and many in two or more of these features. Although this study only examined a small fraction of the universe of yeast RBPs, 70% of the mRNA transcriptome had significant associations with at least one of these RBPs, and on average, each distinct yeast mRNA interacted with three of the RBPs, suggesting the potential for a rich, multidimensional network of regulation. These results strongly suggest that combinatorial binding of RBPs to specific recognition elements in mRNAs is a pervasive mechanism for multi-dimensional regulation of their post-transcriptional fate.
Flanagan, Talia; Van Peer, Achiel; Lindahl, Anders
2016-08-25
Regulatory interactions are an important part of the drug development and licensing process. A survey on the use of biopharmaceutical tools for regulatory purposes has been carried out within the industry community of the EU project OrBiTo within Innovative Medicines Initiative (IMI). The aim was to capture current practice and experience in using in vitro and in silico biopharmaceutics tools at various stages of development, what barriers exist or are perceived, and to understand the current gaps in regulatory biopharmaceutics. The survey indicated that biorelevant dissolution testing and physiologically based modelling and simulation are widely applied throughout development to address a number of biopharmaceutics issues. However, data from these in vitro and in silico predictive biopharmaceutics tools are submitted to regulatory authorities far less often than they are used for internal risk assessment and decision making. This may prevent regulators from becoming familiar with these tools and how they are applied in industry, and limits the opportunities for biopharmaceutics scientists working in industry to understand the acceptability of these tools in the regulatory environment. It is anticipated that the advanced biopharmaceutics tools and understanding delivered in the next years by OrBiTo and other initiatives in the area of predictive tools will also be of value in the regulatory setting, and provide a basis for more informed and confident biopharmaceutics risk assessment and regulatory decision making. To enable the regulatory potential of predictive biopharmaceutics tools to be realized, further scientific dialogue is needed between industry, regulators and scientists in academia, and more examples need to be published to demonstrate the applicability of these tools. Copyright © 2016 Elsevier B.V. All rights reserved.
Li, S; Zhang, P; Zhang, M; Fu, C; Yu, L
2013-01-01
Although the regulation of taxol biosynthesis at the transcriptional level remains unclear, 10-deacetylbaccatin III-10 β-O-acetyl transferase (DBAT) is a critical enzyme in the biosynthesis of taxol. The 1740 bp fragment 5'-flanking sequence of the dbat gene was cloned from Taxus chinensis cells. Important regulatory elements needed for activity of the dbat promoter were located by deletion analyses in T. chinensis cells. A novel WRKY transcription factor, TcWRKY1, was isolated with the yeast one-hybrid system from a T. chinensis cell cDNA library using the important regulatory elements as bait. The gene expression of TcWRKY1 in T. chinensis suspension cells was specifically induced by methyl jasmonate (MeJA). Biochemical analysis indicated that TcWRKY1 protein specifically interacts with the two W-box (TGAC) cis-elements among the important regulatory elements. Overexpression of TcWRKY1 enhanced dbat expression in T. chinensis suspension cells, and RNA interference (RNAi) reduced the level of transcripts of dbat. These results suggest that TcWRKY1 participates in regulation of taxol biosynthesis in T. chinensis cells, and that dbat is a target gene of this transcription factor. This research also provides a potential candidate gene for engineering increased taxol accumulation in Taxus cell cultures. © 2012 German Botanical Society and The Royal Botanical Society of the Netherlands.
Richards, Stephen; Liu, Yue; Bettencourt, Brian R.; Hradecky, Pavel; Letovsky, Stan; Nielsen, Rasmus; Thornton, Kevin; Hubisz, Melissa J.; Chen, Rui; Meisel, Richard P.; Couronne, Olivier; Hua, Sujun; Smith, Mark A.; Zhang, Peili; Liu, Jing; Bussemaker, Harmen J.; van Batenburg, Marinus F.; Howells, Sally L.; Scherer, Steven E.; Sodergren, Erica; Matthews, Beverly B.; Crosby, Madeline A.; Schroeder, Andrew J.; Ortiz-Barrientos, Daniel; Rives, Catharine M.; Metzker, Michael L.; Muzny, Donna M.; Scott, Graham; Steffen, David; Wheeler, David A.; Worley, Kim C.; Havlak, Paul; Durbin, K. James; Egan, Amy; Gill, Rachel; Hume, Jennifer; Morgan, Margaret B.; Miner, George; Hamilton, Cerissa; Huang, Yanmei; Waldron, Lenée; Verduzco, Daniel; Clerc-Blankenburg, Kerstin P.; Dubchak, Inna; Noor, Mohamed A.F.; Anderson, Wyatt; White, Kevin P.; Clark, Andrew G.; Schaeffer, Stephen W.; Gelbart, William; Weinstock, George M.; Gibbs, Richard A.
2005-01-01
We have sequenced the genome of a second Drosophila species, Drosophila pseudoobscura, and compared this to the genome sequence of Drosophila melanogaster, a primary model organism. Throughout evolution the vast majority of Drosophila genes have remained on the same chromosome arm, but within each arm gene order has been extensively reshuffled, leading to a minimum of 921 syntenic blocks shared between the species. A repetitive sequence is found in the D. pseudoobscura genome at many junctions between adjacent syntenic blocks. Analysis of this novel repetitive element family suggests that recombination between offset elements may have given rise to many paracentric inversions, thereby contributing to the shuffling of gene order in the D. pseudoobscura lineage. Based on sequence similarity and synteny, 10,516 putative orthologs have been identified as a core gene set conserved over 25–55 million years (Myr) since the pseudoobscura/melanogaster divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome-wide average, consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than random and nearby sequences between the species—but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a pattern of repeat-mediated chromosomal rearrangement, and high coadaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence between these species of Drosophila. PMID:15632085
Gonzalez, S M; Ferland, L H; Robert, B; Abdelhay, E
1998-06-01
Vertebrate Msx genes are related to one of the most divergent homeobox genes of Drosophila, the muscle segment homeobox (msh) gene, and are expressed in a well-defined pattern at sites of tissue interactions. This pattern of expression is conserved in vertebrates as diverse as quail, zebrafish, and mouse in a range of sites including neural crest, appendages, and craniofacial structures. In the present work, we performed structural and functional analyses in order to identify potential cis-acting elements that may be regulating Msx1 gene expression. To this end, a 4.9-kb segment of the 5'-flanking region was sequenced and analyzed for transcription-factor binding sites. Four regions showing a high concentration of these sites were identified. Transfection assays with fragments of regulatory sequences driving the expression of the bacterial lacZ reporter gene showed that a region of 4 kb upstream of the transcription start site contains positive and negative elements responsible for controlling gene expression. Interestingly, a fragment of 130 bp seems to contain the minimal elements necessary for gene expression, as its removal completely abolishes gene expression in cultured cells. These results are reinforced by comparison of this region with the human Msx1 gene promoter, which shows extensive conservation, including many consensus binding sites, suggesting a regulatory role for them.
Coetzee, Simon G; Shen, Howard C; Hazelett, Dennis J; Lawrenson, Kate; Kuchenbaecker, Karoline; Tyrer, Jonathan; Rhie, Suhn K; Levanon, Keren; Karst, Alison; Drapkin, Ronny; Ramus, Susan J; Couch, Fergus J; Offit, Kenneth; Chenevix-Trench, Georgia; Monteiro, Alvaro N A; Antoniou, Antonis; Freedman, Matthew; Coetzee, Gerhard A; Pharoah, Paul D P; Noushmehr, Houtan; Gayther, Simon A
2015-07-01
Understanding the regulatory landscape of the human genome is a central question in complex trait genetics. Most single-nucleotide polymorphisms (SNPs) associated with cancer risk lie in non-protein-coding regions, implicating regulatory DNA elements as functional targets of susceptibility variants. Here, we describe genome-wide annotation of regions of open chromatin and histone modification in fallopian tube and ovarian surface epithelial cells (FTSECs, OSECs), the debated cellular origins of high-grade serous ovarian cancers (HGSOCs) and in endometriosis epithelial cells (EECs), the likely precursor of clear cell ovarian carcinomas (CCOCs). The regulatory architecture of these cell types was compared with normal human mammary epithelial cells and LNCaP prostate cancer cells. We observed similar positional patterns of global enhancer signatures across the three different ovarian cancer precursor cell types, and evidence of tissue-specific regulatory signatures compared to non-gynecological cell types. We found significant enrichment for risk-associated SNPs intersecting regulatory biofeatures at 17 known HGSOC susceptibility loci in FTSECs (P = 3.8 × 10(-30)), OSECs (P = 2.4 × 10(-23)) and HMECs (P = 6.7 × 10(-15)) but not for EECs (P = 0.45) or LNCaP cells (P = 0.88). Hierarchical clustering of risk SNPs conditioned on the six different cell types indicates FTSECs and OSECs are highly related (96% of samples using multi-scale bootstrapping) suggesting both cell types may be precursors of HGSOC. These data represent the first description of regulatory catalogues of normal precursor cells for different ovarian cancer subtypes, and provide unique insights into the tissue specific regulatory variation with respect to the likely functional targets of germline genetic susceptibility variants for ovarian cancer. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Kusters, Elske; Della Pina, Serena; Castel, Rob; Souer, Erik; Koes, Ronald
2015-08-15
Higher plant species diverged extensively with regard to the moment (flowering time) and position (inflorescence architecture) at which flowers are formed. This seems largely caused by variation in the expression patterns of conserved genes that specify floral meristem identity (FMI), rather than changes in the encoded proteins. Here, we report a functional comparison of the promoters of homologous FMI genes from Arabidopsis, petunia, tomato and Antirrhinum. Analysis of promoter-reporter constructs in petunia and Arabidopsis, as well as complementation experiments, showed that the divergent expression of leafy (LFY) and the petunia homolog aberrant leaf and flower (ALF) results from alterations in the upstream regulatory network rather than cis-regulatory changes. The divergent expression of unusual floral organs (UFO) from Arabidopsis, and the petunia homolog double top (DOT), however, is caused by the loss or gain of cis-regulatory promoter elements, which respond to trans-acting factors that are expressed in similar patterns in both species. Introduction of pUFO:UFO causes no obvious defects in Arabidopsis, but in petunia it causes the precocious and ectopic formation of flowers. This provides an example of how a change in a cis-regulatory region can account for a change in the plant body plan. © 2015. Published by The Company of Biologists Ltd.
King, Lanikea B.; Walum, Hasse; Inoue, Kiyoshi; Eyrich, Nicholas W.; Young, Larry J.
2015-01-01
Background Oxytocin (OXT) modulates several aspects of social behavior. Intranasal OXT is a leading candidate for treating social deficits in autism spectrum disorder (ASD) and common genetic variants in the human oxytocin receptor (OXTR) are associated with emotion recognition, relationship quality and ASD. Animal models have revealed that individual differences in Oxtr expression in the brain drive social behavior variation. Our understanding of how genetic variation contributes to brain OXTR expression is very limited. Methods We investigated Oxtr expression in monogamous prairie voles, which have a well characterized OXT system. We quantified brain region-specific levels of Oxtr mRNA and OXTR protein with established neuroanatomical methods. We used pyrosequencing to investigate allelic imbalance of Oxtr mRNA, a molecular signature of polymorphic genetic regulatory elements. We performed next-generation sequencing to discover variants in and near the Oxtr gene. We investigated social attachment using the partner preference test. Results Our allelic imbalance data demonstrates that genetic variants contribute to individual differences in Oxtr expression, but only in particular brain regions, including the nucleus accumbens (NAcc), where OXTR signaling facilitates social attachment. Next-generation sequencing identified one polymorphism in the Oxtr intron, near a putative cis-regulatory element, explaining 74% of the variance in striatal Oxtr expression specifically. Males homozygous for the high expressing allele display enhanced social attachment. Discussion Taken together, these findings provide convincing evidence for robust genetic influence on Oxtr expression and provide novel insights into how non-coding polymorphisms in the OXTR might influence individual differences in human social cognition and behavior PMID:26893121
Man, Michal; Epel, Bernard L
2004-06-01
A replicon based on Tobacco mosaic virus that was engineered to express the open reading frame (ORF) of the green fluorescent protein (GFP) gene in place of the native coat protein (CP) gene from a minimal CP subgenomic (sg) RNA promoter was found to accumulate very low levels of GFP. Regulatory regions within the CP ORF were identified that, when presented as untranslated regions flanking the GFP ORF, enhanced or inhibited sg transcription and GFP expression. Full GFP expression from the CP sgRNA promoter required more than the first 20 nt of the CP ORF but not beyond the first 56 nt. Further analysis indicated the presence of an enhancer element between nt +25 and +55 with respect to the CP translation start site. The inclusion of this enhancer sequence upstream of the GFP ORF led to elevated sg transcription and to a 50-fold increase in GFP accumulation in comparison with a minimal CP promoter in which the entire CP ORF was displaced by the GFP ORF. Inclusion of the 3'-terminal 22 nt had a minor positive effect on GFP accumulation, but the addition of extended untranslated sequences from the 3' terminus of the CP ORF downstream of the GFP ORF was basically found to inhibit sg transcription. Secondary structure analysis programs predicted the CP sgRNA promoter to reside within two stable stem-loop structures, which are followed by an enhancer region.
2010-01-01
Background The large amount of high-throughput genomic data has facilitated the discovery of the regulatory relationships between transcription factors and their target genes. While early methods for discovery of transcriptional regulation relationships from microarray data often focused on the high-throughput experimental data alone, more recent approaches have explored the integration of external knowledge bases of gene interactions. Results In this work, we develop an algorithm that provides improved performance in the prediction of transcriptional regulatory relationships by supplementing the analysis of microarray data with a new method of integrating information from an existing knowledge base. Using a well-known dataset of yeast microarrays and the Yeast Proteome Database, a comprehensive collection of known information of yeast genes, we show that knowledge-based predictions demonstrate better sensitivity and specificity in inferring new transcriptional interactions than predictions from microarray data alone. We also show that comprehensive, direct and high-quality knowledge bases provide better prediction performance. Comparison of our results with ChIP-chip data and growth fitness data suggests that our predicted genome-wide regulatory pairs in yeast are reasonable candidates for follow-up biological verification. Conclusion High quality, comprehensive, and direct knowledge bases, when combined with appropriate bioinformatic algorithms, can significantly improve the discovery of gene regulatory relationships from high throughput gene expression data. PMID:20122245
Seok, Junhee; Kaushal, Amit; Davis, Ronald W; Xiao, Wenzhong
2010-01-18
The large amount of high-throughput genomic data has facilitated the discovery of the regulatory relationships between transcription factors and their target genes. While early methods for discovery of transcriptional regulation relationships from microarray data often focused on the high-throughput experimental data alone, more recent approaches have explored the integration of external knowledge bases of gene interactions. In this work, we develop an algorithm that provides improved performance in the prediction of transcriptional regulatory relationships by supplementing the analysis of microarray data with a new method of integrating information from an existing knowledge base. Using a well-known dataset of yeast microarrays and the Yeast Proteome Database, a comprehensive collection of known information of yeast genes, we show that knowledge-based predictions demonstrate better sensitivity and specificity in inferring new transcriptional interactions than predictions from microarray data alone. We also show that comprehensive, direct and high-quality knowledge bases provide better prediction performance. Comparison of our results with ChIP-chip data and growth fitness data suggests that our predicted genome-wide regulatory pairs in yeast are reasonable candidates for follow-up biological verification. High quality, comprehensive, and direct knowledge bases, when combined with appropriate bioinformatic algorithms, can significantly improve the discovery of gene regulatory relationships from high throughput gene expression data.
Polak, Marta E; Ung, Chuin Ying; Masapust, Joanna; Freeman, Tom C; Ardern-Jones, Michael R
2017-04-06
Langerhans cells (LCs) are able to orchestrate adaptive immune responses in the skin by interpreting the microenvironmental context in which they encounter foreign substances, but the regulatory basis for this has not been established. Utilising systems immunology approaches combining in silico modelling of a reconstructed gene regulatory network (GRN) with in vitro validation of the predictions, we sought to determine the mechanisms of regulation of immune responses in human primary LCs. The key role of Interferon regulatory factors (IRFs) as controllers of the human Langerhans cell response to epidermal cytokines was revealed by whole transcriptome analysis. Applying Boolean logic we assembled a Petri net-based model of the IRF-GRN which provides molecular pathway predictions for the induction of different transcriptional programmes in LCs. In silico simulations performed after model parameterisation with transcription factor expression values predicted that human LC activation of antigen-specific CD8 T cells would be differentially regulated by epidermal cytokine induction of specific IRF-controlled pathways. This was confirmed by in vitro measurement of IFN-γ production by activated T cells. As a proof of concept, this approach shows that stochastic modelling of a specific immune networks renders transcriptome data valuable for the prediction of functional outcomes of immune responses.
Ma, AyeAye; Margolis, Mathew S.
2013-01-01
Herpes simplex virus 1 (HSV-1) and HSV-2 establish latency in different neuronal subtypes (A5+ and KH10+) in murine trigeminal ganglia, results which correlate with restricted productive infection in these neurons in vitro. HSV-2 latency-associated transcript (LAT) contains a cis-acting regulatory element near the transcription start site that promotes productive infection in A5+ neurons and a second element in exon 1 that inhibits productive infection in KH10+ neurons. HSV-1 contains no such regulatory sequences, demonstrating different mechanisms for regulating productive HSV infection in neurons. PMID:23514893
Primate-Specific Evolution of an LDLR Enhancer
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Qian-fei; Prabhakar, Shyam; Wang, Qianben
2006-06-28
Sequence changes in regulatory regions have often beeninvoked to explain phenotypic divergence among species, but molecularexamples of this have been difficult to obtain. In this study, weidentified an anthropoid primate specific sequence element thatcontributed to the regulatory evolution of the LDL receptor. Using acombination of close and distant species genomic sequence comparisonscoupled with in vivo and in vitro studies, we show that a functionalcholesterol-sensing sequence motif arose and was fixed within apre-existing enhancer in the common ancestor of anthropoid primates. Ourstudy demonstrates one molecular mechanism by which ancestral mammalianregulatory elements can evolve to perform new functions in the primatelineage leadingmore » to human.« less
Nakajima, T; Sano, R; Takahashi, Y; Watanabe, K; Kubo, R; Kobayashi, M; Takahashi, K; Takeshita, H; Kominato, Y
2016-01-01
Recent investigation of transcriptional regulation of the ABO genes has identified a candidate erythroid cell-specific regulatory element, named the +5·8-kb site, in the first intron of ABO. Six haplotypes of the site have been reported previously. The present genetic population study demonstrated that each haplotype was mostly linked with specific ABO alleles with a few exceptions, possibly as a result of hybrid formation between common ABO alleles. Thus, investigation of these haplotypes could provide a clue to further elucidation of ABO alleles. © 2015 International Society of Blood Transfusion.
Functional dissection of drought-responsive gene expression patterns in Cynodon dactylon L.
Kim, Changsoo; Lemke, Cornelia; Paterson, Andrew H
2009-05-01
Water deficit is one of the main abiotic factors that affect plant productivity in subtropical regions. To identify genes induced during the water stress response in Bermudagrass (Cynodon dactylon), cDNA macroarrays were used. The macroarray analysis identified 189 drought-responsive candidate genes from C. dactylon, of which 120 were up-regulated and 69 were down-regulated. The candidate genes were classified into seven groups by cluster analysis of expression levels across two intensities and three durations of imposed stress. Annotation using BLASTX suggested that up-regulated genes may be involved in proline biosynthesis, signal transduction pathways, protein repair systems, and removal of toxins, while down-regulated genes were mostly related to basic plant metabolism such as photosynthesis and glycolysis. The functional classification of gene ontology (GO) was consistent with the BLASTX results, also suggesting some crosstalk between abiotic and biotic stress. Comparative analysis of cis-regulatory elements from the candidate genes implicated specific elements in drought response in Bermudagrass. Although only a subset of genes was studied, Bermudagrass shared many drought-responsive genes and cis-regulatory elements with other botanical models, supporting a strategy of cross-taxon application of drought-responsive genes, regulatory cues, and physiological-genetic information.
Liu, Chune; Yang, Zhihong; Wu, Jianguo; Zhang, Li; Lee, Sangmin; Shin, Dong-Ju; Tran, Melanie; Wang, Li
2018-05-01
H19 is an imprinted long noncoding RNA abundantly expressed in embryonic liver and repressed after birth. We show that H19 serves as a lipid sensor by synergizing with the RNA-binding polypyrimidine tract-binding protein 1 (PTBP1) to modulate hepatic metabolic homeostasis. H19 RNA interacts with PTBP1 to facilitate its association with sterol regulatory element-binding protein 1c mRNA and protein, leading to increased stability and nuclear transcriptional activity. H19 and PTBP1 are up-regulated by fatty acids in hepatocytes and in diet-induced fatty liver, which further augments lipid accumulation. Ectopic expression of H19 induces steatosis and pushes the liver into a "pseudo-fed" state in response to fasting by promoting sterol regulatory element-binding protein 1c protein cleavage and nuclear translocation. Deletion of H19 or knockdown of PTBP1 abolishes high-fat and high-sucrose diet-induced steatosis. Our study unveils an H19/PTBP1/sterol regulatory element-binding protein 1 feedforward amplifying signaling pathway to exacerbate the development of fatty liver. (Hepatology 2018;67:1768-1783). © 2017 by the American Association for the Study of Liver Diseases.
Boulay, Gaylor; Awad, Mary E.; Riggi, Nicolo; Archer, Tenley C.; Iyer, Sowmya; Boonseng, Wannaporn E.; Rossetti, Nikki E; Naigles, Beverly; Rengarajan, Shruthi; Volorio, Angela; Kim, James C.; Mesirov, Jill P.; Tamayo, Pablo; Pomeroy, Scott L.; Aryee, Martin J.; Rivera, Miguel N.
2017-01-01
Medulloblastoma is the most frequent malignant pediatric brain tumor and is divided into at least four subgroups known as Wnt, SHH, Group 3 and Group 4. Here we characterized gene regulation mechanisms in the most aggressive subtype, Group 3 tumors, through genome-wide chromatin and expression profiling. Our results show that most active distal sites in these tumors are occupied by the transcription factor OTX2. Highly active OTX2 bound enhancers are often arranged as clusters of adjacent peaks and are also bound by the transcription factor NEUROD1. These sites are responsive to OTX2 and NEUROD1 knockdown and could also be generated de novo upon ectopic OTX2 expression in primary cells, showing that OTX2 cooperates with NEUROD1 and plays a major role in maintaining and possibly establishing regulatory elements as a pioneer factor. Among OTX2 target genes we identified the kinase NEK2, whose knockdown and pharmacological inhibition decreased cell viability. Our studies thus show that OTX2 controls the regulatory landscape of Group 3 medulloblastoma through cooperative activity at enhancer elements and contributes to the expression of critical target genes. PMID:28213356
Kovina, A P; Petrova, N V; Razin, S V; Yarovaia, O V
2016-01-01
In warm-blooded vertebrates, the α- and β-globin genes are organized in domains of different types and are regulated in different fashion. In cold-blooded vertebrates and, in particular, the tropical fish Danio rerio, the α- and β-globin genes form two gene clusters. A major D. rerio globin gene cluster is in chromosome 3 and includes the α- and β-globin genes of embryonic-larval and adult types. The region upstream of the cluster contains c16orf35, harbors the main regulatory element (MRE) of the α-globin gene domain in warm-blooded vertebrates. In this study, transient transfection of erythroid cells with genetic constructs containing a reporter gene under the control of potential regulatory elements of the domain was performed to characterize the promoters of the embryonic-larval and adult α- and β-globin genes of the major cluster. Also, in the 5th intron of c16orf35 in Danio reriowas detected a functional analog of the warm-blooded vertebrate MRE. This enhancer stimulated activity of the promoters of both adult and embryonic-larval α- and β-globin genes.
Genetic evidence for conserved non-coding element function across species–the ears have it
Turner, Eric E.; Cox, Timothy C.
2014-01-01
Comparison of genomic sequences from diverse vertebrate species has revealed numerous highly conserved regions that do not appear to encode proteins or functional RNAs. Often these “conserved non-coding elements,” or CNEs, can direct gene expression to specific tissues in transgenic models, demonstrating they have regulatory function. CNEs are frequently found near “developmental” genes, particularly transcription factors, implying that these elements have essential regulatory roles in development. However, actual examples demonstrating CNE regulatory functions across species have been few, and recent loss-of-function studies of several CNEs in mice have shown relatively minor effects. In this Perspectives article, we discuss new findings in “fancy” rats and Highland cattle demonstrating that function of a CNE near the Hmx1 gene is crucial for normal external ear development and when disrupted can mimic loss-of function Hmx1 coding mutations in mice and humans. These findings provide important support for conserved developmental roles of CNEs in divergent species, and reinforce the concept that CNEs should be examined systematically in the ongoing search for genetic causes of human developmental disorders in the era of genome-scale sequencing. PMID:24478720
Germline EMSY sequence alterations in hereditary breast cancer and ovarian cancer families.
Määttä, Kirsi M; Nurminen, Riikka; Kankuri-Tammilehto, Minna; Kallioniemi, Anne; Laasanen, Satu-Leena; Schleutker, Johanna
2017-07-24
BRCA1 and BRCA2 mutations explain approximately one-fifth of the inherited susceptibility in high-risk Finnish hereditary breast and ovarian cancer (HBOC) families. EMSY is located in the breast cancer-associated chromosomal region 11q13. The EMSY gene encodes a BRCA2-interacting protein that has been implicated in DNA damage repair and genomic instability. We analysed the role of germline EMSY variation in breast/ovarian cancer predisposition. The present study describes the first EMSY screening in patients with high familial risk for this disease. Index individuals from 71 high-risk, BRCA1/2-negative HBOC families were screened for germline EMSY sequence alterations in protein coding regions and exon-intron boundaries using Sanger sequencing and TaqMan assays. The identified variants were further screened in 36 Finnish HBOC patients and 904 controls. Moreover, one novel intronic deletion was screened in a cohort of 404 breast cancer patients unselected for family history. Haplotype block structure and the association of haplotypes with breast/ovarian cancer were analysed using Haploview. The functionality of the identified variants was predicted using Haploreg, RegulomeDB, Human Splicing Finder, and Pathogenic-or-Not-Pipeline 2. Altogether, 12 germline EMSY variants were observed. Two alterations were located in the coding region, five alterations were intronic, and five alterations were located in the 3'untranslated region (UTR). Variant frequencies did not significantly differ between cases and controls. The novel variant, c.2709 + 122delT, was detected in 1 out of 107 (0.9%) breast cancer patients, and the carrier showed a bilateral form of the disease. The deletion was absent in 897 controls (OR = 25.28; P = 0.1) and in 404 breast cancer patients unselected for family history. No haplotype was identified to increase the risk of breast/ovarian cancer. Functional analyses suggested that variants, particularly in the 3'UTR, were located within regulatory elements. The novel deletion was predicted to affect splicing regulatory elements. These results suggest that the identified EMSY variants are likely neutral at the population level. However, these variants may contribute to breast/ovarian cancer risk in single families. Additional analyses are warranted for rare novel intronic deletions and the 3'UTR variants predicted to have functional roles.
Mouse regulatory DNA landscapes reveal global principles of cis-regulatory evolution.
Vierstra, Jeff; Rynes, Eric; Sandstrom, Richard; Zhang, Miaohua; Canfield, Theresa; Hansen, R Scott; Stehling-Sun, Sandra; Sabo, Peter J; Byron, Rachel; Humbert, Richard; Thurman, Robert E; Johnson, Audra K; Vong, Shinny; Lee, Kristen; Bates, Daniel; Neri, Fidencio; Diegel, Morgan; Giste, Erika; Haugen, Eric; Dunn, Douglas; Wilken, Matthew S; Josefowicz, Steven; Samstein, Robert; Chang, Kai-Hsin; Eichler, Evan E; De Bruijn, Marella; Reh, Thomas A; Skoultchi, Arthur; Rudensky, Alexander; Orkin, Stuart H; Papayannopoulou, Thalia; Treuting, Piper M; Selleri, Licia; Kaul, Rajinder; Groudine, Mark; Bender, M A; Stamatoyannopoulos, John A
2014-11-21
To study the evolutionary dynamics of regulatory DNA, we mapped >1.3 million deoxyribonuclease I-hypersensitive sites (DHSs) in 45 mouse cell and tissue types, and systematically compared these with human DHS maps from orthologous compartments. We found that the mouse and human genomes have undergone extensive cis-regulatory rewiring that combines branch-specific evolutionary innovation and loss with widespread repurposing of conserved DHSs to alternative cell fates, and that this process is mediated by turnover of transcription factor (TF) recognition elements. Despite pervasive evolutionary remodeling of the location and content of individual cis-regulatory regions, within orthologous mouse and human cell types the global fraction of regulatory DNA bases encoding recognition sites for each TF has been strictly conserved. Our findings provide new insights into the evolutionary forces shaping mammalian regulatory DNA landscapes. Copyright © 2014, American Association for the Advancement of Science.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jaynes, J.B.; Johnson, J.E.; Buskin, J.N.
1988-01-01
Muscle creatine kinase (MCK) is induced to high levels during skeletal muscle differentiation. The authors examined the upstream regulatory elements of the mouse MCK gene which specify its activation during myogenesis in culture. Fusion genes containing up to 3,300 nucleotides (nt) of MCK 5' flanking DNA in various positions and orientations relative to the bacterial chloramphenicol acetyltransferase (CAT) structural gene were transfected into cultured cells. Transient expression of CAT was compared between proliferating and differentiated MM14 mouse myoblasts and with nonmyogenic mouse L cells. The major effector of high-level expression was found to have the properties of a transcriptional enhancer.more » This element, located between 1,050 and 1,256 nt upstream of the transcription start site, was also found to have a major influence on the tissue and differentiation specificity of MCK expression; it activated either the MCK promoter or heterologous promoters only in differentiated muscle cells. Comparisons of viral and cellular enhancer sequences with the MCK enhancer revealed some similarities to essential regions of the simian virus 40 enhancer as well as to a region of the immunoglobulin heavy-chain enhancer, which has been implicated in tissue-specific protein binding. Even in the absence of the enhancer, low-level expression from a 776-nt MCK promoter retained differentiation specificity. In addition to positive regulatory elements, our data provide some evidence for negative regulatory elements with activity in myoblasts. These may contribute to the cell type and differentiation specificity of MCK expression.« less
Benito-Sanz, Sara; Aza-Carmona, Miriam; Rodríguez-Estevez, Amaya; Rica-Etxebarria, Ixaso; Gracia, Ricardo; Campos-Barros, Angel; Heath, Karen E
2012-01-01
Short stature homeobox-containing gene, MIM 312865 (SHOX) is located within the pseudoautosomal region 1 (PAR1) of the sex chromosomes. Mutations in SHOX or its downstream transcriptional regulatory elements represent the underlying molecular defect in ~60% of Léri-Weill dyschondrosteosis (LWD) and ~5-15% of idiopathic short stature (ISS) patients. Recently, three novel enhancer elements have been identified upstream of SHOX but to date, no PAR1 deletions upstream of SHOX have been observed that only encompass these enhancers in LWD or ISS patients. We set out to search for genetic alterations of the upstream SHOX regulatory elements in 63 LWD and 100 ISS patients with no known alteration in SHOX or the downstream enhancer regions using a specifically designed MLPA assay, which covers the PAR1 upstream of SHOX. An upstream SHOX deletion was identified in an ISS proband and her affected father. The deletion was confirmed and delimited by array-CGH, to extend ~286 kb. The deletion included two of the upstream SHOX enhancers without affecting SHOX. The 13.3-year-old proband had proportionate short stature with normal GH and IGF-I levels. In conclusion, we have identified the first PAR1 deletion encompassing only the upstream SHOX transcription regulatory elements in a family with ISS. The loss of these elements may result in SHOX haploinsufficiency because of decreased SHOX transcription. Therefore, this upstream region should be included in the routine analysis of PAR1 in patients with LWD, LMD and ISS.
Benito-Sanz, Sara; Aza-Carmona, Miriam; Rodríguez-Estevez, Amaya; Rica-Etxebarria, Ixaso; Gracia, Ricardo; Campos-Barros, Ángel; Heath, Karen E
2012-01-01
Short stature homeobox-containing gene, MIM 312865 (SHOX) is located within the pseudoautosomal region 1 (PAR1) of the sex chromosomes. Mutations in SHOX or its downstream transcriptional regulatory elements represent the underlying molecular defect in ∼60% of Léri-Weill dyschondrosteosis (LWD) and ∼5–15% of idiopathic short stature (ISS) patients. Recently, three novel enhancer elements have been identified upstream of SHOX but to date, no PAR1 deletions upstream of SHOX have been observed that only encompass these enhancers in LWD or ISS patients. We set out to search for genetic alterations of the upstream SHOX regulatory elements in 63 LWD and 100 ISS patients with no known alteration in SHOX or the downstream enhancer regions using a specifically designed MLPA assay, which covers the PAR1 upstream of SHOX. An upstream SHOX deletion was identified in an ISS proband and her affected father. The deletion was confirmed and delimited by array-CGH, to extend ∼286 kb. The deletion included two of the upstream SHOX enhancers without affecting SHOX. The 13.3-year-old proband had proportionate short stature with normal GH and IGF-I levels. In conclusion, we have identified the first PAR1 deletion encompassing only the upstream SHOX transcription regulatory elements in a family with ISS. The loss of these elements may result in SHOX haploinsufficiency because of decreased SHOX transcription. Therefore, this upstream region should be included in the routine analysis of PAR1 in patients with LWD, LMD and ISS. PMID:22071895
D-MATRIX: A web tool for constructing weight matrix of conserved DNA motifs
Sen, Naresh; Mishra, Manoj; Khan, Feroz; Meena, Abha; Sharma, Ashok
2009-01-01
Despite considerable efforts to date, DNA motif prediction in whole genome remains a challenge for researchers. Currently the genome wide motif prediction tools required either direct pattern sequence (for single motif) or weight matrix (for multiple motifs). Although there are known motif pattern databases and tools for genome level prediction but no tool for weight matrix construction. Considering this, we developed a D-MATRIX tool which predicts the different types of weight matrix based on user defined aligned motif sequence set and motif width. For retrieval of known motif sequences user can access the commonly used databases such as TFD, RegulonDB, DBTBS, Transfac. DMATRIX program uses a simple statistical approach for weight matrix construction, which can be converted into different file formats according to user requirement. It provides the possibility to identify the conserved motifs in the coregulated genes or whole genome. As example, we successfully constructed the weight matrix of LexA transcription factor binding site with the help of known sosbox cisregulatory elements in Deinococcus radiodurans genome. The algorithm is implemented in C-Sharp and wrapped in ASP.Net to maintain a user friendly web interface. DMATRIX tool is accessible through the CIMAP domain network. Availability http://203.190.147.116/dmatrix/ PMID:19759861
D-MATRIX: a web tool for constructing weight matrix of conserved DNA motifs.
Sen, Naresh; Mishra, Manoj; Khan, Feroz; Meena, Abha; Sharma, Ashok
2009-07-27
Despite considerable efforts to date, DNA motif prediction in whole genome remains a challenge for researchers. Currently the genome wide motif prediction tools required either direct pattern sequence (for single motif) or weight matrix (for multiple motifs). Although there are known motif pattern databases and tools for genome level prediction but no tool for weight matrix construction. Considering this, we developed a D-MATRIX tool which predicts the different types of weight matrix based on user defined aligned motif sequence set and motif width. For retrieval of known motif sequences user can access the commonly used databases such as TFD, RegulonDB, DBTBS, Transfac. D-MATRIX program uses a simple statistical approach for weight matrix construction, which can be converted into different file formats according to user requirement. It provides the possibility to identify the conserved motifs in the co-regulated genes or whole genome. As example, we successfully constructed the weight matrix of LexA transcription factor binding site with the help of known sos-box cis-regulatory elements in Deinococcus radiodurans genome. The algorithm is implemented in C-Sharp and wrapped in ASP.Net to maintain a user friendly web interface. D-MATRIX tool is accessible through the CIMAP domain network. http://203.190.147.116/dmatrix/
Manavalan, Balachandran; Shin, Tae Hwan; Lee, Gwang
2018-01-05
DNase I hypersensitive sites (DHSs) are genomic regions that provide important information regarding the presence of transcriptional regulatory elements and the state of chromatin. Therefore, identifying DHSs in uncharacterized DNA sequences is crucial for understanding their biological functions and mechanisms. Although many experimental methods have been proposed to identify DHSs, they have proven to be expensive for genome-wide application. Therefore, it is necessary to develop computational methods for DHS prediction. In this study, we proposed a support vector machine (SVM)-based method for predicting DHSs, called DHSpred (DNase I Hypersensitive Site predictor in human DNA sequences), which was trained with 174 optimal features. The optimal combination of features was identified from a large set that included nucleotide composition and di- and trinucleotide physicochemical properties, using a random forest algorithm. DHSpred achieved a Matthews correlation coefficient and accuracy of 0.660 and 0.871, respectively, which were 3% higher than those of control SVM predictors trained with non-optimized features, indicating the efficiency of the feature selection method. Furthermore, the performance of DHSpred was superior to that of state-of-the-art predictors. An online prediction server has been developed to assist the scientific community, and is freely available at: http://www.thegleelab.org/DHSpred.html.
Manavalan, Balachandran; Shin, Tae Hwan; Lee, Gwang
2018-01-01
DNase I hypersensitive sites (DHSs) are genomic regions that provide important information regarding the presence of transcriptional regulatory elements and the state of chromatin. Therefore, identifying DHSs in uncharacterized DNA sequences is crucial for understanding their biological functions and mechanisms. Although many experimental methods have been proposed to identify DHSs, they have proven to be expensive for genome-wide application. Therefore, it is necessary to develop computational methods for DHS prediction. In this study, we proposed a support vector machine (SVM)-based method for predicting DHSs, called DHSpred (DNase I Hypersensitive Site predictor in human DNA sequences), which was trained with 174 optimal features. The optimal combination of features was identified from a large set that included nucleotide composition and di- and trinucleotide physicochemical properties, using a random forest algorithm. DHSpred achieved a Matthews correlation coefficient and accuracy of 0.660 and 0.871, respectively, which were 3% higher than those of control SVM predictors trained with non-optimized features, indicating the efficiency of the feature selection method. Furthermore, the performance of DHSpred was superior to that of state-of-the-art predictors. An online prediction server has been developed to assist the scientific community, and is freely available at: http://www.thegleelab.org/DHSpred.html PMID:29416743
Summerfield, Taryn L.; Yu, Lianbo; Gulati, Parul; Zhang, Jie; Huang, Kun; Romero, Roberto; Kniss, Douglas A.
2011-01-01
A majority of the studies examining the molecular regulation of human labor have been conducted using single gene approaches. While the technology to produce multi-dimensional datasets is readily available, the means for facile analysis of such data are limited. The objective of this study was to develop a systems approach to infer regulatory mechanisms governing global gene expression in cytokine-challenged cells in vitro, and to apply these methods to predict gene regulatory networks (GRNs) in intrauterine tissues during term parturition. To this end, microarray analysis was applied to human amnion mesenchymal cells (AMCs) stimulated with interleukin-1β, and differentially expressed transcripts were subjected to hierarchical clustering, temporal expression profiling, and motif enrichment analysis, from which a GRN was constructed. These methods were then applied to fetal membrane specimens collected in the absence or presence of spontaneous term labor. Analysis of cytokine-responsive genes in AMCs revealed a sterile immune response signature, with promoters enriched in response elements for several inflammation-associated transcription factors. In comparison to the fetal membrane dataset, there were 34 genes commonly upregulated, many of which were part of an acute inflammation gene expression signature. Binding motifs for nuclear factor-κB were prominent in the gene interaction and regulatory networks for both datasets; however, we found little evidence to support the utilization of pathogen-associated molecular pattern (PAMP) signaling. The tissue specimens were also enriched for transcripts governed by hypoxia-inducible factor. The approach presented here provides an uncomplicated means to infer global relationships among gene clusters involved in cellular responses to labor-associated signals. PMID:21655103
Assessment of composite motif discovery methods.
Klepper, Kjetil; Sandve, Geir K; Abul, Osman; Johansen, Jostein; Drablos, Finn
2008-02-26
Computational discovery of regulatory elements is an important area of bioinformatics research and more than a hundred motif discovery methods have been published. Traditionally, most of these methods have addressed the problem of single motif discovery - discovering binding motifs for individual transcription factors. In higher organisms, however, transcription factors usually act in combination with nearby bound factors to induce specific regulatory behaviours. Hence, recent focus has shifted from single motifs to the discovery of sets of motifs bound by multiple cooperating transcription factors, so called composite motifs or cis-regulatory modules. Given the large number and diversity of methods available, independent assessment of methods becomes important. Although there have been several benchmark studies of single motif discovery, no similar studies have previously been conducted concerning composite motif discovery. We have developed a benchmarking framework for composite motif discovery and used it to evaluate the performance of eight published module discovery tools. Benchmark datasets were constructed based on real genomic sequences containing experimentally verified regulatory modules, and the module discovery programs were asked to predict both the locations of these modules and to specify the single motifs involved. To aid the programs in their search, we provided position weight matrices corresponding to the binding motifs of the transcription factors involved. In addition, selections of decoy matrices were mixed with the genuine matrices on one dataset to test the response of programs to varying levels of noise. Although some of the methods tested tended to score somewhat better than others overall, there were still large variations between individual datasets and no single method performed consistently better than the rest in all situations. The variation in performance on individual datasets also shows that the new benchmark datasets represents a suitable variety of challenges to most methods for module discovery.
Interrogating the topological robustness of gene regulatory circuits by randomization
Levine, Herbert; Onuchic, Jose N.
2017-01-01
One of the most important roles of cells is performing their cellular tasks properly for survival. Cells usually achieve robust functionality, for example, cell-fate decision-making and signal transduction, through multiple layers of regulation involving many genes. Despite the combinatorial complexity of gene regulation, its quantitative behavior has been typically studied on the basis of experimentally verified core gene regulatory circuitry, composed of a small set of important elements. It is still unclear how such a core circuit operates in the presence of many other regulatory molecules and in a crowded and noisy cellular environment. Here we report a new computational method, named random circuit perturbation (RACIPE), for interrogating the robust dynamical behavior of a gene regulatory circuit even without accurate measurements of circuit kinetic parameters. RACIPE generates an ensemble of random kinetic models corresponding to a fixed circuit topology, and utilizes statistical tools to identify generic properties of the circuit. By applying RACIPE to simple toggle-switch-like motifs, we observed that the stable states of all models converge to experimentally observed gene state clusters even when the parameters are strongly perturbed. RACIPE was further applied to a proposed 22-gene network of the Epithelial-to-Mesenchymal Transition (EMT), from which we identified four experimentally observed gene states, including the states that are associated with two different types of hybrid Epithelial/Mesenchymal phenotypes. Our results suggest that dynamics of a gene circuit is mainly determined by its topology, not by detailed circuit parameters. Our work provides a theoretical foundation for circuit-based systems biology modeling. We anticipate RACIPE to be a powerful tool to predict and decode circuit design principles in an unbiased manner, and to quantitatively evaluate the robustness and heterogeneity of gene expression. PMID:28362798
mRNA Regulation of Cardiac Iron Transporters and Ferritin Subunits in a Mouse Model of Iron Overload
Brewer, Casey J.; Wood, Ruth I.; Wood, John C.
2014-01-01
Iron cardiomyopathy is the leading cause of death in iron overload. Men have twice the mortality rate of women, though the cause is unknown. In hemojuvelin-knockout mice, a model of the disease, males load more cardiac iron than females. We postulated that sex differences in cardiac iron import cause differences in cardiac iron concentration. RT-PCR was used to measure mRNA of cardiac iron transporters in hemojuvelin-knockout mice. No sex differences were discovered among putative importers of non-transferrin bound iron (L-type and T-type calcium channels, ZRT/IRT-like protein 14 zinc channels). Transferrin-bound iron transporters were also analyzed; these are controlled by the iron regulatory element/iron regulatory protein (IRE/IRP) system. There was a positive relationship between cardiac iron and ferroportin mRNA in both sexes, but it was significantly steeper in females (p<0.05). Transferrin receptor 1 and divalent metal transporter 1 were more highly expressed in females than males (p<0.01 and p<0.0001, respectively), consistent with their lower cardiac iron levels, as predicted by IRE/IRP regulatory pathways. Light-chain (L) ferritin showed a positive correlation with cardiac iron that was nearly identical in males and females (R2=0.41, p<0.01 and R2=0.56, p<0.05, respectively), while heavy-chain (H) ferritin was constitutively expressed in both sexes. This represents the first report of IRE/IRP regulatory pathways in the heart. Transcriptional regulation of ferroportin was suggested in both sexes, creating a potential mechanism for differential set points for iron export. Constitutive H-ferritin expression suggests a logical limit to cardiac iron buffering capacity at levels known to produce heart failure in humans. PMID:25220979
CRX ChIP-seq reveals the cis-regulatory architecture of mouse photoreceptors
Corbo, Joseph C.; Lawrence, Karen A.; Karlstetter, Marcus; Myers, Connie A.; Abdelaziz, Musa; Dirkes, William; Weigelt, Karin; Seifert, Martin; Benes, Vladimir; Fritsche, Lars G.; Weber, Bernhard H.F.; Langmann, Thomas
2010-01-01
Approximately 98% of mammalian DNA is noncoding, yet we understand relatively little about the function of this enigmatic portion of the genome. The cis-regulatory elements that control gene expression reside in noncoding regions and can be identified by mapping the binding sites of tissue-specific transcription factors. Cone-rod homeobox (CRX) is a key transcription factor in photoreceptor differentiation and survival, but its in vivo targets are largely unknown. Here, we used chromatin immunoprecipitation with massively parallel sequencing (ChIP-seq) on CRX to identify thousands of cis-regulatory regions around photoreceptor genes in adult mouse retina. CRX directly regulates downstream photoreceptor transcription factors and their target genes via a network of spatially distributed regulatory elements around each locus. CRX-bound regions act in a synergistic fashion to activate transcription and contain multiple CRX binding sites which interact in a spacing- and orientation-dependent manner to fine-tune transcript levels. CRX ChIP-seq was also performed on Nrl−/− retinas, which represent an enriched source of cone photoreceptors. Comparison with the wild-type ChIP-seq data set identified numerous rod- and cone-specific CRX-bound regions as well as many shared elements. Thus, CRX combinatorially orchestrates the transcriptional networks of both rods and cones by coordinating the expression of photoreceptor genes including most retinal disease genes. In addition, this study pinpoints thousands of noncoding regions of relevance to both Mendelian and complex retinal disease. PMID:20693478
Assessment of the TRACE Reactor Analysis Code Against Selected PANDA Transient Data
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zavisca, M.; Ghaderi, M.; Khatib-Rahbar, M.
2006-07-01
The TRACE (TRAC/RELAP Advanced Computational Engine) code is an advanced, best-estimate thermal-hydraulic program intended to simulate the transient behavior of light-water reactor systems, using a two-fluid (steam and water, with non-condensable gas), seven-equation representation of the conservation equations and flow-regime dependent constitutive relations in a component-based model with one-, two-, or three-dimensional elements, as well as solid heat structures and logical elements for the control system. The U.S. Nuclear Regulatory Commission is currently supporting the development of the TRACE code and its assessment against a variety of experimental data pertinent to existing and evolutionary reactor designs. This paper presents themore » results of TRACE post-test prediction of P-series of experiments (i.e., tests comprising the ISP-42 blind and open phases) conducted at the PANDA large-scale test facility in 1990's. These results show reasonable agreement with the reported test results, indicating good performance of the code and relevant underlying thermal-hydraulic and heat transfer models. (authors)« less
Resolving Heart Regeneration by Replacement Histone Profiling.
Goldman, Joseph Aaron; Kuzu, Guray; Lee, Nutishia; Karasik, Jaclyn; Gemberling, Matthew; Foglia, Matthew J; Karra, Ravi; Dickson, Amy L; Sun, Fei; Tolstorukov, Michael Y; Poss, Kenneth D
2017-02-27
Chromatin regulation is a principal mechanism governing animal development, yet it is unclear to what extent structural changes in chromatin underlie tissue regeneration. Non-mammalian vertebrates such as zebrafish activate cardiomyocyte (CM) division after tissue damage to regenerate lost heart muscle. Here, we generated transgenic zebrafish expressing a biotinylatable H3.3 histone variant in CMs and derived cell-type-specific profiles of histone replacement. We identified an emerging program of putative enhancers that revise H3.3 occupancy during regeneration, overlaid upon a genome-wide reduction of H3.3 from promoters. In transgenic reporter lines, H3.3-enriched elements directed gene expression in subpopulations of CMs. Other elements increased H3.3 enrichment and displayed enhancer activity in settings of injury- and/or Neuregulin1-elicited CM proliferation. Dozens of consensus sequence motifs containing predicted transcription factor binding sites were enriched in genomic regions with regeneration-responsive H3.3 occupancy. Thus, cell-type-specific regulatory programs of tissue regeneration can be revealed by genome-wide H3.3 profiling. Copyright © 2017 Elsevier Inc. All rights reserved.
Lum, Thomas E.; Merritt, Thomas J. S.
2011-01-01
Regulation of transcription can be a complex process in which many cis- and trans-interactions determine the final pattern of expression. Among these interactions are trans-interactions mediated by the pairing of homologous chromosomes. These trans-effects are wide ranging, affecting gene regulation in many species and creating complex possibilities in gene regulation. Here we describe a novel case of trans-interaction between alleles of the Malic enzyme (Men) locus in Drosophila melanogaster that results in allele-specific, non-additive gene expression. Using both empirical biochemical and predictive bioinformatic approaches, we show that the regulatory elements of one allele are capable of interacting in trans with, and modifying the expression of, the second allele. Furthermore, we show that nonlocal factors—different genetic backgrounds—are capable of significant interactions with individual Men alleles, suggesting that these trans-effects can be modified by both locally and distantly acting elements. In sum, these results emphasize the complexity of gene regulation and the need to understand both small- and large-scale interactions as more complete models of the role of trans-interactions in gene regulation are developed. PMID:21900270
Federal Register 2010, 2011, 2012, 2013, 2014
2012-12-26
... SECURITIES AND EXCHANGE COMMISSION [Release No. 34-68468; File No. SR-FINRA-2012-055] Self...-Element Continuing Education Program To Qualify To Engage in a Security Futures Business December 19, 2012. Pursuant to Section 19(b)(1) of the Securities Exchange Act of 1934 (``Act'')\\1\\ and Rule 19b-4 thereunder...
Maternal Prenatal Stress and Infant Regulatory Capacity in Mexican Americans
Lin, Betty; Crnic, Keith A.; Luecken, Linda J.; Gonzales, Nancy A.
2014-01-01
The early postpartum period lays important groundwork for later self-regulation as infants' dispositional traits interact with caregivers' co-regulatory behaviors to produce the earliest forms of self-regulation. Although emerging literature suggests that fetal exposure to maternal stress may be integral in determining child self-regulatory capacity, the complex pathways that characterize these early developmental processes remain unclear. The current study considers these complex, transactional processes in a low income, Mexican American sample. Data were collected from 295 Mexican American infants and their mothers during prenatal, 6- and 12-week postpartum home interviews. Mother reports of stress were obtained prenatally, and mother reports of infant temperament were obtained at 6 weeks. Observer ratings of maternal sensitivity and infant regulatory behaviors were obtained at the 6- and 12-week time points. Study results indicate that prenatal stress predicts higher levels of infant negativity and surgency, both of which directly or interactively predict later engagement in regulatory behaviors. Unexpectedly, prenatal stress also predicted more engagement in orienting, but not self-comforting behaviors. Advancing understandings about the nature of these developmental pathways may have significant implications for targets of early intervention in this high risk population. PMID:25113917
Association analysis identifies 65 new breast cancer risk loci
Lemaçon, Audrey; Soucy, Penny; Glubb, Dylan; Rostamianfar, Asha; Bolla, Manjeet K.; Wang, Qin; Tyrer, Jonathan; Dicks, Ed; Lee, Andrew; Wang, Zhaoming; Allen, Jamie; Keeman, Renske; Eilber, Ursula; French, Juliet D.; Chen, Xiao Qing; Fachal, Laura; McCue, Karen; McCart Reed, Amy E.; Ghoussaini, Maya; Carroll, Jason; Jiang, Xia; Finucane, Hilary; Adams, Marcia; Adank, Muriel A.; Ahsan, Habibul; Aittomäki, Kristiina; Anton-Culver, Hoda; Antonenkova, Natalia N.; Arndt, Volker; Aronson, Kristan J.; Arun, Banu; Auer, Paul L.; Bacot, François; Barrdahl, Myrto; Baynes, Caroline; Beckmann, Matthias W.; Behrens, Sabine; Benitez, Javier; Bermisheva, Marina; Bernstein, Leslie; Blomqvist, Carl; Bogdanova, Natalia V.; Bojesen, Stig E.; Bonanni, Bernardo; Børresen-Dale, Anne-Lise; Brand, Judith S.; Brauch, Hiltrud; Brennan, Paul; Brenner, Hermann; Brinton, Louise; Broberg, Per; Brock, Ian W.; Broeks, Annegien; Brooks-Wilson, Angela; Brucker, Sara Y.; Brüning, Thomas; Burwinkel, Barbara; Butterbach, Katja; Cai, Qiuyin; Cai, Hui; Caldés, Trinidad; Canzian, Federico; Carracedo, Angel; Carter, Brian D.; Castelao, Jose E.; Chan, Tsun L.; Cheng, Ting-Yuan David; Chia, Kee Seng; Choi, Ji-Yeob; Christiansen, Hans; Clarke, Christine L.; Collée, Margriet; Conroy, Don M.; Cordina-Duverger, Emilie; Cornelissen, Sten; Cox, David G; Cox, Angela; Cross, Simon S.; Cunningham, Julie M.; Czene, Kamila; Daly, Mary B.; Devilee, Peter; Doheny, Kimberly F.; Dörk, Thilo; dos-Santos-Silva, Isabel; Dumont, Martine; Durcan, Lorraine; Dwek, Miriam; Eccles, Diana M.; Ekici, Arif B.; Eliassen, A. Heather; Ellberg, Carolina; Elvira, Mingajeva; Engel, Christoph; Eriksson, Mikael; Fasching, Peter A.; Figueroa, Jonine; Flesch-Janys, Dieter; Fletcher, Olivia; Flyger, Henrik; Fritschi, Lin; Gaborieau, Valerie; Gabrielson, Marike; Gago-Dominguez, Manuela; Gao, Yu-Tang; Gapstur, Susan M.; García-Sáenz, José A.; Gaudet, Mia M.; Georgoulias, Vassilios; Giles, Graham G.; Glendon, Gord; Goldberg, Mark S.; Goldgar, David E.; González-Neira, Anna; Grenaker Alnæs, Grethe I.; Grip, Mervi; Gronwald, Jacek; Grundy, Anne; Guénel, Pascal; Haeberle, Lothar; Hahnen, Eric; Haiman, Christopher A.; Håkansson, Niclas; Hamann, Ute; Hamel, Nathalie; Hankinson, Susan; Harrington, Patricia; Hart, Steven N.; Hartikainen, Jaana M.; Hartman, Mikael; Hein, Alexander; Heyworth, Jane; Hicks, Belynda; Hillemanns, Peter; Ho, Dona N.; Hollestelle, Antoinette; Hooning, Maartje J.; Hoover, Robert N.; Hopper, John L.; Hou, Ming-Feng; Hsiung, Chia-Ni; Huang, Guanmengqian; Humphreys, Keith; Ishiguro, Junko; Ito, Hidemi; Iwasaki, Motoki; Iwata, Hiroji; Jakubowska, Anna; Janni, Wolfgang; John, Esther M.; Johnson, Nichola; Jones, Kristine; Jones, Michael; Jukkola-Vuorinen, Arja; Kaaks, Rudolf; Kabisch, Maria; Kaczmarek, Katarzyna; Kang, Daehee; Kasuga, Yoshio; Kerin, Michael J.; Khan, Sofia; Khusnutdinova, Elza; Kiiski, Johanna I.; Kim, Sung-Won; Knight, Julia A.; Kosma, Veli-Matti; Kristensen, Vessela N.; Krüger, Ute; Kwong, Ava; Lambrechts, Diether; Marchand, Loic Le; Lee, Eunjung; Lee, Min Hyuk; Lee, Jong Won; Lee, Chuen Neng; Lejbkowicz, Flavio; Li, Jingmei; Lilyquist, Jenna; Lindblom, Annika; Lissowska, Jolanta; Lo, Wing-Yee; Loibl, Sibylle; Long, Jirong; Lophatananon, Artitaya; Lubinski, Jan; Luccarini, Craig; Lux, Michael P.; Ma, Edmond S.K.; MacInnis, Robert J.; Maishman, Tom; Makalic, Enes; Malone, Kathleen E; Kostovska, Ivana Maleva; Mannermaa, Arto; Manoukian, Siranoush; Manson, JoAnn E.; Margolin, Sara; Mariapun, Shivaani; Martinez, Maria Elena; Matsuo, Keitaro; Mavroudis, Dimitrios; McKay, James; McLean, Catriona; Meijers-Heijboer, Hanne; Meindl, Alfons; Menéndez, Primitiva; Menon, Usha; Meyer, Jeffery; Miao, Hui; Miller, Nicola; Mohd Taib, Nur Aishah; Muir, Kenneth; Mulligan, Anna Marie; Mulot, Claire; Neuhausen, Susan L.; Nevanlinna, Heli; Neven, Patrick; Nielsen, Sune F.; Noh, Dong-Young; Nordestgaard, Børge G.; Norman, Aaron; Olopade, Olufunmilayo I.; Olson, Janet E.; Olsson, Håkan; Olswold, Curtis; Orr, Nick; Pankratz, V. Shane; Park, Sue K.; Park-Simon, Tjoung-Won; Lloyd, Rachel; Perez, Jose I.A.; Peterlongo, Paolo; Peto, Julian; Phillips, Kelly-Anne; Pinchev, Mila; Plaseska-Karanfilska, Dijana; Prentice, Ross; Presneau, Nadege; Prokofieva, Darya; Pugh, Elizabeth; Pylkäs, Katri; Rack, Brigitte; Radice, Paolo; Rahman, Nazneen; Rennert, Gadi; Rennert, Hedy S.; Rhenius, Valerie; Romero, Atocha; Romm, Jane; Ruddy, Kathryn J; Rüdiger, Thomas; Rudolph, Anja; Ruebner, Matthias; Rutgers, Emiel J. Th.; Saloustros, Emmanouil; Sandler, Dale P.; Sangrajrang, Suleeporn; Sawyer, Elinor J.; Schmidt, Daniel F.; Schmutzler, Rita K.; Schneeweiss, Andreas; Schoemaker, Minouk J.; Schumacher, Fredrick; Schürmann, Peter; Scott, Rodney J.; Scott, Christopher; Seal, Sheila; Seynaeve, Caroline; Shah, Mitul; Sharma, Priyanka; Shen, Chen-Yang; Sheng, Grace; Sherman, Mark E.; Shrubsole, Martha J.; Shu, Xiao-Ou; Smeets, Ann; Sohn, Christof; Southey, Melissa C.; Spinelli, John J.; Stegmaier, Christa; Stewart-Brown, Sarah; Stone, Jennifer; Stram, Daniel O.; Surowy, Harald; Swerdlow, Anthony; Tamimi, Rulla; Taylor, Jack A.; Tengström, Maria; Teo, Soo H.; Terry, Mary Beth; Tessier, Daniel C.; Thanasitthichai, Somchai; Thöne, Kathrin; Tollenaar, Rob A.E.M.; Tomlinson, Ian; Tong, Ling; Torres, Diana; Truong, Thérèse; Tseng, Chiu-chen; Tsugane, Shoichiro; Ulmer, Hans-Ulrich; Ursin, Giske; Untch, Michael; Vachon, Celine; van Asperen, Christi J.; Van Den Berg, David; van den Ouweland, Ans M.W.; van der Kolk, Lizet; van der Luijt, Rob B.; Vincent, Daniel; Vollenweider, Jason; Waisfisz, Quinten; Wang-Gohrke, Shan; Weinberg, Clarice R.; Wendt, Camilla; Whittemore, Alice S.; Wildiers, Hans; Willett, Walter; Winqvist, Robert; Wolk, Alicja; Wu, Anna H.; Xia, Lucy; Yamaji, Taiki; Yang, Xiaohong R.; Yip, Cheng Har; Yoo, Keun-Young; Yu, Jyh-Cherng; Zheng, Wei; Zheng, Ying; Zhu, Bin; Ziogas, Argyrios; Ziv, Elad; Lakhani, Sunil R.; Antoniou, Antonis C.; Droit, Arnaud; Andrulis, Irene L.; Amos, Christopher I.; Couch, Fergus J.; Pharoah, Paul D.P.; Chang-Claude, Jenny; Hall, Per; Hunter, David J.; Milne, Roger L.; García-Closas, Montserrat; Schmidt, Marjanka K.; Chanock, Stephen J.; Dunning, Alison M.; Edwards, Stacey L.; Bader, Gary D.; Chenevix-Trench, Georgia; Simard, Jacques; Kraft, Peter; Easton, Douglas F.
2017-01-01
Breast cancer risk is influenced by rare coding variants in susceptibility genes such as BRCA1 and many common, mainly non-coding variants. However, much of the genetic contribution to breast cancer risk remains unknown. We report results from a genome-wide association study (GWAS) of breast cancer in 122,977 cases and 105,974 controls of European ancestry and 14,068 cases and 13,104 controls of East Asian ancestry1. We identified 65 new loci associated with overall breast cancer at p<5x10-8. The majority of credible risk SNPs in the new loci fall in distal regulatory elements, and by integrating in-silico data to predict target genes in breast cells at each locus, we demonstrate a strong overlap between candidate target genes and somatic driver genes in breast tumours. We also find that heritability of breast cancer due to all SNPs in regulatory features was 2-5-fold enriched relative to the genome-wide average, with strong enrichment for particular transcription factor binding sites. These results provide further insight into genetic susceptibility to breast cancer and will improve the utility of genetic risk scores for individualized screening and prevention. PMID:29059683
Schaefke, Bernhard; Wang, Tzi-Yuan; Wang, Chuen-Yi; Li, Wen-Hsiung
2015-01-01
Gene expression evolution occurs through changes in cis- or trans-regulatory elements or both. Interactions between transcription factors (TFs) and their binding sites (TFBSs) constitute one of the most important points where these two regulatory components intersect. In this study, we investigated the evolution of TFBSs in the promoter regions of different Saccharomyces strains and species. We divided the promoter of a gene into the proximal region and the distal region, which are defined, respectively, as the 200-bp region upstream of the transcription starting site and as the 200-bp region upstream of the proximal region. We found that the predicted TFBSs in the proximal promoter regions tend to be evolutionarily more conserved than those in the distal promoter regions. Additionally, Saccharomyces cerevisiae strains used in the fermentation of alcoholic drinks have experienced more TFBS losses than gains compared with strains from other environments (wild strains, laboratory strains, and clinical strains). We also showed that differences in TFBSs correlate with the cis component of gene expression evolution between species (comparing S. cerevisiae and its sister species Saccharomyces paradoxus) and within species (comparing two closely related S. cerevisiae strains). PMID:26220934
Association analysis identifies 65 new breast cancer risk loci.
Michailidou, Kyriaki; Lindström, Sara; Dennis, Joe; Beesley, Jonathan; Hui, Shirley; Kar, Siddhartha; Lemaçon, Audrey; Soucy, Penny; Glubb, Dylan; Rostamianfar, Asha; Bolla, Manjeet K; Wang, Qin; Tyrer, Jonathan; Dicks, Ed; Lee, Andrew; Wang, Zhaoming; Allen, Jamie; Keeman, Renske; Eilber, Ursula; French, Juliet D; Qing Chen, Xiao; Fachal, Laura; McCue, Karen; McCart Reed, Amy E; Ghoussaini, Maya; Carroll, Jason S; Jiang, Xia; Finucane, Hilary; Adams, Marcia; Adank, Muriel A; Ahsan, Habibul; Aittomäki, Kristiina; Anton-Culver, Hoda; Antonenkova, Natalia N; Arndt, Volker; Aronson, Kristan J; Arun, Banu; Auer, Paul L; Bacot, François; Barrdahl, Myrto; Baynes, Caroline; Beckmann, Matthias W; Behrens, Sabine; Benitez, Javier; Bermisheva, Marina; Bernstein, Leslie; Blomqvist, Carl; Bogdanova, Natalia V; Bojesen, Stig E; Bonanni, Bernardo; Børresen-Dale, Anne-Lise; Brand, Judith S; Brauch, Hiltrud; Brennan, Paul; Brenner, Hermann; Brinton, Louise; Broberg, Per; Brock, Ian W; Broeks, Annegien; Brooks-Wilson, Angela; Brucker, Sara Y; Brüning, Thomas; Burwinkel, Barbara; Butterbach, Katja; Cai, Qiuyin; Cai, Hui; Caldés, Trinidad; Canzian, Federico; Carracedo, Angel; Carter, Brian D; Castelao, Jose E; Chan, Tsun L; David Cheng, Ting-Yuan; Seng Chia, Kee; Choi, Ji-Yeob; Christiansen, Hans; Clarke, Christine L; Collée, Margriet; Conroy, Don M; Cordina-Duverger, Emilie; Cornelissen, Sten; Cox, David G; Cox, Angela; Cross, Simon S; Cunningham, Julie M; Czene, Kamila; Daly, Mary B; Devilee, Peter; Doheny, Kimberly F; Dörk, Thilo; Dos-Santos-Silva, Isabel; Dumont, Martine; Durcan, Lorraine; Dwek, Miriam; Eccles, Diana M; Ekici, Arif B; Eliassen, A Heather; Ellberg, Carolina; Elvira, Mingajeva; Engel, Christoph; Eriksson, Mikael; Fasching, Peter A; Figueroa, Jonine; Flesch-Janys, Dieter; Fletcher, Olivia; Flyger, Henrik; Fritschi, Lin; Gaborieau, Valerie; Gabrielson, Marike; Gago-Dominguez, Manuela; Gao, Yu-Tang; Gapstur, Susan M; García-Sáenz, José A; Gaudet, Mia M; Georgoulias, Vassilios; Giles, Graham G; Glendon, Gord; Goldberg, Mark S; Goldgar, David E; González-Neira, Anna; Grenaker Alnæs, Grethe I; Grip, Mervi; Gronwald, Jacek; Grundy, Anne; Guénel, Pascal; Haeberle, Lothar; Hahnen, Eric; Haiman, Christopher A; Håkansson, Niclas; Hamann, Ute; Hamel, Nathalie; Hankinson, Susan; Harrington, Patricia; Hart, Steven N; Hartikainen, Jaana M; Hartman, Mikael; Hein, Alexander; Heyworth, Jane; Hicks, Belynda; Hillemanns, Peter; Ho, Dona N; Hollestelle, Antoinette; Hooning, Maartje J; Hoover, Robert N; Hopper, John L; Hou, Ming-Feng; Hsiung, Chia-Ni; Huang, Guanmengqian; Humphreys, Keith; Ishiguro, Junko; Ito, Hidemi; Iwasaki, Motoki; Iwata, Hiroji; Jakubowska, Anna; Janni, Wolfgang; John, Esther M; Johnson, Nichola; Jones, Kristine; Jones, Michael; Jukkola-Vuorinen, Arja; Kaaks, Rudolf; Kabisch, Maria; Kaczmarek, Katarzyna; Kang, Daehee; Kasuga, Yoshio; Kerin, Michael J; Khan, Sofia; Khusnutdinova, Elza; Kiiski, Johanna I; Kim, Sung-Won; Knight, Julia A; Kosma, Veli-Matti; Kristensen, Vessela N; Krüger, Ute; Kwong, Ava; Lambrechts, Diether; Le Marchand, Loic; Lee, Eunjung; Lee, Min Hyuk; Lee, Jong Won; Neng Lee, Chuen; Lejbkowicz, Flavio; Li, Jingmei; Lilyquist, Jenna; Lindblom, Annika; Lissowska, Jolanta; Lo, Wing-Yee; Loibl, Sibylle; Long, Jirong; Lophatananon, Artitaya; Lubinski, Jan; Luccarini, Craig; Lux, Michael P; Ma, Edmond S K; MacInnis, Robert J; Maishman, Tom; Makalic, Enes; Malone, Kathleen E; Kostovska, Ivana Maleva; Mannermaa, Arto; Manoukian, Siranoush; Manson, JoAnn E; Margolin, Sara; Mariapun, Shivaani; Martinez, Maria Elena; Matsuo, Keitaro; Mavroudis, Dimitrios; McKay, James; McLean, Catriona; Meijers-Heijboer, Hanne; Meindl, Alfons; Menéndez, Primitiva; Menon, Usha; Meyer, Jeffery; Miao, Hui; Miller, Nicola; Taib, Nur Aishah Mohd; Muir, Kenneth; Mulligan, Anna Marie; Mulot, Claire; Neuhausen, Susan L; Nevanlinna, Heli; Neven, Patrick; Nielsen, Sune F; Noh, Dong-Young; Nordestgaard, Børge G; Norman, Aaron; Olopade, Olufunmilayo I; Olson, Janet E; Olsson, Håkan; Olswold, Curtis; Orr, Nick; Pankratz, V Shane; Park, Sue K; Park-Simon, Tjoung-Won; Lloyd, Rachel; Perez, Jose I A; Peterlongo, Paolo; Peto, Julian; Phillips, Kelly-Anne; Pinchev, Mila; Plaseska-Karanfilska, Dijana; Prentice, Ross; Presneau, Nadege; Prokofyeva, Darya; Pugh, Elizabeth; Pylkäs, Katri; Rack, Brigitte; Radice, Paolo; Rahman, Nazneen; Rennert, Gadi; Rennert, Hedy S; Rhenius, Valerie; Romero, Atocha; Romm, Jane; Ruddy, Kathryn J; Rüdiger, Thomas; Rudolph, Anja; Ruebner, Matthias; Rutgers, Emiel J T; Saloustros, Emmanouil; Sandler, Dale P; Sangrajrang, Suleeporn; Sawyer, Elinor J; Schmidt, Daniel F; Schmutzler, Rita K; Schneeweiss, Andreas; Schoemaker, Minouk J; Schumacher, Fredrick; Schürmann, Peter; Scott, Rodney J; Scott, Christopher; Seal, Sheila; Seynaeve, Caroline; Shah, Mitul; Sharma, Priyanka; Shen, Chen-Yang; Sheng, Grace; Sherman, Mark E; Shrubsole, Martha J; Shu, Xiao-Ou; Smeets, Ann; Sohn, Christof; Southey, Melissa C; Spinelli, John J; Stegmaier, Christa; Stewart-Brown, Sarah; Stone, Jennifer; Stram, Daniel O; Surowy, Harald; Swerdlow, Anthony; Tamimi, Rulla; Taylor, Jack A; Tengström, Maria; Teo, Soo H; Beth Terry, Mary; Tessier, Daniel C; Thanasitthichai, Somchai; Thöne, Kathrin; Tollenaar, Rob A E M; Tomlinson, Ian; Tong, Ling; Torres, Diana; Truong, Thérèse; Tseng, Chiu-Chen; Tsugane, Shoichiro; Ulmer, Hans-Ulrich; Ursin, Giske; Untch, Michael; Vachon, Celine; van Asperen, Christi J; Van Den Berg, David; van den Ouweland, Ans M W; van der Kolk, Lizet; van der Luijt, Rob B; Vincent, Daniel; Vollenweider, Jason; Waisfisz, Quinten; Wang-Gohrke, Shan; Weinberg, Clarice R; Wendt, Camilla; Whittemore, Alice S; Wildiers, Hans; Willett, Walter; Winqvist, Robert; Wolk, Alicja; Wu, Anna H; Xia, Lucy; Yamaji, Taiki; Yang, Xiaohong R; Har Yip, Cheng; Yoo, Keun-Young; Yu, Jyh-Cherng; Zheng, Wei; Zheng, Ying; Zhu, Bin; Ziogas, Argyrios; Ziv, Elad; Lakhani, Sunil R; Antoniou, Antonis C; Droit, Arnaud; Andrulis, Irene L; Amos, Christopher I; Couch, Fergus J; Pharoah, Paul D P; Chang-Claude, Jenny; Hall, Per; Hunter, David J; Milne, Roger L; García-Closas, Montserrat; Schmidt, Marjanka K; Chanock, Stephen J; Dunning, Alison M; Edwards, Stacey L; Bader, Gary D; Chenevix-Trench, Georgia; Simard, Jacques; Kraft, Peter; Easton, Douglas F
2017-11-02
Breast cancer risk is influenced by rare coding variants in susceptibility genes, such as BRCA1, and many common, mostly non-coding variants. However, much of the genetic contribution to breast cancer risk remains unknown. Here we report the results of a genome-wide association study of breast cancer in 122,977 cases and 105,974 controls of European ancestry and 14,068 cases and 13,104 controls of East Asian ancestry. We identified 65 new loci that are associated with overall breast cancer risk at P < 5 × 10 -8 . The majority of credible risk single-nucleotide polymorphisms in these loci fall in distal regulatory elements, and by integrating in silico data to predict target genes in breast cells at each locus, we demonstrate a strong overlap between candidate target genes and somatic driver genes in breast tumours. We also find that heritability of breast cancer due to all single-nucleotide polymorphisms in regulatory features was 2-5-fold enriched relative to the genome-wide average, with strong enrichment for particular transcription factor binding sites. These results provide further insight into genetic susceptibility to breast cancer and will improve the use of genetic risk scores for individualized screening and prevention.
Rim, Jong S; Kozak, Leslie P
2002-09-13
Thermogenesis against cold exposure in mammals occurs in brown adipose tissue (BAT) through mitochondrial uncoupling protein (UCP1). Expression of the Ucp1 gene is unique in brown adipocytes and is regulated tightly. The 5'-flanking region of the mouse Ucp1 gene contains cis-acting elements including PPRE, TRE, and four half-site cAMP-responsive elements (CRE) with BAT-specific enhancer elements. In the course of analyzing how these half-site CREs are involved in Ucp1 expression, we found that a DNA regulatory element for NF-E2 overlaps CRE2. Electrophoretic mobility shift assay and competition assays with the CRE2 element indicates that nuclear proteins from BAT, inguinal fat, and retroperitoneal fat tissue interact with the CRE2 motif (CGTCA) in a specific manner. A supershift assay using an antibody against the CRE-binding protein (CREB) shows specific affinity to the complex from CRE2 and nuclear extract of BAT. Additionally, Western blot analysis for phospho-CREB/ATF1 shows an increase in phosphorylation of CREB/ATF1 in HIB-1B cells after norepinephrine treatment. Transient transfection assay using luciferase reporter constructs also indicates that the two half-site CREs are involved in transcriptional regulation of Ucp1 in response to norepinephrine and cAMP. We also show that a second DNA regulatory element for NF-E2 is located upstream of the CRE2 region. This element, which is found in a similar location in the 5'-flanking region of the human and rodent Ucp1 genes, shows specific binding to rat and human NF-E2 by electrophoretic mobility shift assay with nuclear extracts from brown fat. Co-transfections with an Nfe2l2 expression vector and a luciferase reporter construct of the Ucp1 enhancer region provide additional evidence that Nfe2l2 is involved in the regulation of Ucp1 by cAMP-mediated signaling.
Regulators of gene expression as biomarkers for prostate cancer
Willard, Stacey S; Koochekpour, Shahriar
2012-01-01
Recent technological advancements in gene expression analysis have led to the discovery of a promising new group of prostate cancer (PCa) biomarkers that have the potential to influence diagnosis and the prediction of disease severity. The accumulation of deleterious changes in gene expression is a fundamental mechanism of prostate carcinogenesis. Aberrant gene expression can arise from changes in epigenetic regulation or mutation in the genome affecting either key regulatory elements or gene sequences themselves. At the epigenetic level, a myriad of abnormal histone modifications and changes in DNA methylation are found in PCa patients. In addition, many mutations in the genome have been associated with higher PCa risk. Finally, over- or underexpression of key genes involved in cell cycle regulation, apoptosis, cell adhesion and regulation of transcription has been observed. An interesting group of biomarkers are emerging from these studies which may prove more predictive than the standard prostate specific antigen (PSA) serum test. In this review, we discuss recent results in the field of gene expression analysis in PCa including the most promising biomarkers in the areas of epigenetics, genomics and the transcriptome, some of which are currently under investigation as clinical tests for early detection and better prognostic prediction of PCa. PMID:23226612
The standard operating procedure of the DOE-JGI Metagenome Annotation Pipeline (MAP v.4)
Huntemann, Marcel; Ivanova, Natalia N.; Mavromatis, Konstantinos; ...
2016-02-24
The DOE-JGI Metagenome Annotation Pipeline (MAP v.4) performs structural and functional annotation for metagenomic sequences that are submitted to the Integrated Microbial Genomes with Microbiomes (IMG/M) system for comparative analysis. The pipeline runs on nucleotide sequences provide d via the IMG submission site. Users must first define their analysis projects in GOLD and then submit the associated sequence datasets consisting of scaffolds/contigs with optional coverage information and/or unassembled reads in fasta and fastq file formats. The MAP processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNAs, as well as CRISPR elements. Structural annotation ismore » followed by functional annotation including assignment of protein product names and connection to various protein family databases.« less
The standard operating procedure of the DOE-JGI Metagenome Annotation Pipeline (MAP v.4)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Huntemann, Marcel; Ivanova, Natalia N.; Mavromatis, Konstantinos
The DOE-JGI Metagenome Annotation Pipeline (MAP v.4) performs structural and functional annotation for metagenomic sequences that are submitted to the Integrated Microbial Genomes with Microbiomes (IMG/M) system for comparative analysis. The pipeline runs on nucleotide sequences provide d via the IMG submission site. Users must first define their analysis projects in GOLD and then submit the associated sequence datasets consisting of scaffolds/contigs with optional coverage information and/or unassembled reads in fasta and fastq file formats. The MAP processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNAs, as well as CRISPR elements. Structural annotation ismore » followed by functional annotation including assignment of protein product names and connection to various protein family databases.« less
An approach for reduction of false predictions in reverse engineering of gene regulatory networks.
Khan, Abhinandan; Saha, Goutam; Pal, Rajat Kumar
2018-05-14
A gene regulatory network discloses the regulatory interactions amongst genes, at a particular condition of the human body. The accurate reconstruction of such networks from time-series genetic expression data using computational tools offers a stiff challenge for contemporary computer scientists. This is crucial to facilitate the understanding of the proper functioning of a living organism. Unfortunately, the computational methods produce many false predictions along with the correct predictions, which is unwanted. Investigations in the domain focus on the identification of as many correct regulations as possible in the reverse engineering of gene regulatory networks to make it more reliable and biologically relevant. One way to achieve this is to reduce the number of incorrect predictions in the reconstructed networks. In the present investigation, we have proposed a novel scheme to decrease the number of false predictions by suitably combining several metaheuristic techniques. We have implemented the same using a dataset ensemble approach (i.e. combining multiple datasets) also. We have employed the proposed methodology on real-world experimental datasets of the SOS DNA Repair network of Escherichia coli and the IMRA network of Saccharomyces cerevisiae. Subsequently, we have experimented upon somewhat larger, in silico networks, namely, DREAM3 and DREAM4 Challenge networks, and 15-gene and 20-gene networks extracted from the GeneNetWeaver database. To study the effect of multiple datasets on the quality of the inferred networks, we have used four datasets in each experiment. The obtained results are encouraging enough as the proposed methodology can reduce the number of false predictions significantly, without using any supplementary prior biological information for larger gene regulatory networks. It is also observed that if a small amount of prior biological information is incorporated here, the results improve further w.r.t. the prediction of true positives. Copyright © 2018 Elsevier Ltd. All rights reserved.
Schoor, Michael; Mortlock, Doug P.; Reddi, A. Hari; Kingsley, David M.
2016-01-01
Synovial joints are crucial for support and locomotion in vertebrates, and are the frequent site of serious skeletal defects and degenerative diseases in humans. Growth and differentiation factor 5 (Gdf5) is one of the earliest markers of joint formation, is required for normal joint development in both mice and humans, and has been genetically linked to risk of common osteoarthritis in Eurasian populations. Here, we systematically survey the mouse Gdf5 gene for regulatory elements controlling expression in synovial joints. We identify separate regions of the locus that control expression in axial tissues, in proximal versus distal joints in the limbs, and in remarkably specific sub-sets of composite joints like the elbow. Predicted transcription factor binding sites within Gdf5 regulatory enhancers are required for expression in particular joints. The multiple enhancers that control Gdf5 expression in different joints are distributed over a hundred kilobases of DNA, including regions both upstream and downstream of Gdf5 coding exons. Functional rescue tests in mice confirm that the large flanking regions are required to restore normal joint formation and patterning. Orthologs of these enhancers are located throughout the large genomic region previously associated with common osteoarthritis risk in humans. The large array of modular enhancers for Gdf5 provide a new foundation for studying the spatial specificity of joint patterning in vertebrates, as well as new candidates for regulatory regions that may also influence osteoarthritis risk in human populations. PMID:27902701
USDA-ARS?s Scientific Manuscript database
To balance the demand for uptake of essential elements with their potential toxicity living cells have complex regulatory mechanisms. Here, we describe a genome-wide screen to identify genes that impact the elemental composition (‘ionome’) of yeast Saccharomyces cerevisiae. Using inductively coupled...
Federal Register 2010, 2011, 2012, 2013, 2014
2013-10-23
... appropriate limits for impurities, and emphasizes control of supply chains and risk assessments. It is... expectations for test requirements and regulatory filings, and a global policy for limiting elemental... written comments to the Division of Dockets Management (HFA-305), Food and Drug Administration, 5630...
Federal Register 2010, 2011, 2012, 2013, 2014
2010-08-23
... decisions. Data elements with respect to the SHORT subscription service that would be provided through the... information about technical data elements to support transmission and data-integrity processes between the... for making well-informed investment decisions. Broad access to the information collected by the SHORT...
Federal Register 2010, 2011, 2012, 2013, 2014
2010-09-30
... decisions. Data elements with respect to the SHORT subscription service that would be provided through the... information about technical data elements to support transmission and data-integrity processes between the... Securities and Exchange Commission (``Commission''), pursuant to Section 19(b)(1) of the Securities [[Page...
Monteiro, Pedro Tiago; Pais, Pedro; Costa, Catarina; Manna, Sauvagya; Sá-Correia, Isabel; Teixeira, Miguel Cacho
2017-01-04
We present the PATHOgenic YEAst Search for Transcriptional Regulators And Consensus Tracking (PathoYeastract - http://pathoyeastract.org) database, a tool for the analysis and prediction of transcription regulatory associations at the gene and genomic levels in the pathogenic yeasts Candida albicans and C. glabrata Upon data retrieval from hundreds of publications, followed by curation, the database currently includes 28 000 unique documented regulatory associations between transcription factors (TF) and target genes and 107 DNA binding sites, considering 134 TFs in both species. Following the structure used for the YEASTRACT database, PathoYeastract makes available bioinformatics tools that enable the user to exploit the existing information to predict the TFs involved in the regulation of a gene or genome-wide transcriptional response, while ranking those TFs in order of their relative importance. Each search can be filtered based on the selection of specific environmental conditions, experimental evidence or positive/negative regulatory effect. Promoter analysis tools and interactive visualization tools for the representation of TF regulatory networks are also provided. The PathoYeastract database further provides simple tools for the prediction of gene and genomic regulation based on orthologous regulatory associations described for other yeast species, a comparative genomics setup for the study of cross-species evolution of regulatory networks. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
2014-01-01
Background Plant secondary metabolites are critical to various biological processes. However, the regulations of these metabolites are complex because of regulatory rewiring or crosstalk. To unveil how regulatory behaviors on secondary metabolism reshape biological processes, we constructed and analyzed a dynamic regulatory network of secondary metabolic pathways in Arabidopsis. Results The dynamic regulatory network was constructed through integrating co-expressed gene pairs and regulatory interactions. Regulatory interactions were either predicted by conserved transcription factor binding sites (TFBSs) or proved by experiments. We found that integrating two data (co-expression and predicted regulatory interactions) enhanced the number of highly confident regulatory interactions by over 10% compared with using single data. The dynamic changes of regulatory network systematically manifested regulatory rewiring to explain the mechanism of regulation, such as in terpenoids metabolism, the regulatory crosstalk of RAV1 (AT1G13260) and ATHB1 (AT3G01470) on HMG1 (hydroxymethylglutaryl-CoA reductase, AT1G76490); and regulation of RAV1 on epoxysqualene biosynthesis and sterol biosynthesis. Besides, we investigated regulatory rewiring with expression, network topology and upstream signaling pathways. Regulatory rewiring was revealed by the variability of genes’ expression: pathway genes and transcription factors (TFs) were significantly differentially expressed under different conditions (such as terpenoids biosynthetic genes in tissue experiments and E2F/DP family members in genotype experiments). Both network topology and signaling pathways supported regulatory rewiring. For example, we discovered correlation among the numbers of pathway genes, TFs and network topology: one-gene pathways (such as δ-carotene biosynthesis) were regulated by a fewer TFs, and were not critical to metabolic network because of their low degrees in topology. Upstream signaling pathways of 50 TFs were identified to comprehend the underlying mechanism of TFs’ regulatory rewiring. Conclusion Overall, this dynamic regulatory network largely improves the understanding of perplexed regulatory rewiring in secondary metabolism in Arabidopsis. PMID:24993737
Experimental validation of boundary element methods for noise prediction
NASA Technical Reports Server (NTRS)
Seybert, A. F.; Oswald, Fred B.
1992-01-01
Experimental validation of methods to predict radiated noise is presented. A combined finite element and boundary element model was used to predict the vibration and noise of a rectangular box excited by a mechanical shaker. The predicted noise was compared to sound power measured by the acoustic intensity method. Inaccuracies in the finite element model shifted the resonance frequencies by about 5 percent. The predicted and measured sound power levels agree within about 2.5 dB. In a second experiment, measured vibration data was used with a boundary element model to predict noise radiation from the top of an operating gearbox. The predicted and measured sound power for the gearbox agree within about 3 dB.
Identification of genetic elements in metabolism by high-throughput mouse phenotyping.
Rozman, Jan; Rathkolb, Birgit; Oestereicher, Manuela A; Schütt, Christine; Ravindranath, Aakash Chavan; Leuchtenberger, Stefanie; Sharma, Sapna; Kistler, Martin; Willershäuser, Monja; Brommage, Robert; Meehan, Terrence F; Mason, Jeremy; Haselimashhadi, Hamed; Hough, Tertius; Mallon, Ann-Marie; Wells, Sara; Santos, Luis; Lelliott, Christopher J; White, Jacqueline K; Sorg, Tania; Champy, Marie-France; Bower, Lynette R; Reynolds, Corey L; Flenniken, Ann M; Murray, Stephen A; Nutter, Lauryl M J; Svenson, Karen L; West, David; Tocchini-Valentini, Glauco P; Beaudet, Arthur L; Bosch, Fatima; Braun, Robert B; Dobbie, Michael S; Gao, Xiang; Herault, Yann; Moshiri, Ala; Moore, Bret A; Kent Lloyd, K C; McKerlie, Colin; Masuya, Hiroshi; Tanaka, Nobuhiko; Flicek, Paul; Parkinson, Helen E; Sedlacek, Radislav; Seong, Je Kyung; Wang, Chi-Kuang Leo; Moore, Mark; Brown, Steve D; Tschöp, Matthias H; Wurst, Wolfgang; Klingenspor, Martin; Wolf, Eckhard; Beckers, Johannes; Machicao, Fausto; Peter, Andreas; Staiger, Harald; Häring, Hans-Ulrich; Grallert, Harald; Campillos, Monica; Maier, Holger; Fuchs, Helmut; Gailus-Durner, Valerie; Werner, Thomas; Hrabe de Angelis, Martin
2018-01-18
Metabolic diseases are a worldwide problem but the underlying genetic factors and their relevance to metabolic disease remain incompletely understood. Genome-wide research is needed to characterize so-far unannotated mammalian metabolic genes. Here, we generate and analyze metabolic phenotypic data of 2016 knockout mouse strains under the aegis of the International Mouse Phenotyping Consortium (IMPC) and find 974 gene knockouts with strong metabolic phenotypes. 429 of those had no previous link to metabolism and 51 genes remain functionally completely unannotated. We compared human orthologues of these uncharacterized genes in five GWAS consortia and indeed 23 candidate genes are associated with metabolic disease. We further identify common regulatory elements in promoters of candidate genes. As each regulatory element is composed of several transcription factor binding sites, our data reveal an extensive metabolic phenotype-associated network of co-regulated genes. Our systematic mouse phenotype analysis thus paves the way for full functional annotation of the genome.
Altruistic functions for selfish DNA.
Faulkner, Geoffrey J; Carninci, Piero
2009-09-15
Mammalian genomes are comprised of 30-50% transposed elements (TEs). The vast majority of these TEs are truncated and mutated fragments of retrotransposons that are no longer capable of transposition. Although initially regarded as important factors in the evolution of gene regulatory networks, TEs are now commonly perceived as neutrally evolving and non-functional genomic elements. In a major development, recent works have strongly contradicted this "selfish DNA" or "junk DNA" dogma by demonstrating that TEs use a host of novel promoters to generate RNA on a massive scale across most eukaryotic cells. This transcription frequently functions to control the expression of protein-coding genes via alternative promoters, cis regulatory non protein-coding RNAs and the formation of double stranded short RNAs. If considered in sum, these findings challenge the designation of TEs as selfish and neutrally evolving genomic elements. Here, we will expand upon these themes and discuss challenges in establishing novel TE functions in vivo.
Delimiting regulatory sequences of the Drosophila melanogaster Ddc gene.
Hirsh, J; Morgan, B A; Scholnick, S B
1986-01-01
We delimited sequences necessary for in vivo expression of the Drosophila melanogaster dopa decarboxylase gene Ddc. The expression of in vitro-altered genes was assayed following germ line integration via P-element vectors. Sequences between -209 and -24 were necessary for normally regulated expression, although genes lacking these sequences could be expressed at 10 to 50% of wild-type levels at specific developmental times. These genes showed components of normal developmental expression, which suggests that they retain some regulatory elements. All Ddc genes lacking the normal immediate 5'-flanking sequences were grossly deficient in larval central nervous system expression. Thus, this upstream region must contain at least one element necessary for this expression. A mutated Ddc gene without a normal TATA boxlike sequence used the normal RNA start points, indicating that this sequences is not required for start point specificity. Images PMID:3099170
Tissue-Specific Enrichment of Lymphoma Risk Loci in Regulatory Elements
Hayes, James E.; Trynka, Gosia; Vijai, Joseph; Offit, Kenneth; Raychaudhuri, Soumya; Klein, Robert J.
2015-01-01
Though numerous polymorphisms have been associated with risk of developing lymphoma, how these variants function to promote tumorigenesis is poorly understood. Here, we report that lymphoma risk SNPs, especially in the non-Hodgkin’s lymphoma subtype chronic lymphocytic leukemia, are significantly enriched for co-localization with epigenetic marks of active gene regulation. These enrichments were seen in a lymphoid-specific manner for numerous ENCODE datasets, including DNase-hypersensitivity as well as multiple segmentation-defined enhancer regions. Furthermore, we identify putatively functional SNPs that are both in regulatory elements in lymphocytes and are associated with gene expression changes in blood. We developed an algorithm, UES, that uses a Monte Carlo simulation approach to calculate the enrichment of previously identified risk SNPs in various functional elements. This multiscale approach integrating multiple datasets helps disentangle the underlying biology of lymphoma, and more broadly, is generally applicable to GWAS results from other diseases as well. PMID:26422229
Genome-scale cold stress response regulatory networks in ten Arabidopsis thaliana ecotypes
2013-01-01
Background Low temperature leads to major crop losses every year. Although several studies have been conducted focusing on diversity of cold tolerance level in multiple phenotypically divergent Arabidopsis thaliana (A. thaliana) ecotypes, genome-scale molecular understanding is still lacking. Results In this study, we report genome-scale transcript response diversity of 10 A. thaliana ecotypes originating from different geographical locations to non-freezing cold stress (10°C). To analyze the transcriptional response diversity, we initially compared transcriptome changes in all 10 ecotypes using Arabidopsis NimbleGen ATH6 microarrays. In total 6061 transcripts were significantly cold regulated (p < 0.01) in 10 ecotypes, including 498 transcription factors and 315 transposable elements. The majority of the transcripts (75%) showed ecotype specific expression pattern. By using sequence data available from Arabidopsis thaliana 1001 genome project, we further investigated sequence polymorphisms in the core cold stress regulon genes. Significant numbers of non-synonymous amino acid changes were observed in the coding region of the CBF regulon genes. Considering the limited knowledge about regulatory interactions between transcription factors and their target genes in the model plant A. thaliana, we have adopted a powerful systems genetics approach- Network Component Analysis (NCA) to construct an in-silico transcriptional regulatory network model during response to cold stress. The resulting regulatory network contained 1,275 nodes and 7,720 connections, with 178 transcription factors and 1,331 target genes. Conclusions A. thaliana ecotypes exhibit considerable variation in transcriptome level responses to non-freezing cold stress treatment. Ecotype specific transcripts and related gene ontology (GO) categories were identified to delineate natural variation of cold stress regulated differential gene expression in the model plant A. thaliana. The predicted regulatory network model was able to identify new ecotype specific transcription factors and their regulatory interactions, which might be crucial for their local geographic adaptation to cold temperature. Additionally, since the approach presented here is general, it could be adapted to study networks regulating biological process in any biological systems. PMID:24148294
AP1 Keeps Chromatin Poised for Action | Center for Cancer Research
The human genome harbors gene-encoding DNA, the blueprint for building proteins that regulate cellular function. Embedded across the genome, in non-coding regions, are DNA elements to which regulatory factors bind. The interaction of regulatory factors with DNA at these sites modifies gene expression to modulate cell activity. In cells, DNA exists in a complex with proteins
Preclinical Development of Cell-Based Products: a European Regulatory Science Perspective.
McBlane, James W; Phul, Parvinder; Sharpe, Michaela
2018-06-25
This article describes preclinical development of cell-based medicinal products for European markets and discusses European regulatory mechanisms open to developers to aid successful product development. Cell-based medicinal products are diverse, including cells that are autologous or allogeneic, have been genetically modified, or not, or expanded ex vivo, and applied systemically or to an anatomical site different to that of their origin; comments applicable to one product may not be applicable to others, so bespoke development is needed, for all elements - quality, preclinical and clinical. After establishing how the product is produced, proof of potential for therapeutic efficacy, and then safety, of the product need to be determined. This includes understanding biodistribution, persistence and toxicity, including potential for malignant transformation. These elements need to be considered in the context of the intended clinical development. This article describes regulatory mechanisms available to developers to support product development that aim to resolve scientific issues prior to marketing authorization application, to enable patients to have faster access to the product than would otherwise be the case. Developers are encouraged to be aware of both the scientific issues and regulatory mechanisms to ensure patients can be supplied with these products.
Bhatia, Shipra; Gordon, Christopher T.; Foster, Robert G.; Melin, Lucie; Abadie, Véronique; Baujat, Geneviève; Vazquez, Marie-Paule; Amiel, Jeanne; Lyonnet, Stanislas; van Heyningen, Veronica; Kleinjan, Dirk A.
2015-01-01
Disruption of gene regulation by sequence variation in non-coding regions of the genome is now recognised as a significant cause of human disease and disease susceptibility. Sequence variants in cis-regulatory elements (CREs), the primary determinants of spatio-temporal gene regulation, can alter transcription factor binding sites. While technological advances have led to easy identification of disease-associated CRE variants, robust methods for discerning functional CRE variants from background variation are lacking. Here we describe an efficient dual-colour reporter transgenesis approach in zebrafish, simultaneously allowing detailed in vivo comparison of spatio-temporal differences in regulatory activity between putative CRE variants and assessment of altered transcription factor binding potential of the variant. We validate the method on known disease-associated elements regulating SHH, PAX6 and IRF6 and subsequently characterise novel, ultra-long-range SOX9 enhancers implicated in the craniofacial abnormality Pierre Robin Sequence. The method provides a highly cost-effective, fast and robust approach for simultaneously unravelling in a single assay whether, where and when in embryonic development a disease-associated CRE-variant is affecting its regulatory function. PMID:26030420
Functional analysis of two sterol regulatory element binding proteins in Penicillium digitatum
Ruan, Ruoxin; Wang, Mingshuang; Liu, Xin; Sun, Xuepeng; Chung, Kuang-Ren
2017-01-01
The sterol regulatory element binding proteins (SREBPs) are key regulators for sterol homeostasis in most fungi. In the citrus postharvest pathogen Penicillium digitatum, the SREBP homolog is required for fungicide resistance and regulation of CYP51 expression. In this study, we identified another SREBP transcription factor PdSreB in P. digitatum, and the biological functions of both SREBPs were characterized and compared. Inactivation of PdsreA, PdsreB or both genes in P. digitatum reduced ergosterol contents and increased sensitivities to sterol 14-α-demethylation inhibitors (DMIs) and cobalt chloride. Fungal strains impaired at PdsreA but not PdsreB increased sensitivity to tridemorph and an iron chelator 2,2’-dipyridyl. Virulence assays on citrus fruit revealed that fungal strains impaired at PdsreA, PdsreB or both induce maceration lesions similar to those induced by wild-type. However, ΔPdsreA, ΔPdsreB or the double mutant strain rarely produce aerial mycelia on infected citrus fruit peels. RNA-Seq analysis showed the broad regulatory functions of both SREBPs in biosynthesis, transmembrane transportation and stress responses. Our results provide new insights into the conserved and differentiated regulatory functions of SREBP homologs in plant pathogenic fungi. PMID:28467453
Sun, Eric I; Leyn, Semen A; Kazanov, Marat D; Saier, Milton H; Novichkov, Pavel S; Rodionov, Dmitry A
2013-09-02
In silico comparative genomics approaches have been efficiently used for functional prediction and reconstruction of metabolic and regulatory networks. Riboswitches are metabolite-sensing structures often found in bacterial mRNA leaders controlling gene expression on transcriptional or translational levels.An increasing number of riboswitches and other cis-regulatory RNAs have been recently classified into numerous RNA families in the Rfam database. High conservation of these RNA motifs provides a unique advantage for their genomic identification and comparative analysis. A comparative genomics approach implemented in the RegPredict tool was used for reconstruction and functional annotation of regulons controlled by RNAs from 43 Rfam families in diverse taxonomic groups of Bacteria. The inferred regulons include ~5200 cis-regulatory RNAs and more than 12000 target genes in 255 microbial genomes. All predicted RNA-regulated genes were classified into specific and overall functional categories. Analysis of taxonomic distribution of these categories allowed us to establish major functional preferences for each analyzed cis-regulatory RNA motif family. Overall, most RNA motif regulons showed predictable functional content in accordance with their experimentally established effector ligands. Our results suggest that some RNA motifs (including thiamin pyrophosphate and cobalamin riboswitches that control the cofactor metabolism) are widespread and likely originated from the last common ancestor of all bacteria. However, many more analyzed RNA motifs are restricted to a narrow taxonomic group of bacteria and likely represent more recent evolutionary innovations. The reconstructed regulatory networks for major known RNA motifs substantially expand the existing knowledge of transcriptional regulation in bacteria. The inferred regulons can be used for genetic experiments, functional annotations of genes, metabolic reconstruction and evolutionary analysis. The obtained genome-wide collection of reference RNA motif regulons is available in the RegPrecise database (http://regprecise.lbl.gov/).
Advancing Drug Safety Through Prospective Pharmacovigilance.
Pitts, Peter J; Le Louet, Hervé
2018-01-01
Much has changed in a relatively short period of time. There is a raging debate over the level of evidence expected to first introduce a treatment to patients based on smaller, more adaptive data sets. Some argue for less data followed by postapproval follow-up, others for more adaptive clinical trial designs and end-point modification driven by patient-focused drug development and use of real-world evidence. The transition in both the review and postmarketing regulatory framework is happening in front of our eyes in real time. To improve the ability of patients to receive high-quality, safe, effective, and timely care, better information via pharmacovigilance must be a priority as the world's many regulatory systems build the capacity to harness electronic health information to improve health, care quality, and safety. Globally, the widely variable ability of nations to build reliable regulatory systems (from precise review to robust pharmacovigilance) is a dangerous source of health care inequality. Developing validated tools and techniques for "predictive pharmacovigilance" will assist all health systems in better understanding the risks and benefits of the medicines they regulate by understanding what should be happening once a new medicine moves from risk-benefit regulatory efficacy to real-world risk-effectiveness. This will be of particular utility for smaller regulatory agencies with fewer resources. By comparing preapproval predictive pharmacovigilance data, developing regulatory authorities will be able to better understand the potential gap between what was predicted and what was actually measured (via more traditional pharmacovigilance methodologies). Predictive pharmacovigilance recognizes the value of understanding the imperfect reporting of real-world clinical use and that the absence of reporting is, in itself, an important postmarketing signal.
A prior-based integrative framework for functional transcriptional regulatory network inference
Siahpirani, Alireza F.
2017-01-01
Abstract Transcriptional regulatory networks specify regulatory proteins controlling the context-specific expression levels of genes. Inference of genome-wide regulatory networks is central to understanding gene regulation, but remains an open challenge. Expression-based network inference is among the most popular methods to infer regulatory networks, however, networks inferred from such methods have low overlap with experimentally derived (e.g. ChIP-chip and transcription factor (TF) knockouts) networks. Currently we have a limited understanding of this discrepancy. To address this gap, we first develop a regulatory network inference algorithm, based on probabilistic graphical models, to integrate expression with auxiliary datasets supporting a regulatory edge. Second, we comprehensively analyze our and other state-of-the-art methods on different expression perturbation datasets. Networks inferred by integrating sequence-specific motifs with expression have substantially greater agreement with experimentally derived networks, while remaining more predictive of expression than motif-based networks. Our analysis suggests natural genetic variation as the most informative perturbation for network inference, and, identifies core TFs whose targets are predictable from expression. Multiple reasons make the identification of targets of other TFs difficult, including network architecture and insufficient variation of TF mRNA level. Finally, we demonstrate the utility of our inference algorithm to infer stress-specific regulatory networks and for regulator prioritization. PMID:27794550
The twilight zone of cis element alignments.
Sebastian, Alvaro; Contreras-Moreira, Bruno
2013-02-01
Sequence alignment of proteins and nucleic acids is a routine task in bioinformatics. Although the comparison of complete peptides, genes or genomes can be undertaken with a great variety of tools, the alignment of short DNA sequences and motifs entails pitfalls that have not been fully addressed yet. Here we confront the structural superposition of transcription factors with the sequence alignment of their recognized cis elements. Our goals are (i) to test TFcompare (http://floresta.eead.csic.es/tfcompare), a structural alignment method for protein-DNA complexes; (ii) to benchmark the pairwise alignment of regulatory elements; (iii) to define the confidence limits and the twilight zone of such alignments and (iv) to evaluate the relevance of these thresholds with elements obtained experimentally. We find that the structure of cis elements and protein-DNA interfaces is significantly more conserved than their sequence and measures how this correlates with alignment errors when only sequence information is considered. Our results confirm that DNA motifs in the form of matrices produce better alignments than individual sequences. Finally, we report that empirical and theoretically derived twilight thresholds are useful for estimating the natural plasticity of regulatory sequences, and hence for filtering out unreliable alignments.
The twilight zone of cis element alignments
Sebastian, Alvaro; Contreras-Moreira, Bruno
2013-01-01
Sequence alignment of proteins and nucleic acids is a routine task in bioinformatics. Although the comparison of complete peptides, genes or genomes can be undertaken with a great variety of tools, the alignment of short DNA sequences and motifs entails pitfalls that have not been fully addressed yet. Here we confront the structural superposition of transcription factors with the sequence alignment of their recognized cis elements. Our goals are (i) to test TFcompare (http://floresta.eead.csic.es/tfcompare), a structural alignment method for protein–DNA complexes; (ii) to benchmark the pairwise alignment of regulatory elements; (iii) to define the confidence limits and the twilight zone of such alignments and (iv) to evaluate the relevance of these thresholds with elements obtained experimentally. We find that the structure of cis elements and protein–DNA interfaces is significantly more conserved than their sequence and measures how this correlates with alignment errors when only sequence information is considered. Our results confirm that DNA motifs in the form of matrices produce better alignments than individual sequences. Finally, we report that empirical and theoretically derived twilight thresholds are useful for estimating the natural plasticity of regulatory sequences, and hence for filtering out unreliable alignments. PMID:23268451
Widespread promoter-mediated coordination of transcription and mRNA degradation
2012-01-01
Background Previous work showed that mRNA degradation is coordinated with transcription in yeast, and in several genes the control of mRNA degradation was linked to promoter elements through two different mechanisms. Here we show at the genomic scale that the coordination of transcription and mRNA degradation is promoter-dependent in yeast and is also observed in humans. Results We first demonstrate that swapping upstream cis-regulatory sequences between two yeast species affects both transcription and mRNA degradation and suggest that while some cis-regulatory elements control either transcription or degradation, multiple other elements enhance both processes. Second, we show that adjacent yeast genes that share a promoter (through divergent orientation) have increased similarity in their patterns of mRNA degradation, providing independent evidence for the promoter-mediated coupling of transcription to mRNA degradation. Finally, analysis of the differences in mRNA degradation rates between mammalian cell types or mammalian species suggests a similar coordination between transcription and mRNA degradation in humans. Conclusions Our results extend previous studies and suggest a pervasive promoter-mediated coordination between transcription and mRNA degradation in yeast. The diverse genes and regulatory elements associated with this coordination suggest that it is generated by a global mechanism of gene regulation and modulated by gene-specific mechanisms. The observation of a similar coupling in mammals raises the possibility that coupling of transcription and mRNA degradation may reflect an evolutionarily conserved phenomenon in gene regulation. PMID:23237624
Henry, Kelli F.; Kawashima, Tomokazu; Goldberg, Robert B.
2015-03-22
Little is known about the molecular mechanisms by which the embryo proper and suspensor of plant embryos activate specific gene sets shortly after fertilization. We analyzed the upstream region of the Scarlet Runner Bean ( Phaseolus coccineus) G564 gene in order to understand how genes are activated specifically in the suspensor during early embryo development. Previously, we showed that a 54-bp fragment of the G564 upstream region is sufficient for suspensor transcription and contains at least three required cis-regulatory sequences, including the 10-bp motif (5'-GAAAAGCGAA-3'), the 10 bp-like motif (5'-GAAAAACGAA-3'), and Region 2 motif (partial sequence 5'-TTGGT-3'). Here, we usemore » site-directed mutagenesis experiments in transgenic tobacco globularstage embryos to identify two additional cis-regulatory elements within the 54-bp cis-regulatory module that are required for G564 suspensor transcription: the Fifth motif (5'-GAGTTA-3') and a third 10-bp-related sequence (5'-GAAAACCACA-3'). Further deletion of the 54-bp fragment revealed that a 47-bp fragment containing the five motifs (the 10-bp, 10-bp-like, 10-bp-related, Region 2 and Fifth motifs) is sufficient for suspensor transcription, and represents a cis-regulatory module. A consensus sequence for each type of motif was determined by comparing motif sequences shown to activate suspensor transcription. Phylogenetic analyses suggest that the regulation of G564 is evolutionarily conserved. Lastly, a homologous cis-regulatory module was found upstream of the G564 ortholog in the Common Bean (Phaseolus vulgaris), indicating that the regulation of G564 is evolutionarily conserved in closely related bean species.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Henry, Kelli F.; Kawashima, Tomokazu; Goldberg, Robert B.
Little is known about the molecular mechanisms by which the embryo proper and suspensor of plant embryos activate specific gene sets shortly after fertilization. We analyzed the upstream region of the Scarlet Runner Bean ( Phaseolus coccineus) G564 gene in order to understand how genes are activated specifically in the suspensor during early embryo development. Previously, we showed that a 54-bp fragment of the G564 upstream region is sufficient for suspensor transcription and contains at least three required cis-regulatory sequences, including the 10-bp motif (5'-GAAAAGCGAA-3'), the 10 bp-like motif (5'-GAAAAACGAA-3'), and Region 2 motif (partial sequence 5'-TTGGT-3'). Here, we usemore » site-directed mutagenesis experiments in transgenic tobacco globularstage embryos to identify two additional cis-regulatory elements within the 54-bp cis-regulatory module that are required for G564 suspensor transcription: the Fifth motif (5'-GAGTTA-3') and a third 10-bp-related sequence (5'-GAAAACCACA-3'). Further deletion of the 54-bp fragment revealed that a 47-bp fragment containing the five motifs (the 10-bp, 10-bp-like, 10-bp-related, Region 2 and Fifth motifs) is sufficient for suspensor transcription, and represents a cis-regulatory module. A consensus sequence for each type of motif was determined by comparing motif sequences shown to activate suspensor transcription. Phylogenetic analyses suggest that the regulation of G564 is evolutionarily conserved. Lastly, a homologous cis-regulatory module was found upstream of the G564 ortholog in the Common Bean (Phaseolus vulgaris), indicating that the regulation of G564 is evolutionarily conserved in closely related bean species.« less
In silico modeling of epigenetic-induced changes in photoreceptor cis-regulatory elements.
Hossain, Reafa A; Dunham, Nicholas R; Enke, Raymond A; Berndsen, Christopher E
2018-01-01
DNA methylation is a well-characterized epigenetic repressor of mRNA transcription in many plant and vertebrate systems. However, the mechanism of this repression is not fully understood. The process of transcription is controlled by proteins that regulate recruitment and activity of RNA polymerase by binding to specific cis-regulatory sequences. Cone-rod homeobox (CRX) is a well-characterized mammalian transcription factor that controls photoreceptor cell-specific gene expression. Although much is known about the functions and DNA binding specificity of CRX, little is known about how DNA methylation modulates CRX binding affinity to genomic cis-regulatory elements. We used bisulfite pyrosequencing of human ocular tissues to measure DNA methylation levels of the regulatory regions of RHO , PDE6B, PAX6 , and LINE1 retrotransposon repeats. To describe the molecular mechanism of repression, we used molecular modeling to illustrate the effect of DNA methylation on human RHO regulatory sequences. In this study, we demonstrate an inverse correlation between DNA methylation in regulatory regions adjacent to the human RHO and PDE6B genes and their subsequent transcription in human ocular tissues. Docking of CRX to the DNA models shows that CRX interacts with the grooves of these sequences, suggesting changes in groove structure could regulate binding. Molecular dynamics simulations of the RHO promoter and enhancer regions show changes in the flexibility and groove width upon epigenetic modification. Models also demonstrate changes in the local dynamics of CRX binding sites within RHO regulatory sequences which may account for the repression of CRX-dependent transcription. Collectively, these data demonstrate epigenetic regulation of CRX binding sites in human retinal tissue and provide insight into the mechanism of this mode of epigenetic regulation to be tested in future experiments.
Henry, Kelli F; Kawashima, Tomokazu; Goldberg, Robert B
2015-06-01
Little is known about the molecular mechanisms by which the embryo proper and suspensor of plant embryos activate specific gene sets shortly after fertilization. We analyzed the upstream region of the Scarlet Runner Bean (Phaseolus coccineus) G564 gene in order to understand how genes are activated specifically in the suspensor during early embryo development. Previously, we showed that a 54-bp fragment of the G564 upstream region is sufficient for suspensor transcription and contains at least three required cis-regulatory sequences, including the 10-bp motif (5'-GAAAAGCGAA-3'), the 10 bp-like motif (5'-GAAAAACGAA-3'), and Region 2 motif (partial sequence 5'-TTGGT-3'). Here, we use site-directed mutagenesis experiments in transgenic tobacco globular-stage embryos to identify two additional cis-regulatory elements within the 54-bp cis-regulatory module that are required for G564 suspensor transcription: the Fifth motif (5'-GAGTTA-3') and a third 10-bp-related sequence (5'-GAAAACCACA-3'). Further deletion of the 54-bp fragment revealed that a 47-bp fragment containing the five motifs (the 10-bp, 10-bp-like, 10-bp-related, Region 2 and Fifth motifs) is sufficient for suspensor transcription, and represents a cis-regulatory module. A consensus sequence for each type of motif was determined by comparing motif sequences shown to activate suspensor transcription. Phylogenetic analyses suggest that the regulation of G564 is evolutionarily conserved. A homologous cis-regulatory module was found upstream of the G564 ortholog in the Common Bean (Phaseolus vulgaris), indicating that the regulation of G564 is evolutionarily conserved in closely related bean species.
Suzuki, Masaharu; Ketterling, Matthew G; McCarty, Donald R
2005-09-01
We have developed a simple quantitative computational approach for objective analysis of cis-regulatory sequences in promoters of coregulated genes. The program, designated MotifFinder, identifies oligo sequences that are overrepresented in promoters of coregulated genes. We used this approach to analyze promoter sequences of Viviparous1 (VP1)/abscisic acid (ABA)-regulated genes and cold-regulated genes, respectively, of Arabidopsis (Arabidopsis thaliana). We detected significantly enriched sequences in up-regulated genes but not in down-regulated genes. This result suggests that gene activation but not repression is mediated by specific and common sequence elements in promoters. The enriched motifs include several known cis-regulatory sequences as well as previously unidentified motifs. With respect to known cis-elements, we dissected the flanking nucleotides of the core sequences of Sph element, ABA response elements (ABREs), and the C repeat/dehydration-responsive element. This analysis identified the motif variants that may correlate with qualitative and quantitative differences in gene expression. While both VP1 and cold responses are mediated in part by ABA signaling via ABREs, these responses correlate with unique ABRE variants distinguished by nucleotides flanking the ACGT core. ABRE and Sph motifs are tightly associated uniquely in the coregulated set of genes showing a strict dependence on VP1 and ABA signaling. Finally, analysis of distribution of the enriched sequences revealed a striking concentration of enriched motifs in a proximal 200-base region of VP1/ABA and cold-regulated promoters. Overall, each class of coregulated genes possesses a discrete set of the enriched motifs with unique distributions in their promoters that may account for the specificity of gene regulation.
Database construction for PromoterCAD: synthetic promoter design for mammals and plants.
Nishikata, Koro; Cox, Robert Sidney; Shimoyama, Sayoko; Yoshida, Yuko; Matsui, Minami; Makita, Yuko; Toyoda, Tetsuro
2014-03-21
Synthetic promoters can control a gene's timing, location, and expression level. The PromoterCAD web server ( http://promotercad.org ) allows the design of synthetic promoters to control plant gene expression, by novel arrangement of cis-regulatory elements. Recently, we have expanded PromoterCAD's scope with additional plant and animal data: (1) PLACE (Plant Cis-acting Regulatory DNA Elements), including various sized sequence motifs; (2) PEDB (Mammalian Promoter/Enhancer Database), including gene expression data for mammalian tissues. The plant PromoterCAD data now contains 22 000 Arabidopsis thaliana genes, 2 200 000 microarray measurements in 20 growth conditions and 79 tissue organs and developmental stages, while the new mammalian PromoterCAD data contains 679 Mus musculus genes and 65 000 microarray measurements in 96 tissue organs and cell types ( http://promotercad.org/mammal/ ). This work presents step-by-step instructions for adding both regulatory motif and gene expression data to PromoterCAD, to illustrate how users can expand PromoterCAD functionality for their own applications and organisms.
BET Bromodomain Inhibition Releases the Mediator Complex from Select cis-Regulatory Elements.
Bhagwat, Anand S; Roe, Jae-Seok; Mok, Beverly Y L; Hohmann, Anja F; Shi, Junwei; Vakoc, Christopher R
2016-04-19
The bromodomain and extraterminal (BET) protein BRD4 can physically interact with the Mediator complex, but the relevance of this association to the therapeutic effects of BET inhibitors in cancer is unclear. Here, we show that BET inhibition causes a rapid release of Mediator from a subset of cis-regulatory elements in the genome of acute myeloid leukemia (AML) cells. These sites of Mediator eviction were highly correlated with transcriptional suppression of neighboring genes, which are enriched for targets of the transcription factor MYB and for functions related to leukemogenesis. A shRNA screen of Mediator in AML cells identified the MED12, MED13, MED23, and MED24 subunits as performing a similar regulatory function to BRD4 in this context, including a shared role in sustaining a block in myeloid maturation. These findings suggest that the interaction between BRD4 and Mediator has functional importance for gene-specific transcriptional activation and for AML maintenance. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Su, Zhaoming; Wu, Chao; Shi, Liuqing; Luthra, Priya; Pintilie, Grigore D.; Johnson, Britney; Porter, Justin R.; Ge, Peng; Chen, Muyuan; Liu, Gai; Frederick, Thomas E.; Binning, Jennifer M.; Bowman, Gregory R.; Zhou, Z. Hong; Basler, Christopher F.; Gross, Michael L.; Leung, Daisy W.
2018-01-01
Summary Ebola virus nucleoprotein (eNP) assembles into higher-ordered structures that form the viral nucleocapsid (NC) and serve as the scaffold for viral RNA synthesis. However, molecular insights into the NC assembly process are lacking. Using a hybrid approach, we characterized the NC-like assembly of eNP, identified novel regulatory elements, and described how these elements impact function. We generated a three-dimensional structure of the eNP NC-like assembly at 5.8 Å using electron cryo-microscopy and identified a new regulatory role for eNP helices α22–α23. Biochemical, biophysical, and mutational analysis revealed inter-eNP contacts within α22–α23 are critical for viral NC-assembly and regulate viral RNA synthesis. These observations suggest that the N-terminus and α22–α23 of eNP function as context dependent regulatory modules (CDRMs). Our current study provides a framework for a structural mechanism for NC-like assembly and a new therapeutic target. PMID:29474922
Forecasting PM10 in metropolitan areas: Efficacy of neural networks.
Fernando, H J S; Mammarella, M C; Grandoni, G; Fedele, P; Di Marco, R; Dimitrova, R; Hyde, P
2012-04-01
Deterministic photochemical air quality models are commonly used for regulatory management and planning of urban airsheds. These models are complex, computer intensive, and hence are prohibitively expensive for routine air quality predictions. Stochastic methods are becoming increasingly popular as an alternative, which relegate decision making to artificial intelligence based on Neural Networks that are made of artificial neurons or 'nodes' capable of 'learning through training' via historic data. A Neural Network was used to predict particulate matter concentration at a regulatory monitoring site in Phoenix, Arizona; its development, efficacy as a predictive tool and performance vis-à-vis a commonly used regulatory photochemical model are described in this paper. It is concluded that Neural Networks are much easier, quicker and economical to implement without compromising the accuracy of predictions. Neural Networks can be used to develop rapid air quality warning systems based on a network of automated monitoring stations. Copyright © 2011 Elsevier Ltd. All rights reserved.
Crepaldi, Luca; Policarpi, Cristina; Coatti, Alessandro; Sherlock, William T; Jongbloets, Bart C; Down, Thomas A; Riccio, Antonella
2013-01-01
In neurons, the timely and accurate expression of genes in response to synaptic activity relies on the interplay between epigenetic modifications of histones, recruitment of regulatory proteins to chromatin and changes to nuclear structure. To identify genes and regulatory elements responsive to synaptic activation in vivo, we performed a genome-wide ChIPseq analysis of acetylated histone H3 using somatosensory cortex of mice exposed to novel enriched environmental (NEE) conditions. We discovered that Short Interspersed Elements (SINEs) located distal to promoters of activity-dependent genes became acetylated following exposure to NEE and were bound by the general transcription factor TFIIIC. Importantly, under depolarizing conditions, inducible genes relocated to transcription factories (TFs), and this event was controlled by TFIIIC. Silencing of the TFIIIC subunit Gtf3c5 in non-stimulated neurons induced uncontrolled relocation to TFs and transcription of activity-dependent genes. Remarkably, in cortical neurons, silencing of Gtf3c5 mimicked the effects of chronic depolarization, inducing a dramatic increase of both dendritic length and branching. These findings reveal a novel and essential regulatory function of both SINEs and TFIIIC in mediating gene relocation and transcription. They also suggest that TFIIIC may regulate the rearrangement of nuclear architecture, allowing the coordinated expression of activity-dependent neuronal genes.
de Souza, Flávio S.J.; Franchini, Lucía F.; Rubinstein, Marcelo
2013-01-01
Transposable elements (TEs) are mobile genetic sequences that can jump around the genome from one location to another, behaving as genomic parasites. TEs have been particularly effective in colonizing mammalian genomes, and such heavy TE load is expected to have conditioned genome evolution. Indeed, studies conducted both at the gene and genome levels have uncovered TE insertions that seem to have been co-opted—or exapted—by providing transcription factor binding sites (TFBSs) that serve as promoters and enhancers, leading to the hypothesis that TE exaptation is a major factor in the evolution of gene regulation. Here, we critically review the evidence for exaptation of TE-derived sequences as TFBSs, promoters, enhancers, and silencers/insulators both at the gene and genome levels. We classify the functional impact attributed to TE insertions into four categories of increasing complexity and argue that so far very few studies have conclusively demonstrated exaptation of TEs as transcriptional regulatory regions. We also contend that many genome-wide studies dealing with TE exaptation in recent lineages of mammals are still inconclusive and that the hypothesis of rapid transcriptional regulatory rewiring mediated by TE mobilization must be taken with caution. Finally, we suggest experimental approaches that may help attributing higher-order functions to candidate exapted TEs. PMID:23486611
Stiers, Pieter-Jan; van Gastel, Nick; Moermans, Karen; Stockmans, Ingrid; Carmeliet, Geert
2017-12-01
To improve bone healing or regeneration more insight in the fate and role of the different skeletal cell types is required. Mouse models for fate mapping and lineage tracing of skeletal cells, using stage-specific promoters, have advanced our understanding of bone development, a process that is largely recapitulated during bone repair. However, validation of these models is often only performed during development, whereas proof of the activity and specificity of the used promoters during the bone regenerative process is limited. Here, we show that the regulatory elements of the 6kb collagen type II promoter are not adequate to drive gene expression during bone repair. Similarly, the 2.3kb promoter of collagen type I lacks activity in adult mice, but the 3.2kb promoter is suitable. Furthermore, Cre-mediated fate mapping allows the visualization of progeny, but this label retention may hinder to distinguish these cells from ones with active expression of the marker at later time points. Together, our results show that the lineage-specific regulatory elements driving gene expression during bone development differ from those required later in life and during bone repair, and justify validation of lineage-specific cell tracing and gene silencing strategies during fracture healing and bone regenerative applications. Copyright © 2017 Elsevier Inc. All rights reserved.
Inaba, Takehito; Nagano, Yukio; Sakakibara, Toshihiro; Sasaki, Yukiko
1999-01-01
The pra2 gene encodes a pea (Pisum sativum) small GTPase belonging to the YPT/rab family, and its expression is down-regulated by light, mediated by phytochrome. We have isolated and characterized a genomic clone of this gene and constructed a fusion DNA of its 5′-upstream region in front of the gene for firefly luciferase. Using this construct in a transient assay, we determined a pra2 cis-regulatory region sufficient to direct the light down-regulation of the luciferase reporter gene. Both 5′- and internal deletion analyses revealed that the 93-bp sequence between −734 and −642 from the transcriptional start site was important for phytochrome down-regulation. Gain-of-function analysis showed that this 93-bp region could confer light down-regulation when fused to the cauliflower mosaic virus 35S promoter. Furthermore, linker-scanning analysis showed that a 12-bp sequence within the 93-bp region mediated phytochrome down-regulation. Gel-retardation analysis showed the presence of a nuclear factor that was specifically bound to the 12-bp sequence in vitro. These results indicate that this element is a cis-regulatory element involved in phytochrome down-regulated expression. PMID:10364400
Sun, Gao-Fei; He, Shou-Pu; Du, Xiong-Ming
2013-10-01
Cotton genomic studies have boomed since the release of Gossypium raimondii draft genome. In this study, cis-regulatory element (CRE) in 1 kb length sequence upstream 5' UTR of annotated genes were selected and scanned in the Arabidopsis thaliana (At) and Gossypium raimondii (Gr) genomes, based on the database of PLACE (Plant cis-acting Regulatory DNA Elements). According to the definition of this study, 44 (12.3%) and 57 (15.5%) CREs presented "peak-like" distribution in the 1 kb selected sequences of both genomes, respectively. Thirty-four of them were peak-like distributed in both genomes, which could be further categorized into 4 types based on their core sequences. The coincidence of TATABOX peak position and their actual position ((-) -30 bp) indicated that the position of a common CRE was conservative in different genes, which suggested that the peak position of these CREs was their possible actual position of transcription factors. The position of a common CRE was also different between the two genomes due to stronger length variation of 5' UTR in Gr than At. Furthermore, most of the peak-like CREs were located in the region of -110 bp-0 bp, which suggested that concentrated distribution might be conductive to the interaction of transcription factors, and then regulate the gene expression in downstream.
Crepaldi, Luca; Policarpi, Cristina; Coatti, Alessandro; Sherlock, William T.; Jongbloets, Bart C.; Down, Thomas A.; Riccio, Antonella
2013-01-01
In neurons, the timely and accurate expression of genes in response to synaptic activity relies on the interplay between epigenetic modifications of histones, recruitment of regulatory proteins to chromatin and changes to nuclear structure. To identify genes and regulatory elements responsive to synaptic activation in vivo, we performed a genome-wide ChIPseq analysis of acetylated histone H3 using somatosensory cortex of mice exposed to novel enriched environmental (NEE) conditions. We discovered that Short Interspersed Elements (SINEs) located distal to promoters of activity-dependent genes became acetylated following exposure to NEE and were bound by the general transcription factor TFIIIC. Importantly, under depolarizing conditions, inducible genes relocated to transcription factories (TFs), and this event was controlled by TFIIIC. Silencing of the TFIIIC subunit Gtf3c5 in non-stimulated neurons induced uncontrolled relocation to TFs and transcription of activity-dependent genes. Remarkably, in cortical neurons, silencing of Gtf3c5 mimicked the effects of chronic depolarization, inducing a dramatic increase of both dendritic length and branching. These findings reveal a novel and essential regulatory function of both SINEs and TFIIIC in mediating gene relocation and transcription. They also suggest that TFIIIC may regulate the rearrangement of nuclear architecture, allowing the coordinated expression of activity-dependent neuronal genes. PMID:23966877
Evolutionary Novelty in a Butterfly Wing Pattern through Enhancer Shuffling
Pardo-Diaz, Carolina; Hanly, Joseph J.; Martin, Simon H.; Mallet, James; Dasmahapatra, Kanchon K.; Salazar, Camilo; Joron, Mathieu; Nadeau, Nicola; McMillan, W. Owen; Jiggins, Chris D.
2016-01-01
An important goal in evolutionary biology is to understand the genetic changes underlying novel morphological structures. We investigated the origins of a complex wing pattern found among Amazonian Heliconius butterflies. Genome sequence data from 142 individuals across 17 species identified narrow regions associated with two distinct red colour pattern elements, dennis and ray. We hypothesise that these modules in non-coding sequence represent distinct cis-regulatory loci that control expression of the transcription factor optix, which in turn controls red pattern variation across Heliconius. Phylogenetic analysis of the two elements demonstrated that they have distinct evolutionary histories and that novel adaptive morphological variation was created by shuffling these cis-regulatory modules through recombination between divergent lineages. In addition, recombination of modules into different combinations within species further contributes to diversity. Analysis of the timing of diversification in these two regions supports the hypothesis of introgression moving regulatory modules between species, rather than shared ancestral variation. The dennis phenotype introgressed into Heliconius melpomene at about the same time that ray originated in this group, while ray introgressed back into H. elevatus much more recently. We show that shuffling of existing enhancer elements both within and between species provides a mechanism for rapid diversification and generation of novel morphological combinations during adaptive radiation. PMID:26771987
Kyrchanova, Olga; Chetverina, Darya; Maksimenko, Oksana; Kullyev, Andrey; Georgiev, Pavel
2008-12-01
Insulators are defined as a class of regulatory elements that delimit independent transcriptional domains within eukaryotic genomes. According to previous data, an interaction (pairing) between some Drosophila insulators can support distant activation of a promoter by an enhancer. Here, we have demonstrated that pairs of well-studied insulators such as scs-scs, scs'-scs', 1A2-1A2 and Wari-Wari support distant activation of the white promoter by the yeast GAL4 activator in an orientation-dependent manner. The same is true for the efficiency of the enhancer that stimulates white expression in the eyes. In all insulator pairs tested, stimulation of the white gene was stronger when insulators were inserted between the eye enhancer or GAL4 and the white promoter in opposite orientations relative to each other. As shown previously, Zw5, Su(Hw) and dCTCF proteins are required for the functioning of different insulators that do not interact with each other. Here, strong functional interactions have been revealed between DNA fragments containing binding sites for either Zw5 or Su(Hw) or dCTCF protein but not between heterologous binding sites [Zw5-Su(Hw), dCTCF-Su(Hw), or dCTCF-Zw5]. These results suggest that insulator proteins can support selective interactions between distant regulatory elements.
Evolutionary Novelty in a Butterfly Wing Pattern through Enhancer Shuffling.
Wallbank, Richard W R; Baxter, Simon W; Pardo-Diaz, Carolina; Hanly, Joseph J; Martin, Simon H; Mallet, James; Dasmahapatra, Kanchon K; Salazar, Camilo; Joron, Mathieu; Nadeau, Nicola; McMillan, W Owen; Jiggins, Chris D
2016-01-01
An important goal in evolutionary biology is to understand the genetic changes underlying novel morphological structures. We investigated the origins of a complex wing pattern found among Amazonian Heliconius butterflies. Genome sequence data from 142 individuals across 17 species identified narrow regions associated with two distinct red colour pattern elements, dennis and ray. We hypothesise that these modules in non-coding sequence represent distinct cis-regulatory loci that control expression of the transcription factor optix, which in turn controls red pattern variation across Heliconius. Phylogenetic analysis of the two elements demonstrated that they have distinct evolutionary histories and that novel adaptive morphological variation was created by shuffling these cis-regulatory modules through recombination between divergent lineages. In addition, recombination of modules into different combinations within species further contributes to diversity. Analysis of the timing of diversification in these two regions supports the hypothesis of introgression moving regulatory modules between species, rather than shared ancestral variation. The dennis phenotype introgressed into Heliconius melpomene at about the same time that ray originated in this group, while ray introgressed back into H. elevatus much more recently. We show that shuffling of existing enhancer elements both within and between species provides a mechanism for rapid diversification and generation of novel morphological combinations during adaptive radiation.
Decoding transcriptional enhancers: Evolving from annotation to functional interpretation
Engel, Krysta L.; Mackiewicz, Mark; Hardigan, Andrew A.; Myers, Richard M.; Savic, Daniel
2016-01-01
Deciphering the intricate molecular processes that orchestrate the spatial and temporal regulation of genes has become an increasingly major focus of biological research. The differential expression of genes by diverse cell types with a common genome is a hallmark of complex cellular functions, as well as the basis for multicellular life. Importantly, a more coherent understanding of gene regulation is critical for defining developmental processes, evolutionary principles and disease etiologies. Here we present our current understanding of gene regulation by focusing on the role of enhancer elements in these complex processes. Although functional genomic methods have provided considerable advances to our understanding of gene regulation, these assays, which are usually performed on a genome-wide scale, typically provide correlative observations that lack functional interpretation. Recent innovations in genome editing technologies have placed gene regulatory studies at an exciting crossroads, as systematic, functional evaluation of enhancers and other transcriptional regulatory elements can now be performed in a coordinated, high-throughput manner across the entire genome. This review provides insights on transcriptional enhancer function, their role in development and disease, and catalogues experimental tools commonly used to study these elements. Additionally, we discuss the crucial role of novel techniques in deciphering the complex gene regulatory landscape and how these studies will shape future research. PMID:27224938
Decoding transcriptional enhancers: Evolving from annotation to functional interpretation.
Engel, Krysta L; Mackiewicz, Mark; Hardigan, Andrew A; Myers, Richard M; Savic, Daniel
2016-09-01
Deciphering the intricate molecular processes that orchestrate the spatial and temporal regulation of genes has become an increasingly major focus of biological research. The differential expression of genes by diverse cell types with a common genome is a hallmark of complex cellular functions, as well as the basis for multicellular life. Importantly, a more coherent understanding of gene regulation is critical for defining developmental processes, evolutionary principles and disease etiologies. Here we present our current understanding of gene regulation by focusing on the role of enhancer elements in these complex processes. Although functional genomic methods have provided considerable advances to our understanding of gene regulation, these assays, which are usually performed on a genome-wide scale, typically provide correlative observations that lack functional interpretation. Recent innovations in genome editing technologies have placed gene regulatory studies at an exciting crossroads, as systematic, functional evaluation of enhancers and other transcriptional regulatory elements can now be performed in a coordinated, high-throughput manner across the entire genome. This review provides insights on transcriptional enhancer function, their role in development and disease, and catalogues experimental tools commonly used to study these elements. Additionally, we discuss the crucial role of novel techniques in deciphering the complex gene regulatory landscape and how these studies will shape future research. Copyright © 2016 Elsevier Ltd. All rights reserved.
40 CFR 79.56 - Fuel and fuel additive grouping system.
Code of Federal Regulations, 2010 CFR
2010-07-01
... further testing under the provisions of Tier 3 or to support regulatory decisions affecting that fuel or... elements or classes of compounds other than those permitted in the base fuel for the respective fuel family... all of the following criteria: (1) Contain no elements other than carbon, hydrogen, oxygen, nitrogen...
USDA-ARS?s Scientific Manuscript database
A transient in vivo P element excision assay was used to test the regulatory properties of putative repressor-encoding plasmids in Drosophila melanogaster embryos. The somatic expression of an unmodified transposase transcription unit under the control of a heat shock gene promoter (phsn) effectivel...
Syed, Mustafa H; Karpinets, Tatiana V; Leuze, Michael R; Kora, Guruprasad H; Romine, Margaret R; Uberbacher, Edward C
2009-01-01
Shewanella oneidensis MR-1 is an important model organism for environmental research as it has an exceptional metabolic and respiratory versatility regulated by a complex regulatory network. We have developed a database to collect experimental and computational data relating to regulation of gene and protein expression, and, a visualization environment that enables integration of these data types. The regulatory information in the database includes predictions of DNA regulator binding sites, sigma factor binding sites, transcription units, operons, promoters, and RNA regulators including non-coding RNAs, riboswitches, and different types of terminators. Availability http://shewanella-knowledgebase.org:8080/Shewanella/gbrowserLanding.jsp PMID:20198195
Massive contribution of transposable elements to mammalian regulatory sequences.
Rayan, Nirmala Arul; Del Rosario, Ricardo C H; Prabhakar, Shyam
2016-09-01
Barbara McClintock discovered the existence of transposable elements (TEs) in the late 1940s and initially proposed that they contributed to the gene regulatory program of higher organisms. This controversial idea gained acceptance only much later in the 1990s, when the first examples of TE-derived promoter sequences were uncovered. It is now known that half of the human genome is recognizably derived from TEs. It is thus important to understand the scope and nature of their contribution to gene regulation. Here, we provide a timeline of major discoveries in this area and discuss how transposons have revolutionized our understanding of mammalian genomes, with a special emphasis on the massive contribution of TEs to primate evolution. Our analysis of primate-specific functional elements supports a simple model for the rate at which new functional elements arise in unique and TE-derived DNA. Finally, we discuss some of the challenges and unresolved questions in the field, which need to be addressed in order to fully characterize the impact of TEs on gene regulation, evolution and disease processes. Copyright © 2016 Elsevier Ltd. All rights reserved.
Identification of Regulatory Elements That Control PPARγ Expression in Adipocyte Progenitors
Chou, Wen-Ling; Galmozzi, Andrea; Partida, David; Kwan, Kevin; Yeung, Hui; Su, Andrew I.; Saez, Enrique
2013-01-01
Adipose tissue renewal and obesity-driven expansion of fat cell number are dependent on proliferation and differentiation of adipose progenitors that reside in the vasculature that develops in coordination with adipose depots. The transcriptional events that regulate commitment of progenitors to the adipose lineage are poorly understood. Because expression of the nuclear receptor PPARγ defines the adipose lineage, isolation of elements that control PPARγ expression in adipose precursors may lead to discovery of transcriptional regulators of early adipocyte determination. Here, we describe the identification and validation in transgenic mice of 5 highly conserved non-coding sequences from the PPARγ locus that can drive expression of a reporter gene in a manner that recapitulates the tissue-specific pattern of PPARγ expression. Surprisingly, these 5 elements appear to control PPARγ expression in adipocyte precursors that are associated with the vasculature of adipose depots, but not in mature adipocytes. Characterization of these five PPARγ regulatory sequences may enable isolation of the transcription factors that bind these cis elements and provide insight into the molecular regulation of adipose tissue expansion in normal and pathological states. PMID:24009687
QuIN: A Web Server for Querying and Visualizing Chromatin Interaction Networks.
Thibodeau, Asa; Márquez, Eladio J; Luo, Oscar; Ruan, Yijun; Menghi, Francesca; Shin, Dong-Guk; Stitzel, Michael L; Vera-Licona, Paola; Ucar, Duygu
2016-06-01
Recent studies of the human genome have indicated that regulatory elements (e.g. promoters and enhancers) at distal genomic locations can interact with each other via chromatin folding and affect gene expression levels. Genomic technologies for mapping interactions between DNA regions, e.g., ChIA-PET and HiC, can generate genome-wide maps of interactions between regulatory elements. These interaction datasets are important resources to infer distal gene targets of non-coding regulatory elements and to facilitate prioritization of critical loci for important cellular functions. With the increasing diversity and complexity of genomic information and public ontologies, making sense of these datasets demands integrative and easy-to-use software tools. Moreover, network representation of chromatin interaction maps enables effective data visualization, integration, and mining. Currently, there is no software that can take full advantage of network theory approaches for the analysis of chromatin interaction datasets. To fill this gap, we developed a web-based application, QuIN, which enables: 1) building and visualizing chromatin interaction networks, 2) annotating networks with user-provided private and publicly available functional genomics and interaction datasets, 3) querying network components based on gene name or chromosome location, and 4) utilizing network based measures to identify and prioritize critical regulatory targets and their direct and indirect interactions. QuIN's web server is available at http://quin.jax.org QuIN is developed in Java and JavaScript, utilizing an Apache Tomcat web server and MySQL database and the source code is available under the GPLV3 license available on GitHub: https://github.com/UcarLab/QuIN/.
Patel, Hardip; Forêt, Sylvain; Karlsen, Bård Ove; Jørgensen, Tor Erik; Hall-Spencer, Jason M
2018-01-01
Abstract Cnidarians harbor a variety of small regulatory RNAs that include microRNAs (miRNAs) and PIWI-interacting RNAs (piRNAs), but detailed information is limited. Here, we report the identification and expression of novel miRNAs and putative piRNAs, as well as their genomic loci, in the symbiotic sea anemone Anemonia viridis. We generated a draft assembly of the A. viridis genome with putative size of 313 Mb that appeared to be composed of about 36% repeats, including known transposable elements. We detected approximately equal fractions of DNA transposons and retrotransposons. Deep sequencing of small RNA libraries constructed from A. viridis adults sampled at a natural CO2 gradient off Vulcano Island, Italy, identified 70 distinct miRNAs. Eight were homologous to previously reported miRNAs in cnidarians, whereas 62 appeared novel. Nine miRNAs were recognized as differentially expressed along the natural seawater pH gradient. We found a highly abundant and diverse population of piRNAs, with a substantial fraction showing ping–pong signatures. We identified nearly 22% putative piRNAs potentially targeting transposable elements within the A. viridis genome. The A. viridis genome appeared similar in size to that of other hexacorals with a very high divergence of transposable elements resembling that of the sea anemone genus Exaiptasia. The genome encodes and expresses a high number of small regulatory RNAs, which include novel miRNAs and piRNAs. Differentially expressed small RNAs along the seawater pH gradient indicated regulatory gene responses to environmental stressors. PMID:29385567
Puhl, Henry L.; Ikeda, Stephen R.
2008-01-01
Voltage-gated sodium channels (VGSC) are critical membrane components that participate in the electrical activity of excitable cells. The type one VGSC family includes the tetrodotoxin insensitive sodium channel, Nav1.8, encoded by the Scn10a gene. Nav1.8 expression is restricted to small and medium diameter nociceptive sensory neurons of the dorsal root (DRG) and cranial sensory ganglia. In order to understand the stringent transcriptional regulation of the Scn10a gene, the sensory neuron specific promoter was functionally identified. While identifying the mRNA 5’ end, alternative splicing within the 5’ UTR was observed to create heterogeneity in the RNA transcript. Four kilobases of upstream genomic DNA was cloned and the presence of tissue specific promoter activity was tested by microinjection and adenoviral infection of fluorescent protein reporter constructs into primary mouse and rat neurons, and cell lines. The region contained many putative transcription factor binding sites and strong homology with the predicted rat ortholog. Homology to the predicted human ortholog was limited to the proximal end and several conserved cis elements were noted. Two regulatory modules were identified by microinjection of reporter constructs into DRG and superior cervical ganglia neurons: a neuron specific proximal promoter region between −1.6 and −0.2kb of the transcription start site cluster, and a distal sensory neuron switch region beyond −1.6kb that restricted fluorescent protein expression to a subset of primary sensory neurons. PMID:18466327
Wang, Zhuo; Danziger, Samuel A; Heavner, Benjamin D; Ma, Shuyi; Smith, Jennifer J; Li, Song; Herricks, Thurston; Simeonidis, Evangelos; Baliga, Nitin S; Aitchison, John D; Price, Nathan D
2017-05-01
Gene regulatory and metabolic network models have been used successfully in many organisms, but inherent differences between them make networks difficult to integrate. Probabilistic Regulation Of Metabolism (PROM) provides a partial solution, but it does not incorporate network inference and underperforms in eukaryotes. We present an Integrated Deduced And Metabolism (IDREAM) method that combines statistically inferred Environment and Gene Regulatory Influence Network (EGRIN) models with the PROM framework to create enhanced metabolic-regulatory network models. We used IDREAM to predict phenotypes and genetic interactions between transcription factors and genes encoding metabolic activities in the eukaryote, Saccharomyces cerevisiae. IDREAM models contain many fewer interactions than PROM and yet produce significantly more accurate growth predictions. IDREAM consistently outperformed PROM using any of three popular yeast metabolic models and across three experimental growth conditions. Importantly, IDREAM's enhanced accuracy makes it possible to identify subtle synthetic growth defects. With experimental validation, these novel genetic interactions involving the pyruvate dehydrogenase complex suggested a new role for fatty acid-responsive factor Oaf1 in regulating acetyl-CoA production in glucose grown cells.
2012 Global Summit on Regulatory Science (GSRS-2012)--modernizing toxicology.
Miller, Margaret A; Tong, Weida; Fan, Xiaohui; Slikker, William
2013-01-01
Regulatory science encompasses the tools, models, techniques, and studies needed to assess and evaluate product safety, efficacy, quality, and performance. Several recent publications have emphasized the role of regulatory science in improving global health, supporting economic development and fostering innovation. As for other scientific disciplines, research in regulatory science is the critical element underpinning the development and advancement of regulatory science as a modern scientific discipline. As a regulatory agency in the 21st century, the Food and Drug Administration (FDA) has an international component that underpins its domestic mission; foods, drugs, and devices are developed and imported to the United States from across the world. The Global Summit on Regulatory Science, an international conference for discussing innovative technologies, approaches, and partnerships that enhance the translation of basic science into regulatory applications, is providing leadership for the advancement of regulatory sciences within the global context. Held annually, this international conference provides a platform where regulators, policy makers, and bench scientists from various countries can exchange views on how to develop, apply, and implement innovative methodologies into regulatory assessments in their respective countries, as well as developing a harmonized strategy to improve global public health through global collaboration.
2014-01-01
Background Cis-regulatory modules (CRMs), or the DNA sequences required for regulating gene expression, play the central role in biological researches on transcriptional regulation in metazoan species. Nowadays, the systematic understanding of CRMs still mainly resorts to computational methods due to the time-consuming and small-scale nature of experimental methods. But the accuracy and reliability of different CRM prediction tools are still unclear. Without comparative cross-analysis of the results and combinatorial consideration with extra experimental information, there is no easy way to assess the confidence of the predicted CRMs. This limits the genome-wide understanding of CRMs. Description It is known that transcription factor binding and epigenetic profiles tend to determine functions of CRMs in gene transcriptional regulation. Thus integration of the genome-wide epigenetic profiles with systematically predicted CRMs can greatly help researchers evaluate and decipher the prediction confidence and possible transcriptional regulatory functions of these potential CRMs. However, these data are still fragmentary in the literatures. Here we performed the computational genome-wide screening for potential CRMs using different prediction tools and constructed the pioneer database, cisMEP (cis-regulatory module epigenetic profile database), to integrate these computationally identified CRMs with genomic epigenetic profile data. cisMEP collects the literature-curated TFBS location data and nine genres of epigenetic data for assessing the confidence of these potential CRMs and deciphering the possible CRM functionality. Conclusions cisMEP aims to provide a user-friendly interface for researchers to assess the confidence of different potential CRMs and to understand the functions of CRMs through experimentally-identified epigenetic profiles. The deposited potential CRMs and experimental epigenetic profiles for confidence assessment provide experimentally testable hypotheses for the molecular mechanisms of metazoan gene regulation. We believe that the information deposited in cisMEP will greatly facilitate the comparative usage of different CRM prediction tools and will help biologists to study the modular regulatory mechanisms between different TFs and their target genes. PMID:25521507
Denz, Christopher R; Zhang, Chi; Jia, Pingping; Du, Jianfeng; Huang, Xupei; Dube, Syamalima; Thomas, Anish; Poiesz, Bernard J; Dube, Dipak K
2011-09-01
Tropomyosins are a family of actin-binding proteins that show cell-specific diversity by a combination of multiple genes and alternative RNA splicing. Of the 4 different tropomyosin genes, TPM4 plays a pivotal role in myofibrillogenesis as well as cardiac contractility in amphibians. In this study, we amplified and sequenced the upstream regulatory region of the TPM4 gene from both normal and mutant axolotl hearts. To identify the cis-elements that are essential for the expression of the TPM4, we created various deletion mutants of the TPM4 promoter DNA, inserted the deleted segments into PGL3 vector, and performed promoter-reporter assay using luciferase as the reporter gene. Comparison of sequences of the promoter region of the TPM4 gene from normal and mutant axolotl revealed no mutations in the promoter sequence of the mutant TPM4 gene. CArG box elements that are generally involved in controlling the expression of several other muscle-specific gene promoters were not found in the upstream regulatory region of the TPM4 gene. In deletion experiments, loss of activity of the reporter gene was noted upon deletion which was then restored upon further deletion suggesting the presence of both positive and negative cis-elements in the upstream regulatory region of the TPM4 gene. We believe that this is the first axolotl promoter that has ever been cloned and studied with clear evidence that it functions in mammalian cell lines. Although striated muscle-specific cis-acting elements are absent from the promoter region of TPM4 gene, our results suggest the presence of positive and negative cis-elements in the promoter region, which in conjunction with positive and negative trans-elements may be involved in regulating the expression of TPM4 gene in a tissue-specific manner.
Hutton, John J; Jegga, Anil G; Kong, Sue; Gupta, Ashima; Ebert, Catherine; Williams, Sarah; Katz, Jonathan D; Aronow, Bruce J
2004-01-01
Background In this study we have built and mined a gene expression database composed of 65 diverse mouse tissues for genes preferentially expressed in immune tissues and cell types. Using expression pattern criteria, we identified 360 genes with preferential expression in thymus, spleen, peripheral blood mononuclear cells, lymph nodes (unstimulated or stimulated), or in vitro activated T-cells. Results Gene clusters, formed based on similarity of expression-pattern across either all tissues or the immune tissues only, had highly significant associations both with immunological processes such as chemokine-mediated response, antigen processing, receptor-related signal transduction, and transcriptional regulation, and also with more general processes such as replication and cell cycle control. Within-cluster gene correlations implicated known associations of known genes, as well as immune process-related roles for poorly described genes. To characterize regulatory mechanisms and cis-elements of genes with similar patterns of expression, we used a new version of a comparative genomics-based cis-element analysis tool to identify clusters of cis-elements with compositional similarity among multiple genes. Several clusters contained genes that shared 5–6 cis-elements that included ETS and zinc-finger binding sites. cis-Elements AP2 EGRF ETSF MAZF SP1F ZF5F and AREB ETSF MZF1 PAX5 STAT were shared in a thymus-expressed set; AP4R E2FF EBOX ETSF MAZF SP1F ZF5F and CREB E2FF MAZF PCAT SP1F STAT cis-clusters occurred in activated T-cells; CEBP CREB NFKB SORY and GATA NKXH OCT1 RBIT occurred in stimulated lymph nodes. Conclusion This study demonstrates a series of analytic approaches that have allowed the implication of genes and regulatory elements that participate in the differentiation, maintenance, and function of the immune system. Polymorphism or mutation of these could adversely impact immune system functions. PMID:15504237
FK506 biosynthesis is regulated by two positive regulatory elements in Streptomyces tsukubaensis
2012-01-01
Background FK506 (Tacrolimus) is an important immunosuppressant, produced by industrial biosynthetic processes using various Streptomyces species. Considering the complex structure of FK506, it is reasonable to expect complex regulatory networks controlling its biosynthesis. Regulatory elements, present in gene clusters can have a profound influence on the final yield of target product and can play an important role in development of industrial bioprocesses. Results Three putative regulatory elements, namely fkbR, belonging to the LysR-type family, fkbN, a large ATP-binding regulator of the LuxR family (LAL-type) and allN, a homologue of AsnC family regulatory proteins, were identified in the FK506 gene cluster from Streptomyces tsukubaensis NRRL 18488, a progenitor of industrial strains used for production of FK506. Inactivation of fkbN caused a complete disruption of FK506 biosynthesis, while inactivation of fkbR resulted in about 80% reduction of FK506 yield. No functional role in the regulation of the FK506 gene cluster has been observed for the allN gene. Using RT-PCR and a reporter system based on a chalcone synthase rppA, we demonstrated, that in the wild type as well as in fkbN- and fkbR-inactivated strains, fkbR is transcribed in all stages of cultivation, even before the onset of FK506 production, whereas fkbN expression is initiated approximately with the initiation of FK506 production. Surprisingly, inactivation of fkbN (or fkbR) does not abolish the transcription of the genes in the FK506 gene cluster in general, but may reduce expression of some of the tested biosynthetic genes. Finally, introduction of a second copy of the fkbR or fkbN genes under the control of the strong ermE* promoter into the wild type strain resulted in 30% and 55% of yield improvement, respectively. Conclusions Our results clearly demonstrate the positive regulatory role of fkbR and fkbN genes in FK506 biosynthesis in S. tsukubaensis NRRL 18488. We have shown that regulatory mechanisms can differ substantially from other, even apparently closely similar FK506-producing strains, reported in literature. Finally, we have demonstrated the potential of these genetically modified strains of S. tsukubaensis for improving the yield of fermentative processes for production of FK506. PMID:23083511
Abscisic-acid-dependent basic leucine zipper (bZIP) transcription factors in plant abiotic stress.
Banerjee, Aditya; Roychoudhury, Aryadeep
2017-01-01
One of the major causes of significant crop loss throughout the world is the myriad of environmental stresses including drought, salinity, cold, heavy metal toxicity, and ultraviolet-B (UV-B) rays. Plants as sessile organisms have evolved various effective mechanism which enable them to withstand this plethora of stresses. Most of such regulatory mechanisms usually follow the abscisic-acid (ABA)-dependent pathway. In this review, we have primarily focussed on the basic leucine zipper (bZIP) transcription factors (TFs) activated by the ABA-mediated signalosome. Upon perception of ABA by specialized receptors, the signal is transduced via various groups of Ser/Thr kinases, which phosphorylate the bZIP TFs. Following such post-translational modification of TFs, they are activated so that they bind to specific cis-acting sequences called abscisic-acid-responsive elements (ABREs) or GC-rich coupling elements (CE), thereby influencing the expression of their target downstream genes. Several in silico techniques have been adopted so far to predict the structural features, recognize the regulatory modification sites, undergo phylogenetic analyses, and facilitate genome-wide survey of TF under multiple stresses. Current investigations on the epigenetic regulation that controls greater accessibility of the inducible regions of DNA of the target gene to the bZIP TFs exclusively under stress situations, along with the evolved stress memory responses via genomic imprinting mechanism, have been highlighted. The potentiality of overexpression of bZIP TFs, either in a homologous or in a heterologous background, in generating transgenic plants tolerant to various abiotic stressors have also been addressed by various groups. The present review will provide a coherent documentation on the functional characterization and regulation of bZIP TFs under multiple environmental stresses, with the major goal of generating multiple-stress-tolerant plant cultivars in near future.
Oyarzún, Javiera P; Morís, Joaquín; Luque, David; de Diego-Balaguer, Ruth; Fuentemilla, Lluís
2017-08-09
System memory consolidation is conceptualized as an active process whereby newly encoded memory representations are strengthened through selective memory reactivation during sleep. However, our learning experience is highly overlapping in content (i.e., shares common elements), and memories of these events are organized in an intricate network of overlapping associated events. It remains to be explored whether and how selective memory reactivation during sleep has an impact on these overlapping memories acquired during awake time. Here, we test in a group of adult women and men the prediction that selective memory reactivation during sleep entails the reactivation of associated events and that this may lead the brain to adaptively regulate whether these associated memories are strengthened or pruned from memory networks on the basis of their relative associative strength with the shared element. Our findings demonstrate the existence of efficient regulatory neural mechanisms governing how complex memory networks are shaped during sleep as a function of their associative memory strength. SIGNIFICANCE STATEMENT Numerous studies have demonstrated that system memory consolidation is an active, selective, and sleep-dependent process in which only subsets of new memories become stabilized through their reactivation. However, the learning experience is highly overlapping in content and thus events are encoded in an intricate network of related memories. It remains to be explored whether and how memory reactivation has an impact on overlapping memories acquired during awake time. Here, we show that sleep memory reactivation promotes strengthening and weakening of overlapping memories based on their associative memory strength. These results suggest the existence of an efficient regulatory neural mechanism that avoids the formation of cluttered memory representation of multiple events and promotes stabilization of complex memory networks. Copyright © 2017 the authors 0270-6474/17/377748-11$15.00/0.
Federal Register 2010, 2011, 2012, 2013, 2014
2013-04-03
... Regulatory Research, U.S. Nuclear Regulatory Commission, Washington DC 20555-0001; telephone: 301-251-7445... relevant modeling factors to accompany descriptive material for the one or more models submitted by an..., Division of Engineering, Office of Nuclear Regulatory Research. [FR Doc. 2013-07702 Filed 4-2-13; 8:45 am...
Analytical Prediction of the Seismic Response of a Reinforced Concrete Containment Vessel
DOE Office of Scientific and Technical Information (OSTI.GOV)
James, R.J.; Rashid, Y.R.; Cherry, J.L.
Under the sponsorship of the Ministry of International Trade and Industry (MITI) of Japan, the Nuclear Power Engineering Corporation (NUPEC) is investigating the seismic behavior of a Reinforced Concrete Containment Vessel (RCCV) through scale-model testing using the high-performance shaking table at the Tadotsu Engineering Laboratory. A series of tests representing design-level seismic ground motions was initially conducted to gather valuable experimental measurements for use in design verification. Additional tests will be conducted with increasing amplifications of the seismic input until a structural failure of the test model occurs. In a cooperative program with NUPEC, the US Nuclear Regulatory Commission (USNRC),more » through Sandia National Laboratories (SNL), is conducting analytical research on the seismic behavior of RCCV structures. As part of this program, pretest analytical predictions of the model tests are being performed. The dynamic time-history analysis utilizes a highly detailed concrete constitutive model applied to a three-dimensional finite element representation of the test structure. This paper describes the details of the analysis model and provides analysis results.« less
Transcription Factor Map Alignment of Promoter Regions
Blanco, Enrique; Messeguer, Xavier; Smith, Temple F; Guigó, Roderic
2006-01-01
We address the problem of comparing and characterizing the promoter regions of genes with similar expression patterns. This remains a challenging problem in sequence analysis, because often the promoter regions of co-expressed genes do not show discernible sequence conservation. In our approach, thus, we have not directly compared the nucleotide sequence of promoters. Instead, we have obtained predictions of transcription factor binding sites, annotated the predicted sites with the labels of the corresponding binding factors, and aligned the resulting sequences of labels—to which we refer here as transcription factor maps (TF-maps). To obtain the global pairwise alignment of two TF-maps, we have adapted an algorithm initially developed to align restriction enzyme maps. We have optimized the parameters of the algorithm in a small, but well-curated, collection of human–mouse orthologous gene pairs. Results in this dataset, as well as in an independent much larger dataset from the CISRED database, indicate that TF-map alignments are able to uncover conserved regulatory elements, which cannot be detected by the typical sequence alignments. PMID:16733547
Wilding, Craig S; Smith, Ian; Lynd, Amy; Yawson, Alexander Egyir; Weetman, David; Paine, Mark J I; Donnelly, Martin J
2012-09-01
Although cytochrome P450 (CYP450) enzymes are frequently up-regulated in mosquitoes resistant to insecticides, no regulatory motifs driving these expression differences with relevance to wild populations have been identified. Transposable elements (TEs) are often enriched upstream of those CYP450s involved in insecticide resistance, leading to the assumption that they contribute regulatory motifs that directly underlie the resistance phenotype. A partial CuRE1 (Culex Repetitive Element 1) transposable element is found directly upstream of CYP9M10, a cytochrome P450 implicated previously in larval resistance to permethrin in the ISOP450 strain of Culex quinquefasciatus, but is absent from the equivalent genomic region of a susceptible strain. Via expression of CYP9M10 in Escherichia coli we have now demonstrated time- and NADPH-dependant permethrin metabolism, prerequisites for confirmation of a role in metabolic resistance, and through qPCR shown that CYP9M10 is >20-fold over-expressed in ISOP450 compared to a susceptible strain. In a fluorescent reporter assay the region upstream of CYP9M10 from ISOP450 drove 10× expression compared to the equivalent region (lacking CuRE1) from the susceptible strain. Close correspondence with the gene expression fold-change implicates the upstream region including CuRE1 as a cis-regulatory element involved in resistance. Only a single CuRE1 bearing allele, identical to the CuRE1 bearing allele in the resistant strain, is found throughout Sub-Saharan Africa, in contrast to the diversity encountered in non-CuRE1 alleles. This suggests a single origin and subsequent spread due to selective advantage. CuRE1 is detectable using a simple diagnostic. When applied to C. quinquefasciatus larvae from Ghana we have demonstrated a significant association with permethrin resistance in multiple field sites (mean Odds Ratio = 3.86) suggesting this marker has relevance to natural populations of vector mosquitoes. However, when CuRE1 was excised from the allele used in the reporter assay through fusion PCR, expression was unaffected, indicating that the TE has no direct role in resistance and hence that CuRE1 is acting only as a marker of an as yet unidentified regulatory motif in the association analysis. This suggests that a re-evaluation of the assumption that TEs contribute regulatory motifs involved in gene expression may be necessary. Copyright © 2012 Elsevier Ltd. All rights reserved.
Reddy, Palakolanu Sudhakar; Kavi Kishor, Polavarapu B.; Seiler, Christiane; Kuhlmann, Markus; Eschen-Lippold, Lennart; Lee, Justin; Reddy, Malireddy K.; Sreenivasulu, Nese
2014-01-01
The rapid increase in heat shock proteins upon exposure to damaging stresses and during plant development related to desiccation events reveal their dual importance in plant development and stress tolerance. Genome-wide sequence survey identified 20 non-redundant small heat shock proteins (sHsp) and 22 heat shock factor (Hsf) genes in barley. While all three major classes (A, B, C) of Hsfs are localized in nucleus, the 20 sHsp gene family members are localized in different cell organelles like cytoplasm, mitochondria, plastid and peroxisomes. Hsf and sHsp members are differentially regulated during drought and at different seed developmental stages suggesting the importance of chaperone role under drought as well as seed development. In silico cis-regulatory motif analysis of Hsf promoters showed an enrichment with abscisic acid responsive cis-elements (ABRE), implying regulatory role of ABA in mediating transcriptional response of HvsHsf genes. Gene regulatory network analysis identified HvHsfB2c as potential central regulator of the seed-specific expression of several HvsHsps including 17.5CI sHsp. These results indicate that HvHsfB2c is co-expressed in the central hub of small Hsps and therefore it may be regulating the expression of several HvsHsp subclasses HvHsp16.88-CI, HvHsp17.5-CI and HvHsp17.7-CI. The in vivo relevance of binding specificity of HvHsfB2C transcription factor to HSE-element present in the promoter of HvSHP17.5-CI under heat stress exposure is confirmed by gel shift and LUC-reporter assays. Further, we isolated 477 bp cDNA from barley encoding a 17.5 sHsp polypeptide, which was predominantly upregulated under drought stress treatments and also preferentially expressed in developing seeds. Recombinant HvsHsp17.5-CI protein was expressed in E. coli and purified to homogeneity, which displayed in vitro chaperone activity. The predicted structural model of HvsHsp-17.5-CI protein suggests that the α-crystallin domain is evolutionarily highly conserved. PMID:24594978
Reddy, Palakolanu Sudhakar; Kavi Kishor, Polavarapu B; Seiler, Christiane; Kuhlmann, Markus; Eschen-Lippold, Lennart; Lee, Justin; Reddy, Malireddy K; Sreenivasulu, Nese
2014-01-01
The rapid increase in heat shock proteins upon exposure to damaging stresses and during plant development related to desiccation events reveal their dual importance in plant development and stress tolerance. Genome-wide sequence survey identified 20 non-redundant small heat shock proteins (sHsp) and 22 heat shock factor (Hsf) genes in barley. While all three major classes (A, B, C) of Hsfs are localized in nucleus, the 20 sHsp gene family members are localized in different cell organelles like cytoplasm, mitochondria, plastid and peroxisomes. Hsf and sHsp members are differentially regulated during drought and at different seed developmental stages suggesting the importance of chaperone role under drought as well as seed development. In silico cis-regulatory motif analysis of Hsf promoters showed an enrichment with abscisic acid responsive cis-elements (ABRE), implying regulatory role of ABA in mediating transcriptional response of HvsHsf genes. Gene regulatory network analysis identified HvHsfB2c as potential central regulator of the seed-specific expression of several HvsHsps including 17.5CI sHsp. These results indicate that HvHsfB2c is co-expressed in the central hub of small Hsps and therefore it may be regulating the expression of several HvsHsp subclasses HvHsp16.88-CI, HvHsp17.5-CI and HvHsp17.7-CI. The in vivo relevance of binding specificity of HvHsfB2C transcription factor to HSE-element present in the promoter of HvSHP17.5-CI under heat stress exposure is confirmed by gel shift and LUC-reporter assays. Further, we isolated 477 bp cDNA from barley encoding a 17.5 sHsp polypeptide, which was predominantly upregulated under drought stress treatments and also preferentially expressed in developing seeds. Recombinant HvsHsp17.5-CI protein was expressed in E. coli and purified to homogeneity, which displayed in vitro chaperone activity. The predicted structural model of HvsHsp-17.5-CI protein suggests that the α-crystallin domain is evolutionarily highly conserved.
Basnet, Ram Kumar; Moreno-Pachon, Natalia; Lin, Ke; Bucher, Johan; Visser, Richard G F; Maliepaard, Chris; Bonnema, Guusje
2013-12-01
Brassica seeds are important as basic units of plant growth and sources of vegetable oil. Seed development is regulated by many dynamic metabolic processes controlled by complex networks of spatially and temporally expressed genes. We conducted a global microarray gene co-expression analysis by measuring transcript abundance of developing seeds from two diverse B. rapa morphotypes: a pak choi (leafy-type) and a yellow sarson (oil-type), and two of their doubled haploid (DH) progenies, (1) to study the timing of metabolic processes in developing seeds, (2) to explore the major transcriptional differences in developing seeds of the two morphotypes, and (3) to identify the optimum stage for a genetical genomics study in B. rapa seed. Seed developmental stages were similar in developing seeds of pak choi and yellow sarson of B. rapa; however, the colour of embryo and seed coat differed among these two morphotypes. In this study, most transcriptional changes occurred between 25 and 35 DAP, which shows that the timing of seed developmental processes in B. rapa is at later developmental stages than in the related species B. napus. Using a Weighted Gene Co-expression Network Analysis (WGCNA), we identified 47 "gene modules", of which 27 showed a significant association with temporal and/or genotypic variation. An additional hierarchical cluster analysis identified broad spectra of gene expression patterns during seed development. The predominant variation in gene expression was according to developmental stages rather than morphotype differences. Since lipids are the major storage compounds of Brassica seeds, we investigated in more detail the regulation of lipid metabolism. Four co-regulated gene clusters were identified with 17 putative cis-regulatory elements predicted in their 1000 bp upstream region, either specific or common to different lipid metabolic pathways. This is the first study of genome-wide profiling of transcript abundance during seed development in B. rapa. The identification of key physiological events, major expression patterns, and putative cis-regulatory elements provides useful information to construct gene regulatory networks in B. rapa developing seeds and provides a starting point for a genetical genomics study of seed quality traits.
Transiently disordered tails accelerate folding of globular proteins.
Mallik, Saurav; Ray, Tanaya; Kundu, Sudip
2017-07-01
Numerous biological proteins exhibit intrinsic disorder at their termini, which are associated with multifarious functional roles. Here, we show the surprising result that an increased percentage of terminal short transiently disordered regions with enhanced flexibility (TstDREF) is associated with accelerated folding rates of globular proteins. Evolutionary conservation of predicted disorder at TstDREFs and drastic alteration of folding rates upon point-mutations suggest critical regulatory role(s) of TstDREFs in shaping the folding kinetics. TstDREFs are associated with long-range intramolecular interactions and the percentage of native secondary structural elements physically contacted by TstDREFs exhibit another surprising positive correlation with folding kinetics. These results allow us to infer probable molecular mechanisms behind the TstDREF-mediated regulation of folding kinetics that challenge protein biochemists to assess by direct experimental testing. © 2017 Federation of European Biochemical Societies.
A model of self-regulation for control of chronic disease.
Clark, Noreen M; Gong, Molly; Kaciroti, Niko
2014-10-01
Chronic disease poses increasing threat to individual and community health. The day-to-day manager of disease is the patient who undertakes actions with the guidance of a clinician. The ability of the patient to control the illness through an effective therapeutic plan is significantly influenced by social and behavioral factors. This article presents a model of patient management of chronic disease that accounts for intrapersonal and external influences on management and emphasizes the central role of self-regulatory processes in disease control. Asthma serves as a case for exploration of the model. Findings from a 5-year study of 637 children with asthma and their care-taking parents supported that the self-regulation elements of the model were reasonably stable over time and baseline values were predictive of important disease management outcomes. © 2014 Society for Public Health Education.
Predicting gene regulatory networks of soybean nodulation from RNA-Seq transcriptome data.
Zhu, Mingzhu; Dahmen, Jeremy L; Stacey, Gary; Cheng, Jianlin
2013-09-22
High-throughput RNA sequencing (RNA-Seq) is a revolutionary technique to study the transcriptome of a cell under various conditions at a systems level. Despite the wide application of RNA-Seq techniques to generate experimental data in the last few years, few computational methods are available to analyze this huge amount of transcription data. The computational methods for constructing gene regulatory networks from RNA-Seq expression data of hundreds or even thousands of genes are particularly lacking and urgently needed. We developed an automated bioinformatics method to predict gene regulatory networks from the quantitative expression values of differentially expressed genes based on RNA-Seq transcriptome data of a cell in different stages and conditions, integrating transcriptional, genomic and gene function data. We applied the method to the RNA-Seq transcriptome data generated for soybean root hair cells in three different development stages of nodulation after rhizobium infection. The method predicted a soybean nodulation-related gene regulatory network consisting of 10 regulatory modules common for all three stages, and 24, 49 and 70 modules separately for the first, second and third stage, each containing both a group of co-expressed genes and several transcription factors collaboratively controlling their expression under different conditions. 8 of 10 common regulatory modules were validated by at least two kinds of validations, such as independent DNA binding motif analysis, gene function enrichment test, and previous experimental data in the literature. We developed a computational method to reliably reconstruct gene regulatory networks from RNA-Seq transcriptome data. The method can generate valuable hypotheses for interpreting biological data and designing biological experiments such as ChIP-Seq, RNA interference, and yeast two hybrid experiments.
Dynamic modelling of microRNA regulation during mesenchymal stem cell differentiation.
Weber, Michael; Sotoca, Ana M; Kupfer, Peter; Guthke, Reinhard; van Zoelen, Everardus J
2013-11-12
Network inference from gene expression data is a typical approach to reconstruct gene regulatory networks. During chondrogenic differentiation of human mesenchymal stem cells (hMSCs), a complex transcriptional network is active and regulates the temporal differentiation progress. As modulators of transcriptional regulation, microRNAs (miRNAs) play a critical role in stem cell differentiation. Integrated network inference aimes at determining interrelations between miRNAs and mRNAs on the basis of expression data as well as miRNA target predictions. We applied the NetGenerator tool in order to infer an integrated gene regulatory network. Time series experiments were performed to measure mRNA and miRNA abundances of TGF-beta1+BMP2 stimulated hMSCs. Network nodes were identified by analysing temporal expression changes, miRNA target gene predictions, time series correlation and literature knowledge. Network inference was performed using NetGenerator to reconstruct a dynamical regulatory model based on the measured data and prior knowledge. The resulting model is robust against noise and shows an optimal trade-off between fitting precision and inclusion of prior knowledge. It predicts the influence of miRNAs on the expression of chondrogenic marker genes and therefore proposes novel regulatory relations in differentiation control. By analysing the inferred network, we identified a previously unknown regulatory effect of miR-524-5p on the expression of the transcription factor SOX9 and the chondrogenic marker genes COL2A1, ACAN and COL10A1. Genome-wide exploration of miRNA-mRNA regulatory relationships is a reasonable approach to identify miRNAs which have so far not been associated with the investigated differentiation process. The NetGenerator tool is able to identify valid gene regulatory networks on the basis of miRNA and mRNA time series data.
Critcher, Clayton R; Ferguson, Melissa J
2016-06-01
To effectively self-regulate, people must persevere on tasks that they deem important, regardless of whether those tasks are enjoyable. Building on past work that has noted the fundamental role of implicit cognition in guiding effective self-regulation, the present paper tests whether an implicit association between goal means and importance predicts self-regulatory persistence and success. Implicit importance predicted markers of effective self-regulation-better grades, more studying and exercise, and stronger standardized testing performance-over and above, and often better than, explicit beliefs about the importance of that self-regulation, as well as implicit evaluations of those means. In particular, those for whom tasks were fairly taxing to complete (i.e., those for whom this self-regulation required effortful self-control) were those who most benefitted from the implicit association between means and importance. Moreover, when participants were reminded of recent self-regulatory failure that they believed could be overcome through hard work, implicit importance toward the means increased as if to prepare them to achieve self-regulatory persistence. A final study sought to reconcile the present findings with previous work showing the key role that implicit evaluations play in effective self-regulation. We reasoned that means are important precisely because they are associated with valued end-states. Consistent with this account, implicit evaluations of end-states predicted the implicit importance of means, which in turn predicted effective self-regulation. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Regulatory gene networks and the properties of the developmental process
NASA Technical Reports Server (NTRS)
Davidson, Eric H.; McClay, David R.; Hood, Leroy
2003-01-01
Genomic instructions for development are encoded in arrays of regulatory DNA. These specify large networks of interactions among genes producing transcription factors and signaling components. The architecture of such networks both explains and predicts developmental phenomenology. Although network analysis is yet in its early stages, some fundamental commonalities are already emerging. Two such are the use of multigenic feedback loops to ensure the progressivity of developmental regulatory states and the prevalence of repressive regulatory interactions in spatial control processes. Gene regulatory networks make it possible to explain the process of development in causal terms and eventually will enable the redesign of developmental regulatory circuitry to achieve different outcomes.