Science.gov

Sample records for regulatory sequence common

  1. Coordinate cytokine regulatory sequences

    DOEpatents

    Frazer, Kelly A.; Rubin, Edward M.; Loots, Gabriela G.

    2005-05-10

    The present invention provides CNS sequences that regulate the cytokine gene expression, expression cassettes and vectors comprising or lacking the CNS sequences, host cells and non-human transgenic animals comprising the CNS sequences or lacking the CNS sequences. The present invention also provides methods for identifying compounds that modulate the functions of CNS sequences as well as methods for diagnosing defects in the CNS sequences of patients.

  2. Distinct gene expression patterns in skeletal and cardiac muscle are dependent on common regulatory sequences in the MLC1/3 locus.

    PubMed Central

    McGrew, M J; Bogdanova, N; Hasegawa, K; Hughes, S H; Kitsis, R N; Rosenthal, N

    1996-01-01

    The myosin light-chain 1/3 locus (MLC1/3) is regulated by two promoters and a downstream enhancer element which produce two protein isoforms in fast skeletal muscle at distinct stages of mouse embryogenesis. We have analyzed the expression of transcripts from the internal MLC3 promoter and determined that it is also expressed in the atria of the heart. Expression from the MLC3 promoter in these striated muscle lineages is differentially regulated during development. In transgenic mice, the MLC3 promoter is responsible for cardiac-specific reporter gene expression while the downstream enhancer augments expression in skeletal muscle. Examination of the methylation status of endogenous and transgenic promoter and enhancer elements indicates that the internal promoter is not regulated in a manner similar to that of the MLC1 promoter or the downstream enhancer. A GATA protein consensus sequence in the proximal MLC3 promoter but not the MLC1 promoter binds with high affinity to GATA-4, a cardiac muscle- and gut-specific transcription factor. Mutation of either the MEF2 or GATA motifs in the MLC3 promoter attenuates its activity in both heart and skeletal muscles, demonstrating that MLC3 expression in these two diverse muscle types is dependent on common regulatory elements. PMID:8754853

  3. RSAT: regulatory sequence analysis tools.

    PubMed

    Thomas-Chollier, Morgane; Sand, Olivier; Turatsinze, Jean-Valéry; Janky, Rekin's; Defrance, Matthieu; Vervisch, Eric; Brohée, Sylvain; van Helden, Jacques

    2008-07-01

    The regulatory sequence analysis tools (RSAT, http://rsat.ulb.ac.be/rsat/) is a software suite that integrates a wide collection of modular tools for the detection of cis-regulatory elements in genome sequences. The suite includes programs for sequence retrieval, pattern discovery, phylogenetic footprint detection, pattern matching, genome scanning and feature map drawing. Random controls can be performed with random gene selections or by generating random sequences according to a variety of background models (Bernoulli, Markov). Beyond the original word-based pattern-discovery tools (oligo-analysis and dyad-analysis), we recently added a battery of tools for matrix-based detection of cis-acting elements, with some original features (adaptive background models, Markov-chain estimation of P-values) that do not exist in other matrix-based scanning tools. The web server offers an intuitive interface, where each program can be accessed either separately or connected to the other tools. In addition, the tools are now available as web services, enabling their integration in programmatic workflows. Genomes are regularly updated from various genome repositories (NCBI and EnsEMBL) and 682 organisms are currently supported. Since 1998, the tools have been used by several hundreds of researchers from all over the world. Several predictions made with RSAT were validated experimentally and published. PMID:18495751

  4. RSAT 2015: Regulatory Sequence Analysis Tools.

    PubMed

    Medina-Rivera, Alejandra; Defrance, Matthieu; Sand, Olivier; Herrmann, Carl; Castro-Mondragon, Jaime A; Delerce, Jeremy; Jaeger, Sébastien; Blanchet, Christophe; Vincens, Pierre; Caron, Christophe; Staines, Daniel M; Contreras-Moreira, Bruno; Artufel, Marie; Charbonnier-Khamvongsa, Lucie; Hernandez, Céline; Thieffry, Denis; Thomas-Chollier, Morgane; van Helden, Jacques

    2015-07-01

    RSAT (Regulatory Sequence Analysis Tools) is a modular software suite for the analysis of cis-regulatory elements in genome sequences. Its main applications are (i) motif discovery, appropriate to genome-wide data sets like ChIP-seq, (ii) transcription factor binding motif analysis (quality assessment, comparisons and clustering), (iii) comparative genomics and (iv) analysis of regulatory variations. Nine new programs have been added to the 43 described in the 2011 NAR Web Software Issue, including a tool to extract sequences from a list of coordinates (fetch-sequences from UCSC), novel programs dedicated to the analysis of regulatory variants from GWAS or population genomics (retrieve-variation-seq and variation-scan), a program to cluster motifs and visualize the similarities as trees (matrix-clustering). To deal with the drastic increase of sequenced genomes, RSAT public sites have been reorganized into taxon-specific servers. The suite is well-documented with tutorials and published protocols. The software suite is available through Web sites, SOAP/WSDL Web services, virtual machines and stand-alone programs at http://www.rsat.eu/. PMID:25904632

  5. RSAT 2015: Regulatory Sequence Analysis Tools

    PubMed Central

    Medina-Rivera, Alejandra; Defrance, Matthieu; Sand, Olivier; Herrmann, Carl; Castro-Mondragon, Jaime A.; Delerce, Jeremy; Jaeger, Sébastien; Blanchet, Christophe; Vincens, Pierre; Caron, Christophe; Staines, Daniel M.; Contreras-Moreira, Bruno; Artufel, Marie; Charbonnier-Khamvongsa, Lucie; Hernandez, Céline; Thieffry, Denis; Thomas-Chollier, Morgane; van Helden, Jacques

    2015-01-01

    RSAT (Regulatory Sequence Analysis Tools) is a modular software suite for the analysis of cis-regulatory elements in genome sequences. Its main applications are (i) motif discovery, appropriate to genome-wide data sets like ChIP-seq, (ii) transcription factor binding motif analysis (quality assessment, comparisons and clustering), (iii) comparative genomics and (iv) analysis of regulatory variations. Nine new programs have been added to the 43 described in the 2011 NAR Web Software Issue, including a tool to extract sequences from a list of coordinates (fetch-sequences from UCSC), novel programs dedicated to the analysis of regulatory variants from GWAS or population genomics (retrieve-variation-seq and variation-scan), a program to cluster motifs and visualize the similarities as trees (matrix-clustering). To deal with the drastic increase of sequenced genomes, RSAT public sites have been reorganized into taxon-specific servers. The suite is well-documented with tutorials and published protocols. The software suite is available through Web sites, SOAP/WSDL Web services, virtual machines and stand-alone programs at http://www.rsat.eu/. PMID:25904632

  6. Integrating sequence, evolution and functional genomics in regulatory genomics

    PubMed Central

    Vingron, Martin; Brazma, Alvis; Coulson, Richard; van Helden, Jacques; Manke, Thomas; Palin, Kimmo; Sand, Olivier; Ukkonen, Esko

    2009-01-01

    With genome analysis expanding from the study of genes to the study of gene regulation, 'regulatory genomics' utilizes sequence information, evolution and functional genomics measurements to unravel how regulatory information is encoded in the genome. PMID:19226437

  7. Conventional and Regulatory CD4+ T Cells That Share Identical TCRs Are Derived from Common Clones

    PubMed Central

    Emerson, Ryan O.; Pingel, Jeanette; Buller, R. Mark; DiPaolo, Richard J.

    2016-01-01

    Results from studies comparing the diversity and specificity of the TCR repertoires expressed by conventional (Tconv) and regulatory (Treg) CD4+ T cell have varied depending on the experimental system employed. We developed a new model in which T cells express a single fixed TCRα chain, randomly rearranged endogenous TCRβ chains, and a Foxp3-GFP reporter. We purified CD4+Foxp3- and CD4+Foxp3+ cells, then performed biased controlled multiplex PCR and high throughput sequencing of endogenous TCRβ chains. We identified >7,000 different TCRβ sequences in the periphery of 5 individual mice. On average, ~12% of TCR sequences were expressed by both conventional and regulatory populations within individual mice. The CD4+ T cells that expressed shared TCR sequences were present at higher frequencies compared to T cells expressing non-shared TCRs. Furthermore, nearly all (>90%) of the TCR sequences that were shared within mice were identical at the DNA sequence level, indicating that conventional and regulatory T cells that express shared TCRs are derived from common clones. Analysis of TCR repertoire overlap in the thymus reveals that a large proportion of Tconv and Treg sharing observed in the periphery is due to clonal expansion in the thymus. Together these data show that there are a limited number of TCR sequences shared between Tconv and Tregs. Also, Tconv and Tregs sharing identical TCRs are found at relatively high frequencies and are derived from common progenitors, of which a large portion are generated in the thymus. PMID:27100298

  8. Inference of Splicing Regulatory Activities by Sequence Neighborhood Analysis

    PubMed Central

    Stadler, Michael B; Shomron, Noam; Yeo, Gene W; Schneider, Aniket; Xiao, Xinshu; Burge, Christopher B

    2006-01-01

    Sequence-specific recognition of nucleic-acid motifs is critical to many cellular processes. We have developed a new and general method called Neighborhood Inference (NI) that predicts sequences with activity in regulating a biochemical process based on the local density of known sites in sequence space. Applied to the problem of RNA splicing regulation, NI was used to predict hundreds of new exonic splicing enhancer (ESE) and silencer (ESS) hexanucleotides from known human ESEs and ESSs. These predictions were supported by cross-validation analysis, by analysis of published splicing regulatory activity data, by sequence-conservation analysis, and by measurement of the splicing regulatory activity of 24 novel predicted ESEs, ESSs, and neutral sequences using an in vivo splicing reporter assay. These results demonstrate the ability of NI to accurately predict splicing regulatory activity and show that the scope of exonic splicing regulatory elements is substantially larger than previously anticipated. Analysis of orthologous exons in four mammals showed that the NI score of ESEs, a measure of function, is much more highly conserved above background than ESE primary sequence. This observation indicates a high degree of selection for ESE activity in mammalian exons, with surprisingly frequent interchangeability between ESE sequences. PMID:17121466

  9. Detection of Weakly Conserved Ancestral Mammalian RegulatorySequences by Primate Comparisons

    SciTech Connect

    Wang, Qian-fei; Prabhakar, Shyam; Chanan, Sumita; Cheng,Jan-Fang; Rubin, Edward M.; Boffelli, Dario

    2006-06-01

    Genomic comparisons between human and distant, non-primatemammals are commonly used to identify cis-regulatory elements based onconstrained sequence evolution. However, these methods fail to detectcryptic functional elements, which are too weakly conserved among mammalsto distinguish from nonfunctional DNA. To address this problem, weexplored the potential of deep intra-primate sequence comparisons. Wesequenced the orthologs of 558 kb of human genomic sequence, coveringmultiple loci involved in cholesterol homeostasis, in 6 nonhumanprimates. Our analysis identified 6 noncoding DNA elements displayingsignificant conservation among primates, but undetectable in more distantcomparisons. In vitro and in vivo tests revealed that at least three ofthese 6 elements have regulatory function. Notably, the mouse orthologsof these three functional human sequences had regulatory activity despitetheir lack of significant sequence conservation, indicating that they arecryptic ancestral cis-regulatory elements. These regulatory elementscould still be detected in a smaller set of three primate speciesincluding human, rhesus and marmoset. Since the human and rhesus genomesequences are already available, and the marmoset genome is activelybeing sequenced, the primate-specific conservation analysis describedhere can be applied in the near future on a whole-genome scale, tocomplement the annotation provided by more distant speciescomparisons.

  10. Modeling DNA sequence-based cis-regulatory gene networks.

    PubMed

    Bolouri, Hamid; Davidson, Eric H

    2002-06-01

    Gene network analysis requires computationally based models which represent the functional architecture of regulatory interactions, and which provide directly testable predictions. The type of model that is useful is constrained by the particular features of developmentally active cis-regulatory systems. These systems function by processing diverse regulatory inputs, generating novel regulatory outputs. A computational model which explicitly accommodates this basic concept was developed earlier for the cis-regulatory system of the endo16 gene of the sea urchin. This model represents the genetically mandated logic functions that the system executes, but also shows how time-varying kinetic inputs are processed in different circumstances into particular kinetic outputs. The same basic design features can be utilized to construct models that connect the large number of cis-regulatory elements constituting developmental gene networks. The ultimate aim of the network models discussed here is to represent the regulatory relationships among the genomic control systems of the genes in the network, and to state their functional meaning. The target site sequences of the cis-regulatory elements of these genes constitute the physical basis of the network architecture. Useful models for developmental regulatory networks must represent the genetic logic by which the system operates, but must also be capable of explaining the real time dynamics of cis-regulatory response as kinetic input and output data become available. Most importantly, however, such models must display in a direct and transparent manner fundamental network design features such as intra- and intercellular feedback circuitry; the sources of parallel inputs into each cis-regulatory element; gene battery organization; and use of repressive spatial inputs in specification and boundary formation. Successful network models lead to direct tests of key architectural features by targeted cis-regulatory analysis. PMID

  11. Identification of DVA Interneuron Regulatory Sequences in Caenorhabditis elegans

    PubMed Central

    Puckett Robinson, Carmie; Schwarz, Erich M.; Sternberg, Paul W.

    2013-01-01

    Background The identity of each neuron is determined by the expression of a distinct group of genes comprising its terminal gene battery. The regulatory sequences that control the expression of such terminal gene batteries in individual neurons is largely unknown. The existence of a complete genome sequence for C. elegans and draft genomes of other nematodes let us use comparative genomics to identify regulatory sequences directing expression in the DVA interneuron. Methodology/Principal Findings Using phylogenetic comparisons of multiple Caenorhabditis species, we identified conserved non-coding sequences in 3 of 10 genes (fax-1, nmr-1, and twk-16) that direct expression of reporter transgenes in DVA and other neurons. The conserved region and flanking sequences in an 85-bp intronic region of the twk-16 gene directs highly restricted expression in DVA. Mutagenesis of this 85 bp region shows that it has at least four regions. The central 53 bp region contains a 29 bp region that represses expression and a 24 bp region that drives broad neuronal expression. Two short flanking regions restrict expression of the twk-16 gene to DVA. A shared GA-rich motif was identified in three of these genes but had opposite effects on expression when mutated in the nmr-1 and twk-16 DVA regulatory elements. Conclusions/Significance We identified by multi-species conservation regulatory regions within three genes that direct expression in the DVA neuron. We identified four contiguous regions of sequence of the twk-16 gene enhancer with positive and negative effects on expression, which combined to restrict expression to the DVA neuron. For this neuron a single binding site may thus not achieve sufficient specificity for cell specific expression. One of the positive elements, an 8-bp sequence required for expression was identified in silico by sequence comparisons of seven nematode species, demonstrating the potential resolution of expanded multi-species phylogenetic comparisons. PMID

  12. Using deep sequencing to characterize the biophysical mechanism of a transcriptional regulatory sequence

    PubMed Central

    Kinney, Justin B.; Murugan, Anand; Callan, Curtis G.; Cox, Edward C.

    2010-01-01

    Cells use protein-DNA and protein-protein interactions to regulate transcription. A biophysical understanding of this process has, however, been limited by the lack of methods for quantitatively characterizing the interactions that occur at specific promoters and enhancers in living cells. Here we show how such biophysical information can be revealed by a simple experiment in which a library of partially mutated regulatory sequences are partitioned according to their in vivo transcriptional activities and then sequenced en masse. Computational analysis of the sequence data produced by this experiment can provide precise quantitative information about how the regulatory proteins at a specific arrangement of binding sites work together to regulate transcription. This ability to reliably extract precise information about regulatory biophysics in the face of experimental noise is made possible by a recently identified relationship between likelihood and mutual information. Applying our experimental and computational techniques to the Escherichia coli lac promoter, we demonstrate the ability to identify regulatory protein binding sites de novo, determine the sequence-dependent binding energy of the proteins that bind these sites, and, importantly, measure the in vivo interaction energy between RNA polymerase and a DNA-bound transcription factor. Our approach provides a generally applicable method for characterizing the biophysical basis of transcriptional regulation by a specified regulatory sequence. The principles of our method can also be applied to a wide range of other problems in molecular biology. PMID:20439748

  13. Genome-wide identification of conserved regulatory function in diverged sequences

    PubMed Central

    Taher, Leila; McGaughey, David M.; Maragh, Samantha; Aneas, Ivy; Bessling, Seneca L.; Miller, Webb; Nobrega, Marcelo A.; McCallion, Andrew S.; Ovcharenko, Ivan

    2011-01-01

    Plasticity of gene regulatory encryption can permit DNA sequence divergence without loss of function. Functional information is preserved through conservation of the composition of transcription factor binding sites (TFBS) in a regulatory element. We have developed a method that can accurately identify pairs of functional noncoding orthologs at evolutionarily diverged loci by searching for conserved TFBS arrangements. With an estimated 5% false-positive rate (FPR) in approximately 3000 human and zebrafish syntenic loci, we detected approximately 300 pairs of diverged elements that are likely to share common ancestry and have similar regulatory activity. By analyzing a pool of experimentally validated human enhancers, we demonstrated that 7/8 (88%) of their predicted functional orthologs retained in vivo regulatory control. Moreover, in 5/7 (71%) of assayed enhancer pairs, we observed concordant expression patterns. We argue that TFBS composition is often necessary to retain and sufficient to predict regulatory function in the absence of overt sequence conservation, revealing an entire class of functionally conserved, evolutionarily diverged regulatory elements that we term “covert.” PMID:21628450

  14. Identification of potential regulatory motifs in odorant receptor genes by analysis of promoter sequences

    PubMed Central

    Michaloski, Jussara S.; Galante, Pedro A.F.

    2006-01-01

    Mouse odorant receptors (ORs) are encoded by >1000 genes dispersed throughout the genome. Each olfactory neuron expresses one single OR gene, while the rest of the genes remain silent. The mechanisms underlying OR gene expression are poorly understood. Here, we investigated if OR genes share common cis-regulatory sequences in their promoter regions. We carried out a comprehensive analysis in which the upstream regions of a large number of OR genes were compared. First, using RLM-RACE, we generated cDNAs containing the complete 5′-untranslated regions (5′-UTRs) for a total number of 198 mouse OR genes. Then, we aligned these cDNA sequences to the mouse genome so that the 5′ structure and transcription start sites (TSSs) of the OR genes could be precisely determined. Sequences upstream of the TSSs were retrieved and browsed for common elements. We found DNA sequence motifs that are overrepresented in the promoter regions of the OR genes. Most motifs resemble O/E-like sites and are preferentially localized within 200 bp upstream of the TSSs. Finally, we show that these motifs specifically interact with proteins extracted from nuclei prepared from the olfactory epithelium, but not from brain or liver. Our results show that the OR genes share common promoter elements. The present strategy should provide information on the role played by cis-regulatory sequences in OR gene regulation. PMID:16902085

  15. Discovering common stem–loop motifs in unaligned RNA sequences

    PubMed Central

    Gorodkin, Jan; Stricklin, Shawn L.; Stormo, Gary D.

    2001-01-01

    Post-transcriptional regulation of gene expression is often accomplished by proteins binding to specific sequence motifs in mRNA molecules, to affect their translation or stability. The motifs are often composed of a combination of sequence and structural constraints such that the overall structure is preserved even though much of the primary sequence is variable. While several methods exist to discover transcriptional regulatory sites in the DNA sequences of coregulated genes, the RNA motif discovery problem is much more difficult because of covariation in the positions. We describe the combined use of two approaches for RNA structure prediction, FOLDALIGN and COVE, that together can discover and model stem–loop RNA motifs in unaligned sequences, such as UTRs from post-transcriptionally coregulated genes. We evaluate the method on two datasets, one a section of rRNA genes with randomly truncated ends so that a global alignment is not possible, and the other a hyper-variable collection of IRE-like elements that were inserted into randomized UTR sequences. In both cases the combined method identified the motifs correctly, and in the rRNA example we show that it is capable of determining the structure, which includes bulge and internal loops as well as a variable length hairpin loop. Those automated results are quantitatively evaluated and found to agree closely with structures contained in curated databases, with correlation coefficients up to 0.9. A basic server, Stem–Loop Align SearcH (SLASH), which will perform stem–loop searches in unaligned RNA sequences, is available at http://www.bioinf.au.dk/slash/. PMID:11353083

  16. Complete MHC haplotype sequencing for common disease gene mapping.

    PubMed

    Stewart, C Andrew; Horton, Roger; Allcock, Richard J N; Ashurst, Jennifer L; Atrazhev, Alexey M; Coggill, Penny; Dunham, Ian; Forbes, Simon; Halls, Karen; Howson, Joanna M M; Humphray, Sean J; Hunt, Sarah; Mungall, Andrew J; Osoegawa, Kazutoyo; Palmer, Sophie; Roberts, Anne N; Rogers, Jane; Sims, Sarah; Wang, Yu; Wilming, Laurens G; Elliott, John F; de Jong, Pieter J; Sawcer, Stephen; Todd, John A; Trowsdale, John; Beck, Stephan

    2004-06-01

    The future systematic mapping of variants that confer susceptibility to common diseases requires the construction of a fully informative polymorphism map. Ideally, every base pair of the genome would be sequenced in many individuals. Here, we report 4.75 Mb of contiguous sequence for each of two common haplotypes of the major histocompatibility complex (MHC), to which susceptibility to >100 diseases has been mapped. The autoimmune disease-associated-haplotypes HLA-A3-B7-Cw7-DR15 and HLA-A1-B8-Cw7-DR3 were sequenced in their entirety through a bacterial artificial chromosome (BAC) cloning strategy using the consanguineous cell lines PGF and COX, respectively. The two sequences were annotated to encompass all described splice variants of expressed genes. We defined the complete variation content of the two haplotypes, revealing >18,000 variations between them. Average SNP densities ranged from less than one SNP per kilobase to >60. Acquisition of complete and accurate sequence data over polymorphic regions such as the MHC from large-insert cloned DNA provides a definitive resource for the construction of informative genetic maps, and avoids the limitation of chromosome regions that are refractory to PCR amplification. PMID:15140828

  17. Interrogating Transcriptional Regulatory Sequences in Tol2-Mediated Xenopus Transgenics

    PubMed Central

    Loots, Gabriela G.; Bergmann, Anne; Hum, Nicholas R.; Oldenburg, Catherine E.; Wills, Andrea E.; Hu, Na; Ovcharenko, Ivan; Harland, Richard M.

    2013-01-01

    Identifying gene regulatory elements and their target genes in vertebrates remains a significant challenge. It is now recognized that transcriptional regulatory sequences are critical in orchestrating dynamic controls of tissue-specific gene expression during vertebrate development and in adult tissues, and that these elements can be positioned at great distances in relation to the promoters of the genes they control. While significant progress has been made in mapping DNA binding regions by combining chromatin immunoprecipitation and next generation sequencing, functional validation remains a limiting step in improving our ability to correlate in silico predictions with biological function. We recently developed a computational method that synergistically combines genome-wide gene-expression profiling, vertebrate genome comparisons, and transcription factor binding-site analysis to predict tissue-specific enhancers in the human genome. We applied this method to 270 genes highly expressed in skeletal muscle and predicted 190 putative cis-regulatory modules. Furthermore, we optimized Tol2 transgenic constructs in Xenopus laevis to interrogate 20 of these elements for their ability to function as skeletal muscle-specific transcriptional enhancers during embryonic development. We found 45% of these elements expressed only in the fast muscle fibers that are oriented in highly organized chevrons in the Xenopus laevis tadpole. Transcription factor binding site analysis identified >2 Mef2/MyoD sites within ∼200 bp regions in 6 of the validated enhancers, and systematic mutagenesis of these sites revealed that they are critical for the enhancer function. The data described herein introduces a new reporter system suitable for interrogating tissue-specific cis-regulatory elements which allows monitoring of enhancer activity in real time, throughout early stages of embryonic development, in Xenopus. PMID:23874664

  18. Organisation of regulatory elements in two closely spaced Drosophila genes with common expression characteristics.

    PubMed

    Gigliotti, S; Balz, V; Malva, C; Schäfer, M A

    1997-11-01

    Sperm tail proteins that are components of a specific structure formed late during spermatid elongation have been found to be encoded by the Mst(3)CGP gene family. These genes have been demonstrated to be regulated both at the transcriptional as well as at the translational level. We report here on the dissection of the regulatory regions for two members of the gene family, Mst84Da and Mst84Db. While high level transcription and negative translational control of Mst84Da is mediated by a short gene segment of 205 nt (-152/+53), Mst84Db expression is controlled by a number of distinct regulatory elements with different effects that all reside within the gene itself. We identify a transcriptional control element between +154 and +216, a translational repression element around +216 to +275 and an RNA stability element within the 3'UTR. Irrespective of the final common expression characteristics, correct regulation for any individual member of the gene family seems to be achieved by very different means. This confirms earlier observations that did not detect any other sequence elements in common apart from the TCE (translational control element). PMID:9431808

  19. Properties of Sequence Conservation in Upstream Regulatory and Protein Coding Sequences among Paralogs in Arabidopsis thaliana

    NASA Astrophysics Data System (ADS)

    Richardson, Dale N.; Wiehe, Thomas

    Whole genome duplication (WGD) has catalyzed the formation of new species, genes with novel functions, altered expression patterns, complexified signaling pathways and has provided organisms a level of genetic robustness. We studied the long-term evolution and interrelationships of 5’ upstream regulatory sequences (URSs), protein coding sequences (CDSs) and expression correlations (EC) of duplicated gene pairs in Arabidopsis. Three distinct methods revealed significant evolutionary conservation between paralogous URSs and were highly correlated with microarray-based expression correlation of the respective gene pairs. Positional information on exact matches between sequences unveiled the contribution of micro-chromosomal rearrangements on expression divergence. A three-way rank analysis of URS similarity, CDS divergence and EC uncovered specific gene functional biases. Transcription factor activity was associated with gene pairs exhibiting conserved URSs and divergent CDSs, whereas a broad array of metabolic enzymes was found to be associated with gene pairs showing diverged URSs but conserved CDSs.

  20. Sex determination strategies in 2012: towards a common regulatory model?

    PubMed Central

    2012-01-01

    Sex determination is a complicated process involving large-scale modifications in gene expression affecting virtually every tissue in the body. Although the evolutionary origin of sex remains controversial, there is little doubt that it has developed as a process of optimizing metabolic control, as well as developmental and reproductive functions within a given setting of limited resources and environmental pressure. Evidence from various model organisms supports the view that sex determination may occur as a result of direct environmental induction or genetic regulation. The first process has been well documented in reptiles and fish, while the second is the classic case for avian species and mammals. Both of the latter have developed a variety of sex-specific/sex-related genes, which ultimately form a complete chromosome pair (sex chromosomes/gonosomes). Interestingly, combinations of environmental and genetic mechanisms have been described among different classes of animals, thus rendering the possibility of a unidirectional continuous evolutionary process from the one type of mechanism to the other unlikely. On the other hand, common elements appear throughout the animal kingdom, with regard to a) conserved key genes and b) a central role of sex steroid control as a prerequisite for ultimately normal sex differentiation. Studies in invertebrates also indicate a role of epigenetic chromatin modification, particularly with regard to alternative splicing options. This review summarizes current evidence from research in this hot field and signifies the need for further study of both normal hormonal regulators of sexual phenotype and patterns of environmental disruption. PMID:22357269

  1. Massive contribution of transposable elements to mammalian regulatory sequences.

    PubMed

    Rayan, Nirmala Arul; Del Rosario, Ricardo C H; Prabhakar, Shyam

    2016-09-01

    Barbara McClintock discovered the existence of transposable elements (TEs) in the late 1940s and initially proposed that they contributed to the gene regulatory program of higher organisms. This controversial idea gained acceptance only much later in the 1990s, when the first examples of TE-derived promoter sequences were uncovered. It is now known that half of the human genome is recognizably derived from TEs. It is thus important to understand the scope and nature of their contribution to gene regulation. Here, we provide a timeline of major discoveries in this area and discuss how transposons have revolutionized our understanding of mammalian genomes, with a special emphasis on the massive contribution of TEs to primate evolution. Our analysis of primate-specific functional elements supports a simple model for the rate at which new functional elements arise in unique and TE-derived DNA. Finally, we discuss some of the challenges and unresolved questions in the field, which need to be addressed in order to fully characterize the impact of TEs on gene regulation, evolution and disease processes. PMID:27174439

  2. Noninvasive prenatal diagnosis of common aneuploidies by semiconductor sequencing.

    PubMed

    Liao, Can; Yin, Ai-hua; Peng, Chun-fang; Fu, Fang; Yang, Jie-xia; Li, Ru; Chen, Yang-yi; Luo, Dong-hong; Zhang, Yong-ling; Ou, Yan-mei; Li, Jian; Wu, Jing; Mai, Ming-qin; Hou, Rui; Wu, Frances; Luo, Hongrong; Li, Dong-zhi; Liu, Hai-liang; Zhang, Xiao-zhuang; Zhang, Kang

    2014-05-20

    Massively parallel sequencing (MPS) of cell-free fetal DNA from maternal plasma has revolutionized our ability to perform noninvasive prenatal diagnosis. This approach avoids the risk of fetal loss associated with more invasive diagnostic procedures. The present study developed an effective method for noninvasive prenatal diagnosis of common chromosomal aneuploidies using a benchtop semiconductor sequencing platform (SSP), which relies on the MPS platform but offers advantages over existing noninvasive screening techniques. A total of 2,275 pregnant subjects was included in the study; of these, 515 subjects who had full karyotyping results were used in a retrospective analysis, and 1,760 subjects without karyotyping were analyzed in a prospective study. In the retrospective study, all 55 fetal trisomy 21 cases were identified using the SSP with a sensitivity and specificity of 99.94% and 99.46%, respectively. The SSP also detected 16 trisomy 18 cases with 100% sensitivity and 99.24% specificity and 3 trisomy 13 cases with 100% sensitivity and 100% specificity. Furthermore, 15 fetuses with sex chromosome aneuploidies (10 45,X, 2 47,XYY, 2 47,XXX, and 1 47,XXY) were detected. In the prospective study, nine fetuses with trisomy 21, three with trisomy 18, three with trisomy 13, and one with 45,X were detected. To our knowledge, this is the first large-scale clinical study to systematically identify chromosomal aneuploidies based on cell-free fetal DNA using the SSP and provides an effective strategy for large-scale noninvasive screening for chromosomal aneuploidies in a clinical setting. PMID:24799683

  3. Genetic Diagnosis Using Whole Exome Sequencing in Common Variable Immunodeficiency

    PubMed Central

    Maffucci, Patrick; Filion, Charles A.; Boisson, Bertrand; Itan, Yuval; Shang, Lei; Casanova, Jean-Laurent; Cunningham-Rundles, Charlotte

    2016-01-01

    Whole exome sequencing (WES) has proven an effective tool for the discovery of genetic defects in patients with primary immunodeficiencies (PIDs). However, success in dissecting the genetic etiology of common variable immunodeficiency (CVID) has been limited. We outline a practical framework for using WES to identify causative genetic defects in these subjects. WES was performed on 50 subjects diagnosed with CVID who had at least one of the following criteria: early onset, autoimmune/inflammatory manifestations, low B lymphocytes, and/or familial history of hypogammaglobulinemia. Following alignment and variant calling, exomes were screened for mutations in 269 PID-causing genes. Variants were filtered based on the mode of inheritance and reported frequency in the general population. Each variant was assessed by study of familial segregation and computational predictions of deleteriousness. Out of 433 variations in PID-associated genes, we identified 17 probable disease-causing mutations in 15 patients (30%). These variations were rare or private and included monoallelic mutations in NFKB1, STAT3, CTLA4, PIK3CD, and IKZF1, and biallelic mutations in LRBA and STXBP2. Forty-two other damaging variants were found but were not considered likely disease-causing based on the mode of inheritance and/or patient phenotype. WES combined with analysis of PID-associated genes is a cost-effective approach to identify disease-causing mutations in CVID patients with severe phenotypes and was successful in 30% of our cohort. As targeted therapeutics are becoming the mainstay of treatment for non-infectious manifestations in CVID, this approach will improve management of patients with more severe phenotypes. PMID:27379089

  4. Genetic Diagnosis Using Whole Exome Sequencing in Common Variable Immunodeficiency.

    PubMed

    Maffucci, Patrick; Filion, Charles A; Boisson, Bertrand; Itan, Yuval; Shang, Lei; Casanova, Jean-Laurent; Cunningham-Rundles, Charlotte

    2016-01-01

    Whole exome sequencing (WES) has proven an effective tool for the discovery of genetic defects in patients with primary immunodeficiencies (PIDs). However, success in dissecting the genetic etiology of common variable immunodeficiency (CVID) has been limited. We outline a practical framework for using WES to identify causative genetic defects in these subjects. WES was performed on 50 subjects diagnosed with CVID who had at least one of the following criteria: early onset, autoimmune/inflammatory manifestations, low B lymphocytes, and/or familial history of hypogammaglobulinemia. Following alignment and variant calling, exomes were screened for mutations in 269 PID-causing genes. Variants were filtered based on the mode of inheritance and reported frequency in the general population. Each variant was assessed by study of familial segregation and computational predictions of deleteriousness. Out of 433 variations in PID-associated genes, we identified 17 probable disease-causing mutations in 15 patients (30%). These variations were rare or private and included monoallelic mutations in NFKB1, STAT3, CTLA4, PIK3CD, and IKZF1, and biallelic mutations in LRBA and STXBP2. Forty-two other damaging variants were found but were not considered likely disease-causing based on the mode of inheritance and/or patient phenotype. WES combined with analysis of PID-associated genes is a cost-effective approach to identify disease-causing mutations in CVID patients with severe phenotypes and was successful in 30% of our cohort. As targeted therapeutics are becoming the mainstay of treatment for non-infectious manifestations in CVID, this approach will improve management of patients with more severe phenotypes. PMID:27379089

  5. Novel players in the AP2-miR172 regulatory network for common bean nodulation

    PubMed Central

    Íñiguez, Luis P; Nova-Franco, Bárbara; Hernández, Georgina

    2015-01-01

    The intricate regulatory network for floral organogenesis in plants that includes AP2/ERF, SPL and AGL transcription factors, miR172 and miR156 along with other components is well documented, though its complexity and size keep increasing. The miR172/AP2 node was recently proposed as essential regulator in the legume-rhizobia nitrogen-fixing symbiosis. Research from our group contributed to demonstrate the control of common bean (Phaseolus vulgaris) nodulation by miR172c/AP2-1, however no other components of such regulatory network have been reported. Here we propose AGLs as new protagonists in the regulation of common bean nodulation and discuss the relevance of future deeper analysis of the complex AP2 regulatory network for nodule organogenesis in legumes. PMID:26211831

  6. Boosting heterologous protein production in transgenic dicotyledonous seeds using Phaseolus vulgaris regulatory sequences.

    PubMed

    De Jaeger, Geert; Scheffer, Stanley; Jacobs, Anni; Zambre, Mukund; Zobell, Oliver; Goossens, Alain; Depicker, Ann; Angenon, Geert

    2002-12-01

    Over the past decade, several high value proteins have been produced in different transgenic plant tissues such as leaves, tubers, and seeds. Despite recent advances, many heterologous proteins accumulate to low concentrations, and the optimization of expression cassettes to make in planta production and purification economically feasible remains critical. Here, the regulatory sequences of the seed storage protein gene arcelin 5-I (arc5-I) of common bean (Phaseolus vulgaris) were evaluated for producing heterologous proteins in dicotyledonous seeds. The murine single chain variable fragment (scFv) G4 (ref. 4) was chosen as model protein because of the current industrial interest in producing antibodies and derived fragments in crops. In transgenic Arabidopsis thaliana seed stocks, the scFv under control of the 35S promoter of the cauliflower mosaic virus (CaMV) accumulated to approximately 1% of total soluble protein (TSP). However, a set of seed storage promoter constructs boosted the scFv accumulation to exceptionally high concentrations, reaching no less than 36.5% of TSP in homozygous seeds. Even at these high concentrations, the scFv proteins had antigen-binding activity and affinity similar to those produced in Escherichia coli. The feasibility of heterologous protein production under control of arc5-I regulatory sequences was also demonstrated in Phaseolus acutifolius, a promising crop for large scale production. PMID:12415287

  7. A genome-wide cis-regulatory element discovery method based on promoter sequences and gene co-expression networks

    PubMed Central

    2013-01-01

    Background Deciphering cis-regulatory networks has become an attractive yet challenging task. This paper presents a simple method for cis-regulatory network discovery which aims to avoid some of the common problems of previous approaches. Results Using promoter sequences and gene expression profiles as input, rather than clustering the genes by the expression data, our method utilizes co-expression neighborhood information for each individual gene, thereby overcoming the disadvantages of current clustering based models which may miss specific information for individual genes. In addition, rather than using a motif database as an input, it implements a simple motif count table for each enumerated k-mer for each gene promoter sequence. Thus, it can be used for species where previous knowledge of cis-regulatory motifs is unknown and has the potential to discover new transcription factor binding sites. Applications on Saccharomyces cerevisiae and Arabidopsis have shown that our method has a good prediction accuracy and outperforms a phylogenetic footprinting approach. Furthermore, the top ranked gene-motif regulatory clusters are evidently functionally co-regulated, and the regulatory relationships between the motifs and the enriched biological functions can often be confirmed by literature. Conclusions Since this method is simple and gene-specific, it can be readily utilized for insufficiently studied species or flexibly used as an additional step or data source for previous transcription regulatory networks discovery models. PMID:23368633

  8. Molecular sled sequences are common in mammalian proteins

    PubMed Central

    Xiong, Kan; Blainey, Paul C.

    2016-01-01

    Recent work revealed a new class of molecular machines called molecular sleds, which are small basic molecules that bind and slide along DNA with the ability to carry cargo along DNA. Here, we performed biochemical and single-molecule flow stretching assays to investigate the basis of sliding activity in molecular sleds. In particular, we identified the functional core of pVIc, the first molecular sled characterized; peptide functional groups that control sliding activity; and propose a model for the sliding activity of molecular sleds. We also observed widespread DNA binding and sliding activity among basic polypeptide sequences that implicate mammalian nuclear localization sequences and many cell penetrating peptides as molecular sleds. These basic protein motifs exhibit weak but physiologically relevant sequence-nonspecific DNA affinity. Our findings indicate that many mammalian proteins contain molecular sled sequences and suggest the possibility that substantial undiscovered sliding activity exists among nuclear mammalian proteins. PMID:26857546

  9. Molecular sled sequences are common in mammalian proteins.

    PubMed

    Xiong, Kan; Blainey, Paul C

    2016-03-18

    Recent work revealed a new class of molecular machines called molecular sleds, which are small basic molecules that bind and slide along DNA with the ability to carry cargo along DNA. Here, we performed biochemical and single-molecule flow stretching assays to investigate the basis of sliding activity in molecular sleds. In particular, we identified the functional core of pVIc, the first molecular sled characterized; peptide functional groups that control sliding activity; and propose a model for the sliding activity of molecular sleds. We also observed widespread DNA binding and sliding activity among basic polypeptide sequences that implicate mammalian nuclear localization sequences and many cell penetrating peptides as molecular sleds. These basic protein motifs exhibit weak but physiologically relevant sequence-nonspecific DNA affinity. Our findings indicate that many mammalian proteins contain molecular sled sequences and suggest the possibility that substantial undiscovered sliding activity exists among nuclear mammalian proteins. PMID:26857546

  10. PAZAR: a framework for collection and dissemination of cis-regulatory sequence annotation

    PubMed Central

    Portales-Casamar, Elodie; Kirov, Stefan; Lim, Jonathan; Lithwick, Stuart; Swanson, Magdalena I; Ticoll, Amy; Snoddy, Jay; Wasserman, Wyeth W

    2007-01-01

    PAZAR is an open-access and open-source database of transcription factor and regulatory sequence annotation with associated web interface and programming tools for data submission and extraction. Curated boutique data collections can be maintained and disseminated through the unified schema of the mall-like PAZAR repository. The Pleiades Promoter Project collection of brain-linked regulatory sequences is introduced to demonstrate the depth of annotation possible within PAZAR. PAZAR, located at , is open for business. PMID:17916232

  11. On the Concept of Cis-regulatory Information: From Sequence Motifs to Logic Functions

    NASA Astrophysics Data System (ADS)

    Tarpine, Ryan; Istrail, Sorin

    The regulatory genome is about the “system level organization of the core genomic regulatory apparatus, and how this is the locus of causality underlying the twin phenomena of animal development and animal evolution” (E.H. Davidson. The Regulatory Genome: Gene Regulatory Networks in Development and Evolution, Academic Press, 2006). Information processing in the regulatory genome is done through regulatory states, defined as sets of transcription factors (sequence-specific DNA binding proteins which determine gene expression) that are expressed and active at the same time. The core information processing machinery consists of modular DNA sequence elements, called cis-modules, that interact with transcription factors. The cis-modules “read” the information contained in the regulatory state of the cell through transcription factor binding, “process” it, and directly or indirectly communicate with the basal transcription apparatus to determine gene expression. This endowment of each gene with the information-receiving capacity through their cis-regulatory modules is essential for the response to every possible regulatory state to which it might be exposed during all phases of the life cycle and in all cell types. We present here a set of challenges addressed by our CYRENE research project aimed at studying the cis-regulatory code of the regulatory genome. The CYRENE Project is devoted to (1) the construction of a database, the cis-Lexicon, containing comprehensive information across species about experimentally validated cis-regulatory modules; and (2) the software development of a next-generation genome browser, the cis-Browser, specialized for the regulatory genome. The presentation is anchored on three main computational challenges: the Gene Naming Problem, the Consensus Sequence Bottleneck Problem, and the Logic Function Inference Problem.

  12. The genetic architecture of autism spectrum disorders (ASDs) and the potential importance of common regulatory genetic variants.

    PubMed

    Saffen, David

    2015-10-01

    Currently, there is great interest in identifying genetic variants that contribute to the risk of developing autism spectrum disorders (ASDs), due in part to recent increases in the frequency of diagnosis of these disorders worldwide. While there is nearly universal agreement that ASDs are complex diseases, with multiple genetic and environmental contributing factors, there is less agreement concerning the relative importance of common vs rare genetic variants in ASD liability. Recent observations that rare mutations and copy number variants (CNVs) are frequently associated with ASDs, combined with reduced fecundity of individuals with these disorders, has led to the hypothesis that ASDs are caused primarily by de novo or rare genetic mutations. Based on this model, large-scale whole-genome DNA sequencing has been proposed as the most appropriate method for discovering ASD liability genes. While this approach will undoubtedly identify many novel candidate genes and produce important new insights concerning the genetic causes of these disorders, a full accounting of the genetics of ASDs will be incomplete absent an understanding of the contributions of common regulatory variants, which are likely to influence ASD liability by modifying the effects of rare variants or, by assuming unfavorable combinations, directly produce these disorders. Because it is not yet possible to identify regulatory genetic variants by examination of DNA sequences alone, their identification will require experimentation. In this essay, I discuss these issues and describe the advantages of measurements of allelic expression imbalance (AEI) of mRNA expression for identifying cis-acting regulatory variants that contribute to ASDs. PMID:26335735

  13. Genotyping common and rare variation using overlapping pool sequencing

    PubMed Central

    2011-01-01

    Background Recent advances in sequencing technologies set the stage for large, population based studies, in which the ANA or RNA of thousands of individuals will be sequenced. Currently, however, such studies are still infeasible using a straightforward sequencing approach; as a result, recently a few multiplexing schemes have been suggested, in which a small number of ANA pools are sequenced, and the results are then deconvoluted using compressed sensing or similar approaches. These methods, however, are limited to the detection of rare variants. Results In this paper we provide a new algorithm for the deconvolution of DNA pools multiplexing schemes. The presented algorithm utilizes a likelihood model and linear programming. The approach allows for the addition of external data, particularly imputation data, resulting in a flexible environment that is suitable for different applications. Conclusions Particularly, we demonstrate that both low and high allele frequency SNPs can be accurately genotyped when the DNA pooling scheme is performed in conjunction with microarray genotyping and imputation. Additionally, we demonstrate the use of our framework for the detection of cancer fusion genes from RNA sequences. PMID:21989232

  14. Sequence evidence for common ancestry of eukaryotic endomembrane coatomers

    PubMed Central

    Promponas, Vasilis J.; Katsani, Katerina R.; Blencowe, Benjamin J.; Ouzounis, Christos A.

    2016-01-01

    Eukaryotic cells are defined by compartments through which the trafficking of macromolecules is mediated by large complexes, such as the nuclear pore, transport vesicles and intraflagellar transport. The assembly and maintenance of these complexes is facilitated by endomembrane coatomers, long suspected to be divergently related on the basis of structural and more recently phylogenomic analysis. By performing supervised walks in sequence space across coatomer superfamilies, we uncover subtle sequence patterns that have remained elusive to date, ultimately unifying eukaryotic coatomers by divergent evolution. The conserved residues shared by 3,502 endomembrane coatomer components are mapped onto the solenoid superhelix of nucleoporin and COPII protein structures, thus determining the invariant elements of coatomer architecture. This ancient structural motif can be considered as a universal signature connecting eukaryotic coatomers involved in multiple cellular processes across cell physiology and human disease. PMID:26931514

  15. Sequence evidence for common ancestry of eukaryotic endomembrane coatomers.

    PubMed

    Promponas, Vasilis J; Katsani, Katerina R; Blencowe, Benjamin J; Ouzounis, Christos A

    2016-01-01

    Eukaryotic cells are defined by compartments through which the trafficking of macromolecules is mediated by large complexes, such as the nuclear pore, transport vesicles and intraflagellar transport. The assembly and maintenance of these complexes is facilitated by endomembrane coatomers, long suspected to be divergently related on the basis of structural and more recently phylogenomic analysis. By performing supervised walks in sequence space across coatomer superfamilies, we uncover subtle sequence patterns that have remained elusive to date, ultimately unifying eukaryotic coatomers by divergent evolution. The conserved residues shared by 3,502 endomembrane coatomer components are mapped onto the solenoid superhelix of nucleoporin and COPII protein structures, thus determining the invariant elements of coatomer architecture. This ancient structural motif can be considered as a universal signature connecting eukaryotic coatomers involved in multiple cellular processes across cell physiology and human disease. PMID:26931514

  16. RNA-ID, a Powerful Tool for Identifying and Characterizing Regulatory Sequences.

    PubMed

    Brule, C E; Dean, K M; Grayhack, E J

    2016-01-01

    The identification and analysis of sequences that regulate gene expression is critical because regulated gene expression underlies biology. RNA-ID is an efficient and sensitive method to discover and investigate regulatory sequences in the yeast Saccharomyces cerevisiae, using fluorescence-based assays to detect green fluorescent protein (GFP) relative to a red fluorescent protein (RFP) control in individual cells. Putative regulatory sequences can be inserted either in-frame or upstream of a superfolder GFP fusion protein whose expression, like that of RFP, is driven by the bidirectional GAL1,10 promoter. In this chapter, we describe the methodology to identify and study cis-regulatory sequences in the RNA-ID system, explaining features and variations of the RNA-ID reporter, as well as some applications of this system. We describe in detail the methods to analyze a single regulatory sequence, from construction of a single GFP variant to assay of variants by flow cytometry, as well as modifications required to screen libraries of different strains simultaneously. We also describe subsequent analyses of regulatory sequences. PMID:27241757

  17. Retrieval of Branching Sequences in an Associative Memory Model with Common External Input and Bias Input

    NASA Astrophysics Data System (ADS)

    Katahira, Kentaro; Kawamura, Masaki; Okanoya, Kazuo; Okada, Masato

    2007-04-01

    We investigate a recurrent neural network model with common external and bias inputs that can retrieve branching sequences. Retrieval of memory sequences is one of the most important functions of the brain. A lot of research has been done on neural networks that process memory sequences. Most of it has focused on fixed memory sequences. However, many animals can remember and recall branching sequences. Therefore, we propose an associative memory model that can retrieve branching sequences. Our model has bias input and common external input. Kawamura and Okada reported that common external input enables sequential memory retrieval in an associative memory model with auto- and weak cross-correlation connections. We show that retrieval processes along branching sequences are controllable with both the bias input and the common external input. To analyze the behaviors of our model, we derived the macroscopic dynamical description as a probability density function. The results obtained by our theory agree with those obtained by computer simulations.

  18. Synthetic muscle promoters: activities exceeding naturally occurring regulatory sequences

    NASA Technical Reports Server (NTRS)

    Li, X.; Eastman, E. M.; Schwartz, R. J.; Draghia-Akli, R.

    1999-01-01

    Relatively low levels of expression from naturally occurring promoters have limited the use of muscle as a gene therapy target. Myogenic restricted gene promoters display complex organization usually involving combinations of several myogenic regulatory elements. By random assembly of E-box, MEF-2, TEF-1, and SRE sites into synthetic promoter recombinant libraries, and screening of hundreds of individual clones for transcriptional activity in vitro and in vivo, several artificial promoters were isolated whose transcriptional potencies greatly exceed those of natural myogenic and viral gene promoters.

  19. A sequence pattern common to T cell epitopes.

    PubMed Central

    Rothbard, J B; Taylor, W R

    1988-01-01

    An analysis of the known cytotoxic and helper T cell epitopes has revealed similarity within their primary sequences. These similar motifs, characteristic of the known determinants, have been incorporated into predictive templates that have been used successfully to define eight helper and three cytotoxic epitopes in four different proteins. When the defined epitopes are segregated by restriction element, allele specific subpatterns emerge centering around the general pattern. The presence of similarities argues that the binding of peptide antigens to class I and class II is similar in nature. In addition, these motifs can be used to predict accurately areas within proteins capable of being recognized by individual MHC class I and class II molecules. PMID:2452085

  20. Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence

    PubMed Central

    Gordon, Kacy L.; Arthur, Robert K.; Ruvinsky, Ilya

    2015-01-01

    Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements. PMID:26020930

  1. Bioinformatic identification of novel regulatory DNA sequence motifs in Streptomyces coelicolor

    PubMed Central

    Studholme, David J; Bentley, Stephen D; Kormanec, Jan

    2004-01-01

    Background Streptomyces coelicolor is a bacterium with a vast repertoire of metabolic functions and complex systems of cellular development. Its genome sequence is rich in genes that encode regulatory proteins to control these processes in response to its changing environment. We wished to apply a recently published bioinformatic method for identifying novel regulatory sequence signals to gain new insights into regulation in S. coelicolor. Results The method involved production of position-specific weight matrices from alignments of over-represented words of DNA sequence. We generated 2497 weight matrices, each representing a candidate regulatory DNA sequence motif. We scanned the genome sequence of S. coelicolor against each of these matrices. A DNA sequence motif represented by one of the matrices was found preferentially in non-coding sequences immediately upstream of genes involved in polysaccharide degradation, including several that encode chitinases. This motif (TGGTCTAGACCA) was also found upstream of genes encoding components of the phosphoenolpyruvate phosphotransfer system (PTS). We hypothesise that this DNA sequence motif represents a regulatory element that is responsive to availability of carbon-sources. Other motifs of potential biological significance were found upstream of genes implicated in secondary metabolism (TTAGGTtAGgCTaACCTAA), sigma factors (TGACN19TGAC), DNA replication and repair (ttgtCAGTGN13TGGA), nucleotide conversions (CTACgcNCGTAG), and ArsR (TCAGN12TCAG). A motif found upstream of genes involved in chromosome replication (TGTCagtgcN7Tagg) was similar to a previously described motif found in UV-responsive promoters. Conclusions We successfully applied a recently published in silico method to identify conserved sequence motifs in S. coelicolor that may be biologically significant as regulatory elements. Our data are broadly consistent with and further extend data from previously published studies. We invite experimental testing of

  2. Poster — Thur Eve — 50: Common Regulatory Non-Compliances and How to Avoid Them

    SciTech Connect

    Heimann, M.

    2014-08-15

    The Accelerators and Class II Facilities Division (ACFD) of the Canadian Nuclear Safety Commission (CNSC), is responsible for the oversight of radiotherapy facilities containing Class II prescribed equipment in Canada. Over the past several years, ACFD has been performing compliance inspections of Class II nuclear facilities across the country (medical and otherwise), and in that time, has issued several hundred corrective actions to licensees due to non-compliance with regulatory requirements. Recently, a study was done to determine the most common regulatory non-compliances. The purpose of this poster presentation is to disseminate information to the licensee community about the nature of these non-compliances, and how they can be avoided by licensees in the future.

  3. Common Warm Dust Temperatures Around Main Sequence Stars

    NASA Technical Reports Server (NTRS)

    Morales, Farisa; Rieke, George; Werner, Michael; Stapelfeldt, Karl; Bryden, Geoffrey; Su, Kate

    2011-01-01

    We compare the properties of warm dust emission from a sample of main-sequence A-type stars (B8-A7) to those of dust around solar-type stars (F5-KO) with similar Spitzer Space Telescope Infrared Spectrograph/MIPS data and similar ages. Both samples include stars with sources with infrared spectral energy distributions that show evidence of multiple components. Over the range of stellar types considered, we obtain nearly the same characteristic dust temperatures (∼ 190 K and ∼60 K for the inner and outer dust components, respectively)-slightly above the ice evaporation temperature for the inner belts. The warm inner dust temperature is readily explained if populations of small grains are being released by sublimation of ice from icy planetesimals. Evaporation of low-eccentricity icy bodies at ∼ 150 K can deposit particles into an inner/warm belt, where the small grains are heated to dust Temperatures of -190 K. Alternatively, enhanced collisional processing of an asteroid belt-like system of parent planetesimals just interior to the snow line may account for the observed uniformity in dust temperature. The similarity in temperature of the warmer dust across our B8-KO stellar sample strongly suggests that dust-producing planetesimals are not found at similar radial locations around all stars, but that dust production is favored at a characteristic temperature horizon.

  4. Marine organism cell biology and regulatory sequence discoveryin comparative functional genomics.

    PubMed

    Barnes, David W; Mattingly, Carolyn J; Parton, Angela; Dowell, Lori M; Bayne, Christopher J; Forrest, John N

    2004-10-01

    The use of bioinformatics to integrate phenotypic and genomic data from mammalian models is well established as a means of understanding human biology and disease. Beyond direct biomedical applications of these approaches in predicting structure-function relationships between coding sequences and protein activities, comparative studies also promote understanding of molecular evolution and the relationship between genomic sequence and morphological and physiological specialization. Recently recognized is the potential of comparative studies to identify functionally significant regulatory regions and to generate experimentally testable hypotheses that contribute to understanding mechanisms that regulate gene expression, including transcriptional activity, alternative splicing and transcript stability. Functional tests of hypotheses generated by computational approaches require experimentally tractable in vitro systems, including cell cultures. Comparative sequence analysis strategies that use genomic sequences from a variety of evolutionarily diverse organisms are critical for identifying conserved regulatory motifs in the 5'-upstream, 3'-downstream and introns of genes. Genomic sequences and gene orthologues in the first aquatic vertebrate and protovertebrate organisms to be fully sequenced (Fugu rubripes, Ciona intestinalis, Tetraodon nigroviridis, Danio rerio) as well as in the elasmobranchs, spiny dogfish shark (Squalus acanthias) and little skate (Raja erinacea), and marine invertebrate models such as the sea urchin (Strongylocentrotus purpuratus) are valuable in the prediction of putative genomic regulatory regions. Cell cultures have been derived for these and other model species. Data and tools resulting from these kinds of studies will contribute to understanding transcriptional regulation of biomedically important genes and provide new avenues for medical therapeutics and disease prevention. PMID:19003267

  5. The Intolerance of Regulatory Sequence to Genetic Variation Predicts Gene Dosage Sensitivity.

    PubMed

    Petrovski, Slavé; Gussow, Ayal B; Wang, Quanli; Halvorsen, Matt; Han, Yujun; Weir, William H; Allen, Andrew S; Goldstein, David B

    2015-09-01

    Noncoding sequence contains pathogenic mutations. Yet, compared with mutations in protein-coding sequence, pathogenic regulatory mutations are notoriously difficult to recognize. Most fundamentally, we are not yet adept at recognizing the sequence stretches in the human genome that are most important in regulating the expression of genes. For this reason, it is difficult to apply to the regulatory regions the same kinds of analytical paradigms that are being successfully applied to identify mutations among protein-coding regions that influence risk. To determine whether dosage sensitive genes have distinct patterns among their noncoding sequence, we present two primary approaches that focus solely on a gene's proximal noncoding regulatory sequence. The first approach is a regulatory sequence analogue of the recently introduced residual variation intolerance score (RVIS), termed noncoding RVIS, or ncRVIS. The ncRVIS compares observed and predicted levels of standing variation in the regulatory sequence of human genes. The second approach, termed ncGERP, reflects the phylogenetic conservation of a gene's regulatory sequence using GERP++. We assess how well these two approaches correlate with four gene lists that use different ways to identify genes known or likely to cause disease through changes in expression: 1) genes that are known to cause disease through haploinsufficiency, 2) genes curated as dosage sensitive in ClinGen's Genome Dosage Map, 3) genes judged likely to be under purifying selection for mutations that change expression levels because they are statistically depleted of loss-of-function variants in the general population, and 4) genes judged unlikely to cause disease based on the presence of copy number variants in the general population. We find that both noncoding scores are highly predictive of dosage sensitivity using any of these criteria. In a similar way to ncGERP, we assess two ensemble-based predictors of regional noncoding importance, nc

  6. The Intolerance of Regulatory Sequence to Genetic Variation Predicts Gene Dosage Sensitivity

    PubMed Central

    Wang, Quanli; Halvorsen, Matt; Han, Yujun; Weir, William H.; Allen, Andrew S.; Goldstein, David B.

    2015-01-01

    Noncoding sequence contains pathogenic mutations. Yet, compared with mutations in protein-coding sequence, pathogenic regulatory mutations are notoriously difficult to recognize. Most fundamentally, we are not yet adept at recognizing the sequence stretches in the human genome that are most important in regulating the expression of genes. For this reason, it is difficult to apply to the regulatory regions the same kinds of analytical paradigms that are being successfully applied to identify mutations among protein-coding regions that influence risk. To determine whether dosage sensitive genes have distinct patterns among their noncoding sequence, we present two primary approaches that focus solely on a gene’s proximal noncoding regulatory sequence. The first approach is a regulatory sequence analogue of the recently introduced residual variation intolerance score (RVIS), termed noncoding RVIS, or ncRVIS. The ncRVIS compares observed and predicted levels of standing variation in the regulatory sequence of human genes. The second approach, termed ncGERP, reflects the phylogenetic conservation of a gene’s regulatory sequence using GERP++. We assess how well these two approaches correlate with four gene lists that use different ways to identify genes known or likely to cause disease through changes in expression: 1) genes that are known to cause disease through haploinsufficiency, 2) genes curated as dosage sensitive in ClinGen’s Genome Dosage Map, 3) genes judged likely to be under purifying selection for mutations that change expression levels because they are statistically depleted of loss-of-function variants in the general population, and 4) genes judged unlikely to cause disease based on the presence of copy number variants in the general population. We find that both noncoding scores are highly predictive of dosage sensitivity using any of these criteria. In a similar way to ncGERP, we assess two ensemble-based predictors of regional noncoding importance

  7. [Identification and mapping of cis-regulatory elements within long genomic sequences].

    PubMed

    Akopov, S B; Chernov, I P; Vetchinova, A S; Bulanenkova, S S; Nikolaev, L G

    2007-01-01

    The publication of the human and other metazoan genome sequences opened up the possibility for mapping and analysis of genomic regulatory elements. Unfortunately, experimental data on genomic positions of such sequences as enhancers, silencers, insulators, transcription terminators, and replication origins are very limited, especially at the whole genome level. As most genomic regulatory elements (e.g., enhancers) are generally gene-, tissue-, or cell-specific, the prediction of these elements in silico is often ambiguous. Therefore, the development of high-throughput experimental approaches for identification and mapping of genomic functional elements is highly desirable. In this review we discuss novel approaches to high-throughput experimental identification of mammalian genomes cis-regulatory elements which is a necessary step toward the complete genome annotation. PMID:18240562

  8. Close Sequence Comparisons are Sufficient to Identify Humancis-Regulatory Elements

    SciTech Connect

    Prabhakar, Shyam; Poulin, Francis; Shoukry, Malak; Afzal, Veena; Rubin, Edward M.; Couronne, Olivier; Pennacchio, Len A.

    2005-12-01

    Cross-species DNA sequence comparison is the primary method used to identify functional noncoding elements in human and other large genomes. However, little is known about the relative merits of evolutionarily close and distant sequence comparisons, due to the lack of a universal metric for sequence conservation, and also the paucity of empirically defined benchmark sets of cis-regulatory elements. To address this problem, we developed a general-purpose algorithm (Gumby) that detects slowly-evolving regions in primate, mammalian and more distant comparisons without requiring adjustment of parameters, and ranks conserved elements by P-value using Karlin-Altschul statistics. We benchmarked Gumby predictions against previously identified cis-regulatory elements at diverse genomic loci, and also tested numerous extremely conserved human-rodent sequences for transcriptional enhancer activity using reporter-gene assays in transgenic mice. Human regulatory elements were identified with acceptable sensitivity and specificity by comparison with 1-5 other eutherian mammals or 6 other simian primates. More distant comparisons (marsupial, avian, amphibian and fish) failed to identify many of the empirically defined functional noncoding elements. We derived an intuitive relationship between ancient and recent noncoding sequence conservation from whole genome comparative analysis, which explains some of these findings. Lastly, we determined that, in addition to strength of conservation, genomic location and/or density of surrounding conserved elements must also be considered in selecting candidate enhancers for testing at embryonic time points.

  9. Appropriate selection of MRI sequences for common scenarios in clinical practice.

    PubMed

    Chavhan, Govind B

    2016-05-01

    Knowledge about sequence properties is essential to plan and acquire a diagnostic MRI examination. The broad four categories of sequences include spin echo (SE), gradient echo (GRE), inversion recovery (IR) and echoplanar imaging (EPI). Varieties of sequences from these four categories are available for clinical application. They have different contrast mechanisms, spatial and contrast resolution and speed of acquisition. Choice of sequence differs in various scenarios in clinical practice such as solid organ imaging, moving target imaging, bone and bone marrow imaging, cartilage imaging and vessel imaging, taking into consideration properties of sequences to answer the clinical question. Broad classification of sequences and differences in their contrast, spatial and contrast resolution, and speed of acquisition are discussed in this review. A few common clinical scenarios of MRI imaging are illustrated, along with reasons for the given sequence choices. PMID:27229493

  10. A Parallel Non-Alignment Based Approach to Efficient Sequence Comparison using Longest Common Subsequences

    NASA Astrophysics Data System (ADS)

    Bhowmick, S.; Shafiullah, M.; Rai, H.; Bastola, D.

    2010-11-01

    Biological sequence comparison programs have revolutionized the practice of biochemistry, and molecular and evolutionary biology. Pairwise comparison of genomic sequences is a popular method of choice for analyzing genetic sequence data. However the quality of results from most sequence comparison methods are significantly affected by small perturbations in the data and furthermore, there is a dearth of computational tools to compare sequences beyond a certain length. In this paper, we describe a parallel algorithm for comparing genetic sequences using an alignment free-method based on computing the Longest Common Subsequence (LCS) between genetic sequences. We validate the quality of our results by comparing the phylogenetic tress obtained from ClustalW and LCS. We also show through complexity analysis of the isoefficiency and by empirical measurement of the running time that our algorithm is very scalable.

  11. Sequence of the phosphothreonyl regulatory site peptide from inactive maize leaf pyruvate, orthophosphate dikinase

    SciTech Connect

    Roeske, C.A.; Kutny, R.M.; Budde, R.J.A.; Chollet, R.

    1988-05-15

    The regulatory site peptide sequence of phosphorylated inactive pyruvate, orthophosphate dikinase from maize leaf tissue was determined by automated Edman degradation analysis of /sup 32/P-labeled peptides purified by reversed-phase high performance liquid chromatography. The overlapping phosphopeptides were products of a digestion of the (..beta..-/sup 32/P)ADP-inactivated dikinase with either trypsin or Pronase E. The sequence is Thr-Glu-Arg-Gly-Gly-Met-Thr(P)-Ser-His-Ala-Ala-Val-Val-Ala-Arg. The phosphothreonine residue, which appeared as either an anomalous proline or an unidentifiable phenylthiohydantoin derivative during sequencing, was verified by two-dimensional phosphoamino acid analysis of the phosphopeptides and by resequencing the tryptic peptide after dephosphorylation with exogenous alkaline phosphatase. This sequence, starting at position 4, is completely homologous to the previously published sequence of the tryptic dodecapeptide harboring the catalytically essential (phospho)histidyl residue in the active-site domain of the dikinase from the nonphotosynthetic bacterium, Bacteroides symbiosus. These comparative results indicate that the regulatory phosphothreonine causing complete inactivation of maize leaf dikinase is separated from the critical active-site (phospho)histidine by just one intervening residue in the primary sequence.

  12. BayesPI-BAR: a new biophysical model for characterization of regulatory sequence variations.

    PubMed

    Wang, Junbai; Batmanov, Kirill

    2015-12-01

    Sequence variations in regulatory DNA regions are known to cause functionally important consequences for gene expression. DNA sequence variations may have an essential role in determining phenotypes and may be linked to disease; however, their identification through analysis of massive genome-wide sequencing data is a great challenge. In this work, a new computational pipeline, a Bayesian method for protein-DNA interaction with binding affinity ranking (BayesPI-BAR), is proposed for quantifying the effect of sequence variations on protein binding. BayesPI-BAR uses biophysical modeling of protein-DNA interactions to predict single nucleotide polymorphisms (SNPs) that cause significant changes in the binding affinity of a regulatory region for transcription factors (TFs). The method includes two new parameters (TF chemical potentials or protein concentrations and direct TF binding targets) that are neglected by previous methods. The new method is verified on 67 known human regulatory SNPs, of which 47 (70%) have predicted true TFs ranked in the top 10. Importantly, the performance of BayesPI-BAR, which uses principal component analysis to integrate multiple predictions from various TF chemical potentials, is found to be better than that of existing programs, such as sTRAP and is-rSNP, when evaluated on the same SNPs. BayesPI-BAR is a publicly available tool and is able to carry out parallelized computation, which helps to investigate a large number of TFs or SNPs and to detect disease-associated regulatory sequence variations in the sea of genome-wide noncoding regions. PMID:26202972

  13. Partitioning heritability of regulatory and cell-type-specific variants across 11 common diseases.

    PubMed

    Gusev, Alexander; Lee, S Hong; Trynka, Gosia; Finucane, Hilary; Vilhjálmsson, Bjarni J; Xu, Han; Zang, Chongzhi; Ripke, Stephan; Bulik-Sullivan, Brendan; Stahl, Eli; Kähler, Anna K; Hultman, Christina M; Purcell, Shaun M; McCarroll, Steven A; Daly, Mark; Pasaniuc, Bogdan; Sullivan, Patrick F; Neale, Benjamin M; Wray, Naomi R; Raychaudhuri, Soumya; Price, Alkes L

    2014-11-01

    Regulatory and coding variants are known to be enriched with associations identified by genome-wide association studies (GWASs) of complex disease, but their contributions to trait heritability are currently unknown. We applied variance-component methods to imputed genotype data for 11 common diseases to partition the heritability explained by genotyped SNPs (hg(2)) across functional categories (while accounting for shared variance due to linkage disequilibrium). Extensive simulations showed that in contrast to current estimates from GWAS summary statistics, the variance-component approach partitions heritability accurately under a wide range of complex-disease architectures. Across the 11 diseases DNaseI hypersensitivity sites (DHSs) from 217 cell types spanned 16% of imputed SNPs (and 24% of genotyped SNPs) but explained an average of 79% (SE = 8%) of hg(2) from imputed SNPs (5.1× enrichment; p = 3.7 × 10(-17)) and 38% (SE = 4%) of hg(2) from genotyped SNPs (1.6× enrichment, p = 1.0 × 10(-4)). Further enrichment was observed at enhancer DHSs and cell-type-specific DHSs. In contrast, coding variants, which span 1% of the genome, explained <10% of hg(2) despite having the highest enrichment. We replicated these findings but found no significant contribution from rare coding variants in independent schizophrenia cohorts genotyped on GWAS and exome chips. Our results highlight the value of analyzing components of heritability to unravel the functional architecture of common disease. PMID:25439723

  14. Partitioning Heritability of Regulatory and Cell-Type-Specific Variants across 11 Common Diseases

    PubMed Central

    Gusev, Alexander; Lee, S. Hong; Trynka, Gosia; Finucane, Hilary; Vilhjálmsson, Bjarni J.; Xu, Han; Zang, Chongzhi; Ripke, Stephan; Bulik-Sullivan, Brendan; Stahl, Eli; Ripke, Stephan; Neale, Benjamin M.; Corvin, Aiden; Walters, James T.R.; Farh, Kai-How; Holmans, Peter A.; Lee, Phil; Bulik-Sullivan, Brendan; Collier, David A.; Huang, Hailiang; Pers, Tune H.; Agartz, Ingrid; Agerbo, Esben; Albus, Margot; Alexander, Madeline; Amin, Farooq; Bacanu, Silviu A.; Begemann, Martin; Belliveau, Richard A.; Bene, Judit; Bergen, Sarah E.; Bevilacqua, Elizabeth; Bigdeli, Tim B.; Black, Donald W.; Børglum, Anders D.; Bruggeman, Richard; Buccola, Nancy G.; Buckner, Randy L.; Byerley, William; Cahn, Wiepke; Cai, Guiqing; Campion, Dominique; Cantor, Rita M.; Carr, Vaughan J.; Carrera, Noa; Catts, Stanley V.; Chambert, Kimberly D.; Chan, Raymond C.K.; Chen, Ronald Y.L.; Chen, Eric Y.H.; Cheng, Wei; Cheung, Eric F.C.; Chong, Siow Ann; Cloninger, C. Robert; Cohen, David; Cohen, Nadine; Cormican, Paul; Craddock, Nick; Crowley, James J.; Curtis, David; Davidson, Michael; Davis, Kenneth L.; Degenhardt, Franziska; Del Favero, Jurgen; DeLisi, Lynn E.; Demontis, Ditte; Dikeos, Dimitris; Dinan, Timothy; Djurovic, Srdjan; Donohoe, Gary; Drapeau, Elodie; Duan, Jubao; Dudbridge, Frank; Durmishi, Naser; Eichhammer, Peter; Eriksson, Johan; Escott-Price, Valentina; Essioux, Laurent; Fanous, Ayman H.; Farrell, Martilias S.; Frank, Josef; Franke, Lude; Freedman, Robert; Freimer, Nelson B.; Friedl, Marion; Friedman, Joseph I.; Fromer, Menachem; Genovese, Giulio; Georgieva, Lyudmila; Gershon, Elliot S.; Giegling, Ina; Giusti-Rodrguez, Paola; Godard, Stephanie; Goldstein, Jacqueline I.; Golimbet, Vera; Gopal, Srihari; Gratten, Jacob; Grove, Jakob; de Haan, Lieuwe; Hammer, Christian; Hamshere, Marian L.; Hansen, Mark; Hansen, Thomas; Haroutunian, Vahram; Hartmann, Annette M.; Henskens, Frans A.; Herms, Stefan; Hirschhorn, Joel N.; Hoffmann, Per; Hofman, Andrea; Hollegaard, Mads V.; Hougaard, David M.; Ikeda, Masashi; Joa, Inge; Julià, Antonio; Kahn, René S.; Kalaydjieva, Luba; Karachanak-Yankova, Sena; Karjalainen, Juha; Kavanagh, David; Keller, Matthew C.; Kelly, Brian J.; Kennedy, James L.; Khrunin, Andrey; Kim, Yunjung; Klovins, Janis; Knowles, James A.; Konte, Bettina; Kucinskas, Vaidutis; Kucinskiene, Zita Ausrele; Kuzelova-Ptackova, Hana; Kähler, Anna K.; Laurent, Claudine; Keong, Jimmy Lee Chee; Lee, S. Hong; Legge, Sophie E.; Lerer, Bernard; Li, Miaoxin; Li, Tao; Liang, Kung-Yee; Lieberman, Jeffrey; Limborska, Svetlana; Loughland, Carmel M.; Lubinski, Jan; Lnnqvist, Jouko; Macek, Milan; Magnusson, Patrik K.E.; Maher, Brion S.; Maier, Wolfgang; Mallet, Jacques; Marsal, Sara; Mattheisen, Manuel; Mattingsdal, Morten; McCarley, Robert W.; McDonald, Colm; McIntosh, Andrew M.; Meier, Sandra; Meijer, Carin J.; Melegh, Bela; Melle, Ingrid; Mesholam-Gately, Raquelle I.; Metspalu, Andres; Michie, Patricia T.; Milani, Lili; Milanova, Vihra; Mokrab, Younes; Morris, Derek W.; Mors, Ole; Mortensen, Preben B.; Murphy, Kieran C.; Murray, Robin M.; Myin-Germeys, Inez; Mller-Myhsok, Bertram; Nelis, Mari; Nenadic, Igor; Nertney, Deborah A.; Nestadt, Gerald; Nicodemus, Kristin K.; Nikitina-Zake, Liene; Nisenbaum, Laura; Nordin, Annelie; O’Callaghan, Eadbhard; O’Dushlaine, Colm; O’Neill, F. Anthony; Oh, Sang-Yun; Olincy, Ann; Olsen, Line; Van Os, Jim; Pantelis, Christos; Papadimitriou, George N.; Papiol, Sergi; Parkhomenko, Elena; Pato, Michele T.; Paunio, Tiina; Pejovic-Milovancevic, Milica; Perkins, Diana O.; Pietilinen, Olli; Pimm, Jonathan; Pocklington, Andrew J.; Powell, John; Price, Alkes; Pulver, Ann E.; Purcell, Shaun M.; Quested, Digby; Rasmussen, Henrik B.; Reichenberg, Abraham; Reimers, Mark A.; Richards, Alexander L.; Roffman, Joshua L.; Roussos, Panos; Ruderfer, Douglas M.; Salomaa, Veikko; Sanders, Alan R.; Schall, Ulrich; Schubert, Christian R.; Schulze, Thomas G.; Schwab, Sibylle G.; Scolnick, Edward M.; Scott, Rodney J.; Seidman, Larry J.; Shi, Jianxin; Sigurdsson, Engilbert; Silagadze, Teimuraz; Silverman, Jeremy M.; Sim, Kang; Slominsky, Petr; Smoller, Jordan W.; So, Hon-Cheong; Spencer, Chris C.A.; Stahl, Eli A.; Stefansson, Hreinn; Steinberg, Stacy; Stogmann, Elisabeth; Straub, Richard E.; Strengman, Eric; Strohmaier, Jana; Stroup, T. Scott; Subramaniam, Mythily; Suvisaari, Jaana; Svrakic, Dragan M.; Szatkiewicz, Jin P.; Sderman, Erik; Thirumalai, Srinivas; Toncheva, Draga; Tooney, Paul A.; Tosato, Sarah; Veijola, Juha; Waddington, John; Walsh, Dermot; Wang, Dai; Wang, Qiang; Webb, Bradley T.; Weiser, Mark; Wildenauer, Dieter B.; Williams, Nigel M.; Williams, Stephanie; Witt, Stephanie H.; Wolen, Aaron R.; Wong, Emily H.M.; Wormley, Brandon K.; Wu, Jing Qin; Xi, Hualin Simon; Zai, Clement C.; Zheng, Xuebin; Zimprich, Fritz; Wray, Naomi R.; Stefansson, Kari; Visscher, Peter M.; Adolfsson, Rolf; Andreassen, Ole A.; Blackwood, Douglas H.R.; Bramon, Elvira; Buxbaum, Joseph D.; Brglum, Anders D.; Cichon, Sven; Darvasi, Ariel; Domenici, Enrico; Ehrenreich, Hannelore; Esko, Tõnu; Gejman, Pablo V.; Gill, Michael; Gurling, Hugh; Hultman, Christina M.; Iwata, Nakao; Jablensky, Assen V.; Jönsson, Erik G.; Kendler, Kenneth S.; Kirov, George; Knight, Jo; Lencz, Todd; Levinson, Douglas F.; Li, Qingqin S.; Liu, Jianjun; Malhotra, Anil K.; McCarroll, Steven A.; McQuillin, Andrew; Moran, Jennifer L.; Mortensen, Preben B.; Mowry, Bryan J.; Nthen, Markus M.; Ophoff, Roel A.; Owen, Michael J.; Palotie, Aarno; Pato, Carlos N.; Petryshen, Tracey L.; Posthuma, Danielle; Rietschel, Marcella; Riley, Brien P.; Rujescu, Dan; Sham, Pak C.; Sklar, Pamela; St. Clair, David; Weinberger, Daniel R.; Wendland, Jens R.; Werge, Thomas; Daly, Mark J.; Sullivan, Patrick F.; O’Donovan, Michael C.; Ripke, Stephan; O’Dushlaine, Colm; Chambert, Kimberly; Moran, Jennifer L.; Kähler, Anna K.; Akterin, Susanne; Bergen, Sarah; Magnusson, Patrik K.E.; Neale, Benjamin M.; Ruderfer, Douglas; Scolnick, Edward; Purcell, Shaun; McCarroll, Steve; Sklar, Pamela; Hultman, Christina M.; Sullivan, Patrick F.; Kähler, Anna K.; Hultman, Christina M.; Purcell, Shaun M.; McCarroll, Steven A.; Daly, Mark; Pasaniuc, Bogdan; Sullivan, Patrick F.; Neale, Benjamin M.; Wray, Naomi R.; Raychaudhuri, Soumya; Price, Alkes L.

    2014-01-01

    Regulatory and coding variants are known to be enriched with associations identified by genome-wide association studies (GWASs) of complex disease, but their contributions to trait heritability are currently unknown. We applied variance-component methods to imputed genotype data for 11 common diseases to partition the heritability explained by genotyped SNPs (hg2) across functional categories (while accounting for shared variance due to linkage disequilibrium). Extensive simulations showed that in contrast to current estimates from GWAS summary statistics, the variance-component approach partitions heritability accurately under a wide range of complex-disease architectures. Across the 11 diseases DNaseI hypersensitivity sites (DHSs) from 217 cell types spanned 16% of imputed SNPs (and 24% of genotyped SNPs) but explained an average of 79% (SE = 8%) of hg2 from imputed SNPs (5.1× enrichment; p = 3.7 × 10−17) and 38% (SE = 4%) of hg2 from genotyped SNPs (1.6× enrichment, p = 1.0 × 10−4). Further enrichment was observed at enhancer DHSs and cell-type-specific DHSs. In contrast, coding variants, which span 1% of the genome, explained <10% of hg2 despite having the highest enrichment. We replicated these findings but found no significant contribution from rare coding variants in independent schizophrenia cohorts genotyped on GWAS and exome chips. Our results highlight the value of analyzing components of heritability to unravel the functional architecture of common disease. PMID:25439723

  15. Time Delayed Causal Gene Regulatory Network Inference with Hidden Common Causes

    PubMed Central

    Lo, Leung-Yau; Wong, Man-Leung; Lee, Kin-Hong; Leung, Kwong-Sak

    2015-01-01

    Inferring the gene regulatory network (GRN) is crucial to understanding the working of the cell. Many computational methods attempt to infer the GRN from time series expression data, instead of through expensive and time-consuming experiments. However, existing methods make the convenient but unrealistic assumption of causal sufficiency, i.e. all the relevant factors in the causal network have been observed and there are no unobserved common cause. In principle, in the real world, it is impossible to be certain that all relevant factors or common causes have been observed, because some factors may not have been conceived of, and therefore are impossible to measure. In view of this, we have developed a novel algorithm named HCC-CLINDE to infer an GRN from time series data allowing the presence of hidden common cause(s). We assume there is a sparse causal graph (possibly with cycles) of interest, where the variables are continuous and each causal link has a delay (possibly more than one time step). A small but unknown number of variables are not observed. Each unobserved variable has only observed variables as children and parents, with at least two children, and the children are not linked to each other. Since it is difficult to obtain very long time series, our algorithm is also capable of utilizing multiple short time series, which is more realistic. To our knowledge, our algorithm is far less restrictive than previous works. We have performed extensive experiments using synthetic data on GRNs of size up to 100, with up to 10 hidden nodes. The results show that our algorithm can adequately recover the true causal GRN and is robust to slight deviation from Gaussian distribution in the error terms. We have also demonstrated the potential of our algorithm on small YEASTRACT subnetworks using limited real data. PMID:26394325

  16. Distinct Functional Constraints Partition Sequence Conservation in a cis-Regulatory Element

    PubMed Central

    Ruvinsky, Ilya

    2011-01-01

    Different functional constraints contribute to different evolutionary rates across genomes. To understand why some sequences evolve faster than others in a single cis-regulatory locus, we investigated function and evolutionary dynamics of the promoter of the Caenorhabditis elegans unc-47 gene. We found that this promoter consists of two distinct domains. The proximal promoter is conserved and is largely sufficient to direct appropriate spatial expression. The distal promoter displays little if any conservation between several closely related nematodes. Despite this divergence, sequences from all species confer robustness of expression, arguing that this function does not require substantial sequence conservation. We showed that even unrelated sequences have the ability to promote robust expression. A prominent feature shared by all of these robustness-promoting sequences is an AT-enriched nucleotide composition consistent with nucleosome depletion. Because general sequence composition can be maintained despite sequence turnover, our results explain how different functional constraints can lead to vastly disparate rates of sequence divergence within a promoter. PMID:21655084

  17. Close sequence comparisons are sufficient to identify human cis-regulatory elements.

    PubMed

    Prabhakar, Shyam; Poulin, Francis; Shoukry, Malak; Afzal, Veena; Rubin, Edward M; Couronne, Olivier; Pennacchio, Len A

    2006-07-01

    Cross-species DNA sequence comparison is the primary method used to identify functional noncoding elements in human and other large genomes. However, little is known about the relative merits of evolutionarily close and distant sequence comparisons. To address this problem, we identified evolutionarily conserved noncoding regions in primate, mammalian, and more distant comparisons using a uniform approach (Gumby) that facilitates unbiased assessment of the impact of evolutionary distance on predictive power. We benchmarked computational predictions against previously identified cis-regulatory elements at diverse genomic loci and also tested numerous extremely conserved human-rodent sequences for transcriptional enhancer activity using an in vivo enhancer assay in transgenic mice. Human regulatory elements were identified with acceptable sensitivity (53%-80%) and true-positive rate (27%-67%) by comparison with one to five other eutherian mammals or six other simian primates. More distant comparisons (marsupial, avian, amphibian, and fish) failed to identify many of the empirically defined functional noncoding elements. Our results highlight the practical utility of close sequence comparisons, and the loss of sensitivity entailed by more distant comparisons. We derived an intuitive relationship between ancient and recent noncoding sequence conservation from whole-genome comparative analysis that explains most of the observations from empirical benchmarking. Lastly, we determined that, in addition to strength of conservation, genomic location and/or density of surrounding conserved elements must also be considered in selecting candidate enhancers for in vivo testing at embryonic time points. PMID:16769978

  18. Resequencing of the common marmoset genome improves genome assemblies and gene-coding sequence analysis

    PubMed Central

    Sato, Kengo; Kuroki, Yoko; Kumita, Wakako; Fujiyama, Asao; Toyoda, Atsushi; Kawai, Jun; Iriki, Atsushi; Sasaki, Erika; Okano, Hideyuki; Sakakibara, Yasubumi

    2015-01-01

    The first draft of the common marmoset (Callithrix jacchus) genome was published by the Marmoset Genome Sequencing and Analysis Consortium. The draft was based on whole-genome shotgun sequencing, and the current assembly version is Callithrix_jacches-3.2.1, but there still exist 187,214 undetermined gap regions and supercontigs and relatively short contigs that are unmapped to chromosomes in the draft genome. We performed resequencing and assembly of the genome of common marmoset by deep sequencing with high-throughput sequencing technology. Several different sequence runs using Illumina sequencing platforms were executed, and 181 Gbp of high-quality bases including mate-pairs with long insert lengths of 3, 8, 20, and 40 Kbp were obtained, that is, approximately 60× coverage. The resequencing significantly improved the MGSAC draft genome sequence. The N50 of the contigs, which is a statistical measure used to evaluate assembly quality, doubled. As a result, 51% of the contigs (total length: 299 Mbp) that were unmapped to chromosomes in the MGSAC draft were merged with chromosomal contigs, and the improved genome sequence helped to detect 5,288 new genes that are homologous to human cDNAs and the gaps in 5,187 transcripts of the Ensembl gene annotations were completely filled. PMID:26586576

  19. Whole-genome sequencing to understand the genetic architecture of common gene expression and biomarker phenotypes

    PubMed Central

    Wood, Andrew R.; Tuke, Marcus A.; Nalls, Mike; Hernandez, Dena; Gibbs, J. Raphael; Lin, Haoxiang; Xu, Christopher S.; Li, Qibin; Shen, Juan; Jun, Goo; Almeida, Marcio; Tanaka, Toshiko; Perry, John R. B.; Gaulton, Kyle; Rivas, Manny; Pearson, Richard; Curran, Joanne E.; Johnson, Matthew P.; Göring, Harald H. H.; Duggirala, Ravindranath; Blangero, John; Mccarthy, Mark I.; Bandinelli, Stefania; Murray, Anna; Weedon, Michael N.; Singleton, Andrew; Melzer, David; Ferrucci, Luigi; Frayling, Timothy M

    2015-01-01

    Initial results from sequencing studies suggest that there are relatively few low-frequency (<5%) variants associated with large effects on common phenotypes. We performed low-pass whole-genome sequencing in 680 individuals from the InCHIANTI study to test two primary hypotheses: (i) that sequencing would detect single low-frequency–large effect variants that explained similar amounts of phenotypic variance as single common variants, and (ii) that some common variant associations could be explained by low-frequency variants. We tested two sets of disease-related common phenotypes for which we had statistical power to detect large numbers of common variant–common phenotype associations—11 132 cis-gene expression traits in 450 individuals and 93 circulating biomarkers in all 680 individuals. From a total of 11 657 229 high-quality variants of which 6 129 221 and 5 528 008 were common and low frequency (<5%), respectively, low frequency–large effect associations comprised 7% of detectable cis-gene expression traits [89 of 1314 cis-eQTLs at P < 1 × 10−06 (false discovery rate ∼5%)] and one of eight biomarker associations at P < 8 × 10−10. Very few (30 of 1232; 2%) common variant associations were fully explained by low-frequency variants. Our data show that whole-genome sequencing can identify low-frequency variants undetected by genotyping based approaches when sample sizes are sufficiently large to detect substantial numbers of common variant associations, and that common variant associations are rarely explained by single low-frequency variants of large effect. PMID:25378555

  20. A Pan-Cancer Modular Regulatory Network Analysis to Identify Common and Cancer-Specific Network Components

    PubMed Central

    Knaack, Sara A; Siahpirani, Alireza Fotuhi; Roy, Sushmita

    2014-01-01

    Many human diseases including cancer are the result of perturbations to transcriptional regulatory networks that control context-specific expression of genes. A comparative approach across multiple cancer types is a powerful approach to illuminate the common and specific network features of this family of diseases. Recent efforts from The Cancer Genome Atlas (TCGA) have generated large collections of functional genomic data sets for multiple types of cancers. An emerging challenge is to devise computational approaches that systematically compare these genomic data sets across different cancer types that identify common and cancer-specific network components. We present a module- and network-based characterization of transcriptional patterns in six different cancers being studied in TCGA: breast, colon, rectal, kidney, ovarian, and endometrial. Our approach uses a recently developed regulatory network reconstruction algorithm, modular regulatory network learning with per gene information (MERLIN), within a stability selection framework to predict regulators for individual genes and gene modules. Our module-based analysis identifies a common theme of immune system processes in each cancer study, with modules statistically enriched for immune response processes as well as targets of key immune response regulators from the interferon regulatory factor (IRF) and signal transducer and activator of transcription (STAT) families. Comparison of the inferred regulatory networks from each cancer type identified a core regulatory network that included genes involved in chromatin remodeling, cell cycle, and immune response. Regulatory network hubs included genes with known roles in specific cancer types as well as genes with potentially novel roles in different cancer types. Overall, our integrated module and network analysis recapitulated known themes in cancer biology and additionally revealed novel regulatory hubs that suggest a complex interplay of immune response, cell

  1. Evaluating a common semi-mechanistic mathematical model of gene-regulatory networks

    PubMed Central

    2015-01-01

    Modeling and simulation of gene-regulatory networks (GRNs) has become an important aspect of modern systems biology investigations into mechanisms underlying gene regulation. A key challenge in this area is the automated inference (reverse-engineering) of dynamic, mechanistic GRN models from gene expression time-course data. Common mathematical formalisms for representing such models capture two aspects simultaneously within a single parameter: (1) Whether or not a gene is regulated, and if so, the type of regulator (activator or repressor), and (2) the strength of influence of the regulator (if any) on the target or effector gene. To accommodate both roles, "generous" boundaries or limits for possible values of this parameter are commonly allowed in the reverse-engineering process. This approach has several important drawbacks. First, in the absence of good guidelines, there is no consensus on what limits are reasonable. Second, because the limits may vary greatly among different reverse-engineering experiments, the concrete values obtained for the models may differ considerably, and thus it is difficult to compare models. Third, if high values are chosen as limits, the search space of the model inference process becomes very large, adding unnecessary computational load to the already complex reverse-engineering process. In this study, we demonstrate that restricting the limits to the [−1, +1] interval is sufficient to represent the essential features of GRN systems and offers a reduction of the search space without loss of quality in the resulting models. To show this, we have carried out reverse-engineering studies on data generated from artificial and experimentally determined from real GRN systems. PMID:26356485

  2. kmer-SVM: a web server for identifying predictive regulatory sequence features in genomic data sets

    PubMed Central

    Fletez-Brant, Christopher; Lee, Dongwon; McCallion, Andrew S.; Beer, Michael A.

    2013-01-01

    Massively parallel sequencing technologies have made the generation of genomic data sets a routine component of many biological investigations. For example, Chromatin immunoprecipitation followed by sequence assays detect genomic regions bound (directly or indirectly) by specific factors, and DNase-seq identifies regions of open chromatin. A major bottleneck in the interpretation of these data is the identification of the underlying DNA sequence code that defines, and ultimately facilitates prediction of, these transcription factor (TF) bound or open chromatin regions. We have recently developed a novel computational methodology, which uses a support vector machine (SVM) with kmer sequence features (kmer-SVM) to identify predictive combinations of short transcription factor-binding sites, which determine the tissue specificity of these genomic assays (Lee, Karchin and Beer, Discriminative prediction of mammalian enhancers from DNA sequence. Genome Res. 2011; 21:2167–80). This regulatory information can (i) give confidence in genomic experiments by recovering previously known binding sites, and (ii) reveal novel sequence features for subsequent experimental testing of cooperative mechanisms. Here, we describe the development and implementation of a web server to allow the broader research community to independently apply our kmer-SVM to analyze and interpret their genomic datasets. We analyze five recently published data sets and demonstrate how this tool identifies accessory factors and repressive sequence elements. kmer-SVM is available at http://kmersvm.beerlab.org. PMID:23771147

  3. Identification of cis-acting repressive sequences within the negative regulatory element of human immunodeficiency virus type 1.

    PubMed Central

    Lu, Y C; Touzjian, N; Stenzel, M; Dorfman, T; Sodroski, J G; Haseltine, W A

    1990-01-01

    The negative regulatory element of human immunodeficiency virus type 1 is a 260-nucleotide-long sequence that decreases the rate of RNA transcription initiation specified by the long terminal repeat. This region has the potential to bind several cellular transcription factors. Here it is shown that sequences which recognize the NFAT-1 and USF cellular transcription factors contribute to this negative regulatory effect. The sequences within the negative regulatory element which resemble the AP-1 site and the URS do not negatively regulate human immunodeficiency virus long terminal repeat transcription initiation. PMID:2398545

  4. Heme binds to a short sequence that serves a regulatory function in diverse proteins.

    PubMed Central

    Zhang, L; Guarente, L

    1995-01-01

    Heme is a prosthetic group for numerous enzymes, cytochromes and globins, and it binds tightly, sometimes covalently, to these proteins. Interestingly, heme also potentiates binding of the yeast transcriptional activator HAP1 to DNA and inhibits mitochondrial import of the mammalian delta-aminolevulinate synthase (ALAS) and the catalytic activity of the reticulocyte kinase, HRI. All three of these proteins contain a short sequence, the heme regulatory motif (HRM), that occurs six times adjacent to the HAP1 DNA binding domain, twice in the leader targeting sequence of ALAS and twice near the catalytic domain of the HRI kinase. Here we show that a 10 amino acid peptide containing the HRM consensus binds to heme in the micromolar range, and shifts the heme absorption spectrum to a longer wavelength, a direction opposite to the change caused by cytochromes or globins. Further, we show that a single HRM regulates the acidic activation domains of HAP1 and GAL4 independently of regulation of DNA binding of the transcription factors. These findings thus establish a novel heme binding sequence which is structurally distinct from sequences in globins or cytochromes and which has a regulatory function. Images PMID:7835342

  5. Cloning and nucleotide sequence of luxR, a regulatory gene controlling bioluminescence in Vibrio harveyi.

    PubMed Central

    Showalter, R E; Martin, M O; Silverman, M R

    1990-01-01

    Mutagenesis with transposon mini-Mulac was used previously to identify a regulatory locus necessary for expression of bioluminescence genes, lux, in Vibrio harveyi (M. Martin, R. Showalter, and M. Silverman, J. Bacteriol. 171:2406-2414, 1989). Mutants with transposon insertions in this regulatory locus were used to construct a hybridization probe which was used in this study to detect recombinants in a cosmid library containing the homologous DNA. Recombinant cosmids with this DNA stimulated expression of the genes encoding enzymes for luminescence, i.e., the luxCDABE operon, which were positioned in trans on a compatible replicon in Escherichia coli. Transposon mutagenesis and analysis of the DNA sequence of the cloned DNA indicated that regulatory function resided in a single gene of about 0.6-kilobases named luxR. Expression of bioluminescence in V. harveyi and in the fish light-organ symbiont Vibrio fischeri is controlled by density-sensing mechanisms involving the accumulation of small signal molecules called autoinducers, but similarity of the two luminescence systems at the molecular level was not apparent in this study. The amino acid sequence of the LuxR product of V. harveyi, which indicates a structural relationship to some DNA-binding proteins, is not similar to the sequence of the protein that regulates expression of luminescence in V. fischeri. In addition, reconstitution of autoinducer-controlled luminescence in recombinant E. coli, already achieved with lux genes cloned from V. fischeri, was not accomplished with the isolation of luxR from V. harveyi, suggesting a requirement for an additional regulatory component. PMID:2160932

  6. IL-10-Producing Regulatory B Cells Are Decreased in Patients with Common Variable Immunodeficiency

    PubMed Central

    Costa, Priscilla Ramos; Barros, Myrthes Toledo; Kalil, Jorge; Kokron, Cristina Maria

    2016-01-01

    Common variable immunodeficiency (CVID) is the most prevalent symptomatic primary immunodeficiency in adults. CVID patients often present changes in the frequency and function of B lymphocytes, reduced number of Treg cells, chronic immune activation, recurrent infections, high incidence of autoimmunity and increased risk for malignancies. We hypothesized that the frequency of B10 cells would be diminished in CVID patients because these cells play an important role in the development of Treg cells and in the control of T cell activation and autoimmunity. Therefore, we evaluated the frequency of B10 cells in CVID patients and correlated it with different clinical and immunological characteristics of this disease. Forty-two CVID patients and 17 healthy controls were recruited for this study. Cryopreserved PBMCs were used for analysis of T cell activation, frequency of Treg cells and characterization of B10 cells by flow cytometry. IL-10 production by sorted B cells culture and plasma sCD14 were determined by ELISA. We found that CVID patients presented decreased frequency of IL-10-producing CD24hiCD38hi B cells in different cell culture conditions and decreased frequency of IL-10-producing CD24hiCD27+ B cells stimulated with CpG+PIB. Moreover, we found that CVID patients presented lower secretion of IL-10 by sorting-purified B cells when compared to healthy controls. The frequency of B10 cells had no correlation with autoimmunity, immune activation and Treg cells in CVID patients. This work suggests that CVID patients have a compromised regulatory B cell compartment which is not correlated with clinical and immunological characteristics presented by these individuals. PMID:26991898

  7. Insights from exome sequencing in common and rare human endocrine disorders

    PubMed Central

    Dauber, Andrew; de Bruin, Christiaan

    2016-01-01

    Exome sequencing has emerged in recent years as a rapid and effective tool for the elucidation of genetic defects underlying both rare and common human disease. Increased availability and decreased costs of next generation sequencing has enabled researchers worldwide to use this approach not only in individual patients with rare diseases, but also to screen larger cohorts or populations for genetic determinants of disease. Within the field of endocrinology, exome sequencing has led to significant advancements in our understanding of numerous disorders including adrenal disease, growth and pubertal disorders, type 2 diabetes, as well as a multitude of rare genetic syndromes with prominent endocrine involvement. In this review, we aim to provide an overview of these recent new insights and discuss the role that exome sequencing is expected to play in endocrine research and clinical practice in the coming years. PMID:25963271

  8. Conserved Noncoding Sequences Highlight Shared Components of Regulatory Networks in Dicotyledonous Plants[W

    PubMed Central

    Baxter, Laura; Jironkin, Aleksey; Hickman, Richard; Moore, Jay; Barrington, Christopher; Krusche, Peter; Dyer, Nigel P.; Buchanan-Wollaston, Vicky; Tiskin, Alexander; Beynon, Jim; Denby, Katherine; Ott, Sascha

    2012-01-01

    Conserved noncoding sequences (CNSs) in DNA are reliable pointers to regulatory elements controlling gene expression. Using a comparative genomics approach with four dicotyledonous plant species (Arabidopsis thaliana, papaya [Carica papaya], poplar [Populus trichocarpa], and grape [Vitis vinifera]), we detected hundreds of CNSs upstream of Arabidopsis genes. Distinct positioning, length, and enrichment for transcription factor binding sites suggest these CNSs play a functional role in transcriptional regulation. The enrichment of transcription factors within the set of genes associated with CNS is consistent with the hypothesis that together they form part of a conserved transcriptional network whose function is to regulate other transcription factors and control development. We identified a set of promoters where regulatory mechanisms are likely to be shared between the model organism Arabidopsis and other dicots, providing areas of focus for further research. PMID:23110901

  9. Genome Sequencing of Autism-Affected Families Reveals Disruption of Putative Noncoding Regulatory DNA

    PubMed Central

    Turner, Tychele N.; Hormozdiari, Fereydoun; Duyzend, Michael H.; McClymont, Sarah A.; Hook, Paul W.; Iossifov, Ivan; Raja, Archana; Baker, Carl; Hoekzema, Kendra; Stessman, Holly A.; Zody, Michael C.; Nelson, Bradley J.; Huddleston, John; Sandstrom, Richard; Smith, Joshua D.; Hanna, David; Swanson, James M.; Faustman, Elaine M.; Bamshad, Michael J.; Stamatoyannopoulos, John; Nickerson, Deborah A.; McCallion, Andrew S.; Darnell, Robert; Eichler, Evan E.

    2016-01-01

    We performed whole-genome sequencing (WGS) of 208 genomes from 53 families affected by simplex autism. For the majority of these families, no copy-number variant (CNV) or candidate de novo gene-disruptive single-nucleotide variant (SNV) had been detected by microarray or whole-exome sequencing (WES). We integrated multiple CNV and SNV analyses and extensive experimental validation to identify additional candidate mutations in eight families. We report that compared to control individuals, probands showed a significant (p = 0.03) enrichment of de novo and private disruptive mutations within fetal CNS DNase I hypersensitive sites (i.e., putative regulatory regions). This effect was only observed within 50 kb of genes that have been previously associated with autism risk, including genes where dosage sensitivity has already been established by recurrent disruptive de novo protein-coding mutations (ARID1B, SCN2A, NR3C2, PRKCA, and DSCAM). In addition, we provide evidence of gene-disruptive CNVs (in DISC1, WNT7A, RBFOX1, and MBD5), as well as smaller de novo CNVs and exon-specific SNVs missed by exome sequencing in neurodevelopmental genes (e.g., CANX, SAE1, and PIK3CA). Our results suggest that the detection of smaller, often multiple CNVs affecting putative regulatory elements might help explain additional risk of simplex autism. PMID:26749308

  10. Two Lamprey Hedgehog Genes Share Non-Coding Regulatory Sequences and Expression Patterns with Gnathostome Hedgehogs

    PubMed Central

    Ekker, Marc; Hadzhiev, Yavor; Müller, Ferenc; Casane, Didier; Magdelenat, Ghislaine; Rétaux, Sylvie

    2010-01-01

    Hedgehog (Hh) genes play major roles in animal development and studies of their evolution, expression and function point to major differences among chordates. Here we focused on Hh genes in lampreys in order to characterize the evolution of Hh signalling at the emergence of vertebrates. Screening of a cosmid library of the river lamprey Lampetra fluviatilis and searching the preliminary genome assembly of the sea lamprey Petromyzon marinus indicate that lampreys have two Hh genes, named Hha and Hhb. Phylogenetic analyses suggest that Hha and Hhb are lamprey-specific paralogs closely related to Sonic/Indian Hh genes. Expression analysis indicates that Hha and Hhb are expressed in a Sonic Hh-like pattern. The two transcripts are expressed in largely overlapping but not identical domains in the lamprey embryonic brain, including a newly-described expression domain in the nasohypophyseal placode. Global alignments of genomic sequences and local alignment with known gnathostome regulatory motifs show that lamprey Hhs share conserved non-coding elements (CNE) with gnathostome Hhs albeit with sequences that have significantly diverged and dispersed. Functional assays using zebrafish embryos demonstrate gnathostome-like midline enhancer activity for CNEs contained in intron2. We conclude that lamprey Hh genes are gnathostome Shh-like in terms of expression and regulation. In addition, they show some lamprey-specific features, including duplication and structural (but not functional) changes in the intronic/regulatory sequences. PMID:20967201

  11. Genome Sequencing of Autism-Affected Families Reveals Disruption of Putative Noncoding Regulatory DNA.

    PubMed

    Turner, Tychele N; Hormozdiari, Fereydoun; Duyzend, Michael H; McClymont, Sarah A; Hook, Paul W; Iossifov, Ivan; Raja, Archana; Baker, Carl; Hoekzema, Kendra; Stessman, Holly A; Zody, Michael C; Nelson, Bradley J; Huddleston, John; Sandstrom, Richard; Smith, Joshua D; Hanna, David; Swanson, James M; Faustman, Elaine M; Bamshad, Michael J; Stamatoyannopoulos, John; Nickerson, Deborah A; McCallion, Andrew S; Darnell, Robert; Eichler, Evan E

    2016-01-01

    We performed whole-genome sequencing (WGS) of 208 genomes from 53 families affected by simplex autism. For the majority of these families, no copy-number variant (CNV) or candidate de novo gene-disruptive single-nucleotide variant (SNV) had been detected by microarray or whole-exome sequencing (WES). We integrated multiple CNV and SNV analyses and extensive experimental validation to identify additional candidate mutations in eight families. We report that compared to control individuals, probands showed a significant (p = 0.03) enrichment of de novo and private disruptive mutations within fetal CNS DNase I hypersensitive sites (i.e., putative regulatory regions). This effect was only observed within 50 kb of genes that have been previously associated with autism risk, including genes where dosage sensitivity has already been established by recurrent disruptive de novo protein-coding mutations (ARID1B, SCN2A, NR3C2, PRKCA, and DSCAM). In addition, we provide evidence of gene-disruptive CNVs (in DISC1, WNT7A, RBFOX1, and MBD5), as well as smaller de novo CNVs and exon-specific SNVs missed by exome sequencing in neurodevelopmental genes (e.g., CANX, SAE1, and PIK3CA). Our results suggest that the detection of smaller, often multiple CNVs affecting putative regulatory elements might help explain additional risk of simplex autism. PMID:26749308

  12. Phylogenetic Relationships and the Evolution of Regulatory Gene Sequences in the Parrotfishes

    PubMed Central

    Smith, Lydia L.; Fessler, Jennifer L.; Alfaro, Michael E.; Streelman, J. Todd; Westneat, Mark W.

    2008-01-01

    Regulatory genes control the expression of other genes and are key components of developmental processes such as segmentation and embryonic construction of the skull in vertebrates. Here we examine the variability and evolution of three vertebrate regulatory genes, addressing issues of their utility for phylogenetics and comparing the rates of genetic change seen in regulatory loci to the rates seen in other genes in the parrotfishes. The parrotfishes are a diverse group of colorful fishes from coral reefs and seagrasses worldwide and have been placed phylogenetically within the family Labridae. We tested phylogenetic hypotheses among the parrotfishes, with a focus on the genera Chlorurus and Scarus, by analyzing eight gene fragments for 42 parrotfishes and eight outgroup species. We sequenced mitochondrial 12s rRNA (967 bp), 16s rRNA (577 bp), and cytochrome b (477 bp). From the nuclear genome, we sequenced part of the protein-coding genes rag2 (715 bp), tmo4c4 (485 bp), and the developmental regulatory genes otx1 (672 bp), bmp4 (488 bp), and dlx2 (522 bp). Bayesian, likelihood, and parsimony analyses on the resulting 4903 bp of DNA sequence produced similar topologies that confirm the monophyly of the scarines and provide a phylogeny at the species level for portions of the genera Scarus and Chlorurus. Four major clades of Scarus were recovered, with three distributed in the Indo-Pacific and one containing Caribbean/Atlantic taxa. Molecular rates suggest a Miocene origin of the parrotfishes (22 mya) and a recent divergence of species within Scarus and Chlorurus, within the past 5 million years. Developmentally important genes made a significant contribution to phylogenetic structure, and rates of genetic evolution were high in bmp4, similar to other coding nuclear genes, but low in otx1 and the dlx2 exons. Synonymous and nonsynonymous substitution patterns in developmental regulatory genes support the hypothesis of stabilizing selection during the history of

  13. Functional effects of a natural polymorphism in the transcriptional regulatory sequence of HLA-DQB1.

    PubMed Central

    Beaty, J S; West, K A; Nepom, G T

    1995-01-01

    DNA sequence polymorphism in the genes encoding HLA class II proteins accounts for allelic diversity in antigen recognition and presentation and, thus, in the role of these cell surface glycoproteins as determinants of the scope of the T-cell repertoire. In addition, sequence polymorphism in the promoter-proximal transcriptional regulatory regions of these genes has been described, particularly for the HLA-DQB1 locus, where these differences may contribute to variation in locus- and allele-specific expression. In this study, we measured the effect of such regulatory sequence polymorphism on the expression of endogenous alleles of DQB1 in heterozygous cells. Quantitative reverse transcriptase-mediated PCR analysis showed that expression of the DQB1*0301 allele responded more rapidly to gamma interferon induction than that of DQB1*0302. We have analyzed functional effects of a prominent allelic polymorphism that consists of a TG dinucleotide present between the W and X1 consensus elements in the DQB1*0302 allele but missing in the DQB1*0301 allele. The dominant effect of this polymorphism was to introduce a variation in the spacing between the W and X1 elements of these two alleles. A secondary compensatory effect was specific for the TG dinucleotide itself, which was essential for the binding of a nuclear protein complex to the *0302 regulatory region immediately 5' of the X1 element. Derivatives of the DQB1 5' regulatory region were used to drive expression of the chloramphenicol acetyltransferase gene in transient transfections of human B-lymphoblastoid and gamma interferon-treated melanoma cell lines, demonstrating that the additional spacing between the W and X1 elements caused by the presence of the TG dinucleotide in the *0302 allele resulted in reduced expression compared with that driven by the *0301 fragment; this difference overshadowed an up-regulating effect on expression which corresponded to the binding of the TG-dependent nuclear protein complex. The

  14. Rapid evolution of cis-regulatory sequences via local point mutations

    NASA Technical Reports Server (NTRS)

    Stone, J. R.; Wray, G. A.

    2001-01-01

    Although the evolution of protein-coding sequences within genomes is well understood, the same cannot be said of the cis-regulatory regions that control transcription. Yet, changes in gene expression are likely to constitute an important component of phenotypic evolution. We simulated the evolution of new transcription factor binding sites via local point mutations. The results indicate that new binding sites appear and become fixed within populations on microevolutionary timescales under an assumption of neutral evolution. Even combinations of two new binding sites evolve very quickly. We predict that local point mutations continually generate considerable genetic variation that is capable of altering gene expression.

  15. Development of taxon-specific sequences of common wheat for the detection of genetically modified wheat.

    PubMed

    Iida, Mayu; Yamashiro, Satomi; Yamakawa, Hirohito; Hayakawa, Katsuyuki; Kuribara, Hideo; Kodama, Takashi; Furui, Satoshi; Akiyama, Hiroshi; Maitani, Tamio; Hino, Akihiro

    2005-08-10

    Qualitative and quantitative Polymerase Chain Reaction (PCR) systems aimed at the specific detection and quantification of common wheat DNA are described. Many countries have issued regulations to label foods that include genetically modified organisms (GMOs). PCR technology is widely recognized as a reliable and useful technique for the qualitative and quantitative detection of GMOs. Detection methods are needed to amplify a target GM gene, and the amplified results should be compared with those of the corresponding taxon-specific reference gene to obtain reliable results. This paper describes the development of a specific DNA sequence in the waxy-D1 gene for common wheat (Triticum aestivum L.) and the design of a specific primer pair and TaqMan probe on the waxy-D1 gene for PCR analysis. The primers amplified a product (Wx012) of 102 bp. It is indicated that the Wx012 DNA sequence is specific to common wheat, showing homogeneity in qualitative PCR results and very similar quantification accuracy along 19 distantly related common wheat varieties. In Southern blot and real-time PCR analyses, this sequence showed either a single or a low number of copy genes. In addition, by qualitative and quantitative PCR using wx012 primers and a wx012-T probe, the limits of detection of the common wheat genome were found to be about 15 copies, and the reproducibility was reliable. In consequence, the PCR system using wx012 primers and wx012-T probe is considered to be suitable for use as a common wheat-specific taxon-specific reference gene in DNA analyses, including GMO tests. PMID:16076109

  16. Variable Genome Sequences of the Murine Pneumotropic Virus (Polyomaviridae) Regulatory Region Isolated from an Infected Mouse Tissue Viral Suspension

    PubMed Central

    Libbey, Jane E.

    2016-01-01

    The murine pneumotropic virus genome, isolated from an infected murine tissue homogenate, was sequenced to completion. The lungs, liver, spleen, and kidneys were the source of the tissue homogenate in order to mirror the heterogeneity of the virus population in vivo. The regulatory region sequence was found to be highly variable. PMID:27231357

  17. Variable Genome Sequences of the Murine Pneumotropic Virus (Polyomaviridae) Regulatory Region Isolated from an Infected Mouse Tissue Viral Suspension.

    PubMed

    Libbey, Jane E; Fujinami, Robert S

    2016-01-01

    The murine pneumotropic virus genome, isolated from an infected murine tissue homogenate, was sequenced to completion. The lungs, liver, spleen, and kidneys were the source of the tissue homogenate in order to mirror the heterogeneity of the virus population in vivo The regulatory region sequence was found to be highly variable. PMID:27231357

  18. Origins of Transcriptional Transition: Balance between Upstream and Downstream Regulatory Gene Sequences

    PubMed Central

    Sala, Adrien; Shoaib, Muhammad; Anufrieva, Olga; Mutharasu, Gnanavel; Yli-Harja, Olli

    2015-01-01

    ABSTRACT By measuring individual mRNA production at the single-cell level, we investigated the lac promoter’s transcriptional transition during cell growth phases. In exponential phase, variation in transition rates generates two mixed phenotypes, low and high numbers of mRNAs, by modulating their burst frequency and sizes. Independent activation of the regulatory-gene sequence does not produce bimodal populations at the mRNA level, but bimodal populations are produced when the regulatory gene is activated coordinately with the upstream and downstream region promoter sequence (URS and DRS, respectively). Time-lapse microscopy of mRNAs for lac and a variant lac promoter confirm this observation. Activation of the URS/DRS elements of the promoter reveals a counterplay behavior during cell phases. The promoter transition rate coupled with cell phases determines the mRNA and transcriptional noise. We further show that bias in partitioning of RNA does not lead to phenotypic switching. Our results demonstrate that the balance between the URS and the DRS in transcriptional regulation determines population diversity. PMID:25626902

  19. Common and small molecules as the ultimate regulatory and effector mediators of antigen-specific transplantation reactions

    PubMed Central

    Holan, Vladimir; Krulova, Magdalena

    2013-01-01

    In spite of intensive research, the molecular basis of allograft and xenograft rejection still remains not fully understood. The acute rejection of an allograft is associated with the intragraft Th1 cytokine response, while tolerance of an allograft or xenograft rejection is accompanied by a higher production of the Th2 cytokines interleukin (IL)-4 and IL-10. Nevertheless, these cytokines are not the final regulatory and effector molecules mediating transplantation reactions. Data indicate that the functioning of common molecules with enzymatic activities, such are inducible nitric oxide synthase (iNOS), arginase, heme oxygenase-1 (HO-1) or indoleamine-2,3-dioxygenase (IDO), the bioavailability of their substrates (L-arginine, tryptophan, heme) and the cytotoxic and regulatory actions of their small gaseous products (NO, CO) can be the ultimate mechanisms responsible for effector or regulatory reactions. Using models of transplantation immunity and tolerance we show that T cell receptor-mediated recognition of allogeneic or xenogeneic antigens as well as the balance between immunity/tolerance induces distinct cytokine production profiles. The ratio between Th1 and Th2 cytokines efficiently regulates the expression of genes for common enzymes, such as iNOS, arginase, HO-1 and IDO. These enzymes may compete for substrates, such as L-arginine or tryptophan, and the final product of their activity are small molecules (NO, CO) displaying effector or regulatory functions of the immune system. Thus, it is suggested that in spite of the high immunological specificity of transplatation reaction, the ultimate players in regulatory and effector functions could be small and common molecules. PMID:24392309

  20. Sequencing rare and common APOL1 coding variants to determine kidney disease risk.

    PubMed

    Limou, Sophie; Nelson, George W; Lecordier, Laurence; An, Ping; O'hUigin, Colm S; David, Victor A; Binns-Roemer, Elizabeth A; Guiblet, Wilfried M; Oleksyk, Taras K; Pays, Etienne; Kopp, Jeffrey B; Winkler, Cheryl A

    2015-10-01

    A third of African Americans with sporadic focal segmental glomerulosclerosis (FSGS) or HIV-associated nephropathy (HIVAN) do not carry APOL1 renal risk genotypes. This raises the possibility that other APOL1 variants may contribute to kidney disease. To address this question, we sequenced all APOL1 exons in 1437 Americans of African and European descent, including 464 patients with biopsy-proven FSGS/HIVAN. Testing for association with 33 common and rare variants with FSGS/HIVAN revealed no association independent of strong recessive G1 and G2 effects. Seeking additional variants that might have been under selection by pathogens and could represent candidates for kidney disease risk, we also sequenced an additional 1112 individuals representing 53 global populations. Except for G1 and G2, none of the 7 common codon-altering variants showed evidence of selection or could restore lysis against trypanosomes causing human African trypanosomiasis. Thus, only APOL1 G1 and G2 confer renal risk, and other common and rare APOL1 missense variants, including the archaic G3 haplotype, do not contribute to sporadic FSGS and HIVAN in the US population. Hence, in most potential clinical or screening applications, our study suggests that sequencing APOL1 exons is unlikely to bring additional information compared to genotyping only APOL1 G1 and G2 risk alleles. PMID:25993319

  1. Expanding the nitrogen regulatory protein superfamily: Homology detection at below random sequence identity.

    PubMed

    Kinch, Lisa N; Grishin, Nick V

    2002-07-01

    Nitrogen regulatory (PII) proteins are signal transduction molecules involved in controlling nitrogen metabolism in prokaryots. PII proteins integrate the signals of intracellular nitrogen and carbon status into the control of enzymes involved in nitrogen assimilation. Using elaborate sequence similarity detection schemes, we show that five clusters of orthologs (COGs) and several small divergent protein groups belong to the PII superfamily and predict their structure to be a (betaalphabeta)(2) ferredoxin-like fold. Proteins from the newly emerged PII superfamily are present in all major phylogenetic lineages. The PII homologs are quite diverse, with below random (as low as 1%) pairwise sequence identities between some members of distant groups. Despite this sequence diversity, evidence suggests that the different subfamilies retain the PII trimeric structure important for ligand-binding site formation and maintain a conservation of conservations at residue positions important for PII function. Because most of the orthologous groups within the PII superfamily are composed entirely of hypothetical proteins, our remote homology-based structure prediction provides the only information about them. Analogous to structural genomics efforts, such prediction gives clues to the biological roles of these proteins and allows us to hypothesize about locations of functional sites on model structures or rationalize about available experimental information. For instance, conserved residues in one of the families map in close proximity to each other on PII structure, allowing for a possible metal-binding site in the proteins coded by the locus known to affect sensitivity to divalent metal ions. Presented analysis pushes the limits of sequence similarity searches and exemplifies one of the extreme cases of reliable sequence-based structure prediction. In conjunction with structural genomics efforts to shed light on protein function, our strategies make it possible to detect

  2. Detecting Functional Divergence after Gene Duplication through Evolutionary Changes in Posttranslational Regulatory Sequences

    PubMed Central

    Nguyen Ba, Alex N.; Strome, Bob; Hua, Jun Jie; Desmond, Jonathan; Gagnon-Arsenault, Isabelle; Weiss, Eric L.; Landry, Christian R.; Moses, Alan M.

    2014-01-01

    Gene duplication is an important evolutionary mechanism that can result in functional divergence in paralogs due to neo-functionalization or sub-functionalization. Consistent with functional divergence after gene duplication, recent studies have shown accelerated evolution in retained paralogs. However, little is known in general about the impact of this accelerated evolution on the molecular functions of retained paralogs. For example, do new functions typically involve changes in enzymatic activities, or changes in protein regulation? Here we study the evolution of posttranslational regulation by examining the evolution of important regulatory sequences (short linear motifs) in retained duplicates created by the whole-genome duplication in budding yeast. To do so, we identified short linear motifs whose evolutionary constraint has relaxed after gene duplication with a likelihood-ratio test that can account for heterogeneity in the evolutionary process by using a non-central chi-squared null distribution. We find that short linear motifs are more likely to show changes in evolutionary constraints in retained duplicates compared to single-copy genes. We examine changes in constraints on known regulatory sequences and show that for the Rck1/Rck2, Fkh1/Fkh2, Ace2/Swi5 paralogs, they are associated with previously characterized differences in posttranslational regulation. Finally, we experimentally confirm our prediction that for the Ace2/Swi5 paralogs, Cbk1 regulated localization was lost along the lineage leading to SWI5 after gene duplication. Our analysis suggests that changes in posttranslational regulation mediated by short regulatory motifs systematically contribute to functional divergence after gene duplication. PMID:25474245

  3. Sequencing and Analysis of Common Bean ESTs. Building a Foundation for Functional Genomics1[w

    PubMed Central

    Ramírez, Mario; Graham, Michelle A.; Blanco-López, Lourdes; Silvente, Sonia; Medrano-Soto, Arturo; Blair, Matthew W.; Hernández, Georgina; Vance, Carroll P.; Lara, Miguel

    2005-01-01

    Although common bean (Phaseolus vulgaris) is the most important grain legume in the developing world for human consumption, few genomic resources exist for this species. The objectives of this research were to develop expressed sequence tag (EST) resources for common bean and assess nodule gene expression through high-density macroarrays. We sequenced a total of 21,026 ESTs derived from 5 different cDNA libraries, including nitrogen-fixing root nodules, phosphorus-deficient roots, developing pods, and leaves of the Mesoamerican genotype, Negro Jamapa 81. The fifth source of ESTs was a leaf cDNA library derived from the Andean genotype, G19833. Of the total high-quality sequences, 5,703 ESTs were classified as singletons, while 10,078 were assembled into 2,226 contigs producing a nonredundant set of 7,969 different transcripts. Sequences were grouped according to 4 main categories, metabolism (34%), cell cycle and plant development (11%), interaction with the environment (19%), and unknown function (36%), and further subdivided into 15 subcategories. Comparisons to other legume EST projects suggest that an entirely different repertoire of genes is expressed in common bean nodules. Phaseolus-specific contigs, gene families, and single nucleotide polymorphisms were also identified from the EST collection. Functional aspects of individual bean organs were reflected by the 20 contigs from each library composed of the most redundant ESTs. The abundance of transcripts corresponding to selected contigs was evaluated by RNA blots to determine whether gene expression determined by laboratory methods correlated with in silico expression. Evaluation of root nodule gene expression by macroarrays and RNA blots showed that genes related to nitrogen and carbon metabolism are integrated for ureide production. Resources developed in this project provide genetic and genomic tools for an international consortium devoted to bean improvement. PMID:15824284

  4. Development of polymorphic expressed sequence tag-single sequence repeat markers in the common Chinese cuttlefish, Sepiella maindroni.

    PubMed

    Li, R H; Lu, S K; Zhang, C L; Song, W W; Mu, C K; Wang, C L

    2014-01-01

    The common Chinese cuttlefish (Sepiella maindroni) is one of the popular edible cephalopod consumed across Asia. To facilitate the population genetic investigation of this species, we developed fourteen polymorphic microsatellite makers from expressed sequence tags of S. maindroni. The number of alleles at each locus ranged from 6 to 10 with an average of 7.9 alleles per locus. The ranges of observed and expected heterozygosity were from 0.615 to 0.962 and 0.685 to 0.888, respectively. Four loci were found deviated significantly from Hardy-Weinberg equilibrium. The polymorphism information content ranged from 0.638 to 0.833. These polymorphic microsatellite loci will be helpful for the population genetic, genetic linkage map, and other genetic studies of S. maindroni. PMID:25117305

  5. Upstream regulatory regions required to stabilize binding to the TATA sequence in an adenovirus early promoter.

    PubMed

    Garcia, J; Wu, F; Gaynor, R

    1987-10-26

    Of the five early adenovirus promoters, the early region 3 (E3) promoter is one of the most strongly induced by the E1A protein. To identify cellular proteins involved in both the basal and E1A-induced transcriptional regulation of the E3 promoter, DNase I footprinting using partially purified Hela cell extracts was performed. Four regions of the E3 promoter serve as binding domains for cellular proteins. These regions are found between -156 to -179 (site IV), -83 to -103 (site III), -47 to -67 (site II), and -16 to -37 (site I), relative to the start of transcription. Examination of the DNA sequences in each binding domain suggests that site III likely serves as a binding site for activator protein 1 (AP-1), site II for the cyclic AMP regulatory element binding protein (CREB), and site I for a TATA binding factor. The factors binding to either site II or III were sufficient to stabilize binding to the TATA sequence (site I). Mutagenesis studies indicated that both sites II and III, in addition to site I, are needed for complete basal and E1A-induced transcription. These results suggest that multiple cellular factors are involved in both the basal and E1A-induced transcriptional regulation of the E3 promoter, and that either of two upstream regions are capable of stabilizing factor binding to the TATA sequence. PMID:2959908

  6. A transcriptional regulatory element in the coding sequence of the human Bcl-2 gene

    PubMed Central

    Lang, Georgina; Gombert, Wendy M; Gould, Hannah J

    2005-01-01

    We investigated the protein-binding sites in a DNAse I hypersensitive site associated with bcl-2 gene expression in human B cells. We mapped this hypersensitive site to the coding sequence of exon 2 of the bcl-2 gene in the bcl-2-expressing REH B-cell line. Electrophoretic mobility shift assays (EMSAs) with extracts from REH cells revealed three previously unrecognized B-Myb-binding sites in this sequence. The protein was identified as B-Myb by using a specific antibody and EMSAs. Accordingly, the levels of B-Myb and bcl-2 proteins, and of Myb EMSA activity, were correlated over a wide range of cell lines, representing different stages of B-cell development. Transfection of REH cells with antisense B-myb down-regulated EMSA activity and the level of bcl-2, and led to the apoptosis of REH cells. Transfection of the bcl-2-non-expressing RPMI 8226 cell line with a B-Myb expression vector induced B-Myb EMSA activity and the expression of bcl-2. Reporter assays indicated that the HSS8 sequence containing the three B-Myb sites may act as an enhancer when it is linked to the bcl-2 gene promoter. Interaction of B-Myb with HSS8 may enhance bcl-2 gene expression by co-operating with positive regulatory elements (e.g. previously identified B-Myb response elements) or silencing negative response elements in the bcl-2 gene promoter. PMID:15606792

  7. Upstream regulatory regions required to stabilize binding to the TATA sequence in an adenovirus early promoter.

    PubMed Central

    Garcia, J; Wu, F; Gaynor, R

    1987-01-01

    Of the five early adenovirus promoters, the early region 3 (E3) promoter is one of the most strongly induced by the E1A protein. To identify cellular proteins involved in both the basal and E1A-induced transcriptional regulation of the E3 promoter, DNase I footprinting using partially purified Hela cell extracts was performed. Four regions of the E3 promoter serve as binding domains for cellular proteins. These regions are found between -156 to -179 (site IV), -83 to -103 (site III), -47 to -67 (site II), and -16 to -37 (site I), relative to the start of transcription. Examination of the DNA sequences in each binding domain suggests that site III likely serves as a binding site for activator protein 1 (AP-1), site II for the cyclic AMP regulatory element binding protein (CREB), and site I for a TATA binding factor. The factors binding to either site II or III were sufficient to stabilize binding to the TATA sequence (site I). Mutagenesis studies indicated that both sites II and III, in addition to site I, are needed for complete basal and E1A-induced transcription. These results suggest that multiple cellular factors are involved in both the basal and E1A-induced transcriptional regulation of the E3 promoter, and that either of two upstream regions are capable of stabilizing factor binding to the TATA sequence. Images PMID:2959908

  8. Mitochondrial DNA and nuclear DNA from normal rat liver have a common sequence.

    PubMed Central

    Hadler, H I; Dimitrijevic, B; Mahalingam, R

    1983-01-01

    Although Pst I does not cut the circular mitochondrial genome of the rat, BamHI generates from this genome two unequal fragments of DNA. Each of these fragments was cloned in pBR322. Nuclear DNA was digested from rat liver singly or doubly with Pst I and BamHI, and it was demonstrated that nuclear DNA shared a common sequence with the larger mitochondrial DNA BamHI fragment. The cloned larger mitochondrial DNA fragment was further subdivided with HindIII into four pieces that were labeled and then used to probe the double-digested nuclear DNA. The hybridization data showed that the common sequence is less than 3 kilobase pairs long and lies within the part of the mitochondrial genome containing the D-loop and a portion of the rRNA genes. It therefore appears that, as in lower eukaryotes, there are shared sequences between the nuclear and mitochondrial genomes in mammals. Images PMID:6579536

  9. Sequencing of SCN5A identifies rare and common variants associated with cardiac conduction

    PubMed Central

    Magnani, Jared W.; Brody, Jennifer A.; Prins, Bram P.; Arking, Dan E.; Lin, Honghuang; Yin, Xiaoyan; Liu, Ching-Ti; Morrison, Alanna C.; Zhang, Feng; Spector, Tim D.; Alonso, Alvaro; Bis, Joshua C.; Heckbert, Susan R.; Lumley, Thomas; Sitlani, Colleen M.; Cupples, L. Adrienne; Lubitz, Steven A.; Soliman, Elsayed Z.; Pulit, Sara L.; Newton-Cheh, Christopher; O'Donnell, Christopher J.; Ellinor, Patrick T.; Benjamin, Emelia J.; Muzny, Donna M.; Gibbs, Richard A.; Santibanez, Jireh; Taylor, Herman A.; Rotter, Jerome I.; Lange, Leslie A.; Psaty, Bruce M.; Jackson, Rebecca; Rich, Stephen S.; Boerwinkle, Eric; Jamshidi, Yalda; Sotoodehnia, Nona

    2014-01-01

    Background The cardiac sodium channel SCN5A regulates atrioventricular and ventricular conduction. Genetic variants in this gene are associated with PR and QRS intervals. We sought to further characterize the contribution of rare and common coding variation in SCN5A to cardiac conduction. Methods and Results In the Cohorts for Heart and Aging Research in Genomic Epidemiology Targeted Sequencing Study (CHARGE), we performed targeted exonic sequencing of SCN5A (n=3699, European-ancestry individuals) and identified 4 common (minor allele frequency >1%) and 157 rare variants. Common and rare SCN5A coding variants were examined for association with PR and QRS intervals through meta-analysis of European ancestry participants from CHARGE, NHLBI’s Exome Sequencing Project (ESP, n=607) and the UK10K (n=1275) and by examining ESP African-ancestry participants (N=972). Rare coding SCN5A variants in aggregate were associated with PR interval in European and African-ancestry participants (P=1.3×10−3). Three common variants were associated with PR and/or QRS interval duration among European-ancestry participants and one among African-ancestry participants. These included two well-known missense variants; rs1805124 (H558R) was associated with PR and QRS shortening in European-ancestry participants (P=6.25×10−4 and P=5.2×10−3 respectively) and rs7626962 (S1102Y) was associated with PR shortening in those of African ancestry (P=2.82×10−3). Among European-ancestry participants, two novel synonymous variants, rs1805126 and rs6599230, were associated with cardiac conduction. Our top signal, rs1805126 was associated with PR and QRS lengthening (P=3.35×10−7 and P=2.69×10−4 respectively), and rs6599230 was associated with PR shortening (P=2.67×10−5). Conclusions By sequencing SCN5A, we identified novel common and rare coding variants associated with cardiac conduction. PMID:24951663

  10. Mining the LIPG allelic spectrum reveals the contribution of rare and common regulatory variants to HDL cholesterol.

    PubMed

    Khetarpal, Sumeet A; Edmondson, Andrew C; Raghavan, Avanthi; Neeli, Hemanth; Jin, Weijun; Badellino, Karen O; Demissie, Serkalem; Manning, Alisa K; DerOhannessian, Stephanie L; Wolfe, Megan L; Cupples, L Adrienne; Li, Mingyao; Kathiresan, Sekar; Rader, Daniel J

    2011-12-01

    Genome-wide association studies (GWAS) have successfully identified loci associated with quantitative traits, such as blood lipids. Deep resequencing studies are being utilized to catalogue the allelic spectrum at GWAS loci. The goal of these studies is to identify causative variants and missing heritability, including heritability due to low frequency and rare alleles with large phenotypic impact. Whereas rare variant efforts have primarily focused on nonsynonymous coding variants, we hypothesized that noncoding variants in these loci are also functionally important. Using the HDL-C gene LIPG as an example, we explored the effect of regulatory variants identified through resequencing of subjects at HDL-C extremes on gene expression, protein levels, and phenotype. Resequencing a portion of the LIPG promoter and 5' UTR in human subjects with extreme HDL-C, we identified several rare variants in individuals from both extremes. Luciferase reporter assays were used to measure the effect of these rare variants on LIPG expression. Variants conferring opposing effects on gene expression were enriched in opposite extremes of the phenotypic distribution. Minor alleles of a common regulatory haplotype and noncoding GWAS SNPs were associated with reduced plasma levels of the LIPG gene product endothelial lipase (EL), consistent with its role in HDL-C catabolism. Additionally, we found that a common nonfunctional coding variant associated with HDL-C (rs2000813) is in linkage disequilibrium with a 5' UTR variant (rs34474737) that decreases LIPG promoter activity. We attribute the gene regulatory role of rs34474737 to the observed association of the coding variant with plasma EL levels and HDL-C. Taken together, the findings show that both rare and common noncoding regulatory variants are important contributors to the allelic spectrum in complex trait loci. PMID:22174694

  11. Mining the LIPG Allelic Spectrum Reveals the Contribution of Rare and Common Regulatory Variants to HDL Cholesterol

    PubMed Central

    Raghavan, Avanthi; Neeli, Hemanth; Jin, Weijun; Badellino, Karen O.; Demissie, Serkalem; Manning, Alisa K.; DerOhannessian, Stephanie L.; Wolfe, Megan L.; Cupples, L. Adrienne; Li, Mingyao; Kathiresan, Sekar; Rader, Daniel J.

    2011-01-01

    Genome-wide association studies (GWAS) have successfully identified loci associated with quantitative traits, such as blood lipids. Deep resequencing studies are being utilized to catalogue the allelic spectrum at GWAS loci. The goal of these studies is to identify causative variants and missing heritability, including heritability due to low frequency and rare alleles with large phenotypic impact. Whereas rare variant efforts have primarily focused on nonsynonymous coding variants, we hypothesized that noncoding variants in these loci are also functionally important. Using the HDL-C gene LIPG as an example, we explored the effect of regulatory variants identified through resequencing of subjects at HDL-C extremes on gene expression, protein levels, and phenotype. Resequencing a portion of the LIPG promoter and 5′ UTR in human subjects with extreme HDL-C, we identified several rare variants in individuals from both extremes. Luciferase reporter assays were used to measure the effect of these rare variants on LIPG expression. Variants conferring opposing effects on gene expression were enriched in opposite extremes of the phenotypic distribution. Minor alleles of a common regulatory haplotype and noncoding GWAS SNPs were associated with reduced plasma levels of the LIPG gene product endothelial lipase (EL), consistent with its role in HDL-C catabolism. Additionally, we found that a common nonfunctional coding variant associated with HDL-C (rs2000813) is in linkage disequilibrium with a 5′ UTR variant (rs34474737) that decreases LIPG promoter activity. We attribute the gene regulatory role of rs34474737 to the observed association of the coding variant with plasma EL levels and HDL-C. Taken together, the findings show that both rare and common noncoding regulatory variants are important contributors to the allelic spectrum in complex trait loci. PMID:22174694

  12. Sequence variation in the androgen receptor gene is not a common determinant of male sexual orientation.

    PubMed Central

    Macke, J P; Hu, N; Hu, S; Bailey, M; King, V L; Brown, T; Hamer, D; Nathans, J

    1993-01-01

    To test the hypothesis that DNA sequence variation in the androgen receptor gene plays a causal role in the development of male sexual orientation, we have (1) measured the degree of concordance of androgen receptor alleles in 36 pairs of homosexual brothers, (2) compared the lengths of polyglutamine and polyglycine tracts in the amino-terminal domain of the androgen receptor in a sample of 197 homosexual males and 213 unselected subjects, and (3) screened the the entire androgen receptor coding region for sequence variation by PCR and denaturing gradient-gel electrophoresis (DGGE) and/or single-strand conformation polymorphism analysis in 20 homosexual males with homosexual or bisexual brothers and one homosexual male with no homosexual brothers, and screened the amino-terminal domain of the receptor for sequence variation in an additional 44 homosexual males, 37 of whom had one or more first- or second-degree male relatives who were either homosexual or bisexual. These analyses show that (1) homosexual brothers are as likely to be discordant as concordant for androgen receptor alleles; (2) there are no large-scale differences between the distributions of polyglycine or polyglutamine tract lengths in the homosexual and control groups; and (3) coding region sequence variation is not commonly found within the androgen receptor gene of homosexual men. The DGGE screen identified two rare amino acid substitutions, ser205-to-arg and glu793-to-asp, the biological significance of which is unknown. Images Figure 2 PMID:8213813

  13. Sequence variation in the androgen receptor gene is not a common determinant of male sexual orientation

    SciTech Connect

    Macke, J.P.; Nathans, J.; King, V.L. ); Hu, N.; Hu, S.; Hamer, D.; Bailey, M. ); Brown, T. )

    1993-10-01

    To test the hypothesis that DNA sequence variation in the androgen receptor gene plays a causal role in the development of male sexual orientation, the authors have (1) measured the degree of concordance of androgen receptor alleles in 36 pairs of homosexual brothers, (2) compared the lengths of polyglutamine and polyglycine tracts in the amino-terminal domain of the androgen receptor in a sample of 197 homosexual males and 213 unselected subjects, and (3) screened the entire androgen receptor coding region for sequence variation by PCR and denaturing gradient-gel electrophoresis (DGGE) and/or single-strand conformation polymorphism analysis in 20 homosexual males with homosexual or bisexual brothers and one homosexual male with no homosexual brothers, and screened the amino-terminal domain of the receptor for sequence variation in an additional 44 homosexual males, 37 of whom had one or more first- or second-degree male relatives who were either homosexual or bisexual. These analyses show that (1) homosexual brothers are as likely to be discordant as concordant for androgen receptor alleles; (2) there are no large-scale differences between the distributions of polyglycine or polyglutamine tract lengths in the homosexual and control groups; and (3) coding region sequence variation is not commonly found within the androgen receptor gene of homosexual men. The DGGE screen identified two rare amino acid substitutions, ser[sup 205] -to-arg and glu[sup 793]-to-asp, the biological significance of which is unknown. 32 refs., 2 figs., 2 tabs.

  14. Regulatory sequences of Arabidopsis drive reporter gene expression in nematode feeding structures.

    PubMed Central

    Barthels, N; van der Lee, F M; Klap, J; Goddijn, O J; Karimi, M; Puzio, P; Grundler, F M; Ohl, S A; Lindsey, K; Robertson, L; Robertson, W M; Van Montagu, M; Gheysen, G; Sijmons, P C

    1997-01-01

    In the quest for plant regulatory sequences capable of driving nematode-triggered effector gene expression in feeding structures, we show that promoter tagging is a valuable tool. A large collection of transgenic Arabidopsis plants was generated. They were transformed with a beta-glucuronidase gene functioning as a promoter tag. Three T-DNA constructs, pGV1047, p delta gusBin19, and pMOG553, were used. Early responses to nematode invasion were of primary interest. Six lines exhibiting beta-glucuronidase activity in syncytia induced by the beet cyst nematode were studied. Reporter gene activation was also identified in galls induced by root knot and ectoparasitic nematodes. Time-course studies revealed that all six tags were differentially activated during the development of the feeding structure. T-DNA-flanking regions responsible for the observed responses after nematode infection were isolated and characterized for promoter activity. PMID:9437858

  15. Stress-induced endogenous siRNAs targeting regulatory intron sequences in Brachypodium

    PubMed Central

    Wang, Hsiao-Lin V.; Dinwiddie, Brandon L.; Lee, Herman

    2015-01-01

    Exposure to abiotic stresses triggers global changes in the expression of thousands of eukaryotic genes at the transcriptional and post-transcriptional levels. Small RNA (smRNA) pathways and splicing both function as crucial mechanisms regulating stress-responsive gene expression. However, examples of smRNAs regulating gene expression remain largely limited to effects on mRNA stability, translation, and epigenetic regulation. Also, our understanding of the networks controlling plant gene expression in response to environmental changes, and examples of these regulatory pathways intersecting, remains limited. Here, to investigate the role of smRNAs in stress responses we examined smRNA transcriptomes of Brachypodium distachyon plants subjected to various abiotic stresses. We found that exposure to different abiotic stresses specifically induced a group of novel, endogenous small interfering RNAs (stress-induced, UTR-derived siRNAs, or sutr-siRNAs) that originate from the 3′ UTRs of a subset of coding genes. Our bioinformatics analyses predicted that sutr-siRNAs have potential regulatory functions and that over 90% of sutr-siRNAs target intronic regions of many mRNAs in trans. Importantly, a subgroup of these sutr-siRNAs target the important intron regulatory regions, such as branch point sequences, that could affect splicing. Our study indicates that in Brachypodium, sutr-siRNAs may affect splicing by masking or changing accessibility of specific cis-elements through base-pairing interactions to mediate gene expression in response to stresses. We hypothesize that this mode of regulation of gene expression may also serve as a general mechanism for regulation of gene expression in plants and potentially in other eukaryotes. PMID:25480817

  16. Exceptionally high heterologous protein levels in transgenic dicotyledonous seeds using Phaseolus vulgaris regulatory sequences.

    PubMed

    De Jaeger, Geert; Angenon, Geert; Depicker, Ann

    2003-01-01

    Seeds are concentrated sources of protein and thus may be ideal 'bioreactors' for the production of heterologous proteins. For this application, strong seed-specific expression signals are required. A set of expression cassettes were designed using 5' and 3' regulatory sequences of the seed storage protein gene arcelin 5-I (arc5-I) from Phaseolus vulgaris, and evaluated for the production of heterologous proteins in dicotyledonous plant species. A murine single-chain variable fragment (scFv) was chosen as model protein because of the current industrial interest to produce antibodies and derived fragments in crops. Because the highest scFv accumulation in seed had previously been achieved in the endoplasmic reticulum (ER), the scFv-encoding sequence was provided with signal sequences for accumulation in the ER. Transgenic Arabidopsis seed stocks, expressing the scFv under control of the 35S promoter, contained scFv accumulation levels in the range of 1% of total soluble protein (TSP). However, the seed storage promoter constructs boosted the scFv to exceptionally high levels. Maximum scFv levels were obtained in homozygous seed stocks, being 12.5% of TSP under control of the arc5-I regulatory sequences and even up to 36.5% of TSP upon replacing the arc5-I promoter by the beta-phaseolin promoter of Phaseolus vulgaris. Even at such very high levels, the scFv proteins retain their full antigen-binding activity. Moreover, the presence of very high scFv levels has only minory effects on seed germination and no effect on seed production. These results demonstrate that the expression levels of arcelin 5-I and beta-phaseolin seed storage protein genes can be transferred to heterologous proteins, giving exceptionally high levels of heterologous proteins, which can be of great value for the molecular farming industry by raising production yield and lowering bio-mass production and purification costs. Finally, the feasibility of heterologous protein production using the

  17. Building a model: developing genomic resources for common milkweed (Asclepias syriaca) with low coverage genome sequencing

    PubMed Central

    2011-01-01

    Background Milkweeds (Asclepias L.) have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L.) could be useful in characterizing the genome of a plant without prior genomic information and for development of genomic resources as a step toward further developing A. syriaca as a model in ecology and evolution. Results A 0.5× genome of A. syriaca was produced using Illumina sequencing. A virtually complete chloroplast genome of 158,598 bp was assembled, revealing few repeats and loss of three genes: accD, clpP, and ycf1. A nearly complete rDNA cistron (18S-5.8S-26S; 7,541 bp) and 5S rDNA (120 bp) sequence were obtained. Assessment of polymorphism revealed that the rDNA cistron and 5S rDNA had 0.3% and 26.7% polymorphic sites, respectively. A partial mitochondrial genome sequence (130,764 bp), with identical gene content to tobacco, was also assembled. An initial characterization of repeat content indicated that Ty1/copia-like retroelements are the most common repeat type in the milkweed genome. At least one A. syriaca microread hit 88% of Catharanthus roseus (Apocynaceae) unigenes (median coverage of 0.29×) and 66% of single copy orthologs (COSII) in asterids (median coverage of 0.14×). From this partial characterization of the A. syriaca genome, markers for population genetics (microsatellites) and phylogenetics (low-copy nuclear genes) studies were developed. Conclusions The results highlight the promise of next generation sequencing for development of genomic resources for any organism. Low coverage genome sequencing allows characterization of the high copy fraction of the genome and exploration of the low copy fraction of the genome, which facilitate the development of molecular tools for further study of a target species and its relatives

  18. Sequence analysis of the myosin regulatory light chain gene of the vestimentiferan Riftia pachyptila.

    PubMed

    Ravaux, J; Hassanin, A; Deutsch, J; Gaill, F; Markmann-Mulisch, U

    2001-01-24

    We have isolated and characterized a cDNA (DNA complementary to RNA) clone (Rf69) from the vestimentiferan Riftia pachyptila. The cDNA insert consists of 1169 base pairs. The aminoacid sequence deduced from the longest reading frame is 193 residues in length, and clearly characterized it as a myosin regulatory light chain (RLC). The RLC primary structure is described in relation to its function in muscle contraction. The comparison with other RLCs suggested that Riftia myosin is probably regulated through its RLC either by phosphorylation like the vertebrate smooth muscle myosins, and/or by Ca2+-binding like the mollusk myosins. Riftia RLC possesses a N-terminal extension lacking in all other species besides the earthworm Lumbricus terrestris. Aminoacid sequence comparisons with a number of RLCs from vertebrates and invertebrates revealed a relatively high identity score (64%) between Riftia RLC and the homologous gene from Lumbricus. The relationships between the members of the myosin RLCs were examined by two phylogenetic methods, i.e. distance matrix and maximum parsimony. The resulting trees depict the grouping of the RLCs according to their role in myosin activity regulation. In all trees, Riftia RLC groups with RLCs that depend on Ca2+-binding for myosin activity regulation. PMID:11223252

  19. Stable intronic sequence RNAs have possible regulatory roles in Drosophila melanogaster

    PubMed Central

    Osman, Ismail; Tay, Mandy Li-Ian; Zheng, Ruther Teo

    2015-01-01

    Stable intronic sequence RNAs (sisRNAs) have been found in Xenopus tropicalis, human cell lines, and Epstein-Barr virus; however, the biological significance of sisRNAs remains poorly understood. We identify sisRNAs in Drosophila melanogaster by deep sequencing, reverse transcription polymerase chain reaction, and Northern blotting. We characterize a sisRNA (sisR-1) from the regena (rga) locus and show that it can be processed from the precursor messenger RNA (pre-mRNA). We also document a cis-natural antisense transcript (ASTR) from the rga locus, which is highly expressed in early embryos. During embryogenesis, ASTR promotes robust rga pre-mRNA expression. Interestingly, sisR-1 represses ASTR, with consequential effects on rga pre-mRNA expression. Our results suggest a model in which sisR-1 modulates its host gene expression by repressing ASTR during embryogenesis. We propose that sisR-1 belongs to a class of sisRNAs with probable regulatory activities in Drosophila. PMID:26504165

  20. Recent common ancestry of human Y chromosomes: Evidence from DNA sequence data

    PubMed Central

    Thomson, Russell; Pritchard, Jonathan K.; Shen, Peidong; Oefner, Peter J.; Feldman, Marcus W.

    2000-01-01

    We consider a data set of DNA sequence variation at three Y chromosome genes (SMCY, DBY, and DFFRY) in a worldwide sample of human Y chromosomes. Between 53 and 70 chromosomes were fully screened for sequence variation at each locus by using the method of denaturing high-performance liquid chromatography. The sum of the lengths of the three genes is 64,120 bp. We have used these data to study the ancestral genealogy of human Y chromosomes. In particular, we focused on estimating the expected time to the most recent common ancestor and the expected ages of certain mutations with interesting geographic distributions. Although the geographic structure of the inferred haplotype tree is reminiscent of that obtained for other loci (the root is in Africa, and most of the oldest non-African lineages are Asian), the expected time to the most recent common ancestor is remarkably short, on the order of 50,000 years. Thus, although previous studies have noted that Y chromosome variation shows extreme geographic structure, we estimate that the spread of Y chromosomes out of Africa is much more recent than previously was thought. We also show that our data indicate substantial population growth in the effective number of human Y chromosomes. PMID:10861004

  1. Isolation of expressed sequences from the region commonly deleted in Velo-cardio-facial syndrome

    SciTech Connect

    Sirotkin, H.; Morrow, B.; DasGupta, R.

    1994-09-01

    Velo-cardio-facial syndrome (VCFS) is a relatively common autosomal dominant genetic disorder characterized by cleft palate, cardiac abnormalities, learning disabilities and a characteristic facial dysmorphology. Most VCFS patients have interstitial deletions of 22q11 of 1-2 mb. In an effort to isolate the gene(s) responsible for VCFS we have utilized a hybrid selection protocol to recover expressed sequences from three non-overlapping YACs comprising almost 1 mb of the commonly deleted region. Total yeast genomic DNA or isolated YAC DNA was immobilized on Hybond-N filters, blocked with yeast and human ribosomal and human repetitive sequences and hybridized with a mixture of random primed short fragment cDNA libraries. Six human short fragment libraries derived from total fetus, fetal brain, adult brain, testes, thymus and spleen have been used for the selections. Short fragment cDNAs retained on the filter were passed through a second round of selection and cloned into lambda gt10. cDNAs shown to originate from the YACs and from chromosome 22 are being used to isolate full length cDNAs. Three genes known to be present on these YACs, catechol-O-methyltransferase, tuple 1 and clathrin heavy chain have been recovered. Additionally, a gene related to the murine p120 gene and a number of novel short cDNAs have been isolated. The role of these genes in VCFS is being investigated.

  2. Predisposition gene identification in common cancers by exome sequencing: insights from familial breast cancer

    PubMed Central

    Snape, Katie; Ruark, Elise; Tarpey, Patrick; Renwick, Anthony; Turnbull, Clare; Seal, Sheila; Murray, Anne; Hanks, Sandra; Douglas, Jenny; Stratton, Michael R.; Rahman, Nazneen

    2013-01-01

    The genetic component of breast cancer predisposition remains largely unexplained. Candidate-gene case-control resequencing has identified predisposition genes characterised by rare, protein truncating mutations that confer moderate risks of disease. In theory, exome sequencing should yield additional genes of this class. Here, we explore the feasibility and design considerations of this approach. We performed exome sequencing in 50 individuals with familial breast cancer, applying frequency and protein function filters to identify variants most likely to be pathogenic. We identified 867,378 variants that passed the call quality filters of which 1,296 variants passed the frequency and protein truncation filters. The median number of validated, rare, protein truncating variants (PTVs) was 10 in individuals with, and without, mutations in known genes. The functional candidacy of mutated genes was similar in both groups. Without prior knowledge, the known genes would not have been recognisable as breast cancer predisposition genes. Everyone carries multiple rare mutations that are plausibly related to disease. Exome sequencing in common conditions will therefore require intelligent sample and variant prioritisation strategies in large case-control studies to deliver robust genetic evidence of disease association. PMID:22527104

  3. Understanding the Effects of Users' Behaviors on Effectiveness of Different Exogenous Regulatory Common Pool Resource Management Institutions

    NASA Astrophysics Data System (ADS)

    Madani, K.; Dinar, A.

    2013-12-01

    Tragedy of the commons is generally recognized as one of the possible destinies for common pool resources (CPRs). To avoid the tragedy of the commons and prolonging the life of CPRs, users may show different behavioral characteristics and use different rationales for CPR planning and management. Furthermore, regulators may adopt different strategies for sustainable management of CPRs. The effectiveness of different regulatory exogenous management institutions cannot be evaluated through conventional CPR models since they assume that either users base their behavior on individual rationality and adopt a selfish behavior (Nash behavior), or that the users seek the system's optimal solution without giving priority to their own interests. Therefore, conventional models fail to reliably predict the outcome of CPR problems in which parties may have a range of behavioral characteristics, putting them somewhere in between the two types of behaviors traditionally considered. This work examines the effectiveness of different regulatory exogenous CPR management institutions through a user-based model (as opposed to a system-based model). The new modeling framework allows for consideration of sensitivity of the results to different behavioral characteristics of interacting CPR users. The suggested modeling approach is applied to a benchmark groundwater management problem. Results indicate that some well-known exogenous management institutions (e.g. taxing) are ineffective in sustainable management of CPRs in most cases. Bankruptcy-based management can be helpful, but determination of the fair level of cutbacks remains challenging under this type of institution. Furthermore, some bankruptcy rules such as the Constrained Equal Award (CEA) method are more beneficial to wealthier users, failing to establish social justice. Quota-based and CPR status-based management perform as the most promising and robust regulatory exogenous institutions in prolonging the CPR's life and

  4. Identification of expressed resistance gene-like sequences by data mining in 454-derived transcriptomic sequences of common bean (Phaseolus vulgaris L.)

    PubMed Central

    2012-01-01

    Background Common bean (Phaseolus vulgaris L.) is one of the most important legumes in the world. Several diseases severely reduce bean production and quality; therefore, it is very important to better understand disease resistance in common bean in order to prevent these losses. More than 70 resistance (R) genes which confer resistance against various pathogens have been cloned from diverse plant species. Most R genes share highly conserved domains which facilitates the identification of new candidate R genes from the same species or other species. The goals of this study were to isolate expressed R gene-like sequences (RGLs) from 454-derived transcriptomic sequences and expressed sequence tags (ESTs) of common bean, and to develop RGL-tagged molecular markers. Results A data-mining approach was used to identify tentative P. vulgaris R gene-like sequences from approximately 1.69 million 454-derived sequences and 116,716 ESTs deposited in GenBank. A total of 365 non-redundant sequences were identified and named as common bean (P. vulgaris = Pv) resistance gene-like sequences (PvRGLs). Among the identified PvRGLs, about 60% (218 PvRGLs) were from 454-derived sequences. Reverse transcriptase-polymerase chain reaction (RT-PCR) analysis confirmed that PvRGLs were actually expressed in the leaves of common bean. Upon comparison to P. vulgaris genomic sequences, 105 (28.77%) of the 365 tentative PvRGLs could be integrated into the existing common bean physical map. Based on the syntenic blocks between common bean and soybean, 237 (64.93%) PvRGLs were anchored on the P. vulgaris genetic map and will need to be mapped to determine order. In addition, 11 sequence-tagged-site (STS) and 19 cleaved amplified polymorphic sequence (CAPS) molecular markers were developed for 25 unique PvRGLs. Conclusions In total, 365 PvRGLs were successfully identified from 454-derived transcriptomic sequences and ESTs available in GenBank and about 65% of PvRGLs were integrated into the common

  5. The 5' regulatory sequence of the PMP22 in the patients with Charcot-Marie-Tooth disease.

    PubMed

    Sinkiewicz-Darol, Elena; Kabzińska, Dagmara; Moszyńska, Izabela; Kochański, Andrzej

    2010-01-01

    Little is known about the molecular background of clinical variability of Charcot-Marie-Tooth type 1A (CMT1A) disease and hereditary neuropathy with liability to pressure palsies (HNPP). The CMT1A and HNPP disorders result from duplication and deletion of the PMP22 gene respectively. In a series of studies performed on affected animal transgenic models of CMT1A disease, expression of the PMP22 gene (gene dosage) was shown to correlete with severity of CMT course (gene dosage effect). In this study we hypothesized that single nucleotide polymorphisms (SNPs) located within the 5' regulatory sequence of PMP22 gene may be responsible for the CMT1A/HNPP clinical variability. We have sequenced the PMP22 5' upstream regulatory sequence in a group of 45 CMT1A/HNPP patients harboring the PMP22 duplication (37) /deletion (8). We have identified five SNPs in the regulatory sequence of the PMP22 gene. Three of them i.e. -819C>T, -4785G>T, -4800C>T were detected both in the patients and in the control group. Thus, their pathogenic role in the regulation of the expression of the PMP22 gene seems not to be significant. Two SNPs i.e. -4210T>C and -4759T>A were found only in the CMT patients. Their role in the regulation of the PMP22 gene expression can not be excluded. Additionally we have detected the Thr118Met variant in exon 4 of the PMP22 gene, which was previously reported by other authors, in one patient. We conclude that the 5' regulatory sequence of the PMP22 gene is conserved at the nucleotiode level, however rarely occurring SNPs variant in the PMP22 regulatory sequence may be associated with the gene dosage effect. PMID:20842290

  6. IgH sequences in common variable immune deficiency reveal altered B cell development and selection**

    PubMed Central

    Roskin, Krishna M.; Simchoni, Noa; Liu, Yi; Lee, Ji-Yeun; Seo, Katie; Hoh, Ramona A.; Pham, Tho; Park, Joon H.; Furman, David; Dekker, Cornelia L.; Davis, Mark M.; James, Judith A.; Nadeau, Kari C.; Cunningham-Rundles, Charlotte; Boyd, Scott D.

    2015-01-01

    Common variable immune deficiency (CVID) is the most common symptomatic primary immune deficiency, affecting ∼1 in 25,000 persons. These patients suffer from impaired antibody responses, autoimmunity, and susceptibility to lymphoid cancers. To explore the cellular basis for these clinical phenotypes, we conducted high-throughput DNA sequencing of immunoglobulin heavy chain gene rearrangements from 93 CVID patients and 105 control subjects and sorted naïve and memory B cells from 13 of the CVID patients and 10 of the control subjects. CVID patients showed abnormal VDJ rearrangement and abnormal formation of complementarity determining region 3 (CDR3). We observed decreased selection against antibodies with long CDR3 regions in memory repertoires and decreased V gene replacement, offering possible mechanisms for increased patient autoreactivity. Our data indicate that patient immunodeficiency might derive both from decreased diversity of the naïve B cell pool and decreased somatic hypermutation in memory repertoires. CVID patients also exhibited abnormal clonal expansion of unmutated B cells relative to controls. Although impaired B cell germinal center activation is commonly viewed as causative in CVID, these data indicate that CVID B cells diverge from controls as early as the pro-B cell stage and suggest possible explanations for the increased incidence of autoimmunity, immunodeficiency, and lymphoma CVID patients. PMID:26311730

  7. IgH sequences in common variable immune deficiency reveal altered B cell development and selection.

    PubMed

    Roskin, Krishna M; Simchoni, Noa; Liu, Yi; Lee, Ji-Yeun; Seo, Katie; Hoh, Ramona A; Pham, Tho; Park, Joon H; Furman, David; Dekker, Cornelia L; Davis, Mark M; James, Judith A; Nadeau, Kari C; Cunningham-Rundles, Charlotte; Boyd, Scott D

    2015-08-26

    Common variable immune deficiency (CVID) is the most common symptomatic primary immune deficiency, affecting ~1 in 25,000 persons. These patients suffer from impaired antibody responses, autoimmunity, and susceptibility to lymphoid cancers. To explore the cellular basis for these clinical phenotypes, we conducted high-throughput DNA sequencing of immunoglobulin heavy chain gene rearrangements from 93 CVID patients and 105 control subjects and sorted naïve and memory B cells from 13 of the CVID patients and 10 of the control subjects. The CVID patients showed abnormal VDJ rearrangement and abnormal formation of complementarity-determining region 3 (CDR3). We observed a decreased selection against antibodies with long CDR3s in memory repertoires and decreased variable gene replacement, offering possible mechanisms for increased patient autoreactivity. Our data indicate that patient immunodeficiency might derive from both decreased diversity of the naïve B cell pool and decreased somatic hypermutation in memory repertoires. The CVID patients also exhibited an abnormal clonal expansion of unmutated B cells relative to the controls. Although impaired B cell germinal center activation is commonly viewed as causative in CVID, these data indicate that CVID B cells diverge from controls as early as the pro-B stage, cell and suggest possible explanations for the increased incidence of autoimmunity, immunodeficiency, and lymphoma CVID patients. PMID:26311730

  8. A Monte Carlo-based framework enhances the discovery and interpretation of regulatory sequence motifs

    PubMed Central

    2012-01-01

    Background Discovery of functionally significant short, statistically overrepresented subsequence patterns (motifs) in a set of sequences is a challenging problem in bioinformatics. Oftentimes, not all sequences in the set contain a motif. These non-motif-containing sequences complicate the algorithmic discovery of motifs. Filtering the non-motif-containing sequences from the larger set of sequences while simultaneously determining the identity of the motif is, therefore, desirable and a non-trivial problem in motif discovery research. Results We describe MotifCatcher, a framework that extends the sensitivity of existing motif-finding tools by employing random sampling to effectively remove non-motif-containing sequences from the motif search. We developed two implementations of our algorithm; each built around a commonly used motif-finding tool, and applied our algorithm to three diverse chromatin immunoprecipitation (ChIP) data sets. In each case, the motif finder with the MotifCatcher extension demonstrated improved sensitivity over the motif finder alone. Our approach organizes candidate functionally significant discovered motifs into a tree, which allowed us to make additional insights. In all cases, we were able to support our findings with experimental work from the literature. Conclusions Our framework demonstrates that additional processing at the sequence entry level can significantly improve the performance of existing motif-finding tools. For each biological data set tested, we were able to propose novel biological hypotheses supported by experimental work from the literature. Specifically, in Escherichia coli, we suggested binding site motifs for 6 non-traditional LexA protein binding sites; in Saccharomyces cerevisiae, we hypothesize 2 disparate mechanisms for novel binding sites of the Cse4p protein; and in Halobacterium sp. NRC-1, we discoverd subtle differences in a general transcription factor (GTF) binding site motif across several data sets. We

  9. Rare FOXC1 variants in congenital glaucoma: identification of translation regulatory sequences.

    PubMed

    Medina-Trillo, Cristina; Aroca-Aguilar, José-Daniel; Méndez-Hernández, Carmen-Dora; Morales, Laura; García-Antón, Maite; García-Feijoo, Julián; Escribano, Julio

    2016-05-01

    Primary congenital glaucoma (PCG) is the cause of a significant proportion of inherited visual loss in children, but the underlying mechanism is poorly understood. In this study, we assessed the relationship between PCG and FOXC1 variants by Sanger sequencing the proximal promoter and transcribed sequence of FOXC1 from a cohort of 133 PCG families with no known CYP1B1 or MYOC mutations. The pathogenicity of the identified variants was evaluated by functional analyses. Ten patients (7.5%) with no family history of glaucoma carried five different rare heterozygous FOXC1 variants with both increased (rs77888940:C>G, c.-429C>G, rs730882054:c.1134_144del(CGGCGGCGCGG), p.(G380Rfs*144) and rs35717904:A>T, c.*734A>T) and decreased (rs185790394: C>T, c.-244C>T and rs79691946:C>T, p.(P297S)) transactivation, ranging from 50 to 180% of the wild-type activity. The five variants did not show monogenic segregation, and four of them were absent in a control group (n=233). To the best of our knowledge, one of these variants (p.(G380Rfs*144)) has not previously been described. One of the FOXC1 variant carriers (p.(P297S)) also coinherited a functionally altered rare PITX2 heterozygous variant (rs6533526:C>T, c.*454C>T). Bioinformatics and functional analyses provided novel information on three of these variants. c.-429C>G potentially disrupts a consensus sequence for a terminal oligopyrimidine tract, whereas c.-244C>T may alter the RNA secondary structure in the 5'-untranslated region (UTR) that affects mRNA translation. In addition, p.(G380Rfs*144) led to increased protein stability. In summary, these data reveal the presence of translation regulatory sequences in the UTRs of FOXC1 and provide evidence for a possible role of rare FOXC1 variants as modifying factors of goniodysgenesis in PCG. PMID:26220699

  10. CLAMMS: a scalable algorithm for calling common and rare copy number variants from exome sequencing data

    PubMed Central

    Packer, Jonathan S.; Maxwell, Evan K.; O’Dushlaine, Colm; Lopez, Alexander E.; Dewey, Frederick E.; Chernomorsky, Rostislav; Baras, Aris; Overton, John D.; Habegger, Lukas; Reid, Jeffrey G.

    2016-01-01

    Motivation: Several algorithms exist for detecting copy number variants (CNVs) from human exome sequencing read depth, but previous tools have not been well suited for large population studies on the order of tens or hundreds of thousands of exomes. Their limitations include being difficult to integrate into automated variant-calling pipelines and being ill-suited for detecting common variants. To address these issues, we developed a new algorithm—Copy number estimation using Lattice-Aligned Mixture Models (CLAMMS)—which is highly scalable and suitable for detecting CNVs across the whole allele frequency spectrum. Results: In this note, we summarize the methods and intended use-case of CLAMMS, compare it to previous algorithms and briefly describe results of validation experiments. We evaluate the adherence of CNV calls from CLAMMS and four other algorithms to Mendelian inheritance patterns on a pedigree; we compare calls from CLAMMS and other algorithms to calls from SNP genotyping arrays for a set of 3164 samples; and we use TaqMan quantitative polymerase chain reaction to validate CNVs predicted by CLAMMS at 39 loci (95% of rare variants validate; across 19 common variant loci, the mean precision and recall are 99% and 94%, respectively). In the Supplementary Materials (available at the CLAMMS Github repository), we present our methods and validation results in greater detail. Availability and implementation: https://github.com/rgcgithub/clamms (implemented in C). Contact: jeffrey.reid@regeneron.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26382196

  11. The effectiveness of common thermo-regulatory behaviours in a cool temperate grasshopper.

    PubMed

    Harris, Rebecca M B; McQuillan, Peter; Hughes, Lesley

    2015-08-01

    Behavioural thermoregulation has the potential to alleviate the short-term impacts of climate change on some small ectotherms, without the need for changes to species distributions or genetic adaptation. We illustrate this by measuring the effect of behaviour in a cool temperate species of grasshopper (Phaulacridium vittatum) over a range of spatial and temporal scales in laboratory and natural field experiments. Microhabitat selection at the site scale was tested in free-ranging grasshoppers and related to changing thermal quality over a daily period. Artificial warming experiments were then used to measure the temperature at which common thermoregulatory behaviours are initiated and the subsequent reductions in body temperature. Behavioural means such as timing of activity, choice of substrates with optimum surface temperatures, shade seeking and postural adjustments (e.g. stilting, vertical orientation) were found to be highly effective at maintaining preferred body temperature. The maximum voluntarily tolerated temperature (MVT) was determined to be 44°C±0.4°C, indicating the upper bounds of thermal flexibility in this species. Behavioural thermoregulation effectively enables small ectotherms to regulate exposure to changing environmental temperatures and utilize the spatially and temporally heterogeneous environments they occupy. Species such as the wingless grasshopper, although adapted to cool temperate conditions, are likely to be well equipped to respond successfully to coarse scale climate change. PMID:26267501

  12. Reprint of: The effectiveness of common thermo-regulatory behaviours in a cool temperate grasshopper.

    PubMed

    Harris, Rebecca M B; McQuillan, Peter; Hughes, Lesley

    2015-12-01

    Behavioural thermoregulation has the potential to alleviate the short-term impacts of climate change on some small ectotherms, without the need for changes to species distributions or genetic adaptation. We illustrate this by measuring the effect of behaviour in a cool temperate species of grasshopper (Phaulacridium vittatum) over a range of spatial and temporal scales in laboratory and natural field experiments. Microhabitat selection at the site scale was tested in free-ranging grasshoppers and related to changing thermal quality over a daily period. Artificial warming experiments were then used to measure the temperature at which common thermoregulatory behaviours are initiated and the subsequent reductions in body temperature. Behavioural means such as timing of activity, choice of substrates with optimum surface temperatures, shade seeking and postural adjustments (e.g. stilting, vertical orientation) were found to be highly effective at maintaining preferred body temperature. The maximum voluntarily tolerated temperature (MVT) was determined to be 44 °C ± 0.4 °C, indicating the upper bounds of thermal flexibility in this species. Behavioural thermoregulation effectively enables small ectotherms to regulate exposure to changing environmental temperatures and utilize the spatially and temporally heterogeneous environments they occupy. Species such as the wingless grasshopper, although adapted to cool temperate conditions, are likely to be well equipped to respond successfully to coarse scale climate change. PMID:26615722

  13. Highly recurring sequence elements identified in eukaryotic DNAs by computer analysis are often homologous to regulatory sequences or protein binding sites.

    PubMed Central

    Bodnar, J W; Ward, D C

    1987-01-01

    We have used computer assisted dot matrix and oligonucleotide frequency analyses to identify highly recurring sequence elements of 7-11 base pairs in eukaryotic genes and viral DNAs. Such elements are found much more frequently than expected, often with an average spacing of a few hundred base pairs. Furthermore, the most abundant repetitive elements observed in the ovalbumin locus, the beta-globin gene cluster, the metallothionein gene and the viral genomes of SV40, polyoma, Herpes simplex-1 and Mouse Mammary Tumor Virus were sequences shown previously to be protein binding sites or sequences important for regulating gene expression. These sequences were present in both exons and introns as well as promoter regions. These observations suggest that such sequences are often highly overrepresented within the specific gene segments with which they are associated. Computer analysis of other genetic units, including viral genomes and oncogenes, has identified a number of highly recurring sequence elements that could serve similar regulatory or protein-binding functions. A model for the role of such reiterated sequence elements in DNA organization and function is presented. PMID:3822840

  14. GSEL version 2, an online genome-wide query system of operon organization and regulatory sequence elements of Geobacter sulfurreducens.

    PubMed

    Qu, Yanhua; Brown, Peter; Barbe, Jose F; Puljic, Marko; Merino, Enrique; Adkins, Ronald M; Lovley, Derek R; Krushkal, Julia

    2009-10-01

    Geobacter sulfurreducens is a model organism within the delta-Proteobacterial family Geobacteraceae, members of which can participate in environmental bioremediation of metal and organic waste contaminants and in production of bioenergy. In this report, we describe a new, significantly expanded and updated, version 2 of the GSEL (Geobacter Sequence Elements) database ( http://geobacter.org/research/gsel2/ and http://geobacter.org/refs/gsel2/ ) and its accompanying online query system, which compiles information on operon organization and regulatory sequence elements in the genome of G. sulfurreducens. It incorporates a new online graphical browser, provides novel search capabilities, and includes updated operon predictions along with new information on predicted and experimentally validated genome regulatory sites. The GSEL database and online search system provides a unique and comprehensive tool cataloging information about gene regulation in G. sulfurreducens, aiding in investigation of mechanisms that regulate its ability to generate electric power, bioremediate environmental waste, and adapt to environmental changes. PMID:19792871

  15. The nucleotide sequences of several tRNA genes from rat mitochondria: common features and relatedness to homologous species.

    PubMed Central

    Cantatore, P; De Benedetto, C; Gadaleta, G; Gallerani, R; Kroon, A M; Holtrop, M; Lanave, C; Pepe, G; Quagliariello, C; Saccone, C; Sbisa, E

    1982-01-01

    We have determined the nucleotide sequences of thirteen rat mt tRNA genes. The features of the primary and secondary structures of these tRNAs show that those for Gln, Ser, and f-Met resemble, while those for Lys, Cys, and Trp depart strikingly from the universal type. The remainder are slightly abnormal. Among many mammalian mt DNA sequences, those of mt tRNA genes are highly conserved, thus suggesting for those genes an additional, perhaps regulatory, function. A simple evolutionary relationship between the tRNAs of animal mitochondria and those of eukaryotic cytoplasm, of lower eukaryotic mitochondria or of prokaryotes, is not evident owing to the extreme divergence of the tRNA sequences in the two groups. However, a slightly higher homology does exist between a few animal mt tRNAs and those from prokaryotes or from lower eukaryotic mitochondria. PMID:7099963

  16. Prediction of Regulatory Interactions from Genome Sequences Using a Biophysical Model for the Arabidopsis LEAFY Transcription Factor[C][W

    PubMed Central

    Moyroud, Edwige; Minguet, Eugenio Gómez; Ott, Felix; Yant, Levi; Posé, David; Monniaux, Marie; Blanchet, Sandrine; Bastien, Olivier; Thévenon, Emmanuel; Weigel, Detlef; Schmid, Markus; Parcy, François

    2011-01-01

    Despite great advances in sequencing technologies, generating functional information for nonmodel organisms remains a challenge. One solution lies in an improved ability to predict genetic circuits based on primary DNA sequence in combination with detailed knowledge of regulatory proteins that have been characterized in model species. Here, we focus on the LEAFY (LFY) transcription factor, a conserved master regulator of floral development. Starting with biochemical and structural information, we built a biophysical model describing LFY DNA binding specificity in vitro that accurately predicts in vivo LFY binding sites in the Arabidopsis thaliana genome. Applying the model to other plant species, we could follow the evolution of the regulatory relationship between LFY and the AGAMOUS (AG) subfamily of MADS box genes and show that this link predates the divergence between monocots and eudicots. Remarkably, our model succeeds in detecting the connection between LFY and AG homologs despite extensive variation in binding sites. This demonstrates that the cis-element fluidity recently observed in animals also exists in plants, but the challenges it poses can be overcome with predictions grounded in a biophysical model. Therefore, our work opens new avenues to deduce the structure of regulatory networks from mere inspection of genomic sequences. PMID:21515819

  17. Monte Carlo Simulations of the Post-Common-Envelope White-Dwarf Main-Sequence Binary Population

    SciTech Connect

    Camacho, Judit; Torres, Santiago; Garcia-Berro, Enrique; Schreiber, Matthias R.

    2010-12-22

    We present a detailed Monte Carlo simulator of the population of binary systems within the solar neighborhood. We have used the most up-to-date stellar evolutionary models, a complete treatment of the Roche lobe overflow episode, as well as a full implementation of the orbital evolution of the binary system. Preliminary results are presented for the population of white-dwarf main-sequence binaries, resulting from a common envelope episode. We also study the role played by the binding energy parameter, {lambda}, and by the common envelope efficiency, {alpha}{sub CE}. Finally, results are compared with the population of identified white-dwarf main-sequence binaries.

  18. Different regulatory sequences control creatine kinase-M gene expression in directly injected skeletal and cardiac muscle.

    PubMed Central

    Vincent, C K; Gualberto, A; Patel, C V; Walsh, K

    1993-01-01

    Regulatory sequences of the M isozyme of the creatine kinase (MCK) gene have been extensively mapped in skeletal muscle, but little is known about the sequences that control cardiac-specific expression. The promoter and enhancer sequences required for MCK gene expression were assayed by the direct injection of plasmid DNA constructs into adult rat cardiac and skeletal muscle. A 700-nucleotide fragment containing the enhancer and promoter of the rabbit MCK gene activated the expression of a downstream reporter gene in both muscle tissues. Deletion of the enhancer significantly decreased expression in skeletal muscle but had no detectable effect on expression in cardiac muscle. Further deletions revealed a CArG sequence motif at position -179 within the promoter that was essential for cardiac-specific expression. The CArG element of the MCK promoter bound to the recombinant serum response factor and YY1, transcription factors which control expression from structurally similar elements in the skeletal actin and c-fos promoters. MCK-CArG-binding activities that were similar or identical to serum response factor and YY1 were also detected in extracts from adult cardiac muscle. These data suggest that the MCK gene is controlled by different regulatory programs in adult cardiac and skeletal muscle. Images PMID:8423791

  19. The Effects of Sequence Variation on Genome-wide NRF2 Binding—New Target Genes and Regulatory SNPs

    PubMed Central

    Kuosmanen, Suvi M.; Viitala, Sari; Laitinen, Tuomo; Peräkylä, Mikael; Pölönen, Petri; Kansanen, Emilia; Leinonen, Hanna; Raju, Suresh; Wienecke-Baldacchino, Anke; Närvänen, Ale; Poso, Antti; Heinäniemi, Merja; Heikkinen, Sami; Levonen, Anna-Liisa

    2016-01-01

    Transcription factor binding specificity is crucial for proper target gene regulation. Motif discovery algorithms identify the main features of the binding patterns, but the accuracy on the lower affinity sites is often poor. Nuclear factor E2-related factor 2 (NRF2) is a ubiquitous redox-activated transcription factor having a key protective role against endogenous and exogenous oxidant and electrophile stress. Herein, we decipher the effects of sequence variation on the DNA binding sequence of NRF2, in order to identify both genome-wide binding sites for NRF2 and disease-associated regulatory SNPs (rSNPs) with drastic effects on NRF2 binding. Interactions between NRF2 and DNA were studied using molecular modelling, and NRF2 chromatin immunoprecipitation-sequence datasets together with protein binding microarray measurements were utilized to study binding sequence variation in detail. The binding model thus generated was used to identify genome-wide binding sites for NRF2, and genomic binding sites with rSNPs that have strong effects on NRF2 binding and reside on active regulatory elements in human cells. As a proof of concept, miR-126–3p and -5p were identified as NRF2 target microRNAs, and a rSNP (rs113067944) residing on NRF2 target gene (Ferritin, light polypeptide, FTL) promoter was experimentally verified to decrease NRF2 binding and result in decreased transcriptional activity. PMID:26826707

  20. Mining expressed sequence tags of rapeseed (Brassica napus L.) to predict the drought responsive regulatory network.

    PubMed

    Shamloo-Dashtpagerdi, Roohollah; Razi, Hooman; Ebrahimie, Esmaeil

    2015-07-01

    It is of great significance to understand the regulatory mechanisms by which plants deal with drought stress. Two EST libraries derived from rapeseed (Brassica napus) leaves in non-stressed and drought stress conditions were analyzed in order to obtain the transcriptomic landscape of drought-exposed B. napus plants, and also to identify and characterize significant drought responsive regulatory genes and microRNAs. The functional ontology analysis revealed a substantial shift in the B. napus transcriptome to govern cellular drought responsiveness via different stress-activated mechanisms. The activity of transcription factor and protein kinase modules generally increased in response to drought stress. The 26 regulatory genes consisting of 17 transcription factor genes, eight protein kinase genes and one protein phosphatase gene were identified showing significant alterations in their expressions in response to drought stress. We also found the six microRNAs which were differentially expressed during drought stress supporting the involvement of a post-transcriptional level of regulation for B. napus drought response. The drought responsive regulatory network shed light on the significance of some regulatory components involved in biosynthesis and signaling of various plant hormones (abscisic acid, auxin and brassinosteroids), ubiquitin proteasome system, and signaling through Reactive Oxygen Species (ROS). Our findings suggested a complex and multi-level regulatory system modulating response to drought stress in B. napus. PMID:26261397

  1. Sequence analysis of mouse vomeronasal receptor gene clusters reveals common promoter motifs and a history of recent expansion

    PubMed Central

    Lane, Robert P.; Cutforth, Tyler; Axel, Richard; Hood, Leroy; Trask, Barbara J.

    2002-01-01

    We have analyzed the organization and sequence of 73 V1R genes encoding putative pheromone receptors to identify regulatory features and characterize the evolutionary history of the V1R family. The 73 V1Rs arose from seven ancestral genes around the time of mouse–rat speciation through large local duplications, and this expansion may contribute to speciation events. Orthologous V1R genes appear to have been lost during primate evolution. Exceptional noncoding homology is observed across four V1R subfamilies at one cluster and thus may be important for locus-specific transcriptional regulation. PMID:11752409

  2. Identification of novel craniofacial regulatory domains located far upstream of SOX9 and disrupted in Pierre Robin sequence

    PubMed Central

    Gordon, Christopher T.; Attanasio, Catia; Bhatia, Shipra; Benko, Sabina; Ansari, Morad; Tan, Tiong Y.; Munnich, Arnold; Pennacchio, Len A.; Abadie, Véronique; Temple, I. Karen; Goldenberg, Alice; van Heyningen, Veronica; Amiel, Jeanne; FitzPatrick, David; Kleinjan, Dirk A.; Visel, Axel; Lyonnet, Stanislas

    2015-01-01

    Mutations in the coding sequence of SOX9 cause campomelic dysplasia (CD), a disorder of skeletal development associated with 46,XY disorders of sex development (DSDs). Translocations, deletions and duplications within a ~2 Mb region upstream of SOX9 can recapitulate the CD-DSD phenotype fully or partially, suggesting the existence of an unusually large cis-regulatory control region. Pierre Robin sequence (PRS) is a craniofacial disorder that is frequently an endophenotype of CD and a locus for isolated PRS at ~1.2-1.5 Mb upstream of SOX9 has been previously reported. The craniofacial regulatory potential within this locus, and within the greater genomic domain surrounding SOX9, remains poorly defined. We report two novel deletions upstream of SOX9 in families with PRS, allowing refinement of the regions harbouring candidate craniofacial regulatory elements. In parallel, ChIP-Seq for p300 binding sites in mouse craniofacial tissue led to the identification of several novel craniofacial enhancers at the SOX9 locus, which were validated in transgenic reporter mice and zebrafish. Notably, some of the functionally validated elements fall within the PRS deletions. These studies suggest that multiple non-coding elements contribute to the craniofacial regulation of SOX9 expression, and that their disruption results in PRS. PMID:24934569

  3. Complete Chloroplast Genome Sequence of Tartary Buckwheat (Fagopyrum tataricum) and Comparative Analysis with Common Buckwheat (F. esculentum)

    PubMed Central

    Cho, Kwang-Soo; Yun, Bong-Kyoung; Yoon, Young-Ho; Hong, Su-Young; Mekapogu, Manjulatha; Kim, Kyung-Hee; Yang, Tae-Jin

    2015-01-01

    We report the chloroplast (cp) genome sequence of tartary buckwheat (Fagopyrum tataricum) obtained by next-generation sequencing technology and compared this with the previously reported common buckwheat (F. esculentum ssp. ancestrale) cp genome. The cp genome of F. tataricum has a total sequence length of 159,272 bp, which is 327 bp shorter than the common buckwheat cp genome. The cp gene content, order, and orientation are similar to those of common buckwheat, but with some structural variation at tandem and palindromic repeat frequencies and junction areas. A total of seven InDels (around 100 bp) were found within the intergenic sequences and the ycf1 gene. Copy number variation of the 21-bp tandem repeat varied in F. tataricum (four repeats) and F. esculentum (one repeat), and the InDel of the ycf1 gene was 63 bp long. Nucleotide and amino acid have highly conserved coding sequence with about 98% homology and four genes—rpoC2, ycf3, accD, and clpP—have high synonymous (Ks) value. PCR based InDel markers were applied to diverse genetic resources of F. tataricum and F. esculentum, and the amplicon size was identical to that expected in silico. Therefore, these InDel markers are informative biomarkers to practically distinguish raw or processed buckwheat products derived from F. tataricum and F. esculentum. PMID:25966355

  4. Complete Chloroplast Genome Sequence of Tartary Buckwheat (Fagopyrum tataricum) and Comparative Analysis with Common Buckwheat (F. esculentum).

    PubMed

    Cho, Kwang-Soo; Yun, Bong-Kyoung; Yoon, Young-Ho; Hong, Su-Young; Mekapogu, Manjulatha; Kim, Kyung-Hee; Yang, Tae-Jin

    2015-01-01

    We report the chloroplast (cp) genome sequence of tartary buckwheat (Fagopyrum tataricum) obtained by next-generation sequencing technology and compared this with the previously reported common buckwheat (F. esculentum ssp. ancestrale) cp genome. The cp genome of F. tataricum has a total sequence length of 159,272 bp, which is 327 bp shorter than the common buckwheat cp genome. The cp gene content, order, and orientation are similar to those of common buckwheat, but with some structural variation at tandem and palindromic repeat frequencies and junction areas. A total of seven InDels (around 100 bp) were found within the intergenic sequences and the ycf1 gene. Copy number variation of the 21-bp tandem repeat varied in F. tataricum (four repeats) and F. esculentum (one repeat), and the InDel of the ycf1 gene was 63 bp long. Nucleotide and amino acid have highly conserved coding sequence with about 98% homology and four genes--rpoC2, ycf3, accD, and clpP--have high synonymous (Ks) value. PCR based InDel markers were applied to diverse genetic resources of F. tataricum and F. esculentum, and the amplicon size was identical to that expected in silico. Therefore, these InDel markers are informative biomarkers to practically distinguish raw or processed buckwheat products derived from F. tataricum and F. esculentum. PMID:25966355

  5. Levels of dendritic cell populations and regulatory T cells vary significantly between two commonly used mouse strains.

    PubMed

    Vogelsang, Petra; Hovden, Arnt-Ove; Jonsson, Roland; Appel, Silke

    2009-12-01

    Dendritic cells (DC) are a heterogeneous group of professional antigen-presenting cells (APC) involved in both initiating immune responses and maintaining tolerance. Roughly, DC can be divided into plasmacytoid DC (pDC) and conventional DC (cDC). By controlling regulatory T cells (Treg), DC can influence the outcome of both immunity and autoimmunity. Since the use of mice as in vivo models became a practical tool for researchers studying pathological events in all kind of human diseases, we decided to compare levels of cDC, pDC and Treg in both spleen and blood between two inbred mouse strains. Here we show that two commonly used mouse strains, BALB/c and C57BL/10J mice, have significantly different levels of distinct CD11c(+)/CD4(-)/CD8a(+), CD11c(+)/CD4(+)/CD8a(-) and CD11c(+)/CD4(-)/CD8a(-) cDC populations, pDC and Treg. Therefore, we emphasize the importance of considering the proper model when comparing data sets from different mouse strains. PMID:19906196

  6. Small RNA deep sequencing revealed that mixed infection of known and unknown viruses were common in field collected vegetable samples

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In an effort to characterize the causal agents for plant diseases in field collected samples using the small RNA deep sequencing technology, numerous known or novel viruses and viroids were identified. In many cases, a mixed infection with multiple pathogen species was common. Such situation compl...

  7. The Effects of Common Knowledge Construction Model Sequence of Lessons on Science Achievement and Relational Conceptual Change

    ERIC Educational Resources Information Center

    Ebenezer, Jazlin; Chacko, Sheela; Kaya, Osman Nafiz; Koya, Satya Kiran; Ebenezer, Devairakkam Luke

    2010-01-01

    The purpose of this study was to investigate the effects of the Common Knowledge Construction Model (CKCM) lesson sequence, an intervention based both in conceptual change theory and in Phenomenography, a subset of conceptual change theory. A mixed approach was used to investigate whether this model had a significant effect on 7th grade students'…

  8. Cloning and Characterization of 5′ Flanking Regulatory Sequences of AhLEC1B Gene from Arachis Hypogaea L.

    PubMed Central

    Tang, Guiying; Xu, Pingli; Liu, Wei; Liu, Zhanji; Shan, Lei

    2015-01-01

    LEAFY COTYLEDON1 (LEC1) is a B subunit of Nuclear Factor Y (NF-YB) transcription factor that mainly accumulates during embryo development. We cloned the 5′ flanking regulatory sequence of AhLEC1B gene, a homolog of Arabidopsis LEC1, and analyzed its regulatory elements using online software. To identify the crucial regulatory region, we generated a series of GUS expression frameworks driven by different length promoters with 5′ terminal and/or 3′ terminal deletion. We further characterized the GUS expression patterns in the transgenic Arabidopsis lines. Our results show that both the 65bp proximal promoter region and the 52bp 5′ UTR of AhLEC1B contain the key motifs required for the essential promoting activity. Moreover, AhLEC1B is preferentially expressed in the embryo and is co-regulated by binding of its upstream genes with both positive and negative corresponding cis-regulatory elements. PMID:26426444

  9. Common 5S rRNA variants are likely to be accepted in many sequence contexts

    NASA Technical Reports Server (NTRS)

    Zhang, Zhengdong; D'Souza, Lisa M.; Lee, Youn-Hyung; Fox, George E.

    2003-01-01

    Over evolutionary time RNA sequences which are successfully fixed in a population are selected from among those that satisfy the structural and chemical requirements imposed by the function of the RNA. These sequences together comprise the structure space of the RNA. In principle, a comprehensive understanding of RNA structure and function would make it possible to enumerate which specific RNA sequences belong to a particular structure space and which do not. We are using bacterial 5S rRNA as a model system to attempt to identify principles that can be used to predict which sequences do or do not belong to the 5S rRNA structure space. One promising idea is the very intuitive notion that frequently seen sequence changes in an aligned data set of naturally occurring 5S rRNAs would be widely accepted in many other 5S rRNA sequence contexts. To test this hypothesis, we first developed well-defined operational definitions for a Vibrio region of the 5S rRNA structure space and what is meant by a highly variable position. Fourteen sequence variants (10 point changes and 4 base-pair changes) were identified in this way, which, by the hypothesis, would be expected to incorporate successfully in any of the known sequences in the Vibrio region. All 14 of these changes were constructed and separately introduced into the Vibrio proteolyticus 5S rRNA sequence where they are not normally found. Each variant was evaluated for its ability to function as a valid 5S rRNA in an E. coli cellular context. It was found that 93% (13/14) of the variants tested are likely valid 5S rRNAs in this context. In addition, seven variants were constructed that, although present in the Vibrio region, did not meet the stringent criteria for a highly variable position. In this case, 86% (6/7) are likely valid. As a control we also examined seven variants that are seldom or never seen in the Vibrio region of 5S rRNA sequence space. In this case only two of seven were found to be potentially valid. The

  10. Draft Genome Sequence of Catellicoccus marimammalium, a Novel Species Commonly Found in Gull Feces

    EPA Science Inventory

    Catellicoccus marimammalium is a relatively uncharacterized Gram-positive, facultative anaerobe with potential utility as an indicator of waterfowl fecal contamination. Here we report an annotated draft genome sequence that suggests this organism may be a symbiotic gut microbe.

  11. Use of a Drosophila Genome-Wide Conserved Sequence Database to Identify Functionally Related cis-Regulatory Enhancers

    PubMed Central

    Brody, Thomas; Yavatkar, Amarendra S; Kuzin, Alexander; Kundu, Mukta; Tyson, Leonard J; Ross, Jermaine; Lin, Tzu-Yang; Lee, Chi-Hon; Awasaki, Takeshi; Lee, Tzumin; Odenwald, Ward F

    2012-01-01

    Background: Phylogenetic footprinting has revealed that cis-regulatory enhancers consist of conserved DNA sequence clusters (CSCs). Currently, there is no systematic approach for enhancer discovery and analysis that takes full-advantage of the sequence information within enhancer CSCs. Results: We have generated a Drosophila genome-wide database of conserved DNA consisting of >100,000 CSCs derived from EvoPrints spanning over 90% of the genome. cis-Decoder database search and alignment algorithms enable the discovery of functionally related enhancers. The program first identifies conserved repeat elements within an input enhancer and then searches the database for CSCs that score highly against the input CSC. Scoring is based on shared repeats as well as uniquely shared matches, and includes measures of the balance of shared elements, a diagnostic that has proven to be useful in predicting cis-regulatory function. To demonstrate the utility of these tools, a temporally-restricted CNS neuroblast enhancer was used to identify other functionally related enhancers and analyze their structural organization. Conclusions: cis-Decoder reveals that co-regulating enhancers consist of combinations of overlapping shared sequence elements, providing insights into the mode of integration of multiple regulating transcription factors. The database and accompanying algorithms should prove useful in the discovery and analysis of enhancers involved in any developmental process. Developmental Dynamics 241:169–189, 2012. © 2011 Wiley Periodicals, Inc. Key findings A genome-wide catalog of Drosophila conserved DNA sequence clusters. cis-Decoder discovers functionally related enhancers. Functionally related enhancers share balanced sequence element copy numbers. Many enhancers function during multiple phases of development. PMID:22174086

  12. Massively parallel decoding of mammalian regulatory sequences supports a flexible organizational model.

    PubMed

    Smith, Robin P; Taher, Leila; Patwardhan, Rupali P; Kim, Mee J; Inoue, Fumitaka; Shendure, Jay; Ovcharenko, Ivan; Ahituv, Nadav

    2013-09-01

    Despite continual progress in the cataloging of vertebrate regulatory elements, little is known about their organization and regulatory architecture. Here we describe a massively parallel experiment to systematically test the impact of copy number, spacing, combination and order of transcription factor binding sites on gene expression. A complex library of ∼5,000 synthetic regulatory elements containing patterns from 12 liver-specific transcription factor binding sites was assayed in mice and in HepG2 cells. We find that certain transcription factors act as direct drivers of gene expression in homotypic clusters of binding sites, independent of spacing between sites, whereas others function only synergistically. Heterotypic enhancers are stronger than their homotypic analogs and favor specific transcription factor binding site combinations, mimicking putative native enhancers. Exhaustive testing of binding site permutations suggests that there is flexibility in binding site order. Our findings provide quantitative support for a flexible model of regulatory element activity and suggest a framework for the design of synthetic tissue-specific enhancers. PMID:23892608

  13. The nucleosome landscape of Plasmodium falciparum reveals chromatin architecture and dynamics of regulatory sequences

    PubMed Central

    Kensche, Philip Reiner; Hoeijmakers, Wieteke Anna Maria; Toenhake, Christa Geeke; Bras, Maaike; Chappell, Lia; Berriman, Matthew; Bártfai, Richárd

    2016-01-01

    In eukaryotes, the chromatin architecture has a pivotal role in regulating all DNA-associated processes and it is central to the control of gene expression. For Plasmodium falciparum, a causative agent of human malaria, the nucleosome positioning profile of regulatory regions deserves particular attention because of their extreme AT-content. With the aid of a highly controlled MNase-seq procedure we reveal how positioning of nucleosomes provides a structural and regulatory framework to the transcriptional unit by demarcating landmark sites (transcription/translation start and end sites). In addition, our analysis provides strong indications for the function of positioned nucleosomes in splice site recognition. Transcription start sites (TSSs) are bordered by a small nucleosome-depleted region, but lack the stereotypic downstream nucleosome arrays, highlighting a key difference in chromatin organization compared to model organisms. Furthermore, we observe transcription-coupled eviction of nucleosomes on strong TSSs during intraerythrocytic development and demonstrate that nucleosome positioning and dynamics can be predictive for the functionality of regulatory DNA elements. Collectively, the strong nucleosome positioning over splice sites and surrounding putative transcription factor binding sites highlights the regulatory capacity of the nucleosome landscape in this deadly human pathogen. PMID:26578577

  14. Multilocus sequence analysis of Brazilian Rhizobium microsymbionts of common bean (Phaseolus vulgaris L.) reveals unexpected taxonomic diversity.

    PubMed

    Ribeiro, Renan Augusto; Barcellos, Fernando Gomes; Thompson, Fabiano L; Hungria, Mariangela

    2009-05-01

    The diazotrophic bacteria collectively known as "rhizobia" are important for establishing symbiotic N(2)-fixing associations with many legumes. These microbes have been used for over a century as an environmentally beneficial and cost-effective means of ensuring acceptable yields of agricultural legumes. The most widely used phylogenetic marker for identification and classification of rhizobia has been the 16S rRNA gene; however, this marker fails to discriminate some closely related species. In this study, we established the first multilocus sequence analysis (MLSA) scheme for the identification and classification of rhizobial microsymbionts of common bean (Phaseolus vulgaris L.). We analyzed 12 Brazilian strains representative of a collection of over 850 isolates in addition to type and reference rhizobial strains, by sequencing recA, dnaK, gltA, glnII and rpoA genes. Gene sequence similarities among the five type/reference Rhizobium strains which are symbionts of common bean ranged from 95 to 100% for 16S rRNA, and from 83 to 99% for the other five genes. Rhizobial species described as symbionts of common bean also formed separate groups upon analysis of single and concatenated gene sequences, and clusters formed in each tree were in good mutual agreement. The five additional loci may thus be considered useful markers of the genus Rhizobium; in addition, MLSA also revealed broad genetic diversity among strains classified as Rhizobium tropici, providing evidence of new species. PMID:19403105

  15. FINDING REGULATORY ELEMENTS USING JOINT LIKELIHOODS FOR SEQUENCE AND EXPRESSION PROFILE DATA.

    SciTech Connect

    IAN HOLMES, UC BERKELEY, CA, WILLIAM J. BRUNO, LANL

    2000-08-20

    A recent, popular method of finding promoter sequences is to look for conserved motifs up-stream of genes clustered on the basis of expression data. This method presupposes that the clustering is correct. Theoretically, one should be better able to find promoter sequences and create more relevant gene clusters by taking a unified approach to these two problems. We present a likelihood function for a sequence-expression model giving a joint likelihood for a promoter sequence and its corresponding expression levels. An algorithm to estimate sequence-expression model parameters using Gibbs sampling and Expectation/Maximization is described. A program, called kimono, that implements this algorithm has been developed and the source code is freely available over the internet.

  16. An approach to delineate primers for a group of poorly conserved sequences incorporating the common motif region.

    PubMed

    Sahu, Mousumi; Sahu, Jagajjit; Sahoo, Smita; Dehury, Budheswar; Sarma, Kishore; Sarmah, Ranjan; Sen, Priyabrata; Modi, Mahendra Kumar; Barooah, Madhumita

    2012-01-01

    Glutathione synthetase (gshB) has previously been reported to confer tolerance to acidic soil condition in Rhizobium species. Cloning the gene coding for this enzyme necessitates the designing of proper primer sets which in turn depends on the identification of high quality sequence similarity in multiple global alignments. In this experiment, a group of homologous gene sequences related to gshB gene (accession no: gi-86355669:327589-328536) of Rhizobium etli CFN 42, were extracted from NCBI nucleotide sequence databases using BLASTN and were analyzed for designing degenerate primers. However, the T-coffee multiple global alignment results did not show any block of conserved region for the above sequence set to design the primers. Therefore, we attempted to identify the location of common motif region based on multiple local alignments employing the MEME algorithm supported with MAST and Primer3. The results revealed some common motif regions that enabled us to design the primer sets for related gshB gene sequences. The result will be validated in wet lab. PMID:22419837

  17. Impaired fasting tolerance among Alaska Native Children with a common Carnitine Palmitoyltransferase 1A sequence variant

    PubMed Central

    Gillingham, Melanie B.; Hirschfeld, Matthew; Lowe, Sarah; Matern, Dietrich; Shoemaker, James; Lambert, William E.; Koeller, David M.

    2011-01-01

    A high prevalence of the sequence variant c.1436C>T in the CPT1A gene has been identified among Alaska Native newborns but the clinical implications of this variant are unknown. We conducted medically supervised fasts in 5 children homozygous for the c.1436C>T variant. Plasma free fatty acids increased normally in these children but their long-chain acylcarnitine and ketone production was significantly blunted. The fast was terminated early in two subjects due to symptoms of hypoglycemia. Homozygosity for the c.1436C>T sequence variant of CPT1A impairs fasting ketogenesis, and can cause hypoketotic hypoglycemia in young children. PMID:21763168

  18. RNA Sequencing of Mouse Sinoatrial Node Reveals an Upstream Regulatory Role for Islet-1 in Cardiac Pacemaker Cells

    PubMed Central

    Vedantham, Vasanth; Galang, Giselle; Evangelista, Melissa; Deo, Rahul C.; Srivastava, Deepak

    2015-01-01

    Rationale Treatment of sinus node disease with regenerative or cell-based therapies will require a detailed understanding of gene regulatory networks in cardiac pacemaker cells (PCs). Objective To characterize the transcriptome of PCs using RNA sequencing, and to identify transcriptional networks responsible for PC gene expression. Methods and Results We used laser capture micro-dissection (LCM) on a sinus node reporter mouse line to isolate RNA from PCs for RNA sequencing (RNA-Seq). Differential expression and network analysis identified novel SAN-enriched genes, and predicted that the transcription factor Islet-1 (Isl1) is active in developing pacemaker cells. RNA-Seq on SAN tissue lacking Isl1 established that Isl1 is an important transcriptional regulator within the developing SAN. Conclusions (1) The PC transcriptome diverges sharply from other cardiomyocytes; (2) Isl1 is a positive transcriptional regulator of the PC gene expression program. PMID:25623957

  19. Development and characterization of new single nucleotide polymorphism markers from expressed sequence tags in common carp (Cyprinus carpio).

    PubMed

    Zhu, Chuankun; Cheng, Lei; Tong, Jingou; Yu, Xiaomu

    2012-01-01

    The common carp (Cyprinus carpio) is an important aquaculture fish worldwide but only limited single nucleotide polymorphism (SNP) markers are characterized from expressed sequence tags (ESTs) in this species. In this study, 1487 putative SNPs were bioinformatically mined from 14,066 online ESTs mainly from the European common carp, with the occurrence rate of about one SNP every 173 bp. One hundred and twenty-one of these SNPs were selected for validation using PCR fragment sequencing, and 48 out of 81 primers could amplify the expected fragments in the Chinese common carp genome. Only 26 (21.5%) putative SNPs were validated, however, 508 new SNPs and 68 indels were identified. The ratios of transitions to transversions were 1.77 for exon SNPs and 1.05 for intron SNPs. All the 23 SNPs selected for population tests were polymorphic, with the observed heterozygosity (Ho) ranging from 0.053 to 0.526 (mean 0.262), polymorphism information content (PIC) from 0.095 to 0.357 (mean 0.246), and 21 SNPs were in Hardy-Weinberg equilibrium. These results suggest that different common carp populations with geographic isolation have significant genetic variation at the SNP level, and these new EST-SNP markers are readily available for genetics and breeding studies in common carp. PMID:22837697

  20. Mapping the DNA-binding domain and target sequences of the Streptomyces peucetius daunorubicin biosynthesis regulatory protein, DnrI.

    PubMed

    Sheldon, Paul J; Busarow, Sara B; Hutchinson, C Richard

    2002-04-01

    Streptomyces antibiotic regulatory proteins (SARPs) constitute a novel family of transcriptional activators that control the expression of several diverse anti-biotic biosynthetic gene clusters. The Streptomyces peucetius DnrI protein, one of only a handful of these proteins yet discovered, controls the biosynthesis of the polyketide antitumour antibiotics daunorubicin and doxorubicin. Recently, comparative analyses have revealed significant similarities among the predicted DNA-binding domains of the SARPs and the C-terminal DNA-binding domain of the OmpR family of regulatory proteins. Using the crystal structure of the OmpR-binding domain as a template, DnrI was mapped by truncation and site-directed mutagenesis. Several highly conserved residues within the N-terminus are crucial for DNA binding and protein function. Tandemly arranged heptameric imperfect repeat sequences are found within the -35 promoter regions of target genes. Substitutions for each nucleotide within the repeats of the dnrG-dpsABCD promoter were performed by site-directed mutagenesis. The mutant promoter fragments were found to have modified binding characteristics in gel mobility shift assays. The spacing between the repeat target sequences is also critical for successful occupation by DnrI and, therefore, competent transcriptional activation of the dnrG-dpsABCD operon. PMID:11972782

  1. Application of whole genome and RNA sequencing to investigate the genomic landscape of common variable immunodeficiency disorders

    PubMed Central

    van Schouwenburg, Pauline A.; Davenport, Emma E.; Kienzler, Anne-Kathrin; Marwah, Ishita; Wright, Benjamin; Lucas, Mary; Malinauskas, Tomas; Martin, Hilary C.; Lockstone, Helen E.; Cazier, Jean-Baptiste; Chapel, Helen M.; Knight, Julian C.; Patel, Smita Y.

    2015-01-01

    Common Variable Immunodeficiency Disorders (CVIDs) are the most prevalent cause of primary antibody failure. CVIDs are highly variable and a genetic causes have been identified in <5% of patients. Here, we performed whole genome sequencing (WGS) of 34 CVID patients (94% sporadic) and combined them with transcriptomic profiling (RNA-sequencing of B cells) from three patients and three healthy controls. We identified variants in CVID disease genes TNFRSF13B, TNFRSF13C, LRBA and NLRP12 and enrichment of variants in known and novel disease pathways. The pathways identified include B-cell receptor signalling, non-homologous end-joining, regulation of apoptosis, T cell regulation and ICOS signalling. Our data confirm the polygenic nature of CVID and suggest individual-specific aetiologies in many cases. Together our data show that WGS in combination with RNA-sequencing allows for a better understanding of CVIDs and the identification of novel disease associated pathways. PMID:26122175

  2. Application of whole genome and RNA sequencing to investigate the genomic landscape of common variable immunodeficiency disorders.

    PubMed

    van Schouwenburg, Pauline A; Davenport, Emma E; Kienzler, Anne-Kathrin; Marwah, Ishita; Wright, Benjamin; Lucas, Mary; Malinauskas, Tomas; Martin, Hilary C; Lockstone, Helen E; Cazier, Jean-Baptiste; Chapel, Helen M; Knight, Julian C; Patel, Smita Y

    2015-10-01

    Common Variable Immunodeficiency Disorders (CVIDs) are the most prevalent cause of primary antibody failure. CVIDs are highly variable and a genetic causes have been identified in <5% of patients. Here, we performed whole genome sequencing (WGS) of 34 CVID patients (94% sporadic) and combined them with transcriptomic profiling (RNA-sequencing of B cells) from three patients and three healthy controls. We identified variants in CVID disease genes TNFRSF13B, TNFRSF13C, LRBA and NLRP12 and enrichment of variants in known and novel disease pathways. The pathways identified include B-cell receptor signalling, non-homologous end-joining, regulation of apoptosis, T cell regulation and ICOS signalling. Our data confirm the polygenic nature of CVID and suggest individual-specific aetiologies in many cases. Together our data show that WGS in combination with RNA-sequencing allows for a better understanding of CVIDs and the identification of novel disease associated pathways. PMID:26122175

  3. [Examination of processed vegetable foods for the presence of common DNA sequences of genetically modified tomatoes].

    PubMed

    Kitagawa, Mamiko; Nakamura, Kosuke; Kondo, Kazunari; Ubukata, Shoji; Akiyama, Hiroshi

    2014-01-01

    The contamination of processed vegetable foods with genetically modified tomatoes was investigated by the use of qualitative PCR methods to detect the cauliflower mosaic virus 35S promoter (P35S) and the kanamycin resistance gene (NPTII). DNA fragments of P35S and NPTII were detected in vegetable juice samples, possibly due to contamination with the genomes of cauliflower mosaic virus infecting juice ingredients of Brassica species and soil bacteria, respectively. Therefore, to detect the transformation construct sequences of GM tomatoes, primer pairs were designed for qualitative PCR to specifically detect the border region between P35S and NPTII, and the border region between nopaline synthase gene promoter and NPTII. No amplification of the targeted sequences was observed using genomic DNA purified from the juice ingredients. The developed qualitative PCR method is considered to be a reliable tool to check contamination of products with GM tomatoes. PMID:25743587

  4. Impaired fasting tolerance among Alaska native children with a common carnitine palmitoyltransferase 1A sequence variant.

    PubMed

    Gillingham, Melanie B; Hirschfeld, Matthew; Lowe, Sarah; Matern, Dietrich; Shoemaker, James; Lambert, William E; Koeller, David M

    2011-11-01

    A high prevalence of the sequence variant c.1436C→T in the CPT1A gene has been identified among Alaska Native newborns but the clinical implications of this variant are unknown. We conducted medically supervised fasts in 5 children homozygous for the c.1436C→T variant. Plasma free fatty acids increased normally in these children but their long-chain acylcarnitine and ketone production was significantly blunted. The fast was terminated early in two subjects due to symptoms of hypoglycemia. Homozygosity for the c.1436C→T sequence variant of CPT1A impairs fasting ketogenesis, and can cause hypoketotic hypoglycemia in young children. Trial registration www.clinical trials.gov NCT00653666 "Metabolic Consequences of CPT1A Deficiency" PMID:21763168

  5. Novel phytochrome sequences in Arabidopsis thaliana: Structure, evolution, and differential expression of a plant regulatory photoreceptor family

    SciTech Connect

    Sharrock, R.A.; Quail, P.H. )

    1989-01-01

    Phytochrome is a plant regulatory photoreceptor that mediates red light effects on a wide variety of physiological and molecular responses. DNA blot analysis indicates that the Arabidopsis thaliana genome contains four to five phytochrome-related gene sequences. The authors have isolated and sequenced cDNA clones corresponding to three of these genes and have deduced the amino acid sequence of the full-length polypeptide encoded in each case. One of these proteins (phyA) shows 65-80% amino acid sequence identity with the major, etiolated-tissue phytochrome apoproteins described previously in other plant species. The other two polypeptides (phyB and phyC) are unique in that they have low sequence identity with each other, with phyA, and with all previously described phytochromes. The phyA, phyB, and phyC proteins are of similar molecular mass, have related hydropathic profiles, and contain a conserved chromophore attachment region. However, the sequence comparison data indicate that the three phy genes diverged early in plant evolution, well before the divergence of the two major groups of angiosperms, the monocots and dicots. The steady-state level of the phyA transcript is high in dark-grown A. thaliana seedlings and is down-regulated by light. In contrast, the phyB and phyC transcripts are present at lower levels and are not strongly light-regulated. These findings indicate that the red/far red light-responsive phytochrome photoreceptor system in A. thaliana, and perhaps in all higher plants, consists of a family of chromoproteins that are heterogeneous in structure and regulation.

  6. Delineation of the regulatory region sequences of Agrobacterium tumefaciens virB operon.

    PubMed Central

    Das, A; Pazour, G J

    1989-01-01

    A virB-lacZ translational fusion was constructed to monitor expression of the Agrobacterium tumefaciens virB operon. Expression of the fusion gene was dependent on the presence of pTiA6 virA, virG, and a plant factor acetosyringone. Analysis of deletion mutants, constructed by exonuclease Bal31 digestion, showed that 68 residues upstream of the virB transcription initiation site was necessary for its expression. A TT----CC substitution at positions -62 and -61 led to a 7 fold reduction in virB expression. The virB upstream region contains a tetradecameric sequence, dPuT/ATDCAATGHAAPy (D = A, G or T; H = A, C or T), that is conserved in the non-transcribed regions of all vir genes. Alteration of the position of this sequence relative to the promoter region sequences had a drastic negative effect on virB expression. PMID:2748333

  7. Sequence-based introgression mapping identifies candidate white mold tolerance genes in common bean

    Technology Transfer Automated Retrieval System (TEKTRAN)

    White mold disease, caused by the necrotrophic fungus Sclerotinia sclerotiorum (Lib.) de Bary, is a major pathogen of common bean (Phaseolus vulgaris L.). More than 20 QTL were reported using multiple bi-parental populations. To study the disease in more detail, advanced back-cross populations seg...

  8. Spatio-temporal sequence of cross-regulatory events in root meristem growth

    PubMed Central

    Scacchi, Emanuele; Salinas, Paula; Gujas, Bojan; Santuari, Luca; Krogan, Naden; Ragni, Laura; Berleth, Thomas; Hardtke, Christian S.

    2010-01-01

    A central question in developmental biology is how multicellular organisms coordinate cell division and differentiation to determine organ size. In Arabidopsis roots, this balance is controlled by cytokinin-induced expression of SHORT HYPOCOTYL 2 (SHY2) in the so-called transition zone of the meristem, where SHY2 negatively regulates auxin response factors (ARFs) by protein–protein interaction. The resulting down-regulation of PIN-FORMED (PIN) auxin efflux carriers is considered the key event in promoting differentiation of meristematic cells. Here we show that this regulation involves additional, intermediary factors and is spatio-temporally constrained. We found that the described cytokinin–auxin crosstalk antagonizes BREVIS RADIX (BRX) activity in the developing protophloem. BRX is an auxin-responsive target of the prototypical ARF MONOPTEROS (MP), a key promoter of vascular development, and transiently enhances PIN3 expression to promote meristem growth in young roots. At later stages, cytokinin induction of SHY2 in the vascular transition zone restricts BRX expression to down-regulate PIN3 and thus limit meristem growth. Interestingly, proper SHY2 expression requires BRX, which could reflect feedback on the auxin responsiveness of SHY2 because BRX protein can directly interact with MP, likely acting as a cofactor. Thus, cross-regulatory antagonism between BRX and SHY2 could determine ARF activity in the protophloem. Our data suggest a model in which the regulatory interactions favor BRX expression in the early proximal meristem and SHY2 prevails because of supplementary cytokinin induction in the later distal meristem. The complex equilibrium of this regulatory module might represent a universal switch in the transition toward differentiation in various developmental contexts. PMID:21149702

  9. The genome sequence of Geobacter metallireducens: features of metabolism, physiology and regulation common and dissimilar to Geobacter sulfurreducens

    SciTech Connect

    Aklujkar, Muktak; Krushkal, Julia; DiBartolo, Genevieve; Lapidus, Alla L.; Land, Miriam L; Lovley, Derek

    2009-01-01

    Background. The genome sequence of Geobacter metallireducens is the second to be completed from the metal-respiring genus Geobacter, and is compared in this report to that of Geobacter sulfurreducens in order to understand their metabolic, physiological and regulatory similarities and differences. Results. The experimentally observed greater metabolic versatility of G. metallireducens versus G. sulfurreducens is borne out by the presence of more numerous genes for metabolism of organic acids including acetate, propionate, and pyruvate. Although G. metallireducens lacks a dicarboxylic acid transporter, it has acquired a second succinate dehydrogenase/fumarate reductase complex, suggesting that respiration of fumarate was important until recently in its evolutionary history. Vestiges of the molybdate (ModE) regulon of G. sulfurreducens can be detected in G. metallireducens, which has lost the global regulatory protein ModE but retained some putative ModE-binding sites and multiplied certain genes of molybdenum cofactor biosynthesis. Several enzymes of amino acid metabolism are of different origin in the two species, but significant patterns of gene organization are conserved. Whereas most Geobacteraceae are predicted to obtain biosynthetic reducing equivalents from electron transfer pathways via a ferredoxin oxidoreductase, G. metallireducens can derive them from the oxidative pentose phosphate pathway. In addition to the evidence of greater metabolic versatility, the G. metallireducens genome is also remarkable for the abundance of multicopy nucleotide sequences found in intergenic regions and even within genes. Conclusion. The genomic evidence suggests that metabolism, physiology Background. The genome sequence of Geobacter metallireducens is the second to be completed from the metal-respiring genus Geobacter, and is compared in this report to that of Geobacter sulfurreducens in order to understand their metabolic, physiological and regulatory similarities and

  10. Oxytocin receptor gene sequences in owl monkeys and other primates show remarkable interspecific regulatory and protein coding variation.

    PubMed

    Babb, Paul L; Fernandez-Duque, Eduardo; Schurr, Theodore G

    2015-10-01

    The oxytocin (OT) hormone pathway is involved in numerous physiological processes, and one of its receptor genes (OXTR) has been implicated in pair bonding behavior in mammalian lineages. This observation is important for understanding social monogamy in primates, which occurs in only a small subset of taxa, including Azara's owl monkey (Aotus azarae). To examine the potential relationship between social monogamy and OXTR variation, we sequenced its 5' regulatory (4936bp) and coding (1167bp) regions in 25 owl monkeys from the Argentinean Gran Chaco, and examined OXTR sequences from 1092 humans from the 1000 Genomes Project. We also assessed interspecific variation of OXTR in 25 primate and rodent species that represent a set of phylogenetically and behaviorally disparate taxa. Our analysis revealed substantial variation in the putative 5' regulatory region of OXTR, with marked structural differences across primate taxa, particularly for humans and chimpanzees, which exhibited unique patterns of large motifs of dinucleotide A+T repeats upstream of the OXTR 5' UTR. In addition, we observed a large number of amino acid substitutions in the OXTR CDS region among New World primate taxa that distinguish them from Old World primates. Furthermore, primate taxa traditionally defined as socially monogamous (e.g., gibbons, owl monkeys, titi monkeys, and saki monkeys) all exhibited different amino acid motifs for their respective OXTR protein coding sequences. These findings support the notion that monogamy has evolved independently in Old World and New World primates, and that it has done so through different molecular mechanisms, not exclusively through the oxytocin pathway. PMID:26025428

  11. Massively parallel sequencing of 17 commonly used forensic autosomal STRs and amelogenin with small amplicons.

    PubMed

    Kim, Eun Hye; Lee, Hwan Young; Yang, In Seok; Jung, Sang-Eun; Yang, Woo Ick; Shin, Kyoung-Jin

    2016-05-01

    The next-generation sequencing (NGS) method has been utilized to analyze short tandem repeat (STR) markers, which are routinely used for human identification purposes in the forensic field. Some researchers have demonstrated the successful application of the NGS system to STR typing, suggesting that NGS technology may be an alternative or additional method to overcome limitations of capillary electrophoresis (CE)-based STR profiling. However, there has been no available multiplex PCR system that is optimized for NGS analysis of forensic STR markers. Thus, we constructed a multiplex PCR system for the NGS analysis of 18 markers (13CODIS STRs, D2S1338, D19S433, Penta D, Penta E and amelogenin) by designing amplicons in the size range of 77-210 base pairs. Then, PCR products were generated from two single-sources, mixed samples and artificially degraded DNA samples using a multiplex PCR system, and were prepared for sequencing on the MiSeq system through construction of a subsequent barcoded library. By performing NGS and analyzing the data, we confirmed that the resultant STR genotypes were consistent with those of CE-based typing. Moreover, sequence variations were detected in targeted STR regions. Through the use of small-sized amplicons, the developed multiplex PCR system enables researchers to obtain successful STR profiles even from artificially degraded DNA as well as STR loci which are analyzed with large-sized amplicons in the CE-based commercial kits. In addition, successful profiles can be obtained from mixtures up to a 1:19 ratio. Consequently, the developed multiplex PCR system, which produces small size amplicons, can be successfully applied to STR NGS analysis of forensic casework samples such as mixtures and degraded DNA samples. PMID:26799314

  12. The genome sequence of Geobacter metallireducens: features of metabolism, physiology and regulation common and dissimilar to Geobacter sulfurreducens

    SciTech Connect

    Aklujkar, Muktak; Krushkal, Julia; DiBartolo, Genevieve; Lapidus, Alla; Land, Miriam L.; Lovley, Derek R.

    2008-12-01

    Background: The genome sequence of Geobacter metallireducens is the second to be completed from the metal-respiring genus Geobacter, and is compared in this report to that of Geobacter sulfurreducens in order to understand their metabolic, physiological and regulatory similarities and differences. Results: The experimentally observed greater metabolic versatility of G. metallireducens versus G. sulfurreducens is borne out by the presence of more numerous genes for metabolism of organic acids including acetate, propionate, and pyruvate. Although G. metallireducens lacks a dicarboxylic acid transporter, it has acquired a second succinate dehydrogenase/fumarate reductase complex, suggesting that respiration of fumarate was important until recently in its evolutionary history. Vestiges of the molybdate (ModE) regulon of G. sulfurreducens can be detected in G. metallireducens, which has lost the global regulatory protein ModE but retained some putative ModE-binding sites and multiplied certain genes of molybdenum cofactor biosynthesis. Several enzymes of amino acid metabolism are of different origin in the two species, but significant patterns of gene organization are conserved. Whereas most Geobacteraceae are predicted to obtain biosynthetic reducing equivalents from electron transfer pathways via a ferredoxin oxidoreductase, G. metallireducens can derive them from the oxidative pentose phosphate pathway. In addition to the evidence of greater metabolic versatility, the G. metallireducens genome is also remarkable for the abundance of multicopy nucleotide sequences found in intergenic regions and even within genes. Conclusion: The genomic evidence suggests that metabolism, physiology and regulation of gene expression in G. metallireducens may be dramatically different from other Geobacteraceae.

  13. The genome sequence of Geobacter metallireducens: features of metabolism, physiology and regulation common and dissimilar to Geobacter sulfurreducens

    PubMed Central

    2009-01-01

    Background The genome sequence of Geobacter metallireducens is the second to be completed from the metal-respiring genus Geobacter, and is compared in this report to that of Geobacter sulfurreducens in order to understand their metabolic, physiological and regulatory similarities and differences. Results The experimentally observed greater metabolic versatility of G. metallireducens versus G. sulfurreducens is borne out by the presence of more numerous genes for metabolism of organic acids including acetate, propionate, and pyruvate. Although G. metallireducens lacks a dicarboxylic acid transporter, it has acquired a second putative succinate dehydrogenase/fumarate reductase complex, suggesting that respiration of fumarate was important until recently in its evolutionary history. Vestiges of the molybdate (ModE) regulon of G. sulfurreducens can be detected in G. metallireducens, which has lost the global regulatory protein ModE but retained some putative ModE-binding sites and multiplied certain genes of molybdenum cofactor biosynthesis. Several enzymes of amino acid metabolism are of different origin in the two species, but significant patterns of gene organization are conserved. Whereas most Geobacteraceae are predicted to obtain biosynthetic reducing equivalents from electron transfer pathways via a ferredoxin oxidoreductase, G. metallireducens can derive them from the oxidative pentose phosphate pathway. In addition to the evidence of greater metabolic versatility, the G. metallireducens genome is also remarkable for the abundance of multicopy nucleotide sequences found in intergenic regions and even within genes. Conclusion The genomic evidence suggests that metabolism, physiology and regulation of gene expression in G. metallireducens may be dramatically different from other Geobacteraceae. PMID:19473543

  14. Next-generation sequencing of common osteogenesis imperfecta-related genes in clinical practice.

    PubMed

    Árvai, Kristóf; Horváth, Péter; Balla, Bernadett; Tobiás, Bálint; Kató, Karina; Kirschner, Gyöngyi; Klujber, Valéria; Lakatos, Péter; Kósa, János P

    2016-01-01

    Next generation sequencing (NGS) is a rapidly developing area in genetics. Utilizing this technology in the management of disorders with complex genetic background and not recurrent mutation hot spots can be extremely useful. In this study, we applied NGS, namely semiconductor sequencing to determine the most significant osteogenesis imperfecta-related genetic variants in the clinical practice. We selected genes coding collagen type I alpha-1 and-2 (COL1A1, COL1A2) which are responsible for more than 90% of all cases. CRTAP and LEPRE1/P3H1 genes involved in the background of the recessive forms with relatively high frequency (type VII and VIII) represent less than 10% of the disease. In our six patients (1-41 years), we identified 23 different variants. We found a total of 14 single nucleotide variants (SNV) in COL1A1 and COL1A2, 5 in CRTAP and 4 in LEPRE1. Two novel and two already well-established pathogenic SNVs have been identified. Among the newly recognized mutations, one results in an amino acid change and one of them is a stop codon. We have shown that a new full-scale cost-effective NGS method can be developed and utilized to supplement diagnostic process of osteogenesis imperfecta with molecular genetic data in clinical practice. PMID:27335225

  15. Next-generation sequencing of common osteogenesis imperfecta-related genes in clinical practice

    PubMed Central

    Árvai, Kristóf; Horváth, Péter; Balla, Bernadett; Tobiás, Bálint; Kató, Karina; Kirschner, Gyöngyi; Klujber, Valéria; Lakatos, Péter; Kósa, János P.

    2016-01-01

    Next generation sequencing (NGS) is a rapidly developing area in genetics. Utilizing this technology in the management of disorders with complex genetic background and not recurrent mutation hot spots can be extremely useful. In this study, we applied NGS, namely semiconductor sequencing to determine the most significant osteogenesis imperfecta-related genetic variants in the clinical practice. We selected genes coding collagen type I alpha-1 and-2 (COL1A1, COL1A2) which are responsible for more than 90% of all cases. CRTAP and LEPRE1/P3H1 genes involved in the background of the recessive forms with relatively high frequency (type VII and VIII) represent less than 10% of the disease. In our six patients (1–41 years), we identified 23 different variants. We found a total of 14 single nucleotide variants (SNV) in COL1A1 and COL1A2, 5 in CRTAP and 4 in LEPRE1. Two novel and two already well-established pathogenic SNVs have been identified. Among the newly recognized mutations, one results in an amino acid change and one of them is a stop codon. We have shown that a new full-scale cost-effective NGS method can be developed and utilized to supplement diagnostic process of osteogenesis imperfecta with molecular genetic data in clinical practice. PMID:27335225

  16. SNP discovery in common bean by restriction-associated DNA (RAD) sequencing for genetic diversity and population structure analysis.

    PubMed

    Valdisser, Paula Arielle M R; Pappas, Georgios J; de Menezes, Ivandilson P P; Müller, Bárbara S F; Pereira, Wendell J; Narciso, Marcelo G; Brondani, Claudio; Souza, Thiago L P O; Borba, Tereza C O; Vianello, Rosana P

    2016-06-01

    Researchers have made great advances into the development and application of genomic approaches for common beans, creating opportunities to driving more real and applicable strategies for sustainable management of the genetic resource towards plant breeding. This work provides useful polymorphic single-nucleotide polymorphisms (SNPs) for high-throughput common bean genotyping developed by RAD (restriction site-associated DNA) sequencing. The RAD tags were generated from DNA pooled from 12 common bean genotypes, including breeding lines of different gene pools and market classes. The aligned sequences identified 23,748 putative RAD-SNPs, of which 3357 were adequate for genotyping; 1032 RAD-SNPs with the highest ADT (assay design tool) score are presented in this article. The RAD-SNPs were structurally annotated in different coding (47.00 %) and non-coding (53.00 %) sequence components of genes. A subset of 384 RAD-SNPs with broad genome distribution was used to genotype a diverse panel of 95 common bean germplasms and revealed a successful amplification rate of 96.6 %, showing 73 % of polymorphic SNPs within the Andean group and 83 % in the Mesoamerican group. A slightly increased He (0.161, n = 21) value was estimated for the Andean gene pool, compared to the Mesoamerican group (0.156, n = 74). For the linkage disequilibrium (LD) analysis, from a group of 580 SNPs (289 RAD-SNPs and 291 BARC-SNPs) genotyped for the same set of genotypes, 70.2 % were in LD, decreasing to 0.10 %in the Andean group and 0.77 % in the Mesoamerican group. Haplotype patterns spanning 310 Mb of the genome (60 %) were characterized in samples from different origins. However, the haplotype frameworks were under-represented for the Andean (7.85 %) and Mesoamerican (5.55 %) gene pools separately. In conclusion, RAD sequencing allowed the discovery of hundreds of useful SNPs for broad genetic analysis of common bean germplasm. From now, this approach provides an excellent panel

  17. RefNetBuilder: a platform for construction of integrated reference gene regulatory networks from expressed sequence tags

    PubMed Central

    2011-01-01

    Background Gene Regulatory Networks (GRNs) provide integrated views of gene interactions that control biological processes. Many public databases contain biological interactions extracted from experimentally validated literature reports, but most furnish only information for a few genetic model organisms. In order to provide a bioinformatic tool for researchers who work with non-model organisms, we developed RefNetBuilder, a new platform that allows construction of putative reference pathways or GRNs from expressed sequence tags (ESTs). Results RefNetBuilder was designed to have the flexibility to extract and archive pathway or GRN information from public databases such as the Kyoto Encyclopedia of Genes and Genomes (KEGG). It features sequence alignment tools such as BLAST to allow mapping ESTs to pathways and GRNs in model organisms. A scoring algorithm was incorporated to rank and select the best match for each query EST. We validated RefNetBuilder using DNA sequences of Caenorhabditis elegans, a model organism having manually curated KEGG pathways. Using the earthworm Eisenia fetida as an example, we demonstrated the functionalities and features of RefNetBuilder. Conclusions The RefNetBuilder provides a standalone application for building reference GRNs for non-model organisms on a number of operating system platforms with standard desktop computer hardware. As a new bioinformatic tool aimed for constructing putative GRNs for non-model organisms that have only ESTs available, RefNetBuilder is especially useful to explore pathway- or network-related information in these organisms. PMID:22166047

  18. Nucleotide sequence of the regulatory locus controlling expression of bacterial genes for bioluminescence.

    PubMed Central

    Engebrecht, J; Silverman, M

    1987-01-01

    Production of light by the marine bacterium Vibrio fischeri and by recombinant hosts containing cloned lux genes is controlled by the density of the culture. Density-dependent regulation of lux gene expression has been shown to require a locus consisting of the luxR and luxI genes and two closely linked divergent promoters. As part of a genetic analysis to understand the regulation of bioluminescence, we have sequenced the region of DNA containing this control circuit. Open reading frames corresponding to luxR and luxI were identified; transcription start sites were defined by S1 nuclease mapping and sequences resembling promoter elements were located. Images PMID:3697093

  19. Co-regulation of invected and engrailed by a complex array of regulatory sequences in Drosophila

    PubMed Central

    Cheng, Yuzhong; Brunner, Alayne L.; Kremer, Stefanie; DeVido, Sarah K.; Stefaniuk, Catherine M.; Kassis, Judith A.

    2014-01-01

    invected (inv) and engrailed (en) form a gene complex that extends about 115kb. These two genes encode highly related homeodomain proteins that are co-regulated in a complex manner throughout development. Our dissection of inv/en regulatory DNA shows that most enhancers are spread throughout a 62kb region. We used two types of constructs to analyze the function of this DNA: P-element based reporter constructs with small pieces of DNA fused to the en promoter driving lacZ expression and large constructs with HA-tagged en and inv inserted in the genome with the phiC31 system. In addition, we generated deletions of inv and en DNA in situ and assayed their effects on inv/en expression. Our results support and extend our knowledge of inv/en regulation. First, inv and en share regulatory DNA, most of which is flanking the en transcription unit. In support of this, a 79-kb HA-en transgene can rescue inv en double mutants to viable, fertile adults. In contrast, an 84-kb HA-inv transgene lacks most of the enhancers for inv/en expression. Second, there are multiple enhancers for inv/en stripes in embryos; some of these may be redundant but others play discrete roles at different stages of embryonic development. Finally, no small reporter construct gave expression in the posterior compartment of imaginal discs, a hallmark of inv/en expression. Robust expression of HA-en in the posterior compartment of imaginal discs is evident from the 79-kb HA-en transgene, while a 45-kb HA-en transgene gives weaker, variable imaginal disc expression. We suggest that the activity of the imaginal disc enhancer(s) is dependent on the chromatin structure of the inv/en domain. PMID:25172431

  20. A variant in the sonic hedgehog regulatory sequence (ZRS) is associated with triphalangeal thumb and deregulates expression in the developing limb

    PubMed Central

    Furniss, Dominic; Lettice, Laura A.; Taylor, Indira B.; Critchley, Paul S.; Giele, Henk; Hill, Robert E.; Wilkie, Andrew O.M.

    2008-01-01

    A locus for triphalangeal thumb, variably associated with pre-axial polydactyly, was previously identified in the zone of polarizing activity regulatory sequence (ZRS), a long range limb-specific enhancer of the Sonic Hedgehog (SHH) gene at human chromosome 7q36.3. Here, we demonstrate that a 295T>C variant in the human ZRS, previously thought to represent a neutral polymorphism, acts as a dominant allele with reduced penetrance. We found this variant in three independently ascertained probands from southern England with triphalangeal thumb, demonstrated significant linkage of the phenotype to the variant (LOD = 4.1), and identified a shared microsatellite haplotype around the ZRS, suggesting that the probands share a common ancestor. An individual homozygous for the 295C allele presented with isolated bilateral triphalangeal thumb resembling the heterozygous phenotype, suggesting that the variant is largely dominant to the wild-type allele. As a functional test of the pathogenicity of the 295C allele, we utilized a mutated ZRS construct to demonstrate that it can drive ectopic anterior expression of a reporter gene in the developing mouse forelimb. We conclude that the 295T>C variant is in fact pathogenic and, in southern England, appears to be the most common cause of triphalangeal thumb. Depending on the dispersal of the founding mutation, it may play a wider role in the aetiology of this disorder. PMID:18463159

  1. Training in High-Throughput Sequencing: Common Guidelines to Enable Material Sharing, Dissemination, and Reusability

    PubMed Central

    Schiffthaler, Bastian

    2016-01-01

    The advancement of high-throughput sequencing (HTS) technologies and the rapid development of numerous analysis algorithms and pipelines in this field has resulted in an unprecedentedly high demand for training scientists in HTS data analysis. Embarking on developing new training materials is challenging for many reasons. Trainers often do not have prior experience in preparing or delivering such materials and struggle to keep them up to date. A repository of curated HTS training materials would support trainers in materials preparation, reduce the duplication of effort by increasing the usage of existing materials, and allow for the sharing of teaching experience among the HTS trainers’ community. To achieve this, we have developed a strategy for materials’ curation and dissemination. Standards for describing training materials have been proposed and applied to the curation of existing materials. A Git repository has been set up for sharing annotated materials that can now be reused, modified, or incorporated into new courses. This repository uses Git; hence, it is decentralized and self-managed by the community and can be forked/built-upon by all users. The repository is accessible at http://bioinformatics.upsc.se/htmr. PMID:27309738

  2. Training in High-Throughput Sequencing: Common Guidelines to Enable Material Sharing, Dissemination, and Reusability.

    PubMed

    Schiffthaler, Bastian; Kostadima, Myrto; Delhomme, Nicolas; Rustici, Gabriella

    2016-06-01

    The advancement of high-throughput sequencing (HTS) technologies and the rapid development of numerous analysis algorithms and pipelines in this field has resulted in an unprecedentedly high demand for training scientists in HTS data analysis. Embarking on developing new training materials is challenging for many reasons. Trainers often do not have prior experience in preparing or delivering such materials and struggle to keep them up to date. A repository of curated HTS training materials would support trainers in materials preparation, reduce the duplication of effort by increasing the usage of existing materials, and allow for the sharing of teaching experience among the HTS trainers' community. To achieve this, we have developed a strategy for materials' curation and dissemination. Standards for describing training materials have been proposed and applied to the curation of existing materials. A Git repository has been set up for sharing annotated materials that can now be reused, modified, or incorporated into new courses. This repository uses Git; hence, it is decentralized and self-managed by the community and can be forked/built-upon by all users. The repository is accessible at http://bioinformatics.upsc.se/htmr. PMID:27309738

  3. CTCF binding site sequence differences are associated with unique regulatory and functional trends during embryonic stem cell differentiation.

    PubMed

    Plasschaert, Robert N; Vigneau, Sébastien; Tempera, Italo; Gupta, Ravi; Maksimoska, Jasna; Everett, Logan; Davuluri, Ramana; Mamorstein, Ronen; Lieberman, Paul M; Schultz, David; Hannenhalli, Sridhar; Bartolomei, Marisa S

    2014-01-01

    CTCF (CCCTC-binding factor) is a highly conserved multifunctional DNA-binding protein with thousands of binding sites genome-wide. Our previous work suggested that differences in CTCF's binding site sequence may affect the regulation of CTCF recruitment and its function. To investigate this possibility, we characterized changes in genome-wide CTCF binding and gene expression during differentiation of mouse embryonic stem cells. After separating CTCF sites into three classes (LowOc, MedOc and HighOc) based on similarity to the consensus motif, we found that developmentally regulated CTCF binding occurs preferentially at LowOc sites, which have lower similarity to the consensus. By measuring the affinity of CTCF for selected sites, we show that sites lost during differentiation are enriched in motifs associated with weaker CTCF binding in vitro. Specifically, enrichment for T at the 18(th) position of the CTCF binding site is associated with regulated binding in the LowOc class and can predictably reduce CTCF affinity for binding sites. Finally, by comparing changes in CTCF binding with changes in gene expression during differentiation, we show that LowOc and HighOc sites are associated with distinct regulatory functions. Our results suggest that the regulatory control of CTCF is dependent in part on specific motifs within its binding site. PMID:24121688

  4. CTCF binding site sequence differences are associated with unique regulatory and functional trends during embryonic stem cell differentiation

    PubMed Central

    Plasschaert, Robert N.; Vigneau, Sébastien; Tempera, Italo; Gupta, Ravi; Maksimoska, Jasna; Everett, Logan; Davuluri, Ramana; Mamorstein, Ronen; Lieberman, Paul M.; Schultz, David; Hannenhalli, Sridhar; Bartolomei, Marisa S.

    2014-01-01

    CTCF (CCCTC-binding factor) is a highly conserved multifunctional DNA-binding protein with thousands of binding sites genome-wide. Our previous work suggested that differences in CTCF’s binding site sequence may affect the regulation of CTCF recruitment and its function. To investigate this possibility, we characterized changes in genome-wide CTCF binding and gene expression during differentiation of mouse embryonic stem cells. After separating CTCF sites into three classes (LowOc, MedOc and HighOc) based on similarity to the consensus motif, we found that developmentally regulated CTCF binding occurs preferentially at LowOc sites, which have lower similarity to the consensus. By measuring the affinity of CTCF for selected sites, we show that sites lost during differentiation are enriched in motifs associated with weaker CTCF binding in vitro. Specifically, enrichment for T at the 18th position of the CTCF binding site is associated with regulated binding in the LowOc class and can predictably reduce CTCF affinity for binding sites. Finally, by comparing changes in CTCF binding with changes in gene expression during differentiation, we show that LowOc and HighOc sites are associated with distinct regulatory functions. Our results suggest that the regulatory control of CTCF is dependent in part on specific motifs within its binding site. PMID:24121688

  5. Common and distinct DNA-binding and regulatory activities of the BEN-solo transcription factor family

    PubMed Central

    Dai, Qi; Ren, Aiming; Westholm, Jakub O.; Duan, Hong; Patel, Dinshaw J.

    2015-01-01

    Recently, the BEN (BANP, E5R, and NAC1) domain was recognized as a new class of conserved DNA-binding domain. The fly genome encodes three proteins that bear only a single BEN domain (“BEN-solo” factors); namely, Insensitive (Insv), Bsg25A (Elba1), and CG9883 (Elba2). Insv homodimers preferentially bind CCAATTGG palindromes throughout the genome to mediate transcriptional repression, whereas Bsg25A and Elba2 heterotrimerize with their obligate adaptor, Elba3 (i.e., the ELBA complex), to recognize a CCAATAAG motif in the Fab-7 insulator. While these data suggest distinct DNA-binding properties of BEN-solo proteins, we performed reporter assays that indicate that both Bsg25A and Elba2 can individually recognize Insv consensus sites efficiently. We confirmed this by solving the structure of Bsg25A complexed to the Insv site, which showed that key aspects of the BEN:DNA recognition strategy are similar between these proteins. We next show that both Insv and ELBA proteins are competent to mediate transcriptional repression via Insv consensus sequences but that the ELBA complex appears to be selective for the ELBA site. Reciprocally, genome-wide analysis reveals that Insv exhibits significant cobinding to class I insulator elements, indicating that it may also contribute to insulator function. Indeed, we observed abundant Insv binding within the Hox complexes with substantial overlaps with class I insulators, many of which bear Insv consensus sites. Moreover, Insv coimmunoprecipitates with the class I insulator factor CP190. Finally, we observed that Insv harbors exclusive activity among fly BEN-solo factors with respect to regulation of Notch-mediated cell fate choices in the peripheral nervous system. This in vivo activity is recapitulated by BEND6, a mammalian BEN-solo factor that conserves the Notch corepressor function of Insv but not its capacity to bind Insv consensus sites. Altogether, our data define an array of common and distinct biochemical and functional

  6. Common and distinct DNA-binding and regulatory activities of the BEN-solo transcription factor family.

    PubMed

    Dai, Qi; Ren, Aiming; Westholm, Jakub O; Duan, Hong; Patel, Dinshaw J; Lai, Eric C

    2015-01-01

    Recently, the BEN (BANP, E5R, and NAC1) domain was recognized as a new class of conserved DNA-binding domain. The fly genome encodes three proteins that bear only a single BEN domain ("BEN-solo" factors); namely, Insensitive (Insv), Bsg25A (Elba1), and CG9883 (Elba2). Insv homodimers preferentially bind CCAATTGG palindromes throughout the genome to mediate transcriptional repression, whereas Bsg25A and Elba2 heterotrimerize with their obligate adaptor, Elba3 (i.e., the ELBA complex), to recognize a CCAATAAG motif in the Fab-7 insulator. While these data suggest distinct DNA-binding properties of BEN-solo proteins, we performed reporter assays that indicate that both Bsg25A and Elba2 can individually recognize Insv consensus sites efficiently. We confirmed this by solving the structure of Bsg25A complexed to the Insv site, which showed that key aspects of the BEN:DNA recognition strategy are similar between these proteins. We next show that both Insv and ELBA proteins are competent to mediate transcriptional repression via Insv consensus sequences but that the ELBA complex appears to be selective for the ELBA site. Reciprocally, genome-wide analysis reveals that Insv exhibits significant cobinding to class I insulator elements, indicating that it may also contribute to insulator function. Indeed, we observed abundant Insv binding within the Hox complexes with substantial overlaps with class I insulators, many of which bear Insv consensus sites. Moreover, Insv coimmunoprecipitates with the class I insulator factor CP190. Finally, we observed that Insv harbors exclusive activity among fly BEN-solo factors with respect to regulation of Notch-mediated cell fate choices in the peripheral nervous system. This in vivo activity is recapitulated by BEND6, a mammalian BEN-solo factor that conserves the Notch corepressor function of Insv but not its capacity to bind Insv consensus sites. Altogether, our data define an array of common and distinct biochemical and functional

  7. Molecular identification of the traditional herbal medicines, Arisaematis Rhizoma and Pinelliae Tuber, and common adulterants via universal DNA barcode sequences.

    PubMed

    Moon, B C; Kim, W J; Ji, Y; Lee, Y M; Kang, Y M; Choi, G

    2016-01-01

    Methods to identify Pinelliae Tuber and Arisaematis Rhizoma are required because of frequent reciprocal substitution between these two herbal medicines and the existence of several closely related plant materials. As a result of the morphological similarity of dried tubers, correct discrimination of authentic herbal medicines is difficult by conventional methods. Therefore, we analyzed DNA barcode sequences to identify each herbal medicine and the common adulterants at a species level. To verify the identity of these herbal medicines, we collected five authentic species (Pinellia ternata for Pinelliae Tuber, and Arisaema amurense, A. amurense var. serratum, A. erubescens, and A. heterophyllum for Arisaematis Rhizoma) and six common adulterant plant species. Maturase K (matK) and ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit (rbcL) genes were then amplified using universal primers. In comparative analyses of two DNA barcode sequences, we obtained 45 species-specific nucleotides sufficient to identify each species (except A. erubescens with matK) and 28 marker nucleotides for each species (except P. pedatisecta with rbcL). Sequence differences at corresponding positions of the two combined DNA barcodes provided genetic marker nucleotides that could be used to identify specimens of the correct species among the analyzed medicinal plants. Furthermore, we generated a phylogenetic tree showing nine distinct groups depending on the species. These results can be used to authenticate Pinelliae Tuber and Arisaematis Rhizoma from their adulterants and to identify each species. Thus, comparative analyses of plant DNA barcode sequences identified useful genetic markers for the authentication of Pinelliae Tuber and Arisaematis Rhizoma from several adulterant herbal materials. PMID:26909979

  8. 'In silico expression analysis', a novel PathoPlant web tool to identify abiotic and biotic stress conditions associated with specific cis-regulatory sequences.

    PubMed

    Bolívar, Julio C; Machens, Fabian; Brill, Yuri; Romanov, Artyom; Bülow, Lorenz; Hehl, Reinhard

    2014-01-01

    Using bioinformatics, putative cis-regulatory sequences can be easily identified using pattern recognition programs on promoters of specific gene sets. The abundance of predicted cis-sequences is a major challenge to associate these sequences with a possible function in gene expression regulation. To identify a possible function of the predicted cis-sequences, a novel web tool designated 'in silico expression analysis' was developed that correlates submitted cis-sequences with gene expression data from Arabidopsis thaliana. The web tool identifies the A. thaliana genes harbouring the sequence in a defined promoter region and compares the expression of these genes with microarray data. The result is a hierarchy of abiotic and biotic stress conditions to which these genes are most likely responsive. When testing the performance of the web tool, known cis-regulatory sequences were submitted to the 'in silico expression analysis' resulting in the correct identification of the associated stress conditions. When using a recently identified novel elicitor-responsive sequence, a WT-box (CGACTTTT), the 'in silico expression analysis' predicts that genes harbouring this sequence in their promoter are most likely Botrytis cinerea induced. Consistent with this prediction, the strongest induction of a reporter gene harbouring this sequence in the promoter is observed with B. cinerea in transgenic A. thaliana. DATABASE URL: http://www.pathoplant.de/expression_analysis.php. PMID:24727366

  9. ‘In silico expression analysis’, a novel PathoPlant web tool to identify abiotic and biotic stress conditions associated with specific cis-regulatory sequences

    PubMed Central

    Machens, Fabian; Brill, Yuri; Romanov, Artyom; Bülow, Lorenz; Hehl, Reinhard

    2014-01-01

    Using bioinformatics, putative cis-regulatory sequences can be easily identified using pattern recognition programs on promoters of specific gene sets. The abundance of predicted cis-sequences is a major challenge to associate these sequences with a possible function in gene expression regulation. To identify a possible function of the predicted cis-sequences, a novel web tool designated ‘in silico expression analysis’ was developed that correlates submitted cis-sequences with gene expression data from Arabidopsis thaliana. The web tool identifies the A. thaliana genes harbouring the sequence in a defined promoter region and compares the expression of these genes with microarray data. The result is a hierarchy of abiotic and biotic stress conditions to which these genes are most likely responsive. When testing the performance of the web tool, known cis-regulatory sequences were submitted to the ‘in silico expression analysis’ resulting in the correct identification of the associated stress conditions. When using a recently identified novel elicitor-responsive sequence, a WT-box (CGACTTTT), the ‘in silico expression analysis’ predicts that genes harbouring this sequence in their promoter are most likely Botrytis cinerea induced. Consistent with this prediction, the strongest induction of a reporter gene harbouring this sequence in the promoter is observed with B. cinerea in transgenic A. thaliana. Database URL: http://www.pathoplant.de/expression_analysis.php. PMID:24727366

  10. Identification of lignin genes and regulatory sequences involved in secondary cell wall formation in Acacia auriculiformis and Acacia mangium via de novo transcriptome sequencing

    PubMed Central

    2011-01-01

    useful markers for population genetics studies and marker-assisted selection. Conclusion We have produced the first comprehensive transcriptome-wide analysis in A. auriculiformis and A. mangium using de novo assembly techniques. Our high quality and comprehensive assemblies allowed the identification of many genes in the lignin biosynthesis and secondary cell wall formation in Acacia hybrids. Our results demonstrated that Next Generation Sequencing is a cost-effective method for gene discovery, identification of regulatory sequences, and informative markers in a non-model plant. PMID:21729267

  11. Opossum carboxylesterases: sequences, phylogeny and evidence for CES gene duplication events predating the marsupial-eutherian common ancestor

    PubMed Central

    2008-01-01

    Background Carboxylesterases (CES) perform diverse metabolic roles in mammalian organisms in the detoxification of a broad range of drugs and xenobiotics and may also serve in specific roles in lipid, cholesterol, pheromone and lung surfactant metabolism. Five CES families have been reported in mammals with human CES1 and CES2 the most extensively studied. Here we describe the genetics, expression and phylogeny of CES isozymes in the opossum and report on the sequences and locations of CES1, CES2 and CES6 'like' genes within two gene clusters on chromosome one. We also discuss the likely sequence of gene duplication events generating multiple CES genes during vertebrate evolution. Results We report a cDNA sequence for an opossum CES and present evidence for CES1 and CES2 like genes expressed in opossum liver and intestine and for distinct gene locations of five opossum CES genes,CES1, CES2.1, CES2.2, CES2.3 and CES6, on chromosome 1. Phylogenetic and sequence alignment studies compared the predicted amino acid sequences for opossum CES with those for human, mouse, chicken, frog, salmon and Drosophila CES gene products. Phylogenetic analyses produced congruent phylogenetic trees depicting a rapid early diversification into at least five distinct CES gene family clusters: CES2, CES1, CES7, CES3, and CES6. Molecular divergence estimates based on a Bayesian relaxed clock approach revealed an origin for the five mammalian CES gene families between 328–378 MYA. Conclusion The deduced amino acid sequence for an opossum cDNA was consistent with its identity as a mammalian CES2 gene product (designated CES2.1). Distinct gene locations for opossum CES1 (1: 446,222,550–446,274,850), three CES2 genes (1: 677,773,395–677,927,030) and a CES6 gene (1: 677,585,520–677,730,419) were observed on chromosome 1. Opossum CES1 and multiple CES2 genes were expressed in liver and intestine. Amino acid sequences for opossum CES1 and three CES2 gene products revealed conserved

  12. Sequence determinants of DNA bending in the ilvlH promoter and regulatory region of Escherichia coli.

    PubMed Central

    Wang, Q; Albert, F G; Fitzgerald, D J; Calvo, J M; Anderson, J N

    1994-01-01

    Previous studies have shown that the promoter/regulatory region of the ilvlH operon displays intrinsic curvature, with the bend center located at position -120 relative to the transcription start site. In this report, a 57 bp sequence spanning the bend center was mutagenized in vitro in order to study the relationship between nucleotide sequence and curvature measured by electrophoresis. The strategy used for analyzing the results consisted of determining the strengths of the relationships between electrophoretic anomaly and predicted curvature calculated by computer programs that differ in wedge angle composition. The results revealed that programs which assume that bending occurs only at AA/TT display good predictive value, with correlation coefficients between electrophoretic anomaly and predicted curvature as high as 0.93. In contrast, a program which assumes that bending occurs at all 16 dinucleotide steps exhibited lower predictive value, while there were no significant relationships between the experimental data and curvature calculated by a program that was based on all non-AA/TT wedge values. These results show that the complete wedge model which incorporates values for all dinucleotide steps does not adequately describe the electrophoretic data in this report. PMID:7838732

  13. Integrated microRNA-mRNA analyses reveal OPLL specific microRNA regulatory network using high-throughput sequencing.

    PubMed

    Xu, Chen; Chen, Yu; Zhang, Hao; Chen, Yuanyuan; Shen, Xiaolong; Shi, Changgui; Liu, Yang; Yuan, Wen

    2016-01-01

    Ossification of the posterior longitudinal ligament (OPLL) is a genetic disorder which involves pathological heterotopic ossification of the spinal ligaments. Although studies have identified several genes that correlated with OPLL, the underlying regulation network is far from clear. Through small RNA sequencing, we compared the microRNA expressions of primary posterior longitudinal ligament cells form OPLL patients with normal patients (PLL) and identified 218 dysregulated miRNAs (FDR < 0.01). Furthermore, assessing the miRNA profiling data of multiple cell types, we found these dysregulated miRNAs were mostly OPLL specific. In order to decipher the regulation network of these OPLL specific miRNAs, we integrated mRNA expression profiling data with miRNA sequencing data. Through computational approaches, we showed the pivotal roles of these OPLL specific miRNAs in heterotopic ossification of longitudinal ligament by discovering highly correlated miRNA/mRNA pairs that associated with skeletal system development, collagen fibril organization, and extracellular matrix organization. The results of which provide strong evidence that the miRNA regulatory networks we established may indeed play vital roles in OPLL onset and progression. To date, this is the first systematic analysis of the micronome in OPLL, and thus may provide valuable resources in finding novel treatment and diagnostic targets of OPLL. PMID:26868491

  14. RNA-ID, a highly sensitive and robust method to identify cis-regulatory sequences using superfolder GFP and a fluorescence-based assay

    PubMed Central

    Dean, Kimberly M.; Grayhack, Elizabeth J.

    2012-01-01

    We have developed a robust and sensitive method, called RNA-ID, to screen for cis-regulatory sequences in RNA using fluorescence-activated cell sorting (FACS) of yeast cells bearing a reporter in which expression of both superfolder green fluorescent protein (GFP) and yeast codon-optimized mCherry red fluorescent protein (RFP) is driven by the bidirectional GAL1,10 promoter. This method recapitulates previously reported progressive inhibition of translation mediated by increasing numbers of CGA codon pairs, and restoration of expression by introduction of a tRNA with an anticodon that base pairs exactly with the CGA codon. This method also reproduces effects of paromomycin and context on stop codon read-through. Five key features of this method contribute to its effectiveness as a selection for regulatory sequences: The system exhibits greater than a 250-fold dynamic range, a quantitative and dose-dependent response to known inhibitory sequences, exquisite resolution that allows nearly complete physical separation of distinct populations, and a reproducible signal between different cells transformed with the identical reporter, all of which are coupled with simple methods involving ligation-independent cloning, to create large libraries. Moreover, we provide evidence that there are sequences within a 9-nt library that cause reduced GFP fluorescence, suggesting that there are novel cis-regulatory sequences to be found even in this short sequence space. This method is widely applicable to the study of both RNA-mediated and codon-mediated effects on expression. PMID:23097427

  15. Control of extracellular polysaccharide synthesis in Erwinia stewartii and Escherichia coli K-12: a common regulatory function.

    PubMed Central

    Torres-Cabassa, A; Gottesman, S; Frederick, R D; Dolph, P J; Coplin, D L

    1987-01-01

    A primary determinant of pathogenicity in Erwinia stewartii is the production of extracellular polysaccharide (EPS). A single mutation can abolish both EPS synthesis and pathogenicity; both properties are restored by a single cosmid clone. Subcloning and insertion analysis have defined a single positive regulatory function which shares a number of similarities with the rcsA function of Escherichia coli K-12, a positive regulator for capsular polysaccharide synthesis. In E. stewartii, the gene promotes the transcription of at least two operons (cps) involved in EPS synthesis; we have previously demonstrated a similar function for rcsA in E. coli. Both genes code for proteins of 25 to 27 kilodaltons; both proteins are unstable in E. coli. The E. stewartii RcsA protein was stabilized in E. coli lon mutants, as the RcsA product from E. coli is. The E. stewartii function complemented E. coli rcsA mutants, and the E. coli RcsA function increased cps expression and restored virulence in E. stewartii mutants. Therefore, these two gram-negative organisms share a similar component of their regulatory circuitry for the control of capsular polysaccharide synthesis. Images PMID:2820930

  16. Whole Genome Shotgun Sequencing Shows Selection on Leptospira Regulatory Proteins during in vitro Culture Attenuation

    PubMed Central

    Lehmann, Jason S.; Corey, Victoria C.; Ricaldi, Jessica N.; Vinetz, Joseph M.; Winzeler, Elizabeth A.; Matthias, Michael A.

    2016-01-01

    Leptospirosis is the most common zoonotic disease worldwide with an estimated 500,000 severe cases reported annually, and case fatality rates of 12–25%, due primarily to acute kidney and lung injuries. Despite its prevalence, the molecular mechanisms underlying leptospirosis pathogenesis remain poorly understood. To identify virulence-related genes in Leptospira interrogans, we delineated cumulative genome changes that occurred during serial in vitro passage of a highly virulent strain of L. interrogans serovar Lai into a nearly avirulent isogenic derivative. Comparison of protein coding and computationally predicted noncoding RNA (ncRNA) genes between these two polyclonal strains identified 15 nonsynonymous single nucleotide variant (nsSNV) alleles that increased in frequency and 19 that decreased, whereas no changes in allelic frequency were observed among the ncRNA genes. Some of the nsSNV alleles were in six genes shown previously to be transcriptionally upregulated during exposure to in vivo-like conditions. Five of these nsSNVs were in evolutionarily conserved positions in genes related to signal transduction and metabolism. Frequency changes of minor nsSNV alleles identified in this study likely contributed to the loss of virulence during serial in vitro culture. The identification of new virulence-associated genes should spur additional experimental inquiry into their potential role in Leptospira pathogenesis. PMID:26711524

  17. Whole Genome Shotgun Sequencing Shows Selection on Leptospira Regulatory Proteins During in vitro Culture Attenuation.

    PubMed

    Lehmann, Jason S; Corey, Victoria C; Ricaldi, Jessica N; Vinetz, Joseph M; Winzeler, Elizabeth A; Matthias, Michael A

    2016-02-01

    Leptospirosis is the most common zoonotic disease worldwide with an estimated 500,000 severe cases reported annually, and case fatality rates of 12-25%, due primarily to acute kidney and lung injuries. Despite its prevalence, the molecular mechanisms underlying leptospirosis pathogenesis remain poorly understood. To identify virulence-related genes in Leptospira interrogans, we delineated cumulative genome changes that occurred during serial in vitro passage of a highly virulent strain of L. interrogans serovar Lai into a nearly avirulent isogenic derivative. Comparison of protein coding and computationally predicted noncoding RNA (ncRNA) genes between these two polyclonal strains identified 15 nonsynonymous single nucleotide variant (nsSNV) alleles that increased in frequency and 19 that decreased, whereas no changes in allelic frequency were observed among the ncRNA genes. Some of the nsSNV alleles were in six genes shown previously to be transcriptionally upregulated during exposure to in vivo-like conditions. Five of these nsSNVs were in evolutionarily conserved positions in genes related to signal transduction and metabolism. Frequency changes of minor nsSNV alleles identified in this study likely contributed to the loss of virulence during serial in vitro culture. The identification of new virulence-associated genes should spur additional experimental inquiry into their potential role in Leptospira pathogenesis. PMID:26711524

  18. Coffee and tomato share common gene repertoires as revealed by deep sequencing of seed and cherry transcripts.

    PubMed

    Lin, Chenwei; Mueller, Lukas A; Mc Carthy, James; Crouzillat, Dominique; Pétiard, Vincent; Tanksley, Steven D

    2005-12-01

    An EST database has been generated for coffee based on sequences from approximately 47,000 cDNA clones derived from five different stages/tissues, with a special focus on developing seeds. When computationally assembled, these sequences correspond to 13,175 unigenes, which were analyzed with respect to functional annotation, expression profile and evolution. Compared with Arabidopsis, the coffee unigenes encode a higher proportion of proteins related to protein modification/turnover and metabolism-an observation that may explain the high diversity of metabolites found in coffee and related species. Several gene families were found to be either expanded or unique to coffee when compared with Arabidopsis. A high proportion of these families encode proteins assigned to functions related to disease resistance. Such families may have expanded and evolved rapidly under the intense pathogen pressure experienced by a tropical, perennial species like coffee. Finally, the coffee gene repertoire was compared with that of Arabidopsis and Solanaceous species (e.g. tomato). Unlike Arabidopsis, tomato has a nearly perfect gene-for-gene match with coffee. These results are consistent with the facts that coffee and tomato have a similar genome size, chromosome karyotype (tomato, n=12; coffee n=11) and chromosome architecture. Moreover, both belong to the Asterid I clade of dicot plant families. Thus, the biology of coffee (family Rubiacaeae) and tomato (family Solanaceae) may be united into one common network of shared discoveries, resources and information. PMID:16273343

  19. Coffee and tomato share common gene repertoires as revealed by deep sequencing of seed and cherry transcripts

    PubMed Central

    Lin, Chenwei; Mueller, Lukas A.; Carthy, James Mc; Crouzillat, Dominique; Pétiard, Vincent

    2005-01-01

    An EST database has been generated for coffee based on sequences from approximately 47,000 cDNA clones derived from five different stages/tissues, with a special focus on developing seeds. When computationally assembled, these sequences correspond to 13,175 unigenes, which were analyzed with respect to functional annotation, expression profile and evolution. Compared with Arabidopsis, the coffee unigenes encode a higher proportion of proteins related to protein modification/turnover and metabolism—an observation that may explain the high diversity of metabolites found in coffee and related species. Several gene families were found to be either expanded or unique to coffee when compared with Arabidopsis. A high proportion of these families encode proteins assigned to functions related to disease resistance. Such families may have expanded and evolved rapidly under the intense pathogen pressure experienced by a tropical, perennial species like coffee. Finally, the coffee gene repertoire was compared with that of Arabidopsis and Solanaceous species (e.g. tomato). Unlike Arabidopsis, tomato has a nearly perfect gene-for-gene match with coffee. These results are consistent with the facts that coffee and tomato have a similar genome size, chromosome karyotype (tomato, n=12; coffee n=11) and chromosome architecture. Moreover, both belong to the Asterid I clade of dicot plant families. Thus, the biology of coffee (family Rubiacaeae) and tomato (family Solanaceae) may be united into one common network of shared discoveries, resources and information. PMID:16273343

  20. A cis-regulatory sequence from a short intergenic region gives rise to a strong microbe-associated molecular pattern-responsive synthetic promoter.

    PubMed

    Lehmeyer, Mona; Hanko, Erik K R; Roling, Lena; Gonzalez, Lilian; Wehrs, Maren; Hehl, Reinhard

    2016-06-01

    The high gene density in Arabidopsis thaliana leaves only relatively short intergenic regions for potential cis-regulatory sequences. To learn more about the regulation of genes harbouring only very short upstream intergenic regions, this study investigates a recently identified novel microbe-associated molecular pattern (MAMP)-responsive cis-sequence located within the 101 bp long intergenic region upstream of the At1g13990 gene. It is shown that the cis-regulatory sequence is sufficient for MAMP-responsive reporter gene activity in the context of its native promoter. The 3' UTR of the upstream gene has a quantitative effect on gene expression. In context of a synthetic promoter, the cis-sequence is shown to achieve a strong increase in reporter gene activity as a monomer, dimer and tetramer. Mutation analysis of the cis-sequence determined the specific nucleotides required for gene expression activation. In transgenic A. thaliana the synthetic promoter harbouring a tetramer of the cis-sequence not only drives strong pathogen-responsive reporter gene expression but also shows a high background activity. The results of this study contribute to our understanding how genes with very short upstream intergenic regions are regulated and how these regions can serve as a source for MAMP-responsive cis-sequences for synthetic promoter design. PMID:26833485

  1. The complete sequence of the human CD79b (Ig{beta}/B29) gene: Identification of a conserved exon/intron organization, immunoglobulin-like regulatory regions, and allelic polymorphism

    SciTech Connect

    Hashimoto, S.; Chiorazzi, N.; Gregersen, P.K. |

    1994-12-31

    We determined the complete genomic sequence of the human CD79b (Ig{beta}/B29) gene. The CD79b gene product is associated with the membrane immunoglobulin signaling complex which is composed of immunoglobulin (Ig) itself, associated in a noncovalent fashion with CD79b and a second polypeptide chain, CD79a (Ig{alpha}/mb1). The sequence and exon/intron organization of the human and mouse CD79b genes are highly similar. The gene organization suggests that some variant forms of CD79b may arise by virtue of alternative splicing of mRNA. In addition, a number of conserved regulatory sequences commonly found in Ig genes are present in sequences which flank the human CD79b gene. Some of these sequences are distinct from those found in the CD79a promoter. These differences may explain why transcription of CD79b, but not CD79a, is observed in plasma cells. A new Taq 1 restriction fragment length polymorphism is described that is not associated with any structural polymorphisms of the expressed CD79b polypeptide. 13 refs., 3 figs., 1 tab.

  2. Immunity related genes in dipterans share common enrichment of AT-rich motifs in their 5' regulatory regions that are potentially involved in nucleosome formation

    PubMed Central

    Hernandez-Romano, Jesus; Carlos-Rivera, Francisco J; Salgado, Heladia; Lamadrid-Figueroa, Hector; Valverde-Garduño, Veronica; Rodriguez, Mario H; Martinez-Barnetche, Jesus

    2008-01-01

    Background Understanding the transcriptional regulation mechanisms in response to environmental challenges is of fundamental importance in biology. Transcription factors associated to response elements and the chromatin structure had proven to play important roles in gene expression regulation. We have analyzed promoter regions of dipteran genes induced in response to immune challenge, in search for particular sequence patterns involved in their transcriptional regulation. Results 5' upstream regions of D. melanogaster and A. gambiae immunity-induced genes and their corresponding orthologous genes in 11 non-melanogaster drosophilid species and Ae. aegypti share enrichment in AT-rich short motifs. AT-rich motifs are associated with nucleosome formation as predicted by two different algorithms. In A. gambiae and D. melanogaster, many immunity genes 5' upstream sequences also showed NFκB response elements, located within 500 bp from the transcription start site. In A. gambiae, the frequency of ATAA motif near the NFκB response elements was increased, suggesting a functional link between nucleosome formation/remodelling and NFκB regulation of transcription. Conclusion AT-rich motif enrichment in 5' upstream sequences in A. gambiae, Ae. aegypti and the Drosophila genus immunity genes suggests a particular pattern of nucleosome formation/chromatin organization. The co-occurrence of such motifs with the NFκB response elements suggests that these sequence signatures may be functionally involved in transcriptional activation during dipteran immune response. AT-rich motif enrichment in regulatory regions in this group of co-regulated genes could represent an evolutionary constrained signature in dipterans and perhaps other distantly species. PMID:18613977

  3. Large-Scale Evaluation of Common Variation in Regulatory T Cell-Related Genes and Ovarian Cancer Outcome

    PubMed Central

    Charbonneau, Bridget; Moysich, Kirsten B.; Kalli, Kimberly R.; Oberg, Ann L.; Vierkant, Robert A.; Fogarty, Zachary C.; Block, Matthew S.; Maurer, Matthew J.; Goergen, Krista M.; Fridley, Brooke L.; Cunningham, Julie M.; Rider, David N.; Preston, Claudia; Hartmann, Lynn C.; Lawrenson, Kate; Wang, Chen; Tyrer, Jonathan; Song, Honglin; deFazio, Anna; Johnatty, Sharon E.; Doherty, Jennifer A.; Phelan, Catherine M.; Sellers, Thomas A.; Ramirez, Starr M.; Vitonis, Allison F.; Terry, Kathryn L.; Van Den Berg, David; Pike, Malcolm C.; Wu, Anna H.; Berchuck, Andrew; Gentry-Maharaj, Aleksandra; Ramus, Susan J.; Diergaarde, Brenda; Shen, Howard; Jensen, Allan; Menkiszak, Janusz; Cybulski, Cezary; Lubiński, Jan; Ziogas, Argyrios; Rothstein, Joseph H.; McGuire, Valerie; Sieh, Weiva; Lester, Jenny; Walsh, Christine; Vergote, Ignace; Lambrechts, Sandrina; Despierre, Evelyn; Garcia-Closas, Montserrat; Yang, Hannah; Brinton, Louise A.; Spiewankiewicz, Beata; Rzepecka, Iwona K.; Dansonka-Mieszkowska, Agnieszka; Seibold, Petra; Rudolph, Anja; Paddock, Lisa E.; Orlow, Irene; Lundvall, Lene; Olson, Sara H.; Hogdall, Claus K.; Schwaab, Ira; du Bois, Andreas; Harter, Philipp; Flanagan, James M.; Brown, Robert; Paul, James; Ekici, Arif B.; Beckmann, Matthias W.; Hein, Alexander; Eccles, Diana; Lurie, Galina; Hays, Laura E.; Bean, Yukie T.; Pejovic, Tanja; Goodman, Marc T.; Campbell, Ian; Fasching, Peter A.; Konecny, Gottfried; Kaye, Stanley B.; Heitz, Florian; Hogdall, Estrid; Bandera, Elisa V.; Chang-Claude, Jenny; Kupryjanczyk, Jolanta; Wentzensen, Nicolas; Lambrechts, Diether; Karlan, Beth Y.; Whittemore, Alice S.; Culver, Hoda Anton; Gronwald, Jacek; Levine, Douglas A.; Kjaer, Susanne K.; Menon, Usha; Schildkraut, Joellen M.; Pearce, Celeste Leigh; Cramer, Daniel W.; Rossing, Mary Anne; Chenevix-Trench, Georgia; Pharoah, Paul D.P.; Gayther, Simon A.; Ness, Roberta B.; Odunsi, Kunle; Sucheston, Lara E.; Knutson, Keith L.; Goode, Ellen L.

    2014-01-01

    The presence of regulatory T cells (Tregs) in solid tumors is known to play a role in patient survival in ovarian cancer and other malignancies. We assessed inherited genetic variations via 749 tag SNPs in 25 Treg-associated genes (CD28, CTLA4, FOXP3, IDO1, IL10, IL10RA, IL15, 1L17RA, IL23A, IL23R, IL2RA, IL6, IL6R, IL8, LGALS1, LGALS9, MAP3K8, STAT5A, STAT5B, TGFB1, TGFB2, TGFB3, TGFBR1, TGRBR2, and TGFBR3) in relation to ovarian cancer survival. We analyzed genotype and overall survival in 10,084 women with invasive epithelial ovarian cancer, including 5,248 high-grade serous, 1,452 endometrioid, 795 clear cell, and 661 mucinous carcinoma cases of European descent across 28 studies from the Ovarian Cancer Association Consortium (OCAC). The strongest associations were found for endometrioid carcinoma and IL2RA SNPs rs11256497 [HR=1.42, 95% CI: 1.22–1.64; p=5.7 × 10−6], rs791587 [HR=1.36, 95% CI:1.17–1.57; p=6.2 × 10−5], rs2476491 [HR=1.40, 95% CI: 1.191.64; p=5.6 × 10−5], and rs10795763 [HR=1.35, 95% CI: 1.17–1.57; p=7.9 × 10−5], and for clear cell carcinoma and CTLA4 SNP rs231775 [HR=0.67, 95% CI: 0.54–0.82; p=9.3 × 10−5] after adjustment for age, study site, population stratification, stage, grade, and oral contraceptive use. The rs231775 allele associated with improved survival in our study also results in an amino acid change in CTLA4 and previously has been reported to be associated with autoimmune conditions. Thus, we found evidence that SNPs in genes related to Tregs appear to play a role in ovarian cancer survival, particularly in patients with clear cell and endometrioid EOC. PMID:24764580

  4. Sequencing the GRHL3 Coding Region Reveals Rare Truncating Mutations and a Common Susceptibility Variant for Nonsyndromic Cleft Palate.

    PubMed

    Mangold, Elisabeth; Böhmer, Anne C; Ishorst, Nina; Hoebel, Ann-Kathrin; Gültepe, Pinar; Schuenke, Hannah; Klamt, Johanna; Hofmann, Andrea; Gölz, Lina; Raff, Ruth; Tessmann, Peter; Nowak, Stefanie; Reutter, Heiko; Hemprich, Alexander; Kreusch, Thomas; Kramer, Franz-Josef; Braumann, Bert; Reich, Rudolf; Schmidt, Gül; Jäger, Andreas; Reiter, Rudolf; Brosch, Sibylle; Stavusis, Janis; Ishida, Miho; Seselgyte, Rimante; Moore, Gudrun E; Nöthen, Markus M; Borck, Guntram; Aldhorae, Khalid A; Lace, Baiba; Stanier, Philip; Knapp, Michael; Ludwig, Kerstin U

    2016-04-01

    Nonsyndromic cleft lip with/without cleft palate (nsCL/P) and nonsyndromic cleft palate only (nsCPO) are the most frequent subphenotypes of orofacial clefts. A common syndromic form of orofacial clefting is Van der Woude syndrome (VWS) where individuals have CL/P or CPO, often but not always associated with lower lip pits. Recently, ∼5% of VWS-affected individuals were identified with mutations in the grainy head-like 3 gene (GRHL3). To investigate GRHL3 in nonsyndromic clefting, we sequenced its coding region in 576 Europeans with nsCL/P and 96 with nsCPO. Most strikingly, nsCPO-affected individuals had a higher minor allele frequency for rs41268753 (0.099) than control subjects (0.049; p = 1.24 × 10(-2)). This association was replicated in nsCPO/control cohorts from Latvia, Yemen, and the UK (pcombined = 2.63 × 10(-5); ORallelic = 2.46 [95% CI 1.6-3.7]) and reached genome-wide significance in combination with imputed data from a GWAS in nsCPO triads (p = 2.73 × 10(-9)). Notably, rs41268753 is not associated with nsCL/P (p = 0.45). rs41268753 encodes the highly conserved p.Thr454Met (c.1361C>T) (GERP = 5.3), which prediction programs denote as deleterious, has a CADD score of 29.6, and increases protein binding capacity in silico. Sequencing also revealed four novel truncating GRHL3 mutations including two that were de novo in four families, where all nine individuals harboring mutations had nsCPO. This is important for genetic counseling: given that VWS is rare compared to nsCPO, our data suggest that dominant GRHL3 mutations are more likely to cause nonsyndromic than syndromic CPO. Thus, with rare dominant mutations and a common risk variant in the coding region, we have identified an important contribution for GRHL3 in nsCPO. PMID:27018475

  5. Regulatory Mechanisms That Prevent Re-initiation of DNA Replication Can Be Locally Modulated at Origins by Nearby Sequence Elements

    PubMed Central

    Richardson, Christopher D.; Li, Joachim J.

    2014-01-01

    Eukaryotic cells must inhibit re-initiation of DNA replication at each of the thousands of origins in their genome because re-initiation can generate genomic alterations with extraordinary frequency. To minimize the probability of re-initiation from so many origins, cells use a battery of regulatory mechanisms that reduce the activity of replication initiation proteins. Given the global nature of these mechanisms, it has been presumed that all origins are inhibited identically. However, origins re-initiate with diverse efficiencies when these mechanisms are disabled, and this diversity cannot be explained by differences in the efficiency or timing of origin initiation during normal S phase replication. This observation raises the possibility of an additional layer of replication control that can differentially regulate re-initiation at distinct origins. We have identified novel genetic elements that are necessary for preferential re-initiation of two origins and sufficient to confer preferential re-initiation on heterologous origins when the control of re-initiation is partially deregulated. The elements do not enhance the S phase timing or efficiency of adjacent origins and thus are specifically acting as re-initiation promoters (RIPs). We have mapped the two RIPs to ∼60 bp AT rich sequences that act in a distance- and sequence-dependent manner. During the induction of re-replication, Mcm2-7 reassociates both with origins that preferentially re-initiate and origins that do not, suggesting that the RIP elements can overcome a block to re-initiation imposed after Mcm2-7 associates with origins. Our findings identify a local level of control in the block to re-initiation. This local control creates a complex genomic landscape of re-replication potential that is revealed when global mechanisms preventing re-replication are compromised. Hence, if re-replication does contribute to genomic alterations, as has been speculated for cancer cells, some regions of the genome

  6. Transcriptional activation of the fra-1 gene by AP-1 is mediated by regulatory sequences in the first intron.

    PubMed Central

    Bergers, G; Graninger, P; Braselmann, S; Wrighton, C; Busslinger, M

    1995-01-01

    Constitutive expression of c-Fos, FosB, Fra-1, or c-Jun in rat fibroblasts leads to up-regulation of the immediate-early gene fra-1. Using the posttranslational FosER induction system, we demonstrate that this AP-1-dependent stimulation of fra-1 expression is rapid, depends on a functional DNA-binding domain of FosER, and is a general phenomenon observed in different cell types. In vitro mutagenesis and functional analysis of the rat fra-1 gene in stably transfected Rat-1A-FosER fibroblasts indicated that basal and AP-1-regulated expression of the fra-1 gene depends on regulatory sequences in the first intron which comprise a consensus AP-1 site and two AP-1-like elements. We have also investigated the transactivating and transforming properties of the Fra-1 protein to address the significance of fra-1 up-regulation. The entire Fra-1 protein fused to the DNA-binding domain of Ga14 is shown to lack any transactivation function, and yet it possesses oncogenic potential, as overexpression of Fra-1 in established rat fibroblasts results in anchorage-independent growth in vitro and tumor development in athymic mice, fra-1 is therefore not only induced by members of the Fos family, but its gene product may also contribute to cellular transformation by these proteins. Together, these data identify fra-1 as a unique member of the fos gene family which is under positive control by AP-1 activity. PMID:7791782

  7. A screening method to identify genes commonly overexpressed in carcinomas and the identification of a novel complementary DNA sequence.

    PubMed

    Byrne, J A; Tomasetto, C; Garnier, J M; Rouyer, N; Mattei, M G; Bellocq, J P; Rio, M C; Basset, P

    1995-07-01

    We describe a differential screening method for cDNA libraries which used a combination of subtracted and PCR-amplified cDNA probes, and which can be applied to the selection of genes expressed in multiple tissues. This technique was used to identify genes commonly overexpressed in breast and basal cell carcinomas. These represent stromally dependent, invasive tumors with and without metastatic capacity. Thus, this screening sought to identify genes involved in the early stages of tumor progression. We identified a total of 16 genes, including c-erbB-2 and tissue inhibitor of metalloproteinases 3 whose products have been implicated in tumorigenesis or invasion. We also identified a novel sequence (D52) showing little homology with others described in any species, which maps to the human chromosomal band 8q21. In situ RNA hybridizations of breast carcinoma sections indicated that the D52 gene was expressed in cancer cells, whereas other genes identified in the differential screening were expressed in fibroblastic or inflammatory cells within the tumor stroma. Thus, the procedure developed in this study selected genes expressed in a diversity of cell types, indicating its potential usefulness in other systems. PMID:7796418

  8. Single-cell sequencing analysis characterizes common and cell-lineage-specific mutations in a muscle-invasive bladder cancer

    PubMed Central

    2012-01-01

    Background Cancers arise through an evolutionary process in which cell populations are subjected to selection; however, to date, the process of bladder cancer, which is one of the most common cancers in the world, remains unknown at a single-cell level. Results We carried out single-cell exome sequencing of 66 individual tumor cells from a muscle-invasive bladder transitional cell carcinoma (TCC). Analyses of the somatic mutant allele frequency spectrum and clonal structure revealed that the tumor cells were derived from a single ancestral cell, but that subsequent evolution occurred, leading to two distinct tumor cell subpopulations. By analyzing recurrently mutant genes in an additional cohort of 99 TCC tumors, we identified genes that might play roles in the maintenance of the ancestral clone and in the muscle-invasive capability of subclones of this bladder cancer, respectively. Conclusions This work provides a new approach of investigating the genetic details of bladder tumoral changes at the single-cell level and a new method for assessing bladder cancer evolution at a cell-population level. PMID:23587365

  9. Relation between mRNA expression and sequence information in Desulfovibrio vulgaris: Combinatorial contributions of upstream regulatory motifs and coding sequence features to variations in mRNA abundance

    SciTech Connect

    Wu, Gang; Nie, Lei; Zhang, Weiwen

    2006-05-26

    ABSTRACT-The context-dependent expression of genes is the core for biological activities, and significant attention has been given to identification of various factors contributing to gene expression at genomic scale. However, so far this type of analysis has been focused whether on relation between mRNA expression and non-coding sequence features such as upstream regulatory motifs or on correlation between mRN abundance and non-random features in coding sequences (e.g. codon usage and amino acid usage). In this study multiple regression analyses of the mRNA abundance and all sequence information in Desulfovibrio vulgaris were performed, with the goal to investigate how much coding and non-coding sequence features contribute to the variations in mRNA expression, and in what manner they act together...

  10. Analyses of Methylomes Derived from Meso-American Common Bean (Phaseolus vulgaris L.) Using MeDIP-Seq and Whole Genome Sodium Bisulfite-Sequencing

    PubMed Central

    Crampton, Mollee; Sripathi, Venkateswara R.; Hossain, Khwaja; Kalavacharla, Venu

    2016-01-01

    Common bean (Phaseolus vulgaris L.) is economically important for its high protein, fiber, and micronutrient contents, with a relatively small genome size of ∼587 Mb. Common bean is genetically diverse with two major gene pools, Meso-American and Andean. The phenotypic variability within common bean is partly attributed to the genetic diversity and epigenetic changes that are largely influenced by environmental factors. It is well established that an important epigenetic regulator of gene expression is DNA methylation. Here, we present results generated from two high-throughput sequencing technologies, methylated DNA immunoprecipitation-sequencing (MeDIP-seq) and whole genome bisulfite-sequencing (BS-Seq). Our analyses revealed that this Meso-American common bean displays similar methylation patterns as other previously published plant methylomes, with CG ∼50%, CHG ∼30%, and CHH ∼2.7% methylation, however, these differ from the common bean reference methylome of Andean origin. We identified higher CG methylation levels in both promoter and genic regions than CHG and CHH contexts. Moreover, we found relatively higher CG methylation levels in genes than in promoters. Conversely, the CHG and CHH methylation levels were highest in promoters than in genes. This is the first genome-wide DNA methylation profiling study in a Meso-American common bean cultivar (“Sierra”) using NGS approaches. Our long-term goal is to generate genome-wide epigenomic maps in common bean focusing on chromatin accessibility, histone modifications, and DNA methylation. PMID:27199997

  11. Analyses of Methylomes Derived from Meso-American Common Bean (Phaseolus vulgaris L.) Using MeDIP-Seq and Whole Genome Sodium Bisulfite-Sequencing.

    PubMed

    Crampton, Mollee; Sripathi, Venkateswara R; Hossain, Khwaja; Kalavacharla, Venu

    2016-01-01

    Common bean (Phaseolus vulgaris L.) is economically important for its high protein, fiber, and micronutrient contents, with a relatively small genome size of ∼587 Mb. Common bean is genetically diverse with two major gene pools, Meso-American and Andean. The phenotypic variability within common bean is partly attributed to the genetic diversity and epigenetic changes that are largely influenced by environmental factors. It is well established that an important epigenetic regulator of gene expression is DNA methylation. Here, we present results generated from two high-throughput sequencing technologies, methylated DNA immunoprecipitation-sequencing (MeDIP-seq) and whole genome bisulfite-sequencing (BS-Seq). Our analyses revealed that this Meso-American common bean displays similar methylation patterns as other previously published plant methylomes, with CG ∼50%, CHG ∼30%, and CHH ∼2.7% methylation, however, these differ from the common bean reference methylome of Andean origin. We identified higher CG methylation levels in both promoter and genic regions than CHG and CHH contexts. Moreover, we found relatively higher CG methylation levels in genes than in promoters. Conversely, the CHG and CHH methylation levels were highest in promoters than in genes. This is the first genome-wide DNA methylation profiling study in a Meso-American common bean cultivar ("Sierra") using NGS approaches. Our long-term goal is to generate genome-wide epigenomic maps in common bean focusing on chromatin accessibility, histone modifications, and DNA methylation. PMID:27199997

  12. Sequence variants in SLC16A11 are a common risk factor for type 2 diabetes in Mexico

    PubMed Central

    2014-01-01

    Performing genetic studies in multiple human populations can identify disease risk alleles that are common in one population but rare in others1, with the potential to illuminate pathophysiology, health disparities, and the population genetic origins of disease alleles. We analyzed 9.2 million single nucleotide polymorphisms (SNPs) in each of 8,214 Mexicans and Latin Americans: 3,848 with type 2 diabetes (T2D) and 4,366 non-diabetic controls. In addition to replicating previous findings2–4, we identified a novel locus associated with T2D at genome-wide significance spanning the solute carriers SLC16A11 and SLC16A13 (P=3.9×10−13; odds ratio (OR)=1.29). The association was stronger in younger, leaner people with T2D, and replicated in independent samples (P=1.1×10−4; OR=1.20). The risk haplotype carries four amino acid substitutions, all in SLC16A11; it is present at ≈50% frequency in Native American samples and ≈10% in East Asian, but rare in European and African samples. Analysis of an archaic genome sequence indicated the risk haplotype introgressed into modern humans via admixture with Neandertals. The SLC16A11 mRNA is expressed in liver, and V5-tagged SLC16A11 protein localizes to the endoplasmic reticulum. Expression of SLC16A11 in heterologous cells alters lipid metabolism, most notably causing an increase in intracellular triacylglycerol levels. Despite T2D having been well studied by genome-wide association studies (GWAS) in other populations, analysis in Mexican and Latin American individuals identified SLC16A11 as a novel candidate gene for T2D with a possible role in triacylglycerol metabolism. PMID:24390345

  13. Common nucleotide sequence of structural gene encoding fibroblast growth factor 4 in eight cattle derived from three breeds.

    PubMed

    Sato, Sho; Takahashi, Toshikiyo; Nishinomiya, Hiroshi; Katoh, Makiko; Itoh, Ryu; Yokoo, Masaki; Yokoo, Mari; Iha, Momoe; Mori, Yuki; Kasuga, Kano; Kojima, Ikuo; Kobayashi, Masayuki

    2012-03-01

    Fibroblast growth factor 4 (FGF4) is considered as a crucial gene for the proper development of bovine embryos. However, the complete nucleotide sequences of the structural genes encoding FGF4 in identified breeds are still unknown. In the present study, direct sequencing of PCR products derived from genomic DNA samples obtained from three Japanese Black, two Japanese Shorthorn and three Holstein cattle, revealed that the nucleotide sequences of the structural gene encoding FGF4 matched completely among these eight cattle. On the other hand, differences in the nucleotide sequences, leading to substitutions, insertions or deletions of amino acid residues were detected when compared with the already reported sequence from unidentified breeds. We cannot rule out a possibility that the structural gene elucidated in the present study is widely distributed in cattle. To the best of our knowledge, this is the first determination of the complete nucleotide sequence of the structural gene encoding bovine FGF4 in identified breeds. PMID:22435631

  14. [Cloning and function identification of gene 'admA' and up-stream regulatory sequence related to antagonistic activity of Enterobacter cloacae B8].

    PubMed

    Zhu, Jun-Li; Li, De-Bao; Yu, Xu-Ping

    2012-04-01

    To reveal the antagonistic mechanism of B8 strain to Xanthomonas oryzae pv. oryzae, transposon tagging method and chromosome walking were deployed to clone antagonistic related fragments around Tn5 insertion site in the mutant strain B8B. The function of up-stream regulatory sequence of gene 'admA' involved in the antagonistic activity was further identified by gene knocking out technique. An antagonistic related left fragment of Tn5 insertion site, 2 608 bp in length, was obtained by tagging with Kan resistance gene of Tn5. A 2 354 bp right fragment of Tn5 insertion site was amplified with 2 rounds of chromosome walking. The length of the B contig around the Tn5 insertion site was 4 611 bp, containing 7 open reading frames (ORFs). Bioinformatic analysis revealed that these ORFs corresponded to the partial coding regions of glyceraldehyde-3-phosphate dehydrogenase, two LysR family transcriptional regulators, hypothetical protein VSWAT3-20465 of Vibrionales and admA, admB, and partial sequence of admC gene of Pantoea agglomerans biosynthetic gene cluster, respectively. Tn5 was inserted in the up-stream of 200 bp or 894 bp of the sequence corresponding to anrP ORF or admA gene on B8B, respectively. The B-1 and B-2 mutants that lost antagonistic activity were selected by homeologuous recombination technology in association with knocking out plasmid pMB-BG. These results suggested that the transcription and expression of anrP gene might be disrupted as a result of the knocking out of up-stream regulatory sequence by Tn5 in B8B strain, further causing biosythesis regulation of the antagonistic related gene cluster. Thus, the antagonistic related genes in B8 strain is a gene family similar as andrimid biosynthetic gene cluster, and the upstream regulatory region appears to be critical for the antibiotics biosynthesis. PMID:22522167

  15. On universal common ancestry, sequence similarity, and phylogenetic structure: the sins of P-values and the virtues of Bayesian evidence

    PubMed Central

    2011-01-01

    Background The universal common ancestry (UCA) of all known life is a fundamental component of modern evolutionary theory, supported by a wide range of qualitative molecular evidence. Nevertheless, recently both the status and nature of UCA has been questioned. In earlier work I presented a formal, quantitative test of UCA in which model selection criteria overwhelmingly choose common ancestry over independent ancestry, based on a dataset of universally conserved proteins. These model-based tests are founded in likelihoodist and Bayesian probability theory, in opposition to classical frequentist null hypothesis tests such as Karlin-Altschul E-values for sequence similarity. In a recent comment, Koonin and Wolf (K&W) claim that the model preference for UCA is "a trivial consequence of significant sequence similarity". They support this claim with a computational simulation, derived from universally conserved proteins, which produces similar sequences lacking phylogenetic structure. The model selection tests prefer common ancestry for this artificial data set. Results For the real universal protein sequences, hierarchical phylogenetic structure (induced by genealogical history) is the overriding reason for why the tests choose UCA; sequence similarity is a relatively minor factor. First, for cases of conflicting phylogenetic structure, the tests choose independent ancestry even with highly similar sequences. Second, certain models, like star trees and K&W's profile model (corresponding to their simulation), readily explain sequence similarity yet lack phylogenetic structure. However, these are extremely poor models for the real proteins, even worse than independent ancestry models, though they explain K&W's artificial data well. Finally, K&W's simulation is an implementation of a well-known phylogenetic model, and it produces sequences that mimic homologous proteins. Therefore the model selection tests work appropriately with the artificial data. Conclusions For K

  16. A re-sequencing based assessment of genomic heterogeneity and fast neutron-induced deletions in a common bean cultivar

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A small fast neutron mutant population has been established from Phaseolus vulgaris cv. Red Hawk. We leveraged the available P. vulgaris genome sequence and high throughput next generation DNA sequencing to examine the genomic structure of five Phaseolus vulgaris cv. Red Hawk fast neutron mutants wi...

  17. The upstream regulatory sequence of the light harvesting complex Lhcf2 gene of the marine diatom Phaeodactylum tricornutum enhances transcription in an orientation- and distance-independent fashion.

    PubMed

    Russo, Monia Teresa; Annunziata, Rossella; Sanges, Remo; Ferrante, Maria Immacolata; Falciatore, Angela

    2015-12-01

    Diatoms are a key phytoplankton group in the contemporary ocean, showing extraordinary adaptation capacities to rapidly changing environments. The recent availability of whole genome sequences from representative species has revealed distinct features in their genomes, like novel combinations of genes encoding distinct metabolisms and a significant number of diatom-specific genes. However, the regulatory mechanisms driving diatom gene expression are still largely uncharacterized. Considering the wide variety of fields of study orbiting diatoms, ranging from ecology, evolutionary biology to biotechnology, it is thus essential to increase our understanding of fundamental gene regulatory processes such as transcriptional regulation. To this aim, we explored the functional properties of the 5'-flanking region of the Phaeodatylum tricornutum Lhcf2 gene, encoding a member of the Light Harvesting Complex superfamily and we showed that this region enhances transcription of a GUS reporter gene in an orientation- and distance-independent fashion. This represents the first example of a cis-regulatory sequence with enhancer-like features discovered in diatoms and it is instrumental for the generation of novel genetic tools and diatom exploitation in different areas of study. PMID:26117181

  18. Application of high throughout sequencing to measure performance of commonly used selective cultivation methods for the food-borne pathogen campylobacter

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Campylobacter is an important food-borne human pathogen which has traditionally been studied using a variety of selective cultivation methods. Here we use next-generation sequencing to ask: 1) How selective are commonly-used Campylobacter cultivation methods relative to the initial community being ...

  19. Common variable immunodeficiency, impaired neurological development and reduced numbers of T regulatory cells in a 10-year-old boy with a STAT1 gain-of-function mutation.

    PubMed

    Kobbe, Robin; Kolster, Manuela; Fuchs, Sebastian; Schulze-Sturm, Ulf; Jenderny, Jutta; Kochhan, Lothar; Staab, Julia; Tolosa, Eva; Grimbacher, Bodo; Meyer, Thomas

    2016-07-25

    Recently, gain-of-function (GOF) mutations in the gene encoding signal transducer and activator of transcription 1 (STAT1) have been associated with chronic mucocutaneous candidiasis (CMC). This case report describes a 10-year-old boy presenting with signs of common variable immunodeficiency (CVID), failure to thrive, impaired neurological development, and a history of recurrent mucocutaneous Candida infections. Sequencing of the STAT1 gene identified a heterozygous missense mutation in exon 7 encoding the STAT1 coiled-coil domain (c.514T>C, p.Phe172Leu). In addition to hypogammaglobulinemia with B-cell deficiency, and a low percentage of Th17 cells, immunological analysis of the patient revealed a marked depletion of forkhead-box P3(+)-expressing regulatory T cells (Tregs). In vitro stimulation of T cells from the patient with interferon-α (IFNα) and/or IFNɣ resulted in a significantly increased expression of STAT1-regulated target genes such as MIG1, IRF1, MX1, MCP1/CCL2, IFI-56K, and CXCL10 as compared to IFN-treated cells from a healthy control, while no IFNα/ɣ-mediated up-regulation of the FOXP3 gene was found. These data demonstrate that the STAT1 GOF mutation F172L, which results in impaired stability of the antiparallel STAT1 dimer conformation, is associated with inhibited Treg cell development and neurological symptoms. PMID:27063510

  20. GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions.

    PubMed

    Besemer, J; Lomsadze, A; Borodovsky, M

    2001-06-15

    Improving the accuracy of prediction of gene starts is one of a few remaining open problems in computer prediction of prokaryotic genes. Its difficulty is caused by the absence of relatively strong sequence patterns identifying true translation initiation sites. In the current paper we show that the accuracy of gene start prediction can be improved by combining models of protein-coding and non-coding regions and models of regulatory sites near gene start within an iterative Hidden Markov model based algorithm. The new gene prediction method, called GeneMarkS, utilizes a non-supervised training procedure and can be used for a newly sequenced prokaryotic genome with no prior knowledge of any protein or rRNA genes. The GeneMarkS implementation uses an improved version of the gene finding program GeneMark.hmm, heuristic Markov models of coding and non-coding regions and the Gibbs sampling multiple alignment program. GeneMarkS predicted precisely 83.2% of the translation starts of GenBank annotated Bacillus subtilis genes and 94.4% of translation starts in an experimentally validated set of Escherichia coli genes. We have also observed that GeneMarkS detects prokaryotic genes, in terms of identifying open reading frames containing real genes, with an accuracy matching the level of the best currently used gene detection methods. Accurate translation start prediction, in addition to the refinement of protein sequence N-terminal data, provides the benefit of precise positioning of the sequence region situated upstream to a gene start. Therefore, sequence motifs related to transcription and translation regulatory sites can be revealed and analyzed with higher precision. These motifs were shown to possess a significant variability, the functional and evolutionary connections of which are discussed. PMID:11410670

  1. Rapid Sanger Sequencing of the 16S rRNA Gene for Identification of Some Common Pathogens

    PubMed Central

    Zhou, Guangbiao; Shi, Xiaojun; Su, Jianhui; Chen, Guanwu; Lin, Kun

    2014-01-01

    Conventional Sanger sequencing remains time-consuming and laborious. In this study, we developed a rapid improved sequencing protocol of 16S rRNA for pathogens identification by using a new combination of SYBR Green I real-time PCR and Sanger sequencing with FTA® cards. To compare the sequencing quality of this method with conventional Sanger sequencing, 12 strains, including three kinds of strains (1 reference strain and 3 clinical strains, which were previously identified by biochemical tests), which have 4 Pseudomonas aeruginosa, 4 Staphyloccocus aureus and 4 Escherichia coli, were targeted. Additionally, to validate the sequencing results and bacteria identification, expanded specimens with 90 clinical strains, also comprised of the three kinds of strains which included 30 samples respectively, were performed as just described. The results showed that although statistical differences (P<0.05) were found in sequencing quality between the two methods, their identification results were all correct and consistent. The workload, the time consumption and the cost per batch were respectively light versus heavy, 8 h versus 11 h and $420 versus $400. In the 90 clinical strains, all of the Pseudomonas aeruginosa and Staphyloccocus aureus strains were correctly identified, but only 26.7% of the Escherichia coli strains were recognized as Escherichia coli, while 33.3% as Shigella sonnei and 40% as Shigella dysenteriae. The protocol described here is a rapid, reliable, stable and convenient method for 16S rRNA sequencing, and can be used for Pseudomonas aeruginosa and Staphyloccocus aureus identification, yet it is not completely suitable for discriminating Escherichia coli and Shigella strains. PMID:24551186

  2. Inhibition of protein kinase C catalytic activity by additional regions within the human protein kinase Calpha-regulatory domain lying outside of the pseudosubstrate sequence.

    PubMed

    Kirwan, Angie F; Bibby, Ashley C; Mvilongo, Thierry; Riedel, Heimo; Burke, Thomas; Millis, Sherri Z; Parissenti, Amadeo M

    2003-07-15

    The N-terminal pseudosubstrate site within the protein kinase Calpha (PKCalpha)-regulatory domain has long been regarded as the major determinant for autoinhibition of catalytic domain activity. Previously, we observed that the PKC-inhibitory capacity of the human PKCalpha-regulatory domain was only reduced partially on removal of the pseudosubstrate sequence [Parissenti, Kirwan, Kim, Colantonio and Schimmer (1998) J. Biol. Chem. 273, 8940-8945]. This finding suggested that one or more additional region(s) contributes to the inhibition of catalytic domain activity. To assess this hypothesis, we first examined the PKC-inhibitory capacity of a smaller fragment of the PKCalpha-regulatory domain consisting of the C1a, C1b and V2 regions [GST-Ralpha(39-177): this protein contained the full regulatory domain of human PKCalpha fused to glutathione S-transferase (GST), but lacked amino acids 1-38 (including the pseudosubstrate sequence) and amino acids 178-270 (including the C2 region)]. GST-Ralpha(39-177) significantly inhibited PKC in a phorbol-independent manner and could not bind the peptide substrate used in our assays. These results suggested that a region within C1/V2 directly inhibits catalytic domain activity. Providing further in vivo support for this hypothesis, we found that expression of N-terminally truncated pseudosubstrate-less bovine PKCalpha holoenzymes in yeast was capable of inhibiting cell growth in a phorbol-dependent manner. This suggested that additional autoinhibitory force(s) remained within the truncated holoenzymes that could be relieved by phorbol ester. Using tandem PCR-mediated mutagenesis, we observed that mutation of amino acids 33-86 within GST-Ralpha(39-177) dramatically reduced its PKC-inhibitory capacity when protamine was used as substrate. Mutagenesis of a broad range of sequences within C2 (amino acids 159-242) also significantly reduced PKC-inhibitory capacity. Taken together, these observations support strongly the existence of

  3. A systematic computational analysis of the rRNA-3' UTR sequence complementarity suggests a regulatory mechanism influencing post-termination events in metazoan translation.

    PubMed

    Pánek, Josef; Kolář, Michal; Herrmannová, Anna; Valášek, Leoš Shivaya

    2016-07-01

    Nucleic acid sequence complementarity underlies many fundamental biological processes. Although first noticed a long time ago, sequence complementarity between mRNAs and ribosomal RNAs still lacks a meaningful biological interpretation. Here we used statistical analysis of large-scale sequence data sets and high-throughput computing to explore complementarity between 18S and 28S rRNAs and mRNA 3' UTR sequences. By the analysis of 27,646 full-length 3' UTR sequences from 14 species covering both protozoans and metazoans, we show that the computed 18S rRNA complementarity creates an evolutionarily conserved localization pattern centered around the ribosomal mRNA entry channel, suggesting its biological relevance and functionality. Based on this specific pattern and earlier data showing that post-termination 80S ribosomes are not stably anchored at the stop codon and can migrate in both directions to codons that are cognate to the P-site deacylated tRNA, we propose that the 18S rRNA-mRNA complementarity selectively stabilizes post-termination ribosomal complexes to facilitate ribosome recycling. We thus demonstrate that the complementarity between 18S rRNA and 3' UTRs has a non-random nature and very likely carries information with a regulatory potential for translational control. PMID:27190231

  4. RegTransBase - A Database Of Regulatory Sequences and Interactionsin a Wide Range of Prokaryotic Genomes

    SciTech Connect

    Kazakov, Alexei E.; Cipriano, Michael J.; Novichkov, Pavel S.; Minovitsky, Simon; Vinogradov, Dmitry V.; Arkin, Adam; Mironov, AndreyA.; Gelfand, Mikhail S.; Dubchak, Inna

    2006-07-01

    RegTransBase, a manually curated database of regulatoryinteractions in prokaryotes, captures the knowledge in publishedscientific literature using a controlled vocabulary. Although a number ofdatabases describing interactions between regulatory proteins and theirbinding sites are currently being maintained, they focus mostly on themodel organisms Escherichia coli and Bacillus subtilis, or are entirelycomputationally derived. RegTransBase describes a large number ofregulatory interactions reported in many organisms and contains varioustypes of experimental data, in particular: the activation or repressionof transcription by an identified direct regulator; determining thetranscriptional regulatory function of a protein (or RNA) directlybinding to DNA (RNA); mapping or prediction of binding site for aregulatory protein; characterization of regulatory mutations. Currently,the RegTransBase content is derived from about 3000 relevant articlesdescribing over 7000 experiments in relation to 128 microbes. It containsdata on the regulation of about 7500 genes and evidence for 6500interactions with 650 regulators. RegTransBase also contains manuallycreated position weight matrices (PWM) that can be used to identifycandidate regulatory sites in over 60 species. RegTransBase is availableat http://regtransbase.lbl.gov.

  5. Propionibacterium acnes: Disease-Causing Agent or Common Contaminant? Detection in Diverse Patient Samples by Next-Generation Sequencing.

    PubMed

    Mollerup, Sarah; Friis-Nielsen, Jens; Vinner, Lasse; Hansen, Thomas Arn; Richter, Stine Raith; Fridholm, Helena; Herrera, Jose Alejandro Romero; Lund, Ole; Brunak, Søren; Izarzugaza, Jose M G; Mourier, Tobias; Nielsen, Lars Peter; Hansen, Anders Johannes

    2016-04-01

    Propionibacterium acnesis the most abundant bacterium on human skin, particularly in sebaceous areas.P. acnesis suggested to be an opportunistic pathogen involved in the development of diverse medical conditions but is also a proven contaminant of human clinical samples and surgical wounds. Its significance as a pathogen is consequently a matter of debate. In the present study, we investigated the presence ofP. acnesDNA in 250 next-generation sequencing data sets generated from 180 samples of 20 different sample types, mostly of cancerous origin. The samples were subjected to either microbial enrichment, involving nuclease treatment to reduce the amount of host nucleic acids, or shotgun sequencing. We detected high proportions ofP. acnesDNA in enriched samples, particularly skin tissue-derived and other tissue samples, with the levels being higher in enriched samples than in shotgun-sequenced samples.P. acnesreads were detected in most samples analyzed, though the proportions in most shotgun-sequenced samples were low. Our results show thatP. acnescan be detected in practically all sample types when molecular methods, such as next-generation sequencing, are employed. The possibility of contamination from the patient or other sources, including laboratory reagents or environment, should therefore always be considered carefully whenP. acnesis detected in clinical samples. We advocate that detection ofP. acnesalways be accompanied by experiments validating the association between this bacterium and any clinical condition. PMID:26818667

  6. Propionibacterium acnes: Disease-Causing Agent or Common Contaminant? Detection in Diverse Patient Samples by Next-Generation Sequencing

    PubMed Central

    Friis-Nielsen, Jens; Vinner, Lasse; Hansen, Thomas Arn; Richter, Stine Raith; Fridholm, Helena; Herrera, Jose Alejandro Romero; Lund, Ole; Brunak, Søren; Izarzugaza, Jose M. G.; Mourier, Tobias; Nielsen, Lars Peter

    2016-01-01

    Propionibacterium acnes is the most abundant bacterium on human skin, particularly in sebaceous areas. P. acnes is suggested to be an opportunistic pathogen involved in the development of diverse medical conditions but is also a proven contaminant of human clinical samples and surgical wounds. Its significance as a pathogen is consequently a matter of debate. In the present study, we investigated the presence of P. acnes DNA in 250 next-generation sequencing data sets generated from 180 samples of 20 different sample types, mostly of cancerous origin. The samples were subjected to either microbial enrichment, involving nuclease treatment to reduce the amount of host nucleic acids, or shotgun sequencing. We detected high proportions of P. acnes DNA in enriched samples, particularly skin tissue-derived and other tissue samples, with the levels being higher in enriched samples than in shotgun-sequenced samples. P. acnes reads were detected in most samples analyzed, though the proportions in most shotgun-sequenced samples were low. Our results show that P. acnes can be detected in practically all sample types when molecular methods, such as next-generation sequencing, are employed. The possibility of contamination from the patient or other sources, including laboratory reagents or environment, should therefore always be considered carefully when P. acnes is detected in clinical samples. We advocate that detection of P. acnes always be accompanied by experiments validating the association between this bacterium and any clinical condition. PMID:26818667

  7. Deciphering the molecular mechanisms underlying the binding of the TWIST1/E12 complex to regulatory E-box sequences.

    PubMed

    Bouard, Charlotte; Terreux, Raphael; Honorat, Mylène; Manship, Brigitte; Ansieau, Stéphane; Vigneron, Arnaud M; Puisieux, Alain; Payen, Léa

    2016-06-20

    The TWIST1 bHLH transcription factor controls embryonic development and cancer processes. Although molecular and genetic analyses have provided a wealth of data on the role of bHLH transcription factors, very little is known on the molecular mechanisms underlying their binding affinity to the E-box sequence of the promoter. Here, we used an in silico model of the TWIST1/E12 (TE) heterocomplex and performed molecular dynamics (MD) simulations of its binding to specific (TE-box) and modified E-box sequences. We focused on (i) active E-box and inactive E-box sequences, on (ii) modified active E-box sequences, as well as on (iii) two box sequences with modified adjacent bases the AT- and TA-boxes. Our in silico models were supported by functional in vitro binding assays. This exploration highlighted the predominant role of protein side-chain residues, close to the heart of the complex, at anchoring the dimer to DNA sequences, and unveiled a shift towards adjacent ((-1) and (-1*)) bases and conserved bases of modified E-box sequences. In conclusion, our study provides proof of the predictive value of these MD simulations, which may contribute to the characterization of specific inhibitors by docking approaches, and their use in pharmacological therapies by blocking the tumoral TWIST1/E12 function in cancers. PMID:27151200

  8. Deciphering the molecular mechanisms underlying the binding of the TWIST1/E12 complex to regulatory E-box sequences

    PubMed Central

    Bouard, Charlotte; Terreux, Raphael; Honorat, Mylène; Manship, Brigitte; Ansieau, Stéphane; Vigneron, Arnaud M.; Puisieux, Alain; Payen, Léa

    2016-01-01

    The TWIST1 bHLH transcription factor controls embryonic development and cancer processes. Although molecular and genetic analyses have provided a wealth of data on the role of bHLH transcription factors, very little is known on the molecular mechanisms underlying their binding affinity to the E-box sequence of the promoter. Here, we used an in silico model of the TWIST1/E12 (TE) heterocomplex and performed molecular dynamics (MD) simulations of its binding to specific (TE-box) and modified E-box sequences. We focused on (i) active E-box and inactive E-box sequences, on (ii) modified active E-box sequences, as well as on (iii) two box sequences with modified adjacent bases the AT- and TA-boxes. Our in silico models were supported by functional in vitro binding assays. This exploration highlighted the predominant role of protein side-chain residues, close to the heart of the complex, at anchoring the dimer to DNA sequences, and unveiled a shift towards adjacent ((-1) and (-1*)) bases and conserved bases of modified E-box sequences. In conclusion, our study provides proof of the predictive value of these MD simulations, which may contribute to the characterization of specific inhibitors by docking approaches, and their use in pharmacological therapies by blocking the tumoral TWIST1/E12 function in cancers. PMID:27151200

  9. In trangenic rice, alpha- and beta-tubulin regulatory sequences control GUS amount and distribution through intron mediated enhancement and intron dependent spatial expression.

    PubMed

    Gianì, Silvia; Altana, Andrea; Campanoni, Prisca; Morello, Laura; Breviario, Diego

    2009-04-01

    The genomic upstream sequence of the rice tubulin gene OsTub6 has been cloned, sequenced and characterized. The 5'UTR sequence is interrupted by a 446 bp long leader intron. This feature is shared with two other rice beta-tubulin genes (OsTub4 and OsTub1) that, together with OsTub6, group in the same clade in the evolutionary phylogenetic tree of plant beta-tubulins. Similarly to OsTub4, the leader intron of OsTub6 is capable of sustaining intron mediated enhancement (IME) of gene expression, in transient expression assays. A general picture is drawn for three rice alpha-tubulin and two rice beta-tubulin genes in which the first intron of the coding sequence for the formers and the intron present in the 5'UTR for the latters, are important elements for controlling gene expression. We used OsTua2:GUS, OsTua3:GUS, OsTub4:GUS and OsTub6:GUS chimeric constructs to investigate the in vivo pattern of beta-glucuronidase (GUS) expression in transgenic rice plants. The influence of the regulatory introns on expression patterns was evaluated for two of them, OsTua2 and OsTub4. We have thus characterized distinct patterns of expression attributable to each tubulin isotype and we have shown that the presence of the regulatory intron can greatly influence both the amount and the actual site of expression. We propose the term Intron Dependent Spatial Expression (IDSE) to highlight this latter effect. PMID:18668337

  10. Full Mitochondrial Genome Sequence of the Sugar Beet Wireworm Limonius californicus (Coleoptera: Elateridae), a Common Agricultural Pest

    PubMed Central

    New, Daniel D.; Robison, Barrie D.; Rashed, Arash; Hohenlohe, Paul; Forney, Larry; Rashidi, Mahnaz; Wilson, Cathy M.; Settles, Matthew L.

    2016-01-01

    We report here the full mitochondrial genome sequence of Limonius californicus, a species of click beetle that is an agricultural pest in its larval form. The circular genome is 16.5 kb and contains 13 protein-coding genes, 2 rRNA genes, and 22 tRNA genes. PMID:26798113

  11. Existence of microsatellites in expressed sequence tags of common carp ( Cyprinus carpio L.) available in GenBank dbEST database

    NASA Astrophysics Data System (ADS)

    Jingjie, Hu; Xiaolong, Wang; Xiaoli, Hu; Zhenmin, Bao

    2006-01-01

    Common carp expressed sequence tags (ESTs) were analyzed for the existence of microsatellites, or simple sequence repeats (SSRs). In the NCBI dbEST database, a total of 10612 sequences were registered before December 31, 2004. A complete search of 2-6 nucleotide microsatellites resulted in the identification of 513 SSR-containing ESTs, accounting for 4.8% of the total. Cluster analysis indicated that 73 sequences of SSR-containing ESTs fell into 27 groups and the remaining 440 ESTs were indenpendent. A total of 467 unique SSR-containing ESTs were identified. These EST-SSRs contained a variety of simple sequence types, and di- and tri-nucleotide repeats were the most abundant, accounting for 42.1% and 27.9% of the whole, respectively. Of the dinucleotide repeats, CA/TG was the most abundant, followed by GA/TC. BLASTx search showed that 38.1% of the SSR loci could be associated with genes or proteins of known or unknown function. BLASTx searches of SSR-containing ESTs also showed high frequencies (98/179) of hits on zebrafish sequences.

  12. Existence of microsatellites in expressed sequence tags of common carp ( Cyprinus carpio L.) available in GenBank dbEST database

    NASA Astrophysics Data System (ADS)

    Hu, Jingjie; Wang, Xiaolong; Hu, Xiaoli; Bao, Zhenmin

    2006-01-01

    Common carp expressed sequence tags (ESTs) were analyzed for the existence of microsatellites, or simple sequence repeats (SSRs). In the NCBI dbEST database, a total of 10612 sequences were registered before December 31, 2004. A complete search of 2 6 nucleotide microsatellites resulted in the identification of 513 SSR-containing ESTs, accounting for 4.8% of the total. Cluster analysis indicated that 73 sequences of SSR-containing ESTs fell into 27 groups and the remaining 440 ESTs were indenpendent. A total of 467 unique SSR-containing ESTs were identified. These EST-SSRs contained a variety of simple sequence types, and di- and tri-nucleotide repeats were the most abundant, accounting for 42.1% and 27.9% of the whole, respectively. Of the dinucleotide repeats, CA/TG was the most abundant, followed by GA/TC. BLASTx search showed that 38.1% of the SSR loci could be associated with genes or proteins of known or unknown function. BLASTx searches of SSR-containing ESTs also showed high frequencies (98/179) of hits on zebrafish sequences.

  13. De novo sequencing of root transcriptome reveals complex cadmium-responsive regulatory networks in radish (Raphanus sativus L.).

    PubMed

    Xu, Liang; Wang, Yan; Liu, Wei; Wang, Jin; Zhu, Xianwen; Zhang, Keyun; Yu, Rugang; Wang, Ronghua; Xie, Yang; Zhang, Wei; Gong, Yiqin; Liu, Liwang

    2015-07-01

    Cadmium (Cd) is a nonessential metallic trace element that poses potential chronic toxicity to living organisms. To date, little is known about the Cd-responsive regulatory network in root vegetable crops including radish. In this study, 31,015 unigenes representing 66,552 assembled unique transcripts were isolated from radish root under Cd stress based on de novo transcriptome assembly. In all, 1496 differentially expressed genes (DEGs) consisted of 3579 transcripts were identified from Cd-free (CK) and Cd-treated (Cd200) libraries. Gene Ontology and pathway enrichment analysis indicated that the up- and down-regulated DEGs were predominately involved in glucosinolate biosynthesis as well as cysteine and methionine-related pathways, respectively. RT-qPCR showed that the expression profiles of DEGs were in consistent with results from RNA-Seq analysis. Several candidate genes encoding phytochelatin synthase (PCS), metallothioneins (MTs), glutathione (GSH), zinc iron permease (ZIPs) and ABC transporter were responsible for Cd uptake, accumulation, translocation and detoxification in radish. The schematic model of DEGs and microRNAs-involved in Cd-responsive regulatory network was proposed. This study represents a first comprehensive transcriptome-based characterization of Cd-responsive DEGs in radish. These results could provide fundamental insight into complex Cd-responsive regulatory networks and facilitate further genetic manipulation of Cd accumulation in root vegetable crops. PMID:26025544

  14. Differentiated evolutionary relationships among chordates from comparative alignments of multiple sequences of MyoD and MyoG myogenic regulatory factors.

    PubMed

    Oliani, L C; Lidani, K C F; Gabriel, J E

    2015-01-01

    MyoD and MyoG are transcription factors that have essential roles in myogenic lineage determination and muscle differentiation. The purpose of this study was to compare multiple amino acid sequences of myogenic regulatory proteins to infer evolutionary relationships among chordates. Protein sequences from Mus musculus (P10085 and P12979), human Homo sapiens (P15172 and P15173), bovine Bos taurus (Q7YS82 and Q7YS81), wild pig Sus scrofa (P49811 and P49812), quail Coturnix coturnix (P21572 and P34060), chicken Gallus gallus (P16075 and P17920), rat Rattus norvegicus (Q02346 and P20428), domestic water buffalo Bubalus bubalis (D2SP11 and A7L034), and sheep Ovis aries (Q90477 and D3YKV7) were searched from a non-redundant protein sequence database UniProtKB/Swiss-Prot, and subsequently analyzed using the Mega6.0 software. MyoD evolutionary analyses revealed the presence of three main clusters with all mammals branched in one cluster, members of the order Rodentia (mouse and rat) in a second branch linked to the first, and birds of the order Galliformes (chicken and quail) remaining isolated in a third. MyoG evolutionary analyses aligned sequences in two main clusters, all mammalian specimens grouped in different sub-branches, and birds clustered in a second branch. These analyses suggest that the evolution of MyoD and MyoG was driven by different pathways. PMID:26505406

  15. Cloning and sequencing of the blood meal-induced late trypsin gene from the mosquito Aedes aegypti and characterization of the upstream regulatory region.

    PubMed

    Barillas-Mury, C; Wells, M A

    1993-01-01

    A 4.1 kb genomic clone of the late trypsin gene from the mosquito Aedes aegypti was isolated, mapped and subcloned. A 1.6 kb subclone, corresponding to 1.1 kb of upstream regulatory region and 0.5 kb of coding region, was sequenced. The gene has no introns within the coding region. The 5' end of the mature mRNA was mapped using primer extension analysis. A TATA box consensus sequence (TATAAA) was found at position -31 from the 5' end of the mature mRNA. A cluster of five repeat sequences homologous to the yeast GCN4 DNA binding site was found within 200 nucleotides upstream of the cap site. GCN4 is required for derepression mediated control of general amino acid biosynthesis in response to amino acid starvation in yeast. It activates the transcription of at least twenty different genes coding for enzymes involved in amino acid biosynthesis. The presence of this cluster of consensus sequences suggests that a protein similar to GCN4 might regulate expression of the late trypsin gene in the mosquito. Southern blot analysis of genomic DNA indicates that late trypsin is a single copy gene. PMID:9087537

  16. cDNA sequence and deduced primary structure of an alpha-amylase inhibitor from a bruchid-resistant wild common bean.

    PubMed

    Suzuki, K; Ishimoto, M; Kitamura, K

    1994-06-12

    alpha-Amylase inhibitor-2 (alpha AI-2), a seed storage protein present in a bruchid-resistant wild common bean (Phaseolus vulgaris), inhibits the growth of bruchid pests. The authors isolated and determined the sequence of an 852 nucleotide cDNA, designated as alpha ai2, and found it to contain a 720 base open reading frame (ORF). This ORF encodes a 240 amino-acid alpha AI-2 polypeptide 75.8% identical with alpha-amylase inhibitor-1 (alpha AI-1) and 50.6-55.6% with arcelin-1, phytohemagglutinin (PHA)-L and PHA-E of common bean. The high degree of sequence homology suggests that there is an evolutionary relationship among these genes. PMID:8003534

  17. Integration of small RNAs, degradome and transcriptome sequencing in hyperaccumulator Sedum alfredii uncovers a complex regulatory network and provides insights into cadmium phytoremediation.

    PubMed

    Han, Xiaojiao; Yin, Hengfu; Song, Xixi; Zhang, Yunxing; Liu, Mingying; Sang, Jiang; Jiang, Jing; Li, Jihong; Zhuo, Renying

    2016-06-01

    The hyperaccumulating ecotype of Sedum alfredii Hance is a cadmium (Cd)/zinc/lead co-hyperaccumulating species of Crassulaceae. It is a promising phytoremediation candidate accumulating substantial heavy metal ions without obvious signs of poisoning. However, few studies have focused on the regulatory roles of miRNAs and their targets in the hyperaccumulating ecotype of S. alfredii. Here, we combined analyses of the transcriptomics, sRNAs and the degradome to generate a comprehensive resource focused on identifying key regulatory miRNA-target circuits under Cd stress. A total of 87 721 unigenes and 356 miRNAs were identified by deep sequencing, and 79 miRNAs were differentially expressed under Cd stress. Furthermore, 754 target genes of 194 miRNAs were validated by degradome sequencing. A gene ontology (GO) enrichment analysis of differential miRNA targets revealed that auxin, redox-related secondary metabolism and metal transport pathways responded to Cd stress. An integrated analysis uncovered 39 pairs of miRNA targets that displayed negatively correlated expression profiles. Ten miRNA-target pairs also exhibited negative correlations according to a real-time quantitative PCR analysis. Moreover, a coexpression regulatory network was constructed based on profiles of differentially expressed genes. Two hub genes, ARF4 (auxin response factor 4) and AAP3 (amino acid permease 3), which might play central roles in the regulation of Cd-responsive genes, were uncovered. These results suggest that comprehensive analyses of the transcriptomics, sRNAs and the degradome provided a useful platform for investigating Cd hyperaccumulation in S. alfredii, and may provide new insights into the genetic engineering of phytoremediation. PMID:26801211

  18. The positive regulatory function of the 5'-proximal open reading frames in GCN4 mRNA can be mimicked by heterologous, short coding sequences.

    PubMed Central

    Williams, N P; Mueller, P P; Hinnebusch, A G

    1988-01-01

    Translational control of GCN4 expression in the yeast Saccharomyces cerevisiae is mediated by multiple AUG codons present in the leader of GCN4 mRNA, each of which initiates a short open reading frame of only two or three codons. Upstream AUG codons 3 and 4 are required to repress GCN4 expression in normal growth conditions; AUG codons 1 and 2 are needed to overcome this repression in amino acid starvation conditions. We show that the regulatory function of AUG codons 1 and 2 can be qualitatively mimicked by the AUG codons of two heterologous upstream open reading frames (URFs) containing the initiation regions of the yeast genes PGK and TRP1. These AUG codons inhibit GCN4 expression when present singly in the mRNA leader; however, they stimulate GCN4 expression in derepressing conditions when inserted upstream from AUG codons 3 and 4. This finding supports the idea that AUG codons 1 and 2 function in the control mechanism as translation initiation sites and further suggests that suppression of the inhibitory effects of AUG codons 3 and 4 is a general consequence of the translation of URF 1 and 2 sequences upstream. Several observations suggest that AUG codons 3 and 4 are efficient initiation sites; however, these sequences do not act as positive regulatory elements when placed upstream from URF 1. This result suggests that efficient translation is only one of the important properties of the 5' proximal URFs in GCN4 mRNA. We propose that a second property is the ability to permit reinitiation following termination of translation and that URF 1 is optimized for this regulatory function. Images PMID:3065626

  19. Common sequence motifs coding for higher-plant and prokaryotic O-acetylserine (thiol)-lyases: bacterial origin of a chloroplast transit peptide?

    PubMed

    Rolland, N; Job, D; Douce, R

    1993-08-01

    A comparison of the amino acid sequence of O-acetylserine (thiol)-lyase (EC 4.2.99.8) from Escherichia coli and the isoforms of this enzyme found in the cytosolic and chloroplastic compartments of spinach (Spinacia oleracea) leaf cells allows the essential lysine residue involved in the binding of the pyridoxal 5'-phosphate cofactor to be identified. The results of further sequence comparison of cDNAs coding for these proteins are discussed in the frame of the endosymbiotic theory of chloroplast evolution. The results are compatible with a mechanism in which the chloroplast enzyme originated from the cytosolic enzyme and both plant genes originated from a common prokaryotic ancestor. The comparison also suggests that the 5'-non-coding sequence of the bacterial gene was transferred to the plant cell nucleus and that it has been used to create the N-terminal portions of both plant enzymes, and possibly the transit peptide of the chloroplast enzyme. PMID:7916619

  20. Common Missense Variant in the Glucokinase Regulatory Protein Gene Is Associated With Increased Plasma Triglyceride and C-Reactive Protein but Lower Fasting Glucose Concentrations

    PubMed Central

    Orho-Melander, Marju; Melander, Olle; Guiducci, Candace; Perez-Martinez, Pablo; Corella, Dolores; Roos, Charlotta; Tewhey, Ryan; Rieder, Mark J.; Hall, Jennifer; Abecasis, Goncalo; Tai, E. Shyong; Welch, Cullan; Arnett, Donna K.; Lyssenko, Valeriya; Lindholm, Eero; Saxena, Richa; de Bakker, Paul I.W.; Burtt, Noel; Voight, Benjamin F.; Hirschhorn, Joel N.; Tucker, Katherine L.; Hedner, Thomas; Tuomi, Tiinamaija; Isomaa, Bo; Eriksson, Karl-Fredrik; Taskinen, Marja-Riitta; Wahlstrand, Björn; Hughes, Thomas E.; Parnell, Laurence D.; Lai, Chao-Qiang; Berglund, Göran; Peltonen, Leena; Vartiainen, Erkki; Jousilahti, Pekka; Havulinna, Aki S.; Salomaa, Veikko; Nilsson, Peter; Groop, Leif; Altshuler, David; Ordovas, Jose M.; Kathiresan, Sekar

    2008-01-01

    OBJECTIVE—Using the genome-wide association approach, we recently identified the glucokinase regulatory protein gene (GCKR, rs780094) region as a novel quantitative trait locus for plasma triglyceride concentration in Europeans. Here, we sought to study the association of GCKR variants with metabolic phenotypes, including measures of glucose homeostasis, to evaluate the GCKR locus in samples of non-European ancestry and to fine- map across the associated genomic interval. RESEARCH DESIGN AND METHODS—We performed association studies in 12 independent cohorts comprising >45,000 individuals representing several ancestral groups (whites from Northern and Southern Europe, whites from the U.S., African Americans from the U.S., Hispanics of Caribbean origin, and Chinese, Malays, and Asian Indians from Singapore). We conducted genetic fine-mapping across the ∼417-kb region of linkage disequilibrium spanning GCKR and 16 other genes on chromosome 2p23 by imputing untyped HapMap single nucleotide polymorphisms (SNPs) and genotyping 104 SNPs across the associated genomic interval. RESULTS—We provide comprehensive evidence that GCKR rs780094 is associated with opposite effects on fasting plasma triglyceride (Pmeta = 3 × 10−56) and glucose (Pmeta = 1 × 10−13) concentrations. In addition, we confirmed recent reports that the same SNP is associated with C-reactive protein (CRP) level (P = 5 × 10−5). Both fine-mapping approaches revealed a common missense GCKR variant (rs1260326, Pro446Leu, 34% frequency, r2 = 0.93 with rs780094) as the strongest association signal in the region. CONCLUSIONS—These findings point to a molecular mechanism in humans by which higher triglycerides and CRP can be coupled with lower plasma glucose concentrations and position GCKR in central pathways regulating both hepatic triglyceride and glucose metabolism. PMID:18678614

  1. Uncommon HLA alleles identified by hemizygous ultra-high Sanger sequencing: haplotype associations and reconsideration of their assignment in the Common and Well-Documented catalogue.

    PubMed

    Voorter, Christina E M; Groeneweg, Mathijs; Groeneveld, Lisette; Tilanus, Marcel G J

    2016-02-01

    Although the number of HLA alleles still increases, many of them have been reported being uncommon. This is partly due to lack of full length gene sequencing, especially for those alleles belonging to an allele ambiguity in which the first discovered allele has been assigned as the most frequent one. As members of the working group on Common and Well Documented (CWD) alleles and since we implemented full length group-specific sequencing as standard method routinely, we have investigated the presence of presumably rare alleles in our collection of HLA typing data. We identified 50 alleles, that were not previously encountered as Common or Well Documented. Sixteen of them should be added to the CWD catalogue, since we encountered them in 5 or more unrelated individuals. Another 11 could be added, based upon our results and the data present in the IMGT database and the rare allele section of the allele frequencies database. Furthermore, tight associations were observed between several different alleles even at the level of synonymous and non-coding sequences. In addition, in several cases the uncommon allele was found to be more frequent than its common counterpart. PMID:26610902

  2. Genome Sequence of Complex HIV-1 Unique Recombinant Forms Sharing a Common Recombination Breakpoint Identified in Malaysia

    PubMed Central

    Cheong, Hui Ting; Ng, Kim Tien; Ong, Lai Yee; Takebe, Yutaka; Chan, Kok Gan; Koh, Clayton; Al-Darraji, Haider Abdulrazzaq Abed; Kamarulzaman, Adeeba

    2015-01-01

    Three strains of HIV-1 unique recombinant forms (URFs) descended from subtypes B, B′, and CRF01_AE were identified among people who inject drugs in Kuala Lumpur, Malaysia. These three URFs shared a common recombination breakpoint in the reverse transcriptase region, indicating frequent linkage within the drug-injecting networks in Malaysia. PMID:26543107

  3. Comparison of Muscle Onset Activation Sequences between a Golf or Tennis Swing and Common Training Exercises Using Surface Electromyography: A Pilot Study.

    PubMed

    Vasudevan, John M; Logan, Andrew; Shultz, Rebecca; Koval, Jeffrey J; Roh, Eugene Y; Fredericson, Michael

    2016-01-01

    Aim. The purpose of this pilot study is to use surface electromyography to determine an individual athlete's typical muscle onset activation sequence when performing a golf or tennis forward swing and to use the method to assess to what degree the sequence is reproduced with common conditioning exercises and a machine designed for this purpose. Methods. Data for 18 healthy male subjects were collected for 15 muscles of the trunk and lower extremities. Data were filtered and processed to determine the average onset of muscle activation for each motion. A Spearman correlation estimated congruence of activation order between the swing and each exercise. Correlations of each group were pooled with 95% confidence intervals using a random effects meta-analytic strategy. Results. The averaged sequences differed among each athlete tested, but pooled correlations demonstrated a positive association between each exercise and the participants' natural muscle onset activation sequence. Conclusion. The selected training exercises and Turning Point™ device all partially reproduced our athletes' averaged muscle onset activation sequences for both sports. The results support consideration of a larger, adequately powered study using this method to quantify to what degree each of the selected exercises is appropriate for use in both golf and tennis. PMID:27403454

  4. Comparison of Muscle Onset Activation Sequences between a Golf or Tennis Swing and Common Training Exercises Using Surface Electromyography: A Pilot Study

    PubMed Central

    Shultz, Rebecca; Fredericson, Michael

    2016-01-01

    Aim. The purpose of this pilot study is to use surface electromyography to determine an individual athlete's typical muscle onset activation sequence when performing a golf or tennis forward swing and to use the method to assess to what degree the sequence is reproduced with common conditioning exercises and a machine designed for this purpose. Methods. Data for 18 healthy male subjects were collected for 15 muscles of the trunk and lower extremities. Data were filtered and processed to determine the average onset of muscle activation for each motion. A Spearman correlation estimated congruence of activation order between the swing and each exercise. Correlations of each group were pooled with 95% confidence intervals using a random effects meta-analytic strategy. Results. The averaged sequences differed among each athlete tested, but pooled correlations demonstrated a positive association between each exercise and the participants' natural muscle onset activation sequence. Conclusion. The selected training exercises and Turning Point™ device all partially reproduced our athletes' averaged muscle onset activation sequences for both sports. The results support consideration of a larger, adequately powered study using this method to quantify to what degree each of the selected exercises is appropriate for use in both golf and tennis. PMID:27403454

  5. CRISPR MultiTargeter: A Web Tool to Find Common and Unique CRISPR Single Guide RNA Targets in a Set of Similar Sequences

    PubMed Central

    Prykhozhij, Sergey V.; Rajan, Vinothkumar; Gaston, Daniel; Berman, Jason N.

    2015-01-01

    Genome engineering has been revolutionized by the discovery of clustered regularly interspaced palindromic repeats (CRISPR) and CRISPR-associated system genes (Cas) in bacteria. The type IIB Streptococcus pyogenes CRISPR/Cas9 system functions in many species and additional types of CRISPR/Cas systems are under development. In the type II system, expression of CRISPR single guide RNA (sgRNA) targeting a defined sequence and Cas9 generates a sequence-specific nuclease inducing small deletions or insertions. Moreover, knock-in of large DNA inserts has been shown at the sites targeted by sgRNAs and Cas9. Several tools are available for designing sgRNAs that target unique locations in the genome. However, the ability to find sgRNA targets common to several similar sequences or, by contrast, unique to each of these sequences, would also be advantageous. To provide such a tool for several types of CRISPR/Cas system and many species, we developed the CRISPR MultiTargeter software. Similar DNA sequences in question are duplicated genes and sets of exons of different transcripts of a gene. Thus, we implemented a basic sgRNA target search of input sequences for single-sgRNA and two-sgRNA/Cas9 nickase targeting, as well as common and unique sgRNA target searches in 1) a set of input sequences; 2) a set of similar genes or transcripts; or 3) transcripts a single gene. We demonstrate potential uses of the program by identifying unique isoform-specific sgRNA sites in 71% of zebrafish alternative transcripts and common sgRNA target sites in approximately 40% of zebrafish duplicated gene pairs. The design of unique targets in alternative exons is helpful because it will facilitate functional genomic studies of transcript isoforms. Similarly, its application to duplicated genes may simplify multi-gene mutational targeting experiments. Overall, this program provides a unique interface that will enhance use of CRISPR/Cas technology. PMID:25742428

  6. Paleoecology of the Devonian-Mississippian black-shale sequence in eastern Kentucky with an atlas of some common fossils

    SciTech Connect

    Barron, L.S.; Ettensohn, F.R.

    1981-04-01

    The Devonian-Mississippian black-shale sequence of eastern North America is a distinctive stratigraphic interval generally characterized by low clastic influx, high organic production in the water column, anaerobic bottom conditions, and the relative absence of fossil evidence for biologic activity. The laminated black shales which constitute most of the black-shale sequence are broken by two major sequences of interbedded greenish-gray, clayey shales which contain bioturbation and pyritized micromorph invertebrates. The black shales contain abundant evidence of life from upper parts of the water column such as fish fossils, conodonts, algae and other phytoplankton; however, there is a lack of evidence of benthic life. The rare brachiopods, crinoids, and molluscs that occur in the black shales were probably epiplanktic. A significant physical distinction between the environment in which the black sediments were deposited and that in which the greenish-gray sediments were deposited was the level of dissolved oxygen. The laminated black shales point to anaerobic conditions and the bioturbated greenish-gray shales suggest dysaerobic to marginally aerobic-dysaerobic conditions. A paleoenvironmental model in which quasi-estuarine circulation compliments and enhances the effect of a stratified water column can account for both depletion of dissolved oxygen in the bottom environments and the absence of oxygen replenishment during black-shale deposition. Periods of abundant clastic influx from fluvial environments to the east probably account for the abundance of clays in the greenish-gray shale as well as the small amounts of oxygen necessary to support the depauparate, opportunistic, benthic faunas found there. These pulses of greenish-gray clastics were short-lived and eventually were replaced by anaerobic conditions and low rates of clastic sedimentation which characterized most of black-shale deposition.

  7. RNA sequencing of laser-capture microdissected compartments of the maize kernel identifies regulatory modules associated with endosperm cell differentiation.

    PubMed

    Zhan, Junpeng; Thakare, Dhiraj; Ma, Chuang; Lloyd, Alan; Nixon, Neesha M; Arakaki, Angela M; Burnett, William J; Logan, Kyle O; Wang, Dongfang; Wang, Xiangfeng; Drews, Gary N; Yadegari, Ramin

    2015-03-01

    Endosperm is an absorptive structure that supports embryo development or seedling germination in angiosperms. The endosperm of cereals is a main source of food, feed, and industrial raw materials worldwide. However, the genetic networks that regulate endosperm cell differentiation remain largely unclear. As a first step toward characterizing these networks, we profiled the mRNAs in five major cell types of the differentiating endosperm and in the embryo and four maternal compartments of the maize (Zea mays) kernel. Comparisons of these mRNA populations revealed the diverged gene expression programs between filial and maternal compartments and an unexpected close correlation between embryo and the aleurone layer of endosperm. Gene coexpression network analysis identified coexpression modules associated with single or multiple kernel compartments including modules for the endosperm cell types, some of which showed enrichment of previously identified temporally activated and/or imprinted genes. Detailed analyses of a coexpression module highly correlated with the basal endosperm transfer layer (BETL) identified a regulatory module activated by MRP-1, a regulator of BETL differentiation and function. These results provide a high-resolution atlas of gene activity in the compartments of the maize kernel and help to uncover the regulatory modules associated with the differentiation of the major endosperm cell types. PMID:25783031

  8. Analysis of BAC-end sequences in common bean (Phaseolus vulgaris L.) towards the development and characterization of long motifs SSRs.

    PubMed

    Müller, Bárbara Salomão de Faria; Sakamoto, Tetsu; de Menezes, Ivandilson Pessoa Pinto; Prado, Guilherme Souza; Martins, Wellington Santos; Brondani, Claudio; de Barros, Everaldo Gonçalves; Vianello, Rosana Pereira

    2014-11-01

    The increasing volume of genomic data on the Phaseolus vulgaris species have contributed to its importance as a model genetic species and positively affected the investigation of other legumes of scientific and economic value. To expand and gain a more in-depth knowledge of the common bean genome, the ends of a number of bacterial artificial chromosome (BAC) were sequenced, annotated and the presence of repetitive sequences was determined. In total, 52,270 BESs (BAC-end sequences), equivalent to 32 Mbp (~6 %) of the genome, were processed. In total, 3,789 BES-SSRs were identified, with a distribution of one SSR (simple sequence repeat) per 8.36 kbp and 2,000 were suitable for the development of SSRs, of which 194 were evaluated in low-resolution screening. From 40 BES-SSRs based on long motifs SSRs (≥ trinucleotides) analyzed in high-resolution genotyping, 34 showed an equally good amplification for the Andean and for the Mesoamerican genepools, exhibiting an average gene diversity (H E) of 0.490 and 5.59 alleles/locus, of which six classified as Class I showed a H E ≥ 0.7. The PCoA and structure analysis allowed to discriminate the gene pools (K = 2, FST = 0.733). From the 52,270 BESs, 2 % corresponded to transcription factors and 3 % to transposable elements. Putative functions for 24,321 BESs were identified and for 19,363 were assigned functional categories (gene ontology). This study identified highly polymorphic BES-SSRs containing tri- to hexanucleotides motifs and bringing together relevant genetic characteristics useful for breeding programs. Additionally, the BESs were incorporated into the international genome-sequencing project for the common bean. PMID:25164100

  9. Variations in the coding and regulatory sequences of the angiogenin (ANG) gene are not associated to ALS (amyotrophic lateral sclerosis) in the Italian population.

    PubMed

    Corrado, Lucia; Battistini, Stefania; Penco, Silvana; Bergamaschi, Laura; Testa, Lucia; Ricci, Claudia; Giannini, Fabio; Greco, Giuseppe; Patrosso, Maria Cristina; Pileggi, Simona; Causarano, Renzo; Mazzini, Letizia; Momigliano-Richiardi, Patricia; D'Alfonso, Sandra

    2007-07-15

    Potentially causative missense variations in the ANG gene and a positive association with the synonymous rs11701-G substitution was detected mainly in Irish and Scottish ALS patients. We screened 262 Italian SOD1 negative ALS patients (250 sporadic) and 415 matched controls for sequence variations in the coding, 3'/5' UTR and 5' flanking (642 bp) regions of the ANG gene. We identified 53 sequence variations of which 46 new, 20 with a minor allele frequency (MAF) >or=0.01 and only three localised in the coding sequence, namely the missense I46V, identified in one patient and two controls, and the synonymous G86G and T97T corresponding to rs11701 and rs2228653. None of the detected SNPs or of their haplotypic combinations was significantly associated with ALS susceptibility or clinical features. In conclusion, we did not detect the association with rs11701-G or with any other newly detected variation in the ANG regulatory region. Furthermore we did not identify potentially causal mutations in the coding region. PMID:17462671

  10. De novo Sequencing of the Leaf Transcriptome Reveals Complex Light-Responsive Regulatory Networks in Camellia sinensis cv. Baijiguan

    PubMed Central

    Wu, Quanjin; Chen, Zhidan; Sun, Weijiang; Deng, Tingting; Chen, Mingjie

    2016-01-01

    Tea plants (Camellia sinensis L.) possess high genetic diversity that is important for breeding. One cultivar, Baijiguan, exhibits a yellow leaf phenotype, reduced chlorophyll (Chl) content, and aberrant chloroplast structures under high light intensity. In contrast, under low light intensity, the flush shoot from Baijiguan becomes green, the Chl content increases significantly, and the chloroplasts exhibit normal structures. To understand the underlying molecular mechanisms for these observations, we performed de novo transcriptome sequencing and digital gene expression (DGE) profiling using Illumina sequencing technology. De novo transcriptome assembly identified 88,788 unigenes, including 1652 transcription factors from 25 families. In total, 1993 and 2576 differentially expressed genes (DEGs) were identified in Baijiguan plants treated with 3 and 6 days of shade, respectively. Gene Ontology (GO) and pathway enrichment analyses indicated that the DEGs are predominantly involved in the ROS scavenging system, chloroplast development, photosynthetic pigment synthesis, secondary metabolism, and circadian systems. The light-responsive gene POR (protochlorophyllide oxidoreductase) and transcription factor HY5 were identified. Quantitative real-time PCR (qRT-PCR) analysis of 20 selected DEGs confirmed the RNA-sequencing (RNA-Seq) results. Overall, these findings suggest that high light intensity inhibits the expression of photosystem II 10-kDa protein (PsbR) in Baijiguan, thus affecting PSII stability, chloroplast development and chlorophyll biosynthesis. PMID:27047513

  11. De novo Sequencing of the Leaf Transcriptome Reveals Complex Light-Responsive Regulatory Networks in Camellia sinensis cv. Baijiguan.

    PubMed

    Wu, Quanjin; Chen, Zhidan; Sun, Weijiang; Deng, Tingting; Chen, Mingjie

    2016-01-01

    Tea plants (Camellia sinensis L.) possess high genetic diversity that is important for breeding. One cultivar, Baijiguan, exhibits a yellow leaf phenotype, reduced chlorophyll (Chl) content, and aberrant chloroplast structures under high light intensity. In contrast, under low light intensity, the flush shoot from Baijiguan becomes green, the Chl content increases significantly, and the chloroplasts exhibit normal structures. To understand the underlying molecular mechanisms for these observations, we performed de novo transcriptome sequencing and digital gene expression (DGE) profiling using Illumina sequencing technology. De novo transcriptome assembly identified 88,788 unigenes, including 1652 transcription factors from 25 families. In total, 1993 and 2576 differentially expressed genes (DEGs) were identified in Baijiguan plants treated with 3 and 6 days of shade, respectively. Gene Ontology (GO) and pathway enrichment analyses indicated that the DEGs are predominantly involved in the ROS scavenging system, chloroplast development, photosynthetic pigment synthesis, secondary metabolism, and circadian systems. The light-responsive gene POR (protochlorophyllide oxidoreductase) and transcription factor HY5 were identified. Quantitative real-time PCR (qRT-PCR) analysis of 20 selected DEGs confirmed the RNA-sequencing (RNA-Seq) results. Overall, these findings suggest that high light intensity inhibits the expression of photosystem II 10-kDa protein (PsbR) in Baijiguan, thus affecting PSII stability, chloroplast development and chlorophyll biosynthesis. PMID:27047513

  12. A widely expressed transcription factor with multiple DNA sequence specificity, CTCF, is localized at chromosome segment 16q22.1 within one of the smallest regions of overlap for common deletions in breast and prostate cancers.

    PubMed

    Filippova, G N; Lindblom, A; Meincke, L J; Klenova, E M; Neiman, P E; Collins, S J; Doggett, N A; Lobanenkov, V V

    1998-05-01

    The cellular protooncogene MYC encodes a nuclear transcription factor that is involved in regulating important cellular functions, including cell cycle progression, differentiation, and apoptosis. Dysregulated MYC expression appears critical to the development of various types of malignancies, and thus factors involved in regulating MYC expression may also play a key role in the pathogenesis of certain cancers. We have cloned one such MYC regulatory factor, termed CTCF, which is a highly evolutionarily conserved-11-zinc finger transcriptional factor possessing multiple DNA sequence specificity. CTCF binds to a number of important regulatory regions within the 5' noncoding sequence of the human MYC oncogene, and it can regulate its transcription in several experimental systems. CTCF mRNA is expressed in cells of multiple different lineages. Enforced ectopic expression of CTCF inhibits cell growth in culture. Southern blot analyses and fluorescence in situ hybridization (FISH) with normal human metaphase chromosomes showed that the human CTCF is a single-copy gene situated at chromosome locus 16q22. Cytogenetic studies have pointed out that chromosome abnormalities (deletions) at this locus frequently occur in many different human malignancies, suggesting the presence of one or more tumor suppressor genes in the region. To narrow down their localization, several loss of heterozygosity (LOH) studies of chromosome arm 16q in sporadic breast and prostate cancers have been carried out to define the most recurrent and smallest region(s) of overlap (SRO) for commonly deleted chromosome arm 16q material. For CTCF to be considered as a candidate tumor suppressor gene associated with tumorigenesis, it should localize within one of the SROs at 16q. Fine-mapping of CTCF has enabled us to assign the CTCF gene to about a 2 centiMorgan (cM) interval of 16q22.1 between the somatic cell hybrid breakpoints CY130(D) and CY4, which is between markers D16S186 (16AC16-101) and D16S496

  13. SNP Assay Development for Linkage Map Construction, Anchoring Whole-Genome Sequence, and Other Genetic and Genomic Applications in Common Bean

    DOE PAGESBeta

    Song, Qijian; Jia, Gaofeng; Hyten, David L.; Jenkins, Jerry; Hwang, Eun-Young; Schroeder, Steven G.; Osorno, Juan M.; Schmutz, Jeremy; Jackson, Scott A.; McClean, Phillip E.; et al

    2015-08-28

    A total of 992,682 single-nucleotide polymorphisms (SNPs) was identified as ideal for Illumina Infinium II BeadChip design after sequencing a diverse set of 17 common bean (Phaseolus vulgaris L) varieties with the aid of next-generation sequencing technology. From these, two BeadChips each with >5000 SNPs were designed. The BARCBean6K_1 BeadChip was selected for the purpose of optimizing polymorphism among market classes and, when possible, SNPs were targeted to sequence scaffolds in the Phaseolus vulgaris 14× genome assembly with sequence lengths >10 kb. The BARCBean6K_2 BeadChip was designed with the objective of anchoring additional scaffolds and to facilitate orientation of largemore » scaffolds. Analysis of 267 F2 plants from a cross of varieties Stampede × Red Hawk with the two BeadChips resulted in linkage maps with a total of 7040 markers including 7015 SNPs. With the linkage map, a total of 432.3 Mb of sequence from 2766 scaffolds was anchored to create the Phaseolus vulgaris v1.0 assembly, which accounted for approximately 89% of the 487 Mb of available sequence scaffolds of the Phaseolus vulgaris v0.9 assembly. A core set of 6000 SNPs (BARCBean6K_3 BeadChip) with high genotyping quality and polymorphism was selected based on the genotyping of 365 dry bean and 134 snap bean accessions with the BARCBean6K_1 and BARCBean6K_2 BeadChips. The BARCBean6K_3 BeadChip is a useful tool for genetics and genomics research and it is widely used by breeders and geneticists in the United States and abroad.« less

  14. SNP Assay Development for Linkage Map Construction, Anchoring Whole-Genome Sequence, and Other Genetic and Genomic Applications in Common Bean

    PubMed Central

    Song, Qijian; Jia, Gaofeng; Hyten, David L.; Jenkins, Jerry; Hwang, Eun-Young; Schroeder, Steven G.; Osorno, Juan M.; Schmutz, Jeremy; Jackson, Scott A.; McClean, Phillip E.; Cregan, Perry B.

    2015-01-01

    A total of 992,682 single-nucleotide polymorphisms (SNPs) was identified as ideal for Illumina Infinium II BeadChip design after sequencing a diverse set of 17 common bean (Phaseolus vulgaris L) varieties with the aid of next-generation sequencing technology. From these, two BeadChips each with >5000 SNPs were designed. The BARCBean6K_1 BeadChip was selected for the purpose of optimizing polymorphism among market classes and, when possible, SNPs were targeted to sequence scaffolds in the Phaseolus vulgaris 14× genome assembly with sequence lengths >10 kb. The BARCBean6K_2 BeadChip was designed with the objective of anchoring additional scaffolds and to facilitate orientation of large scaffolds. Analysis of 267 F2 plants from a cross of varieties Stampede × Red Hawk with the two BeadChips resulted in linkage maps with a total of 7040 markers including 7015 SNPs. With the linkage map, a total of 432.3 Mb of sequence from 2766 scaffolds was anchored to create the Phaseolus vulgaris v1.0 assembly, which accounted for approximately 89% of the 487 Mb of available sequence scaffolds of the Phaseolus vulgaris v0.9 assembly. A core set of 6000 SNPs (BARCBean6K_3 BeadChip) with high genotyping quality and polymorphism was selected based on the genotyping of 365 dry bean and 134 snap bean accessions with the BARCBean6K_1 and BARCBean6K_2 BeadChips. The BARCBean6K_3 BeadChip is a useful tool for genetics and genomics research and it is widely used by breeders and geneticists in the United States and abroad. PMID:26318155

  15. SNP Assay Development for Linkage Map Construction, Anchoring Whole-Genome Sequence, and Other Genetic and Genomic Applications in Common Bean

    SciTech Connect

    Song, Qijian; Jia, Gaofeng; Hyten, David L.; Jenkins, Jerry; Hwang, Eun-Young; Schroeder, Steven G.; Osorno, Juan M.; Schmutz, Jeremy; Jackson, Scott A.; McClean, Phillip E.; Cregan, Perry B.

    2015-08-28

    A total of 992,682 single-nucleotide polymorphisms (SNPs) was identified as ideal for Illumina Infinium II BeadChip design after sequencing a diverse set of 17 common bean (Phaseolus vulgaris L) varieties with the aid of next-generation sequencing technology. From these, two BeadChips each with >5000 SNPs were designed. The BARCBean6K_1 BeadChip was selected for the purpose of optimizing polymorphism among market classes and, when possible, SNPs were targeted to sequence scaffolds in the Phaseolus vulgaris 14× genome assembly with sequence lengths >10 kb. The BARCBean6K_2 BeadChip was designed with the objective of anchoring additional scaffolds and to facilitate orientation of large scaffolds. Analysis of 267 F2 plants from a cross of varieties Stampede × Red Hawk with the two BeadChips resulted in linkage maps with a total of 7040 markers including 7015 SNPs. With the linkage map, a total of 432.3 Mb of sequence from 2766 scaffolds was anchored to create the Phaseolus vulgaris v1.0 assembly, which accounted for approximately 89% of the 487 Mb of available sequence scaffolds of the Phaseolus vulgaris v0.9 assembly. A core set of 6000 SNPs (BARCBean6K_3 BeadChip) with high genotyping quality and polymorphism was selected based on the genotyping of 365 dry bean and 134 snap bean accessions with the BARCBean6K_1 and BARCBean6K_2 BeadChips. The BARCBean6K_3 BeadChip is a useful tool for genetics and genomics research and it is widely used by breeders and geneticists in the United States and abroad.

  16. SNP Assay Development for Linkage Map Construction, Anchoring Whole-Genome Sequence, and Other Genetic and Genomic Applications in Common Bean.

    PubMed

    Song, Qijian; Jia, Gaofeng; Hyten, David L; Jenkins, Jerry; Hwang, Eun-Young; Schroeder, Steven G; Osorno, Juan M; Schmutz, Jeremy; Jackson, Scott A; McClean, Phillip E; Cregan, Perry B

    2015-11-01

    A total of 992,682 single-nucleotide polymorphisms (SNPs) was identified as ideal for Illumina Infinium II BeadChip design after sequencing a diverse set of 17 common bean (Phaseolus vulgaris L) varieties with the aid of next-generation sequencing technology. From these, two BeadChips each with >5000 SNPs were designed. The BARCBean6K_1 BeadChip was selected for the purpose of optimizing polymorphism among market classes and, when possible, SNPs were targeted to sequence scaffolds in the Phaseolus vulgaris 14× genome assembly with sequence lengths >10 kb. The BARCBean6K_2 BeadChip was designed with the objective of anchoring additional scaffolds and to facilitate orientation of large scaffolds. Analysis of 267 F2 plants from a cross of varieties Stampede × Red Hawk with the two BeadChips resulted in linkage maps with a total of 7040 markers including 7015 SNPs. With the linkage map, a total of 432.3 Mb of sequence from 2766 scaffolds was anchored to create the Phaseolus vulgaris v1.0 assembly, which accounted for approximately 89% of the 487 Mb of available sequence scaffolds of the Phaseolus vulgaris v0.9 assembly. A core set of 6000 SNPs (BARCBean6K_3 BeadChip) with high genotyping quality and polymorphism was selected based on the genotyping of 365 dry bean and 134 snap bean accessions with the BARCBean6K_1 and BARCBean6K_2 BeadChips. The BARCBean6K_3 BeadChip is a useful tool for genetics and genomics research and it is widely used by breeders and geneticists in the United States and abroad. PMID:26318155

  17. Identification of sequence similarity between 60 kDa and 70 kDa molecular chaperones: evidence for a common evolutionary background?

    PubMed Central

    Flores, A I; Cuezva, J M

    1997-01-01

    Recent findings support the premise that chaperonins (60 kDa stress-proteins) and alpha-subunits of F-type ATPases (alpha-ATPase) are evolutionary related protein families. Two-dimensional gel patterns of synthesized proteins in unstressed and heat-shocked embryonic Drosophila melanogaster SL2 cells revealed that antibodies raised against the alpha-subunit of the F1-ATPase complex from rat liver recognize an inducible p71 member of the 70 kDa stress-responsive protein family. Molecular recognition of this stress-responsive 70 kDa protein by antibodies raised against the F1-ATPase alpha-subunit suggests the possibility of partial sequence similarity within these ATP-binding protein families. A multiple sequence alignment between alpha-ATPases and 60 kDa and 70 kDa molecular chaperones is presented. Statistical evaluation of sequence similarity reveals a significant degree of sequence conservation within the three protein families. The finding suggests a common evolutionary origin for the ATPases and molecular chaperone protein families of 60 kDa and 70 kDa, despite the lack of obvious structural resemblance between them. PMID:9065788

  18. ICO amplicon NGS data analysis: a Web tool for variant detection in common high-risk hereditary cancer genes analyzed by amplicon GS Junior next-generation sequencing.

    PubMed

    Lopez-Doriga, Adriana; Feliubadaló, Lídia; Menéndez, Mireia; Lopez-Doriga, Sergio; Morón-Duran, Francisco D; del Valle, Jesús; Tornero, Eva; Montes, Eva; Cuesta, Raquel; Campos, Olga; Gómez, Carolina; Pineda, Marta; González, Sara; Moreno, Victor; Capellá, Gabriel; Lázaro, Conxi

    2014-03-01

    Next-generation sequencing (NGS) has revolutionized genomic research and is set to have a major impact on genetic diagnostics thanks to the advent of benchtop sequencers and flexible kits for targeted libraries. Among the main hurdles in NGS are the difficulty of performing bioinformatic analysis of the huge volume of data generated and the high number of false positive calls that could be obtained, depending on the NGS technology and the analysis pipeline. Here, we present the development of a free and user-friendly Web data analysis tool that detects and filters sequence variants, provides coverage information, and allows the user to customize some basic parameters. The tool has been developed to provide accurate genetic analysis of targeted sequencing of common high-risk hereditary cancer genes using amplicon libraries run in a GS Junior System. The Web resource is linked to our own mutation database, to assist in the clinical classification of identified variants. We believe that this tool will greatly facilitate the use of the NGS approach in routine laboratories. PMID:24227591

  19. Genetic divergence analysis of the Common Barn Owl Tyto alba (Scopoli, 1769) and the Short-eared Owl Asio flammeus (Pontoppidan, 1763) from southern Chile using COI sequence

    PubMed Central

    Colihueque, Nelson; Gantz, Alberto; Rau, Jaime Ricardo; Parraguez, Margarita

    2015-01-01

    Abstract In this paper new mitochondrial COI sequences of Common Barn Owl Tyto alba (Scopoli, 1769) and Short-eared Owl Asio flammeus (Pontoppidan, 1763) from southern Chile are reported and compared with sequences from other parts of the World. The intraspecific genetic divergence (mean p-distance) was 4.6 to 5.5% for the Common Barn Owl in comparison with specimens from northern Europe and Australasia and 3.1% for the Short-eared Owl with respect to samples from north America, northern Europe and northern Asia. Phylogenetic analyses revealed three distinctive groups for the Common Barn Owl: (i) South America (Chile and Argentina) plus Central and North America, (ii) northern Europe and (iii) Australasia, and two distinctive groups for the Short-eared Owl: (i) South America (Chile and Argentina) and (ii) north America plus northern Europe and northern Asia. The level of genetic divergence observed in both species exceeds the upper limit of intraspecific comparisons reported previously for Strigiformes. Therefore, this suggests that further research is needed to assess the taxonomic status, particularly for the Chilean populations that, to date, have been identified as belonging to these species through traditional taxonomy. PMID:26668551

  20. Genetic divergence analysis of the Common Barn Owl Tyto alba (Scopoli, 1769) and the Short-eared Owl Asio flammeus (Pontoppidan, 1763) from southern Chile using COI sequence.

    PubMed

    Colihueque, Nelson; Gantz, Alberto; Rau, Jaime Ricardo; Parraguez, Margarita

    2015-01-01

    In this paper new mitochondrial COI sequences of Common Barn Owl Tyto alba (Scopoli, 1769) and Short-eared Owl Asio flammeus (Pontoppidan, 1763) from southern Chile are reported and compared with sequences from other parts of the World. The intraspecific genetic divergence (mean p-distance) was 4.6 to 5.5% for the Common Barn Owl in comparison with specimens from northern Europe and Australasia and 3.1% for the Short-eared Owl with respect to samples from north America, northern Europe and northern Asia. Phylogenetic analyses revealed three distinctive groups for the Common Barn Owl: (i) South America (Chile and Argentina) plus Central and North America, (ii) northern Europe and (iii) Australasia, and two distinctive groups for the Short-eared Owl: (i) South America (Chile and Argentina) and (ii) north America plus northern Europe and northern Asia. The level of genetic divergence observed in both species exceeds the upper limit of intraspecific comparisons reported previously for Strigiformes. Therefore, this suggests that further research is needed to assess the taxonomic status, particularly for the Chilean populations that, to date, have been identified as belonging to these species through traditional taxonomy. PMID:26668551

  1. Analysis of the coding-complete genomic sequence of groundnut ringspot virus suggests a common ancestor with tomato chlorotic spot virus.

    PubMed

    de Breuil, Soledad; Cañizares, Joaquín; Blanca, José Miguel; Bejerman, Nicolás; Trucco, Verónica; Giolitti, Fabián; Ziarsolo, Peio; Lenardon, Sergio

    2016-08-01

    Groundnut ringspot virus (GRSV) and tomato chlorotic spot virus (TCSV) share biological and serological properties, so their identification is carried out by molecular methods. Their genomes consist of three segmented RNAs: L, M and S. The finding of a reassortant between these two viruses may complicate correct virus identification and requires the characterization of the complete genome. Therefore, we present for the first time the complete sequences of all the genes encoded by a GRSV isolate. The high level of sequence similarity between GRSV and TCSV (over 90 % identity) observed in the genes and proteins encoded in the M RNA support previous results indicating that these viruses probably have a common ancestor. PMID:27260536

  2. Next-generation sequencing of genomic DNA fragments bound to a transcription factor in vitro reveals its regulatory potential.

    PubMed

    Kurihara, Yukio; Makita, Yuko; Kawashima, Mika; Hamasaki, Hidefumi; Yamamoto, Yoshiharu Y; Matsui, Minami

    2014-01-01

    Several transcription factors (TFs) coordinate to regulate expression of specific genes at the transcriptional level. In Arabidopsis thaliana it is estimated that approximately 10% of all genes encode TFs or TF-like proteins. It is important to identify target genes that are directly regulated by TFs in order to understand the complete picture of a plant's transcriptome profile. Here, we investigate the role of the LONG HYPOCOTYL5 (HY5) transcription factor that acts as a regulator of photomorphogenesis. We used an in vitro genomic DNA binding assay coupled with immunoprecipitation and next-generation sequencing (gDB-seq) instead of the in vivo chromatin immunoprecipitation (ChIP)-based methods. The results demonstrate that the HY5-binding motif predicted here was similar to the motif reported previously and that in vitro HY5-binding loci largely overlapped with the HY5-targeted candidate genes identified in previous ChIP-chip analysis. By combining these results with microarray analysis, we identified hundreds of HY5-binding genes that were differentially expressed in hy5. We also observed delayed induction of some transcripts of HY5-binding genes in hy5 mutants in response to blue-light exposure after dark treatment. Thus, an in vitro gDNA-binding assay coupled with sequencing is a convenient and powerful method to bridge the gap between identifying TF binding potential and establishing function. PMID:25534860

  3. Transposable Elements and DNA Methylation Create in Embryonic Stem Cells Human-Specific Regulatory Sequences Associated with Distal Enhancers and Noncoding RNAs.

    PubMed

    Glinsky, Gennadi V

    2015-06-01

    Despite significant progress in the structural and functional characterization of the human genome, understanding of the mechanisms underlying the genetic basis of human phenotypic uniqueness remains limited. Here, I report that transposable element-derived sequences, most notably LTR7/HERV-H, LTR5_Hs, and L1HS, harbor 99.8% of the candidate human-specific regulatory loci (HSRL) with putative transcription factor-binding sites in the genome of human embryonic stem cells (hESC). A total of 4,094 candidate HSRL display selective and site-specific binding of critical regulators (NANOG [Nanog homeobox], POU5F1 [POU class 5 homeobox 1], CCCTC-binding factor [CTCF], Lamin B1), and are preferentially located within the matrix of transcriptionally active DNA segments that are hypermethylated in hESC. hESC-specific NANOG-binding sites are enriched near the protein-coding genes regulating brain size, pluripotency long noncoding RNAs, hESC enhancers, and 5-hydroxymethylcytosine-harboring regions immediately adjacent to binding sites. Sequences of only 4.3% of hESC-specific NANOG-binding sites are present in Neanderthals' genome, suggesting that a majority of these regulatory elements emerged in Modern Humans. Comparisons of estimated creation rates of novel TF-binding sites revealed that there was 49.7-fold acceleration of creation rates of NANOG-binding sites in genomes of Chimpanzees compared with the mouse genomes and further 5.7-fold acceleration in genomes of Modern Humans compared with the Chimpanzees genomes. Preliminary estimates suggest that emergence of one novel NANOG-binding site detectable in hESC required 466 years of evolution. Pathway analysis of coding genes that have hESC-specific NANOG-binding sites within gene bodies or near gene boundaries revealed their association with physiological development and functions of nervous and cardiovascular systems, embryonic development, behavior, as well as development of a diverse spectrum of pathological conditions

  4. Transposable Elements and DNA Methylation Create in Embryonic Stem Cells Human-Specific Regulatory Sequences Associated with Distal Enhancers and Noncoding RNAs

    PubMed Central

    Glinsky, Gennadi V.

    2015-01-01

    Despite significant progress in the structural and functional characterization of the human genome, understanding of the mechanisms underlying the genetic basis of human phenotypic uniqueness remains limited. Here, I report that transposable element-derived sequences, most notably LTR7/HERV-H, LTR5_Hs, and L1HS, harbor 99.8% of the candidate human-specific regulatory loci (HSRL) with putative transcription factor-binding sites in the genome of human embryonic stem cells (hESC). A total of 4,094 candidate HSRL display selective and site-specific binding of critical regulators (NANOG [Nanog homeobox], POU5F1 [POU class 5 homeobox 1], CCCTC-binding factor [CTCF], Lamin B1), and are preferentially located within the matrix of transcriptionally active DNA segments that are hypermethylated in hESC. hESC-specific NANOG-binding sites are enriched near the protein-coding genes regulating brain size, pluripotency long noncoding RNAs, hESC enhancers, and 5-hydroxymethylcytosine-harboring regions immediately adjacent to binding sites. Sequences of only 4.3% of hESC-specific NANOG-binding sites are present in Neanderthals’ genome, suggesting that a majority of these regulatory elements emerged in Modern Humans. Comparisons of estimated creation rates of novel TF-binding sites revealed that there was 49.7-fold acceleration of creation rates of NANOG-binding sites in genomes of Chimpanzees compared with the mouse genomes and further 5.7-fold acceleration in genomes of Modern Humans compared with the Chimpanzees genomes. Preliminary estimates suggest that emergence of one novel NANOG-binding site detectable in hESC required 466 years of evolution. Pathway analysis of coding genes that have hESC-specific NANOG-binding sites within gene bodies or near gene boundaries revealed their association with physiological development and functions of nervous and cardiovascular systems, embryonic development, behavior, as well as development of a diverse spectrum of pathological conditions

  5. Common chromosomal fragile site FRA16D sequence: identification of the FOR gene spanning FRA16D and homozygous deletions and translocation breakpoints in cancer cells.

    PubMed

    Ried, K; Finnis, M; Hobson, L; Mangelsdorf, M; Dayan, S; Nancarrow, J K; Woollatt, E; Kremmidiotis, G; Gardner, A; Venter, D; Baker, E; Richards, R I

    2000-07-01

    Fluorescence in situ hybridization of a tile path of DNA subclones has previously enabled the cyto-genetic definition of the minimal DNA sequence which spans the FRA16D common chromosomal fragile site, located at 16q23.2. Homozygous deletion of the FRA16D locus has been reported in adenocarcinomas of stomach, colon, lung and ovary. We have sequenced the 270 kb containing the FRA16D fragile site and the minimal homozygously deleted region in tumour cells. This sequence enabled localization of some of the tumour cell breakpoints to regions which contain AT-rich secondary structures similar to those associated with the FRA10B and FRA16B rare fragile sites. The FRA16D DNA sequence also led to the identification of an alternatively spliced gene, named FOR (fragile site FRA16D oxidoreductase), exons of which span both the fragile site and the minimal region of homozygous deletion. In addition, the complete DNA sequence of the FRA16D-containing FOR intron reveals no evidence of additional authentic transcripts. Alternatively spliced FOR transcripts (FOR I, FOR II and FOR III) encode proteins which share N-terminal WW domains and differ at their C-terminus, with FOR III having a truncated oxidoreductase domain. FRA16D-associated deletions selectively affect the FOR gene transcripts. Three out of five previously mapped translocation breakpoints in multiple myeloma are also located within the FOR gene. FOR is therefore the principle genetic target for DNA instability at 16q23.2 and perturbation of FOR function is likely to contribute to the biological consequences of DNA instability at FRA16D in cancer cells. PMID:10861292

  6. Common methods for fecal sample storage in field studies yield consistent signatures of individual identity in microbiome sequencing data.

    PubMed

    Blekhman, Ran; Tang, Karen; Archie, Elizabeth A; Barreiro, Luis B; Johnson, Zachary P; Wilson, Mark E; Kohn, Jordan; Yuan, Michael L; Gesquiere, Laurence; Grieneisen, Laura E; Tung, Jenny

    2016-01-01

    Field studies of wild vertebrates are frequently associated with extensive collections of banked fecal samples-unique resources for understanding ecological, behavioral, and phylogenetic effects on the gut microbiome. However, we do not understand whether sample storage methods confound the ability to investigate interindividual variation in gut microbiome profiles. Here, we extend previous work on storage methods for gut microbiome samples by comparing immediate freezing, the gold standard of preservation, to three methods commonly used in vertebrate field studies: lyophilization, storage in ethanol, and storage in RNAlater. We found that the signature of individual identity consistently outweighed storage effects: alpha diversity and beta diversity measures were significantly correlated across methods, and while samples often clustered by donor, they never clustered by storage method. Provided that all analyzed samples are stored the same way, banked fecal samples therefore appear highly suitable for investigating variation in gut microbiota. Our results open the door to a much-expanded perspective on variation in the gut microbiome across species and ecological contexts. PMID:27528013

  7. Common methods for fecal sample storage in field studies yield consistent signatures of individual identity in microbiome sequencing data

    PubMed Central

    Blekhman, Ran; Tang, Karen; Archie, Elizabeth A.; Barreiro, Luis B.; Johnson, Zachary P.; Wilson, Mark E.; Kohn, Jordan; Yuan, Michael L.; Gesquiere, Laurence; Grieneisen, Laura E.; Tung, Jenny

    2016-01-01

    Field studies of wild vertebrates are frequently associated with extensive collections of banked fecal samples—unique resources for understanding ecological, behavioral, and phylogenetic effects on the gut microbiome. However, we do not understand whether sample storage methods confound the ability to investigate interindividual variation in gut microbiome profiles. Here, we extend previous work on storage methods for gut microbiome samples by comparing immediate freezing, the gold standard of preservation, to three methods commonly used in vertebrate field studies: lyophilization, storage in ethanol, and storage in RNAlater. We found that the signature of individual identity consistently outweighed storage effects: alpha diversity and beta diversity measures were significantly correlated across methods, and while samples often clustered by donor, they never clustered by storage method. Provided that all analyzed samples are stored the same way, banked fecal samples therefore appear highly suitable for investigating variation in gut microbiota. Our results open the door to a much-expanded perspective on variation in the gut microbiome across species and ecological contexts. PMID:27528013

  8. Organization, regulatory sequences, and alternatively spliced transcripts of the mucosal addressin cell adhesion molecule-1 (MAdCAM-1) gene

    SciTech Connect

    Sampaio, S.O.; Mei, C.; Butcher, E.C.

    1995-09-01

    The mucosal addressin cell adhesion molecule-1 (MAdCAM-1) is expressed selectively at venular sites of lymphocyte extravasation into mucosal lymphoid tissues and lamina propria, where it directs local lymphocyte trafficking. MAdCAM-1 is a multifunctional type I transmembrane adhesion molecule comprising two distal Ig domains involved in {alpha}4{beta}7 integrin binding, a mucin-like region able to display L-selectin-binding carbohydrates, and a membrane-proximal Ig domain homologous to IgA. We show in this work that the MAdCAM-1 gene is located on chromosome 10 and contains five exons. The signal peptide and each one of the three Ig domains are encoded by a distinct exon, whereas the transmembrane, cytoplasmic tail, and 3{prime}-untranslated region of MAdCAM-1 are combined on a single exon. The mucin-like region and the third Ig domain are encoded together on exon 4. An alternatively spliced MAdCAM-1 mRNA is identified that lacks the mucin/IgA-homologous exon 4-encoded sequences. This short variant of MAdCAM-1 may be specialized to support {alpha}4{beta}7-dependent adhesion strengthening, independent of carbohydrate-presenting function. Sequences 5{prime} of the transcription start site include tandem nuclear factor-KB sites; AP-1, AP-2, and signal peptide-1 binding sites; and an estrogen response element. Our findings reinforce the correspondence between the multidomain structure and versatile functions of this vascular addressin, and suggest an additional level of regulation of carbohydrate-presenting capability, and thus of its importance in lectin-mediated vs. {alpha}4{beta}7-dependent adhesive events in lymphocyte trafficking. 46 refs., 6 figs., 1 tab.

  9. RNA sequencing and functional analysis implicate the regulatory role of long non-coding RNAs in tomato fruit ripening

    PubMed Central

    Zhu, Benzhong; Yang, Yongfang; Li, Ran; Fu, Daqi; Wen, Liwei; Luo, Yunbo; Zhu, Hongliang

    2015-01-01

    Recently, long non-coding RNAs (lncRNAs) have been shown to play critical regulatory roles in model plants, such as Arabidopsis, rice, and maize. However, the presence of lncRNAs and how they function in fleshy fruit ripening are still largely unknown because fleshy fruit ripening is not present in the above model plants. Tomato is the model system for fruit ripening studies due to its dramatic ripening process. To investigate further the role of lncRNAs in fruit ripening, it is necessary and urgent to discover and identify novel lncRNAs and understand the function of lncRNAs in tomato fruit ripening. Here it is reported that 3679 lncRNAs were discovered from wild-type tomato and ripening mutant fruit. The lncRNAs are transcribed from all tomato chromosomes, 85.1% of which came from intergenic regions. Tomato lncRNAs are shorter and have fewer exons than protein-coding genes, a situation reminiscent of lncRNAs from other model plants. It was also observed that 490 lncRNAs were significantly up-regulated in ripening mutant fruits, and 187 lncRNAs were down-regulated, indicating that lncRNAs could be involved in the regulation of fruit ripening. In line with this, silencing of two novel tomato intergenic lncRNAs, lncRNA1459 and lncRNA1840, resulted in an obvious delay of ripening of wild-type fruit. Overall, the results indicated that lncRNAs might be essential regulators of tomato fruit ripening, which sheds new light on the regulation of fruit ripening. PMID:25948705

  10. A Common Missense Variant in the Glucokinase Regulatory Protein Gene (GCKR) Is Associated with Increased Plasma Triglyceride and C-Reactive Protein but Lower Fasting Glucose Concentrations

    Technology Transfer Automated Retrieval System (TEKTRAN)

    OBJECTIVE-Using the genome-wide-association approach, we recently identified the glucokinase regulatory protein gene (GCKR, rs780094) region as a novel quantitative trait locus for plasma triglyceride concentration in Europeans. Here, we sought to study the association of GCKR variants with metaboli...

  11. Identification of DNA methylation changes at cis-regulatory elements during early steps of HSC differentiation using tagmentation-based whole genome bisulfite sequencing

    PubMed Central

    Lipka, Daniel B; Wang, Qi; Cabezas-Wallscheid, Nina; Klimmeck, Daniel; Weichenhan, Dieter; Herrmann, Carl; Lier, Amelie; Brocks, David; von Paleske, Lisa; Renders, Simon; Wünsche, Peer; Zeisberger, Petra; Gu, Lei; Haas, Simon; Essers, Marieke Ag; Brors, Benedikt; Eils, Roland; Trumpp, Andreas; Milsom, Michael D; Plass, Christoph

    2014-01-01

    Epigenetic alterations during cellular differentiation are a key molecular mechanism which both instructs and reinforces the process of lineage commitment. Within the haematopoietic system, progressive changes in the DNA methylome of haematopoietic stem cells (HSCs) are essential for the effective production of mature blood cells. Inhibition or loss of function of the cellular DNA methylation machinery has been shown to lead to a severe perturbation in blood production and is also an important driver of malignant transformation. HSCs constitute a very rare cell population in the bone marrow, capable of life-long self-renewal and multi-lineage differentiation. The low abundance of HSCs has been a major technological barrier to the global analysis of the CpG methylation status within both HSCs and their immediate progeny, the multipotent progenitors (MPPs). Within this Extra View article, we review the current understanding of how the DNA methylome regulates normal and malignant hematopoiesis. We also discuss the current methodologies that are available for interrogating the DNA methylation status of HSCs and MPPs and describe a new data set that was generated using tagmentation-based whole genome bisulfite sequencing (TWGBS) in order to comprehensively map methylated cytosines using the limited amount of genomic DNA that can be harvested from rare cell populations. Extended analysis of this data set clearly demonstrates the added value of genome-wide sequencing of methylated cytosines and identifies novel important cis-acting regulatory regions that are dynamically remodeled during the first steps of haematopoietic differentiation. PMID:25483069

  12. Secondary structure model of the Mason-Pfizer monkey virus 5' leader sequence: identification of a structural motif common to a variety of retroviruses.

    PubMed Central

    Harrison, G P; Hunter, E; Lever, A M

    1995-01-01

    A stable secondary structure model is presented for the region 3' of the primer-binding site to 130 bases into the gag sequence of the prototype type D retrovirus Mason-Pfizer monkey virus. Using biochemical probing of RNA from this region in association with free energy minimization, we have identified a stem-loop structure in the region, which from other studies has been shown to be important for genomic RNA encapsidation. The structure involves a highly stable stem of five G-C pairs terminating in a heptaloop. Comparison of the Mason-Pfizer monkey virus structure with one predicted for squirrel monkey retrovirus demonstrates an identical stem and a common ACC motif in the loop. Free energy studies of the secondary structure of the 5' regions of eight other retroviruses predict stem loops which have similar GAYC motifs. We believe this may represent a common structural and sequence motif which among other functions may be involved in genomic RNA packaging in these viruses. PMID:7884866

  13. Helena, the hidden beauty: Resolving the most common West Eurasian mtDNA control region haplotype by massively parallel sequencing an Italian population sample.

    PubMed

    Bodner, Martin; Iuvaro, Alessandra; Strobl, Christina; Nagl, Simone; Huber, Gabriela; Pelotti, Susi; Pettener, Davide; Luiselli, Donata; Parson, Walther

    2015-03-01

    The analysis of mitochondrial (mt)DNA is a powerful tool in forensic genetics when nuclear markers fail to give results or maternal relatedness is investigated. The mtDNA control region (CR) contains highly condensed variation and is therefore routinely typed. Some samples exhibit an identical haplotype in this restricted range. Thus, they convey only weak evidence in forensic queries and limited phylogenetic information. However, a CR match does not imply that also the mtDNA coding regions are identical or samples belong to the same phylogenetic lineage. This is especially the case for the most frequent West Eurasian CR haplotype 263G 315.1C 16519C, which is observed in various clades within haplogroup H and occurs at a frequency of 3-4% in many European populations. In this study, we investigated the power of massively parallel complete mtGenome sequencing in 29 Italian samples displaying the most common West Eurasian CR haplotype - and found an unexpected high diversity. Twenty-eight different haplotypes falling into 19 described sub-clades of haplogroup H were revealed in the samples with identical CR sequences. This study demonstrates the benefit of complete mtGenome sequencing for forensic applications to enforce maximum discrimination, more comprehensive heteroplasmy detection, as well as highest phylogenetic resolution. PMID:25303789

  14. Negative regulatory element associated with potentially functional promoter and enhancer elements in the long terminal repeats of endogenous murine leukemia virus-related proviral sequences.

    PubMed Central

    Ch'ang, L Y; Yang, W K; Myer, F E; Yang, D M

    1989-01-01

    Three series of recombinant DNA clones were constructed, with the bacterial chloramphenicol acetyltransferase (CAT) gene as a quantitative indicator, to examine the activities of promoter and enhancer sequence elements in the 5' long terminal repeat (LTR) of murine leukemia virus (MuLV)-related proviral sequences isolated from the mouse genome. Transient CAT expression was determined in mouse NIH 3T3, human HT1080, and mink CCL64 cultured cells transfected with the LTR-CAT constructs. The 700-base-pair (bp) LTRs of three polytropic MuLV-related proviral clones and the 750-bp LTRs of four modified polytropic proviral clones, in complete structures either with or without the adjacent downstream sequences, all showed very little or negligible activities for CAT expression, while ecotropic MuLV LTRs were highly active. The MuLV-related LTRs were divided into three portions and examined separately. The 3' portion of the MuLV-related LTRs that contains the CCAAC and TATAA boxes was found to be a functional promoter, being about one-half to one-third as active as the corresponding portion of ecotropic MuLV LTRs. A MboI-Bg/II fragment, representing the distinct 190- to 200-bp inserted segment in the middle, was found to be a potential enhancer, especially when examined in combination with the simian virus 40 promoter in CCL64 cells. A PstI-MboI fragment of the 5' portion, which contains the protein-binding motifs of the enhancer segment as well as the upstream LTR sequences, showed moderate enhancer activities in CCL6 cells but was virtually inactive in NIH 3T3 cells and HT1080 cells; addition of this fragment to the ecotropic LTR-CAT constructs depressed CAT expression. Further analyses using chimeric LTR constructs located the presence of a strong negative regulatory element within the region containing the 5' portion of the enhancer and the immediate upstream sequences in the MuLV-related LTRs. Images PMID:2542587

  15. Comparisons of Ribosomal Protein Gene Promoters Indicate Superiority of Heterologous Regulatory Sequences for Expressing Transgenes in Phytophthora infestans

    PubMed Central

    Khachatoorian, Careen; Judelson, Howard S.

    2015-01-01

    Molecular genetics approaches in Phytophthora research can be hampered by the limited number of known constitutive promoters for expressing transgenes and the instability of transgene activity. We have therefore characterized genes encoding the cytoplasmic ribosomal proteins of Phytophthora and studied their suitability for expressing transgenes in P. infestans. Phytophthora spp. encode a standard complement of 79 cytoplasmic ribosomal proteins. Several genes are duplicated, and two appear to be pseudogenes. Half of the genes are expressed at similar levels during all stages of asexual development, and we discovered that the majority share a novel promoter motif named the PhRiboBox. This sequence is enriched in genes associated with transcription, translation, and DNA replication, including tRNA and rRNA biogenesis. Promoters from the three P. infestans genes encoding ribosomal proteins S9, L10, and L23 and their orthologs from P. capsici were tested for their ability to drive transgenes in stable transformants of P. infestans. Five of the six promoters yielded strong expression of a GUS reporter, but the stability of expression was higher using the P. capsici promoters. With the RPS9 and RPL10 promoters of P. infestans, about half of transformants stopped making GUS over two years of culture, while their P. capsici orthologs conferred stable expression. Since cross-talk between native and transgene loci may trigger gene silencing, we encourage the use of heterologous promoters in transformation studies. PMID:26716454

  16. Complete genome sequence and transcriptomics analyses reveal pigment biosynthesis and regulatory mechanisms in an industrial strain, Monascus purpureus YY-1

    PubMed Central

    Yang, Yue; Liu, Bin; Du, Xinjun; Li, Ping; Liang, Bin; Cheng, Xiaozhen; Du, Liangcheng; Huang, Di; Wang, Lei; Wang, Shuo

    2015-01-01

    Monascus has been used to produce natural colorants and food supplements for more than one thousand years, and approximately more than one billion people eat Monascus-fermented products during their daily life. In this study, using next-generation sequencing and optical mapping approaches, a 24.1-Mb complete genome of an industrial strain, Monascus purpureus YY-1, was obtained. This genome consists of eight chromosomes and 7,491 genes. Phylogenetic analysis at the genome level provides convincing evidence for the evolutionary position of M. purpureus. We provide the first comprehensive prediction of the biosynthetic pathway for Monascus pigment. Comparative genomic analyses show that the genome of M. purpureus is 13.6–40% smaller than those of closely related filamentous fungi and has undergone significant gene losses, most of which likely occurred during its specialized adaptation to starch-based foods. Comparative transcriptome analysis reveals that carbon starvation stress, resulting from the use of relatively low-quality carbon sources, contributes to the high yield of pigments by repressing central carbon metabolism and augmenting the acetyl-CoA pool. Our work provides important insights into the evolution of this economically important fungus and lays a foundation for future genetic manipulation and engineering of this strain. PMID:25660389

  17. Complete genome sequence and transcriptomics analyses reveal pigment biosynthesis and regulatory mechanisms in an industrial strain, Monascus purpureus YY-1.

    PubMed

    Yang, Yue; Liu, Bin; Du, Xinjun; Li, Ping; Liang, Bin; Cheng, Xiaozhen; Du, Liangcheng; Huang, Di; Wang, Lei; Wang, Shuo

    2015-01-01

    Monascus has been used to produce natural colorants and food supplements for more than one thousand years, and approximately more than one billion people eat Monascus-fermented products during their daily life. In this study, using next-generation sequencing and optical mapping approaches, a 24.1-Mb complete genome of an industrial strain, Monascus purpureus YY-1, was obtained. This genome consists of eight chromosomes and 7,491 genes. Phylogenetic analysis at the genome level provides convincing evidence for the evolutionary position of M. purpureus. We provide the first comprehensive prediction of the biosynthetic pathway for Monascus pigment. Comparative genomic analyses show that the genome of M. purpureus is 13.6-40% smaller than those of closely related filamentous fungi and has undergone significant gene losses, most of which likely occurred during its specialized adaptation to starch-based foods. Comparative transcriptome analysis reveals that carbon starvation stress, resulting from the use of relatively low-quality carbon sources, contributes to the high yield of pigments by repressing central carbon metabolism and augmenting the acetyl-CoA pool. Our work provides important insights into the evolution of this economically important fungus and lays a foundation for future genetic manipulation and engineering of this strain. PMID:25660389

  18. Rearrangements of the transcriptional regulatory networks of metabolic pathways in fungi

    PubMed Central

    Lavoie, Hugo; Hogues, Hervé; Whiteway, Malcolm

    2013-01-01

    Growing evidence suggests that transcriptional regulatory networks in many organisms are highly flexible. Here, we discuss the evolution of transcriptional regulatory networks governing the metabolic machinery of sequenced ascomycetes. In particular, recent work has shown that transcriptional rewiring is common in regulons controlling processes such as production of ribosome components and metabolism of carbohydrates and lipids. We note that dramatic rearrangements of the transcriptional regulatory components of metabolic functions have occurred among ascomycetes species. PMID:19875326

  19. Lung Parenchymal Signal Intensity in MRI: A Technical Review with Educational Aspirations Regarding Reversible Versus Irreversible Transverse Relaxation Effects in Common Pulse Sequences

    PubMed Central

    MULKERN, ROBERT; HAKER, STEVEN; MAMATA, HATSUHO; LEE, EDWARD; MITSOURAS, DIMITRIOS; OSHIO, KOICHI; BALASUBRAMANIAN, MUKUND; HATABU, HIROTO

    2014-01-01

    Lung parenchyma is challenging to image with proton MRI. The large air space results in ~l/5th as many signal-generating protons compared to other organs. Air/tissue magnetic susceptibility differences lead to strong magnetic field gradients throughout the lungs and to broad frequency distributions, much broader than within other organs. Such distributions have been the subject of experimental and theoretical analyses which may reveal aspects of lung microarchitecture useful for diagnosis. Their most immediate relevance to current imaging practice is to cause rapid signal decays, commonly discussed in terms of short T2* values of 1 ms or lower at typical imaging field strengths. Herein we provide a brief review of previous studies describing and interpreting proton lung spectra. We then link these broad frequency distributions to rapid signal decays, though not necessarily the exponential decays generally used to define T2* values. We examine how these decays influence observed signal intensities and spatial mapping features associated with the most prominent torso imaging sequences, including spoiled gradient and spin echo sequences. Effects of imperfect refocusing pulses on the multiple echo signal decays in single shot fast spin echo (SSFSE) sequences and effects of broad frequency distributions on balanced steady state free precession (bSSFP) sequence signal intensities are also provided. The theoretical analyses are based on the concept of explicitly separating the effects of reversible and irreversible transverse relaxation processes, thus providing a somewhat novel and more general framework from which to estimate lung signal intensity behavior in modern imaging practice. PMID:25228852

  20. Lung Parenchymal Signal Intensity in MRI: A Technical Review with Educational Aspirations Regarding Reversible Versus Irreversible Transverse Relaxation Effects in Common Pulse Sequences.

    PubMed

    Mulkern, Robert; Haker, Steven; Mamata, Hatsuho; Lee, Edward; Mitsouras, Dimitrios; Oshio, Koichi; Balasubramanian, Mukund; Hatabu, Hiroto

    2014-03-01

    Lung parenchyma is challenging to image with proton MRI. The large air space results in ~l/5th as many signal-generating protons compared to other organs. Air/tissue magnetic susceptibility differences lead to strong magnetic field gradients throughout the lungs and to broad frequency distributions, much broader than within other organs. Such distributions have been the subject of experimental and theoretical analyses which may reveal aspects of lung microarchitecture useful for diagnosis. Their most immediate relevance to current imaging practice is to cause rapid signal decays, commonly discussed in terms of short T2 (*) values of 1 ms or lower at typical imaging field strengths. Herein we provide a brief review of previous studies describing and interpreting proton lung spectra. We then link these broad frequency distributions to rapid signal decays, though not necessarily the exponential decays generally used to define T2 (*) values. We examine how these decays influence observed signal intensities and spatial mapping features associated with the most prominent torso imaging sequences, including spoiled gradient and spin echo sequences. Effects of imperfect refocusing pulses on the multiple echo signal decays in single shot fast spin echo (SSFSE) sequences and effects of broad frequency distributions on balanced steady state free precession (bSSFP) sequence signal intensities are also provided. The theoretical analyses are based on the concept of explicitly separating the effects of reversible and irreversible transverse relaxation processes, thus providing a somewhat novel and more general framework from which to estimate lung signal intensity behavior in modern imaging practice. PMID:25228852

  1. Transgenic LacZ under control of Hec-6st regulatory sequences recapitulates endogenous gene expression on high endothelial venules

    PubMed Central

    Liao, Shan; Bentley, Kevin; Lebrun, Marielle; Lesslauer, Werner; Ruddle, Frank H.; Ruddle, Nancy H.

    2007-01-01

    Hec-6st is a highly specific high endothelial venule (HEV) gene that is crucial for regulating lymphocyte homing to lymph nodes (LN). The enzyme is also expressed in HEV-like vessels in tertiary lymphoid organs that form in chronic inflammation in autoimmunity, graft rejection, and microbial infection. Understanding the molecular nature of Hec-6st regulation is crucial for elucidating its function in development and disease. However, studies of HEV are limited because of the difficulties in isolating and maintaining the unique characteristics of these vessels in vitro. The novel pClasper yeast homologous recombination technique was used to isolate from a BAC clone a 60-kb DNA fragment that included the Hec-6st (Chst4) gene with flanking sequences. Transgenic mice were generated with the β-galactosidase (LacZ) reporter gene inserted in-frame in the exon II of Hec-6st within the isolated BAC DNA fragment. LacZ was expressed specifically on HEV in LN, as indicated by its colocalization with peripheral node vascular addressin. LacZ was increased in nasal-associated lymphoid tissue during development and was reduced in LN and nasal-associated lymphoid tissue by LTβR-Ig (lymphotoxin-β receptor human Ig fusion protein) treatment in a manner identical to the endogenous gene. The transgene was expressed at high levels in lymphoid accumulations with characteristics of tertiary lymphoid organs in the salivary glands of aged mice. Thus, the Hec-6s-LacZ construct faithfully reproduces Hec-6st tissue-specific expression and can be used in further studies to drive expression of reporter or effector genes, which could visualize or inhibit HEV in autoimmunity. PMID:17360566

  2. In planta analysis of a cis-regulatory cytokinin response motif in Arabidopsis and identification of a novel enhancer sequence.

    PubMed

    Ramireddy, Eswarayya; Brenner, Wolfram G; Pfeifer, Andreas; Heyl, Alexander; Schmülling, Thomas

    2013-07-01

    The phytohormone cytokinin plays a key role in regulating plant growth and development, and is involved in numerous physiological responses to environmental changes. The type-B response regulators, which regulate the transcription of cytokinin response genes, are a part of the cytokinin signaling system. Arabidopsis thaliana encodes 11 type-B response regulators (type-B ARRs), and some of them were shown to bind in vitro to the core cytokinin response motif (CRM) 5'-(A/G)GAT(T/C)-3' or, in the case of ARR1, to an extended motif (ECRM), 5'-AAGAT(T/C)TT-3'. Here we obtained in planta proof for the functionality of the latter motif. Promoter deletion analysis of the primary cytokinin response gene ARR6 showed that a combination of two extended motifs within the promoter is required to mediate the full transcriptional activation by ARR1 and other type-B ARRs. CRMs were found to be over-represented in the vicinity of ECRMs in the promoters of cytokinin-regulated genes, suggesting their functional relevance. Moreover, an evolutionarily conserved 27 bp long T-rich region between -220 and -193 bp was identified and shown to be required for the full activation by type-B ARRs and the response to cytokinin. This novel enhancer is not bound by the DNA-binding domain of ARR1, indicating that additional proteins might be involved in mediating the transcriptional cytokinin response. Furthermore, genome-wide expression profiling identified genes, among them ARR16, whose induction by cytokinin depends on both ARR1 and other specific type-B ARRs. This together with the ECRM/CRM sequence clustering indicates cooperative action of different type-B ARRs for the activation of particular target genes. PMID:23620480

  3. Inactivation, sequence, and lacZ fusion analysis of a regulatory locus required for repression of nitrogen fixation genes in Rhodobacter capsulatus.

    PubMed Central

    Kranz, R G; Pace, V M; Caldicott, I M

    1990-01-01

    Transcription of the genes that code for proteins involved in nitrogen fixation in free-living diazotrophs is typically repressed by high internal oxygen concentrations or exogenous fixed nitrogen. The DNA sequence of a regulatory locus required for repression of Rhodobacter capsulatus nitrogen fixation genes was determined. It was shown that this locus, defined by Tn5 insertions and by ethyl methanesulfonate-derived mutations, is homologous to the glnB gene of other organisms. The R. capsulatus glnB gene was upstream of glnA, the gene for glutamine synthetase, in a glnBA operon. beta-Galactosidase expression from an R. capsulatus glnBA-lacZ translational fusion was increased twofold in cells induced by nitrogen limitation relative to that in cells under nitrogen-sufficient conditions. R. capsulatus nifR1, a gene that was previously shown to be homologous to ntrC and that is required for transcription of nitrogen fixation genes, was responsible for approximately 50% of the transcriptional activation of this glnBA fusion in cells induced under nitrogen-limiting conditions. R. capsulatus GLNB, NIFR1, and NIFR2 (a protein homologous to NTRB) were proposed to transduce the nitrogen status in the cell into repression or activation of other R. capsulatus nif genes. Repression of nif genes in response to oxygen was still present in R. capsulatus glnB mutants and must have occurred at a different level of control in the regulatory circuit. Images FIG. 4 FIG. 5 PMID:2152916

  4. The kil-kor regulon of broad-host-range plasmid RK2: nucleotide sequence, polypeptide product, and expression of regulatory gene korC.

    PubMed Central

    Kornacki, J A; Burlage, R S; Figurski, D H

    1990-01-01

    Broad-host-range plasmid RK2 encodes several kil operons (kilA, kilB, kilC, kilE) whose expression is potentially lethal to Escherichia coli host cells. The kil operons and the RK2 replication initiator gene (trfA) are coregulated by various combinations of kor genes (korA, korB, korC, korE). This regulatory network is called the kil-kor regulon. Presented here are studies on the structure, product, and expression of korC. Genetic mapping revealed the precise location of korC in a region near transposon Tn1. We determined the nucleotide sequence of this region and identified the korC structural gene by analysis of korC mutants. Sequence analysis predicts the korC product to be a polypeptide of 85 amino acids with a molecular mass of 9,150 daltons. The KorC polypeptide was identified in vivo by expressing wild-type and mutant korC alleles from a bacteriophage T7 RNA polymerase-dependent promoter. The predicted structure of KorC polypeptide has a net positive charge and a helix-turn-helix region similar to those of known DNA-binding proteins. These properties are consistent with the repressorlike function of KorC protein, and we discuss the evidence that KorA and KorC proteins act as corepressors in the control of the kilC and kilE operons. Finally, we show that korC is expressed from the bla promoters within the upstream transposon Tn1, suggesting that insertion of Tn1 interrupted a plasmid operon that may have originally included korC and kilC. Images PMID:2160936

  5. Cis-regulatory mutations in human disease

    PubMed Central

    2009-01-01

    Cis-acting regulatory sequences are required for the proper temporal and spatial control of gene expression. Variation in gene expression is highly heritable and a significant determinant of human disease susceptibility. The diversity of human genetic diseases attributed, in whole or in part, to mutations in non-coding regulatory sequences is on the rise. Improvements in genome-wide methods of associating genetic variation with human disease and predicting DNA with cis-regulatory potential are two of the major reasons for these recent advances. This review will highlight select examples from the literature that have successfully integrated genetic and genomic approaches to uncover the molecular basis by which cis-regulatory mutations alter gene expression and contribute to human disease. The fine mapping of disease-causing variants has led to the discovery of novel cis-acting regulatory elements that, in some instances, are located as far away as 1.5 Mb from the target gene. In other cases, the prior knowledge of the regulatory landscape surrounding the gene of interest aided in the selection of enhancers for mutation screening. The success of these studies should provide a framework for following up on the large number of genome-wide association studies that have identified common variants in non-coding regions of the genome that associate with increased risk of human diseases including, diabetes, autism, Crohn's, colorectal cancer, and asthma, to name a few. PMID:19641089

  6. Push it to the limit: Characterizing the convergence of common sequences of basis sets for intermolecular interactions as described by density functional theory

    NASA Astrophysics Data System (ADS)

    Witte, Jonathon; Neaton, Jeffrey B.; Head-Gordon, Martin

    2016-05-01

    With the aim of systematically characterizing the convergence of common families of basis sets such that general recommendations for basis sets can be made, we have tested a wide variety of basis sets against complete-basis binding energies across the S22 set of intermolecular interactions—noncovalent interactions of small and medium-sized molecules consisting of first- and second-row atoms—with three distinct density functional approximations: SPW92, a form of local-density approximation; B3LYP, a global hybrid generalized gradient approximation; and B97M-V, a meta-generalized gradient approximation with nonlocal correlation. We have found that it is remarkably difficult to reach the basis set limit; for the methods and systems examined, the most complete basis is Jensen's pc-4. The Dunning correlation-consistent sequence of basis sets converges slowly relative to the Jensen sequence. The Karlsruhe basis sets are quite cost effective, particularly when a correction for basis set superposition error is applied: counterpoise-corrected def2-SVPD binding energies are better than corresponding energies computed in comparably sized Dunning and Jensen bases, and on par with uncorrected results in basis sets 3-4 times larger. These trends are exhibited regardless of the level of density functional approximation employed. A sense of the magnitude of the intrinsic incompleteness error of each basis set not only provides a foundation for guiding basis set choice in future studies but also facilitates quantitative comparison of existing studies on similar types of systems.

  7. Genetic structure of the widespread and common Mediterranean bryophyte Pleurochaete squarrosa (Brid.) Lindb. (Pottiaceae) - evidence from nuclear and plastidic DNA sequence variation and allozymes.

    PubMed

    Grundmann, Michael; Ansell, Stephen W; Russell, Stephen J; Koch, Marcus A; Vogel, Johannes C

    2007-02-01

    The Mediterranean Basin as one the world's most biologically diverse regions provides an interesting area for the study of plant evolution and spatial structure in plant populations. The dioecious moss Pleurochaete squarrosa is a widespread and common bryophyte in the Mediterranean Basin. Thirty populations were sampled for a study on molecular diversity and genetic structure, covering most major islands and mainland populations from Europe and Africa. A significant decline in nuclear and chloroplast sequence and allozyme variation within populations from west to east was observed. While DNA sequence data showed patterns of isolation by distance, allozyme markers did not. Instead, their considerable interpopulation genetic differentiation appeared to be unrelated to geographic distance. Similar high values for coefficients of gene diversity (G(ST)) in all data sets provided evidence of geographic isolation and limited gene flow among populations (i) within islands, (ii) within mainland areas, and (iii) between islands and mainland. Notably, populations in continental Spain are strongly genetically isolated from all other investigated areas. Surprisingly, there was no difference in gene diversity and G(ST) between islands and mainland areas. Thus, we conclude that large Mediterranean islands may function as 'mainland' for bryophytes. This hypothesis and its implication for conservation biology of cryptogamic plants warrant further investigation. While sexually reproducing populations were found all over the Mediterranean Basin, high levels of multilocus linkage disequilibrium provide evidence of mainly vegetative propagation even in populations where sexual reproduction was observed. PMID:17284206

  8. Vision from next generation sequencing: Multi-dimensional genome-wide analysis for producing gene regulatory networks underlying retinal development, aging and disease

    PubMed Central

    Yang, Hyun-Jin; Ratnapriya, Rinki; Cogliati, Tiziana; Kim, Jung-Woong; Swaroop, Anand

    2015-01-01

    Genomics and genetics have invaded all aspects of biology and medicine, opening uncharted territory for scientific exploration. The definition of “gene” itself has become ambiguous, and the central dogma is continuously being revised and expanded. Computational biology and computational medicine are no longer intellectual domains of the chosen few. Next generation sequencing (NGS) technology, together with novel methods of pattern recognition and network analyses, has revolutionized the way we think about fundamental biological mechanisms and cellular pathways. In this review, we discuss NGS-based genome-wide approaches that can provide deeper insights into retinal development, aging and disease pathogenesis. We first focus on gene regulatory networks (GRNs) that govern the differentiation of retinal photoreceptors and modulate adaptive response during aging. Then, we discuss NGS technology in the context of retinal disease and develop a vision for therapies based on network biology. We should emphasize that basic strategies for network construction and analyses can be transported to any tissue or cell type. We believe that specific and uniform guidelines are required for generation of genome, transcriptome and epigenome data to facilitate comparative analysis and integration of multi-dimensional data sets, and for constructing networks underlying complex biological processes. As cellular homeostasis and organismal survival are dependent on gene-gene and gene-environment interactions, we believe that network-based biology will provide the foundation for deciphering disease mechanisms and discovering novel drug targets for retinal neurodegenerative diseases. PMID:25668385

  9. Vision from next generation sequencing: multi-dimensional genome-wide analysis for producing gene regulatory networks underlying retinal development, aging and disease.

    PubMed

    Yang, Hyun-Jin; Ratnapriya, Rinki; Cogliati, Tiziana; Kim, Jung-Woong; Swaroop, Anand

    2015-05-01

    Genomics and genetics have invaded all aspects of biology and medicine, opening uncharted territory for scientific exploration. The definition of "gene" itself has become ambiguous, and the central dogma is continuously being revised and expanded. Computational biology and computational medicine are no longer intellectual domains of the chosen few. Next generation sequencing (NGS) technology, together with novel methods of pattern recognition and network analyses, has revolutionized the way we think about fundamental biological mechanisms and cellular pathways. In this review, we discuss NGS-based genome-wide approaches that can provide deeper insights into retinal development, aging and disease pathogenesis. We first focus on gene regulatory networks (GRNs) that govern the differentiation of retinal photoreceptors and modulate adaptive response during aging. Then, we discuss NGS technology in the context of retinal disease and develop a vision for therapies based on network biology. We should emphasize that basic strategies for network construction and analyses can be transported to any tissue or cell type. We believe that specific and uniform guidelines are required for generation of genome, transcriptome and epigenome data to facilitate comparative analysis and integration of multi-dimensional data sets, and for constructing networks underlying complex biological processes. As cellular homeostasis and organismal survival are dependent on gene-gene and gene-environment interactions, we believe that network-based biology will provide the foundation for deciphering disease mechanisms and discovering novel drug targets for retinal neurodegenerative diseases. PMID:25668385

  10. Traditional and targeted exome sequencing reveals common, rare and novel functional deleterious variants in RET signaling complex in a cohort of living US patients with urinary tract malformations

    PubMed Central

    Chatterjee, Rajshekhar; Ramos, Enrique; Hoffman, Mary; VanWinkle, Jessica; Martin, Daniel R; Davis, Thomas K; Hoshi, Masato; Hmiel, Stanley P; Beck, Anne; Hruska, Keith; Coplen, Doug; Liapis, Helen; Mitra, Robi; Druley, Todd; Austin, Paul; Jain, Sanjay

    2013-01-01

    Signaling by the glial cell line-derived neurotrophic factor (GDNF)-RET receptor tyrosine kinase and SPRY1, a RET repressor, is essential for early urinary tract development. Individual or a combination of GDNF, RET and SPRY1 mutant alleles in mice cause renal malformations reminiscent of congenital anomalies of the kidney or urinary tract (CAKUT) in humans and distinct from renal agenesis phenotype in complete GDNF or RET null mice. We sequenced GDNF, SPRY1 and RET in 122 unrelated living CAKUT patients to discover deleterious mutations that cause CAKUT. Novel or rare deleterious mutations in GDNF or RET were found in 6 unrelated patients. A family with duplicated collecting system had a novel mutation, RETR831Q, which showed markedly decreased GDNF dependent MAPK activity. Two patients with RET-G691S polymorphism harbored additional rare non-synonymous variants GDNF-R93W and RET-R982C. The patient with double RET-G691S/R982C genotype had multiple defects including renal dysplasia, megaureters and cryptorchidism. Presence of both mutations were necessary to affect RET activity. Targeted whole exome and next-generation sequencing revealed a novel deleterious mutation G443D in GFRα1, the co-receptor for RET, in this patient. Pedigree analysis indicated that the GFRα1 mutation was inherited from the unaffected mother and the RET mutations from the unaffected father. Our studies indicate that 5% of living CAKUT patients harbor deleterious rare variants or novel mutations in GDNF-GFRα1-RET pathway. We provide evidence for the coexistence of deleterious rare and common variants in genes in the same pathway as a cause of CAKUT and discovered novel phenotypes associated with the RET pathway. PMID:22729463

  11. Push it to the limit: Characterizing the convergence of common sequences of basis sets for intermolecular interactions as described by density functional theory.

    PubMed

    Witte, Jonathon; Neaton, Jeffrey B; Head-Gordon, Martin

    2016-05-21

    With the aim of systematically characterizing the convergence of common families of basis sets such that general recommendations for basis sets can be made, we have tested a wide variety of basis sets against complete-basis binding energies across the S22 set of intermolecular interactions-noncovalent interactions of small and medium-sized molecules consisting of first- and second-row atoms-with three distinct density functional approximations: SPW92, a form of local-density approximation; B3LYP, a global hybrid generalized gradient approximation; and B97M-V, a meta-generalized gradient approximation with nonlocal correlation. We have found that it is remarkably difficult to reach the basis set limit; for the methods and systems examined, the most complete basis is Jensen's pc-4. The Dunning correlation-consistent sequence of basis sets converges slowly relative to the Jensen sequence. The Karlsruhe basis sets are quite cost effective, particularly when a correction for basis set superposition error is applied: counterpoise-corrected def2-SVPD binding energies are better than corresponding energies computed in comparably sized Dunning and Jensen bases, and on par with uncorrected results in basis sets 3-4 times larger. These trends are exhibited regardless of the level of density functional approximation employed. A sense of the magnitude of the intrinsic incompleteness error of each basis set not only provides a foundation for guiding basis set choice in future studies but also facilitates quantitative comparison of existing studies on similar types of systems. PMID:27208948

  12. Epithelial and endothelial expression of the green fluorescent protein reporter gene under the control of bovine prion protein (PrP) gene regulatory sequences in transgenic mice

    NASA Astrophysics Data System (ADS)

    Lemaire-Vieille, Catherine; Schulze, Tobias; Podevin-Dimster, Valérie; Follet, Jérome; Bailly, Yannick; Blanquet-Grossard, Françoise; Decavel, Jean-Pierre; Heinen, Ernst; Cesbron, Jean-Yves

    2000-05-01

    The expression of the cellular form of the prion protein (PrPc) gene is required for prion replication and neuroinvasion in transmissible spongiform encephalopathies. The identification of the cell types expressing PrPc is necessary to understanding how the agent replicates and spreads from peripheral sites to the central nervous system. To determine the nature of the cell types expressing PrPc, a green fluorescent protein reporter gene was expressed in transgenic mice under the control of 6.9 kb of the bovine PrP gene regulatory sequences. It was shown that the bovine PrP gene is expressed as two populations of mRNA differing by alternative splicing of one 115-bp 5' untranslated exon in 17 different bovine tissues. The analysis of transgenic mice showed reporter gene expression in some cells that have been identified as expressing PrP, such as cerebellar Purkinje cells, lymphocytes, and keratinocytes. In addition, expression of green fluorescent protein was observed in the plexus of the enteric nervous system and in a restricted subset of cells not yet clearly identified as expressing PrP: the epithelial cells of the thymic medullary and the endothelial cells of both the mucosal capillaries of the intestine and the renal capillaries. These data provide valuable information on the distribution of PrPc at the cellular level and argue for roles of the epithelial and endothelial cells in the spread of infection from the periphery to the brain. Moreover, the transgenic mice described in this paper provide a model that will allow for the study of the transcriptional activity of the PrP gene promoter in response to scrapie infection.

  13. Interferon regulatory factor 4b (IRF4b) in Japanese flounder, Paralichthys olivaceus: Sequencing, ubiquitous tissue distribution and inducible expression by poly(I:C) and DNA virus.

    PubMed

    Liu, Dahai; Chen, Jinjing; Zhang, Haiyan; Hu, Mengzhu; Lou, Huimin; Liu, Qiuming; Zhang, Shicui; Hu, Guobin

    2016-09-01

    Interferon regulatory factor 4 (IRF4) in mammals is known to be critical in regulation of development and functions of lymphomyeloid cell lineages. Recent studies have demonstrated its involvement in immune responses to bacterial and viral challenges in teleosts. In this study, an IRF4 gene was cloned from Japanese flounder (Paralichthys olivaceus) and its expression in response to polyinosinic:polycytidylic acid [poly(I:C)] and lymphocystis disease virus (LCDV) stimulations was studied in vivo. The cloned gene spans over 5.9 kb, comprises eight exons and seven introns and encodes a putative protein of 456 amino acids. The deduced amino acid sequence possesses a conserved DNA-binding domain (DBD), an IRF-association domain (IAD) and a nuclear localization signal (NLS). Phylogenetic analysis clustered it into the teleost IRF4b clade and, thus, it was named Paralichthys olivaceus (Po)IRF4b. The constitutive expression of PoIRF4b transcripts was detectable in all examined organs, with highest levels found in lymphomyeloid-rich tissues. They were induced by both poly(I:C) and LCDV with a similar inducibility in immune or non-immune organs. Two waves of induced expression of PoIRF4b were observed with the two stimuli during a 7-day time course in the immune organs, with the early-phase induction being stronger. The maximum increases of PoIRF4b transcript levels ranged from 1.3 to 4.0-fold and appeared at day 1-5 post-injection depending on different organs and stimuli. In both stimulation cases, the strongest induction was detected in spleen and the weakest in muscle. These results indicate that PoIRF4b may participate in regulation of immune responses of flounders to both RNA and DNA virus infections. PMID:27084058

  14. Using invasion process theory to enhance the understanding and management of introduced species: a case study reconstructing the invasion sequence of the common myna (Acridotheres tristis).

    PubMed

    Grarock, Kate; Lindenmayer, David B; Wood, Jeff T; Tidemann, Chris R

    2013-11-15

    Research indicates that invasion is a multi-step process, where each stage is contingent on the stage that precedes it. Numerous hypotheses addressing the factors that influence each stage of the invasion process have been formulated, but how well does this theory match what occurs in the natural world? We created a general conceptual model for the invasion process based on invasion theory. Using a composite 41-year data set, we then reconstructed the invasion sequence of the common myna (Acridotheres tristis) to investigate the similarities between invasion theory and this observed invasion. We observed a lag period before population growth of 2.7 (±0.3) years, a maximum rate of population growth of 24.1 (±6.4) birds per km(2) per year, a lag period before spreading of six years and an average spreading rate of 0.4 km per year. The length and duration of these stages correspond closely with what invasion process theory would anticipate. We suggest that a conceptual model, coupled with basic species, environment and event information, could be a useful tool to enhance the understanding and management of invasions. PMID:23995141

  15. Dynamic control of nutrient-removal from industrial wastewater in a sequencing batch reactor, using common and low-cost online sensors.

    PubMed

    Dries, Jan

    2016-01-01

    On-line control of the biological treatment process is an innovative tool to cope with variable concentrations of chemical oxygen demand and nutrients in industrial wastewater. In the present study we implemented a simple dynamic control strategy for nutrient-removal in a sequencing batch reactor (SBR) treating variable tank truck cleaning wastewater. The control system was based on derived signals from two low-cost and robust sensors that are very common in activated sludge plants, i.e. oxidation reduction potential (ORP) and dissolved oxygen. The amount of wastewater fed during anoxic filling phases, and the number of filling phases in the SBR cycle, were determined by the appearance of the 'nitrate knee' in the profile of the ORP. The phase length of the subsequent aerobic phases was controlled by the oxygen uptake rate measured online in the reactor. As a result, the sludge loading rate (F/M ratio), the volume exchange rate and the SBR cycle length adapted dynamically to the activity of the activated sludge and the actual characteristics of the wastewater, without affecting the final effluent quality. PMID:26901715

  16. Comparative inter-strain sequence analysis of the putative regulatory region of murine psychostimulant-regulated gene GNB1 (G protein beta 1 subunit gene).

    PubMed

    Kitanaka, Nobue; Kitanaka, Junichi; Walther, Donna; Wang, Xiao-Bing; Uhl, George R

    2003-08-01

    We isolated a cDNA clone from a murine genomic library of C57BL/6 strain, carrying 13.8 kb of nucleotides including exon 1 of heterotrimeric GTP-binding protein beta 1 subunit gene (genetic symbol, GNB1) and 10.6 kb of the 5' flanking region. Sequence comparison with GNB1 gene locus from 129Sv strain revealed a 0.2% divergence in a 13.2 kb common region between these two strains. The divergence consisted of eight single nucleotide polymorphisms, three insertions and one deletion, with 129Sv used as the reference. The exon 1 and the putative regulation elements, such as cyclic AMP response element, AP1, AP2, Sp1 and nuclear factor-kappa B recognition sites, were perfectly conserved. The expression of GNB1 mRNA was significantly increased in mouse striatum 2 h after single methamphetamine administration with an approximately 150% expression level compared with the basal level. In contrast, no change in the expression level was observed in the cerebral cortex. After the chronic methamphetamine treatment regimen, the expression level of GNB1 mRNA did not change in any brain regions examined. These results suggest (1) that the 5' flanking nucleotide sequence of GNB1 gene was strictly conserved for its possible contribution to the same change in the expression level between the mouse strains in response to psychostimulants and (2) that the initial process of development of behavioral sensitization appeared to occur parallel to the significant increase in the expression level of GNB1 gene in the mouse striatum. PMID:14631649

  17. Sequence and Expression Analysis of Interferon Regulatory Factor 10 (IRF10) in Three Diverse Teleost Fish Reveals Its Role in Antiviral Defense

    PubMed Central

    Xu, Qiaoqing; Jiang, Yousheng; Wangkahart, Eakapol; Zou, Jun; Chang, Mingxian; Yang, Daiqin; Secombes, Chris J.; Nie, Pin; Wang, Tiehui

    2016-01-01

    Background Interferon regulatory factor (IRF) 10 was first found in birds and is present in the genome of other tetrapods (but not humans and mice), as well as in teleost fish. The functional role of IRF10 in vertebrate immunity is relatively unknown compared to IRF1-9. The target of this research was to clone and characterize the IRF10 genes in three economically important fish species that will facilitate future evaluation of this molecule in fish innate and adaptive immunity. Molecular Characterization of IRF10 in Three Fish Species In the present study, a single IRF10 gene was cloned in grass carp Ctenopharyngodon idella and Asian swamp eel Monopterus albus, and two, named IRF10a and IRF10b, in rainbow trout Oncorhynchus mykiss. The fish IRF10 molecules share highest identities to other vertebrate IRF10s, and have a well conserved DNA binding domain, IRF-associated domain, and an 8 exon/7 intron structure with conserved intron phase. The presence of an upstream ATG or open reading frame (ORF) in the 5’-untranslated region of different fish IRF10 cDNA sequences suggests potential regulation at the translational level, and this has been verified by in vitro transcription/translation experiments of the trout IRF10a cDNA, but would still need to be validated in fish cells. Expression Analysis of IRF10 In Vivo and In Vitro Both trout IRF10 paralogues are highly expressed in thymus, blood and spleen but are relatively low in head kidney and caudal kidney. Trout IRF10b expression is significantly higher than IRF10a in integumentary tissues i.e. gills, scales, skin, intestine, adipose fin and tail fins, suggesting that IRF10b may be more important in mucosal immunity. The expression of both trout IRF10 paralogues is up-regulated by recombinant IFN-γ. The expression of the IRF10 genes is highly induced by Poly I:C in vitro and in vivo, and by viral infection, but is less responsive to peptidoglycan and bacterial infection, suggesting an important role of fish IRF10

  18. Regulatory RNAs

    PubMed Central

    Vazquez-Anderson, Jorge; Contreras, Lydia M

    2013-01-01

    RNAs have many important functional properties, including that they are independently controllable and highly tunable. As a result of these advantageous properties, their use in a myriad of sophisticated devices has been widely explored. Yet, the exploitation of RNAs for synthetic applications is highly dependent on the ability to characterize the many new molecules that continue to be discovered by large-scale sequencing and high-throughput screening techniques. In this review, we present an exhaustive survey of the most recent synthetic bacterial riboswitches and small RNAs while emphasizing their virtues in gene expression management. We also explore the use of these RNA components as building blocks in the RNA synthetic biology toolbox and discuss examples of synthetic RNA components used to rewire bacterial regulatory circuitry. We anticipate that this field will expand its catalog of smart devices by mimicking and manipulating natural RNA mechanisms and functions. PMID:24356572

  19. Sequencing of candidate genes in Dominican families implicates both rare exonic and common non-exonic variants for carotid intima-media thickness at bifurcation.

    PubMed

    Wang, Liyong; Beecham, Ashley; Dueker, Nicole; Blanton, Susan H; Rundek, Tatjana; Sacco, Ralph L

    2015-10-01

    Through linkage and tagSNP-based association studies in 100 Dominican Republic (DR) families, we previously identified ANLN and AOAH (7p14.3) as candidate genes for carotid intima-media thickness at bifurcation (bIMT). Introns, exons, and flanking regions of ANLN and AOAH were re-sequenced in 151 individuals from nine families with evidence for linkage at 7p14.3. For common variants [CV, minor allele frequency (MAF) ≥5 %], single variant-based analysis was performed. For rare variants (RV, MAF < 5 %), gene-based analysis aggregating all RVs within a gene was performed. CV analysis revealed the strongest signal at rs3815483 (P = 0.0003) in ANLN and rs60023210 (P = 0.00005) in AOAH. In ANLN, RV analysis found suggestive evidence for association with exonic RVs (P = 0.08), and in particular non-synonymous RVs (P = 0.04) but not with all RVs (P = 0.15). The variant alleles of all non-synonymous RVs segregated with the major allele of rs3815483 and were associated with lower bIMT while a novel synonymous RV segregated with the minor allele of rs3815483 and was associated with greater bIMT. Additional analysis in 561 DR individuals found suggestive evidence for association with all ANLN non-synonymous RVs (P = 0.08). In AOAH, no evidence for association with RVs was detected. Instead, conditional analysis revealed that multiple independent intronic CVs are associated with bIMT in addition to rs60023210. We demonstrate the utility of using family-based studies to evaluate the contribution of RVs. Our data suggest two modes of genetic architecture underlying the linkage and association at ANLN (multiple exonic RVs) and AOAH (multiple intronic CVs with uncharacterized functions). PMID:26319989

  20. PCR detection and DNA sequence analysis of the regulatory region of lymphotropic papovavirus in peripheral blood mononuclear cells of an immunocompromised rhesus macaque

    NASA Technical Reports Server (NTRS)

    Lednicky, John A.; Halvorson, Steven J.; Butel, Janet S.

    2002-01-01

    A lymphotropic papovavirus (LPV) archetypal regulatory region was amplified from DNA from the blood of an immunocompromised rhesus monkey. We believe this is the first nonserological evidence of LPV infection in rhesus monkeys.

  1. Common Variants at Putative Regulatory Sites of the Tissue Nonspecific Alkaline Phosphatase Gene Influence Circulating Pyridoxal 5′-Phosphate Concentration in Healthy Adults123

    PubMed Central

    Carter, Tonia C; Pangilinan, Faith; Molloy, Anne M; Fan, Ruzong; Wang, Yifan; Shane, Barry; Gibney, Eileen R; Midttun, Øivind; Ueland, Per M; Cropp, Cheryl D; Kim, Yoonhee; Wilson, Alexander F; Bailey-Wilson, Joan E; Brody, Lawrence C; Mills, James L

    2015-01-01

    Background: Vitamin B-6 interconversion enzymes are important for supplying pyridoxal 5′-phosphate (PLP), the co-enzyme form, to tissues. Variants in the genes for these enzymes [tissue nonspecific alkaline phosphatase (ALPL), pyridoxamine 5′-phosphate oxidase, pyridoxal kinase, and pyridoxal phosphatase] could affect enzyme function and vitamin B-6 status. Objectives: We tested whether single-nucleotide polymorphisms (SNPs) in these genes influence vitamin B-6 status markers [plasma PLP, pyridoxal (PL), and 4-pyridoxic acid (PA)], and explored potential functional effects of the SNPs. Methods: Study subjects were young, healthy adults from Ireland (n = 2345). We measured plasma PLP, PL, and PA with liquid chromatography–tandem mass spectrometry and genotyped 66 tag SNPs in the 4 genes. We tested for associations with single SNPs in candidate genes and also performed genome-wide association study (GWAS) and gene-based analyses. Results: Seventeen SNPs in ALPL were associated with altered plasma PLP in candidate gene analyses (P < 1.89 × 10−4). In the GWAS, 5 additional ALPL SNPs were associated with altered plasma PLP (P < 5.0 × 10−8). Gene-based analyses that used the functional linear model β-spline (P = 4.04 × 10−15) and Fourier spline (P = 5.87 × 10−15) methods also showed associations between ALPL and altered plasma PLP. No SNPs in other genes were associated with plasma PLP. The association of the minor CC genotype of 1 ALPL SNP, rs1256341, with reduced ALPL expression in the HapMap Northern European ancestry population is consistent with the positive association between the CC genotype and plasma PLP in our study (P = 0.008). No SNP was associated with altered plasma PL or PA. Conclusions: In healthy adults, common variants in ALPL influence plasma PLP concentration, the most frequently used biomarker for vitamin B-6 status. Whether these associations are indicative of functional changes in vitamin B-6 status requires more investigation

  2. PrimerSNP: a web tool for whole-genome selection of allele-specific and common primers of phylogenetically-related bacterial genomic sequences

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The increasing number of genomic sequences of bacteria makes it possible to select unique SNPs of a particular strain/species at the whole genome level and thus design specific primers based on the SNPs. The high similarity of genomic sequences among phylogenetically-related bacteria requires the id...

  3. Epigenomic annotation of gene regulatory alterations during evolution of the primate brain.

    PubMed

    Vermunt, Marit W; Tan, Sander C; Castelijns, Bas; Geeven, Geert; Reinink, Peter; de Bruijn, Ewart; Kondova, Ivanela; Persengiev, Stephan; Bontrop, Ronald; Cuppen, Edwin; de Laat, Wouter; Creyghton, Menno P

    2016-03-01

    Although genome sequencing has identified numerous noncoding alterations between primate species, which of those are regulatory and potentially relevant to the evolution of the human brain is unclear. Here we annotated cis-regulatory elements (CREs) in the human, rhesus macaque and chimpanzee genomes using chromatin immunoprecipitation followed by sequencing (ChIP-seq) in different anatomical regions of the adult brain. We found high similarity in the genomic positioning of rhesus macaque and human CREs, suggesting that the majority of these elements were already present in a common ancestor 25 million years ago. Most of the observed regulatory changes between humans and rhesus macaques occurred before the ancestral separation of humans and chimpanzees, leaving a modest set of regulatory elements with predicted human specificity. Our data refine previous predictions and hypotheses on the consequences of genomic changes between primate species and allow the identification of regulatory alterations relevant to the evolution of the brain. PMID:26807951

  4. Draft Genome Sequences of Streptomyces scabiei S58, Streptomyces turgidiscabies T45, and Streptomyces acidiscabies a10, the Pathogens of Potato Common Scab, Isolated in Japan

    PubMed Central

    Nishi, Yatsuka; Sakai, Masao; Ikenaga, Makoto; Okubo, Takashi; Ikeda, Seishi

    2016-01-01

    The draft genome sequences of the three pathogens of potato common scab, Streptomyces scabiei S58, Streptomyces turgidiscabies T45, and Streptomyces acidiscabies a10, isolated in Japan, are presented here. The genome size of each strain is >10 Mb, and the three pathogenic strains share genes located in a pathogenicity island previously described in other pathogenic Streptomyces species. PMID:26941144

  5. Characterization of DNA sequences that mediate nuclear protein binding to the regulatory region of the Pisum sativum (pea) chlorophyl a/b binding protein gene AB80: identification of a repeated heptamer motif.

    PubMed

    Argüello, G; García-Hernández, E; Sánchez, M; Gariglio, P; Herrera-Estrella, L; Simpson, J

    1992-05-01

    Two protein factors binding to the regulatory region of the pea chlorophyl a/b binding protein gene AB80 have been identified. One of these factors is found only in green tissue but not in etiolated or root tissue. The second factor (denominated ABF-2) binds to a DNA sequence element that contains a direct heptamer repeat TCTCAAA. It was found that presence of both of the repeats is essential for binding. ABF-2 is present in both green and etiolated tissue and in roots and factors analogous to ABF-2 are present in several plant species. Computer analysis showed that the TCTCAAA motif is present in the regulatory region of several plant genes. PMID:1303797

  6. LEU3 of Saccharomyces cerevisiae activates multiple genes for branched-chain amino acid biosynthesis by binding to a common decanucleotide core sequence

    SciTech Connect

    Friden, P.; Schimmel, P.

    1988-07-01

    LEU3 of Saccharomyces cerevisiae encodes an 886-amino-acid polypeptide that regulates transcription of a group of genes involved in leucine biosynthesis and has been shown to bind specifically to a 114-base-pair DNA fragment of the LEU2 upstream region. The authors show here that, in addition to LEU2, LEU3 binds in vitro to sequences in the promoter regions of LEU1, LEU4, ILV2, and, by inference, ILV5. The largely conserved decanucleotide core sequence shared by the binding sites in these genes is CCGGNNCCGG. Methylation interference footprinting experiements show that LEU 3 makes symmetrical contacts with the conserved bases that lie in the major groove. Synthetic oligonucleides (19 to 29 base pairs) which contain the core decanucleotide and flanking sequences of LEU1, LEU2, LEU4, and ILV2 have individually been placed upstream of a LEU3-insensitive test promoter. The expression of each construction is activated by LEU3, although the degree of activation varies considerably according to the specific oligonucleotide which is introduced. A promoter construction with substitutions in the core sequence remains LEU3 insensitive, however. One of the oligonucleotides (based on a LEU2 sequence) was also tested and shown to confer leucine-sensitive expression on the test promoter. The results demonstrate that only a short sequence element is necessary for LEU3-dependent promoter binding and activation and provide direct evidence for an expanded repertoire of genes that are activated by LEU3.

  7. Deep transcriptome sequencing reveals the expression of key functional and regulatory genes involved in the abiotic stress signaling pathways in rice

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Drought, salt and cold are the major abiotic stresses that limit the rice production and cause serious threat to food security. The identification of the key functional and regulatory genes in the abiotic stress signaling pathways is important for understanding the molecular basis of abiotic stress ...

  8. Commonly-occurring polymorphisms in the COMT, DRD1 and DRD2 genes influence different aspects of motor sequence learning in humans.

    PubMed

    Baetu, Irina; Burns, Nicholas R; Urry, Kristi; Barbante, Girolamo Giovanni; Pitcher, Julia B

    2015-11-01

    Performing sequences of movements is a ubiquitous skill that involves dopamine transmission. However, it is unclear which components of the dopamine system contribute to which aspects of motor sequence learning. Here we used a genetic approach to investigate the relationship between different components of the dopamine system and specific aspects of sequence learning in humans. In particular, we investigated variations in genes that code for the catechol-O-methyltransferase (COMT) enzyme, the dopamine transporter (DAT) and dopamine D1 and D2 receptors (DRD1 and DRD2). COMT and the DAT regulate dopamine availability in the prefrontal cortex and the striatum, respectively, two key regions recruited during learning, whereas dopamine D1 and D2 receptors are thought to be involved in long-term potentiation and depression, respectively. We show that polymorphisms in the COMT, DRD1 and DRD2 genes differentially affect behavioral performance on a sequence learning task in 161 Caucasian participants. The DRD1 polymorphism predicted the ability to learn new sequences, the DRD2 polymorphism predicted the ability to perform a previously learnt sequence after performing interfering random movements, whereas the COMT polymorphism predicted the ability to switch flexibly between two sequences. We used computer simulations to explore potential mechanisms underlying these effects, which revealed that the DRD1 and DRD2 effects are possibly related to neuroplasticity. Our prediction-error algorithm estimated faster rates of connection strengthening in genotype groups with presumably higher D1 receptor densities, and faster rates of connection weakening in genotype groups with presumably higher D2 receptor densities. Consistent with current dopamine theories, these simulations suggest that D1-mediated neuroplasticity contributes to learning to select appropriate actions, whereas D2-mediated neuroplasticity is involved in learning to inhibit incorrect action plans. However, the

  9. Exploration of noncoding sequences in metagenomes.

    PubMed

    Tobar-Tosse, Fabián; Rodríguez, Adrián C; Vélez, Patricia E; Zambrano, María M; Moreno, Pedro A

    2013-01-01

    Environment-dependent genomic features have been defined for different metagenomes, whose genes and their associated processes are related to specific environments. Identification of ORFs and their functional categories are the most common methods for association between functional and environmental features. However, this analysis based on finding ORFs misses noncoding sequences and, therefore, some metagenome regulatory or structural information could be discarded. In this work we analyzed 23 whole metagenomes, including coding and noncoding sequences using the following sequence patterns: (G+C) content, Codon Usage (Cd), Trinucleotide Usage (Tn), and functional assignments for ORF prediction. Herein, we present evidence of a high proportion of noncoding sequences discarded in common similarity-based methods in metagenomics, and the kind of relevant information present in those. We found a high density of trinucleotide repeat sequences (TRS) in noncoding sequences, with a regulatory and adaptive function for metagenome communities. We present associations between trinucleotide values and gene function, where metagenome clustering correlate with microorganism adaptations and kinds of metagenomes. We propose here that noncoding sequences have relevant information to describe metagenomes that could be considered in a whole metagenome analysis in order to improve their organization, classification protocols, and their relation with the environment. PMID:23536879

  10. Exploration of Noncoding Sequences in Metagenomes

    PubMed Central

    Tobar-Tosse, Fabián; Rodríguez, Adrián C.; Vélez, Patricia E.; Zambrano, María M.; Moreno, Pedro A.

    2013-01-01

    Environment-dependent genomic features have been defined for different metagenomes, whose genes and their associated processes are related to specific environments. Identification of ORFs and their functional categories are the most common methods for association between functional and environmental features. However, this analysis based on finding ORFs misses noncoding sequences and, therefore, some metagenome regulatory or structural information could be discarded. In this work we analyzed 23 whole metagenomes, including coding and noncoding sequences using the following sequence patterns: (G+C) content, Codon Usage (Cd), Trinucleotide Usage (Tn), and functional assignments for ORF prediction. Herein, we present evidence of a high proportion of noncoding sequences discarded in common similarity-based methods in metagenomics, and the kind of relevant information present in those. We found a high density of trinucleotide repeat sequences (TRS) in noncoding sequences, with a regulatory and adaptive function for metagenome communities. We present associations between trinucleotide values and gene function, where metagenome clustering correlate with microorganism adaptations and kinds of metagenomes. We propose here that noncoding sequences have relevant information to describe metagenomes that could be considered in a whole metagenome analysis in order to improve their organization, classification protocols, and their relation with the environment. PMID:23536879

  11. Targeted next generation sequencing identifies novel mutations in RP1 as a relatively common cause of autosomal recessive rod-cone dystrophy.

    PubMed

    El Shamieh, Said; Boulanger-Scemama, Elise; Lancelot, Marie-Elise; Antonio, Aline; Démontant, Vanessa; Condroyer, Christel; Letexier, Mélanie; Saraiva, Jean-Paul; Mohand-Saïd, Saddek; Sahel, José-Alain; Audo, Isabelle; Zeitz, Christina

    2015-01-01

    We report ophthalmic and genetic findings in families with autosomal recessive rod-cone dystrophy (arRCD) and RP1 mutations. Detailed ophthalmic examination was performed in 242 sporadic and arRCD subjects. Genomic DNA was investigated using our customized next generation sequencing panel targeting up to 123 genes implicated in inherited retinal disorders. Stringent filtering coupled with Sanger sequencing and followed by cosegregation analysis was performed to confirm biallelism and the implication of the most likely disease causing variants. Sequencing identified 9 RP1 mutations in 7 index cases. Eight of the mutations were novel, and all cosegregated with severe arRCD phenotype, found associated with additional macular changes. Among the identified mutations, 4 belong to a region, previously associated with arRCD, and 5 others in a region previously associated with adRCD. Our prevalence studies showed that RP1 mutations account for up to 2.5% of arRCD. These results point out for the necessity of sequencing RP1 when genetically investigating sporadic and arRCD. It further highlights the interest of unbiased sequencing technique, which allows investigating the implication of the same gene in different modes of inheritance. Finally, it reports that different regions of RP1 can also lead to arRCD. PMID:25692139

  12. Integration of bioinformatics and synthetic promoters leads to the discovery of novel elicitor-responsive cis-regulatory sequences in Arabidopsis.

    PubMed

    Koschmann, Jeannette; Machens, Fabian; Becker, Marlies; Niemeyer, Julia; Schulze, Jutta; Bülow, Lorenz; Stahl, Dietmar J; Hehl, Reinhard

    2012-09-01

    A combination of bioinformatic tools, high-throughput gene expression profiles, and the use of synthetic promoters is a powerful approach to discover and evaluate novel cis-sequences in response to specific stimuli. With Arabidopsis (Arabidopsis thaliana) microarray data annotated to the PathoPlant database, 732 different queries with a focus on fungal and oomycete pathogens were performed, leading to 510 up-regulated gene groups. Using the binding site estimation suite of tools, BEST, 407 conserved sequence motifs were identified in promoter regions of these coregulated gene sets. Motif similarities were determined with STAMP, classifying the 407 sequence motifs into 37 families. A comparative analysis of these 37 families with the AthaMap, PLACE, and AGRIS databases revealed similarities to known cis-elements but also led to the discovery of cis-sequences not yet implicated in pathogen response. Using a parsley (Petroselinum crispum) protoplast system and a modified reporter gene vector with an internal transformation control, 25 elicitor-responsive cis-sequences from 10 different motif families were identified. Many of the elicitor-responsive cis-sequences also drive reporter gene expression in an Agrobacterium tumefaciens infection assay in Nicotiana benthamiana. This work significantly increases the number of known elicitor-responsive cis-sequences and demonstrates the successful integration of a diverse set of bioinformatic resources combined with synthetic promoter analysis for data mining and functional screening in plant-pathogen interaction. PMID:22744985

  13. Integration of Bioinformatics and Synthetic Promoters Leads to the Discovery of Novel Elicitor-Responsive cis-Regulatory Sequences in Arabidopsis1[C][W][OA

    PubMed Central

    Koschmann, Jeannette; Machens, Fabian; Becker, Marlies; Niemeyer, Julia; Schulze, Jutta; Bülow, Lorenz; Stahl, Dietmar J.; Hehl, Reinhard

    2012-01-01

    A combination of bioinformatic tools, high-throughput gene expression profiles, and the use of synthetic promoters is a powerful approach to discover and evaluate novel cis-sequences in response to specific stimuli. With Arabidopsis (Arabidopsis thaliana) microarray data annotated to the PathoPlant database, 732 different queries with a focus on fungal and oomycete pathogens were performed, leading to 510 up-regulated gene groups. Using the binding site estimation suite of tools, BEST, 407 conserved sequence motifs were identified in promoter regions of these coregulated gene sets. Motif similarities were determined with STAMP, classifying the 407 sequence motifs into 37 families. A comparative analysis of these 37 families with the AthaMap, PLACE, and AGRIS databases revealed similarities to known cis-elements but also led to the discovery of cis-sequences not yet implicated in pathogen response. Using a parsley (Petroselinum crispum) protoplast system and a modified reporter gene vector with an internal transformation control, 25 elicitor-responsive cis-sequences from 10 different motif families were identified. Many of the elicitor-responsive cis-sequences also drive reporter gene expression in an Agrobacterium tumefaciens infection assay in Nicotiana benthamiana. This work significantly increases the number of known elicitor-responsive cis-sequences and demonstrates the successful integration of a diverse set of bioinformatic resources combined with synthetic promoter analysis for data mining and functional screening in plant-pathogen interaction. PMID:22744985

  14. Accurate Identification of Common Pathogenic Nocardia Species: Evaluation of a Multilocus Sequence Analysis Platform and Matrix-Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry.

    PubMed

    Xiao, Meng; Pang, Lu; Chen, Sharon C-A; Fan, Xin; Zhang, Li; Li, Hai-Xia; Hou, Xin; Cheng, Jing-Wei; Kong, Fanrong; Zhao, Yu-Pei; Xu, Ying-Chun

    2016-01-01

    Species identification of Nocardia is not straightforward due to rapidly evolving taxonomy, insufficient discriminatory power of conventional phenotypic methods and also of single gene locus analysis including 16S rRNA gene sequencing. Here we evaluated the ability of a 5-locus (16S rRNA, gyrB, secA1, hsp65 and rpoB) multilocus sequence analysis (MLSA) approach as well as that of matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) in comparison with sequencing of the 5'-end 606 bp partial 16S rRNA gene to provide identification of 25 clinical isolates of Nocardia. The 5'-end 606 bp 16S rRNA gene sequencing successfully assigned 24 of 25 (96%) clinical isolates to species level, namely Nocardia cyriacigeorgica (n = 12, 48%), N. farcinica (n = 9, 36%), N. abscessus (n = 2, 8%) and N. otitidiscaviarum (n = 1, 4%). MLSA showed concordance with 16S rRNA gene sequencing results for the same 24 isolates. However, MLSA was able to identify the remaining isolate as N. wallacei, and clustered N. cyriacigeorgica into three subgroups. None of the clinical isolates were correctly identified to the species level by MALDI-TOF MS analysis using the manufacturer-provided database. A small "in-house" spectral database was established incorporating spectra of five clinical isolates representing the five species identified in this study. After complementation with the "in-house" database, of the remaining 20 isolates, 19 (95%) were correctly identified to species level (score ≥ 2.00) and one (an N. abscessus strain) to genus level (score ≥ 1.70 and < 2.00). In summary, MLSA showed superior discriminatory power compared with the 5'-end 606 bp partial 16S rRNA gene sequencing for species identification of Nocardia. MALDI-TOF MS can provide rapid and accurate identification but is reliant on a robust mass spectra database. PMID:26808813

  15. Nucleotide diversity of a genomic sequence similar to SHATTERPROOF (PvSHP1) in domesticated and wild common bean (Phaseolus vulgaris L.).

    PubMed

    Nanni, L; Bitocchi, E; Bellucci, E; Rossi, M; Rau, D; Attene, G; Gepts, P; Papa, R

    2011-12-01

    Evolutionary studies in plant and animal breeding are aimed at understanding the structure and organization of genetic variations of species. We have identified and characterized a genomic sequence in Phaseolus vulgaris of 1,200 bp (PvSHP1) that is homologous to SHATTERPROOF-1 (SHP1), a gene involved in control of fruit shattering in Arabidopsis thaliana. The PvSHP1 fragment was mapped to chromosome Pv06 in P. vulgaris and is linked to the flower and seed color gene V. Amplification of the PvSHP1 sequence from the most agronomically important legume species showed a high degree of interspecies diversity in the introns within the Phaseoleae, while the coding region was conserved across distant taxa. Sequencing of the PvSHP1 sequence in a sample of 91 wild and domesticated genotypes that span the geographic distribution of this species in the centers of origin showed that PvSHP1 is highly polymorphic and, therefore, particularly useful to further investigate the origin and domestication history of P. vulgaris. Our data confirm the gene pool structure seen in P. vulgaris along with independent domestication processes in the Andes and Mesoamerica; they provide additional evidence for a single domestication event in Mesoamerica. Moreover, our results support the Mesoamerican origin of this species. Finally, we have developed three indel-spanning markers that will be very useful for bean germplasm characterization, and particularly to trace the distribution of the domesticated Andean and Mesoamerican gene pools. PMID:21830108

  16. Cofunctional Subpathways Were Regulated by Transcription Factor with Common Motif, Common Family, or Common Tissue

    PubMed Central

    Su, Fei; Shang, Desi; Xu, Yanjun; Feng, Li; Yang, Haixiu; Liu, Baoquan; Su, Shengyang; Chen, Lina; Li, Xia

    2015-01-01

    Dissecting the characteristics of the transcription factor (TF) regulatory subpathway is helpful for understanding the TF underlying regulatory function in complex biological systems. To gain insight into the influence of TFs on their regulatory subpathways, we constructed a global TF-subpathways network (TSN) to analyze systematically the regulatory effect of common-motif, common-family, or common-tissue TFs on subpathways. We performed cluster analysis to show that the common-motif, common-family, or common-tissue TFs that regulated the same pathway classes tended to cluster together and contribute to the same biological function that led to disease initiation and progression. We analyzed the Jaccard coefficient to show that the functional consistency of subpathways regulated by the TF pairs with common motif, common family, or common tissue was significantly greater than the random TF pairs at the subpathway level, pathway level, and pathway class level. For example, HNF4A (hepatocyte nuclear factor 4, alpha) and NR1I3 (nuclear receptor subfamily 1, group I, member 3) were a pair of TFs with common motif, common family, and common tissue. They were involved in drug metabolism pathways and were liver-specific factors required for physiological transcription. In short, we inferred that the cofunctional subpathways were regulated by common-motif, common-family, or common-tissue TFs. PMID:26688819

  17. Accurate Identification of Common Pathogenic Nocardia Species: Evaluation of a Multilocus Sequence Analysis Platform and Matrix-Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry

    PubMed Central

    Chen, Sharon C-A.; Fan, Xin; Zhang, Li; Li, Hai-Xia; Hou, Xin; Cheng, Jing-Wei; Kong, Fanrong; Zhao, Yu-Pei; Xu, Ying-Chun

    2016-01-01

    Species identification of Nocardia is not straightforward due to rapidly evolving taxonomy, insufficient discriminatory power of conventional phenotypic methods and also of single gene locus analysis including 16S rRNA gene sequencing. Here we evaluated the ability of a 5-locus (16S rRNA, gyrB, secA1, hsp65 and rpoB) multilocus sequence analysis (MLSA) approach as well as that of matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) in comparison with sequencing of the 5’-end 606 bp partial 16S rRNA gene to provide identification of 25 clinical isolates of Nocardia. The 5’-end 606 bp 16S rRNA gene sequencing successfully assigned 24 of 25 (96%) clinical isolates to species level, namely Nocardia cyriacigeorgica (n = 12, 48%), N. farcinica (n = 9, 36%), N. abscessus (n = 2, 8%) and N. otitidiscaviarum (n = 1, 4%). MLSA showed concordance with 16S rRNA gene sequencing results for the same 24 isolates. However, MLSA was able to identify the remaining isolate as N. wallacei, and clustered N. cyriacigeorgica into three subgroups. None of the clinical isolates were correctly identified to the species level by MALDI-TOF MS analysis using the manufacturer-provided database. A small “in-house” spectral database was established incorporating spectra of five clinical isolates representing the five species identified in this study. After complementation with the “in-house” database, of the remaining 20 isolates, 19 (95%) were correctly identified to species level (score ≥ 2.00) and one (an N. abscessus strain) to genus level (score ≥ 1.70 and < 2.00). In summary, MLSA showed superior discriminatory power compared with the 5’-end 606 bp partial 16S rRNA gene sequencing for species identification of Nocardia. MALDI-TOF MS can provide rapid and accurate identification but is reliant on a robust mass spectra database. PMID:26808813

  18. Chicken interferon consensus sequence-binding protein (ICSBP) and interferon regulatory factor (IRF) 1 genes reveal evolutionary conservation in the IRF gene family.

    PubMed Central

    Jungwirth, C; Rebbert, M; Ozato, K; Degen, H J; Schultz, U; Dawid, I B

    1995-01-01

    Members of the IRF family mediate transcriptional responses to interferons (IFNs) and to virus infection. So far, proteins of this family have been studied only among mammalian species. Here we report the isolation of cDNA clones encoding two members of this family from chicken, interferon consensus sequence-binding protein (ICSBP) and IRF-1. The predicted chicken ICSBP and IRF-1 proteins show high levels of sequence similarity to their corresponding human and mouse counterparts. Sequence identities in the putative DNA-binding domains of chicken and human ICSBP and IRF-1 were 97% and 89%, respectively, whereas the C-terminal regions showed identities of 64% and 51%; sequence relationships with mouse ICSBP and IRF-1 are very similar. Chicken ICSBP was found to be expressed in several embryonic tissues, and both chicken IRF-1 and ICSBP were strongly induced in chicken fibroblasts by IFN treatment, supporting the involvement of these factors in IFN-regulated gene expression. The presence of proteins homologous to mammalian IRF family members, together with earlier observations on the occurrence of functionally homologous IFN-responsive elements in chicken and mammalian genes, highlights the conservation of transcriptional mechanisms in the IFN system, a finding that contrasts with the extensive sequence and functional divergence of the IFNs. Images Fig. 3 Fig. 4 Fig. 5 PMID:7536924

  19. Common formin-regulating sequences in Smy1 and Bud14 are required for the control of actin cable assembly in vivo

    PubMed Central

    Eskin, Julian A.; Rankova, Aneliya; Johnston, Adam B.; Alioto, Salvatore L.; Goode, Bruce L.

    2016-01-01

    Formins comprise a large family of proteins with diverse roles in remodeling the actin cytoskeleton. However, the spatiotemporal mechanisms used by cells to control formin activities are only beginning to be understood. Here we dissected Smy1, which has dual roles in regulating formins and myosin. Using mutagenesis, we identified specific sequences in Smy1 critical for its in vitro inhibitory effects on the FH2 domain of the formin Bnr1. By integrating smy1 alleles targeting those sequences, we genetically uncoupled Smy1’s functions in regulating formins and myosin. Quantitative imaging analysis further demonstrated that the ability of Smy1 to directly control Bnr1 activity is crucial in vivo for proper actin cable length, shape, and velocity and, in turn, efficient secretory vesicle transport. A Smy1-like sequence motif was also identified in a different Bnr1 regulator, Bud14, and found to be essential for Bud14 functions in regulating actin cable architecture and function in vivo. Together these observations reveal unanticipated mechanistic ties between two distinct formin regulators. Further, they emphasize the importance of tightly controlling formin activities in vivo to generate specialized geometries and dynamics of actin structures tailored to their physiological roles. PMID:26764093

  20. Common formin-regulating sequences in Smy1 and Bud14 are required for the control of actin cable assembly in vivo.

    PubMed

    Eskin, Julian A; Rankova, Aneliya; Johnston, Adam B; Alioto, Salvatore L; Goode, Bruce L

    2016-03-01

    Formins comprise a large family of proteins with diverse roles in remodeling the actin cytoskeleton. However, the spatiotemporal mechanisms used by cells to control formin activities are only beginning to be understood. Here we dissected Smy1, which has dual roles in regulating formins and myosin. Using mutagenesis, we identified specific sequences in Smy1 critical for its in vitro inhibitory effects on the FH2 domain of the formin Bnr1. By integrating smy1 alleles targeting those sequences, we genetically uncoupled Smy1's functions in regulating formins and myosin. Quantitative imaging analysis further demonstrated that the ability of Smy1 to directly control Bnr1 activity is crucial in vivo for proper actin cable length, shape, and velocity and, in turn, efficient secretory vesicle transport. A Smy1-like sequence motif was also identified in a different Bnr1 regulator, Bud14, and found to be essential for Bud14 functions in regulating actin cable architecture and function in vivo. Together these observations reveal unanticipated mechanistic ties between two distinct formin regulators. Further, they emphasize the importance of tightly controlling formin activities in vivo to generate specialized geometries and dynamics of actin structures tailored to their physiological roles. PMID:26764093

  1. High density genome wide genotyping-by-sequencing and association identifies common and low frequency SNPs, and novel candidate genes influencing cow milk traits

    PubMed Central

    Ibeagha-Awemu, Eveline M.; Peters, Sunday O.; Akwanji, Kingsley A.; Imumorin, Ikhide G.; Zhao, Xin

    2016-01-01

    High-throughput sequencing technologies have increased the ability to detect sequence variations for complex trait improvement. A high throughput genome wide genotyping-by-sequencing (GBS) method was used to generate 515,787 single nucleotide polymorphisms (SNPs), from which 76,355 SNPs with call rates >85% and minor allele frequency ≥1.5% were used in genome wide association study (GWAS) of 44 milk traits in 1,246 Canadian Holstein cows. GWAS was accomplished with a mixed linear model procedure implementing the additive and dominant models. A strong signal within the centromeric region of bovine chromosome 14 was associated with test day fat percentage. Several SNPs were associated with eicosapentaenoic acid, docosapentaenoic acid, arachidonic acid, CLA:9c11t and gamma linolenic acid. Most of the significant SNPs for 44 traits studied are novel and located in intergenic regions or introns of genes. Novel potential candidate genes for milk traits or mammary gland functions include ERCC6, TONSL, NPAS2, ACER3, ITGB4, GGT6, ACOX3, MECR, ADAM12, ACHE, LRRC14, FUK, NPRL3, EVL, SLCO3A1, PSMA4, FTO, ADCK5, PP1R16A and TEP1. Our study further demonstrates the utility of the GBS approach for identifying population-specific SNPs for use in improvement of complex dairy traits. PMID:27506634

  2. High density genome wide genotyping-by-sequencing and association identifies common and low frequency SNPs, and novel candidate genes influencing cow milk traits.

    PubMed

    Ibeagha-Awemu, Eveline M; Peters, Sunday O; Akwanji, Kingsley A; Imumorin, Ikhide G; Zhao, Xin

    2016-01-01

    High-throughput sequencing technologies have increased the ability to detect sequence variations for complex trait improvement. A high throughput genome wide genotyping-by-sequencing (GBS) method was used to generate 515,787 single nucleotide polymorphisms (SNPs), from which 76,355 SNPs with call rates >85% and minor allele frequency ≥1.5% were used in genome wide association study (GWAS) of 44 milk traits in 1,246 Canadian Holstein cows. GWAS was accomplished with a mixed linear model procedure implementing the additive and dominant models. A strong signal within the centromeric region of bovine chromosome 14 was associated with test day fat percentage. Several SNPs were associated with eicosapentaenoic acid, docosapentaenoic acid, arachidonic acid, CLA:9c11t and gamma linolenic acid. Most of the significant SNPs for 44 traits studied are novel and located in intergenic regions or introns of genes. Novel potential candidate genes for milk traits or mammary gland functions include ERCC6, TONSL, NPAS2, ACER3, ITGB4, GGT6, ACOX3, MECR, ADAM12, ACHE, LRRC14, FUK, NPRL3, EVL, SLCO3A1, PSMA4, FTO, ADCK5, PP1R16A and TEP1. Our study further demonstrates the utility of the GBS approach for identifying population-specific SNPs for use in improvement of complex dairy traits. PMID:27506634

  3. Sequence-based introgression mapping of WM7.1 and WM8.3, two major QTL affecting white mold tolerance in Common Bean (Phaseolus vulgaris L.)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    White mold disease, caused by the necrotrophic fungus Sclerotinia sclerotiorum (Lib.) de Bary, is a major pathogen of common bean (Phaseolus vulgaris L.). More than 20 QTL were reported using multiple bi-parental populations. To study the disease in more detail, advanced back-cross populations seg...

  4. Common Space, Common Time, Common Work

    ERIC Educational Resources Information Center

    Shank, Melody J.

    2005-01-01

    The most valued means of support and learning cited by new teachers at Poland Regional High School in rural Maine are the collegial interactions that common workspace, common planning time, and common tasks make possible. The school has used these everyday structures to enable new and veteran teachers to converse about curricular and pedagogical…

  5. Two regulatory proteins that bind to the basic transcription element (BTE), a GC box sequence in the promoter region of the rat P-4501A1 gene.

    PubMed Central

    Imataka, H; Sogawa, K; Yasumoto, K; Kikuchi, Y; Sasano, K; Kobayashi, A; Hayami, M; Fujii-Kuriyama, Y

    1992-01-01

    The cDNAs for two DNA binding proteins of BTE, a GC box sequence in the promoter region of the P-450IA1(CYP1A1) gene, have been isolated from a rat liver cDNA library by using the BTE sequence as a binding probe. While one is for the rat equivalent to human Sp1, the other encodes a primary structure of 244 amino acids, a novel DNA binding protein designated BTEB. Both proteins contain a zinc finger domain of Cys-Cys/His-His motif that is repeated three times with sequence similarity of 72% to each other, otherwise they share little or no similarity. The function of BTEB was analysed by transfection of plasmids expressing BTEB and/or Sp1 with appropriate reporter plasmids into a monkey cell line CV-1 and compared with Sp1. BTEB and Sp1 activated the expression of genes with repeated GC box sequences in promoters such as the simian virus 40 early promoter and the human immunodeficiency virus-1 long terminal repeat promoter. In contrast, BTEB repressed the activity of a promoter containing BTE, a single GC box of the CYP1A1 gene that is stimulated by Sp1. When the BTE sequence was repeated five times, however, BTEB turned out to be an activator of the promoter. RNA blot analysis showed that mRNAs for BTEB and Sp1 were expressed in all tissues tested, but their concentrations varied independently in tissues. The former mRNA was rich in the brain, kidney, lung and testis, while the latter was relatively abundant in the thymus and spleen.(ABSTRACT TRUNCATED AT 250 WORDS) Images PMID:1356762

  6. Multiple ESBL-Producing Escherichia coli Sequence Types Carrying Quinolone and Aminoglycoside Resistance Genes Circulating in Companion and Domestic Farm Animals in Mwanza, Tanzania, Harbor Commonly Occurring Plasmids.

    PubMed

    Seni, Jeremiah; Falgenhauer, Linda; Simeo, Nabina; Mirambo, Mariam M; Imirzalioglu, Can; Matee, Mecky; Rweyemamu, Mark; Chakraborty, Trinad; Mshana, Stephen E

    2016-01-01

    The increased presence of extended-spectrum beta-lactamase (ESBL)-producing bacteria in humans, animals, and their surrounding environments is of global concern. Currently there is limited information on ESBL presence in rural farming communities worldwide. We performed a cross-sectional study in Mwanza, Tanzania, involving 600 companion and domestic farm animals between August/September 2014. Rectal swab/cloaca specimens were processed to identify ESBL-producing Enterobacteriaceae. We detected 130 (21.7%) animals carrying ESBL-producing bacteria, the highest carriage being among dogs and pigs [39.2% (51/130) and 33.1% (43/130), respectively]. The majority of isolates were Escherichia coli [93.3% (125/134)] and exotic breed type [OR (95%CI) = 2.372 (1.460-3.854), p-value < 0.001] was found to be a predictor of ESBL carriage among animals. Whole-genome sequences of 25 ESBL-producing E. coli were analyzed for phylogenetic relationships using multi-locus sequence typing (MLST) and core genome comparisons. Fourteen different sequence types were detected of which ST617 (7/25), ST2852 (3/25), ST1303 (3/25) were the most abundant. All isolates harbored the bla CTX-M-15 allele, 22/25 carried strA and strB, 12/25 aac(6')-lb-cr, and 11/25 qnrS1. Antibiotic resistance was associated with IncF, IncY, as well as non-typable plasmids. Eleven isolates carried pPGRT46-related plasmids, previously reported from isolates in Nigeria. Five isolates had plasmids exhibiting 85-99% homology to pCA28, previously detected in isolates from the US. Our findings indicate a pan-species distribution of ESBL-producing E. coli clonal groups in farming communities and provide evidence for plasmids harboring antibiotic resistances of regional and international impact. PMID:26904015

  7. Multiple ESBL-Producing Escherichia coli Sequence Types Carrying Quinolone and Aminoglycoside Resistance Genes Circulating in Companion and Domestic Farm Animals in Mwanza, Tanzania, Harbor Commonly Occurring Plasmids

    PubMed Central

    Seni, Jeremiah; Falgenhauer, Linda; Simeo, Nabina; Mirambo, Mariam M.; Imirzalioglu, Can; Matee, Mecky; Rweyemamu, Mark; Chakraborty, Trinad; Mshana, Stephen E.

    2016-01-01

    The increased presence of extended-spectrum beta-lactamase (ESBL)-producing bacteria in humans, animals, and their surrounding environments is of global concern. Currently there is limited information on ESBL presence in rural farming communities worldwide. We performed a cross-sectional study in Mwanza, Tanzania, involving 600 companion and domestic farm animals between August/September 2014. Rectal swab/cloaca specimens were processed to identify ESBL-producing Enterobacteriaceae. We detected 130 (21.7%) animals carrying ESBL-producing bacteria, the highest carriage being among dogs and pigs [39.2% (51/130) and 33.1% (43/130), respectively]. The majority of isolates were Escherichia coli [93.3% (125/134)] and exotic breed type [OR (95%CI) = 2.372 (1.460–3.854), p-value < 0.001] was found to be a predictor of ESBL carriage among animals. Whole-genome sequences of 25 ESBL-producing E. coli were analyzed for phylogenetic relationships using multi-locus sequence typing (MLST) and core genome comparisons. Fourteen different sequence types were detected of which ST617 (7/25), ST2852 (3/25), ST1303 (3/25) were the most abundant. All isolates harbored the blaCTX-M-15 allele, 22/25 carried strA and strB, 12/25 aac(6′)-lb-cr, and 11/25 qnrS1. Antibiotic resistance was associated with IncF, IncY, as well as non-typable plasmids. Eleven isolates carried pPGRT46-related plasmids, previously reported from isolates in Nigeria. Five isolates had plasmids exhibiting 85–99% homology to pCA28, previously detected in isolates from the US. Our findings indicate a pan-species distribution of ESBL-producing E. coli clonal groups in farming communities and provide evidence for plasmids harboring antibiotic resistances of regional and international impact. PMID:26904015

  8. Regulatory circuit based on autogenous activation-repression: roles of C-boxes and spacer sequences in control of the PvuII restriction-modification system

    PubMed Central

    Mruk, Iwona; Rajesh, Preeti; Blumenthal, Robert M.

    2007-01-01

    Type II restriction-modification (R-M) systems comprise a restriction endonuclease (REase) and a protective methyltransferase (MTase). After R-M genes enter a new cell, MTase must appear before REase or the chromosome will be cleaved. PvuII and some other R-M systems achieve this delay by cotranscribing the REase gene with the gene for an autogenous transcription activator (the controlling or ‘C’ protein C.PvuII). This study reveals, through in vivo titration, that C.PvuII is not only an activator but also a repressor for its own gene. In other systems, this type of circuit can result in oscillatory behavior. Despite the use of identical, symmetrical C protein-binding sequences (C-boxes) in the left and right operators, C.PvuII showed higher in vitro affinity for OL than for OR, implicating the spacer sequences in this difference. Mutational analysis associated the repression with OR, which overlaps the promoter −35 hexamer but is otherwise dispensable for activation. A nonrepressing mutant exhibited poor establishment in new cells. Comparing promoter-operator regions from PvuII and 29 R-M systems controlled by C proteins revealed that the most-highly conserved sequence is the tetranucleotide spacer separating OL from OR. Any changes in that spacer reduced the stability of C.PvuII-operator complexes and abolished activation. PMID:17933763

  9. Regulatory circuit based on autogenous activation-repression: roles of C-boxes and spacer sequences in control of the PvuII restriction-modification system.

    PubMed

    Mruk, Iwona; Rajesh, Preeti; Blumenthal, Robert M

    2007-01-01

    Type II restriction-modification (R-M) systems comprise a restriction endonuclease (REase) and a protective methyltransferase (MTase). After R-M genes enter a new cell, MTase must appear before REase or the chromosome will be cleaved. PvuII and some other R-M systems achieve this delay by cotranscribing the REase gene with the gene for an autogenous transcription activator (the controlling or 'C' protein C.PvuII). This study reveals, through in vivo titration, that C.PvuII is not only an activator but also a repressor for its own gene. In other systems, this type of circuit can result in oscillatory behavior. Despite the use of identical, symmetrical C protein-binding sequences (C-boxes) in the left and right operators, C.PvuII showed higher in vitro affinity for O(L) than for O(R), implicating the spacer sequences in this difference. Mutational analysis associated the repression with O(R), which overlaps the promoter -35 hexamer but is otherwise dispensable for activation. A nonrepressing mutant exhibited poor establishment in new cells. Comparing promoter-operator regions from PvuII and 29 R-M systems controlled by C proteins revealed that the most-highly conserved sequence is the tetranucleotide spacer separating O(L) from O(R). Any changes in that spacer reduced the stability of C.PvuII-operator complexes and abolished activation. PMID:17933763

  10. Deep Sequencing-Based Transcriptome Analysis Reveals the Regulatory Mechanism of Bemisia tabaci (Hemiptera: Aleyrodidae) Nymph Parasitized by Encarsia sophia (Hymenoptera: Aphelinidae)

    PubMed Central

    Wang, Ran; Li, Fei; Zhang, Fan; Wang, Su

    2016-01-01

    The whitefly Bemisia tabaci is a genetically diverse complex with multiple cryptic species, and some are the most destructive invasive pests of many ornamentals and crops worldwide. Encarsia sophia is an autoparasitoid wasp that demonstrated high efficiency as bio-control agent of whiteflies. However, the immune mechanism of B. tabaci parasitization by E. sophia is unknown. In order to investigate immune response of B. tabaci to E. Sophia parasitization, the transcriptome of E. sophia parasitized B. tabaci nymph was sequenced by Illumina sequencing. De novo assembly generated 393,063 unigenes with average length of 616 bp, in which 46,406 unigenes (15.8% of all unigenes) were successfully mapped. Parasitization by E. sophia had significant effects on the transcriptome profile of B. tabaci nymph. A total of 1482 genes were significantly differentially expressed, of which 852 genes were up-regulated and 630 genes were down-regulated. These genes were mainly involved in immune response, development, metabolism and host signaling pathways. At least 52 genes were found to be involved in the host immune response, 33 genes were involved in the development process, and 29 genes were involved in host metabolism. Taken together, the assembled and annotated transcriptome sequences provided a valuable genomic resource for further understanding the molecular mechanism of immune response of B. tabaci parasitization by E. sophia. PMID:27332546

  11. Exome sequencing identifies DYNC2H1 mutations as a common cause of asphyxiating thoracic dystrophy (Jeune syndrome) without major polydactyly, renal or retinal involvement

    PubMed Central

    Schmidts, Miriam; Arts, Heleen H; Bongers, Ernie M H F; Yap, Zhimin; Oud, Machteld M; Antony, Dinu; Duijkers, Lonneke; Emes, Richard D; Stalker, Jim; Yntema, Jan-Bart L; Plagnol, Vincent; Hoischen, Alexander; Gilissen, Christian; Forsythe, Elisabeth; Lausch, Ekkehart; Veltman, Joris A; Roeleveld, Nel; Superti-Furga, Andrea; Kutkowska-Kazmierczak, Anna; Kamsteeg, Erik-Jan; Elçioğlu, Nursel; van Maarle, Merel C; Graul-Neumann, Luitgard M; Devriendt, Koenraad; Smithson, Sarah F; Wellesley, Diana; Verbeek, Nienke E; Hennekam, Raoul C M; Kayserili, Hulya; Scambler, Peter J; Beales, Philip L; Knoers, Nine VAM; Roepman, Ronald; Mitchison, Hannah M

    2013-01-01

    Background Jeune asphyxiating thoracic dystrophy (JATD) is a rare, often lethal, recessively inherited chondrodysplasia characterised by shortened ribs and long bones, sometimes accompanied by polydactyly, and renal, liver and retinal disease. Mutations in intraflagellar transport (IFT) genes cause JATD, including the IFT dynein-2 motor subunit gene DYNC2H1. Genetic heterogeneity and the large DYNC2H1 gene size have hindered JATD genetic diagnosis. Aims and methods To determine the contribution to JATD we screened DYNC2H1 in 71 JATD patients JATD patients combining SNP mapping, Sanger sequencing and exome sequencing. Results and conclusions We detected 34 DYNC2H1 mutations in 29/71 (41%) patients from 19/57 families (33%), showing it as a major cause of JATD especially in Northern European patients. This included 13 early protein termination mutations (nonsense/frameshift, deletion, splice site) but no patients carried these in combination, suggesting the human phenotype is at least partly hypomorphic. In addition, 21 missense mutations were distributed across DYNC2H1 and these showed some clustering to functional domains, especially the ATP motor domain. DYNC2H1 patients largely lacked significant extra-skeletal involvement, demonstrating an important genotype–phenotype correlation in JATD. Significant variability exists in the course and severity of the thoracic phenotype, both between affected siblings with identical DYNC2H1 alleles and among individuals with different alleles, which suggests the DYNC2H1 phenotype might be subject to modifier alleles, non-genetic or epigenetic factors. Assessment of fibroblasts from patients showed accumulation of anterograde IFT proteins in the ciliary tips, confirming defects similar to patients with other retrograde IFT machinery mutations, which may be of undervalued potential for diagnostic purposes. PMID:23456818

  12. Swine Leukocyte Antigen (SLA) class I allele typing of Danish swine herds and identification of commonly occurring haplotypes using sequence specific low and high resolution primers.

    PubMed

    Pedersen, Lasse Eggers; Jungersen, Gregers; Sorensen, Maria Rathmann; Ho, Chak-Sum; Vadekær, Dorte Fink

    2014-12-15

    The swine major histocompatibility complex (MHC) genomic region (SLA) is extremely polymorphic comprising high numbers of different alleles, many encoding a distinct MHC class I molecule, which binds and presents endogenous peptides to circulating T cells of the immune system. Upon recognition of such peptide-MHC complexes (pMHC) naïve T cells can become activated and respond to a given pathogen leading to its elimination and the generation of memory cells. Hence SLA plays a crucial role in maintaining overall adaptive immunologic resistance to pathogens. Knowing which SLA alleles that are commonly occurring can be of great importance in regard to future vaccine development and the establishment of immune protection in swine through broad coverage, highly specific, subunit based vaccination against viruses such as swine influenza, porcine reproductive and respiratory syndrome virus, vesicular stomatitis virus, foot-and-mouth-disease virus and others. Here we present the use of low- and high-resolution PCR-based typing methods to identify individual and commonly occurring SLA class I alleles in Danish swine. A total of 101 animals from seven different herds were tested, and by low resolution typing the top four most frequent SLA class I alleles were those of the allele groups SLA-3*04XX, SLA-1*08XX, SLA-2*02XX, and SLA-1*07XX, respectively. Customised high resolution primers were used to identify specific alleles within the above mentioned allele groups as well as within the SLA-2*05XX allele group. Our studies also suggest the most common haplotype in Danish pigs to be Lr-4.0 expressing the SLA-1*04XX, SLA-2*04XX, and SLA-3*04XX allele combination. PMID:25457547

  13. Diversity in junctional sequences associated with the common human V gamma 9 and V delta 2 gene segments in normal blood and lung compared with the limited diversity in a granulomatous disease

    PubMed Central

    1990-01-01

    The T cell receptor (TCR) junctional regions (N regions) of the common human V gamma 9 and V delta 2 gene segments were sequenced from the blood and lung of normal individuals (195 transcripts) and a group of individuals with sarcoidosis (220 transcripts), a granulomatous disease in which increased numbers of V gamma 9+ gamma/delta + T cells are often observed. In normal individuals, the vast majority (86%) of blood V gamma 9 transcripts used the J gamma P gene segment. In contrast to this restriction of J region usage, there was a large diversity of the junctional region, with less than 20% of blood V gamma 9 junctional regions showing identical sequences for any one normal individual. For the blood V delta 2 transcripts in normal individuals, there was restriction of J region usage, with 93% using J delta 1. The junctional regions were even more diverse than for V gamma 9, with a unique sequence observed in each transcript examined. Compared with blood, sequences from the normal lung showed a small increase in identical junctional regions, particularly in one individual where 46% of V gamma 9 transcripts examined were identical, suggesting a response of some gamma/delta T cells to antigens found in the lung in the normal state. In marked contrast to normals, some individuals with sarcoidosis had large numbers of V gamma 9 transcripts, as well as V delta 2 transcripts, sharing identical sequences. For V gamma 9 blood transcripts, two individuals showed 84 and 56% of junctional region sequences to be identical, respectively. Similarly, blood V delta 2 transcripts showed 43, 33, and 25% identical junctional region sequences in three individuals. In the sarcoid patient with the most striking over-representation of blood V gamma 9 junctional sequences, lung V gamma 9 transcripts showed increased (67%) use of the same junctional region sequence as in blood. This limited diversity of TCR junctional regions among some individuals with sarcoidosis suggests a response from

  14. Circuitry and dynamics of human transcription factor regulatory networks

    PubMed Central

    Neph, Shane; Stergachis, Andrew B.; Reynolds, Alex; Sandstrom, Richard; Borenstein, Elhanan; Stamatoyannopoulos, John A.

    2012-01-01

    SUMMARY The combinatorial cross-regulation of hundreds of sequence-specific transcription factors defines a regulatory network that underlies cellular identity and function. Here we use genome-wide maps of in vivo DNaseI footprints to assemble an extensive core human regulatory network comprising connections among 475 sequence-specific transcription factors, and to analyze the dynamics of these connections across 41 diverse cell and tissue types. We find that human transcription factor networks are highly cell-selective and are driven by cohorts of factors that include regulators with previously unrecognized roles in control of cellular identity. Moreover, we identify many widely expressed factors that impact transcriptional regulatory networks in a cell-selective manner. Strikingly, in spite of their inherent diversity, all cell type regulatory networks independently converge on a common architecture that closely resembles the topology of living neuronal networks. Together, our results provide the first description of the circuitry, dynamics, and organizing principles of the human transcription factor regulatory network. PMID:22959076

  15. Generation and Molecular Mapping of a Sequence Characterized Amplified Region Marker Linked with the Bct Gene for Resistance to Beet curly top virus in Common Bean.

    PubMed

    Larsen, Richard C; Miklas, Phillip N

    2004-04-01

    ABSTRACT A random amplified polymorphic DNA (RAPD) marker directly linked (0.0 cM) with a resistance gene was identified in a snap bean recombinant inbred population (Moncayo x Primo) consisting of 94 F(5:7) recombinant inbred lines that had uniform segregation for disease reaction to Beet curly top virus (BCTV) across three field locations. Resistance was conditioned by a single dominant allele tentatively designated Bct. Seven hundred and fifty decamer primers were screened to obtain the linked RAPD marker that was then converted to a sequence characterized amplified region (SCAR) marker SAS8.1550. The SCAR mapped within a cluster of resistance genes on linkage group B7 of the core map. A survey of 103 BCTV-resistant and -susceptible snap and dry bean genotypes was conducted using SAS8.1550. Results showed that the SCAR would be highly useful for marker-assisted selection of Bct in snap and dry bean originating from the Andean gene pool. Marker-assisted selection for Bct will expedite the development of BCTV-resistant cultivars and minimize the need for cumbersome pathogen tests. PMID:18944106

  16. Interstitial telomeric sequences in human chromosomes cluster with common fragile sites, mutagen sensitive sites, viral integration sites, cancer breakpoints, proto-oncogenes and breakpoints involved in primate evolution

    SciTech Connect

    Adekunle, S.S.A.; Wyandt, H.; Mark, H.F.L.

    1994-09-01

    Recently we mapped the telomeric repeat sequences to 111 interstitial sites in the human genome and to sites of gaps and breaks induced by aphidicolin and sister chromatid exchange sites detected by BrdU. Many of these sites correspond to conserved fragile sites in man, gorilla and chimpazee, to sites of conserved sister chromatid exchange in the mammalian X chromosome, to mutagenic sensitive sites, mapped locations of proto-oncogenes, breakpoints implicated in primate evolution and to breakpoints indicated as the sole anomaly in neoplasia. This observation prompted us to investigate if the interstitial telomeric sites cluster with these sites. An extensive literature search was carried out to find all the available published sites mentioned above. For comparison, we also carried out a statistical analysis of the clustering of the sites of the telomeric repeats with the gene locations where only nucleotide mutations have been observed as the only chromosomal abnormality. Our results indicate that the telomeric repeats cluster most with fragile sites, mutagenic sensitive sites and breakpoints implicated in primate evolution and least with cancer breakpoints, mapped locations of proto-oncogenes and other genes with nucleotide mutations.

  17. Heterozygous triplication of upstream regulatory sequences leads to dysregulation of matrix metalloproteinase 19 (MMP19) in patients with cavitary optic disc anomaly (CODA)

    PubMed Central

    Hazlewood, Ralph J.; Roos, Benjamin R.; Solivan-Timpe, Frances; Honkanen, Robert A.; Jampol, Lee M.; Gieser, Stephen C.; Meyer, Kacie J.; Mullins, Robert F.; Kuehn, Markus H.; Scheetz, Todd E.; Kwon, Young H.; Alward, Wallace L.M.; Stone, Edwin M.; Fingert, John H.

    2015-01-01

    Patients with a congenital optic nerve disease, cavitary optic disc anomaly (CODA), are born with profound excavation of the optic nerve resembling glaucoma. We previously mapped the gene that causes autosomal dominant CODA in a large pedigree to a chromosome 12q locus. Using comparative genomic hybridization and quantitative PCR analysis of this pedigree, we report identifying a 6Kbp heterozygous triplication upstream of the matrix metalloproteinase 19 (MMP19) gene, present in all 17 affected family members and no normal members. Moreover, the triplication was not detected in 78 control subjects or in the Database of Genomic Variants. We further detected the same 6Kbp triplication in 1 of 24 unrelated CODA patients and in none of 172 glaucoma patients. Analysis with a luciferase assay showed that the 6Kbp sequence has transcription enhancer activity. A 773bp fragment of the 6Kbp DNA segment increased downstream gene expression 8-fold, suggesting that triplication of this sequence may lead to dysregulation of the downstream gene, MMP19, in CODA patients. Lastly, immunohistochemical analysis of human donor eyes revealed strong expression of MMP19 in optic nerve head. These data strongly suggest that triplication of an enhancer may lead to overexpression of MMP19 in the optic nerve which causes CODA. PMID:25581579

  18. A root chicory MADS box sequence and the Arabidopsis flowering repressor FLC share common features that suggest conserved function in vernalization and de-vernalization responses.

    PubMed

    Périlleux, Claire; Pieltain, Alexandra; Jacquemin, Guillaume; Bouché, Frédéric; Detry, Nathalie; D'Aloia, Maria; Thiry, Laura; Aljochim, Pierre; Delansnay, Martin; Mathieu, Anne-Sophie; Lutts, Stanley; Tocquin, Pierre

    2013-08-01

    Root chicory (Cichorium intybus var. sativum) is a biennial crop, but is harvested to obtain root inulin at the end of the first growing season before flowering. However, cold temperatures may vernalize seeds or plantlets, leading to incidental early flowering, and hence understanding the molecular basis of vernalization is important. A MADS box sequence was isolated by RT-PCR and named FLC-LIKE1 (CiFL1) because of its phylogenetic positioning within the same clade as the floral repressor Arabidopsis FLOWERING LOCUS C (AtFLC). Moreover, over-expression of CiFL1 in Arabidopsis caused late flowering and prevented up-regulation of the AtFLC target FLOWERING LOCUS T by photoperiod, suggesting functional conservation between root chicory and Arabidopsis. Like AtFLC in Arabidopsis, CiFL1 was repressed during vernalization of seeds or plantlets of chicory, but repression of CiFL1 was unstable when the post-vernalization temperature was favorable to flowering and when it de-vernalized the plants. This instability of CiFL1 repression may be linked to the bienniality of root chicory compared with the annual lifecycle of Arabidopsis. However, re-activation of AtFLC was also observed in Arabidopsis when a high temperature treatment was used straight after seed vernalization, eliminating the promotive effect of cold on flowering. Cold-induced down-regulation of a MADS box floral repressor and its re-activation by high temperature thus appear to be conserved features of the vernalization and de-vernalization responses in distant species. PMID:23581257

  19. Regulatory polymorphisms underlying complex disease traits.

    PubMed

    Knight, Julian C

    2005-02-01

    There is growing evidence that genetic variation plays an important role in the determination of individual susceptibility to complex disease traits. In contrast to coding sequence polymorphisms, where the consequences of non-synonymous variation may be resolved at the level of the protein phenotype, defining specific functional regulatory polymorphisms has proved problematic. This has arisen for a number of reasons, including difficulties with fine mapping due to linkage disequilibrium, together with a paucity of experimental tools to resolve the effects of non-coding sequence variation on gene expression. Recent studies have shown that variation in gene expression is heritable and can be mapped as a quantitative trait. Allele-specific effects on gene expression appear relatively common, typically of modest magnitude and context specific. The role of regulatory polymorphisms in determining susceptibility to a number of complex disease traits is discussed, including variation at the VNTR of INS, encoding insulin, in type 1 diabetes and polymorphism of CTLA4, encoding cytotoxic T lymphocyte antigen, in autoimmune disease. Examples where regulatory polymorphisms have been found to play a role in mongenic traits such as factor VII deficiency are discussed, and contrasted with those polymorphisms associated with ischaemic heart disease at the same gene locus. Molecular mechanisms operating in an allele-specific manner at the level of transcription are illustrated, with examples including the role of Duffy binding protein in malaria. The difficulty of resolving specific functional regulatory variants arising from linkage disequilibrium is demonstrated using a number of examples including polymorphism of CCR5, encoding CC chemokine receptor 5, and HIV-1 infection. The importance of understanding haplotypic structure to the design and interpretation of functional assays of putative regulatory variation is highlighted, together with discussion of the strategic use of

  20. Fast and Efficient Cloning of Cis-Regulatory Sequences for High-Throughput Yeast One-Hybrid Analyses of Transcription Factors.

    PubMed

    Kelemen, Zsolt; Przybyla-Toscano, Jonathan; Tissot, Nicolas; Lepiniec, Loïc; Dubos, Christian

    2016-01-01

    Yeast one-hybrid (Y1H) assay has been proven to be a powerful technique to characterize in vivo the interaction between a given transcription factor (TF), or its DNA-binding domain (DBD), and target DNA sequences. Comprehensive characterization of TF/DBD and DNA interactions should allow designing synthetic promoters that would undoubtedly be valuable for biotechnological approaches. Here, we use the ligation-independent cloning system (LIC) in order to enhance the cloning efficiency of DNA motifs into the pHISi Y1H vector. LIC overcomes important limitations of traditional cloning technologies, since any DNA fragment can be cloned into LIC compatible vectors without using restriction endonucleases, ligation, or in vitro recombination. PMID:27557765

  1. Muscle-specific activity of the skeletal troponin I promoter requires interaction between upstream regulatory sequences and elements contained within the first transcribed exon.

    PubMed Central

    Nikovits, W; Mar, J H; Ordahl, C P

    1990-01-01

    Expression of the skeletal troponin I (sTnI) gene is regulated transcriptionally in a muscle-specific fashion. We show here that the region of the sTnI gene between -160 and +61 (relative to the transcription initiation site) is able to direct expression of the bacterial chloramphenicol acetyltransferase (CAT) gene is muscle cultures at a level approximately 100 times higher than in fibroblast cultures. RNA analysis demonstrated that transcription of the CAT gene was initiated at the same site as transcription of the endogenous sTnI gene and that CAT activity levels were approximately proportional to CAT mRNA levels. Deletion analysis demonstrated that the region between nucleotides -160 and -40 contained sequences essential for full promoter activity. Surprisingly, 3' deletion analysis indicated that the first exon (-6 to +61) of the sTnI gene was also required for full activity of the sTnI promoter in skeletal muscle cells. Chimeric promoter experiments, in which segments of the sTnI and the herpes simplex virus thymidine kinase promoter were interchanged, indicated that reconstitution of a muscle-specific promoter required inclusion of both the upstream and exon I regions of the sTnI gene. Exon I, and the region immediately upstream, showed DNase protection over sequence motifs related to those found in other genes, including the tar region of human immunodeficiency virus type 1. These results demonstrate that expression of the sTnI promoter in embryonic skeletal muscle cells requires complex interaction between two separate promoter regions, one of which resides within the first 61 transcribed nucleotides of the gene. Images PMID:2355914

  2. CD4+CD25+Foxp3+ T regulatory cells, Th1 (CCR5, IL-2, IFN-γ) and Th2 (CCR4, IL-4, Il-13) type chemokine receptors and intracellular cytokines in children with common variable immunodeficiency.

    PubMed

    Kutukculer, Necil; Azarsiz, Elif; Aksu, Guzide; Karaca, Neslihan Edeer

    2016-06-01

    Common variable immunodeficiency (CVID) is a heterogeneous group of primary antibody deficiencies characterized by decreased serum immunoglobulin G along with a decrease in serum IgA and/or IgM, defective specific antibody production, and recurrent bacterial infections. Abnormal lymphocyte trafficking, dysregulated cellular responses to chemokines, and uncontrolled T cell polarization may be involved in the pathogenesis and may help to understand the clinical complications. We evaluated T helper cell subsets (chemokine receptors CCR4, CCR5, and CCR7), expressions on T lymphocytes, intracellular cytokines - IL-2, IL-4, IL-13, IFN- γ-on CD4(+) T cells, and expression of CD4(+)CD25(+)Foxp3(+) regulatory T cells of 20 CVID patients and 26 healthy controls. Autoimmune clinical findings and other complications were also determined. Percentages and absolute numbers of CD4(+)CD25(+) Foxp3(+) cells did not show any significant difference between CVID cases and healthy controls nor between severe and moderate disease patients. The only significant difference regarding Th1 and Th2 type intracellular cytokines was the decreased absolute numbers of CD3(+)CD4(+)IL4(+) cells in CVID cases. There were some findings about T helper cell type dominance in CVID patients such as positive correlation between hepatomegaly and high IL-2 and IFN-γ in CD3(+)CD4(+) cells and very high expression of CCR5 (Th1) on CD3(+)CD4(+) cells in patients with granuloma. Th1 (CCR5) and Th2 (CCR4) type chemokine receptors did not show any dominance in CVID cases. However, frequencies of CCR7 expressing CD3(+) T cells, CD3(+)CD4(+) T helper cells and CD3(+)CD8(+) T cytotoxic cells were significantly lower in severe CVID patients. In addition, presence of autoimmune clinical findings was negatively correlated with CCR7(+) cells. As CCR7 is a key mediator balancing immunity and tolerance in the immune system, the abnormality of this mediator may contribute to the profound immune dysregulation seen in CVID

  3. Geochemical similarities between volcanic units at Yucca Mountain and Pahute Mesa: evidence for a common magmatic origin for volcanic sequences that flank the Timber Mountain Caldera

    SciTech Connect

    Warren, R.G.

    1983-12-31

    Chemical compositions have been determined for sanidine, plagioclase, biotite, and hornblende phenocrysts by electron microprobe for a comprehensive set of samples of Crater Flat Tuff and tuffs of Calico Hills. Most of these samples were obtained from drill holes at Yucca Mountain. Samples of tuffs and lavas of Area 20, obtained from locations at Pahute Mesa, have similarly been subjected to microprobe analysis. Complete modal petrography has been determined for all samples. Biotite and hornblende in the samples from both Yucca Mountain and Pahute Mesa have Fe-rich compositions that contract strikingly with Fe-poor compositions in the overlying Paintbrush Tuff and the underlying Lithic Ridge Tuff at Yucca Mountain. Each unit from Yucca Mountain has distinctive compositions for both sanidine and plagioclase that very closely match compositions for a corresponding unit identified within the lower, middle and upper portions of the Area 20 tuffs and lavas from Pahute Mesa. Each of these paired units probably originated from a common parental magma and was eruptd contemporaneously or nearly so. Each pair of units with matching phenocryst chemistries has a similar, but not identical set of petrographic characteristics. The petrographic differences, as well as small differences in phenocryst chemistry, result from a sonal distribution of phenocrysts within the parent magma chamber and eruption through earlier units that differ markedly between Yucca Mountain and Pahute Mesa.

  4. Common Schools for Common Education.

    ERIC Educational Resources Information Center

    Callan, Eamonn

    1995-01-01

    A vision of common education for citizens of a liberal democracy warrants faith in common schools as an instrument of social good. Some kinds of separate schooling are not inconsistent with common schooling and are even desirable. Equal respect, as defined by J. Rawls, is a basis for common education. (SLD)

  5. Common Cold

    MedlinePlus

    ... News & Events Volunteer NIAID > Health & Research Topics > Common Cold Skip Website Tools Website Tools Print this page ... Help people who are suffering from the common cold by volunteering for NIAID clinical studies on ClinicalTrials. ...

  6. Common Cold

    MedlinePlus

    ... coughing - everyone knows the symptoms of the common cold. It is probably the most common illness. In ... people in the United States suffer 1 billion colds. You can get a cold by touching your ...

  7. Exploring the miRNA-mRNA regulatory network in clear cell renal cell carcinomas by next-generation sequencing expression profiles.

    PubMed

    Müller, Sören; Nowak, Katharina

    2014-01-01

    Altered microRNA (miRNA) expression is a hallmark of many cancer types. The combined analysis of miRNA and messenger RNA (mRNA) expression profiles is crucial to identifying links between deregulated miRNAs and oncogenic pathways. Therefore, we investigated the small non-coding (snc) transcriptomes of nine clear cell renal cell carcinomas (ccRCCs) and adjacent normal tissues for alterations in miRNA expression using a publicly available small RNA-Sequencing (sRNA-Seq) raw-dataset. We constructed a network of deregulated miRNAs and a set of differentially expressed genes publicly available from an independent study to in silico determine miRNAs that contribute to clear cell renal cell carcinogenesis. From a total of 1,672 sncRNAs, 61 were differentially expressed across all ccRCC tissue samples. Several with known implications in ccRCC development, like the upregulated miR-21-5p, miR-142-5p, as well as the downregulated miR-106a-5p, miR-135a-5p, or miR-206. Additionally, novel promising candidates like miR-3065, which i.a. targets NRP2 and FLT1, were detected in this study. Interaction network analysis revealed pivotal roles for miR-106a-5p, whose loss might contribute to the upregulation of 49 target mRNAs, miR-135a-5p (32 targets), miR-206 (28 targets), miR-363-3p (22 targets), and miR-216b (13 targets). Among these targets are the angiogenesis, metastasis, and motility promoting oncogenes c-MET, VEGFA, NRP2, and FLT1, the latter two coding for VEGFA receptors. PMID:24977165

  8. Exploring the miRNA-mRNA Regulatory Network in Clear Cell Renal Cell Carcinomas by Next-Generation Sequencing Expression Profiles

    PubMed Central

    Nowak, Katharina

    2014-01-01

    Altered microRNA (miRNA) expression is a hallmark of many cancer types. The combined analysis of miRNA and messenger RNA (mRNA) expression profiles is crucial to identifying links between deregulated miRNAs and oncogenic pathways. Therefore, we investigated the small non-coding (snc) transcriptomes of nine clear cell renal cell carcinomas (ccRCCs) and adjacent normal tissues for alterations in miRNA expression using a publicly available small RNA-Sequencing (sRNA-Seq) raw-dataset. We constructed a network of deregulated miRNAs and a set of differentially expressed genes publicly available from an independent study to in silico determine miRNAs that contribute to clear cell renal cell carcinogenesis. From a total of 1,672 sncRNAs, 61 were differentially expressed across all ccRCC tissue samples. Several with known implications in ccRCC development, like the upregulated miR-21-5p, miR-142-5p, as well as the downregulated miR-106a-5p, miR-135a-5p, or miR-206. Additionally, novel promising candidates like miR-3065, which i.a. targets NRP2 and FLT1, were detected in this study. Interaction network analysis revealed pivotal roles for miR-106a-5p, whose loss might contribute to the upregulation of 49 target mRNAs, miR-135a-5p (32 targets), miR-206 (28 targets), miR-363-3p (22 targets), and miR-216b (13 targets). Among these targets are the angiogenesis, metastasis, and motility promoting oncogenes c-MET, VEGFA, NRP2, and FLT1, the latter two coding for VEGFA receptors. PMID:24977165

  9. The regulatory network of cluster-root function and development in phosphate-deficient white lupin (Lupinus albus) identified by transcriptome sequencing.

    PubMed

    Wang, Zhengrui; Straub, Daniel; Yang, Huaiyu; Kania, Angelika; Shen, Jianbo; Ludewig, Uwe; Neumann, Günter

    2014-07-01

    Lupinus albus serves as model plant for root-induced mobilization of sparingly soluble soil phosphates via the formation of cluster-roots (CRs) that mediate secretion of protons, citrate, phenolics and acid phosphatases (APases). This study employed next-generation sequencing to investigate the molecular mechanisms behind these complex adaptive responses at the transcriptome level. We compared different stages of CR development, including pre-emergent (PE), juvenile (JU) and the mature (MA) stages. The results confirmed that the primary metabolism underwent significant modifications during CR maturation, promoting the biosynthesis of organic acids, as had been deduced from physiological studies. Citrate catabolism was downregulated, associated with citrate accumulation in MA clusters. Upregulation of the phenylpropanoid pathway reflected the accumulation of phenolics. Specific transcript expression of ALMT and MATE transporter genes correlated with the exudation of citrate and flavonoids. The expression of transcripts related to nucleotide degradation and APases in MA clusters coincided with the re-mobilization and hydrolysis of organic phosphate resources. Most interestingly, hormone-related gene expression suggested a central role of ethylene during CR maturation. This was associated with the upregulation of the iron (Fe)-deficiency regulated network that mediates ethylene-induced expression of Fe-deficiency responses in other species. Finally, transcripts related to abscisic acid and jasmonic acid were upregulated in MA clusters, while auxin- and brassinosteroid-related genes and cytokinin receptors were most strongly expressed during CR initiation. Key regulations proposed by the RNA-seq data were confirmed by quantitative real-time polymerase chain reaction (RT-qPCR) and some physiological analyses. A model for the gene network regulating CR development and function is presented. PMID:24635386

  10. Interferon (IFN) Consensus Sequence-binding Protein, a Transcription Factor of the IFN Regulatory Factor Family, Regulates Immune Responses In Vivo through Control of Interleukin 12 Expression

    PubMed Central

    Giese, Nathalia A.; Gabriele, Lucia; Doherty, T. Mark; Klinman, Dennis M.; Tadesse-Heath, Lekidelu; Contursi, Christina; Epstein, Suzanne L.; Morse, Herbert C.

    1997-01-01

    Mice with a null mutation of the gene encoding interferon consensus sequence-binding protein (ICSBP) develop a chronic myelogenous leukemia-like syndrome and mount impaired responses to certain viral and bacterial infections. To gain a mechanistic understanding of the contributions of ICSBP to humoral and cellular immunity, we characterized the responses of control and ICSBP−/− mice to infection with influenza A (flu) and Leishmania major (L. major). Mice of both genotypes survived infections with flu, but differed markedly in the isotype distribution of antiflu antibodies. In sera of normal mice, immunoglobulin (Ig)G2a antibodies were dominant over IgG1 antibodies, a pattern indicative of a T helper cell type 1 (Th1)-driven response. In sera of ICSBP−/− mice, however, IgG1 antibodies dominated over IgG2a antibodies, a pattern indicative of a Th2-driven response. The dominance of IgG1 and IgE over IgG2a was detected in the sera of uninfected mice as well. A seeming Th2 bias of ICSBP-deficient mice was also uncovered in their inability to control infection with L. major, where resistance is known to be dependent on IL-12 and IFN-γ as components of a Th1 response. Infected ICSBP-deficient mice developed fulminant, disseminated leishmaniasis as a result of failure to mount a Th1-mediated curative response, although T cells remained capable of secreting IFN-γ and macrophages of producing nitric oxide. Compromised Th1 differentiation in ICSBP−/− mice could not be attributed to hyporesponsiveness of CD4+ T cells to interleukin (IL)-12; however, the ability of uninfected and infected ICSBP-deficient mice to produce IL-12 was markedly impaired. This indicates that ICSBP is a deciding factor in Th responses governing humoral and cellular immunity through its role in regulating IL-12 expression. PMID:9348311

  11. RNA Synthesis in Cells Infected with Herpes Simplex Virus VII. Control of Transcription and of Transcript Abundancies of Unique and Common Sequences of Herpes Simplex Virus 1 and 2

    PubMed Central

    Frenkel, Niza; Silverstein, Saul; Cassai, Enzo; Roizman, Bernard

    1973-01-01

    Analysis of the kinetics of hybridization in liquid of labeled herpes simplex virus (HSV) 1 and 2 DNAs with excess unlabeled RNA extracted at 2 (early) and 8 (late) h postinfection revealed the following. (i) The RNA transcripts present in the HSV-1-infected cells at 2 and 8 h postinfection are complementary to 44 and 48% of HSV-1 DNA. The RNA transcripts present in the HSV-2-infected cells at 2 and 8 h postinfection are complementary to 21 and 50% of HSV-2 DNA. (ii) The transcripts present in 2-h HSV-1- or HSV-2-infected cells treated with cycloheximide are complementary to 44 and 45% of the respective DNAs. (iii) The RNA transcripts present in the HSV-1-infected cells at 2 h postinfection and in HSV-2-infected cells at 8 h postinfection form 2 classes, abundant and scarce, differing in molar concentrations. The RNA transcripts present in the HSV-2-infected cells at 2 h postinfection form only one abundance class. (iv) The transcripts present in the HSV-1-infected cells at 8 h postinfection are complementary to 24% of HSV-2 DNA and therefore 50% of the transcribed HSV-1 sequences are shared by the two viruses. Of the RNA sequences complementary to HSV-2 DNA, 13% arise from HSV-1 templates specifying abundant RNA and 11% arise from HSV-1 templates specifying scarce RNA. Thus, the DNA sequences shared in common by HSV-1 and HSV-2 DNAs constitute 71% of the HSV-1 templates specifying abundant RNA and 39% of sequences specifying scarce RNA. PMID:4351458

  12. Analysis of ORF5 and Full-Length Genome Sequences of Porcine Reproductive and Respiratory Syndrome Virus Isolates of Genotypes 1 and 2 Retrieved Worldwide Provides Evidence that Recombination Is a Common Phenomenon and May Produce Mosaic Isolates

    PubMed Central

    Martín-Valls, G. E.; Kvisgaard, L. K.; Tello, M.; Darwich, L.; Cortey, M.; Burgara-Estrella, A. J.; Hernández, J.; Larsen, L. E.

    2014-01-01

    ABSTRACT Recombination is currently recognized as a factor for high genetic diversity, but the frequency of such recombination events and the genome segments involved are not well known. In the present study, we initially focused on the detection of recombinant porcine reproductive and respiratory syndrome virus (PRRSV) isolates by examining previously published data sets of ORF5 sequences (genotypes 1 and 2) obtained worldwide. We then examined full-length genome sequences in order to determine potential recombination breakpoints along the viral genome. For ORF5, 11 sets of genotype 1 sequences from different geographical areas, including 2 Asian, 1 American, and 7 European regions, and three sets of genotype 2, including sets from China, Mexico, and the United States, were analyzed separately. Potential recombination breakpoints were detected in 10/11 genotype 1 sets, including 9 cases in which the clustering of at least one isolate was different before and after the breakpoints. In genotype 2, potential breakpoints and different tree clustering of at least one strain before and after the breakpoint were observed in 2 out of 3 sets. The results indicated that most of the ORF5 data sets contained at least one recombinant sequence. When the full-length genome sequences were examined, both genotype 1 and 2 sets presented breakpoints (10 and 9, respectively), resulting in significantly different topologies before and after the breakpoints. Mosaic genomes were detected in genotype 1 sequences. These results may have significant implications for the understanding of the molecular epidemiology of PRRSV. IMPORTANCE PRRSV is one of the most important viruses affecting swine production worldwide, causing big economic losses and sanitary problems. One of the key questions on PRRSV arises from its genetic diversity, which is thought to have a direct impact on immunobiology, epidemiology, diagnosis, and vaccine efficacy. One of the causes of this genetic diversity is

  13. Clays, common

    USGS Publications Warehouse

    Virta, R.L.

    1998-01-01

    Part of a special section on the state of industrial minerals in 1997. The state of the common clay industry worldwide for 1997 is discussed. Sales of common clay in the U.S. increased from 26.2 Mt in 1996 to an estimated 26.5 Mt in 1997. The amount of common clay and shale used to produce structural clay products in 1997 was estimated at 13.8 Mt.

  14. Accelerated Evolution of Conserved Noncoding Sequences in theHuman Genome

    SciTech Connect

    Prambhakar, Shyam; Noonan, James P.; Paabo, Svante; Rubin, EdwardM.

    2006-07-06

    Genomic comparisons between human and distant, non-primatemammals are commonly used to identify cis-regulatory elements based onconstrained sequence evolution. However, these methods fail to detect"cryptic" functional elements, which are too weakly conserved amongmammals to distinguish from nonfunctional DNA. To address this problem,we explored the potential of deep intra-primate sequence comparisons. Wesequenced the orthologs of 558 kb of human genomic sequence, coveringmultiple loci involved in cholesterol homeostasis, in 6 nonhumanprimates. Our analysis identified 6 noncoding DNA elements displayingsignificant conservation among primates, but undetectable in more distantcomparisons. In vitro and in vivo tests revealed that at least three ofthese 6 elements have regulatory function. Notably, the mouse orthologsof these three functional human sequences had regulatory activity despitetheir lack of significant sequence conservation, indicating that they arecryptic ancestral cis-regulatory elements. These regulatory elementscould still be detected in a smaller set of three primate speciesincluding human, rhesus and marmoset. Since the human and rhesus genomesequences are already available, and the marmoset genome is activelybeing sequenced, the primate-specific conservation analysis describedhere can be applied in the near future on a whole-genome scale, tocomplement the annotation provided by more distant speciescomparisons.

  15. Student Commons

    ERIC Educational Resources Information Center

    Gordon, Douglas

    2010-01-01

    Student commons are no longer simply congregation spaces for students with time on their hands. They are integral to providing a welcoming environment and effective learning space for students. Many student commons have been transformed into spaces for socialization, an environment for alternative teaching methods, a forum for large group meetings…

  16. Genetic relatedness of Clostridium difficile isolates from various origins determined by triple-locus sequence analysis based on toxin regulatory genes tcdC, tcdR, and cdtR.

    PubMed

    Bouvet, Philippe J M; Popoff, Michel R

    2008-11-01

    A triple-locus nucleotide sequence analysis based on toxin regulatory genes tcdC, tcdR and cdtR was initiated to assess the sequence variability of these genes among Clostridium difficile isolates and to study the genetic relatedness between isolates. A preliminary investigation of the variability of the tcdC gene was done with 57 clinical and veterinary isolates. Twenty-three isolates representing nine main clusters were selected for tcdC, tcdR, and cdtR analysis. The numbers of alleles found for tcdC, tcdR and cdtR were nine, six, and five, respectively. All strains possessed the cdtR gene except toxin A-negative toxin B-positive variants. All but one binary toxin CDT-positive isolate harbored a deletion (>1 bp) in the tcdC gene. The combined analyses of the three genes allowed us to distinguish five lineages correlated with the different types of deletion in tcdC, i.e., 18 bp (associated or not with a deletion at position 117), 36 bp, 39 bp, and 54 bp, and with the wild-type tcdC (no deletion). The tcdR and tcdC genes, though located within the same pathogenicity locus, were found to have evolved separately. Coevolution of the three genes was noted only with strains harboring a 39-bp or a 54-bp deletion in tcdC that formed two homogeneous, separate divergent clusters. Our study supported the existence of the known clones (PCR ribotype 027 isolates and toxin A-negative toxin B-positive C. difficile variants) and evidence for clonality of isolates with a 39-bp deletion (toxinotype V, PCR ribotype 078) that are frequently isolated worldwide from human infections and from food animals. PMID:18832125

  17. Common cold

    MedlinePlus

    ... are the most common reason that children miss school and parents miss work. Parents often get colds ... other children. A cold can spread quickly through schools or daycares. Colds can occur at any time ...

  18. The distribution of SNPs in human gene regulatory regions

    PubMed Central

    Guo, Yongjian; Jamison, D Curtis

    2005-01-01

    Background As a result of high-throughput genotyping methods, millions of human genetic variants have been reported in recent years. To efficiently identify those with significant biological functions, a practical strategy is to concentrate on variants located in important sequence regions such as gene regulatory regions. Results Analysis of the most common type of variant, single nucleotide polymorphisms (SNPs), shows that in gene promoter regions more SNPs occur in close proximity to transcriptional start sites than in regions further upstream, and a disproportionate number of those SNPs represent nucleotide transversions. Additionally, the number of SNPs found in the predicted transcription factor binding sites is higher than in non-binding site sequences. Conclusion Current information about transcription factor binding site sequence patterns may not be exhaustive, and SNPs may be actively involved in influencing gene expression by affecting the transcription factor binding sites. PMID:16209714

  19. Regulation of next generation sequencing.

    PubMed

    Javitt, Gail H; Carner, Katherine Strong

    2014-01-01

    Next generation sequencing raises new questions within the context of an existing and still evolving regulatory landscape for device manufacturers and clinical laboratories. FDA cleared the first NGS sequencing platform in November 2013, but it is unclear what lies ahead for this technology. NGS will require new types of training and expertise to interpret the vast quantities of genetic data so as to provide meaningful clinical information to physicians and patients. This paper will describe the current regulatory landscape for NGS technologies, identify the regulatory challenges they present, and consider whether new regulatory paradigms are needed to accommodate NGS technologies and services. PMID:25298288

  20. Qualitative De Novo Analysis of Full Length cDNA and Quantitative Analysis of Gene Expression for Common Marmoset (Callithrix jacchus) Transcriptomes Using Parallel Long-Read Technology and Short-Read Sequencing

    PubMed Central

    Uno, Yasuhiro; Uehara, Shotaro; Inoue, Takashi; Murayama, Norie; Onodera, Jun; Sasaki, Erika; Yamazaki, Hiroshi

    2014-01-01

    The common marmoset (Callithrix jacchus) is a non-human primate that could prove useful as human pharmacokinetic and biomedical research models. The cytochromes P450 (P450s) are a superfamily of enzymes that have critical roles in drug metabolism and disposition via monooxygenation of a broad range of xenobiotics; however, information on some marmoset P450s is currently limited. Therefore, identification and quantitative analysis of tissue-specific mRNA transcripts, including those of P450s and flavin-containing monooxygenases (FMO, another monooxygenase family), need to be carried out in detail before the marmoset can be used as an animal model in drug development. De novo assembly and expression analysis of marmoset transcripts were conducted with pooled liver, intestine, kidney, and brain samples from three male and three female marmosets. After unique sequences were automatically aligned by assembling software, the mean contig length was 718 bp (with a standard deviation of 457 bp) among a total of 47,883 transcripts. Approximately 30% of the total transcripts were matched to known marmoset sequences. Gene expression in 18 marmoset P450- and 4 FMO-like genes displayed some tissue-specific patterns. Of these, the three most highly expressed in marmoset liver were P450 2D-, 2E-, and 3A-like genes. In extrahepatic tissues, including brain, gene expressions of these monooxygenases were lower than those in liver, although P450 3A4 (previously P450 3A21) in intestine and P450 4A11- and FMO1-like genes in kidney were relatively highly expressed. By means of massive parallel long-read sequencing and short-read technology applied to marmoset liver, intestine, kidney, and brain, the combined next-generation sequencing analyses reported here were able to identify novel marmoset drug-metabolizing P450 transcripts that have until now been little reported. These results provide a foundation for mechanistic studies and pave the way for the use of marmosets as model animals

  1. RNA Sequencing Analysis of the msl2msl3, crl, and ggps1 Mutants Indicates that Diverse Sources of Plastid Dysfunction Do Not Alter Leaf Morphology Through a Common Signaling Pathway

    PubMed Central

    Luesse, Darron R.; Wilson, Margaret E.; Haswell, Elizabeth S.

    2015-01-01

    Determining whether individual genes function in the same or in different pathways is an important aspect of genetic analysis. As an alternative to the construction of higher-order mutants, we used contemporary expression profiling methods to perform pathway analysis on several Arabidopsis thaliana mutants, including the mscS-like (msl)2msl3 double mutant. MSL2 and MSL3 are implicated in plastid ion homeostasis, and msl2msl3 double mutants exhibit leaves with a lobed periphery, a rumpled surface, and disturbed mesophyll cell organization. Similar developmental phenotypes are also observed in other mutants with defects in a range of other chloroplast or mitochondrial functions, including biogenesis, gene expression, and metabolism. We wished to test the hypothesis that the common leaf morphology phenotypes of these mutants are the result of a characteristic nuclear expression pattern that is generated in response to organelle dysfunction. RNA-Sequencing was performed on aerial tissue of msl2msl3 geranylgeranyl diphosphate synthase 1 (ggps1), and crumpled leaf (crl) mutants. While large groups of co-expressed genes were identified in pairwise comparisons between genotypes, we were only able to identify a small set of genes that showed similar expression profiles in all three genotypes. Subsequent comparison to the previously published gene expression profiles of two other mutants, yellow variegated 2 (var2) and scabra3 (sca3), failed to reveal a common pattern of gene expression associated with superficially similar leaf morphology defects. Nor did we observe overlap between genes differentially expressed in msl2msl3, crl, and ggps1 and a previously identified retrograde core response module. These data suggest that a common retrograde signaling pathway initiated by organelle dysfunction either does not exist in these mutants or cannot be identified through transcriptomic methods. Instead, the leaf developmental defects observed in these mutants may be achieved

  2. The miRNAs and their regulatory networks responsible for pollen abortion in Ogura-CMS Chinese cabbage revealed by high-throughput sequencing of miRNAs, degradomes, and transcriptomes

    PubMed Central

    Wei, Xiaochun; Zhang, Xiaohui; Yao, Qiuju; Yuan, Yuxiang; Li, Xixiang; Wei, Fang; Zhao, Yanyan; Zhang, Qiang; Wang, Zhiyong; Jiang, Wusheng; Zhang, Xiaowei

    2015-01-01

    Chinese cabbage (Brassica rapa ssp. pekinensis) is one of the most important vegetables in Asia and is cultivated across the world. Ogura-type cytoplasmic male sterility (Ogura-CMS) has been widely used in the hybrid breeding industry for Chinese cabbage and many other cruciferous vegetables. Although, the cause of Ogura-CMS has been localized to the orf138 locus in the mitochondrial genome, however, the mechanism by which nuclear genes respond to the mutation of the mitochondrial orf138 locus is unclear. In this study, a series of whole genome small RNA, degradome and transcriptome analyses were performed on both Ogura-CMS and its maintainer Chinese cabbage buds using deep sequencing technology. A total of 289 known miRNAs derived from 69 families (including 23 new families first reported in B. rapa) and 426 novel miRNAs were identified. Among these novel miRNAs, both 3-p and 5-p miRNAs were detected on the hairpin arms of 138 precursors. Ten known and 49 novel miRNAs were down-regulated, while one known and 27 novel miRNAs were up-regulated in Ogura-CMS buds compared to the fertile plants. Using degradome analysis, a total of 376 mRNAs were identified as targets of 30 known miRNA families and 100 novel miRNAs. A large fraction of the targets were annotated as reproductive development related. Our transcriptome profiling revealed that the expression of the targets was finely tuned by the miRNAs. Two novel miRNAs were identified that were specifically highly expressed in Ogura-CMS buds and sufficiently suppressed two pollen development essential genes: sucrose transporter SUC1 and H+-ATPase 6. These findings provide clues for the contribution of a potential miRNA regulatory network to bud development and pollen engenderation. This study contributes new insights to the communication between the mitochondria and chromosome and takes one step toward filling the gap in the regulatory network from the orf138 locus to pollen abortion in Ogura-CMS plants from a mi

  3. Common-Cause Failure Analysis in Event Assessment

    SciTech Connect

    Dana L. Kelly; Dale M. Rasmuson

    2008-09-01

    This paper describes the approach taken by the U. S. Nuclear Regulatory Commission to the treatment of common-cause failure in probabilistic risk assessment of operational events. The approach is based upon the Basic Parameter Model for common-cause failure, and examples are illustrated using the alpha-factor parameterization, the approach adopted by the NRC in their Standardized Plant Analysis Risk (SPAR) models. The cases of a failed component (with and without shared common-cause failure potential) and a component being unavailable due to preventive maintenance or testing are addressed. The treatment of two related failure modes (e.g., failure to start and failure to run) is a new feature of this paper. These methods are being applied by the NRC in assessing the risk significance of operational events for the Significance Determination Process (SDP) and the Accident Sequence Precursor (ASP) program.

  4. 77 FR 8020 - Semiannual Regulatory Agenda

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-02-13

    ... Agenda for the agency. SBA's last semiannual regulatory agenda was published on July 7, 2011, at 76 FR... Administration--Completed Actions Regulation Sequence No. Title Identifier No. 474 Implementation of Military... individual SBDC client data. Timetable: Action Date FR Cite ANPRM 06/00/12 Regulatory Flexibility...

  5. Analysis of common bean expressed sequence tags identifies sulfur metabolic pathways active in seed and sulfur-rich proteins highly expressed in the absence of phaseolin and major lectins

    PubMed Central

    2011-01-01

    Background A deficiency in phaseolin and phytohemagglutinin is associated with a near doubling of sulfur amino acid content in genetically related lines of common bean (Phaseolus vulgaris), particularly cysteine, elevated by 70%, and methionine, elevated by 10%. This mostly takes place at the expense of an abundant non-protein amino acid, S-methyl-cysteine. The deficiency in phaseolin and phytohemagglutinin is mainly compensated by increased levels of the 11S globulin legumin and residual lectins. Legumin, albumin-2, defensin and albumin-1 were previously identified as contributing to the increased sulfur amino acid content in the mutant line, on the basis of similarity to proteins from other legumes. Results Profiling of free amino acid in developing seeds of the BAT93 reference genotype revealed a biphasic accumulation of gamma-glutamyl-S-methyl-cysteine, the main soluble form of S-methyl-cysteine, with a lag phase occurring during storage protein accumulation. A collection of 30,147 expressed sequence tags (ESTs) was generated from four developmental stages, corresponding to distinct phases of gamma-glutamyl-S-methyl-cysteine accumulation, and covering the transitions to reserve accumulation and dessication. Analysis of gene ontology categories indicated the occurrence of multiple sulfur metabolic pathways, including all enzymatic activities responsible for sulfate assimilation, de novo cysteine and methionine biosynthesis. Integration of genomic and proteomic data enabled the identification and isolation of cDNAs coding for legumin, albumin-2, defensin D1 and albumin-1A and -B induced in the absence of phaseolin and phytohemagglutinin. Their deduced amino acid sequences have a higher content of cysteine than methionine, providing an explanation for the preferential increase of cysteine in the mutant line. Conclusion The EST collection provides a foundation to further investigate sulfur metabolism and the differential accumulation of sulfur amino acids in seed

  6. Making the Common Good Common

    ERIC Educational Resources Information Center

    Chase, Barbara

    2011-01-01

    How are independent schools to be useful to the wider world? Beyond their common commitment to educate their students for meaningful lives in service of the greater good, can they educate a broader constituency and, thus, share their resources and skills more broadly? Their answers to this question will be shaped by their independence. Any…

  7. Combinatorial Gene Regulatory Functions Underlie Ultraconserved Elements in Drosophila

    PubMed Central

    Warnefors, Maria; Hartmann, Britta; Thomsen, Stefan; Alonso, Claudio R.

    2016-01-01

    Ultraconserved elements (UCEs) are discrete genomic elements conserved across large evolutionary distances. Although UCEs have been linked to multiple facets of mammalian gene regulation their extreme evolutionary conservation remains largely unexplained. Here, we apply a computational approach to investigate this question in Drosophila, exploring the molecular functions of more than 1,500 UCEs shared across the genomes of 12 Drosophila species. Our data indicate that Drosophila UCEs are hubs for gene regulatory functions and suggest that UCE sequence invariance originates from their combinatorial roles in gene control. We also note that the gene regulatory roles of intronic and intergenic UCEs (iUCEs) are distinct from those found in exonic UCEs (eUCEs). In iUCEs, transcription factor (TF) and epigenetic factor binding data strongly support iUCE roles in transcriptional and epigenetic regulation. In contrast, analyses of eUCEs indicate that they are two orders of magnitude more likely than the expected to simultaneously include protein-coding sequence, TF-binding sites, splice sites, and RNA editing sites but have reduced roles in transcriptional or epigenetic regulation. Furthermore, we use a Drosophila cell culture system and transgenic Drosophila embryos to validate the notion of UCE combinatorial regulatory roles using an eUCE within the Hox gene Ultrabithorax and show that its protein-coding region also contains alternative splicing regulatory information. Taken together our experiments indicate that UCEs emerge as a result of combinatorial gene regulatory roles and highlight common features in mammalian and insect UCEs implying that similar processes might underlie ultraconservation in diverse animal taxa. PMID:27247329

  8. Combinatorial Gene Regulatory Functions Underlie Ultraconserved Elements in Drosophila.

    PubMed

    Warnefors, Maria; Hartmann, Britta; Thomsen, Stefan; Alonso, Claudio R

    2016-09-01

    Ultraconserved elements (UCEs) are discrete genomic elements conserved across large evolutionary distances. Although UCEs have been linked to multiple facets of mammalian gene regulation their extreme evolutionary conservation remains largely unexplained. Here, we apply a computational approach to investigate this question in Drosophila, exploring the molecular functions of more than 1,500 UCEs shared across the genomes of 12 Drosophila species. Our data indicate that Drosophila UCEs are hubs for gene regulatory functions and suggest that UCE sequence invariance originates from their combinatorial roles in gene control. We also note that the gene regulatory roles of intronic and intergenic UCEs (iUCEs) are distinct from those found in exonic UCEs (eUCEs). In iUCEs, transcription factor (TF) and epigenetic factor binding data strongly support iUCE roles in transcriptional and epigenetic regulation. In contrast, analyses of eUCEs indicate that they are two orders of magnitude more likely than the expected to simultaneously include protein-coding sequence, TF-binding sites, splice sites, and RNA editing sites but have reduced roles in transcriptional or epigenetic regulation. Furthermore, we use a Drosophila cell culture system and transgenic Drosophila embryos to validate the notion of UCE combinatorial regulatory roles using an eUCE within the Hox gene Ultrabithorax and show that its protein-coding region also contains alternative splicing regulatory information. Taken together our experiments indicate that UCEs emerge as a result of combinatorial gene regulatory roles and highlight common features in mammalian and insect UCEs implying that similar processes might underlie ultraconservation in diverse animal taxa. PMID:27247329

  9. A genomic regulatory network for development

    NASA Technical Reports Server (NTRS)

    Davidson, Eric H.; Rast, Jonathan P.; Oliveri, Paola; Ransick, Andrew; Calestani, Cristina; Yuh, Chiou-Hwa; Minokawa, Takuya; Amore, Gabriele; Hinman, Veronica; Arenas-Mena, Cesar; Otim, Ochan; Brown, C. Titus; Livi, Carolina B.; Lee, Pei Yun; Revilla, Roger; Rust, Alistair G.; Pan, Zheng jun; Schilstra, Maria J.; Clarke, Peter J C.; Arnone, Maria I.; Rowen, Lee; Cameron, R. Andrew; McClay, David R.; Hood, Leroy; Bolouri, Hamid

    2002-01-01

    Development of the body plan is controlled by large networks of regulatory genes. A gene regulatory network that controls the specification of endoderm and mesoderm in the sea urchin embryo is summarized here. The network was derived from large-scale perturbation analyses, in combination with computational methodologies, genomic data, cis-regulatory analysis, and molecular embryology. The network contains over 40 genes at present, and each node can be directly verified at the DNA sequence level by cis-regulatory analysis. Its architecture reveals specific and general aspects of development, such as how given cells generate their ordained fates in the embryo and why the process moves inexorably forward in developmental time.

  10. Regulatory Physiology

    NASA Technical Reports Server (NTRS)

    Lane, Helen W.; Whitson, Peggy A.; Putcha, Lakshmi; Baker, Ellen; Smith, Scott M.; Stewart, Karen; Gretebeck, Randall; Nimmagudda, R. R.; Schoeller, Dale A.; Davis-Street, Janis

    1999-01-01

    As noted elsewhere in this report, a central goal of the Extended Duration Orbiter Medical Project (EDOMP) was to ensure that cardiovascular and muscle function were adequate to perform an emergency egress after 16 days of spaceflight. The goals of the Regulatory Physiology component of the EDOMP were to identify and subsequently ameliorate those biochemical and nutritional factors that deplete physiological reserves or increase risk for disease, and to facilitate the development of effective muscle, exercise, and cardiovascular countermeasures. The component investigations designed to meet these goals focused on biochemical and physiological aspects of nutrition and metabolism, the risk of renal (kidney) stone formation, gastrointestinal function, and sleep in space. Investigations involved both ground-based protocols to validate proposed methods and flight studies to test those methods. Two hardware tests were also completed.

  11. Regulatory Anatomy

    PubMed Central

    2015-01-01

    This article proposes the term “safety logics” to understand attempts within the European Union (EU) to harmonize member state legislation to ensure a safe and stable supply of human biological material for transplants and transfusions. With safety logics, I refer to assemblages of discourses, legal documents, technological devices, organizational structures, and work practices aimed at minimizing risk. I use this term to reorient the analytical attention with respect to safety regulation. Instead of evaluating whether safety is achieved, the point is to explore the types of “safety” produced through these logics as well as to consider the sometimes unintended consequences of such safety work. In fact, the EU rules have been giving rise to complaints from practitioners finding the directives problematic and inadequate. In this article, I explore the problems practitioners face and why they arise. In short, I expose the regulatory anatomy of the policy landscape. PMID:26139952

  12. 47 CFR 27.10 - Regulatory status.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... 47 Telecommunication 2 2012-10-01 2012-10-01 false Regulatory status. 27.10 Section 27.10 Telecommunication FEDERAL COMMUNICATIONS COMMISSION (CONTINUED) COMMON CARRIER SERVICES MISCELLANEOUS WIRELESS COMMUNICATIONS SERVICES Applications and Licenses § 27.10 Regulatory status. The following rules apply...

  13. 47 CFR 27.10 - Regulatory status.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... 47 Telecommunication 2 2014-10-01 2014-10-01 false Regulatory status. 27.10 Section 27.10 Telecommunication FEDERAL COMMUNICATIONS COMMISSION (CONTINUED) COMMON CARRIER SERVICES MISCELLANEOUS WIRELESS COMMUNICATIONS SERVICES Applications and Licenses § 27.10 Regulatory status. The following rules apply...

  14. 47 CFR 27.10 - Regulatory status.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... 47 Telecommunication 2 2013-10-01 2013-10-01 false Regulatory status. 27.10 Section 27.10 Telecommunication FEDERAL COMMUNICATIONS COMMISSION (CONTINUED) COMMON CARRIER SERVICES MISCELLANEOUS WIRELESS COMMUNICATIONS SERVICES Applications and Licenses § 27.10 Regulatory status. The following rules apply...

  15. 47 CFR 27.10 - Regulatory status.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... 47 Telecommunication 2 2011-10-01 2011-10-01 false Regulatory status. 27.10 Section 27.10 Telecommunication FEDERAL COMMUNICATIONS COMMISSION (CONTINUED) COMMON CARRIER SERVICES MISCELLANEOUS WIRELESS COMMUNICATIONS SERVICES Applications and Licenses § 27.10 Regulatory status. The following rules apply...

  16. 47 CFR 27.10 - Regulatory status.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... 47 Telecommunication 2 2010-10-01 2010-10-01 false Regulatory status. 27.10 Section 27.10 Telecommunication FEDERAL COMMUNICATIONS COMMISSION (CONTINUED) COMMON CARRIER SERVICES MISCELLANEOUS WIRELESS COMMUNICATIONS SERVICES Applications and Licenses § 27.10 Regulatory status. The following rules apply...

  17. Noncoding Regulatory RNAs in Hematopoiesis.

    PubMed

    Jeong, M; Goodell, M A

    2016-01-01

    Hematopoiesis is a dynamic process in which blood cells are continuously generated from hematopoietic stem cells (HSCs). The regulatory mechanisms controlling HSC fate have been studied extensively over the past several decades. Although many protein-coding genes have been shown to regulate hematopoietic differentiation, additional levels of HSC regulation are not well studied. Advances in deep sequencing have revealed many new classes of regulatory noncoding RNAs (ncRNAs), such as enhancer RNAs and antisense ncRNAs. Functional analysis of some of these ncRNAs has provided insights into the molecular mechanisms that regulate hematopoietic development and disease. In this review, we summarize recent advances in our understanding of functional regulatory ncRNAs associated with hematopoietic self-renewal and differentiation, as well as those dysregulated ncRNAs involved in hematologic malignancies. PMID:27137659

  18. Plant nitrogen regulatory P-PII genes

    DOEpatents

    Coruzzi, Gloria M.; Lam, Hon-Ming; Hsieh, Ming-Hsiun

    2001-01-01

    The present invention generally relates to plant nitrogen regulatory PII gene (hereinafter P-PII gene), a gene involved in regulating plant nitrogen metabolism. The invention provides P-PII nucleotide sequences, expression constructs comprising said nucleotide sequences, and host cells and plants having said constructs and, optionally expressing the P-PII gene from said constructs. The invention also provides substantially pure P-PII proteins. The P-PII nucleotide sequences and constructs of the

  19. Automated DNA sequencing.

    PubMed

    Wallis, Yvonne; Morrell, Natalie

    2011-01-01

    Fluorescent cycle sequencing of PCR products is a multistage process and several methodologies are available to perform each stage. This chapter will describe the more commonly utilised dye-terminator cycle sequencing approach using BigDye® terminator chemistry (Applied Biosystems) ready for analysis on a 3730 DNA genetic analyzer. Even though DNA sequencing is one of the most common and robust techniques performed in molecular laboratories it may not always produce desirable results. The causes of the most common problems will also be discussed in this chapter. PMID:20938839

  20. Discovering cis-regulatory RNAs in Shewanella genomes by Support Vector Machines.

    PubMed

    Xu, Xing; Ji, Yongmei; Stormo, Gary D

    2009-04-01

    An increasing number of cis-regulatory RNA elements have been found to regulate gene expression post-transcriptionally in various biological processes in bacterial systems. Effective computational tools for large-scale identification of novel regulatory RNAs are strongly desired to facilitate our exploration of gene regulation mechanisms and regulatory networks. We present a new computational program named RSSVM (RNA Sampler+Support Vector Machine), which employs Support Vector Machines (SVMs) for efficient identification of functional RNA motifs from random RNA secondary structures. RSSVM uses a set of distinctive features to represent the common RNA secondary structure and structural alignment predicted by RNA Sampler, a tool for accurate common RNA secondary structure prediction, and is trained with functional RNAs from a variety of bacterial RNA motif/gene families covering a wide range of sequence identities. When tested on a large number of known and random RNA motifs, RSSVM shows a significantly higher sensitivity than other leading RNA identification programs while maintaining the same false positive rate. RSSVM performs particularly well on sets with low sequence identities. The combination of RNA Sampler and RSSVM provides a new, fast, and efficient pipeline for large-scale discovery of regulatory RNA motifs. We applied RSSVM to multiple Shewanella genomes and identified putative regulatory RNA motifs in the 5' untranslated regions (UTRs) in S. oneidensis, an important bacterial organism with extraordinary respiratory and metal reducing abilities and great potential for bioremediation and alternative energy generation. From 1002 sets of 5'-UTRs of orthologous operons, we identified 166 putative regulatory RNA motifs, including 17 of the 19 known RNA motifs from Rfam, an additional 21 RNA motifs that are supported by literature evidence, 72 RNA motifs overlapping predicted transcription terminators or attenuators, and other candidate regulatory RNA motifs

  1. The limits of regulatory toxicology

    SciTech Connect

    Carrington, Clark D.; Bolger, P. Michael

    2010-03-01

    The Acceptable Daily Intake (ADI) has been used by regulatory and public health organizations (e.g., the U.S. Food and Drug and Administration, and the World Health Organization) for chemicals for more than 50 years. The ADI concept was also initially employed at the U.S. Environmental Protection Agency at its inception in 1971, although with the adoption of newer terminology, it later became known as the Reference Dose (RfD). It is clear from the literature that both were first devised as instruments of regulatory policy. In the intervening years, it has become common to use language that implies that these standards are statements of scientific fact. Similarly, some of the discretionary or default values that are used to derive regulatory standards are represented as scientific assumptions when in fact they also represent regulatory policy. This confusion impedes both the best use of the available science and informed public participation in policy making. In addition, the misconception of the ADI or the RfD as statements of scientific fact may impede the consideration of alternative means to reduce exposure to chemicals that may be harmful, including regulatory measures that do not involve prescribing a regulatory concentration limit.

  2. The common ancestry of life

    PubMed Central

    2010-01-01

    Background It is common belief that all cellular life forms on earth have a common origin. This view is supported by the universality of the genetic code and the universal conservation of multiple genes, particularly those that encode key components of the translation system. A remarkable recent study claims to provide a formal, homology independent test of the Universal Common Ancestry hypothesis by comparing the ability of a common-ancestry model and a multiple-ancestry model to predict sequences of universally conserved proteins. Results We devised a computational experiment on a concatenated alignment of universally conserved proteins which shows that the purported demonstration of the universal common ancestry is a trivial consequence of significant sequence similarity between the analyzed proteins. The nature and origin of this similarity are irrelevant for the prediction of "common ancestry" of by the model-comparison approach. Thus, homology (common origin) of the compared proteins remains an inference from sequence similarity rather than an independent property demonstrated by the likelihood analysis. Conclusion A formal demonstration of the Universal Common Ancestry hypothesis has not been achieved and is unlikely to be feasible in principle. Nevertheless, the evidence in support of this hypothesis provided by comparative genomics is overwhelming. Reviewers this article was reviewed by William Martin, Ivan Iossifov (nominated by Andrey Rzhetsky) and Arcady Mushegian. For the complete reviews, see the Reviewers' Report section. PMID:21087490

  3. Indexing Similar DNA Sequences

    NASA Astrophysics Data System (ADS)

    Huang, Songbo; Lam, T. W.; Sung, W. K.; Tam, S. L.; Yiu, S. M.

    To study the genetic variations of a species, one basic operation is to search for occurrences of patterns in a large number of very similar genomic sequences. To build an indexing data structure on the concatenation of all sequences may require a lot of memory. In this paper, we propose a new scheme to index highly similar sequences by taking advantage of the similarity among the sequences. To store r sequences with k common segments, our index requires only O(n + NlogN) bits of memory, where n is the total length of the common segments and N is the total length of the distinct regions in all texts. The total length of all sequences is rn + N, and any scheme to store these sequences requires Ω(n + N) bits. Searching for a pattern P of length m takes O(m + m logN + m log(rk)psc(P) + occlogn), where psc(P) is the number of prefixes of P that appear as a suffix of some common segments and occ is the number of occurrences of P in all sequences. In practice, rk ≤ N, and psc(P) is usually a small constant. We have implemented our solution and evaluated our solution using real DNA sequences. The experiments show that the memory requirement of our solution is much less than that required by BWT built on the concatenation of all sequences. When compared to the other existing solution (RLCSA), we use less memory with faster searching time.

  4. Common Cause Failure Modeling

    NASA Technical Reports Server (NTRS)

    Hark, Frank; Britton, Paul; Ring, Rob; Novack, Steven D.

    2015-01-01

    Common Cause Failures (CCFs) are a known and documented phenomenon that defeats system redundancy. CCFS are a set of dependent type of failures that can be caused by: system environments; manufacturing; transportation; storage; maintenance; and assembly, as examples. Since there are many factors that contribute to CCFs, the effects can be reduced, but they are difficult to eliminate entirely. Furthermore, failure databases sometimes fail to differentiate between independent and CCF (dependent) failure and data is limited, especially for launch vehicles. The Probabilistic Risk Assessment (PRA) of NASA's Safety and Mission Assurance Directorate at Marshall Space Flight Center (MFSC) is using generic data from the Nuclear Regulatory Commission's database of common cause failures at nuclear power plants to estimate CCF due to the lack of a more appropriate data source. There remains uncertainty in the actual magnitude of the common cause risk estimates for different systems at this stage of the design. Given the limited data about launch vehicle CCF and that launch vehicles are a highly redundant system by design, it is important to make design decisions to account for a range of values for independent and CCFs. When investigating the design of the one-out-of-two component redundant system for launch vehicles, a response surface was constructed to represent the impact of the independent failure rate versus a common cause beta factor effect on a system's failure probability. This presentation will define a CCF and review estimation calculations. It gives a summary of reduction methodologies and a review of examples of historical CCFs. Finally, it presents the response surface and discusses the results of the different CCFs on the reliability of a one-out-of-two system.

  5. Common Cause Failure Modeling

    NASA Technical Reports Server (NTRS)

    Hark, Frank; Britton, Paul; Ring, Rob; Novack, Steven D.

    2016-01-01

    Common Cause Failures (CCFs) are a known and documented phenomenon that defeats system redundancy. CCFS are a set of dependent type of failures that can be caused by: system environments; manufacturing; transportation; storage; maintenance; and assembly, as examples. Since there are many factors that contribute to CCFs, the effects can be reduced, but they are difficult to eliminate entirely. Furthermore, failure databases sometimes fail to differentiate between independent and CCF (dependent) failure and data is limited, especially for launch vehicles. The Probabilistic Risk Assessment (PRA) of NASA's Safety and Mission Assurance Directorate at Marshal Space Flight Center (MFSC) is using generic data from the Nuclear Regulatory Commission's database of common cause failures at nuclear power plants to estimate CCF due to the lack of a more appropriate data source. There remains uncertainty in the actual magnitude of the common cause risk estimates for different systems at this stage of the design. Given the limited data about launch vehicle CCF and that launch vehicles are a highly redundant system by design, it is important to make design decisions to account for a range of values for independent and CCFs. When investigating the design of the one-out-of-two component redundant system for launch vehicles, a response surface was constructed to represent the impact of the independent failure rate versus a common cause beta factor effect on a system's failure probability. This presentation will define a CCF and review estimation calculations. It gives a summary of reduction methodologies and a review of examples of historical CCFs. Finally, it presents the response surface and discusses the results of the different CCFs on the reliability of a one-out-of-two system.

  6. Divergent allosteric patterns verify the regulatory paradigm for aspartate transcarbamylase.

    PubMed

    Wales, M E; Madison, L L; Glaser, S S; Wild, J R

    1999-12-17

    The native Escherichia coli aspartate transcarbamoylase (ATCase, E.C. 2.1.3.2) provides a classic allosteric model for the feedback inhibition of a biosynthetic pathway by its end products. Both E. coli and Erwinia herbicola possess ATCase holoenzymes which are dodecameric (2(c3):3(r2)) with 311 amino acid residues per catalytic monomer and 153 and 154 amino acid residues per regulatory (r) monomer, respectively. While the quaternary structures of the two enzymes are identical, the primary amino acid sequences have diverged by 14 % in the catalytic polypeptide and 20 % in the regulatory polypeptide. The amino acids proposed to be directly involved in the active site and nucleotide binding site are strictly conserved between the two enzymes; nonetheless, the two enzymes differ in their catalytic and regulatory characteristics. The E. coli enzyme has sigmoidal substrate binding with activation by ATP, and inhibition by CTP, while the E. herbicola enzyme has apparent first order kinetics at low substrate concentrations in the absence of allosteric ligands, no ATP activation and only slight CTP inhibition. In an apparently important and highly conserved characteristic, CTP and UTP impose strong synergistic inhibition on both enzymes. The co-operative binding of aspartate in the E. coli enzyme is correlated with a T-to-R conformational transition which appears to be greatly reduced in the E. herbicola enzyme, although the addition of inhibitory heterotropic ligands (CTP or CTP+UTP) re-establishes co-operative saturation kinetics. Hybrid holoenzymes assembled in vivo with catalytic subunits from E. herbicola and regulatory subunits from E. coli mimick the allosteric response of the native E. coli holoenzyme and exhibit ATP activation. The reverse hybrid, regulatory subunits from E. herbicola and catalytic subunits from E. coli, exhibited no response to ATP. The conserved structure and diverged functional characteristics of the E. herbicola enzyme provides an opportunity

  7. The last common bilaterian ancestor

    NASA Technical Reports Server (NTRS)

    Erwin, Douglas H.; Davidson, Eric H.

    2002-01-01

    Many regulatory genes appear to be utilized in at least superficially similar ways in the development of particular body parts in Drosophila and in chordates. These similarities have been widely interpreted as functional homologies, producing the conventional view of the last common protostome-deuterostome ancestor (PDA) as a complex organism that possessed some of the same body parts as modern bilaterians. Here we discuss an alternative view, in which the last common PDA had a less complex body plan than is frequently conceived. This reconstruction alters expectations for Neoproterozoic fossil remains that could illustrate the pathways of bilaterian evolution.

  8. A Provisional Gene Regulatory Atlas for Mouse Heart Development

    PubMed Central

    Chen, Hailin; VanBuren, Vincent

    2014-01-01

    Congenital Heart Disease (CHD) is one of the most common birth defects. Elucidating the molecular mechanisms underlying normal cardiac development is an important step towards early identification of abnormalities during the developmental program and towards the creation of early intervention strategies. We developed a novel computational strategy for leveraging high-content data sets, including a large selection of microarray data associated with mouse cardiac development, mouse genome sequence, ChIP-seq data of selected mouse transcription factors and Y2H data of mouse protein-protein interactions, to infer the active transcriptional regulatory network of mouse cardiac development. We identified phase-specific expression activity for 765 overlapping gene co-expression modules that were defined for obtained cardiac lineage microarray data. For each co-expression module, we identified the phase of cardiac development where gene expression for that module was higher than other phases. Co-expression modules were found to be consistent with biological pathway knowledge in Wikipathways, and met expectations for enrichment of pathways involved in heart lineage development. Over 359,000 transcription factor-target relationships were inferred by analyzing the promoter sequences within each gene module for overrepresentation against the JASPAR database of Transcription Factor Binding Site (TFBS) motifs. The provisional regulatory network will provide a framework of studying the genetic basis of CHD. PMID:24421884

  9. Identification and arrangement of the DNA sequence recognized in specific transformation of Neisseria gonorrhoeae.

    PubMed Central

    Goodman, S D; Scocca, J J

    1988-01-01

    DNA segments from Neisseria gonorrhoeae, cloned and propagated in Escherichia coli, were tested for the ability to competitively inhibit gonococcal transformation. The nucleotide sequences of active segments were determined and compared; these sequences contained the sequence 5' GCCGTCTGAA 3' in common. Subcloning studies confirmed the identity of this sequence as the gonococcal DNA recognition site. The three instances of the recognition sequence isolated from N. gonorrhoeae chromosomal DNA contain the sequence in the immediate neighborhood of its inverted repeat. Because a single copy of the sequence functions as a recognition site, the inverted duplication is not required for specific binding. The dyad symmetric arrangements of the chromosomal recognition sequences may form stable stem-loop structures that can function as terminators or attenuators of transcription. These inverted repeats are located at the boundaries of long open reading frames. The recognition sequence also constitutes part of two other probable terminators of gonococcal genes. We conclude that the signal for recognition of transforming DNA by gonococci is a frequent component of transcriptional terminator sequences. This regulatory function might account for the origin and maintenance of recognition sequences in the chromosomes of Gram-negative transformable bacteria. PMID:3137581

  10. A phylogenetic Gibbs sampler that yields centroid solutions of cis-regulatory sites

    SciTech Connect

    Newberg, Lee A.; Thompson, William A.; Conlan, Sean; Smith, Thomas M.; McCue, Lee Ann; Lawrence, Charles E.

    2007-07-15

    Identification of functionally conserved regulatory elements in sequence data from closely related organisms is becoming feasible, due to the rapid growth of public sequence databases. Closely related organisms are most likely to have common regulatory motifs, however the recent speciation of such organisms results in the high degree of correlation in their genome sequences, confounding the detection of functional elements. Additionally, alignment algorithms that use optimization techniques are limited to the detection of a single alignment that may not be representative. Comparative-genomics studies must be able to address the phylogenetic correlation in the data and efficiently explore the alignment space, in order to make specific and biologically relevant predictions. Results: We describe here a Gibbs sampler that employs a full phylogenetic model and reports an ensemble centroid solution. We describe regulatory motif detection using both simulated and real data, and demonstrate that this approach achieves improved specificity, sensitivity, and positive predictive value over non-phylogenetic algorithms, and over phylogenetic algorithms that report a maximum likelihood solution.

  11. Fungal Genes in Context: Genome Architecture Reflects Regulatory Complexity and Function

    PubMed Central

    Noble, Luke M.; Andrianopoulos, Alex

    2013-01-01

    Gene context determines gene expression, with local chromosomal environment most influential. Comparative genomic analysis is often limited in scope to conserved or divergent gene and protein families, and fungi are well suited to this approach with low functional redundancy and relatively streamlined genomes. We show here that one aspect of gene context, the amount of potential upstream regulatory sequence maintained through evolution, is highly predictive of both molecular function and biological process in diverse fungi. Orthologs with large upstream intergenic regions (UIRs) are strongly enriched in information processing functions, such as signal transduction and sequence-specific DNA binding, and, in the genus Aspergillus, include the majority of experimentally studied, high-level developmental and metabolic transcriptional regulators. Many uncharacterized genes are also present in this class and, by implication, may be of similar importance. Large intergenic regions also share two novel sequence characteristics, currently of unknown significance: they are enriched for plus-strand polypyrimidine tracts and an information-rich, putative regulatory motif that was present in the last common ancestor of the Pezizomycotina. Systematic consideration of gene UIR in comparative genomics, particularly for poorly characterized species, could help reveal organisms’ regulatory priorities. PMID:23699226

  12. Evidence for a common evolutionary rate in metazoan transcriptional networks

    PubMed Central

    Carvunis, Anne-Ruxandra; Wang, Tina; Skola, Dylan; Yu, Alice; Chen, Jonathan; Kreisberg, Jason F; Ideker, Trey

    2015-01-01

    Genome sequences diverge more rapidly in mammals than in other animal lineages, such as birds or insects. However, the effect of this rapid divergence on transcriptional evolution remains unclear. Recent reports have indicated a faster divergence of transcription factor binding in mammals than in insects, but others found the reverse for mRNA expression. Here, we show that these conflicting interpretations resulted from differing methodologies. We performed an integrated analysis of transcriptional network evolution by examining mRNA expression, transcription factor binding and cis-regulatory motifs across >25 animal species, including mammals, birds and insects. Strikingly, we found that transcriptional networks evolve at a common rate across the three animal lineages. Furthermore, differences in rates of genome divergence were greatly reduced when restricting comparisons to chromatin-accessible sequences. The evolution of transcription is thus decoupled from the global rate of genome sequence evolution, suggesting that a small fraction of the genome regulates transcription. DOI: http://dx.doi.org/10.7554/eLife.11615.001 PMID:26682651

  13. 3 CFR - Regulatory Review

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... 3 The President 1 2010-01-01 2010-01-01 false Regulatory Review Presidential Documents Other Presidential Documents Memorandum of January 30, 2009 Regulatory Review Memorandum for the Heads of Executive Departments and Agencies For well over two decades, the Office of Information and Regulatory Affairs (OIRA) at the Office of Management...

  14. Characterization and identification of cis-regulatory elements in Arabidopsis based on single-nucleotide polymorphism information.

    PubMed

    Korkuc, Paula; Schippers, Jos H M; Walther, Dirk

    2014-01-01

    Identifying regulatory elements and revealing their role in gene expression regulation remains a central goal of plant genome research. We exploited the detailed genomic sequencing information of a large number of Arabidopsis (Arabidopsis thaliana) accessions to characterize known and to identify novel cis-regulatory elements in gene promoter regions of Arabidopsis by relying on conservation as the hallmark signal of functional relevance. Based on the genomic layout and the obtained density profiles of single-nucleotide polymorphisms (SNPs) in sequence regions upstream of transcription start sites, the average length of promoter regions in Arabidopsis could be established at 500 bp. Genes associated with high degrees of variability of their respective upstream regions are preferentially involved in environmental response and signaling processes, while low levels of promoter SNP density are common among housekeeping genes. Known cis-elements were found to exhibit a decreased SNP density than sequence regions not associated with known motifs. For 15 known cis-element motifs, strong positional preferences relative to the transcription start site were detected based on their promoter SNP density profiles. Five novel candidate cis-element motifs were identified as consensus motifs of 17 sequence hexamers exhibiting increased sequence conservation combined with evidence of positional preferences, annotation information, and functional relevance for inducing correlated gene expression. Our study demonstrates that the currently available resolution of SNP data offers novel ways for the identification of functional genomic elements and the characterization of gene promoter sequences. PMID:24204023

  15. Regulatory affairs administration as regulatory policy determinant

    SciTech Connect

    Forcier, J.R.

    1984-05-10

    It is the thesis of this article that the processing of a utility company's regulation-related work, the supporting tasks and the manner in which they are completed, can and does have a significant impact on the final results or work product of the regulatory affairs function, including even, potentially, the action of the regulatory agency. The article is therefore full of practical pointers on how the interface with the regulatory authority can best be organized, managed, and carried through to the attainment of optimum results for the utility. 2 references.

  16. Comparative genomics reveals functional transcriptional control sequences in the Prop1 gene

    PubMed Central

    Ward, Robert D.; Davis, Shannon W.; Cho, MinChul; Esposito, Constance; Lyons, Robert H.; Cheng, Jan-Fang; Rubin, Edward M.; Rhodes, Simon J.; Raetzman, Lori T.; Smith, Timothy P. L.

    2007-01-01

    Mutations in PROP1 are a common genetic cause of multiple pituitary hormone deficiency (MPHD). We used a comparative genomics approach to predict the transcriptional regulatory domains of Prop1 and tested them in cell culture and mice. A BAC transgene containing Prop1 completely rescues the Prop1 mutant phenotype, demonstrating that the regulatory elements necessary for proper PROP1 transcription are contained within the BAC. We generated DNA sequences from the PROP1 genes in lemur, pig, and five different primate species. Comparison of these with available human and mouse PROP1 sequences identified three putative regulatory sequences that are highly conserved. These are located in the PROP1 promoter proximal region, within the first intron of PROP1, and downstream of PROP1. Each of the conserved elements elicited orientation-specific enhancer activity in the context of the Drosophila alcohol dehydrogenase minimal promoter in both heterologous and pituitary-derived cells lines. The intronic element is sufficient to confer dorsal expansion of the pituitary expression domain of a transgene, suggesting that this element is important for the normal spatial expression of endogenous Prop1 during pituitary development. This study illustrates the usefulness of a comparative genomics approach in the identification of regulatory elements that may be the site of mutations responsible for some cases of MPHD. PMID:17557180

  17. Target activation by regulatory RNAs in bacteria

    PubMed Central

    Papenfort, Kai; Vanderpool, Carin K.

    2015-01-01

    Bacterial small regulatory RNAs (sRNAs) are commonly known to repress gene expression by base pairing to target mRNAs. In many cases, sRNAs base pair with and sequester mRNA ribosome-binding sites, resulting in translational repression and accelerated transcript decay. In contrast, a growing number of examples of translational activation and mRNA stabilization by sRNAs have now been documented. A given sRNA often employs a conserved region to interact with and regulate both repressed and activated targets. However, the mechanisms underlying activation differ substantially from repression. Base pairing resulting in target activation can involve sRNA interactions with the 5′ untranslated region (UTR), the coding sequence or the 3′ UTR of the target mRNAs. Frequently, the activities of protein factors such as cellular ribonucleases and the RNA chaperone Hfq are required for activation. Bacterial sRNAs, including those that function as activators, frequently control stress response pathways or virulence-associated functions required for immediate responses to changing environments. This review aims to summarize recent advances in knowledge regarding target mRNA activation by bacterial sRNAs, highlighting the molecular mechanisms and biological relevance of regulation. PMID:25934124

  18. Genomics in the land of regulatory science.

    PubMed

    Tong, Weida; Ostroff, Stephen; Blais, Burton; Silva, Primal; Dubuc, Martine; Healy, Marion; Slikker, William

    2015-06-01

    Genomics science has played a major role in the generation of new knowledge in the basic research arena, and currently question arises as to its potential to support regulatory processes. However, the integration of genomics in the regulatory decision-making process requires rigorous assessment and would benefit from consensus amongst international partners and research communities. To that end, the Global Coalition for Regulatory Science Research (GCRSR) hosted the fourth Global Summit on Regulatory Science (GSRS2014) to discuss the role of genomics in regulatory decision making, with a specific emphasis on applications in food safety and medical product development. Challenges and issues were discussed in the context of developing an international consensus for objective criteria in the analysis, interpretation and reporting of genomics data with an emphasis on transparency, traceability and "fitness for purpose" for the intended application. It was recognized that there is a need for a global path in the establishment of a regulatory bioinformatics framework for the development of transparent, reliable, reproducible and auditable processes in the management of food and medical product safety risks. It was also recognized that training is an important mechanism in achieving internationally consistent outcomes. GSRS2014 provided an effective venue for regulators andresearchers to meet, discuss common issues, and develop collaborations to address the challenges posed by the application of genomics to regulatory science, with the ultimate goal of wisely integrating novel technical innovations into regulatory decision-making. PMID:25796433

  19. Conserved cis-regulatory modules in promoters of genes encoding wheat high-molecular-weight glutenin subunits

    PubMed Central

    Ravel, Catherine; Fiquet, Samuel; Boudet, Julie; Dardevet, Mireille; Vincent, Jonathan; Merlino, Marielle; Michard, Robin; Martre, Pierre

    2014-01-01

    The concentration and composition of the gliadin and glutenin seed storage proteins (SSPs) in wheat flour are the most important determinants of its end-use value. In cereals, the synthesis of SSPs is predominantly regulated at the transcriptional level by a complex network involving at least five cis-elements in gene promoters. The high-molecular-weight glutenin subunits (HMW-GS) are encoded by two tightly linked genes located on the long arms of group 1 chromosomes. Here, we sequenced and annotated the HMW-GS gene promoters of 22 electrophoretic wheat alleles to identify putative cis-regulatory motifs. We focused on 24 motifs known to be involved in SSP gene regulation. Most of them were identified in at least one HMW-GS gene promoter sequence. A common regulatory framework was observed in all the HMW-GS gene promoters, as they shared conserved cis-regulatory modules (CCRMs) including all the five motifs known to regulate the transcription of SSP genes. This common regulatory framework comprises a composite box made of the GATA motifs and GCN4-like Motifs (GLMs) and was shown to be functional as the GLMs are able to bind a bZIP transcriptional factor SPA (Storage Protein Activator). In addition to this regulatory framework, each HMW-GS gene promoter had additional motifs organized differently. The promoters of most highly expressed x-type HMW-GS genes contain an additional box predicted to bind R2R3-MYB transcriptional factors. However, the differences in annotation between promoter alleles could not be related to their level of expression. In summary, we identified a common modular organization of HMW-GS gene promoters but the lack of correlation between the cis-motifs of each HMW-GS gene promoter and their level of expression suggests that other cis-elements or other mechanisms regulate HMW-GS gene expression. PMID:25429295

  20. Evolution of Cis-Regulatory Elements and Regulatory Networks in Duplicated Genes of Arabidopsis1[OPEN

    PubMed Central

    Guo, Xu Qiu; Adams, Keith L.

    2015-01-01

    Plant genomes contain large numbers of duplicated genes that contribute to the evolution of new functions. Following duplication, genes can exhibit divergence in their coding sequence and their expression patterns. Changes in the cis-regulatory element landscape can result in changes in gene expression patterns. High-throughput methods developed recently can identify potential cis-regulatory elements on a genome-wide scale. Here, we use a recent comprehensive data set of DNase I sequencing-identified cis-regulatory binding sites (footprints) at single-base-pair resolution to compare binding sites and network connectivity in duplicated gene pairs in Arabidopsis (Arabidopsis thaliana). We found that duplicated gene pairs vary greatly in their cis-regulatory element architecture, resulting in changes in regulatory network connectivity. Whole-genome duplicates (WGDs) have approximately twice as many footprints in their promoters left by potential regulatory proteins than do tandem duplicates (TDs). The WGDs have a greater average number of footprint differences between paralogs than TDs. The footprints, in turn, result in more regulatory network connections between WGDs and other genes, forming denser, more complex regulatory networks than shown by TDs. When comparing regulatory connections between duplicates, WGDs had more pairs in which the two genes are either partially or fully diverged in their network connections, but fewer genes with no network connections than the TDs. There is evidence of younger TDs and WGDs having fewer unique connections compared with older duplicates. This study provides insights into cis-regulatory element evolution and network divergence in duplicated genes. PMID:26474639

  1. Regulatory gene networks and the properties of the developmental process

    NASA Technical Reports Server (NTRS)

    Davidson, Eric H.; McClay, David R.; Hood, Leroy

    2003-01-01

    Genomic instructions for development are encoded in arrays of regulatory DNA. These specify large networks of interactions among genes producing transcription factors and signaling components. The architecture of such networks both explains and predicts developmental phenomenology. Although network analysis is yet in its early stages, some fundamental commonalities are already emerging. Two such are the use of multigenic feedback loops to ensure the progressivity of developmental regulatory states and the prevalence of repressive regulatory interactions in spatial control processes. Gene regulatory networks make it possible to explain the process of development in causal terms and eventually will enable the redesign of developmental regulatory circuitry to achieve different outcomes.

  2. Automated conserved non-coding sequence (CNS) discovery reveals differences in gene content and promoter evolution among grasses

    PubMed Central

    Turco, Gina; Schnable, James C.; Pedersen, Brent; Freeling, Michael

    2013-01-01

    Conserved non-coding sequences (CNS) are islands of non-coding sequence that, like protein coding exons, show less divergence in sequence between related species than functionless DNA. Several CNSs have been demonstrated experimentally to function as cis-regulatory regions. However, the specific functions of most CNSs remain unknown. Previous searches for CNS in plants have either anchored on exons and only identified nearby sequences or required years of painstaking manual annotation. Here we present an open source tool that can accurately identify CNSs between any two related species with sequenced genomes, including both those immediately adjacent to exons and distal sequences separated by >12 kb of non-coding sequence. We have used this tool to characterize new motifs, associate CNSs with additional functions, and identify previously undetected genes encoding RNA and protein in the genomes of five grass species. We provide a list of 15,363 orthologous CNSs conserved across all grasses tested. We were also able to identify regulatory sequences present in the common ancestor of grasses that have been lost in one or more extant grass lineages. Lists of orthologous gene pairs and associated CNSs are provided for reference inbred lines of arabidopsis, Japonica rice, foxtail millet, sorghum, brachypodium, and maize. PMID:23874343

  3. Regulatory region of the divergent Klebsiella pneumoniae lac operon.

    PubMed Central

    Buvinger, W E; Riley, M

    1985-01-01

    The chromosomal DNA that lies between the lacI and lacZ genes of Klebsiella pneumoniae constitutes a 196-base pair intercistronic region that contains regulatory sequences for both genes. The probable locations of specific regulatory elements for both lacI and lacZ genes were determined by analogy with the corresponding Escherichia coli sequences. A recombinational event in ancestral DNA evidently has inverted the transcriptional direction of lacI in K. pneumoniae relative to the transcriptional direction of lacI in E. coli. One end of the inversion was located within a 19-base pair sequence in the K. pneumoniae regulatory region. Sequences partially homologous to these 19 base pairs were found in two locations on either side of the E. coli lacI gene. The nucleotide sequence of the lac regulatory region in K. pneumoniae exhibits more than one possibility for folded tertiary structures. The spatial relationships of transcriptional binding sites differ in two possible structures. Associations of regulatory and transcriptional proteins with the DNA might affect conformation of the regulatory sequences and, as a consequence, transcription of the lac genes. PMID:3897197

  4. Activation of enhancer sequences in type II human T-cell leukemia virus and bovine leukemia virus long terminal repeats by virus-associated trans-acting regulatory factors.

    PubMed Central

    Rosen, C A; Sodroski, J G; Kettman, R; Haseltine, W A

    1986-01-01

    The ability of the sequences present in the long terminal repeats (LTRs) of human T-cell leukemia viruses type I and II (HTLV-I and HTLV-II) and of bovine leukemia virus to function as enhancer elements was investigated. Recombinant plasmids that contained the HTLV-I, HTLV-II, and bovine leukemia virus LTRs at a distance from a simian virus 40 promoter element located 5' to the bacterial gene encoding chloramphenicol acetyltransferase (EC 2.3.1.28) were constructed. We report that all three LTR sequences contain enhancer elements capable of increasing the level of gene expression directed from a distal heterologous promoter. The enhancer present in the HTLV-I LTR was active in uninfected cells of lymphoid and nonlymphoid origin. In contrast, the enhancer activity of the HTLV-II and bovine leukemia virus LTR sequences was evident only in virus-infected cells. This activity is likely due to virus-associated trans-acting transcriptional factors previously shown to be present in HTLV- and bovine leukemia virus-infected cells. The implication of these observations for virus replication and transforming activity are discussed. Images PMID:3005624

  5. No Common Opinion on the Common Core

    ERIC Educational Resources Information Center

    Henderson, Michael B.; Peterson, Paul E.; West, Martin R.

    2015-01-01

    According to the three authors of this article, the 2014 "EdNext" poll yields four especially important new findings: (1) Opinion with respect to the Common Core has yet to coalesce. The idea of a common set of standards across the country has wide appeal, and the Common Core itself still commands the support of a majority of the public.…

  6. Burst diaphragm sequence valve

    NASA Astrophysics Data System (ADS)

    Wisneskie, Bradley D.; Hyman, Sheldon; Hallum, Charles E.

    1991-11-01

    A burst diaphragm sequence valve which effectively combines the structure of a burst diaphragm with that of an ordinary swing check valve, the pivot of the ordinary swing check valve being replaced by an integral flexural hinge. The sequence valve provides a way to sequentially burn solid propellant hot gas generators which exit into a common gas manifold, thereby enabling gas-powered devices to operate for a longer time than the duration of one gas generator burn.

  7. High-Throughput Sequencing Technologies

    PubMed Central

    Reuter, Jason A.; Spacek, Damek; Snyder, Michael P.

    2015-01-01

    Summary The human genome sequence has profoundly altered our understanding of biology, human diversity and disease. The path from the first draft sequence to our nascent era of personal genomes and genomic medicine has been made possible only because of the extraordinary advancements in DNA sequencing technologies over the past ten years. Here, we discuss commonly used high-throughput sequencing platforms, the growing array of sequencing assays developed around them as well as the challenges facing current sequencing platforms and their clinical application. PMID:26000844

  8. Effects of species combination on comparative analyses of conserved regulatory elements

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Cross-species DNA sequence comparison is the primary approach to discover regulatory elements by identifying highly conserved sequences due to evolutionary constraints. Previously, we reported that a systematic approach, combining position-specific weight matrixes (JASPAR) and phylogenetic footprint...

  9. Systematic identification of conserved regulatory elements in upstream promoter regions of the cattle genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Cross-species DNA sequence comparison is the primary approach to discover regulatory elements by identifying highly conserved sequences due to evolutionary constraints. Previously, we reported that a systematic approach, combining position-specific weight matrixes (JASPAR) and phylogenetic footprint...

  10. Prediction of conserved regulatory elements in promoter regions of the cattle genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Cross-species DNA sequence comparison is the primary approach to discover regulatory elements by identifying highly conserved sequences due to evolutionary constraints. Previously, we reported that a systematic approach, combining position-specific weight matrixes (JASPAR) and phylogenetic footprint...

  11. [Clonage of the "malA" region of "Escherichia coli" K12: nucleotide sequence of the regulatory region and the promoters, identification and purification of the MalT-activator protein (author's transl)].

    PubMed

    Raibaud, O; Débarbouillé, M; Cossart, P

    1982-01-01

    A 5,800-bp (base pair) HindIII-EcoRI DNA fragment containing malT, the positive regulator gene of the maltose regulon, and most of malP, the structural gene for maltodextrin phosphorylase, was cloned into pBR322. A sequence of 802 bp was established in a DNA segment containing the promotor for malPQ and the promoter for malT. A total of 611 bp separates the initiation codons for these two genes, which are transcribed in opposite directions. The malT product was identified as a 94,000 dalton polypeptide. PMID:6462088

  12. Large-scale identification of sequence variants impacting human transcription factor occupancy in vivo

    PubMed Central

    Maurano, Matthew T.; Haugen, Eric; Sandstrom, Richard; Vierstra, Jeff; Shafer, Anthony; Kaul, Rajinder; Stamatoyannopoulos, John A.

    2015-01-01

    The function of human regulatory regions depends exquisitely on their local genomic environment and cellular context, complicating experimental analysis of the expanding pool of common disease- and trait-associated variants that localize within regulatory DNA. We leverage allelically resolved genomic DNaseI footprinting data encompassing 166 individuals and 114 cell types to identify >60,000 common variants that directly impact transcription factor occupancy and regulatory DNA accessibility in vivo. The unprecedented scale of these data enable systematic analysis of the impact of sequence variation on transcription factor occupancy in vivo. We leverage this analysis to develop accurate models of variation affecting the recognition sites for diverse transcription factors, and apply these models to discriminate nearly 500,000 common regulatory variants likely to affect transcription factor occupancy across the human genome. The approach and results provide a novel foundation for analysis and interpretation of noncoding variation in complete human genomes, and for systems-level investigation of disease-associated variants. PMID:26502339

  13. Repetitive Sequences

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Repetitive sequences, or repeats, account for a substantial portion of the eukaryotic genomes. These sequences include very different types of DNA with respect to mode of origin, function, structure, and genomic distribution. Two large families of repetitive sequences can be readily recognized, ta...

  14. Complementation of nitrogen-regulatory (ntr-like) mutations in Rhodobacter capsulatus by an Escherichia coli gene: cloning and sequencing of the gene and characterization of the gene product.

    PubMed Central

    Allibert, P; Willison, J C; Vignais, P M

    1987-01-01

    In vivo genetic engineering by R' plasmid formation was used to isolate an Escherichia coli gene that restored the Ntr+ phenotype to Ntr- mutants of the photosynthetic bacterium Rhodobacter capsulatus (formerly Rhodopseudomonas capsulata; J. F. Imhoff, H. G. Trüper, and N. Pfenning, Int. J. Syst. Bacteriol. 34:340-343, 1984). Nucleotide sequencing of the gene revealed no homology to the ntr genes of Klebsiella pneumoniae. Furthermore, hybridization experiments between the cloned gene and different F' plasmids indicated that the gene is located between 34 and 39 min on the E. coli genetic map and is therefore unlinked to the known ntr genes. The molecular weight of the gene product, deduced from the nucleotide sequence, was 30,563. After the gene was cloned in an expression vector, the gene product was purified. It was shown to have a pI of 5.8 and to behave as a dimer during gel filtration and on sucrose density gradients. Antibodies raised against the purified protein revealed the presence of this protein in R. capsulatus strains containing the E. coli gene, but not in other strains. Moreover, elimination of the plasmid carrying the E. coli gene from complemented strains resulted in the loss of the Ntr+ phenotype. Complementation of the R. capsulatus mutations by the E. coli gene therefore occurs in trans and results from the synthesis of a functional gene product. Images PMID:3025172

  15. metagene Profiles Analyses Reveal Regulatory Element's Factor-Specific Recruitment Patterns.

    PubMed

    Joly Beauparlant, Charles; Lamaze, Fabien C; Deschênes, Astrid; Samb, Rawane; Lemaçon, Audrey; Belleau, Pascal; Bilodeau, Steve; Droit, Arnaud

    2016-08-01

    ChIP-Sequencing (ChIP-Seq) provides a vast amount of information regarding the localization of proteins across the genome. The aggregation of ChIP-Seq enrichment signal in a metagene plot is an approach commonly used to summarize data complexity and to obtain a high level visual representation of the general occupancy pattern of a protein. Here we present the R package metagene, the graphical interface Imetagene and the companion package similaRpeak. Together, they provide a framework to integrate, summarize and compare the ChIP-Seq enrichment signal from complex experimental designs. Those packages identify and quantify similarities or dissimilarities in patterns between large numbers of ChIP-Seq profiles. We used metagene to investigate the differential occupancy of regulatory factors at noncoding regulatory regions (promoters and enhancers) in relation to transcriptional activity in GM12878 B-lymphocytes. The relationships between occupancy patterns and transcriptional activity suggest two different mechanisms of action for transcriptional control: i) a "gradient effect" where the regulatory factor occupancy levels follow transcription and ii) a "threshold effect" where the regulatory factor occupancy levels max out prior to reaching maximal transcription. metagene, Imetagene and similaRpeak are implemented in R under the Artistic license 2.0 and are available on Bioconductor. PMID:27538250

  16. metagene Profiles Analyses Reveal Regulatory Element’s Factor-Specific Recruitment Patterns

    PubMed Central

    Samb, Rawane; Lemaçon, Audrey; Bilodeau, Steve; Droit, Arnaud

    2016-01-01

    ChIP-Sequencing (ChIP-Seq) provides a vast amount of information regarding the localization of proteins across the genome. The aggregation of ChIP-Seq enrichment signal in a metagene plot is an approach commonly used to summarize data complexity and to obtain a high level visual representation of the general occupancy pattern of a protein. Here we present the R package metagene, the graphical interface Imetagene and the companion package similaRpeak. Together, they provide a framework to integrate, summarize and compare the ChIP-Seq enrichment signal from complex experimental designs. Those packages identify and quantify similarities or dissimilarities in patterns between large numbers of ChIP-Seq profiles. We used metagene to investigate the differential occupancy of regulatory factors at noncoding regulatory regions (promoters and enhancers) in relation to transcriptional activity in GM12878 B-lymphocytes. The relationships between occupancy patterns and transcriptional activity suggest two different mechanisms of action for transcriptional control: i) a “gradient effect” where the regulatory factor occupancy levels follow transcription and ii) a “threshold effect” where the regulatory factor occupancy levels max out prior to reaching maximal transcription. metagene, Imetagene and similaRpeak are implemented in R under the Artistic license 2.0 and are available on Bioconductor. PMID:27538250

  17. Chromatin insulators: regulatory mechanisms and epigenetic inheritance

    PubMed Central

    Bushey, Ashley M.; Dorman, Elizabeth R.; Corces, Victor G.

    2008-01-01

    Enhancer-blocking insulators are DNA elements that disrupt the communication between a regulatory sequence, such as an enhancer or a silencer, and a promoter. Insulators participate in both transcriptional regulation and global nuclear organization, two features of chromatin that are thought to be maintained from one generation to the next through epigenetic mechanisms. Furthermore, there are many regulatory mechanisms in place that enhance or hinder insulator activity. These modes of regulation could be used to establish cell-type specific insulator activity that is epigenetically inherited along a cell and/or organismal lineage. This review will discuss the evidence for epigenetic inheritance and regulation of insulator function. PMID:18851828

  18. POEM: Identifying Joint Additive Effects on Regulatory Circuits

    PubMed Central

    Botzman, Maya; Nachshon, Aharon; Brodt, Avital; Gat-Viks, Irit

    2016-01-01

    Motivation: Expression Quantitative Trait Locus (eQTL) mapping tackles the problem of identifying variation in DNA sequence that have an effect on the transcriptional regulatory network. Major computational efforts are aimed at characterizing the joint effects of several eQTLs acting in concert to govern the expression of the same genes. Yet, progress toward a comprehensive prediction of such joint effects is limited. For example, existing eQTL methods commonly discover interacting loci affecting the expression levels of a module of co-regulated genes. Such “modularization” approaches, however, are focused on epistatic relations and thus have limited utility for the case of additive (non-epistatic) effects. Results: Here we present POEM (Pairwise effect On Expression Modules), a methodology for identifying pairwise eQTL effects on gene modules. POEM is specifically designed to achieve high performance in the case of additive joint effects. We applied POEM to transcription profiles measured in bone marrow-derived dendritic cells across a population of genotyped mice. Our study reveals widespread additive, trans-acting pairwise effects on gene modules, characterizes their organizational principles, and highlights high-order interconnections between modules within the immune signaling network. These analyses elucidate the central role of additive pairwise effect in regulatory circuits, and provide computational tools for future investigations into the interplay between eQTLs. Availability: The software described in this article is available at csgi.tau.ac.il/POEM/. PMID:27148351

  19. Mapping the Shh long-range regulatory domain.

    PubMed

    Anderson, Eve; Devenney, Paul S; Hill, Robert E; Lettice, Laura A

    2014-10-01

    Coordinated gene expression controlled by long-distance enhancers is orchestrated by DNA regulatory sequences involving transcription factors and layers of control mechanisms. The Shh gene and well-established regulators are an example of genomic composition in which enhancers reside in a large desert extending into neighbouring genes to control the spatiotemporal pattern of expression. Exploiting the local hopping activity of the Sleeping Beauty transposon, the lacZ reporter gene was dispersed throughout the Shh region to systematically map the genomic features responsible for expression activity. We found that enhancer activities are retained inside a genomic region that corresponds to the topological associated domain (TAD) defined by Hi-C. This domain of approximately 900 kb is in an open conformation over its length and is generally susceptible to all Shh enhancers. Similar to the distal enhancers, an enhancer residing within the Shh second intron activates the reporter gene located at distances of hundreds of kilobases away, suggesting that both proximal and distal enhancers have the capacity to survey the Shh topological domain to recognise potential promoters. The widely expressed Rnf32 gene lying within the Shh domain evades enhancer activities by a process that may be common among other housekeeping genes that reside in large regulatory domains. Finally, the boundaries of the Shh TAD do not represent the absolute expression limits of enhancer activity, as expression activity is lost stepwise at a number of genomic positions at the verges of these domains. PMID:25252942

  20. Mapping the Shh long-range regulatory domain

    PubMed Central

    Anderson, Eve; Devenney, Paul S.; Hill, Robert E.; Lettice, Laura A.

    2014-01-01

    Coordinated gene expression controlled by long-distance enhancers is orchestrated by DNA regulatory sequences involving transcription factors and layers of control mechanisms. The Shh gene and well-established regulators are an example of genomic composition in which enhancers reside in a large desert extending into neighbouring genes to control the spatiotemporal pattern of expression. Exploiting the local hopping activity of the Sleeping Beauty transposon, the lacZ reporter gene was dispersed throughout the Shh region to systematically map the genomic features responsible for expression activity. We found that enhancer activities are retained inside a genomic region that corresponds to the topological associated domain (TAD) defined by Hi-C. This domain of approximately 900 kb is in an open conformation over its length and is generally susceptible to all Shh enhancers. Similar to the distal enhancers, an enhancer residing within the Shh second intron activates the reporter gene located at distances of hundreds of kilobases away, suggesting that both proximal and distal enhancers have the capacity to survey the Shh topological domain to recognise potential promoters. The widely expressed Rnf32 gene lying within the Shh domain evades enhancer activities by a process that may be common among other housekeeping genes that reside in large regulatory domains. Finally, the boundaries of the Shh TAD do not represent the absolute expression limits of enhancer activity, as expression activity is lost stepwise at a number of genomic positions at the verges of these domains. PMID:25252942

  1. Plant nitrogen regulatory P-PII polypeptides

    DOEpatents

    Coruzzi, Gloria M.; Lam, Hon-Ming; Hsieh, Ming-Hsiun

    2004-11-23

    The present invention generally relates to plant nitrogen regulatory PII gene (hereinafter P-PII gene), a gene involved in regulating plant nitrogen metabolism. The invention provides P-PII nucleotide sequences, expression constructs comprising said nucleotide sequences, and host cells and plants having said constructs and, optionally expressing the P-PII gene from said constructs. The invention also provides substantially pure P-PII proteins. The P-PII nucleotide sequences and constructs of the invention may be used to engineer organisms to overexpress wild-type or mutant P-PII regulatory protein. Engineered plants that overexpress or underexpress P-PII regulatory protein may have increased nitrogen assimilation capacity. Engineered organisms may be used to produce P-PII proteins which, in turn, can be used for a variety of purposes including in vitro screening of herbicides. P-PII nucleotide sequences have additional uses as probes for isolating additional genomic clones having the promoters of P-PII gene. P-PII promoters are light- and/or sucrose-inducible and may be advantageously used in genetic engineering of plants.

  2. Effect of temperature and solvent composition on acid dissociation equilibria, I: Sequenced (s)(s)pKa determination of compounds commonly used as buffers in high performance liquid chromatography coupled to mass spectroscopy detection.

    PubMed

    Padró, Juan M; Acquaviva, Agustín; Tascon, Marcos; Gagliardi, Leonardo G; Castells, Cecilia B

    2012-05-01

    A new automated and rapid potentiometric method for determining the effect of organic-solvent composition on pK(a) has been developed. It is based on the measurements of pH values of buffer solutions of variable solvent compositions using a combined glass electrode. Additions of small volumes of one precisely thermostated solution into another, both containing exactly the same analytical concentrations of the buffer components, can produce continuous changes in the solvent composition. Two sequences of potential measurements, one of increasing and the other of decreasing solvent content, are sufficient to obtain the pK(a) values of the acidic compound within the complete solvent-composition range in about 2h. The experimental design, procedures, and calculations needed to convert the measured pH into the thermodynamic pK(a) values are thoroughly discussed. This rapid and automated method allows the systematic study of the effect of solvent compositions and temperatures on the pK(a). It has been applied to study the dissociation constants of two monoprotic acids: formic acid and triethylamine:HCl in acetonitrile/water mixtures within the range from 0 to 90% (v/v) at temperatures between 20°C and 60°C. These volatile compounds are frequently used to control the pH of the mobile phase in HPLC, especially in methods coupled to mass-spectrometry detection. The obtained pK(a) values are in excellent agreement with those previously reported. The results were fitted to empirical functions between pK(a) and temperature and composition. These equations, which can be used to estimate the pK(a) of these substances at any composition and temperature, would be highly useful in practical work during chromatographic method development. PMID:22502616

  3. Quantitative Sequencing for the Determination of Kdr-type Resistance Allele (V419L, L925I, I936F) Frequencies in Common Bed Bug (Hemiptera: Cimicidae) Populations Collected from Israel.

    PubMed

    Palenchar, Daniel J; Gellatly, Kyle J; Yoon, Kyong Sup; Mumcuoglu, Kosta Y; Shalom, Uri; Clark, J Marshall

    2015-09-01

    Human bed bug infestations have dramatically increased worldwide since the mid-1990s. A similar phenomenon was also observed in Israel since 2005, when infestations were reported from all over the country. Two single nucleotide polymorphisms (V419L and L925I) in the bed bug voltage-sensitive sodium channel confer kdr-type resistance to pyrethroids. Using quantitative sequencing (QS), the resistance allele frequencies of Israeli bed bug populations from across the country were determined. Genomic DNA was extracted from samples of 12 populations of bed bugs collected from Israel and DNA fragments containing the V419L or L925I and I936F mutations sites were PCR amplified. The PCR products were analyzed by QS and the nucleotide signal ratios calculated and used to predict the resistance allele frequencies of the unknown populations. Results of the genetic analysis show that resistant nucleotide signals are highly correlated to resistance allele frequencies for both mutations. Ten of the 12 tested populations had 100% of the L925I mutation and 0% of the V419L mutation. One population was heterogeneous for the L925I mutation and had 0% of the V419L mutation and another population was heterogeneous for the V419L mutation and had 100% of the L925I mutation. I936F occurred only at low levels. These results indicate that bed bugs in Israel are genetically resistant to pyrethroids. Thus, pyrethroids should only be used for bed bug management with caution using effective application and careful monitoring procedures. Additionally, new and novel-acting insecticides and nonchemical means of controlling bed bugs should be explored. PMID:26336243

  4. Variation in vertebrate cis-regulatory elements in evolution and disease.

    PubMed

    Douglas, Adam Thomas; Hill, Robert D

    2014-01-01

    Much of the genetic information that drives animal diversity lies within the vast non-coding regions of the genome. Multi-species sequence conservation in non-coding regions of the genome flags important regulatory elements and more recently, techniques that look for functional signatures predicted for regulatory sequences have added to the identification of thousands more. For some time, biologists have argued that changes in cis-regulatory sequences creates the basic genetic framework for evolutionary change. Recent advances support this notion and show that there is extensive genomic variability in non-coding regulatory elements associated with trait variation, speciation and disease. PMID:25764334

  5. Variation in Vertebrate Cis-Regulatory Elements in Evolution and Disease.

    PubMed

    Douglas, Adam T; Hill, Robert E

    2014-05-01

    Much of the genetic information that drives animal diversity lies within the vast non-coding regions of the genome. Multi-species sequence conservation in non-coding regions of the genome flags important regulatory elements and more recently, techniques that look for functional signatures predicted for regulatory sequences have added to the identification of thousands more. For some time, biologists have argued that changes in cis-regulatory sequences creates the basic genetic framework for evolutionary change. Recent advances support this notion and show that there is extensive genomic variability in non-coding regulatory elements associated with trait variation, speciation and disease. PMID:24802895

  6. Variation in Vertebrate Cis-Regulatory Elements in Evolution and Disease

    PubMed Central

    Douglas, Adam Thomas; Hill, Robert E

    2014-01-01

    Much of the genetic information that drives animal diversity lies within the vast non-coding regions of the genome. Multi-species sequence conservation in non-coding regions of the genome flags important regulatory elements and more recently, techniques that look for functional signatures predicted for regulatory sequences have added to the identification of thousands more. For some time, biologists have argued that changes in cis-regulatory sequences creates the basic genetic framework for evolutionary change. Recent advances support this notion and show that there is extensive genomic variability in non-coding regulatory elements associated with trait variation, speciation and disease. PMID:25764334

  7. Nuclear Regulatory Commission Semiannual Regulatory Agenda

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-12-20

    ...) 2010, less the amounts appropriated from the Nuclear Waste Fund, amounts appropriated for Waste... agenda on April 26, 2010 (75 FR 21960). For this edition of the NRC's regulatory agenda, the most... publication of the last NRC semiannual agenda on April 26, 2010 (75 FR 21960). Within each group, the...

  8. NRC regulatory agenda

    SciTech Connect

    Not Available

    1991-04-01

    The NRC Regulatory Agenda is a compilation of all rules on which the NRC has recently completed action or has proposed, or is considering action and all petitions for rulemaking which have been received by the Commission and are pending disposition by the Commission. The Regulatory Agenda is updated and issued each quarter.

  9. NRC regulatory agenda

    SciTech Connect

    Not Available

    1992-05-01

    The NRC Regulatory Agenda is a compilation of all rules on which the NRC has recently completed action, or has proposed action, or is considering action, and all petitions for rulemaking which have been received by the Commission and are pending disposition by the Commission. The Regulatory Agenda is updated and issued each quarter.

  10. NRC Regulatory Agenda

    SciTech Connect

    Not Available

    1991-10-01

    The NRC Regulatory Agenda is a compilation of all rules on which the NRC has recently completed action, or has proposed action, or is considering action, and all petitions for rulemaking which have been received by the Commission and are pending disposition by the Commission. The Regulatory Agenda is updated and issued each quarter.

  11. NRC Regulatory Agenda

    SciTech Connect

    Not Available

    1991-08-01

    The NRC Regulatory Agenda is a compilation of all rules on which the NRC has recently completed action or has proposed, or is considering action and all petitions for rulemaking which have been received by the commission and are pending disposition by the Commission. The Regulatory Agenda is updated and issued each quarter.

  12. NRC regulatory agenda

    SciTech Connect

    Not Available

    1993-04-01

    The NRC Regulatory Agenda is a compilation of all rules on which the NRC has recently completed action, or has proposed action, or is considering action, and all petitions for rulemaking which have been received by the Commission and are pending disposition by the Commission. The Regulatory Agenda is updated and issued each quarter.

  13. NRC regulatory agenda

    SciTech Connect

    Not Available

    1990-07-01

    The NRC Regulatory Agenda is a compilation of all rules on which the NRC has proposed or is considering action and all petitions for rulemaking which have been received by the Commission and are pending disposition by the Commission. The Regulatory Agenda is updated and issued each quarter.

  14. Plant Regulatory Organizations

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The chapter on Plant Regulatory Organizations is part of a book titled Pest Management and Phytosanitary Trade Barriers authored by Neil Heather (Australia) and Guy Hallman. It covers the role of plant regulatory organizations from the international to state level in protecting plant health. At on...

  15. Migraine and Common Morbidities

    MedlinePlus

    ... headaches . Home > Migraine and Common Morbidities Print Email Migraine and Common Morbidities ACHE Newsletter Sign up for ... newsletter by entering your e-mail address below. Migraine and Common Morbidities For many patients, migraine is ...

  16. Common Cause Failure Modeling

    NASA Technical Reports Server (NTRS)

    Hark, Frank; Britton, Paul; Ring, Robert; Novack, Steven

    2015-01-01

    Space Launch System (SLS) Agenda: Objective; Key Definitions; Calculating Common Cause; Examples; Defense against Common Cause; Impact of varied Common Cause Failure (CCF) and abortability; Response Surface for various CCF Beta; Takeaways.

  17. Sequence logos: a new way to display consensus sequences.

    PubMed Central

    Schneider, T D; Stephens, R M

    1990-01-01

    A graphical method is presented for displaying the patterns in a set of aligned sequences. The characters representing the sequence are stacked on top of each other for each position in the aligned sequences. The height of each letter is made proportional to its frequency, and the letters are sorted so the most common one is on top. The height of the entire stack is then adjusted to signify the information content of the sequences at that position. From these 'sequence logos', one can determine not only the consensus sequence but also the relative frequency of bases and the information content (measured in bits) at every position in a site or sequence. The logo displays both significant residues and subtle sequence patterns. PMID:2172928

  18. Single-Nucleotide Mutations in FMR1 Reveal Novel Functions and Regulatory Mechanisms of the Fragile X Syndrome Protein FMRP

    PubMed Central

    Suhl, Joshua A.; Warren, Stephen T.

    2015-01-01

    Fragile X syndrome is a monogenic disorder and a common cause of intellectual disability. Despite nearly 25 years of research on FMR1, the gene underlying the syndrome, very few pathological mutations other than the typical CGG-repeat expansion have been reported. This is in contrast to other X-linked, monogenic, intellectual disability disorders, such as Rett syndrome, where many point mutations have been validated as causative of the disorder. As technology has improved and significantly driven down the cost of sequencing, allowing for whole genes to be sequenced with relative ease, in-depth sequencing studies on FMR1 have recently been performed. These studies have led to the identification of novel variants in FMR1, where some of which have been functionally evaluated and are likely pathogenic. In this review, we discuss recently identified FMR1 variants, the ways these novel variants cause dysfunction, and how they reveal new regulatory mechanisms and functionalities of the gene. PMID:26819560

  19. Cement sequence stratigraphy in carbonates

    SciTech Connect

    Braithwaite, C.J.R. )

    1993-03-01

    Conventional paragenesis analysis commonly fails to describe the subtleties of regional variation found in carbonate cement sequences. Application of the concept of sequence stratigraphy to diagenetic studies allows grouping of the products of depositional (precipitation or crystallization) events into sequences. The boundaries to these sequences are surfaces reflecting erosion (dissolution), non-deposition (renucleation), compaction or imposed fracture events. The pattern of chemical changes within a diagenetic sequence provides a fingerprint which allows that sequence to be recognized among others. Diagenetic sequences may be grouped into temporal series to provide an overall view of the diagenesis of the unit which takes account of local unconformities. The belief that lateral stratigraphical equivalence can serve as a proxy for diagenetic time equivalence is evidently mistaken; diagenetic boundaries may cross stratigraphic boundaries. The application of sequence stratigraphy concepts provides a valuable tool for interpreting regional diagenesis in carbonates and potentially offers insight into the pathways of hydrocarbon or ore fluid migration.

  20. Common themes and differences in SAM recognition among SAM riboswitches

    PubMed Central

    Price, Ian R.; Grigg, Jason C.; Ke, Ailong

    2014-01-01

    The recent discovery of short cis-acting RNA elements termed riboswitches has caused a paradigm shift in our understanding of genetic regulatory mechanisms. The three distinct superfamilies of S-adenosyl-L-methionine (SAM) riboswitches are the most commonly found riboswitch classes in nature. These RNAs represent three independent evolutionary solutions to achieve specific SAM recognition. This review summarizes research on 1) modes of gene regulatory mechanisms, 2) common themes and differences in ligand recognition, and 3) ligand-induced conformational dynamics among SAM riboswitch families. The body of work on the SAM riboswitch families constitutes a useful primer to the topic of gene regulatory RNAs as a whole. PMID:24863160

  1. Common Career Technical Core: Common Standards, Common Vision for CTE

    ERIC Educational Resources Information Center

    Green, Kimberly

    2012-01-01

    This article provides an overview of the National Association of State Directors of Career Technical Education Consortium's (NASDCTEc) Common Career Technical Core (CCTC), a state-led initiative that was created to ensure that career and technical education (CTE) programs are consistent and high quality across the United States. Forty-two states,…

  2. Uncovering drug-responsive regulatory elements

    PubMed Central

    Luizon, Marcelo R; Ahituv, Nadav

    2016-01-01

    Nucleotide changes in gene regulatory elements can have a major effect on interindividual differences in drug response. For example, by reviewing all published pharmacogenomic genome-wide association studies, we show here that 96.4% of the associated single nucleotide polymorphisms reside in noncoding regions. We discuss how sequencing technologies are improving our ability to identify drug response-associated regulatory elements genome-wide and to annotate nucleotide variants within them. We highlight specific examples of how nucleotide changes in these elements can affect drug response and illustrate the techniques used to find them and functionally characterize them. Finally, we also discuss challenges in the field of drug-responsive regulatory elements that need to be considered in order to translate these findings into the clinic. PMID:26555224

  3. Post-transcriptional control of nuclear-encoded cytochrome oxidase subunits in Trypanosoma brucei: evidence for genome-wide conservation of life-cycle stage-specific regulatory elements

    PubMed Central

    Mayho, Matthew; Fenn, Katelyn; Craddy, Paul; Crosthwaite, Susan; Matthews, Keith

    2006-01-01

    Trypanosomes represent an excellent model for the post-transcriptional regulation of gene expression because their genome is organized into polycistronic transcription units. However, few signals governing developmental stage-specific expression have been identified, with there being no compelling evidence for widespread conservation of regulatory motifs. As a tool to search for common regulatory sequences we have used the nuclear-encoded components of the cytochrome oxidase (COX) complex of the trypanosome respiratory chain. Components of this complex represent a form of post-transcriptional operon because trypanosome mitochondrial activity is unusual in being developmentally programmed. By genome analysis we identified the genes for seven components of the COX complex. Each mRNA exhibits bloodstream stage-specific instability, which is not mediated by the RNA silencing pathway but which is alleviated by cycloheximide. Reporter assays have identified regulatory regions within the 3′-untranslated regions of three COX mRNAs operating principally at the translational level, but also via mRNA stability. Interrogation of the mapped regions via oligonucleotide frequency scoring provides evidence for genome-wide conservation of regulatory sequences among a large cohort of procyclic-enriched transcripts. Analysis of the co-regulated subunits of a stage-specific enzyme is therefore a novel approach to uncover cryptic regulatory sequences controlling gene expression at the post-transcriptional level. PMID:17012283

  4. Regulatory guidance document

    SciTech Connect

    1994-05-01

    The Office of Civilian Radioactive Waste Management (OCRWM) Program Management System Manual requires preparation of the OCRWM Regulatory Guidance Document (RGD) that addresses licensing, environmental compliance, and safety and health compliance. The document provides: regulatory compliance policy; guidance to OCRWM organizational elements to ensure a consistent approach when complying with regulatory requirements; strategies to achieve policy objectives; organizational responsibilities for regulatory compliance; guidance with regard to Program compliance oversight; and guidance on the contents of a project-level Regulatory Compliance Plan. The scope of the RGD includes site suitability evaluation, licensing, environmental compliance, and safety and health compliance, in accordance with the direction provided by Section 4.6.3 of the PMS Manual. Site suitability evaluation and regulatory compliance during site characterization are significant activities, particularly with regard to the YW MSA. OCRWM`s evaluation of whether the Yucca Mountain site is suitable for repository development must precede its submittal of a license application to the Nuclear Regulatory Commission (NRC). Accordingly, site suitability evaluation is discussed in Chapter 4, and the general statements of policy regarding site suitability evaluation are discussed in Section 2.1. Although much of the data and analyses may initially be similar, the licensing process is discussed separately in Chapter 5. Environmental compliance is discussed in Chapter 6. Safety and Health compliance is discussed in Chapter 7.

  5. Regulatory principles governing Salmonella and Yersinia virulence

    PubMed Central

    Erhardt, Marc; Dersch, Petra

    2015-01-01

    Enteric pathogens such as Salmonella and Yersinia evolved numerous strategies to survive and proliferate in different environmental reservoirs and mammalian hosts. Deciphering common and pathogen-specific principles for how these bacteria adjust and coordinate spatiotemporal expression of virulence determinants, stress adaptation, and metabolic functions is fundamental to understand microbial pathogenesis. In order to manage sudden environmental changes, attacks by the host immune systems and microbial competition, the pathogens employ a plethora of transcriptional and post-transcriptional control elements, including transcription factors, sensory and regulatory RNAs, RNAses, and proteases, to fine-tune and control complex gene regulatory networks. Many of the contributing global regulators and the molecular mechanisms of regulation are frequently conserved between Yersinia and Salmonella. However, the interplay, arrangement, and composition of the control elements vary between these closely related enteric pathogens, which generate phenotypic differences leading to distinct pathogenic properties. In this overview we present common and different regulatory networks used by Salmonella and Yersinia to coordinate the expression of crucial motility, cell adhesion and invasion determinants, immune defense strategies, and metabolic adaptation processes. We highlight evolutionary changes of the gene regulatory circuits that result in different properties of the regulatory elements and how this influences the overall outcome of the infection process. PMID:26441883

  6. 78 FR 44279 - Regulatory Agenda

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-07-23

    ...The Department of Justice is publishing its spring 2013 regulatory agenda pursuant to Executive Order 12866, ``Regulatory Planning and Review,'' 58 FR 51735, and the Regulatory Flexibility Act, 5 U.S.C. sections 601 to 612...

  7. The ensembl regulatory build.

    PubMed

    Zerbino, Daniel R; Wilder, Steven P; Johnson, Nathan; Juettemann, Thomas; Flicek, Paul R

    2015-01-01

    Most genomic variants associated with phenotypic traits or disease do not fall within gene coding regions, but in regulatory regions, rendering their interpretation difficult. We collected public data on epigenetic marks and transcription factor binding in human cell types and used it to construct an intuitive summary of regulatory regions in the human genome. We verified it against independent assays for sensitivity. The Ensembl Regulatory Build will be progressively enriched when more data is made available. It is freely available on the Ensembl browser, from the Ensembl Regulation MySQL database server and in a dedicated track hub. PMID:25887522

  8. Gene regulatory mechanisms underpinning prostate cancer susceptibility.

    PubMed

    Whitington, Thomas; Gao, Ping; Song, Wei; Ross-Adams, Helen; Lamb, Alastair D; Yang, Yuehong; Svezia, Ilaria; Klevebring, Daniel; Mills, Ian G; Karlsson, Robert; Halim, Silvia; Dunning, Mark J; Egevad, Lars; Warren, Anne Y; Neal, David E; Grönberg, Henrik; Lindberg, Johan; Wei, Gong-Hong; Wiklund, Fredrik

    2016-04-01

    Molecular characterization of genome-wide association study (GWAS) loci can uncover key genes and biological mechanisms underpinning complex traits and diseases. Here we present deep, high-throughput characterization of gene regulatory mechanisms underlying prostate cancer risk loci. Our methodology integrates data from 295 prostate cancer chromatin immunoprecipitation and sequencing experiments with genotype and gene expression data from 602 prostate tumor samples. The analysis identifies new gene regulatory mechanisms affected by risk locus SNPs, including widespread disruption of ternary androgen receptor (AR)-FOXA1 and AR-HOXB13 complexes and competitive binding mechanisms. We identify 57 expression quantitative trait loci at 35 risk loci, which we validate through analysis of allele-specific expression. We further validate predicted regulatory SNPs and target genes in prostate cancer cell line models. Finally, our integrated analysis can be accessed through an interactive visualization tool. This analysis elucidates how genome sequence variation affects disease predisposition via gene regulatory mechanisms and identifies relevant genes for downstream biomarker and drug development. PMID:26950096

  9. Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters

    SciTech Connect

    Santini, Simona; Boore, Jeffrey L.; Meyer, Axel

    2003-12-31

    Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involved in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.

  10. Overview Article: Identifying transcriptional cis-regulatory modules in animal genomes

    PubMed Central

    Suryamohan, Kushal; Halfon, Marc S.

    2014-01-01

    Gene expression is regulated through the activity of transcription factors and chromatin modifying proteins acting on specific DNA sequences, referred to as cis-regulatory elements. These include promoters, located at the transcription initiation sites of genes, and a variety of distal cis-regulatory modules (CRMs), the most common of which are transcriptional enhancers. Because regulated gene expression is fundamental to cell differentiation and acquisition of new cell fates, identifying, characterizing, and understanding the mechanisms of action of CRMs is critical for understanding development. CRM discovery has historically been challenging, as CRMs can be located far from the genes they regulate, have few readily-identifiable sequence characteristics, and for many years were not amenable to high-throughput discovery methods. However, the recent availability of complete genome sequences and the development of next-generation sequencing methods has led to an explosion of both computational and empirical methods for CRM discovery in model and non-model organisms alike. Experimentally, CRMs can be identified through chromatin immunoprecipitation directed against transcription factors or histone post-translational modifications, identification of nucleosome-depleted “open” chromatin regions, or sequencing-based high-throughput functional screening. Computational methods include comparative genomics, clustering of known or predicted transcription factor binding sites, and supervised machine-learning approaches trained on known CRMs. All of these methods have proven effective for CRM discovery, but each has its own considerations and limitations, and each is subject to a greater or lesser number of false-positive identifications. Experimental confirmation of predictions is essential, although shortcomings in current methods suggest that additional means of validation need to be developed. PMID:25704908

  11. NRC regulatory agenda

    SciTech Connect

    Not Available

    1990-10-01

    The Regulatory Agenda is a quarterly compilation of all rules on which the NRC has recently completed action or has proposed, or is considering action and of all petitions for rulemaking that the NRC has received that are pending disposition.

  12. NRC Regulatory Agenda

    SciTech Connect

    Not Available

    1989-10-01

    The Regulatory Agenda is a quarterly compilation of all rules on which the NRC has recently completed action or has proposed, or is considering action and of all petitions for rulemaking that the NRC has received that are pending disposition.

  13. NRC regulatory agenda

    SciTech Connect

    Not Available

    1990-04-01

    The Regulatory Agenda is a quarterly compilation of all rules on which the NRC has recently completed action or has proposed, or is considering action and of all petitions for rulemaking that the NRC has received that are pending disposition.

  14. Canadian drug regulatory framework.

    PubMed

    Kelly, L; Lazzaro, M; Petersen, C

    2007-03-01

    The role of regulatory drug submission evaluators in Canada is to critically assess both the data submitted and the sponsor's interpretation of the data in order to reach an evidence-, and context-based recommendation as to the potential benefits and potential harms (i.e., risks) associated with taking the drug under the proposed conditions of use. The purpose of this document is to outline the regulatory framework in which this assessment occurs, including: defining what "authorization to market a drug in Canada" means, in terms of the role of the sponsor, the responsibility of Health Canada in applying the Food and Drugs Act prior to and after marketing authorization, and the distinction between regulatory authorization versus physician authorization; highlighting organizational, process and legal factors within Health Canada related to authorization of clinical trials and authorization to market a drug; considerations during the review process, such as regulatory and scientific issues related to the drug, patient populations and trial designs; application of international guidelines, and decisions from other jurisdictions; regulatory realities regarding drug authorization, including the requirement for wording in the Product Monograph to accurately reflect the information currently available on the safe and effective use of a drug, and that hypothesis-confirming studies are essential to regulatory endorsement; current issues related to the review of therapies for dementia, such as assessing preventative treatments, and therapies that have symptomatic versus disease-modifying effects, statistical issues regarding missing data, and trial design issues. PMID:17469674

  15. Dna Sequencing

    DOEpatents

    Tabor, Stanley; Richardson, Charles C.

    1995-04-25

    A method for sequencing a strand of DNA, including the steps off: providing the strand of DNA; annealing the strand with a primer able to hybridize to the strand to give an annealed mixture; incubating the mixture with four deoxyribonucleoside triphosphates, a DNA polymerase, and at least three deoxyribonucleoside triphosphates in different amounts, under conditions in favoring primer extension to form nucleic acid fragments complementory to the DNA to be sequenced; labelling the nucleic and fragments; separating them and determining the position of the deoxyribonucleoside triphosphates by differences in the intensity of the labels, thereby to determine the DNA sequence.

  16. Ranking and Sequencing Model

    Energy Science and Technology Software Center (ESTSC)

    2009-08-13

    This database application (commonly called the Supermodel) provides a repository for managing critical facility/project information, allows the user to subjectively an objectively assess key criteria , quantify project risks, develop ROM cost estimates, determine facility/project end states, ultimately performing risk-based modeling to rank facilities/project based on risk, sequencing project schedules and provides an optimized recommended sequencing/scheduling of these projects which maximize the S&M cost savings to perform closure projects which benefit all stakeholders.

  17. SRD: a Staphylococcus regulatory RNA database.

    PubMed

    Sassi, Mohamed; Augagneur, Yoann; Mauro, Tony; Ivain, Lorraine; Chabelskaya, Svetlana; Hallier, Marc; Sallou, Olivier; Felden, Brice

    2015-05-01

    An overflow of regulatory RNAs (sRNAs) was identified in a wide range of bacteria. We designed and implemented a new resource for the hundreds of sRNAs identified in Staphylococci, with primary focus on the human pathogen Staphylococcus aureus. The "Staphylococcal Regulatory RNA Database" (SRD, http://srd.genouest.org/) compiled all published data in a single interface including genetic locations, sequences and other features. SRD proposes novel and simplified identifiers for Staphylococcal regulatory RNAs (srn) based on the sRNA's genetic location in S. aureus strain N315 which served as a reference. From a set of 894 sequences and after an in-depth cleaning, SRD provides a list of 575 srn exempt of redundant sequences. For each sRNA, their experimental support(s) is provided, allowing the user to individually assess their validity and significance. RNA-seq analysis performed on strains N315, NCTC8325, and Newman allowed us to provide further details, upgrade the initial annotation, and identified 159 RNA-seq independent transcribed sRNAs. The lists of 575 and 159 sRNAs sequences were used to predict the number and location of srns in 18 S. aureus strains and 10 other Staphylococci. A comparison of the srn contents within 32 Staphylococcal genomes revealed a poor conservation between species. In addition, sRNA structure predictions obtained with MFold are accessible. A BLAST server and the intaRNA program, which is dedicated to target prediction, were implemented. SRD is the first sRNA database centered on a genus; it is a user-friendly and scalable device with the possibility to submit new sequences that should spread in the literature. PMID:25805861

  18. SRD: a Staphylococcus regulatory RNA database

    PubMed Central

    Sassi, Mohamed; Augagneur, Yoann; Mauro, Tony; Ivain, Lorraine; Chabelskaya, Svetlana; Hallier, Marc; Sallou, Olivier; Felden, Brice

    2015-01-01

    An overflow of regulatory RNAs (sRNAs) was identified in a wide range of bacteria. We designed and implemented a new resource for the hundreds of sRNAs identified in Staphylococci, with primary focus on the human pathogen Staphylococcus aureus. The “Staphylococcal Regulatory RNA Database” (SRD, http://srd.genouest.org/) compiled all published data in a single interface including genetic locations, sequences and other features. SRD proposes novel and simplified identifiers for Staphylococcal regulatory RNAs (srn) based on the sRNA's genetic location in S. aureus strain N315 which served as a reference. From a set of 894 sequences and after an in-depth cleaning, SRD provides a list of 575 srn exempt of redundant sequences. For each sRNA, their experimental support(s) is provided, allowing the user to individually assess their validity and significance. RNA-seq analysis performed on strains N315, NCTC8325, and Newman allowed us to provide further details, upgrade the initial annotation, and identified 159 RNA-seq independent transcribed sRNAs. The lists of 575 and 159 sRNAs sequences were used to predict the number and location of srns in 18 S. aureus strains and 10 other Staphylococci. A comparison of the srn contents within 32 Staphylococcal genomes revealed a poor conservation between species. In addition, sRNA structure predictions obtained with MFold are accessible. A BLAST server and the intaRNA program, which is dedicated to target prediction, were implemented. SRD is the first sRNA database centered on a genus; it is a user-friendly and scalable device with the possibility to submit new sequences that should spread in the literature. PMID:25805861

  19. Common Interventional Radiology Procedures

    MedlinePlus

    ... of common interventional techniques is below. Common Interventional Radiology Procedures Angiography An X-ray exam of the ... into the vertebra. Copyright © 2016 Society of Interventional Radiology. All rights reserved. 3975 Fair Ridge Drive • Suite ...

  20. How Common Is the Common Core?

    ERIC Educational Resources Information Center

    Thomas, Amande; Edson, Alden J.

    2014-01-01

    Since the introduction of the Common Core State Standards for Mathematics (CCSSM) in 2010, stakeholders in adopting states have engaged in a variety of activities to understand CCSSM standards and transition from previous state standards. These efforts include research, professional development, assessment and modification of curriculum resources,…

  1. Structure of Proximal and Distant Regulatory Elements in the Human Genome

    NASA Astrophysics Data System (ADS)

    Ovcharenko, Ivan

    Clustering of multiple transcription factor binding sites (TFBSs) for the same transcription factor (TF) is a common feature of cis-regulatory modules in invertebrate animals, but the occurrence of such homotypic clusters of TFBSs (HCTs) in the human genome has remained largely unknown. To explore whether HCTs are also common in human and other vertebrates, we used known binding motifs for vertebrate TFs and a hidden Markov model-based approach to detect HCTs in the human, mouse, chicken, and fugu genomes, and examined their association with cis-regulatory modules. We found that evolutionarily conserved HCTs occupy nearly 2% of the human genome, with experimental evidence for individual TFs supporting their binding to predicted HCTs. More than half of promoters of human genes contain HCTs, with a distribution around the transcription start site in agreement with the experimental data from the ENCODE project. In addition, almost half of 487 experimentally validated developmental enhancers contain them as well - a number more than 25-fold larger than expected by chance. We also found evidence of negative selection acting on TFBSs within HCTs, as the conservation of TFBSs is stronger than the conservation of sequences separating them. The important role of HCTs as components of developmental enhancers is additionally supported by a strong correlation between HCTs and the binding of the enhancer-associated co-activator protein p300. Experimental validation of HCT-containing elements in both zebrafish and mouse suggest that HCTs could be used to predict both the presence of enhancers and their tissue specificity, and are thus a feature that can be effectively used in deciphering the gene regulatory code. In conclusion, our results indicate that HCTs are a pervasive feature of human cis-regulatory modules and suggest that they play an important role in gene regulation in the human and other vertebrate genomes.

  2. The New Common School.

    ERIC Educational Resources Information Center

    Glenn, Charles L.

    1987-01-01

    Horace Mann's goal of creating a common school that brings our society's children together in mutual respect and common learning need not be frustrated by residential segregation and geographical separation of the haves and have-nots. Massachusetts' new common school vision boasts a Metro Program for minority students, 80 magnet schools, and…

  3. The Common Core.

    ERIC Educational Resources Information Center

    Boyer, Ernest L.

    Current curricula in institutions of higher education are criticized in this speech for their lack of a common core of education. Several possibilities for developing such a common core include education centered around our common heritage and the challenges of the present. It is suggested that all students must be introduced to the events,…

  4. Knowledge representation for commonality

    NASA Technical Reports Server (NTRS)

    Yeager, Dorian P.

    1990-01-01

    Domain-specific knowledge necessary for commonality analysis falls into two general classes: commonality constraints and costing information. Notations for encoding such knowledge should be powerful and flexible and should appeal to the domain expert. The notations employed by the Commonality Analysis Problem Solver (CAPS) analysis tool are described. Examples are given to illustrate the main concepts.

  5. Automated Identification of Nucleotide Sequences

    NASA Technical Reports Server (NTRS)

    Osman, Shariff; Venkateswaran, Kasthuri; Fox, George; Zhu, Dian-Hui

    2007-01-01

    STITCH is a computer program that processes raw nucleotide-sequence data to automatically remove unwanted vector information, perform reverse-complement comparison, stitch shorter sequences together to make longer ones to which the shorter ones presumably belong, and search against the user s choice of private and Internet-accessible public 16S rRNA databases. ["16S rRNA" denotes a ribosomal ribonucleic acid (rRNA) sequence that is common to all organisms.] In STITCH, a template 16S rRNA sequence is used to position forward and reverse reads. STITCH then automatically searches known 16S rRNA sequences in the user s chosen database(s) to find the sequence most similar to (the sequence that lies at the smallest edit distance from) each spliced sequence. The result of processing by STITCH is the identification of the most similar well-described bacterium. Whereas previously commercially available software for analyzing genetic sequences operates on one sequence at a time, STITCH can manipulate multiple sequences simultaneously to perform the aforementioned operations. A typical analysis of several dozen sequences (length of the order of 103 base pairs) by use of STITCH is completed in a few minutes, whereas such an analysis performed by use of prior software takes hours or days.

  6. The Common Law Threesome: Libel, Slander, and Invasion of Privacy.

    ERIC Educational Resources Information Center

    Anapol, Malthon M.

    Unlike most of the regulatory constraints which have impact on the media, libel, slander, and invasion of privacy are common law concepts developed from the precedents of previous court decisions and from reasoning employed in the written judicial opinions of appellate courts. Since common law is thus both traditional in nature and subject to…

  7. Development of candidate gene markers associated to common bacterial blight resistance in common bean

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Common bacterial blight (CBB), caused by Xanthomonas axonopodis pv. phaseoli (Xap), is a major yield-limiting factor of common bean (Phaseolus vulgaris L.) production around the world. Two major CBB-resistant quantitative trait loci (QTL), linked to the sequence characterized amplified region marker...

  8. Automatic Command Sequence Generation

    NASA Technical Reports Server (NTRS)

    Fisher, Forest; Gladded, Roy; Khanampompan, Teerapat

    2007-01-01

    Automatic Sequence Generator (Autogen) Version 3.0 software automatically generates command sequences for the Mars Reconnaissance Orbiter (MRO) and several other JPL spacecraft operated by the multi-mission support team. Autogen uses standard JPL sequencing tools like APGEN, ASP, SEQGEN, and the DOM database to automate the generation of uplink command products, Spacecraft Command Message Format (SCMF) files, and the corresponding ground command products, DSN Keywords Files (DKF). Autogen supports all the major multi-mission mission phases including the cruise, aerobraking, mapping/science, and relay mission phases. Autogen is a Perl script, which functions within the mission operations UNIX environment. It consists of two parts: a set of model files and the autogen Perl script. Autogen encodes the behaviors of the system into a model and encodes algorithms for context sensitive customizations of the modeled behaviors. The model includes knowledge of different mission phases and how the resultant command products must differ for these phases. The executable software portion of Autogen, automates the setup and use of APGEN for constructing a spacecraft activity sequence file (SASF). The setup includes file retrieval through the DOM (Distributed Object Manager), an object database used to store project files. This step retrieves all the needed input files for generating the command products. Depending on the mission phase, Autogen also uses the ASP (Automated Sequence Processor) and SEQGEN to generate the command product sent to the spacecraft. Autogen also provides the means for customizing sequences through the use of configuration files. By automating the majority of the sequencing generation process, Autogen eliminates many sequence generation errors commonly introduced by manually constructing spacecraft command sequences. Through the layering of commands into the sequence by a series of scheduling algorithms, users are able to rapidly and reliably construct the

  9. Transcription factor trapping by RNA in gene regulatory elements

    PubMed Central

    Sigova, Alla A.; Abraham, Brian J.; Ji, Xiong; Molinie, Benoit; Hannett, Nancy M.; Eric Guo, Yang; Jangi, Mohini; Giallourakis, Cosmas C.; Sharp, Phillip A.; Young, Richard A.

    2016-01-01

    Transcription factors (TFs) bind specific sequences in promoter-proximal and distal DNA elements in order to regulate gene transcription. RNA is transcribed from both of these DNA elements, and some DNA-binding TFs bind RNA. Hence, RNA transcribed from regulatory elements may contribute to stable TF occupancy at these sites. We show that the ubiquitously expressed TF YY1 binds to both gene regulatory elements and also to their associated RNA species genome-wide. Reduced transcription of regulatory elements diminishes YY1 occupancy whereas artificial tethering of RNA enhances YY1 occupancy at these elements. We propose that RNA makes a modest but important contribution to the maintenance of certain TFs at gene regulatory elements and suggest that transcription of regulatory elements produces a positive feedback loop that contributes to the stability of gene expression programs. PMID:26516199

  10. Transcription factor trapping by RNA in gene regulatory elements.

    PubMed

    Sigova, Alla A; Abraham, Brian J; Ji, Xiong; Molinie, Benoit; Hannett, Nancy M; Guo, Yang Eric; Jangi, Mohini; Giallourakis, Cosmas C; Sharp, Phillip A; Young, Richard A

    2015-11-20

    Transcription factors (TFs) bind specific sequences in promoter-proximal and -distal DNA elements to regulate gene transcription. RNA is transcribed from both of these DNA elements, and some DNA binding TFs bind RNA. Hence, RNA transcribed from regulatory elements may contribute to stable TF occupancy at these sites. We show that the ubiquitously expressed TF Yin-Yang 1 (YY1) binds to both gene regulatory elements and their associated RNA species across the entire genome. Reduced transcription of regulatory elements diminishes YY1 occupancy, whereas artificial tethering of RNA enhances YY1 occupancy at these elements. We propose that RNA makes a modest but important contribution to the maintenance of certain TFs at gene regulatory elements and suggest that transcription of regulatory elements produces a positive-feedback loop that contributes to the stability of gene expression programs. PMID:26516199

  11. The Prokineticins: A Novel Pair of Regulatory Peptides

    PubMed Central

    Zhou, Qun-Yong

    2009-01-01

    Secreted peptides play broad regulatory roles in brain function and elsewhere in the body. Prokineticins are a pair of newly identified regulatory peptides that signal through two highly homologous G protein–coupled receptors. Prokineticins possess a unique structural motif of five disulfide bonds and a completely conserved N-terminal hexapeptide sequence that is essential to biological activity. Diverse biological functions, including roles in development and cell differentiation, have been assigned to the prokineticins. A network of genes, subject to various transcriptional factors, may functionally converge on the prokineticins as regulatory targets. PMID:17200460

  12. Cis- and Trans-Regulatory Mechanisms of Gene Expression in the ASJ Sensory Neuron of Caenorhabditis elegans

    PubMed Central

    González-Barrios, María; Fierro-González, Juan Carlos; Krpelanova, Eva; Mora-Lorca, José Antonio; Pedrajas, José Rafael; Peñate, Xenia; Chavez, Sebastián; Swoboda, Peter; Jansen, Gert; Miranda-Vizuete, Antonio

    2015-01-01

    The identity of a given cell type is determined by the expression of a set of genes sharing common cis-regulatory motifs and being regulated by shared transcription factors. Here, we identify cis and trans regulatory elements that drive gene expression in the bilateral sensory neuron ASJ, located in the head of the nematode Caenorhabditis elegans. For this purpose, we have dissected the promoters of the only two genes so far reported to be exclusively expressed in ASJ, trx-1 and ssu-1. We hereby identify the ASJ motif, a functional cis-regulatory bipartite promoter region composed of two individual 6 bp elements separated by a 3 bp linker. The first element is a 6 bp CG-rich sequence that presumably binds the Sp family member zinc-finger transcription factor SPTF-1. Interestingly, within the C. elegans nervous system SPTF-1 is also found to be expressed only in ASJ neurons where it regulates expression of other genes in these neurons and ASJ cell fate. The second element of the bipartite motif is a 6 bp AT-rich sequence that is predicted to potentially bind a transcription factor of the homeobox family. Together, our findings identify a specific promoter signature and SPTF-1 as a transcription factor that functions as a terminal selector gene to regulate gene expression in C. elegans ASJ sensory neurons. PMID:25769980

  13. Mapping open chromatin with formaldehyde-assisted isolation of regulatory elements.

    PubMed

    Nammo, Takao; Rodríguez-Seguí, Santiago A; Ferrer, Jorge

    2011-01-01

    Noncoding regulatory genomic elements are central for cellular function, differentiation, and disease, but remain poorly characterized. FAIRE (formaldehyde-assisted isolation of regulatory elements) has emerged as a simple method to identify and analyze active regulatory sequences based on their decreased nucleosomal content. More recently FAIRE was combined with high-throughput sequencing (FAIRE-seq) to locate tissue-specific regulatory elements at a genome scale in purified human pancreatic islets. Here we describe the implementation of the FAIRE method in human pancreatic islet cells. PMID:21913087

  14. Common Transcriptional Mechanisms for Visual Photoreceptor Cell Differentiation among Pancrustaceans

    PubMed Central

    Mahato, Simpla; Morita, Shinichi; Tucker, Abraham E.; Liang, Xulong; Jackowska, Magdalena; Friedrich, Markus; Shiga, Yasuhiro; Zelhof, Andrew C.

    2014-01-01

    A hallmark of visual rhabdomeric photoreceptors is the expression of a rhabdomeric opsin and uniquely associated phototransduction molecules, which are incorporated into a specialized expanded apical membrane, the rhabdomere. Given the extensive utilization of rhabdomeric photoreceptors in the eyes of protostomes, here we address whether a common transcriptional mechanism exists for the differentiation of rhabdomeric photoreceptors. In Drosophila, the transcription factors Pph13 and Orthodenticle (Otd) direct both aspects of differentiation: rhabdomeric opsin transcription and rhabdomere morphogenesis. We demonstrate that the orthologs of both proteins are expressed in the visual systems of the distantly related arthropod species Tribolium castaneum and Daphnia magna and that their functional roles are similar in these species. In particular, we establish that the Pph13 homologs have the ability to bind a subset of Rhodopsin core sequence I sites and that these sites are present in key phototransduction genes of both Tribolium and Daphnia. Furthermore, Pph13 and Otd orthologs are capable of executing deeply conserved functions of photoreceptor differentiation as evidenced by the ability to rescue their respective Drosophila mutant phenotypes. Pph13 homologs are equivalent in their ability to direct both rhabdomere morphogenesis and opsin expression within Drosophila, whereas Otd paralogs demonstrate differential abilities to regulate photoreceptor differentiation. Finally, loss-of-function analyses in Tribolium confirm the conserved requirement of Pph13 and Otd in regulating both rhabdomeric opsin transcription and rhabdomere morphogenesis. Taken together, our data identify components of a regulatory framework for rhabdomeric photoreceptor differentiation in Pancrustaceans, providing a foundation for defining ancestral regulatory modules of rhabdomeric photoreceptor differentiation. PMID:24991928

  15. Rationales for regulatory activity

    SciTech Connect

    Perhac, R.M.

    1997-02-01

    The author provides an outline which touches on the types of concerns about risk evaluation which are addressed in the process of establishing regulatory guides. Broadly he says regulatory activity serves three broad constituents: (1) Paternalism (private risk); (2) Promotion of social welfare (public risks); (3) Protection of individual rights (public risks). He then discusses some of the major issues encountered in reaching a decision on what is an acceptable level of risk within each of these areas, and how one establishes such a level.

  16. Identification of transcriptional regulatory nodes in soybean defense networks using transient co-transactivation assays

    PubMed Central

    Wang, Yongli; Wang, Hui; Ma, Yujie; Du, Haiping; Yang, Qing; Yu, Deyue

    2015-01-01

    Plant responses to major environmental stressors, such as insect feeding, not only occur via the functions of defense genes but also involve a series of regulatory factors. Our previous transcriptome studies proposed that, in addition to two defense-related genes, GmVSPβ and GmN:IFR, a high proportion of transcription factors (TFs) participate in the incompatible soybean-common cutworm interaction networks. However, the regulatory mechanisms and effects of these TFs on those induced defense-related genes remain unknown. In the present work, we isolated and identified 12 genes encoding MYB, WRKY, NAC, bZIP, and DREB TFs from a common cutworm-induced cDNA library of a resistant soybean line. Sequence analysis of the promoters of three co-expressed genes, including GmVSPα, GmVSPβ, and GmN:IFR, revealed the enrichment of various TF-binding sites for defense and stress responses. To further identify the regulatory nodes composed of these TFs and defense gene promoters, we performed extensive transient co-transactivation assays to directly test the transcriptional activity of the 12 TFs binding at different levels to the three co-expressed gene promoters. The results showed that all 12 TFs were able to transactivate the GmVSPβ and GmN:IFR promoters. GmbZIP110 and GmMYB75 functioned as distinct regulators of GmVSPα/β and GmN:IFR expression, respectively, while GmWRKY39 acted as a common central regulator of GmVSPα/β and GmN:IFR expression. These corresponding TFs play crucial roles in coordinated plant defense regulation, which provides valuable information for understanding the molecular mechanisms involved in insect-induced transcriptional regulation in soybean. More importantly, the identified TFs and suitable promoters can be used to engineer insect-resistant plants in molecular breeding studies. PMID:26579162

  17. Extensive conservation of ancient microsynteny across metazoans due to cis-regulatory constraints

    PubMed Central

    Irimia, Manuel; Tena, Juan J.; Alexis, Maria S.; Fernandez-Miñan, Ana; Maeso, Ignacio; Bogdanović, Ozren; de la Calle-Mustienes, Elisa; Roy, Scott W.; Gómez-Skarmeta, José L.; Fraser, Hunter B.

    2012-01-01

    The order of genes in eukaryotic genomes has generally been assumed to be neutral, since gene order is largely scrambled over evolutionary time. Only a handful of exceptional examples are known, typically involving deeply conserved clusters of tandemly duplicated genes (e.g., Hox genes and histones). Here we report the first systematic survey of microsynteny conservation across metazoans, utilizing 17 genome sequences. We identified nearly 600 pairs of unrelated genes that have remained tightly physically linked in diverse lineages across over 600 million years of evolution. Integrating sequence conservation, gene expression data, gene function, epigenetic marks, and other genomic features, we provide extensive evidence that many conserved ancient linkages involve (1) the coordinated transcription of neighboring genes, or (2) genomic regulatory blocks (GRBs) in which transcriptional enhancers controlling developmental genes are contained within nearby bystander genes. In addition, we generated ChIP-seq data for key histone modifications in zebrafish embryos, which provided further evidence of putative GRBs in embryonic development. Finally, using chromosome conformation capture (3C) assays and stable transgenic experiments, we demonstrate that enhancers within bystander genes drive the expression of genes such as Otx and Islet, critical regulators of central nervous system development across bilaterians. These results suggest that ancient genomic functional associations are far more common than previously thought—involving ∼12% of the ancestral bilaterian genome—and that cis-regulatory constraints are crucial in determining metazoan genome architecture. PMID:22722344

  18. Returning common sense to regulations

    SciTech Connect

    Fox, M.R.

    1995-10-01

    While these sessions of the November 1995 meeting of the American Nuclear Society are being devoted to the Linear Theory of harm from radiation, it must be realized that the low-level radiation issue, as important as it may be, is but a subset of an entire body of environmental issues running afoul of common sense. Cellular phones, electromagnetic fields, asbestos, dioxin, acid rain, and others especially in their public portrayals, some in their regulatory treatment, are based upon exaggerated or misunderstood risks. One must recognize that what lies ahead is an immense effort to revisit the underlying science of the existing regulations of radiation exposures. New evidence has been published, and most importantly, it is now recognized that many of these regulations--promulgated with the best of intentions--have been extraordinarily harmful to the public. In many cases, the harm has been exaggerated, and has created in the public policy arena the notion that the public is at great risk from the smallest sources of radiation. The national cost of compliance with these regulations has been enormous. To the extent that existing environmental regulations are not being moderated, they pose major economic threats to present and future industries involving nuclear materials and technology. These would include the pharmaceutical industries as well as those seeking U.S. isotope markets in separations, purification, labeling, and manufacturing of new radiopharmaceuticals for cancer therapy, diagnosis, pain mitigation, treatment of arthritis, and other new applications. For those who are not aware of the results of recent advances in radiopharmaceuticals, clinical trials have demonstrated an 80% remission rate in the treatment of b-cell lymphoma and leukemia. New isotopes and new isotope technology promise greater effectiveness in the treatment of cancer and other diseases. The regulatory problems and their enormous costs exist at all stages in nuclear medicine, from the

  19. Increasing the dynamic control space of mammalian transcription devices by combinatorial assembly of homologous regulatory elements from different bacterial species.

    PubMed

    Bacchus, William; Weber, Wilfried; Fussenegger, Martin

    2013-01-01

    Prokaryotic transcriptional regulatory elements are widely utilized building blocks for constructing regulatory genetic circuits adapted for mammalian cells and have found their way into a broad range of biotechnological applications. Prokaryotic transcriptional repressors, fused to eukaryotic transactivation or repression domains, compose the transcription factor, which binds and adjusts transcription from chimeric promoters containing the repressor-specific operator sequence. Escherichia coli and Chlamydia trachomatis share common features in the regulatory mechanism of the biosynthesis of l-tryptophan. The repressor protein TrpR of C. trachomatis regulates the trpRBA operon and the TrpR of E. coli regulates the trpEDCBA operon, both requiring l-tryptophan as a co-repressor. Fusion of these bacterial repressors to the VP16 transactivation domain of Herpes simplex virus creates synthetic transactivators that could bind and activate chimeric promoters, assembled by placing repressor-specific operator modules adjacent to a minimal promoter, in an l-tryptophan-adjustable manner. Combinations of different transactivator and promoter variants from the same or different bacterial species resulted in a multitude of regulatory systems where l-tryptophan regulation properties, background noise, and maximal gene expression levels were significantly diverse. Different l-tryptophan analogues showed diverse regulatory capacity depending on the promoter/transactivator combination. We believe the systems approach to rationally choose promoters, transactivators and inducer molecules, to obtain desired and predefined genetic expression dynamics and control profiles, will significantly advance the design of new regulatory circuits as well as improving already existing ones. PMID:23178502

  20. [Sequencing babies?].

    PubMed

    Jordan, Bertrand

    2015-10-01

    An extension of newborn screening to genome sequencing is now feasible but raises a number of scientific, organisational and ethical issues. This is being explored in discussions and in several funded trials, in order to maximize benefits and avoid some identified risks. As some companies are already offering such a service, this is quite an urgent matter. PMID:26481033

  1. Toxicogenomics in Regulatory Ecotoxicology

    EPA Science Inventory

    The potential utility of toxicogenomics in toxicological research and regulatory activities has been the subject of scientific discussions, and as with any new technology, there is a wide range of opinion. The purpose of this feature article is to consider roles of toxicogenomic...

  2. The regulatory horizon

    NASA Technical Reports Server (NTRS)

    Cook, ED

    1987-01-01

    The author briefly discusses the FAA's position as it relates to cockpit resource management. For example, if Cockpit Resource Management (CRM) is a positive concept, why isn't everyone required to implement it? The regulatory practice of the FAA is discussed and questions and answers are presented.

  3. EPAct Transportation Regulatory Activities

    SciTech Connect

    2011-11-21

    The U.S. Department of Energy's (DOE) Vehicle Technologies Program manages several transportation regulatory activities established by the Energy Policy Act of 1992 (EPAct), as amended by the Energy Conservation Reauthorization Act of 1998, EPAct 2005, and the Energy Independence and Security Act of 2007 (EISA).

  4. REFINE WETLAND REGULATORY PROGRAM

    EPA Science Inventory

    The Tribes will work toward refining a regulatory program by taking a draft wetland conservation code with permitting incorporated to TEB for review. Progress will then proceed in developing a permit tracking system that will track both Tribal and fee land sites within reservati...

  5. REGULATORY AIR QUALITY MODELS

    EPA Science Inventory

    Appendix W to 40CFR Part 51 (Guideline on Air Quality Models) specifies the models to be used for purposes of permitting, PSD, and SIPs. Through a formal regulatory process this modeling guidance is periodically updated to reflect current science. In the most recent action, thr...

  6. NRC regulatory agenda

    SciTech Connect

    Not Available

    1993-02-01

    This document is a compilation of all rules on which the NRC has recently completed action, or has proposed action, or is considered action, and all petitions for rulemaking which have been received by the Commission and are pending disposition by the Commission. The Regulatory Agenda is updated and issued each quarter.

  7. NRC regulatory agenda

    SciTech Connect

    Not Available

    1992-11-01

    This document provides a compilation of all rules on which the NRC has recently completed action, or has proposed action, or is considering action, and all petitions for rulemaking which have been received by the Commission and are pending disposition by the Commission. The Regulatory Agenda is updated and issued each quarter.

  8. NRC Regulatory Agenda

    SciTech Connect

    Not Available

    1992-07-01

    This document compilation of all rules on which the NRC has recently completed action, or has proposed action, or is considering action, and all petitions for rule making which have been received by the Commission and are pending disposition by the Commission. The Regulatory Agenda is updated and issued each quarter.

  9. Genome Sequence of the Immunomodulatory Strain Bifidobacterium bifidum LMG 13195

    PubMed Central

    Gueimonde, Miguel; Ventura, Marco; Margolles, Abelardo

    2012-01-01

    In this work, we report the genome sequences of Bifidobacterium bifidum strain LMG13195. Results from our research group show that this strain is able to interact with human immune cells, generating functional regulatory T cells. PMID:23209243

  10. The role of cis regulatory evolution in maize domestication.

    PubMed

    Lemmon, Zachary H; Bukowski, Robert; Sun, Qi; Doebley, John F

    2014-11-01

    Gene expression differences between divergent lineages caused by modification of cis regulatory elements are thought to be important in evolution. We assayed genome-wide cis and trans regulatory differences between maize and its wild progenitor, teosinte, using deep RNA sequencing in F1 hybrid and parent inbred lines for three tissue types (ear, leaf and stem). Pervasive regulatory variation was observed with approximately 70% of ∼17,000 genes showing evidence of regulatory divergence between maize and teosinte. However, many fewer genes (1,079 genes) show consistent cis differences with all sampled maize and teosinte lines. For ∼70% of these 1,079 genes, the cis differences are specific to a single tissue. The number of genes with cis regulatory differences is greatest for ear tissue, which underwent a drastic transformation in form during domestication. As expected from the domestication bottleneck, maize possesses less cis regulatory variation than teosinte with this deficit greatest for genes showing maize-teosinte cis regulatory divergence, suggesting selection on cis regulatory differences during domestication. Consistent with selection on cis regulatory elements, genes with cis effects correlated strongly with genes under positive selection during maize domestication and improvement, while genes with trans regulatory effects did not. We observed a directional bias such that genes with cis differences showed higher expression of the maize allele more often than the teosinte allele, suggesting domestication favored up-regulation of gene expression. Finally, this work documents the cis and trans regulatory changes between maize and teosinte in over 17,000 genes for three tissues. PMID:25375861

  11. MSLICE Sequencing

    NASA Technical Reports Server (NTRS)

    Crockett, Thomas M.; Joswig, Joseph C.; Shams, Khawaja S.; Norris, Jeffrey S.; Morris, John R.

    2011-01-01

    MSLICE Sequencing is a graphical tool for writing sequences and integrating them into RML files, as well as for producing SCMF files for uplink. When operated in a testbed environment, it also supports uplinking these SCMF files to the testbed via Chill. This software features a free-form textural sequence editor featuring syntax coloring, automatic content assistance (including command and argument completion proposals), complete with types, value ranges, unites, and descriptions from the command dictionary that appear as they are typed. The sequence editor also has a "field mode" that allows tabbing between arguments and displays type/range/units/description for each argument as it is edited. Color-coded error and warning annotations on problematic tokens are included, as well as indications of problems that are not visible in the current scroll range. "Quick Fix" suggestions are made for resolving problems, and all the features afforded by modern source editors are also included such as copy/cut/paste, undo/redo, and a sophisticated find-and-replace system optionally using regular expressions. The software offers a full XML editor for RML files, which features syntax coloring, content assistance and problem annotations as above. There is a form-based, "detail view" that allows structured editing of command arguments and sequence parameters when preferred. The "project view" shows the user s "workspace" as a tree of "resources" (projects, folders, and files) that can subsequently be opened in editors by double-clicking. Files can be added, deleted, dragged-dropped/copied-pasted between folders or projects, and these operations are undoable and redoable. A "problems view" contains a tabular list of all problems in the current workspace. Double-clicking on any row in the table opens an editor for the appropriate sequence, scrolling to the specific line with the problem, and highlighting the problematic characters. From there, one can invoke "quick fix" as described

  12. Toxicogenomics and the Regulatory Framework

    EPA Science Inventory

    Toxicogenomics presents regulatory agencies with the opportunity to revolutionize their analyses by enabling the collection of information on a broader range of responses than currently considered in traditional regulatory decision making. Analyses of genomic responses are expec...

  13. Insertion Sequences

    PubMed Central

    Mahillon, Jacques; Chandler, Michael

    1998-01-01

    Insertion sequences (ISs) constitute an important component of most bacterial genomes. Over 500 individual ISs have been described in the literature to date, and many more are being discovered in the ongoing prokaryotic and eukaryotic genome-sequencing projects. The last 10 years have also seen some striking advances in our understanding of the transposition process itself. Not least of these has been the development of various in vitro transposition systems for both prokaryotic and eukaryotic elements and, for several of these, a detailed understanding of the transposition process at the chemical level. This review presents a general overview of the organization and function of insertion sequences of eubacterial, archaebacterial, and eukaryotic origins with particular emphasis on bacterial elements and on different aspects of the transposition mechanism. It also attempts to provide a framework for classification of these elements by assigning them to various families or groups. A total of 443 members of the collection have been grouped in 17 families based on combinations of the following criteria: (i) similarities in genetic organization (arrangement of open reading frames); (ii) marked identities or similarities in the enzymes which mediate the transposition reactions, the recombinases/transposases (Tpases); (iii) similar features of their ends (terminal IRs); and (iv) fate of the nucleotide sequence of their target sites (generation of a direct target duplication of determined length). A brief description of the mechanism(s) involved in the mobility of individual ISs in each family and of the structure-function relationships of the individual Tpases is included where available. PMID:9729608

  14. Transcriptome Analysis of an Insecticide Resistant Housefly Strain: Insights about SNPs and Regulatory Elements in Cytochrome P450 Genes

    PubMed Central

    Asp, Torben; Kristensen, Michael

    2016-01-01

    Background Insecticide resistance in the housefly, Musca domestica, has been investigated for more than 60 years. It will enter a new era after the recent publication of the housefly genome and the development of multiple next generation sequencing technologies. The genetic background of the xenobiotic response can now be investigated in greater detail. Here, we investigate the 454-pyrosequencing transcriptome of the spinosad-resistant 791spin strain in relation to the housefly genome with focus on P450 genes. Results The de novo assembly of clean reads gave 35,834 contigs consisting of 21,780 sequences of the spinosad resistant strain. The 3,648 sequences were annotated with an enzyme code EC number and were mapped to 124 KEGG pathways with metabolic processes as most highly represented pathway. One hundred and twenty contigs were annotated as P450s covering 44 different P450 genes of housefly. Eight differentially expressed P450s genes were identified and investigated for SNPs, CpG islands and common regulatory motifs in promoter and coding regions. Functional annotation clustering of metabolic related genes and motif analysis of P450s revealed their association with epigenetic, transcription and gene expression related functions. The sequence variation analysis resulted in 12 SNPs and eight of them found in cyp6d1. There is variation in location, size and frequency of CpG islands and specific motifs were also identified in these P450s. Moreover, identified motifs were associated to GO terms and transcription factors using bioinformatic tools. Conclusion Transcriptome data of a spinosad resistant strain provide together with genome data fundamental support for future research to understand evolution of resistance in houseflies. Here, we report for the first time the SNPs, CpG islands and common regulatory motifs in differentially expressed P450s. Taken together our findings will serve as a stepping stone to advance understanding of the mechanism and role of P450s

  15. Discovery and verification of functional single nucleotide polymorphisms in regulatory genomic regions: Current and developing technologies

    PubMed Central

    Chorley, Brian N.; Wang, Xuting; Campbell, Michelle R.; Pittman, Gary S.; Noureddine, Maher A.; Bell, Douglas A.

    2008-01-01

    The most common form of genetic variation, single nucleotide polymorphisms or SNPs, can affect the way an individual responds to the environment and modify disease risk. Although most of the millions of SNPs have little or no effect on gene regulation and protein activity, there are many circumstances where base changes can have deleterious effects. Non-synonymous SNPs that result in amino acid changes in proteins have been studied because of their obvious impact on protein activity. It is well known that SNPs within regulatory regions of the genome can result in disregulation of gene transcription. However, the impact of SNPs located in putative regulatory regions, or rSNPs, is harder to predict for two primary reasons. First, the mechanistic roles of non-coding genomic sequence remain poorly defined. Second, experimental validation of the functional consequences of rSNPs is often slow and laborious. In this review, we summarize traditional and novel methodologies for candidate rSNPs selection, in particular in silico techniques that aid in candidate rSNP selection. Additionally we will discuss molecular biological techniques that assess the impact of rSNPs on binding of regulatory machinery, as well as functional consequences on transcription. Standard techniques such as EMSA and luciferase reporter constructs are still widely used to assess effects of rSNPs on binding and gene transcription; however, these protocols are often bottlenecks in the discovery process. Therefore, we highlight novel and developing high-throughput protocols that promise to aid in shortening the process of rSNP validation. Given the large amount of genomic information generated from a multitude of re-sequencing and genome-wide SNP array efforts, future focus should be to develop validation techniques that will allow greater understanding of the impact these polymorphisms have on human health and disease. PMID:18565787

  16. Common Conditions in Newborns

    MedlinePlus

    ... Prenatal Baby Bathing & Skin Care Breastfeeding Crying & Colic Diapers & Clothing Feeding & Nutrition Preemie Sleep Teething & Tooth Care Toddler Preschool Gradeschool Teen Young Adult Healthy Children > Ages & Stages > Baby > Common Conditions in ...

  17. Common clay and shale

    USGS Publications Warehouse

    Virta, R.L.

    2004-01-01

    Part of the 2003 industrial minerals review. The legislation, production, and consumption of common clay and shale are discussed. The average prices of the material and outlook for the market are provided.

  18. Genomic Data Commons launches

    Cancer.gov

    The Genomic Data Commons (GDC), a unified data system that promotes sharing of genomic and clinical data between researchers, launched today with a visit from Vice President Joe Biden to the operations center at the University of Chicago.

  19. Barry Commoner Assails Petrochemicals

    ERIC Educational Resources Information Center

    Chemical and Engineering News, 1973

    1973-01-01

    Discusses Commoner's ideas on the social value of the petrochemical industry and his suggestions for curtailment or elimination of its productive operation to produce a higher environmental quality for mankind at a relatively low loss in social benefit. (CC)

  20. Nuclear Regulatory Commission information digest

    SciTech Connect

    None,

    1990-03-01

    The Nuclear Regulatory Commission information digest provides summary information regarding the US Nuclear Regulatory Commission, its regulatory responsibilities, and areas licensed by the commission. This is an annual publication for the general use of the NRC Staff and is available to the public. The digest is divided into two parts: the first presents an overview of the US Nuclear Regulatory Commission and the second provides data on NRC commercial nuclear reactor licensees and commercial nuclear power reactors worldwide.

  1. Common clay and shale

    USGS Publications Warehouse

    Virta, R.L.

    2011-01-01

    The article discusses the latest developments in the global common clay and shale industry, particularly in the U.S. It claims that common clay and shale is mainly used in the manufacture of heavy clay products like brick, flue tile and sewer pipe. The main producing states in the U.S. include North Carolina, New York and Oklahoma. Among the firms that manufacture clay and shale-based products are Mid America Brick & Structural Clay Products LLC and Boral USA.

  2. Common clay and shale

    USGS Publications Warehouse

    Virta, R.L.

    2006-01-01

    At present, 150 companies produce common clay and shale in 41 US states. According to the United States Geological Survey (USGS), domestic production in 2005 reached 24.8 Mt valued at $176 million. In decreasing order by tonnage, the leading producer states include North Carolina, Texas, Alabama, Georgia and Ohio. For the whole year, residential and commercial building construction remained the major market for common clay and shale products such as brick, drain tile, lightweight aggregate, quarry tile and structural tile.

  3. Predicting the impact of promoter variability on regulatory outputs

    PubMed Central

    Kreamer, Naomi N.; Phillips, Rob; Newman, Dianne K.; Boedicker, James Q.

    2015-01-01

    The increased availability of whole genome sequences calls for quantitative models of global gene expression, yet predicting gene expression patterns directly from genome sequence remains a challenge. We examine the contributions of an individual regulator, the ferrous iron-responsive regulatory element, BqsR, on global patterns of gene expression in Pseudomonas aeruginosa. The position weight matrix (PWM) derived for BqsR uncovered hundreds of likely binding sites throughout the genome. Only a subset of these potential binding sites had a regulatory consequence, suggesting that BqsR/DNA interactions were not captured within the PWM or that the broader regulatory context at each promoter played a greater role in setting promoter outputs. The architecture of the BqsR operator was systematically varied to understand how binding site parameters influence expression. We found that BqsR operator affinity was predicted by the PWM well. At many promoters the surrounding regulatory context, including overlapping operators of BqsR or the presence of RhlR binding sites, were influential in setting promoter outputs. These results indicate more comprehensive models that include local regulatory contexts are needed to develop a predictive understanding of global regulatory outputs. PMID:26675057

  4. Predicting the impact of promoter variability on regulatory outputs.

    PubMed

    Kreamer, Naomi N; Phillips, Rob; Newman, Dianne K; Boedicker, James Q

    2015-01-01

    The increased availability of whole genome sequences calls for quantitative models of global gene expression, yet predicting gene expression patterns directly from genome sequence remains a challenge. We examine the contributions of an individual regulator, the ferrous iron-responsive regulatory element, BqsR, on global patterns of gene expression in Pseudomonas aeruginosa. The position weight matrix (PWM) derived for BqsR uncovered hundreds of likely binding sites throughout the genome. Only a subset of these potential binding sites had a regulatory consequence, suggesting that BqsR/DNA interactions were not captured within the PWM or that the broader regulatory context at each promoter played a greater role in setting promoter outputs. The architecture of the BqsR operator was systematically varied to understand how binding site parameters influence expression. We found that BqsR operator affinity was predicted by the PWM well. At many promoters the surrounding regulatory context, including overlapping operators of BqsR or the presence of RhlR binding sites, were influential in setting promoter outputs. These results indicate more comprehensive models that include local regulatory contexts are needed to develop a predictive understanding of global regulatory outputs. PMID:26675057

  5. Conservation of trans-acting networks during mammalian regulatory evolution

    PubMed Central

    Stergachis, Andrew B.; Neph, Shane; Sandstrom, Richard; Haugen, Eric; Reynolds, Alex P.; Zhang, Miaohua; Byron, Rachel; Canfield, Theresa; Stelhing-Sun, Sandra; Lee, Kristen; Thurman, Robert E.; Vong, Shinny; Bates, Daniel; Neri, Fidencio; Diegel, Morgan; Giste, Erika; Dunn, Douglas; Hansen, R. Scott; Johnson, Audra K.; Sabo, Peter J.; Wilken, Matthew S.; Reh, Thomas A.; Treuting, Piper M.; Kaul, Rajinder; Groudine, Mark; Bender, M.A.; Borenstein, Elhanan; Stamatoyannopoulos, John A.

    2014-01-01

    The fundamental body plan and major physiological axes have been highly conserved during mammalian evolution, despite const